Query lcl|NC_019404.1_cdsid_YP_006987835.1 [gene=D861_gp62] [protein=putative portal protein] [protein_id=YP_006987835.1] [location=6428..7684] Match_columns 418 No_of_seqs 114 out of 251 Neff 8.0 Searched_HMMs 1612 Date Thu Nov 7 17:33:12 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_17 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_17_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104338 Length: 422 100.0 2E-130 1E-133 731.9 42.4 415 1-415 1-422 (422) 2 protein:vir:107662 Length: 427 100.0 6E-128 4E-131 718.1 43.0 417 1-418 3-425 (427) 3 protein:vir:79647 Length: 435 100.0 3E-125 2E-128 703.1 42.6 414 1-414 12-435 (435) 4 protein:vir:5249 Length: 437 # 100.0 2E-115 1E-118 649.1 41.1 408 1-418 1-427 (437) 5 protein:vir:96068 Length: 765 100.0 2E-111 2E-114 627.0 39.8 406 1-418 68-511 (765) 6 protein:vir:99563 Length: 862 100.0 3E-110 2E-113 621.4 40.0 406 1-418 97-537 (862) 7 protein:vir:94049 Length: 532 100.0 1E-109 9E-113 617.3 38.8 406 1-418 48-489 (532) 8 protein:vir:107742 Length: 537 100.0 1E-108 7E-112 612.4 38.3 406 1-418 67-516 (537) 9 protein:vir:106716 Length: 698 100.0 2E-108 1E-111 611.4 34.1 407 1-418 82-529 (698) 10 protein:vir:78589 Length: 695 100.0 1E-107 7E-111 606.8 34.5 407 1-418 82-529 (695) 11 protein:vir:3648 Length: 695 # 100.0 1E-107 7E-111 606.8 34.3 407 1-418 82-529 (695) 12 protein:vir:101541 Length: 694 100.0 2E-107 9E-111 606.2 34.4 406 1-418 71-528 (694) 13 protein:vir:80040 Length: 461 100.0 2E-103 1E-106 583.8 40.5 401 1-415 13-461 (461) 14 protein:vir:105782 Length: 449 100.0 1E-98 6E-102 557.8 35.4 397 1-418 19-449 (449) 15 protein:vir:103219 Length: 201 100.0 3.8E-57 2.4E-60 329.9 20.4 194 219-414 1-201 (201) 16 protein:vir:79772 Length: 648 99.9 2.1E-25 1.3E-28 155.9 28.9 382 1-418 62-494 (648) 17 protein:vir:100882 Length: 383 99.9 7.8E-23 4.8E-26 141.8 26.0 355 1-414 19-383 (383) 18 protein:vir:100187 Length: 385 99.8 3.2E-21 2E-24 132.9 26.8 356 1-415 1-385 (385) 19 protein:vir:102118 Length: 409 99.8 4.3E-21 2.7E-24 132.2 26.7 362 1-414 14-409 (409) 20 protein:vir:1380 Length: 422 # 99.8 4.5E-21 2.8E-24 132.1 26.2 373 1-416 1-422 (422) 21 protein:vir:4454 Length: 414 # 99.8 2.6E-20 1.6E-23 127.9 28.0 361 1-418 1-413 (414) 22 protein:vir:4156 Length: 542 # 99.8 4.5E-20 2.8E-23 126.7 28.8 382 1-418 11-443 (542) 23 protein:vir:483 Length: 413 # 99.8 2.6E-20 1.6E-23 127.9 27.5 362 1-418 15-412 (413) 24 protein:vir:102080 Length: 429 99.8 7.9E-20 4.9E-23 125.3 29.8 370 1-416 1-429 (429) 25 protein:vir:102855 Length: 432 99.8 9.8E-20 6.1E-23 124.8 29.4 371 1-416 1-432 (432) 26 protein:vir:107605 Length: 432 99.8 9.8E-20 6.1E-23 124.8 29.4 371 1-416 1-432 (432) 27 protein:vir:105002 Length: 432 99.8 9.8E-20 6.1E-23 124.8 29.4 371 1-416 1-432 (432) 28 protein:vir:80644 Length: 551 99.8 1.1E-19 6.9E-23 124.5 29.0 382 1-418 37-516 (551) 29 protein:vir:81152 Length: 411 99.8 4E-20 2.5E-23 126.9 26.4 370 1-415 1-411 (411) 30 protein:vir:100691 Length: 535 99.8 1.6E-19 9.9E-23 123.6 29.2 387 1-418 21-507 (535) 31 protein:vir:7853 Length: 518 # 99.8 5.5E-20 3.4E-23 126.2 26.6 366 1-418 1-440 (518) 32 protein:vir:3843 Length: 397 # 99.8 5.8E-20 3.6E-23 126.0 26.6 354 1-414 15-397 (397) 33 protein:vir:8100 Length: 466 # 99.8 1.5E-19 9.4E-23 123.8 28.4 387 1-418 19-463 (466) 34 protein:vir:4952 Length: 386 # 99.8 2.1E-19 1.3E-22 123.0 28.3 358 1-413 15-386 (386) 35 protein:vir:1431 Length: 419 # 99.8 3.2E-19 2E-22 122.0 29.2 363 1-418 15-419 (419) 36 protein:vir:97060 Length: 432 99.8 3.5E-19 2.2E-22 121.8 29.3 366 1-418 7-431 (432) 37 protein:vir:4337 Length: 434 # 99.8 6.6E-20 4.1E-23 125.7 25.2 355 1-414 1-434 (434) 38 protein:vir:8418 Length: 409 # 99.8 2.6E-19 1.6E-22 122.5 28.1 362 4-417 1-409 (409) 39 protein:vir:63755 Length: 547 99.8 2.9E-19 1.8E-22 122.2 28.2 382 1-418 18-512 (547) 40 protein:vir:3153 Length: 467 # 99.8 1.6E-19 1E-22 123.6 25.8 348 32-418 1-447 (467) 41 protein:vir:101648 Length: 518 99.8 3.3E-19 2E-22 121.9 26.6 366 1-418 1-440 (518) 42 protein:vir:4194 Length: 540 # 99.8 1.4E-18 8.4E-22 118.5 29.4 380 1-418 8-441 (540) 43 protein:vir:10362 Length: 432 99.8 1.9E-18 1.2E-21 117.8 29.9 366 1-418 7-431 (432) 44 protein:vir:2683 Length: 412 # 99.8 1.1E-18 6.9E-22 119.0 28.4 366 1-415 10-412 (412) 45 protein:vir:5737 Length: 419 # 99.8 6.9E-19 4.3E-22 120.1 26.6 366 1-418 1-413 (419) 46 protein:vir:80796 Length: 574 99.8 3.5E-18 2.2E-21 116.3 29.7 389 1-418 41-524 (574) 47 protein:vir:1326 Length: 457 # 99.8 1.1E-18 7E-22 119.0 26.5 370 4-418 1-444 (457) 48 protein:vir:81072 Length: 432 99.8 4.2E-18 2.6E-21 115.9 29.4 366 1-418 7-431 (432) 49 protein:vir:7407 Length: 392 # 99.8 2.7E-18 1.6E-21 116.9 28.1 364 2-418 1-391 (392) 50 protein:vir:1023 Length: 392 # 99.8 2.2E-18 1.3E-21 117.4 27.6 364 2-418 1-391 (392) 51 protein:vir:3989 Length: 392 # 99.8 2.2E-18 1.3E-21 117.4 27.6 364 2-418 1-391 (392) 52 protein:vir:960 Length: 413 # 99.8 9.2E-19 5.7E-22 119.5 25.5 360 1-413 10-413 (413) 53 protein:vir:100150 Length: 437 99.8 1.9E-18 1.2E-21 117.7 26.9 366 1-418 1-433 (437) 54 protein:vir:1266 Length: 416 # 99.8 4.5E-18 2.8E-21 115.7 28.5 353 2-415 1-416 (416) 55 protein:vir:4598 Length: 416 # 99.8 1.9E-18 1.2E-21 117.7 26.4 361 1-414 13-416 (416) 56 protein:vir:81095 Length: 416 99.8 1.9E-18 1.2E-21 117.7 26.4 361 1-414 13-416 (416) 57 protein:vir:80333 Length: 419 99.8 3.8E-18 2.4E-21 116.1 27.7 362 1-418 15-419 (419) 58 protein:vir:102727 Length: 945 99.7 3.7E-18 2.3E-21 116.2 27.5 380 1-418 70-539 (945) 59 protein:vir:96579 Length: 576 99.7 6.8E-18 4.2E-21 114.7 28.9 382 1-418 25-528 (576) 60 protein:vir:99312 Length: 563 99.7 1.8E-17 1.1E-20 112.4 30.8 382 1-418 23-525 (563) 61 protein:vir:95599 Length: 563 99.7 1.8E-17 1.1E-20 112.4 30.8 382 1-418 23-525 (563) 62 protein:vir:95378 Length: 406 99.7 2.1E-18 1.3E-21 117.5 25.2 351 1-414 16-406 (406) 63 protein:vir:105064 Length: 421 99.7 5.1E-18 3.2E-21 115.4 27.2 367 1-418 1-418 (421) 64 protein:vir:6240 Length: 457 # 99.7 5.4E-18 3.3E-21 115.3 27.1 370 5-418 1-457 (457) 65 protein:vir:94426 Length: 409 99.7 7.7E-18 4.8E-21 114.4 27.8 365 2-415 1-409 (409) 66 protein:vir:93943 Length: 409 99.7 6.7E-18 4.2E-21 114.7 27.2 363 1-415 1-409 (409) 67 protein:vir:93610 Length: 454 99.7 1.3E-17 8.2E-21 113.1 28.3 366 1-418 17-435 (454) 68 protein:vir:9408 Length: 441 # 99.7 5E-18 3.1E-21 115.4 26.0 366 1-414 22-441 (441) 69 protein:vir:79984 Length: 441 99.7 5E-18 3.1E-21 115.4 26.0 366 1-414 22-441 (441) 70 protein:vir:94666 Length: 723 99.7 1.1E-17 6.5E-21 113.7 27.5 368 7-418 1-412 (723) 71 protein:vir:96980 Length: 409 99.7 1.6E-17 1E-20 112.6 28.4 362 1-415 1-409 (409) 72 protein:vir:1884 Length: 424 # 99.7 7E-18 4.3E-21 114.6 24.9 362 1-416 9-424 (424) 73 protein:vir:9702 Length: 406 # 99.7 8.6E-18 5.3E-21 114.1 25.4 358 1-418 11-401 (406) 74 protein:vir:98396 Length: 441 99.7 1.9E-17 1.2E-20 112.2 26.8 361 1-414 3-441 (441) 75 protein:vir:189 Length: 424 # 99.7 1E-17 6.3E-21 113.8 25.3 362 1-416 10-424 (424) 76 protein:vir:4509 Length: 424 # 99.7 5E-17 3.1E-20 110.0 29.0 361 1-414 9-424 (424) 77 protein:vir:4828 Length: 382 # 99.7 4.7E-17 2.9E-20 110.1 28.0 361 5-414 1-382 (382) 78 protein:vir:100249 Length: 431 99.7 3.2E-17 2E-20 111.0 26.9 356 1-413 1-431 (431) 79 protein:vir:9359 Length: 348 # 99.7 3.5E-17 2.2E-20 110.8 26.5 320 45-415 1-348 (348) 80 protein:vir:4854 Length: 386 # 99.7 6.1E-17 3.8E-20 109.5 27.2 359 5-414 1-386 (386) 81 protein:vir:8317 Length: 409 # 99.7 2.4E-17 1.5E-20 111.7 23.9 348 1-416 16-409 (409) 82 protein:vir:4995 Length: 384 # 99.7 1.2E-16 7.3E-20 107.9 26.3 363 1-417 15-384 (384) 83 protein:vir:81218 Length: 423 99.7 1.8E-16 1.1E-19 106.9 26.8 374 5-415 1-423 (423) 84 protein:vir:7987 Length: 456 # 99.7 1.2E-17 7.5E-21 113.3 20.2 375 1-418 21-443 (456) 85 protein:vir:101647 Length: 460 99.7 2.3E-16 1.4E-19 106.3 26.9 377 7-414 1-460 (460) 86 protein:vir:6210 Length: 394 # 99.7 3.1E-16 1.9E-19 105.6 26.4 353 1-415 1-394 (394) 87 protein:vir:1082 Length: 359 # 99.6 3E-16 1.9E-19 105.7 25.0 343 1-395 12-359 (359) 88 protein:vir:3868 Length: 417 # 99.6 3.3E-16 2E-19 105.5 24.8 363 2-418 1-415 (417) 89 protein:vir:105819 Length: 456 99.6 8.2E-17 5.1E-20 108.8 20.4 374 2-418 1-443 (456) 90 protein:vir:102602 Length: 456 99.6 8.2E-17 5.1E-20 108.8 20.4 374 2-418 1-443 (456) 91 protein:vir:9507 Length: 395 # 99.6 3.4E-16 2.1E-19 105.4 23.2 351 4-416 1-395 (395) 92 protein:vir:101289 Length: 395 99.6 3.4E-16 2.1E-19 105.4 23.2 351 4-416 1-395 (395) 93 protein:vir:100650 Length: 395 99.6 3.4E-16 2.1E-19 105.4 23.2 351 4-416 1-395 (395) 94 protein:vir:99452 Length: 651 99.6 1.3E-15 8.1E-19 102.2 26.4 390 1-418 12-536 (651) 95 protein:vir:80134 Length: 403 99.6 1E-15 6.4E-19 102.7 24.7 353 5-414 1-403 (403) 96 protein:vir:99072 Length: 479 99.6 8.5E-16 5.3E-19 103.2 23.6 375 1-418 9-472 (479) 97 protein:vir:104259 Length: 403 99.6 3.1E-15 1.9E-18 100.1 24.6 356 5-414 1-403 (403) 98 protein:vir:9641 Length: 395 # 99.6 4.5E-15 2.8E-18 99.2 25.0 358 5-415 1-395 (395) 99 protein:vir:98444 Length: 434 99.6 1.8E-15 1.1E-18 101.4 22.6 368 21-415 1-434 (434) 100 protein:vir:95965 Length: 385 99.6 5.3E-15 3.3E-18 98.8 25.1 351 4-414 1-385 (385) 101 protein:vir:5961 Length: 503 # 99.6 6.6E-15 4.1E-18 98.3 24.7 367 1-418 12-503 (503) 102 protein:vir:95113 Length: 474 99.6 1.6E-14 9.9E-18 96.2 26.6 368 1-414 43-474 (474) 103 protein:vir:97447 Length: 474 99.6 2.2E-14 1.3E-17 95.5 27.0 374 1-414 43-474 (474) 104 protein:vir:94498 Length: 474 99.6 2.2E-14 1.3E-17 95.5 27.0 374 1-414 43-474 (474) 105 protein:vir:93747 Length: 472 99.6 4.7E-14 2.9E-17 93.7 28.5 382 1-414 40-472 (472) 106 protein:vir:78227 Length: 480 99.5 1.9E-15 1.2E-18 101.3 20.5 393 1-418 1-470 (480) 107 protein:vir:4898 Length: 502 # 99.5 1.8E-14 1.1E-17 96.0 25.8 394 1-418 49-502 (502) 108 protein:vir:95806 Length: 440 99.5 1E-14 6.4E-18 97.2 24.5 383 1-414 10-440 (440) 109 protein:vir:98643 Length: 395 99.5 1.3E-14 8.3E-18 96.6 23.6 358 5-415 1-395 (395) 110 protein:vir:94805 Length: 492 99.5 1.1E-13 6.6E-17 91.7 28.2 382 1-414 35-492 (492) 111 protein:vir:2427 Length: 485 # 99.5 7.3E-15 4.5E-18 98.1 21.6 394 1-418 12-478 (485) 112 protein:vir:97336 Length: 492 99.5 1.8E-13 1.1E-16 90.5 28.9 381 1-414 60-492 (492) 113 protein:vir:1236 Length: 483 # 99.5 1.6E-13 1E-16 90.7 28.6 382 1-414 51-483 (483) 114 protein:vir:105292 Length: 478 99.5 8E-14 4.9E-17 92.4 26.9 381 1-414 42-478 (478) 115 protein:vir:104082 Length: 485 99.5 1.1E-14 6.7E-18 97.1 22.1 389 1-418 8-485 (485) 116 protein:vir:2732 Length: 501 # 99.5 8.1E-14 5E-17 92.3 26.3 390 1-414 56-501 (501) 117 protein:vir:79043 Length: 479 99.5 6E-14 3.7E-17 93.1 25.3 369 1-413 6-479 (479) 118 protein:vir:78537 Length: 480 99.5 8.5E-15 5.3E-18 97.7 20.4 393 1-418 1-470 (480) 119 protein:vir:78310 Length: 376 99.5 3.9E-14 2.4E-17 94.1 23.5 347 5-415 1-376 (376) 120 protein:vir:9871 Length: 429 # 99.5 2E-13 1.2E-16 90.2 27.1 370 1-412 17-429 (429) 121 protein:vir:99522 Length: 470 99.5 1.5E-13 9.2E-17 90.9 26.3 373 1-414 24-470 (470) 122 protein:vir:96494 Length: 501 99.5 1.1E-13 6.9E-17 91.6 25.6 393 1-418 48-501 (501) 123 protein:vir:99781 Length: 511 99.5 8.8E-14 5.5E-17 92.2 24.5 390 1-414 39-511 (511) 124 protein:vir:9306 Length: 511 # 99.5 9.8E-14 6.1E-17 91.9 24.5 389 1-414 39-511 (511) 125 protein:vir:107112 Length: 478 99.5 4.9E-13 3.1E-16 88.1 27.6 380 1-414 42-478 (478) 126 protein:vir:3609 Length: 452 # 99.5 5E-13 3.1E-16 88.0 27.6 367 1-414 33-452 (452) 127 protein:vir:106639 Length: 481 99.5 3.5E-14 2.2E-17 94.4 21.0 391 1-415 29-481 (481) 128 protein:vir:7768 Length: 484 # 99.5 4.5E-14 2.8E-17 93.8 21.5 391 1-417 28-484 (484) 129 protein:vir:9751 Length: 422 # 99.5 9.8E-14 6.1E-17 91.9 23.4 366 1-411 3-422 (422) 130 protein:vir:9568 Length: 410 # 99.5 1.5E-13 9E-17 91.0 24.0 356 4-412 1-410 (410) 131 protein:vir:102950 Length: 471 99.5 2.3E-13 1.4E-16 89.9 25.1 372 1-414 21-471 (471) 132 protein:vir:2341 Length: 488 # 99.4 3.3E-14 2.1E-17 94.5 20.0 389 1-418 1-488 (488) 133 protein:vir:103951 Length: 511 99.4 3.4E-13 2.1E-16 88.9 25.1 383 1-414 39-511 (511) 134 protein:vir:105461 Length: 470 99.4 6.4E-13 4E-16 87.4 26.3 381 1-414 21-470 (470) 135 protein:vir:105889 Length: 474 99.4 7.2E-13 4.5E-16 87.1 26.6 376 2-414 1-474 (474) 136 protein:vir:94101 Length: 474 99.4 7.2E-13 4.5E-16 87.1 26.6 376 2-414 1-474 (474) 137 protein:vir:97171 Length: 512 99.4 7.2E-13 4.5E-16 87.2 26.5 384 1-414 31-512 (512) 138 protein:vir:96240 Length: 511 99.4 3.4E-13 2.1E-16 88.9 24.6 382 1-414 39-511 (511) 139 protein:vir:96366 Length: 511 99.4 4.2E-13 2.6E-16 88.4 24.9 390 1-414 39-511 (511) 140 protein:vir:78805 Length: 511 99.4 4.2E-13 2.6E-16 88.4 24.9 390 1-414 39-511 (511) 141 protein:vir:96179 Length: 468 99.4 3.6E-13 2.2E-16 88.8 24.4 369 1-418 42-461 (468) 142 protein:vir:106571 Length: 499 99.4 2.9E-12 1.8E-15 83.8 29.2 395 1-418 4-491 (499) 143 protein:vir:96839 Length: 474 99.4 4.6E-13 2.9E-16 88.2 24.6 365 1-414 42-474 (474) 144 protein:vir:94742 Length: 409 99.4 9.7E-13 6E-16 86.4 25.8 361 2-401 1-409 (409) 145 protein:vir:733 Length: 453 # 99.4 3.1E-13 1.9E-16 89.1 23.0 368 1-416 33-453 (453) 146 protein:vir:94002 Length: 378 99.4 4.5E-14 2.8E-17 93.8 18.4 322 8-414 1-378 (378) 147 protein:vir:95899 Length: 474 99.4 1.5E-12 9.2E-16 85.4 26.5 371 1-414 43-474 (474) 148 protein:vir:96266 Length: 474 99.4 1.5E-12 9.2E-16 85.4 26.5 371 1-414 43-474 (474) 149 protein:vir:2500 Length: 501 # 99.4 3.5E-13 2.2E-16 88.9 22.8 379 1-418 22-496 (501) 150 protein:vir:94546 Length: 506 99.4 1.1E-12 6.8E-16 86.1 25.5 396 1-416 21-506 (506) 151 protein:vir:99916 Length: 504 99.4 2.8E-12 1.7E-15 84.0 27.4 399 1-415 12-504 (504) 152 protein:vir:78083 Length: 537 99.4 2.8E-12 1.7E-15 84.0 27.3 382 1-418 30-496 (537) 153 protein:vir:102330 Length: 451 99.4 2.4E-12 1.5E-15 84.3 26.9 376 2-414 1-451 (451) 154 protein:vir:3964 Length: 453 # 99.4 2E-12 1.2E-15 84.7 26.0 369 1-414 33-453 (453) 155 protein:vir:4089 Length: 395 # 99.4 1.2E-12 7.2E-16 86.0 24.4 355 1-418 1-395 (395) 156 protein:vir:93867 Length: 378 99.4 9.7E-14 6E-17 91.9 18.3 324 5-414 1-378 (378) 157 protein:vir:78641 Length: 278 99.4 9E-13 5.6E-16 86.6 22.1 263 45-354 1-278 (278) 158 protein:vir:4223 Length: 486 # 99.3 4.9E-13 3E-16 88.1 20.4 396 1-418 29-481 (486) 159 protein:vir:80680 Length: 441 99.3 1.4E-12 8.5E-16 85.6 22.7 376 2-418 1-435 (441) 160 protein:vir:1661 Length: 378 # 99.3 3E-13 1.8E-16 89.3 18.4 322 5-414 1-378 (378) 161 protein:vir:1634 Length: 409 # 99.3 5.5E-12 3.4E-15 82.3 24.6 359 2-401 1-409 (409) 162 protein:vir:38 Length: 496 # N 99.3 1.2E-12 7.4E-16 86.0 20.9 385 1-414 31-496 (496) 163 protein:vir:96738 Length: 505 99.3 1.2E-11 7.2E-15 80.5 26.0 381 1-414 1-505 (505) 164 protein:vir:94869 Length: 378 99.3 2.7E-12 1.7E-15 84.0 21.2 320 1-414 1-378 (378) 165 protein:vir:79538 Length: 502 99.3 4.7E-11 2.9E-14 77.2 27.5 389 1-415 1-502 (502) 166 protein:vir:10321 Length: 495 99.3 2.8E-11 1.7E-14 78.5 26.0 385 1-414 3-495 (495) 167 protein:vir:9922 Length: 489 # 99.3 2.8E-11 1.7E-14 78.5 25.5 389 1-409 32-489 (489) 168 protein:vir:389 Length: 530 # 99.3 8.5E-11 5.3E-14 75.8 28.1 395 1-418 6-530 (530) 169 protein:vir:3420 Length: 533 # 99.2 1.9E-10 1.2E-13 73.9 29.4 396 1-418 9-532 (533) 170 protein:vir:107880 Length: 491 99.2 1.2E-10 7.3E-14 75.0 28.1 364 1-418 5-410 (491) 171 protein:vir:79703 Length: 505 99.2 4.6E-12 2.9E-15 82.7 20.3 398 1-418 1-503 (505) 172 protein:vir:8184 Length: 474 # 99.2 9.2E-11 5.7E-14 75.6 27.3 387 1-418 6-473 (474) 173 protein:vir:80959 Length: 499 99.2 1.5E-11 9.1E-15 80.0 22.1 387 3-414 1-499 (499) 174 protein:vir:9815 Length: 500 # 99.2 3E-11 1.9E-14 78.3 23.2 394 1-418 1-499 (500) 175 protein:vir:3028 Length: 500 # 99.2 3E-11 1.9E-14 78.3 23.2 394 1-418 1-499 (500) 176 protein:vir:858 Length: 378 # 99.2 7.5E-12 4.7E-15 81.6 19.0 319 8-414 1-378 (378) 177 protein:vir:6382 Length: 553 # 99.2 4.7E-10 2.9E-13 71.7 27.5 391 1-414 1-553 (553) 178 protein:vir:79063 Length: 491 99.1 1.8E-09 1.1E-12 68.5 30.2 361 1-418 27-410 (491) 179 protein:vir:95542 Length: 548 99.1 3.9E-10 2.4E-13 72.1 25.0 389 1-418 1-501 (548) 180 protein:vir:78907 Length: 518 99.1 7.7E-11 4.8E-14 76.0 20.9 395 1-418 4-512 (518) 181 protein:vir:267 Length: 348 # 99.1 4E-10 2.5E-13 72.1 23.6 301 1-362 29-348 (348) 182 protein:vir:1587 Length: 508 # 99.1 3.3E-10 2.1E-13 72.6 23.0 390 1-418 1-503 (508) 183 protein:vir:79150 Length: 368 99.0 1.9E-10 1.2E-13 73.9 19.9 326 1-367 20-368 (368) 184 protein:vir:4782 Length: 522 # 99.0 7.1E-10 4.4E-13 70.7 22.1 399 1-415 1-522 (522) 185 protein:vir:2013 Length: 344 # 99.0 2E-09 1.3E-12 68.2 23.0 296 1-356 34-344 (344) 186 protein:vir:5839 Length: 533 # 98.9 9.5E-10 5.9E-13 70.1 20.0 384 1-418 20-473 (533) 187 protein:vir:3780 Length: 345 # 98.9 6.7E-09 4.2E-12 65.4 24.0 300 1-356 28-345 (345) 188 protein:vir:103860 Length: 528 98.9 2E-08 1.2E-11 62.8 29.2 357 1-418 22-456 (528) 189 protein:vir:6058 Length: 344 # 98.9 4.1E-09 2.5E-12 66.6 22.4 296 1-356 34-344 (344) 190 protein:vir:98816 Length: 446 98.9 1.5E-08 9.2E-12 63.5 25.3 381 1-409 18-446 (446) 191 protein:vir:78191 Length: 351 98.9 3.6E-09 2.2E-12 66.9 21.8 302 1-360 12-351 (351) 192 protein:vir:79207 Length: 351 98.9 6.4E-09 3.9E-12 65.5 22.5 301 1-360 12-351 (351) 193 protein:vir:103971 Length: 376 98.8 2.5E-09 1.5E-12 67.8 19.9 300 1-360 64-376 (376) 194 protein:vir:98883 Length: 517 98.8 3.2E-08 2E-11 61.7 24.2 392 1-414 1-517 (517) 195 protein:vir:3743 Length: 345 # 98.8 2.7E-08 1.7E-11 62.1 23.4 299 2-353 1-345 (345) 196 protein:vir:78749 Length: 337 98.8 3.6E-08 2.2E-11 61.4 23.8 289 1-354 28-337 (337) 197 protein:vir:100328 Length: 346 98.7 2.3E-08 1.4E-11 62.4 22.3 297 1-358 32-346 (346) 198 protein:vir:5691 Length: 344 # 98.7 2.3E-08 1.4E-11 62.5 22.2 298 1-360 34-344 (344) 199 protein:vir:98567 Length: 340 98.7 4.2E-08 2.6E-11 61.0 23.1 296 1-357 31-340 (340) 200 protein:vir:1150 Length: 350 # 98.7 1.6E-08 1E-11 63.3 20.9 297 1-357 14-350 (350) 201 protein:vir:99232 Length: 526 98.7 1E-07 6.2E-11 59.0 30.6 365 1-418 11-432 (526) 202 protein:vir:101494 Length: 527 98.6 5.4E-08 3.3E-11 60.4 20.7 379 12-418 1-521 (527) 203 protein:vir:102239 Length: 527 98.6 5.4E-08 3.4E-11 60.4 20.7 379 12-418 1-521 (527) 204 protein:vir:79233 Length: 526 98.5 3.1E-07 1.9E-10 56.3 30.4 364 1-418 11-436 (526) 205 protein:vir:104500 Length: 537 98.5 3.5E-07 2.2E-10 56.0 23.3 383 1-418 25-521 (537) 206 protein:vir:104892 Length: 558 98.5 2.8E-07 1.7E-10 56.5 21.7 379 1-418 22-514 (558) 207 protein:vir:108049 Length: 524 98.4 6.1E-07 3.8E-10 54.7 23.8 386 1-416 33-524 (524) 208 protein:vir:108215 Length: 469 98.4 7.3E-07 4.5E-10 54.2 29.0 380 1-418 4-469 (469) 209 protein:vir:7208 Length: 524 # 98.4 8.3E-07 5.2E-10 53.9 23.6 383 1-416 34-524 (524) 210 protein:vir:103458 Length: 524 98.4 8.5E-07 5.3E-10 53.9 23.6 383 1-416 34-524 (524) 211 protein:vir:99853 Length: 488 98.4 8.8E-07 5.5E-10 53.8 30.6 367 2-418 1-413 (488) 212 protein:vir:103177 Length: 533 98.4 9.7E-07 6E-10 53.6 22.4 382 1-418 15-515 (533) 213 protein:vir:106999 Length: 564 98.4 9.2E-07 5.7E-10 53.7 21.5 383 1-418 20-535 (564) 214 protein:vir:81017 Length: 521 98.4 1E-06 6.4E-10 53.4 22.2 385 1-416 30-521 (521) 215 protein:vir:6896 Length: 523 # 98.3 1.2E-06 7.5E-10 53.0 23.1 385 1-416 34-523 (523) 216 protein:vir:101806 Length: 516 98.3 1.3E-06 7.8E-10 52.9 22.9 382 1-417 29-516 (516) 217 protein:vir:101189 Length: 516 98.3 1.3E-06 7.8E-10 52.9 22.9 382 1-417 29-516 (516) 218 protein:vir:1986 Length: 512 # 98.3 1.8E-06 1.1E-09 52.0 29.5 364 1-418 11-427 (512) 219 protein:vir:97265 Length: 513 98.2 1.9E-06 1.2E-09 52.0 19.7 389 1-418 11-497 (513) 220 protein:vir:100598 Length: 516 98.2 3E-06 1.9E-09 50.8 23.2 383 1-418 10-508 (516) 221 protein:vir:6596 Length: 521 # 98.2 3.1E-06 2E-09 50.7 22.7 385 1-416 30-521 (521) 222 protein:vir:98853 Length: 219 98.1 6.8E-07 4.2E-10 54.4 15.2 200 120-358 1-219 (219) 223 protein:vir:4698 Length: 251 # 98.1 2.9E-07 1.8E-10 56.4 12.5 226 1-262 13-251 (251) 224 protein:vir:7430 Length: 563 # 98.1 5.9E-06 3.6E-09 49.3 24.3 375 12-418 1-525 (563) 225 protein:vir:106282 Length: 521 98.0 6.5E-06 4.1E-09 49.0 24.9 382 1-416 31-521 (521) 226 protein:vir:98265 Length: 524 98.0 8.4E-06 5.2E-09 48.4 24.1 382 1-416 35-524 (524) 227 protein:vir:95149 Length: 501 97.9 1.4E-05 8.7E-09 47.2 22.4 393 1-418 6-498 (501) 228 protein:vir:96783 Length: 488 97.7 3E-05 1.9E-08 45.3 20.8 376 1-416 32-488 (488) 229 protein:vir:78161 Length: 355 97.6 3.5E-05 2.2E-08 45.0 26.7 293 94-418 1-319 (355) 230 protein:vir:101418 Length: 569 97.6 3.8E-05 2.3E-08 44.8 19.2 405 1-418 42-545 (569) 231 protein:vir:79511 Length: 448 97.6 4.2E-05 2.6E-08 44.6 27.3 373 1-418 23-444 (448) 232 protein:vir:94956 Length: 452 97.3 0.0001 6.3E-08 42.5 23.5 371 2-415 1-452 (452) 233 protein:vir:78393 Length: 489 97.3 0.0001 6.5E-08 42.4 22.7 385 1-418 1-477 (489) 234 protein:vir:95254 Length: 488 97.3 0.00012 7.2E-08 42.2 29.3 392 1-418 10-483 (488) 235 protein:vir:5665 Length: 511 # 97.1 0.00018 1.1E-07 41.2 24.6 382 1-416 23-511 (511) 236 protein:vir:80453 Length: 535 97.0 0.00023 1.4E-07 40.6 22.4 390 1-418 37-514 (535) 237 protein:vir:95014 Length: 491 96.8 0.00033 2E-07 39.7 20.3 391 1-418 1-485 (491) 238 protein:vir:78942 Length: 510 96.2 0.00083 5.2E-07 37.5 20.2 375 1-418 32-490 (510) 239 protein:vir:1785 Length: 555 # 95.9 0.0013 8.3E-07 36.3 19.4 376 1-418 20-502 (555) 240 protein:vir:94709 Length: 522 95.5 0.002 1.2E-06 35.4 19.5 385 1-418 27-498 (522) 241 protein:vir:8883 Length: 543 # 95.4 0.0021 1.3E-06 35.3 20.9 387 1-418 29-500 (543) 242 protein:vir:7321 Length: 556 # 94.8 0.0034 2.1E-06 34.1 18.4 386 1-418 24-512 (556) 243 protein:vir:102668 Length: 547 94.4 0.0045 2.8E-06 33.5 17.6 384 1-418 31-521 (547) 244 protein:vir:6322 Length: 510 # 94.4 0.0046 2.9E-06 33.4 20.5 374 1-418 32-490 (510) 245 protein:vir:95315 Length: 559 93.4 0.0078 4.8E-06 32.2 16.3 389 1-418 24-521 (559) 246 protein:vir:2198 Length: 536 # 93.1 0.0088 5.4E-06 31.9 20.3 380 1-418 42-503 (536) 247 protein:vir:103330 Length: 517 92.2 0.012 7.6E-06 31.1 20.1 381 1-418 27-487 (517) 248 protein:vir:3361 Length: 535 # 92.2 0.012 7.7E-06 31.0 21.0 388 1-418 29-498 (535) 249 protein:vir:77981 Length: 448 91.9 0.014 8.5E-06 30.8 31.7 373 1-418 18-444 (448) 250 protein:vir:107822 Length: 555 91.7 0.014 9E-06 30.7 19.5 386 1-418 25-520 (555) 251 protein:vir:107404 Length: 555 91.7 0.014 9E-06 30.7 19.5 386 1-418 25-520 (555) 252 protein:vir:98506 Length: 555 91.7 0.014 9E-06 30.7 19.5 386 1-418 25-520 (555) 253 protein:vir:78696 Length: 542 91.7 0.015 9.2E-06 30.6 24.7 380 1-418 20-513 (542) 254 protein:vir:10447 Length: 536 91.5 0.016 9.6E-06 30.5 20.0 380 1-418 42-503 (536) 255 protein:vir:96988 Length: 516 91.4 0.016 9.8E-06 30.5 19.1 390 1-418 31-516 (516) 256 protein:vir:99672 Length: 532 90.9 0.018 1.1E-05 30.1 22.7 381 1-418 29-507 (532) 257 protein:vir:1538 Length: 535 # 90.4 0.021 1.3E-05 29.8 20.7 388 1-418 29-503 (535) 258 protein:vir:105641 Length: 516 88.1 0.034 2.1E-05 28.6 21.0 389 1-418 31-498 (516) 259 protein:vir:94572 Length: 535 87.1 0.041 2.5E-05 28.2 20.5 386 1-418 30-503 (535) 260 protein:vir:7017 Length: 515 # 86.8 0.043 2.6E-05 28.1 20.2 385 1-418 30-497 (515) 261 protein:vir:100039 Length: 522 86.5 0.045 2.8E-05 28.0 22.9 382 1-418 18-490 (522) 262 protein:vir:80211 Length: 514 79.9 0.1 6.2E-05 26.1 22.2 386 1-418 18-497 (514) 263 protein:vir:103765 Length: 549 75.7 0.14 8.9E-05 25.2 20.6 380 1-418 38-525 (549) 264 protein:vir:80165 Length: 651 74.2 0.16 0.0001 24.9 24.3 386 1-418 15-594 (651) 265 protein:vir:4073 Length: 279 # 35.0 1.3 0.00078 20.0 8.0 249 89-394 1-279 (279) No 1 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=100.00 E-value=1.8e-130 Score=731.87 Aligned_cols=415 Identities=93% Similarity=1.352 Sum_probs=391.8 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) |+++|||.|+|+|++++++.++++.++++++++++|++||++|+|||+||+||+|+||+|+|++++.+++++|++|++++ T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~~~~~~~~~l~~~~ 80 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (422) T ss_pred CccchhhHHHHcCCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHHHHHHHHHHHhhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999988889999999999999 Q ss_pred HHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccc Q lcl|NC_019404. 81 NINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYD 160 (418) Q Consensus 81 ~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~ 160 (418) +|++|++|+|+||+|+|++.++|++.+++||++++++++|+|+++|+++|..+++||.+|+||+|++|+|++.++...++ T Consensus 81 ~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~ 160 (422) T protein:vir:10 81 NINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITTNESDMFYD 160 (422) T ss_pred HHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEecCCCCccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988777789 Q ss_pred cCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHH Q lcl|NC_019404. 161 VHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARL 240 (418) Q Consensus 161 iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~ 240 (418) |||||||+|.|+++|+++++.+++||+|+|++.||++|++|+++++++++|++++++.|+|+++++++++.++...++++ T Consensus 161 iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~ 240 (422) T protein:vir:10 161 VHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARL 240 (422) T ss_pred eccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHH Confidence 99999999999999999999999999999998899999999999999999999999999999999999998888889999 Q ss_pred HHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH Q lcl|NC_019404. 241 RLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID 320 (418) Q Consensus 241 r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~ 320 (418) |+..+.+.+++++++++++++|+|++++++|||++++++++++.||++++||+|+|||+||+|||||||+|++|||++|+ T Consensus 241 r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~ 320 (422) T protein:vir:10 241 RLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVD 320 (422) T ss_pred HHHHHHHhcCCccceeEecCCcceEEEecccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC--- Q lcl|NC_019404. 321 RKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI--- 397 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~--- 397 (418) ++||+.++|+|++|+++|+++++|+|+|+|||++|+|||||+++++|+++++|+++|+++++|+|+.|+...++.++ T Consensus 321 ~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~ 400 (422) T protein:vir:10 321 RKRNAELLPILEFLIPFIVNAEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDG 400 (422) T ss_pred HHHHHHHHHHHHHHHHHhcccCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999876665554 Q ss_pred -Chhhcccccc--cC-CCcccc Q lcl|NC_019404. 398 -GDNDIQTEES--EL-ITETEV 415 (418) Q Consensus 398 -~~~~~~~~e~--~~-~~e~e~ 415 (418) .+++++..+. ++ +.++++ T Consensus 401 ~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 401 SVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred CCccccchhhcCCCCCCCCCCC Confidence 3444433332 11 122222 No 2 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=100.00 E-value=5.8e-128 Score=718.13 Aligned_cols=417 Identities=53% Similarity=0.858 Sum_probs=389.3 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) +.++|||.|+ +|++.+++.++....+..++++++|++||++|+|||+||+||||+||+|+|++++++++++|++|++++ T Consensus 3 ~~~~d~~~~~-~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~~~~~~~~~l~~~~ 81 (427) T protein:vir:10 3 IVKHDGYNDI-FNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKDEKEFKSLWDSYKLDS 81 (427) T ss_pred ccccchHHHH-hhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccHHHHHHHHHHHhhHHH Confidence 9999999996 677777778888888889999999999999999999999999999999999888889999999999999 Q ss_pred HHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccc Q lcl|NC_019404. 81 NINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYD 160 (418) Q Consensus 81 ~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~ 160 (418) +|++|++|+|+||+|+|+++++|+.++++|+++++++++|+|+++|+++|..++.||++|+||+|++|+|+++++...++ T Consensus 82 ~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~ 161 (427) T protein:vir:10 82 SLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPEIYKVSPGDNMQPYL 161 (427) T ss_pred HHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccccccCcceEEEEecCCCCcceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999887777789 Q ss_pred cCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHH Q lcl|NC_019404. 161 VHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARL 240 (418) Q Consensus 161 iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~ 240 (418) |||||||+|.|+++|+++++.+++||+|+|++++|++|++|+++++++++|++++++.|+|+++++++++.++.+.++++ T Consensus 162 iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~ 241 (427) T protein:vir:10 162 IHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARL 241 (427) T ss_pred EccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHH Confidence 99999999999999999999999999999988899999999999999999999999999999999999999888889999 Q ss_pred HHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH Q lcl|NC_019404. 241 RLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID 320 (418) Q Consensus 241 r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~ 320 (418) |+..+++.+++++++++++++|+|++++++|||++++++++++.||++++||+|+|||+||+|||||||+|++|||++|+ T Consensus 242 r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~ 321 (427) T protein:vir:10 242 RLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD 321 (427) T ss_pred HHHHHHHhcCcccceeeecCCCceeEEecccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC--- Q lcl|NC_019404. 321 RKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI--- 397 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~--- 397 (418) ++||+.++|+|++|+++|+++++|+|+|+|||++|++|+||+++++|+++++|+++|+++++|+|+.|+..+++.++ T Consensus 322 ~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~ 401 (427) T protein:vir:10 322 RKREEDYRPLLEFLLPFIVDEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDG 401 (427) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999877665544 Q ss_pred ---ChhhcccccccCCCccccccC Q lcl|NC_019404. 398 ---GDNDIQTEESELITETEVVIA 418 (418) Q Consensus 398 ---~~~~~~~~e~~~~~e~e~~~~ 418 (418) ++++.++++..+++.+|++-. T Consensus 402 ~~~~~e~~~~~~e~~p~~~e~~~d 425 (427) T protein:vir:10 402 NNINIREPEETTEPEPGLGEKLED 425 (427) T ss_pred ccccccccchhcCCCCCCCCCCCC Confidence 334444433333333333333 No 3 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=100.00 E-value=3.2e-125 Score=703.08 Aligned_cols=414 Identities=55% Similarity=0.893 Sum_probs=381.5 Q ss_pred CccchhhHHHHhc-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MVKTDSYANIFLG-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~~~D~~~n~~~g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) +.++|||.|.|.+ .++..........+++++|+++|++||++|+|||+||+||||+||+|+|+++.++++++|++|+++ T Consensus 12 ~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~~~~~~~~~l~~~ 91 (435) T protein:vir:79 12 ITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVKNEKSFKSRWDELRLN 91 (435) T ss_pred chhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCCChHHHHHHHHHHhhHH Confidence 9999999997643 233223344555678999999999999999999999999999999999998888999999999999 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ++|+++++|+|+||+|+|++.++|++++++||+.+|++++|+|+++|+++|..+++||++|+||+|++|+|++.++...+ T Consensus 92 ~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~ 171 (435) T protein:vir:79 92 AKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGEPKLYKISPGGDIPEF 171 (435) T ss_pred HHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCcceEEEEecCCCCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988776778 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAAR 239 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~ 239 (418) +|||||||+|+|+++|++.++.+++||+|||.+++|++|++|+++++++++|++++++.|+|++++++++++++...+++ T Consensus 172 ~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~ 251 (435) T protein:vir:79 172 FVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAAR 251 (435) T ss_pred EEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHH Confidence 99999999999999999999999999999998789999999999999999999999999999999999999888888899 Q ss_pred HHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH Q lcl|NC_019404. 240 LRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI 319 (418) Q Consensus 240 ~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I 319 (418) +|+..+++.+++++.+++++++|+|++++++|+|+++++++++++||++++||+|+|||+||+|||||||+|++|||++| T Consensus 252 ~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i 331 (435) T protein:vir:79 252 LRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLI 331 (435) T ss_pred HHHHHHHHhcCCCCceeEecCCcceEEEecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHH Confidence 99999999999999999999989999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGD 399 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~ 399 (418) +++||+.++|+|++|+++++++++|+|+|+|||++|+||+||+++++|+++++|+++|+|+++|+|+.|+...+.+++.. T Consensus 332 ~~~Qe~~l~p~l~~l~~li~~s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~ 411 (435) T protein:vir:79 332 DRKRVEDYKPILEFLLPFMISETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMD 411 (435) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999988777666543 Q ss_pred hh---cccccc------cCCCccc Q lcl|NC_019404. 400 ND---IQTEES------ELITETE 414 (418) Q Consensus 400 ~~---~~~~e~------~~~~e~e 414 (418) ++ +++.++ ..-+|+| T Consensus 412 ~~~~~~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 412 NDNIELPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred cccccCCccccCCCCCCCCCCCCC Confidence 22 222222 1223344 No 4 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=100.00 E-value=2.3e-115 Score=649.07 Aligned_cols=408 Identities=20% Similarity=0.327 Sum_probs=365.9 Q ss_pred CccchhhHHHHhcCCCCcc----ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH----HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSE----IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE----PAFWSR 72 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~----~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~----~~i~~~ 72 (418) |-++|||.|+++|+|+... .++++..+++++|+++|++||++|+|||+||+||+|+||+|+++++. ++++++ T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~~~~ 80 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLFTKF 80 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHHHHH Confidence 9999999999998776532 35777788999999999999999999999999999999999875422 468999 Q ss_pred HHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc-cccccccccccCcceEEEEe Q lcl|NC_019404. 73 WDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ-NREENPRNARFGKPLTYRIT 151 (418) Q Consensus 73 ~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~-~~~~dp~s~~yg~p~~y~i~ 151 (418) |++|++|++|+++++|+||||+|+|++.+ |++++++||++.+.+++|+|+|+|+++|. .+++||++|+||+|++|+|+ T Consensus 81 ~~~l~~~~~l~~a~~~~rl~G~a~i~i~~-d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v~ 159 (437) T protein:vir:52 81 ERSLKLRETLTKALQWSSLYGSVGLLVVT-DSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSIL 159 (437) T ss_pred HHhhcHHHHHHHHHHhcccccceEEEEEe-cCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEEEe Confidence 99999999999999999999999999877 67889999999999999999999999975 56789999999999999998 Q ss_pred cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcC Q lcl|NC_019404. 152 TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDD 231 (418) Q Consensus 152 ~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~ 231 (418) +++. .++|||||||||+|+++| ++.+++||.|+++ .+|++|++|+++++++++|++++++.|+|++++++.++. T Consensus 160 ~~~~--~~~iH~SRii~~~~~~~~---~~~~~~~G~s~le-~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~ 233 (437) T protein:vir:52 160 GGSQ--SITVHHSRLIILNANDAP---LSDNDIWGVSDLE-KIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAA 233 (437) T ss_pred cCCc--ceeEccceeEEecCccCC---CccccccCCchHH-HHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcC Confidence 7643 368999999999999988 4668899999886 599999999999999999999999999999999999887 Q ss_pred cchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHH Q lcl|NC_019404. 232 SEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTA 311 (418) Q Consensus 232 ~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d 311 (418) + ++..+.+|...+...+++++.+++|++ ++|++++++|+|+++++++++++||++++||+|+|||+||+|| |||++| T Consensus 234 ~-~~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~e~~~~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gl-asge~D 310 (437) T protein:vir:52 234 G-MENEVASVISAVQEIKSATNSLLLDAE-NEYDRKELTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGL-ASGDED 310 (437) T ss_pred C-cHHHHHHHHHHHHHhcCCCceEEEcCC-cceEEEecCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccc-cccHHH Confidence 5 567888999999999998899888875 8899999999999999999999999999999999999999999 789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019404. 312 LETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEAR 385 (418) Q Consensus 312 ~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r 385 (418) +++||++|+++||+.++|++++|+++|+++ ++|+|+|+|||++|++|+||+++++|+++++|+++|+++++|+| T Consensus 311 ~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r 390 (437) T protein:vir:52 311 IQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIA 390 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHH Confidence 999999999999999999999999999875 48999999999999999999999999999999999999999999 Q ss_pred HHHHhhcCcCCCChhhcccccccCC----CccccccC Q lcl|NC_019404. 386 DTLRTIAPEIKIGDNDIQTEESELI----TETEVVIA 418 (418) Q Consensus 386 ~~l~~~~~~~~~~~~~~~~~e~~~~----~e~e~~~~ 418 (418) +.|+..+.+.+++++++++.++..+ .+.....+ T Consensus 391 ~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (437) T protein:vir:52 391 NELRESGLFANISAEHIEELKNADEFAGNFEEPEKME 427 (437) T ss_pred HHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCC Confidence 9999998888888777655443211 11111222 No 5 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=100.00 E-value=2.4e-111 Score=627.02 Aligned_cols=406 Identities=16% Similarity=0.187 Sum_probs=345.0 Q ss_pred Cccchhh---------HHHHhcCCC--Cc---cccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch- Q lcl|NC_019404. 1 MVKTDSY---------ANIFLGGSD--GS---EIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD- 65 (418) Q Consensus 1 ~~~~D~~---------~n~~~g~~~--~~---~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d- 65 (418) -+++||. .|...|.+. .. ..+..+..+..++++++|++||++|+|||+||+||||+||+|+++++ T Consensus 68 ~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e 147 (765) T protein:vir:96 68 SVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAARNGWELKSDGRK 147 (765) T ss_pred ceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhcCCceeecCccc Confidence 4566665 332222221 11 12344556677899999999999999999999999999999988543 Q ss_pred -----HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec--CCCcccccccC----CCceEEEEEeecccccc---c Q lcl|NC_019404. 66 -----EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK--DNRALTSPVRE----GAELETVRVYDRTQVKV---Q 131 (418) Q Consensus 66 -----~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~--d~~~l~~pl~~----~~~i~~i~v~~~~~i~~---~ 131 (418) .++|++++++|+++++|+++++|+|+||++++++.++ |+..|++||+. ++++++|++++++++.+ . T Consensus 148 ~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~ 227 (765) T protein:vir:96 148 LSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTA 227 (765) T ss_pred cCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcccccch Confidence 2568999999999999999999999999999999885 67889999964 47999999999999887 3 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ++..||++|+||+|++|+|++ ++|||||||+|.|.++|+++++.+++||.|+++ .+|++|++|+++++++++| T Consensus 228 e~~~Dp~sp~fg~P~~y~i~g------~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq-~~yd~I~~~~~t~~~~a~L 300 (765) T protein:vir:96 228 ESTADPSAEHFYEPDFWIISG------KKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQ-RIYERVYAAERTANEAPLL 300 (765) T ss_pred hccccccccccCcceeeeecC------ceeccceEEEecCCCchhhhccccCccCccHHH-HHHHHHHHHHHHHHHHHHH Confidence 678899999999999999954 589999999999999999999999999999885 5999999999999999999 Q ss_pred HHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcC Q lcl|NC_019404. 212 LRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGI 291 (418) Q Consensus 212 ~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~I 291 (418) ++++++.|+|++++..+ . ..+++.+|+..+.+++++.+.++++++ |+|++++++||||+++++++++.||++++| T Consensus 301 l~k~~~~v~k~~~~~~l-~---~~~~l~~r~~~~~~~r~n~g~~~id~e-e~~e~~s~~lsgl~d~l~~~~~~iAaas~I 375 (765) T protein:vir:96 301 AMSKRTSTIHVDVEKAI-A---NEDAFNARLAFWIANRDNHGVKVIGID-ETMEQFDTNLSDFDSVIMNQYQLVAAIAKT 375 (765) T ss_pred HHHhccceeeechHhhh-c---cHHHHHHHHHHHHHhcCCceeEEecCC-cceeEEecccCCHHHHHHHHHHHHHhhhCC Confidence 99999999999976554 2 245688999999999998888888775 889999999999999999999999999999 Q ss_pred CeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC----CceEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019404. 292 HEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNAE----EWSVEFSPLDHESSKDKAEVLEKSV 367 (418) Q Consensus 292 P~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~----~~~~~f~pL~~~~eke~ae~~~~~a 367 (418) |+|+|||+||+|+|||||+|++|||++|+++||+.++|+|++|+++|+++. +|+|+|+|||++|++||||+++++| T Consensus 376 P~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~~i~~d~~i~FnpL~~~sekEkAei~~k~A 455 (765) T protein:vir:96 376 PATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKSESIDVQLEIVWNPVDSTTSQQQAELNNKKA 455 (765) T ss_pred CeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcceEEeCCCCCCCHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999864 8999999999999999999999999 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHhhcC--cCCCChhhcccccccCC---CccccccC Q lcl|NC_019404. 368 NSIAALIAAGAMDIKEARDTLRTIAP--EIKIGDNDIQTEESELI---TETEVVIA 418 (418) Q Consensus 368 ~a~~~~~~~g~i~~~e~r~~l~~~~~--~~~~~~~~~~~~e~~~~---~e~e~~~~ 418 (418) +++++|+++|+|+++|+|+.|+.... +..+++++++.+....+ .+.+..-+ T Consensus 456 ea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~ 511 (765) T protein:vir:96 456 ATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGA 511 (765) T ss_pred HHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCCc Confidence 99999999999999999999975432 34455555443221111 11111110 No 6 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=100.00 E-value=2.6e-110 Score=621.37 Aligned_cols=406 Identities=14% Similarity=0.151 Sum_probs=346.3 Q ss_pred CccchhhHHHHhcCCCCc--ccc-------Ccc--ccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc----- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGS--EIY-------GSL--QNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID----- 64 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~--~~~-------~~~--~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~----- 64 (418) =+.+||+.|.+++.|+.+ +.| +++ ..+..++++++|++||++|+|||+||+||||+||+|.+.. T Consensus 97 ~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~ 176 (862) T protein:vir:99 97 GFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEI 176 (862) T ss_pred hhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCccccc Confidence 245699999886644432 222 111 1234568999999999999999999999999999998632 Q ss_pred ---hHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec--CCCcccccccC----CCceEEEEEeeccccccc---c Q lcl|NC_019404. 65 ---DEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK--DNRALTSPVRE----GAELETVRVYDRTQVKVQ---N 132 (418) Q Consensus 65 ---d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~--d~~~l~~pl~~----~~~i~~i~v~~~~~i~~~---~ 132 (418) ..++|++++++|+++++|+++++|+||||++++++.++ |+..|++||+. +|++++|+||++++++|. + T Consensus 177 ~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~ 256 (862) T protein:vir:99 177 DEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAE 256 (862) T ss_pred CHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhccccccc Confidence 23579999999999999999999999999999888764 67889999974 578999999999998863 5 Q ss_pred ccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 133 REENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 133 ~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) ++.||++|+||+|++|+|++ ++|||||||+|.|+++|+++++.++|||.|+++ .+|++|++|+++++++++|+ T Consensus 257 ~~~Dp~sp~yGkP~~y~I~g------~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe-~iyd~L~~~d~t~~saa~Ll 329 (862) T protein:vir:99 257 STADPSSQFFYEPEFWIISG------QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQ-RIYERVYAAERTANEAPLLA 329 (862) T ss_pred ccccccccccCCceeeeecC------eeeccceeEEecCCCchhhhhccCCccCccHHH-HHHHHHHHHHHHHHHHHHHH Confidence 78999999999999999954 589999999999999999999999999999886 59999999999999999999 Q ss_pred HHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCC Q lcl|NC_019404. 213 RRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIH 292 (418) Q Consensus 213 ~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP 292 (418) +++++.|+|+++++.+. ....+.+|+..+++.+++.+.++++++ |+|++++++|||+++++++++++||++++|| T Consensus 330 ~ka~l~v~ktd~l~~l~----~ed~l~~r~~~~~~~rdN~Gi~liD~e-Ee~e~ls~slSGL~dll~~~~q~IAaas~IP 404 (862) T protein:vir:99 330 MNKRTTAIHTDTAKAIA----NEDKFIQRLMFWVRYRDNHAVKVLGTD-ETMEQFDTSLADFDAVIMGQYQLVASIAKTP 404 (862) T ss_pred HHhccceeechhHhhhc----cHHHHHHHHHHHHhccCcceeEEecCC-CceeEEecccCChHHHHHHHHHHHHhhhCCC Confidence 99999999999887653 245688899999999998888887775 8899999999999999999999999999999 Q ss_pred eeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----CCceEEeCCCCCCCHHHHHHHHHHHHH Q lcl|NC_019404. 293 EIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA----EEWSVEFSPLDHESSKDKAEVLEKSVN 368 (418) Q Consensus 293 ~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~----~~~~~~f~pL~~~~eke~ae~~~~~a~ 368 (418) +|+|||+||+|+|||||+|++|||++|+++|++.|+|+|++|+.+++++ .+|+|+|+|||++|++|+||+++++|+ T Consensus 405 ~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~~~d~~ieFnpL~~~sekEkAEi~kk~Ae 484 (862) T protein:vir:99 405 ATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLGIQHEIDVVMEPVASMTAQQQADLNKTKAE 484 (862) T ss_pred ceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcceEEeCCCCCCCHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999988753 689999999999999999999999999 Q ss_pred HHHHHHhCCCCCHHHHHHHHHhhcC--cCCCChhhcccccccCC-CccccccC Q lcl|NC_019404. 369 SIAALIAAGAMDIKEARDTLRTIAP--EIKIGDNDIQTEESELI-TETEVVIA 418 (418) Q Consensus 369 a~~~~~~~g~i~~~e~r~~l~~~~~--~~~~~~~~~~~~e~~~~-~e~e~~~~ 418 (418) ++++|+++|+|+++|+|+.|+.... +.++++++++++....+ ...+..-+ T Consensus 485 a~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~ 537 (862) T protein:vir:99 485 GGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKA 537 (862) T ss_pred HHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccC Confidence 9999999999999999999986443 45566666553221111 11111110 No 7 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=100.00 E-value=1.4e-109 Score=617.30 Aligned_cols=406 Identities=17% Similarity=0.216 Sum_probs=346.6 Q ss_pred CccchhhHHHHh---cCCC--Ccc---ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc-------h Q lcl|NC_019404. 1 MVKTDSYANIFL---GGSD--GSE---IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID-------D 65 (418) Q Consensus 1 ~~~~D~~~n~~~---g~~~--~~~---~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~-------d 65 (418) -...|++.|.++ |.++ .++ .++.+..+..++++++|++||++|+|||+||+||||+||+|.+.+ . T Consensus 48 ~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~ 127 (532) T protein:vir:94 48 PYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKDELAADK 127 (532) T ss_pred cccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCccccchHH Confidence 233488888763 4333 322 456666778899999999999999999999999999999997632 1 Q ss_pred HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC---CCccccccc------CCCceEEEEEeeccccccccc-cc Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD---NRALTSPVR------EGAELETVRVYDRTQVKVQNR-EE 135 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d---~~~l~~pl~------~~~~i~~i~v~~~~~i~~~~~-~~ 135 (418) .++|+.++++|+++++|+++++|+|+||+|+|++.+++ ..+++.|+. .+|++++|+|+++|+++|... .. T Consensus 128 ~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~ 207 (532) T protein:vir:94 128 ATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNAT 207 (532) T ss_pred HHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccc Confidence 24689999999999999999999999999999999963 245565553 467899999999999999865 47 Q ss_pred cccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019404. 136 NPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRK 215 (418) Q Consensus 136 dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~ 215 (418) ||++|+||+|++|++.++ ++|||||||||.|+++|+++++.+++||.|+++ .+|++|++|+++++++++|++++ T Consensus 208 dp~sp~fg~P~~y~v~~g-----~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq-~~~~~l~~~~~t~~~~~~l~~~~ 281 (532) T protein:vir:94 208 DPTLPSFYKPDSWIATSG-----KKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQ-LAMPYVDNWLRTRQSVSDTVKQF 281 (532) T ss_pred cccccccCCceeEEEccC-----eeeccceEEEecCCCchhhhccccccccccHHH-HHHHHHHHHHHHHHHHHHHHHhc Confidence 999999999999998653 579999999999999999999999999999875 69999999999999999999999 Q ss_pred CCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 216 QQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 216 ~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~ 295 (418) ++.|+|+ +++++++.+ +...+.+|+..+.+.+++.+.+++++++|+|++++++|+||++++++++++||++++||+|+ T Consensus 282 ~~~v~k~-~~a~~ls~~-~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~~~lsgl~~~l~~~~~~iAaa~~IP~t~ 359 (532) T protein:vir:94 282 SMTNLAT-DMAQLLAPG-GAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTNTPLSGLDSLQAQSQEQMAAVSHIPLVK 359 (532) T ss_pred CCceeee-chHHhhcch-hHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEecccCCHHHHHHHHHHHHHhHhCCCeee Confidence 9999999 478887654 57889999999999999999999999889999999999999999999999999999999999 Q ss_pred eeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLDHESSKDKAEVLEKSVNS 369 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~~~~eke~ae~~~~~a~a 369 (418) |||+||+|||||||+|+++||++|+++||+.++|+|++|+++|+++ ++|+|+|+|||++|+||+||+++++|++ T Consensus 360 LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a 439 (532) T protein:vir:94 360 LLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNAST 439 (532) T ss_pred eecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999975 4899999999999999999999999999 Q ss_pred HHHHHhCCCCCHHHHHHHHHhhcCcCCC-----ChhhcccccccCCCccccccC Q lcl|NC_019404. 370 IAALIAAGAMDIKEARDTLRTIAPEIKI-----GDNDIQTEESELITETEVVIA 418 (418) Q Consensus 370 ~~~~~~~g~i~~~e~r~~l~~~~~~~~~-----~~~~~~~~e~~~~~e~e~~~~ 418 (418) +++|+++|+|+++|+|+.|+.... .++ ..+++++.++. ..|++-. T Consensus 440 ~~~~~~~Gvi~~~Evr~~l~~~~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~ 489 (532) T protein:vir:94 440 DSTLMELGVIDAKMVQQRLAADPT-SGYAGALGERDELDDVEEI---AKQLMAA 489 (532) T ss_pred HHHHHhcCCCCHHHHHHHHhcCCc-cccccccccccccccccch---hhhhccc Confidence 999999999999999999875332 111 11111111110 0000000 No 8 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=100.00 E-value=1.1e-108 Score=612.39 Aligned_cols=406 Identities=14% Similarity=0.158 Sum_probs=343.8 Q ss_pred CccchhhHHHHhc----CCCC----ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch------- Q lcl|NC_019404. 1 MVKTDSYANIFLG----GSDG----SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD------- 65 (418) Q Consensus 1 ~~~~D~~~n~~~g----~~~~----~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d------- 65 (418) -|++|++.+.+.+ +++. ...+++...+..++++++|++||++|+|||+||+||+|+||+|+++++ T Consensus 67 ~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~ 146 (537) T protein:vir:10 67 DMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGHQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKD 146 (537) T ss_pred chhccccccchhhhhhhccccccchhhhhccccCCccHHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcccccHHH Confidence 4556665443321 1111 123445556678899999999999999999999999999999987532 Q ss_pred HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec--CCCcccccccC----CCceEEEEEeeccccccc---ccccc Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK--DNRALTSPVRE----GAELETVRVYDRTQVKVQ---NREEN 136 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~--d~~~l~~pl~~----~~~i~~i~v~~~~~i~~~---~~~~d 136 (418) .++|++++++|+++++|+++++|+||||+++++|.++ |+..+++||+. +|++++|+|+++|+++|. ++..| T Consensus 147 ~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~d 226 (537) T protein:vir:10 147 AKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSN 226 (537) T ss_pred HHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhcc Confidence 2578999999999999999999999999999999885 77889999974 568999999999999873 57789 Q ss_pred ccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019404. 137 PRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQ 216 (418) Q Consensus 137 p~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~ 216 (418) |++|+||+|++|+|.+ ++|||||||||+|+++|+++++.+++||.|+++ .+|++|++|+++++++++|+++++ T Consensus 227 p~sp~fg~P~~y~v~g------~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq-~~~~~l~~~~~t~~~~~~l~~~~~ 299 (537) T protein:vir:10 227 PVSMHFYEPTYWLING------KKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQ-QIMERVYAAERTANEGPMLAMTKR 299 (537) T ss_pred CCccccCCceeeeecC------eEecceeEEEecCCCCchhhhcccCcccccHHH-HHHHHHHHHHHHHHHHHHHHHhcC Confidence 9999999999999943 689999999999999999999999999999885 599999999999999999999999 Q ss_pred CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeeee Q lcl|NC_019404. 217 QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIIL 296 (418) Q Consensus 217 ~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L 296 (418) +.|+|+++++.+ ++ .+.+.+|+..+++.+++.+.+++++++|+|++++++|+|++++++.++++||++++||+|+| T Consensus 300 ~~v~k~~~~~~l-~~---~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L 375 (537) T protein:vir:10 300 QTVLKVDAAQVL-AN---KQQFDETMSWWTATRDNYQVRVVDKDNEDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKM 375 (537) T ss_pred CceeeechHHhh-cC---HHHHHHHHHHHHhhcCCcceeEecCCCceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceee Confidence 999999987654 32 34678888999999999999999998899999999999999999999999999999999999 Q ss_pred eccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----CCceEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019404. 297 KNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA-----EEWSVEFSPLDHESSKDKAEVLEKSVNSIA 371 (418) Q Consensus 297 ~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-----~~~~~~f~pL~~~~eke~ae~~~~~a~a~~ 371 (418) ||+||+|||||||+|+++||++|+++|+ .++|+|++++++|+++ .+|+|+|+|||++|+|||||+++++|++++ T Consensus 376 ~G~sp~GlnatGe~D~~~yyd~I~~~Qe-~l~p~l~~l~~ll~~~~~~~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~ 454 (537) T protein:vir:10 376 LGTVPTGFNSTGDYEEASYHEECESTQD-DMRPLIDRHHQLVCRSHLRKRIRVKVEFPPMDAPKESERADTFLKKMQAAK 454 (537) T ss_pred ccCCccccccchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999 5999999999999875 389999999999999999999999999999 Q ss_pred HHHhCCCCCHHHHHHHHHhhc--CcCCC----Chhhccccccc-CCCcccccc--------C Q lcl|NC_019404. 372 ALIAAGAMDIKEARDTLRTIA--PEIKI----GDNDIQTEESE-LITETEVVI--------A 418 (418) Q Consensus 372 ~~~~~g~i~~~e~r~~l~~~~--~~~~~----~~~~~~~~e~~-~~~e~e~~~--------~ 418 (418) +|+++|+|+++|+|+.|+... .+.++ ++++.++..-+ ...+.|+.- | T Consensus 455 ~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~ 516 (537) T protein:vir:10 455 LAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMF 516 (537) T ss_pred HHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccC Confidence 999999999999999998642 22333 33333321110 001111111 1 No 9 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=100.00 E-value=1.7e-108 Score=611.45 Aligned_cols=407 Identities=16% Similarity=0.173 Sum_probs=340.2 Q ss_pred CccchhhHHHH--hcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc--------------- Q lcl|NC_019404. 1 MVKTDSYANIF--LGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI--------------- 63 (418) Q Consensus 1 ~~~~D~~~n~~--~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~--------------- 63 (418) .--.-+++..+ .|....+-.+-....+-.++.++...++|.+|+++++++++|+|+|+++.+. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~ 161 (698) T protein:vir:10 82 PRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGN 161 (698) T ss_pred ccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhccccccc Confidence 00001111111 1111111111112223356778888999999999999999999999886422 Q ss_pred -------chHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccc------cCCCceEEEEEeeccccc Q lcl|NC_019404. 64 -------DDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPV------REGAELETVRVYDRTQVK 129 (418) Q Consensus 64 -------~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl------~~~~~i~~i~v~~~~~i~ 129 (418) |+.++|++++++|++|.+++++++|+|+|||++++|.++ ++..+++|| ..+|++++|+|+|||+++ T Consensus 162 ~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vt 241 (698) T protein:vir:10 162 AASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVT 241 (698) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccc Confidence 233579999999999999999999999999999999885 456788888 247899999999999999 Q ss_pred ccccc-ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 130 VQNRE-ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 130 ~~~~~-~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) |...+ .||++|+||+|++|+|.+ .+||+||+++|.|+|+|+++++.++|||.|..+ .+|++|.+|++++.++ T Consensus 242 P~~~n~~dP~spdfgkP~~y~V~G------~~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q-~~~e~V~~~~rT~~~v 314 (698) T protein:vir:10 242 PNNYNSINPVADDFYKPSTWWMIG------SEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQ-LAMPYIDNWLRTRQSV 314 (698) T ss_pred cchhhhccchhhccCCCceEEEec------ceecceeEEEecCCCchhhhcchhccCCccHHH-HHHHHHHHHHHHhhhH Confidence 98665 599999999999999964 479999999999999999999999999999775 6999999999999999 Q ss_pred HHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhh Q lcl|NC_019404. 209 TQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVAL 288 (418) Q Consensus 209 ~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaa 288 (418) ++|++++++.++|++ +++.++.+ +...+.+|+..+++++++++.+++|+++|+|++++++||||++++++|+++||++ T Consensus 315 ~~Li~~~~~~~l~~d-la~aL~~g-~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st~lSGLddVi~qf~q~VAga 392 (698) T protein:vir:10 315 SDIVKQFSVSGILMD-LAQALTPG-ANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (698) T ss_pred HHHHHHhhHHHHHHH-HHHhcCCh-hhHHHHHHHHHHHHhcCccceEEEecCCcceEEEecCcCCHHHHHHHHHHHHHhh Confidence 999999999999986 88888765 4567889999999999999999999878999999999999999999999999999 Q ss_pred hcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 289 SGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 289 s~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~~~~eke~ae~ 362 (418) ++||+|+||||||+|||||||+|++||||+|+++|++.|+|.|++++++|++| .+|+|+|+|||++|++|+||+ T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI 472 (698) T protein:vir:10 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAEA 472 (698) T ss_pred hcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999987 589999999999999999999 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHhh--cCcCC-CChhhcccccccCCCccccccC Q lcl|NC_019404. 363 LEKSVNSIAALIAAGAMDIKEARDTLRTI--APEIK-IGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~l~~~--~~~~~-~~~~~~~~~e~~~~~e~e~~~~ 418 (418) ++++|+++++|++.|+|+++|+|+.|... +.|.+ ++.+|-+. .+.+++.|++-+ T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~--~~~~~~~~~~~~ 529 (698) T protein:vir:10 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPG--APADDDIDGVLT 529 (698) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCC--CCCCCcchHHHh Confidence 99999999999999999999999999753 55655 33222211 122222333321 No 10 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=100.00 E-value=1.2e-107 Score=606.80 Aligned_cols=407 Identities=16% Similarity=0.168 Sum_probs=337.6 Q ss_pred CccchhhHHHH--hcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc--------------- Q lcl|NC_019404. 1 MVKTDSYANIF--LGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI--------------- 63 (418) Q Consensus 1 ~~~~D~~~n~~--~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~--------------- 63 (418) .--.-+++..+ .|....+-.+-....+-.++.++...++|.+|+++++++++|+|+|+++.+. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~ 161 (695) T protein:vir:78 82 PRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGN 161 (695) T ss_pred ccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhccccccc Confidence 00001111111 1111111111112223356778888999999999999999999999886422 Q ss_pred -------chHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccc------cCCCceEEEEEeeccccc Q lcl|NC_019404. 64 -------DDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPV------REGAELETVRVYDRTQVK 129 (418) Q Consensus 64 -------~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl------~~~~~i~~i~v~~~~~i~ 129 (418) |+.++|++++++|++|.+++++++|+|+|||++++|.++ +++.+++|| ..+|++++|+|+|||+++ T Consensus 162 ~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vt 241 (695) T protein:vir:78 162 AASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVT 241 (695) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccc Confidence 233579999999999999999999999999999999885 457788999 247899999999999999 Q ss_pred ccccc-ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 130 VQNRE-ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 130 ~~~~~-~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) |...+ .||++|+||+|++|+|.+ ++||+||+++|.|+|+|+++++.+++||.|..+ .+|++|.+|++++.++ T Consensus 242 P~~~n~~dP~spdfgkP~~y~V~G------~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q-~~~e~V~~~~rT~~~v 314 (695) T protein:vir:78 242 PNNYNSINPVADDFYKPSTWWMIG------TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQ-LAMPYIDNWLRTRQSV 314 (695) T ss_pred cchhhhccchhhccCCCceEEEec------eEEeeeeEEEecCCCchhhhhcccccCcccHHH-HHHHHHHHHHHHHhHH Confidence 98665 599999999999999953 579999999999999999999999999999774 6999999999999999 Q ss_pred HHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhh Q lcl|NC_019404. 209 TQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVAL 288 (418) Q Consensus 209 ~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaa 288 (418) ++|++++++.++|++ +++.+..+ +...+.+|+..+++++++++.+++|+++|+|++++++||||++++.+|+++||++ T Consensus 315 ~~Li~~~~v~~lk~d-la~~L~~g-~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslSGLddVi~qf~q~VAga 392 (695) T protein:vir:78 315 SDIVKQFSVSGILMD-LAQALMPG-ANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (695) T ss_pred HHHHHhhhhHHHHHH-HHHhhcCh-hHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccCCHHHHHHHHHHHHHhh Confidence 999999999999997 88887755 4556888999999999999999999878999999999999999999999999999 Q ss_pred hcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 289 SGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 289 s~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~~~~eke~ae~ 362 (418) ++||+|+|||+||+|||||||+|++||||+|+++|++.|+|.|++++++|++| .+|+|+|+|||+||++|+||+ T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI 472 (695) T protein:vir:78 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAES 472 (695) T ss_pred hcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999987 589999999999999999999 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHhh--cCcCC-CChhhcccccccCCCccccccC Q lcl|NC_019404. 363 LEKSVNSIAALIAAGAMDIKEARDTLRTI--APEIK-IGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~l~~~--~~~~~-~~~~~~~~~e~~~~~e~e~~~~ 418 (418) ++++|+++++|++.|+|+++|+|+.|... +.|.+ ++.+|.+. ...+++..++-+ T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~--~~~~~~~~~~~~ 529 (695) T protein:vir:78 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPG--VPADDDIDGVLT 529 (695) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCC--cCccchhhhhHh Confidence 99999999999999999999999999753 45544 22222110 011111111111 No 11 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=100.00 E-value=1.2e-107 Score=606.78 Aligned_cols=407 Identities=16% Similarity=0.172 Sum_probs=337.4 Q ss_pred CccchhhHHH--HhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc--------------- Q lcl|NC_019404. 1 MVKTDSYANI--FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI--------------- 63 (418) Q Consensus 1 ~~~~D~~~n~--~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~--------------- 63 (418) .--.-+++.. |.|....+-.+-....+-.++.++...++|.+|+++++++++|+|+|+++.+. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~ 161 (695) T protein:vir:36 82 PRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGN 161 (695) T ss_pred ccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceecccchhhhhhcccccccc Confidence 0000011111 11111111111112223356778888999999999999999999999886422 Q ss_pred -------chHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccc------cCCCceEEEEEeeccccc Q lcl|NC_019404. 64 -------DDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPV------REGAELETVRVYDRTQVK 129 (418) Q Consensus 64 -------~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl------~~~~~i~~i~v~~~~~i~ 129 (418) |+.++|++++++|++|.+++++++|+|+|||++++|.++ +++.+++|| ..+|++++|+|+|||+++ T Consensus 162 ~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vt 241 (695) T protein:vir:36 162 AASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVT 241 (695) T ss_pred ccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccc Confidence 223579999999999999999999999999999999885 457788999 247899999999999999 Q ss_pred ccccc-ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 130 VQNRE-ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 130 ~~~~~-~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) |...+ .||++|+||+|++|+|.+ ++||+||+++|.|+|+|+++++.+++||.|..+ .+|++|.+|++++.++ T Consensus 242 P~~~n~~dP~spdfgkP~~y~V~G------~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q-~~~e~V~~~~rT~~~v 314 (695) T protein:vir:36 242 PNNYNSINPVADDFYKPSTWWMIG------TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQ-LAMPYIDNWLRTRQSV 314 (695) T ss_pred cchhhhccchhhccCCCceEEEec------eEEeeeeEEEecCCCchhhhhcccccCcccHHH-HHHHHHHHHHHHHhHH Confidence 98665 599999999999999953 579999999999999999999999999999774 6999999999999999 Q ss_pred HHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhh Q lcl|NC_019404. 209 TQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVAL 288 (418) Q Consensus 209 ~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaa 288 (418) ++|++++++.++|++ +++.+..+ +...+.+|+..+++++++++.+++|+++|+|++++++||||++++.+|+++||++ T Consensus 315 ~~Li~~~~v~~lk~d-la~aL~~g-~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslSGLddVi~qf~q~VAga 392 (695) T protein:vir:36 315 SDIVKQFSVSGILMD-LAQALMPG-ANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (695) T ss_pred HHHHHhhhHHHHHHH-HHHhhcCh-hHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccCCHHHHHHHHHHHHHhh Confidence 999999999999997 88887755 4556888999999999999999999878999999999999999999999999999 Q ss_pred hcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 289 SGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 289 s~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~~~~eke~ae~ 362 (418) ++||+|+|||+||+|||||||+|++||||+|+++|++.|+|.|++++++|++| .+|+|+|+|||+||++|+||+ T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI 472 (695) T protein:vir:36 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAES 472 (695) T ss_pred hcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999987 589999999999999999999 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHhh--cCcCC-CChhhcccccccCCCccccccC Q lcl|NC_019404. 363 LEKSVNSIAALIAAGAMDIKEARDTLRTI--APEIK-IGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~l~~~--~~~~~-~~~~~~~~~e~~~~~e~e~~~~ 418 (418) ++++|+++++|++.|+|+++|+|+.|... +.|.+ ++.+|.+. ...+++..++-+ T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~--~~~~~~~~~~~~ 529 (695) T protein:vir:36 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPG--VPADDDIDGVLT 529 (695) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCC--cCccchhhhhHh Confidence 99999999999999999999999999753 45544 22222110 011111111111 No 12 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=100.00 E-value=1.5e-107 Score=606.19 Aligned_cols=406 Identities=16% Similarity=0.177 Sum_probs=338.0 Q ss_pred Cccchh--h--------HHHHhcC-CCCccccCc--cccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc---- Q lcl|NC_019404. 1 MVKTDS--Y--------ANIFLGG-SDGSEIYGS--LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI---- 63 (418) Q Consensus 1 ~~~~D~--~--------~n~~~g~-~~~~~~~~~--~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~---- 63 (418) .+-+|. + +.. ++. ++.....++ ...+-.++.++...++|.+|+++++++++|+|+|+++.+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~ 149 (694) T protein:vir:10 71 QFEVDVSNYTPRERRAASYA-LDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEK 149 (694) T ss_pred hccccccCCCccccchhhhh-hccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchh Confidence 111111 1 111 221 111122221 1223356778888999999999999999999999886422 Q ss_pred ------------------chHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccc------cCCCceE Q lcl|NC_019404. 64 ------------------DDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPV------REGAELE 118 (418) Q Consensus 64 ------------------~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl------~~~~~i~ 118 (418) |+.++|++++++|++|.+++++++|+|+|||++++|.++ +++.+++|| ..+|+++ T Consensus 150 ~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslK 229 (694) T protein:vir:10 150 ADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQ 229 (694) T ss_pred hhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCccee Confidence 233579999999999999999999999999999999885 456788998 2478999 Q ss_pred EEEEeecccccccccc-ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHH Q lcl|NC_019404. 119 TVRVYDRTQVKVQNRE-ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDS 197 (418) Q Consensus 119 ~i~v~~~~~i~~~~~~-~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~ 197 (418) +|+|+|||+++|...+ .||++|+||+|++|+|.+ ++||+||+++|.|+|+|+++++.+++||.|..+ .+|++ T Consensus 230 Gl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G------~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q-~~~e~ 302 (694) T protein:vir:10 230 GLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG------TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQ-LAMPY 302 (694) T ss_pred eeEeecccccccchhhhccchhhccCCCceEEEec------eEEeeeeEEEecCCCchhhhhcccccCcccHHH-HHHHH Confidence 9999999999998665 599999999999999953 579999999999999999999999999999774 69999 Q ss_pred HHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHH Q lcl|NC_019404. 198 IKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAF 277 (418) Q Consensus 198 l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~ 277 (418) |.+|++++.++++|++++++.++|++ +++.+..+ +...+.+|+..+++++++++.+++|+++|+|++++++||||+++ T Consensus 303 V~~~~rT~~~v~~Li~~~~v~~lk~d-la~~L~~g-~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslSGLddV 380 (694) T protein:vir:10 303 IDNWLRTRQSVSDIVKQFSVSGILMD-LAQALMPG-ANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDAL 380 (694) T ss_pred HHHHHHHHhHHHHHHHhhhhHHHHHH-HHHhhcCh-hHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccCCHHHH Confidence 99999999999999999999999997 88887755 45668889999999999999999998789999999999999999 Q ss_pred HHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCC Q lcl|NC_019404. 278 LDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPL 351 (418) Q Consensus 278 ~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL 351 (418) +.+|+++||++++||+|+|||+||+|||||||+|++||||+|+++|++.|+|.|++++++|++| .+|+|+|+|| T Consensus 381 i~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL 460 (694) T protein:vir:10 381 QAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNAL 460 (694) T ss_pred HHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCC Confidence 9999999999999999999999999999999999999999999999999999999999999987 5899999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhh--cCcCC-CChhhcccccccCCCccccccC Q lcl|NC_019404. 352 DHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTI--APEIK-IGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 352 ~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~--~~~~~-~~~~~~~~~e~~~~~e~e~~~~ 418 (418) |+||++|+||+++++|+++++|++.|+|+++|+|+.|... +.|.+ ++.+|.+. ...+++..++-+ T Consensus 461 ~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~--~~~~~~~~~~~~ 528 (694) T protein:vir:10 461 RELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPG--VPADDDIDGVLT 528 (694) T ss_pred CCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCC--cCccchhhhhHh Confidence 9999999999999999999999999999999999999753 45544 22222110 011111111111 No 13 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=100.00 E-value=1.8e-103 Score=583.83 Aligned_cols=401 Identities=19% Similarity=0.258 Sum_probs=334.5 Q ss_pred CccchhhHHHHhcCCCC--c-----cccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG--S-----EIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWS 71 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~--~-----~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~ 71 (418) --.+|++.|++.|.|.+ . ..++++..+++++|.++|++||++|+|||+||++|||+||+|+++++. +++++ T Consensus 13 ~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i~~~~~~~~~~~~~ 92 (461) T protein:vir:80 13 DSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSLKTDNKEMKKNIES 92 (461) T ss_pred hhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeeeecCCHHHHHHHHH Confidence 23478999988765432 2 236677778999999999999999999999999999999999987653 46899 Q ss_pred HHHHhCchHHHHHHHHhccccceEEEEEeecCCC----cccccccCC--CceEEEEEeeccccccccccccccccccCcc Q lcl|NC_019404. 72 RWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNR----ALTSPVREG--AELETVRVYDRTQVKVQNREENPRNARFGKP 145 (418) Q Consensus 72 ~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~----~l~~pl~~~--~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p 145 (418) +|++|+++++++++++|+|+||+|+|++.++|+. .+++|+++. +.+++|+|++++++++..+++||++|+||+| T Consensus 93 ~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P 172 (461) T protein:vir:80 93 KWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFGEV 172 (461) T ss_pred HHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEeccccccchhhhcccCcCcccccc Confidence 9999999999999999999999999999998754 467788764 4788999999999999999999999999999 Q ss_pred eEEEEecCC-----------cccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 146 LTYRITTNE-----------SDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 146 ~~y~i~~~~-----------~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) ++|+|.+.. +...++|||||||||.|.++|+ .+||.|+++ ++|++|++|++++.++++|+++ T Consensus 173 ~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~------~~~G~S~le-~~~~~l~~~~~~~~~~~~l~~~ 245 (461) T protein:vir:80 173 EFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEG------ETKGRSIFE-SLYDIITVMDTSLWSVGQILYD 245 (461) T ss_pred eEEEEeccccccccccccccCccceEEccccEEEecCCCCCc------cccCcchHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 999997642 2234689999999999999875 478999885 6999999999999999999999 Q ss_pred cCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCee Q lcl|NC_019404. 215 KQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEI 294 (418) Q Consensus 215 ~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t 294 (418) ++++++|++++..+... ...+..+++. ..+++.+.+++++ +|+|++++++|+|+++++++++++||++++||+| T Consensus 246 ~~~~v~k~~~l~~~~~~--~~~~~~~~~~---~~~~~~g~~~~d~-~e~~e~~~~~lsgl~~~l~~~~~~iaa~s~iP~t 319 (461) T protein:vir:80 246 FAFKVYKTDDIDALNKD--DKANLTAMLD---FMFRTEALAIIKG-DEQLTKESTNVSGMKDLLDYGWDYLAGAVRMPKT 319 (461) T ss_pred hCCCceecchHHhhhch--HHHHHHHHHH---HhcCCceEEEEcC-CcceEEEecCcCCHHHHHHHHHHHHhhhhcCCee Confidence 99999999998776543 3344555554 4455555555554 5889999999999999999999999999999999 Q ss_pred eeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 295 ILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 295 ~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~~~eke~ae~ 362 (418) +|||+|||| +|||++|++|||++|+++||+.++|+|++|+++|+++ .+|+|+|+|||++|+||+||+ T Consensus 320 ~L~G~s~g~-~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~ 398 (461) T protein:vir:80 320 VLKGQEAGT-LTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEV 398 (461) T ss_pred eeecccCCc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHH Confidence 999999955 6799999999999999999999999999999999864 379999999999999999999 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHhhc------CcCCCChh--hcccccc--cCCCcccc Q lcl|NC_019404. 363 LEKSVNSIAALIAAGAMDIKEARDTLRTIA------PEIKIGDN--DIQTEES--ELITETEV 415 (418) Q Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~------~~~~~~~~--~~~~~e~--~~~~e~e~ 415 (418) ++++|+++++|+++|+|+++|+|+.|+... .+.+++.+ ++...+. ...++.++ T Consensus 399 ~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 399 RKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccccccCCCC Confidence 999999999999999999999999997432 23334333 2211111 22223333 No 14 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=100.00 E-value=1e-98 Score=557.81 Aligned_cols=397 Identities=15% Similarity=0.146 Sum_probs=317.2 Q ss_pred CccchhhHHHHhcCCCCcc----ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-Ccc-----hHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSE----IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-GID-----DEPAFW 70 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~----~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-~~~-----d~~~i~ 70 (418) -.+.|+|.|.++|.|+... .||++..+++++|+++|++||++++||++|+++|||.|..|. |.+ ...+++ T Consensus 19 ~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~~~~~~~~~~~~e 98 (449) T protein:vir:10 19 ARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGDDADDSEDETSWE 98 (449) T ss_pred HHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccccCccccchhhhHHHH Confidence 2356999999998876532 468899999999999999999999999999999999998773 332 223567 Q ss_pred HHHHHh---CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceE Q lcl|NC_019404. 71 SRWDDL---EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLT 147 (418) Q Consensus 71 ~~~~~l---~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~ 147 (418) .++++| ++|++++++++|+|+||||+|++.++|+++|++|+++++.|++|+|+++++++|..++.||.+|+||+|++ T Consensus 99 ~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~ 178 (449) T protein:vir:10 99 KKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKL 178 (449) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCCCCCCceE Confidence 777665 67899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecC---CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc------ Q lcl|NC_019404. 148 YRITTN---ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA------ 218 (418) Q Consensus 148 y~i~~~---~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~------ 218 (418) |+|++. +.....+||||||++|++.++| |.|.| +++|+.+.+++++..+.++.+.+.... T Consensus 179 y~v~~~~~g~~~~~~~iH~SRl~~~~~~~~~----------g~~~L-~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~ 247 (449) T protein:vir:10 179 WKYTERLPNGSSRRVDIHPDRVFILGDYSED----------AIGFL-EPAYNAFVSLEKVEGGSGESFLKNAARQLNVNF 247 (449) T ss_pred EEEeeeccCCCccceeeccceeEeecCCCCC----------ChhHH-HHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhh Confidence 999843 3334467999999999988765 67755 579999999999999988877764322 Q ss_pred --eeecchHHHhhcCcchHHHHHHHHHHHHHhc-CCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 219 --VWKAKGLAELCDDSEGFGAARLRLAQVDNNS-GVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 219 --v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~-~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~ 295 (418) .+++.+++.+++.+ .+.+.+++....... ...+.++++. +++|++++++|+|++++++.+++.+||+++||+|+ T Consensus 248 ~~~~~~~~l~~~~~~~--~e~~~~~~~~~~~~~~~~~~~~~i~~-~~d~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~ 324 (449) T protein:vir:10 248 EKEIDFTNLASLYGVS--IDELQDKFNEVAGEINRGNDVLMTTQ-GATVTPLVTSVADPTATYNVNLQTAAAGVDIPTRI 324 (449) T ss_pred hhhhhhhhhhHHhhCC--chHHHHHHHHHHHHHhccchheeecC-CcceEEEecccCChhHHHHHHHHHHHHHhCCCeee Confidence 22455556555433 333344443222221 2233455554 57899999999999999999999999999999999 Q ss_pred eeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLDHESSKDKAEVLEKSVNS 369 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~~~~eke~ae~~~~~a~a 369 (418) |||+||+||||| +|.+|||++|+++|+ .|+|.|++|+++|+++ ++|+|+|+|||++|+|||||+++++|++ T Consensus 325 L~Gqsp~glnst--~D~~nyyd~i~~~Q~-~l~p~le~l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a 401 (449) T protein:vir:10 325 LIGNQQAERSST--EDQKYFNARCQSRRV-DLSFEIEDFCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEI 401 (449) T ss_pred eeccCccccccc--hhHHHHHHHHHHHHH-hhhHHHHHHHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHH Confidence 999999999986 478999999999997 5999999999999885 5899999999999999999999999999 Q ss_pred HHHHHhCC---CCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 370 IAALIAAG---AMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 370 ~~~~~~~g---~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) +++++++| +++++|+|+.+.... +..+.+++++++..++...=-| T Consensus 402 ~~~~~~ag~~~~~~~~EiR~~~~~~~----~~~~~~~~e~~de~~~~~d~~a 449 (449) T protein:vir:10 402 NQTMLGSGDNPAFSREEIRTAAGYDN----DDEEPLGEEDGDEEDKATDSAA 449 (449) T ss_pred HHHHHHccccCCcCHHHHHHHhcccC----CCCCCCCCCCCccccccCCcCC Confidence 99999888 999999998774322 2222222222222222222222 No 15 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=100.00 E-value=3.8e-57 Score=329.90 Aligned_cols=194 Identities=73% Similarity=1.097 Sum_probs=175.6 Q ss_pred eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 219 VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 219 v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G 298 (418) |||+++++++++.+ ...+++|+..+.+++++++++++++++|+|++++++||||+++++++++.|||+++||+|+||| T Consensus 1 V~k~~~l~~~~~~~--~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG 78 (201) T protein:vir:10 1 MWKAKGLADLCDDS--DGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKG 78 (201) T ss_pred CccchHHHHHhcCC--hHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcC Confidence 99999999999865 4578899999999999999999999999999999999999999999999999999999999999 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGA 378 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~ 378 (418) +||+||||||++|++|||++|+++|++.++|+|++|+++++++++|+|+|+|||++|+|||||+++++|+++++|+++|+ T Consensus 79 ~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~ 158 (201) T protein:vir:10 79 KNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVTEQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAAGI 158 (201) T ss_pred CCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHhhcCcCCCChhh----ccccc---ccCCCccc Q lcl|NC_019404. 379 MDIKEARDTLRTIAPEIKIGDND----IQTEE---SELITETE 414 (418) Q Consensus 379 i~~~e~r~~l~~~~~~~~~~~~~----~~~~e---~~~~~e~e 414 (418) ++++|+|+.|+..+.+..++++. ++..+ ++..+|++ T Consensus 159 i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 159 IDADEARDTLRAISTEVKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred CCHHHHHHHHHhcCCcCCCCCCCCCccccccccCCCCCCCCCC Confidence 99999999999877766654332 22222 22223344 No 16 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.92 E-value=2.1e-25 Score=155.87 Aligned_cols=382 Identities=13% Similarity=0.114 Sum_probs=217.7 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHH--HHHH--HHH- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPA--FWSR--WDD- 75 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~--i~~~--~~~- 75 (418) +.| +| -.+..++.++..+..+ ..++..+.++|..++.++++|+++++++.+.+|.+...++... .... +.+ T Consensus 62 ~~r-~g--~~~~~~~~g~~~~~ep-p~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rP 137 (648) T protein:vir:79 62 VKR-IG--LAIMDGGGGGRDFEEP-EFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAE 137 (648) T ss_pred HHH-hH--HHHHhhcCCccccccC-CcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhcc Confidence 111 12 1222222222222211 1267789999999999999999999999999999876543211 1111 111 Q ss_pred ---hCchHHHHHHHHhccccceEEEEEeecC-CCccc--cccc--CCCceEEEEEeeccccccccccccccccccCcceE Q lcl|NC_019404. 76 ---LEMTQNINDAWSWARLFGGAAIVAIVKD-NRALT--SPVR--EGAELETVRVYDRTQVKVQNREENPRNARFGKPLT 147 (418) Q Consensus 76 ---l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~--~pl~--~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~ 147 (418) ....+-++.....-.+||.||+.+.-+. +.++- .++. ....++.+.++++..+.+.. ..||.+.. T Consensus 138 n~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~-------d~~g~~~~ 210 (648) T protein:vir:79 138 ATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKR-------DKFGMIKG 210 (648) T ss_pred CCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEE-------cCCCceee Confidence 1222334445555668999999876532 22111 1111 12345556666655554322 24788888 Q ss_pred EEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchH Q lcl|NC_019404. 148 YRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGL 225 (418) Q Consensus 148 y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l 225 (418) |.+...++.....++++.||||.... +....+|.||+.. +.+.|.....+....+.++.....+ +++++.. T Consensus 211 Y~y~~~g~~~~~~~~~~dIIHik~~~------~~d~~~GlSpi~~-a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~ 283 (648) T protein:vir:79 211 WQQEQEGQDKPQKFKPEDIVHIYYKR------EKGRAFGTPWLLP-ALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLE 283 (648) T ss_pred eEEEecCCceeEEecCccEEEEccCC------CCCCceeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC Confidence 88876655555678999999995321 2345689999975 8899999999999999988877644 3443311 Q ss_pred HHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC------CHHHHHHHHHHHHhhhhcCCeeeeecc Q lcl|NC_019404. 226 AELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG------GIDAFLDKKFDRIVALSGIHEIILKNK 299 (418) Q Consensus 226 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~------gl~~~~~~~~~~iaaas~IP~t~L~G~ 299 (418) .. ..+...+..+++.. ..+.+.+.+..-+++.+.++.. .+.+..++..+.||.+.+||..+| |. T Consensus 284 ~~---~~e~~k~~~e~~~~------~~~~~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lL-G~ 353 (648) T protein:vir:79 284 QE---GFGAEEGEVDLVRG------EVENMDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMM-GR 353 (648) T ss_pred cc---chHHHHHHHHHHHH------hcccccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHc-cc Confidence 10 01111112222221 1122222222234444444432 133446778899999999998755 77 Q ss_pred CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------CCceEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019404. 300 NVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA----------EEWSVEFSPLDHESSKDKAEVLEKSVNS 369 (418) Q Consensus 300 s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~----------~~~~~~f~pL~~~~eke~ae~~~~~a~a 369 (418) ..++-.++++....+|.++|...|+...+.+...+...+.+. ..+.|+|++|...+++.+ ++. T Consensus 354 ~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~-------a~~ 426 (648) T protein:vir:79 354 GGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKL-------ENQ 426 (648) T ss_pred CCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHH-------HHH Confidence 766666778888889999999888765555544544443321 135688888887776554 455 Q ss_pred HHHHHhCCCCCHHHHHHHHHhhcCcCCCCh-----------hhcccc-----cc----cCCCccccccC Q lcl|NC_019404. 370 IAALIAAGAMDIKEARDTLRTIAPEIKIGD-----------NDIQTE-----ES----ELITETEVVIA 418 (418) Q Consensus 370 ~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~-----------~~~~~~-----e~----~~~~e~e~~~~ 418 (418) +..++++|++|++|+|+.+ ...|-.+-.+ ...... .+ ...+.+|+.-. T Consensus 427 ~~~l~~~GilT~NEaR~~l-GlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~ 494 (648) T protein:vir:79 427 AVFLYEHNAISEDEMRELI-GRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKK 494 (648) T ss_pred HHHHHhCCCcCHHHHHHHh-CCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCcccccccc Confidence 6678999999999999876 2222111000 000000 00 00011111110 No 17 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.87 E-value=7.8e-23 Score=141.80 Aligned_cols=355 Identities=12% Similarity=0.067 Sum_probs=201.2 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) ....+.+...+.++. .+ ...+. .-|.+++.+.++|+..|+++-+-++++....... +...-..+.... T Consensus 19 ~~~~~~~~~~~~~~~-----~~--~~v~~----~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~-ll~~PN~~~t~~ 86 (383) T protein:vir:10 19 YPSNPAFFTTTVGGM-----QL--SYVSA----LSALQNTNVYSVINRIASDVSSAHFKTENTATLN-RLESPSSLIGRF 86 (383) T ss_pred cccchhhhhhhccCc-----cc--cccch----hHhhcchHHHHHHHHHHHhhccCceeecccchhh-hhhCCCCCCCHH Confidence 000011111111100 00 01111 2245678899999999999999999886433222 222222233344 Q ss_pred HHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 81 NINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 81 ~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) .|.+.+.+ ..++|.|++++. .+ ..++.+.++.++.+... +....|.+.....+... T Consensus 87 ~f~~~~~~~l~l~Gn~~~~i~-~~-------------~~~~~p~~~~~v~~~~~---------~~~~~~~~~~~~~~~~~ 143 (383) T protein:vir:10 87 SFWQGALMQLCLSGNDYIPLV-GQ-------------NLEHIPNSDVQINYLPG---------NMGIVYTVLESNDRPKM 143 (383) T ss_pred HHHHHHHHHhhhcCCeEEEEE-cC-------------ceeEeecCcceEEEEEc---------CCceEEEEEEcCCceEE Confidence 45444444 556899998874 22 11233334444432211 11233445444344456 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~ 237 (418) ++++++||||.+...+ .....+|.|++.. +.+.|.....+......++.....+ ++++++ . +.+.+.... T Consensus 144 ~~~~~evih~r~~~~~----~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~--~-~~~~e~~~~ 215 (383) T protein:vir:10 144 VLRQDQMLHFRLMPDP----QYRYLIGRSPLES-LQNALNLDDKASKSNMSAMENQINPAGKLTISN--Y-LSDGKDLES 215 (383) T ss_pred EEcccceEEeccCCCC----cccccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC--C-CCCHHHHHH Confidence 7999999999643211 1123579999975 8889999999999999988876654 455542 1 222333344 Q ss_pred HHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHH---HHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHH Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGID---AFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALET 314 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~---~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~ 314 (418) ..++++..... .+.+.+++..++.+|+.++.+....+ +..++..+.||.+.|||..+|.+...++.+.+.-+..+. T Consensus 216 ~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~ 294 (383) T protein:vir:10 216 AREEFEKANTG-DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKA 294 (383) T ss_pred HHHHHHHHhCc-cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHH Confidence 44555443322 23344555556688999988876554 556777899999999999888655544443332223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh---hccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhh Q lcl|NC_019404. 315 FHKLIDRKRNAELLPILEFLIPFI---VNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTI 391 (418) Q Consensus 315 y~~~I~~~Qe~~l~p~l~~l~~~i---~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~ 391 (418) +|. .-|+|+++.+-..+ +....++|++.+|...|.+++ ++++.+++++|++|++|+|+.+.. T Consensus 295 ~~~-------~~l~P~~~~ie~~l~~~l~~~~~~f~~~~l~~~d~~~~-------~~~~~~~~~~G~~t~nE~R~~lg~- 359 (383) T protein:vir:10 295 TYL-------ANLNSYVNPIVDELRLKMNAPDLELDIKDMLDVDDSIL-------INQVSNLAKSGVLGAEQAQFILTR- 359 (383) T ss_pred HHH-------HHHHHHHHHHHHHHHHhhCCceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCC- Confidence 332 13677776664443 335678899999998988776 667889999999999999997732 Q ss_pred cCcCCCChhhcccccccC-CCccc Q lcl|NC_019404. 392 APEIKIGDNDIQTEESEL-ITETE 414 (418) Q Consensus 392 ~~~~~~~~~~~~~~e~~~-~~e~e 414 (418) .+..+.+.........+. -+++| T Consensus 360 ~p~~~~d~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 360 SGFLPDNLPEFKPLTNETKGGDDK 383 (383) T ss_pred CcccCCcccccCCCcccCCCCCCC Confidence 222222111111111111 23344 No 18 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.84 E-value=3.2e-21 Score=132.92 Aligned_cols=356 Identities=12% Similarity=0.066 Sum_probs=201.7 Q ss_pred Cccchhh------------------HHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKTDSY------------------ANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~D~~------------------~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) |==.+.+ ...+.|+. .....+. ..|.+++.+++||+.+|+++-+-++++.- T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~~----~~al~~~~v~~~i~~ia~~ia~~p~~v~~ 69 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSNPAFFTTTVGGM-------QLSYVSA----LSALQNTNVYSVINRIASDVASAHFKTEN 69 (385) T ss_pred Cccccchhcccccccccccccchhhhhhhcccc-------CccccCH----HHhhccHHHHHHHHHHHHHHhhCceeeec Confidence 1000100 00000000 0011122 22456788999999999999999998863 Q ss_pred cchHHHHHHHHHHhCchHHHHHHHHhcc-ccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc Q lcl|NC_019404. 63 IDDEPAFWSRWDDLEMTQNINDAWSWAR-LFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR 141 (418) Q Consensus 63 ~~d~~~i~~~~~~l~~~~~~~~a~~~~r-l~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~ 141 (418) .. ...+..+-..+-....|.+.+.+.+ ++|.|++++. ++ ...+.+++++++.+.. | T Consensus 70 ~~-~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~-r~-------------~~~~~p~~~~~v~~~~---~----- 126 (385) T protein:vir:10 70 TA-TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLV-GQ-------------NLEHIPNSDVQINYLP---G----- 126 (385) T ss_pred cc-hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEE-cC-------------ceeEeecCCceEEEEE---c----- Confidence 22 2223333333445566777777666 5899998874 22 1123344444443321 1 Q ss_pred cCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--e Q lcl|NC_019404. 142 FGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--V 219 (418) Q Consensus 142 yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v 219 (418) +....|.+....++....++++.||||.+...+. ....+|.|++.. +.+.+.....+.......+...... + T Consensus 127 -~~~~~~~~~~~~~~~~~~~~~~eiihik~~~~~~----~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~ng~~~~gi 200 (385) T protein:vir:10 127 -NMGIVYTVLESNDRPQMVLRQDQMLHFRLMPDPQ----YRYLIGRSPLES-LQNALNLDDKASKSNMSAMENQINPAGK 200 (385) T ss_pred -CCceEEEEEEcCCceEEEEccccEEEeccCCCCc----ccccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceE Confidence 1123455544444444679999999996432111 123469999975 8899998888888888888775433 4 Q ss_pred eecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHH---HHHHHHHHHHhhhhcCCeeee Q lcl|NC_019404. 220 WKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGID---AFLDKKFDRIVALSGIHEIIL 296 (418) Q Consensus 220 ~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~---~~~~~~~~~iaaas~IP~t~L 296 (418) +++++ . +.+++......++++..... .+.+.+++..++.+|+.++.+...+. +..++....||.+.+||..+| T Consensus 201 l~~~~--~-~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~l 276 (385) T protein:vir:10 201 LTISN--Y-LSDGKDLESAREEFEKANTG-DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDIL 276 (385) T ss_pred EEeCC--C-CCCHHHHHHHHHHHHHHhCc-cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHc Confidence 45542 1 12233344445555443322 22344555555678998888776644 456777899999999998777 Q ss_pred eccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 297 KNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAAL 373 (418) Q Consensus 297 ~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~ 373 (418) .+...++.+.+.-+..+.+|.. .|.|.++.+-..+ +...+++|.+.+|...|.+++ +++++++ T Consensus 277 g~~~~~~~~~sn~eq~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~f~~~~ll~~d~~~~-------~~~~~~~ 342 (385) T protein:vir:10 277 GGGTSTESQHSNIDQIKATYLA-------NLNSYVNPIVDELRLKMNAPDLELDIKDMLDVDDSAL-------INQVSNL 342 (385) T ss_pred CCccCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCCceEEeechhhhccCHHHH-------HHHHHHH Confidence 5544443332222233334321 2677776665554 335678888889998988765 6788899 Q ss_pred HhCCCCCHHHHHHHHHhhcCcCCCChhh--cccccccCCCcccc Q lcl|NC_019404. 374 IAAGAMDIKEARDTLRTIAPEIKIGDND--IQTEESELITETEV 415 (418) Q Consensus 374 ~~~g~i~~~e~r~~l~~~~~~~~~~~~~--~~~~e~~~~~e~e~ 415 (418) +++|++|++|+|+.+.. .+..+.+... .+....+.-++++. T Consensus 343 ~~~G~~T~NE~R~~~g~-~p~p~~~~~~~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 343 AKSGVLGAEQAQFILTR-SGFLPDNLPEFKPLTTQVKGGDEGDN 385 (385) T ss_pred HhCCCcCHHHHHHHhCC-CccCCCCCccccCcccccCCCCCCCC Confidence 99999999999987632 2211111111 11111122223333 No 19 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.84 E-value=4.3e-21 Score=132.24 Aligned_cols=362 Identities=13% Similarity=0.071 Sum_probs=210.2 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-CcchH-HH----HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-GIDDE-PA----FWSRWD 74 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-~~~d~-~~----i~~~~~ 74 (418) +-..|.-.-.+.|.+ .+. .+.+.. .+ .+++.++++|+.+|++.-+-++.+. ..++. .. +...+. T Consensus 14 ~~~~~~~~~~~~g~~-~~~-----~~v~~~---~a-l~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~l~~lL~ 83 (409) T protein:vir:10 14 ISIDDKKILEWLGIN-PSE-----TYVNGK---SC-LKQATVFGCIRILSDNISKLPIKIYQKKDGIKRVPDHYLEYLLK 83 (409) T ss_pred CCCChHHHHHHhcCC-cCc-----ceechh---hh-hccHHHHHHHHHHHHhhhhCceEEEEecCCeeeccCchHHHHHh Confidence 111121111111211 111 112222 22 3578889999999999999888772 11111 11 111121 Q ss_pred ----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEE Q lcl|NC_019404. 75 ----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYR 149 (418) Q Consensus 75 ----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~ 149 (418) ..-....|.+.+.+ ..++|.|++++.-+ ..|.+..+.++++.++++... .+... .+.....|. T Consensus 84 ~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~----------~~G~~~~L~~i~~~~V~v~~~-~~~~~-~~~~~~~y~ 151 (409) T protein:vir:10 84 LRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFK----------KNGEIKGLYPLKSDGMKIFVD-DTGLL-NSENNVWYL 151 (409) T ss_pred hccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc----------CCCcEEEEEEEcCCceEEEEc-CCccc-cccceEEEE Confidence 12233455554444 57789999988532 335677889999988876432 12221 122233566 Q ss_pred EecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHH Q lcl|NC_019404. 150 ITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAE 227 (418) Q Consensus 150 i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~ 227 (418) ++...+ ....++++.||||.+... ...+|.|++.. +.+.|.....+......++.....+ ++++++ . T Consensus 152 ~~~~~g-~~~~~~~~evih~r~~~~-------d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~ 220 (409) T protein:vir:10 152 YTDDLG-QRHKFMSDEILHFKGLTA-------DGLAGLSVIEL-LNHLIENGKSSETYLNNFFKNGLQVKGLVQYAG--D 220 (409) T ss_pred EEeCCc-eeEEeccccEEEecCcCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC--C Confidence 654433 235799999999964321 23579999975 8899999999999999988876544 556553 1 Q ss_pred hhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccc Q lcl|NC_019404. 228 LCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLS 305 (418) Q Consensus 228 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~ 305 (418) ++ ++...+..+++........+.+.+++..++.+|++++.+..+ +-+..++..+.||++.+||..+| |...++-. T Consensus 221 -l~-~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~ 297 (409) T protein:vir:10 221 -LN-PEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQL-NDLDRATH 297 (409) T ss_pred -CC-HHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCcc Confidence 21 233344455554433322234445555566789988877654 44667889999999999999866 55544445 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc----cCCceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 306 SSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN----AEEWSVE--FSPLDHESSKDKAEVLEKSVNSIAALIA 375 (418) Q Consensus 306 stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~----~~~~~~~--f~pL~~~~eke~ae~~~~~a~a~~~~~~ 375 (418) ++.+.....||.. .|.|.++.+-..+ +. ..++.++ ++.|...|.+++ +++++++++ T Consensus 298 ~~~e~~~~~f~~~-------~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~-------~~~~~~~~~ 363 (409) T protein:vir:10 298 SNITEQNREFYID-------TLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTR-------YESYKEAIQ 363 (409) T ss_pred ccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHH-------HHHHHHHHh Confidence 5667777777764 3778777664443 21 2344455 557777777665 667888999 Q ss_pred CCCCCHHHHHHHHHhhcCcCCCChh--------hccccc-ccCCCccc Q lcl|NC_019404. 376 AGAMDIKEARDTLRTIAPEIKIGDN--------DIQTEE-SELITETE 414 (418) Q Consensus 376 ~g~i~~~e~r~~l~~~~~~~~~~~~--------~~~~~e-~~~~~e~e 414 (418) +|++|++|+|+.+. ..+..+ .|+ -++... ....+.++ T Consensus 364 ~G~~T~NE~R~~lg-l~p~~g-gD~~~~~~n~~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 364 NGFKTPNEIRELEE-DEPLEG-GDVLLINGNMIPVKMAGEQYSKGGEK 409 (409) T ss_pred CCCcCHHHHHHHhC-CCCCCC-cCeeeeccCccchhhccccccccCCC Confidence 99999999998763 222111 110 111111 11112222 No 20 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.83 E-value=4.5e-21 Score=132.13 Aligned_cols=373 Identities=12% Similarity=0.045 Sum_probs=209.3 Q ss_pred CccchhhHHHHhcCCC--------------Cccc---cCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSD--------------GSEI---YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI 63 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~--------------~~~~---~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~ 63 (418) |=-.|.+.+-...... .... +|.....+ ..-..+ ..++.+.++|+.+++++-+-++.+... T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-v~~~~a-l~~~~v~~ci~~ia~~iA~lp~~~~~~ 78 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFS-VRGKRA-LKENTVYVCTKIRAESIGKLSLKIYKD 78 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcc-cchhhh-hccHHHHHHHHHHHHhhhhCceEEEec Confidence 2112222111000000 0000 01000000 011122 246778899999999999998887322 Q ss_pred chH---HHHHHHHH----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccc Q lcl|NC_019404. 64 DDE---PAFWSRWD----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREE 135 (418) Q Consensus 64 ~d~---~~i~~~~~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~ 135 (418) .+. ..+...+. ..-.+..|.+.+.+ -.++|.|++++.- + ..|.+..|.++++.++.+... . T Consensus 79 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~G~~~~L~~i~~~~v~~~~~-~ 147 (422) T protein:vir:13 79 KEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIER-D---------RKGKIIGLYPINSDNVTKIID-D 147 (422) T ss_pred CcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEECCcceEEEEc-C Confidence 111 11222221 22223455555555 4568999988743 2 335678899999988876543 2 Q ss_pred cccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019404. 136 NPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRK 215 (418) Q Consensus 136 dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~ 215 (418) |.....++. ..|.++..++ ....++++++||+.+.. .....+|.|++.. +.+.+.....+.......++.. T Consensus 148 ~~~~~~~~~-~~y~~~~~~g-~~~~~~~~eiih~~~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~f~ng 218 (422) T protein:vir:13 148 DNFLSSLSK-VWYVVTDKNG-KEHKLLPDEMLHFIGDI------TLDGLIGIKPLDY-LRCTIENGRATQEFINKFFKNG 218 (422) T ss_pred Ccceeccce-EEEEEEeCCC-eEEEEcccceEEEcCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhcc Confidence 222223333 4566655433 33579999999997532 2234679999975 8889999999988888888875 Q ss_pred CCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcC Q lcl|NC_019404. 216 QQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGI 291 (418) Q Consensus 216 ~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~I 291 (418) ..+ ++++++ .+ +.+...+.++++...-....+.+.+++..++.+|++++.+..+ +-+...+....||.+.+| T Consensus 219 ~~p~gil~~~~---~l-~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgV 294 (422) T protein:vir:13 219 LSIKGIVQYVG---DL-DEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGM 294 (422) T ss_pred CCccEEEEeCC---CC-CHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 433 445542 12 2233445555555443332334455555566789888877654 446667888999999999 Q ss_pred CeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----c----cCCceEEeC--CCCCCCHHHHHH Q lcl|NC_019404. 292 HEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV----N----AEEWSVEFS--PLDHESSKDKAE 361 (418) Q Consensus 292 P~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~----~----~~~~~~~f~--pL~~~~eke~ae 361 (418) |..+|.+...+.. ++.++....||.. .|.|++..+-..+- . ..++.|+|+ .|...|.+++ T Consensus 295 pp~~lg~~~~~~~-sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~-- 364 (422) T protein:vir:13 295 KSYHLNDLERATF-NNLTEQQKDFYVT-------TLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTR-- 364 (422) T ss_pred CHHHhCCCCCCCc-ccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHH-- Confidence 9977754444443 4456666777654 37777766544432 1 235566654 6666666654 Q ss_pred HHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhcccccccCCCccccc Q lcl|NC_019404. 362 VLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDIQTEESELITETEVV 416 (418) Q Consensus 362 ~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~~~~e~~~~~e~e~~ 416 (418) ++++++++++|++|++|+|+.+. ..+..+-+ .+...+. ...-+|+-+. T Consensus 365 -----~~~~~~~~~~G~~T~NE~R~~~g-l~p~~ggD~~~~~~n~~~l~~~~~~-~~~~g~~~g~ 422 (422) T protein:vir:13 365 -----YEAYRIGIQGGFIEANEARRREN-LPPVEGGDRLLVNGNMIPIEMAGEQ-YKKGGEKGGK 422 (422) T ss_pred -----HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeeccCccchhhcccc-cccCCCcCCC Confidence 66788899999999999998763 22211100 0111111 1111111111 No 21 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.82 E-value=2.6e-20 Score=127.92 Aligned_cols=361 Identities=12% Similarity=0.045 Sum_probs=204.1 Q ss_pred CccchhhH---------------HHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC--c Q lcl|NC_019404. 1 MVKTDSYA---------------NIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG--I 63 (418) Q Consensus 1 ~~~~D~~~---------------n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~--~ 63 (418) |--.|.+. +.+ |.+. .+. .....+++ .+..++.+.++|+.+|+++-+-++.+.. + T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~-~~~~-~~~--~g~~v~~~----~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~ 72 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAI-GLSY-DTY--TGKQISSQ----RAMRLTAVFSCVRVLAESVGMLPCNLYHLNG 72 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhh-ccCc-ccc--CCceechh----hhhccHHHHHHHHHHHHHhccCceEEEEecC Confidence 33333321 111 1100 000 11112222 3456888999999999999988887731 1 Q ss_pred chH---------HHHHHHHHHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc Q lcl|NC_019404. 64 DDE---------PAFWSRWDDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR 133 (418) Q Consensus 64 ~d~---------~~i~~~~~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~ 133 (418) +.. ..+..+-........|.+.+.+ ..++|.|++++. ++ .|.+..+.++++.++++... T Consensus 73 ~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~-~~----------~g~~~~L~~l~~~~v~~~~~ 141 (414) T protein:vir:44 73 SLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKV-KA----------FGEVAELLPVDPGCVVPKLN 141 (414) T ss_pred CceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-eC----------CCcEEEEEEEcCceEEEEEC Confidence 111 1122222223334456555555 456899998774 32 14466788888888765432 Q ss_pred cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 134 EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 134 ~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) . .|.+ .|.+...++ ....++++.|+||.+..+ ...+|.|++.. +.+.|.....+......++. T Consensus 142 ~-------~~~~-~y~~~~~~g-~~~~~~~~evih~~~~~~-------d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ 204 (414) T protein:vir:44 142 S-------SWEP-VYQVTFPDG-STDVLSQEDIWHVRTLTL-------DGLVGLNPIAY-AREAISLAAATEEHGARLFS 204 (414) T ss_pred C-------CCcE-EEEEEecCc-eEEEEccccEEEecCCCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 1 2333 455554332 235799999999964331 23579999975 77888888888888888887 Q ss_pred HcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhh Q lcl|NC_019404. 214 RKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALS 289 (418) Q Consensus 214 ~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas 289 (418) ....+ ++++++. + +.+...+..+++........+.+.+++..++.+|+.++.+..+ +.+..+...+.||.+. T Consensus 205 ng~~p~gil~~~~~---l-~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 280 (414) T protein:vir:44 205 NGAVTSGVLRTEQT---L-SDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLF 280 (414) T ss_pred ccCCCceEEEeCCC---C-CHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHh Confidence 75543 4555531 2 2233444555554433322233345555556788888877654 4466778889999999 Q ss_pred cCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cc---CCce--EEeCCCCCCCHHHHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV----NA---EEWS--VEFSPLDHESSKDKA 360 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~----~~---~~~~--~~f~pL~~~~eke~a 360 (418) +||..+|.+...+. .++.+...+.|+.. .|.|+++.+-..|- .. .++. |.+..|...|.+++ T Consensus 281 gVpp~~l~~~~~~t-~~n~e~~~~~~~~~-------~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~- 351 (414) T protein:vir:44 281 RVPLHMVQNTDRAT-FNNIEELGLGFINY-------SLVPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSR- 351 (414) T ss_pred CCCHHHhCCCCCCC-cccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHH- Confidence 99998774433333 34456666777764 37888776644442 12 1334 44557777777665 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChh-----h---cccc----cccCCCccccccC Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDN-----D---IQTE----ESELITETEVVIA 418 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~-----~---~~~~----e~~~~~e~e~~~~ 418 (418) ++++++++++|+++++|+|+.+. ..+..+ .|. . .+.. ..+.++.+++.=+ T Consensus 352 ------~~~~~~~~~~G~~t~NE~R~~~g-l~p~~g-gD~~~~~~n~~~~~~~~~~~~~~~~~~~~d~~~ 413 (414) T protein:vir:44 352 ------FEAYATGINWGIYSPNDCRDLED-MNPRPG-GDVYLTPMNMTTKPSDGSKAGKQKDNANADETT 413 (414) T ss_pred ------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCC-cceecccccccccCCccccCCCCCCCCCCCCCC Confidence 66788899999999999998763 211111 110 0 0000 0111111111111 No 22 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.82 E-value=4.5e-20 Score=126.67 Aligned_cols=382 Identities=14% Similarity=0.103 Sum_probs=198.9 Q ss_pred CccchhhHHHHhc-----CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLG-----GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDD 75 (418) Q Consensus 1 ~~~~D~~~n~~~g-----~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~ 75 (418) +.+-....+.-.+ .......+..+ .++..+.++|..|+.+++||+..+++..+-++.+.+.+.. .+...+-. T Consensus 11 ~~~~~~i~~~~~~s~~~~~~~~~~~~~pp--~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~~-~l~~~lpN 87 (542) T protein:vir:41 11 LEKYKAIKREEVESQALGETRFEEYVEPK--VNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDEG-VVDEFIRA 87 (542) T ss_pred cccchhhhhccccccccccccCCccccCC--CCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccch-hhhhhcCC Confidence 2222222111111 11111222222 4678999999999999999999999999999999765432 22222211 Q ss_pred --hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEE--- Q lcl|NC_019404. 76 --LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRI--- 150 (418) Q Consensus 76 --l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i--- 150 (418) ....+-+...+.+-.++|.|++.+.- + ..|.+..|.+++++.+++.....-...-..+....|.. T Consensus 88 ~~~s~~~f~~~~v~~lll~Gnayi~i~r-d---------~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~ 157 (542) T protein:vir:41 88 CKPSFEYVLLRALEDLQVFNYCTLEVVR-D---------DRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYR 157 (542) T ss_pred CCCCHHHHHHHHHHHHhhcCCeEEEEEE-c---------CCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeec Confidence 22334445555567889999998754 2 23567788899888876542111111001122222211 Q ss_pred -----ecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecc Q lcl|NC_019404. 151 -----TTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAK 223 (418) Q Consensus 151 -----~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~ 223 (418) ....+.....+.++.||||.... +....+|.|++.. +...|.....+......++.....+ +++++ T Consensus 158 ~~~~~~~~~g~~~~~~~~~eIiHir~~~------~~~~~~Glspi~~-~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~ 230 (542) T protein:vir:41 158 YEGEINPETGEDQDSVGANELVFIHIPS------PVCSYYGVPRYVS-AAPAILAMQKIDEYNYAFFDNYTIPSYVITVT 230 (542) T ss_pred ccccccccccccccccCcccEEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEeC Confidence 11112222356778899985332 3345689999975 7788887777777777776654433 45555 Q ss_pred hHHH--hhc----CcchHHHHHHHHHHHHH-hcCCcceeEEEc------CCCceeEeecccC--CHHHHHHHHHHHHhhh Q lcl|NC_019404. 224 GLAE--LCD----DSEGFGAARLRLAQVDN-NSGVGQAIGIDA------ESEEYSVLNSDIG--GIDAFLDKKFDRIVAL 288 (418) Q Consensus 224 ~l~~--~~~----~~~~~~~~~~r~~~~~~-~~~~~~~~~~d~------~~e~~~~~~~~~~--gl~~~~~~~~~~iaaa 288 (418) +... ... +.+......+.+...-. ..++.+..++.. ++-+|..++.+.. .+-+......+.||++ T Consensus 231 ~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~a 310 (542) T protein:vir:41 231 GEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAA 310 (542) T ss_pred CccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 3210 000 11112223333322211 123334444431 2224555544432 2345566778999999 Q ss_pred hcCCeeeeeccC-ccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c---cCCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 289 SGIHEIILKNKN-VGGLS-SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N---AEEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 289 s~IP~t~L~G~s-~~gl~-stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~---~~~~~~~f~pL~~~~eke~a 360 (418) .+||..+| |.. .+.+| ++-|.....|+.. .|.|+++.+-..|- . ..++.|+|+...-+.. + T Consensus 311 fgVPp~~l-G~~~~~t~n~sn~Eq~~~~f~~~-------tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~-d-- 379 (542) T protein:vir:41 311 HMIDPYRL-GIADTGPLGGNFAEVTRRTYYES-------VVRPQQNIISSILTDFFQVKFNPKTRFKFNDETLLES-D-- 379 (542) T ss_pred hCCCHHHh-CcCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccccCCceEEEecchhhcch-H-- Confidence 99999866 554 45555 3446666666554 36777666544432 1 1357788874332221 1 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCC-------CChhhcccccccCC----CccccccC Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIK-------IGDNDIQTEESELI----TETEVVIA 418 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~-------~~~~~~~~~e~~~~----~e~e~~~~ 418 (418) .++.+..++++|++|++|+|+.|....+... .+...+...+.+.. .+-+--.+ T Consensus 380 -----~~~~~~~~v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~ 443 (542) T protein:vir:41 380 -----SVRNCALLVQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYA 443 (542) T ss_pred -----HHHHHHHHHhCCCCCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCCCchhhhhhccc Confidence 1234556899999999999976632221100 01111111111100 00000111 No 23 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.82 E-value=2.6e-20 Score=127.91 Aligned_cols=362 Identities=11% Similarity=0.044 Sum_probs=205.9 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC--cchH-H----HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG--IDDE-P----AFWSRW 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~--~~d~-~----~i~~~~ 73 (418) +.....+.+.+ |++.. +..| ...+. ..|.+++.+.++|+.+|+++-+-++.+.. ++.. . .+..-+ T Consensus 15 ~~~~~~~~~~~-~~~~~-~~~g--~~v~~----~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~~~~~lL 86 (413) T protein:vir:48 15 VTTPAELAEAI-GLSYD-TYTG--KRISS----QRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTRVVDERLHKLV 86 (413) T ss_pred ccchHHHHHhh-hcCcc-cccC--ceech----hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcceeecccHHHHHH Confidence 22222223322 21111 0111 11122 23456888999999999999998887731 1111 1 112122 Q ss_pred ----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEE Q lcl|NC_019404. 74 ----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTY 148 (418) Q Consensus 74 ----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y 148 (418) ...-.+..|.+.+. +-.++|.|++++.- + .|.+..+.++++..+++.... .+.+ .| T Consensus 87 ~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~-~----------~g~~~~L~~l~~~~v~~~~~~-------~~~~-~y 147 (413) T protein:vir:48 87 SAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVK-A----------LGEVVELLPIDPGCVEPKLNS-------QWQP-VY 147 (413) T ss_pred HhhccCCCCHHHHHHHHHHHHhhcCceEEEEEe-C----------CCcEEEEEEEcCceEEEEEcC-------CceE-EE Confidence 11222334444444 45668999988642 2 245678888888887764321 1222 35 Q ss_pred EEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHH Q lcl|NC_019404. 149 RITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLA 226 (418) Q Consensus 149 ~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~ 226 (418) .+...++. ...++++.|+||.+..+ ...+|.|++.. +.+.|.....+......+++....+ ++++++. T Consensus 148 ~~~~~~g~-~~~~~~~evih~~~~~~-------d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~- 217 (413) T protein:vir:48 148 QVTFPDGS-VDVLTQDEIWHVRTLTL-------DGLVGLNPIAY-AREAISLAAATEEHGARLFGNGAVTSGVLRTEQK- 217 (413) T ss_pred EEEecCce-EEEEccccEEEecCcCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC- Confidence 55443322 24689999999965431 23579999975 7899999999998898888876543 4555531 Q ss_pred HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH--HHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 227 ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 227 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl--~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) + +.+...+..+++........+.+.+++...+.+|+.++.+..+. .+..+.....||.+.+||..+|.+ ..++- T Consensus 218 --~-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~t 293 (413) T protein:vir:48 218 --L-TPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQN-TDRAT 293 (413) T ss_pred --C-CHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CcCCC Confidence 1 22333444555544333323334455555567898888776654 467778889999999999987744 43333 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcc---CCceE--EeCCCCCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNA---EEWSV--EFSPLDHESSKDKAEVLEKSVNSIAALIA 375 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~---~~~~~--~f~pL~~~~eke~ae~~~~~a~a~~~~~~ 375 (418) .++.++....||.. .|.|+++.+-..+ +.. .++.| .+..|...|.+++ +++++++++ T Consensus 294 ~~n~e~~~~~f~~~-------~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~ 359 (413) T protein:vir:48 294 FNNIEELGLGFINY-------SLVPYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSR-------FEAYATGIN 359 (413) T ss_pred cccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHH-------HHHHHHHHh Confidence 34556666777764 3788877764444 212 13444 4557766666654 667888999 Q ss_pred CCCCCHHHHHHHHHhhcCcCCC-------Chhhc----ccccccCCCccccccC Q lcl|NC_019404. 376 AGAMDIKEARDTLRTIAPEIKI-------GDNDI----QTEESELITETEVVIA 418 (418) Q Consensus 376 ~g~i~~~e~r~~l~~~~~~~~~-------~~~~~----~~~e~~~~~e~e~~~~ 418 (418) +|+++++|+|+.+. ..+..+- ..... +..+++.++.++..=| T Consensus 360 ~g~~T~NE~R~~~g-~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 412 (413) T protein:vir:48 360 WGIYSPNDCRDLED-MNPRPGGDVYLTPMNMTTSPSAGDDNGKKKESGDADKTA 412 (413) T ss_pred CCCcCHHHHHHHhC-CCCCCCcceeeccccccccccccccCCCCCCCCCccccC Confidence 99999999998763 2221110 00011 1111112211222222 No 24 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.82 E-value=7.9e-20 Score=125.32 Aligned_cols=370 Identities=12% Similarity=0.091 Sum_probs=207.0 Q ss_pred CccchhhHHHHhcCCCCccc-----------cCc---cccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--Ccc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEI-----------YGS---LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GID 64 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~-----------~~~---~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~ 64 (418) |=-...+.|........... .|. ....+. ..+ .+++.+++||+.+|+++-+-++.+. .++ T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~---~~a-l~~~~v~~~i~~ia~~ia~l~~~~~~~~~~ 76 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKG---KNA-LKVATVFACIKILSESVSKLPLKIYQEDEY 76 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcceech---hhh-hccHHHHHHHHHHHHhhccCceEEEEecCC Confidence 22222222211000000000 000 001111 123 3578899999999999999888872 211 Q ss_pred hH-HH----HHHHHH----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc Q lcl|NC_019404. 65 DE-PA----FWSRWD----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE 134 (418) Q Consensus 65 d~-~~----i~~~~~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~ 134 (418) .. .. +...+. .......|.+.+.+ -.++|.|++++.- + ..|.+..|.++++.++++.... T Consensus 77 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~~~~ 146 (429) T protein:vir:10 77 GIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF-D---------RKGKVQALWPIDASKVTVYIDD 146 (429) T ss_pred ceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEcC Confidence 11 11 121221 11223345555555 4678999998753 2 3356778999999888764332 Q ss_pred ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 135 ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 135 ~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) .... .++....|.+..++. ...++++.||||.... +....+|.||+.. +.+.|.....+.......+.. T Consensus 147 ~~~~--~~~~~~~~~~~~~g~--~~~~~~~evih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~n 215 (429) T protein:vir:10 147 VGLL--NSKTKMWYVVNTGGQ--QRVLKPEEILHFKNGI------TLDGLVGVPTMEY-LKSTLENSASADKFINNFYKQ 215 (429) T ss_pred cccc--cccceEEEEEccCCe--EEEEccccEEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhc Confidence 2111 122223455544332 3579999999995321 2344679999975 788899999998889888877 Q ss_pred cCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhc Q lcl|NC_019404. 215 KQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSG 290 (418) Q Consensus 215 ~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~ 290 (418) ...+ ++++++ . + +.+...+..++++.......+.+.+++..++.+|++++.+..+ +-+..++..+.||.+.+ T Consensus 216 g~~~~~il~~~~--~-l-~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 291 (429) T protein:vir:10 216 GLQVKGLVQYVG--D-L-NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFG 291 (429) T ss_pred cCCccEEEEcCC--C-C-CHHHHHHHHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhC Confidence 6544 445543 1 2 2233344555555433332333444555556788888876654 34557788999999999 Q ss_pred CCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----h----ccCCceEEeC--CCCCCCHHHHH Q lcl|NC_019404. 291 IHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----V----NAEEWSVEFS--PLDHESSKDKA 360 (418) Q Consensus 291 IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~----~~~~~~~~f~--pL~~~~eke~a 360 (418) ||..+| |...++-.++.++....|+.. .|.|+++.+-..+ + +..++.++|+ .|...|.+++ T Consensus 292 VP~~~l-g~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~- 362 (429) T protein:vir:10 292 IKMHQL-NDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTR- 362 (429) T ss_pred CCHHHh-CCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHH- Confidence 999766 444333344556666777653 4788776665544 2 1235556654 7877787765 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChh-----------h-----cc---cccccCCCccccc Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDN-----------D-----IQ---TEESELITETEVV 416 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~-----------~-----~~---~~e~~~~~e~e~~ 416 (418) ++++++++++|++|++|+|+.+. ..+..+ .|+ . .+ +.++...+.+|+- T Consensus 363 ------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~g-gD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 363 ------YEAYRTGIQGGFLKPNEARSKED-LPPEAG-GDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred ------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCC-cCeeeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 66788899999999999998763 222111 110 0 00 0111111112222 No 25 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.81 E-value=9.8e-20 Score=124.80 Aligned_cols=371 Identities=12% Similarity=0.085 Sum_probs=208.1 Q ss_pred CccchhhHHHHhcCCCCccc--------------cC---ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEI--------------YG---SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~--------------~~---~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-- 61 (418) |==.|-+.+.|....+.... .| .....+.. . +.+++.+.+||+..|+++-+-++.+. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~---~-al~~~~v~~~i~~ia~~ia~lp~~~~~~ 76 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGK---N-ALKVATVFACIKILSESVSKLPLKIYQE 76 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchh---h-hhccHHHHHHHHHHHHhhccCceEEEEe Confidence 33333333332111000000 00 00011111 2 24478889999999999999888872 Q ss_pred Ccch-HHH----HHHHHH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GIDD-EPA----FWSRWD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~~d-~~~----i~~~~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) +++. +.. +...+. ..-.+..|.+.+. .-.++|.|++++.- + ..|.+..|.++++.++.+. T Consensus 77 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~ 146 (432) T protein:vir:10 77 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF-D---------RKGKVQALWPIDASKVTVY 146 (432) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEE Confidence 2111 111 222221 1222344444444 45678999998753 2 3456788999999888764 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ....... .++....|.+..++. ...++++.||||.... +...++|.|++.. +.+.|.....+....... T Consensus 147 ~d~~~~~--~~~~~~~y~~~~~g~--~~~~~~~eiih~r~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~ 215 (432) T protein:vir:10 147 IDDVGLL--NSKTKMWYVVNTGGQ--QRVLKPEEILHFKNGI------TLDGLVGVPTMEY-LKSTLENSASADKFINNF 215 (432) T ss_pred EcCcccc--cccceEEEEEecCCe--EEEEccccEEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHH Confidence 3221111 122223444544332 3569999999995321 2345679999975 788999999999999998 Q ss_pred HHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH--HHHHHHHHHHHhh Q lcl|NC_019404. 212 LRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVA 287 (418) Q Consensus 212 ~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl--~~~~~~~~~~iaa 287 (418) +.....+ ++++++ . + +++...+..+++........+.+.+++...+.+|++++.+..+. -+..++..+.||. T Consensus 216 ~~ng~~p~gil~~~~--~-l-~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 291 (432) T protein:vir:10 216 YKQGLQVKGLVQYVG--D-L-NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 291 (432) T ss_pred HhccCCccEEEEcCC--C-C-CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 8876544 445543 1 2 22333445555544333222334444555567898888776554 3667788899999 Q ss_pred hhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc----cCCceEEe--CCCCCCCHH Q lcl|NC_019404. 288 LSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN----AEEWSVEF--SPLDHESSK 357 (418) Q Consensus 288 as~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~----~~~~~~~f--~pL~~~~ek 357 (418) +.|||..+| |....|-.++.++....||.. .|+|.+..+-..+ +. ..++.|+| ..|...|.+ T Consensus 292 ~fgVP~~~l-g~~~~~~~s~~e~~~~~~~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~ 363 (432) T protein:vir:10 292 AFGIKMHQL-NDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIK 363 (432) T ss_pred HhCCCHHHh-CCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHH Confidence 999999777 444333334556666677654 4788877765544 21 23455554 478878887 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhc--------ccccccCCCccccc Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDI--------QTEESELITETEVV 416 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~--------~~~e~~~~~e~e~~ 416 (418) ++ ++++++++++|++|++|+|+.+. ..|..+-+ .+.. ++.++...+.+|+= T Consensus 364 ~~-------~~~~~~~~~~G~~t~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 364 TR-------YEAYRTGIQGGFLKPNEARSKED-LPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred HH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 65 56788899999999999998763 22211110 0000 00011111111222 No 26 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.81 E-value=9.8e-20 Score=124.80 Aligned_cols=371 Identities=12% Similarity=0.085 Sum_probs=208.1 Q ss_pred CccchhhHHHHhcCCCCccc--------------cC---ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEI--------------YG---SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~--------------~~---~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-- 61 (418) |==.|-+.+.|....+.... .| .....+.. . +.+++.+.+||+..|+++-+-++.+. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~---~-al~~~~v~~~i~~ia~~ia~lp~~~~~~ 76 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGK---N-ALKVATVFACIKILSESVSKLPLKIYQE 76 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchh---h-hhccHHHHHHHHHHHHhhccCceEEEEe Confidence 33333333332111000000 00 00011111 2 24478889999999999999888872 Q ss_pred Ccch-HHH----HHHHHH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GIDD-EPA----FWSRWD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~~d-~~~----i~~~~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) +++. +.. +...+. ..-.+..|.+.+. .-.++|.|++++.- + ..|.+..|.++++.++.+. T Consensus 77 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~ 146 (432) T protein:vir:10 77 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF-D---------RKGKVQALWPIDASKVTVY 146 (432) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEE Confidence 2111 111 222221 1222344444444 45678999998753 2 3456788999999888764 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ....... .++....|.+..++. ...++++.||||.... +...++|.|++.. +.+.|.....+....... T Consensus 147 ~d~~~~~--~~~~~~~y~~~~~g~--~~~~~~~eiih~r~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~ 215 (432) T protein:vir:10 147 IDDVGLL--NSKTKMWYVVNTGGQ--QRVLKPEEILHFKNGI------TLDGLVGVPTMEY-LKSTLENSASADKFINNF 215 (432) T ss_pred EcCcccc--cccceEEEEEecCCe--EEEEccccEEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHH Confidence 3221111 122223444544332 3569999999995321 2345679999975 788999999999999998 Q ss_pred HHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH--HHHHHHHHHHHhh Q lcl|NC_019404. 212 LRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVA 287 (418) Q Consensus 212 ~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl--~~~~~~~~~~iaa 287 (418) +.....+ ++++++ . + +++...+..+++........+.+.+++...+.+|++++.+..+. -+..++..+.||. T Consensus 216 ~~ng~~p~gil~~~~--~-l-~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 291 (432) T protein:vir:10 216 YKQGLQVKGLVQYVG--D-L-NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 291 (432) T ss_pred HhccCCccEEEEcCC--C-C-CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 8876544 445543 1 2 22333445555544333222334444555567898888776554 3667788899999 Q ss_pred hhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc----cCCceEEe--CCCCCCCHH Q lcl|NC_019404. 288 LSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN----AEEWSVEF--SPLDHESSK 357 (418) Q Consensus 288 as~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~----~~~~~~~f--~pL~~~~ek 357 (418) +.|||..+| |....|-.++.++....||.. .|+|.+..+-..+ +. ..++.|+| ..|...|.+ T Consensus 292 ~fgVP~~~l-g~~~~~~~s~~e~~~~~~~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~ 363 (432) T protein:vir:10 292 AFGIKMHQL-NDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIK 363 (432) T ss_pred HhCCCHHHh-CCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHH Confidence 999999777 444333334556666677654 4788877765544 21 23455554 478878887 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhc--------ccccccCCCccccc Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDI--------QTEESELITETEVV 416 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~--------~~~e~~~~~e~e~~ 416 (418) ++ ++++++++++|++|++|+|+.+. ..|..+-+ .+.. ++.++...+.+|+= T Consensus 364 ~~-------~~~~~~~~~~G~~t~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 364 TR-------YEAYRTGIQGGFLKPNEARSKED-LPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred HH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 65 56788899999999999998763 22211110 0000 00011111111222 No 27 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.81 E-value=9.8e-20 Score=124.80 Aligned_cols=371 Identities=12% Similarity=0.085 Sum_probs=208.1 Q ss_pred CccchhhHHHHhcCCCCccc--------------cC---ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEI--------------YG---SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~--------------~~---~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-- 61 (418) |==.|-+.+.|....+.... .| .....+.. . +.+++.+.+||+..|+++-+-++.+. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~---~-al~~~~v~~~i~~ia~~ia~lp~~~~~~ 76 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGK---N-ALKVATVFACIKILSESVSKLPLKIYQE 76 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchh---h-hhccHHHHHHHHHHHHhhccCceEEEEe Confidence 33333333332111000000 00 00011111 2 24478889999999999999888872 Q ss_pred Ccch-HHH----HHHHHH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GIDD-EPA----FWSRWD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~~d-~~~----i~~~~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) +++. +.. +...+. ..-.+..|.+.+. .-.++|.|++++.- + ..|.+..|.++++.++.+. T Consensus 77 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~ 146 (432) T protein:vir:10 77 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF-D---------RKGKVQALWPIDASKVTVY 146 (432) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEE Confidence 2111 111 222221 1222344444444 45678999998753 2 3456788999999888764 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ....... .++....|.+..++. ...++++.||||.... +...++|.|++.. +.+.|.....+....... T Consensus 147 ~d~~~~~--~~~~~~~y~~~~~g~--~~~~~~~eiih~r~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~ 215 (432) T protein:vir:10 147 IDDVGLL--NSKTKMWYVVNTGGQ--QRVLKPEEILHFKNGI------TLDGLVGVPTMEY-LKSTLENSASADKFINNF 215 (432) T ss_pred EcCcccc--cccceEEEEEecCCe--EEEEccccEEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHH Confidence 3221111 122223444544332 3569999999995321 2345679999975 788999999999999998 Q ss_pred HHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH--HHHHHHHHHHHhh Q lcl|NC_019404. 212 LRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVA 287 (418) Q Consensus 212 ~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl--~~~~~~~~~~iaa 287 (418) +.....+ ++++++ . + +++...+..+++........+.+.+++...+.+|++++.+..+. -+..++..+.||. T Consensus 216 ~~ng~~p~gil~~~~--~-l-~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 291 (432) T protein:vir:10 216 YKQGLQVKGLVQYVG--D-L-NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 291 (432) T ss_pred HhccCCccEEEEcCC--C-C-CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 8876544 445543 1 2 22333445555544333222334444555567898888776554 3667788899999 Q ss_pred hhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc----cCCceEEe--CCCCCCCHH Q lcl|NC_019404. 288 LSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN----AEEWSVEF--SPLDHESSK 357 (418) Q Consensus 288 as~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~----~~~~~~~f--~pL~~~~ek 357 (418) +.|||..+| |....|-.++.++....||.. .|+|.+..+-..+ +. ..++.|+| ..|...|.+ T Consensus 292 ~fgVP~~~l-g~~~~~~~s~~e~~~~~~~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~ 363 (432) T protein:vir:10 292 AFGIKMHQL-NDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIK 363 (432) T ss_pred HhCCCHHHh-CCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHH Confidence 999999777 444333334556666677654 4788877765544 21 23455554 478878887 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhc--------ccccccCCCccccc Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDI--------QTEESELITETEVV 416 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~--------~~~e~~~~~e~e~~ 416 (418) ++ ++++++++++|++|++|+|+.+. ..|..+-+ .+.. ++.++...+.+|+= T Consensus 364 ~~-------~~~~~~~~~~G~~t~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 364 TR-------YEAYRTGIQGGFLKPNEARSKED-LPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred HH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 65 56788899999999999998763 22211110 0000 00011111111222 No 28 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.81 E-value=1.1e-19 Score=124.50 Aligned_cols=382 Identities=15% Similarity=0.167 Sum_probs=197.0 Q ss_pred CccchhhHH-----------HHhcCCCCcc-ccCccccCCHHHHHH---HHHcCCccchhhhcchhhhcc---------- Q lcl|NC_019404. 1 MVKTDSYAN-----------IFLGGSDGSE-IYGSLQNQAPTILAS---LYADNALVRRIIDTIPETALA---------- 55 (418) Q Consensus 1 ~~~~D~~~n-----------~~~g~~~~~~-~~~~~~~~~~~~l~~---~Y~~~~~~r~iVd~~a~d~~r---------- 55 (418) =+..|.+.- .+.|.-.... +...+...++..+.. .|..|+++++||++.++...+ T Consensus 37 ~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~ 116 (551) T protein:vir:80 37 QREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEK 116 (551) T ss_pred cccHHHHHHhhccCcceeecccccceecCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcC Confidence 111111111 1111111111 111233345555544 689999999999999987754 Q ss_pred -CCccccCc--------chH---HHHHHHHHHhCc--------hHHHHH-HHHhccccceEEEEEeecCCCcccccccCC Q lcl|NC_019404. 56 -AGFHIDGI--------DDE---PAFWSRWDDLEM--------TQNIND-AWSWARLFGGAAIVAIVKDNRALTSPVREG 114 (418) Q Consensus 56 -~~~~i~~~--------~d~---~~i~~~~~~l~~--------~~~~~~-a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~ 114 (418) -+|.|.-. .+. ..++.-+.+.+. +..|.+ .+....++|.|++.+.- + .. T Consensus 117 g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~r-d---------~~ 186 (551) T protein:vir:80 117 GVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVF-N---------RN 186 (551) T ss_pred CCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEE-C---------CC Confidence 23444211 011 123334444432 123444 44455678999887653 2 33 Q ss_pred CceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH Q lcl|NC_019404. 115 AELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI 194 (418) Q Consensus 115 ~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~ 194 (418) |.+..|.++++.+|.+..... -.- .-.+..|..... ++....+.++.||||...+.+. .....+|.||+.. + T Consensus 187 G~~~~L~~l~p~~V~v~~~~~-g~~--~~~~~~y~~~~~-g~~~~~~~~~eiiH~~~n~~~~---~~~~~~G~spi~~-a 258 (551) T protein:vir:80 187 QSMVRFVAKDPTTIFFATTAD-GKI--PDNGNRFVQVID-QKIVATFNAREMAFAVRNPRSD---IYATGYGYPELEI-A 258 (551) T ss_pred CcEEEEEEeCCceeEEEECCc-ccc--ccCceEEEEEeC-CcEEEEEcccceEEecccCCCC---cccccccccHHHH-H Confidence 567889999998887643110 000 001122322222 2233468899999998765432 2234579999975 7 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcce-eEEEcCCCceeEeeccc Q lcl|NC_019404. 195 LDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQA-IGIDAESEEYSVLNSDI 271 (418) Q Consensus 195 ~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~-~~~d~~~e~~~~~~~~~ 271 (418) .+.|.....+......++.....+ ++++++-..+ +.+....+++++...-....+.+. .++.+++-+|+.++.+. T Consensus 259 ~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~l--t~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~ 336 (551) T protein:vir:80 259 LKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQ--SQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSA 336 (551) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCC--CHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCCh Confidence 899999988888888888775543 2444421111 122233444444432222222233 45555555677776655 Q ss_pred CC--HHHHHHHHHHHHhhhhcCCeeeeeccCccc--------cc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Q lcl|NC_019404. 272 GG--IDAFLDKKFDRIVALSGIHEIILKNKNVGG--------LS-SSQNTALETFHKLIDRKRNAELLPILEFLIPFI-- 338 (418) Q Consensus 272 ~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~g--------l~-stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i-- 338 (418) .+ +-+...+..+.||.+.+||..+|.-..-++ ++ |+-+.....|+ +..|.|++.++-..| T Consensus 337 ~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~-------~~tL~P~~~~ie~~ln~ 409 (551) T protein:vir:80 337 RDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASK-------NKGLQPLLGFIEDFINK 409 (551) T ss_pred hHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHH-------HHHHHHHHHHHHHHHHh Confidence 43 445577888999999999987663222211 11 12222222333 344778776654433 Q ss_pred --hc--cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhh------------c Q lcl|NC_019404. 339 --VN--AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDND------------I 402 (418) Q Consensus 339 --~~--~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~------------~ 402 (418) +. ...+.|+|+.+...+.+++++ +..++.+|++|++|+|+.+.- .+..+-+|.- . T Consensus 410 ~L~~~~~~~~~f~f~~~~~~~~~~~~~--------~~~~~~~g~lT~NE~R~~~gl-~P~~egGD~~~~~~~~~~~~~~~ 480 (551) T protein:vir:80 410 HIVAEFGDKYTFQFVGGDIKSELESVK--------ILAEKAKVAMTVNEVRKELNL-PGDVIGGDIPLNGVIVQRIGQLM 480 (551) T ss_pred hhccccCCceEEEeeccChhhHHHHHH--------HHHHHhcCCcCHHHHHHHhCC-CCCCCCCceeecccccccccccc Confidence 21 246889999887766655432 223566799999999987632 1211101100 0 Q ss_pred ccc--c------------------ccCCCccccccC Q lcl|NC_019404. 403 QTE--E------------------SELITETEVVIA 418 (418) Q Consensus 403 ~~~--e------------------~~~~~e~e~~~~ 418 (418) ... + ..++++.|---+ T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 516 (551) T protein:vir:80 481 QQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 516 (551) T ss_pred cccCcchhhhhhccccccCcCCCCCCCCCCCCCCcc Confidence 000 0 001111111111 No 29 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.81 E-value=4e-20 Score=126.92 Aligned_cols=370 Identities=13% Similarity=0.092 Sum_probs=206.5 Q ss_pred CccchhhHHHHhcCCCCccccC-----cc--ccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchH----- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYG-----SL--QNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDE----- 66 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~-----~~--~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~----- 66 (418) |==.|-+.+.+........... .. ...+.. -..+++-+.++|+.+|+++-+-++.+- +++.. T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~----~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~ 76 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLLQWLGVDPDTPR----NQLSEATYFACLKILSESLGKLPLKMYQKTERGIVKSDR 76 (411) T ss_pred CchHHHHHhhccCcccccccchHHHHHHhcCcccChh----hhhccHHHHHHHHHHHHhHhhCceeEEEecCCceeeecc Confidence 1111111111111000000000 00 001111 123567789999999999999888872 22111 Q ss_pred HHHHHHHH----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc Q lcl|NC_019404. 67 PAFWSRWD----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR 141 (418) Q Consensus 67 ~~i~~~~~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~ 141 (418) ..+...+. ..-.+..|.+.+.+ -.++|.|++++..++ |.+..+.++++..+++........... T Consensus 77 ~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-----------g~~~~l~~l~~~~v~~~~~~~~~~~~~ 145 (411) T protein:vir:81 77 EELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-----------PQLQALWILPSQYVTIVVDDRGLLGEK 145 (411) T ss_pred cHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-----------CceEEEEEECCceEEEEEcCccccccc Confidence 11222221 22233445555554 567899999875432 446678899988887653321111111 Q ss_pred cCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--e Q lcl|NC_019404. 142 FGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--V 219 (418) Q Consensus 142 yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v 219 (418) ..-.|.+.....+....++++.||||.... +....+|.|++.. +.+.+.....+.......+.....+ + T Consensus 146 --~~~~~~~~~~~~g~~~~~~~~eiih~k~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gi 216 (411) T protein:vir:81 146 --NAIWYRYNDPYDGKMYVFRNDEILHFKTSV------TFDGITGLSVRDV-LKHTVDGALESQKFMNNLYKTGLTGKAV 216 (411) T ss_pred --ceEEEEEEecCCceEEEEccccEEEEcCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 122455544333344568999999996322 1234679999975 7789999999999999988776544 3 Q ss_pred eecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeee Q lcl|NC_019404. 220 WKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILK 297 (418) Q Consensus 220 ~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~ 297 (418) +++++ . + +.+...+.++++........+.+.+++..++.+|++++.+.. .+-+..++....||++.+||..+| T Consensus 217 l~~~~--~-l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l- 291 (411) T protein:vir:81 217 LEYTG--D-L-NQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQI- 291 (411) T ss_pred EEeCC--C-C-CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh- Confidence 45542 1 2 223344556666554433233344556566678988887664 345677788999999999998766 Q ss_pred ccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------ccCCc--eEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019404. 298 NKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV--------NAEEW--SVEFSPLDHESSKDKAEVLEKSV 367 (418) Q Consensus 298 G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~--------~~~~~--~~~f~pL~~~~eke~ae~~~~~a 367 (418) |...++-.++.+.....|+.. .|.|.++.+-..+- +..+. +|++..|...|.+++ + T Consensus 292 g~~~~~t~~n~e~~~~~f~~~-------~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~-------~ 357 (411) T protein:vir:81 292 NDYEKSSYASAEAQNLAFYVD-------TLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQ-------M 357 (411) T ss_pred CCCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHH-------H Confidence 555444335566666677654 47888776655442 12344 445556666776654 6 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhh--------cccccccCCCcccc Q lcl|NC_019404. 368 NSIAALIAAGAMDIKEARDTLRTIAPEIKIGDND--------IQTEESELITETEV 415 (418) Q Consensus 368 ~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~--------~~~~e~~~~~e~e~ 415 (418) +++++++++|++|++|+|+.+. ..+..+ +|+- ++.........+|- T Consensus 358 ~~~~~~~~~g~~t~NE~R~~~g-l~p~~g-gD~~~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 358 DSLSTAVQNGIMTPNEARDYLD-MPADDY-GNNLMANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred HHHHHHHhCCCcCHHHHHHHhC-CCCCCC-CCeeeeccCccchhhhhhhhccCCCC Confidence 7888999999999999998763 222111 1110 11100111111111 No 30 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.81 E-value=1.6e-19 Score=123.65 Aligned_cols=387 Identities=13% Similarity=0.136 Sum_probs=202.8 Q ss_pred Cccchhh----------------HHHHhc--C--CCCccc---cCccccCCHHHHHHHHHcCCccchhhhcchhhhcc-- Q lcl|NC_019404. 1 MVKTDSY----------------ANIFLG--G--SDGSEI---YGSLQNQAPTILASLYADNALVRRIIDTIPETALA-- 55 (418) Q Consensus 1 ~~~~D~~----------------~n~~~g--~--~~~~~~---~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r-- 55 (418) ..+.=.+ .--+.| . +..... ......+++.++.++|..++++++||++.++.... T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~ 100 (535) T protein:vir:10 21 YIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYS 100 (535) T ss_pred hHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhccChhHHHHHHHHHHHHHHHH Confidence 0000000 000111 0 000011 11223357889999999999999999988877542 Q ss_pred ---------CCcccc---Cc---ch--HH---HHHHHHH----HhC----chHHHHHH-HHhccccce-EEEEEeecCCC Q lcl|NC_019404. 56 ---------AGFHID---GI---DD--EP---AFWSRWD----DLE----MTQNINDA-WSWARLFGG-AAIVAIVKDNR 105 (418) Q Consensus 56 ---------~~~~i~---~~---~d--~~---~i~~~~~----~l~----~~~~~~~a-~~~~rl~G~-~~i~i~~~d~~ 105 (418) .++.+. .+ .. .. .+...+. .+. .+..|... +....++|+ +++++ +++ T Consensus 101 ~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i-~r~-- 177 (535) T protein:vir:10 101 NPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERI-FKN-- 177 (535) T ss_pred HHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEE-EEC-- Confidence 123331 11 11 11 1222222 111 22334443 344566765 55555 333 Q ss_pred cccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccC Q lcl|NC_019404. 106 ALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGW 185 (418) Q Consensus 106 ~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~ 185 (418) ..|.+..|.++++..|.+.. |+... .+-+.+|++..++ ....+.++.||||.+.+.+. ....++ T Consensus 178 -------~~G~~~~L~~l~p~~V~v~~---d~~~~-~~~~~~~~~~~~~--~~~~~~~~eiih~~~~~~~~---~~~~~~ 241 (535) T protein:vir:10 178 -------DSNELDHFNAVDASKVVISY---SPRSK-DQPRKFEQFVSET--KSVKFSERNLTFINYWNLSD---TDRRGY 241 (535) T ss_pred -------CCCcEEEEEEeCCceeEEEE---cCccc-cCceEEEEEecCc--eeEEECcccEEEEeccCCCC---cccccc Confidence 33567789999988887643 11111 1123445554433 23578999999998765433 223457 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCC-cceeEEEcCCC Q lcl|NC_019404. 186 GRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGV-GQAIGIDAESE 262 (418) Q Consensus 186 G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~d~~~e 262 (418) |.||+.. +.+.|.....+......++.....+ ++++++......+.+..+.+.+.+...-...++ +...++.+++- T Consensus 242 G~Spi~~-~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~ 320 (535) T protein:vir:10 242 GYSPVEA-SIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDA 320 (535) T ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCc Confidence 9999975 7899999999999999988775543 667664211111122223333333332222122 23345555556 Q ss_pred ceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccC-ccccccchhHHHHHHHHHHHHHHHH----HHHHHHHHHH Q lcl|NC_019404. 263 EYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKN-VGGLSSSQNTALETFHKLIDRKRNA----ELLPILEFLI 335 (418) Q Consensus 263 ~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s-~~gl~stge~d~~~y~~~I~~~Qe~----~l~p~l~~l~ 335 (418) +|+.++.+..+ +-+...+....||.+.+||..+| |.. .+..++........|.+.++..+.. .|.|++..+- T Consensus 321 ~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~l-G~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie 399 (535) T protein:vir:10 321 KFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEI-NFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIE 399 (535) T ss_pred eEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 77777766543 44556678899999999999766 655 4444333333334455555555443 3778776665 Q ss_pred HHh----hc--cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-------hh-- Q lcl|NC_019404. 336 PFI----VN--AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG-------DN-- 400 (418) Q Consensus 336 ~~i----~~--~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-------~~-- 400 (418) ..| +. ..++.|+|+.|...+.++++++ .+.+ .+|++|++|+|+.+. ..+..+-+ .. T Consensus 400 ~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~-------~~~~-~~g~lT~NE~R~~~g-l~piegGD~~~~~~~~~~~ 470 (535) T protein:vir:10 400 QVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQV-------WKLK-LANGYFINEYRKDHG-LKTVDGLDVPGFIGSAENF 470 (535) T ss_pred HHHhhhcccccCCeEEEEeccccccCHHHHHHH-------HHHH-HcCCCCHHHHHHHhC-CCCCCCccccccccchhhc Confidence 444 22 1368899999998888776543 3322 357799999998762 11110000 00 Q ss_pred --------------------hcccccccCCCccccccC Q lcl|NC_019404. 401 --------------------DIQTEESELITETEVVIA 418 (418) Q Consensus 401 --------------------~~~~~e~~~~~e~e~~~~ 418 (418) .+++.+.+ +.+.+..-+ T Consensus 471 ~~~~~~~~~~~p~~~~~~~~~~~~~~~q-~~~~~~~~~ 507 (535) T protein:vir:10 471 INATGFGQPNVPDSSDDSGSTLGERERQ-ERIQHSKDY 507 (535) T ss_pred ccccccccccCCCCCCCccccCCccccC-ccccccccc Confidence 00000000 000000000 No 31 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.81 E-value=5.5e-20 Score=126.16 Aligned_cols=366 Identities=11% Similarity=0.084 Sum_probs=207.3 Q ss_pred CccchhhHHHHhcCCCCcc------------------ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSE------------------IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~------------------~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~- 61 (418) |+-+ ||....+ ..+.............|..++.+++||+.+++++-+-++.+. T Consensus 1 ~~~~--------~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~ 72 (518) T protein:vir:78 1 MLLA--------NGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred Cccc--------CceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEE Confidence 2222 2221110 011111111233345688899999999999999999888872 Q ss_pred -Ccch-HH----HHHHHHHH---hCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 -GIDD-EP----AFWSRWDD---LEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 -~~~d-~~----~i~~~~~~---l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) ..+. .. .+..-+.+ .-....|.+.+.. -.++|.|++++.- + ..|.+..|.++++..+++. T Consensus 73 ~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r-~---------~~G~~~~L~~l~p~~Vtv~ 142 (518) T protein:vir:78 73 TSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N---------KSGTPEKLMPMHPSRVAIK 142 (518) T ss_pred EcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-c---------CCCcEEEEEEECCCceEEE Confidence 1111 11 11111111 1223445555544 4568999998753 2 3356778899998888765 Q ss_pred cccccccccccCcceEEEEecCCc--ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNES--DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLAT 209 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~--~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~ 209 (418) ... .+....|+++...+ .....++++.||||.+.- +....+|.|++.. +.+.|.....+..... T Consensus 143 ~~~-------~~~~~~y~~~~~~~~~~~~~~~~~~eIiHir~~~------~dg~~~G~Spi~~-~~~~i~~~~aa~~~~~ 208 (518) T protein:vir:78 143 RNS-------RTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFN------PDGLERGLSLMES-LKSTIFSEDSSRNATA 208 (518) T ss_pred EcC-------CCCEEEEEEEecCCccceeEEecCCcEEEecCCC------CCcccccccHHHH-HHHHHHHHHHHHHHHH Confidence 432 12234555553322 233468899999996432 2233469999974 7889999999999998 Q ss_pred HHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHH Q lcl|NC_019404. 210 QLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRI 285 (418) Q Consensus 210 ~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~i 285 (418) .++.....+ ++++++. + +++...+.+++++.......+.+.+++..++.+|+.++.+..+ +-+...+....| T Consensus 209 ~~f~Ng~~p~gvl~~~~~---l-s~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eI 284 (518) T protein:vir:78 209 AMWKNAGRPNLVLRHEKR---L-SPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEV 284 (518) T ss_pred HHHhcCCCccEEEecCCC---C-CHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHH Confidence 888776554 5566531 2 2233344555554443332333445555566789888876543 456677888999 Q ss_pred hhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hc---cCCceEEe--CCCCCCCHH Q lcl|NC_019404. 286 VALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VN---AEEWSVEF--SPLDHESSK 357 (418) Q Consensus 286 aaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~---~~~~~~~f--~pL~~~~ek 357 (418) |.+.+||..+| |...++-.++-+.....||.. .|.|++..+-..| +. ..+..++| ..|...|.+ T Consensus 285 a~afgVPp~~l-g~~~~st~sn~e~~~~~f~~~-------tL~P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~ 356 (518) T protein:vir:78 285 CGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWE 356 (518) T ss_pred HHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccccCcceEEeechhhhccCHH Confidence 99999998766 655443334456666667654 3677766664433 21 12444555 477777776 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCC-CChh----------------hcccccccCCC--------- Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIK-IGDN----------------DIQTEESELIT--------- 411 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~-~~~~----------------~~~~~e~~~~~--------- 411 (418) ++ ++++..++++|++|++|+|+.+. ..+-.+ ..++ ..+..+.+... T Consensus 357 ~r-------~~~~~~~~~~G~lT~NE~R~~~g-l~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~ 428 (518) T protein:vir:78 357 AK-------SESTQKMVNSGVATPNEGREIMG-LPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVAS 428 (518) T ss_pred HH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccc Confidence 54 77788999999999999998763 111100 0010 00000000000 Q ss_pred ----c-cccccC Q lcl|NC_019404. 412 ----E-TEVVIA 418 (418) Q Consensus 412 ----e-~e~~~~ 418 (418) + .+.+-+ T Consensus 429 ~~~~~~~~~~~~ 440 (518) T protein:vir:78 429 LDQSPPASVPGL 440 (518) T ss_pred cccCccccCCCC Confidence 0 000000 No 32 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.80 E-value=5.8e-20 Score=126.03 Aligned_cols=354 Identities=13% Similarity=0.059 Sum_probs=200.6 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) ...-..+...+.++.. ..+.+.. .+.+++.+.+||+.+|+++-+-.+.++.+. ...+..+-...--+. T Consensus 15 ~~~~~~~~~~~~~~~~-------~~~v~~~----~al~~~~V~~~v~~ia~~ia~~p~~~~~~~-~~~l~~~PN~~~s~~ 82 (397) T protein:vir:38 15 SLNDPDWVNFLTGGEA-------QKYVSAD----TALKNSDIFSLIMQLSGDLAMVRYTSESDR-SQSIISNPSVTANGY 82 (397) T ss_pred cCCchhhhhhhcCCcC-------CceechH----HhhccHHHHHHHHHHHHHHhhCcccccccH-HHHHHhcCCCCCCHH Confidence 1111112222111110 1112222 124588899999999999987777655332 223333333333445 Q ss_pred HHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC--ccc Q lcl|NC_019404. 81 NINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE--SDM 157 (418) Q Consensus 81 ~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~--~~~ 157 (418) .|.+.+.+ -.++|.|++++.- + ..|.+..+.++++..+++.... .|....|+++... .+. T Consensus 83 ~f~~~~~~~lll~Gna~~~i~r-~---------~~g~~~~l~~l~~~~v~i~~~~-------~~~~~~y~~~~~~~~~~~ 145 (397) T protein:vir:38 83 SFWQGMFAQLLLDGNCYAYRHK-N---------TNGVDLSWEYLRPSQVQPMLLQ-------DGSGLIYNINFDEPAIGY 145 (397) T ss_pred HHHHHHHHHhhhcCCEEEEEEE-C---------CCCcEEEEEEEcCceeEEEEcC-------CCceEEEEEEeccccccc Confidence 56555555 4568999988753 2 3356778889988887664321 2334566665332 233 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchH Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGF 235 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~ 235 (418) ...+.++.||||.... +....+|.|++.. +...|.....+.......+.....+ +++++.. + ..+.. T Consensus 146 ~~~~~~~eiih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~---~-~~e~~ 214 (397) T protein:vir:38 146 MENVPAADVIHIRLLS------KNGGKTGISPLSA-LINEQQIKDASNELTLKALKQSVTASAVLTIQKG---G-LLDAE 214 (397) T ss_pred eeEecCccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC---C-CHHHH Confidence 3568999999996432 2334579999975 7889999888888888888775543 4555531 1 22333 Q ss_pred HHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHH Q lcl|NC_019404. 236 GAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALE 313 (418) Q Consensus 236 ~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~ 313 (418) .+..+++.......+ .+.+++..++.+|+.++.+.. .+.+..+...+.||++.|||..+|.|... +- ++.+ ... T Consensus 215 ~~~~~~~~~~~~~~n-~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~-~~-~~~e-~~~ 290 (397) T protein:vir:38 215 TRIARSKEISKQIHN-SDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGD-QQ-SSIT-QIS 290 (397) T ss_pred HHHHHHHHHHhcccc-cCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-cc-cHHH-HHH Confidence 344445544333333 344444455678888887654 35567889999999999999988765433 22 2223 344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---hccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHh Q lcl|NC_019404. 314 TFHKLIDRKRNAELLPILEFLIPFI---VNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRT 390 (418) Q Consensus 314 ~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~ 390 (418) .||. ..|.|++..+-..| +.. ++.+.+..+...+.+ .++++++++++.|+++++|+|+.+.. T Consensus 291 ~~~~-------~~l~P~~~~ie~~ln~~l~~-~~~~~~~~~~~~d~~-------~~~~~~~~~~~~G~~t~nE~R~~lg~ 355 (397) T protein:vir:38 291 GQYA-------KSLNRYVQAIVGELNDKLHA-NISANIRFAIDAMGD-------QYASTISSSVKGGTIAGNQARFILQN 355 (397) T ss_pred HHHH-------HHHHHHHHHHHHHHHHhccC-hhcccccccccCCHH-------HHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 5553 24677776664443 222 233344445555544 45777888999999999999997632 Q ss_pred hcCcCCCC-------------h------hhcccccccCCCccc Q lcl|NC_019404. 391 IAPEIKIG-------------D------NDIQTEESELITETE 414 (418) Q Consensus 391 ~~~~~~~~-------------~------~~~~~~e~~~~~e~e 414 (418) .+..+-+ . ++-....++..++.| T Consensus 356 -~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 356 -SGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred -CCCCCCccccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 1110000 0 000111112222222 No 33 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.80 E-value=1.5e-19 Score=123.77 Aligned_cols=387 Identities=13% Similarity=0.063 Sum_probs=214.8 Q ss_pred CccchhhHHHH-------------------hcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc Q lcl|NC_019404. 1 MVKTDSYANIF-------------------LGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID 61 (418) Q Consensus 1 ~~~~D~~~n~~-------------------~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~ 61 (418) +-+..+-.|.. ++++............+ .+.|.+++.+++||+.+++.+-+-++.+. T Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~----~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~ 94 (466) T protein:vir:81 19 IDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLA----TQAYQANGPVFACMLVRQLVFSSVRFRWQ 94 (466) T ss_pred hhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccc----hhhhhccHHHHHHHHHHHHhhccCceEEE Confidence 22223333321 11110000000011111 34467789999999999999999988874 Q ss_pred Ccch--HH-----HHHHHHHH---hCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccc Q lcl|NC_019404. 62 GIDD--EP-----AFWSRWDD---LEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKV 130 (418) Q Consensus 62 ~~~d--~~-----~i~~~~~~---l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~ 130 (418) -.++ .. .+..-+.+ ......|.+.+. +..++|.|++++.-.+ .. ...-+..+.+..+.++++..+.+ T Consensus 95 ~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~-~g-~l~~~~~g~~~~l~~l~~~~v~~ 172 (466) T protein:vir:81 95 RLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGE-FV-RMRPDWVDVVVEERMVRGGRGEL 172 (466) T ss_pred EecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecC-cc-ccccccCcceeEEEEecCcceEE Confidence 2111 11 11111111 223344544444 4566899999875422 21 11112345677888888887766 Q ss_pred ccccccccccccCcceEEEEecCCc---ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 131 QNREENPRNARFGKPLTYRITTNES---DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERL 207 (418) Q Consensus 131 ~~~~~dp~s~~yg~p~~y~i~~~~~---~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~ 207 (418) .... |. + ....|+++..+. .....++++.||||.+.+.| ....+|.|++.. +.+.|.....+... T Consensus 173 ~~~~-~~----~-~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~-----~d~~~G~s~i~~-~~~~i~~~~a~~~~ 240 (466) T protein:vir:81 173 GGGQ-LG----W-RKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDP-----LASYRGMSWLTP-ILREIRADQAMSKH 240 (466) T ss_pred EEcC-CC----c-eEEEEEEEecCcccccceeeeccccEEEEcCCCCc-----ccccccccHHHH-HHHHHHHHHHHHHH Confidence 5421 11 1 112344433221 12346899999999654322 234579999975 77999999899999 Q ss_pred HHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHH Q lcl|NC_019404. 208 ATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFD 283 (418) Q Consensus 208 ~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~ 283 (418) ...++...... +++++.. + +++...+.++++......-.+.+.+++..++.+|+.++.+..+ +-+...+..+ T Consensus 241 ~~~~f~ng~~p~gil~~~~~---l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~ 316 (466) T protein:vir:81 241 QAKFFDNGATVNLVIKHNPM---A-DPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGET 316 (466) T ss_pred HHHHHhcCCCcceEEecCCC---C-CHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHH Confidence 99988876644 4565531 2 1233334444444332222233445555556789988877654 4466778999 Q ss_pred HHhhhhcCCeeeeeccCccccccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc----CCceEEeC--CC Q lcl|NC_019404. 284 RIVALSGIHEIILKNKNVGGLSSS---QNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA----EEWSVEFS--PL 351 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s~~gl~st---ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~----~~~~~~f~--pL 351 (418) .||.+.+||..+| |.+.+.-.+| -|...+.||.. .|.|++..+-..| +.. ..+.|+|+ +| T Consensus 317 ~Ia~~fgVPp~~l-G~~~~~~~st~sn~eq~~~~f~~~-------tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~l 388 (466) T protein:vir:81 317 RIAAAAGVPPVIV-GLSEGLAAATYSNYGQARRRLADG-------TAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPF 388 (466) T ss_pred HHHHHhCCCHHHc-ccccCCCccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCcccCcceEEEecchhh Confidence 9999999998654 6554433333 35555666654 3777776664443 221 13455554 88 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhc----CcCCCCh-hhccccccc----CCCccccccC Q lcl|NC_019404. 352 DHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIA----PEIKIGD-NDIQTEESE----LITETEVVIA 418 (418) Q Consensus 352 ~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~----~~~~~~~-~~~~~~e~~----~~~e~e~~~~ 418 (418) ...|.++++++.++.++.+..++++|+ +++|+|..+..-. ..++... +.++..... ..+...+--. T Consensus 389 lr~d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ 463 (466) T protein:vir:81 389 LREDEKDAADIQKVRAETINTLITAGY-EPESVVAAVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADD 463 (466) T ss_pred hccCHHHHHHHHHHHHHHHHHHHHcCC-ChhhccccccCCccccccCCCcchhhhcccccccccCCCCcccCCCCc Confidence 899999999999999999999999995 9999997542111 0111111 111111110 0000111111 No 34 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.80 E-value=2.1e-19 Score=122.97 Aligned_cols=358 Identities=13% Similarity=0.024 Sum_probs=200.8 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) ..+.+++.+..-++.......+ ...+.. .+.+++.+.++|+.+|+++-+-++.+.... .+.+..+-..+.... T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~--~~v~~~----~al~~~~v~~~i~~ia~~ia~~p~~~~~~~-~~~l~~~PN~~~t~~ 87 (386) T protein:vir:49 15 PINQESFFDIADSDFLASLNSS--EWVSAE----NALKNSDLFSIISQLSNDLATAKITTSRKQ-LQGIVDNPSNNANRF 87 (386) T ss_pred ccchhhhhhhhhccccccccCC--ceechh----hhhccHHHHHHHHHHHHHhhhCceeeccch-hhhhhhccCCCCCHH Confidence 3333333332211111000111 111222 134578889999999999999888886433 233444444444556 Q ss_pred HHHHHHHhcc-ccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEec--CCccc Q lcl|NC_019404. 81 NINDAWSWAR-LFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITT--NESDM 157 (418) Q Consensus 81 ~~~~a~~~~r-l~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~--~~~~~ 157 (418) .|.+.+.+.+ ++|.|++++.-+ ..|.+..+.++++.++++.... .+.+..|.+.. ...+. T Consensus 88 ~f~~~~~~~lll~Gna~~~i~r~----------~~g~~~~l~~i~~~~v~v~~~~-------~~~~~~y~~~~~~~~~~~ 150 (386) T protein:vir:49 88 NFYQSIFAQMLLGGEAFAYRWRN----------DNGRDMKWEYLRPSQVSFNRLD-------NQNGLYYNITFDDPHIAP 150 (386) T ss_pred HHHHHHHHHhhhcCCEEEEEEEC----------CCCcEEEEEEecCceeEEEEcC-------CCceEEEEEEEcCccccc Confidence 6766666665 579999887542 2245678888888887665432 12334555543 22334 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchH Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGF 235 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~ 235 (418) ...++++.||||.... +....+|.|++.. +.+.|.....+.......++....+ ++++++. +. .+.. T Consensus 151 ~~~~~~~evih~~~~~------~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~---~~-~~~~ 219 (386) T protein:vir:49 151 KQHVPQNDILHFRLLS------VDGGLTSVSPLMA-LGREFNIQKASDKLTISALKNALNANGILKIKGG---GL-LDFK 219 (386) T ss_pred eeEEccccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCC---CC-hHHH Confidence 4578999999996432 2344579999975 8899999999999999988876544 4555431 11 1111 Q ss_pred HHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHH Q lcl|NC_019404. 236 GAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALE 313 (418) Q Consensus 236 ~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~ 313 (418) ....+.+ .....+.+.+++...+.+|+.++.+... +.+..++..+.||++.+||..+|.+ +.++- ++++ ..+ T Consensus 220 ~~~~~~~---~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~-~~~~-~~~ 293 (386) T protein:vir:49 220 TKVSRSR---QAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGG-DGDQQ-SSLE-MIY 293 (386) T ss_pred HHHHHHH---HHhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCcc-chHH-HHH Confidence 1222211 2222334444455556789988876643 4567889999999999999987754 33322 2333 223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh--ccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhh Q lcl|NC_019404. 314 TFHKLIDRKRNAELLPILEFLIPFIV--NAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTI 391 (418) Q Consensus 314 ~y~~~I~~~Qe~~l~p~l~~l~~~i~--~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~ 391 (418) .+|. ..++|.++.+...+- +...+.++...+...+.+++ +.....++.+|+++++|+|+.|... T Consensus 294 ~~~~-------~~i~~~l~~i~~~~~~~l~~~~~~~~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~l~~~ 359 (386) T protein:vir:49 294 NIYF-------KSVSRYLRPFVSEMSKKLSCEVDVDISPAVDPTGSNY-------ISLINSMVKSGTLAQNQGLYILQQA 359 (386) T ss_pred HHHH-------HHHHHHHHHHHHHHHHHhcchhcccchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHHhhC Confidence 3432 234444444433331 12345555556666665544 4556788999999999999988643 Q ss_pred cCcCC-CC-hhh---cccccccCCCcc Q lcl|NC_019404. 392 APEIK-IG-DND---IQTEESELITET 413 (418) Q Consensus 392 ~~~~~-~~-~~~---~~~~e~~~~~e~ 413 (418) +-..+ +. .++ .+...-+..+++ T Consensus 360 ~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 360 EILPKELPDGKNPNRTSLKGGEINEQD 386 (386) T ss_pred CCCCCcCcchhccCCCCCCCCCCCCCC Confidence 32111 00 011 000001111111 No 35 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.80 E-value=3.2e-19 Score=121.98 Aligned_cols=363 Identities=14% Similarity=0.101 Sum_probs=207.9 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchH-H----HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDE-P----AFWSRW 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~-~----~i~~~~ 73 (418) -+..+++...++|++.... ..+.+.... ..++-+.+||+.+++++-+-++.+. ..++. . .+...+ T Consensus 15 ~~~~~~~~~~~~g~~~s~~----~~~vt~~~a----l~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL 86 (419) T protein:vir:14 15 QMSAGGWVSALLGSSRSDS----GQVVTPASA----LALTVLQNCVTLLAESIAQLPIELYERSGEDRKPATDHPLYSIL 86 (419) T ss_pred ccCcchhhHHhhcCCCccC----CcccchHHh----hccHHHHHHHHHHHHhhccCceEEEEecCCccccccccHHHHHH Confidence 4555666666666443211 122233222 3567789999999999998888762 22221 1 122222 Q ss_pred H----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEE Q lcl|NC_019404. 74 D----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTY 148 (418) Q Consensus 74 ~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y 148 (418) . ..--...|.+.+.+ -.++|.|++++.- + ..|.+..+.++++..+++.... .|. ..| T Consensus 87 ~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r-~---------~~G~~~~l~pl~~~~v~v~~~~-------~~~-~~y 148 (419) T protein:vir:14 87 KYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDR-D---------SDGVIQGLYPLDNEAVTVMRGS-------DLK-PVY 148 (419) T ss_pred HhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEecCceEEEEECC-------Cce-EEE Confidence 2 12233445555444 5668999988743 2 2356778999999888764321 122 245 Q ss_pred EEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHH Q lcl|NC_019404. 149 RITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLA 226 (418) Q Consensus 149 ~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~ 226 (418) +++... .++.+.|+|+.+.+ ....+|.|++.. +.+.|.....+.......+...... ++++++.. T Consensus 149 ~~~~~~-----~~~~~~i~h~~~~~-------~dg~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 215 (419) T protein:vir:14 149 RVRGSD-----PMPQRLVHHVRWMS-------INGYTGLSPVLL-HANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDA 215 (419) T ss_pred EEccCc-----ccchhheeEecCcC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCC Confidence 665432 35667788886533 134689999975 7788998888988888888775544 56665322 Q ss_pred HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 227 ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 227 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) ......+..+...++++.....-.+.+.+++..++.+|++++.+..+ +-+...+..+.||.+.|||..+| |...++- T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~l-g~~~~~t 294 (419) T protein:vir:14 216 PALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMV-NELERAT 294 (419) T ss_pred CcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCCCCC Confidence 11111222233344443322222233445555556788888876543 45667788899999999999877 4443333 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc----CCceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA----EEWSVEF--SPLDHESSKDKAEVLEKSVNSIAALIA 375 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~----~~~~~~f--~pL~~~~eke~ae~~~~~a~a~~~~~~ 375 (418) .++-|...+.||.. .|.|.+..+-..|- .. .++.++| ..|...|.+++ +++++++++ T Consensus 295 ~s~~E~~~~~f~~~-------~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~-------~~~~~~~~~ 360 (419) T protein:vir:14 295 FSNIEHQSLQFVIY-------TLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSR-------YAAYAVGRQ 360 (419) T ss_pred cccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHH-------HHHHHHHHh Confidence 34456666677664 37888777644432 11 2455555 46666666655 677888999 Q ss_pred CCCCCHHHHHHHHHhhcCcCC------------CCh-hhcccccccC----CCccccccC Q lcl|NC_019404. 376 AGAMDIKEARDTLRTIAPEIK------------IGD-NDIQTEESEL----ITETEVVIA 418 (418) Q Consensus 376 ~g~i~~~e~r~~l~~~~~~~~------------~~~-~~~~~~e~~~----~~e~e~~~~ 418 (418) +|++|++|+|+.+. ..|..+ .+. +..+..++++ .+|...+.+ T Consensus 361 ~G~~T~NE~R~~~g-l~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 361 WGWLSINDIRRLEN-MPPVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKAAIDEIGRILS 419 (419) T ss_pred CCCcCHHHHHHHhC-CCCCCCcCeeeeccccccccccccccCCCCCCccccccchhcccC Confidence 99999999998763 222111 010 0111111111 122222333 No 36 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.80 E-value=3.5e-19 Score=121.75 Aligned_cols=366 Identities=13% Similarity=0.137 Sum_probs=205.4 Q ss_pred CccchhhHHHHhcCCC------Cc--------cccCccccCCHH-HHHHHHHcCCccchhhhcchhhhccCCcccc--Cc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSD------GS--------EIYGSLQNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHID--GI 63 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~------~~--------~~~~~~~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~ 63 (418) |...+-+...|..... .. ...+.....+.. --...+.+++.+.++|+.+++++-+-++.+- .. T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTP 86 (432) T ss_pred CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecC Confidence 6666665555532110 00 000100000000 0112245788999999999999999888762 21 Q ss_pred ch-HHH----HHHHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc Q lcl|NC_019404. 64 DD-EPA----FWSRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR 133 (418) Q Consensus 64 ~d-~~~----i~~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~ 133 (418) +. ..+ +...+ ...-....|.+.+. +..++|.|++++.-++ |.+..+.+++++.+++... T Consensus 87 ~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-----------g~~~~L~~l~p~~v~v~~~ 155 (432) T protein:vir:97 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-----------GRIESLQYLANDRLTITTD 155 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-----------CcEEEEEEEcCcceEEEEc Confidence 11 111 11111 12223344555555 4567899998875432 3466788888888876532 Q ss_pred cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 134 EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 134 ~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) . .|. ..|++...++ ....++++.|+|+.+.++ ...+|.|++.. +.+.|.....+....+.++. T Consensus 156 ~-------~g~-~~y~~~~~~g-~~~~~~~~~iih~r~~~~-------dg~~G~spi~~-~~~~i~~~~a~~~~~~~~f~ 218 (432) T protein:vir:97 156 T-------KGN-TAYRYRRTDG-QMIDIPRQQIWKIMGYSL-------DGENGLSAIRY-GAQIFGTAIAAEAQAARAFR 218 (432) T ss_pred C-------CCc-EEEEEEecCc-eEEEEccccEEEecCcCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 1 233 3556654433 235789999999965432 23579999975 77889888888888888887 Q ss_pred HcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhh Q lcl|NC_019404. 214 RKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALS 289 (418) Q Consensus 214 ~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas 289 (418) ....+ ++++++. ++ .+......+++.. ..+ .+.+++..++.+|++++.+..+ +-+..++....||.+. T Consensus 219 ng~~~~gil~~~~~---l~-~e~~~~~~~~~~~---~~n-ag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 290 (432) T protein:vir:97 219 NGQLQSVYYQIDRF---LT-DDQYDSFSKKVSG---SVE-AGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFF 290 (432) T ss_pred ccCCcceeEecCCC---CC-HHHHHHHHHHHhh---hhc-CCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHh Confidence 75543 6676632 21 2222233333322 222 3344455556789998887654 4456788899999999 Q ss_pred cCCeeeeeccCccccc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-c---CC--ceEEeCCCCCCCHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLS---SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-A---EE--WSVEFSPLDHESSK 357 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~---stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-~---~~--~~~~f~pL~~~~ek 357 (418) +||..+| |....|-. ++-|.....|+.. -|.|+++.+-..|- . . .+ ++|+++.|...|.+ T Consensus 291 gVPp~~l-g~~~~~t~~~~s~~e~~~~~f~~~-------tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~ 362 (432) T protein:vir:97 291 GVPPSMI-GHSSAGTTSWGSGIESQQLGFLTM-------TLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSA 362 (432) T ss_pred CCCHHHc-CCcCCcccccchhHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHH Confidence 9998766 65433322 2224444555543 37787776644432 1 1 13 34555577777776 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-----------hhhc-----ccccccCCCccccccC Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG-----------DNDI-----QTEESELITETEVVIA 418 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-----------~~~~-----~~~e~~~~~e~e~~~~ 418 (418) ++ ++++.+++++|++|++|+|+.+. ..+..+-+ .+.. +++.....++.+.-+. T Consensus 363 ~r-------~~~~~~~~~~G~~T~NE~R~~~g-lpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~ 431 (432) T protein:vir:97 363 AR-------SSYYSQLVNNGLMTRDEAREIEG-LPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred HH-------HHHHHHHHhCCCCCHHHHHHHhC-CCCCCCCcceEeecccccchhhhcccCCCCCCCCCCCccccccc Confidence 65 66788999999999999998763 22211100 0000 0001111222222233 No 37 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.80 E-value=6.6e-20 Score=125.74 Aligned_cols=355 Identities=12% Similarity=0.077 Sum_probs=198.5 Q ss_pred Cccc----------------------------hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhh Q lcl|NC_019404. 1 MVKT----------------------------DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPET 52 (418) Q Consensus 1 ~~~~----------------------------D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d 52 (418) |.+. +.+.+.+.|+.+.+. ...+.. -..+++.+.+||+.+|++ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g-----~~v~~~----~al~~~~V~~~i~~ia~~ 71 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSG-----KKVTVD----KAMKLSAVWACVRLISTS 71 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCccCC-----ceechh----hhhccHHHHHHHHHHHHh Confidence 3222 122222222111100 011111 124577888999999999 Q ss_pred hccCCcccc---CcchHH-----HHHHHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEE Q lcl|NC_019404. 53 ALAAGFHID---GIDDEP-----AFWSRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELET 119 (418) Q Consensus 53 ~~r~~~~i~---~~~d~~-----~i~~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~ 119 (418) +-+-++.+. ++.... .+...+ ...-....|.+.+. +..++|.|++++.- + .|.+.. T Consensus 72 ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~-~----------~G~~~~ 140 (434) T protein:vir:43 72 VAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRR-A----------AGRPAA 140 (434) T ss_pred hhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-C----------CCcEEE Confidence 999888872 111111 111122 11222344555544 45678999988743 2 244668 Q ss_pred EEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHH Q lcl|NC_019404. 120 VRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIK 199 (418) Q Consensus 120 i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~ 199 (418) +.++++.++++.... .|.+. |.+...++ ....++++.|+||.+.++ ...+|.||+.. +.+.|. T Consensus 141 L~~l~p~~v~~~~~~-------~g~~~-y~~~~~~g-~~~~~~~~eVih~~~~~~-------dg~~G~spi~~-~~~~i~ 203 (434) T protein:vir:43 141 LDFLLPSRVDLECDE-------NGRLK-YFYTTKKG-ARREIERTNMLHIPAFTL-------DGRIGLSAIRY-GVDVFG 203 (434) T ss_pred EEEEcCcceEEEEcC-------CCeEE-EEEEecCc-eEEEEccccEEEecCcCC-------CCccccCHHHH-HHHHHH Confidence 888988888764321 23433 33433322 235799999999965431 23579999975 788898 Q ss_pred HHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHH Q lcl|NC_019404. 200 DYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GID 275 (418) Q Consensus 200 ~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~ 275 (418) ....+......++...... ++++++. ++ ++.....++.++......+ .+.+++..++.+|+.++.+.. .+. T Consensus 204 ~~~~~~~~~~~~f~ng~~~~gil~~~~~---l~-~e~~~~~r~~~~~~~g~~n-ag~~~vl~~g~~~~~l~~~~~d~q~~ 278 (434) T protein:vir:43 204 SVMSAEDAANGTFKNGLLPTVAFKVDRI---LQ-PAQREEFREYVKSVSGAMN-SGRSPVLEQGITPETIGINPVDAQLL 278 (434) T ss_pred HHHHHHHHHHHHHhccCCcceEEecCCC---CC-HHHHHHHHHHHHHhcCccc-cCCccccCCCceEEEccCChhHHHHH Confidence 8888888888888775444 5566532 22 2223333333332221222 233444455678998887765 455 Q ss_pred HHHHHHHHHHhhhhcCCeeeeeccCccc-cc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc----CCceE Q lcl|NC_019404. 276 AFLDKKFDRIVALSGIHEIILKNKNVGG-LS-SSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA----EEWSV 346 (418) Q Consensus 276 ~~~~~~~~~iaaas~IP~t~L~G~s~~g-l~-stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~----~~~~~ 346 (418) +..++..+.||.+.|||..+| |....+ .. ++-+.....|+.. -|.|++..+-..+ +.+ .++.+ T Consensus 279 e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~s~~e~~~~~f~~~-------~L~P~~~~ie~~ln~kL~~~~~~~~~~~ 350 (434) T protein:vir:43 279 ETREHGVIEICRWFGVPPWMI-GQTDKGSNWGTGLEQQMLAFLTF-------SISSITNQIQQCVNKRLLTAPERIRYYA 350 (434) T ss_pred HHHHHHHHHHHHHhCCCHHHh-CCCcCCccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhhcCceE Confidence 778899999999999998766 655433 22 2224444555543 4788877764444 221 14555 Q ss_pred EeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC----------Chhhc------------ Q lcl|NC_019404. 347 EFS--PLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI----------GDNDI------------ 402 (418) Q Consensus 347 ~f~--pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~----------~~~~~------------ 402 (418) +|+ .|...|.+++ ++++.+++++|++|++|+|+.+. ..+..+- ..+.+ T Consensus 351 ~fd~~~llr~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~ 422 (434) T protein:vir:43 351 EFSLEGFLKADSAGR-------AAWYSTMAQNGFMTRNEGRRKEN-LPELPGGDILTVQSNLVPIDQLGQSNKSQAVRAA 422 (434) T ss_pred EEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeeccCccchhhhhccCCCcchhhh Confidence 554 7777777664 77888899999999999998753 2221110 00111 Q ss_pred -ccccccCCCccc Q lcl|NC_019404. 403 -QTEESELITETE 414 (418) Q Consensus 403 -~~~e~~~~~e~e 414 (418) +...++++. +| T Consensus 423 ~~~~~~~~~~-~~ 434 (434) T protein:vir:43 423 LMNWFSQPEP-QE 434 (434) T ss_pred hhccCCCCCC-CC Confidence 111111221 22 No 38 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.79 E-value=2.6e-19 Score=122.46 Aligned_cols=362 Identities=11% Similarity=0.017 Sum_probs=199.1 Q ss_pred chhhHHHHhcCCCCccc---cCc----------cccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC-cchHH-- Q lcl|NC_019404. 4 TDSYANIFLGGSDGSEI---YGS----------LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG-IDDEP-- 67 (418) Q Consensus 4 ~D~~~n~~~g~~~~~~~---~~~----------~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~-~~d~~-- 67 (418) |==|.++|.+....... .+. ....+. .-+.+++.+.++|+.+|+++-+-++.+.- .++.. T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~ 76 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGA----NSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIP 76 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccch----hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccc Confidence 11122222211111000 000 011111 12245788999999999999998887632 11111 Q ss_pred --HHHHHHH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccccc Q lcl|NC_019404. 68 --AFWSRWD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNA 140 (418) Q Consensus 68 --~i~~~~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~ 140 (418) .+...+. ..--+..|.+.+. +..++|.|++++..++ ..|.+..|.++++.++.+.... |. T Consensus 77 ~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~---------~~g~~~~L~~l~p~~v~v~~~~-~~--- 143 (409) T protein:vir:84 77 VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARD---------EANRPTAIMPIHPDCIHVTDAK-DE--- 143 (409) T ss_pred cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEEC---------CCCceEEEEEEcCceeEEEEcC-CC--- Confidence 1111221 1222344555555 5567899999886543 3366788889888877654321 11 Q ss_pred ccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc-- Q lcl|NC_019404. 141 RFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA-- 218 (418) Q Consensus 141 ~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~-- 218 (418) .+. ..|.+... .+..++++.|||+.+.. +...++|.|++.. +.+.|.....+.......+.....+ T Consensus 144 -~~~-~~~~~~~~---~g~~~~~~dvih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~p~g 211 (409) T protein:vir:84 144 -DGD-WIEPVYRI---DGKVVPNHRIMHIKRYP------VAGCALGMSPIEK-AASAIGLGLAAERYGLRWFRDSANPSG 211 (409) T ss_pred -cce-EEEEEecC---CceEEchhhEEEecCCC------CCcccccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCccE Confidence 111 11111111 12468999999996543 2334579999975 7888999988888888888775543 Q ss_pred eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeee Q lcl|NC_019404. 219 VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIIL 296 (418) Q Consensus 219 v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L 296 (418) ++++++ . + +++...+..+++... ..+ .+.+++..++.+|+.++.+.. .+-+...+..+.||.+.+||..+| T Consensus 212 il~~~~--~-l-~~e~~~~~~~~~~~~--~~n-~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 284 (409) T protein:vir:84 212 ILSSDA--D-L-TPDQVKQTQKQWIQS--HHN-RRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMI 284 (409) T ss_pred EEecCC--C-C-CHHHHHHHHHHHHHH--hcc-CCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 445543 1 1 222333444444332 233 344455555678888887664 345566788899999999999766 Q ss_pred eccC-ccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCC--ceEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019404. 297 KNKN-VGGLS-SSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN--AEE--WSVEFSPLDHESSKDKAEVLEKSVNSI 370 (418) Q Consensus 297 ~G~s-~~gl~-stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--~~~--~~~~f~pL~~~~eke~ae~~~~~a~a~ 370 (418) |.. .+... ++-+....+|+.. -|.|+++.+-..+-+ ..+ ++|+++.|...|.+++ ++++ T Consensus 285 -g~~~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~L~~g~~i~fd~~~l~~~d~~~~-------~~~~ 349 (409) T protein:vir:84 285 -GDVEKSTSWGTGIEEQGINFVRH-------TLLPWLRCIEQALDTFLPRGQFVKFNVDGLMRGDVTAR-------FTAY 349 (409) T ss_pred -CCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHHhccCCCeEEEechhhhccCHHHH-------HHHH Confidence 543 33332 3234455566543 367777666554421 234 4555667877777665 6788 Q ss_pred HHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhcc----cccccCCCcccccc Q lcl|NC_019404. 371 AALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDIQ----TEESELITETEVVI 417 (418) Q Consensus 371 ~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~~----~~e~~~~~e~e~~~ 417 (418) .+++++|++|++|+|+.+. ..+..+-+ .++.+ ..++++.+++++-= T Consensus 350 ~~~~~~G~~t~NE~R~~~g-~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 350 QMGLQNGIWSVNEVRAWED-APPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred HHHHhCCCcCHHHHHHHhC-CCCCCCcceeeecccccccccCCccccCcCCCCCCccCCCC Confidence 8999999999999998763 22211100 01111 11111111111111 No 39 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.79 E-value=2.9e-19 Score=122.23 Aligned_cols=382 Identities=15% Similarity=0.169 Sum_probs=196.8 Q ss_pred Cccchh-------hHHH-------------------HhcCCCCcc-ccCccccCCHHHHH---HHHHcCCccchhhhcch Q lcl|NC_019404. 1 MVKTDS-------YANI-------------------FLGGSDGSE-IYGSLQNQAPTILA---SLYADNALVRRIIDTIP 50 (418) Q Consensus 1 ~~~~D~-------~~n~-------------------~~g~~~~~~-~~~~~~~~~~~~l~---~~Y~~~~~~r~iVd~~a 50 (418) +.+.+. +.|. .+|.-.... +...+...+++++. ..|..++++++||++.+ T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a 97 (547) T protein:vir:63 18 VKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRS 97 (547) T ss_pred ccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHHHHhhcCHHHHHHHHHHH Confidence 111111 1111 111111111 11123334666654 46999999999999998 Q ss_pred hhhccC-----------CccccC--------cchH---HHHHHHHHHhCc--------hHHHHHHH-HhccccceEEEEE Q lcl|NC_019404. 51 ETALAA-----------GFHIDG--------IDDE---PAFWSRWDDLEM--------TQNINDAW-SWARLFGGAAIVA 99 (418) Q Consensus 51 ~d~~r~-----------~~~i~~--------~~d~---~~i~~~~~~l~~--------~~~~~~a~-~~~rl~G~~~i~i 99 (418) +...+- +|+|.- ..+. ..++.-+.+.+. +..|.+.+ ....++|.+++.+ T Consensus 98 ~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i 177 (547) T protein:vir:63 98 NQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEK 177 (547) T ss_pred HHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEE Confidence 866531 233311 0111 123333444432 22344444 4456789888776 Q ss_pred eecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhh Q lcl|NC_019404. 100 IVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMR 179 (418) Q Consensus 100 ~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~ 179 (418) .- + ..|.+..|.++++..|.+.... |-.-+ ..+..|..... +.....++++.||||.+.++.. T Consensus 178 ~r-d---------~~G~~~~L~~l~p~~V~~~~~~-~g~~~--~~~~~y~~~~~-~~~~~~~~~~eiih~r~n~~~~--- 240 (547) T protein:vir:63 178 VF-N---------RNQSMVRFVAKDPTTIFFATTA-DGKIP--DNGNRFVQVID-QKIVATFNAREMAFAVRNPRSD--- 240 (547) T ss_pred EE-C---------CCCcEEEEEEecCceeEEEECC-ccccc--cCceEEEEEcC-CcEEEEeccccEEEecccCCCC--- Confidence 43 2 3456788999998887664211 10000 11122322222 2233468899999998766433 Q ss_pred hccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCc-ceeE Q lcl|NC_019404. 180 RQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVG-QAIG 256 (418) Q Consensus 180 ~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~-~~~~ 256 (418) ....++|.||+.. +.+.|.....+.......+...... ++++++-.. + +.+....+++.+......-.+. ...+ T Consensus 241 ~~~~~~G~Spi~~-~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~-l-s~e~~~~lk~~~~~~~~G~~nagk~~v 317 (547) T protein:vir:63 241 IYATGYGYPELEI-ALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQ-Q-SQHALEIFKREWKNSLSGINGSWQIPV 317 (547) T ss_pred cccccccccHHHH-HHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCC-C-CHHHHHHHHHHHHHHhcCccccccccc Confidence 2334679999975 7788999988888888888775543 345443111 1 1222333444443322221222 3345 Q ss_pred EEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccc--------cc-cchhHHHHHHHHHHHHHHHH Q lcl|NC_019404. 257 IDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGG--------LS-SSQNTALETFHKLIDRKRNA 325 (418) Q Consensus 257 ~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~g--------l~-stge~d~~~y~~~I~~~Qe~ 325 (418) +.+++-+|+.++.+..+ +-+...+..+.||.+.+||..+|.-...++ ++ |+-+.....||. . T Consensus 318 l~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~-------~ 390 (547) T protein:vir:63 318 VSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKN-------K 390 (547) T ss_pred ccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHH-------H Confidence 54555567777665543 345567788999999999997663222211 11 122333333433 3 Q ss_pred HHHHHHHHHHHHh---hcc---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh Q lcl|NC_019404. 326 ELLPILEFLIPFI---VNA---EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGD 399 (418) Q Consensus 326 ~l~p~l~~l~~~i---~~~---~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~ 399 (418) .|.|++.++-..| +.. .++.|+|+.+...++.++++ +..++.+|++|++|+|+.+. ..+..+-+| T Consensus 391 tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~--------~~~~~~~g~lT~NE~R~~~g-l~P~~egGD 461 (547) T protein:vir:63 391 GLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVK--------ILAEKAKVAMTVNEVRKELN-LPGDVIGGD 461 (547) T ss_pred HHHHHHHHHHHHHHhhcccccCCceEEEeeccccccHHHHHH--------HHHHHhCCCcCHHHHHHHhC-CCCCCCCCc Confidence 4777776664443 222 46889999888777665543 23466789999999998763 222111111 Q ss_pred hh------------ccc--cccc------------------CCCccccccC Q lcl|NC_019404. 400 ND------------IQT--EESE------------------LITETEVVIA 418 (418) Q Consensus 400 ~~------------~~~--~e~~------------------~~~e~e~~~~ 418 (418) .- ... .+.+ ++++.+-.-+ T Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (547) T protein:vir:63 462 IPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 512 (547) T ss_pred eeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCc Confidence 00 000 0000 1111111111 No 40 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.79 E-value=1.6e-19 Score=123.57 Aligned_cols=348 Identities=12% Similarity=0.163 Sum_probs=187.6 Q ss_pred HHHHHHcCCccchhhhcchhhhccCCccccCc---c--h--HHHHH---HHHHHhC--------------chHHHHHHHH Q lcl|NC_019404. 32 LASLYADNALVRRIIDTIPETALAAGFHIDGI---D--D--EPAFW---SRWDDLE--------------MTQNINDAWS 87 (418) Q Consensus 32 l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~---~--d--~~~i~---~~~~~l~--------------~~~~~~~a~~ 87 (418) |..+-..|+.+++||+..+++..+-++.+.-. + + ...++ ..+.... ...-++..+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 77777789999999999999999999887411 1 1 11111 1121111 1133445566 Q ss_pred hccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccc-----------------ccccccC---cceE Q lcl|NC_019404. 88 WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREEN-----------------PRNARFG---KPLT 147 (418) Q Consensus 88 ~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~d-----------------p~s~~yg---~p~~ 147 (418) +..++|.|++++.- + ..|.+..+.++++..+++...... +...+.. .+.+ T Consensus 81 ~l~l~Gn~~i~~~r-~---------~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (467) T protein:vir:31 81 DYEAIGWLTIEILT-Q---------TDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVF 150 (467) T ss_pred HHHhcCCeEEEEEE-C---------CCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeee Confidence 67779999998753 2 234566777777777655321110 0100000 1112 Q ss_pred EEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchH Q lcl|NC_019404. 148 YRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGL 225 (418) Q Consensus 148 y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l 225 (418) +.......+....+.++.||||... .+....+|.|++.. +.+.|.....+......++...... ++++++ T Consensus 151 ~~~~~~~~~~~~~~~~~diih~r~~------~~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~- 222 (467) T protein:vir:31 151 VDADDGSTGTSVSNPANELIFKRNH------SPLYPHYGAPDIIP-AVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKG- 222 (467) T ss_pred eeeccccccceeEeccccEEEecCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC- Confidence 2222222334456888999999532 23345689999975 7788888888777777776654432 344432 Q ss_pred HHhhcCcchHHHHHHHHHHHHH-----------hcCCcceeEEEcCCCceeEeeccc----------CCHHHHHHHHHHH Q lcl|NC_019404. 226 AELCDDSEGFGAARLRLAQVDN-----------NSGVGQAIGIDAESEEYSVLNSDI----------GGIDAFLDKKFDR 284 (418) Q Consensus 226 ~~~~~~~~~~~~~~~r~~~~~~-----------~~~~~~~~~~d~~~e~~~~~~~~~----------~gl~~~~~~~~~~ 284 (418) ..+ +.+....+.+.+..... ...+.+..++...+.++..+.+.+ +...+........ T Consensus 223 -~~l-~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~ 300 (467) T protein:vir:31 223 -AEL-TEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHD 300 (467) T ss_pred -cCC-CHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHH Confidence 111 22233333333322111 001222223333334444433322 1234566778888 Q ss_pred HhhhhcCCeeeeeccC-ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hc-----cCCc--eEEeCCCCC Q lcl|NC_019404. 285 IVALSGIHEIILKNKN-VGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VN-----AEEW--SVEFSPLDH 353 (418) Q Consensus 285 iaaas~IP~t~L~G~s-~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~-----~~~~--~~~f~pL~~ 353 (418) ||++.|||..+| |.. .+..+++-+.....|+.. .|.|++..+-..| +. ..++ +|.+..|.. T Consensus 301 Ia~~fgVpp~~l-G~~~~~~~~s~~e~~~~~f~~~-------~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~ 372 (467) T protein:vir:31 301 ILKVHDVPPVIA-GVVESGAFSTDAEEQRKEFAEE-------TIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDT 372 (467) T ss_pred HHHHhCCCHHHc-ccCCCCCcccCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhc Confidence 999999998765 665 444444445566666543 3677766654443 11 1344 455568877 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhc-----------------cccccc----CCCc Q lcl|NC_019404. 354 ESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDI-----------------QTEESE----LITE 412 (418) Q Consensus 354 ~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~-----------------~~~e~~----~~~e 412 (418) .+.+++ +++...++++|++|++|+|+.+. ..| +.++++ ...+++ .+.+ T Consensus 373 ~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~G-l~p---i~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (467) T protein:vir:31 373 KLQDVE-------IASQRVQAMQGLLTVNELRDEFG-FEP---FPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDR 441 (467) T ss_pred cCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCC---CCcccccCCcccccccccccCCCCcccCcCCCCCCCc Confidence 787665 55677899999999999998763 111 111000 011111 1111 Q ss_pred cccccC Q lcl|NC_019404. 413 TEVVIA 418 (418) Q Consensus 413 ~e~~~~ 418 (418) .|-++. T Consensus 442 ~~~~~~ 447 (467) T protein:vir:31 442 ADEIID 447 (467) T ss_pred ccchHh Confidence 111111 No 41 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.78 E-value=3.3e-19 Score=121.90 Aligned_cols=366 Identities=11% Similarity=0.084 Sum_probs=204.0 Q ss_pred CccchhhHHHHhcCCCCcc------------ccCc------cccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSE------------IYGS------LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~------------~~~~------~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~- 61 (418) |+ +.||....+ .+++ ............|..++.+++||+.++++.-+-++.+. T Consensus 1 ~~--------~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~ 72 (518) T protein:vir:10 1 ML--------LANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred Cc--------ccCceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEE Confidence 11 123222111 1111 11111122334577899999999999999988777762 Q ss_pred --CcchHHHHHHHHHHh-------CchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 --GIDDEPAFWSRWDDL-------EMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 --~~~d~~~i~~~~~~l-------~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) ++.........+..| -....|.+.+. +-.++|.|++++.- + ..|.+..|.++++..+++. T Consensus 73 ~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r-~---------~~G~~~~L~~l~p~~v~v~ 142 (518) T protein:vir:10 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N---------KSGTPEKLMPMHPSRVAIK 142 (518) T ss_pred EcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEECCCceEEE Confidence 111111111111222 22334555555 45578999988753 2 2356778889998888765 Q ss_pred cccccccccccCcceEEEEecCCc--ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNES--DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLAT 209 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~--~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~ 209 (418) ... .+....|.++.... .....+.++.||||.+.. +....+|.||+.. +.+.|.....+..... T Consensus 143 ~~~-------~~~~~~y~~~~~~~~~~~~~~~~~~eViHir~~s------~dg~~~G~spi~~-a~~~i~~~~a~~~~~~ 208 (518) T protein:vir:10 143 RNS-------RTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFN------PDGLERGLSLMES-LKSTIFSEDSSRNATA 208 (518) T ss_pred EcC-------CCCEEEEEEEecCCccceEEEecCCcEEEecCCC------CCcccccccHHHH-HHHHHHHHHHHHHHHH Confidence 432 11223455554322 222467889999996432 2233469999974 7889999999998988 Q ss_pred HHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHH Q lcl|NC_019404. 210 QLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRI 285 (418) Q Consensus 210 ~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~i 285 (418) .++...... ++++++. ++ .+...+++++++.......+.+.+++..++.+|+.++.+..+ +-+...+..+.| T Consensus 209 ~~f~ng~~p~gil~~~~~---ls-~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eI 284 (518) T protein:vir:10 209 AMWKNAGRPNLVLRHEKR---LS-EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEV 284 (518) T ss_pred HHHhcCCCccEEEecCCC---CC-HHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHH Confidence 888776654 5565531 22 222334444444433322233445555566788888876543 455667888999 Q ss_pred hhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc---CCceEEe--CCCCCCCHH Q lcl|NC_019404. 286 VALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA---EEWSVEF--SPLDHESSK 357 (418) Q Consensus 286 aaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~---~~~~~~f--~pL~~~~ek 357 (418) |.+.+||..+| |...++-.++.+.....||.. -|.|++..+-..+ +.. .++.++| ..|...|.+ T Consensus 285 a~afgVPp~~l-g~~~~~t~sn~eq~~~~f~~~-------tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~ 356 (518) T protein:vir:10 285 CGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWE 356 (518) T ss_pred HHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccccCCceEEEechhhhccCHH Confidence 99999998766 655433334456666677654 3677766664433 211 2445555 477777766 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCC-CChh----------------hcccccccCC---C------ Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIK-IGDN----------------DIQTEESELI---T------ 411 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~-~~~~----------------~~~~~e~~~~---~------ 411 (418) ++ ++++.+++++|++|++|+|+.+. ..+..+ ..++ ..+..+.+.. . T Consensus 357 ~r-------~~~~~~~~~~G~lT~NE~R~~~G-l~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~ 428 (518) T protein:vir:10 357 AK-------SESTQKMVNSGVATPNEGREIMG-LPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVAS 428 (518) T ss_pred HH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCcccccc Confidence 54 77788999999999999998763 111100 0010 0000000000 0 Q ss_pred -----ccccccC Q lcl|NC_019404. 412 -----ETEVVIA 418 (418) Q Consensus 412 -----e~e~~~~ 418 (418) ..+..-+ T Consensus 429 ~~~~~~~~~~~~ 440 (518) T protein:vir:10 429 LDQSPPTSVPGL 440 (518) T ss_pred ccccccccCCCC Confidence 0000000 No 42 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.78 E-value=1.4e-18 Score=118.55 Aligned_cols=380 Identities=16% Similarity=0.106 Sum_probs=200.4 Q ss_pred CccchhhHHHHhcCCCC-------ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG-------SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRW 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~-------~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~ 73 (418) +..+.-+.-...-..+. ...+..+ .++..|.++|..++++++||+..++++.+-++.+...+.... .-.- T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pp--~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~~~~-~~lp 84 (540) T protein:vir:41 8 IKSLEKYRAIKGDTDSQALKEDRFEEYVEPK--VHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDGGVE-ELLR 84 (540) T ss_pred hhhccchhhhhccccccccccCCCCccccCC--CCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCccchh-hhcc Confidence 33333332222111111 1112222 467889999999999999999999999999999876543211 0000 Q ss_pred HHh-CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc------ccc----ccccc Q lcl|NC_019404. 74 DDL-EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE------ENP----RNARF 142 (418) Q Consensus 74 ~~l-~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~------~dp----~s~~y 142 (418) ... ...+-+...+..-.++|.|++++.-+ ..|.+..|.++++..+++.... .|. ....| T Consensus 85 N~~~t~~~f~~~~v~dlll~Gnayv~i~r~----------~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~ 154 (540) T protein:vir:41 85 ACRPSFEFILLQALEDLQVFNYCTLEVVRD----------DQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDY 154 (540) T ss_pred CCCCCHHHHHHHHHHHHHhcCCeEEEEEEC----------CCCcEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecc Confidence 111 12233333344566789999987542 2356778888888887653211 011 01112 Q ss_pred CcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--ee Q lcl|NC_019404. 143 GKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VW 220 (418) Q Consensus 143 g~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~ 220 (418) +.+..+. ..++.....+.++.||||.... +....+|.||+.. +...+.....+......+++..... ++ T Consensus 155 ~~~~~~~--~~~g~~~~~~~~~eViHir~~~------~~~~~~G~Spi~~-~~~~i~~~~~~~~~~~~~f~Ng~~p~giL 225 (540) T protein:vir:41 155 RYEGEVN--PDNGEDQDGVGANEIIFIHLPS------PICSYYGVPRYLS-AAPSILAMQKIDEYNYAFFDNYTIPSYVI 225 (540) T ss_pred cccceee--ccccccceeecccceEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 2221111 1122233467888999995331 3345689999975 7778888888888888777665443 45 Q ss_pred ecchHH--HhhcCcchHHHHHHHHHHHHHh-----cCCcceeEEEc------CCCceeEeecccCC--HHHHHHHHHHHH Q lcl|NC_019404. 221 KAKGLA--ELCDDSEGFGAARLRLAQVDNN-----SGVGQAIGIDA------ESEEYSVLNSDIGG--IDAFLDKKFDRI 285 (418) Q Consensus 221 k~~~l~--~~~~~~~~~~~~~~r~~~~~~~-----~~~~~~~~~d~------~~e~~~~~~~~~~g--l~~~~~~~~~~i 285 (418) ++++.. ......+......+++...... ..+.+..++.. ++-+|..++.+..+ +-+......+.| T Consensus 226 ~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eI 305 (540) T protein:vir:41 226 TVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDI 305 (540) T ss_pred EeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHH Confidence 554311 1111112222333444333221 12333344432 22356666554433 446677888999 Q ss_pred hhhhcCCeeeeeccCc-cccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc--cCCceEEeCCCCCCCHH Q lcl|NC_019404. 286 VALSGIHEIILKNKNV-GGLS-SSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN--AEEWSVEFSPLDHESSK 357 (418) Q Consensus 286 aaas~IP~t~L~G~s~-~gl~-stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~--~~~~~~~f~pL~~~~ek 357 (418) |++.+||..+| |... ++.| |+-+.....||.. .|.|.++.+-..| .. ..++.|+|+.-.-+. T Consensus 306 a~afgVPp~~l-G~~~~~~~n~sn~eq~~~~f~~~-------tL~P~~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll~-- 375 (540) T protein:vir:41 306 AAAHMIDPYRL-GITDVGPLGGNFAEVARRTYYES-------VVRPQQEIVSSVLTDFIQLKLDPGARFVFNEEILME-- 375 (540) T ss_pred HHHhCCCHHHc-CcccCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhhhccCCceEEEecchhhcc-- Confidence 99999998766 7653 3333 4456666677654 3667666654433 11 246788887532222 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC-------CCChhhccccc----ccCCCccccccC Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI-------KIGDNDIQTEE----SELITETEVVIA 418 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~-------~~~~~~~~~~e----~~~~~e~e~~~~ 418 (418) .|. +..+.+++++|++|++|+|+.|....+.. +....++...+ .....+.+...+ T Consensus 376 --~D~----~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~ 441 (540) T protein:vir:41 376 --SEF----VHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYA 441 (540) T ss_pred --hHH----HHHHHHHHhCCCCCHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccc Confidence 221 23356789999999999997662211100 01111111110 011111122222 No 43 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.78 E-value=1.9e-18 Score=117.76 Aligned_cols=366 Identities=14% Similarity=0.148 Sum_probs=206.7 Q ss_pred CccchhhHHHHhcCCC------Cccc--------cCccccCCHHH-HHHHHHcCCccchhhhcchhhhccCCccc--cCc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSD------GSEI--------YGSLQNQAPTI-LASLYADNALVRRIIDTIPETALAAGFHI--DGI 63 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~------~~~~--------~~~~~~~~~~~-l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i--~~~ 63 (418) |...|-+...|..... .... .+.....+... -...+.+++.+.++|+.++++.-+-++.+ +.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecC Confidence 7777777776643211 0000 01010100000 11224578999999999999999988876 222 Q ss_pred ch-HHH--------HHHHHHHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc Q lcl|NC_019404. 64 DD-EPA--------FWSRWDDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR 133 (418) Q Consensus 64 ~d-~~~--------i~~~~~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~ 133 (418) +. ..+ +..+-...-....|.+.+. +..++|.|++++.-++ |.+..+.++++.++++... T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-----------g~~~~L~~l~~~~v~v~~~ 155 (432) T protein:vir:10 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-----------GRIESLQYLANDRLTITTD 155 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-----------CcEEEEEEEcCCceEEEEc Confidence 21 111 1111122223344555444 4577899998875432 4466888888888876532 Q ss_pred cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 134 EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 134 ~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) . .|. ..|++...++ ....++++.|+|+.+.++ +..+|.|++.. +.+.|.....+......++. T Consensus 156 ~-------~g~-~~y~~~~~~g-~~~~~~~~~iih~~~~~~-------dg~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ 218 (432) T protein:vir:10 156 T-------KGN-TAYRYRRTDG-QMIDIPKQQIWKIMGYSL-------DGENGLSAIRY-GAQIFGTAIAAEAQAARAFR 218 (432) T ss_pred C-------CCc-EEEEEEecCc-eEEEEcCccEEEecCCCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 1 233 3555654333 235789999999965432 23569999975 77888888888888888776 Q ss_pred HcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhh Q lcl|NC_019404. 214 RKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALS 289 (418) Q Consensus 214 ~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas 289 (418) ..... ++++++. ++ .+..++..+++.. ..+.++.+++ .++.+|++++.+..+ +-+..++....||.+. T Consensus 219 ng~~~~gil~~~~~---l~-~e~~~~~~~~~~~---~~nag~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~af 290 (432) T protein:vir:10 219 NGQLQSVYYQIDRF---LT-DDQYDSFAKKVSG---SVEAGRAPLL-EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFF 290 (432) T ss_pred cCCCcceEEecCCC---CC-HHHHHHHHHHHhh---hhhCCCceec-CCCceEEEccCChHHHHHHHHHHHHHHHHHHHh Confidence 54433 5565531 21 2223333333332 2223344444 455789888887654 4456788999999999 Q ss_pred cCCeeeeeccCccccc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-c---CCceEEe--CCCCCCCHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLS---SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-A---EEWSVEF--SPLDHESSK 357 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~---stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-~---~~~~~~f--~pL~~~~ek 357 (418) +||..+| |....|-. ++-|.....|+.. -|.|.++.+-..|- . . .++.|+| +.|...|.+ T Consensus 291 gVPp~~l-g~~~~~t~~~~sn~e~~~~~f~~~-------tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~ 362 (432) T protein:vir:10 291 GVPPSMI-GHSSAGTTSWGSGIESQQLGFLSM-------TLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSA 362 (432) T ss_pred CCCHHHc-CCccCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHH Confidence 9999765 65543322 3234455566543 37787766644432 1 1 2345555 477777776 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-----------hhhc-----ccccccCCCccccccC Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG-----------DNDI-----QTEESELITETEVVIA 418 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-----------~~~~-----~~~e~~~~~e~e~~~~ 418 (418) ++ ++++++++++|++|++|+|+.+. ..+..+.+ .+.. +++....+++++.... T Consensus 363 ~r-------~~~~~~~~~~G~~T~NE~R~~~g-lppi~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~ 431 (432) T protein:vir:10 363 AR-------SSYYSQLVNNGLMTRDEAREIEG-LPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred HH-------HHHHHHHHhCCCCCHHHHHHHhC-CCCCCCCcceEeecCcccchhhhcccCCCCCCCCCCCccccccc Confidence 65 66788899999999999998763 22111100 0110 0111112222333333 No 44 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.77 E-value=1.1e-18 Score=119.01 Aligned_cols=366 Identities=10% Similarity=0.050 Sum_probs=198.5 Q ss_pred Ccc-chhhHHHHhcCC-CCccccCccccCCHH-HHHHHHHcCCccchhhhcchhhhccCCccccCcch--HHH----HHH Q lcl|NC_019404. 1 MVK-TDSYANIFLGGS-DGSEIYGSLQNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDGIDD--EPA----FWS 71 (418) Q Consensus 1 ~~~-~D~~~n~~~g~~-~~~~~~~~~~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d--~~~----i~~ 71 (418) ..| ..++.|...+.. +....+......++. --...+.+++.+.++|+.+|+++-+-++.+.-..+ ... +.. T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~~~~lL~~ 89 (412) T protein:vir:26 10 VTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEVSDLLTV 89 (412) T ss_pred hhhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccccccchHHHHHHh Confidence 111 122222221111 111111111111110 11234567899999999999999998887731111 111 222 Q ss_pred HHHHhCchHHHHH-HHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEE Q lcl|NC_019404. 72 RWDDLEMTQNIND-AWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRI 150 (418) Q Consensus 72 ~~~~l~~~~~~~~-a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i 150 (418) +-..+-.+..|.+ .+.+-.++|.|++++.- + ..|.+..|.++++..+++.... .+.+..|++ T Consensus 90 ~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~l~~~~v~v~~~~-------~~~~~~y~~ 152 (412) T protein:vir:26 90 SPNNSLSSFDFINQIETIRNEKGNAYVLIER-D---------IYHQPSKLFLLNPDVVEMLIEN-------QSRELYYSI 152 (412) T ss_pred hcccCCCHHHHHHHHHHHHhhcCceEEEEEE-C---------CCCcEEEEEEEcCceeEEEEeC-------CCcEEEEEE Confidence 2222333445544 44455778999988753 2 2356778889998888765322 234556777 Q ss_pred ecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhc Q lcl|NC_019404. 151 TTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCD 230 (418) Q Consensus 151 ~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~ 230 (418) +..++. ...+.++.|+||.+.. .....+|.|++.. +.+.+.....+......-.....-.+++.+. . + T Consensus 153 ~~~~g~-~~~~~~~evih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~a~~~~~~~~~~~~~~~i~~~~~--~-l- 220 (412) T protein:vir:26 153 HAATGN-KLIVHNMDMLHFKHIV------ASNMVQGISPIDV-LKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS--N-V- 220 (412) T ss_pred EcCCce-EEEEccccEEEeCCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHhcCCCCceEEecCC--C-C- Confidence 765433 3568999999996432 2245679999975 5565555444433321111111111223321 1 1 Q ss_pred CcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch Q lcl|NC_019404. 231 DSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ 308 (418) Q Consensus 231 ~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg 308 (418) +++.....++++.. ... +.+.+++..++.+|+.++.+.. .+-+..++....||.+.+||..+|.+...+.. ++. T Consensus 221 ~~e~~~~~~~~~~~--~~~-~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~-sn~ 296 (412) T protein:vir:26 221 GKEKRQQVLEDFKQ--YYE-ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNF-AKN 296 (412) T ss_pred CHHHHHHHHHHHHH--Hhh-cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc-ccH Confidence 12223344444443 223 3445555556678888876654 34556667889999999999987765444433 455 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-----cCCceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019404. 309 NTALETFHKLIDRKRNAELLPILEFLIPFIV---N-----AEEWSVE--FSPLDHESSKDKAEVLEKSVNSIAALIAAGA 378 (418) Q Consensus 309 e~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-----~~~~~~~--f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~ 378 (418) |...+.|+.. .|.|.++.+-..+- . ..+..|+ +.+|...|.+++ ++++++++++|+ T Consensus 297 e~~~~~f~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~~G~ 362 (412) T protein:vir:26 297 EELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQ-------AEVYFKAVRSGY 362 (412) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHH-------HHHHHHHHhCCC Confidence 6666677665 37888877754432 1 1234455 457777777665 677889999999 Q ss_pred CCHHHHHHHHHhhcCcCCCChh-----------hccc----ccccCCCcccc Q lcl|NC_019404. 379 MDIKEARDTLRTIAPEIKIGDN-----------DIQT----EESELITETEV 415 (418) Q Consensus 379 i~~~e~r~~l~~~~~~~~~~~~-----------~~~~----~e~~~~~e~e~ 415 (418) ++++|+|+.+. ..|..+ +|+ ...+ ...-.++++|. T Consensus 363 ~t~NE~R~~~g-l~p~~g-gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 363 YTINDIREWED-LPPVEG-GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred cCHHHHHHHhC-CCCCCC-cCeeeecccccccccchhhcccccCCCCCcCCC Confidence 99999999873 222111 111 0000 01112233444 No 45 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.77 E-value=6.9e-19 Score=120.15 Aligned_cols=366 Identities=13% Similarity=0.086 Sum_probs=204.4 Q ss_pred CccchhhHHHHh----------cCC-CCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--Ccch-H Q lcl|NC_019404. 1 MVKTDSYANIFL----------GGS-DGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDD-E 66 (418) Q Consensus 1 ~~~~D~~~n~~~----------g~~-~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d-~ 66 (418) |.=-|-+.+.-. |+. +.....| ...+.+. + .+++.+++||+.+|++.-+-++.+. ..+. . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~v~~~~---a-l~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~ 74 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAG--VIITPET---A-LALSAVRACVTLLAESVAQLPCVLYRRTENGGR 74 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCC--ceechHH---h-hccHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 211111111100 000 0000111 1112222 2 3467889999999999998888761 1111 1 Q ss_pred -----HHHHHHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccc Q lcl|NC_019404. 67 -----PAFWSRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREEN 136 (418) Q Consensus 67 -----~~i~~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~d 136 (418) ..+...+ ....-...|.+.+. .-.++|.|++++.- + ..|.+..+.+++++++++.... T Consensus 75 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r-~---------~~G~~~~L~pl~~~~v~v~~~~-- 142 (419) T protein:vir:57 75 EIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDR-N---------GRGDITELIPINPHKVIVLKGP-- 142 (419) T ss_pred eccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCcceEEEECC-- Confidence 1122222 12223344544444 45578999888743 2 3366788999999888764321 Q ss_pred ccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019404. 137 PRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQ 216 (418) Q Consensus 137 p~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~ 216 (418) .|.+ +|.+...+ ..++++.|+|+.+.+ ....+|.|++.. +...+.....+......++.... T Consensus 143 -----~g~~-~y~~~~~~----~~~~~~~vih~r~~~-------~d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~ 204 (419) T protein:vir:57 143 -----DGMP-YYDIPSIG----EILPMRMVHHIKSFS-------LDGYIGTSPIQT-NPDVLGLGIAVEQHAAQVFARGT 204 (419) T ss_pred -----CceE-EEEEcCCc----eEEchhhEEEecCcC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHccC Confidence 2332 46664332 457889999996432 124679999974 78888888888888888887755 Q ss_pred Cc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCC Q lcl|NC_019404. 217 QA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIH 292 (418) Q Consensus 217 ~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP 292 (418) .+ +++++.........+..+.+.+++........+.+.+++..++.+|+.++.+... +-+..+...+.||++.||| T Consensus 205 ~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 284 (419) T protein:vir:57 205 TMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVP 284 (419) T ss_pred CccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCC Confidence 44 4555432211112222233444443322222233445555556788888876653 4466778889999999999 Q ss_pred eeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----c---CCceEEe--CCCCCCCHHHHHHHH Q lcl|NC_019404. 293 EIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN----A---EEWSVEF--SPLDHESSKDKAEVL 363 (418) Q Consensus 293 ~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~----~---~~~~~~f--~pL~~~~eke~ae~~ 363 (418) ..+| |...++-.++-|+....||.. .|.|+++.+-..|-+ . .++.++| ..|...|.+++ T Consensus 285 p~~l-g~~~~~t~sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~---- 352 (419) T protein:vir:57 285 PHMI-QDLQKSTNNNIEHQGLQYVIY-------TMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSR---- 352 (419) T ss_pred HHHh-CCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHH---- Confidence 9777 444333345556677777764 378887777554421 1 2555555 47777777765 Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhcccccccCCCccccccC Q lcl|NC_019404. 364 EKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDIQTEESELITETEVVIA 418 (418) Q Consensus 364 ~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~~~~e~~~~~e~e~~~~ 418 (418) ++++++++++|++|++|+|+.+. ..+..+-+ .+..++.....+..+....+ T Consensus 353 ---~~~~~~~~~~G~~T~NE~R~~~g-l~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 413 (419) T protein:vir:57 353 ---YESYALGRQWGWLSVNDIRRMEN-LTPIPGGDKYLTPLNMVDSKALTGIGKATPQQLKDIEA 413 (419) T ss_pred ---HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeeccccccccccccccCCCcccCcchhh Confidence 56777899999999999998763 22211110 01111111222223333333 No 46 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.77 E-value=3.5e-18 Score=116.30 Aligned_cols=389 Identities=14% Similarity=0.141 Sum_probs=197.1 Q ss_pred CccchhhHHH-----------HhcCCCCcccc-CccccC---CHHHHHHHHHcCCccchhhhcchhhhcc---------- Q lcl|NC_019404. 1 MVKTDSYANI-----------FLGGSDGSEIY-GSLQNQ---APTILASLYADNALVRRIIDTIPETALA---------- 55 (418) Q Consensus 1 ~~~~D~~~n~-----------~~g~~~~~~~~-~~~~~~---~~~~l~~~Y~~~~~~r~iVd~~a~d~~r---------- 55 (418) |-+.|.+.-. ++|.-.....+ -.+... +...+.+.|..++++++||++.++.+.+ T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia 120 (574) T protein:vir:80 41 PYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSET 120 (574) T ss_pred CCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 3332322111 11211111111 111122 3345567788999999999998775542 Q ss_pred -CCccccC--cc------hH---HHHHHHHHHhC--------chHHHHHHHH-hccccceEEEEEeecCCCcccccccCC Q lcl|NC_019404. 56 -AGFHIDG--ID------DE---PAFWSRWDDLE--------MTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREG 114 (418) Q Consensus 56 -~~~~i~~--~~------d~---~~i~~~~~~l~--------~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~ 114 (418) -+|.|.- .+ +. ..+...+.+.. .+..|.+.+. ...++|.+++.+.- + .. T Consensus 121 ~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r-~---------~~ 190 (574) T protein:vir:80 121 GVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVF-D---------KD 190 (574) T ss_pred cCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEE-C---------CC Confidence 3455521 10 00 01222222211 1123444444 45678999987653 2 23 Q ss_pred CceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH Q lcl|NC_019404. 115 AELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI 194 (418) Q Consensus 115 ~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~ 194 (418) |.+..|.++++..|.+......... .+.+.+|++..++ ....+.++.||||...+.+. ....+||.||+.. + T Consensus 191 G~~~~L~pl~p~~V~v~~d~~~~~~--~~~~~y~~~~~g~--~~~~~~~~eiih~~~~~~~~---~~~~~~G~spi~~-a 262 (574) T protein:vir:80 191 GNFIKFDTVDPTTIFLATNGEGKLI--KNGERFVQVIDNR--IVAKFNERELAFAVRNPRAD---IEVGQYGYPELEI-A 262 (574) T ss_pred CcEEEEEEEcCceeEEEEcCccccc--cCceEEEEEeCCc--eEEEEccccEEEEeccCCCC---cccccccccHHHH-H Confidence 5688899999988877532211110 1123345554332 23467889999997665443 2234689999974 8 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHh-cCCcceeEEEcCCCceeEeeccc Q lcl|NC_019404. 195 LDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNN-SGVGQAIGIDAESEEYSVLNSDI 271 (418) Q Consensus 195 ~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~-~~~~~~~~~d~~~e~~~~~~~~~ 271 (418) .+.|.....+.......+.....+ ++++++... + +.+....+++++...-.. .+.+...++.+++-+|..++.+. T Consensus 263 ~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~-l-s~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~ 340 (574) T protein:vir:80 263 LKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQ-Q-SQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSA 340 (574) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC-C-CHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCCh Confidence 899999999999999988876544 355543111 1 122233344444332222 22223345545556777777665 Q ss_pred CC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhh---c---c Q lcl|NC_019404. 272 GG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL--ETFHKLIDRKRNAELLPILEFLIPFIV---N---A 341 (418) Q Consensus 272 ~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~--~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~---~ 341 (418) .+ +-+...+....||.+.+||..+|.-.+.+++.+++.+.. .+.-..-..+.+..|.|++.++-..|- . . T Consensus 341 ~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~ 420 (574) T protein:vir:80 341 NDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFG 420 (574) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Confidence 43 456677888999999999997663333333322221111 111111222333447788766654442 1 2 Q ss_pred CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-----------hhhc-------- Q lcl|NC_019404. 342 EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG-----------DNDI-------- 402 (418) Q Consensus 342 ~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-----------~~~~-------- 402 (418) ..+.|+|+...-.+..+++. ...++.+|++|++|+|+.+. ..+..+-+ +... T Consensus 421 ~~~~~~f~~~d~~~~~~~~~--------~~~~~~~G~lT~NE~R~~lg-l~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~ 491 (574) T protein:vir:80 421 EKYQFQFRGGDLSAQLDKLK--------IIEQEGKVFRTVNEIRHDKG-LEPIKGGDVILNGVHIQAIGQALQEEQLEYQ 491 (574) T ss_pred CceEEEecccchhhHHHHHH--------HHHHHhCCccCHHHHHHHhC-CCCCCCCCEeeeccceeecccccccccCCcc Confidence 45778888654433332221 23567899999999999763 11211100 0000 Q ss_pred --------------ccccccCC---CccccccC Q lcl|NC_019404. 403 --------------QTEESELI---TETEVVIA 418 (418) Q Consensus 403 --------------~~~e~~~~---~e~e~~~~ 418 (418) ++++.+.. .+++.-.. T Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~ 524 (574) T protein:vir:80 492 RSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTD 524 (574) T ss_pred chhccccccccccCCCCCCCCCCCCCCcccccc Confidence 00000000 00001011 No 47 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.76 E-value=1.1e-18 Score=118.99 Aligned_cols=370 Identities=15% Similarity=0.120 Sum_probs=200.2 Q ss_pred chhhHHHHhcCCCCcc-ccCcc--------------------ccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc- Q lcl|NC_019404. 4 TDSYANIFLGGSDGSE-IYGSL--------------------QNQAPTILASLYADNALVRRIIDTIPETALAAGFHID- 61 (418) Q Consensus 4 ~D~~~n~~~g~~~~~~-~~~~~--------------------~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~- 61 (418) |==+.++| |.+.... ..+.. ...+.. ...+++-+.+||+.+|+++-+-++.+. T Consensus 1 Mg~~~~l~-~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~----~al~~~~V~~~v~~Ia~~iA~lp~~~~~ 75 (457) T protein:vir:13 1 MGFWSALF-GRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPH----DALQVSAVFASVRLLSETIATLPLSTYS 75 (457) T ss_pred Cchhhhhh-cccccccccccccccccccchHHHhhcccccCCceechH----HhhccHHHHHHHHHHHHhhccCceEEEE Confidence 11112211 1111100 00000 011111 223467788999999999999888873 Q ss_pred --CcchHH----HHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 --GIDDEP----AFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 --~~~d~~----~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) +...+. .+...+.. +...+-++..+.+..++|.|++++.- + .|.+..|.+++++++++. T Consensus 76 ~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~-~----------~g~~~~l~~l~p~~v~v~ 144 (457) T protein:vir:13 76 KRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRW-Q----------GPNIVGLDVLDPTKIHVH 144 (457) T ss_pred ecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-c----------CCcEEEEEEEccCceEEE Confidence 111111 12222221 11223444455556778999988743 2 234567888888888765 Q ss_pred cccccccccccCcceEEEEecCCcc-cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESD-MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQ 210 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~-~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~ 210 (418) ....+... +.....|.+...+.. ....+++++|||+.+.. +....+|.|++.. +.+.|.....+...... T Consensus 145 ~~~~~~~~--~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~ 215 (457) T protein:vir:13 145 MVMVDGLR--RKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMM------LPGDFVGCSPISY-ARESIGLALAAQKYGSK 215 (457) T ss_pred EecCCCcc--ceeEEEEEEecCCceeeEEeeCccceEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHH Confidence 43322221 111234555544332 22357899999996432 2234689999975 77889988888888888 Q ss_pred HHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHh Q lcl|NC_019404. 211 LLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIV 286 (418) Q Consensus 211 l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~ia 286 (418) ++.....+ ++++++ . + +.+...++++++........+.+.+++..++.+|++++.+..+ +-+...+....|| T Consensus 216 ~f~ng~~p~gil~~~~--~-l-s~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 291 (457) T protein:vir:13 216 FFANGAMPGAVVEVPG--T-M-SEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIA 291 (457) T ss_pred HHhcCCCcceEEEcCC--C-C-CHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 88775554 456553 1 2 2333444555555443332333445555556789888877654 4466778889999 Q ss_pred hhhcCCeeeeeccCccccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc----CC--ceEEeCCCCCCC Q lcl|NC_019404. 287 ALSGIHEIILKNKNVGGLS--SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA----EE--WSVEFSPLDHES 355 (418) Q Consensus 287 aas~IP~t~L~G~s~~gl~--stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~----~~--~~~~f~pL~~~~ 355 (418) .+.+||..+| |...++-. +.-+.....|+.. .|.|+++.+-..+- .. .+ ++|.+..|...| T Consensus 292 ~~fgVPp~~l-g~~~~~~~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D 363 (457) T protein:vir:13 292 RIFGVPPHLI-SDATNSTSWGSGLAEQNIAFTMF-------SLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGA 363 (457) T ss_pred HHhCCCHHHc-CCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccC Confidence 9999998766 76644322 2224444455443 47887776654442 22 12 445555887777 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC-Chh-----------hc--------------ccccccC Q lcl|NC_019404. 356 SKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI-GDN-----------DI--------------QTEESEL 409 (418) Q Consensus 356 eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~-~~~-----------~~--------------~~~e~~~ 409 (418) .+++ ++++.+++++|++|++|+|+.+. ..+..+. .++ +. +..++.. T Consensus 364 ~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (457) T protein:vir:13 364 PKER-------MELWSLGLQNGIYSIDEVRAAED-MTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDE 435 (457) T ss_pred HHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCC Confidence 7665 66678899999999999998762 2111110 000 00 0000000 Q ss_pred CCccccccC Q lcl|NC_019404. 410 ITETEVVIA 418 (418) Q Consensus 410 ~~e~e~~~~ 418 (418) ..+.++... T Consensus 436 ~~~~~g~~d 444 (457) T protein:vir:13 436 EPEPEGKPD 444 (457) T ss_pred CCCCCCCCc Confidence 011111111 No 48 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.76 E-value=4.2e-18 Score=115.86 Aligned_cols=366 Identities=14% Similarity=0.161 Sum_probs=201.0 Q ss_pred CccchhhHHHHhc------CCCCc--------cccCccccCCHHH-HHHHHHcCCccchhhhcchhhhccCCccc--cCc Q lcl|NC_019404. 1 MVKTDSYANIFLG------GSDGS--------EIYGSLQNQAPTI-LASLYADNALVRRIIDTIPETALAAGFHI--DGI 63 (418) Q Consensus 1 ~~~~D~~~n~~~g------~~~~~--------~~~~~~~~~~~~~-l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i--~~~ 63 (418) |=-.+-..+.+.. ++..+ ...+.....+..- -...+.+++.+.+||+.+|++.-+-++.+ +.. T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecC Confidence 2112222222211 00000 0111111000000 11234678889999999999999988886 222 Q ss_pred chH-HH----HHHHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc Q lcl|NC_019404. 64 DDE-PA----FWSRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR 133 (418) Q Consensus 64 ~d~-~~----i~~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~ 133 (418) +.. .. +...+ ...-....|.+.+. +..++|.|++++.-.+ |.+..|.++++..+++... T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~-----------g~~~~L~~l~~~~v~v~~~ 155 (432) T protein:vir:81 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-----------GRIESLQYLANDRLTITTD 155 (432) T ss_pred CcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-----------CcEEEEEEEcCCceEEEEC Confidence 211 11 11111 11222344555555 4567899988765433 3466788888888776532 Q ss_pred cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 134 EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 134 ~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) ..|. ..|.++..++ ....++++.|+||.+.++ +..+|.||+.. +.+.|.....+.......+. T Consensus 156 -------~~g~-~~y~~~~~~g-~~~~~~~~~iih~r~~~~-------dg~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ 218 (432) T protein:vir:81 156 -------PKGN-TAYRYRRTDG-QMIDIPKQQIWKIMGYSL-------DGENGLSAIRY-GAQIFGTAIAAEAQAARAFR 218 (432) T ss_pred -------CCCc-EEEEEEecCc-eEEEEccccEEEecCCCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 1232 3456654433 235789999999975432 23579999975 77888888888888888776 Q ss_pred HcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhh Q lcl|NC_019404. 214 RKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALS 289 (418) Q Consensus 214 ~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas 289 (418) ..... ++++++. ++ .+..+...+++.. ..+.++.+++ .++.+|++++.+..+ +-+..++..+.||.+. T Consensus 219 ng~~~~gil~~~~~---l~-~e~~~~~~~~~~~---~~nag~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~f 290 (432) T protein:vir:81 219 NGQLQSVYYQIDRF---LT-DDQYDSFAKKVSG---SVEAGRAPLL-EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFF 290 (432) T ss_pred cCCCcceEEecCCC---CC-HHHHHHHHHHHhh---hhcCCCceec-CCCceEEEccCCHHHHHHHHHHHHHHHHHHHHh Confidence 64443 5666531 21 2222233333322 2233344444 456789988887654 4466788999999999 Q ss_pred cCCeeeeeccCccccc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hc-c---CCceEEe--CCCCCCCHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLS---SSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VN-A---EEWSVEF--SPLDHESSK 357 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~---stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~-~---~~~~~~f--~pL~~~~ek 357 (418) +||..+| |....|-. ++-|.....|+.. -|.|++..+-..+ +. . .++.++| +.|...|.+ T Consensus 291 gVPp~~l-g~~~~~~~~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~ 362 (432) T protein:vir:81 291 GVPPSMI-GHSSAGTTSWGSGIESQQLGFLTM-------TLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSA 362 (432) T ss_pred CCCHHHc-CCcCCccccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHH Confidence 9998765 65544333 2234455566653 3778776664433 21 1 2344554 477777776 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-----------hhhc-----ccccccCCCccccccC Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG-----------DNDI-----QTEESELITETEVVIA 418 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-----------~~~~-----~~~e~~~~~e~e~~~~ 418 (418) ++ ++++++++++|+++++|+|+.+.- .+..+-+ .+.. +++-.....+.+..+. T Consensus 363 ~r-------~~~~~~~~~~G~~t~NE~R~~~gl-pp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~~ 431 (432) T protein:vir:81 363 AR-------SSYYSQLVNNGLMTRDEAREIEGL-PKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred HH-------HHHHHHHHhCCCCCHHHHHHHhCC-CCCCCCcceEeecCcccchhhhccCCCCCCCCCCCCccccccc Confidence 65 677888999999999999997631 1111100 0000 0001112233444444 No 49 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.76 E-value=2.7e-18 Score=116.94 Aligned_cols=364 Identities=13% Similarity=0.037 Sum_probs=203.2 Q ss_pred ccchhhHHHHhcCCCC---ccccCccccCCHHH-------------HHHHHHcCCccchhhhcchhhhccCCccccCcch Q lcl|NC_019404. 2 VKTDSYANIFLGGSDG---SEIYGSLQNQAPTI-------------LASLYADNALVRRIIDTIPETALAAGFHIDGIDD 65 (418) Q Consensus 2 ~~~D~~~n~~~g~~~~---~~~~~~~~~~~~~~-------------l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d 65 (418) |-| |+.|.+...... +...++........ -...+.+++.+++||+.+|++.-+-++.+..... T Consensus 1 m~m-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:74 1 MIL-PILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred Ccc-hhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh Confidence 111 334444221110 00001000000000 0112345788999999999999888888764332 Q ss_pred HHHHHHHHHHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCc Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGK 144 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~ 144 (418) ..+...-...-....|.+.+.+ ..++|.|++++.- + ..|.+..+.++++..+++.... +|. T Consensus 80 -~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~G~~~~L~~i~~~~v~v~~~~-------~~~ 141 (392) T protein:vir:74 80 -QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWR-N---------ANGADMKWEYLRPSQVNTYYFE-------YEN 141 (392) T ss_pred -hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEE-C---------CCCcEEEEEEEcCceeEEEEcC-------CCc Confidence 2333333333344556665554 5678999988753 2 2356778999998888655321 233 Q ss_pred ceEEEEecCCcc--cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--ee Q lcl|NC_019404. 145 PLTYRITTNESD--MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VW 220 (418) Q Consensus 145 p~~y~i~~~~~~--~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~ 220 (418) ...|+++..+.. ....++++.||||.+.. .....+|.|++.. +.+.|.....+.......+.....+ ++ T Consensus 142 ~~~y~~~~~~~~~~~~~~~~~~evih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~p~~il 214 (392) T protein:vir:74 142 GMYYNITFDDPKIEPILQAPQSDLIHMKLLS------IDGGKTGISPLYS-LRRESKIQRASDRLTISSLNSSLNVPGVL 214 (392) T ss_pred eEEEEEEecCCccceeEEEcCccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 446666644322 23468899999996433 2344679999975 8888998888888888888776543 45 Q ss_pred ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 221 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G 298 (418) ++++- .... ++..+++........+.+.+++..++.+|++++.+.. .+-+...+....||.+.+||..+| | T Consensus 215 ~~~~~--~~~~----~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g 287 (392) T protein:vir:74 215 TVKGG--GLLS----DKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-G 287 (392) T ss_pred EeCCC--CCch----HHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-C Confidence 65531 1111 1222222222222223344444455678998887643 456677888899999999998766 4 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN--AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA 376 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~ 376 (418) ..... ++. .+..+.||. ..|.|.+..+-..+-+ ..++.+.+..+...+.+++ ++.+..++.+ T Consensus 288 ~~~~~-~~~-~e~~~~~~~-------~~l~p~~~~ie~~l~~~l~~~~~~~~~~~~~~d~~~~-------~~~~~~l~~~ 351 (392) T protein:vir:74 288 GQGDQ-QSS-IQQISGMYA-------SALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNY-------LSTISTATRW 351 (392) T ss_pred CCCCc-ccH-HHHHHHHHH-------HHHHHHHHHHHHHHHHhccchhcccchhhhcCCHHHH-------HHHHHHHHhC Confidence 33222 122 223444433 3467776666444421 2346666667766666544 5678889999 Q ss_pred CCCCHHHHHHHHHhhcCcCCCChhhcccccccCC--CccccccC Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIKIGDNDIQTEESELI--TETEVVIA 418 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~--~e~e~~~~ 418 (418) |+++++|+|+.+...+- ...++...|+-+. +-++.--+ T Consensus 352 g~~t~near~~~~~~g~----~pne~r~~enl~~~~~Gd~~~p~ 391 (392) T protein:vir:74 352 GALAENQATFVLQEAGY----IPKDLPAPENTNKKTTGQSNEPV 391 (392) T ss_pred CCcCHHHHHHHHHhCCC----CccccchhcCCCCCCCCCCCCCC Confidence 99999999998754432 1122222221110 00111111 No 50 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.76 E-value=2.2e-18 Score=117.42 Aligned_cols=364 Identities=13% Similarity=0.041 Sum_probs=203.2 Q ss_pred ccchhhHHHHhcCCCCc---c---cc---------CccccCCHH-HHHHHHHcCCccchhhhcchhhhccCCccccCcch Q lcl|NC_019404. 2 VKTDSYANIFLGGSDGS---E---IY---------GSLQNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDGIDD 65 (418) Q Consensus 2 ~~~D~~~n~~~g~~~~~---~---~~---------~~~~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d 65 (418) |-| |+.|.+.+..... . .+ +........ --.....+++.+.++|+.+|++.-+-++.+..... T Consensus 1 m~m-~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:10 1 MIL-PILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred Ccc-hhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh Confidence 222 3333332211100 0 00 000000000 00111235788999999999999988888764332 Q ss_pred HHHHHHHHHHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCc Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGK 144 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~ 144 (418) ..+..+-...-....|.+.+. ...++|.|++++.- + ..|.+..+.++++..+++.... .|. T Consensus 80 -~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~g~~~~L~~l~~~~v~~~~~~-------~~~ 141 (392) T protein:vir:10 80 -QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWR-N---------ANGADMKWEYLRPSQVNTYYFE-------YEN 141 (392) T ss_pred -hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEE-C---------CCCcEEEEEEEcCceeEEEEcC-------CCc Confidence 233333233333455555555 45678999988753 2 2356778999998888665421 344 Q ss_pred ceEEEEecCCcc--cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--ee Q lcl|NC_019404. 145 PLTYRITTNESD--MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VW 220 (418) Q Consensus 145 p~~y~i~~~~~~--~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~ 220 (418) ...|+++..+.. ....++++.|||+.+.. +....+|.|++.. +.+.|.....+.......++..... ++ T Consensus 142 ~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil 214 (392) T protein:vir:10 142 GMYYNITFDDPKIEPILQAPQSDLIHMKLLS------IDGGKTGISPLYS-LRRESKIQRASDRLTISSLNSSLNVPGVL 214 (392) T ss_pred eEEEEEEecCcccceeEEEccccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 456777654322 23568999999996543 2344679999975 7888988888888888888876544 45 Q ss_pred ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 221 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G 298 (418) ++++- .. ..++.++++........+.+.+++..++.+|++++.+.. .+-+..+...+.||.+.|||..+|.+ T Consensus 215 ~~~~~--~~----~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~ 288 (392) T protein:vir:10 215 TVKGG--GL----LSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGG 288 (392) T ss_pred EeCCC--CC----chHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Confidence 66531 11 112222322222222223344444445678988887653 45667788889999999999877643 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--ccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV--NAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA 376 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~--~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~ 376 (418) +...- + ..+..+.||. ..|.|.+..+-..+- +..++.++...+...+.+++ ++.+..++.+ T Consensus 289 -~~~~~-~-~~~~~~~f~~-------~~l~P~~~~ie~~l~~~L~~~~~~d~~~~~~~d~~~~-------~~~~~~l~~~ 351 (392) T protein:vir:10 289 -QGDQQ-S-SIQQISGMYA-------SALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNY-------LSTISTATRW 351 (392) T ss_pred -CCCcc-c-HHHHHHHHHH-------HHHHHHHHHHHHHHHHhccccccccchhhhccCHHHH-------HHHHHHHHhC Confidence 32221 2 2233445544 336777666544442 12345566666666665443 5677889999 Q ss_pred CCCCHHHHHHHHHhhcCcCCCChhhcccccccCC--CccccccC Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIKIGDNDIQTEESELI--TETEVVIA 418 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~--~e~e~~~~ 418 (418) |+++++|+|+.+...+-. ..++...|+-+. +-++.--+ T Consensus 352 g~~t~nE~r~~l~~~g~~----p~e~r~~e~l~~~~~Gd~~~p~ 391 (392) T protein:vir:10 352 GALAENQATFVLQEAGYI----PKDLPAPENTNKKTTGQSNEPV 391 (392) T ss_pred CCcCHHHHHHHHHhcCCC----ccccchhcCCCCCCCCCCCCCC Confidence 999999999987654421 122222221111 01111111 No 51 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.76 E-value=2.2e-18 Score=117.42 Aligned_cols=364 Identities=13% Similarity=0.041 Sum_probs=203.2 Q ss_pred ccchhhHHHHhcCCCCc---c---cc---------CccccCCHH-HHHHHHHcCCccchhhhcchhhhccCCccccCcch Q lcl|NC_019404. 2 VKTDSYANIFLGGSDGS---E---IY---------GSLQNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDGIDD 65 (418) Q Consensus 2 ~~~D~~~n~~~g~~~~~---~---~~---------~~~~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d 65 (418) |-| |+.|.+.+..... . .+ +........ --.....+++.+.++|+.+|++.-+-++.+..... T Consensus 1 m~m-~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:39 1 MIL-PILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred Ccc-hhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh Confidence 222 3333332211100 0 00 000000000 00111235788999999999999988888764332 Q ss_pred HHHHHHHHHHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCc Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGK 144 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~ 144 (418) ..+..+-...-....|.+.+. ...++|.|++++.- + ..|.+..+.++++..+++.... .|. T Consensus 80 -~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~g~~~~L~~l~~~~v~~~~~~-------~~~ 141 (392) T protein:vir:39 80 -QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWR-N---------ANGADMKWEYLRPSQVNTYYFE-------YEN 141 (392) T ss_pred -hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEE-C---------CCCcEEEEEEEcCceeEEEEcC-------CCc Confidence 233333233333455555555 45678999988753 2 2356778999998888665421 344 Q ss_pred ceEEEEecCCcc--cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--ee Q lcl|NC_019404. 145 PLTYRITTNESD--MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VW 220 (418) Q Consensus 145 p~~y~i~~~~~~--~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~ 220 (418) ...|+++..+.. ....++++.|||+.+.. +....+|.|++.. +.+.|.....+.......++..... ++ T Consensus 142 ~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil 214 (392) T protein:vir:39 142 GMYYNITFDDPKIEPILQAPQSDLIHMKLLS------IDGGKTGISPLYS-LRRESKIQRASDRLTISSLNSSLNVPGVL 214 (392) T ss_pred eEEEEEEecCcccceeEEEccccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 456777654322 23568999999996543 2344679999975 7888988888888888888876544 45 Q ss_pred ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 221 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G 298 (418) ++++- .. ..++.++++........+.+.+++..++.+|++++.+.. .+-+..+...+.||.+.|||..+|.+ T Consensus 215 ~~~~~--~~----~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~ 288 (392) T protein:vir:39 215 TVKGG--GL----LSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGG 288 (392) T ss_pred EeCCC--CC----chHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Confidence 66531 11 112222322222222223344444445678988887653 45667788889999999999877643 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--ccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV--NAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA 376 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~--~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~ 376 (418) +...- + ..+..+.||. ..|.|.+..+-..+- +..++.++...+...+.+++ ++.+..++.+ T Consensus 289 -~~~~~-~-~~~~~~~f~~-------~~l~P~~~~ie~~l~~~L~~~~~~d~~~~~~~d~~~~-------~~~~~~l~~~ 351 (392) T protein:vir:39 289 -QGDQQ-S-SIQQISGMYA-------SALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNY-------LSTISTATRW 351 (392) T ss_pred -CCCcc-c-HHHHHHHHHH-------HHHHHHHHHHHHHHHHhccccccccchhhhccCHHHH-------HHHHHHHHhC Confidence 32221 2 2233445544 336777666544442 12345566666666665443 5677889999 Q ss_pred CCCCHHHHHHHHHhhcCcCCCChhhcccccccCC--CccccccC Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIKIGDNDIQTEESELI--TETEVVIA 418 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~--~e~e~~~~ 418 (418) |+++++|+|+.+...+-. ..++...|+-+. +-++.--+ T Consensus 352 g~~t~nE~r~~l~~~g~~----p~e~r~~e~l~~~~~Gd~~~p~ 391 (392) T protein:vir:39 352 GALAENQATFVLQEAGYI----PKDLPAPENTNKKTTGQSNEPV 391 (392) T ss_pred CCcCHHHHHHHHHhcCCC----ccccchhcCCCCCCCCCCCCCC Confidence 999999999987654421 122222221111 01111111 No 52 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.76 E-value=9.2e-19 Score=119.46 Aligned_cols=360 Identities=12% Similarity=0.128 Sum_probs=195.8 Q ss_pred CccchhhHHHHhcCCC---CccccCcc-ccCC-HHHH-------HHHHHcCCccchhhhcchhhhccCCcccc--CcchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSD---GSEIYGSL-QNQA-PTIL-------ASLYADNALVRRIIDTIPETALAAGFHID--GIDDE 66 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~---~~~~~~~~-~~~~-~~~l-------~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~ 66 (418) ++.++=|.+--.-... .......+ ...+ +..+ ...+..++.+.+||+..|.+.-+-++.+- +.+.. T Consensus 10 ~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~ 89 (413) T protein:vir:96 10 DKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGD 89 (413) T ss_pred hhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCceEEEEecCCCc Confidence 3333333221000000 00000000 0000 1111 11235589999999999999999888872 11111 Q ss_pred HH----HHHHH----HHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccc Q lcl|NC_019404. 67 PA----FWSRW----DDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENP 137 (418) Q Consensus 67 ~~----i~~~~----~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp 137 (418) .. +...+ ...-.+..|.+.+.+ -.++|.|++++.-++ .++.+..+.+++++.+++.... T Consensus 90 ~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~---------~g~~~~~L~~l~~~~v~~~~~~--- 157 (413) T protein:vir:96 90 KRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQV---------SGDKIIGLTPISPYKVTFNVSD--- 157 (413) T ss_pred cccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---------CCCceEEEEEecCceeEEEEcC--- Confidence 11 11111 112223445444444 557899998875421 1234667888888888764321 Q ss_pred cccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019404. 138 RNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ 217 (418) Q Consensus 138 ~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~ 217 (418) +. ..|.+...+ .++.++.||||...+-+ ...+.|.|++.. +.+.|.....+......++..... T Consensus 158 -----~~-~~y~~~~~~----~~~~~~evih~k~~~~~-----~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~ 221 (413) T protein:vir:96 158 -----DD-LDYSITFDN----KEYDPSTLLHFVLNPSI-----ERPFIGTGYKVA-LKDIVGNLKQASVTKKGFMASEYM 221 (413) T ss_pred -----Ce-EEEEEeecC----cEEchhhEEEEeccCCC-----CCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCC Confidence 11 345555433 35788999999644321 233469999975 888999999998889888887654 Q ss_pred c--eeecchHHHhhcCcchHHHHHHHHHHHHHh-cCCcceeEEEcCCCceeEee-cccC--CHHHHHHHHHHHHhhhhcC Q lcl|NC_019404. 218 A--VWKAKGLAELCDDSEGFGAARLRLAQVDNN-SGVGQAIGIDAESEEYSVLN-SDIG--GIDAFLDKKFDRIVALSGI 291 (418) Q Consensus 218 ~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~-~~~~~~~~~d~~~e~~~~~~-~~~~--gl~~~~~~~~~~iaaas~I 291 (418) + ++++++ . + .++...+.+++++..... .+.++.+++.....++..+. .+.. .+-+........||.+.+| T Consensus 222 p~gil~~~~--~-l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgV 297 (413) T protein:vir:96 222 PNLIVSVDS--D-S-DELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGV 297 (413) T ss_pred ccEEEEeCC--C-C-CHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 3 556553 1 2 223334455555443322 22344455655544444432 3332 3446667888999999999 Q ss_pred CeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hccCCceEEe--CCCCCCCHHHHHHHHHHH Q lcl|NC_019404. 292 HEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNAEEWSVEF--SPLDHESSKDKAEVLEKS 366 (418) Q Consensus 292 P~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~~~~~~~f--~pL~~~~eke~ae~~~~~ 366 (418) |..+| |.. +..+....+|+.. .|.|+++.+-..| +..+++.++| ..|...|.+++ T Consensus 298 P~~~l-g~~-----~~~~~~~~~~~~~-------~l~P~~~~ie~~ln~~ll~~~~~~~fd~~~ll~~d~~~~------- 357 (413) T protein:vir:96 298 PAFLL-GVG-----TYNKDEFNNFINT-------KIMSIAQVIQQTYNKLIVEEDMYFSLNPRSLYNYSLTEM------- 357 (413) T ss_pred CHHHc-CCC-----cchHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCCCCcEEEEechhhhccCHHHH------- Confidence 99766 421 1123344555543 3778776665544 3455665555 46666666654 Q ss_pred HHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC----------hhhcccccccCCCcc Q lcl|NC_019404. 367 VNSIAALIAAGAMDIKEARDTLRTIAPEIKIG----------DNDIQTEESELITET 413 (418) Q Consensus 367 a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~----------~~~~~~~e~~~~~e~ 413 (418) ++++..++++|+++++|+|+.+. ..|..+-+ .++..+..+..-+|+ T Consensus 358 ~~~~~~~~~~G~~t~NE~R~~~g-~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 358 VSAGAQMTQLNALRRNEFRNWVG-MPPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeecccccchhhcccccCCCCCCC Confidence 66788899999999999998763 22322210 111111112222333 No 53 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.76 E-value=1.9e-18 Score=117.71 Aligned_cols=366 Identities=12% Similarity=0.088 Sum_probs=199.6 Q ss_pred Cc-----cchhhHHHHhcC-CCC-c-------cccC-----ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc Q lcl|NC_019404. 1 MV-----KTDSYANIFLGG-SDG-S-------EIYG-----SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID 61 (418) Q Consensus 1 ~~-----~~D~~~n~~~g~-~~~-~-------~~~~-----~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~ 61 (418) |= +...+.+.+++- +.. + ...+ .....+.+ -+.+++.+.+||+.++++.-+-++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~ci~~Ia~~ia~lp~~~~ 76 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTAD----SALQLSAVWSCVRLIAETIATLPLNLY 76 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccCCCceechH----hhhccHHHHHHHHHHHHHHhhCceeEE Confidence 10 111111111110 000 0 0000 00111222 235678899999999999998887762 Q ss_pred C--cc-hH-----HHHHHHH----HHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeecccc Q lcl|NC_019404. 62 G--ID-DE-----PAFWSRW----DDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQV 128 (418) Q Consensus 62 ~--~~-d~-----~~i~~~~----~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i 128 (418) - .+ .. ..+...+ ...--...|.+.+.. -.++|.|++++.-++ |.+..|.+++++.+ T Consensus 77 ~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-----------g~~~~L~~l~p~~v 145 (437) T protein:vir:10 77 QTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-----------GVLIGLELMLPQRT 145 (437) T ss_pred EEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-----------CcEEEEEEEcCcce Confidence 1 11 10 0111111 112233445555554 467899998875322 45667888888887 Q ss_pred ccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 129 KVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 129 ~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) ++.... -|.+ .|++...++ ....+.++.||||.+..+ ...+|.||+.. +.+.|.....+.... T Consensus 146 ~i~~~~-------~g~~-~y~~~~~~g-~~~~~~~~dIih~r~~~~-------d~~~G~spi~~-~~~~i~~~~~~~~~~ 208 (437) T protein:vir:10 146 TVKRLT-------SGAL-QYTYRNVDG-TVSTLAEDDVFHVRGFSL-------DGLMGLTPIQY-AREVLGNSTAANKTS 208 (437) T ss_pred EEEECC-------CCeE-EEEEEecCc-eEEEEccccEEEecCcCC-------CCcccccHHHH-HHHHHHHHHHHHHHH Confidence 765432 1222 344443322 235689999999964321 23579999975 888898888888888 Q ss_pred HHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHH Q lcl|NC_019404. 209 TQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDR 284 (418) Q Consensus 209 ~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~ 284 (418) ..++.....+ ++++++. + +++...+.++++......-.+.+.+++..++.+|++++.+.. .+-+...+.... T Consensus 209 ~~~f~ng~~p~gil~~~~~---l-~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 284 (437) T protein:vir:10 209 ASVFRNGLRPSGVLSTDQI---L-QKEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEE 284 (437) T ss_pred HHHHhccCCccEEEEcCCC---C-CHHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHH Confidence 8888775533 5565531 2 233344555555443322223344555555678888887654 456667788899 Q ss_pred HhhhhcCCeeeeeccCccc-c-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc----CC--ceEEeCCCCC Q lcl|NC_019404. 285 IVALSGIHEIILKNKNVGG-L-SSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA----EE--WSVEFSPLDH 353 (418) Q Consensus 285 iaaas~IP~t~L~G~s~~g-l-~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~----~~--~~~~f~pL~~ 353 (418) ||.+.+||..+| |...++ . ++.-++....||.. -|.|++..+-..|- .. .. ++|++..|.. T Consensus 285 Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~~~-------tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~ 356 (437) T protein:vir:10 285 ICRWYRVPPFMV-GHSEKSTSWGTGIEQQTLGFLTF-------TLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLR 356 (437) T ss_pred HHHHhCCCHHHh-CCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccCceEEEEechhhhc Confidence 999999998766 655332 2 23335555666654 37888777655542 11 12 3455557777 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh--------hhccccc-ccCCCc-ccccc------ Q lcl|NC_019404. 354 ESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGD--------NDIQTEE-SELITE-TEVVI------ 417 (418) Q Consensus 354 ~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~--------~~~~~~e-~~~~~e-~e~~~------ 417 (418) .|.+++ ++++++++++|++|++|+|+.+. ..+..+.++ -.++... ..+... ++..- T Consensus 357 ~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (437) T protein:vir:10 357 ADSAGR-------AAFYSTMTQNGLMTRDECRAKEN-LPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLYQE 428 (437) T ss_pred cCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCCCC Confidence 777665 55788899999999999998762 222111100 0000000 000000 00000 Q ss_pred ----C Q lcl|NC_019404. 418 ----A 418 (418) Q Consensus 418 ----~ 418 (418) + T Consensus 429 ~~~~~ 433 (437) T protein:vir:10 429 EKTRA 433 (437) T ss_pred CCCCc Confidence 0 No 54 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.75 E-value=4.5e-18 Score=115.71 Aligned_cols=353 Identities=12% Similarity=0.082 Sum_probs=208.6 Q ss_pred ccchhh------------------HHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-- Q lcl|NC_019404. 2 VKTDSY------------------ANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-- 61 (418) Q Consensus 2 ~~~D~~------------------~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-- 61 (418) |=.|.+ ...|.|.. .......+... +..++-++++|+.+|++.-+-++.+- T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~v~~~~----al~~~~v~~~i~~Ia~~ia~l~~~~~~~ 71 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRK-----TASGERVSESN----SLVQPDIFACVNVLSDDIAKLPIHTYKR 71 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcc-----cccCceechhh----hhccHHHHHHHHHHHHhhhhCceEEEEe Confidence 111111 11111100 01111122222 23567789999999999999888762 Q ss_pred CcchH-H--------HHHHHHHHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GIDDE-P--------AFWSRWDDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~~d~-~--------~i~~~~~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) .++.. . .+..+-..+..+..|.+.+.+ -.++|.|++++.-+ ..|.+..+.++++..+++. T Consensus 72 ~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~----------~~G~~~~L~~l~~~~v~v~ 141 (416) T protein:vir:12 72 TDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFG----------SHGYPEALFPLRPDYTNAY 141 (416) T ss_pred cCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC----------CCCcEEEEEEECCcceEEE Confidence 21111 1 122222223334455555555 45689999887532 2355778889998888764 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ... .+.+.+|++...+.. ..+++++|+|+.+... ...+|.|++.. +.+.+.....+......+ T Consensus 142 ~~~-------~~~~~~~~~~~~g~~--~~~~~~eiih~~~~~~-------~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~ 204 (416) T protein:vir:12 142 VHP-------TTGMLWYQTVLNGKA--IELYDYEVLHFKGLST-------DGIHGKSPIGV-VREHIGAQAAATKYNAKL 204 (416) T ss_pred EeC-------CCcEEEEEEecCCeE--EEecCccEEEecCcCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHH Confidence 321 223446666554432 5789999999964321 23579999975 889999999998889888 Q ss_pred HHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhh Q lcl|NC_019404. 212 LRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVA 287 (418) Q Consensus 212 ~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaa 287 (418) +.....+ +++++.. + +.+...+.+++++... + .+.+++..++.+|++++.+..+ +-+...+....||. T Consensus 205 ~~ng~~p~~il~~~~~---~-~~e~~~~~~~~~~~~~---~-~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 276 (416) T protein:vir:12 205 YKNEATPRGILKVPAF---L-DEKPKENVRKEWKRVN---K-VENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISM 276 (416) T ss_pred HhcCCCCceEEecCCC---C-CHHHHHHHHHHHHHHh---c-CCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHH Confidence 8876544 5566531 2 2333445555554322 2 3444555556789988887764 44778889999999 Q ss_pred hhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcc----CCceEEe--CCCCCCCHH Q lcl|NC_019404. 288 LSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNA----EEWSVEF--SPLDHESSK 357 (418) Q Consensus 288 as~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~----~~~~~~f--~pL~~~~ek 357 (418) +.+||..+|.+...+.. ++-+...+.||.. .|.|++..+-..+ +.. .++.|+| +.|...|.+ T Consensus 277 ~fgVPp~~lg~~~~~t~-sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~ 348 (416) T protein:vir:12 277 IYKVPLHKLNELDKATF-SNIEHQSIEYVRN-------TLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSK 348 (416) T ss_pred HhCCCHHHhCCccCCCc-ccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHH Confidence 99999987754444443 4456666677654 4788777665554 211 2455555 466666766 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh-----------hhccc--------ccccCCCcccc Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGD-----------NDIQT--------EESELITETEV 415 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~-----------~~~~~--------~e~~~~~e~e~ 415 (418) ++ ++++.+++++|++|++|+|+.+. ..|-.+ .+ +..++ +..-.++.+|+ T Consensus 349 ~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~Pi~g-gd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 349 TQ-------AEYLKTLHETGVLNKDEIRELLE-RNPIEN-GDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred HH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCC-cceeeeccccccccccchhhccccccccCCCCCcCCC Confidence 65 67788899999999999999763 111111 11 00100 11223455566 No 55 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.75 E-value=1.9e-18 Score=117.74 Aligned_cols=361 Identities=13% Similarity=0.070 Sum_probs=197.4 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH---HHHHHHH---- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE---PAFWSRW---- 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~---~~i~~~~---- 73 (418) -...+.+...+....+.....+ ...+.. -+.+++-+.++|+.+|+++-+-++.+..+... ..+...+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~----~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~P 86 (416) T protein:vir:45 13 QYNEDDLQMMVQTLPGFQGTKL--RQYKDI----EAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRP 86 (416) T ss_pred cCCCcchhHHHHHhccccccCc--cccchh----hhhcchHHHHHHHHHHHhhccCceEEecCccccccchHHHHHhccc Confidence 1111222222210000000000 011111 11234456679999999999988877432211 1111111 Q ss_pred HHhCchHHHHHHHHhc-cccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEe- Q lcl|NC_019404. 74 DDLEMTQNINDAWSWA-RLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRIT- 151 (418) Q Consensus 74 ~~l~~~~~~~~a~~~~-rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~- 151 (418) ..+-....|.+.+.+. .++|.|++++.- + ..|.+..|.+++++++++... ..|.+.+|... T Consensus 87 N~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~G~~~~L~~i~~~~v~v~~~-------~~g~~~~~~~~~ 149 (416) T protein:vir:45 87 NPMYNGYIFKLVVFVSALLTSHGYIEITR-D---------KTGEPMNLTFRKTSEIELKSD-------ARGRLYYFHQRI 149 (416) T ss_pred ccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEC-------CCccEEEEEEEe Confidence 1222334566666664 578999988743 2 335677888999888875431 23555444332 Q ss_pred -cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--ceeecchHHHh Q lcl|NC_019404. 152 -TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVWKAKGLAEL 228 (418) Q Consensus 152 -~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~~~ 228 (418) +.+......++++.||||...+ ....+|.|++.. +.+.|.....+......++..... -++++++. T Consensus 150 ~~~~~~~~~~~~~~evihir~~~-------~d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~--- 218 (416) T protein:vir:45 150 DSNGNNIERNVKFEDMLDIKFYS-------LDGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV--- 218 (416) T ss_pred cCCCceeEEEEccccEEEeccCC-------CCCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC--- Confidence 3333334579999999996432 123579999975 778888888888888888877553 35566631 Q ss_pred hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCcccccc Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS 306 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s 306 (418) +.+.+...+++++++.......+.+.+++..++.+|+.++.+... +-+...+....||++.|||..+| |...++.+. T Consensus 219 ~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~ 297 (416) T protein:vir:45 219 LDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMSI 297 (416) T ss_pred CCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCccH Confidence 122222334445555443322233344555556789988876543 45667788899999999998765 655554321 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc--C--CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019404. 307 SQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA--E--EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAM 379 (418) Q Consensus 307 tge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~--~--~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i 379 (418) + +...+|. ..|.|.+..+-..+- .. . .|+|.++.|...|.+++ ++++++++++|++ T Consensus 298 --~-~~~~~~~-------~~l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~~ 360 (416) T protein:vir:45 298 --T-DANLDYL-------STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKM 360 (416) T ss_pred --H-HHHHHHH-------HHHHHHHHHHHHHHhhhccccccCceEEEechhhhccCHHHH-------HHHHHHHHhCCCc Confidence 2 2222322 236777766644442 21 2 35555667777777665 6678889999999 Q ss_pred CHHHHHHHHHhhcCcCCCCh---------hhccccc-------------ccCCCccc Q lcl|NC_019404. 380 DIKEARDTLRTIAPEIKIGD---------NDIQTEE-------------SELITETE 414 (418) Q Consensus 380 ~~~e~r~~l~~~~~~~~~~~---------~~~~~~e-------------~~~~~e~e 414 (418) |++|+|+.+. ..|..+.+. -.++..+ -+.=+++| T Consensus 361 T~NE~R~~~g-l~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 361 NIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred CHHHHHHHhC-CCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCCC Confidence 9999999873 222111110 0000000 01112233 No 56 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.75 E-value=1.9e-18 Score=117.74 Aligned_cols=361 Identities=13% Similarity=0.070 Sum_probs=197.4 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH---HHHHHHH---- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE---PAFWSRW---- 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~---~~i~~~~---- 73 (418) -...+.+...+....+.....+ ...+.. -+.+++-+.++|+.+|+++-+-++.+..+... ..+...+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~----~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~P 86 (416) T protein:vir:81 13 QYNEDDLQMMVQTLPGFQGTKL--RQYKDI----EAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRP 86 (416) T ss_pred cCCCcchhHHHHHhccccccCc--cccchh----hhhcchHHHHHHHHHHHhhccCceEEecCccccccchHHHHHhccc Confidence 1111222222210000000000 011111 11234456679999999999988877432211 1111111 Q ss_pred HHhCchHHHHHHHHhc-cccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEe- Q lcl|NC_019404. 74 DDLEMTQNINDAWSWA-RLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRIT- 151 (418) Q Consensus 74 ~~l~~~~~~~~a~~~~-rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~- 151 (418) ..+-....|.+.+.+. .++|.|++++.- + ..|.+..|.+++++++++... ..|.+.+|... T Consensus 87 N~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~G~~~~L~~i~~~~v~v~~~-------~~g~~~~~~~~~ 149 (416) T protein:vir:81 87 NPMYNGYIFKLVVFVSALLTSHGYIEITR-D---------KTGEPMNLTFRKTSEIELKSD-------ARGRLYYFHQRI 149 (416) T ss_pred ccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEC-------CCccEEEEEEEe Confidence 1222334566666664 578999988743 2 335677888999888875431 23555444332 Q ss_pred -cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--ceeecchHHHh Q lcl|NC_019404. 152 -TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVWKAKGLAEL 228 (418) Q Consensus 152 -~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~~~ 228 (418) +.+......++++.||||...+ ....+|.|++.. +.+.|.....+......++..... -++++++. T Consensus 150 ~~~~~~~~~~~~~~evihir~~~-------~d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~--- 218 (416) T protein:vir:81 150 DSNGNNIERNVKFEDMLDIKFYS-------LDGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV--- 218 (416) T ss_pred cCCCceeEEEEccccEEEeccCC-------CCCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC--- Confidence 3333334579999999996432 123579999975 778888888888888888877553 35566631 Q ss_pred hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCcccccc Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS 306 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s 306 (418) +.+.+...+++++++.......+.+.+++..++.+|+.++.+... +-+...+....||++.|||..+| |...++.+. T Consensus 219 ~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~ 297 (416) T protein:vir:81 219 LDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMSI 297 (416) T ss_pred CCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCccH Confidence 122222334445555443322233344555556789988876543 45667788899999999998765 655554321 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc--C--CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019404. 307 SQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA--E--EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAM 379 (418) Q Consensus 307 tge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~--~--~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i 379 (418) + +...+|. ..|.|.+..+-..+- .. . .|+|.++.|...|.+++ ++++++++++|++ T Consensus 298 --~-~~~~~~~-------~~l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~~ 360 (416) T protein:vir:81 298 --T-DANLDYL-------STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKM 360 (416) T ss_pred --H-HHHHHHH-------HHHHHHHHHHHHHHhhhccccccCceEEEechhhhccCHHHH-------HHHHHHHHhCCCc Confidence 2 2222322 236777766644442 21 2 35555667777777665 6678889999999 Q ss_pred CHHHHHHHHHhhcCcCCCCh---------hhccccc-------------ccCCCccc Q lcl|NC_019404. 380 DIKEARDTLRTIAPEIKIGD---------NDIQTEE-------------SELITETE 414 (418) Q Consensus 380 ~~~e~r~~l~~~~~~~~~~~---------~~~~~~e-------------~~~~~e~e 414 (418) |++|+|+.+. ..|..+.+. -.++..+ -+.=+++| T Consensus 361 T~NE~R~~~g-l~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 361 NIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred CHHHHHHHhC-CCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCCC Confidence 9999999873 222111110 0000000 01112233 No 57 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.75 E-value=3.8e-18 Score=116.07 Aligned_cols=362 Identities=14% Similarity=0.082 Sum_probs=202.6 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchHH-----HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDEP-----AFWSRW 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~~-----~i~~~~ 73 (418) --..+++...+.|+... .. ..+.++... .+++.+.+||+.+|+++-+-++.+. +.+... .+...+ T Consensus 15 ~~~~~~~~~~~~g~~~s--~~--~~~v~~~~a----l~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL 86 (419) T protein:vir:80 15 QPGSGGWVSALLGSARS--EA--GQVVTPASA----LSLTVLQNCVTLLAESIAQLPVELYERSGDDRKPATDHPLYSIL 86 (419) T ss_pred CCCcchhhHHhhccccc--cc--CcccChHHh----hccHHHHHHHHHHHHhhccCceEEEEecCCCcccccccHHHHHH Confidence 11123333333332211 11 112233222 2477889999999999999888872 111111 122222 Q ss_pred H----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEE Q lcl|NC_019404. 74 D----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTY 148 (418) Q Consensus 74 ~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y 148 (418) . ...-...|.+.+.+ -.++|.|++++.- + ..|.+..|.+++++.+++.... .|.+ .| T Consensus 87 ~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r-~---------~~G~~~~L~~i~~~~v~i~~~~-------~~~~-~y 148 (419) T protein:vir:80 87 KYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDR-D---------QDGVIQGLYPLDNEAVTVMKGP-------DLKP-MY 148 (419) T ss_pred HhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEecCceEEEEECC-------CceE-EE Confidence 2 12233455555555 4668999988743 2 2356788999999988764321 1232 45 Q ss_pred EEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHH Q lcl|NC_019404. 149 RITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLA 226 (418) Q Consensus 149 ~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~ 226 (418) ++.+. ..++++.|+|+...+ ....+|.|++.. +.+.|.....+.......+.....+ ++++++.. T Consensus 149 ~~~~~-----~~~~~~~i~h~~~~~-------~d~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~ 215 (419) T protein:vir:80 149 RVAGA-----DPLPQRLVHHVRWMS-------INGYTGLSPVLL-HANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDA 215 (419) T ss_pred EEcCc-----cccchhheEEecCCC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCC Confidence 55433 247788888886432 233679999975 7788888888888888888775544 55655322 Q ss_pred HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 227 ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 227 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) ....+.+...++.+.++.......+.+.+++...+.+|++++.+..+ +.+...+..+.||.+.|||..+| |...++- T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~ll-g~~~~~t 294 (419) T protein:vir:80 216 PALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMV-NELERAT 294 (419) T ss_pred CcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCCCCC Confidence 11111222222333333322222223445555556788888876643 45677788999999999998766 5443333 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cc---CCceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV----NA---EEWSVEF--SPLDHESSKDKAEVLEKSVNSIAALIA 375 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~----~~---~~~~~~f--~pL~~~~eke~ae~~~~~a~a~~~~~~ 375 (418) .++.+.....||... |.|+++.+-..+- .. .++.++| ..|...|.+++ ++++.++++ T Consensus 295 ~~n~e~~~~~f~~~~-------l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~-------~~~~~~~~~ 360 (419) T protein:vir:80 295 FSNIEHQSLQFVIYT-------LLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSR-------YAAYAVGRQ 360 (419) T ss_pred cccHHHHHHHHHHHH-------HHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHH-------HHHHHHHHh Confidence 445566666776653 7887776644432 11 2455555 46666666655 567778999 Q ss_pred CCCCCHHHHHHHHHhhcCcCCCChh----------hcccc-cccCC-------CccccccC Q lcl|NC_019404. 376 AGAMDIKEARDTLRTIAPEIKIGDN----------DIQTE-ESELI-------TETEVVIA 418 (418) Q Consensus 376 ~g~i~~~e~r~~l~~~~~~~~~~~~----------~~~~~-e~~~~-------~e~e~~~~ 418 (418) +|++|++|+|+.+. ..+..+ +|+ +.++. +.... +|-+.+.| T Consensus 361 ~G~~T~NE~R~~~g-~~p~~g-GD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 419 (419) T protein:vir:80 361 WGWLSINDIRRLEN-MPPVKG-GDIYLSPMNMVDASKPQPIPMGKTEPTKAALDEIGRILS 419 (419) T ss_pred CCCcCHHHHHHHhC-CCCCCC-cceeeeccccccccccccccCCCCCchhhhHHHHHhhcC Confidence 99999999998762 211111 111 00000 01111 12223333 No 58 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.75 E-value=3.7e-18 Score=116.16 Aligned_cols=380 Identities=12% Similarity=0.073 Sum_probs=201.2 Q ss_pred CccchhhHHHHhc------------CC-CCcc--ccCccccCC-HHHHHHHHHcCCccchhhhcchhhhccCCcccc--- Q lcl|NC_019404. 1 MVKTDSYANIFLG------------GS-DGSE--IYGSLQNQA-PTILASLYADNALVRRIIDTIPETALAAGFHID--- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g------------~~-~~~~--~~~~~~~~~-~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--- 61 (418) +++.....+-|.. .+ .... ....+.... -.-..+....+..+..+|+..+++.-+-++.+- T Consensus 70 ~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~ 149 (945) T protein:vir:10 70 VLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHI 149 (945) T ss_pred HHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEec Confidence 3333333222210 00 0000 000010000 112334445678899999999999998888761 Q ss_pred --Ccch--------HHHHHHHHHH-------hCchHHHHHHH-HhccccceEEEEEeecCCCcccccccCCCceEEEEEe Q lcl|NC_019404. 62 --GIDD--------EPAFWSRWDD-------LEMTQNINDAW-SWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVY 123 (418) Q Consensus 62 --~~~d--------~~~i~~~~~~-------l~~~~~~~~a~-~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~ 123 (418) +..+ ...+..-+.+ ...|+.|.+.+ ..-.++|.+++++.- + ..|.+..+.++ T Consensus 150 edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiR-d---------~~G~ii~L~pL 219 (945) T protein:vir:10 150 EDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIR-D---------EQGNLVAITPV 219 (945) T ss_pred ccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEE Confidence 1111 0112222222 22344455554 456778999988743 2 33557788999 Q ss_pred eccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHH Q lcl|NC_019404. 124 DRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTN 203 (418) Q Consensus 124 ~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~ 203 (418) ++.++.+.... +-+.+..|....++ .....++++.+|++...+.++. ...++|.||+.. +.+.+..... T Consensus 220 dPs~Vti~~dd------DG~~~y~Yv~~idG-~~~~~v~a~DvIlhirn~s~DG---~~~GyGlSPIea-a~~aI~~alA 288 (945) T protein:vir:10 220 DGTTIKPILSE------DTGIVVGYVQEVDG-AIVAHFDKRDVVLFRQNLTPDV---YMYGYSLPPIEI-LYKVILSDIF 288 (945) T ss_pred CCcceEEEEcC------CCcEEEEEEEecCC-ceEEEecCCceEEEeccCCCCc---ccccCCchHHHH-HHHHHHHHHH Confidence 99888765321 11222234333332 2334678888776543332221 123469999975 7788988888 Q ss_pred HHHHHHHHHHHcCC---ceeecchHHHh------hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC- Q lcl|NC_019404. 204 CERLATQLLRRKQQ---AVWKAKGLAEL------CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG- 273 (418) Q Consensus 204 ~~~~~~~l~~~~~~---~v~k~~~l~~~------~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g- 273 (418) +....+..+.+.+. -++++++.... .-+.+..++.++.+....... +++..++..++.+|++++.+..+ T Consensus 289 aek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~-NnG~piVLdeGmef~pLs~s~~Da 367 (945) T protein:vir:10 289 IDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGD-YTQVPILSGGKFTWIDFKGKRRDM 367 (945) T ss_pred HHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCc-ccccceecCCCceEEEccCChhHH Confidence 88888888766542 25565532110 001222334444444433322 23334445556788888876643 Q ss_pred -HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---h-c---cCCce Q lcl|NC_019404. 274 -IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---V-N---AEEWS 345 (418) Q Consensus 274 -l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~-~---~~~~~ 345 (418) +.+..+....+||++.|||...| |...++-.|+.+.....|+.. -|.|.+..+-..| + . ..++. T Consensus 368 QfLEsrkfs~eeIArAFGVPP~lL-G~~e~st~SNiEqq~~~Fv~~-------tL~Pil~~IEqeLNrkLl~~~eg~~i~ 439 (945) T protein:vir:10 368 QFKELAEFVARKICAVYQVSPQDV-GILEGSNKATAEVMASLTKAK-------GLEPLMATISKGFDEVVSEFRNEKDIK 439 (945) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHc-ccCCCCCcchHHHHHHHHHHH-------HHHHHHHHHHHHHHHhccccccCceeE Confidence 45678888899999999998777 544333334556667777653 2445444432222 1 1 23688 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCC----------C--Chhh-------cc--- Q lcl|NC_019404. 346 VEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIK----------I--GDND-------IQ--- 403 (418) Q Consensus 346 ~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~----------~--~~~~-------~~--- 403 (418) |+|+.+.-.+.++ +++++++++++|++|++|+|+.+. ..|..+ . .++. .+ T Consensus 440 fdFd~ldl~D~ks-------raEal~kli~sGiLTiNEvRe~lG-LpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~ 511 (945) T protein:vir:10 440 LWFKEDDLEKERD-------WWNIIQGQLNTGFRSINEARMEKG-LEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQL 511 (945) T ss_pred EEecchhccCHHH-------HHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeeccccccccccccccccCCCCccc Confidence 9999888777654 567888899999999999998762 111100 0 0000 00 Q ss_pred ----cccc--cC--CCcc-----ccccC Q lcl|NC_019404. 404 ----TEES--EL--ITET-----EVVIA 418 (418) Q Consensus 404 ----~~e~--~~--~~e~-----e~~~~ 418 (418) .+++ +. .+|+ |..-+ T Consensus 512 aq~~~dqp~~kGGe~dEns~~psE~kda 539 (945) T protein:vir:10 512 AQAMADQPSQQGGGVDENSSVPSEQKNA 539 (945) T ss_pred ccCCCCCCCCCCCCCCCCCCCCCcccch Confidence 0000 00 0000 11111 No 59 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.75 E-value=6.8e-18 Score=114.69 Aligned_cols=382 Identities=11% Similarity=0.121 Sum_probs=194.3 Q ss_pred CccchhhHHHHh-------------cCCCCc-------ccc---CccccC-------CHHHHHHHHHcCCccchhhhcch Q lcl|NC_019404. 1 MVKTDSYANIFL-------------GGSDGS-------EIY---GSLQNQ-------APTILASLYADNALVRRIIDTIP 50 (418) Q Consensus 1 ~~~~D~~~n~~~-------------g~~~~~-------~~~---~~~~~~-------~~~~l~~~Y~~~~~~r~iVd~~a 50 (418) +--.|++.+.|. .+...+ ..+ ++.... +.......|..|+++++||++.+ T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia 104 (576) T protein:vir:96 25 VPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRS 104 (576) T ss_pred hhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHH Confidence 111333333221 011000 001 111111 22345567888999999999999 Q ss_pred hhhcc-----------CCccccC--c------chHHH---HHHHHHHhCc--------hHHHHHHHH-hccccceEEEEE Q lcl|NC_019404. 51 ETALA-----------AGFHIDG--I------DDEPA---FWSRWDDLEM--------TQNINDAWS-WARLFGGAAIVA 99 (418) Q Consensus 51 ~d~~r-----------~~~~i~~--~------~d~~~---i~~~~~~l~~--------~~~~~~a~~-~~rl~G~~~i~i 99 (418) +.+.+ -++.|.- . .+... +...+..+.. +..|.+.+. ...++|.|++++ T Consensus 105 ~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i 184 (576) T protein:vir:96 105 NQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEK 184 (576) T ss_pred HHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEE Confidence 87754 2333311 0 11111 1222222211 233544444 467799999887 Q ss_pred eec-CCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhh Q lcl|NC_019404. 100 IVK-DNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAM 178 (418) Q Consensus 100 ~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~ 178 (418) ... ++ .|.+..|.++++.+|++..... ...|..+..|.....+ .....+.++.+||+...+.++. T Consensus 185 ~~~rd~---------~g~~~~L~pl~p~~V~v~~~~d---g~~~~~~~~~~~~~~~-~~~~~~~~~dii~~~~~~~~d~- 250 (576) T protein:vir:96 185 VFNKKN---------ATTMDKFIAVDPSTIFYATDKN---GKIIKGGKRFVQVINK-KVVASFTSREMAMGIRNPRTEL- 250 (576) T ss_pred EEecCC---------CCceEEEEEeCCceeEEEECCC---CceeeeeeEEEEecCC-ceEEEecccceEEEeecCCCCc- Confidence 542 22 2557788899988887654211 1112223333322222 2334677888887765554331 Q ss_pred hhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcce-e Q lcl|NC_019404. 179 RRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQA-I 255 (418) Q Consensus 179 ~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~-~ 255 (418) ...++|.||+.. +.+.|.....+.......+...... ++++++-..+ +.+...++++++........+.+. . T Consensus 251 --~~~~~G~Spi~~-a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~l--s~e~~~~lr~~~~~~~~G~~nag~~p 325 (576) T protein:vir:96 251 --SSSGYGLSEVEI-AMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQ--SQRALENFKREWKSSFSGINGSWQVP 325 (576) T ss_pred --ccCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC--CHHHHHHHHHHHHHHhccccccccce Confidence 224579999975 7789999999988888888776543 4555532111 222333444444433222222232 4 Q ss_pred EEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCccc----------cc-cchhHHHHHHHHHHHHH Q lcl|NC_019404. 256 GIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVGG----------LS-SSQNTALETFHKLIDRK 322 (418) Q Consensus 256 ~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~g----------l~-stge~d~~~y~~~I~~~ 322 (418) ++..++.+|+.++.+.. .+-+...+....||.+.+||..+| |....+ ++ ++-+.....|+.. T Consensus 326 ~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~---- 400 (576) T protein:vir:96 326 VVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEI-GFPNRGGATGGKGGNTLNEADPGKKQQQSQNK---- 400 (576) T ss_pred eecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHc-cccccccccccccccccccccHHHHHHHHHHH---- Confidence 55555678888877654 456677888999999999999766 654322 11 2234444444443 Q ss_pred HHHHHHHHHHHHHHHh----hcc--CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCC Q lcl|NC_019404. 323 RNAELLPILEFLIPFI----VNA--EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIK 396 (418) Q Consensus 323 Qe~~l~p~l~~l~~~i----~~~--~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~ 396 (418) .|.|++..+-..| +.. .++.++|. ..+.+.+++.. +....+.+|++|++|+|+.+. ..+..+ T Consensus 401 ---tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~---r~d~~~~~e~~-----~~~~~~~~G~lT~NE~R~~~g-l~pieg 468 (576) T protein:vir:96 401 ---GLQPLLRFIEDLINTHIISEYSDKYVFQFV---GGDTKSELDKI-----KILQEEVKTYKTVNEARKEKG-LKPIEG 468 (576) T ss_pred ---HHHHHHHHHHHHHHhhhchhccCceEEEec---cCCHHHHHHHH-----HHHHHHhcCccCHHHHHHHhC-CCCCCC Confidence 3777776664443 222 34556654 34444443321 222345689999999998762 211111 Q ss_pred CC---------hhhccc--ccccC---------------------------CCccccccC Q lcl|NC_019404. 397 IG---------DNDIQT--EESEL---------------------------ITETEVVIA 418 (418) Q Consensus 397 ~~---------~~~~~~--~e~~~---------------------------~~e~e~~~~ 418 (418) -+ .-+... ...+. ++..++..+ T Consensus 469 GD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~ 528 (576) T protein:vir:96 469 GDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRES 528 (576) T ss_pred cceeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCccccccc Confidence 00 000000 00000 001111111 No 60 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.75 E-value=1.8e-17 Score=112.44 Aligned_cols=382 Identities=12% Similarity=0.137 Sum_probs=193.7 Q ss_pred Cccchhh-------------HHHHhc---CCC-------------CccccCccccC----CHHHHHHHHHcCCccchhhh Q lcl|NC_019404. 1 MVKTDSY-------------ANIFLG---GSD-------------GSEIYGSLQNQ----APTILASLYADNALVRRIID 47 (418) Q Consensus 1 ~~~~D~~-------------~n~~~g---~~~-------------~~~~~~~~~~~----~~~~l~~~Y~~~~~~r~iVd 47 (418) |---|++ .|.+.. +.. ....+..+..+ +...+.+.|..|+++++||+ T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~ 102 (563) T protein:vir:99 23 VPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIIL 102 (563) T ss_pred eeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHH Confidence 1001111 112111 000 01122222222 33567788999999999999 Q ss_pred cchhhhccC-----------Ccccc-------Ccc-hHH---HHHHHHHHhC---------chHHHHHHHHhccccceEE Q lcl|NC_019404. 48 TIPETALAA-----------GFHID-------GID-DEP---AFWSRWDDLE---------MTQNINDAWSWARLFGGAA 96 (418) Q Consensus 48 ~~a~d~~r~-----------~~~i~-------~~~-d~~---~i~~~~~~l~---------~~~~~~~a~~~~rl~G~~~ 96 (418) +.++.+.+- ++.+. +.+ +.. .++..+..++ .++-+...+....++|.|+ T Consensus 103 t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~ 182 (563) T protein:vir:99 103 TRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVN 182 (563) T ss_pred HHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeE Confidence 999876641 23331 111 111 1222222211 1233344455567899999 Q ss_pred EEEee-cCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccch Q lcl|NC_019404. 97 IVAIV-KDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVP 175 (418) Q Consensus 97 i~i~~-~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp 175 (418) +++.+ ++ ..|.+..|.++++..|++...... .-|.....|.+...+. ....+.++.+||+...+.+ T Consensus 183 ~~~~~~rd---------~~G~~~~L~pl~p~~V~v~~~~~g---~~~~~~~~y~~~~~g~-~~~~~~~~evI~~~~~~~~ 249 (563) T protein:vir:99 183 FEKVFNKN---------NKTKLEKFIAVDPSTIFYATDKKG---KIIKGGKRFVQVVDKR-VVASFTSRELAMGIRNPRT 249 (563) T ss_pred EEEEEEec---------CCCceEEEEEeCCceeEEEECCCC---ceeccceeEEEEeCCc-eeEEecCcceEEEeccCCC Confidence 87654 23 235678899999988877532211 1122223333333222 2235667777766544432 Q ss_pred hhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCce--eecchHHHhhcCcchHHHHHHHHHHHHHhcCC-c Q lcl|NC_019404. 176 NAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAV--WKAKGLAELCDDSEGFGAARLRLAQVDNNSGV-G 252 (418) Q Consensus 176 ~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v--~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~-~ 252 (418) + ....++|.||+.. +.+.|.....+....+..+.....+- +++++-.. + +.+...++++.+........+ + T Consensus 250 d---~~~~~~G~Spi~~-a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~-l-s~e~~~~~~~~~~~~~~G~~nag 323 (563) T protein:vir:99 250 E---LSSSGYGLSEVEI-AMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQ-Q-SQHALENFKREWKSSLSGINGSW 323 (563) T ss_pred C---cccCcccchHHHH-HHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCC-C-CHHHHHHHHHHHHHHhccccccc Confidence 2 2234689999975 78999999999999999888765443 55543111 1 222333444444432222112 2 Q ss_pred ceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccC-cccccc----------chhHHHHHHHHHH Q lcl|NC_019404. 253 QAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKN-VGGLSS----------SQNTALETFHKLI 319 (418) Q Consensus 253 ~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s-~~gl~s----------tge~d~~~y~~~I 319 (418) ...++..++.+|+.++.+... +-+...+....||.+.+||..+| |.. .++..+ +-+.....|+ T Consensus 324 k~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~--- 399 (563) T protein:vir:99 324 QIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQ--- 399 (563) T ss_pred cceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-cccccccccccccccchhhccHHHHHHHHH--- Confidence 334555666788888876654 45777889999999999998766 544 222211 1122222333 Q ss_pred HHHHHHHHHHHHHHHHHHh----hcc--CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcC Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPFI----VNA--EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAP 393 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~i----~~~--~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~ 393 (418) +..|.|++..+-..| +.. ..+.++|. ..+.+.+++.. ....++.+|++|++|+|+.+. ..+ T Consensus 400 ----~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~---r~D~~~~~e~~-----~~~~~~~~G~lT~NE~R~~~g-l~P 466 (563) T protein:vir:99 400 ----NKGLQPLLRFIEDLVNRHIISEYGDKYTFQFV---GGDTKSATDKL-----NILKLETQIFKTVNEAREEQG-KKP 466 (563) T ss_pred ----HHHHHHHHHHHHHHHHhhhchhcccccEEEec---cCCHHHHHHHH-----HHHHHhcCCccCHHHHHHHhC-CCC Confidence 334778776665444 222 24455553 34555544432 223468899999999998763 211 Q ss_pred cCCCC-----------h-------hh----------c------ccccccCCCccccccC Q lcl|NC_019404. 394 EIKIG-----------D-------ND----------I------QTEESELITETEVVIA 418 (418) Q Consensus 394 ~~~~~-----------~-------~~----------~------~~~e~~~~~e~e~~~~ 418 (418) ..+-+ . .+ + +..+++.++.++.--+ T Consensus 467 i~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (563) T protein:vir:99 467 IEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSND 525 (563) T ss_pred CCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCC Confidence 11100 0 00 0 0000001111111011 No 61 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.75 E-value=1.8e-17 Score=112.44 Aligned_cols=382 Identities=12% Similarity=0.137 Sum_probs=193.7 Q ss_pred Cccchhh-------------HHHHhc---CCC-------------CccccCccccC----CHHHHHHHHHcCCccchhhh Q lcl|NC_019404. 1 MVKTDSY-------------ANIFLG---GSD-------------GSEIYGSLQNQ----APTILASLYADNALVRRIID 47 (418) Q Consensus 1 ~~~~D~~-------------~n~~~g---~~~-------------~~~~~~~~~~~----~~~~l~~~Y~~~~~~r~iVd 47 (418) |---|++ .|.+.. +.. ....+..+..+ +...+.+.|..|+++++||+ T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~ 102 (563) T protein:vir:95 23 VPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIIL 102 (563) T ss_pred eeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHH Confidence 1001111 112111 000 01122222222 33567788999999999999 Q ss_pred cchhhhccC-----------Ccccc-------Ccc-hHH---HHHHHHHHhC---------chHHHHHHHHhccccceEE Q lcl|NC_019404. 48 TIPETALAA-----------GFHID-------GID-DEP---AFWSRWDDLE---------MTQNINDAWSWARLFGGAA 96 (418) Q Consensus 48 ~~a~d~~r~-----------~~~i~-------~~~-d~~---~i~~~~~~l~---------~~~~~~~a~~~~rl~G~~~ 96 (418) +.++.+.+- ++.+. +.+ +.. .++..+..++ .++-+...+....++|.|+ T Consensus 103 t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~ 182 (563) T protein:vir:95 103 TRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVN 182 (563) T ss_pred HHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeE Confidence 999876641 23331 111 111 1222222211 1233344455567899999 Q ss_pred EEEee-cCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccch Q lcl|NC_019404. 97 IVAIV-KDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVP 175 (418) Q Consensus 97 i~i~~-~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp 175 (418) +++.+ ++ ..|.+..|.++++..|++...... .-|.....|.+...+. ....+.++.+||+...+.+ T Consensus 183 ~~~~~~rd---------~~G~~~~L~pl~p~~V~v~~~~~g---~~~~~~~~y~~~~~g~-~~~~~~~~evI~~~~~~~~ 249 (563) T protein:vir:95 183 FEKVFNKN---------NKTKLEKFIAVDPSTIFYATDKKG---KIIKGGKRFVQVVDKR-VVASFTSRELAMGIRNPRT 249 (563) T ss_pred EEEEEEec---------CCCceEEEEEeCCceeEEEECCCC---ceeccceeEEEEeCCc-eeEEecCcceEEEeccCCC Confidence 87654 23 235678899999988877532211 1122223333333222 2235667777766544432 Q ss_pred hhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCce--eecchHHHhhcCcchHHHHHHHHHHHHHhcCC-c Q lcl|NC_019404. 176 NAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAV--WKAKGLAELCDDSEGFGAARLRLAQVDNNSGV-G 252 (418) Q Consensus 176 ~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v--~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~-~ 252 (418) + ....++|.||+.. +.+.|.....+....+..+.....+- +++++-.. + +.+...++++.+........+ + T Consensus 250 d---~~~~~~G~Spi~~-a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~-l-s~e~~~~~~~~~~~~~~G~~nag 323 (563) T protein:vir:95 250 E---LSSSGYGLSEVEI-AMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQ-Q-SQHALENFKREWKSSLSGINGSW 323 (563) T ss_pred C---cccCcccchHHHH-HHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCC-C-CHHHHHHHHHHHHHHhccccccc Confidence 2 2234689999975 78999999999999999888765443 55543111 1 222333444444432222112 2 Q ss_pred ceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccC-cccccc----------chhHHHHHHHHHH Q lcl|NC_019404. 253 QAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKN-VGGLSS----------SQNTALETFHKLI 319 (418) Q Consensus 253 ~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s-~~gl~s----------tge~d~~~y~~~I 319 (418) ...++..++.+|+.++.+... +-+...+....||.+.+||..+| |.. .++..+ +-+.....|+ T Consensus 324 k~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~--- 399 (563) T protein:vir:95 324 QIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQ--- 399 (563) T ss_pred cceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-cccccccccccccccchhhccHHHHHHHHH--- Confidence 334555666788888876654 45777889999999999998766 544 222211 1122222333 Q ss_pred HHHHHHHHHHHHHHHHHHh----hcc--CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcC Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPFI----VNA--EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAP 393 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~i----~~~--~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~ 393 (418) +..|.|++..+-..| +.. ..+.++|. ..+.+.+++.. ....++.+|++|++|+|+.+. ..+ T Consensus 400 ----~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~---r~D~~~~~e~~-----~~~~~~~~G~lT~NE~R~~~g-l~P 466 (563) T protein:vir:95 400 ----NKGLQPLLRFIEDLVNRHIISEYGDKYTFQFV---GGDTKSATDKL-----NILKLETQIFKTVNEAREEQG-KKP 466 (563) T ss_pred ----HHHHHHHHHHHHHHHHhhhchhcccccEEEec---cCCHHHHHHHH-----HHHHHhcCCccCHHHHHHHhC-CCC Confidence 334778776665444 222 24455553 34555544432 223468899999999998763 211 Q ss_pred cCCCC-----------h-------hh----------c------ccccccCCCccccccC Q lcl|NC_019404. 394 EIKIG-----------D-------ND----------I------QTEESELITETEVVIA 418 (418) Q Consensus 394 ~~~~~-----------~-------~~----------~------~~~e~~~~~e~e~~~~ 418 (418) ..+-+ . .+ + +..+++.++.++.--+ T Consensus 467 i~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (563) T protein:vir:95 467 IEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSND 525 (563) T ss_pred CCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCC Confidence 11100 0 00 0 0000001111111011 No 62 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.74 E-value=2.1e-18 Score=117.48 Aligned_cols=351 Identities=14% Similarity=0.124 Sum_probs=193.6 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchHHH--------HH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDEPA--------FW 70 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~~~--------i~ 70 (418) ....+.+...+.++.. ......+. ..+..++.+++||+.+|+++-+-.+.+. .++.... +. T Consensus 16 ~~~~~~~~~~~~~~~~-----~~~~~~~~----~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~ 86 (406) T protein:vir:95 16 IRADTGYVGLFMSGED-----VSFLVPGY----VRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDIRIRNELSRKID 86 (406) T ss_pred ccccchhhhhhccCcc-----cCccccCH----HHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeecchHHHHHh Confidence 1111111111111100 00111111 2234689999999999999999988872 2111111 11 Q ss_pred HHHHHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEE Q lcl|NC_019404. 71 SRWDDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYR 149 (418) Q Consensus 71 ~~~~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~ 149 (418) .+-..+-.+..|.+.+.+ ..++|.|++++.+.. +..|.+..+.++++..+.+..... .|+ T Consensus 87 ~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~--------~~~g~~~~l~~i~~~~v~~~~~~~-----------~~~ 147 (406) T protein:vir:95 87 ITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKY--------TADGLIDELVPLTPSKVNFLDTPD-----------GYQ 147 (406) T ss_pred hccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEE--------CCCCcEEEEEEEcCceeEEEEcCC-----------eEE Confidence 111122234455555554 456766655543321 134567788899888887643221 144 Q ss_pred EecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHH Q lcl|NC_019404. 150 ITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAE 227 (418) Q Consensus 150 i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~ 227 (418) +..++ ..+.++.||||.-.+. +...++|.|++.. +.+.+.....+.......+...... +++++.. T Consensus 148 ~~~~~----~~~~~~evih~~~~~~-----~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-- 215 (406) T protein:vir:95 148 VLYGG----QTFNYDEVLHFIYNPD-----PERPYIGRGYRVV-LKDIADNLKQATATKKSFMSGKYMPSLIVKVDAA-- 215 (406) T ss_pred EEecc----EEEchhHEEEeeccCC-----CCCCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-- Confidence 43322 4578899999963221 2234579999975 7889999999999999988776554 5565531 Q ss_pred hhcCcchHHHHHHHHHHHHHh-cCCcceeEEEcCCCceeEee-cccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCccc Q lcl|NC_019404. 228 LCDDSEGFGAARLRLAQVDNN-SGVGQAIGIDAESEEYSVLN-SDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVGG 303 (418) Q Consensus 228 ~~~~~~~~~~~~~r~~~~~~~-~~~~~~~~~d~~~e~~~~~~-~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~g 303 (418) ++ ++...+..+++...... .+.++.+++..+.++++++. .+.. .+.+..+...+.||.+.|||..+| |.. T Consensus 216 -l~-~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~l-g~~--- 289 (406) T protein:vir:95 216 -TA-ELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLL-GIG--- 289 (406) T ss_pred -CC-HHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCC--- Confidence 21 22333344444433322 22344555655555665432 3433 455777889999999999998766 432 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccCCce--EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019404. 304 LSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNAEEWS--VEFSPLDHESSKDKAEVLEKSVNSIAALIAAG 377 (418) Q Consensus 304 l~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~~~~~--~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g 377 (418) +..+....+||. .-|.|+++.+-..| +...++. |+++.|...|.+++ ++++..++++| T Consensus 290 --~~~~~~~~~~~~-------~~l~P~~~~ie~~l~~~l~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~l~~~G 353 (406) T protein:vir:95 290 --EFNRDEYNNFIN-------STILPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKEL-------AEVGSNMYVRG 353 (406) T ss_pred --CchHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCCCCCcEEEeechhhhcCCHHHH-------HHHHHHHHhCC Confidence 112344555654 34888887776554 2234554 45556666666654 67788899999 Q ss_pred CCCHHHHHHHHHhhcCcCC-------CC---hhh------ccc-ccccCCCccc Q lcl|NC_019404. 378 AMDIKEARDTLRTIAPEIK-------IG---DND------IQT-EESELITETE 414 (418) Q Consensus 378 ~i~~~e~r~~l~~~~~~~~-------~~---~~~------~~~-~e~~~~~e~e 414 (418) +++++|+|+.+.. .+..+ .. .+. .+. +.+...+++| T Consensus 354 ~~t~NE~R~~~gl-~p~~~gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 354 IMEGNEVRDWLGL-SPKEGLSELVILENYIPLDKIGDQSKLKGGDNSGADGQTD 406 (406) T ss_pred CcCHHHHHHHhCC-CCCCCcceeeeccCccchhhcccccccCCCCCCCCCCCCC Confidence 9999999997732 11111 00 011 111 1122233334 No 63 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.74 E-value=5.1e-18 Score=115.39 Aligned_cols=367 Identities=13% Similarity=0.058 Sum_probs=206.5 Q ss_pred Cc------------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--C-cch Q lcl|NC_019404. 1 MV------------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--G-IDD 65 (418) Q Consensus 1 ~~------------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~-~~d 65 (418) |. ...++-+.+.|+... +........+... ..+++.+++||+.+|++.-+-++.+. . +.. T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~~~-~~~~~g~~vt~~~----al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~ 75 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGVRS-SHSKAGVMITPET----ALALSAVRACVTLLAESVAQLPVELYRRDKNGG 75 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhhcc-CcccCCceechHH----hhccHHHHHHHHHHHHhhccCceEEEEEcCCCc Confidence 21 112222222221111 1111112223332 34678899999999999999888872 1 111 Q ss_pred HH-----HHHHHH----HHhCchHHHHHH-HHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccc Q lcl|NC_019404. 66 EP-----AFWSRW----DDLEMTQNINDA-WSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREE 135 (418) Q Consensus 66 ~~-----~i~~~~----~~l~~~~~~~~a-~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~ 135 (418) .. .+...+ ...--...|.+. +.+..++|.|++++.- + ..|.+..+.++++.++++... T Consensus 76 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~G~~~~L~~l~~~~v~v~~~-- 143 (421) T protein:vir:10 76 RQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDR-D---------GKGYPKELIPINPKKVIVLKG-- 143 (421) T ss_pred eeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEE-c---------CCCcEEEEEEecCceEEEEEC-- Confidence 11 112222 112223445444 4456678999988753 2 235577889999888876432 Q ss_pred cccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019404. 136 NPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRK 215 (418) Q Consensus 136 dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~ 215 (418) ..|.+ +|.+...+ ..+.++.|||+.+... ...+|.||+.. +.+.+.....+......++... T Consensus 144 -----~~g~~-~y~~~~~g----~~~~~~eiih~~~~~~-------d~~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ng 205 (421) T protein:vir:10 144 -----PDGMP-YYEIPEIG----ETLPMRMMHHVKVFSL-------DGYIGSSPIQT-NADVLGLNLAVEEHASAVFRRG 205 (421) T ss_pred -----CCceE-EEEEcCCC----cEEchhhEEEecCcCC-------CCcccccHHHH-HHHHHHHHHHHHHHHHHHHhcC Confidence 12443 45554332 4688899999965431 24579999975 7788988888888888888775 Q ss_pred CCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcC Q lcl|NC_019404. 216 QQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGI 291 (418) Q Consensus 216 ~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~I 291 (418) ... ++++++-..-..+.+...+..+++........+.+.+++..++.+|++++.+.. .+.+..++..+.||.+.+| T Consensus 206 ~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgV 285 (421) T protein:vir:10 206 ATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKI 285 (421) T ss_pred CCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCC Confidence 443 566653221111222233344444443332223344555555678888887664 3455667889999999999 Q ss_pred CeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc----CC--ceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 292 HEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA----EE--WSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 292 P~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~----~~--~~~~f~pL~~~~eke~ae~ 362 (418) |..+| |....+-.++-|.....||.. -|.|++..+-..|- .. .+ ++|+...|...|.+++ T Consensus 286 Pp~~l-g~~~~~t~sn~e~~~~~f~~~-------tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~--- 354 (421) T protein:vir:10 286 PPHMV-QMLAKATNNNIEHQGLQFVMY-------TLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSR--- 354 (421) T ss_pred CHHHc-CCCcCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHH--- Confidence 98766 555444345556666677664 37887776644442 11 23 4455557777777765 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC-------C---hhhccccc---ccCCCccccccC Q lcl|NC_019404. 363 LEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI-------G---DNDIQTEE---SELITETEVVIA 418 (418) Q Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~-------~---~~~~~~~e---~~~~~e~e~~~~ 418 (418) ++++.+++++|++|++|+|+.+.. .+..+- . .++....+ ....+.+++.+. T Consensus 355 ----~~~~~~~~~~G~~T~NE~R~~~gl-~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~e~d~~~ 418 (421) T protein:vir:10 355 ----YESYALGRQWGWLSVNDIRRMENL-PPIAGGDKYLTPLNMVDSAQIIPGDKKPTAQQMAEIDTIL 418 (421) T ss_pred ----HHHHHHHHhCCCcCHHHHHHHhCC-CCCCCcceeeeccccccccccccCCCCcccccCccccccc Confidence 567888999999999999987632 111110 0 11111111 111111233333 No 64 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.74 E-value=5.4e-18 Score=115.26 Aligned_cols=370 Identities=14% Similarity=0.124 Sum_probs=198.6 Q ss_pred hhhHHHHhcCCCCcc-------cc----------Ccc----ccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC- Q lcl|NC_019404. 5 DSYANIFLGGSDGSE-------IY----------GSL----QNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG- 62 (418) Q Consensus 5 D~~~n~~~g~~~~~~-------~~----------~~~----~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~- 62 (418) =|+.+.+.|...... .+ +.. ...+.. .+.+++.+.++|+.+++++-+-++.+.- T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~~i~~ia~~iA~lp~~~~~~ 76 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPH----DALQVSAVFASVRLLSETIATLPLSTYSK 76 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechH----HhhccHHHHHHHHHHHHhHhhCceEEEEe Confidence 021111112111100 00 000 011111 2345788899999999999998888731 Q ss_pred -cchHHH-----HHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 -IDDEPA-----FWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 -~~d~~~-----i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++.... +..-+.+ +...+-++..+.+..++|.|++++.- + .|.+..+.++++.++++.. T Consensus 77 ~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~-~----------~g~~~~l~~l~p~~v~v~~ 145 (457) T protein:vir:62 77 RGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRW-A----------GPNIAGLDVLDPTKIHVHM 145 (457) T ss_pred cCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-C----------CCcEEEEEEEcCcceEEEE Confidence 111111 1111111 22233444444445678999988732 2 2456778888888887654 Q ss_pred ccccccccccCcceEEEEecCCcc-cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 133 REENPRNARFGKPLTYRITTNESD-MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 133 ~~~dp~s~~yg~p~~y~i~~~~~~-~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ...+..... ....|.+...+.. ....++++.||||.+.. +....+|.|++.. +.+.|.....+......+ T Consensus 146 ~~~~~~~~~--~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~------~~~~~~G~sp~~~-~~~~i~~~~~~~~~~~~~ 216 (457) T protein:vir:62 146 VMVDGLRRK--VFEAYDIDADGNEVLLGWFTPRDVLHIPGMM------LPGDFVGCSPISY-ARESIGLALAAQKYGAHF 216 (457) T ss_pred eccCCccce--eEEEEEEccCCceeEEEeeCccceEEecCCC------CCCceecccHHHH-HHHHHHHHHHHHHHHHHH Confidence 332222111 1123555443322 22357899999996432 2334679999975 778888888888888888 Q ss_pred HHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhh Q lcl|NC_019404. 212 LRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVA 287 (418) Q Consensus 212 ~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaa 287 (418) ++....+ ++++++ . + +++...++.+++........+.+.+++...+.+|+.++.+..+ +-+...+....||. T Consensus 217 f~ng~~p~gil~~~~--~-l-s~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 292 (457) T protein:vir:62 217 FRNGAMPGAVVEVPG--T-M-SEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIAR 292 (457) T ss_pred HhccCCcceEEEcCC--C-C-CHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 8775543 466653 1 2 2233344444444433222223444555556788888877654 45667788899999 Q ss_pred hhcCCeeeeeccCccccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc----CC--ceEEeCCCCCCCH Q lcl|NC_019404. 288 LSGIHEIILKNKNVGGLS--SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA----EE--WSVEFSPLDHESS 356 (418) Q Consensus 288 as~IP~t~L~G~s~~gl~--stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~----~~--~~~~f~pL~~~~e 356 (418) +.+||..+| |....+-. |.-+.....|+.. .|.|+++.+-..+- .. .+ ++|.+..|...|. T Consensus 293 ~fgVPp~~l-g~~~~~~~~~sn~eq~~~~f~~~-------~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~ 364 (457) T protein:vir:62 293 IFGVPPHLI-SDATNSTSWGSGLAEQNIAFTMF-------SLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAP 364 (457) T ss_pred HhCCCHHHc-CCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCH Confidence 999998755 66544322 2234455556554 37787766644432 21 23 3455558877777 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC-Ch-----------hhccccc--------------ccCC Q lcl|NC_019404. 357 KDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI-GD-----------NDIQTEE--------------SELI 410 (418) Q Consensus 357 ke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~-~~-----------~~~~~~e--------------~~~~ 410 (418) +++ ++++.+++++|++|++|+|+.+. ..+-.+. .| ++.++.+ +... T Consensus 365 ~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (457) T protein:vir:62 365 KER-------MELWSLGLQNGIYSIDEVRAAED-MTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADD 436 (457) T ss_pred HHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCC Confidence 765 56677899999999999998763 1111110 00 0000000 0000 Q ss_pred C----------cc---ccccC Q lcl|NC_019404. 411 T----------ET---EVVIA 418 (418) Q Consensus 411 ~----------e~---e~~~~ 418 (418) . ++ .+.-| T Consensus 437 ~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 437 EEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred CCCCCCCCCCccccccccccC Confidence 0 00 01111 No 65 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.74 E-value=7.7e-18 Score=114.39 Aligned_cols=365 Identities=11% Similarity=0.069 Sum_probs=196.9 Q ss_pred ccchhh----HHHHhcC---CCCccccCcc--ccCCHH-HHHHHHHcCCccchhhhcchhhhccCCccccC--cchHHHH Q lcl|NC_019404. 2 VKTDSY----ANIFLGG---SDGSEIYGSL--QNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDG--IDDEPAF 69 (418) Q Consensus 2 ~~~D~~----~n~~~g~---~~~~~~~~~~--~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~--~~d~~~i 69 (418) |.-+.. .+.+++. .+.++.+++. ...++. --...|.+++.+.++|+.+|+++-+-++.+.- ......+ T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~ 80 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEV 80 (409) T ss_pred CcccccchhhhhHHhhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccccchhH Confidence 222221 1122110 0111111111 001111 11234667899999999999999998888731 1111112 Q ss_pred H----HHHHHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCc Q lcl|NC_019404. 70 W----SRWDDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGK 144 (418) Q Consensus 70 ~----~~~~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~ 144 (418) . .+-..+-....|.+.+. +..++|.|++++.- + ..|.+..|.+++++.+++.... .+. T Consensus 81 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~l~~~~v~v~~~~-------~~~ 143 (409) T protein:vir:94 81 SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-D---------IYHQPSKLFLLNPDVVEMLIEN-------QSR 143 (409) T ss_pred HHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEeC-------CCc Confidence 2 22222333455555544 45778999988753 2 3456778889998888765321 234 Q ss_pred ceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecch Q lcl|NC_019404. 145 PLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKG 224 (418) Q Consensus 145 p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~ 224 (418) +.+|.+...++. ...++++.|+||.+.. +....+|.|++.. +.+.+.....+......-.....--+++.+. T Consensus 144 ~~~y~~~~~~g~-~~~~~~~dvih~r~~~------~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~ 215 (409) T protein:vir:94 144 ELYYSIHAATGN-KLIVHNMDMLHFKHIV------ASNMVQGISPIDV-LKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS 215 (409) T ss_pred EEEEEEEcCCce-EEEEccccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHhcCCCCeeEEecCC Confidence 556777655433 3468899999996422 2244579999964 5565555444433321111111111223221 Q ss_pred HHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCcc Q lcl|NC_019404. 225 LAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVG 302 (418) Q Consensus 225 l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~ 302 (418) . + +++.....++++.. ... +.+.+++..++.+|+.++.+.. .+-+.......+||.+.+||..+|.+...+ T Consensus 216 --~-l-~~e~~~~~~~~~~~--~~~-~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 288 (409) T protein:vir:94 216 --N-V-GKEKRQQVLEDFKQ--YYE-ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT 288 (409) T ss_pred --C-C-CHHHHHHHHHHHHH--Hhh-cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 1 1 12223334444443 233 3445555556678888876654 455667778899999999999877554443 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------ccCCceEEeC--CCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 303 GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV--------NAEEWSVEFS--PLDHESSKDKAEVLEKSVNSIAA 372 (418) Q Consensus 303 gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~--------~~~~~~~~f~--pL~~~~eke~ae~~~~~a~a~~~ 372 (418) .. ++-|.....|+.. .|.|.++.+-..+- +..+..|+|+ .|...|.+++ ++++++ T Consensus 289 ~~-sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~-------~~~~~~ 353 (409) T protein:vir:94 289 NF-AKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQ-------AEVYFK 353 (409) T ss_pred Cc-ccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHH-------HHHHHH Confidence 33 4456666677664 37888777654432 1234556654 6666666554 677889 Q ss_pred HHhCCCCCHHHHHHHHHhhcCcCCCChh-----------hcccc----cccCCCcccc Q lcl|NC_019404. 373 LIAAGAMDIKEARDTLRTIAPEIKIGDN-----------DIQTE----ESELITETEV 415 (418) Q Consensus 373 ~~~~g~i~~~e~r~~l~~~~~~~~~~~~-----------~~~~~----e~~~~~e~e~ 415 (418) ++++|++|++|+|+.+. ..|..+ .|+ ...+. +.-.++.+|+ T Consensus 354 ~~~~G~~T~NE~R~~~g-~~p~~g-gD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 354 AVRSGYYTINDIREWED-LPPVEG-GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HHhCCCcCHHHHHHHhC-CCCCCC-cCeEeecccccccccchhhcccccCCCCCcCCC Confidence 99999999999998763 222111 111 11111 1111223344 No 66 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.74 E-value=6.7e-18 Score=114.73 Aligned_cols=363 Identities=12% Similarity=0.060 Sum_probs=195.7 Q ss_pred Cccchhh-------HHHHhcCCC-CccccCccccCCH-HHHHHHHHcCCccchhhhcchhhhccCCccccCcch--HHHH Q lcl|NC_019404. 1 MVKTDSY-------ANIFLGGSD-GSEIYGSLQNQAP-TILASLYADNALVRRIIDTIPETALAAGFHIDGIDD--EPAF 69 (418) Q Consensus 1 ~~~~D~~-------~n~~~g~~~-~~~~~~~~~~~~~-~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d--~~~i 69 (418) |-..--+ .+-..+..+ ....+......++ ---...|.+++.+.++|+.+|+++-+-++.+.-..+ ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~~~~~~ 80 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEV 80 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccccccchH Confidence 2222211 111111111 1111100000000 001123567888999999999999998887732111 1111 Q ss_pred H----HHHHHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCc Q lcl|NC_019404. 70 W----SRWDDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGK 144 (418) Q Consensus 70 ~----~~~~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~ 144 (418) . .+-...-....|.+.+. ...++|.|++++.-+ ..|.+..|.++++..+++.... .+. T Consensus 81 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~----------~~G~~~~L~~l~~~~v~~~~~~-------~~~ 143 (409) T protein:vir:93 81 SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD----------IYHQPSKLFLLNPDVVEMLIEN-------QSR 143 (409) T ss_pred HHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC----------CCCcEEEEEEEcCceeEEEEeC-------CCc Confidence 2 22222233444544444 456789999887532 2355778889888877654321 233 Q ss_pred ceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC---ceee Q lcl|NC_019404. 145 PLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ---AVWK 221 (418) Q Consensus 145 p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~---~v~k 221 (418) +..|.++..++. ...++++.|+||.+.. +....+|.|++.. +.+.+.....+.... +...+. .+++ T Consensus 144 ~~~y~~~~~~g~-~~~~~~~eVih~r~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~---~~~~~~~~~~i~~ 212 (409) T protein:vir:93 144 ELYYSIHAATGN-KLIVHNMDMLHFKHIV------ASNMVQGISPIDV-LKNTTDFDNAVRTFN---LTEMQKPDSFMLK 212 (409) T ss_pred EEEEEEEcCCce-EEEEccccEEEeCCCC------CCCccccccHHHH-HHHHHHHHHHHHHHH---HHhcCCCCceEEe Confidence 456777765433 3568999999996432 2234579999965 556555544443332 222221 1223 Q ss_pred cchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeecc Q lcl|NC_019404. 222 AKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNK 299 (418) Q Consensus 222 ~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~ 299 (418) .+. . + +++.....++++.. ... +.+.+++..++.+|++++.+.. .+-+......+.||.+.+||..+|.+. T Consensus 213 ~~~--~-l-~~e~~~~~~~~~~~--~~~-~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~ 285 (409) T protein:vir:93 213 YGS--N-V-GKEKRQQVLEDFKQ--YYE-ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNAR 285 (409) T ss_pred cCC--C-C-CHHHHHHHHHHHHH--Hhh-cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 321 1 1 12223344444442 223 3444555556678988876654 345566678899999999999877554 Q ss_pred CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-----cCCceEEeC--CCCCCCHHHHHHHHHHHHHH Q lcl|NC_019404. 300 NVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-----AEEWSVEFS--PLDHESSKDKAEVLEKSVNS 369 (418) Q Consensus 300 s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-----~~~~~~~f~--pL~~~~eke~ae~~~~~a~a 369 (418) ..+.. ++.|+..+.|+... |.|.++.+-..+- . ..++.|+|+ .|...|.+++ +++ T Consensus 286 ~~~~~-sn~e~~~~~f~~~~-------l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~-------~~~ 350 (409) T protein:vir:93 286 SNTNF-AKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQ-------AEV 350 (409) T ss_pred CCCCc-ccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHH-------HHH Confidence 44433 45566667777653 7888777654432 1 124556654 6666666554 677 Q ss_pred HHHHHhCCCCCHHHHHHHHHhhcCcCCCChh-----------hccc----ccccCCCcccc Q lcl|NC_019404. 370 IAALIAAGAMDIKEARDTLRTIAPEIKIGDN-----------DIQT----EESELITETEV 415 (418) Q Consensus 370 ~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~-----------~~~~----~e~~~~~e~e~ 415 (418) +++++++|++|++|+|+.+. ..+..+ +|+ ...+ ...-..+++|+ T Consensus 351 ~~~~~~~G~~T~NE~R~~~g-~~p~~g-gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 351 YFKAVRSGYYTINDIREWED-LPPVEG-GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HHHHHhCCCcCHHHHHHHhC-CCCCCC-cCeeeecccccccccchhhcccccCCCCCcCCC Confidence 88999999999999999773 222111 111 0010 01111233444 No 67 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.73 E-value=1.3e-17 Score=113.12 Aligned_cols=366 Identities=12% Similarity=0.094 Sum_probs=203.4 Q ss_pred CccchhhHHHHhcCC---CCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC---cchHHH-----H Q lcl|NC_019404. 1 MVKTDSYANIFLGGS---DGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG---IDDEPA-----F 69 (418) Q Consensus 1 ~~~~D~~~n~~~g~~---~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~---~~d~~~-----i 69 (418) .++..+....+-..+ .+.+..|. ..++... .+++-+.+||+.++++.-+-++.+.- +..... + T Consensus 17 ~~~~~~~~~~~~~~~~~~~g~~~~g~--~v~~~~a----l~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~~ 90 (454) T protein:vir:93 17 DVREAGWTSLFQAVAEPFAGAWQQGV--KADPEAV----LSFHAVFACISLISQDIAKMRLRLMQTDAQGIRRETRRGDI 90 (454) T ss_pred cccchhhhhhhhhhhhhhcchhhcCc--ccChHHh----hccHHHHHHHHHHHHhhccCceEEEEeccCCccchhhhHHH Confidence 333333333221100 01111221 2233322 34567889999999999998888731 111111 1 Q ss_pred HHHHHH---hCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcc Q lcl|NC_019404. 70 WSRWDD---LEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKP 145 (418) Q Consensus 70 ~~~~~~---l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p 145 (418) ..-+.+ .-....|.+.+. +..++|.|++++.-+ ..|.+..|.++++.++++.... -|. T Consensus 91 ~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~----------~~G~~~~L~~i~~~~v~v~~~~-------~g~- 152 (454) T protein:vir:93 91 ARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRN----------ARGQIKELRILDWNRVEPLVAD-------DGE- 152 (454) T ss_pred HHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEEC----------CCCcEEEEEEEcCcceEEEEcC-------CCc- Confidence 111112 222344555555 456789999887542 2356888999999888764321 122 Q ss_pred eEEEEecCCc---ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--cee Q lcl|NC_019404. 146 LTYRITTNES---DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVW 220 (418) Q Consensus 146 ~~y~i~~~~~---~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~ 220 (418) -.|+++.... .....+.++.||||.... .....+|.|++.. +.+.|.....+......++..... -++ T Consensus 153 ~~y~~~~~~~~~~~~~~~~~~~eViH~k~~~------~~~~~~G~sp~~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil 225 (454) T protein:vir:93 153 VFYRITPDRNCGITEAVTVPAREVIHDRFNC------FFHPLIGLPPVYA-AGLAATQGHHIQENSTSFFRNGGRPSGVI 225 (454) T ss_pred EEEEEEeccccccceeEEecCcceEEeccCC------CCCCceeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEE Confidence 2455543221 223468889999995321 2234579999975 778888888888888887766444 356 Q ss_pred ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 221 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G 298 (418) ++++. + +++...+++++++......+.++.+++ ..+.+|++++.+..+ +-+...+....||.+.|||..+| | T Consensus 226 ~~~~~---l-~~e~~~~~~~~~~~~~~g~n~g~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g 299 (454) T protein:vir:93 226 EIPGS---I-TEENAKKLKSNWDSGYTGENAGKTAIL-SNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKI-G 299 (454) T ss_pred ecCCC---C-CHHHHHHHHHHHHHHhcccccCCceec-cCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHc-C Confidence 76631 2 233344555666554433333334444 445788888876654 33566688899999999998755 6 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccCCceEEe--CCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNAEEWSVEF--SPLDHESSKDKAEVLEKSVNSIAA 372 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~~~~~~~f--~pL~~~~eke~ae~~~~~a~a~~~ 372 (418) ...++-.++-+...+.|+.. .|.|++..+-..+ +...+..++| +.|...|.+++ ++++.+ T Consensus 300 ~~~~~t~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~L~~~~~~~~~f~~~~ll~~D~~~r-------~~~~~~ 365 (454) T protein:vir:93 300 VGQPPSSDNVEALEQQYYSQ-------CLQTLIESIELLLDEALETGENESTEFDVTTLLRMDSERR-------MKTLGD 365 (454) T ss_pred CCCCCcchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCCCCcEEEeechhhhccCHHHH-------HHHHHH Confidence 55444445556556666554 4788877664443 2234555555 46666666554 667888 Q ss_pred HHhCCCCCHHHHHHHHHhhcCcCCC----------Chhhccc--ccc-------------cCCCccccccC Q lcl|NC_019404. 373 LIAAGAMDIKEARDTLRTIAPEIKI----------GDNDIQT--EES-------------ELITETEVVIA 418 (418) Q Consensus 373 ~~~~g~i~~~e~r~~l~~~~~~~~~----------~~~~~~~--~e~-------------~~~~e~e~~~~ 418 (418) ++++|+++++|+|+.+. ..+..+- ..+++.. ..+ +..++++.--. T Consensus 366 ~~~~G~~T~NE~R~~~g-l~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 435 (454) T protein:vir:93 366 AVKNTLLTPNEARKREN-LPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKA 435 (454) T ss_pred HHhCCCcCHHHHHHHhC-CCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCC Confidence 99999999999998763 2221110 0001100 000 00000000000 No 68 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.73 E-value=5e-18 Score=115.42 Aligned_cols=366 Identities=13% Similarity=0.064 Sum_probs=193.1 Q ss_pred CccchhhHHHHhcCCC--Ccc--------ccCccc-cCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHH-- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSD--GSE--------IYGSLQ-NQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEP-- 67 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~--~~~--------~~~~~~-~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~-- 67 (418) -|...|+....-..+. ... ..+... ......-..+ .+++-+.++|+.+|++.-+-++.+..+.... T Consensus 22 ~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-l~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~ 100 (441) T protein:vir:94 22 ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEA-IRHSDIFTAVMMIASDLARMPIRVTVNGQINYS 100 (441) T ss_pred hhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhh-hccHHHHHHHHHHHHhhccCceeeecCcccccc Confidence 2222222110000000 000 000000 0000001111 3455667899999999988888774321111 Q ss_pred -HHHHHH----HHhCchHHHHHHH-HhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc Q lcl|NC_019404. 68 -AFWSRW----DDLEMTQNINDAW-SWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR 141 (418) Q Consensus 68 -~i~~~~----~~l~~~~~~~~a~-~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~ 141 (418) .+...+ ..+-....|.+.+ ....++|.|++++.- + ..|.+..|.+++++.+++... . T Consensus 101 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~~d-------~ 163 (441) T protein:vir:94 101 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITR-D---------KTGEPMNLTFRKTSEIELKSD-------A 163 (441) T ss_pred chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEC-------C Confidence 111111 1111223444444 446778999988743 2 335677889999988876432 1 Q ss_pred cCcceEEEEe--cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC-- Q lcl|NC_019404. 142 FGKPLTYRIT--TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ-- 217 (418) Q Consensus 142 yg~p~~y~i~--~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~-- 217 (418) .|.+.++... +.+......++++.||||...++ ...+|.||+.. +.+.|.....+......++..... T Consensus 164 ~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~-------dg~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~ 235 (441) T protein:vir:94 164 RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSL-------DGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAG 235 (441) T ss_pred CccEEEEEEEeccCCceeEEEEccccEEEeccCCC-------CCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCc Confidence 3444333222 22333335689999999964321 23579999975 778888888888888888877553 Q ss_pred ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 218 AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 218 ~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~ 295 (418) -++++++ .+.+.+..+.+++++........+.+.+++..++.+|+.++.+... +-+........||.+.+||..+ T Consensus 236 gil~~~~---~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 312 (441) T protein:vir:94 236 GILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHK 312 (441) T ss_pred EEEEcCC---CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 3556653 1122222233444444433322233445555566789888876543 5566778889999999999976 Q ss_pred eeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc--CCce--EEeCCCCCCCHHHHHHHHHHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA--EEWS--VEFSPLDHESSKDKAEVLEKSVN 368 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~--~~~~--~~f~pL~~~~eke~ae~~~~~a~ 368 (418) | |...++.+. + +...+|. ..|.|.+..+-..| +.. .+.. |+++.|...|.+++ ++ T Consensus 313 l-g~~~~~~s~--~-q~~~~~~-------~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~-------~~ 374 (441) T protein:vir:94 313 F-GIETANMSI--T-DANLDYL-------STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQ-------AE 374 (441) T ss_pred c-CCCCCCccH--H-HHHHHHH-------HHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHH-------HH Confidence 5 665554332 2 2222222 13677766654433 222 2444 44456666666554 77 Q ss_pred HHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh---------hhcccc-----------cc--cCCCccc Q lcl|NC_019404. 369 SIAALIAAGAMDIKEARDTLRTIAPEIKIGD---------NDIQTE-----------ES--ELITETE 414 (418) Q Consensus 369 a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~---------~~~~~~-----------e~--~~~~e~e 414 (418) ++++++++|++|++|+|+.+. ..|..+.+. -.++.. +. +.=+++| T Consensus 375 ~~~~~i~~G~~T~NE~R~~~g-l~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 375 IDKINIDSGKMNIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 788999999999999998762 222111100 001000 00 1112222 No 69 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.73 E-value=5e-18 Score=115.42 Aligned_cols=366 Identities=13% Similarity=0.064 Sum_probs=193.1 Q ss_pred CccchhhHHHHhcCCC--Ccc--------ccCccc-cCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHH-- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSD--GSE--------IYGSLQ-NQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEP-- 67 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~--~~~--------~~~~~~-~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~-- 67 (418) -|...|+....-..+. ... ..+... ......-..+ .+++-+.++|+.+|++.-+-++.+..+.... T Consensus 22 ~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-l~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~ 100 (441) T protein:vir:79 22 ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEA-IRHSDIFTAVMMIASDLARMPIRVTVNGQINYS 100 (441) T ss_pred hhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhh-hccHHHHHHHHHHHHhhccCceeeecCcccccc Confidence 2222222110000000 000 000000 0000001111 3455667899999999988888774321111 Q ss_pred -HHHHHH----HHhCchHHHHHHH-HhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc Q lcl|NC_019404. 68 -AFWSRW----DDLEMTQNINDAW-SWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR 141 (418) Q Consensus 68 -~i~~~~----~~l~~~~~~~~a~-~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~ 141 (418) .+...+ ..+-....|.+.+ ....++|.|++++.- + ..|.+..|.+++++.+++... . T Consensus 101 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~~d-------~ 163 (441) T protein:vir:79 101 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITR-D---------KTGEPMNLTFRKTSEIELKSD-------A 163 (441) T ss_pred chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEC-------C Confidence 111111 1111223444444 446778999988743 2 335677889999988876432 1 Q ss_pred cCcceEEEEe--cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC-- Q lcl|NC_019404. 142 FGKPLTYRIT--TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ-- 217 (418) Q Consensus 142 yg~p~~y~i~--~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~-- 217 (418) .|.+.++... +.+......++++.||||...++ ...+|.||+.. +.+.|.....+......++..... T Consensus 164 ~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~-------dg~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~ 235 (441) T protein:vir:79 164 RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSL-------DGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAG 235 (441) T ss_pred CccEEEEEEEeccCCceeEEEEccccEEEeccCCC-------CCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCc Confidence 3444333222 22333335689999999964321 23579999975 778888888888888888877553 Q ss_pred ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 218 AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 218 ~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~ 295 (418) -++++++ .+.+.+..+.+++++........+.+.+++..++.+|+.++.+... +-+........||.+.+||..+ T Consensus 236 gil~~~~---~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 312 (441) T protein:vir:79 236 GILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHK 312 (441) T ss_pred EEEEcCC---CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 3556653 1122222233444444433322233445555566789888876543 5566778889999999999976 Q ss_pred eeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc--CCce--EEeCCCCCCCHHHHHHHHHHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA--EEWS--VEFSPLDHESSKDKAEVLEKSVN 368 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~--~~~~--~~f~pL~~~~eke~ae~~~~~a~ 368 (418) | |...++.+. + +...+|. ..|.|.+..+-..| +.. .+.. |+++.|...|.+++ ++ T Consensus 313 l-g~~~~~~s~--~-q~~~~~~-------~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~-------~~ 374 (441) T protein:vir:79 313 F-GIETANMSI--T-DANLDYL-------STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQ-------AE 374 (441) T ss_pred c-CCCCCCccH--H-HHHHHHH-------HHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHH-------HH Confidence 5 665554332 2 2222222 13677766654433 222 2444 44456666666554 77 Q ss_pred HHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh---------hhcccc-----------cc--cCCCccc Q lcl|NC_019404. 369 SIAALIAAGAMDIKEARDTLRTIAPEIKIGD---------NDIQTE-----------ES--ELITETE 414 (418) Q Consensus 369 a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~---------~~~~~~-----------e~--~~~~e~e 414 (418) ++++++++|++|++|+|+.+. ..|..+.+. -.++.. +. +.=+++| T Consensus 375 ~~~~~i~~G~~T~NE~R~~~g-l~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 375 IDKINIDSGKMNIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 788999999999999998762 222111100 001000 00 1112222 No 70 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.73 E-value=1.1e-17 Score=113.66 Aligned_cols=368 Identities=11% Similarity=0.088 Sum_probs=197.1 Q ss_pred hHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-H---HHHHHHH-H---hCc Q lcl|NC_019404. 7 YANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-P---AFWSRWD-D---LEM 78 (418) Q Consensus 7 ~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~---~i~~~~~-~---l~~ 78 (418) +..+-.|.|+-. ..+ +.. ....-.+.|..++.+.++|+.++++.-+-++.+...+.+ . .+...+. + .-- T Consensus 1 ~~~~~~~~g~~~-~~~-~~~-~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t 77 (723) T protein:vir:94 1 MTTFPSGAGGWN-AWS-ADS-VFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMP 77 (723) T ss_pred CcccccCCCccc-ccc-ccc-cccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCC Confidence 111112222111 111 110 111113457889999999999999999888887432211 1 1222222 1 112 Q ss_pred hHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc-ccccccccCcceEEEEecCCcc Q lcl|NC_019404. 79 TQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE-ENPRNARFGKPLTYRITTNESD 156 (418) Q Consensus 79 ~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~-~dp~s~~yg~p~~y~i~~~~~~ 156 (418) ...|.+.+.. -.++|.+++++...+ .. ..|.+..+.+++++.+.+.... .+... .+..-.|.+...++ T Consensus 78 ~~~f~~~~~~~lll~Gnay~~i~r~~-r~------~~g~p~~l~~l~~~~~~v~~~~~~~~~~--~~~~~~y~~~~~~G- 147 (723) T protein:vir:94 78 AQVLKALSMTRLQLDGQCHLWLNYNG-RT------PAGVPDEIWYVYDRVTTIVATRAADAVP--QAQIIGYVIERTDG- 147 (723) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEecC-Cc------cccceeEEEEecCcceEEeecCCCccce--eeeeeEEEEEecCc- Confidence 3456666665 456799999876532 21 2244556667665444332211 11111 11123455544332 Q ss_pred cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--ceeecchHHHhhcCcch Q lcl|NC_019404. 157 MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVWKAKGLAELCDDSEG 234 (418) Q Consensus 157 ~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~~~~~~~~~ 234 (418) ....++++.||||.+.- +.+..+|.||+.. +.+.|.....+.......+..... -+++++.+ +.+. T Consensus 148 ~~~~~~~~dIiHir~~~------~~dg~~G~Spi~~-a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~~l-----~~e~ 215 (723) T protein:vir:94 148 VRVPVLADEMLWLRFSD------PYDPLAVMAPWKA-ARAAVDADFYAATWQRQSFKNGARPGGVVNLGDM-----DEQT 215 (723) T ss_pred eeEEecccceEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCC-----CHHH Confidence 23568899999996431 2344579999975 778888888888877777765443 35555432 2223 Q ss_pred HHHHHHHHHHHHHh-cCCcceeEEEc---------CCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCcc Q lcl|NC_019404. 235 FGAARLRLAQVDNN-SGVGQAIGIDA---------ESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVG 302 (418) Q Consensus 235 ~~~~~~r~~~~~~~-~~~~~~~~~d~---------~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~ 302 (418) ..+..+++...-.. .+.+..+++.+ ++-+|+.++.+..+ +-+...+..+.||.+.+||...|+|.++ T Consensus 216 ~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st- 294 (723) T protein:vir:94 216 FTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGST- 294 (723) T ss_pred HHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCC- Confidence 33444444432221 22334455433 33466666655433 3355677788899999999887766432 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc---CCceEEeCC--CCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 303 GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA---EEWSVEFSP--LDHESSKDKAEVLEKSVNSIAALI 374 (418) Q Consensus 303 gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~---~~~~~~f~p--L~~~~eke~ae~~~~~a~a~~~~~ 374 (418) +++.++....||. ..|.|.++.+-..|- .+ .+++|+|+. |...|.++ +++++..++ T Consensus 295 --~sN~e~~~~~f~~-------~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~-------r~~~~~~~v 358 (723) T protein:vir:94 295 --YENQAEAKAAVWT-------ETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEA-------QAGRNQGYL 358 (723) T ss_pred --cccHHHHHHHHHH-------HHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHH-------HHHHHHHHH Confidence 2233444455654 337787766655442 12 256778875 45566554 467889999 Q ss_pred hCCCCCHHHHHHHHHhhcCcCCCChhhc---------ccccccCCCcccc---ccC Q lcl|NC_019404. 375 AAGAMDIKEARDTLRTIAPEIKIGDNDI---------QTEESELITETEV---VIA 418 (418) Q Consensus 375 ~~g~i~~~e~r~~l~~~~~~~~~~~~~~---------~~~e~~~~~e~e~---~~~ 418 (418) ++|++|++|+|+.+. ..|-.+ ++.++ -..+...+..+|+ +.| T Consensus 359 ~~G~~T~NE~R~~lg-lpPi~g-Gd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~ 412 (723) T protein:vir:94 359 VNDVLMVDEVRATIG-LDPLPG-GIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLA 412 (723) T ss_pred hCCCcCHHHHHHHhC-CCCCCC-CcccceeccccccccCCCCCCccchhhhHhhhh Confidence 999999999998762 222111 11111 1111222222333 112 No 71 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.73 E-value=1.6e-17 Score=112.62 Aligned_cols=362 Identities=13% Similarity=0.075 Sum_probs=197.9 Q ss_pred Cccchhh---HHHH----hcCCCCccccCcc--ccCCHH-HHHHHHHcCCccchhhhcchhhhccCCccccCcc--hHHH Q lcl|NC_019404. 1 MVKTDSY---ANIF----LGGSDGSEIYGSL--QNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDGID--DEPA 68 (418) Q Consensus 1 ~~~~D~~---~n~~----~g~~~~~~~~~~~--~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~--d~~~ 68 (418) |-+.--+ .+.+ .+... ++.+++. ...++. --...|.+++.+.++|+.+|+++-+-++.+.-.. .... T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~~~~~ 79 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSA-SKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTE 79 (409) T ss_pred CccccchhhhhhHHhhhhhcccc-ccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeecccccchh Confidence 3332222 1122 12111 1111110 000100 1122356788899999999999998888773111 1111 Q ss_pred HHHHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccC Q lcl|NC_019404. 69 FWSRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFG 143 (418) Q Consensus 69 i~~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg 143 (418) +...+ ...--...|.+.+. +..++|.|++++.- + ..|.+..|.+++++.+++.... .+ T Consensus 80 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~l~~~~v~v~~~~-------~~ 142 (409) T protein:vir:96 80 VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-D---------IYHQPSKLFLLNPDVVEMLIEN-------QS 142 (409) T ss_pred HHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEE-C---------CCCcEEEEEEEcCceeEEEEeC-------CC Confidence 22222 22223344544444 45678999988743 2 3356778889998888765421 23 Q ss_pred cceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC---cee Q lcl|NC_019404. 144 KPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ---AVW 220 (418) Q Consensus 144 ~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~---~v~ 220 (418) .+..|.+...++. ...+.++.||||.+.. +.+..+|.|++.. +.+.+.....+.... ++.... -++ T Consensus 143 ~~~~y~~~~~~g~-~~~~~~~evih~r~~~------~~~~~~G~s~l~~-~~~~i~~~~~~~~~~---~~~~~~~~~~i~ 211 (409) T protein:vir:96 143 RELYYSIHAATGN-KLIVHNMDMLHFKHIV------ASNMVQGISPIDV-LKNTTDFDNAVRTFN---LTEMQKPDSFML 211 (409) T ss_pred cEEEEEEEcCCce-EEEEccccEEEeCCCC------CCCccccccHHHH-HHHHHHHHHHHHHHH---HHhcCCCceeEE Confidence 4456777654432 3468889999996432 2344679999965 556555444433332 222221 233 Q ss_pred ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 221 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G 298 (418) +.+. .+ +++......+++... . ++.+.+++..++.+|+.++.+.. .+-+........||.+.+||..+|.+ T Consensus 212 ~~~~---~l-~~e~~~~~~~~~~~~--~-~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~ 284 (409) T protein:vir:96 212 KYGS---NV-STEKRQQVLEDFKQY--Y-EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNA 284 (409) T ss_pred ecCC---CC-CHHHHHHHHHHHHHH--h-hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Confidence 3321 11 122333444444332 2 33445666666688998887765 34556777889999999999987754 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------cCCceEEeC--CCCCCCHHHHHHHHHHHHH Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN--------AEEWSVEFS--PLDHESSKDKAEVLEKSVN 368 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------~~~~~~~f~--pL~~~~eke~ae~~~~~a~ 368 (418) ...+.. ++.|+..+.|+.. .|.|.++.+-..|-+ ..+..|+|+ .|...|.+++ ++ T Consensus 285 ~~~~~~-s~~e~~~~~f~~~-------~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~-------~e 349 (409) T protein:vir:96 285 RSNTNF-AKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQ-------AE 349 (409) T ss_pred CCCCCc-ccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHH-------HH Confidence 333333 4556666677665 378887777554421 234556664 6666666554 77 Q ss_pred HHHHHHhCCCCCHHHHHHHHHhhcCcCCCChh-----------hccc----ccccCCCcccc Q lcl|NC_019404. 369 SIAALIAAGAMDIKEARDTLRTIAPEIKIGDN-----------DIQT----EESELITETEV 415 (418) Q Consensus 369 a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~-----------~~~~----~e~~~~~e~e~ 415 (418) ++++++++|++|++|+|+.+. ..|..+ +|+ ...+ .+.-..+++|+ T Consensus 350 ~~~~~~~~G~~T~NE~R~~~g-~~pi~g-gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 350 VYFKAVRSGYYTINDIREWED-LPPVEG-GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HHHHHHhCCCCCHHHHHHHhC-CCCCCC-cceeeecccccccccchhhcccccCCCCCcCCC Confidence 788999999999999999873 222111 111 1110 11112233444 No 72 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.72 E-value=7e-18 Score=114.65 Aligned_cols=362 Identities=13% Similarity=0.104 Sum_probs=206.7 Q ss_pred Cccch-hhHHH----HhcCCC----CccccCc------c--ccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-C Q lcl|NC_019404. 1 MVKTD-SYANI----FLGGSD----GSEIYGS------L--QNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-G 62 (418) Q Consensus 1 ~~~~D-~~~n~----~~g~~~----~~~~~~~------~--~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-~ 62 (418) =++.| |+-|- |.|... .+...+. . ...++ ..+.+++.+.+||+.++++.-+-++.+- . T Consensus 9 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~ 84 (424) T protein:vir:18 9 DLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND----ERILQISTVWRCVSLISTLTACLPLDVFET 84 (424) T ss_pred eecCCCchHHHHHhhhcccccccccccccccccccccccccccccH----HHhhccHHHHHHHHHHHHhhccCceEEEEe Confidence 11111 22222 222110 0000110 0 01122 2345678889999999999999888872 1 Q ss_pred -cc-hHH------HHHHHHH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccc Q lcl|NC_019404. 63 -ID-DEP------AFWSRWD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVK 129 (418) Q Consensus 63 -~~-d~~------~i~~~~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~ 129 (418) .+ ... .+...+. ..-....|.+.+. +-.++|.|++++.- + ..|.+..|.++++..++ T Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~pl~~~~V~ 154 (424) T protein:vir:18 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-N---------SAGDVISLLPLQSANMD 154 (424) T ss_pred ecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEecCcceE Confidence 11 111 1111121 1222334444444 55678999988742 2 34567789999998887 Q ss_pred cccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 130 VQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLAT 209 (418) Q Consensus 130 ~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~ 209 (418) +... +.+-.|.+..++. ...+.++.||||.+.. ....+|.||+.. +.+.|.....+..... T Consensus 155 v~~~---------~~~~~y~~~~~g~--~~~~~~~eIih~r~~~-------~dg~~G~spi~~-~~~~i~~~~a~~~~~~ 215 (424) T protein:vir:18 155 VKLV---------GKKVVYRYQRDSE--YADFSQKEIFHLKGFG-------FTGLVGLSPIAF-ACKSAGVAVAMEDQQR 215 (424) T ss_pred EEEc---------CCeEEEEEEeCCe--EEEeccccEEEecCcC-------CCCcccccHHHH-HHHHHHHHHHHHHHHH Confidence 6321 2334677765543 2478999999996432 123579999975 7899999999999999 Q ss_pred HHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHH Q lcl|NC_019404. 210 QLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRI 285 (418) Q Consensus 210 ~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~i 285 (418) .++.....+ +++++. ..+ +.+....++++++......+.++.+++ .++.+|++++.+..+ +-+...+..+.| T Consensus 216 ~~f~ng~~p~gil~~~~--~~l-~~e~~~~~~~~~~~~~~g~nag~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~I 291 (424) T protein:vir:18 216 DFFANGAKSPQILSTGE--KVL-TEQQRSQVEENFKEIAGGPVKKRLWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSEL 291 (424) T ss_pred HHHHccCCcceEEEeCC--cCC-CHHHHHHHHHHHHHHhCCcccCCceec-cCCceEEecCCChhHHHHHHHHHHHHHHH Confidence 988876544 555542 112 233344555566544433333344444 456788888776543 455677888999 Q ss_pred hhhhcCCeeeeeccCccc-c-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-c---CC--ceEEeCCCCCC Q lcl|NC_019404. 286 VALSGIHEIILKNKNVGG-L-SSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-A---EE--WSVEFSPLDHE 354 (418) Q Consensus 286 aaas~IP~t~L~G~s~~g-l-~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-~---~~--~~~~f~pL~~~ 354 (418) |.+.|||..+| |...++ . +|.-++....|+.. -|.|++..+-..|- . . .+ ++|+++.|... T Consensus 292 a~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~ 363 (424) T protein:vir:18 292 ARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRG 363 (424) T ss_pred HHHhCCCHHHh-CCCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhcc Confidence 99999998766 554333 2 23335555566643 47888877744442 2 2 23 34555677777 Q ss_pred CHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhh--------cccccccCCCccccc Q lcl|NC_019404. 355 SSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDND--------IQTEESELITETEVV 416 (418) Q Consensus 355 ~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~--------~~~~e~~~~~e~e~~ 416 (418) |.+++ ++++.+++++|++|++|+|+.+. ..+..+ .|+- +.......+.++++- T Consensus 364 d~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~pi~g-GD~~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 364 DSASR-------AAFMKAMGEAGLRTINEMRRTDN-LPPLPG-GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred CHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCC-cCeeeeccCccchHhhhccCCCccCCC Confidence 77765 66777899999999999998762 222211 1110 111111122222222 No 73 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.72 E-value=8.6e-18 Score=114.14 Aligned_cols=358 Identities=14% Similarity=0.101 Sum_probs=190.4 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHH----HHHHHHH-- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEP----AFWSRWD-- 74 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~----~i~~~~~-- 74 (418) -+..|.+-+.++|++..... . ...+ .+++-+.++|+.+|+++-+-.+.+...+... .+...+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~----~-----~~~A--l~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~~~lL~~~ 79 (406) T protein:vir:97 11 KVSYDDYISSVLAGDVSQKY----L-----GVSA--LKNSDILTATSIIAGDIARFPLVKKDVNGDIIHDEDINYLLNVK 79 (406) T ss_pred CCCcchHHHHHhcCCCCccc----c-----cchh--hccHHHHHHHHHHHHhhhhCeeEEEecCccccccchHHHHhhcc Confidence 22223333334433221110 0 1111 1345567799999999988777664322111 1222221 Q ss_pred --HhCchHHHHH-HHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEe Q lcl|NC_019404. 75 --DLEMTQNIND-AWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRIT 151 (418) Q Consensus 75 --~l~~~~~~~~-a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~ 151 (418) .+-.+..|.+ .+.+-.++|.|++++.- ++ ..|.+..+.+++++++++... +.|. -.|++. T Consensus 80 PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r-~~--------~~g~~~~L~~i~p~~v~v~~~-------~~~~-~~y~~~ 142 (406) T protein:vir:97 80 STSNASARTWKFAMAVNAILTGNSFSRILR-DP--------KTNQALQFQFYRPSETTVEET-------DNHE-IVYTFT 142 (406) T ss_pred CCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-cC--------CCCeEEEEEEECCCeeEEEEc-------CCce-EEEEEE Confidence 1222334444 45556678999998743 21 235567888888888865422 1222 346665 Q ss_pred cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc-eeecchHHHhhc Q lcl|NC_019404. 152 TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA-VWKAKGLAELCD 230 (418) Q Consensus 152 ~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~-v~k~~~l~~~~~ 230 (418) ....+....+.++.||||.+.+ .....|.||+.. +.+.|.....+....+..+...... ++.+.+ ..+ T Consensus 143 ~~~~~~~~~~~~~evih~r~~~-------~dg~~G~spi~~-~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~--~~l- 211 (406) T protein:vir:97 143 DMLTAKQVKCFAHDVIHWKFFS-------HDTILGRSPLLS-LGDEIDLQTGGINTLIKFFKDGFSSGILTMKG--AQL- 211 (406) T ss_pred ecCCceEEEEccccEEEecCCC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEecC--CCC- Confidence 4434444578899999996432 122459999975 7788888878888787777553322 222221 111 Q ss_pred CcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCHH--HHHHHHHHHHhhhhcCCeeeeeccCccccccch Q lcl|NC_019404. 231 DSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGID--AFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ 308 (418) Q Consensus 231 ~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl~--~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg 308 (418) +.+.....++++....... +.+.+++...+.+|++++.+...+. +...+....||.+.+||..+|.+.+ .+ ++- T Consensus 212 ~~e~~~~~~~~~~~~~~g~-n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~-~~--~~~ 287 (406) T protein:vir:97 212 SGDARQRARQEFEKMREGS-VGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNS-PN--QSV 287 (406) T ss_pred CHHHHHHHHHHHHHHhccc-ccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCC-Cc--chH Confidence 2233445556665443332 3344445455678888876654433 4556678899999999998875432 22 233 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcc---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCH Q lcl|NC_019404. 309 NTALETFHKLIDRKRNAELLPILEFLIPFI----VNA---EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDI 381 (418) Q Consensus 309 e~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~---~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~ 381 (418) ++..+.|+.. .|.|++..+-..+ +.. ..+.++|+- .. + .+..++++.+++++|++++ T Consensus 288 e~~~~~f~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~-~~-~-------~~~~~~~~~~~~~~g~~T~ 351 (406) T protein:vir:97 288 AQLMEDYVTN-------DLPFYFDAITSELGLKTLNDKDRRLYHIEFDT-RS-V-------TGRNVDEIVKLVNNQILTP 351 (406) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHhhhhcChhhccceeEEEec-Cc-c-------chhhHHHHHHHHhCCCcCH Confidence 4455566553 3778776665443 222 245677752 11 1 2344667778999999999 Q ss_pred HHHHHHHHhhcCcCCC-Ch-----------hhcccccccC--CCccccccC Q lcl|NC_019404. 382 KEARDTLRTIAPEIKI-GD-----------NDIQTEESEL--ITETEVVIA 418 (418) Q Consensus 382 ~e~r~~l~~~~~~~~~-~~-----------~~~~~~e~~~--~~e~e~~~~ 418 (418) +|+|+.+.. .+..+. .| +..++..+.. ..+.-..-+ T Consensus 352 NE~R~~~g~-~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~~~ 401 (406) T protein:vir:97 352 NQGLVELGK-QKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEVNA 401 (406) T ss_pred HHHHHHhCC-CCCCCCCCCeEeeccCccchhcccccccccccccCCCCCCC Confidence 999998732 211110 01 0111000000 000000001 No 74 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.72 E-value=1.9e-17 Score=112.24 Aligned_cols=361 Identities=14% Similarity=0.100 Sum_probs=192.6 Q ss_pred Cccchhh-----------------------------------HHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchh Q lcl|NC_019404. 1 MVKTDSY-----------------------------------ANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRI 45 (418) Q Consensus 1 ~~~~D~~-----------------------------------~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~i 45 (418) ---.|+| ...+....+..... ....+.. . ..+++-+.++ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~-al~~~~V~ac 76 (441) T protein:vir:98 3 WYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKDI---E-AIRHSDIFTA 76 (441) T ss_pred eecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccC--ccccchh---h-hhccHHHHHH Confidence 0111222 11111000000000 0011111 1 1245566789 Q ss_pred hhcchhhhccCCccccCcchHH---HHHHHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCce Q lcl|NC_019404. 46 IDTIPETALAAGFHIDGIDDEP---AFWSRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAEL 117 (418) Q Consensus 46 Vd~~a~d~~r~~~~i~~~~d~~---~i~~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i 117 (418) |+.+|+++-+-++.+....... .+...+ ...--...|.+++. ...++|.|++++.- + ..|.+ T Consensus 77 v~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r-~---------~~G~~ 146 (441) T protein:vir:98 77 VMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITR-D---------KTGEP 146 (441) T ss_pred HHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEE-c---------CCCcE Confidence 9999999998888774321111 111111 11122234444444 45678999988743 2 23567 Q ss_pred EEEEEeeccccccccccccccccccCcceEEEEe--cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHH Q lcl|NC_019404. 118 ETVRVYDRTQVKVQNREENPRNARFGKPLTYRIT--TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDIL 195 (418) Q Consensus 118 ~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~--~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~ 195 (418) ..|.+++++.+++.... .|.+.++... ..+......+.++.||||...+ ....+|.||+.. +. T Consensus 147 ~~L~~i~~~~v~v~~~~-------~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~-------~dg~~G~spi~~-~~ 211 (441) T protein:vir:98 147 MNLTFRKTSEIELKLDA-------RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-------LDGINGLSLLDT-LS 211 (441) T ss_pred EEEEEEcCceeEEEECC-------CCcEEEEEEEeccCcceeeEEEccccEEEeccCC-------CCCccccCHHHH-HH Confidence 78899999888764421 2444333322 2233334568999999996432 123579999975 77 Q ss_pred HHHHHHHHHHHHHHHHHHHcCC--ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRKQQ--AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG 273 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g 273 (418) +.|.....+......++..... -++++++ .+.+.+..+.+++++........+.+.+++..++.+|+.++.+... T Consensus 212 ~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~---~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d 288 (441) T protein:vir:98 212 RTIESDNNGKDFLNNFLRNGTHAGGILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEV 288 (441) T ss_pred HHHHHHHHHHHHHHHHHhccCCCcEEEEeCC---CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhH Confidence 8888888888888888777543 3556653 1122222233444444433222233345555566788888776543 Q ss_pred --HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc--CCceE Q lcl|NC_019404. 274 --IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA--EEWSV 346 (418) Q Consensus 274 --l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~--~~~~~ 346 (418) +-+...+....||.+.|||..+| |...++.+. ++....|.. .|.|++..+-..| +.+ .+..| T Consensus 289 ~q~~e~r~~~~~~Ia~~fgVPp~~l-g~~~~~~s~--~q~~~~y~~--------tl~P~~~~ie~~ln~~L~~~~~~~~~ 357 (441) T protein:vir:98 289 LKLIRENKSSTREIAGVFGIPLHKF-GIETANMSI--TDANLDYLS--------TLKPYITCVCAELNFKFNDEYVNREF 357 (441) T ss_pred HHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCccH--HHHHHHHHH--------HHHHHHHHHHHHHHhhccccccCceE Confidence 45667788899999999999766 655554332 222222322 3677766654433 222 34445 Q ss_pred Ee--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC---------hhhccccc--------- Q lcl|NC_019404. 347 EF--SPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG---------DNDIQTEE--------- 406 (418) Q Consensus 347 ~f--~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~---------~~~~~~~e--------- 406 (418) +| +.|...|.+++ ++++++++++|++|++|+|+.+. ..|..+.+ .-.++..+ T Consensus 358 ~fd~~~llr~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~ 429 (441) T protein:vir:98 358 KFDTTEIRVVDEKTQ-------AEIDKINIDSGKMNIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRA 429 (441) T ss_pred EEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceEeecccccccccccccccccccc Confidence 55 46666666554 77788999999999999998762 22211111 00111000 Q ss_pred --c--cCCCccc Q lcl|NC_019404. 407 --S--ELITETE 414 (418) Q Consensus 407 --~--~~~~e~e 414 (418) . +.=+++| T Consensus 430 ~~~~~kgGe~ne 441 (441) T protein:vir:98 430 TDKKLKGGEENE 441 (441) T ss_pred cccccCCCCCCC Confidence 0 1111222 No 75 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.72 E-value=1e-17 Score=113.75 Aligned_cols=362 Identities=12% Similarity=0.092 Sum_probs=205.8 Q ss_pred CccchhhHHHH----hcCCC----CccccC------cc--ccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--C Q lcl|NC_019404. 1 MVKTDSYANIF----LGGSD----GSEIYG------SL--QNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--G 62 (418) Q Consensus 1 ~~~~D~~~n~~----~g~~~----~~~~~~------~~--~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~ 62 (418) |.+-=|+-|-+ .|... .+...+ +. ...+. .-+.+++.+.+||+.+++++-+-++.+- . T Consensus 10 ~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~ 85 (424) T protein:vir:18 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIND----ERILQISTVWRCVSLISTLTACLPLDVFETD 85 (424) T ss_pred cCCCCchHHHHHhhccccccccccchhhccccccccccccccccH----HHhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 22222333333 22110 000011 10 11122 2245678889999999999998888772 1 Q ss_pred cch-HHH------HHHHHH----HhCchHHHHH-HHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccc Q lcl|NC_019404. 63 IDD-EPA------FWSRWD----DLEMTQNIND-AWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKV 130 (418) Q Consensus 63 ~~d-~~~------i~~~~~----~l~~~~~~~~-a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~ 130 (418) .++ ..+ +...+. ..-....|.+ .+.+-.++|.|++++.- + ..|.+..+.++++..+++ T Consensus 86 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~---------~~G~~~~L~~l~~~~v~v 155 (424) T protein:vir:18 86 QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-N---------SAGDVISLLPLQSANMDV 155 (424) T ss_pred cCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEecCcceEE Confidence 111 111 111111 1122233444 44456678999988742 2 345677899999888876 Q ss_pred ccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 131 QNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQ 210 (418) Q Consensus 131 ~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~ 210 (418) ... +.+-.|++...+. ...++++.|||+.+.. ....+|.||+.. +.+.|.....+...... T Consensus 156 ~~~---------~~~~~y~~~~~g~--~~~~~~~eVihir~~~-------~dg~~G~spi~~-~~~~i~~~~~~~~~~~~ 216 (424) T protein:vir:18 156 KLV---------GKKVVYRYQRDSE--YADFSQKEIFHLKGFG-------FTGLVGLSPIAF-ACKSAGVAVAMEDQQRD 216 (424) T ss_pred EEc---------CCeEEEEEEeCCe--EEEeccccEEEecCcC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHH Confidence 331 2334677765543 2579999999996432 123579999975 78889998889888988 Q ss_pred HHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHh Q lcl|NC_019404. 211 LLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIV 286 (418) Q Consensus 211 l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~ia 286 (418) ++...... ++++++ ..+ +.+....++++++......+.++.+++ .++.+|+.++.+..+ +-+...+..+.|| T Consensus 217 ~f~ng~~~~gil~~~~--~~l-~~e~~~~~~~~~~~~~~~~nag~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 292 (424) T protein:vir:18 217 FFANGAKSPQILSTGE--KVL-TEQQRSQVEENFKEIAGGPVKKRLWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELA 292 (424) T ss_pred HHhccCCcceEEEeCC--cCC-CHHHHHHHHHHHHHHhCCcccCCceec-cCCceEEecCCChhHHHHHHHHHHhHHHHH Confidence 88876544 556542 112 233334455555544333333344444 456788888776543 4566778889999 Q ss_pred hhhcCCeeeeeccCccccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-c---CCce--EEeCCCCCCC Q lcl|NC_019404. 287 ALSGIHEIILKNKNVGGLS--SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-A---EEWS--VEFSPLDHES 355 (418) Q Consensus 287 aas~IP~t~L~G~s~~gl~--stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-~---~~~~--~~f~pL~~~~ 355 (418) .+.|||..+| |...++-. |+-++....|+.. -|.|++..+-..|- . . .++. |++..|...| T Consensus 293 ~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d 364 (424) T protein:vir:18 293 RFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGD 364 (424) T ss_pred HHhCCCHHHh-CCCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccC Confidence 9999998766 65543322 2234455566543 47888877655442 1 2 2344 4556777777 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhh--------cccccccCCCccccc Q lcl|NC_019404. 356 SKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDND--------IQTEESELITETEVV 416 (418) Q Consensus 356 eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~--------~~~~e~~~~~e~e~~ 416 (418) .+++ ++++.++++.|++|++|+|+.+. ..+..+ .|+- +.....+.+.++.+- T Consensus 365 ~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~pi~g-gD~~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 365 SASR-------AAFMKAMGESGLRTINEMRRTDN-MPPLPG-GDVAMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred HHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCC-cCeeeeccCccchhhhhccCCccccCC Confidence 7765 66777889999999999998762 222211 1111 111111112222222 No 76 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.72 E-value=5e-17 Score=109.95 Aligned_cols=361 Identities=12% Similarity=0.086 Sum_probs=203.6 Q ss_pred CccchhhHHHHhcC------CCCccc----c----C---ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc-- Q lcl|NC_019404. 1 MVKTDSYANIFLGG------SDGSEI----Y----G---SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID-- 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~------~~~~~~----~----~---~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-- 61 (418) =+..|+...+|.+. .+.+.. . + ...+.+. ..+.+++.+.++|+.+|++.-+-++.+. T Consensus 9 ~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~----~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~ 84 (424) T protein:vir:45 9 WLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSP----ETAMKLAAVYSCIYVLSSSLAQMPLHVMRR 84 (424) T ss_pred eecCcchhHHHHhhccccCCCCCccccchhhhhhhccccCCceech----HHhhccHHHHHHHHHHHHHHhhCceEEEEe Confidence 12224433333221 111000 0 0 0001111 2235678889999999999999888772 Q ss_pred CcchHH-----HHHHHHH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GIDDEP-----AFWSRWD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~~d~~-----~i~~~~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) .++... .+...+. ..-....|.+.+. +..++|.|++++.- + ..|.+..+.+++++.+++. T Consensus 85 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r-~---------~~G~~~~L~~l~~~~v~i~ 154 (424) T protein:vir:45 85 HKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKR-N---------RRGEVISLDCCMPWETTLM 154 (424) T ss_pred cCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE-c---------CCCcEEEEEEecCceEEEE Confidence 111111 1121221 2223345555555 45667999988743 2 2356778888888877654 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) .. .+ .-.|++...++. ..++++.||||.+.. ....+|.||+.. +.+.|.....+......+ T Consensus 155 ~~--------~~-~~~y~~~~~~~~--~~~~~~eVih~r~~~-------~d~~~G~spi~~-~~~~i~~~~~~~~~~~~~ 215 (424) T protein:vir:45 155 NT--------GG-RYTYGLYNEYGA--FAISPDDMIHIRALG-------NNQKMGLSPIMQ-HAETIGMGMSGQKYTESF 215 (424) T ss_pred Ec--------CC-eEEEEEEecCce--EEECcccEEEecCcC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHHH Confidence 31 12 235666654433 469999999996432 124579999975 788899888888888888 Q ss_pred HHHcCC--ceeecchHHHhhcCcchHHHHHHHHHHHHHh-cCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHh Q lcl|NC_019404. 212 LRRKQQ--AVWKAKGLAELCDDSEGFGAARLRLAQVDNN-SGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIV 286 (418) Q Consensus 212 ~~~~~~--~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~-~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~ia 286 (418) +..... -++++++. + +++......+++...... ..+.+.+++..++.+|+.++.+..+ +-+...+..+.|| T Consensus 216 f~ng~~p~gil~~~~~---l-~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 291 (424) T protein:vir:45 216 FSGNARPAGIVSVKSG---L-NKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIA 291 (424) T ss_pred HhccCCccEEEEeCCC---C-CHHHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 877554 35565531 2 222333444444433322 1233445555556788888876544 4467778889999 Q ss_pred hhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc----cCCceEEeC--CCCCCCH Q lcl|NC_019404. 287 ALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN----AEEWSVEFS--PLDHESS 356 (418) Q Consensus 287 aas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~----~~~~~~~f~--pL~~~~e 356 (418) .+.+||..+| |...++-.++.|.....|+.. .|.|++..+-..+ +. ..++.|+|+ .|...|. T Consensus 292 ~~fgVPp~~l-g~~~~~t~sn~eq~~~~f~~~-------tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~ 363 (424) T protein:vir:45 292 GIFNIPAHMI-NDLEKATFSNISAQAIQFVRY-------TMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTP 363 (424) T ss_pred HHhCCCHHHh-CCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCH Confidence 9999999766 544333334556566666553 4778777665443 21 235555554 6666666 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCC----------C-hhhcccccccCCCccc Q lcl|NC_019404. 357 KDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKI----------G-DNDIQTEESELITETE 414 (418) Q Consensus 357 ke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~----------~-~~~~~~~e~~~~~e~e 414 (418) +++ ++++++++++|++|++|+|+.+. ..+..+- + .++......+...++| T Consensus 364 ~~r-------~~~~~~~~~~g~~T~NE~R~~~g-l~pi~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 364 QER-------AQFYHFAITDGWMSRNEARAFED-MNPVEGLDEMLVSVNAANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred HHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeecccccccccccCCCCCCCCCCCC Confidence 554 67788899999999999998752 2221110 0 1111111111112222 No 77 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.71 E-value=4.7e-17 Score=110.12 Aligned_cols=361 Identities=11% Similarity=-0.009 Sum_probs=196.6 Q ss_pred hhhHHHHhcCCCCc---------cccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHH Q lcl|NC_019404. 5 DSYANIFLGGSDGS---------EIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDD 75 (418) Q Consensus 5 D~~~n~~~g~~~~~---------~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~ 75 (418) =|+.+-..+..... ..+..+..-...-....+.+++.+.++|+.+|+++-+-.+.+..... +.+..+-.. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~-~~L~~~PN~ 79 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL-QGIVDNPSN 79 (382) T ss_pred CccccccccCCcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh-hhhhhhcCC Confidence 00000000000000 00000000000111223467888999999999999998888875433 344444444 Q ss_pred hCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC Q lcl|NC_019404. 76 LEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE 154 (418) Q Consensus 76 l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~ 154 (418) +--+..|.+.+.+ -.++|.|++++.- | ..|.+..+.++++.++++... ..|....|++...+ T Consensus 80 ~~t~~~f~~~l~~~l~l~Gna~~~i~r-d---------~~G~~~~l~~i~~~~v~v~~~-------~~~~~~~y~~~~~~ 142 (382) T protein:vir:48 80 NANRFNFYQSIFAQMLLGGEAFAYRWR-N---------ENGRDMKWEYLRPSQVSFNRL-------DNKDGIYYNITFDD 142 (382) T ss_pred CCCHHHHHHHHHHHhhhcCCEEEEEEE-C---------CCCcEEEEEEEcCceeEEEEc-------CCCCeEEEEEEecC Confidence 4445556666665 4567989988753 2 235677889999988876542 13445567776443 Q ss_pred c--ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhc Q lcl|NC_019404. 155 S--DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCD 230 (418) Q Consensus 155 ~--~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~ 230 (418) . +....++++.||||.... +....+|.|++.. +.+.|.....+.......+.....+ ++++++. +. T Consensus 143 ~~~~~~~~~~~~evih~~~~~------~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~---~~ 212 (382) T protein:vir:48 143 PRIPPKQHVPQNDVLHFRLLS------VDGGMTSVSPLMA-LSRELDIQKASGNLTINSLKNALNANGILKIKGG---GL 212 (382) T ss_pred ccccceeEEcCccEEEecCCC------CCCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC---CC Confidence 2 233568899999996432 3345689999975 7889998888888888888876654 5566531 11 Q ss_pred CcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccccch Q lcl|NC_019404. 231 DSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ 308 (418) Q Consensus 231 ~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg 308 (418) .+...+..+.+. ....+ .+.+++..++.+|++++.+... +.+..+...+.||.+.+||..+| |.+..+- ++ T Consensus 213 -~e~~~~~~~~~~--~~~~n-~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~l-g~~~~~~-~~- 285 (382) T protein:vir:48 213 -LDFKTKLSRSRQ--AMKQM-QGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVV-GGQGDQQ-SS- 285 (382) T ss_pred -hHHHHHHHHHHH--hhccC-CCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcc-cH- Confidence 122222222222 22333 3444444556789988876653 44677888899999999998766 4432221 22 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019404. 309 NTALETFHKLIDRKRNAELLPILEFLIPFIVN--AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARD 386 (418) Q Consensus 309 e~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~ 386 (418) ++..+.|+. ..|.|.+..+-..+-+ ..++.+........+.. .....+..++.+|+++++|+|+ T Consensus 286 ~~~~~~~~~-------~~l~p~~~~i~~~l~~~l~~~~~~~~~~~~~~~~~-------~~~~~~~~l~~~g~~t~~e~r~ 351 (382) T protein:vir:48 286 LEMSSDLYS-------KAVSRYLRPFLSELSQKLSCDVDADIFPAVDPTGS-------NYISRINSLVKTGTLAQNQGLY 351 (382) T ss_pred HHHHHHHHH-------HHHHHHHHHHHHHHHHHhcChhhhhhhhhhccchh-------HHHHHHHHHhhcCccCHHHHHH Confidence 233445543 3467776666544421 11222221111122221 1233456789999999999999 Q ss_pred HHHhhcCcCC--CChhhc-ccccccCCCccc Q lcl|NC_019404. 387 TLRTIAPEIK--IGDNDI-QTEESELITETE 414 (418) Q Consensus 387 ~l~~~~~~~~--~~~~~~-~~~e~~~~~e~e 414 (418) .|...+-.+. ...++. +..+.=-+.+++ T Consensus 352 ~l~~~g~~~~~~~~~~~~~~~~~GGd~~~~~ 382 (382) T protein:vir:48 352 ILQQAEILPKELPNGENPNSTLKGGEEDGQD 382 (382) T ss_pred HHhhCCCCCcchhhhhcCCCCCCCCCCCCCC Confidence 8864432111 011111 111100011122 No 78 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.71 E-value=3.2e-17 Score=111.01 Aligned_cols=356 Identities=13% Similarity=0.055 Sum_probs=193.4 Q ss_pred Cccchhh---------------------------------------HHHHhcCCCCccccCccccCCHHHHHHHHHcCCc Q lcl|NC_019404. 1 MVKTDSY---------------------------------------ANIFLGGSDGSEIYGSLQNQAPTILASLYADNAL 41 (418) Q Consensus 1 ~~~~D~~---------------------------------------~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~ 41 (418) |=-.|-| ...+.++++. .| ...+.+ -..+++. T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~---~g--~~v~~~----~al~~~~ 71 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGEL---NG--GTGRET----RALRNMA 71 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCcc---Cc--ceechh----hhhccHH Confidence 1111111 0011111110 01 011111 1235788 Q ss_pred cchhhhcchhhhccCCcccc-CcchH-H----HHHHHHH----HhCchHHHHHH-HHhccccceEEEEEeecCCCccccc Q lcl|NC_019404. 42 VRRIIDTIPETALAAGFHID-GIDDE-P----AFWSRWD----DLEMTQNINDA-WSWARLFGGAAIVAIVKDNRALTSP 110 (418) Q Consensus 42 ~r~iVd~~a~d~~r~~~~i~-~~~d~-~----~i~~~~~----~l~~~~~~~~a-~~~~rl~G~~~i~i~~~d~~~l~~p 110 (418) +.+||+.+++++-+-++.+. .++.. . .+...+. ..-....|.+. +.+..++|.|++++.-++ T Consensus 72 V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~------- 144 (431) T protein:vir:10 72 VLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG------- 144 (431) T ss_pred HHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC------- Confidence 99999999999998888772 21111 1 1111111 12223345444 455667899999885422 Q ss_pred ccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchH Q lcl|NC_019404. 111 VREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVL 190 (418) Q Consensus 111 l~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l 190 (418) |.+..+.++++..+.+.... .|.+ .|.++..++. ...+.++.|+||.+..+ +..+|.||+ T Consensus 145 ----g~~~~L~pl~~~~v~~~~~~-------~~~~-~y~~~~~~g~-~~~~~~~dViHir~~~~-------dg~~G~spi 204 (431) T protein:vir:10 145 ----NRPIRLIPMDRGSAKGRLTS-------TWQI-VYDYTTPTGD-KIELPAREVFHLRDLSI-------DGVSGVSRV 204 (431) T ss_pred ----CceEEEEEEcCceeEEEEcC-------CCeE-EEEEEeCCce-EEEEchhhEEEecCcCC-------CCcccccHH Confidence 33556778888777654321 1232 4555543332 35689999999964321 235799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEee Q lcl|NC_019404. 191 SSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN 268 (418) Q Consensus 191 ~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~ 268 (418) .. +.+.|.....+......++...... ++++++ .++ ++...++++++........+.+.+++..++.+|++++ T Consensus 205 ~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~ls-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~ 279 (431) T protein:vir:10 205 KL-SGNALELAEQAERAASRTFRTGVMAGGAIEVPK---ELS-DNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFS 279 (431) T ss_pred HH-HHHHHHHHHHHHHHHHHHHhccCCccEEEecCC---CCC-HHHHHHHHHHHHHHhcCccccCCceecCCCceEEEcc Confidence 75 7899999999999999988875544 566663 122 2333444555443332222333444444557788887 Q ss_pred cccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc-- Q lcl|NC_019404. 269 SDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA-- 341 (418) Q Consensus 269 ~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~-- 341 (418) .+... +-+...+....||.+.|||..+|.+...+. .++-|.....|+.. -|.|++..+-..+ +++ T Consensus 280 ~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t-~sn~eq~~~~f~~~-------tL~P~~~~ie~~ln~~Ll~~~ 351 (431) T protein:vir:10 280 NTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSW-GSGIEQLAIFFIQY-------GLSHWFVSWEQAAARAFLPEK 351 (431) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCc-cccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccChh Confidence 66543 445567778899999999998775433333 34455555666654 3788876664433 222 Q ss_pred --CCceE--EeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC----CCCHHHHHHHHHhhcCcCCC-Chhhc-------ccc Q lcl|NC_019404. 342 --EEWSV--EFSPLDHESSKDKAEVLEKSVNSIAALIAAG----AMDIKEARDTLRTIAPEIKI-GDNDI-------QTE 405 (418) Q Consensus 342 --~~~~~--~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g----~i~~~e~r~~l~~~~~~~~~-~~~~~-------~~~ 405 (418) .++.| ++..|...|.+++ +++++++++.| ++|++|+|+.+. ..+..+. .|+-. ..+ T Consensus 352 ~~~~~~~~fd~~~llr~d~~~r-------~~~~~~~~~~G~~~g~lT~NE~R~~~g-l~p~~~~~gD~~~~p~n~~~~~~ 423 (431) T protein:vir:10 352 MLGQRQFKFNEGALLRGTLNDQ-------AAFFSKALGAGGQSPWMKQNEVREMLD-LPRADDPVADQLRNPMTQKQKGS 423 (431) T ss_pred hcCCceEEEechhhhccCHHHH-------HHHHHHHHhcccccCccCHHHHHHHhC-CCCCCCccccceecccccccCCC Confidence 24444 4556666676655 55666666555 599999998762 2221111 11000 001 Q ss_pred cccCCCcc Q lcl|NC_019404. 406 ESELITET 413 (418) Q Consensus 406 e~~~~~e~ 413 (418) -++++.-+ T Consensus 424 ~~~~p~~~ 431 (431) T protein:vir:10 424 GDEPPATT 431 (431) T ss_pred CCCCCCCC Confidence 11111112 No 79 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.70 E-value=3.5e-17 Score=110.82 Aligned_cols=320 Identities=11% Similarity=0.029 Sum_probs=173.7 Q ss_pred hhhcchhhhccCCccccCcchHHHHHHHHHHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEe Q lcl|NC_019404. 45 IIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVY 123 (418) Q Consensus 45 iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~ 123 (418) |-..|.. ..|+.-.+. .+-...+..+-...--...|.+.+. +-.++|.|++++.- + ..|.+..+.++ T Consensus 1 ia~lp~~-~~~~~~~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r-~---------~~G~~~~L~~l 68 (348) T protein:vir:93 1 MASLPLK-MYEDYKVVN-TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-D---------IYHQPSKLFLL 68 (348) T ss_pred CcccceE-eEecCcCcc-cHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEE Confidence 2222222 112111000 0000112111122223344544444 45678999988753 2 23567788899 Q ss_pred eccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHH Q lcl|NC_019404. 124 DRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTN 203 (418) Q Consensus 124 ~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~ 203 (418) ++..+++.... .+.+..|.+...++. ...++++.|+||.+.. +....+|.|++.. +.+.+..... T Consensus 69 ~~~~v~~~~~~-------~~~~~~y~~~~~~g~-~~~~~~~eiih~r~~~------~~~~~~G~s~~~~-~~~~i~~~~~ 133 (348) T protein:vir:93 69 NPDVVEMLIEN-------QSRELYYSIHAATGN-KLIVHNMDMLHFKHIV------ASNMVQGISPIDV-LKNTTDFDNA 133 (348) T ss_pred cCCceEEEEeC-------CCcEEEEEEEcCCCe-EEEEccccEEEecCCC------CCCceeeccHHHH-HHHHHHHHHH Confidence 88877654321 234556777655443 3568999999997543 2344679999865 6666655544 Q ss_pred HHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--CHHHHHHHH Q lcl|NC_019404. 204 CERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKK 281 (418) Q Consensus 204 ~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~ 281 (418) +......-.......+++.+. .+ +++.....++++... .. +.+.+++...+.+|+.++.+.. .+.+...+. T Consensus 134 ~~~~~~~~~~~~~~~i~~~~~---~l-~~e~~~~~~~~~~~~--~~-n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 206 (348) T protein:vir:93 134 VRTFNLTEMQKPDSFMLKYGS---NV-STEKRQQVLEDFKQY--YE-ENGGILFQEPGVEIEPLPKKYVSEDIVASENLT 206 (348) T ss_pred HHHHHHHhcCCCceeEEecCC---CC-CHHHHHHHHHHHHHH--hh-cCCCeeecCCCceEEEcCCChhHHHHHHHHHHH Confidence 443321111111122233332 11 223344455555443 23 3444555556678998887765 455667788 Q ss_pred HHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----c----CCc--eEEeCCC Q lcl|NC_019404. 282 FDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN----A----EEW--SVEFSPL 351 (418) Q Consensus 282 ~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~----~----~~~--~~~f~pL 351 (418) ...||++.|||..+|.+...+. .++.++..+.|+..+ |.|.++.+-..|-+ . .+. +|+++.| T Consensus 207 ~~~Ia~~fgVP~~~lg~~~~~~-~~~~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l 278 (348) T protein:vir:93 207 RERVANVFQLPSIFLNARSNTN-FAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSY 278 (348) T ss_pred HHHHHHHhCCCHHHhCCCCCCC-cccHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhCCcccccCcceEEeechhh Confidence 9999999999998775433333 345666677776654 88888777555421 1 234 4455677 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChh-----------hcccc----cccCCCcccc Q lcl|NC_019404. 352 DHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDN-----------DIQTE----ESELITETEV 415 (418) Q Consensus 352 ~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~-----------~~~~~----e~~~~~e~e~ 415 (418) ...|.+++ |+++++++++|++|++|+|+.+. ..|..+ +|+ ...+. ....++.+|. T Consensus 279 ~~~d~~~~-------a~~~~~~~~~G~~T~NE~R~~~g-~~p~~g-gD~~~~~~n~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 279 LRADSATQ-------AEVYFKAVRSGYYTINDIREWED-LPPVEG-GDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred hccCHHHH-------HHHHHHHHhCCCCCHHHHHHHhC-CCCCCC-cCeEeecccccccccchhhcccccCCCCCcCCC Confidence 77777665 66788999999999999999773 222111 111 11111 1111222333 No 80 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.70 E-value=6.1e-17 Score=109.46 Aligned_cols=359 Identities=12% Similarity=0.019 Sum_probs=199.0 Q ss_pred hhhHHHHhc-CCCCccccC---------cccc-CCH-HHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHH Q lcl|NC_019404. 5 DSYANIFLG-GSDGSEIYG---------SLQN-QAP-TILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSR 72 (418) Q Consensus 5 D~~~n~~~g-~~~~~~~~~---------~~~~-~~~-~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~ 72 (418) =|+.+.... ........+ .... ... .--...+.+++.+.++|+.+++++-.-++.+.-.. ...+..+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~-~~~l~~~ 79 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ-LQGIIDN 79 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch-hHHHhhc Confidence 000010000 000000000 0000 000 01122346789999999999999999888876322 2334433 Q ss_pred HHHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEe Q lcl|NC_019404. 73 WDDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRIT 151 (418) Q Consensus 73 ~~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~ 151 (418) ....-.+..|.+.+.+ -.++|.|++++.- + ..|.+..+.++++..+++.... .|.+..|++. T Consensus 80 pN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~---------~~g~~~~L~~l~~~~v~v~~~~-------~~~~~~y~~~ 142 (386) T protein:vir:48 80 PSNNANRFNFYQSIFAQMLLGGEAFAYRWR-N---------ENGRDMKWEYLRPSQVSFNRLD-------NKDGIYYNIT 142 (386) T ss_pred CCCCCCHHHHHHHHHHHhhhcCcEEEEEEE-C---------CCCcEEEEEEecCceeEEEEcC-------CCceEEEEEE Confidence 3333334455555554 5668999888754 2 2355778889988888765421 3455677776 Q ss_pred cCCc--ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHH Q lcl|NC_019404. 152 TNES--DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAE 227 (418) Q Consensus 152 ~~~~--~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~ 227 (418) ..+. .....+-++.||||.+.. +....+|.|++.. +.+.+.....+......++.....+ ++++++ . T Consensus 143 ~~~~~~~~~~~~~~~evih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~--~ 213 (386) T protein:vir:48 143 FDDPRIPPKQHVPQGDVLHFKLLS------VDGGLTSVSPLMA-LSRELNIQKASDKLTLNSLKNALNANGILKIKG--G 213 (386) T ss_pred ecCccccceeEecCccEEEecCCC------CCCceeeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC--C Confidence 4432 233457788999996432 2334679999975 7788998888989999888875544 444442 1 Q ss_pred hhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccc Q lcl|NC_019404. 228 LCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLS 305 (418) Q Consensus 228 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~ 305 (418) + ..+...+..+.+. .... +.+.+++..++.+|+.++.+... +.+..++..+.||++.+||..+| |.+.. + T Consensus 214 ~--~~e~~~~~~~~~~--~~~~-n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~--~ 285 (386) T protein:vir:48 214 G--LLDFKTKLSRSRQ--AMKQ-MQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVV-GGQGD--Q 285 (386) T ss_pred C--CHHHHHHHHHHHH--Hhhc-CCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCC--c Confidence 1 1122222332222 2233 33444444556789888876653 55777888999999999998766 43322 2 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHH Q lcl|NC_019404. 306 SSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN--AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKE 383 (418) Q Consensus 306 stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e 383 (418) ++.++....|+.. .|.|+++.+-..|-+ -.++.+++.++...+.. ..+..+..++.+|+++++| T Consensus 286 ~~~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~~~~d~~-------~~~~~~~~l~~~g~~t~nE 351 (386) T protein:vir:48 286 QSSLEMSLDLYNK-------AVSRYLRPFLSELSQKLSCDVDADILPAVDPTGS-------NSVSRINSMVKSGTLAQNQ 351 (386) T ss_pred ccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcchhhcchhhhhccChH-------HHHHHHHHHHhCCCcCHHH Confidence 3345555666543 377777666444421 12344444444444443 2355667899999999999 Q ss_pred HHHHHHhhcCcCCCChhhcc-----cc-cccCCCccc Q lcl|NC_019404. 384 ARDTLRTIAPEIKIGDNDIQ-----TE-ESELITETE 414 (418) Q Consensus 384 ~r~~l~~~~~~~~~~~~~~~-----~~-e~~~~~e~e 414 (418) +|+.+....- .+-+....+ .. .-+.. ++| T Consensus 352 ~r~~lg~~~~-~~~~~~~~~~~~~~~~~gGd~~-~~~ 386 (386) T protein:vir:48 352 GLYILQQAEI-LPKELPEGENPNKTTLKGGEIN-GED 386 (386) T ss_pred HHHHhhcCCC-CCccchhhcCCCCCccCCCCCC-CCC Confidence 9998753221 111111111 11 01111 122 No 81 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.69 E-value=2.4e-17 Score=111.70 Aligned_cols=348 Identities=14% Similarity=0.024 Sum_probs=189.6 Q ss_pred CccchhhHHHH--------------------------hcCCCCcc--ccCccccCCHHHHHHHHHcCCccchhhhcchhh Q lcl|NC_019404. 1 MVKTDSYANIF--------------------------LGGSDGSE--IYGSLQNQAPTILASLYADNALVRRIIDTIPET 52 (418) Q Consensus 1 ~~~~D~~~n~~--------------------------~g~~~~~~--~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d 52 (418) +--.-|++..+ .+.++... ........+ .+.+..++.+.+||+.+++. T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t----~~~~~~~~~v~acV~~Ia~~ 91 (409) T protein:vir:83 16 LPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQ----DKLRTLIDVAWACIDLNASV 91 (409) T ss_pred cccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccc----hhhHhhhHHHHHHHHHHHHh Confidence 00001111110 01000000 000111111 23355678899999999999 Q ss_pred hccCCccccCc-chHHHHHH----HHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccc Q lcl|NC_019404. 53 ALAAGFHIDGI-DDEPAFWS----RWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQ 127 (418) Q Consensus 53 ~~r~~~~i~~~-~d~~~i~~----~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~ 127 (418) +-+-++.+.-. +..+.... .-..+-.+..|.+.+.+..+.|++++++...+ ..|.+..+.+++++. T Consensus 92 iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~---------~~G~~~~L~pl~p~~ 162 (409) T protein:vir:83 92 LSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHG---------SDGYPIRFRVVPPWL 162 (409) T ss_pred hccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEEC---------CCCcEEEEEEECCcc Confidence 99888876421 11111111 11122345667777777777899988765443 235567888999888 Q ss_pred cccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 128 VKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERL 207 (418) Q Consensus 128 i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~ 207 (418) +++.... | |. ..|++... ..++.||||... .+....+|.||++. +-..+.....+... T Consensus 163 v~v~~~~-~------g~-~~y~~~~~-------~~~~eiiHir~~------~~~~~~~G~spi~~-~~~~i~~~~a~~~~ 220 (409) T protein:vir:83 163 VNVELKK-G------AR-REYRIGGL-------NVTDEILHIRYQ------GNTADAHGHGPLES-AAPRQVVIGLLQKY 220 (409) T ss_pred eEEEEcC-C------ce-EEEEEccc-------cCccceEEeCCC------CCCCCcccccHHHH-HHHHHHHHHHHHHH Confidence 7754321 1 22 34666442 234678998532 23345689999975 67788887777777 Q ss_pred HHHHHHHcCC--ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCC-cceeEEEcCCCceeEeecccCC--HHHHHHHHH Q lcl|NC_019404. 208 ATQLLRRKQQ--AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGV-GQAIGIDAESEEYSVLNSDIGG--IDAFLDKKF 282 (418) Q Consensus 208 ~~~l~~~~~~--~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~ 282 (418) ...++..... -++++++ .+ +++.....+++++. ...++ +..+++.+..+.++.++.+..+ +-+...+.. T Consensus 221 ~~~~f~nga~p~gil~~~~---~l-s~e~~~~~~~~~~~--~~~~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~ 294 (409) T protein:vir:83 221 VQNLAETGGVPLYWLGVER---RL-SETEAVDLMDRWIE--SRSKYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNE 294 (409) T ss_pred HHHHHhcCCCcceEeecCC---CC-CHHHHHHHHHHHHH--hhCCccCccceecCCcccccccCCCHHHHHHHHHHHhhH Confidence 7777765332 3455553 12 22233445555543 22223 3445555543333445655443 345556778 Q ss_pred HHHhhhhcCCeeeeeccCccccc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---ccCCceEE--eCCCCCC Q lcl|NC_019404. 283 DRIVALSGIHEIILKNKNVGGLS---SSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NAEEWSVE--FSPLDHE 354 (418) Q Consensus 283 ~~iaaas~IP~t~L~G~s~~gl~---stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~~~~~~~--f~pL~~~ 354 (418) ..||.+.+||.. |+|....+-+ |.-|+....|+.. -|.|++.++-..+- ...+..++ +..|... T Consensus 295 ~eIa~~fgVPp~-llg~~~~~~~~tysn~eq~~~~f~~~-------tL~P~~~~ie~~l~~~Ll~~~~~~~f~~~~llr~ 366 (409) T protein:vir:83 295 ARIAILLGVPPF-LVGLPGATGSLTYSNIEQLFSFHDRS-------SLRPKATAVMAALDRWALPSPQHLELNRDDYTRP 366 (409) T ss_pred HHHHHHhCCCHH-HccCCCCccccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCCCCcEEEeehhhhhcc Confidence 899999999975 5565432211 2235566666643 37787766655442 23334444 4566666 Q ss_pred CHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccc Q lcl|NC_019404. 355 SSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVV 416 (418) Q Consensus 355 ~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~ 416 (418) |.++ +++++++++++|++|++|+|+.+ ...+..|. +++ ++=+| T Consensus 367 d~~~-------r~~~~~~~~~~G~lT~NE~R~~~-glpp~~gg--d~l---------~~~gv 409 (409) T protein:vir:83 367 SLVE-------RATAYKIMIEAGVMEPNEARAME-RLHSEAAA--VRL---------SGGGV 409 (409) T ss_pred CHHH-------HHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCC--ccc---------CCCCC Confidence 6655 47789999999999999999864 22111111 111 11112 No 82 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.68 E-value=1.2e-16 Score=107.91 Aligned_cols=363 Identities=13% Similarity=0.004 Sum_probs=185.6 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) ....++..+.+-.. .....+...+.+. .-+.+++.+.+||+.+++++-+-++.+... ....+..+-..+.... T Consensus 15 ~~~~~~~~~~~~~~--~~~~~~~~~~v~~----~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~-~~~~l~~~PN~~~t~~ 87 (384) T protein:vir:49 15 PSNQDSFFDITDPE--FLDALNGSEWVSA----ETALKNSDLFSIISQLSNDLATAKITTSRK-QLQGIVDNPSNNANRF 87 (384) T ss_pred cccchhhccccchh--hcccccCCceech----hhhhccHHHHHHHHHHHHHHhhCceeeecc-hhhhhhhccCCCCCHH Confidence 11111111110000 0000000011111 123568889999999999999998888643 2233444333444455 Q ss_pred HHHHHHHhc-cccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC--ccc Q lcl|NC_019404. 81 NINDAWSWA-RLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE--SDM 157 (418) Q Consensus 81 ~~~~a~~~~-rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~--~~~ 157 (418) .|.+.+... .++|.|++++.- + ..|.+..+.++++..+++.... .+....|++...+ .+. T Consensus 88 ~f~~~l~~~lll~Gna~~~i~r-~---------~~g~~~~L~~l~~~~v~v~~~~-------~~~~~~y~~~~~~~~~~~ 150 (384) T protein:vir:49 88 NFYQSIFAQMLLGGEAFAYRWR-N---------ENGRDMKWEYLRPSQVSFNRLD-------NQNGLYYNITFDDPRIPP 150 (384) T ss_pred HHHHHHHHHhhhcCCeEEEEEE-C---------CCCcEEEEEEEcCceeEEEEcC-------CCceEEEEEEecCccccc Confidence 566666655 557999988754 2 2356778999998888765421 1233466665432 223 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchH Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGF 235 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~ 235 (418) ...++++.||||.+.. +....+|.||+.. +.+.+.....+......++.....+ ++++++.. ...+ T Consensus 151 ~~~~~~~eVih~~~~~------~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~---~~~~-- 218 (384) T protein:vir:49 151 KQHVPQGDILHFRLLS------VDGGLTSVSPLMA-LGRELNIQKASDKLTLNALKNALNANGILKIKGGG---LLDF-- 218 (384) T ss_pred eeEecCccEEEecCCC------CCCceeeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC---ChHH-- Confidence 3578999999996432 2334679999974 8899999999999999988876543 45665321 1111 Q ss_pred HHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHH Q lcl|NC_019404. 236 GAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALE 313 (418) Q Consensus 236 ~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~ 313 (418) ..+..........+.+.+++..++.+|+.++.+... +.+..+...+.||.+.|||..+|.+ +.++ +++.+.-.. T Consensus 219 --~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~-~~~~-~~~~~~~~~ 294 (384) T protein:vir:49 219 --KTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGG-EGDK-QSSLEMIYN 294 (384) T ss_pred --HHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCc-cccHHHHHH Confidence 112222222223344445555566889888876654 4567788999999999999887754 3222 233332222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcC Q lcl|NC_019404. 314 TFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAP 393 (418) Q Consensus 314 ~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~ 393 (418) .|...| +..++|+++.+-..+.. ++.+. +...++.+...... ....++.+|+.+..|+++.|...+- T Consensus 295 ~~~~~i----~~~l~pi~~~i~~~l~~----~l~~~----~~~~~~~~~~~~~~-~~~~l~~~~~~t~~e~~~~l~~~g~ 361 (384) T protein:vir:49 295 IYFKAV----SRFLRPFVSELSKKLSC----EVDAD----ILPAVDPTGSNYIG-LINSMVKTGTLAQNQGLYVLQQAEI 361 (384) T ss_pred HHHHHH----HHHHHHHHHHHHHHhch----hhhhh----hhhhhhccchHHHH-HHHHHhhcCcccHHHHHHHHhhCCC Confidence 222222 22355555555444321 11110 01111111111111 1223455556666666555543321 Q ss_pred cCCCChhhcccccccCCCcccccc Q lcl|NC_019404. 394 EIKIGDNDIQTEESELITETEVVI 417 (418) Q Consensus 394 ~~~~~~~~~~~~e~~~~~e~e~~~ 417 (418) .+ -..-.++...+-.-+++..-| T Consensus 362 ~~-ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 362 LP-KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred CC-hhHHHHcCCCCCCCCCCCCCC Confidence 11 000011111111112222222 No 83 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.67 E-value=1.8e-16 Score=106.86 Aligned_cols=374 Identities=11% Similarity=0.050 Sum_probs=195.2 Q ss_pred hhhHHHH-------hcCCCC---ccccCccccCC-HHHHHHHHHcCCccchhhhcchhhhccCCccc---cCcchHH--- Q lcl|NC_019404. 5 DSYANIF-------LGGSDG---SEIYGSLQNQA-PTILASLYADNALVRRIIDTIPETALAAGFHI---DGIDDEP--- 67 (418) Q Consensus 5 D~~~n~~-------~g~~~~---~~~~~~~~~~~-~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i---~~~~d~~--- 67 (418) =||.+-+ .+.... +.........+ ...+.+.|..++-+++||+.+++++-+-++.+ .++.... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~ 80 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVR 80 (423) T ss_pred CchhHhhccccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeec Confidence 0111110 000000 00000011111 12567778899999999999999999988876 1221111 Q ss_pred --HHHHHHHH---hCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc Q lcl|NC_019404. 68 --AFWSRWDD---LEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR 141 (418) Q Consensus 68 --~i~~~~~~---l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~ 141 (418) .+..-+.+ +-....|.+++.+ -.++|.|++++.-+.+. .+.+..+++++...+.+.... | . T Consensus 81 ~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~--------~~~~~~l~p~~~~~v~~~~~~-~----~ 147 (423) T protein:vir:81 81 EGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGV--------DTPTLDIRPIPVSWVQRRAYK-D----G 147 (423) T ss_pred cchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc--------CcceEEEeecccceeeeeecc-C----C Confidence 12212222 2234555555554 45789998877543222 123445556555555443221 1 1 Q ss_pred cCcceEEEEec--CCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc- Q lcl|NC_019404. 142 FGKPLTYRITT--NESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA- 218 (418) Q Consensus 142 yg~p~~y~i~~--~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~- 218 (418) .|.. .|++.. ...+....++++.|||+.+. .+....+|.||+.. +.+.|.....+......++.....+ T Consensus 148 ~~~~-~Y~~~~~~~~~g~~~~~~~~evih~r~~------~~~~~~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ng~~p~ 219 (423) T protein:vir:81 148 WGSL-DYIIIESGDNDGRSVKVPGERVIHRHGY------NPKTMKRGKSPVQS-LRDILGEQIEAAIFRAQMWRNGPRPG 219 (423) T ss_pred Ccce-EEEEEEecCCCceEEEEcccceEEecCC------CCCCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCc Confidence 2222 344432 12223357899999999632 12334579999975 7899988888888888888665433 Q ss_pred -eeecchHHHhhc-CcchHHHHHHHHHHHHH-hcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCe Q lcl|NC_019404. 219 -VWKAKGLAELCD-DSEGFGAARLRLAQVDN-NSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHE 293 (418) Q Consensus 219 -v~k~~~l~~~~~-~~~~~~~~~~r~~~~~~-~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~ 293 (418) +++++.....-. +.+......+++...-. .-.+.+.+++..++.+|+.++.+..+ +-+...+....||.+.+||. T Consensus 220 gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp 299 (423) T protein:vir:81 220 MVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINP 299 (423) T ss_pred eEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCH Confidence 555553211000 11122233333333221 11223445555556788888776643 33456677888999999997 Q ss_pred eeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc-----cCCceEEe--CCCCCCCHHHHHHH Q lcl|NC_019404. 294 IILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN-----AEEWSVEF--SPLDHESSKDKAEV 362 (418) Q Consensus 294 t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~-----~~~~~~~f--~pL~~~~eke~ae~ 362 (418) .+| |...++-.++-|...+.||.. .|.|.+..+-..+ +. ..++.|+| +.|...|-+++ T Consensus 300 ~~l-g~~~~~t~sn~e~~~~~f~~~-------~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r--- 368 (423) T protein:vir:81 300 TMV-GQLDNANYSNVREFRKALYGD-------NLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEA--- 368 (423) T ss_pred HHh-cCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHH--- Confidence 655 765444444456666667664 3677665554333 21 23455666 46666666554 Q ss_pred HHHHHHHHHHHH-hCCCCCHHHHHHHHHhhcCcCCCCh----hhc-cccccc-CCCcccc Q lcl|NC_019404. 363 LEKSVNSIAALI-AAGAMDIKEARDTLRTIAPEIKIGD----NDI-QTEESE-LITETEV 415 (418) Q Consensus 363 ~~~~a~a~~~~~-~~g~i~~~e~r~~l~~~~~~~~~~~----~~~-~~~e~~-~~~e~e~ 415 (418) ++++++++ +.|++|++|+|+.+. ..+..+-+. ... +.+.++ +-++.|- T Consensus 369 ----~~~~~~~l~~~G~~T~NE~R~~~g-l~p~~gGD~~~~p~n~~~~~~~~~~~~~~~t 423 (423) T protein:vir:81 369 ----AEIKRAAVGNVAWMTINEVRAMDN-LPSIDGGDDLARPLNTEFGDSEDAPGEEVET 423 (423) T ss_pred ----HHHHHHHHhCCCCcCHHHHHHHhC-CCCCCCcceeecccccccCccCCCCCCCCCC Confidence 55566666 469999999998762 222111110 011 111111 1122222 No 84 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.67 E-value=1.2e-17 Score=113.33 Aligned_cols=375 Identities=12% Similarity=0.070 Sum_probs=190.3 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHH--HHcCCccchhhhcchhhhccCCccccCcchH---HHHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASL--YADNALVRRIIDTIPETALAAGFHIDGIDDE---PAFWSRWDD 75 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~--Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~---~~i~~~~~~ 75 (418) +-|.+-+..-..|-..- ...+. ..+.++... ...+.++++|||..++.++-+|+.+.+++|. +.+.+.|++ T Consensus 21 ~~r~~~l~~Yy~g~~~i-~~~~~---~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~d~~~~~~~~~~~~~ 96 (456) T protein:vir:79 21 MSRVRLLARYSNGDAPL-PELTR---NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD 96 (456) T ss_pred HHHHHHHHHHHhccCCh-hhcCc---ccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCccHHHHHHHHHHh Confidence 11111111111221110 00010 011222222 2345788999999999999999998765432 356677777 Q ss_pred hCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccc-------------cccc-c Q lcl|NC_019404. 76 LEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREE-------------NPRN-A 140 (418) Q Consensus 76 l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~-------------dp~s-~ 140 (418) -++.....++++.+..||.|++++..++ |.+. +.+++++++.+.+... +... + T Consensus 97 n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~------------i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~ 164 (456) T protein:vir:79 97 NRMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT------------ITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAES 164 (456) T ss_pred cChhHHHHHHHHHHhhcCeeEEEEeeCCCCceE------------EEEeccceeEEEEcCCCCCceEEEEEEEEecCCce Confidence 7888888999999999999998876642 2221 2233333322221100 0000 0 Q ss_pred ----ccCcceEEEE---e---cCCcccccccCccc------EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHH Q lcl|NC_019404. 141 ----RFGKPLTYRI---T---TNESDMFYDVHYSR------IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNC 204 (418) Q Consensus 141 ----~yg~p~~y~i---~---~~~~~~~~~iH~SR------~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~ 204 (418) .|..+..|.+ . ..........+... ..++.|.+ |.. +..+.+|.|.++ ++.+.+.+++.+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-pvv--~~~N~~~~gd~e-~v~~liD~~~~~ 240 (456) T protein:vir:79 165 DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-PVV--VYQNPDGMGEVE-PHIDIINRINRA 240 (456) T ss_pred eEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCce-eEE--EecCCCCCchhh-hhHHHHHHHHHH Confidence 0111111110 0 00000000000000 01111111 111 124567888887 477888888887 Q ss_pred HHHHHHHHHHcCCceeecchHHHh--hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeE-eecccCCHHHHHHHH Q lcl|NC_019404. 205 ERLATQLLRRKQQAVWKAKGLAEL--CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV-LNSDIGGIDAFLDKK 281 (418) Q Consensus 205 ~~~~~~l~~~~~~~v~k~~~l~~~--~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~-~~~~~~gl~~~~~~~ 281 (418) +......+..++.....+.+.... ..+..+ . .+......+...+.+....++.++-+ -..++.+..+.++.. T Consensus 241 ~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g--~---~i~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~l~~~ 315 (456) T protein:vir:79 241 ELQLLSTMAIQAFRQRALKSSEHRLPKVDENG--N---AIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEH 315 (456) T ss_pred HHHHHHHHHHHhhHHHHHhcCCcccccccccc--c---ccchhhhhhhhccccccCCCCcceeeecccChHHHHHHHHHH Confidence 665554444444444333332110 001111 1 11111111122233333333444433 346678889999999 Q ss_pred HHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhcc------CCceEEeCCCC Q lcl|NC_019404. 282 FDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEFLIPFIVNA------EEWSVEFSPLD 352 (418) Q Consensus 282 ~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------~~~~~~f~pL~ 352 (418) ..+|++.+++|...|.|.+. |.||+.-. ......++.+| ..+++.|++++.+++.- .++++.|.|.. T Consensus 316 i~~i~~~t~~p~~~~~~~~~---N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~~~~~l~~~~~g~~~~~~i~v~w~~~~ 391 (456) T protein:vir:79 316 IRQLSSATKTPLPMLMPDSA---NQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIEGESVEDTVDVSFESPD 391 (456) T ss_pred HHHHHhhcCCChhHhccccc---CcHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccccceEEeCCCC Confidence 99999999999988876542 33454333 33334444444 45788999998887642 26788999988 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 353 HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 353 ~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) .++..+. |+++.+++++|+++.+.++..| ++++++++..|-+-..++....| T Consensus 392 ~~s~~~~-------ada~~kl~~~G~~~~~~~~~~l-------g~~~~~i~~~e~~r~~~e~~~~~ 443 (456) T protein:vir:79 392 RVTLGEK-------YSAASLAKAAGESWASIRRNIL-------NYNADQIKQDDLDRAREQITLFA 443 (456) T ss_pred CcCHHHH-------HHHHHHHHhcCCChHHHHHhcC-------CCCHHHHHHHHHHHHHHHHHHHh Confidence 8887665 6666777788888876655432 34444443333222222222222 No 85 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.67 E-value=2.3e-16 Score=106.31 Aligned_cols=377 Identities=13% Similarity=0.112 Sum_probs=197.3 Q ss_pred hHHHHhc-------CCCC-c----cccCcc---ccCCHH-HHHHHHHcCCccchhhhcchhhhccCCccccC--cchH-H Q lcl|NC_019404. 7 YANIFLG-------GSDG-S----EIYGSL---QNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDG--IDDE-P 67 (418) Q Consensus 7 ~~n~~~g-------~~~~-~----~~~~~~---~~~~~~-~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~--~~d~-~ 67 (418) ++|.+.- .+.. . ...|.. ...+.. -..+.|..++.+.++|+.++++.-+-++.+.- .+.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~ 80 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQ 80 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccch Confidence 3333210 0000 0 001110 001111 23445788899999999999999988888731 1100 0 Q ss_pred ---H-------H------------------HHHHHHh-------CchHHHHHHHH-hccccceEEEEEeecCCCcccccc Q lcl|NC_019404. 68 ---A-------F------------------WSRWDDL-------EMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPV 111 (418) Q Consensus 68 ---~-------i------------------~~~~~~l-------~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl 111 (418) . + +.....| -....|.+.+. +-.++|.|++++.-.+.. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~------ 154 (460) T protein:vir:10 81 QLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDG------ 154 (460) T ss_pred hhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCC------ Confidence 0 0 0001111 12334444444 566789999887543211 Q ss_pred cCCCceEEEEEeeccccccccc-cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchH Q lcl|NC_019404. 112 REGAELETVRVYDRTQVKVQNR-EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVL 190 (418) Q Consensus 112 ~~~~~i~~i~v~~~~~i~~~~~-~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l 190 (418) ...|.+..|.++++.++++... ...+....++ ...|.+..+ +....+.++.||||.....+.. ......+|.||+ T Consensus 155 ~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~-~~~~~~~~~--g~~~~~~~~evih~r~~~~~~~-~~~~~~~G~sp~ 230 (460) T protein:vir:10 155 INAGVPSQMYVLPAHLIKIVLKDDINLLSTDSP-IKSYMLIQG--DQFIEFNEDEVIHTKYANPNFD-LQGSHLYGMSPI 230 (460) T ss_pred ccCceeEEEEEEcCceEEEEEcCCCceeeeeee-eeEEEEecC--ceeEEecccceEEEecCCCCcc-cccCccccccHH Confidence 2346677889999888876532 2222221111 233444433 2336789999999964332111 112345799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEee Q lcl|NC_019404. 191 SSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN 268 (418) Q Consensus 191 ~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~ 268 (418) .. +.+.|.....+.......+...... +++++. .+ +.+...+.++++........+.+.+++..++.+|+.++ T Consensus 231 ~~-~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~---~l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~ 305 (460) T protein:vir:10 231 RA-ILRNINSQNSTIDNNVKTMQNGGVFGFIHGGST---GL-TQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKIS 305 (460) T ss_pred HH-HHHHHHHHHHHHHHHHHHHhcCCCcceeeecCC---CC-CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEcc Confidence 75 7788888888888888877764433 222221 11 22333444555544433222334445555557888888 Q ss_pred cccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCcc-ccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hc- Q lcl|NC_019404. 269 SDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVG-GLS-SSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VN- 340 (418) Q Consensus 269 ~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~-gl~-stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~- 340 (418) .+... +-+...+..+.||.+.|||..+| |...+ ..+ |+-|.....|+.. .|.|++..+-..+ +. T Consensus 306 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~kl~~ 377 (460) T protein:vir:10 306 LNTDELKPFDYLKYDQKAICNALGWSDKLL-NNNEGGGLNTGNLEEERKRVVTD-------NIQPDLVILKQAFDKKFIK 377 (460) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcC Confidence 76543 45667788899999999999755 54432 222 3445566667664 3677666554332 11 Q ss_pred ----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC-CCCh-----------hhccc Q lcl|NC_019404. 341 ----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI-KIGD-----------NDIQT 404 (418) Q Consensus 341 ----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~-~~~~-----------~~~~~ 404 (418) ..++.++|+- ..+.. -+ ....+...++++|++|++|+|+.+. ..+-. +..| ++.++ T Consensus 378 ~~~~~~~~~i~~d~-~~l~~-l~-----~d~~~~~~~~~~g~~T~NE~R~~~g-~~pi~~~~gD~~~~~~n~~~~~~~~~ 449 (460) T protein:vir:10 378 RFKGYENAVIEWDI-SELPE-MQ-----TDMVAMASWLNTIPVTPNEIRIAMK-YETLNQDGMDIVFMPSNKVRIDDVSN 449 (460) T ss_pred cccccCCceEEeec-chhhh-HH-----HHHHHHHHHHhCCCCCHHHHHHHhC-CCCCCCCCCCeeeecccccchhhccc Confidence 2356666642 12211 11 1122334578999999999998762 22210 0111 11111 Q ss_pred -ccccCCCccc Q lcl|NC_019404. 405 -EESELITETE 414 (418) Q Consensus 405 -~e~~~~~e~e 414 (418) ..+..+.+++ T Consensus 450 ~~~~~~~nq~~ 460 (460) T protein:vir:10 450 NLIDSAFNQNQ 460 (460) T ss_pred ccCCCcccCCC Confidence 1122222333 No 86 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.66 E-value=3.1e-16 Score=105.62 Aligned_cols=353 Identities=11% Similarity=0.067 Sum_probs=180.7 Q ss_pred CccchhhHHHHhcCCCC----ccccCccccCC--HHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHH----HH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG----SEIYGSLQNQA--PTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPA----FW 70 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~----~~~~~~~~~~~--~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~----i~ 70 (418) |==.|-|.+.+...... ....+.....+ +..-..+| +++.++++|+.+++++-+-++.+...+.... .. T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vt~~~al-~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~~~~~~~ 79 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRYSGVYVTDSNIL-QSSDVYELLQDISNQMVLADIVVEDEFGNEIKDDIAL 79 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhcccccCccccChhhhh-ccHHHHHHHHHHHHhhcccceEEEcCCCcccchhhHH Confidence 21112222211000000 00000000000 01112233 4688999999999999999998854322111 11 Q ss_pred HHHHH---hCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcce Q lcl|NC_019404. 71 SRWDD---LEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPL 146 (418) Q Consensus 71 ~~~~~---l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~ 146 (418) .-+.+ .-....|.+.+.+ ..++|.|++++. +..+.. +..+.+. .| +.+ T Consensus 80 ~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~---~~~~~~-------~~~~~~~-----------~~----~~~--- 131 (394) T protein:vir:62 80 QILRNPNNYLTQSEFIKLMTNTYLLEGETFPILN---GAQIHL-------ASNVFTE-----------LD----DNL--- 131 (394) T ss_pred HHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEe---cceeec-------cccceEE-----------EC----Cce--- Confidence 11111 2223445554444 566899999863 211110 1111110 00 011 Q ss_pred EEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecch Q lcl|NC_019404. 147 TYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKG 224 (418) Q Consensus 147 ~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~ 224 (418) .|.+..+ +.++.++.|+|+.+... ...+|.|++.. +.+.|.....+......++.....+ +++++. T Consensus 132 ~~~~~~~----~~~~~~~eiih~r~~~~-------d~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~ 199 (394) T protein:vir:62 132 VEHFNIG----GHEIPPCMIRHVKNIGA-------DHLRGKGILDL-GRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDA 199 (394) T ss_pred EEEEeeC----CEEechhheEEecCcCC-------CCccccChHHH-HHHHHHHHHHHHHHHHHHHHccCCcceEEEeCC Confidence 1222221 25688999999965321 23579999975 7889999999988888888876655 555553 Q ss_pred HHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeec--ccC--CHHHHHHHHHHHHhhhhcCCeeeeeccC Q lcl|NC_019404. 225 LAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS--DIG--GIDAFLDKKFDRIVALSGIHEIILKNKN 300 (418) Q Consensus 225 l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~--~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s 300 (418) .. -.......+.++++........+.+.+++...+.+++.... +.. .+-+..++....||.+.+||..+| |.. T Consensus 200 ~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~ 276 (394) T protein:vir:62 200 HI--NPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTY-TEL 276 (394) T ss_pred CC--CcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHc-CCC Confidence 21 11111122334444333222233344444444465665444 333 345566788899999999999877 432 Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc----CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 301 VGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA----EEWSVEFSPLDHESSKDKAEVLEKSVNSIAAL 373 (418) Q Consensus 301 ~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~----~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~ 373 (418) . +|+.+...+.|+.. -|.|++..+-..| +.. ..+.|+|+.+.-++..++ ++++.++ T Consensus 277 ~---~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~-------~~~~~~~ 339 (394) T protein:vir:62 277 I---KEDIEKAMMYIHNK-------AVRPIMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTYSNK-------TNIGYNL 339 (394) T ss_pred C---CcCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhhcCccccCceEEEechhhhcCHHHH-------HHHHHHH Confidence 2 23345555666544 3788877764443 222 257889988776666543 5567899 Q ss_pred HhCCCCCHHHHHHHHHhhcCcC-CC-------------ChhhcccccccCCCcccc Q lcl|NC_019404. 374 IAAGAMDIKEARDTLRTIAPEI-KI-------------GDNDIQTEESELITETEV 415 (418) Q Consensus 374 ~~~g~i~~~e~r~~l~~~~~~~-~~-------------~~~~~~~~e~~~~~e~e~ 415 (418) +++|++|++|+|+.+. ..+-. +. +..+-+.+..+.-+++|- T Consensus 340 ~~~g~~T~NE~R~~~g-l~p~~~~~gd~~~~~~n~~~~~~~~~~~~~~kgge~~en 394 (394) T protein:vir:62 340 VRTAITSPDNVADMLG-FPKQNTKESQAIYISNDVTEIGKKEATDGSLGGGEENEN 394 (394) T ss_pred HhCCCcCHHHHHHHhC-CCCCCCCCCCeeecccccccccccccccccCCCCCCCCC Confidence 9999999999998763 22210 10 000011111112222333 No 87 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.64 E-value=3e-16 Score=105.66 Aligned_cols=343 Identities=14% Similarity=0.119 Sum_probs=184.5 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQ 80 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~ 80 (418) ....+.+...+.+++. ...+. ..+... + .+++-+..+|+.++++.-+-.+. .......+..+-..+--.. T Consensus 12 ~~~~~~~~~~~~~~~~--~~~~~--~v~~~~---a-l~~~av~~cv~~ia~~ia~~p~~--~~~~~~~L~~~PN~~~t~~ 81 (359) T protein:vir:10 12 SITPNNYYPFMVQNGS--IVPNS--LVDATE---A-LKNSDLYAVTSLISSDIAGTRFI--GNQVFTSVLNNPSHLTNAF 81 (359) T ss_pred cCCCCcchhhhhcccc--ccCCc--ccCHHH---h-hcchHHHHHHHHHHHhhhcCccc--cchHHHHHhhcccccCCHH Confidence 2223333322222111 11111 122222 1 23555678999999988766552 2222333443333333444 Q ss_pred HHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 81 NINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 81 ~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) .|.+.+. +-.++|.|++++.- + ..|.+..+.++++..+++.... + .-.|+++........ T Consensus 82 ~f~~~~~~~lll~Gnay~~i~r-~---------~~g~~~~l~~l~~~~v~i~~~~--------~-~~~y~~~~~~~~~~~ 142 (359) T protein:vir:10 82 SFWQTAILNLLLNGNVFLAILK-G---------DNSLMKELRLIPSNAITIDLTD--------D-TLTYEVNQFDDYPSA 142 (359) T ss_pred HHHHHHHHhccccCceEEEEEE-C---------CCCeEEEEEEeCCceEEEEEcC--------C-eEEEEEEecCCceEE Confidence 5555555 55678999987742 2 2345677888888777653211 1 134666544334456 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--ceeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~~~~~~~~~~~~ 237 (418) .++++.|+||.....+. .+....+|.||+.. +.+.+.....+.......++.... -+++++.. .+ +.+..+. T Consensus 143 ~~~~~evih~~~~~~~~--~~~dg~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~--~l-~~e~~~~ 216 (359) T protein:vir:10 143 KYNASEMIHVKIMAYGV--DTLHNLVGHSPLES-LTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQG--TL-SSEAKDS 216 (359) T ss_pred EEcccceEEeccCCCCC--CccCccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC--CC-CHHHHHH Confidence 79999999997643221 12233579999975 778888888888888887766443 35565421 11 2233344 Q ss_pred HHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF 315 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y 315 (418) ..++++..... .+.+.+++..++.+|+.++.+... +-+..+...+.||.+.+||..+|.+ .. .-+++.+.-...| T Consensus 217 ~~~~~~~~~~~-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~-~~-~~~~~~~~~e~~~ 293 (359) T protein:vir:10 217 IRKEFEKANGG-NNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNG-TG-DQQSSLDQIKDLY 293 (359) T ss_pred HHHHHHHHhCc-cccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CC-cccccHHHHHHHH Confidence 55555433222 223334444556788888776544 3356778888999999999987744 22 2223333222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC Q lcl|NC_019404. 316 HKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI 395 (418) Q Consensus 316 ~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~ 395 (418) ...+. ..+.|+.+.+-..+.. .+.++...+...+... ....+.+++++|+++++|+|+.|.. .|.. T Consensus 294 ~~~l~----~~l~p~~~~l~~~l~~--~~~~~~~~~~~~d~~~-------~~~~~~~~~~~G~~t~NE~R~~l~~-~pv~ 359 (359) T protein:vir:10 294 VNALN----RFIEPLISELRIKCDS--SIGVDMSPITDYSNSV-------FKADILNWVKEGIIEPTEAKTLLES-KGII 359 (359) T ss_pred HHHHH----HHHHHHHHHHHHHhhh--hhcccchhhhhcCHHH-------HHHHHHHHHhCCCcCHHHHHHHhCC-CCCC Confidence 22221 1245555444333321 2233333333333322 1234567899999999999997732 2222 No 88 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.64 E-value=3.3e-16 Score=105.49 Aligned_cols=363 Identities=13% Similarity=0.084 Sum_probs=189.0 Q ss_pred ccchhhHHH--HhcCCCCcc--ccCccccC--CHHHHHHHHHcCCccchhhhcchhhhccCCcccc-CcchH----HHHH Q lcl|NC_019404. 2 VKTDSYANI--FLGGSDGSE--IYGSLQNQ--APTILASLYADNALVRRIIDTIPETALAAGFHID-GIDDE----PAFW 70 (418) Q Consensus 2 ~~~D~~~n~--~~g~~~~~~--~~~~~~~~--~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~-~~~d~----~~i~ 70 (418) |.+ |+-. ......... ..+..-.. .+.... + .+++-+.+||+.++++.-+-.+.+. ...+. ..+. T Consensus 1 m~~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-A-l~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~ 76 (417) T protein:vir:38 1 MKL--FRGLATEVDPHWADHLLDSGVIPSFRGGYLGIS-A-LRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIE 76 (417) T ss_pred Ccc--ccccccCCCccchhhhcccccccccCCceechh-h-cccHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchHH Confidence 111 1110 000000000 00000000 000111 2 2456677899999999999888773 21111 1112 Q ss_pred HHH----HHhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcc Q lcl|NC_019404. 71 SRW----DDLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKP 145 (418) Q Consensus 71 ~~~----~~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p 145 (418) ..+ ...--...|.+.+. +..++|.|++++.- ++ ..+.+..+.+++++++.+.... -|. T Consensus 77 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r-~~--------~g~~~~~l~~l~p~~v~v~~~~-------~~~- 139 (417) T protein:vir:38 77 YLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVR-DP--------ITNEPAMFEFYAPSQTQVDTSD-------PDN- 139 (417) T ss_pred HHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEE-cC--------CCCEEEEEEEeCCceEEEEEcC-------CCe- Confidence 111 11222334544444 45678999988753 21 1244667788888887653321 122 Q ss_pred eEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecc Q lcl|NC_019404. 146 LTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAK 223 (418) Q Consensus 146 ~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~ 223 (418) -.|+++..++.....++++.||||.+.++ ..++|.|++.. +.+.|.....+.......+...... +++++ T Consensus 140 ~~y~~~~~~~~~~~~~~~~dviH~r~~~~-------d~~~G~s~l~~-~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~ 211 (417) T protein:vir:38 140 IIYRFTPYNSSMQKVCGFEDVIHWKFFSY-------DTIMGRSPLLS-LGDEIGLQESGVSTLQKFFKSGLKGSIIKAKE 211 (417) T ss_pred EEEEEEEcCCcEEEEecCcceEEecCCCC-------CCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC Confidence 34666655554445678899999965321 23569999975 7788888888888888877664443 44444 Q ss_pred hHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHHHHHHHHHHhhhhcCCeeeeeccCc Q lcl|NC_019404. 224 GLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNV 301 (418) Q Consensus 224 ~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~~~~~~~~iaaas~IP~t~L~G~s~ 301 (418) + .++ .+...+.++++.......+. +..++...+.+|+.++.+..+ +-+...+....||.+.|||..+| |.+. T Consensus 212 ~---~l~-~e~~~~~~~~~~~~~~g~n~-g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~ 285 (417) T protein:vir:38 212 S---RLS-AEARQKIREDFERAQAGADA-GSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRL-AQNS 285 (417) T ss_pred C---CCC-HHHHHHHHHHHHHHhccccc-CCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHh-CCCC Confidence 2 122 23445566666554433333 344444456788888776543 33456677888999999998777 5433 Q ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcc---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 302 GGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNA---EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI 374 (418) Q Consensus 302 ~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~---~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~ 374 (418) . +++-++....|+. ..|.|.+..+-..+ +.. .++.|+|+. ..+.. .....+++++ T Consensus 286 ~--~s~~e~~~~~~~~-------~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~-~~l~~--------~~~~~~~~~~ 347 (417) T protein:vir:38 286 P--NQSVKQLADDYIR-------NDLPFYFEPITSEFELKLLDDAQRHQYCIGFDT-KSVNG--------LPIADVNTAV 347 (417) T ss_pred c--chhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhhcChhhcccceEEech-hhhhH--------HHHHHHHHHH Confidence 2 2334555556654 34777776664443 211 356788862 11211 1223356788 Q ss_pred hCCCCCHHHHHHHHHhhcCcCCCC-hh----------hcc-c-----------ccc--cCCCccccccC Q lcl|NC_019404. 375 AAGAMDIKEARDTLRTIAPEIKIG-DN----------DIQ-T-----------EES--ELITETEVVIA 418 (418) Q Consensus 375 ~~g~i~~~e~r~~l~~~~~~~~~~-~~----------~~~-~-----------~e~--~~~~e~e~~~~ 418 (418) ++|+++++|+|+.+. ..|..+.+ |. +.. . -++ +...+..+--+ T Consensus 348 ~~G~~T~NE~R~~~g-l~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~~~~ 415 (417) T protein:vir:38 348 NGGLWTGNEGRAELG-KKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSGTNA 415 (417) T ss_pred hCCCcCHHHHHHHhC-CCCCCCCCCCeeeecccccccccccccccccccccCCCCCCCCCCCcCCCCcC Confidence 999999999999763 22211110 10 000 0 000 00000001111 No 89 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.63 E-value=8.2e-17 Score=108.77 Aligned_cols=374 Identities=13% Similarity=0.074 Sum_probs=188.5 Q ss_pred ccchhh---HH------------------HHhcCCCCccccCccccCCHHHHHHHH--HcCCccchhhhcchhhhccCCc Q lcl|NC_019404. 2 VKTDSY---AN------------------IFLGGSDGSEIYGSLQNQAPTILASLY--ADNALVRRIIDTIPETALAAGF 58 (418) Q Consensus 2 ~~~D~~---~n------------------~~~g~~~~~~~~~~~~~~~~~~l~~~Y--~~~~~~r~iVd~~a~d~~r~~~ 58 (418) |+...- .+ -..|-..- .+ .+. ..+.++..+. ..+.+++.|||..++.++-+|+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-~~--~~~-~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~ 76 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-PE--LTR-NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-hh--cCc-ccChhhhhhhhhhhcchHHHHHHHHHhhhccCCe Confidence 111111 11 11111100 00 001 1123333322 3467889999999999999999 Q ss_pred cccCcchH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecccccccc-- Q lcl|NC_019404. 59 HIDGIDDE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQN-- 132 (418) Q Consensus 59 ~i~~~~d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~-- 132 (418) .+.+++|. ..+.+.|++-++.....++++.+.+||.|++++..++ +.+. ++++++.++.+.+ T Consensus 77 ~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~------------i~~~~p~~~~~i~d~ 144 (456) T protein:vir:10 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT------------ITADSPETMVVSVDP 144 (456) T ss_pred ecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE------------EEEEccceeEEEEcC Confidence 99765432 3466777777888889999999999999998876532 2221 2233333322211 Q ss_pred -----------ccccccc-ccc-------CcceEEEE----ecCCcccccccCcc-----cEEEecCccchhhhhhcccc Q lcl|NC_019404. 133 -----------REENPRN-ARF-------GKPLTYRI----TTNESDMFYDVHYS-----RIHIIDGERVPNAMRRQNDG 184 (418) Q Consensus 133 -----------~~~dp~s-~~y-------g~p~~y~i----~~~~~~~~~~iH~S-----R~i~~~g~~lp~~~~~~~~~ 184 (418) +..++.. +.| +...+|.. ...........+.+ ...++-|.+ |- -+.++. T Consensus 145 ~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-pv--v~~~N~ 221 (456) T protein:vir:10 145 LQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-PV--VVYQNP 221 (456) T ss_pred CCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCce-eE--EEecCC Confidence 0001000 000 00011100 00000000000000 001111111 11 123466 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHH--HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 185 WGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLA--ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 185 ~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~--~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) +|.|.++ ++.+.+.+++.+.......+..++.+...+.+.. ....+..+. .. ......+...+.+....++. T Consensus 222 ~g~gd~e-~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~--~~---~~~~~~~~~~~~~~~~~~~~ 295 (456) T protein:vir:10 222 DGMGEVE-PHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGN--AI---DYASIFEAAPGALWELPPGV 295 (456) T ss_pred CCCchhh-hhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccc--cc---chhhhhhhhccccccCCCCc Confidence 8999997 5789999998887665544444444333333221 111111111 11 11111122223333334445 Q ss_pred ceeEee-cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 263 EYSVLN-SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEFLIPFI 338 (418) Q Consensus 263 ~~~~~~-~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~l~~~i 338 (418) ++.+++ .++.+..+.++.....+++.+++|...|.|.+. |.||+.-. ......++.+|+ .+++.+.++..++ T Consensus 296 ~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~---N~Sg~Ai~~~~~~l~~k~~~~~~-~f~~~l~~~~rl~ 371 (456) T protein:vir:10 296 DIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA---NQSAEGAHNIEKGFLFKCEDRLS-IAKIGLEAILVKA 371 (456) T ss_pred ceEEecccChhHHHHHHHHHHHHHHhccCCChHHhccccc---ChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 554443 456778888999999999999999988866542 33454332 233444444443 5678888888887 Q ss_pred hcc------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCc Q lcl|NC_019404. 339 VNA------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITE 412 (418) Q Consensus 339 ~~~------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e 412 (418) +.- .++++.|.+...++..+. |+++.++.++|+++.+.+++.| ++++++++..|-+-..+ T Consensus 372 ~~~~g~~~~~~~~v~w~~~~~~~~~~~-------ada~~kl~~~gi~~~~~~~~~l-------g~~~~~i~~~e~er~~~ 437 (456) T protein:vir:10 372 LQIEGESVEDTVDVSFESPDRVTLGEK-------YSAASLAKAAGESWASIRRNIL-------NYNADQIKQDDLDRARE 437 (456) T ss_pred HHhcCCCcccceeEEecCCCCcCHHHH-------HHHHHHHHHcCCChHHHHHhhC-------CCCHHHHHHHHHHHHHH Confidence 642 367899999988887765 5666677778887766555432 33444333222111111 Q ss_pred cccccC Q lcl|NC_019404. 413 TEVVIA 418 (418) Q Consensus 413 ~e~~~~ 418 (418) +...-| T Consensus 438 e~~~~~ 443 (456) T protein:vir:10 438 QITLFA 443 (456) T ss_pred HHHHHh Confidence 111111 No 90 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.63 E-value=8.2e-17 Score=108.77 Aligned_cols=374 Identities=13% Similarity=0.074 Sum_probs=188.5 Q ss_pred ccchhh---HH------------------HHhcCCCCccccCccccCCHHHHHHHH--HcCCccchhhhcchhhhccCCc Q lcl|NC_019404. 2 VKTDSY---AN------------------IFLGGSDGSEIYGSLQNQAPTILASLY--ADNALVRRIIDTIPETALAAGF 58 (418) Q Consensus 2 ~~~D~~---~n------------------~~~g~~~~~~~~~~~~~~~~~~l~~~Y--~~~~~~r~iVd~~a~d~~r~~~ 58 (418) |+...- .+ -..|-..- .+ .+. ..+.++..+. ..+.+++.|||..++.++-+|+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-~~--~~~-~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~ 76 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-PE--LTR-NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-hh--cCc-ccChhhhhhhhhhhcchHHHHHHHHHhhhccCCe Confidence 111111 11 11111100 00 001 1123333322 3467889999999999999999 Q ss_pred cccCcchH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecccccccc-- Q lcl|NC_019404. 59 HIDGIDDE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQN-- 132 (418) Q Consensus 59 ~i~~~~d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~-- 132 (418) .+.+++|. ..+.+.|++-++.....++++.+.+||.|++++..++ +.+. ++++++.++.+.+ T Consensus 77 ~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~------------i~~~~p~~~~~i~d~ 144 (456) T protein:vir:10 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT------------ITADSPETMVVSVDP 144 (456) T ss_pred ecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE------------EEEEccceeEEEEcC Confidence 99765432 3466777777888889999999999999998876532 2221 2233333322211 Q ss_pred -----------ccccccc-ccc-------CcceEEEE----ecCCcccccccCcc-----cEEEecCccchhhhhhcccc Q lcl|NC_019404. 133 -----------REENPRN-ARF-------GKPLTYRI----TTNESDMFYDVHYS-----RIHIIDGERVPNAMRRQNDG 184 (418) Q Consensus 133 -----------~~~dp~s-~~y-------g~p~~y~i----~~~~~~~~~~iH~S-----R~i~~~g~~lp~~~~~~~~~ 184 (418) +..++.. +.| +...+|.. ...........+.+ ...++-|.+ |- -+.++. T Consensus 145 ~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-pv--v~~~N~ 221 (456) T protein:vir:10 145 LQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-PV--VVYQNP 221 (456) T ss_pred CCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCce-eE--EEecCC Confidence 0001000 000 00011100 00000000000000 001111111 11 123466 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHH--HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 185 WGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLA--ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 185 ~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~--~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) +|.|.++ ++.+.+.+++.+.......+..++.+...+.+.. ....+..+. .. ......+...+.+....++. T Consensus 222 ~g~gd~e-~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~--~~---~~~~~~~~~~~~~~~~~~~~ 295 (456) T protein:vir:10 222 DGMGEVE-PHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGN--AI---DYASIFEAAPGALWELPPGV 295 (456) T ss_pred CCCchhh-hhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccc--cc---chhhhhhhhccccccCCCCc Confidence 8999997 5789999998887665544444444333333221 111111111 11 11111122223333334445 Q ss_pred ceeEee-cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 263 EYSVLN-SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEFLIPFI 338 (418) Q Consensus 263 ~~~~~~-~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~l~~~i 338 (418) ++.+++ .++.+..+.++.....+++.+++|...|.|.+. |.||+.-. ......++.+|+ .+++.+.++..++ T Consensus 296 ~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~---N~Sg~Ai~~~~~~l~~k~~~~~~-~f~~~l~~~~rl~ 371 (456) T protein:vir:10 296 DIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA---NQSAEGAHNIEKGFLFKCEDRLS-IAKIGLEAILVKA 371 (456) T ss_pred ceEEecccChhHHHHHHHHHHHHHHhccCCChHHhccccc---ChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 554443 456778888999999999999999988866542 33454332 233444444443 5678888888887 Q ss_pred hcc------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCc Q lcl|NC_019404. 339 VNA------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITE 412 (418) Q Consensus 339 ~~~------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e 412 (418) +.- .++++.|.+...++..+. |+++.++.++|+++.+.+++.| ++++++++..|-+-..+ T Consensus 372 ~~~~g~~~~~~~~v~w~~~~~~~~~~~-------ada~~kl~~~gi~~~~~~~~~l-------g~~~~~i~~~e~er~~~ 437 (456) T protein:vir:10 372 LQIEGESVEDTVDVSFESPDRVTLGEK-------YSAASLAKAAGESWASIRRNIL-------NYNADQIKQDDLDRARE 437 (456) T ss_pred HHhcCCCcccceeEEecCCCCcCHHHH-------HHHHHHHHHcCCChHHHHHhhC-------CCCHHHHHHHHHHHHHH Confidence 642 367899999988887765 5666677778887766555432 33444333222111111 Q ss_pred cccccC Q lcl|NC_019404. 413 TEVVIA 418 (418) Q Consensus 413 ~e~~~~ 418 (418) +...-| T Consensus 438 e~~~~~ 443 (456) T protein:vir:10 438 QITLFA 443 (456) T ss_pred HHHHHh Confidence 111111 No 91 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.62 E-value=3.4e-16 Score=105.42 Aligned_cols=351 Identities=10% Similarity=0.092 Sum_probs=172.1 Q ss_pred chhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HH----HHHHHHHhC Q lcl|NC_019404. 4 TDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PA----FWSRWDDLE 77 (418) Q Consensus 4 ~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~----i~~~~~~l~ 77 (418) |-=|.++| +...... ++........-....|..++.++++|+.+++++-+-++.+...+.. .. +..+-..+- T Consensus 1 Mg~f~~lf-~~~~~~~-~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~ 78 (395) T protein:vir:95 1 MSILEKIF-KTRKDIT-YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDL 78 (395) T ss_pred Cchhhhhh-ccCcccc-ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcCC Confidence 11122332 2111111 1000000011112446678999999999999999988876432211 11 222222334 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM 157 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~ 157 (418) .+..|.+++....+.||.++++...++. +.+++.+.+.+.... |+. ...+.+ ..... T Consensus 79 t~~~f~~~~~~~lll~g~~~~~~~~~~~--------------~~~~~~~~~~~~~~~-----~~~--~~~~~~--~~~~~ 135 (395) T protein:vir:95 79 SSDSFWQQVIYKLIYDNEVLIVVSDSKE--------------LLIADSFYREEYALY-----DDI--FKDVTV--KDYTY 135 (395) T ss_pred CHHHHHHHHHHHHhhCCceEEEEecCCC--------------eEecCCccceeEeec-----Ccc--eeEEEE--cCcee Confidence 4566666666666666555544332221 122222222221111 110 011222 12222 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchH Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGF 235 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~ 235 (418) ...+.++.||||... .+....+|.||+.. +...+..... .+.+.+.. +++++. ..+ +++.. T Consensus 136 ~~~~~~~evih~~~~------~~~~~~~G~spi~~-~~~~~~~~~~-------~~~~~~~~~gii~~~~--~~~-~~e~~ 198 (395) T protein:vir:95 136 QRTFTMQEVIYLKYN------NNKVTHFVESLFED-YGKIFGRMIG-------AQLKNYQIRGILKSAS--SAY-DEKNI 198 (395) T ss_pred eeeeccccEEEEccC------CCCcccccchHHHH-HHHHHHHHHH-------HHHhcCCCceEEEeCC--CCC-CHHHH Confidence 346889999999642 23345679999865 5444443322 22232221 233331 111 22222 Q ss_pred HHHHHHHHHHHHhcCC-cceeEEEcCCCceeEeecccCC-------HHHHHHHHHHHHhhhhcCCeeeeeccCccccccc Q lcl|NC_019404. 236 GAARLRLAQVDNNSGV-GQAIGIDAESEEYSVLNSDIGG-------IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS 307 (418) Q Consensus 236 ~~~~~r~~~~~~~~~~-~~~~~~d~~~e~~~~~~~~~~g-------l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st 307 (418) .+..+.++........ ...++...++.+|+.++.+..+ +-+...+..++||.+.+||..+|.| + .++ T Consensus 199 ~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-~----~sn 273 (395) T protein:vir:95 199 EKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-E----TAD 273 (395) T ss_pred HHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-c----ccC Confidence 3333333332222122 2234434556788888776543 3455567788899999999987733 1 233 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAM 379 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i 379 (418) -++....||.. .|.|.+..+-..+- . ..+++|.++.|...|.+++ +++.+.++++|++ T Consensus 274 ~e~~~~~~~~~-------~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~l 339 (395) T protein:vir:95 274 LEKNTLVFEKF-------CLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQY-------AEAIDKLVSSGSF 339 (395) T ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHH-------HHHHHHHHhCCCc Confidence 35566677653 37777766644432 1 1356788888888887765 6678889999999 Q ss_pred CHHHHHHHHHhhcCcCCC-Chh-----------hcc-----cccccC---CCccccc Q lcl|NC_019404. 380 DIKEARDTLRTIAPEIKI-GDN-----------DIQ-----TEESEL---ITETEVV 416 (418) Q Consensus 380 ~~~e~r~~l~~~~~~~~~-~~~-----------~~~-----~~e~~~---~~e~e~~ 416 (418) +++|+|+.+. ..+..+. .|+ ..+ ..+... ++.+.+- T Consensus 340 t~NE~R~~~g-~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 340 TRNEVRIMLG-EEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred CHHHHHHHhC-CCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 9999998762 2221111 010 000 000000 0000000 No 92 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.62 E-value=3.4e-16 Score=105.42 Aligned_cols=351 Identities=10% Similarity=0.092 Sum_probs=172.1 Q ss_pred chhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HH----HHHHHHHhC Q lcl|NC_019404. 4 TDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PA----FWSRWDDLE 77 (418) Q Consensus 4 ~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~----i~~~~~~l~ 77 (418) |-=|.++| +...... ++........-....|..++.++++|+.+++++-+-++.+...+.. .. +..+-..+- T Consensus 1 Mg~f~~lf-~~~~~~~-~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~ 78 (395) T protein:vir:10 1 MSILEKIF-KTRKDIT-YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDL 78 (395) T ss_pred Cchhhhhh-ccCcccc-ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcCC Confidence 11122332 2111111 1000000011112446678999999999999999988876432211 11 222222334 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM 157 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~ 157 (418) .+..|.+++....+.||.++++...++. +.+++.+.+.+.... |+. ...+.+ ..... T Consensus 79 t~~~f~~~~~~~lll~g~~~~~~~~~~~--------------~~~~~~~~~~~~~~~-----~~~--~~~~~~--~~~~~ 135 (395) T protein:vir:10 79 SSDSFWQQVIYKLIYDNEVLIVVSDSKE--------------LLIADSFYREEYALY-----DDI--FKDVTV--KDYTY 135 (395) T ss_pred CHHHHHHHHHHHHhhCCceEEEEecCCC--------------eEecCCccceeEeec-----Ccc--eeEEEE--cCcee Confidence 4566666666666666555544332221 122222222221111 110 011222 12222 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchH Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGF 235 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~ 235 (418) ...+.++.||||... .+....+|.||+.. +...+..... .+.+.+.. +++++. ..+ +++.. T Consensus 136 ~~~~~~~evih~~~~------~~~~~~~G~spi~~-~~~~~~~~~~-------~~~~~~~~~gii~~~~--~~~-~~e~~ 198 (395) T protein:vir:10 136 QRTFTMQEVIYLKYN------NNKVTHFVESLFED-YGKIFGRMIG-------AQLKNYQIRGILKSAS--SAY-DEKNI 198 (395) T ss_pred eeeeccccEEEEccC------CCCcccccchHHHH-HHHHHHHHHH-------HHHhcCCCceEEEeCC--CCC-CHHHH Confidence 346889999999642 23345679999865 5444443322 22232221 233331 111 22222 Q ss_pred HHHHHHHHHHHHhcCC-cceeEEEcCCCceeEeecccCC-------HHHHHHHHHHHHhhhhcCCeeeeeccCccccccc Q lcl|NC_019404. 236 GAARLRLAQVDNNSGV-GQAIGIDAESEEYSVLNSDIGG-------IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS 307 (418) Q Consensus 236 ~~~~~r~~~~~~~~~~-~~~~~~d~~~e~~~~~~~~~~g-------l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st 307 (418) .+..+.++........ ...++...++.+|+.++.+..+ +-+...+..++||.+.+||..+|.| + .++ T Consensus 199 ~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-~----~sn 273 (395) T protein:vir:10 199 EKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-E----TAD 273 (395) T ss_pred HHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-c----ccC Confidence 3333333332222122 2234434556788888776543 3455567788899999999987733 1 233 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAM 379 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i 379 (418) -++....||.. .|.|.+..+-..+- . ..+++|.++.|...|.+++ +++.+.++++|++ T Consensus 274 ~e~~~~~~~~~-------~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~l 339 (395) T protein:vir:10 274 LEKNTLVFEKF-------CLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQY-------AEAIDKLVSSGSF 339 (395) T ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHH-------HHHHHHHHhCCCc Confidence 35566677653 37777766644432 1 1356788888888887765 6678889999999 Q ss_pred CHHHHHHHHHhhcCcCCC-Chh-----------hcc-----cccccC---CCccccc Q lcl|NC_019404. 380 DIKEARDTLRTIAPEIKI-GDN-----------DIQ-----TEESEL---ITETEVV 416 (418) Q Consensus 380 ~~~e~r~~l~~~~~~~~~-~~~-----------~~~-----~~e~~~---~~e~e~~ 416 (418) +++|+|+.+. ..+..+. .|+ ..+ ..+... ++.+.+- T Consensus 340 t~NE~R~~~g-~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 340 TRNEVRIMLG-EEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred CHHHHHHHhC-CCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 9999998762 2221111 010 000 000000 0000000 No 93 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.62 E-value=3.4e-16 Score=105.42 Aligned_cols=351 Identities=10% Similarity=0.092 Sum_probs=172.1 Q ss_pred chhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HH----HHHHHHHhC Q lcl|NC_019404. 4 TDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PA----FWSRWDDLE 77 (418) Q Consensus 4 ~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~----i~~~~~~l~ 77 (418) |-=|.++| +...... ++........-....|..++.++++|+.+++++-+-++.+...+.. .. +..+-..+- T Consensus 1 Mg~f~~lf-~~~~~~~-~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~ 78 (395) T protein:vir:10 1 MSILEKIF-KTRKDIT-YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDL 78 (395) T ss_pred Cchhhhhh-ccCcccc-ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcCC Confidence 11122332 2111111 1000000011112446678999999999999999988876432211 11 222222334 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM 157 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~ 157 (418) .+..|.+++....+.||.++++...++. +.+++.+.+.+.... |+. ...+.+ ..... T Consensus 79 t~~~f~~~~~~~lll~g~~~~~~~~~~~--------------~~~~~~~~~~~~~~~-----~~~--~~~~~~--~~~~~ 135 (395) T protein:vir:10 79 SSDSFWQQVIYKLIYDNEVLIVVSDSKE--------------LLIADSFYREEYALY-----DDI--FKDVTV--KDYTY 135 (395) T ss_pred CHHHHHHHHHHHHhhCCceEEEEecCCC--------------eEecCCccceeEeec-----Ccc--eeEEEE--cCcee Confidence 4566666666666666555544332221 122222222221111 110 011222 12222 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchH Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGF 235 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~ 235 (418) ...+.++.||||... .+....+|.||+.. +...+..... .+.+.+.. +++++. ..+ +++.. T Consensus 136 ~~~~~~~evih~~~~------~~~~~~~G~spi~~-~~~~~~~~~~-------~~~~~~~~~gii~~~~--~~~-~~e~~ 198 (395) T protein:vir:10 136 QRTFTMQEVIYLKYN------NNKVTHFVESLFED-YGKIFGRMIG-------AQLKNYQIRGILKSAS--SAY-DEKNI 198 (395) T ss_pred eeeeccccEEEEccC------CCCcccccchHHHH-HHHHHHHHHH-------HHHhcCCCceEEEeCC--CCC-CHHHH Confidence 346889999999642 23345679999865 5444443322 22232221 233331 111 22222 Q ss_pred HHHHHHHHHHHHhcCC-cceeEEEcCCCceeEeecccCC-------HHHHHHHHHHHHhhhhcCCeeeeeccCccccccc Q lcl|NC_019404. 236 GAARLRLAQVDNNSGV-GQAIGIDAESEEYSVLNSDIGG-------IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS 307 (418) Q Consensus 236 ~~~~~r~~~~~~~~~~-~~~~~~d~~~e~~~~~~~~~~g-------l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st 307 (418) .+..+.++........ ...++...++.+|+.++.+..+ +-+...+..++||.+.+||..+|.| + .++ T Consensus 199 ~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-~----~sn 273 (395) T protein:vir:10 199 EKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-E----TAD 273 (395) T ss_pred HHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-c----ccC Confidence 3333333332222122 2234434556788888776543 3455567788899999999987733 1 233 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAM 379 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i 379 (418) -++....||.. .|.|.+..+-..+- . ..+++|.++.|...|.+++ +++.+.++++|++ T Consensus 274 ~e~~~~~~~~~-------~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~l 339 (395) T protein:vir:10 274 LEKNTLVFEKF-------CLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQY-------AEAIDKLVSSGSF 339 (395) T ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHH-------HHHHHHHHhCCCc Confidence 35566677653 37777766644432 1 1356788888888887765 6678889999999 Q ss_pred CHHHHHHHHHhhcCcCCC-Chh-----------hcc-----cccccC---CCccccc Q lcl|NC_019404. 380 DIKEARDTLRTIAPEIKI-GDN-----------DIQ-----TEESEL---ITETEVV 416 (418) Q Consensus 380 ~~~e~r~~l~~~~~~~~~-~~~-----------~~~-----~~e~~~---~~e~e~~ 416 (418) +++|+|+.+. ..+..+. .|+ ..+ ..+... ++.+.+- T Consensus 340 t~NE~R~~~g-~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 340 TRNEVRIMLG-EEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred CHHHHHHHhC-CCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 9999998762 2221111 010 000 000000 0000000 No 94 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.62 E-value=1.3e-15 Score=102.19 Aligned_cols=390 Identities=15% Similarity=0.143 Sum_probs=193.8 Q ss_pred Cccchhh--HHHH--------hcCCCCccccCcc-ccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc------Cc Q lcl|NC_019404. 1 MVKTDSY--ANIF--------LGGSDGSEIYGSL-QNQAPTILASLYADNALVRRIIDTIPETALAAGFHID------GI 63 (418) Q Consensus 1 ~~~~D~~--~n~~--------~g~~~~~~~~~~~-~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~------~~ 63 (418) +|.+.+. .+.. ++-.......+-. -..++..|+.+-..+++.++||+..++....-||.+. ++ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~ 91 (651) T protein:vir:99 12 KVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGFDLVPAQGVDGD 91 (651) T ss_pred EEEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCceeeecccCCCC Confidence 2222211 0000 0000000011111 1127889999999999999999999999999998874 21 Q ss_pred c-hHHHH---HHHHHH-----------hC----chHHHHHHHHhccccceEEEEEeecC-CCcc---cccccC------- Q lcl|NC_019404. 64 D-DEPAF---WSRWDD-----------LE----MTQNINDAWSWARLFGGAAIVAIVKD-NRAL---TSPVRE------- 113 (418) Q Consensus 64 ~-d~~~i---~~~~~~-----------l~----~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l---~~pl~~------- 113 (418) + +...+ ++.|+. ++ ....+..++.....+|.+++=+..++ +.+. ..|... T Consensus 92 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~ 171 (651) T protein:vir:99 92 DASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTVRVRRPQ 171 (651) T ss_pred ccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhcChhheeeeccc Confidence 1 22222 222221 11 11333344444456677776443221 1110 011100 Q ss_pred ---------------C----------------CceEEEEEee-------------cccccccccc--ccccccccCcceE Q lcl|NC_019404. 114 ---------------G----------------AELETVRVYD-------------RTQVKVQNRE--ENPRNARFGKPLT 147 (418) Q Consensus 114 ---------------~----------------~~i~~i~v~~-------------~~~i~~~~~~--~dp~s~~yg~p~~ 147 (418) . ....++.++. ...+++.... .....+.+..+.. T Consensus 172 ~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~ 251 (651) T protein:vir:99 172 NRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRET 251 (651) T ss_pred ccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeeccccee Confidence 0 0000111110 0000000000 0011123344555 Q ss_pred EEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--ceeecchH Q lcl|NC_019404. 148 YRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVWKAKGL 225 (418) Q Consensus 148 y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l 225 (418) +.++.........+.++.||||.... +....+|.|++.. +...|.....+.......+..... -++++++ T Consensus 252 g~~~~~~~~~~~~~~~~eViHir~~~------~~~g~~G~spl~~-a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~- 323 (651) T protein:vir:99 252 GDVTTGDANGLENRPANELIFIPNPS------ILEDDYGVPDWVS-AIRTISADEAAKDYNRDFFDNDTIPRMVIKVTG- 323 (651) T ss_pred eeEEEcCCCceeEecccceEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC- Confidence 55554443344567889999995431 2344689999976 778888888888888888776544 3555542 Q ss_pred HHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----------CCceeEeecccC---CHHHHHHHHHHHHhhhhcCC Q lcl|NC_019404. 226 AELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----------SEEYSVLNSDIG---GIDAFLDKKFDRIVALSGIH 292 (418) Q Consensus 226 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----------~e~~~~~~~~~~---gl~~~~~~~~~~iaaas~IP 292 (418) ..+ +.+....+++.++. ...+.+..+++..+ +-+|+.++...+ .+-+...+....||++.+|| T Consensus 324 -~~l-s~e~~~~lr~~~~~--~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVP 399 (651) T protein:vir:99 324 -GEL-SEESKRDLRQMLNG--LREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVP 399 (651) T ss_pred -CCC-CHHHHHHHHHHHHH--HhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCC Confidence 111 22334445555543 23344555555432 234555544332 23455677888899999999 Q ss_pred eeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc-----CC--ceEEeCC--CCCCCHHHHH Q lcl|NC_019404. 293 EIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA-----EE--WSVEFSP--LDHESSKDKA 360 (418) Q Consensus 293 ~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~-----~~--~~~~f~p--L~~~~eke~a 360 (418) ..+| |...++-.|+.|.....|+.. .|.|++..+-..| +.. .+ +.|+|+. |...+. T Consensus 400 p~~l-G~~~~~~~sn~E~~~~~f~~~-------tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~---- 467 (651) T protein:vir:99 400 PVKI-GVTDSANRSNSDQQDKDFALE-------VIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEA---- 467 (651) T ss_pred HHHh-ccCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccH---- Confidence 8655 666554445667677777654 3777766654443 221 23 3455553 544444 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCC-CCh------------hhccccc--ccCCCccccccC Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIK-IGD------------NDIQTEE--SELITETEVVIA 418 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~-~~~------------~~~~~~e--~~~~~e~e~~~~ 418 (418) +.+++++..++++|++|++|+|+.+. ..+..+ ..+ +.....+ ...+..+|..+. T Consensus 468 ---~~~~e~~~~~i~~G~~T~NE~R~~lg-lppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~ 536 (651) T protein:vir:99 468 ---QLAEQRVRAMRLAGVGLVDEAREELG-LDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIG 536 (651) T ss_pred ---HHHHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCccccccccccccccccccccCCCCcccccCccccccc Confidence 45578889999999999999999763 111000 000 0000000 000011111111 No 95 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.61 E-value=1e-15 Score=102.73 Aligned_cols=353 Identities=15% Similarity=0.153 Sum_probs=185.7 Q ss_pred hhhHHHHhcCCC--CccccCc------cccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchHHH----HH Q lcl|NC_019404. 5 DSYANIFLGGSD--GSEIYGS------LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDEPA----FW 70 (418) Q Consensus 5 D~~~n~~~g~~~--~~~~~~~------~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~~~----i~ 70 (418) =|+.|.|.--.. ....... ..........++ ..++.++.+|+.+|+++-+-++.+- .++.... +. T Consensus 1 Mg~~~~f~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~ 79 (403) T protein:vir:80 1 MGLFNFFRRKTRSEPTNAISWFLTQEAYDTLAIPGYTRL-SDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNELS 79 (403) T ss_pred Ccccccccccccccccchhhhhcccccccccccchhhhh-hhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCChHH Confidence 122333311000 0000000 001111111222 4477889999999999988888762 1111111 11 Q ss_pred HHHH----HhCchHHHHHHHHhccc---cceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccC Q lcl|NC_019404. 71 SRWD----DLEMTQNINDAWSWARL---FGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFG 143 (418) Q Consensus 71 ~~~~----~l~~~~~~~~a~~~~rl---~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg 143 (418) ..+. .+-....|.+.+-+..+ +|.|++++.- + ..|.+..+.++++..+++.... .| T Consensus 80 ~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~-~---------~~g~~~~L~~l~p~~v~~~~~~-------~g 142 (403) T protein:vir:80 80 RKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKY-T---------TSGLIDELIPLAPSKVSFVDTD-------TG 142 (403) T ss_pred HHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEE-c---------CCCcEEEEEEEcCCeeEEEEcC-------Cc Confidence 1121 12234566666655543 4667777642 2 2356778888888887653221 12 Q ss_pred cceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eee Q lcl|NC_019404. 144 KPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWK 221 (418) Q Consensus 144 ~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k 221 (418) .. +++.+ ..+-++.||||...+.| ...+.|.||+. .+.+.+.....+.......+.....+ +++ T Consensus 143 ~~--~~y~~------~~~~~~eiih~~~~~~~-----~~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 208 (403) T protein:vir:80 143 YQ--IWYQG------KAYNYDEVLHFIVNPDP-----EKPYMGRGYRV-VLKDIVNNLKQATTTKKSFMSGKYMPSLIVK 208 (403) T ss_pred eE--EEEee------cccchhhEEEEeccCCC-----cCccccccHHH-HHHHHHHHHHHHHHHHHHHHhccCCcceEEE Confidence 11 22221 34567889998633322 23345999986 47788888877777777777655433 455 Q ss_pred cchHHHhhcCcchHHHHHHHHHHH-HHhcCCcceeEEEcCCCceeEee-cccC--CHHHHHHHHHHHHhhhhcCCeeeee Q lcl|NC_019404. 222 AKGLAELCDDSEGFGAARLRLAQV-DNNSGVGQAIGIDAESEEYSVLN-SDIG--GIDAFLDKKFDRIVALSGIHEIILK 297 (418) Q Consensus 222 ~~~l~~~~~~~~~~~~~~~r~~~~-~~~~~~~~~~~~d~~~e~~~~~~-~~~~--gl~~~~~~~~~~iaaas~IP~t~L~ 297 (418) ++.. +. .....+.++++... ....+.+..+++.....++++.. .+.. .+-+..+.....||.+.+||..+| T Consensus 209 ~~~~---~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l- 283 (403) T protein:vir:80 209 VDAA---TA-ELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLL- 283 (403) T ss_pred eCCC---CC-hHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHc- Confidence 5531 11 12223334433222 22233445555555444444332 3332 445667788889999999998666 Q ss_pred ccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccCCceEEeC--CCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019404. 298 NKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNAEEWSVEFS--PLDHESSKDKAEVLEKSVNSIA 371 (418) Q Consensus 298 G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~~~~~~~f~--pL~~~~eke~ae~~~~~a~a~~ 371 (418) |. ++..++...+||.. .|.|+++.+-..| +...++.|+|+ .|...|.+++ ++++. T Consensus 284 g~-----~~~~~~~~~~f~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~f~~~~ll~~d~~~~-------~~~~~ 344 (403) T protein:vir:80 284 GV-----GKYDKDEYNNFINS-------TILPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKEL-------AEVGS 344 (403) T ss_pred CC-----CCccHHHHHHHHHH-------HHHHHHHHHHHHHHHhccCCCCcEEEeechhhhccCHHHH-------HHHHH Confidence 43 22223344556543 4788876664443 34567777775 5666666555 66788 Q ss_pred HHHhCCCCCHHHHHHHHHhhcCcCCC------------C---h-hhccc-ccccCCCccc Q lcl|NC_019404. 372 ALIAAGAMDIKEARDTLRTIAPEIKI------------G---D-NDIQT-EESELITETE 414 (418) Q Consensus 372 ~~~~~g~i~~~e~r~~l~~~~~~~~~------------~---~-~~~~~-~e~~~~~e~e 414 (418) +++++|++|++|+|+.+. ..+..+- + . ...+. +.+...+++| T Consensus 345 ~~~~~Gi~t~NE~R~~~g-l~p~~ggd~~~~~~n~~pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 345 NMYVRGLMEGNEVRDWLG-LSPKEGLSELVILENYIPLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred HHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchhhccchhhccCCCCCCCCCCCC Confidence 899999999999999763 2221110 0 0 00111 1123344455 No 96 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.60 E-value=8.5e-16 Score=103.21 Aligned_cols=375 Identities=9% Similarity=0.100 Sum_probs=185.4 Q ss_pred Cc---------------------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MV---------------------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~---------------------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~ 59 (418) |. |.+-+..-..|-..-..............+... ..+.++++|||..++.+.=++|. T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~-~~~n~~~~iVd~~~~~l~~~gf~ 87 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQL-SRKPWMGLMVNSFAQQLIVDGYR 87 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHH-hhcCcHHHHHHHHHhhccccccc Confidence 11 111111112222211000000000111122222 24577999999999999888888 Q ss_pred ccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccc Q lcl|NC_019404. 60 IDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRN 139 (418) Q Consensus 60 i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 139 (418) +.+.+..+.+.+.|++-++.....++++.+.+||.|++++.-. .. +.+..+.+ .++++++.++.+.+. |+.. T Consensus 88 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~-~~----~~d~~g~~-~i~~~~p~~~~~iyd--d~~~ 159 (479) T protein:vir:99 88 KTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSG-IS----PLDGTTVA-RIKCIDPRDAFAIWE--DPYW 159 (479) T ss_pred CCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecC-CC----CcCCCCce-EEEEechhheEEEec--CCcc Confidence 7655555567777777778888899999999999999877521 01 11122222 345555555444321 1111 Q ss_pred ---c----c--------cCcceEE-EEecCCcccc---cccCcc-c--EEEecCccchhhhhhccccCCcchHHHHHHHH Q lcl|NC_019404. 140 ---A----R--------FGKPLTY-RITTNESDMF---YDVHYS-R--IHIIDGERVPNAMRRQNDGWGRSVLSSDILDS 197 (418) Q Consensus 140 ---~----~--------yg~p~~y-~i~~~~~~~~---~~iH~S-R--~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~ 197 (418) + . |.-+..| .+...++... ..-|+= + ++.|..+ +....||.|.++ ++.+. T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~-------~~~~~~g~sd~e-~v~~l 231 (479) T protein:vir:99 160 DEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNV-------MDLRGVCYGDVE-PLVTV 231 (479) T ss_pred cceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecC-------CCcCcCCcchhH-HHHHH Confidence 0 0 0001111 1111111100 111211 1 1222211 122358999996 58899 Q ss_pred HHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEee-cccCCHHH Q lcl|NC_019404. 198 IKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN-SDIGGIDA 276 (418) Q Consensus 198 l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~-~~~~gl~~ 276 (418) +.+++++.......+..++...+.+.+.... ..... .... ......+.++..+++-++-+++ .++....+ T Consensus 232 iDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~-~~~~~-~~~~-------~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~ 302 (479) T protein:vir:99 232 AKAIDKTGLDILLVQHHQSFQIRWATGLMLP-EGANA-DQEK-------MRFAQESMLISQNEKASFGAIPAAPLDGLLN 302 (479) T ss_pred HHHHHHHHHHHHHHHHHhhchhhhhcCCCcc-ccccc-chhc-------cccccccceeecCCCceEEEecccchHHHHH Confidence 9999999887777777666666555543211 11111 0000 0111233334433333343333 34556667 Q ss_pred HHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhhcc---------CCce Q lcl|NC_019404. 277 FLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKR--NAELLPILEFLIPFIVNA---------EEWS 345 (418) Q Consensus 277 ~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Q--e~~l~p~l~~l~~~i~~~---------~~~~ 345 (418) .++....+|++.+++|.. .||.+ | |+||+.-...+...+...+ +..+.+.|++++.+++.- .++. T Consensus 303 ~l~~~i~~i~~~t~~p~~-~~g~~--~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~ 378 (479) T protein:vir:99 303 AYKESLLEFLALAQLPPH-IAGQI--V-NVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFT 378 (479) T ss_pred HHHHHHHHHhccCCCCHH-Hcccc--c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeee Confidence 788888999999999986 45643 2 3566655544444333322 235678888888877541 1467 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhccc---cc---------------- Q lcl|NC_019404. 346 VEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQT---EE---------------- 406 (418) Q Consensus 346 ~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~---~e---------------- 406 (418) +.|.+...++..+. |+++.+++++|.++.+.+...|. ++++.+++. .. T Consensus 379 ~~w~~~~~~s~~~~-------ad~~~kl~~ag~is~et~l~~l~------gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 445 (479) T protein:vir:99 379 ITWQDVTIQSLAQF-------ADAWAKMVESLKIPAEGVWDMIP------NLDQSTVNGWKEIYDREGDFGKYMRKLQNG 445 (479) T ss_pred EEecCCCCCCHHHH-------HHHHHHHHhcCCCCHHHHHHhcC------CCCHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 88998888888765 55566666677777665554321 111111110 00 Q ss_pred -------ccCCCcc--------ccccC Q lcl|NC_019404. 407 -------SELITET--------EVVIA 418 (418) Q Consensus 407 -------~~~~~e~--------e~~~~ 418 (418) ..+.+.+ ++-=| T Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (479) T protein:vir:99 446 PDPAEQRGGPNGATNMQQANNKTGEPA 472 (479) T ss_pred cCcccccCCCCCCCCCCCCCCCCcchh Confidence 0000000 00011 No 97 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.58 E-value=3.1e-15 Score=100.10 Aligned_cols=356 Identities=11% Similarity=0.100 Sum_probs=187.4 Q ss_pred hhhHHHHhcCCCCc----ccc---CccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc----chHH-----H Q lcl|NC_019404. 5 DSYANIFLGGSDGS----EIY---GSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI----DDEP-----A 68 (418) Q Consensus 5 D~~~n~~~g~~~~~----~~~---~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~----~d~~-----~ 68 (418) =|+.|-+...-+.+ +.. .......+....+.|..++.+.++|+.+|+.+.+-++.+... .+.+ . T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~~~ 80 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGVKTKT 80 (403) T ss_pred CcchhhhhhccchhhhhhhcccccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccccccccccccch Confidence 13444332110110 000 000011122233667788999999999999999988877311 1111 1 Q ss_pred HHHHHH----HhCchHHHHHHHHhcc-ccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccC Q lcl|NC_019404. 69 FWSRWD----DLEMTQNINDAWSWAR-LFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFG 143 (418) Q Consensus 69 i~~~~~----~l~~~~~~~~a~~~~r-l~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg 143 (418) +...+. ..--...|.+.+...+ ++|.|++++. +. .+.++++..+.+.. | ..+ T Consensus 81 l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~---~~-------------~l~~l~~~~~~v~~---~----~~~ 137 (403) T protein:vir:10 81 LDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD---GT-------------SLYHVPAALMQVEA---D----ANK 137 (403) T ss_pred HHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe---Cc-------------eeEeecCcceEEEE---c----CCc Confidence 111121 1223356666666555 6788887752 11 13344444333221 0 011 Q ss_pred cceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eee Q lcl|NC_019404. 144 KPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWK 221 (418) Q Consensus 144 ~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k 221 (418) ....| +... ...+.+++++||....+ ...+....+|.||+.. +.+.+.....+.......+...... +++ T Consensus 138 ~~~~~-~~~~----~~~~~~~eiih~~~~~~--~~~~~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~f~ng~~~~gil~ 209 (403) T protein:vir:10 138 FIKKF-IFNN----QINYRVDEIIFIKDNSY--VCGTNSQISGQSRVAT-VIDSLEKRSKMLNFKEKFLDNGTVIGLILE 209 (403) T ss_pred eEEEE-EecC----ceeecccceEEeccccc--ccCCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEE Confidence 11122 2221 12356778999964332 1123345679999975 7889999888888888877654433 556 Q ss_pred cchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccC--C--HHHHHHHHHHHHhhhhcCCeeeee Q lcl|NC_019404. 222 AKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIG--G--IDAFLDKKFDRIVALSGIHEIILK 297 (418) Q Consensus 222 ~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~--g--l~~~~~~~~~~iaaas~IP~t~L~ 297 (418) +++ .++ ++...+..+++........+.+.+++..++.+|+.++.+.+ + +-+...+....||.+.+||..+| T Consensus 210 ~~~---~l~-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l- 284 (403) T protein:vir:10 210 TDE---ILN-KKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLL- 284 (403) T ss_pred eCC---CCC-HHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc- Confidence 553 122 22333444444443322223333445455677888875443 3 35667788899999999999755 Q ss_pred ccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCCceEEeCCC--CCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 298 NKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN--AEEWSVEFSPL--DHESSKDKAEVLEKSVNSIAAL 373 (418) Q Consensus 298 G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--~~~~~~~f~pL--~~~~eke~ae~~~~~a~a~~~~ 373 (418) |.+. +++-++....|+.. -|.|.+..+-..+-. ...+.++++.+ ...|.+ +++++++++ T Consensus 285 g~~~---~sn~e~~~~~f~~~-------tl~P~~~~ie~~l~~~L~~~~~~d~~~~~~l~~D~~-------~~~~~~~~~ 347 (403) T protein:vir:10 285 DGGN---NANIRPNIELFYYM-------TIIPMLNKLTSSLTFFFGYKITPNTKEVAALTPDKE-------AEAKHLTSL 347 (403) T ss_pred CCCC---CcCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCceeeeccchhhhcccCHH-------HHHHHHHHH Confidence 5432 33445555566643 377877776555422 23455555544 344443 457888899 Q ss_pred HhCCCCCHHHHHHHHHhhcCcC--CC-----C--h-------hhcccccccCCCccc Q lcl|NC_019404. 374 IAAGAMDIKEARDTLRTIAPEI--KI-----G--D-------NDIQTEESELITETE 414 (418) Q Consensus 374 ~~~g~i~~~e~r~~l~~~~~~~--~~-----~--~-------~~~~~~e~~~~~e~e 414 (418) ++.|++|++|+|+.+. ..+-. +. + . ..-+..+++.-+|.| T Consensus 348 ~~~G~lT~NE~R~~~g-l~pi~~~~~d~~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 348 VNNGIITGNEARSELN-LEPLDDEQMNKIRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred HhCCCcCHHHHHHHhC-CCCCCcccccccccccccccccccCCCCcCCCCCCCcCCC Confidence 9999999999999862 22110 00 0 0 000111122223333 No 98 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.58 E-value=4.5e-15 Score=99.24 Aligned_cols=358 Identities=10% Similarity=0.068 Sum_probs=160.1 Q ss_pred hhhHHHHhcCCCCccc-cCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHH----HHHHHHH-H--- Q lcl|NC_019404. 5 DSYANIFLGGSDGSEI-YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEP----AFWSRWD-D--- 75 (418) Q Consensus 5 D~~~n~~~g~~~~~~~-~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~----~i~~~~~-~--- 75 (418) =|+.+.|......... ......+ ..-....|..+..+.++|+.+|+++-+-++.+...+... .+...+. + T Consensus 1 Mgl~d~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~~PN~ 79 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDDSGSTT-SEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLYWINTKANP 79 (395) T ss_pred CcchhhhcCCCCccccccccccch-hhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHHHHhhcCCC Confidence 1333433111111101 1111111 122234566788889999999999999888885332111 1221221 1 Q ss_pred hCchHHHHHHH-HhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC Q lcl|NC_019404. 76 LEMTQNINDAW-SWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE 154 (418) Q Consensus 76 l~~~~~~~~a~-~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~ 154 (418) .-....|.+.+ ....++|.|++++. ++.. +.+.+.+..... ..|+ .++.+...+ T Consensus 80 ~~t~~~f~~~l~~~lll~Gna~~~~~-~~~~--------------~~~~~~~~~~~~------~~~~----~~~~v~~~~ 134 (395) T protein:vir:96 80 NQSASQFWVEVVQKLLVDGETLIFVI-PGKG--------------IYVADAFTQDKK------LSGN----KFKVSRVQG 134 (395) T ss_pred CCCHHHHHHHHHHHHhhcCceEEEEE-cCCc--------------eecCCccccccc------cccc----eeeeeeecc Confidence 11234444444 44455788988764 3321 111111111000 0011 111222222 Q ss_pred cccccccCcccEEEecCccchhhhhhccccC-CcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcc Q lcl|NC_019404. 155 SDMFYDVHYSRIHIIDGERVPNAMRRQNDGW-GRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSE 233 (418) Q Consensus 155 ~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~-G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~ 233 (418) ......+.++.|+||.....+.. ......+ +.+.+...+.. +.....+..................+ ...+. T Consensus 135 ~~~~~~~~~~dvih~k~~~~~~~-~~~~~~~~~~~~~~~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~ 207 (395) T protein:vir:96 135 QTYEKIFTFDQVIYLKNDNSDLM-LKVESLWEEYGELLGHVIN-NQKIANQIRFTMTPPKDKVRERAQEN-----SDGGR 207 (395) T ss_pred ceeeeEeccCceEEecccCCccc-cccccccchHHHHHHHHHH-HHHHHHHHHHHhhhcccccccceeec-----cCchh Confidence 22234578889999964332111 1111111 11222221111 11111111122222211111111111 11111 Q ss_pred hHHHHHHHHHHHHHh-cCCcceeEEEcCCCceeEeecccCCH--------HHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 234 GFGAARLRLAQVDNN-SGVGQAIGIDAESEEYSVLNSDIGGI--------DAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 234 ~~~~~~~r~~~~~~~-~~~~~~~~~d~~~e~~~~~~~~~~gl--------~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) ......+.+...... .+....+++...+.+|+.++.+..+. .++.....+.||.+.|||..+|.| . T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~----~- 282 (395) T protein:vir:96 208 QPKSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG----D- 282 (395) T ss_pred hHHHHHHHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----C- Confidence 112222222222212 22233333334446787777665432 222345567899999999987732 1 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA 376 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~ 376 (418) .++-+.....||.. .|.|++..+-..+- . ..++.|+|+.|...|.+++ ++++++++++ T Consensus 283 ~sn~e~~~~~f~~~-------~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~-------~~~~~~~~~~ 348 (395) T protein:vir:96 283 IADNQKNYELLLEG-------PIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSI-------SSQADKLISS 348 (395) T ss_pred CccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHH-------HHHHHHHHhC Confidence 23345555667663 37787766654432 1 2467788988888887665 6677889999 Q ss_pred CCCCHHHHHHHHHhhcCcCC-CChh--------hcccccccCCCcccc Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIK-IGDN--------DIQTEESELITETEV 415 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~-~~~~--------~~~~~e~~~~~e~e~ 415 (418) |++|++|+|+.+. ..|-.+ ..|+ .+.+.-.+..+|.|- T Consensus 349 G~~T~NE~R~~~g-l~pi~~~~gD~~~~~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 349 GFVFIDEVREEIG-LPELPDGLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred CCcCHHHHHHHhC-CCCCCCCCCceeeecccceechhccCCCCCCCCC Confidence 9999999998763 222111 1111 111111123333333 No 99 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.57 E-value=1.8e-15 Score=101.42 Aligned_cols=368 Identities=11% Similarity=0.033 Sum_probs=178.3 Q ss_pred cCccccCCHHHHHHHHH--cCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEE Q lcl|NC_019404. 21 YGSLQNQAPTILASLYA--DNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIV 98 (418) Q Consensus 21 ~~~~~~~~~~~l~~~Y~--~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~ 98 (418) |-++. +..++....+ ...+++.|||.+++-+.-+||.....++.+.+.+.|++-++.....++++.+.+||.|+++ T Consensus 1 ~l~~~--~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 78 (434) T protein:vir:98 1 MLPKN--AEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYML 78 (434) T ss_pred CCCCC--ccHHHHHhhhhhhccchHHHHHHHHhhhccCceecCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 22222 2334444432 3579999999999999888988765555556777788888889999999999999999998 Q ss_pred EeecCCCcccccccCCCceEEEEEeecccccccc-------------cccccccc------ccCcceEEEEecCCcccc- Q lcl|NC_019404. 99 AIVKDNRALTSPVREGAELETVRVYDRTQVKVQN-------------REENPRNA------RFGKPLTYRITTNESDMF- 158 (418) Q Consensus 99 i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~-------------~~~dp~s~------~yg~p~~y~i~~~~~~~~- 158 (418) +..+.+.... .....--|+++++.++.+.+ +..+...- .++....|.......... T Consensus 79 v~~~~~~~~~----~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (434) T protein:vir:98 79 VGAHPTRTED----NGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERTGARLP 154 (434) T ss_pred EecCCCcccc----cCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEeeccccccc Confidence 8754321110 00000112333333322211 11100000 001111111110000000 Q ss_pred ---------cccCcccEEEecCccc-hhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHh Q lcl|NC_019404. 159 ---------YDVHYSRIHIIDGERV-PNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAEL 228 (418) Q Consensus 159 ---------~~iH~SR~i~~~g~~l-p~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~ 228 (418) ...|...-+.|..-|+ |..-.+....+|.|.++ ++.+.+.++++++.........++.+...+.+.... T Consensus 155 ~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e-~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~ 233 (434) T protein:vir:98 155 WGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFA-GVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFA 233 (434) T ss_pred cccccceecccccccccCCCCccceEEeccCCCcCcCCcchhh-hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcc Confidence 0011111111111111 11111222357999996 688999999999887777666666554444332110 Q ss_pred hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeec---ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccc Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS---DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLS 305 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~---~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~ 305 (418) ........... .........+.+.+.. +++.+..+. ++.+..+.++....++++.+++|...|.|.. -| T Consensus 234 -~~~~~~~~~~~---~~~~~~~~~~~i~~~~-~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~---~n 305 (434) T protein:vir:98 234 -KRTDPATGMTV---VDQPFVPSPSAVWASE-GENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDL---VN 305 (434) T ss_pred -cccccccccch---hhhhhhccccccccCC-CCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhcccc---CC Confidence 00011000111 0111111222333322 233443333 4455666777889999999999988775532 13 Q ss_pred cchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019404. 306 SSQNTA---LETFHKLIDRKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA 375 (418) Q Consensus 306 stge~d---~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~ 375 (418) +||+.- ....-..++.+|+ .++..+.+++.+++.- .++.+.|.+-..++..+.|+. ++++.+ T Consensus 306 ~Sg~Al~~~~~~l~~k~~~k~~-~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada-------~~kl~~ 377 (434) T protein:vir:98 306 ISADTIGALDILHVAKVREHIA-SFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADA-------ATKLKS 377 (434) T ss_pred hHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHH-------HHHHHh Confidence 344433 3344445555553 4677888888877642 257889999998988877655 444444 Q ss_pred CCC----------CCHHHHHHHHHhhcC----------cCCC-ChhhcccccccCCCcccc Q lcl|NC_019404. 376 AGA----------MDIKEARDTLRTIAP----------EIKI-GDNDIQTEESELITETEV 415 (418) Q Consensus 376 ~g~----------i~~~e~r~~l~~~~~----------~~~~-~~~~~~~~e~~~~~e~e~ 415 (418) +|+ .+++|+.+..++... -.+- .....++++.-+ .+ T Consensus 378 ~g~~~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~----dg 434 (434) T protein:vir:98 378 IGYPLDVIAEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAV----DG 434 (434) T ss_pred cCCcHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCC----CC Confidence 443 122232221111000 0000 000001111111 11 No 100 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.57 E-value=5.3e-15 Score=98.85 Aligned_cols=351 Identities=9% Similarity=0.086 Sum_probs=172.6 Q ss_pred chhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc--hHHHHHHHHH----HhC Q lcl|NC_019404. 4 TDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID--DEPAFWSRWD----DLE 77 (418) Q Consensus 4 ~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~--d~~~i~~~~~----~l~ 77 (418) |-=|.++| +............... .-....|..++.++++|+.++.++.+-++.+.-.+ ....+...+. ..- T Consensus 1 Mg~f~~~f-~~~~~~~~~~~~~~~~-~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~~~l~~lL~~~PN~~~ 78 (385) T protein:vir:95 1 MGLFDSVF-KRHSELSWMYDLEFLQ-DKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEKGTLYYLLNVRPNRNQ 78 (385) T ss_pred Cchhhhhh-ccCcccccccchhhhh-ccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCccccchHHHHHhcccCcCC Confidence 22233333 3221111100000000 01123456788899999999999999888874221 1112222221 122 Q ss_pred chHHHHHH-HHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcc Q lcl|NC_019404. 78 MTQNINDA-WSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESD 156 (418) Q Consensus 78 ~~~~~~~a-~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~ 156 (418) .+..|.+. +.+-.++|.|++++. +++.. + +.. +...+.... ... +.+|.+...... T Consensus 79 t~~~f~~~~~~~l~l~Gna~i~~~-~~~~~----------~----~~~-~~~~~~~~~--~~~-----~~~~~~~~~~~~ 135 (385) T protein:vir:95 79 NAVDFWQKFIFKLIMDNEVLVVKN-DEGHF----------F----VAD-DFEKEDELG--LYS-----HRFTNVLVNDFE 135 (385) T ss_pred CHHHHHHHHHHHHhhcCceEEEEe-cCCCe----------e----ecc-ccccccccc--ccc-----ccceeeeecccc Confidence 23444444 444557899998763 23221 0 000 001111000 000 112222222222 Q ss_pred cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc-eeecchHHHhhcCcchH Q lcl|NC_019404. 157 MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA-VWKAKGLAELCDDSEGF 235 (418) Q Consensus 157 ~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~-v~k~~~l~~~~~~~~~~ 235 (418) ....+-++.||||.... .....+|.|++.. +.+.+.....+. .+..+.. +++++.. ...+.+.. T Consensus 136 ~~~~~~~~eiih~~~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~------~~~~~~~g~l~~~~~--~~~~~e~~ 200 (385) T protein:vir:95 136 FKRVFTMDDVIYLKYNN------QKLDAFSLGLFED-YGEIFGRMIDLQ------MLNNQIRGILKVDAT--KFYNKEKQ 200 (385) T ss_pred eeeeeccccEEEecCCC------CCcccccchHHHH-HHHHHHHHHHHH------HhcCCCceEEEeCCc--cCCCHHHH Confidence 22456678899996432 2334579999864 555554332221 1222222 3333321 11122222 Q ss_pred HHHHHHHHHHHH-hcCCcceeEEEcCCCceeEeecccC--------CHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc Q lcl|NC_019404. 236 GAARLRLAQVDN-NSGVGQAIGIDAESEEYSVLNSDIG--------GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS 306 (418) Q Consensus 236 ~~~~~r~~~~~~-~~~~~~~~~~d~~~e~~~~~~~~~~--------gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s 306 (418) ....+++...-. ..+..+.+++..++.+|+.++.... .+.+.......+||.+.+||..+|.| . .+ T Consensus 201 ~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~----~-~s 275 (385) T protein:vir:95 201 KELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG----E-MA 275 (385) T ss_pred HHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC----C-Cc Confidence 333444433322 1234455555556678888875432 34556777888899999999988732 2 23 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc-C-----CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019404. 307 SQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA-E-----EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG 377 (418) Q Consensus 307 tge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~-~-----~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g 377 (418) +-++....||.. .|.|.+..+-..+- .. . .++|+++.|...|.+++ ++++++++++| T Consensus 276 n~e~~~~~~~~~-------~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~-------~~~~~~~~~~g 341 (385) T protein:vir:95 276 DLEKTIESYLQF-------CINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKL-------SEAIDKLVASG 341 (385) T ss_pred CHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHH-------HHHHHHHHhCC Confidence 445555666553 37887777655542 11 1 24556668877777654 67788899999 Q ss_pred CCCHHHHHHHHHhhcCcC-CCChh-----hccccc--ccCCCccc Q lcl|NC_019404. 378 AMDIKEARDTLRTIAPEI-KIGDN-----DIQTEE--SELITETE 414 (418) Q Consensus 378 ~i~~~e~r~~l~~~~~~~-~~~~~-----~~~~~e--~~~~~e~e 414 (418) ++|++|+|+.+. ..|.. +.+++ +....+ ...+..+| T Consensus 342 ~lt~NE~R~~~g-~~p~~~~~gd~~~~~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 342 TFTRNQVRIMTG-EEPADDPELDKFIITKNLQSADAFKGGESNEE 385 (385) T ss_pred CcCHHHHHHHhC-CCCCCCCCCceeeecccceecccccCCCCCCC Confidence 999999999773 22211 11111 111111 12222233 No 101 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.56 E-value=6.6e-15 Score=98.33 Aligned_cols=367 Identities=11% Similarity=0.118 Sum_probs=186.6 Q ss_pred CccchhhHHH---------------HhcCCCCccccCccccCCHHHHHHHH----------------------------- Q lcl|NC_019404. 1 MVKTDSYANI---------------FLGGSDGSEIYGSLQNQAPTILASLY----------------------------- 36 (418) Q Consensus 1 ~~~~D~~~n~---------------~~g~~~~~~~~~~~~~~~~~~l~~~Y----------------------------- 36 (418) ++.++++.-- +....... ....+..+| T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---------~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 82 (503) T protein:vir:59 12 TEELNEIIVESAKEIAEPDTTMIQKLIDEHNPE---------PLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNN 82 (503) T ss_pred HHhHHHhhhhhhhhccchhHHHHHHHHHhhcHH---------HHHHHHHHhccccchhhccchhcccccccccccccccc Confidence 2222222110 11111000 011111111 Q ss_pred -HcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCC Q lcl|NC_019404. 37 -ADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREG 114 (418) Q Consensus 37 -~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~ 114 (418) ..+++++.||+..+..++.+++.++++++. ..+.+.|.+-++...+.++.+.+..||.|++++.++.+. T Consensus 83 ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg--------- 153 (503) T protein:vir:59 83 RTSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEG--------- 153 (503) T ss_pred eeecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCC--------- Confidence 136789999999999999999999876654 234445555578888999999999999999998774321 Q ss_pred CceEEEEEeeccccccccccccccccccCcceEEEEecCCccc--ccccC-cccEEEec--------------------- Q lcl|NC_019404. 115 AELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM--FYDVH-YSRIHIID--------------------- 170 (418) Q Consensus 115 ~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~--~~~iH-~SR~i~~~--------------------- 170 (418) .+ .+.++++.++-+.+-+.....+.++ ..+|......+.. ..++| +.++.+|. T Consensus 154 -~~-~i~~~~p~~~~~i~d~~~~~~~~~~-ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (503) T protein:vir:59 154 -EF-DYVIFPAEEMIVVYKDNTRRDILFA-LRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHM 230 (503) T ss_pred -ce-EEEEEccceeEEEEeCCCCCceEEE-EEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccce Confidence 12 2344444443332211111111111 0111111100000 00011 11111111 Q ss_pred ---Ccc-----chhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHH Q lcl|NC_019404. 171 ---GER-----VPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRL 242 (418) Q Consensus 171 ---g~~-----lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~ 242 (418) +.+ +|. ....++.+|.|.++. +-+.+.+++.+....+..+..++..++.+.+.. +.........+ T Consensus 231 ~~~~~~~~~~~vPi-v~~~nn~~~~sd~~~-~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~-----~~~~~~~~~~~ 303 (503) T protein:vir:59 231 TKGGQAIGWGRVPI-IPFKNNEEMVSDLKF-YKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYD-----GENPKEFTANL 303 (503) T ss_pred eecceeccCCccce-EEecCCCCCCcchhh-hHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCC-----ccccchhhhhh Confidence 011 111 122345679998874 889999999998888888888888888776531 11111111111 Q ss_pred HHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH---HH Q lcl|NC_019404. 243 AQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF---HK 317 (418) Q Consensus 243 ~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y---~~ 317 (418) .....+.+.+ +.+.+.+ +.+..++...++.+.+.|...+.+|-.-. + .-+| |.||..-...| .. T Consensus 304 -------~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~~~-~~Sg~Ai~~~~~~l~~ 372 (503) T protein:vir:59 304 -------RYHSVIKVSG-DGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSP-E-TIGG-GATGPALENLYALLDL 372 (503) T ss_pred -------hcccceeccC-CCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCc-c-cccc-cccHHHHHHHHHHHHH Confidence 1122333333 3445544 45567888899999999988888885432 1 1112 34454433223 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc------------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019404. 318 LIDRKRNAELLPILEFLIPFIVN------------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEAR 385 (418) Q Consensus 318 ~I~~~Qe~~l~p~l~~l~~~i~~------------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r 385 (418) .++.+ +..++..|++++.+++. ..++.+.|++-...++++. ++++.+++++|++|.+.+. T Consensus 373 k~~~~-~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~-------~~~~~kl~~~GiiS~et~l 444 (503) T protein:vir:59 373 KANMA-ERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEI-------VQSLVQGVTGGIMSKETAV 444 (503) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHH-------HHHHHHHHhCCCCchHHHH Confidence 33333 34567888888777643 1258999999999998876 4555666667776665555 Q ss_pred HHHHhhcCcCCCChhhc---------------------ccccc---------cCCCccccccC Q lcl|NC_019404. 386 DTLRTIAPEIKIGDNDI---------------------QTEES---------ELITETEVVIA 418 (418) Q Consensus 386 ~~l~~~~~~~~~~~~~~---------------------~~~e~---------~~~~e~e~~~~ 418 (418) ..|. +..-..+++ ...++ +.+....+-.| T Consensus 445 ~~l~----~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 445 ARNP----FVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HhCC----CCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 4321 100000000 00000 01111122222 No 102 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.56 E-value=1.6e-14 Score=96.21 Aligned_cols=368 Identities=10% Similarity=0.041 Sum_probs=184.3 Q ss_pred CccchhhHHHHhcCCCC-cc-ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG-SE-IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~-~~-~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~ 77 (418) +-+.+=+..-..|-+.- .+ .+..............--.+.+++.||+..+..++.+++.++++++. ..+.+.+.+-+ T Consensus 43 ~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~ 122 (474) T protein:vir:95 43 LDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLDTR 122 (474) T ss_pred HHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHhcc Confidence 11111111112222110 00 00000000000000001136899999999999999999999876543 23334444456 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc---------------- Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR---------------- 141 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~---------------- 141 (418) ....+.++.+.+..||.|++++.++.+.. + .+.++++.++-|.+.+.+...+. T Consensus 123 ~~~~~~e~~~~~~~~G~~~~~v~~d~~~~----------~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~ 191 (474) T protein:vir:95 123 WDNKLIDILTATSNKGIDWLQVYINENGE----------M-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEF 191 (474) T ss_pred HHHHHHHHHHHHhhcCcEEEEEEecCCCc----------e-EEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEE Confidence 77888999999999999999987743221 1 12233333322221110000110 Q ss_pred cCc--ceEEEEecCCccc-----------ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 142 FGK--PLTYRITTNESDM-----------FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 142 yg~--p~~y~i~~~~~~~-----------~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) |.. ...|....++... ...-|+--. +|.. ...++.+|.|.++ .+.+.+.+++.+.... T Consensus 192 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-------iPvv-~~~nn~~g~sd~e-~v~~liDa~d~~~S~~ 262 (474) T protein:vir:95 192 WTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGR-------VPFI-AFKNNPEEVSDIW-MYKSLIDAIDKRLSDA 262 (474) T ss_pred EeCCeEEEEEEcCCccccccccCcccccccccccCCCc-------cceE-eecCCCCCCCcHH-HHHHHHHHHHHHHHHH Confidence 000 0112111110000 000121111 1111 1234567899886 4889999999999999 Q ss_pred HHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeE--eecccCCHHHHHHHHHHHHh Q lcl|NC_019404. 209 TQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV--LNSDIGGIDAFLDKKFDRIV 286 (418) Q Consensus 209 ~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~--~~~~~~gl~~~~~~~~~~ia 286 (418) +..+..++..++...+.. +........ .......+.+++ +.+.+. .+.+.++....++.+.++|. T Consensus 263 ~~~~~~~~~p~lv~~g~~-----~~~~~~~~~-------~~~~~~~i~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~ 329 (474) T protein:vir:95 263 QNMFDESVELIYILKGYE-----GQDLEEFMR-------GLKYYKAINVDG-DGGVETIQVEVPVSSTKEYIDLMRAYIM 329 (474) T ss_pred HHHHHHhcCceeeeecCC-----cccchhhhh-------hhhccceeeccC-CCceeEEeecCCHHHHHHHHHHHHHHHH Confidence 888888888877766532 111111111 111233444444 344544 45667789999999999999 Q ss_pred hhhcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHH Q lcl|NC_019404. 287 ALSGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSK 357 (418) Q Consensus 287 aas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~ek 357 (418) ..+++|-. -++ +.+| |.||..-...|..... ...+..++..+++++.+|+.- .++.+.|++-...+++ T Consensus 330 ~~s~~p~~-~~~-~~~~-n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~ 406 (474) T protein:vir:95 330 EFGQGVDF-QTD-KFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMNDA 406 (474) T ss_pred HHhCCccc-ccc-cccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHH Confidence 99999963 222 2223 3456554444433322 333345788888888877532 3678999998888888 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCHHHHHH-------------HHHh-h-------cCcCCCChhhcccccccCCCccc Q lcl|NC_019404. 358 DKAEVLEKSVNSIAALIAAGAMDIKEARD-------------TLRT-I-------APEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 358 e~ae~~~~~a~a~~~~~~~g~i~~~e~r~-------------~l~~-~-------~~~~~~~~~~~~~~e~~~~~e~e 414 (418) |.|++.. ++|+||.+.+.. ++++ . ....+...++.++.+.+.+.+.| T Consensus 407 e~a~~~~----------~~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 407 EQSQIIA----------QSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred HHHHHHH----------hcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCCC Confidence 8776532 235555444433 2211 0 01111111112222222222333 No 103 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.55 E-value=2.2e-14 Score=95.49 Aligned_cols=374 Identities=11% Similarity=0.035 Sum_probs=187.5 Q ss_pred CccchhhHHHHhcCCCCc-cc-cCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGS-EI-YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~-~~-~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~ 77 (418) +-|.+-+..-..|-+.-- +. +..............--.+++++.||+..+..++.+++.++++++. ..+.+.|.+-+ T Consensus 43 ~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~ 122 (474) T protein:vir:97 43 LDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTR 122 (474) T ss_pred HHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhcc Confidence 111121222222221100 00 0000000000000000136889999999999999999999876543 34445555557 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccC-------------- Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFG-------------- 143 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg-------------- 143 (418) ....+.++.+.+..||.|++++..+.+.. + .+.++++.++-|.+-+.+...+.++ T Consensus 123 ~~~~~~e~~~~~~~~G~~~~~~~~d~~~~----------~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~ 191 (474) T protein:vir:97 123 WDNKLIDILTATSNKGIDWLQVYINENGE----------M-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEF 191 (474) T ss_pred HHHHHHHHHHHHhhcCceEEEEEecCCCe----------e-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEE Confidence 88899999999999999999887743221 1 2233344333332211111111111 Q ss_pred -cc-e--EEEEecCCcccc-----cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 144 -KP-L--TYRITTNESDMF-----YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 144 -~p-~--~y~i~~~~~~~~-----~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) .+ . .|....+..... ..+.....-+.-| .+|. ....++.+|.|.++ .+.+.+.+++.+....+.-+.. T Consensus 192 yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~vPv-v~~~nn~~g~sd~e-~v~~liDa~n~~~s~~~~~~~~ 268 (474) T protein:vir:97 192 WTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWG-RVPF-IAFKNNPEEVSDIW-MYKSIIDAIDKRLSDAQNMFDE 268 (474) T ss_pred EeCCeEEEEEEcCCccccccccCcCcccccccccCCC-ccce-EEecCCcCCCCcHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 01 1 111111100000 0000000011111 1121 11234567999887 5889999999999998888888 Q ss_pred cCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEee--cccCCHHHHHHHHHHHHhhhhcCC Q lcl|NC_019404. 215 KQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIH 292 (418) Q Consensus 215 ~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP 292 (418) ++...+.+.+.. +........ .......+.+++ +.+.+.++ .+.++....++.+.+.|...+++| T Consensus 269 ~~~~~lv~~g~~-----~~~~~~~~~-------~~~~~~~i~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 335 (474) T protein:vir:97 269 SVELIYILKGYE-----GEDLEEFMR-------GLKYYKAINVDG-DGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGV 335 (474) T ss_pred hcCceeeeecCC-----cccchhhhh-------hhhccceeeccC-CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 888887777532 111111111 111233444444 35555544 566788899999999999999999 Q ss_pred eeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHHHHH Q lcl|NC_019404. 293 EIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKAEVL 363 (418) Q Consensus 293 ~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~ae~~ 363 (418) -.-. .+.+| |.||..-...|...+. ..++..++..+++++.+++.- .++++.|++-...+++|.|++. T Consensus 336 ~~~~--~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~ 412 (474) T protein:vir:97 336 DFQT--DKFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII 412 (474) T ss_pred ccCc--ccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH Confidence 6432 12223 3456543333333322 333456788888888887541 3578999998888888877653 Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHH-------------Hh-h------c-CcCCCChhhcccccccCCCccc Q lcl|NC_019404. 364 EKSVNSIAALIAAGAMDIKEARDTL-------------RT-I------A-PEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 364 ~~~a~a~~~~~~~g~i~~~e~r~~l-------------~~-~------~-~~~~~~~~~~~~~e~~~~~e~e 414 (418) . ++|++|.+.+...+ ++ . . +..+...++-++++.+...++| T Consensus 413 ~----------~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 413 A----------QSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred H----------HcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 2 23555554444322 11 0 0 0111111111112222223344 No 104 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.55 E-value=2.2e-14 Score=95.49 Aligned_cols=374 Identities=11% Similarity=0.035 Sum_probs=187.5 Q ss_pred CccchhhHHHHhcCCCCc-cc-cCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGS-EI-YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~-~~-~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~ 77 (418) +-|.+-+..-..|-+.-- +. +..............--.+++++.||+..+..++.+++.++++++. ..+.+.|.+-+ T Consensus 43 ~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~ 122 (474) T protein:vir:94 43 LDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTR 122 (474) T ss_pred HHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhcc Confidence 111121222222221100 00 0000000000000000136889999999999999999999876543 34445555557 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccC-------------- Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFG-------------- 143 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg-------------- 143 (418) ....+.++.+.+..||.|++++..+.+.. + .+.++++.++-|.+-+.+...+.++ T Consensus 123 ~~~~~~e~~~~~~~~G~~~~~~~~d~~~~----------~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~ 191 (474) T protein:vir:94 123 WDNKLIDILTATSNKGIDWLQVYINENGE----------M-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEF 191 (474) T ss_pred HHHHHHHHHHHHhhcCceEEEEEecCCCe----------e-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEE Confidence 88899999999999999999887743221 1 2233344333332211111111111 Q ss_pred -cc-e--EEEEecCCcccc-----cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 144 -KP-L--TYRITTNESDMF-----YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 144 -~p-~--~y~i~~~~~~~~-----~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) .+ . .|....+..... ..+.....-+.-| .+|. ....++.+|.|.++ .+.+.+.+++.+....+.-+.. T Consensus 192 yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~vPv-v~~~nn~~g~sd~e-~v~~liDa~n~~~s~~~~~~~~ 268 (474) T protein:vir:94 192 WTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWG-RVPF-IAFKNNPEEVSDIW-MYKSIIDAIDKRLSDAQNMFDE 268 (474) T ss_pred EeCCeEEEEEEcCCccccccccCcCcccccccccCCC-ccce-EEecCCcCCCCcHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 01 1 111111100000 0000000011111 1121 11234567999887 5889999999999998888888 Q ss_pred cCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEee--cccCCHHHHHHHHHHHHhhhhcCC Q lcl|NC_019404. 215 KQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIH 292 (418) Q Consensus 215 ~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP 292 (418) ++...+.+.+.. +........ .......+.+++ +.+.+.++ .+.++....++.+.+.|...+++| T Consensus 269 ~~~~~lv~~g~~-----~~~~~~~~~-------~~~~~~~i~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 335 (474) T protein:vir:94 269 SVELIYILKGYE-----GEDLEEFMR-------GLKYYKAINVDG-DGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGV 335 (474) T ss_pred hcCceeeeecCC-----cccchhhhh-------hhhccceeeccC-CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 888887777532 111111111 111233444444 35555544 566788899999999999999999 Q ss_pred eeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHHHHH Q lcl|NC_019404. 293 EIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKAEVL 363 (418) Q Consensus 293 ~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~ae~~ 363 (418) -.-. .+.+| |.||..-...|...+. ..++..++..+++++.+++.- .++++.|++-...+++|.|++. T Consensus 336 ~~~~--~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~ 412 (474) T protein:vir:94 336 DFQT--DKFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII 412 (474) T ss_pred ccCc--ccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH Confidence 6432 12223 3456543333333322 333456788888888887541 3578999998888888877653 Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHH-------------Hh-h------c-CcCCCChhhcccccccCCCccc Q lcl|NC_019404. 364 EKSVNSIAALIAAGAMDIKEARDTL-------------RT-I------A-PEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 364 ~~~a~a~~~~~~~g~i~~~e~r~~l-------------~~-~------~-~~~~~~~~~~~~~e~~~~~e~e 414 (418) . ++|++|.+.+...+ ++ . . +..+...++-++++.+...++| T Consensus 413 ~----------~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 413 A----------QSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred H----------HcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 2 23555554444322 11 0 0 0111111111112222223344 No 105 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.55 E-value=4.7e-14 Score=93.65 Aligned_cols=382 Identities=10% Similarity=0.064 Sum_probs=192.0 Q ss_pred CccchhhHHHHhcCCCC-ccccCccccCCHHHHHHH-HHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG-SEIYGSLQNQAPTILASL-YADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~-~~~~~~~~~~~~~~l~~~-Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~ 77 (418) +-+.+-+...+-|-+.- .+..-............- -..+++++.||+..+...+.+++.+.++++. ....+.|..-+ T Consensus 40 ~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~n~ 119 (472) T protein:vir:93 40 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNR 119 (472) T ss_pred HHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHhcc Confidence 12222222222222110 000000000000000000 0136999999999999999999999876543 22333444447 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccc---------------- Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR---------------- 141 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~---------------- 141 (418) ....+.++.+.+..||.|++++..+++.. + .+.++++.++.+.+-+.....+. T Consensus 120 ~~~~~~~~~~~~~~~G~~~~~v~~d~d~~----------~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~ 188 (472) T protein:vir:93 120 FDDKLHSVLTGASNKGIEWLHPYLDEEGE----------F-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEY 188 (472) T ss_pred HHHHHHHHHHHHhhcCeEEEEEEECCCCc----------e-EEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEE Confidence 77888999999999999999887743211 1 12233333322221000000000 Q ss_pred c--CcceEEEEecCCc------c-cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 142 F--GKPLTYRITTNES------D-MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 142 y--g~p~~y~i~~~~~------~-~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) | +....|.+..+.. . ....+|. .-+.-| .+|. ....++.+|.|.++ .+.+.+.+++.++...+..+ T Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~vPv-v~~~nn~~g~s~~e-~v~~liDa~~~~~s~~~~~~ 263 (472) T protein:vir:93 189 WDKVTVNYYVYENGSLIPDYSNNLENSKTHF--STGSWG-KIPF-IPFKNNDLEISDIF-MYKTLIDAYNRRLSDLSNTF 263 (472) T ss_pred EecCeEEEEEEecCeeeeccccccccccccc--ccCCCC-Ccce-EEecCCCCCCCchh-hhHHHHHHHHHHHHHHHHHH Confidence 1 1111222211100 0 0001110 000000 0111 12234568999997 48899999999999888888 Q ss_pred HHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhhc Q lcl|NC_019404. 213 RRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALSG 290 (418) Q Consensus 213 ~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas~ 290 (418) ..++...+...+... .........+ ...+.+.++ ++.+.+.+ +.+.+++...++.+.+.|...++ T Consensus 264 ~~~~~~~~~~~g~~~-----~~~~~~~~~~-------~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 330 (472) T protein:vir:93 264 KDSNELTYVLTNYDD-----QELPEFKRLL-------RYYGAIKVS-DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 330 (472) T ss_pred HHhcCceeEeecCCc-----ccchhhHHHH-------hhccccccC-CCCcceeEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 888888777765321 1111111111 112233333 33455554 55667899999999999999999 Q ss_pred CCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHHH Q lcl|NC_019404. 291 IHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKAE 361 (418) Q Consensus 291 IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~ae 361 (418) +|-.-+ + .-+| |.||+.-.-.|...+. ..++..+...+++++++++.- .++.+.|++-...+..+.++ T Consensus 331 ~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~ 407 (472) T protein:vir:93 331 AVDFSS-D-KFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 407 (472) T ss_pred CCCCCc-c-cccc-CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHH Confidence 996533 2 1122 3455543323333322 333445778888888776532 36789999999999999988 Q ss_pred HHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCcCCCChhhcccccccCCCccc Q lcl|NC_019404. 362 VLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 362 ~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~~~~~~~~~~~~e~~~~~e~e 414 (418) +..+.+.+++. +-..+.++ +++..+++++ .....+...++-++.+.+.+.++| T Consensus 408 ~~~k~~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 408 TAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcccCC Confidence 87766543332 22334332 3333333311 111111122222334444444555 No 106 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.55 E-value=1.9e-15 Score=101.34 Aligned_cols=393 Identities=13% Similarity=0.107 Sum_probs=184.6 Q ss_pred CccchhhHHHHhcCC---CC-----cccc-C-----ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch- Q lcl|NC_019404. 1 MVKTDSYANIFLGGS---DG-----SEIY-G-----SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD- 65 (418) Q Consensus 1 ~~~~D~~~n~~~g~~---~~-----~~~~-~-----~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d- 65 (418) |.|-+-..+.+.... .. .++| | +...-.+.++...-..+.++++||+.+++.+.-+||.+.++++ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 444444333221100 00 0011 1 0001123344444445678999999999999888988765433 Q ss_pred HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccc---c---- Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENP---R---- 138 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp---~---- 138 (418) .+.+.+-|++-++.....++++.+.+||.|++++....... .+..+.+ .++++++.++.+.+-.... . T Consensus 81 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~----~d~~g~~-~i~~~~p~~~~~~~D~~~~~~~~~~i~ 155 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVES----GDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCcccc----CCCCCee-EEEEEcccceEEEEcCCCccceEEEEE Confidence 34677777777888999999999999999998876421111 1111211 2334444333321100000 0 Q ss_pred ----ccccCcceEE---------EE--ecCCccccccc------Cc-cc--EEEecCccchhhhhhccccCCcchHHHHH Q lcl|NC_019404. 139 ----NARFGKPLTY---------RI--TTNESDMFYDV------HY-SR--IHIIDGERVPNAMRRQNDGWGRSVLSSDI 194 (418) Q Consensus 139 ----s~~yg~p~~y---------~i--~~~~~~~~~~i------H~-SR--~i~~~g~~lp~~~~~~~~~~G~S~l~~~~ 194 (418) ..+.+.+..+ ++ .++.. ..... |+ .+ |+.|... .....+||.|.++..+ T Consensus 156 ~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~vPvv~f~n~------~~~~~~~G~s~i~~~v 228 (480) T protein:vir:78 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLN-DQWVVDGDVIKHGLGVVPVVPLTND------PRLGNRYGRSEISPEL 228 (480) T ss_pred EEEeecCCCceEEEEEEeCCeEEEEEecCCCc-cccccccccccCCCCCcceEEeecc------cccCCccCcccchhhH Confidence 0011222111 11 11110 00011 11 11 1122111 1223468999887667 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeec---cc Q lcl|NC_019404. 195 LDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS---DI 271 (418) Q Consensus 195 ~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~---~~ 271 (418) .+.+.++++++...+..+..++.+...+.+........+... ..+.. . .+..+.+. +++....+. ++ T Consensus 229 ~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~---~~~~~---~--~~~~~~~~--~~~~~~~~~~~~~~ 298 (480) T protein:vir:78 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN---TTLDI---Y--YGRILTLA--SEAAKISEFKAAEL 298 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCcccccccccc---chhhh---h--hhhhccCC--CCCceEEecCccCH Confidence 788888888888877777766655544443211000000000 00110 0 11112222 233333333 34 Q ss_pred CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhhcc-------- Q lcl|NC_019404. 272 GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKR--NAELLPILEFLIPFIVNA-------- 341 (418) Q Consensus 272 ~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Q--e~~l~p~l~~l~~~i~~~-------- 341 (418) ....+.++....++++.+++|...|.|.+. . ++||+.-...+...+...+ +..+.+.|.+++.+++.- T Consensus 299 ~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n-~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~ 376 (480) T protein:vir:78 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSE-N-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) T ss_pred HHHHHHHHHHHHHHhcccCCChHHhccccC-c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcccc Confidence 456667778888899999999988755432 2 3566654444444333322 234677888887776531 Q ss_pred -CCceEEeCCCCCCCHHHHHHHHHHHHHHHH-------HHHhCCCCCHHHHHHH--HH-hhc----C-cCCCChhhcc-- Q lcl|NC_019404. 342 -EEWSVEFSPLDHESSKDKAEVLEKSVNSIA-------ALIAAGAMDIKEARDT--LR-TIA----P-EIKIGDNDIQ-- 403 (418) Q Consensus 342 -~~~~~~f~pL~~~~eke~ae~~~~~a~a~~-------~~~~~g~i~~~e~r~~--l~-~~~----~-~~~~~~~~~~-- 403 (418) .++.+.|.+-..++..+.++...+.+++.. .+-..|+.. +++.+. ++ +.. + ......++-. T Consensus 377 ~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~-d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 455 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA-TQREQMRDWDKQETEDMIDTLYSTTKAQADAT 455 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCH-hHHHHHHHHHHHHHHHHHHHhhccccccCCCC Confidence 247789998888998887776555444321 112234332 222110 00 000 0 0000000000 Q ss_pred cccccCCCccccccC Q lcl|NC_019404. 404 TEESELITETEVVIA 418 (418) Q Consensus 404 ~~e~~~~~e~e~~~~ 418 (418) ..+...+..+|.-=| T Consensus 456 ~~~~~~~~~~~~~~~ 470 (480) T protein:vir:78 456 PKPTVTETKTETQTS 470 (480) T ss_pred CCCCCCCCCCccccc Confidence 000111111122112 No 107 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.55 E-value=1.8e-14 Score=95.96 Aligned_cols=394 Identities=13% Similarity=0.059 Sum_probs=196.7 Q ss_pred Cc--------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--H--- Q lcl|NC_019404. 1 MV--------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--P--- 67 (418) Q Consensus 1 ~~--------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~--- 67 (418) |- |.+-+..-+.|....-..... .........-..+++++.||+..+..++.+++.+++.++. + T Consensus 49 i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~---~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~ 125 (502) T protein:vir:48 49 INHHKLRQAPRIQELLDYARGENHDVLKSGR---RKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQND 125 (502) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccccccc---ccccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHH Confidence 00 111111122232110000000 0000000011347999999999999999999999875432 2 Q ss_pred -HHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcce Q lcl|NC_019404. 68 -AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPL 146 (418) Q Consensus 68 -~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~ 146 (418) .+.+-|++-++...+.++++.+..||.|++++..+.+. .+ .+.++++.++.+.+-+.....+.++- . T Consensus 126 ~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg----------~~-~i~~~~p~~~~~vydd~~~~~~~~~i-r 193 (502) T protein:vir:48 126 DAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYD----------ET-RIKRLSPLETFVIYDNSLEDNSIAAV-R 193 (502) T ss_pred HHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCC----------ce-EEEEEcccceEEEEcCCCCCceEEEE-E Confidence 24445556688899999999999999999988764221 11 23344444433322110000011110 1 Q ss_pred EEEEecCCc-cccccc-CcccEEEecCc--------------cchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 147 TYRITTNES-DMFYDV-HYSRIHIIDGE--------------RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQ 210 (418) Q Consensus 147 ~y~i~~~~~-~~~~~i-H~SR~i~~~g~--------------~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~ 210 (418) +|.+..... ....++ -+.++.++.+. .+|. ....++.+|.|.++ .+.+.+.+++.+....+. T Consensus 194 ~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPv-v~~~nn~~g~sd~e-~v~~liDa~d~~~S~~~~ 271 (502) T protein:vir:48 194 YYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPI-TEFLNNADGIGDYE-TELYLIDLYDSAESDTAN 271 (502) T ss_pred EEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccce-EEecCCCCCCCchh-hhHHHHHHHHHHHHHHHH Confidence 111110000 000011 11122222211 1121 11234567889887 488999999999999999 Q ss_pred HHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeE--eecccCCHHHHHHHHHHHHhhh Q lcl|NC_019404. 211 LLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV--LNSDIGGIDAFLDKKFDRIVAL 288 (418) Q Consensus 211 l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~--~~~~~~gl~~~~~~~~~~iaaa 288 (418) .+..++..++.+.+..... .++ .....++....... .....--.+++-+++. .+.+..+....++.+.++|... T Consensus 272 ~~~~~~~~~lv~~g~~~~~-~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~ 347 (502) T protein:vir:48 272 HMSDMADAILAIYGDLALP-QGM-QASDMKRTRLMQLK--PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVF 347 (502) T ss_pred HHHHhcCceeeeecCcccc-ccc-chhhhhhcceeecc--ccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHH Confidence 9998888888777643211 111 11111111100000 0000000112234444 4456678999999999999999 Q ss_pred hcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCCC Q lcl|NC_019404. 289 SGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDHE 354 (418) Q Consensus 289 s~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~~ 354 (418) +++|..-+ +.. +| |.||+.-...+..... ..++..++..+++++.+++.- .++.+.|+|-... T Consensus 348 s~~p~~~~-~~~-~~-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~ 424 (502) T protein:vir:48 348 TNTPDMSD-NHF-SG-NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPK 424 (502) T ss_pred hCCCCcCc-ccc-cc-CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCc Confidence 99997544 222 22 3456543322222221 333456788888888776431 2468999999999 Q ss_pred CHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh-hc--C---cC-CCCh---hhcccccccCCCccccccC Q lcl|NC_019404. 355 SSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT-IA--P---EI-KIGD---NDIQTEESELITETEVVIA 418 (418) Q Consensus 355 ~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~-~~--~---~~-~~~~---~~~~~~e~~~~~e~e~~~~ 418 (418) +.++.|++..+.+..++. +-..+.++ +++-.+++++ .. + .. ...+ ....+.+.....|.|.++- T Consensus 425 d~~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 425 SLYEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred CHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcCCCCC Confidence 999998887666543321 22344443 3333333321 10 0 00 0111 1112223345556666666 No 108 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.54 E-value=1e-14 Score=97.25 Aligned_cols=383 Identities=13% Similarity=0.034 Sum_probs=186.9 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch--HH---HHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD--EP---AFWSRWDD 75 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d--~~---~i~~~~~~ 75 (418) .=|.+-+.....|-+..-..... ...... ......+++++.||+..+..++.+++.++..++ .+ .+..-|++ T Consensus 10 ~~r~~~l~~yy~g~~~~~~~~~~--~~~~~~-~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~l~~~~~~ 86 (440) T protein:vir:95 10 KQRLAILASYAQGDNFSILSGHR--RLDDEK-ADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLSTIKDIEWQ 86 (440) T ss_pred HHHHHHHHHHhccCCcccccccc--cccccC-CcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHHHHHHHHh Confidence 11222222223332211000000 000000 000124789999999999999999999865332 22 34455666 Q ss_pred hCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccccc------------- Q lcl|NC_019404. 76 LEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR------------- 141 (418) Q Consensus 76 l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~------------- 141 (418) -++...+.++++.+..||.|++++..+. +.+. +.+++++++.+.+-......+. T Consensus 87 n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~------------i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~~ 154 (440) T protein:vir:95 87 NDINALNSDLAFDASVYGRAYEYHFRDKDKVDR------------VVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKVN 154 (440) T ss_pred cCHhHHHHHHHHHHhhcCeEEEEEEecCCCceE------------EEEEcccceEEEEcCCCCCceEEEEEEEEecCceE Confidence 7888999999999999999999987743 2221 2223333322211000000010 Q ss_pred ---cCcceEEEEecCCccc-c-----cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 142 ---FGKPLTYRITTNESDM-F-----YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 142 ---yg~p~~y~i~~~~~~~-~-----~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) |..-..|++...+... . ..-|+-..| |.. ...++.+|.|.++. +.+.+.+++.+....+..+ T Consensus 155 ~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v-------Pvv-~~~n~~~g~sd~e~-v~~lida~~~~~s~~~~~~ 225 (440) T protein:vir:95 155 MTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDV-------PVV-EWWNNRFRMGDYES-EISLIDAYDAGQSDTANYM 225 (440) T ss_pred EEEEeCCeEEEEEEecCCccceeecceeeccCcee-------eEE-EeeCCCCCCCchhh-hHHHHHHHHHHHHHHHHHH Confidence 1111111111110000 0 011222211 111 12345679998974 8899999999999999888 Q ss_pred HHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhc-CCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhh Q lcl|NC_019404. 213 RRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNS-GVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALS 289 (418) Q Consensus 213 ~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~-~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas 289 (418) ..++...+.+++........+... .++....... .........+++.+.+.+ +.+..+....++.+.+.|...+ T Consensus 226 ~~~~~~~~v~~g~~~~~~~~~e~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s 302 (440) T protein:vir:95 226 SDLNDAMLLVKGDLDGIKLSPEDA---AKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFS 302 (440) T ss_pred HHhhcceeeeecccccCCCCccch---hhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 888888777776432222111111 1111100000 001111111222334444 4556788999999999999999 Q ss_pred cCCeeeeeccCccccccchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhc-----------cCCceEEeCCCCCCC Q lcl|NC_019404. 290 GIHEIILKNKNVGGLSSSQNTALETFHK---LIDRKRNAELLPILEFLIPFIVN-----------AEEWSVEFSPLDHES 355 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~stge~d~~~y~~---~I~~~Qe~~l~p~l~~l~~~i~~-----------~~~~~~~f~pL~~~~ 355 (418) ++|..-+ +.. +| |.||+.-...|.. .++.+ +..++..+.+++.+++. ..++++.|++-...+ T Consensus 303 ~~p~~~~-~~~-~~-n~Sg~Al~~~~~~l~~k~~~k-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~ 378 (440) T protein:vir:95 303 RIPNLDD-DRF-NS-TSSGIALLYKMIGLEQVRKDK-ETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQD 378 (440) T ss_pred CCccccc-ccc-cc-cchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCC Confidence 9997443 221 12 3345543323332 33333 34567788887777642 135789999999999 Q ss_pred HHHHHHHHHHHHHHHHH---HHhCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccc Q lcl|NC_019404. 356 SKDKAEVLEKSVNSIAA---LIAAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 356 eke~ae~~~~~a~a~~~---~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e 414 (418) +++.|++..+.+..++. +-..+.+++++..+.+.+-.........+.....+...++.| T Consensus 379 ~~~~ad~~~kl~g~iS~et~~~~l~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 379 VWTEIKAYIEAGGEISQETLMENASFTDYKTEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred HHHHHHHHHHHhccCcHHHHHHhCCCCCcHHHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 99998876655433221 122333332221122211100000000111111111222222 No 109 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.53 E-value=1.3e-14 Score=96.64 Aligned_cols=358 Identities=9% Similarity=0.074 Sum_probs=159.7 Q ss_pred hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHH----HHHHHHH----Hh Q lcl|NC_019404. 5 DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEP----AFWSRWD----DL 76 (418) Q Consensus 5 D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~----~i~~~~~----~l 76 (418) =|+.+.+....+..............-....|..++.+.++|+.+|+++-+-++.+...++.. .+...+. .. T Consensus 1 MGlf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~lL~~~PN~~ 80 (395) T protein:vir:98 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLYWINTKANPN 80 (395) T ss_pred CcchhhhcCCCcccccccccchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHHHHhhcCCCC Confidence 133333322211111111111111112234455678899999999999999988874322111 1111121 11 Q ss_pred CchHHHHHH-HHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCc Q lcl|NC_019404. 77 EMTQNINDA-WSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNES 155 (418) Q Consensus 77 ~~~~~~~~a-~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~ 155 (418) -....|.+. ..+..++|.|++++.- ++. +.+.+.+..... ..|+. .+.+...+. T Consensus 81 ~t~~~f~~~~~~~lll~Gnayi~~~~-~~~--------------~~~~~~~~~~~~------~~~~~----~~~~~~~~~ 135 (395) T protein:vir:98 81 QSASQFWVEVIQKLLVDGETLIFVIP-GKG--------------IYVADSFTQDKK------ISGSQ----FKVSRVQGQ 135 (395) T ss_pred CCHHHHHHHHHHHHhhcCceEEEEEe-CCc--------------eecCCccccccc------ccCcc----cceeeecCc Confidence 222444444 4445568999988643 321 011111111100 00110 111221111 Q ss_pred ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHH--HHHHcCCceeecchHHHhhcCcc Q lcl|NC_019404. 156 DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQ--LLRRKQQAVWKAKGLAELCDDSE 233 (418) Q Consensus 156 ~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~--l~~~~~~~v~k~~~l~~~~~~~~ 233 (418) .....+-++.|+||..... ....++.+++ ...-+.+............ .+................ .... T Consensus 136 ~~~~~~~~~evih~k~~~~------~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 207 (395) T protein:vir:98 136 TYEKTFTFDQVIYLKNDNS------DLMSKVESLW-EEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENS-DGGR 207 (395) T ss_pred eeeeEecCccEEEecCCCC------Cccccccchh-hhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccC-CcHH Confidence 1123456678999863321 1112232222 2122222222222111111 111111111111100000 0111 Q ss_pred hHHHHHHHHHHHHHh-cCCcceeEEEcCCCceeEeecccC--------CHHHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 234 GFGAARLRLAQVDNN-SGVGQAIGIDAESEEYSVLNSDIG--------GIDAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 234 ~~~~~~~r~~~~~~~-~~~~~~~~~d~~~e~~~~~~~~~~--------gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) ......+.+...-.. ......+++...+-+|++++.... .+-++..+....||.+.|||..+|.| .. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~----~~ 283 (395) T protein:vir:98 208 QSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG----DI 283 (395) T ss_pred HHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----Cc Confidence 111111222221111 122333443444567877765432 23445566678899999999987732 22 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----h----ccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----V----NAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA 376 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~----~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~ 376 (418) ++-+.....||. ..|.|++..+-..+ + +..++.|+|+.|...|.+++ +++++++++. T Consensus 284 -sn~e~~~~~f~~-------~tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~-------~~~~~~~~~~ 348 (395) T protein:vir:98 284 -ADNQKNYELLLE-------GPIESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSI-------SNQADKLISS 348 (395) T ss_pred -ccHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHH-------HHHHHHHHhC Confidence 233444555654 34788776665443 1 23467899999988887665 6778889999 Q ss_pred CCCCHHHHHHHHHhhcCcCC-CCh--------hhcccccccCCCcccc Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIK-IGD--------NDIQTEESELITETEV 415 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~-~~~--------~~~~~~e~~~~~e~e~ 415 (418) |++|++|+|+.+. ..|-.+ ..| ..+++...+..++++- T Consensus 349 G~~T~NE~R~~~g-~~Pi~~~~gD~~~~~~n~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 349 GFVFIDEVREEIG-LPELPDGLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred CCcCHHHHHHHhC-CCCCCCCCCceeeecccceecccccCCCCCCCCC Confidence 9999999998762 222111 111 1122122222223333 No 110 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.52 E-value=1.1e-13 Score=91.71 Aligned_cols=382 Identities=10% Similarity=0.055 Sum_probs=189.5 Q ss_pred Cc-------------------------cchhhHHHHhcCCCCccccCc-cccCCHHHHHH-HHHcCCccchhhhcchhhh Q lcl|NC_019404. 1 MV-------------------------KTDSYANIFLGGSDGSEIYGS-LQNQAPTILAS-LYADNALVRRIIDTIPETA 53 (418) Q Consensus 1 ~~-------------------------~~D~~~n~~~g~~~~~~~~~~-~~~~~~~~l~~-~Y~~~~~~r~iVd~~a~d~ 53 (418) +. |.+-+..-..|-+.--..... ........... .-..+++++.||+..+..+ T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl 114 (492) T protein:vir:94 35 IVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI 114 (492) T ss_pred ccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhh Confidence 11 111111112222110000000 00000000000 0013799999999999999 Q ss_pred ccCCccccCcchHH-HHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 54 LAAGFHIDGIDDEP-AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 54 ~r~~~~i~~~~d~~-~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) +.+++.++++++.. ...+.|.+-+......++.+.+..||.|++++..+. +.. .++++++.++-+. T Consensus 115 ~G~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~------------~~~~~~p~~~~~v 182 (492) T protein:vir:94 115 VGKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF------------KLFRVPAEQGIPI 182 (492) T ss_pred cccCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCce------------EEEEEcccceEEE Confidence 99999998765431 223334444677888999999999999999887642 221 1333343333222 Q ss_pred ccccccccccc------------------CcceEEEEecCCc-----ccccccCcccEEEecCccchhhhhhccccCCcc Q lcl|NC_019404. 132 NREENPRNARF------------------GKPLTYRITTNES-----DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRS 188 (418) Q Consensus 132 ~~~~dp~s~~y------------------g~p~~y~i~~~~~-----~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S 188 (418) +-+.....+.+ .....|.+..+.. ......+....-+.-| .+|. ....++.+|.| T Consensus 183 ~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~vPv-v~~~nn~~~~s 260 (492) T protein:vir:94 183 WTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG-KIPF-IPFKNNDLEIS 260 (492) T ss_pred EcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCC-ccce-EEecCCCCCCC Confidence 10000000101 1111222211100 0000011111111101 1121 11223557899 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeE-- Q lcl|NC_019404. 189 VLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV-- 266 (418) Q Consensus 189 ~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~-- 266 (418) .++ .+.+.+.+++.+....+..+..++...+...+... .........+ ...+.+.++ ++.+.+. T Consensus 261 d~e-~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~-----~~~~~~~~~~-------~~~~~~~~~-~~~~~~~l~ 326 (492) T protein:vir:94 261 DIF-MYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDD-----QELPEFKRLL-------RYYGAIKVS-DNGGVDTIQ 326 (492) T ss_pred chH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc-----ccchhhHHHH-------hhccceecC-CCCcceeEe Confidence 887 48899999999999988888888888887775421 1111111111 112233333 3344444 Q ss_pred eecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhcc-- Q lcl|NC_019404. 267 LNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPILEFLIPFIVNA-- 341 (418) Q Consensus 267 ~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~l~~l~~~i~~~-- 341 (418) .+.+.+++...++.+.+.|...+.+|-.-. + .-+| |.||+.-.-.| ...++. ++..++..+++++++++.- T Consensus 327 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~-k~~~f~~~l~~~~~li~~~~~ 402 (492) T protein:vir:94 327 VEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFGS-APSGVALEFLYTNLNLKADK-LARKAKVAIQELLWFVFEHFD 402 (492) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCCcCCCc-c-cccc-CchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhc Confidence 556667899999999999999999996433 1 2222 34555432222 223333 3445778888888777531 Q ss_pred -----CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHh--------hcCcCCCChhhccc Q lcl|NC_019404. 342 -----EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARDTLRT--------IAPEIKIGDNDIQT 404 (418) Q Consensus 342 -----~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~--------~~~~~~~~~~~~~~ 404 (418) .++.+.|++-...++++.+++..+.+..++. +-..+.+ ++++..+.+.+ .....+-..++-++ T Consensus 403 ~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~ 482 (492) T protein:vir:94 403 IKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADSAQQ 482 (492) T ss_pred CCcccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCcc Confidence 3688999999999999998876665433222 2233333 33333333211 11111111111222 Q ss_pred ccccCCCccc Q lcl|NC_019404. 405 EESELITETE 414 (418) Q Consensus 405 ~e~~~~~e~e 414 (418) ++.+...|+| T Consensus 483 ~~~~~~~e~e 492 (492) T protein:vir:94 483 QERSNNKESE 492 (492) T ss_pred ccCCccccCC Confidence 3333344445 No 111 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.52 E-value=7.3e-15 Score=98.10 Aligned_cols=394 Identities=15% Similarity=0.114 Sum_probs=182.5 Q ss_pred Cccc---hhhHHH--------------HhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc Q lcl|NC_019404. 1 MVKT---DSYANI--------------FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI 63 (418) Q Consensus 1 ~~~~---D~~~n~--------------~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~ 63 (418) .+.. |.|.+. ..|...- .......+.++......+.+++.|||..++-+.-+||.+.++ T Consensus 12 ~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i----~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~ 87 (485) T protein:vir:24 12 ADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRP----EAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFRLGDA 87 (485) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCch----hhcCcccchhhhhhhhccchHHHHHHHHhhhhccCceecCCC Confidence 1111 112111 1121110 001111234445555667899999999999998889987654 Q ss_pred chH-HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc---------- Q lcl|NC_019404. 64 DDE-PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN---------- 132 (418) Q Consensus 64 ~d~-~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~---------- 132 (418) +.. +.+.+.|++-++.....++++.+.+||.|++++..+.+... ..+..+.. .|+++++.++.+.+ T Consensus 88 ~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~--~~~~~~~~-~i~~~~p~~~~~i~D~~~~~~~~~ 164 (485) T protein:vir:24 88 DEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQID--LGWDPNVP-LIRVEPPTRMYAEIDPRIGRPAKA 164 (485) T ss_pred chhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc--cccCCCcc-eEEEeccceeEEEeeCCcCceeEE Confidence 333 45677777777888899999999999999999876432111 11111111 12333333322111 Q ss_pred ---ccc-ccc---ccc-cCcceEEEEecCCccc---ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHH Q lcl|NC_019404. 133 ---REE-NPR---NAR-FGKPLTYRITTNESDM---FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDY 201 (418) Q Consensus 133 ---~~~-dp~---s~~-yg~p~~y~i~~~~~~~---~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~ 201 (418) +.. +-. .-. |-.=..|++...++.. ...-|+-..+.+.. .+.. ......||.|.++..+.+.+.++ T Consensus 165 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~--f~n~-~~~~~~~G~s~i~~~v~~liDa~ 241 (485) T protein:vir:24 165 IRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVP--LPNR-TRLSDLYGTSEITPELRSMTDAA 241 (485) T ss_pred EEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEE--eccC-cccCCcCCcccchhhHHHHHHHH Confidence 000 000 000 1111122222211111 01124332221110 0111 11233589999876677778888 Q ss_pred HHHHHHHHHHHHHcCCceeecchHHHh-hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeeccc---CCHHHH Q lcl|NC_019404. 202 TNCERLATQLLRRKQQAVWKAKGLAEL-CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDI---GGIDAF 277 (418) Q Consensus 202 ~~~~~~~~~l~~~~~~~v~k~~~l~~~-~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~---~gl~~~ 277 (418) ++++......+..++.+...+.+.... .....+.. ... .+...+.+.... +++....+.+- ....+. T Consensus 242 ~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~--~~~------~~~~~~~i~~~~-~~~~~~~q~~~~~~e~~~~~ 312 (485) T protein:vir:24 242 ARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETG--QTL------FDAYLARILAFE-DAEGKIQQFSAAELANFTNA 312 (485) T ss_pred HHHHHHHHHHHHhhcchhhhhccCCccccccccccc--cch------hhhcccceeccC-CCCceEEeecccchHHHHHH Confidence 888877777776666655544432210 00000000 000 111122233322 23334334333 445566 Q ss_pred HHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhcc----------CCc Q lcl|NC_019404. 278 LDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFH---KLIDRKRNAELLPILEFLIPFIVNA----------EEW 344 (418) Q Consensus 278 ~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~---~~I~~~Qe~~l~p~l~~l~~~i~~~----------~~~ 344 (418) ++....++|+.+++|...|.|.+.+ ++||+.-.-.+. ..++.+ +..+.+.|++++.+++.- .++ T Consensus 313 l~~~i~~~s~~~~~p~~~fg~~~~n--~~Sg~Al~~~~~~l~~ka~~~-~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i 389 (485) T protein:vir:24 313 LDQIAKQVAAYTGLPPQYLSTAADN--PASAEAIRAAESRLIKKVERK-NAIFGGAWEEAMRLAYRLMKGGDVPPDMLRM 389 (485) T ss_pred HHHHHHHHhcccCCCHHHhccccCc--chHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcCCCCcccccee Confidence 7777888999999998777544321 245554332222 233333 345678888888876431 267 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHHHHH-------HHHhCCCCCHHHHHH--HHH-hhcC-------cCCCChhhcccccc Q lcl|NC_019404. 345 SVEFSPLDHESSKDKAEVLEKSVNSIA-------ALIAAGAMDIKEARD--TLR-TIAP-------EIKIGDNDIQTEES 407 (418) Q Consensus 345 ~~~f~pL~~~~eke~ae~~~~~a~a~~-------~~~~~g~i~~~e~r~--~l~-~~~~-------~~~~~~~~~~~~e~ 407 (418) .+.|.+-..++..+.|+...+.+++.. .+-..|+. ++++.+ .+. +... ...-....-+..++ T Consensus 390 ~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~-~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~ 468 (485) T protein:vir:24 390 ETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYS-IAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPN 468 (485) T ss_pred eEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCC-HhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCC Confidence 889998888999888776655544311 12223433 222211 111 0000 00000000000000 Q ss_pred cCCCccccccC Q lcl|NC_019404. 408 ELITETEVVIA 418 (418) Q Consensus 408 ~~~~e~e~~~~ 418 (418) . -++.+..-| T Consensus 469 ~-~e~~~~~~~ 478 (485) T protein:vir:24 469 P-TPAPKPQPA 478 (485) T ss_pred C-CCCCCCccC Confidence 0 011111111 No 112 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.51 E-value=1.8e-13 Score=90.46 Aligned_cols=381 Identities=10% Similarity=0.061 Sum_probs=190.5 Q ss_pred CccchhhHHHHhcCCCCccccCccccCC--HHHHHH-HHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHh Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQA--PTILAS-LYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDL 76 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~--~~~l~~-~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l 76 (418) .-|.+-+..-..|-+.--.. ....... ...... .-..+.+++.||+..+..++.+++.++++++. ....+.|.+- T Consensus 60 ~~r~~~l~~YY~g~~~i~~~-~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~n 138 (492) T protein:vir:97 60 LPEISIGQEYYEQRPDIVKE-PKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGN 138 (492) T ss_pred HHHHHHHHHHhcccCccccc-cccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHHHHHHHHHHhc Confidence 11111111212222110000 0000000 000000 00137999999999999999999999876553 2233344444 Q ss_pred CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccccccc-------------- Q lcl|NC_019404. 77 EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARF-------------- 142 (418) Q Consensus 77 ~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~y-------------- 142 (418) +......++.+.+..||.|++++..+.+. .+ .+.++++.++-+.+-+.....+.+ T Consensus 139 ~~~~~~~~~~~~~~~~G~a~~~v~~d~dg----------~~-~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~ 207 (492) T protein:vir:97 139 RFDDKLHSVLTGASNKGIEWLHPYLDEEG----------EF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 207 (492) T ss_pred cHHHHHHHHHHHHhhcCeEEEEEEecCCC----------ce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEE Confidence 67788889999999999999988764221 11 233344433322211000000111 Q ss_pred ----CcceEEEEecCCcc-------cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 143 ----GKPLTYRITTNESD-------MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 143 ----g~p~~y~i~~~~~~-------~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) +....|.+.++... ....+|. .-+.-| .+|. ....++.+|.|.++ .+.+.+.+++.+....+.. T Consensus 208 ~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g-~vPv-v~~~nn~~g~sd~e-~v~~liDa~d~~~S~~~~~ 282 (492) T protein:vir:97 208 YWDKVTVNYYVYENGSLIPDYSNNLENSKTHF--STGSWG-KIPF-IPFKNNDLEISDIF-MYKTLIDAYNRRLSDLSNT 282 (492) T ss_pred EEecCeEEEEEEecCeeeeccccccccccccc--ccCCCC-Ccce-EEecCCCCCCCchH-hHHHHHHHHHHHHHHHHHH Confidence 11112222211000 0001110 000000 0111 12233557899887 4889999999999999988 Q ss_pred HHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhh Q lcl|NC_019404. 212 LRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALS 289 (418) Q Consensus 212 ~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas 289 (418) +..++...+.+.+... .........+ ...+.+.+ .++.+.+.+ +.+.++....++.+.+.|...+ T Consensus 283 ~~~~~~~~l~~~g~~~-----~~~~~~~~~~-------~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s 349 (492) T protein:vir:97 283 FKDSNELTYVLKNYDD-----QELPEFKRLL-------RYYGAIKV-SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG 349 (492) T ss_pred HHHhccceeeeecCCc-----ccchhHHHHH-------hhccceec-CCCCcceeEeccCCHHHHHHHHHHHHHHHHHHh Confidence 8888888888775321 1111111111 11222333 334455554 4566789999999999999999 Q ss_pred cCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~a 360 (418) ++|-.-+ ..-+| |.||+.-.-.|..... ...+..++..+++++++++.. .++.+.|++-...++++.+ T Consensus 350 ~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a 426 (492) T protein:vir:97 350 QAVDFSS--DKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQV 426 (492) T ss_pred CCCCCCc--ccccc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHH Confidence 9996433 12222 4456543333333222 333445778888888877542 3678999999999999998 Q ss_pred HHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHh--------hcCcCCCChhhcccccccCCCccc Q lcl|NC_019404. 361 EVLEKSVNSIAA---LIAAGAM-DIKEARDTLRT--------IAPEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 361 e~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~--------~~~~~~~~~~~~~~~e~~~~~e~e 414 (418) ++..+.+..++. +-..+.+ ++++..+++++ .....+...++-++.+.+...++| T Consensus 427 ~~~~kl~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 427 QTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCcccccccccccC Confidence 876665543222 2233333 23333333311 111111122222333333333444 No 113 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.51 E-value=1.6e-13 Score=90.71 Aligned_cols=382 Identities=11% Similarity=0.069 Sum_probs=189.2 Q ss_pred CccchhhHHHHhcCCCC-ccccCccccCCHHHHHH-HHHcCCccchhhhcchhhhccCCccccCcchHH-HHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG-SEIYGSLQNQAPTILAS-LYADNALVRRIIDTIPETALAAGFHIDGIDDEP-AFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~-~~~~~~~~~~~~~~l~~-~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~-~i~~~~~~l~ 77 (418) +-+.+-+..-..|-+.- .+..-............ .-..+++++.||+..+..++.+++.++++++.. ...+.|.+-+ T Consensus 51 ~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~~~n~ 130 (483) T protein:vir:12 51 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNR 130 (483) T ss_pred HHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhcc Confidence 11222222222232110 00000000000000000 001369999999999999999999998765531 2233444446 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc---ccccc------------ccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE---ENPRN------------ARF 142 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~---~dp~s------------~~y 142 (418) ......++.+.+..||.|++++..+.+. .+ .+.++++.++-+.+-+ ..|.. -.+ T Consensus 131 ~~~~~~~~~~~~~~~G~~y~~v~~d~d~----------~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~ 199 (483) T protein:vir:12 131 FDDKLHSVLTGASNKGIEWLHPYLDEEG----------EF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEY 199 (483) T ss_pred HHHHHHHHHHHHhhCCeEEEEEEEcCCC----------ce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEE Confidence 7788889999999999999988774321 11 1233333333222100 00000 000 Q ss_pred ---CcceEEEEecCCc------c-cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 143 ---GKPLTYRITTNES------D-MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 143 ---g~p~~y~i~~~~~------~-~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) +....|.+..+.. . ....+|. .-+.-| .+|. ....++.+|.|.++ .+.+.+.+++.+....+..+ T Consensus 200 y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g-~vPv-v~~~nn~~g~sd~e-~v~~liDa~d~~~S~~~~~~ 274 (483) T protein:vir:12 200 WDKVTVNYYVYENGSLIPDYSNNLENSKTHF--STGSWG-KIPF-IPFKNNDLEISDIF-MYKTLIDAYNRRLSDLSNTF 274 (483) T ss_pred EecCeEEEEEEeCCeeeeccccccccccccc--ccCCCC-ccce-EEecCCCCCCCchh-hHHHHHHHHHHHHHHHHHHH Confidence 1111221111100 0 0001110 000000 1111 11223457889887 58899999999988888888 Q ss_pred HHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeE--eecccCCHHHHHHHHHHHHhhhhc Q lcl|NC_019404. 213 RRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV--LNSDIGGIDAFLDKKFDRIVALSG 290 (418) Q Consensus 213 ~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~--~~~~~~gl~~~~~~~~~~iaaas~ 290 (418) ..++...+.+.+.. ..........+ ...+.+.+. ++.+.+. .+.+.++....++.+.+.|...++ T Consensus 275 ~~~~~~~lv~~g~~-----~~~~~~~~~~~-------~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 341 (483) T protein:vir:12 275 KDSNELTYVLTNYD-----DQELPEFKRLL-------RYYGAIKVS-DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 341 (483) T ss_pred HHhcCceeeeecCC-----cccchhHHHhh-------hhccccccC-CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 88887777766532 11111111111 112222222 3344544 455667899999999999999999 Q ss_pred CCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHHH Q lcl|NC_019404. 291 IHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKAE 361 (418) Q Consensus 291 IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~ae 361 (418) +|-.-+ + .-+| |.||+.-.-.|...+. ..++..++..+++++++++.. .++++.|++-...+.++.|+ T Consensus 342 ~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~ 418 (483) T protein:vir:12 342 AVDFSS-D-KFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 418 (483) T ss_pred CCCCCc-c-cccc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHH Confidence 997433 2 2222 4456543333333322 334455788888888877532 36789999999999999988 Q ss_pred HHHHHHHHHHH---HHhCCCC-CHHHHHHHHHh--------hcCcCCCChhhcccccccCCCccc Q lcl|NC_019404. 362 VLEKSVNSIAA---LIAAGAM-DIKEARDTLRT--------IAPEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 362 ~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~--------~~~~~~~~~~~~~~~e~~~~~e~e 414 (418) +..+.+..++. +-..+.+ ++++..+++++ ..+..+...+.-..++...+.|+| T Consensus 419 ~~~kl~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 419 TAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccCC Confidence 76665443222 1223333 23333332211 111111111222333444455555 No 114 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.51 E-value=8e-14 Score=92.39 Aligned_cols=381 Identities=12% Similarity=0.067 Sum_probs=189.3 Q ss_pred CccchhhHHHHhcCCCCcccc--CccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHH--HHHHHHHHh Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIY--GSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEP--AFWSRWDDL 76 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~--~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~--~i~~~~~~l 76 (418) +-+.+-+.+-+.|-+..-... .................+++++.||+..+..++.+++.++++++.. .+.+ +.+- T Consensus 42 ~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~-~~~n 120 (478) T protein:vir:10 42 IDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQH-TLNH 120 (478) T ss_pred HHHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHH-HHhc Confidence 222222222233322110000 0000000000000112468999999999999999999998766542 2333 3345 Q ss_pred CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcc Q lcl|NC_019404. 77 EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESD 156 (418) Q Consensus 77 ~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~ 156 (418) ++...+.++.+.+..||.|++++..+.+. .+ .+.++++.++.|.+-+.....+.++ ...|...+... T Consensus 121 ~~~~~~~~~~~~~~~~G~~~~~~~~d~~g----------~~-~~~~~~p~~~~~i~d~~~~~~~~~~-v~~~~~~~~~~- 187 (478) T protein:vir:10 121 KWDDKLVDILTAASNKGIEWVQPYVDEEG----------EF-KTFRVPAEQAVPIWTNKERDELQAF-IRVYELDGAER- 187 (478) T ss_pred CHHHHHHHHHHHHHhcCeEEEEEEecCCC----------ee-EEEEEcccceEEEEcCCCCCceEEE-EEEEEecCceE- Confidence 78889999999999999999988764221 12 1333344433332211111111111 11111111000 Q ss_pred ccccc-CcccEEEec------------------------Cc-----cchhhhhhccccCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 157 MFYDV-HYSRIHIID------------------------GE-----RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCER 206 (418) Q Consensus 157 ~~~~i-H~SR~i~~~------------------------g~-----~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~ 206 (418) .++ .+.++.++. .. .+|. ....++.+|.|.++. +.+.+.+++.+.. T Consensus 188 --~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv-v~~~n~~~g~sd~~~-v~~liDa~~~~~S 263 (478) T protein:vir:10 188 --VEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPF-IPFKNNPQEVSDLFM-YKTIIDALDKRLS 263 (478) T ss_pred --EEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccce-EEeccCCCCCCcHHH-HHHHHHHHHHHHH Confidence 000 011111110 00 1111 122356689998874 8899999999999 Q ss_pred HHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC-CCcee--EeecccCCHHHHHHHHHH Q lcl|NC_019404. 207 LATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE-SEEYS--VLNSDIGGIDAFLDKKFD 283 (418) Q Consensus 207 ~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~-~e~~~--~~~~~~~gl~~~~~~~~~ 283 (418) ..+..+..++..++.+.+.. +.......... .....+.+.++ +.+.+ ..+.+..+....++.+.+ T Consensus 264 ~~~~~~~~~~~p~~~~~g~~-----~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 331 (478) T protein:vir:10 264 DTQNTFDESVELIYILKGYE-----GEDMKDFMHNL-------KYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRD 331 (478) T ss_pred HHHHHHHHhhCceeeeecCC-----ccccchhhhhh-------hhcceEEecCCCCCcceEEeecCChHHHHHHHHHHHH Confidence 99988888888887776542 11111111111 11233444322 23444 445567789999999999 Q ss_pred HHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhc-------cCCceEEeCCCCCC Q lcl|NC_019404. 284 RIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVN-------AEEWSVEFSPLDHE 354 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~-------~~~~~~~f~pL~~~ 354 (418) .|...+++|-.-+.+ -+| |.||..-...|..... ..++..+...+.+++.+++. ..++++.|++-... T Consensus 332 ~i~~~s~~p~~~~~~--~~~-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~ 408 (478) T protein:vir:10 332 YIIEFGQGVDFQQDK--FGN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMV 408 (478) T ss_pred HHHHHhCccccCccc--ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCC Confidence 999999999643311 123 4566654444443322 33345577888888777753 23688999999999 Q ss_pred CHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHhhcCc-----CCCChhhccccccc-CCCccc Q lcl|NC_019404. 355 SSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARDTLRTIAPE-----IKIGDNDIQTEESE-LITETE 414 (418) Q Consensus 355 ~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~~~~~-----~~~~~~~~~~~e~~-~~~e~e 414 (418) ++++.|++..+.+..++. +-..|.+ ++++..+++++-... ..+..+...+.+++ ..+..| T Consensus 409 d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 409 NELENSQIAMNSTGLLSKETILSNHAWVEDPVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred CHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCCCC Confidence 999988876554332211 1222322 233333333211100 00000000001111 111112 No 115 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.51 E-value=1.1e-14 Score=97.14 Aligned_cols=389 Identities=15% Similarity=0.142 Sum_probs=182.0 Q ss_pred Cc-------cchhhHHH--------------HhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MV-------KTDSYANI--------------FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~-------~~D~~~n~--------------~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~ 59 (418) +. ..+.+.+. ..|-..- ... ..-.+.++........++++||+..++.+.=+||. T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i-~~~---~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~ 83 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRP-EAI---GVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFR 83 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-hhc---CCCCChhhhhhhhhcCcHHHHHHHHHhhhccccee Confidence 11 11111111 1121110 000 11123344444455689999999999999878888 Q ss_pred ccCcchH-HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc----- Q lcl|NC_019404. 60 IDGIDDE-PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR----- 133 (418) Q Consensus 60 i~~~~d~-~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~----- 133 (418) +.++++. +.+.+.|.+-++.....++++.+.+||.|++++..+.+.... ...++. -.|+++++.++.+.+- T Consensus 84 ~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~--~~~~~~-~~i~~~~p~~~~~~~D~~~~~ 160 (485) T protein:vir:10 84 FGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDL--GWDPNT-PIIRVEPPTRMYAEIDPRIGR 160 (485) T ss_pred cCCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCccccc--ccCCCe-eEEEEEccceeEEEEcCCCCc Confidence 7554433 456777777788889999999999999999988764322111 111111 1234444433322110 Q ss_pred -------ccccc-----ccccCcc-eEEEEecCCccc---ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHH Q lcl|NC_019404. 134 -------EENPR-----NARFGKP-LTYRITTNESDM---FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDS 197 (418) Q Consensus 134 -------~~dp~-----s~~yg~p-~~y~i~~~~~~~---~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~ 197 (418) ..+.. ...++.+ ..|++...++.. ...-|+-..+.+. +.++.. .....||.|.++..+.+. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv--~~~n~~-~~~~~~G~s~i~~~v~~l 237 (485) T protein:vir:10 161 VSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVV--PIPNRT-RLSDLYGTSEITPELRSM 237 (485) T ss_pred eeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEE--Eecccc-ccCCCCCccchhHHHHHH Confidence 00000 0001111 112221111110 0011332211111 001111 123468999887667677 Q ss_pred HHHHHHHHHHHHHHHHHcCCceeecchHHHh-hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeec---ccCC Q lcl|NC_019404. 198 IKDYTNCERLATQLLRRKQQAVWKAKGLAEL-CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS---DIGG 273 (418) Q Consensus 198 l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~-~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~---~~~g 273 (418) +.++++++......+..++.+...+.+...- .....+..... .+...+.+.... +++....+. ++.+ T Consensus 238 iDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~--------~~~~~~~i~~~~-~~d~k~~q~~~~~~~~ 308 (485) T protein:vir:10 238 TDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTL--------FDAYLARILAFE-DAEGKIQQFSAAELAN 308 (485) T ss_pred HHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchh--------hhhcccceeccC-CCCceEEeecccchHH Confidence 7888888777666666666655444432110 00000000101 111122233322 233333333 3445 Q ss_pred HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhcc--------- Q lcl|NC_019404. 274 IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALE---TFHKLIDRKRNAELLPILEFLIPFIVNA--------- 341 (418) Q Consensus 274 l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~---~y~~~I~~~Qe~~l~p~l~~l~~~i~~~--------- 341 (418) ..+.++....++|+.+++|...|.|.+. . ++||+.-.. .....++.+| ..+.+.|.++..+++.- T Consensus 309 ~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n-~~Sg~Al~~~~~~l~~k~~~k~-~~f~~~l~~~~~l~~~~~~~~~~~~~ 385 (485) T protein:vir:10 309 FTNALDQIAKQVAAYTGLPPQYLSTAAD-N-PASAEAIRAAESRLIKKVERKN-SIFGGAWEEAMRLAYRMMKGGDVPPD 385 (485) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccC-c-hhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCCccc Confidence 6667777889999999999988755432 2 245554333 3333444443 34678888887776421 Q ss_pred -CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC--CCCHHHHHHHH----------Hh----h------------c Q lcl|NC_019404. 342 -EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG--AMDIKEARDTL----------RT----I------------A 392 (418) Q Consensus 342 -~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g--~i~~~e~r~~l----------~~----~------------~ 392 (418) .++.+.|.+-...+..+.|+... +++++| +++.+.+++.| +. . . T Consensus 386 ~~~i~v~w~~~~~~~~~~~ada~~-------kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~ 458 (485) T protein:vir:10 386 MLRMETVWRDPSTPTYAAKADAAS-------KLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVD 458 (485) T ss_pred ceeeeEEecCCCCCCHHHHHHHHH-------HHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 25678999888899888766544 444433 44444333221 00 0 0 Q ss_pred CcCCCChhhcccccccCCC-ccccccC Q lcl|NC_019404. 393 PEIKIGDNDIQTEESELIT-ETEVVIA 418 (418) Q Consensus 393 ~~~~~~~~~~~~~e~~~~~-e~e~~~~ 418 (418) +..+.+++.=++.+.++.+ ++.+-=| T Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 459 PNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred cCCCCCCCCCccccccCcCCCCCCCCC Confidence 1111111111111111111 1122222 No 116 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.51 E-value=8.1e-14 Score=92.35 Aligned_cols=390 Identities=12% Similarity=0.069 Sum_probs=191.2 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH------HHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE------PAFWSRWD 74 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~------~~i~~~~~ 74 (418) .-|.+-+..-..|-...-..+. .............+++++.||+..+..++.+++.++++++. +.+.+.|+ T Consensus 56 ~~r~~~l~~yY~g~~~~i~~~~---~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~ 132 (501) T protein:vir:27 56 APRIQELLDYARGENHDVLQFG---RRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGR 132 (501) T ss_pred HHHHHHHHHHhcCCCccccccC---ccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCccchHHHHHHHHHHHH Confidence 0011111111222111000000 00000000112347999999999999999999999875432 23445566 Q ss_pred HhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC Q lcl|NC_019404. 75 DLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE 154 (418) Q Consensus 75 ~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~ 154 (418) +.++...+.++.+.+..||.|++++..+.+.. + .+.++++.++.+.+-+.....+.++ ..+|...... T Consensus 133 ~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~----------~-~i~~~~p~~~~~v~d~~~~~~~~~~-ir~~~~~~~~ 200 (501) T protein:vir:27 133 INDIDSHNRTLIRDLSQTGRAYEVIYRNEYDE----------T-RIKRLNPLETFVIYDNSLEDNSIAA-VRYYNRGTLQ 200 (501) T ss_pred hcChhHHHHHHHHHHhhCCeEEEEEEeCCCCc----------e-EEEEEccceeEEEecCCCCCceEEE-EEEEEeeecC Confidence 67899999999999999999999887753211 1 1333444443332111000011111 0111111000 Q ss_pred cc-ccccc-CcccEEEecCc--------------cchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc Q lcl|NC_019404. 155 SD-MFYDV-HYSRIHIIDGE--------------RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA 218 (418) Q Consensus 155 ~~-~~~~i-H~SR~i~~~g~--------------~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~ 218 (418) .. ...+| -+.++.+|... .+|. ....++.+|.|.++ .+.+.+.+++.+....+.-+..++.. T Consensus 201 ~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPv-v~~~nn~~g~sd~e-~v~~liDa~d~~~S~~~~~~~~~~~~ 278 (501) T protein:vir:27 201 NAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPI-TEFLNNVDGIGDYE-TELYLIDLYDSAESDTANHMSDMADA 278 (501) T ss_pred CcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccE-EEecCCCCCCCchh-hhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 00 00001 11122222110 1121 11234567999897 48899999999999999888888888 Q ss_pred eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhhcCCeeee Q lcl|NC_019404. 219 VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALSGIHEIIL 296 (418) Q Consensus 219 v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas~IP~t~L 296 (418) .+.+.+.... ..++ .....++..... . ...+.......+.++..+ +.+..+++..++.+.+.|...+++|-.-+ T Consensus 279 ~~v~~g~~~~-~~~~-~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~ 354 (501) T protein:vir:27 279 ILAIYGDLAL-PKGM-QASDMKRTRLMQ-L-KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSD 354 (501) T ss_pred eeeeecCccC-Cccc-chhhhhhcCcee-e-cccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCc Confidence 8877753211 1111 111111110000 0 000000011112234444 45557899999999999999999997433 Q ss_pred eccCccccccchhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 297 KNKNVGGLSSSQNTALETFHK--LIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 297 ~G~s~~gl~stge~d~~~y~~--~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~~~eke~ae~ 362 (418) +.. +| |.||..-.-.|.. .-...++..++..|++++.+++.- .++.+.|+|-...+.++.|++ T Consensus 355 -~~~-~~-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~ 431 (501) T protein:vir:27 355 -TNF-SG-NTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSI 431 (501) T ss_pred -ccc-cc-CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHH Confidence 222 22 3456543322222 223344556788888888776531 247799999999999999988 Q ss_pred HHHHHHHHHH--H-HhCCCCC-HHHHHHHHHhhcCc---CC----CCh------hhc-ccccccCCCccc Q lcl|NC_019404. 363 LEKSVNSIAA--L-IAAGAMD-IKEARDTLRTIAPE---IK----IGD------NDI-QTEESELITETE 414 (418) Q Consensus 363 ~~~~a~a~~~--~-~~~g~i~-~~e~r~~l~~~~~~---~~----~~~------~~~-~~~e~~~~~e~e 414 (418) ..+.+..++. + -..+.++ +++-.+++++-... .. +.+ +.. +..+++-+...| T Consensus 432 ~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 432 LTGLGGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred HHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 7776544332 2 2334443 44444444221100 00 000 000 111111122222 No 117 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.50 E-value=6e-14 Score=93.09 Aligned_cols=369 Identities=14% Similarity=0.070 Sum_probs=184.2 Q ss_pred Cccc-------------------------------hhhHHHHhcCCCCc----cccCccccCCHHHHHHHHHcCCccchh Q lcl|NC_019404. 1 MVKT-------------------------------DSYANIFLGGSDGS----EIYGSLQNQAPTILASLYADNALVRRI 45 (418) Q Consensus 1 ~~~~-------------------------------D~~~n~~~g~~~~~----~~~~~~~~~~~~~l~~~Y~~~~~~r~i 45 (418) ||-. +-+.....|-+.-- ...+.........-......+++++.| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~I 85 (479) T protein:vir:79 6 ISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLL 85 (479) T ss_pred ecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHH Confidence 1111 11111112211100 000000000000000001237889999 Q ss_pred hhcchhhhccCCccccCcchH-HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEee Q lcl|NC_019404. 46 IDTIPETALAAGFHIDGIDDE-PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYD 124 (418) Q Consensus 46 Vd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~ 124 (418) |+..+...+.+++.++++++. ..+.+.|.+-++...+.++++.+..||.|++++.++.+. .+ .+.+++ T Consensus 86 vd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~----------~~-~i~~~~ 154 (479) T protein:vir:79 86 VDQKVGYSVGNPIVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKG----------EF-KYVIIP 154 (479) T ss_pred HHHHHhhhhcCCceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCC----------ce-EEEEEc Confidence 999999999999999886654 345556665588899999999999999999988764321 11 133333 Q ss_pred ccccccccccccccccccCcceEEEEecCCccc--cccc-CcccEEEec-----------------------------Cc Q lcl|NC_019404. 125 RTQVKVQNREENPRNARFGKPLTYRITTNESDM--FYDV-HYSRIHIID-----------------------------GE 172 (418) Q Consensus 125 ~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~--~~~i-H~SR~i~~~-----------------------------g~ 172 (418) +.++-|.+-......+.++ ..+|......+.. ...+ .+.++.+|. +. T Consensus 155 p~~~~~v~d~~~~~~~~~~-ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (479) T protein:vir:79 155 AEEAIPIWDSKRQRELVAF-IRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNK 233 (479) T ss_pred cceeEEEEeCCCCCceEEE-EEEEEEeecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccccc Confidence 3333222110000001110 0111111000000 0000 001111110 00 Q ss_pred c-----chhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHH Q lcl|NC_019404. 173 R-----VPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDN 247 (418) Q Consensus 173 ~-----lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~ 247 (418) + +|. ....++.+|.|.++ .+.+.+.+++.+....+.-+..++...+..++.... ........ T Consensus 234 ~~~~~~vPv-v~~~nn~~g~sd~~-~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~-----~~~~~~~~------ 300 (479) T protein:vir:79 234 EQGWGKVPF-IPFKNNEKCVSDLT-FYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGT-----SLQEFIDN------ 300 (479) T ss_pred ccCCCcccE-EEecCCCCCCcchh-hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcc-----ccccchhh------ Confidence 0 011 12234567899887 488999999999888888888777776666653211 10111111 Q ss_pred hcCCcceeEEEcCCCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHH--HHHHH Q lcl|NC_019404. 248 NSGVGQAIGIDAESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKL--IDRKR 323 (418) Q Consensus 248 ~~~~~~~~~~d~~~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~--I~~~Q 323 (418) ......+.++ ++.+++.++ .+..+....++.+.+.|...+.+|..-..+ .| |+||+.-...|... -...+ T Consensus 301 -~~~~~~i~~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g-n~Sg~Ai~~~~~~l~~k~~~~ 374 (479) T protein:vir:79 301 -IRYYKSIKVD-GGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQN---TG-DKSGVALKFLYSLLDLKCSKT 374 (479) T ss_pred -hhhccceecC-CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccccccc---cc-chhHHHHHHHHHHHHHHHHHH Confidence 1122333333 345555544 556688899999999999999999754422 23 34565433333332 22333 Q ss_pred HHHHHHHHHHHHHHhhc-----------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhc Q lcl|NC_019404. 324 NAELLPILEFLIPFIVN-----------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIA 392 (418) Q Consensus 324 e~~l~p~l~~l~~~i~~-----------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~ 392 (418) +..++..+.+++++++. ..++.+.|++-...++++.|++..+. .|++|.+.+.+.|. T Consensus 375 ~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl---------~g~iS~et~l~~l~--- 442 (479) T protein:vir:79 375 EKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKS---------TGIVSDETIVSNHP--- 442 (479) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH---------hccCcHHHHHHhCC--- Confidence 44577788888777652 13678999999999999887765443 25666655544331 Q ss_pred CcCCCChhh-----------------cccccccCCCcc Q lcl|NC_019404. 393 PEIKIGDND-----------------IQTEESELITET 413 (418) Q Consensus 393 ~~~~~~~~~-----------------~~~~e~~~~~e~ 413 (418) +..-..++ +...+....+|+ T Consensus 443 -~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 443 -WVEDVNDELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred -CCCCHHHHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 11000111 111111222222 No 118 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.50 E-value=8.5e-15 Score=97.73 Aligned_cols=393 Identities=13% Similarity=0.110 Sum_probs=184.2 Q ss_pred CccchhhHHHHhcCCCC--------cccc-Cc-----cccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG--------SEIY-GS-----LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD- 65 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~--------~~~~-~~-----~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d- 65 (418) |.|-+-+.+.+...... .++| |. ...-.+.++......+.+++.||+..++-+.=+||.+.++++ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 54444443332110000 0011 10 000123344444456789999999999999888887754433 Q ss_pred HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccc---ccc--- Q lcl|NC_019404. 66 EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREEN---PRN--- 139 (418) Q Consensus 66 ~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~d---p~s--- 139 (418) .+.+.+-|++-++.....++++.+.+||.|++++.-... .+.+..+.. .++++++.++.+.+-... +.. T Consensus 81 ~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~----~~~d~~~~~-~i~~~~p~~~~~i~D~~~~~~~~~~i~ 155 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV----ESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCcc----ccCCCCCee-EEEEEcccceEEEEcCCCccceEEEEE Confidence 345777777778889999999999999999988753210 011122211 233444433332110000 000 Q ss_pred -----cccCcceE---------EEEecCCcc-ccccc------Ccc-c--EEEecCccchhhhhhccccCCcchHHHHHH Q lcl|NC_019404. 140 -----ARFGKPLT---------YRITTNESD-MFYDV------HYS-R--IHIIDGERVPNAMRRQNDGWGRSVLSSDIL 195 (418) Q Consensus 140 -----~~yg~p~~---------y~i~~~~~~-~~~~i------H~S-R--~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~ 195 (418) .+.+.+.. |++...++. ....+ |+= + |++|... .....+||.|.++..+. T Consensus 156 ~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~------~~~~~~~G~sdi~~~i~ 229 (480) T protein:vir:78 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND------PRLGNRYGRSEISPELR 229 (480) T ss_pred EEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecc------cccCCccCccchhHHHH Confidence 01111111 111111110 00001 110 1 1222111 12234689998876577 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCceeecchHHHhh-cCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeec---cc Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRKQQAVWKAKGLAELC-DDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS---DI 271 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~-~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~---~~ 271 (418) +.+.++++++......+..++.+...+.+...-. ...... ..+.. ..+..+.+. +++.+..+. ++ T Consensus 230 ~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~----~~~~~-----~~~~~~~~~--~~~~~~~~~~~~~~ 298 (480) T protein:vir:78 230 KVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN----TTLDI-----YYGRILTLA--SEAAKISEFKAAEL 298 (480) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhhCCCcccccccccc----chhhh-----hhhhhccCC--CCCceEEecCccCH Confidence 8888999988888777776666655544322100 000000 00110 011222222 233343443 34 Q ss_pred CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhcc-------- Q lcl|NC_019404. 272 GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVNA-------- 341 (418) Q Consensus 272 ~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~~-------- 341 (418) ....+.++....++++.+++|...|.|.+ .. ++||+.-.-.+...+ ...++..+++.|.+++.+++.- T Consensus 299 ~~~~~~l~~~i~~~~~~~~~p~~~fg~~~-~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~ 376 (480) T protein:vir:78 299 RNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcccc Confidence 45666778888899999999988775533 22 245654333333222 2233345678888888877631 Q ss_pred -CCceEEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHHHH--HHHh-hcC-----cCCCChhhcccc Q lcl|NC_019404. 342 -EEWSVEFSPLDHESSKDKAEVLEKSVNSI-------AALIAAGAMDIKEARD--TLRT-IAP-----EIKIGDNDIQTE 405 (418) Q Consensus 342 -~~~~~~f~pL~~~~eke~ae~~~~~a~a~-------~~~~~~g~i~~~e~r~--~l~~-~~~-----~~~~~~~~~~~~ 405 (418) .++++.|.+-..++..+.|+...+.+++. ..+-..|+.. +++.+ .+++ ... .......+-... T Consensus 377 ~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~-d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA-TQREQMRDWDKQETEDMIDTLYSTTKAQADAT 455 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCH-hHHHHHHHHHHHHHHHHHHHhhccccCCCccc Confidence 25788999988999988877665554432 1122344432 22111 1110 000 000000000000 Q ss_pred c--ccCCCccccccC Q lcl|NC_019404. 406 E--SELITETEVVIA 418 (418) Q Consensus 406 e--~~~~~e~e~~~~ 418 (418) . ...+.++|.==| T Consensus 456 ~~~~~~~~~~~~~~~ 470 (480) T protein:vir:78 456 PKPTVTETKTETQTS 470 (480) T ss_pred cCCCCCCCCCccCCC Confidence 0 000111111111 No 119 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.49 E-value=3.9e-14 Score=94.12 Aligned_cols=347 Identities=12% Similarity=0.099 Sum_probs=162.3 Q ss_pred hhhHHHHhcCCCCcc---ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HH----HHHHHHH Q lcl|NC_019404. 5 DSYANIFLGGSDGSE---IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PA----FWSRWDD 75 (418) Q Consensus 5 D~~~n~~~g~~~~~~---~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~----i~~~~~~ 75 (418) =||.+-+.+..+... ........+ ...|..++.+.+||+.+++++-+-++.+...+.. .. +..+-.. T Consensus 1 Mg~f~~l~~~~~~~~~~~~~~~~~~~~----~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~l~~ll~~~PN~ 76 (376) T protein:vir:78 1 MGFFSELFKRNKEIEWMWDLDFLEDKT----TKVYLKKMALNTCVKHIARTIAKSDFRLKNGETSVRDKLYYKLNIRPNT 76 (376) T ss_pred CchhhhhhccCCccccccchhhccccc----hhhhhhhHHHHHHHHHHHHhhcccceeeccccccccchHHHHHhhcccc Confidence 022222222211111 111111122 2235567889999999999999999888532211 11 2222222 Q ss_pred hCchHHHHHHHHhcc-ccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC Q lcl|NC_019404. 76 LEMTQNINDAWSWAR-LFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE 154 (418) Q Consensus 76 l~~~~~~~~a~~~~r-l~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~ 154 (418) .-.+..|.+.+.+.. ++|.|++++.-++ . +.+..+.++.+..+.+. ..+.+...+ T Consensus 77 ~~t~~~f~~~~~~~lll~Gn~~~~~~r~~-~---------~~~~~~~~~~~~~~~~~--------------~~~~~~~~~ 132 (376) T protein:vir:78 77 DMSSSSFWEKVIYKLIYDNECLIVLSDTD-D---------FLIADSYVRKEFAFFPD--------------VFEGVTVKD 132 (376) T ss_pred CCCHHHHHHHHHHHHhHcCcEEEEEEeCC-C---------eeeccceeecccceeee--------------eeeeeeeec Confidence 334455555555544 4688888775332 2 11111112221111111 112222221 Q ss_pred cccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH-HHHcCCce-eecchHHHhhcCc Q lcl|NC_019404. 155 SDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL-LRRKQQAV-WKAKGLAELCDDS 232 (418) Q Consensus 155 ~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l-~~~~~~~v-~k~~~l~~~~~~~ 232 (418) ......+.++.|+||.....| ...++.+.. . .+.......... .+..+... ++++. ...+ ++ T Consensus 133 ~~~~~~~~~~evih~~~~~~~------~~~~~~~~~-----~---~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~ 196 (376) T protein:vir:78 133 YRYNRNFSMDDVIFLEYGNER------LSAFTDGMF-----E---DYGELFGKMIRAQMRNFQIRGAVNFKM-AGVA-DK 196 (376) T ss_pred ceeeeeeccccEEEeccCCCC------chhhhhHHH-----H---HHHHHHHHHHHHHHhcCCCceeEEEcc-CCCC-CH Confidence 112235678889998532211 111222211 1 111111111111 22222222 22221 1111 22 Q ss_pred chHHHHHHHHHHHHHhc-CCcceeEEEcCCCceeEeecccCC-------HHHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 233 EGFGAARLRLAQVDNNS-GVGQAIGIDAESEEYSVLNSDIGG-------IDAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 233 ~~~~~~~~r~~~~~~~~-~~~~~~~~d~~~e~~~~~~~~~~g-------l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) +......+++...-... +....+++...+.+|+.++.+... +.+........||.+.+||..+|.| . T Consensus 197 e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~----~- 271 (376) T protein:vir:78 197 DKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG----D- 271 (376) T ss_pred HHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC----C- Confidence 22333444443332221 233445545556789888877643 4556667788899999999987743 1 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccCCceE--EeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VNAEEWSV--EFSPLDHESSKDKAEVLEKSVNSIAALIAAGA 378 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~~~~~~~--~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~ 378 (418) .++-++....||.. -|.|.+..+-..+ +...++.+ ++..+...|.++ +++++++++++|+ T Consensus 272 ~s~~e~~~~~f~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~~~~~~ll~~d~~~-------~~~~~~~~~~~G~ 337 (376) T protein:vir:78 272 MADLSNNMKAYMEY-------CIDPLTKKLEDELNAKLFTFSEFLAGEHIKIIHKKDIIE-------NAEAVDKLVASGS 337 (376) T ss_pred CCCHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhCCcccceecccchhhcccCHHH-------HHHHHHHHHhCCC Confidence 23334445555554 3777776664443 23345544 444666666654 4778889999999 Q ss_pred CCHHHHHHHHHhhcCcCC-CChhhccccc--ccCCCcccc Q lcl|NC_019404. 379 MDIKEARDTLRTIAPEIK-IGDNDIQTEE--SELITETEV 415 (418) Q Consensus 379 i~~~e~r~~l~~~~~~~~-~~~~~~~~~e--~~~~~e~e~ 415 (418) ++++|+|+.+. ..+-.+ ..|+-+.... +-...++++ T Consensus 338 ~t~NE~R~~lg-~~p~~~g~~d~~~~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 338 FNRNEVRELLG-AERVDNPELDKYLITKNYQSADEGGEDG 376 (376) T ss_pred cCHHHHHHHhC-CCCCCCCCCceeeeccCceehhccccCC Confidence 99999998763 222111 1121111100 001123344 No 120 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.49 E-value=2e-13 Score=90.25 Aligned_cols=370 Identities=10% Similarity=0.069 Sum_probs=187.7 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHHHhCc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEM 78 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l~~ 78 (418) +.|.+-+..-..|-+.--..+........ ....+++++.||+..+..++.+++.++++++. +.+..-+++.++ T Consensus 17 ~~r~~~l~~yy~g~~~il~~~~~~~~~~~-----~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~l~~~~~~n~~ 91 (429) T protein:vir:98 17 NLSYSAYKQLYEGDHAILQQKQKEQYKPD-----NRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQVSNYLELLDGYNDQ 91 (429) T ss_pred HHHHHHHHHHhccccccccccccccCCCc-----ceeecchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHHhhcCH Confidence 23333333333332210000000000000 12246899999999999999999999876543 345556666788 Q ss_pred hHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccccccC-------------- Q lcl|NC_019404. 79 TQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFG-------------- 143 (418) Q Consensus 79 ~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg-------------- 143 (418) ...+.++.+.+..||.|++++..+. |.. .+.+++++++.+.+-+.....|.++ T Consensus 92 ~~~~~~~~~~~~~~G~~~~~v~~d~~g~~------------~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~ 159 (429) T protein:vir:98 92 DDNNAELSKICSIYGHGYELVFNDENAEA------------GITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGVLEGS 159 (429) T ss_pred hHHHHHHHHHHhhcCeEEEEEEecCCCcE------------EEEEEcccceEEEEeCCCCCceEEEEEEEEecCceEEEE Confidence 8999999999999999999887642 221 2334444444332211000001111 Q ss_pred --cceEEEE-ecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCcee Q lcl|NC_019404. 144 --KPLTYRI-TTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVW 220 (418) Q Consensus 144 --~p~~y~i-~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~ 220 (418) .+..+++ ...+.+ ..+-.+.=+.|.. +|. ....++.+|.|.++ .+.+.+.+++.+....+..+..++.+.+ T Consensus 160 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~--vPv-v~~~n~~~g~sd~e-~v~~liD~~d~~~s~~~~~~~~~~~p~~ 233 (429) T protein:vir:98 160 YSDASNITYFKDGEKG--IEIGESEPHPFDG--VPM-IEYVENEERQSLLA-SVVTLINAFNKAISEKANDVEYFADAYL 233 (429) T ss_pred EEeCceEEEEEecCCc--eEecccccccCCc--cce-EEecCCCCCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 1110000 000000 0000000000111 111 12234668999997 5889999999999988888888888877 Q ss_pred ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc---CCCceeE--eecccCCHHHHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA---ESEEYSV--LNSDIGGIDAFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 221 k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~---~~e~~~~--~~~~~~gl~~~~~~~~~~iaaas~IP~t~ 295 (418) .+.+.. .. .+. ...+ .....+.+.. ++.+.+. .+.+..+....++.+.+.|...+++|..- T Consensus 234 ~i~g~~-----~~-~~~-~~~~-------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 299 (429) T protein:vir:98 234 KILGAE-----LD-DET-LKSL-------RDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANIS 299 (429) T ss_pred eeecCC-----CC-cch-hhhH-------hhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccC Confidence 766421 11 111 1111 1122333322 2223444 45566778889999999999999999643 Q ss_pred eeccCccccccchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhcc----------CCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHK---LIDRKRNAELLPILEFLIPFIVNA----------EEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~---~I~~~Qe~~l~p~l~~l~~~i~~~----------~~~~~~f~pL~~~~eke~ae~ 362 (418) . +.. | |+||+.-...+.. .++.+ +..++..++++..+++.- .+++++|++-...++.+.|+. T Consensus 300 ~-~~~--g-n~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~ 374 (429) T protein:vir:98 300 D-ESF--G-TASGIALRYRLQAMDNLAKTK-ERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQI 374 (429) T ss_pred c-ccc--c-cchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHH Confidence 3 211 2 3456543333333 33334 345677787777776431 257899999999999998877 Q ss_pred HHHHHHHHHH---HHhCCCC-CHHHHHHHHHhh-cCcCCCChhhcccccccCCCc Q lcl|NC_019404. 363 LEKSVNSIAA---LIAAGAM-DIKEARDTLRTI-APEIKIGDNDIQTEESELITE 412 (418) Q Consensus 363 ~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~~-~~~~~~~~~~~~~~e~~~~~e 412 (418) ..+.+..++. +-..|.+ ++++..+.+++- .+.......++..++.+...| T Consensus 375 ~~kl~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 375 AGNLAGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTILE 429 (429) T ss_pred HHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCCCC Confidence 6554322111 1122222 122222222110 000000011222222222222 No 121 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.49 E-value=1.5e-13 Score=90.91 Aligned_cols=373 Identities=10% Similarity=0.065 Sum_probs=194.0 Q ss_pred Cc------------------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MV------------------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~------------------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) -+ |.+-+..-..|-+.-..........+ .. ..+.++++||+..+...+.+++.+.+ T Consensus 24 ~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~----~k--i~~n~~~~Ivd~~~~~l~g~p~~~~~ 97 (470) T protein:vir:99 24 KLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKETGAD----NR--IVVNSAKYVVDVYNGYFCGIEPKLAL 97 (470) T ss_pred CcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcccccCCc----ce--eecchHHHHHHHHhhhhccCCeeEee Confidence 11 11111111222111000000000000 01 13578999999999999999999876 Q ss_pred cchH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecccccccccccccc Q lcl|NC_019404. 63 IDDE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPR 138 (418) Q Consensus 63 ~~d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~ 138 (418) .++. +.+.+.+++-++...+.++++.+..||.|++++.++. +.. .+.+++++++.+.+-+..-. T Consensus 98 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~------------~i~~~~p~~~~~i~d~~~~~ 165 (470) T protein:vir:99 98 LNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARP------------HLMYSSPNHAFIIYDDTVQR 165 (470) T ss_pred CCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE------------EEEEEccceeEEEEcCCCCc Confidence 5443 4567777777889999999999999999999887743 221 13333444332221000000 Q ss_pred ccc--------------------cCcceEEEEecCCcccc-----cccCcccEEEecCccchhhhhhccccCCcchHHHH Q lcl|NC_019404. 139 NAR--------------------FGKPLTYRITTNESDMF-----YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD 193 (418) Q Consensus 139 s~~--------------------yg~p~~y~i~~~~~~~~-----~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~ 193 (418) .+. |..-..|.+...+.... ...|+--.+ |+. ...++.+|.|.++. T Consensus 166 ~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v-----Pvv---~~~n~~~g~sd~e~- 236 (470) T protein:vir:99 166 QPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLV-----PAV---EFFENEERQGIFDS- 236 (470) T ss_pred ceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCcc-----ceE---eecCCCCCCcchHh- Confidence 000 11111222222111110 122331111 111 12345679998874 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEE----cCCCceeEee- Q lcl|NC_019404. 194 ILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGID----AESEEYSVLN- 268 (418) Q Consensus 194 ~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d----~~~e~~~~~~- 268 (418) +.+.+.+++.+....+..+..++...+.+.+.... ....+ +....+ .....+.+. +++.++..++ T Consensus 237 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~-~~~~g--~~~~~~-------~~~~~~~~~~~~~~~~~~~~~l~~ 306 (470) T protein:vir:99 237 IKTLINALDKVISQKANQVEYFDNAYMYMIGFKLP-EDDEG--NPKFDF-------KNNRVLYVSQLDPDTNPQIGFIAK 306 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc-ccccc--chhhhh-------hhcceeeecCCCCCCCCcceEEee Confidence 88999999999998888888888888777764211 11111 111111 112223222 2233455554 Q ss_pred -cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhcc--- Q lcl|NC_019404. 269 -SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPILEFLIPFIVNA--- 341 (418) Q Consensus 269 -~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~l~~l~~~i~~~--- 341 (418) .+..+....++.+.+.|...+++|-.-+ +.. +| |.||+.-...| ...++.+ +..++..|.+++.+++.. T Consensus 307 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-~~-n~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~ 382 (470) T protein:vir:99 307 PDADQMQENLIQHLTDFIFMMAMVPNIQD-KNF-AG-NSSGVALQYKLFAMKNKADSK-ERKFDKSLMQLYRIVLATLFN 382 (470) T ss_pred cCChHHHHHHHHHHHHHHHHHhCCccccc-ccc-cc-CchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhc Confidence 4556888999999999999999996433 222 22 34555433322 2233333 455788888888776421 Q ss_pred --------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCCHHHHHHHHHhhc-CcCCCCh---hhccccc Q lcl|NC_019404. 342 --------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMDIKEARDTLRTIA-PEIKIGD---NDIQTEE 406 (418) Q Consensus 342 --------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~~~e~r~~l~~~~-~~~~~~~---~~~~~~e 406 (418) .++++.|+|-...++.+.|++..+.+.+++. +-..+.+++++..+++++-. +...... ......+ T Consensus 383 ~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~ 462 (470) T protein:vir:99 383 NKQDQELWSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDILK 462 (470) T ss_pred cCCcccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCC Confidence 2678999999999999998887665543321 33455556655444443211 0000000 0111122 Q ss_pred ccCCCccc Q lcl|NC_019404. 407 SELITETE 414 (418) Q Consensus 407 ~~~~~e~e 414 (418) .++.+|+| T Consensus 463 ~d~~~ee~ 470 (470) T protein:vir:99 463 RDNNAEEE 470 (470) T ss_pred CCCCccCC Confidence 23333333 No 122 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.48 E-value=1.1e-13 Score=91.62 Aligned_cols=393 Identities=13% Similarity=0.040 Sum_probs=193.3 Q ss_pred Cc--------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch--HH--- Q lcl|NC_019404. 1 MV--------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD--EP--- 67 (418) Q Consensus 1 ~~--------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d--~~--- 67 (418) +- |.+-+...+.|....- ...... ...........+++++.||+..+..++.+++.+++.++ .+ T Consensus 48 i~~~~~~~~~r~~~~~~yY~g~~~~i--~~~~~~-~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~~~~~~ 124 (501) T protein:vir:96 48 INHHKLRQAPRIQELLDYARGENHDV--LKSGRR-KDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDDNSQND 124 (501) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCcc--cCcccc-CccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccchhHHH Confidence 11 1111111122321100 000000 00000011234789999999999999999999976542 12 Q ss_pred -HHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcce Q lcl|NC_019404. 68 -AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPL 146 (418) Q Consensus 68 -~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~ 146 (418) .+.+-|++-++...+.++++.+..||.|++++..+.+. .+ .+.++++.++.+.+-+.....+.++ .. T Consensus 125 ~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg----------~~-~i~~~~p~~~~~v~d~~~~~~~~~~-v~ 192 (501) T protein:vir:96 125 DAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYD----------ET-RIKRLSPLETFVIYDNSLEDNSIAA-VR 192 (501) T ss_pred HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCC----------ce-EEEEEccceeEEEEcCCCCCceEEE-EE Confidence 24455566688899999999999999999988764221 11 2334444443332211101111111 01 Q ss_pred EEEEecCCcc-cccc-cCcccEEEecCc--------------cchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 147 TYRITTNESD-MFYD-VHYSRIHIIDGE--------------RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQ 210 (418) Q Consensus 147 ~y~i~~~~~~-~~~~-iH~SR~i~~~g~--------------~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~ 210 (418) .|........ ...+ +.+.++.++... .+| .....++.+|.|.++. +.+.+.+++.+....+. T Consensus 193 ~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vP-vv~~~nn~~g~sd~e~-v~~liDa~d~~~s~~~~ 270 (501) T protein:vir:96 193 YYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVP-ITEYLNNIDGIGDYET-ELYLIDLYDSAESDTAN 270 (501) T ss_pred EEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccc-eEEecCCccCCCchhh-hHHHHHHHHHHHHHHHH Confidence 1111110000 0000 112222222110 011 1122345689999974 88999999999999999 Q ss_pred HHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCcee--EeecccCCHHHHHHHHHHHHhhh Q lcl|NC_019404. 211 LLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYS--VLNSDIGGIDAFLDKKFDRIVAL 288 (418) Q Consensus 211 l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~--~~~~~~~gl~~~~~~~~~~iaaa 288 (418) .+..++..++.+.+.... ..++ .....+..... .... .........+.+++ ..+.+.++++..++.+.+.|... T Consensus 271 ~~~~~~~~~l~i~G~~~~-~~~~-~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 346 (501) T protein:vir:96 271 HMSDMADAILAIYGDLAL-PKGM-QASDMKRTRLM-QLKP-PKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIF 346 (501) T ss_pred HHHHhcCceeeeeccccc-Cccc-chhhhhhcCee-eecc-cccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHH Confidence 888888888777754211 1111 11111110000 0000 00000111222344 34456678999999999999999 Q ss_pred hcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhc------------cCCceEEeCCCCC Q lcl|NC_019404. 289 SGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPILEFLIPFIVN------------AEEWSVEFSPLDH 353 (418) Q Consensus 289 s~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~l~~l~~~i~~------------~~~~~~~f~pL~~ 353 (418) +++|..-+.+ . +| |.||+.-...| ...+ ..++..++..|++++.+++. ..++.+.|+|-.. T Consensus 347 s~~p~~~~~~-~-~~-n~Sg~Al~~~~~~l~~ka-~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p 422 (501) T protein:vir:96 347 TNTPDMSDTN-F-SG-NTSGEALKYKLFGLDQDR-VDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLP 422 (501) T ss_pred hCCcccCccc-c-cc-cchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCC Confidence 9999755422 2 12 34555433222 2233 33345677888877777542 1247899999999 Q ss_pred CCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHhhcCc----CCCC---hhh--cccc-cccCCCccccccC Q lcl|NC_019404. 354 ESSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARDTLRTIAPE----IKIG---DND--IQTE-ESELITETEVVIA 418 (418) Q Consensus 354 ~~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~~~~~----~~~~---~~~--~~~~-e~~~~~e~e~~~~ 418 (418) .+.++.|++..+.+.+++. +-..+.+ ++++..+.+++-... .... +.. -.+. .....+|.|.++- T Consensus 423 ~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 423 KSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred cCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 9999998877666543322 2234444 345545444321100 0000 000 0011 1112234455555 No 123 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.48 E-value=8.8e-14 Score=92.16 Aligned_cols=390 Identities=12% Similarity=0.076 Sum_probs=189.4 Q ss_pred Cccchh------------------hHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKTDS------------------YANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~D~------------------~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) ++..+- +..-+.|-+.--.... ......-..--..+++++.||+..+..++.+++.+++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~---~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:99 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT---RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccC---cccccccCcceeecchHHHHHHHHHhhhcccCceeec Confidence 111111 1112222221100000 0000000000123688999999999999999999987 Q ss_pred cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccc Q lcl|NC_019404. 63 IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRN 139 (418) Q Consensus 63 ~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 139 (418) +++. +.+.+-+++-++.....++.+...+||.|++++..+. +.. .+.++++.++-|.+-...... T Consensus 116 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~------------~i~~~~p~~~~~vyd~~~~~~ 183 (511) T protein:vir:99 116 DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET------------RLYKSDAMSTFVIYDNTIERN 183 (511) T ss_pred CchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce------------EEEEEccceeEEEEcCCCCCc Confidence 6543 3556666677899999999999999999999887742 221 233344433333221111111 Q ss_pred cccCcceEEEEecCCcccc-----c-ccCcccEEEecCc--------------------cchhhhhhccccCCcchHHHH Q lcl|NC_019404. 140 ARFGKPLTYRITTNESDMF-----Y-DVHYSRIHIIDGE--------------------RVPNAMRRQNDGWGRSVLSSD 193 (418) Q Consensus 140 ~~yg~p~~y~i~~~~~~~~-----~-~iH~SR~i~~~g~--------------------~lp~~~~~~~~~~G~S~l~~~ 193 (418) |.++ ..+|.+........ . -..+.++.+|... .+|. ....++.+|.|.++ . T Consensus 184 ~~~~-vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~nn~~g~sd~e-~ 260 (511) T protein:vir:99 184 SIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI-TEFSNNERRKGDYE-K 260 (511) T ss_pred eEEE-EEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccce-EEecCCCCCCCchh-h Confidence 1111 11111110000000 0 0111122222100 0111 11233557899887 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHH-HHHHHHHh-cCCcceeEEEcCCCceeEee--c Q lcl|NC_019404. 194 ILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARL-RLAQVDNN-SGVGQAIGIDAESEEYSVLN--S 269 (418) Q Consensus 194 ~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~-r~~~~~~~-~~~~~~~~~d~~~e~~~~~~--~ 269 (418) +.+.+.+++.+....+..+..++..++.+.+.... .........+ +....... .......- ..++.++..++ . T Consensus 261 v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~~l~~~~ 337 (511) T protein:vir:99 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--DPVEVRKQKEANVLFLEPTVYADSEGRE-TEGSVDGGYIYKQY 337 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhchhhhhccCccc--Cchhhcccccccceeccccccccccccc-CCCCcceeEEeecC Confidence 88999999999999888888777776666543211 1111001110 00000000 00111111 12223455444 4 Q ss_pred ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHhhcc------ Q lcl|NC_019404. 270 DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKL--IDRKRNAELLPILEFLIPFIVNA------ 341 (418) Q Consensus 270 ~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~--I~~~Qe~~l~p~l~~l~~~i~~~------ 341 (418) +..+....++.+.+.|...+.+|-.-.-+. +| |.||..-...|... -...++..++..|++++++++.- T Consensus 338 ~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~--~g-n~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~ 414 (511) T protein:vir:99 338 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNF--SG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 414 (511) T ss_pred CHHHHHHHHHHHHHHHHHHhCCcccccccc--cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 556889999999999999999998544222 22 34555433333222 22333455788888887776421 Q ss_pred -------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCc-----CCC Q lcl|NC_019404. 342 -------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPE-----IKI 397 (418) Q Consensus 342 -------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~-----~~~ 397 (418) .++++.|++-...+.++.+++..+.+-.++. +-..+.++ +++..+++++ .... .+. T Consensus 415 ~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:99 415 IDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNI 494 (511) T ss_pred cccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCC Confidence 2578999999999999998876665433321 22233333 3333333321 0011 111 Q ss_pred ChhhcccccccCCCccc Q lcl|NC_019404. 398 GDNDIQTEESELITETE 414 (418) Q Consensus 398 ~~~~~~~~e~~~~~e~e 414 (418) .+++-++.+....+++| T Consensus 495 ~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 495 NDDEQDDSTKDSIDKKE 511 (511) T ss_pred CCCCCCCCCcCcccccC Confidence 11222222223333444 No 124 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.47 E-value=9.8e-14 Score=91.90 Aligned_cols=389 Identities=11% Similarity=0.053 Sum_probs=186.0 Q ss_pred Cc------------------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MV------------------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~------------------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) .. |.+-+..-+.|-+..-........ ..-...-..+++++.||+..+..++.+++.+++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~---~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:93 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE---EYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcc---cccCcceeecchHHHHHHHHhhhhcccCeeecc Confidence 00 111122222332211000000000 000000123689999999999999999999987 Q ss_pred cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccc Q lcl|NC_019404. 63 IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRN 139 (418) Q Consensus 63 ~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 139 (418) +++. +.+.+-+++-++.....++.+...+||.|++++..+. +.. .+.++++.++-+.+-+..... T Consensus 116 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~------------~i~~~~p~~~~~vydd~~~~~ 183 (511) T protein:vir:93 116 DDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET------------RLYKSDAMSTFVIYDNTIERN 183 (511) T ss_pred CChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce------------EEEEEccceeEEEEcCCCCCc Confidence 6543 3455566667899999999999999999999987742 221 123333333322211100001 Q ss_pred cccCcceEEEEecCCcccc-----c-ccCcccEEEecCc--------------------cchhhhhhccccCCcchHHHH Q lcl|NC_019404. 140 ARFGKPLTYRITTNESDMF-----Y-DVHYSRIHIIDGE--------------------RVPNAMRRQNDGWGRSVLSSD 193 (418) Q Consensus 140 ~~yg~p~~y~i~~~~~~~~-----~-~iH~SR~i~~~g~--------------------~lp~~~~~~~~~~G~S~l~~~ 193 (418) |.++ ..+|.......... . -+.+.++.+|... .+|. ....++.+|.|.++ . T Consensus 184 ~~~~-vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~nn~~g~gd~e-~ 260 (511) T protein:vir:93 184 SIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI-TEFSNNERRKGDYE-K 260 (511) T ss_pred eEEE-EEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccce-EEecCCCCCCCchh-h Confidence 1110 00111100000000 0 0011111111100 0111 11234567888887 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHH-HHHHHH-HHhcCCcceeEEEcCCCceeE--eec Q lcl|NC_019404. 194 ILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAAR-LRLAQV-DNNSGVGQAIGIDAESEEYSV--LNS 269 (418) Q Consensus 194 ~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~-~r~~~~-~~~~~~~~~~~~d~~~e~~~~--~~~ 269 (418) +.+.+.+++.+....+..+..++..++.+.+....- ........ .+.... .........+-. .++.++.. .+. T Consensus 261 v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~ 337 (511) T protein:vir:93 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD--PVEVRKQKEANVLFLEPTVYADSEGRET-EGSVDGGYIYKQY 337 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccC--chhhcccccccceecccccccccccccC-CCCcceeEEeecC Confidence 889999999999998888888887777666532110 00000000 000000 000000111111 12234444 445 Q ss_pred ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhcc----- Q lcl|NC_019404. 270 DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFH---KLIDRKRNAELLPILEFLIPFIVNA----- 341 (418) Q Consensus 270 ~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~---~~I~~~Qe~~l~p~l~~l~~~i~~~----- 341 (418) +..+....++.+.+.|...+.+|..-. +.. +| |.||+.-...|. ..++ .++..++..|++++.+++.. T Consensus 338 ~~~~~~~~~~~L~~~I~~~s~~P~~~~-~~~-~~-n~Sg~Al~~~~~~l~~k~~-~k~~~f~~~l~~~~~li~~~l~~~~ 413 (511) T protein:vir:93 338 DVQGTEAYKDRLNSDIHMFTNTPNMKD-DNF-SG-TQSGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKLLETILKNTW 413 (511) T ss_pred CHHHHHHHHHHHHHHHHHHhCCccccc-ccc-cc-cchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcc Confidence 667899999999999999999997543 211 22 345554332222 2333 33456788888887776521 Q ss_pred --------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCcCCCChhh Q lcl|NC_019404. 342 --------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPEIKIGDND 401 (418) Q Consensus 342 --------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~~~~~~~~ 401 (418) .++++.|++-...+.+|.+++..+.+..++. +-..+.++ +++..+++++ .....+....+ T Consensus 414 ~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 493 (511) T protein:vir:93 414 SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD 493 (511) T ss_pred CcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCC Confidence 2468999999999999998876655433222 11223332 3333333211 00001000111 Q ss_pred c-----ccccccCCCccc Q lcl|NC_019404. 402 I-----QTEESELITETE 414 (418) Q Consensus 402 ~-----~~~e~~~~~e~e 414 (418) . ++.+.+...|+| T Consensus 494 ~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 494 INDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCcccccccccC Confidence 1 111112222333 No 125 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.46 E-value=4.9e-13 Score=88.05 Aligned_cols=380 Identities=13% Similarity=0.089 Sum_probs=188.7 Q ss_pred CccchhhHHHHhcCCCCc-c-ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHHHh Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGS-E-IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDL 76 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~-~-~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l 76 (418) +=+.+-+..-+.|-+.-- + .........+.......-.+++++.||+..+..++.+++.+.++++. +.+...+ .= T Consensus 42 ~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~-~n 120 (478) T protein:vir:10 42 IDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTL-NH 120 (478) T ss_pred HHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHH-hc Confidence 112222222223322100 0 00000000000000111236899999999999999999999876543 2233333 34 Q ss_pred CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcc Q lcl|NC_019404. 77 EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESD 156 (418) Q Consensus 77 ~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~ 156 (418) ++...+.++.+.+..||.|++++.++.+. .+ .+.++++.++-|.+-+.+...+.++ ...|...+.. T Consensus 121 ~~~~~~~~~~~~~~~~G~~~~~v~~d~~~----------~~-~~~~~~p~~~~~v~d~~~~~~~~~~-ir~~~~~~~~-- 186 (478) T protein:vir:10 121 KWDDKLVDILTAASNKGIEWVQPYVDEEG----------EF-KTFRVPAEQAVPIWTNKERDELQAF-IRVYELDGAE-- 186 (478) T ss_pred cHHHHHHHHHHHHhhCCeEEEEEEecCCC----------ce-EEEEEcccceEEEEcCCCCCceEEE-EEEEeeeCce-- Confidence 67788899999999999999988774221 12 1333444333322211111111111 0111111110 Q ss_pred ccccc-CcccEEEecC------------------------c-----cchhhhhhccccCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 157 MFYDV-HYSRIHIIDG------------------------E-----RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCER 206 (418) Q Consensus 157 ~~~~i-H~SR~i~~~g------------------------~-----~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~ 206 (418) ..++ .+.++.++.. . .+|. ....++..|.|.++. +.+.+.+|+.+.. T Consensus 187 -~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~g~sd~e~-v~~liDa~~~~~S 263 (478) T protein:vir:10 187 -RVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPF-IPFKNNPQEVSDLFM-YKTIIDALDKRLS 263 (478) T ss_pred -EEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceE-EEeccCCCCCCcHHH-HHHHHHHHHHHHH Confidence 0001 1112211110 0 0111 122345578898875 8899999999999 Q ss_pred HHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc-CCCceeEe--ecccCCHHHHHHHHHH Q lcl|NC_019404. 207 LATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA-ESEEYSVL--NSDIGGIDAFLDKKFD 283 (418) Q Consensus 207 ~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~-~~e~~~~~--~~~~~gl~~~~~~~~~ 283 (418) ..+..+..++...+.+.+.. +.........+ .....+.+.+ ++.+.+.+ +.+..++...++.+.+ T Consensus 264 ~~~~~~~~~~~~~~~~~g~~-----~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 331 (478) T protein:vir:10 264 DTQNTFDESVELIYILKGYE-----GEDMKDFMHNL-------KYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRD 331 (478) T ss_pred HHHHHHHHhhCcceeeecCC-----cccccchhhhh-------hhCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHH Confidence 99988888888777766532 11101111111 1133444433 23344444 4566789999999999 Q ss_pred HHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhc-------cCCceEEeCCCCCC Q lcl|NC_019404. 284 RIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVN-------AEEWSVEFSPLDHE 354 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~-------~~~~~~~f~pL~~~ 354 (418) .|...+++|-.-. + +-+| |.||..-...|..... ...+..+++.+++++.+++. ..++++.|++-... T Consensus 332 ~I~~~s~~p~~~~-~-~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~ 408 (478) T protein:vir:10 332 YIIEFGQGVDFQQ-D-KFGN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVMV 408 (478) T ss_pred HHHHHhCCcCcCc-c-cccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCCC Confidence 9999999996433 2 1122 4456543333333322 33345578888888888763 13688999999999 Q ss_pred CHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHh-------hcCcCCCChhhcccccccCCCccc Q lcl|NC_019404. 355 SSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARDTLRT-------IAPEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 355 ~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~-------~~~~~~~~~~~~~~~e~~~~~e~e 414 (418) +++|.+++..+.+..++. +-..+.+ ++++..+++++ ..+..+-+..+ ++.+...+++.| T Consensus 409 ~~~e~~~~~~~~~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d-~~~~~~~d~~~e 478 (478) T protein:vir:10 409 NELENSQIAMNSTGLLSKETILGNHSWVQDPVAEMERIEQENIELNQQLPDIEEGLND-EQQRQSEDNQSE 478 (478) T ss_pred CHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccCCCCcc-cccccCcCCCCC Confidence 999988776554432222 1223322 33333333321 11111111111 111112222333 No 126 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.46 E-value=5e-13 Score=88.04 Aligned_cols=367 Identities=11% Similarity=0.076 Sum_probs=188.5 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHHHhCc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEM 78 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l~~ 78 (418) +-|.+-+.+-..|-+.-.. .+.......... ...++++.||+..+..++.+++.+.++++. +.+++-|..-++ T Consensus 33 ~~r~~~~~~Yy~g~~~i~~---~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~ 107 (452) T protein:vir:36 33 VARYEYLKNMYLGIMAIDD---EPAKDSWKPDNR--LAVNFTKYIVDTFTGYFNGIPVKKSHSDKEILTKLQEFDNLNDM 107 (452) T ss_pred HHHHHHHHHHhcccccccc---CccccccCccce--eecchHHHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcCh Confidence 2222222222333221100 000000000111 236899999999999999999999865443 445556666788 Q ss_pred hHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccccc---------------- Q lcl|NC_019404. 79 TQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR---------------- 141 (418) Q Consensus 79 ~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~---------------- 141 (418) ...+.++++.+..||.|++++..+. +.. .+.++++.++.+.+-+.....+. T Consensus 108 ~~~~~~~~~~~~~~G~~~~~v~~d~~g~~------------~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~ 175 (452) T protein:vir:36 108 EDEESELAKMACIYGRAFEFLYQDEDTQT------------NVVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGE 175 (452) T ss_pred hHHHHHHHHHHHhcCeEEEEEEecCCCee------------EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEE Confidence 8999999999999999999887642 222 12333333332211000000011 Q ss_pred -cCcceEEEEecCCcccc---cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019404. 142 -FGKPLTYRITTNESDMF---YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ 217 (418) Q Consensus 142 -yg~p~~y~i~~~~~~~~---~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~ 217 (418) |..-..|.+...+++.. ..-|+--. +|. ....++..|.|.++ .+.+.+.+++.+....+..+..++. T Consensus 176 vyt~~~i~~~~~~~~~~~~~~~~~~~~g~-------iPv-v~~~n~~~g~sd~e-~v~~liDa~d~~~s~~~~~~~~~~~ 246 (452) T protein:vir:36 176 VYTLLETIKISGENDEISFGEGTYNPYPD-------LPV-VEFYFNEERMSIFE-SVISLVNAFNKAISEKANDVDYFSD 246 (452) T ss_pred EEecCeEEEEEEcCCceEEecceeccCCc-------ccE-EEecCCCCCCcchH-HHHHHHHHHHHHHHHHHHHHHHhcC Confidence 11111122211111100 01122111 111 11233456888887 5889999999999999998888888 Q ss_pred ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCC----CceeEee--cccCCHHHHHHHHHHHHhhhhcC Q lcl|NC_019404. 218 AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAES----EEYSVLN--SDIGGIDAFLDKKFDRIVALSGI 291 (418) Q Consensus 218 ~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~----e~~~~~~--~~~~gl~~~~~~~~~~iaaas~I 291 (418) +++...+.. + .. .-...+. .+..+.+..++ .++..++ .+.++....++.+.+.|...+++ T Consensus 247 p~~~~~g~~-~----~~--~~~~~~~-------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 312 (452) T protein:vir:36 247 QYLTFLGAA-V----EE--EDLKNIR-------SNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMV 312 (452) T ss_pred ceeEeecCC-c----Cc--hhhhhhh-------hcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCc Confidence 877776421 1 11 0111110 12333333322 2344444 55678889999999999999999 Q ss_pred CeeeeeccCccccccchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhc----------cCCceEEeCCCCCCCHHH Q lcl|NC_019404. 292 HEIILKNKNVGGLSSSQNTALETFHK---LIDRKRNAELLPILEFLIPFIVN----------AEEWSVEFSPLDHESSKD 358 (418) Q Consensus 292 P~t~L~G~s~~gl~stge~d~~~y~~---~I~~~Qe~~l~p~l~~l~~~i~~----------~~~~~~~f~pL~~~~eke 358 (418) |..- ++. .| |+||+.-...|.. .++.+ +..++..+++++.+++. ..++++.|++-...++.+ T Consensus 313 p~~~-~~~--~g-n~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~ 387 (452) T protein:vir:36 313 ANIS-DES--FG-SSSGVSLAYKLQAMSNLALSF-QRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKE 387 (452) T ss_pred cccC-ccc--cc-CCcHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHH Confidence 9632 222 23 4566654333333 33333 34467788887777653 136789999999999999 Q ss_pred HHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHhh-c------CcCCCChhhcccccccCCCccc Q lcl|NC_019404. 359 KAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRTI-A------PEIKIGDNDIQTEESELITETE 414 (418) Q Consensus 359 ~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~~-~------~~~~~~~~~~~~~e~~~~~e~e 414 (418) .|++..+.+.+++. +-..|.++ +++..+++++- . .....++++.. +...+++.| T Consensus 388 ~a~~~~k~~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~--~~~~~~~~e 452 (452) T protein:vir:36 388 QAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTD--TVVSETNEE 452 (452) T ss_pred HHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCCCCccc--ccCccccCC Confidence 98876665433221 22333332 33333333211 0 00111111111 112222233 No 127 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.46 E-value=3.5e-14 Score=94.37 Aligned_cols=391 Identities=15% Similarity=0.127 Sum_probs=188.8 Q ss_pred CccchhhHHHH------------------hcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKTDSYANIF------------------LGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~D~~~n~~------------------~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) ++..+-+.+++ .|....-. ...........-......+++++.||+..+..++.+++.+.+ T Consensus 29 ~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~-~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 107 (481) T protein:vir:10 29 LLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDIL-AGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITH 107 (481) T ss_pred hcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-cCccccccccccccceeecchHHHHHHHHHhhhccCCceEec Confidence 33333232222 22211000 000000000000001124689999999999999999998876 Q ss_pred cch--HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccccc Q lcl|NC_019404. 63 IDD--EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNA 140 (418) Q Consensus 63 ~~d--~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~ 140 (418) +++ .+.+.+-|++.++...+.++++.+.+||.|++++..+.+. .+ .+.++++.++.+..-+.....+ T Consensus 108 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg----------~~-~i~~~~p~~~~~v~d~~~~~~~ 176 (481) T protein:vir:10 108 QDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFED----------RD-TFKVLDPKSTFVVYDQTLDKKV 176 (481) T ss_pred CChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCC----------eE-EEEEEcccceEEEEcCCCCCce Confidence 543 3467777888889999999999999999999988774221 11 2334444443322110000011 Q ss_pred ccCcceEEEEecCCccc--cccc-CcccEEEecCc---------------cchhhhhhccccCCcchHHHHHHHHHHHHH Q lcl|NC_019404. 141 RFGKPLTYRITTNESDM--FYDV-HYSRIHIIDGE---------------RVPNAMRRQNDGWGRSVLSSDILDSIKDYT 202 (418) Q Consensus 141 ~yg~p~~y~i~~~~~~~--~~~i-H~SR~i~~~g~---------------~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~ 202 (418) .++ ...|......... ...+ -+.++.+|... .+|. ....++.+|.|.++ .+.+.+.+++ T Consensus 177 ~~~-i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~g~~~~~-~v~~lida~~ 253 (481) T protein:vir:10 177 VAG-VRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPI-IEYLNDQFKQGDFE-NVIALIDLYD 253 (481) T ss_pred EEE-EEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeE-EEeecCCCCCCchh-hHHHHHHHHH Confidence 111 0111111100000 0001 01112111100 1111 11234567889887 5888999999 Q ss_pred HHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHH Q lcl|NC_019404. 203 NCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDK 280 (418) Q Consensus 203 ~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~ 280 (418) .+....+..+..++..++.+.+.... .++....... .... ...........+++.+++.+ +.+..++...++. T Consensus 254 ~~~s~~~~~~~~~~~~~~~~~g~~~~--~~~~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 328 (481) T protein:vir:10 254 SAQSDTANYMTDLNDAMLAIIGNVDL--DSEDAKAFRD--ANMI-HLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKR 328 (481) T ss_pred HHHHHHHHHHHHhcCceeEeecCcCC--Cccchhhhhh--ccce-eccccccccCCCCCcceeEEeecCCHHHHHHHHHH Confidence 99988888888888888877753221 1111111111 0000 00000001111222344444 4455788999999 Q ss_pred HHHHHhhhhcCCeeeeeccCccccccchhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------CCceE Q lcl|NC_019404. 281 KFDRIVALSGIHEIILKNKNVGGLSSSQNTALE---TFHKLIDRKRNAELLPILEFLIPFIVNA-----------EEWSV 346 (418) Q Consensus 281 ~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~---~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-----------~~~~~ 346 (418) +.+.|...+++|.. -+|.. +| |.||+.-.. .....++.+ +..++..+++++.+++.- .++++ T Consensus 329 l~~~i~~~s~~p~~-~~~~~-~~-n~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v 404 (481) T protein:vir:10 329 LQNDIHKYTNTPDL-NDEQF-SG-VQSGESMKYKLFGLEQVRAIK-ERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTI 404 (481) T ss_pred HHHHHHHHhCCccc-ccccc-cc-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCCccccceeeE Confidence 99999999999964 33422 22 345553322 233344444 355788888888876531 25789 Q ss_pred EeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCCC-HHHHHHHHHh-hcCcCCCC-hhhc--ccccccCCCcccc Q lcl|NC_019404. 347 EFSPLDHESSKDKAEVLEKSVNSIA---ALIAAGAMD-IKEARDTLRT-IAPEIKIG-DNDI--QTEESELITETEV 415 (418) Q Consensus 347 ~f~pL~~~~eke~ae~~~~~a~a~~---~~~~~g~i~-~~e~r~~l~~-~~~~~~~~-~~~~--~~~e~~~~~e~e~ 415 (418) .|+|-...++++.|++..+.+-.++ .+-..+.++ +++..+.+++ ........ ...+ ..++....+++++ T Consensus 405 ~f~~~~~~~~~~~a~~~~kl~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 405 TFTPNLPKSMMESINAFNALSGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred EeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 9999999999988877555432111 111222222 2222222211 00000000 0000 0011112233444 No 128 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.46 E-value=4.5e-14 Score=93.75 Aligned_cols=391 Identities=14% Similarity=0.124 Sum_probs=181.8 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhCch Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~~~ 79 (418) +-|.+-+.--..|-..- . ....-.+.++......+.+++.||+..++-+.-+||.+.++++. ..+.+.|++-++. T Consensus 28 ~~rl~~l~~Yy~G~~~i-~---~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~~~~l~~i~~~N~~d 103 (484) T protein:vir:77 28 TQDLGDNTAYYESERRP-D---AVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFRLGGADKADEQLWDWWQANDLD 103 (484) T ss_pred HHHHHHHHHHHhccccc-h---hcccccchhHHhhhhhcCcHHHHHHHHHhhhccCceecCCcchhHHHHHHHHHhcCHh Confidence 11111111111222110 0 01111234555666678999999999999988889887654433 4577778888888 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc--ccccc-------ccccCcce---- Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR--EENPR-------NARFGKPL---- 146 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~--~~dp~-------s~~yg~p~---- 146 (418) ....++++.+..||.|++++..+.+.... .... ..-.|+++++.++.+.+- ...+. ..+.+++. T Consensus 104 ~~~~~~~~~a~~~G~a~~~v~~~~~~~~~--~~~~-~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~ 180 (484) T protein:vir:77 104 IESTLGHTDSLVHGRSYITISKPDPNIDP--GVDP-EVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATL 180 (484) T ss_pred HHHHHHHHHHhhcCceEEEEecCCCCccc--cccc-ccceEEEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEE Confidence 99999999999999999887654321110 0000 000122222222211100 00000 00111111 Q ss_pred -----EEEEecCCccc---ccccCcccE---EEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019404. 147 -----TYRITTNESDM---FYDVHYSRI---HIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRK 215 (418) Q Consensus 147 -----~y~i~~~~~~~---~~~iH~SR~---i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~ 215 (418) .|++....+.. ...-|+--. +.|..+ .....+||.|.++..+.+.+.+++.+.......+..+ T Consensus 181 y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~------~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~ 254 (484) T protein:vir:77 181 YLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNR------TRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELM 254 (484) T ss_pred EecCeEEEEEecCCceEeeccccCCCCCcceEEeccc------cccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 12221111110 001133221 222111 1234468999997667777788888877666666555 Q ss_pred CCceeecchHH--HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeec---ccCCHHHHHHHHHHHHhhhhc Q lcl|NC_019404. 216 QQAVWKAKGLA--ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS---DIGGIDAFLDKKFDRIVALSG 290 (418) Q Consensus 216 ~~~v~k~~~l~--~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~---~~~gl~~~~~~~~~~iaaas~ 290 (418) +.+...+.+.. ..... .... ...+.. ..+..+++.+ ++....+. ++.+..+.++.....+|+.++ T Consensus 255 a~p~~~i~G~~~~~~~~~-~~~~--~~~~~~-----~~~~~~~~~~--~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~ 324 (484) T protein:vir:77 255 GVPQRLLFGVKGEELGVD-PETG--QTLFDA-----YLARILAFED--HESKAQQFSAAELRNFVDALDALDRKAAAYTG 324 (484) T ss_pred hhhHHHHhCCCcchhccc-cccc--chhhhh-----hhhhhcccCC--CCceeEeecCCChHHHHHHHHHHHHHHhcccC Confidence 55444433321 11000 0000 001110 0111222222 33333333 344566677788889999999 Q ss_pred CCeeeeeccCccccccchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhcc----------CCceEEeCCCCCCCHH Q lcl|NC_019404. 291 IHEIILKNKNVGGLSSSQNTALETFHK---LIDRKRNAELLPILEFLIPFIVNA----------EEWSVEFSPLDHESSK 357 (418) Q Consensus 291 IP~t~L~G~s~~gl~stge~d~~~y~~---~I~~~Qe~~l~p~l~~l~~~i~~~----------~~~~~~f~pL~~~~ek 357 (418) +|..-|.|.+. . ++||+.-...+.. .++.+| ..+.+.+.+++.+++.- .++.+.|.+...++.. T Consensus 325 ~p~~~fg~~~~-n-~~Sg~Al~~~~~~l~~ka~~k~-~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~ 401 (484) T protein:vir:77 325 LPPYYLSFSSE-N-PASAEAIRSSESRLVKTVERKN-KIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYA 401 (484) T ss_pred CCHHHhccccC-c-chHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHH Confidence 99988855442 2 2456544433333 334444 34677788877776531 2467899998899998 Q ss_pred HHHHHHHHHHHHH------H-HHHhCCCCCHHHH-HHHHHh---------hcCcCCC-----Ch-hhcccccccCCCccc Q lcl|NC_019404. 358 DKAEVLEKSVNSI------A-ALIAAGAMDIKEA-RDTLRT---------IAPEIKI-----GD-NDIQTEESELITETE 414 (418) Q Consensus 358 e~ae~~~~~a~a~------~-~~~~~g~i~~~e~-r~~l~~---------~~~~~~~-----~~-~~~~~~e~~~~~e~e 414 (418) +.|+...|.+++- . .+-..|+...+.. .+.+++ .....+. .+ +.-+.++..+..+++ T Consensus 402 ~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (484) T protein:vir:77 402 AKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPNPAEE 481 (484) T ss_pred HHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCCCccc Confidence 8877655554431 0 1122333322111 011110 0001111 01 111112222222222 Q ss_pred ccc Q lcl|NC_019404. 415 VVI 417 (418) Q Consensus 415 ~~~ 417 (418) ..= T Consensus 482 ~~~ 484 (484) T protein:vir:77 482 AAA 484 (484) T ss_pred cCC Confidence 111 No 129 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.46 E-value=9.8e-14 Score=91.90 Aligned_cols=366 Identities=12% Similarity=0.046 Sum_probs=197.1 Q ss_pred CccchhhHHHHhcC-CCC---cccc-Cc-----cccCCHHHHHHHHHc-CCccchhhhcchhhhccCCccccCcchHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGG-SDG---SEIY-GS-----LQNQAPTILASLYAD-NALVRRIIDTIPETALAAGFHIDGIDDEPAF 69 (418) Q Consensus 1 ~~~~D~~~n~~~g~-~~~---~~~~-~~-----~~~~~~~~l~~~Y~~-~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i 69 (418) -+..|.|...+..- .+- .++| |. ...-.+.++..+++. .++.++||+.+++-+.=.||... |. .+ T Consensus 3 ~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~---d~-~l 78 (422) T protein:vir:97 3 YMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTND---DF-NA 78 (422) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceeeCC---ch-hH Confidence 22223333322110 000 0111 11 111134566666643 36779999999998777888743 22 25 Q ss_pred HHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc-------------ccccc Q lcl|NC_019404. 70 WSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ-------------NREEN 136 (418) Q Consensus 70 ~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~-------------~~~~d 136 (418) .+-|.+-++.....++++.+.+||.|++++.-+++. ..|+ ++++++.++... .+..| T Consensus 79 ~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~--~~p~--------i~~~sp~~~~~i~D~~~~~~~~a~~~~~~~ 148 (422) T protein:vir:97 79 WEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAED--GLPK--------MQVIEASKATGILDPTTFLLTEGYAILESD 148 (422) T ss_pred HHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCC--CeeE--------EEEechhhEEEEEeCCCCcceeeEEEEEec Confidence 666777778888889999999999999998543211 1222 222222222111 11111 Q ss_pred ----ccccccCcceE-EEEecCCcccccccCccc---EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 137 ----PRNARFGKPLT-YRITTNESDMFYDVHYSR---IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 137 ----p~s~~yg~p~~-y~i~~~~~~~~~~iH~SR---~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) +..-.|..+.. |.+...+. ....-|+-. ++.|..+ ......||.|.+.+++.+.+.++.+++... T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~vPvv~~~n~------~~~~~~~G~s~I~e~v~~l~da~~r~~~~~ 221 (422) T protein:vir:97 149 SNGNPTLEAYFTDKDIWYYPKKGK-PYNIKNPTGHPLLVPIIHR------PDAVRPFGRSRITKAGMYHQKAAKRTLERA 221 (422) T ss_pred CCCcEEEEEEEcCceEEEEcCCCc-cccccCCCCCcceEEeccc------CCCccccCccccchhHHHHHHHHHHHHHHH Confidence 11111222222 22222211 111223322 2222222 123457899988667889999999998887 Q ss_pred HHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc--CCCc--eeEee-cccCCHHHHHHHHHH Q lcl|NC_019404. 209 TQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA--ESEE--YSVLN-SDIGGIDAFLDKKFD 283 (418) Q Consensus 209 ~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~--~~e~--~~~~~-~~~~gl~~~~~~~~~ 283 (418) .....-++.+...+.|... ++........+. ...+.+.. +++. +.+++ .++.+..+.++.... T Consensus 222 ~~~~e~~a~pqr~i~G~d~---d~~~~~~~~~~~---------~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~ 289 (422) T protein:vir:97 222 EVTAEFYSFPQKYVLGMDP---DAKPMEKWRATV---------STLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYAS 289 (422) T ss_pred HHHHHHhcchhhhhcccCc---ccccCchhhhhh---------hhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHH Confidence 7777777777777665421 121111111111 12222322 2222 32222 456677888999999 Q ss_pred HHhhhhcCCeeeeeccCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------CCceEEeC Q lcl|NC_019404. 284 RIVALSGIHEIILKNKNVGGLSSSQNT---ALETFHKLIDRKRNAELLPILEFLIPFIVNA-----------EEWSVEFS 349 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s~~gl~stge~---d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-----------~~~~~~f~ 349 (418) ++|+.+++|...|.|++. -++|++. ........++.+|+. ..+.++++..+++.- .++.+.|. T Consensus 290 ~~a~~s~lP~~~lg~~~~--NpsSa~Ai~a~~~~L~~ka~~k~~~-fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~ 366 (422) T protein:vir:97 290 LFAGGSGLTLDDLGFPSD--NPSSVESIKAAHENLRAAGRKAQRS-FSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWE 366 (422) T ss_pred HHhcccCCCHHHhccccC--chhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCcccchhhccceEEEc Confidence 999999999988766552 1234443 334455566666654 577888877775421 14679999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhC--CCCCHHHHHHHHHhhcCcCCCChhhccccc-ccCCC Q lcl|NC_019404. 350 PLDHESSKDKAEVLEKSVNSIAALIAA--GAMDIKEARDTLRTIAPEIKIGDNDIQTEE-SELIT 411 (418) Q Consensus 350 pL~~~~eke~ae~~~~~a~a~~~~~~~--g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e-~~~~~ 411 (418) |....+..+.|++ |+++.+++++ |+.+.+.+++.|. ++.. +..++.-+ -+.++ T Consensus 367 p~~~~~~~s~a~~----aDa~~Kl~~a~~~~~~~~~~~~~lg----~~~~-~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 367 PLFEADANMLTLV----GDGAIKLNQAIPGFMDADVIRDLTG----VKGA-DKPIPAITEVTTDG 422 (422) T ss_pred cCCCCChHHHHHH----HHHHHHHHhhccccccHHHHHHHcC----CCch-hHHHHHHHhhhccC Confidence 8887886666554 7888899888 6788887777651 1111 22222211 12222 No 130 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.45 E-value=1.5e-13 Score=90.97 Aligned_cols=356 Identities=13% Similarity=0.036 Sum_probs=191.9 Q ss_pred chhhHH--H-----HhcCCCCccccCccccCCHHHHHHHHH-cCCccchhhhcchhhhccCCccccCcchHHHHHHHHHH Q lcl|NC_019404. 4 TDSYAN--I-----FLGGSDGSEIYGSLQNQAPTILASLYA-DNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDD 75 (418) Q Consensus 4 ~D~~~n--~-----~~g~~~~~~~~~~~~~~~~~~l~~~Y~-~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~ 75 (418) .|.++. . ..|.. .- .+...-.+.++...++ ..++.++|||.+++-+.=+||... |. .+.+-|.+ T Consensus 1 l~~~~~r~~~~~~yY~g~~-~~---~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~~---d~-~l~~i~~~ 72 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQH-YE---APTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFAND---DF-NVTEIFDR 72 (410) T ss_pred CCcchhhHHHHHHHhcCCC-Cc---cccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccccCC---Cc-hHHHHHhh Confidence 222211 0 11111 00 0011112345554443 358899999999998888888632 22 35666777 Q ss_pred hCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecccccccc-------------cccc----c Q lcl|NC_019404. 76 LEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQN-------------REEN----P 137 (418) Q Consensus 76 l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~-------------~~~d----p 137 (418) -++.....++++.+.+||.|++++.-+ ++. |+ |+++++.++...+ +..+ + T Consensus 73 N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~----~~--------i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~~~~~~ 140 (410) T protein:vir:95 73 NNPDIFFDSAILSALIGSCSFVYISKGEDDE----VR--------LQVIESSNATGVIDPITGLLVEGYAVLARDDYNRP 140 (410) T ss_pred cChHHHHHHHHHHHHHhCceeEEEecCCCCc----eE--------EEEEcccceEEEEeCCCCceEEEEEEEEecCCCeE Confidence 888888999999999999999987542 222 22 2333333322111 0000 0 Q ss_pred cccccCc-ceEEEEecCCcccccccCccc---EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 138 RNARFGK-PLTYRITTNESDMFYDVHYSR---IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 138 ~s~~yg~-p~~y~i~~~~~~~~~~iH~SR---~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) ..-.+.. -..|++...+.... .-|+-- +++|..++ .....||.|.+.+++.+.++++.+++........ T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~vPvV~f~n~~------~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e 213 (410) T protein:vir:95 141 TLEAYFEPNATHFIPKDGEPYS-VTNETGIPLLVPVIHRP------DAVRPFGRSRITRAGMYYQKYAKRTLERADITAE 213 (410) T ss_pred EEEEEEeCCcEEEEeeCCcccc-ccCCCCCcceEEecccc------cCCccCCccccchhHHHHHHHHHHHHHHHHHHHH Confidence 0111111 12223322221111 113221 22332221 2245689998877788888999999888777777 Q ss_pred HcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCC--C--ceeEee-cccCCHHHHHHHHHHHHhhh Q lcl|NC_019404. 214 RKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAES--E--EYSVLN-SDIGGIDAFLDKKFDRIVAL 288 (418) Q Consensus 214 ~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~--e--~~~~~~-~~~~gl~~~~~~~~~~iaaa 288 (418) -++.+...+.|+.. ++.+....... ....+.+..++ + ++-+++ .++.+.-+.++....++|+. T Consensus 214 ~~a~pqr~i~G~d~---d~~~~~~~~~~---------~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~ 281 (410) T protein:vir:95 214 FYSWPQKYILGLDP---DAEPMEKWKAT---------VSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGE 281 (410) T ss_pred HhcchhheeeccCC---CCCcCchhhhh---------hhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhh Confidence 77777776666421 11111111111 11223332221 1 232222 35667778889999999999 Q ss_pred hcCCeeeeeccCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------c---CCceEEeCCCCCC Q lcl|NC_019404. 289 SGIHEIILKNKNVGGLSSSQNT---ALETFHKLIDRKRNAELLPILEFLIPFIVN--------A---EEWSVEFSPLDHE 354 (418) Q Consensus 289 s~IP~t~L~G~s~~gl~stge~---d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------~---~~~~~~f~pL~~~ 354 (418) +++|...|.|++.. .+|++. ........++++|+. ..+.++++..+.+. + .+..+.|.|+..+ T Consensus 282 s~lP~~~lg~~~~N--psSa~Al~a~~~~L~~ka~~k~~~-fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~ 358 (410) T protein:vir:95 282 MGLTLDDLGFVSDN--PSSVEAIKASHENLRLAGRKAQRS-LGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEA 358 (410) T ss_pred cCCCHHHhccccCc--hhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCc Confidence 99999888765521 233332 334456666677654 57777777776532 1 1356789987766 Q ss_pred CHHHHHHHHHHHHHHHHHHHhC--CCCCHHHHHHHHHhhcCcCCCChhhc---ccccccCCCc Q lcl|NC_019404. 355 SSKDKAEVLEKSVNSIAALIAA--GAMDIKEARDTLRTIAPEIKIGDNDI---QTEESELITE 412 (418) Q Consensus 355 ~eke~ae~~~~~a~a~~~~~~~--g~i~~~e~r~~l~~~~~~~~~~~~~~---~~~e~~~~~e 412 (418) +-...| ..|+++.+++++ |+++.+.+++.| |++++++ ..+|....+| T Consensus 359 ~~~s~a----~~aDa~~Kl~~a~~g~~~~~~~~~~l-------g~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 359 DANTMT----MIGDGVVKLNQALPGYINAETIRDLT-------GIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred chhhHH----HHHHHHHHHHHhccCCccHHHHHHhc-------CCChHHHHHHHHHHHHhCCC Confidence 544443 358888888887 677777777655 2333332 2233444444 No 131 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.45 E-value=2.3e-13 Score=89.87 Aligned_cols=372 Identities=11% Similarity=0.003 Sum_probs=185.2 Q ss_pred CccchhhHHHHhcCCCCcc---cc-CccccC------CHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-HHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSE---IY-GSLQNQ------APTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAF 69 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~---~~-~~~~~~------~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i 69 (418) +-+.+-+..-..|-+.--. .. ...... ...........+++++.||+..+..++.+++.+.+++++ ... T Consensus 21 ~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~~~~~ 100 (471) T protein:vir:10 21 VSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKKAYALTYPPTFDVDDKKVNDM 100 (471) T ss_pred HHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhhhhhcccCceeccCChHHHHH Confidence 1112222222223221000 00 000000 000000011246899999999999999999999887654 233 Q ss_pred HHHHHHhCchHHHHHHHHhccccceEEEEEeec--CCCcccccccCCCceEEEEEeeccccccccccccccccc------ Q lcl|NC_019404. 70 WSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK--DNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR------ 141 (418) Q Consensus 70 ~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~--d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~------ 141 (418) .+.+.+-++.....++.+.+..||.|++++.++ ++... +.++++.++-|.+-......+. T Consensus 101 l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~------------~~~~~p~~~~~i~d~~~~~~~~~~ir~~ 168 (471) T protein:vir:10 101 IVDVLGDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFR------------YACVDSKEVIPIYSKSLDKKSIGVLRVY 168 (471) T ss_pred HHHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeE------------EEEEcccceEEEEcCCCCCceEEEEEEE Confidence 344444577888899999999999999988774 22211 2223333222211000000000 Q ss_pred ----------------cCcceEEEEecCCcccccc-----------------cCcccEEEecCccchhhhhhccccCCcc Q lcl|NC_019404. 142 ----------------FGKPLTYRITTNESDMFYD-----------------VHYSRIHIIDGERVPNAMRRQNDGWGRS 188 (418) Q Consensus 142 ----------------yg~p~~y~i~~~~~~~~~~-----------------iH~SR~i~~~g~~lp~~~~~~~~~~G~S 188 (418) |..-..|.+...+...... ......-+.-| .+|. ....++.+|.| T Consensus 169 ~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~iPv-v~~~n~~~~~s 246 (471) T protein:vir:10 169 SSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFG-LVPF-IPFKNNEIETN 246 (471) T ss_pred EeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCC-ceeE-EEeccCCCCCC Confidence 0000111111100000000 00000000000 0111 11233456888 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----CCce Q lcl|NC_019404. 189 VLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----SEEY 264 (418) Q Consensus 189 ~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----~e~~ 264 (418) .++ .+.+.+.+++.+....+..+..++...+.+.+.. +.........+ ...+.+.+... +.++ T Consensus 247 d~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~~-------~~~~~i~~~~~~~~~~~~~ 313 (471) T protein:vir:10 247 DLK-PIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYG-----GQDKQEFLEDL-------KRYKMIKMDNDGMGDQSGV 313 (471) T ss_pred chH-HHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC-----ccccchhHHHh-------hcCCeEEecCCCCccCccc Confidence 886 4889999999999999988888888877777631 11111111111 11233333322 1234 Q ss_pred e--EeecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019404. 265 S--VLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVN 340 (418) Q Consensus 265 ~--~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~ 340 (418) + ..+.+..+.+..++.+.+.|...+++|-.-..+ .| |+||..-...|.... ...++..++..+.+++.+++. T Consensus 314 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~---~g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 389 (471) T protein:vir:10 314 TTIAIDIPTEARNLILERTKKQIFISGQGVNPETDK---LG-NSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILK 389 (471) T ss_pred eEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccc---cc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 444566789999999999999999999753332 23 456665433333322 233345678888888877753 Q ss_pred ------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH-------------HHhhcCcCCCChhh Q lcl|NC_019404. 341 ------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDT-------------LRTIAPEIKIGDND 401 (418) Q Consensus 341 ------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~-------------l~~~~~~~~~~~~~ 401 (418) ..++.+.|++....+++|.+++..+.+ |+||.+.+.+. +++-.....-...+ T Consensus 390 ~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~---------g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~~~ 460 (471) T protein:vir:10 390 HLGLSDKLKIKQTWTRNSINNDTEMAQVVSTLA---------TITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKLYD 460 (471) T ss_pred HhccCCCceeEEEeCCCCCCCHHHHHHHHHHHh---------ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccc Confidence 236889999999999999988755543 45555444432 22110000000111 Q ss_pred cccccccCCCccc Q lcl|NC_019404. 402 IQTEESELITETE 414 (418) Q Consensus 402 ~~~~e~~~~~e~e 414 (418) +.+.+.+ +|-| T Consensus 461 ~~~~~~~--~e~~ 471 (471) T protein:vir:10 461 MEEVEHE--SEVE 471 (471) T ss_pred cCCCCCc--cccC Confidence 2222222 2222 No 132 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.44 E-value=3.3e-14 Score=94.47 Aligned_cols=389 Identities=14% Similarity=0.071 Sum_probs=180.3 Q ss_pred Cccchhh-----------------------HHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCC Q lcl|NC_019404. 1 MVKTDSY-----------------------ANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAG 57 (418) Q Consensus 1 ~~~~D~~-----------------------~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~ 57 (418) |...+.+ .--..|-.. ....+.. .+.+.......+.++++|||..++-+.-.| T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~---i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~a~~l~~~G 76 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERR---PDAIGLA-VPLDMRKYLAHVGYPRTYVDAIAERQELEG 76 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---hhhcCcc-cchhhhhhhhhcchHHHHHHHHHHhhhccc Confidence 1111111 111122111 0001111 133333444568899999999999887777 Q ss_pred ccccC--------cchH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecc Q lcl|NC_019404. 58 FHIDG--------IDDE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRT 126 (418) Q Consensus 58 ~~i~~--------~~d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~ 126 (418) |.+-. .++. ..+.+-|++-++.....++.+.+.+||.|++++....+. ....+..+. .-|+++++. T Consensus 77 f~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~--~~~~~~~~~-~~i~~~~p~ 153 (488) T protein:vir:23 77 FRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPE--VDFDVDPEV-PLIRVEPPT 153 (488) T ss_pred eeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcc--cccCCCCCc-ceEEEeccc Confidence 76532 1222 345666777788899999999999999999888653211 111111111 123444444 Q ss_pred ccccccccc--cccc-------cccCcce---------EEEEecCCccc--c-cccCccc---EEEecCccchhhhhhcc Q lcl|NC_019404. 127 QVKVQNREE--NPRN-------ARFGKPL---------TYRITTNESDM--F-YDVHYSR---IHIIDGERVPNAMRRQN 182 (418) Q Consensus 127 ~i~~~~~~~--dp~s-------~~yg~p~---------~y~i~~~~~~~--~-~~iH~SR---~i~~~g~~lp~~~~~~~ 182 (418) ++.+.+-.. .+.. .+.++.. .|++....+.. . ..-|+=. |+.|.++ .... T Consensus 154 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~------~~~~ 227 (488) T protein:vir:23 154 ALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNR------TRLS 227 (488) T ss_pred eeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccc------cccC Confidence 433321100 0000 0111111 11111111110 0 0112211 1112111 1123 Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHH--HhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC Q lcl|NC_019404. 183 DGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLA--ELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE 260 (418) Q Consensus 183 ~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~--~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~ 260 (418) .+||.|.++..+.+.+.+++++....+..+..++.....+.+.. ..... .... ...+ +...+.+.+..+ T Consensus 228 ~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~-~~~~--~~~~------~~~~~~v~~~~~ 298 (488) T protein:vir:23 228 DLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGIN-AETG--QRMF------DAYMARILAFEG 298 (488) T ss_pred CcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccc-cccc--chhh------hhhhhhhccCCC Confidence 45899988766777778888888877777766665555444322 11000 0000 0001 111122333334 Q ss_pred CCceeEeec---ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_019404. 261 SEEYSVLNS---DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDR--KRNAELLPILEFLI 335 (418) Q Consensus 261 ~e~~~~~~~---~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~--~Qe~~l~p~l~~l~ 335 (418) +++....+. ++....+.++....++++.+++|...|.|.+ .. ++||+.-...+...+.. .++..+.+.|.+++ T Consensus 299 g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~-~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~ 376 (488) T protein:vir:23 299 GEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSS-DN-PASAEAIKAAESRLVKKVERKNKIFGGAWEQAM 376 (488) T ss_pred CCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455444444 3445666677778899999999987774433 22 24565444333333222 22334677888888 Q ss_pred HHhhcc----------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHH-------------hh- Q lcl|NC_019404. 336 PFIVNA----------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLR-------------TI- 391 (418) Q Consensus 336 ~~i~~~----------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~-------------~~- 391 (418) .+++.- .++.+.|.+-..++..+.++...|.+++. .|+++.+.+++.|. +. T Consensus 377 ~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g-----~~~~s~et~~~~l~~~~d~~~~~~~~~~~~ 451 (488) T protein:vir:23 377 RLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANG-----AGLIPRERGWVDMGYTIVEREQMRQWLEQD 451 (488) T ss_pred HHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcc-----cccCCHHHHHHhCCCCchHHHHHHHHHHHH Confidence 887531 25788999888899988877655544321 11333333332210 00 Q ss_pred -----cC---cCC--CChhhcccccccCCCccccccC Q lcl|NC_019404. 392 -----AP---EIK--IGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 392 -----~~---~~~--~~~~~~~~~e~~~~~e~e~~~~ 418 (418) +. ..+ ..++...+.+.....+.|.--| T Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 452 QKQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 00 000 0000000000001111111122 No 133 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.44 E-value=3.4e-13 Score=88.93 Aligned_cols=383 Identities=13% Similarity=0.096 Sum_probs=186.1 Q ss_pred CccchhhHH------------------HHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKTDSYAN------------------IFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~D~~~n------------------~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) +...+-+.. -..|-+..-.... ......-...-..+.+++.||+..+..++.+++.+++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~---~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:10 39 LQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELT---RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred ccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccC---cccccccCcceeecchHHHHHHHHhhhhcccCceeec Confidence 222222222 2222221100000 0000000000123689999999999999999999987 Q ss_pred cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccc Q lcl|NC_019404. 63 IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRN 139 (418) Q Consensus 63 ~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 139 (418) +++. +.+..-+++-++.....++.+...+||.|++++..+. +... +.++++.++-+.+-+..... T Consensus 116 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~------------i~~~~p~~~~~vydd~~~~~ 183 (511) T protein:vir:10 116 DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETR------------LYKSDAMSTFVIYDNTIERN 183 (511) T ss_pred CchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceE------------EEEEccceeEEEEcCCCCCc Confidence 6543 3455556666788899999999999999999887742 3221 22233332222211100000 Q ss_pred cc------------------------cCcceEEEEecCCccc--------ccccCcccEEEecCccchhhhhhccccCCc Q lcl|NC_019404. 140 AR------------------------FGKPLTYRITTNESDM--------FYDVHYSRIHIIDGERVPNAMRRQNDGWGR 187 (418) Q Consensus 140 ~~------------------------yg~p~~y~i~~~~~~~--------~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~ 187 (418) |. |..-..|++...+... ...-|+=..+.+ ....++.+|. T Consensus 184 ~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv--------v~f~nn~~g~ 255 (511) T protein:vir:10 184 SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI--------TEFSNNERRK 255 (511) T ss_pred eEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeE--------EEecCCCCCC Confidence 11 1111112221111100 001122111111 1122345788 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHH-HHHHHH-HhcCCcceeEEEcCCCcee Q lcl|NC_019404. 188 SVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARL-RLAQVD-NNSGVGQAIGIDAESEEYS 265 (418) Q Consensus 188 S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~-r~~~~~-~~~~~~~~~~~d~~~e~~~ 265 (418) |.++ .+.+.+.+++.+....+..+..++..++.+.+.... .......... +..... .......... ..++.++. T Consensus 256 gd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~ 331 (511) T protein:vir:10 256 GDYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--DPVEVRKQKEANVLFLEPTVYADSEGRE-TEGSVDGG 331 (511) T ss_pred Cchh-hhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccC--Cchhhccchhccceeccccccccccccc-CCCCccee Confidence 8886 488999999999988888888888777776653211 1111000000 000000 0000111111 11223444 Q ss_pred Ee--ecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019404. 266 VL--NSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPILEFLIPFIVN 340 (418) Q Consensus 266 ~~--~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~l~~l~~~i~~ 340 (418) .+ +.+..+....++.+.+.|...+.+|-.-. +.. +| |.||..-...| ...++ .++..++..|++++++++. T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~-~~~-~~-n~Sg~Al~~~~~~l~~k~~-~k~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:10 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNF-SG-TQSGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKLLET 407 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-ccc-cc-cchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 44 45667899999999999999999998543 211 12 33555432222 22233 3345578888888777643 Q ss_pred c-------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCcC Q lcl|NC_019404. 341 A-------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPEI 395 (418) Q Consensus 341 ~-------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~~ 395 (418) - .++++.|++-...+..+.+++..+.+-.++. +-..+.++ +++-.+.+++ ....+ T Consensus 408 ~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 487 (511) T protein:vir:10 408 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGI 487 (511) T ss_pred HHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhc Confidence 1 2578999999999999998876665432222 12233332 2222222211 00001 Q ss_pred C-----CChhhcccccccCCCccc Q lcl|NC_019404. 396 K-----IGDNDIQTEESELITETE 414 (418) Q Consensus 396 ~-----~~~~~~~~~e~~~~~e~e 414 (418) + ..+++-++++.+...|+| T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 488 YKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ccCCCCCCCCCCCCcccCcccccC Confidence 1 111111112222233333 No 134 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.43 E-value=6.4e-13 Score=87.43 Aligned_cols=381 Identities=12% Similarity=0.064 Sum_probs=188.7 Q ss_pred CccchhhHHHHhcCCCCccccCccc----cCCHHHHH--HHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQ----NQAPTILA--SLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSR 72 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~----~~~~~~l~--~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~ 72 (418) +-+.+-+..-..|-+.--....... ..-..... ..-..+.+++.||+..+..++-+++.+..+++. +.+.+. T Consensus 21 ~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~G~p~~~~~~d~~~~~~l~~~ 100 (470) T protein:vir:10 21 INNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVASVFPDIDVGKDADNKKIIDV 100 (470) T ss_pred HHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhheeccceeeecCchHHHHHHHHH Confidence 2222223333334322100000000 00000000 001247899999999999999999999765543 344444 Q ss_pred HHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccc---------------cc Q lcl|NC_019404. 73 WDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREE---------------NP 137 (418) Q Consensus 73 ~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~---------------dp 137 (418) +.. +....+.++.+.+..||.|++++.++... .++ +.++++.++-|.+-.. +. T Consensus 101 ~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~----------~~~-~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~ 168 (470) T protein:vir:10 101 LGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDG----------NFR-YGIIQPDQITPIYATTLDNKLLGILRSYKQLDP 168 (470) T ss_pred Hhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCC----------ceE-EEEEcccceEEEEcCCCCCceEEEEEEEEeeec Confidence 432 56677788889999999999998874322 111 2233333222221100 00 Q ss_pred ccc-------cc--CcceEEEEecCCcccccccCcccEEEe----------------cC-ccchhhhhhccccCCcchHH Q lcl|NC_019404. 138 RNA-------RF--GKPLTYRITTNESDMFYDVHYSRIHII----------------DG-ERVPNAMRRQNDGWGRSVLS 191 (418) Q Consensus 138 ~s~-------~y--g~p~~y~i~~~~~~~~~~iH~SR~i~~----------------~g-~~lp~~~~~~~~~~G~S~l~ 191 (418) ... -| +...+|....... ..+++-+.+.. ++ ..+|. ....++.+|.|.++ T Consensus 169 ~~~~~~~~~e~yt~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~nn~~g~sd~e 244 (470) T protein:vir:10 169 DSGKYFTVHEYWTDKEAQFFRTNATDS---TVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF-IEFSKNKYRLPELN 244 (470) T ss_pred CCceEEEEEEEEcCCcEEEEEeecCcc---eeccccccccccccccccccccccccccCCCeeeE-EEeecCCCCCCchh Confidence 000 00 0111111110000 00000000000 00 00111 11123456888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc----CCCcee-- Q lcl|NC_019404. 192 SDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA----ESEEYS-- 265 (418) Q Consensus 192 ~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~----~~e~~~-- 265 (418) .+.+.+.+|+.++...+.-+..++...+.+.+.. . .+.. +....+ ...+.+.+.. ++-+++ T Consensus 245 -~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~-~-~~~~---~~~~~~-------~~~~~i~~~~~~~~~~~~~~~l 311 (470) T protein:vir:10 245 -KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG-G-ADLH---QFMNDL-------RKYKSIKINNTGNGDNSGVDKL 311 (470) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCC-c-cccc---hhhhhh-------hhcCeEeccCCCCCcCceeEEE Confidence 4889999999999999999999988888877532 1 1111 111111 1122333322 122344 Q ss_pred EeecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhc--- Q lcl|NC_019404. 266 VLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVN--- 340 (418) Q Consensus 266 ~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~--- 340 (418) ....+..+....++.+.+.|...+.+|-.-.. ..| |+||..-...|.... ....+..+++.+++++.+|+. T Consensus 312 t~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~---~~g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~ 387 (470) T protein:vir:10 312 QIDIPVEARDDALKITRKNIFLFGQGIDPANF---ESS-NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLN 387 (470) T ss_pred eecCChHHHHHHHHHHHHHHHHHhCCCCCCcc---ccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44456678899999999999999999975332 224 567776544444443 334455678888888887753 Q ss_pred -----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHhhcCcCCCChhhcccccccCCC Q lcl|NC_019404. 341 -----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARDTLRTIAPEIKIGDNDIQTEESELIT 411 (418) Q Consensus 341 -----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~ 411 (418) ..++.+.|++-...+++|.|++..+.+..++. +-..+.+ ++++..+++++--.....-.....+.+....+ T Consensus 388 ~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~d 467 (470) T protein:vir:10 388 FSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQADELNGKGVN 467 (470) T ss_pred ccCcccceeeEEeccCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhccccccCCCCCC Confidence 13678999999999999998876665433222 1223322 23333333321100000000011122222222 Q ss_pred ccc Q lcl|NC_019404. 412 ETE 414 (418) Q Consensus 412 e~e 414 (418) ++| T Consensus 468 de~ 470 (470) T protein:vir:10 468 DEQ 470 (470) T ss_pred CCC Confidence 333 No 135 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.43 E-value=7.2e-13 Score=87.14 Aligned_cols=376 Identities=12% Similarity=0.069 Sum_probs=193.8 Q ss_pred ccchhhHHHHhcCCCCc---------------------ccc-Ccc------ccCCHHHHHHHH---------------Hc Q lcl|NC_019404. 2 VKTDSYANIFLGGSDGS---------------------EIY-GSL------QNQAPTILASLY---------------AD 38 (418) Q Consensus 2 ~~~D~~~n~~~g~~~~~---------------------~~~-~~~------~~~~~~~l~~~Y---------------~~ 38 (418) |++.-|...+.-..-.. .+| +.. .......+...| .. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 34333333221111000 000 000 000000111000 12 Q ss_pred CCccchhhhcchhhhccCCccccCcch---HHH----HHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccc Q lcl|NC_019404. 39 NALVRRIIDTIPETALAAGFHIDGIDD---EPA----FWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPV 111 (418) Q Consensus 39 ~~~~r~iVd~~a~d~~r~~~~i~~~~d---~~~----i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl 111 (418) +++++.||+..+..++.+++.+.++++ .++ +.+.+++-++.....++.+.+..||.|++++.++.+.. T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~----- 155 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGD----- 155 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCe----- Confidence 789999999999999999999876432 223 33344445788999999999999999999987753221 Q ss_pred cCCCceEEEEEeecccccccccc-cccc-------------------ccccCcceEEEEecCCccccc----ccCcccEE Q lcl|NC_019404. 112 REGAELETVRVYDRTQVKVQNRE-ENPR-------------------NARFGKPLTYRITTNESDMFY----DVHYSRIH 167 (418) Q Consensus 112 ~~~~~i~~i~v~~~~~i~~~~~~-~dp~-------------------s~~yg~p~~y~i~~~~~~~~~----~iH~SR~i 167 (418) + .+.++++.++-+.+-+ .+|. -..|..-..|++...+..... .-|+.-.+ T Consensus 156 -----~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 229 (474) T protein:vir:10 156 -----I-RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYN 229 (474) T ss_pred -----e-EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCcc Confidence 1 2333333333221100 0000 011222233333322211111 12332222 Q ss_pred EecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHH Q lcl|NC_019404. 168 IIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDN 247 (418) Q Consensus 168 ~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~ 247 (418) .+ ....++.+|.|.++. +.+.+.+++.+....+..+..++..++.+.+.. ... +..... T Consensus 230 Pv--------v~~~n~~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-----~~~--~~~~~~----- 288 (474) T protein:vir:10 230 PL--------FGVPNNKEMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-----MSE--EMIQET----- 288 (474) T ss_pred ce--------EEecCCCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-----CCc--hhhhhh----- Confidence 11 122356689998874 889999999999999988888888877776531 111 111111 Q ss_pred hcCCcceeEEEcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHH Q lcl|NC_019404. 248 NSGVGQAIGIDAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKR 323 (418) Q Consensus 248 ~~~~~~~~~~d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Q 323 (418) ...+.+.+..++.+++.++. +..+....++.+.+.|...+++|..-. + ..+| |.||..-...|.... ...+ T Consensus 289 --~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~~ 363 (474) T protein:vir:10 289 --QKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNS-D-EFNG-NVPIIGMKLKLMALENKCMTF 363 (474) T ss_pred --hhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccc-c-cccc-cchHHHHHHHHHHHHHHHHHH Confidence 12345555555566776654 456788899999999999999997433 2 1223 456665444444332 2344 Q ss_pred HHHHHHHHHHHHHHhhcc-------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHH Q lcl|NC_019404. 324 NAELLPILEFLIPFIVNA-------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARD 386 (418) Q Consensus 324 e~~l~p~l~~l~~~i~~~-------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~ 386 (418) +..++..+++++++++.- .++++.|++-...++++.|++..+.+..++. +-..+.+ ++++..+ T Consensus 364 ~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~e 443 (474) T protein:vir:10 364 ERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELD 443 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHH Confidence 556778888877776420 2578999999999999998876654322111 1122222 2322222 Q ss_pred HHHhhcCcCCCChhhc---ccccccCCCccc Q lcl|NC_019404. 387 TLRTIAPEIKIGDNDI---QTEESELITETE 414 (418) Q Consensus 387 ~l~~~~~~~~~~~~~~---~~~e~~~~~e~e 414 (418) .+++-.....-...+. ..++.+...++| T Consensus 444 ri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 444 EMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 2211000000000000 111111222333 No 136 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.43 E-value=7.2e-13 Score=87.14 Aligned_cols=376 Identities=12% Similarity=0.069 Sum_probs=193.8 Q ss_pred ccchhhHHHHhcCCCCc---------------------ccc-Ccc------ccCCHHHHHHHH---------------Hc Q lcl|NC_019404. 2 VKTDSYANIFLGGSDGS---------------------EIY-GSL------QNQAPTILASLY---------------AD 38 (418) Q Consensus 2 ~~~D~~~n~~~g~~~~~---------------------~~~-~~~------~~~~~~~l~~~Y---------------~~ 38 (418) |++.-|...+.-..-.. .+| +.. .......+...| .. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 34333333221111000 000 000 000000111000 12 Q ss_pred CCccchhhhcchhhhccCCccccCcch---HHH----HHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccc Q lcl|NC_019404. 39 NALVRRIIDTIPETALAAGFHIDGIDD---EPA----FWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPV 111 (418) Q Consensus 39 ~~~~r~iVd~~a~d~~r~~~~i~~~~d---~~~----i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl 111 (418) +++++.||+..+..++.+++.+.++++ .++ +.+.+++-++.....++.+.+..||.|++++.++.+.. T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~----- 155 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGD----- 155 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCe----- Confidence 789999999999999999999876432 223 33344445788999999999999999999987753221 Q ss_pred cCCCceEEEEEeecccccccccc-cccc-------------------ccccCcceEEEEecCCccccc----ccCcccEE Q lcl|NC_019404. 112 REGAELETVRVYDRTQVKVQNRE-ENPR-------------------NARFGKPLTYRITTNESDMFY----DVHYSRIH 167 (418) Q Consensus 112 ~~~~~i~~i~v~~~~~i~~~~~~-~dp~-------------------s~~yg~p~~y~i~~~~~~~~~----~iH~SR~i 167 (418) + .+.++++.++-+.+-+ .+|. -..|..-..|++...+..... .-|+.-.+ T Consensus 156 -----~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 229 (474) T protein:vir:94 156 -----I-RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYN 229 (474) T ss_pred -----e-EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCcc Confidence 1 2333333333221100 0000 011222233333322211111 12332222 Q ss_pred EecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHH Q lcl|NC_019404. 168 IIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDN 247 (418) Q Consensus 168 ~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~ 247 (418) .+ ....++.+|.|.++. +.+.+.+++.+....+..+..++..++.+.+.. ... +..... T Consensus 230 Pv--------v~~~n~~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-----~~~--~~~~~~----- 288 (474) T protein:vir:94 230 PL--------FGVPNNKEMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-----MSE--EMIQET----- 288 (474) T ss_pred ce--------EEecCCCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-----CCc--hhhhhh----- Confidence 11 122356689998874 889999999999999988888888877776531 111 111111 Q ss_pred hcCCcceeEEEcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHH Q lcl|NC_019404. 248 NSGVGQAIGIDAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKR 323 (418) Q Consensus 248 ~~~~~~~~~~d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Q 323 (418) ...+.+.+..++.+++.++. +..+....++.+.+.|...+++|..-. + ..+| |.||..-...|.... ...+ T Consensus 289 --~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~~ 363 (474) T protein:vir:94 289 --QKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNS-D-EFNG-NVPIIGMKLKLMALENKCMTF 363 (474) T ss_pred --hhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccc-c-cccc-cchHHHHHHHHHHHHHHHHHH Confidence 12345555555566776654 456788899999999999999997433 2 1223 456665444444332 2344 Q ss_pred HHHHHHHHHHHHHHhhcc-------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHH Q lcl|NC_019404. 324 NAELLPILEFLIPFIVNA-------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARD 386 (418) Q Consensus 324 e~~l~p~l~~l~~~i~~~-------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~ 386 (418) +..++..+++++++++.- .++++.|++-...++++.|++..+.+..++. +-..+.+ ++++..+ T Consensus 364 ~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~e 443 (474) T protein:vir:94 364 ERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELD 443 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHH Confidence 556778888877776420 2578999999999999998876654322111 1122222 2322222 Q ss_pred HHHhhcCcCCCChhhc---ccccccCCCccc Q lcl|NC_019404. 387 TLRTIAPEIKIGDNDI---QTEESELITETE 414 (418) Q Consensus 387 ~l~~~~~~~~~~~~~~---~~~e~~~~~e~e 414 (418) .+++-.....-...+. ..++.+...++| T Consensus 444 ri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 444 EMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 2211000000000000 111111222333 No 137 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.43 E-value=7.2e-13 Score=87.16 Aligned_cols=384 Identities=13% Similarity=0.092 Sum_probs=186.7 Q ss_pred Cc--------------------------cchhhHHHHhcCCCCccccCccccCCHHHHH-HHHHcCCccchhhhcchhhh Q lcl|NC_019404. 1 MV--------------------------KTDSYANIFLGGSDGSEIYGSLQNQAPTILA-SLYADNALVRRIIDTIPETA 53 (418) Q Consensus 1 ~~--------------------------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~-~~Y~~~~~~r~iVd~~a~d~ 53 (418) |- |.+-+..-..|-+..-.... .. ..... ..-..+++++.||+..+..+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~---~~-~~~~~~~~ki~~n~~k~Ivd~~~~yl 106 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT---RR-KEEYMADNRVAHDYASYISDFINGYF 106 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccC---cc-cccccCcceeecchHHHHHHHHhhhh Confidence 10 01111111222221000000 00 00000 00123688999999999999 Q ss_pred ccCCccccCcchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecccccc Q lcl|NC_019404. 54 LAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKV 130 (418) Q Consensus 54 ~r~~~~i~~~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~ 130 (418) +.+++.++++++. +.+.+-+++-++...+.++.+...+||.|++++..+. +.. .+.++++.++.+ T Consensus 107 ~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~------------~i~~~~p~~~~~ 174 (512) T protein:vir:97 107 LGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET------------RLYKSDAMSTFV 174 (512) T ss_pred cccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce------------EEEEEcccceEE Confidence 9999999876543 3566666667889999999999999999999987742 221 123333333322 Q ss_pred ccccccccccccC---------------cc---------eEEEEecCCcccc--------cccCcccEEEecCccchhhh Q lcl|NC_019404. 131 QNREENPRNARFG---------------KP---------LTYRITTNESDMF--------YDVHYSRIHIIDGERVPNAM 178 (418) Q Consensus 131 ~~~~~dp~s~~yg---------------~p---------~~y~i~~~~~~~~--------~~iH~SR~i~~~g~~lp~~~ 178 (418) .+-+.....|.++ .. ..|++...+.... ..-|+-..+ |. . T Consensus 175 iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~v-------Pv-v 246 (512) T protein:vir:97 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM-------PI-T 246 (512) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCccc-------ce-E Confidence 2110000011110 00 1122211111000 001221111 11 1 Q ss_pred hhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHH-HHHHHHHHHhcCCcceeEE Q lcl|NC_019404. 179 RRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAA-RLRLAQVDNNSGVGQAIGI 257 (418) Q Consensus 179 ~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~-~~r~~~~~~~~~~~~~~~~ 257 (418) ...++.+|.|.++ .+.+.+.+++.+....+.-+..++..++.+.+.... ........ ..+...............+ T Consensus 247 ~~~nn~~~~gd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (512) T protein:vir:97 247 EFSNNERRKGDYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--DPVEVRKQKEANVLFLEPTVYENRDTGI 323 (512) T ss_pred eecCCCCCCCchh-hhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccC--Cchhhhhhhhcccccccccchhhccccc Confidence 1233557888887 488999999999988888888888777776653211 11110000 0000000000001111111 Q ss_pred E-cCCCceeE--eecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_019404. 258 D-AESEEYSV--LNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKL--IDRKRNAELLPILE 332 (418) Q Consensus 258 d-~~~e~~~~--~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~--I~~~Qe~~l~p~l~ 332 (418) . .++.++.. .+.+..+....++.+.+.|...+++|-.-. +.. +| |.||+.-...|... -...++..++..|+ T Consensus 324 ~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~-~~~-~g-n~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~ 400 (512) T protein:vir:97 324 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNF-SG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) T ss_pred CCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCc-ccc-cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 12233444 445677899999999999999999998543 211 22 34565433222222 12334456788888 Q ss_pred HHHHHhhcc-------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh----- Q lcl|NC_019404. 333 FLIPFIVNA-------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT----- 390 (418) Q Consensus 333 ~l~~~i~~~-------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~----- 390 (418) +++.+++.- .++++.|++-...+..+.+++..+.+..++. +-..+.++ +++..+++++ T Consensus 401 ~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~ 480 (512) T protein:vir:97 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) T ss_pred HHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 887776421 1578999999999999988876665433222 12233332 3332232211 Q ss_pred ---hc--CcCCC---ChhhcccccccCCCccc Q lcl|NC_019404. 391 ---IA--PEIKI---GDNDIQTEESELITETE 414 (418) Q Consensus 391 ---~~--~~~~~---~~~~~~~~e~~~~~e~e 414 (418) .. .+... .+++=++...+...|+| T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred HHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 00 00001 11111111122223333 No 138 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.43 E-value=3.4e-13 Score=88.91 Aligned_cols=382 Identities=12% Similarity=0.091 Sum_probs=185.3 Q ss_pred Cccchh------------------hHHHHhcCCCCccccCccccCCHHHHH-HHHHcCCccchhhhcchhhhccCCcccc Q lcl|NC_019404. 1 MVKTDS------------------YANIFLGGSDGSEIYGSLQNQAPTILA-SLYADNALVRRIIDTIPETALAAGFHID 61 (418) Q Consensus 1 ~~~~D~------------------~~n~~~g~~~~~~~~~~~~~~~~~~l~-~~Y~~~~~~r~iVd~~a~d~~r~~~~i~ 61 (418) ++..+- +..-+.|-+.--..... ...... ..-..+.+++.||+..+..++.+++.++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~----~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~ 114 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQ 114 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc----CcccccCcceeecchHHHHHHHHHhhhccCCceee Confidence 111111 11222232211000000 000000 0012368999999999999999999998 Q ss_pred CcchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecccccccccccccc Q lcl|NC_019404. 62 GIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPR 138 (418) Q Consensus 62 ~~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~ 138 (418) ++++. +.+.+-+++-++...+.++.+...+||.|++++..+. +.. .+.++++.++-+.+-+.... T Consensus 115 ~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~------------~i~~~~p~~~~~vydd~~~~ 182 (511) T protein:vir:96 115 DDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET------------RLYKSDAMSTFVIYDNTIER 182 (511) T ss_pred cCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce------------EEEEEccceeEEEEcCCCCC Confidence 76543 4566667777899999999999999999999887742 321 12333333332221111111 Q ss_pred cccc---------------Ccce---------EEEEecCCccc--------ccccCcccEEEecCccchhhhhhccccCC Q lcl|NC_019404. 139 NARF---------------GKPL---------TYRITTNESDM--------FYDVHYSRIHIIDGERVPNAMRRQNDGWG 186 (418) Q Consensus 139 s~~y---------------g~p~---------~y~i~~~~~~~--------~~~iH~SR~i~~~g~~lp~~~~~~~~~~G 186 (418) .|.+ +... .|++...++.. ...-|+-..+.+ ....++.+| T Consensus 183 ~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv--------v~~~nn~~g 254 (511) T protein:vir:96 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI--------TEFSNNERR 254 (511) T ss_pred ceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceee--------EEecCCCCC Confidence 1111 1111 11111111100 001122111111 112234578 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHH-HHHHHHH-HhcCCcceeEEEcCCCce Q lcl|NC_019404. 187 RSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAAR-LRLAQVD-NNSGVGQAIGIDAESEEY 264 (418) Q Consensus 187 ~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~-~r~~~~~-~~~~~~~~~~~d~~~e~~ 264 (418) .|.++ .+.+.+.+++.+....+..+..++..++.+.+.... + ........ .+..... .........- ...+.++ T Consensus 255 ~gd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 330 (511) T protein:vir:96 255 KGDYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-D-PVEVRKQKEANVLFLEPTVYADSEGRE-TEGSVDG 330 (511) T ss_pred CCchh-hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccC-C-chhhcccccccceeccccccccccccc-CCCCcce Confidence 88887 488999999999999998888888777776653211 0 00000000 0000000 0000111111 1122344 Q ss_pred eEe--ecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019404. 265 SVL--NSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPILEFLIPFIV 339 (418) Q Consensus 265 ~~~--~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~l~~l~~~i~ 339 (418) ..+ +.+..+....++.+.+.|...+.+|-.-..+. +| |.||..-...| ...++. ++..++..+++++++|+ T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~--~~-n~Sg~Al~~~~~~l~~k~~~-k~~~~~~~l~~~~~li~ 406 (511) T protein:vir:96 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF--SG-TQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLE 406 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--cc-cchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 444 45567899999999999999999998544221 12 34555433222 223333 34457788888777764 Q ss_pred cc-------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCc Q lcl|NC_019404. 340 NA-------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPE 394 (418) Q Consensus 340 ~~-------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~ 394 (418) .. .++++.|++-...+.++.+++..+.+-.++. +-..+.++ +++..+.+.+ .... T Consensus 407 ~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~ 486 (511) T protein:vir:96 407 TILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKG 486 (511) T ss_pred HHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhc Confidence 21 2578999999999999988875554332221 11222222 2222222211 0000 Q ss_pred CC-----CChhhcccccccCCCccc Q lcl|NC_019404. 395 IK-----IGDNDIQTEESELITETE 414 (418) Q Consensus 395 ~~-----~~~~~~~~~e~~~~~e~e 414 (418) ++ ..+++=++++.+...|+| T Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 487 IYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred cccCCCCCCCCCCCCcccccccccC Confidence 11 011111111122223333 No 139 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.43 E-value=4.2e-13 Score=88.44 Aligned_cols=390 Identities=12% Similarity=0.071 Sum_probs=183.9 Q ss_pred Cccc------------------hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKT------------------DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~------------------D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) +... +-+..-+.|-+..-.... ......-...-..+.+++.||+..+..++.+++.+++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~---~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT---RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccC---cccccccCcceeecchHHHHHHHHhhhhcccCceeec Confidence 1111 111222223221100000 0000000000123588999999999999999999987 Q ss_pred cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeeccccccccccccccc Q lcl|NC_019404. 63 IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQNREENPRN 139 (418) Q Consensus 63 ~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 139 (418) +++. +.+.+-+++-++.....++.+...+||.|++++..+ ++.. .+.++++.++-+.+-+..... T Consensus 116 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~------------~i~~~~p~~~~~v~dd~~~~~ 183 (511) T protein:vir:96 116 DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET------------RLYKSDAMSTFIIYDNTVERN 183 (511) T ss_pred CchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCce------------EEEEEcccceEEEEcCCCCCc Confidence 6543 456666777788899999999999999999988774 2322 122333333322211100001 Q ss_pred cccCcceEEEEecCCcccc-----c-ccCcccEEEecCc--------------------cchhhhhhccccCCcchHHHH Q lcl|NC_019404. 140 ARFGKPLTYRITTNESDMF-----Y-DVHYSRIHIIDGE--------------------RVPNAMRRQNDGWGRSVLSSD 193 (418) Q Consensus 140 ~~yg~p~~y~i~~~~~~~~-----~-~iH~SR~i~~~g~--------------------~lp~~~~~~~~~~G~S~l~~~ 193 (418) |.++- .+|.......... . -+.+.++.+|... .+|. ....++.+|.|.++. T Consensus 184 ~~~~v-r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~g~gd~e~- 260 (511) T protein:vir:96 184 SIAGV-RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPI-TEFSNNERRKGDYEK- 260 (511) T ss_pred eEEEE-EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccce-EEecCCCCCCCchhh- Confidence 11110 1111110000000 0 0111122221100 0111 112234578888874 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHH-HHHHHHHHhcC-CcceeEEEcCCCcee--Eeec Q lcl|NC_019404. 194 ILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAAR-LRLAQVDNNSG-VGQAIGIDAESEEYS--VLNS 269 (418) Q Consensus 194 ~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~-~r~~~~~~~~~-~~~~~~~d~~~e~~~--~~~~ 269 (418) +.+.+.+++.+....+..+..++..++.+.+.... ......... .+........- .....- ..++.+.. ..+. T Consensus 261 v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~ 337 (511) T protein:vir:96 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--DPVEVRKQKEANVLFLEPTVYVDAEGRE-TEGSVDGGYIYKQY 337 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhcchhheecCccC--Cchhhcccccccceeccccceecccccc-CCCCcceeEEeecC Confidence 88999999999888888888777776666542211 011000000 00000000000 000000 01122343 3445 Q ss_pred ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhc------- Q lcl|NC_019404. 270 DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVN------- 340 (418) Q Consensus 270 ~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~------- 340 (418) +.++....++.+.+.|...+++|..-. +... | |.||+.-...|.... ...++..++..+++++.+++. T Consensus 338 ~~~~~e~~~~~L~~~I~~~s~~P~~~~-~~~~-~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~ 414 (511) T protein:vir:96 338 DVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFS-G-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 414 (511) T ss_pred CHHHHHHHHHHHHHHHHHHhCCccccc-cccc-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 667899999999999999999997533 2222 2 345654333333222 223344577788887777632 Q ss_pred ---c---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCcCCCChhhc Q lcl|NC_019404. 341 ---A---EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPEIKIGDNDI 402 (418) Q Consensus 341 ---~---~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~~~~~~~~~ 402 (418) . .++++.|++-...+.++.+++..+.+.+++. +-..+.++ +++..+++.+ ....++....+. T Consensus 415 ~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:96 415 IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDI 494 (511) T ss_pred CccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCC Confidence 1 2578999999999999988876655433222 12233332 2332332221 001111111111 Q ss_pred ----cc-ccccCCCccc Q lcl|NC_019404. 403 ----QT-EESELITETE 414 (418) Q Consensus 403 ----~~-~e~~~~~e~e 414 (418) ++ +..+...|+| T Consensus 495 ~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 495 NDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCccCcccccC Confidence 11 1112223333 No 140 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.43 E-value=4.2e-13 Score=88.44 Aligned_cols=390 Identities=12% Similarity=0.071 Sum_probs=183.9 Q ss_pred Cccc------------------hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKT------------------DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~------------------D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) +... +-+..-+.|-+..-.... ......-...-..+.+++.||+..+..++.+++.+++ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~---~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:78 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT---RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccC---cccccccCcceeecchHHHHHHHHhhhhcccCceeec Confidence 1111 111222223221100000 0000000000123588999999999999999999987 Q ss_pred cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeeccccccccccccccc Q lcl|NC_019404. 63 IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQNREENPRN 139 (418) Q Consensus 63 ~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 139 (418) +++. +.+.+-+++-++.....++.+...+||.|++++..+ ++.. .+.++++.++-+.+-+..... T Consensus 116 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~------------~i~~~~p~~~~~v~dd~~~~~ 183 (511) T protein:vir:78 116 DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET------------RLYKSDAMSTFIIYDNTVERN 183 (511) T ss_pred CchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCce------------EEEEEcccceEEEEcCCCCCc Confidence 6543 456666777788899999999999999999988774 2322 122333333322211100001 Q ss_pred cccCcceEEEEecCCcccc-----c-ccCcccEEEecCc--------------------cchhhhhhccccCCcchHHHH Q lcl|NC_019404. 140 ARFGKPLTYRITTNESDMF-----Y-DVHYSRIHIIDGE--------------------RVPNAMRRQNDGWGRSVLSSD 193 (418) Q Consensus 140 ~~yg~p~~y~i~~~~~~~~-----~-~iH~SR~i~~~g~--------------------~lp~~~~~~~~~~G~S~l~~~ 193 (418) |.++- .+|.......... . -+.+.++.+|... .+|. ....++.+|.|.++. T Consensus 184 ~~~~v-r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~g~gd~e~- 260 (511) T protein:vir:78 184 SIAGV-RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPI-TEFSNNERRKGDYEK- 260 (511) T ss_pred eEEEE-EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccce-EEecCCCCCCCchhh- Confidence 11110 1111110000000 0 0111122221100 0111 112234578888874 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHH-HHHHHHHHhcC-CcceeEEEcCCCcee--Eeec Q lcl|NC_019404. 194 ILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAAR-LRLAQVDNNSG-VGQAIGIDAESEEYS--VLNS 269 (418) Q Consensus 194 ~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~-~r~~~~~~~~~-~~~~~~~d~~~e~~~--~~~~ 269 (418) +.+.+.+++.+....+..+..++..++.+.+.... ......... .+........- .....- ..++.+.. ..+. T Consensus 261 v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~ 337 (511) T protein:vir:78 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--DPVEVRKQKEANVLFLEPTVYVDAEGRE-TEGSVDGGYIYKQY 337 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhcchhheecCccC--Cchhhcccccccceeccccceecccccc-CCCCcceeEEeecC Confidence 88999999999888888888777776666542211 011000000 00000000000 000000 01122343 3445 Q ss_pred ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhc------- Q lcl|NC_019404. 270 DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVN------- 340 (418) Q Consensus 270 ~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~------- 340 (418) +.++....++.+.+.|...+++|..-. +... | |.||+.-...|.... ...++..++..+++++.+++. T Consensus 338 ~~~~~e~~~~~L~~~I~~~s~~P~~~~-~~~~-~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~ 414 (511) T protein:vir:78 338 DVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFS-G-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 414 (511) T ss_pred CHHHHHHHHHHHHHHHHHHhCCccccc-cccc-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 667899999999999999999997533 2222 2 345654333333222 223344577788887777632 Q ss_pred ---c---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh--------hcCcCCCChhhc Q lcl|NC_019404. 341 ---A---EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT--------IAPEIKIGDNDI 402 (418) Q Consensus 341 ---~---~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~--------~~~~~~~~~~~~ 402 (418) . .++++.|++-...+.++.+++..+.+.+++. +-..+.++ +++..+++.+ ....++....+. T Consensus 415 ~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:78 415 IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDI 494 (511) T ss_pred CccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCC Confidence 1 2578999999999999988876655433222 12233332 2332332221 001111111111 Q ss_pred ----cc-ccccCCCccc Q lcl|NC_019404. 403 ----QT-EESELITETE 414 (418) Q Consensus 403 ----~~-~e~~~~~e~e 414 (418) ++ +..+...|+| T Consensus 495 ~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 495 NDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCccCcccccC Confidence 11 1112223333 No 141 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.43 E-value=3.6e-13 Score=88.83 Aligned_cols=369 Identities=12% Similarity=0.070 Sum_probs=185.3 Q ss_pred CccchhhHHHHhcCCCCc----cccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGS----EIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWD 74 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~----~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~ 74 (418) +-+.+-+..-..|.+.-- ............--.. ..+++++.||+..+..++.+++.++++++. +.+.+.+ T Consensus 42 ~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k--i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~- 118 (468) T protein:vir:96 42 VEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR--MYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVL- 118 (468) T ss_pred HHHHHHHHHHhcCCCccccccccccccccccccccccc--cccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHH- Confidence 111111112223322100 0000000000000001 137899999999999999999999876543 2333333 Q ss_pred HhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC Q lcl|NC_019404. 75 DLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE 154 (418) Q Consensus 75 ~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~ 154 (418) .=+....+.++.+.+..||.|++++..+.+. .+ .+.++++.++-|.+-+.+...+.++ ...|.+.... T Consensus 119 ~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~----------~~-~i~~~~p~~~~~v~~~~~~~~~~~~-ir~~~~~~~~ 186 (468) T protein:vir:96 119 NHKWDDKLVDILTAASNKGVEWIQPYVDEQG----------EF-KTFRVPAEQAIPIWTNKERDELKAF-IRLYELDGGE 186 (468) T ss_pred hcCHHHHHHHHHHHHhhcCeEEEEEEEcCCC----------ce-EEEEEcccceEEEEcCCCCCceEEE-EEEEEecCce Confidence 2367788889999999999999887764211 11 1333344333222111111111111 0111111100 Q ss_pred ccccccc-CcccEEEec------------------------Cc-----cchhhhhhccccCCcchHHHHHHHHHHHHHHH Q lcl|NC_019404. 155 SDMFYDV-HYSRIHIID------------------------GE-----RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNC 204 (418) Q Consensus 155 ~~~~~~i-H~SR~i~~~------------------------g~-----~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~ 204 (418) . .++ .+.++.++. .. .+|. ....++.+|.|.++. +.+.+.+++.+ T Consensus 187 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPv-v~~~n~~~g~sd~e~-v~~liDa~d~~ 261 (468) T protein:vir:96 187 R---VEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPF-IPFKNNPQEVSDLFM-YKTIIDAMDKR 261 (468) T ss_pred E---EEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccE-EEecCCCCCCCchHH-HHHHHHHHHHH Confidence 0 000 011111110 00 0111 112235678998874 88999999999 Q ss_pred HHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCC-CceeEee--cccCCHHHHHHHH Q lcl|NC_019404. 205 ERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAES-EEYSVLN--SDIGGIDAFLDKK 281 (418) Q Consensus 205 ~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~-e~~~~~~--~~~~gl~~~~~~~ 281 (418) ....+..+..++..++.+.+... .... ..... ......+.+.+++ .+.+.++ .+..+....++.+ T Consensus 262 ~S~~~~~~~~~~~p~lv~~g~~~--~~~~---~~~~~-------~~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l 329 (468) T protein:vir:96 262 LSDTQNTFDEATELIYVLKGYEG--EDLE---EFMYN-------LKYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDML 329 (468) T ss_pred HHHHHHHHHHhcCceeeeecCCc--cccc---hhhhh-------hhcCceEEecCCCCCcceEEeecCChHHHHHHHHHH Confidence 99988888888888777775321 1111 11111 1124445554432 2455554 4556888999999 Q ss_pred HHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCC Q lcl|NC_019404. 282 FDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLD 352 (418) Q Consensus 282 ~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~ 352 (418) .+.|...+++|-.-. ..-+| |.||+.-...|..... ...+..++..+++++++++.- .++.+.|++-. T Consensus 330 ~~~I~~~s~~p~~~~--~~~~~-n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~ 406 (468) T protein:vir:96 330 RDYVIEFGQGVDFQQ--DKFGN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNV 406 (468) T ss_pred HHHHHHHhCcccccc--ccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCC Confidence 999999999996422 22233 5567654433333332 333456788888888887642 36789999988 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhccc---ccccCCCccccccC Q lcl|NC_019404. 353 HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQT---EESELITETEVVIA 418 (418) Q Consensus 353 ~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~---~e~~~~~e~e~~~~ 418 (418) ..+++|.|++. .++|++|.+.+.+.+ ++....+++++. +..+....+..+.. T Consensus 407 p~d~~e~a~~~----------~~~g~iS~et~i~~l----~~v~D~~~E~~ri~~E~~~~~~~~~~~~~ 461 (468) T protein:vir:96 407 MVNELEQSQIG----------VNSQYLSKETVVTNH----PWVDDPVAEMERIDQEELALPSIEEGLNG 461 (468) T ss_pred CcCHHHHHHHH----------HhcCCCchHHHHHhC----CCCCCHHHHHHHHHHHHHHHHHHhhccCC Confidence 88888776642 345788877666543 111111112111 00000000000000 No 142 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.42 E-value=2.9e-12 Score=83.85 Aligned_cols=395 Identities=11% Similarity=0.030 Sum_probs=193.0 Q ss_pred Cc----------------------------cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhh Q lcl|NC_019404. 1 MV----------------------------KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPET 52 (418) Q Consensus 1 ~~----------------------------~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d 52 (418) .+ |.+-+..-..|-+.-- .... ...........+.+++.||+..+.. T Consensus 4 ~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~---~~~~--~~~~~~~~ki~~n~~~~Iv~~~~~~ 78 (499) T protein:vir:10 4 VIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIE---KHEF--DNATVEAANVMVNHAKYITDMNVGF 78 (499) T ss_pred chhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchh---cCCc--CcCCCCcceeecchHHHHHHHHhhh Confidence 11 1111111122221100 0000 0000011122367899999999999 Q ss_pred hccCCccccCcchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCC-Ccccccc-----cCCCceEEEEEee Q lcl|NC_019404. 53 ALAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDN-RALTSPV-----REGAELETVRVYD 124 (418) Q Consensus 53 ~~r~~~~i~~~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~-~~l~~pl-----~~~~~i~~i~v~~ 124 (418) ++.+++.+.++++. +.+.+-+++-++...+.++.+.+..||.|++++.++.+ .....+. .....-..+.+++ T Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~ 158 (499) T protein:vir:10 79 MTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVID 158 (499) T ss_pred hcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEc Confidence 99999998765543 45666677778889999999999999999999887643 2221111 1111112345566 Q ss_pred ccccccccccccccccccCcceEEEEecC-Ccc--cccc-cCcccEEEecCc--------------------cchhhhhh Q lcl|NC_019404. 125 RTQVKVQNREENPRNARFGKPLTYRITTN-ESD--MFYD-VHYSRIHIIDGE--------------------RVPNAMRR 180 (418) Q Consensus 125 ~~~i~~~~~~~dp~s~~yg~p~~y~i~~~-~~~--~~~~-iH~SR~i~~~g~--------------------~lp~~~~~ 180 (418) ++++-+.+....-..+.++ ..+|..... +.. ...+ .-+.++.+|... .+|. ... T Consensus 159 p~~~~~v~~d~~~~~~~~~-i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~ 236 (499) T protein:vir:10 159 PRATVVVCDDTVEHDPLFA-VFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPI-IEF 236 (499) T ss_pred ccceEEEecCCCCcceEEE-EEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccce-EEe Confidence 6554332211100000010 011111100 000 0000 011222222100 0111 112 Q ss_pred ccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEE-c Q lcl|NC_019404. 181 QNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGID-A 259 (418) Q Consensus 181 ~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d-~ 259 (418) .++.+|.|.++ .+.+.+.+++.+....+..+..++..++.+.+.. + ..... ....+ ..+...++. . T Consensus 237 ~n~~~~~~d~e-~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~-~-~~~~~---~~~~~-------~~~~~~~~~~~ 303 (499) T protein:vir:10 237 RNNEERQGDFE-QLISLIDAYNLLQTDRISDKEAFVDALLVTFGFG-L-GDDKD---DIQRL-------KRGAIEAPPRE 303 (499) T ss_pred cCCCCCCCchH-hHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc-c-ccccc---hhhhh-------hhcceeccCCC Confidence 34567888886 4788899999998888888888888877776521 1 11110 01111 112222222 2 Q ss_pred CCCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019404. 260 ESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLI 335 (418) Q Consensus 260 ~~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~ 335 (418) ++.+++.++ .+..+....++.+.+.|...+++|..-. +.- +| |.||..-.-.|.... ....+..+++.+++++ T Consensus 304 ~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-~g-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 380 (499) T protein:vir:10 304 EGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMND-EKF-MG-NVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRL 380 (499) T ss_pred CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCc-hhh-cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 233455554 4567899999999999999999996322 211 22 335554333333322 2333456788888888 Q ss_pred HHhhcc----------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHhh---------c Q lcl|NC_019404. 336 PFIVNA----------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAM-DIKEARDTLRTI---------A 392 (418) Q Consensus 336 ~~i~~~----------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i-~~~e~r~~l~~~---------~ 392 (418) .+++.- .++++.|++-...++.+.++...+.+..++. +-..+.+ ++++..+++.+- . T Consensus 381 ~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (499) T protein:vir:10 381 KLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQE 460 (499) T ss_pred HHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Confidence 887541 2578999999999999988877665432222 1223333 234333333211 0 Q ss_pred CcCCCC-----hhhcccccccCCCccccccC Q lcl|NC_019404. 393 PEIKIG-----DNDIQTEESELITETEVVIA 418 (418) Q Consensus 393 ~~~~~~-----~~~~~~~e~~~~~e~e~~~~ 418 (418) ...+.. .++-++++.+...+.+..-+ T Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (499) T protein:vir:10 461 ALRGQDPDRLELEDKQDDSSENDKEAGSNHN 491 (499) T ss_pred hhccCCCCCCCCCCCCcccCCCCCCCccccc Confidence 111111 11111111111111111111 No 143 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.42 E-value=4.6e-13 Score=88.21 Aligned_cols=365 Identities=11% Similarity=0.049 Sum_probs=187.0 Q ss_pred CccchhhHHHHhcCCCC----ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG----SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWD 74 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~----~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~ 74 (418) +-+.+-+..-+.|-+.- ....+.... .... ...--.+++++.||+..+...+.+++.++++++. +.+.+ +. T Consensus 42 ~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~-~~~~-~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~-~~ 118 (474) T protein:vir:96 42 IDDITVGERYYNHDPDVLRLAPKLDNKGEI-DPLK-PDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQE-VL 118 (474) T ss_pred HHHHHHHHHHhccCCcchhccchhcccccc-cccc-cchhcccchHHHHHHhhhhhhcccCceeecCchHHHHHHHH-HH Confidence 11122222222232210 000000000 0000 0001136899999999999999999999876543 23332 22 Q ss_pred HhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccccccc------------ Q lcl|NC_019404. 75 DLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARF------------ 142 (418) Q Consensus 75 ~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~y------------ 142 (418) .-+......++.+.+..||.|++++.++.+. .+ .+.++++.++-|.+-+.....+.+ T Consensus 119 ~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~----------~~-~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~ 187 (474) T protein:vir:96 119 NHKWDDKLVDILTAASNKGIEWLQPYIDENG----------EF-KTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAER 187 (474) T ss_pred hcCHHHHHHHHHHHHHhcCeeEEEEEecCCC----------ce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCceE Confidence 3366778888899999999999988774221 11 133333333322211100000100 Q ss_pred ------CcceEEEEecCCcc---------------cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHH Q lcl|NC_019404. 143 ------GKPLTYRITTNESD---------------MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDY 201 (418) Q Consensus 143 ------g~p~~y~i~~~~~~---------------~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~ 201 (418) ++...|....+... ....-|+-..+ |. ....++.+|.|.++. +.+.+.++ T Consensus 188 ~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i-------Pv-v~~~nn~~g~sd~e~-v~~liDa~ 258 (474) T protein:vir:96 188 VEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRV-------PF-IPFKNNPQEMSDLFM-YKTIIDAM 258 (474) T ss_pred EEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCce-------eE-EEeccCCCCCCcHHH-HHHHHHHH Confidence 11111211110000 00011211111 11 112245578888874 88999999 Q ss_pred HHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEee--cccCCHHHHHH Q lcl|NC_019404. 202 TNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN--SDIGGIDAFLD 279 (418) Q Consensus 202 ~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~--~~~~gl~~~~~ 279 (418) +.+....+..+..++..++.+.+.. +........ .......+.+++++.+++.++ .+..+....++ T Consensus 259 d~~~S~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~-------~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~ 326 (474) T protein:vir:96 259 DKRLSDTQNTFDESTELIYILKGYE-----GQDLDEFMR-------NLKYYKAINVDGDGSGVDTIQIEVPVQSSKEYLD 326 (474) T ss_pred HHHHHHHHHHHHHhccceeeeecCC-----cccccchhh-------hhhcCceEEecCCCCceeEEeecCChHHHHHHHH Confidence 9999999988988888888777532 111111111 112356666766666666655 55678999999 Q ss_pred HHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhc-------cCCceEEeCC Q lcl|NC_019404. 280 KKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVN-------AEEWSVEFSP 350 (418) Q Consensus 280 ~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~-------~~~~~~~f~p 350 (418) .+.++|...+++|-.-. + .-+| |.||..-...|...+. ...+..++..+.+++.+|+. ..++.+.|++ T Consensus 327 ~l~~~i~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~ 403 (474) T protein:vir:96 327 MLRDYVIEFGQGVDFQQ-D-KFGN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIKVQDVEITFNF 403 (474) T ss_pred HHHHHHHHHhCCccccc-c-cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecc Confidence 99999999999997533 2 2222 4456654433443322 33445678888888887753 2367899999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhh--------------cCcCC----CChhhcccccccCCCc Q lcl|NC_019404. 351 LDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTI--------------APEIK----IGDNDIQTEESELITE 412 (418) Q Consensus 351 L~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~--------------~~~~~----~~~~~~~~~e~~~~~e 412 (418) -...+++|.+++. .++|++|.+.+...+... .+... ...+.....++ -+.| T Consensus 404 ~~p~~~~e~~~~~----------~~ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d-~~~e 472 (474) T protein:vir:96 404 NVMVNELEQSQIG----------VQSQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPLEGDANGRAQD-NESE 472 (474) T ss_pred CCCcCHHHHHHHH----------HhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccccccCC-Cccc Confidence 9988888877642 345677666665433110 00000 00000000000 0001 Q ss_pred cc Q lcl|NC_019404. 413 TE 414 (418) Q Consensus 413 ~e 414 (418) +. T Consensus 473 ~~ 474 (474) T protein:vir:96 473 TN 474 (474) T ss_pred CC Confidence 11 No 144 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.41 E-value=9.7e-13 Score=86.45 Aligned_cols=361 Identities=11% Similarity=0.022 Sum_probs=192.4 Q ss_pred ccchhhHHHHh--cCCCC-----cccc-Cc-----cccCCHHHHHHHHH-cCCccchhhhcchhhhccCCccccCcchHH Q lcl|NC_019404. 2 VKTDSYANIFL--GGSDG-----SEIY-GS-----LQNQAPTILASLYA-DNALVRRIIDTIPETALAAGFHIDGIDDEP 67 (418) Q Consensus 2 ~~~D~~~n~~~--g~~~~-----~~~~-~~-----~~~~~~~~l~~~Y~-~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~ 67 (418) |..+.+.-+.. ..... .++| |. ...-.+.++...|+ ..+++++|||.+++-+.=+||.. +|. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~---~d~- 76 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN---DDF- 76 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC---Cch- Confidence 44444433321 11100 0111 11 11113455655554 34789999999999887788763 232 Q ss_pred HHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecc--cccccc--cccc----cc Q lcl|NC_019404. 68 AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRT--QVKVQN--REEN----PR 138 (418) Q Consensus 68 ~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~--~i~~~~--~~~d----p~ 138 (418) .+++-|.+-++.....++++.+.+||.|++++.-+ ++.+.-.++.+.. -+.+.|+. .+.... +..| +. T Consensus 77 ~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~---~~~i~D~~~~~~~~a~~~~~~d~~~~~~ 153 (409) T protein:vir:94 77 TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVN---ATGIIDPITGLLTEGYAVLERDENNNVV 153 (409) T ss_pred HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccce---EEEEEecCCCceeeeEEEEEecCCCceE Confidence 35667777778888899999999999999988643 2222111111110 01112221 011111 1100 00 Q ss_pred ccccCcc-eEEEEecCCcccccccCccc---EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 139 NARFGKP-LTYRITTNESDMFYDVHYSR---IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 139 s~~yg~p-~~y~i~~~~~~~~~~iH~SR---~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) .-.++.| +.|++...++.....-|+-. +++|..++ .....||.|.+.+++.+.+.++.+++........- T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~------~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~ 227 (409) T protein:vir:94 154 LEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRP------DAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEF 227 (409) T ss_pred EEEEEecCcEEEEEecCceeEeeeCCCCCcceEEecccc------ccccccCccccchhHHHHHHHHHHHHHHHHHHHHH Confidence 0011111 11222211111112224332 22232221 22456899988777888899999998777777777 Q ss_pred cCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEE--EcCCCc--eeEe-ecccCCHHHHHHHHHHHHhhhh Q lcl|NC_019404. 215 KQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGI--DAESEE--YSVL-NSDIGGIDAFLDKKFDRIVALS 289 (418) Q Consensus 215 ~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~--d~~~e~--~~~~-~~~~~gl~~~~~~~~~~iaaas 289 (418) ++.+...+.|+.. ++.......... +..+.+ +.+++. +.++ ..++.+.-+.++....++|+.+ T Consensus 228 ~a~pqr~i~G~d~---d~~~~~~~~~~~---------~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t 295 (409) T protein:vir:94 228 YSFPQKYVTGLSD---DAEPMETWKATV---------SSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGET 295 (409) T ss_pred hcChhheeEecCC---CCcccchhhhhH---------HHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhc Confidence 7777766665421 122211111111 111222 122222 3222 2356677788999999999999 Q ss_pred cCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhc--------cC---CceEEeCCCCCCC Q lcl|NC_019404. 290 GIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEFLIPFIVN--------AE---EWSVEFSPLDHES 355 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------~~---~~~~~f~pL~~~~ 355 (418) ++|...|.|++. -++|++.-. ......++++|+. ..+.++++..+++. .. +..+.|.|+...+ T Consensus 296 ~lP~~~lg~~~~--NpsSa~Al~a~~~~L~~~a~~k~~~-fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~ 372 (409) T protein:vir:94 296 GLTLDDLGFVSD--NPSSVEAIKASHENLRLAGRKAQRS-LGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEAD 372 (409) T ss_pred CCCHHHhccccC--chhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcc Confidence 999988866552 124444333 2334455555543 56777777776532 11 4578899988777 Q ss_pred HHHHHHHHHHHHHHHHHHHhCC--CCCHHHHHHHHHhhcCcCCCChhh Q lcl|NC_019404. 356 SKDKAEVLEKSVNSIAALIAAG--AMDIKEARDTLRTIAPEIKIGDND 401 (418) Q Consensus 356 eke~ae~~~~~a~a~~~~~~~g--~i~~~e~r~~l~~~~~~~~~~~~~ 401 (418) ..+.|++ |+++.+++++| +.+.+.+++.| |+++.| T Consensus 373 ~~~~a~~----aDa~~Kl~~ag~~~~~~~~~~~~l-------G~~~~d 409 (409) T protein:vir:94 373 ASMLSLI----GDGAIKLNQAIPEFINKDTIRDLT-------GIEGGE 409 (409) T ss_pred hHHHHHH----HHHHHHHHHhcccccchhHHHHHc-------CCCCCC Confidence 6666554 89999999999 55666676654 333333 No 145 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.41 E-value=3.1e-13 Score=89.15 Aligned_cols=368 Identities=11% Similarity=0.011 Sum_probs=183.2 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHHHhCc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEM 78 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l~~ 78 (418) +-|.+-+..-..|.+.-.......... ...-..+++++.||+..+..++.+++.+.++++. +.++.-+++-++ T Consensus 33 ~~r~~~~~~yy~g~~~i~~~~~~~~~~-----~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~ 107 (453) T protein:vir:73 33 VERYEYLGNMYKGIMEISSQKAKDSWK-----PDNRLTNNFAKYIVDTFVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDM 107 (453) T ss_pred HHHHHHHHHHhccccchhcCCCCCccC-----ccceeecchHHHHHHHhhhhhcccCceeecCChHHHHHHHHHHHhcCh Confidence 111111111112221100000000000 0011236899999999999999999999876543 346666667789 Q ss_pred hHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecccccccc-------------ccccccccc--- Q lcl|NC_019404. 79 TQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQN-------------REENPRNAR--- 141 (418) Q Consensus 79 ~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~-------------~~~dp~s~~--- 141 (418) .....++.+.+..||.|++++..+. +... +.++++.++.+.+ +..+..... T Consensus 108 ~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~------------i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~ 175 (453) T protein:vir:73 108 EDEESELAKIACVYGRAYELMYQNESTESE------------VIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGT 175 (453) T ss_pred hHHHHHHHHHHHhcCeEEEEEEeCCCCceE------------EEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEE Confidence 9999999999999999999887743 2221 2222222221111 001100000 Q ss_pred -cCcceEEEEecCCccc---ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019404. 142 -FGKPLTYRITTNESDM---FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ 217 (418) Q Consensus 142 -yg~p~~y~i~~~~~~~---~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~ 217 (418) |..-..|++...++.. ...-|+-..+ |+. ...++.+|.|.++ .+.+.+.+++.+....+..+..++. T Consensus 176 vyt~~~i~~~~~~~~~~~~~~~~~~~~g~v-----Pvv---~~~n~~~g~s~~~-~v~~liDa~~~~~S~~~~~~~~~~~ 246 (453) T protein:vir:73 176 VYTLLETISITGKAGEVKFGESTYNVYSDL-----PIV---EYNFNEERQSIFE-PVHSLINSYNKVTSEKANDVEYFSD 246 (453) T ss_pred EEeCCeEEEEEecCCceEEccceeccCCce-----eEE---EecCCCCCCcchh-hHHHHHHHHHHHHHHHHHHHHHhcc Confidence 1111122222221110 0011222211 111 1234567899887 5889999999999999988888887 Q ss_pred ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeE--eecccCCHHHHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 218 AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV--LNSDIGGIDAFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 218 ~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~--~~~~~~gl~~~~~~~~~~iaaas~IP~t~ 295 (418) ..+.+.+.. +. +.......+...................+.++.. .+.+.++....++.+.+.|...+++|-.- T Consensus 247 ~~l~~~g~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 322 (453) T protein:vir:73 247 QYLVFLGAE-VD---EEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANIS 322 (453) T ss_pred ceeeeecCC-CC---chhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccC Confidence 777666431 10 1010111111111111011111112222334444 44556788999999999999999999743 Q ss_pred eeccCccccccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhc----------cCCceEEeCCCCCCCHHHHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHKL---IDRKRNAELLPILEFLIPFIVN----------AEEWSVEFSPLDHESSKDKAEV 362 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~~---I~~~Qe~~l~p~l~~l~~~i~~----------~~~~~~~f~pL~~~~eke~ae~ 362 (418) ..+ .| |+||+.-...|... ++.+ +..++..++++..+++. ..+++++|++-...++++.|++ T Consensus 323 ~~~---~g-n~Sg~Al~~~~~~l~~ka~~~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~ 397 (453) T protein:vir:73 323 DEN---FG-NSSGVALAYKLQAMSNLALSF-QRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAET 397 (453) T ss_pred ccc---cc-CccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHH Confidence 322 12 34565443333332 3333 34467777777776642 1367899999999999888776 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhcc---cccc------------cCCCccccc Q lcl|NC_019404. 363 LEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQ---TEES------------ELITETEVV 416 (418) Q Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~---~~e~------------~~~~e~e~~ 416 (418) ..+.+ |++|.+.+.+.+ ++..-..++++ .++. .++.+-..+ T Consensus 398 ~~k~~---------giis~et~~~~~----~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 398 ANILK---------GITSEETALSVI----SVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HHHHh---------ccCcHHHHHHhC----CCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 44332 566665544332 11110111111 1000 111111111 No 146 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.41 E-value=4.5e-14 Score=93.75 Aligned_cols=322 Identities=14% Similarity=0.034 Sum_probs=159.3 Q ss_pred HHHHhcCCCCccc--cCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc---Ccc---hH------HHHHHHH Q lcl|NC_019404. 8 ANIFLGGSDGSEI--YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID---GID---DE------PAFWSRW 73 (418) Q Consensus 8 ~n~~~g~~~~~~~--~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~---~~~---d~------~~i~~~~ 73 (418) .++|...-+.++. .+.....+-..-...+..+..+++||+.+|++.-+-++.+- ..+ +. ..+...+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL 80 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVL 80 (378) T ss_pred CCccccchhcccccccCCcceeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccchHHHHH Confidence 2222111001110 01111111111122333456789999999999999888751 111 11 0112222 Q ss_pred HH----hCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEE Q lcl|NC_019404. 74 DD----LEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTY 148 (418) Q Consensus 74 ~~----l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y 148 (418) .. .-....|.+.+.+ ..++|.|++++..++. .+.+.++.+ T Consensus 81 ~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~---------~g~~~~l~p-------------------------- 125 (378) T protein:vir:94 81 NWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDN---------TGELLDLLF-------------------------- 125 (378) T ss_pred hhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCC---------CceEEEEEe-------------------------- Confidence 21 1123345555544 5667999988755432 122322211 Q ss_pred EEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHh Q lcl|NC_019404. 149 RITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAEL 228 (418) Q Consensus 149 ~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~ 228 (418) .+ ...+++++.||||.+ | .+...|.|++.. +...+..+.+. . . ---++++++. T Consensus 126 ----~~--~~~~~~~~diiH~~~-~-------~~~~~g~s~l~~-~~~~i~~~~~~---~-~-----~~gil~~~~~--- 178 (378) T protein:vir:94 126 ----AD--DKKEYKPEELVRLTS-P-------FYINEDTSILDN-ALASIQTKLEQ---G-K-----LRGLLKINAF--- 178 (378) T ss_pred ----cC--CeeEeeeeeeEEecC-c-------CCccchhHHHHH-HHHHHHHHHhc---c-c-----ccceeeeCCc--- Confidence 00 113567889999963 1 122347787753 55555433211 1 0 0123455421 Q ss_pred hcCcchHHHHHHHHHHHHHhc---CCcceeEEEcCCCceeEeecccCCHH-HHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDNNS---GVGQAIGIDAESEEYSVLNSDIGGID-AFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~~~---~~~~~~~~d~~~e~~~~~~~~~~gl~-~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) ++ .+...+..+++....... .+.+.+++...+.+|++++.+....+ ...++....||.+.|||..+|.|. T Consensus 179 l~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~~----- 252 (378) T protein:vir:94 179 LD-IDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT----- 252 (378) T ss_pred CC-HHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC----- Confidence 11 112233344444332221 12333455555678998887665433 334677788999999999887431 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc-------------CCceEEeCCCCCCCHHHHHHHHHHHHH Q lcl|NC_019404. 305 SSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA-------------EEWSVEFSPLDHESSKDKAEVLEKSVN 368 (418) Q Consensus 305 ~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~-------------~~~~~~f~pL~~~~eke~ae~~~~~a~ 368 (418) ..+....+||.. -|.|.+..+-..+ +.+ .++.|++..|...|.+++ ++ T Consensus 253 --~se~~~~~f~~~-------tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~-------~~ 316 (378) T protein:vir:94 253 --ASQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL-------ID 316 (378) T ss_pred --hHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHH-------HH Confidence 123444555543 4788876665443 221 135677778887777655 67 Q ss_pred HHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-------------hhhccc---cc-ccCCCccc Q lcl|NC_019404. 369 SIAALIAAGAMDIKEARDTLRTIAPEIKIG-------------DNDIQT---EE-SELITETE 414 (418) Q Consensus 369 a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-------------~~~~~~---~e-~~~~~e~e 414 (418) ++.+++++|++|++|+|+.+. ..|..+.+ ..+.+. .+ ...++.+| T Consensus 317 ~~~~~~~~G~~T~NE~R~~~g-l~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 317 LYHENINGPIFTQNQLLVKMG-EQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeecccccccccchhhcCCcCCCCCCCCCCCC Confidence 788999999999999999763 22221110 000010 01 11222233 No 147 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.41 E-value=1.5e-12 Score=85.43 Aligned_cols=371 Identities=11% Similarity=0.042 Sum_probs=185.4 Q ss_pred CccchhhHHHHhcCCCCccccCcc-ccCCHHHHH-HHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSL-QNQAPTILA-SLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~-~~~~~~~l~-~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~ 77 (418) +-+.+-+..-..|-+.-....... .......-. ..=-.+.+++.||+..+..++.+++.++++++. ....+.|..-+ T Consensus 43 ~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~n~ 122 (474) T protein:vir:95 43 LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLDTR 122 (474) T ss_pred HHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHhcc Confidence 222222233223322110000000 000000000 000136899999999999999999999876653 22233344447 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM 157 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~ 157 (418) ....+.++.+....||.|++++..+.+. .+ .+.++++.++-+.+-+.+...+.++ ...|...... T Consensus 123 ~~~~~~~l~~~~~~~G~~~~~~~~d~~~----------~~-~i~~~~p~~~~~v~d~~~~~~~~a~-ir~~~~~~~~--- 187 (474) T protein:vir:95 123 WDNKLIDILTAASNKGIDWLQVYINEDG----------EL-KLFRVPAEQAIPIWTDKEREQLNAF-IRIFTFNGET--- 187 (474) T ss_pred HHHHHHHHHHHHhhCCeEEEEeeeCCCC----------ce-EEEEEcccceEEEEcCCCCCceEEE-EEEEeecCee--- Confidence 8888999999999999999998774221 11 2333444443332211111111111 1111111100 Q ss_pred ccccC-cccEEEec---C----------------------ccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 158 FYDVH-YSRIHIID---G----------------------ERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 158 ~~~iH-~SR~i~~~---g----------------------~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ..++| +.++.+|. + ..+|. ....++..|.|.++ .+.+.+.+++.+....+.. T Consensus 188 ~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv-v~~~nn~~~~~d~e-~v~~liDa~d~~~S~~~~~ 265 (474) T protein:vir:95 188 KVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPF-IAFKNNPEEVSDIW-MYKSFVDAIDKRLSDVQNM 265 (474) T ss_pred EEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccce-EEecCCCCCCCchH-HHHHHHHHHHHHHHHHHHH Confidence 00111 12222221 0 00111 11123456788886 4889999999999988888 Q ss_pred HHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhh Q lcl|NC_019404. 212 LRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALS 289 (418) Q Consensus 212 ~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas 289 (418) +..++...+.+.+.. +.......... .....+.+ .++.+.+.+ +.+.++....++.+.++|...+ T Consensus 266 ~~~~~~p~lv~~g~~-----~~~~~~~~~~~-------~~~~~i~~-~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 332 (474) T protein:vir:95 266 FDESVELIYILRGYE-----GEDLSEFMEGL-------KYYKAINV-SSDGGVETIQVEVPVASTKEYLDMMRAYIVEFG 332 (474) T ss_pred HHHhhcchhhhcCCC-----cccccchhhhh-------hccceeec-cCCCceeEEeccCCHHHHHHHHHHHHHHHHHHh Confidence 888888777766531 11101111111 11223333 334555554 5566789999999999999999 Q ss_pred cCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~a 360 (418) ++|-.-.- +-+| |.||..-...|..... ..++..++..+.+++.+++.- .++++.|++-...++.|.| T Consensus 333 ~~p~~~~~--~~~~-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a 409 (474) T protein:vir:95 333 QGVDFQTD--KFGS-ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQS 409 (474) T ss_pred CCcCcccc--cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHH Confidence 99965432 1222 4456543333332222 334456788888888887642 3678999999999998887 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------Hh-------hcC-cCCCChhhcccccccCCCccc Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTL-------------RT-------IAP-EIKIGDNDIQTEESELITETE 414 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l-------------~~-------~~~-~~~~~~~~~~~~e~~~~~e~e 414 (418) ++.. ++|++|.+.+...+ ++ ... ..+...+.-++.+.....|.| T Consensus 410 ~~~~----------~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 410 QIGA----------QSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHH----------HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 7532 23666555444322 10 000 000000000111111112222 No 148 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.41 E-value=1.5e-12 Score=85.43 Aligned_cols=371 Identities=11% Similarity=0.042 Sum_probs=185.4 Q ss_pred CccchhhHHHHhcCCCCccccCcc-ccCCHHHHH-HHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSL-QNQAPTILA-SLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~-~~~~~~~l~-~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~ 77 (418) +-+.+-+..-..|-+.-....... .......-. ..=-.+.+++.||+..+..++.+++.++++++. ....+.|..-+ T Consensus 43 ~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~n~ 122 (474) T protein:vir:96 43 LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLDTR 122 (474) T ss_pred HHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHhcc Confidence 222222233223322110000000 000000000 000136899999999999999999999876653 22233344447 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM 157 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~ 157 (418) ....+.++.+....||.|++++..+.+. .+ .+.++++.++-+.+-+.+...+.++ ...|...... T Consensus 123 ~~~~~~~l~~~~~~~G~~~~~~~~d~~~----------~~-~i~~~~p~~~~~v~d~~~~~~~~a~-ir~~~~~~~~--- 187 (474) T protein:vir:96 123 WDNKLIDILTAASNKGIDWLQVYINEDG----------EL-KLFRVPAEQAIPIWTDKEREQLNAF-IRIFTFNGET--- 187 (474) T ss_pred HHHHHHHHHHHHhhCCeEEEEeeeCCCC----------ce-EEEEEcccceEEEEcCCCCCceEEE-EEEEeecCee--- Confidence 8888999999999999999998774221 11 2333444443332211111111111 1111111100 Q ss_pred ccccC-cccEEEec---C----------------------ccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 158 FYDVH-YSRIHIID---G----------------------ERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 158 ~~~iH-~SR~i~~~---g----------------------~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) ..++| +.++.+|. + ..+|. ....++..|.|.++ .+.+.+.+++.+....+.. T Consensus 188 ~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv-v~~~nn~~~~~d~e-~v~~liDa~d~~~S~~~~~ 265 (474) T protein:vir:96 188 KVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPF-IAFKNNPEEVSDIW-MYKSFVDAIDKRLSDVQNM 265 (474) T ss_pred EEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccce-EEecCCCCCCCchH-HHHHHHHHHHHHHHHHHHH Confidence 00111 12222221 0 00111 11123456788886 4889999999999988888 Q ss_pred HHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe--ecccCCHHHHHHHHHHHHhhhh Q lcl|NC_019404. 212 LRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALS 289 (418) Q Consensus 212 ~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~--~~~~~gl~~~~~~~~~~iaaas 289 (418) +..++...+.+.+.. +.......... .....+.+ .++.+.+.+ +.+.++....++.+.++|...+ T Consensus 266 ~~~~~~p~lv~~g~~-----~~~~~~~~~~~-------~~~~~i~~-~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 332 (474) T protein:vir:96 266 FDESVELIYILRGYE-----GEDLSEFMEGL-------KYYKAINV-SSDGGVETIQVEVPVASTKEYLDMMRAYIVEFG 332 (474) T ss_pred HHHhhcchhhhcCCC-----cccccchhhhh-------hccceeec-cCCCceeEEeccCCHHHHHHHHHHHHHHHHHHh Confidence 888888777766531 11101111111 11223333 334555554 5566789999999999999999 Q ss_pred cCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcc-------CCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 290 GIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEFLIPFIVNA-------EEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 290 ~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~l~~~i~~~-------~~~~~~f~pL~~~~eke~a 360 (418) ++|-.-.- +-+| |.||..-...|..... ..++..++..+.+++.+++.- .++++.|++-...++.|.| T Consensus 333 ~~p~~~~~--~~~~-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a 409 (474) T protein:vir:96 333 QGVDFQTD--KFGS-ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQS 409 (474) T ss_pred CCcCcccc--cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHH Confidence 99965432 1222 4456543333332222 334456788888888887642 3678999999999998887 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------Hh-------hcC-cCCCChhhcccccccCCCccc Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTL-------------RT-------IAP-EIKIGDNDIQTEESELITETE 414 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l-------------~~-------~~~-~~~~~~~~~~~~e~~~~~e~e 414 (418) ++.. ++|++|.+.+...+ ++ ... ..+...+.-++.+.....|.| T Consensus 410 ~~~~----------~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 410 QIGA----------QSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHH----------HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 7532 23666555444322 10 000 000000000111111112222 No 149 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.40 E-value=3.5e-13 Score=88.89 Aligned_cols=379 Identities=9% Similarity=0.090 Sum_probs=181.4 Q ss_pred CccchhhHHHH---------------------hcCCCCccccCccccCCHHHHHHH-H-HcCCccchhhhcchhhhccCC Q lcl|NC_019404. 1 MVKTDSYANIF---------------------LGGSDGSEIYGSLQNQAPTILASL-Y-ADNALVRRIIDTIPETALAAG 57 (418) Q Consensus 1 ~~~~D~~~n~~---------------------~g~~~~~~~~~~~~~~~~~~l~~~-Y-~~~~~~r~iVd~~a~d~~r~~ 57 (418) .+..+.....+ .|-+.. ... +.. .+.++... . ..+.+++.|||..++-+.-+| T Consensus 22 ~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~-~~~--~~~-~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~g 97 (501) T protein:vir:25 22 SMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGR-PEV--PEG-ASDEVKELAKLSVKNVLSLVRDSFAQNLSVVG 97 (501) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-hhc--ccc-CChhhhhhHhhhhcChHHHHHHHHHhhhcccc Confidence 22222221111 111100 000 010 11222221 1 235799999999999998889 Q ss_pred ccccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc--c Q lcl|NC_019404. 58 FHIDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE--E 135 (418) Q Consensus 58 ~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~--~ 135 (418) |.+.+.++.+.+..-|++-++.....++++.+.+||.|++++..+++. | .+++++++++.+.+.+ . T Consensus 98 f~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~----~--------~i~~~sp~~~~~iy~D~~~ 165 (501) T protein:vir:25 98 YRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG----P--------VFRTRSPRQILAVYADPSV 165 (501) T ss_pred eecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC----C--------eEEEeccccEEEEEecCCC Confidence 888665555667777777778888899999999999999887653321 1 1344444443322110 0 Q ss_pred c--cc-------------cc----ccCcceEEEEecCCc---------c---------cccc------cCcccEEEecCc Q lcl|NC_019404. 136 N--PR-------------NA----RFGKPLTYRITTNES---------D---------MFYD------VHYSRIHIIDGE 172 (418) Q Consensus 136 d--p~-------------s~----~yg~p~~y~i~~~~~---------~---------~~~~------iH~SR~i~~~g~ 172 (418) + |. .. .|..-..|.+..... . .... -|+=.++.++ T Consensus 166 ~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv-- 243 (501) T protein:vir:25 166 DAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVV-- 243 (501) T ss_pred CcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeE-- Confidence 0 00 00 011111111111000 0 0000 0110111110 Q ss_pred cchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCc Q lcl|NC_019404. 173 RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVG 252 (418) Q Consensus 173 ~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~ 252 (418) +..-.+..+++|.|.++ ++.+.+.+++++..........++.+...+.+.. ++ .....+ .. .+ T Consensus 244 --~f~N~~~~~~~g~sdie-~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~-----~~-~~~~~~------~~--~~ 306 (501) T protein:vir:25 244 --RFVNGRDADDMIVGEVA-PLILLQQAINSVNFDRLIVSRFGANPQRVISGWT-----GS-KAEVLK------AS--AL 306 (501) T ss_pred --eccCccccCccccchhh-hhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCC-----CC-ccchhh------hc--cc Confidence 01111223568999997 5888889999888776666655555544333321 11 111110 11 12 Q ss_pred ceeEEEcCCCceeEee-cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHH---HHHHHHHHHHHHHHH Q lcl|NC_019404. 253 QAIGIDAESEEYSVLN-SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALE---TFHKLIDRKRNAELL 328 (418) Q Consensus 253 ~~~~~d~~~e~~~~~~-~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~---~y~~~I~~~Qe~~l~ 328 (418) ..+++.+++-++-+++ .++.+..+.++.....||+.+++|...|.|.+. |.||+.-.. ..-..++.+|+ .+. T Consensus 307 ~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~---N~Sg~Al~~~~~~l~~ka~~k~~-~f~ 382 (501) T protein:vir:25 307 RVWTFEDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMI---NVSAEALAAAEANQQRKLAAKRE-SFG 382 (501) T ss_pred ceeccCCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccC---ChHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 2233332223333332 345567778889999999999999877765432 334553332 33344444443 467 Q ss_pred HHHHHHHHHhhcc---------CCceEEeCCCCCCCHHHHHHHHHHHHHH---HHHHH-hCCCCCHHHHHHHHHhhc--C Q lcl|NC_019404. 329 PILEFLIPFIVNA---------EEWSVEFSPLDHESSKDKAEVLEKSVNS---IAALI-AAGAMDIKEARDTLRTIA--P 393 (418) Q Consensus 329 p~l~~l~~~i~~~---------~~~~~~f~pL~~~~eke~ae~~~~~a~a---~~~~~-~~g~i~~~e~r~~l~~~~--~ 393 (418) +.|+++..+++.- .++.+.|.+...++..+.|+...|.+++ ...++ ..--++++++.+..+... . T Consensus 383 ~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g~~~~~ie~~~~~~~e~~ 462 (501) T protein:vir:25 383 ESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPGMTQQTIQAIKDSLRGGE 462 (501) T ss_pred HHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCCCCHHHHHHHHHHHHHHh Confidence 7888887776531 2578899999999998887766555443 11122 222335444322111000 0 Q ss_pred cC---------CCChhhcccccccCCCccccccC Q lcl|NC_019404. 394 EI---------KIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 394 ~~---------~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) .. +.+...-...+....+++++--+ T Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (501) T protein:vir:25 463 VKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVN 496 (501) T ss_pred HHHHHHHhhccCcCCCCCCCCCCCccccccccCC Confidence 00 00000001111111122222222 No 150 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.40 E-value=1.1e-12 Score=86.15 Aligned_cols=396 Identities=12% Similarity=0.074 Sum_probs=179.9 Q ss_pred Cccc------------------hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccC Q lcl|NC_019404. 1 MVKT------------------DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (418) Q Consensus 1 ~~~~------------------D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~ 62 (418) -++. +-+..-+.|-+..- +..........-......+++++.||+..+...+-+++.+.+ T Consensus 21 ~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i--~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~ 98 (506) T protein:vir:94 21 NLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKI--LDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKL 98 (506) T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--cccccccccccCCcceeecchHHHHHHHhhhhhcccCceeec Confidence 1111 11111122322100 000000000000001124689999999999999999999987 Q ss_pred cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccccc Q lcl|NC_019404. 63 IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNA 140 (418) Q Consensus 63 ~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~ 140 (418) +++. +.+..-++.-++...+.++.+.+..||.|++++.++.+. .+ .+.++++.++-+.+.+.....| T Consensus 99 ~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~----------~~-~i~~~~p~~~~~v~dd~~~~~~ 167 (506) T protein:vir:94 99 PDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDN----------EE-HLAKLDPLDTFVIYSTDVDPKP 167 (506) T ss_pred CcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCC----------ee-EEEEEcccceEEEecCCCCCce Confidence 6543 445555666688889999999999999999998874221 11 2333444433332211110111 Q ss_pred ccCcceEEEEecCCccc-c-----c-ccCcccEEEecCcc---------------chhhhhhccccCCcchHHHHHHHHH Q lcl|NC_019404. 141 RFGKPLTYRITTNESDM-F-----Y-DVHYSRIHIIDGER---------------VPNAMRRQNDGWGRSVLSSDILDSI 198 (418) Q Consensus 141 ~yg~p~~y~i~~~~~~~-~-----~-~iH~SR~i~~~g~~---------------lp~~~~~~~~~~G~S~l~~~~~~~l 198 (418) .++ ..+|......... . . ..-+.++.++.+.. +|. ....++..|.|.++ .+.+.+ T Consensus 168 ~~~-v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~~~sd~e-~~~~li 244 (506) T protein:vir:94 168 IMA-VRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPV-VEFKNSNFRLGDFE-NVLPLI 244 (506) T ss_pred EEE-EEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccce-EEecCCCCCCCchh-hhHHHH Confidence 111 0111110000000 0 0 00011111111111 111 11122334677776 477888 Q ss_pred HHHHHHHHHHHHHHHHcCCceeecchHHHhhcCc------------chHHHHH-HHHHHHHHhcCCcceeEEEc------ Q lcl|NC_019404. 199 KDYTNCERLATQLLRRKQQAVWKAKGLAELCDDS------------EGFGAAR-LRLAQVDNNSGVGQAIGIDA------ 259 (418) Q Consensus 199 ~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~------------~~~~~~~-~r~~~~~~~~~~~~~~~~d~------ 259 (418) .+|+.+....+.-+..++..++.+.+.......+ .+..... .+..... .....+.+.+.. T Consensus 245 Da~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 323 (506) T protein:vir:94 245 DLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIK-EMKDANMLLLKSGMTVNG 323 (506) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHh-hhhhcCeeeecccccccC Confidence 8888888777776666665554444322111000 0000000 0000000 111122222221 Q ss_pred --CCCceeE--eecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_019404. 260 --ESEEYSV--LNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLID--RKRNAELLPILEF 333 (418) Q Consensus 260 --~~e~~~~--~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~--~~Qe~~l~p~l~~ 333 (418) .+.+++. .+.+..+.+..++.+.++|...+++|..-. .+-+| |.||..-...|..... ...+..++..+++ T Consensus 324 ~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~--~~~~~-n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~ 400 (506) T protein:vir:94 324 TQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTD--ENFAS-NSSGVAMQYKVLGTVELASTKRRMFERGLYA 400 (506) T ss_pred ccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccc--ccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1223443 445667899999999999999999997422 12222 4556644433433222 3334456777777 Q ss_pred HHHHhhc-----c-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHh-hc---C Q lcl|NC_019404. 334 LIPFIVN-----A-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRT-IA---P 393 (418) Q Consensus 334 l~~~i~~-----~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~-~~---~ 393 (418) ++.+++. . .++++.|++-...++.+.|++..+.+..++. +-..+.++ +++..+.+.+ .. + T Consensus 401 ~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~ 480 (506) T protein:vir:94 401 RYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDY 480 (506) T ss_pred HHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhh Confidence 7766542 1 2467999999999999998876654322211 11222332 3332232221 00 0 Q ss_pred ---cCCCChhhcccccccCCCccccc Q lcl|NC_019404. 394 ---EIKIGDNDIQTEESELITETEVV 416 (418) Q Consensus 394 ---~~~~~~~~~~~~e~~~~~e~e~~ 416 (418) ..+...++-+..+.+-+.++|-+ T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 481 SFDQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred cchhhcCCCcccCccccccccccCCC Confidence 01111111111122222223333 No 151 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.40 E-value=2.8e-12 Score=83.95 Aligned_cols=399 Identities=12% Similarity=0.020 Sum_probs=189.0 Q ss_pred Cccchhh-----------HHHHhc-CCCC---cccc-Cc-----cccCCHHHHHHHHHcCCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MVKTDSY-----------ANIFLG-GSDG---SEIY-GS-----LQNQAPTILASLYADNALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~~~D~~-----------~n~~~g-~~~~---~~~~-~~-----~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~ 59 (418) ..+..|+ .+.... ..+- .++| |. ...-.+.++...+....++++||+++++-+.=+||. T Consensus 12 ~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~ 91 (504) T protein:vir:99 12 TFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFV 91 (504) T ss_pred ccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceee Confidence 1111111 111100 0000 0011 11 111124556666666788999999999988889998 Q ss_pred ccCcchH-HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccc------ Q lcl|NC_019404. 60 IDGIDDE-PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQ------ 131 (418) Q Consensus 60 i~~~~d~-~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~------ 131 (418) +.+.++. ..+.+-|.+-++.....++++.+.+||.||+++.-.+ +.+ .|+ |++++++++... T Consensus 92 ~~d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~--~~~--------I~~~sP~~~~~iyD~~~~ 161 (504) T protein:vir:99 92 WPDGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEP--DSL--------IHVKSAMQATGEWNSRRN 161 (504) T ss_pred CCCCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCc--eeE--------EEEeccceeEEEEeCCCC Confidence 7644332 4567777777888888999999999999998875432 221 111 222222221111 Q ss_pred -------ccccccc----ccccCcce-EEEEecCCcccccccCcccEEEecCccchhhh--hhccccCCcchHHHHHHHH Q lcl|NC_019404. 132 -------NREENPR----NARFGKPL-TYRITTNESDMFYDVHYSRIHIIDGERVPNAM--RRQNDGWGRSVLSSDILDS 197 (418) Q Consensus 132 -------~~~~dp~----s~~yg~p~-~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~--~~~~~~~G~S~l~~~~~~~ 197 (418) .++.|.. .-.++.|. .|.+...+.+. ....+.-|.-|-|+.... ......+|.|.+.+++.+. T Consensus 162 ~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~---~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l 238 (504) T protein:vir:99 162 AMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGD---WHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSL 238 (504) T ss_pred ceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCce---eeeccccCCCCcceEEecccccCccccCcccchhhHHHH Confidence 1111111 11112221 22222111110 001111111122221110 1124568999887778888 Q ss_pred HHHHHHHHHHHHHHHHHcCCceeecchHHHh---hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe-ecccCC Q lcl|NC_019404. 198 IKDYTNCERLATQLLRRKQQAVWKAKGLAEL---CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL-NSDIGG 273 (418) Q Consensus 198 l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~---~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~-~~~~~g 273 (418) +.++++++........-++.+...+.|...- ..++........+...+..........+..+..-++-++ ..++.+ T Consensus 239 ~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~ 318 (504) T protein:vir:99 239 QQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQP 318 (504) T ss_pred HHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHH Confidence 8888888877766666666655555443211 111111111122211111111111111111111223222 234556 Q ss_pred HHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhc--------c- Q lcl|NC_019404. 274 IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEFLIPFIVN--------A- 341 (418) Q Consensus 274 l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------~- 341 (418) ..+.++....+||+.++||..-| |.....-++||+.-. ......++.+|+ .+...++++..+.+. . T Consensus 319 ~~~~l~~~i~~~a~~t~~P~~~l-G~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~-~f~~~l~~~~rla~~~~~~~~~~~~ 396 (504) T protein:vir:99 319 HIEMLEQIAMMFSGETSIPVESL-GFSNRANPTSADAYIASREDLIAEAEGATD-DWSPAFRRSMIRALAIKNGLDRIPP 396 (504) T ss_pred HHHHHHHHHHHHHhhhCCCHHHh-cccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcccc Confidence 77788899999999999997644 654433345665443 334455555554 357778777776532 1 Q ss_pred --CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH-------HH-hCCCCCHHHHHHHHHhhc---------------Cc-- Q lcl|NC_019404. 342 --EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA-------LI-AAGAMDIKEARDTLRTIA---------------PE-- 394 (418) Q Consensus 342 --~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~-------~~-~~g~i~~~e~r~~l~~~~---------------~~-- 394 (418) .++.+.|.+...+|..+.|+...|.+++... +. ..|+ +++|+.+.....- .. T Consensus 397 ~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~-~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~ 475 (504) T protein:vir:99 397 EWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGL-TPQQAKRALAERRRASSVSIIEALNRRQQEAA 475 (504) T ss_pred ccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCC-CHHHHHHHHHHHHHHhhHHHHHHHhcccCCCC Confidence 2467889999999999988877666664321 11 2354 5555543221100 00 Q ss_pred CCCChhhccccccc--------CCCcccc Q lcl|NC_019404. 395 IKIGDNDIQTEESE--------LITETEV 415 (418) Q Consensus 395 ~~~~~~~~~~~e~~--------~~~e~e~ 415 (418) ..-++++-+..|+. ..+.-|+ T Consensus 476 ~~~~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 476 TAGEDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred CCCCCCCcCCCCCCCCCCCccCCCcccCC Confidence 00000000000000 0111111 No 152 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.39 E-value=2.8e-12 Score=83.95 Aligned_cols=382 Identities=13% Similarity=0.108 Sum_probs=181.9 Q ss_pred CccchhhHHHHhcCCCC----ccccCccccCCHHHH-HHHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG----SEIYGSLQNQAPTIL-ASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWD 74 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~----~~~~~~~~~~~~~~l-~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~ 74 (418) .-++.-+..-..|-+.- ...++.......... -..--.+.+++.||+..+..++-+++.+++.++. +++...+. T Consensus 30 ~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~ 109 (537) T protein:vir:78 30 IKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQ 109 (537) T ss_pred HHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeecCcchhHHHHHHHH Confidence 11111111112222110 000110000000000 0001236899999999999999999999875432 33333443 Q ss_pred Hh---CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc---ccccc--------- Q lcl|NC_019404. 75 DL---EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE---ENPRN--------- 139 (418) Q Consensus 75 ~l---~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~---~dp~s--------- 139 (418) .+ +......+..+....||.|++++.++.+.. ++ +.++++.++-|.+-+ ..+.. T Consensus 110 ~~~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~----------~~-~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~ 178 (537) T protein:vir:78 110 EYFDEDFQATIDTLVTNASKKGFEGIFARTTSEGK----------LK-FQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYS 178 (537) T ss_pred HHhhccHHHHHHHHHHHHhhcCeeEEEeeecCCCc----------eE-EEEEccceeEEEEcCCCCceeEEEEEeeeecc Confidence 32 445667788888999999999887753221 11 112222221111000 00000 Q ss_pred -----------cccC---cceEEEEecCCccc----ccccCcccEEEec---C-------------------ccchhhhh Q lcl|NC_019404. 140 -----------ARFG---KPLTYRITTNESDM----FYDVHYSRIHIID---G-------------------ERVPNAMR 179 (418) Q Consensus 140 -----------~~yg---~p~~y~i~~~~~~~----~~~iH~SR~i~~~---g-------------------~~lp~~~~ 179 (418) -.++ ....|....++... ...+....+-++. . ..+|. .. T Consensus 179 ~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv-v~ 257 (537) T protein:vir:78 179 TKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPF-QL 257 (537) T ss_pred ccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeE-EE Confidence 0000 01111111110000 0000000000000 0 00111 11 Q ss_pred hccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc Q lcl|NC_019404. 180 RQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA 259 (418) Q Consensus 180 ~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~ 259 (418) ..++-+|.|.++. +-+.+.+|+.+....+..+..+.-.++.+.+.. +....+.+..+ ...+.+.+++ T Consensus 258 f~nn~~~~sd~e~-v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~-----~~~~~~~~~~l-------~~~~~i~v~~ 324 (537) T protein:vir:78 258 LYNNKDGMSDVKR-VKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFS-----GDSTDKLRQNI-------KAKKMIGVNG 324 (537) T ss_pred eccCccCCCchhh-hHHHHHHHHHHHHhhhhHHHHhcCceeeeecCC-----CccchhHHHHH-------hhcCceeecC Confidence 2234467888875 889999999999999999999988888887631 11111112111 1244556666 Q ss_pred CCCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_019404. 260 ESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKL--IDRKRNAELLPILEFLI 335 (418) Q Consensus 260 ~~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~--I~~~Qe~~l~p~l~~l~ 335 (418) ++.+++.++ .+..+.+..++.+.+.|-..+..|-+ .....| |+||..-.-.|... -....+..++..|++++ T Consensus 325 d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~---~~~~~g-n~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~ 400 (537) T protein:vir:78 325 DNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNS---TAVGDG-NVTNVVIKSRYTLLAMKARKMETSLRKVLRWCA 400 (537) T ss_pred CCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCC---cccccc-CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 655555554 45567888899999888877777753 333334 45676444333333 22334455788888887 Q ss_pred HHhhc-----------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcC---------cC Q lcl|NC_019404. 336 PFIVN-----------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAP---------EI 395 (418) Q Consensus 336 ~~i~~-----------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~---------~~ 395 (418) ++|+. ..++.+.|++-...+++|.|++ +.++++.|++|.+.+...+.-..+ .. T Consensus 401 ~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~-------~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~ 473 (537) T protein:vir:78 401 DMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATT-------RKTEAETEALKIGNIMTVAPRIGDDETLKLIAEEL 473 (537) T ss_pred HHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHH-------HHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHH Confidence 77752 1367899999999999888665 444556666666555432210000 00 Q ss_pred CCChhhcccccccCCCccccccC Q lcl|NC_019404. 396 KIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 396 ~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) ....+++.+...+...+.+.+.. T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~ 496 (537) T protein:vir:78 474 DLDYNELKDALAEQDAQSLDVSP 496 (537) T ss_pred HhhhhhhhhhhhhhcccccCcCc Confidence 00000000000000011111111 No 153 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.39 E-value=2.4e-12 Score=84.30 Aligned_cols=376 Identities=13% Similarity=0.020 Sum_probs=188.6 Q ss_pred ccchhhHHHH-----------------hcCCCC----ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccc Q lcl|NC_019404. 2 VKTDSYANIF-----------------LGGSDG----SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHI 60 (418) Q Consensus 2 ~~~D~~~n~~-----------------~g~~~~----~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i 60 (418) ++.+-+..++ .|-+.- ........ ....-......+.+++.||+..+..++.+++.+ T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~--~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~ 78 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDE--NPLRNADNRISHNFHEILVDEKASYMFTYPVLF 78 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccccccccccc--ccccccccccccchHHHHHHhhhhheeccccee Confidence 4444333322 221100 00000000 000000012237999999999999999999998 Q ss_pred cCcchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccc Q lcl|NC_019404. 61 DGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPR 138 (418) Q Consensus 61 ~~~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~ 138 (418) ..+++. ..+.+.+.+=++.....++.+.+..||.|++++.++...... ....+.++ +.++++.++-|.+-+..-. T Consensus 79 ~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~--~~~~~~~~-~~~i~p~~~~~vydd~~~~ 155 (451) T protein:vir:10 79 DIDNNKELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGE--QVTNQTFK-YGVVNTEEIIPIYRNGIER 155 (451) T ss_pred ecCCcHHHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccc--ccccccee-EEEEcccceEEEEcCCCCC Confidence 765443 233444444567788889999999999999998875332211 11223332 3445554443322110000 Q ss_pred ccccCcceEEEEecCCcc--cc-----cc-cCcccEEEecC---c----------------cchhhhhhccccCCcchHH Q lcl|NC_019404. 139 NARFGKPLTYRITTNESD--MF-----YD-VHYSRIHIIDG---E----------------RVPNAMRRQNDGWGRSVLS 191 (418) Q Consensus 139 s~~yg~p~~y~i~~~~~~--~~-----~~-iH~SR~i~~~g---~----------------~lp~~~~~~~~~~G~S~l~ 191 (418) .+.++ ..+|.......+ .. .+ .-+.++.++.. . .+|. ....++..|.|.++ T Consensus 156 ~~~~~-ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~nn~~~~~d~e 233 (451) T protein:vir:10 156 ELEAV-IRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPF-VEFSNNIKKQSDLS 233 (451) T ss_pred ceEEE-EEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeE-EEeccCCCCCCchh Confidence 11111 011111000000 00 00 00111122110 0 0111 11123445778786 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc----CCCceeE- Q lcl|NC_019404. 192 SDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA----ESEEYSV- 266 (418) Q Consensus 192 ~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~----~~e~~~~- 266 (418) .+-+.+.+|+.+....+..+..++..++++.+... .. ..+....+ ...+.+.+.. ++.+.+. T Consensus 234 -~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~----~~-~~~~~~~~-------~~~~~i~~~~~~~~~~~~~~~l 300 (451) T protein:vir:10 234 -KYKKILDLYDRVMSGFANDLEDIQQIIYILENFGG----ED-TSEFLKEL-------KRYKTIKTETDSEGDSGGLKTM 300 (451) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc----cc-chhhHHHH-------hhCCeEEecCcCCccCCcceEE Confidence 47889999999999999888888888888775321 11 11111111 1123333332 2223444 Q ss_pred -eecccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhc--- Q lcl|NC_019404. 267 -LNSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVN--- 340 (418) Q Consensus 267 -~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~--- 340 (418) .+.+..+....++.+.+.|...+++|-.-. ...| |+||..-.-.|.... ...++..+++.+++++++++. T Consensus 301 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~---~~~g-n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 376 (451) T protein:vir:10 301 QIEIPTEARKIILEILKKQIYESGQGLQQDT---ENFG-NASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG 376 (451) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCcccccc---cccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 455677899999999999999999996422 1123 456664433333332 223344577888888888763 Q ss_pred ---cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhc---ccccc------- Q lcl|NC_019404. 341 ---AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDI---QTEES------- 407 (418) Q Consensus 341 ---~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~---~~~e~------- 407 (418) ..++.+.|++-...+++|.+++..+.+ |+||.+.+...+ ++..-..+++ .+++. T Consensus 377 ~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~---------g~iS~et~~~~~----p~v~d~~~e~~~~~ee~~~~~~~~~ 443 (451) T protein:vir:10 377 VTDYKKIQQTYTRNMMSNDLEDADIATKSV---------GIIPTKIILRHH----PWVDDVEEAEKLYLEEKKIQASKVS 443 (451) T ss_pred CCCccceeEEecCCCCCCHHHHHHHHHHHh---------ccCchHHHHHhC----CCCCCHHHHHHHHHHHHHHHHHHHH Confidence 247889999999999999877655432 566665555432 1111001111 00000 Q ss_pred -cCCCccc Q lcl|NC_019404. 408 -ELITETE 414 (418) Q Consensus 408 -~~~~e~e 414 (418) ....-++ T Consensus 444 ~~~~~~~~ 451 (451) T protein:vir:10 444 DDYNNFTE 451 (451) T ss_pred hhcCCCCC Confidence 0000000 No 154 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.39 E-value=2e-12 Score=84.74 Aligned_cols=369 Identities=11% Similarity=0.039 Sum_probs=186.7 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch--HHHHHHHHHHhCc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD--EPAFWSRWDDLEM 78 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d--~~~i~~~~~~l~~ 78 (418) +-|.+-+..-..|-+.-- ..+..........+ ..++++.||+..+...+.+++.+.++++ .+.+.+-|++-++ T Consensus 33 ~~r~~~~~~yy~g~~~i~---~~~~~~~~~~~~ki--~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~i~~~N~~ 107 (453) T protein:vir:39 33 VARYEYLKNMYRGIMAID---AEPTKDLWKPDNRL--TVNFTKYIVDTFTGYFNGIPVKKSHSDKETLSKLQEFDNLNDM 107 (453) T ss_pred HHHHHHHHHHhhccCchh---cCCCccccCcccee--ecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHHhcCh Confidence 222222222223322100 00000000000111 3578999999999999999999976554 3457777777889 Q ss_pred hHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeeccccccccccccccccc---------------- Q lcl|NC_019404. 79 TQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRTQVKVQNREENPRNAR---------------- 141 (418) Q Consensus 79 ~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~---------------- 141 (418) ...+.++++.+..||.|++++..+. +.. .+.++++.++.+.+-+.....+. T Consensus 108 ~~~~~~~~~~~~~~G~~~~~v~~d~~g~~------------~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~ 175 (453) T protein:vir:39 108 EDEESELAKMACIYGRAFELLYQNEETQT------------NVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGE 175 (453) T ss_pred hHHHHHHHHHHhhcCeEEEEEEecCCCce------------EEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEE Confidence 9999999999999999999887642 221 12233333332221100000010 Q ss_pred -cCcceEEEEecCCcccc---cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019404. 142 -FGKPLTYRITTNESDMF---YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ 217 (418) Q Consensus 142 -yg~p~~y~i~~~~~~~~---~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~ 217 (418) |..-..|++...++... ..-|+-..+.++ ...++.+|.|.++ .+.+.+.+++.+....+..+..++. T Consensus 176 ~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv--------~~~n~~~g~sd~e-~v~~liDa~~~~~s~~~~~~~~~~~ 246 (453) T protein:vir:39 176 VYTKETTYALNGTMGFYNMTEQAPNPFDDLPVV--------EFYFNEERMSIFE-SVISLVNAFNKAISEKANDVDYFSD 246 (453) T ss_pred EEeCCeEEEEEecCCceeeecccccCCCceeEE--------EecCCCCCCcchh-hhHHHHHHHHHHHHHHHHHHHHhhC Confidence 11111122222111100 011222222111 1223567899887 4889999999999988888888887 Q ss_pred ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEE-----cCCCceeE--eecccCCHHHHHHHHHHHHhhhhc Q lcl|NC_019404. 218 AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGID-----AESEEYSV--LNSDIGGIDAFLDKKFDRIVALSG 290 (418) Q Consensus 218 ~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d-----~~~e~~~~--~~~~~~gl~~~~~~~~~~iaaas~ 290 (418) .++.+.+.. + . ++. ...+.. +..+.+. +++.++.. .+.+..++...++.+.+.|...++ T Consensus 247 p~~~~~g~~-~-~-~~~----~~~~~~-------~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~ 312 (453) T protein:vir:39 247 QYLTFLGAA-V-E-EED----LKNIRS-------NRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTM 312 (453) T ss_pred ceeeeecCC-C-C-chh----hhhhhh-------cceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 777766421 1 1 111 111110 1111111 12234444 445667889999999999999999 Q ss_pred CCeeeeeccCccccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhc----------cCCceEEeCCCCCCCHH Q lcl|NC_019404. 291 IHEIILKNKNVGGLSSSQNTALETFH---KLIDRKRNAELLPILEFLIPFIVN----------AEEWSVEFSPLDHESSK 357 (418) Q Consensus 291 IP~t~L~G~s~~gl~stge~d~~~y~---~~I~~~Qe~~l~p~l~~l~~~i~~----------~~~~~~~f~pL~~~~ek 357 (418) +|-.-. + ..| |+||+.-...+. ..++.+| ..+...+++++.+++. ..++++.|++-...+.+ T Consensus 313 ~p~~~~-~--~~g-n~Sg~Al~~~~~~l~~ka~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~ 387 (453) T protein:vir:39 313 VANISD-E--SFG-SSSGVSLAYKLQAMSNLALSFQ-RKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIK 387 (453) T ss_pred Cccccc-c--ccc-CChHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHH Confidence 996432 1 112 456664433333 3333333 3467777777776642 13678999999999999 Q ss_pred HHHHHHHHHHHHHHH---HHhCCCCC-HHHHHHHHHhhcCcC----CCChhhccc-ccccCCCccc Q lcl|NC_019404. 358 DKAEVLEKSVNSIAA---LIAAGAMD-IKEARDTLRTIAPEI----KIGDNDIQT-EESELITETE 414 (418) Q Consensus 358 e~ae~~~~~a~a~~~---~~~~g~i~-~~e~r~~l~~~~~~~----~~~~~~~~~-~e~~~~~e~e 414 (418) +.|++..+.+..++. +-..+.++ +++..+++++-.... .-...+.+. ++..++.+.| T Consensus 388 ~~a~~~~kl~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 388 EQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred HHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 998876665533222 22233332 222222222110000 000001111 1112223333 No 155 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.38 E-value=1.2e-12 Score=86.02 Aligned_cols=355 Identities=11% Similarity=0.023 Sum_probs=158.0 Q ss_pred CccchhhHHHHhcCCCCc-cccCcc-ccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHH-HHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGS-EIYGSL-QNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRW-DDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~-~~~~~~-~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~-~~l~ 77 (418) |==-|-+.+.+ +..... ..+... ...........|..+..+.++|+.++++.-+-++.+...+.. ....+ ..|+ T Consensus 1 Mg~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~--~~~~~~~lL~ 77 (395) T protein:vir:40 1 MGFKSWVSGFF-NEEQRTLNLTDTVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEE--VRKKNWYMFN 77 (395) T ss_pred CchHHHHHhhh-cccccccccccchhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCcc--ccchHHHHHH Confidence 11111111111 111000 000000 000111223456678889999999999999988887532221 11111 1232 Q ss_pred c-------hHHHHHH-HHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEE Q lcl|NC_019404. 78 M-------TQNINDA-WSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYR 149 (418) Q Consensus 78 ~-------~~~~~~a-~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~ 149 (418) . ...|.++ +..-.|+|.|++++. ++.. .+.+.+.... ....++ ..+. T Consensus 78 ~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~-~~~~---------------~~~~~~~~~~-----~~~~~~----~~~~ 132 (395) T protein:vir:40 78 VEANQNQNATEFWKKAIYKLVYDNEALIFMQ-DEYI---------------YVADSFTKND-----KSLYEN----TYTE 132 (395) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEEe-cCce---------------eecCCccccc-----cccccc----eeee Confidence 2 2455554 444556899998763 2211 0111111100 000110 1111 Q ss_pred EecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC-ceeecchHHHh Q lcl|NC_019404. 150 ITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ-AVWKAKGLAEL 228 (418) Q Consensus 150 i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~-~v~k~~~l~~~ 228 (418) +...+......+.++.|+||.... .....++.+.+. .+...+.. ......+..+. .+++++.... T Consensus 133 v~~~~~~~~~~~~~~evih~r~~~------~~~~~~~~~l~~-~~~~~~~~------~~~~~~~~~~~~~~l~~~~~~~- 198 (395) T protein:vir:40 133 VTLKDLTLKKEFKESEVLHLTLNN------ESIKSIIDGFYL-LYGDLLTA------AVNKYKKLNSRKIIVKLKAMFG- 198 (395) T ss_pred eeecCceeeeeeccccEEEeecCC------CCccccchhHHH-HHHHHHHH------HHHHHHhcCCCCceEEEecccC- Confidence 222222222357788999985221 111222332222 12222211 11122222222 2223321111 Q ss_pred hcCcchHHHHHHHHHHHHH-hcCCcceeEEEcCCCceeEeecccCCHH--HHH---HHHHHHHhhhhcCCeeeeeccCcc Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDN-NSGVGQAIGIDAESEEYSVLNSDIGGID--AFL---DKKFDRIVALSGIHEIILKNKNVG 302 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~-~~~~~~~~~~d~~~e~~~~~~~~~~gl~--~~~---~~~~~~iaaas~IP~t~L~G~s~~ 302 (418) + +++...+.++++...-. ..++.+.+++..++.+|++++.+..... ++. +.+..+||.+.|||..+|.| T Consensus 199 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~---- 273 (395) T protein:vir:40 199 Q-TPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG---- 273 (395) T ss_pred C-CHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC---- Confidence 1 22223344444443322 2234455555556678999988776533 222 23356899999999987732 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------ccCCceEE--eCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 303 GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV--------NAEEWSVE--FSPLDHESSKDKAEVLEKSVNSIAA 372 (418) Q Consensus 303 gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~--------~~~~~~~~--f~pL~~~~eke~ae~~~~~a~a~~~ 372 (418) . .++.+.....||. ..|.|.++.+-..+- +..++.|+ +.+|...|.+++ ++++.+ T Consensus 274 ~-~sn~e~~~~~f~~-------~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~-------~~~~~~ 338 (395) T protein:vir:40 274 D-TVGLSEQVNSFLM-------FSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEI-------ASSMDV 338 (395) T ss_pred C-CcCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHH-------HHHHHH Confidence 1 2223444555654 347777776654432 12234444 457777777665 567888 Q ss_pred HHhCCCCCHHHHHHHHHhhcCcCC-CCh--------hhcccccc---cCCCccccccC Q lcl|NC_019404. 373 LIAAGAMDIKEARDTLRTIAPEIK-IGD--------NDIQTEES---ELITETEVVIA 418 (418) Q Consensus 373 ~~~~g~i~~~e~r~~l~~~~~~~~-~~~--------~~~~~~e~---~~~~e~e~~~~ 418 (418) ++++|+++++|+|+.+. ..|-.+ ..| ..+...++ ..+.+++.--+ T Consensus 339 ~~~~G~~t~NE~R~~~g-~~pi~~~~gD~~~~~~n~~~~~~~~~~~kgge~~~~~~~~ 395 (395) T protein:vir:40 339 LFHIGVNTIDDNLRMIG-REPVMSPETQERFVTKNYAPLGENEEDLKGGDINENKGDS 395 (395) T ss_pred HHhCCCCCHHHHHHHhC-CCCCCCCCCceeeeccccccccccccccCCCCCCCCcCCC Confidence 99999999999998763 222111 111 00000000 00111111111 No 156 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.38 E-value=9.7e-14 Score=91.92 Aligned_cols=324 Identities=13% Similarity=0.040 Sum_probs=157.3 Q ss_pred hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--Ccc----hH------HHHHHH Q lcl|NC_019404. 5 DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GID----DE------PAFWSR 72 (418) Q Consensus 5 D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~----d~------~~i~~~ 72 (418) =||.|-+.+-.+...........++ +-...+..+..+.+||+.+|++.-+-++.+- ..+ +. ..+..- T Consensus 1 Mg~f~~~~~f~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~l 79 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDTQRVTAW-QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEV 79 (378) T ss_pred CccchhhhhhhccccCCCcceeeec-ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccchHHHH Confidence 0111111100000000111111111 1122333556789999999999999888652 110 10 011212 Q ss_pred HH-H---hCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceE Q lcl|NC_019404. 73 WD-D---LEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLT 147 (418) Q Consensus 73 ~~-~---l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~ 147 (418) +. + .-....|.+.+. +-.++|.|++++..++. .+.+.++.+. T Consensus 80 L~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~---------~g~~~~l~~~------------------------ 126 (378) T protein:vir:93 80 LNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN---------TGELLDLLFA------------------------ 126 (378) T ss_pred HhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC---------CceEEEEEec------------------------ Confidence 21 1 112234444444 45568999988754321 1222222110 Q ss_pred EEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHH Q lcl|NC_019404. 148 YRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAE 227 (418) Q Consensus 148 y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~ 227 (418) ....++.++.|+|+.+ | .+...|.|++.. +...+..+-. +. .---++++++. T Consensus 127 --------~~~~~~~~~diih~r~-~-------~~~~~~~s~l~~-~~~~i~~~~~---~~------~~~g~l~~~~~-- 178 (378) T protein:vir:93 127 --------DDKKEYKTEELVRLTS-P-------FYINEDTSILDN-ALASIQTKLE---QG------KLRGLLKINAF-- 178 (378) T ss_pred --------CCeeEeccceeEEecC-c-------cccchhhHHHHH-HHHHHHHHHh---cC------cccceeeeCCc-- Confidence 0123577889999963 1 122236776653 4444433211 10 01123454421 Q ss_pred hhcCcchHHHHHHHHHHHHHhc---CCcceeEEEcCCCceeEeecccCCHH-HHHHHHHHHHhhhhcCCeeeeeccCccc Q lcl|NC_019404. 228 LCDDSEGFGAARLRLAQVDNNS---GVGQAIGIDAESEEYSVLNSDIGGID-AFLDKKFDRIVALSGIHEIILKNKNVGG 303 (418) Q Consensus 228 ~~~~~~~~~~~~~r~~~~~~~~---~~~~~~~~d~~~e~~~~~~~~~~gl~-~~~~~~~~~iaaas~IP~t~L~G~s~~g 303 (418) +. ++...+.++++...-... ++.+.+++..++.+|+.++.+....+ +..++....||.+.+||..+|.| T Consensus 179 -l~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g----- 251 (378) T protein:vir:93 179 -LD-IDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG----- 251 (378) T ss_pred -CC-HHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC----- Confidence 11 222333444444433211 12334455555678998887665433 34567889999999999988743 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc-------------CCceEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019404. 304 LSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA-------------EEWSVEFSPLDHESSKDKAEVLEKSV 367 (418) Q Consensus 304 l~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~-------------~~~~~~f~pL~~~~eke~ae~~~~~a 367 (418) +..+....+||. .-|.|.+..+-..+ +.+ .++.|+++.|...|.+++ + T Consensus 252 --~~~e~~~~~f~~-------~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~-------~ 315 (378) T protein:vir:93 252 --TATQEQQIYFYN-------STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL-------I 315 (378) T ss_pred --CcHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHH-------H Confidence 122444455543 34778776665443 111 135677778887777665 6 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC-------h---hhc------ccccc-cCCCccc Q lcl|NC_019404. 368 NSIAALIAAGAMDIKEARDTLRTIAPEIKIG-------D---NDI------QTEES-ELITETE 414 (418) Q Consensus 368 ~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~-------~---~~~------~~~e~-~~~~e~e 414 (418) +++.+++++|+++++|+|+.+. ..|..+.+ . ++. +..+. .-++.+| T Consensus 316 ~~~~~~~~~G~~t~NE~R~~~g-l~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 316 DLYHENINGPIFTQNQLLVKMG-EQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 6788999999999999999763 22211110 0 000 00111 1112233 No 157 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.35 E-value=9e-13 Score=86.62 Aligned_cols=263 Identities=11% Similarity=0.004 Sum_probs=145.9 Q ss_pred hhhcchhhhccCCccccCcchHHHHHHHHH----HhCchHHHHHHH-HhccccceEEEEEeecCCCcccccccCCCceEE Q lcl|NC_019404. 45 IIDTIPETALAAGFHIDGIDDEPAFWSRWD----DLEMTQNINDAW-SWARLFGGAAIVAIVKDNRALTSPVREGAELET 119 (418) Q Consensus 45 iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~----~l~~~~~~~~a~-~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~ 119 (418) |-..|.. ..++... ....+...+. .......|.+.+ ..-.++|.|++++.- + ..|.+.. T Consensus 1 ia~l~~~-~~~~~~~-----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r-~---------~~G~~~~ 64 (278) T protein:vir:78 1 MASLPLK-MYEDYKV-----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-D---------IYHQPSK 64 (278) T ss_pred CccceeE-EEecCcc-----cccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEE-C---------CCCcEEE Confidence 1111111 1111111 0111222221 122233344444 455668999888753 2 2356778 Q ss_pred EEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHH Q lcl|NC_019404. 120 VRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIK 199 (418) Q Consensus 120 i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~ 199 (418) +.+++++++++.... .|.+..|.+...++. ...+.++.||||.... +...++|.|++.. +...+. T Consensus 65 l~~l~~~~v~v~~~~-------~~~~~~y~~~~~~g~-~~~~~~~evih~~~~~------~~~~~~G~s~~~~-~~~~i~ 129 (278) T protein:vir:78 65 LFLLNPDVVEMLIEN-------QSRELYYSIHAATGN-KLIVHNMDMLHFKHIV------ASNMVQGISPIDV-LKNTTD 129 (278) T ss_pred EEEECCceeEEEEcC-------CCceEEEEEEcCCce-EEEEccccEEEECCCC------CCCCeeeccHHHH-HHHHHH Confidence 999999988765432 345667777665443 3578999999995321 2344679999975 777777 Q ss_pred HHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC--HHHH Q lcl|NC_019404. 200 DYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAF 277 (418) Q Consensus 200 ~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g--l~~~ 277 (418) ....+......-.....--+++.++ . + +++......+++... . ++.+.+++..++.+|++++.+..+ +.+. T Consensus 130 ~~~~~~~~~~~~~~~~~~~i~~~~~--~-l-~~e~~~~~~~~~~~~--~-~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~ 202 (278) T protein:vir:78 130 FDNAVRTFNLTEMQKPDSFMLKYGS--N-V-GKEKRQQVLEDFKQY--Y-EENGGILFQEPGVEIEPLPKKYVSEDIVAS 202 (278) T ss_pred HHHHHHHHHHHHhcCCCcEEEEeCC--C-C-CHHHHHHHHHHHHHH--h-ccCCCceecCCCceEEEccCChhHHHHHHH Confidence 6666655543222222222333332 1 1 123334445555432 2 334445555556789988877664 4466 Q ss_pred HHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----c----CCceEEeC Q lcl|NC_019404. 278 LDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN----A----EEWSVEFS 349 (418) Q Consensus 278 ~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~----~----~~~~~~f~ 349 (418) .+...+.||.+.|||..++ |...++-.++.++..+.||..+ +.|.++.+-..+-+ . .++.|+|+ T Consensus 203 ~~~~~~~Ia~~fgVpp~~l-g~~~~~~~sn~~~~~~~~~~~~-------l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~ 274 (278) T protein:vir:78 203 ENLTRERVANVFQLPSVFL-NARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKIGILNLT 274 (278) T ss_pred HHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCChhHhcCCceEEEe Confidence 7788999999999997544 6665554556666666776653 77877666555421 1 24556665 Q ss_pred CCCCC Q lcl|NC_019404. 350 PLDHE 354 (418) Q Consensus 350 pL~~~ 354 (418) +..+ T Consensus 275 -~~~l 278 (278) T protein:vir:78 275 -LNLI 278 (278) T ss_pred -cccC Confidence 2222 No 158 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.35 E-value=4.9e-13 Score=88.09 Aligned_cols=396 Identities=13% Similarity=0.084 Sum_probs=178.8 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-HHHHHHHHHhCch Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-~~i~~~~~~l~~~ 79 (418) +=|.+-+.--+-|-..- .+.....+.++........++++|||..++-+.-.||.+.+.++. ..+.+.|++-++. T Consensus 29 ~~r~~~l~~YY~G~~~i----~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~~~~~~~~~~~~i~~~N~~d 104 (486) T protein:vir:42 29 SKDLASNTSYYDAERRP----EAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLGDADEADEELWQWWQANNLD 104 (486) T ss_pred HHHHHHHHHHhcccCcc----hhcccccchhHhhhhhccchHHHHHHHHHhhhcccceecCCCchhHHHHHHHHHhcChh Confidence 00000000001121100 011111344555555567789999999999988888887654443 3467777777888 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccc--cc-------ccccCcce---- Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREEN--PR-------NARFGKPL---- 146 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~d--p~-------s~~yg~p~---- 146 (418) ....++++.+.+||.|++++..+.+.....+ ......++++++.++.+.+-... +. ..+.+.+. T Consensus 105 ~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~---~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (486) T protein:vir:42 105 IEAPLGYTDAYVHGRSFITISKPDPQLDLGW---DQNVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATL 181 (486) T ss_pred HHHHHHHHHHhhcCceEEEEecCCccccccc---CCCeeEEEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEE Confidence 8899999999999999998865422111000 01111223333333222110000 00 00111111 Q ss_pred -----EEEEecCCcccc---cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc Q lcl|NC_019404. 147 -----TYRITTNESDMF---YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA 218 (418) Q Consensus 147 -----~y~i~~~~~~~~---~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~ 218 (418) .|++...++... ..-|+-..+.+.. .++. ......||.|.++..+...+.+++++....+.....++.+ T Consensus 182 y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~--~~n~-~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p 258 (486) T protein:vir:42 182 YTPMETIGWFRADGEWAEWFNVPHGLGVVPVVP--LPNR-TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVP 258 (486) T ss_pred EcCCcEEEEEecCCcEEeecceecCCCCceEEE--eccc-cccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcch Confidence 222222111110 0113322111110 0111 1223358999887667777888888877766665555554 Q ss_pred eeecchHHHh-hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecc---cCCHHHHHHHHHHHHhhhhcCCee Q lcl|NC_019404. 219 VWKAKGLAEL-CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSD---IGGIDAFLDKKFDRIVALSGIHEI 294 (418) Q Consensus 219 v~k~~~l~~~-~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~---~~gl~~~~~~~~~~iaaas~IP~t 294 (418) ...+.+...- .....+.. ...+ ....+.+.... +++.+..+.+ +....+.+.....++|+.+++|.. T Consensus 259 ~~~i~G~~~~~~~~~~~~~--~~~~------~~~~~~~~~~~-~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~ 329 (486) T protein:vir:42 259 QRLIFGIKPEEIGVDSETG--QTLF------DAYLARILAFE-DAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQ 329 (486) T ss_pred HHHhhcCCccccccccccc--cchh------hhhhchhcccC-CCCceEEeecccCHHHHHHHHHHHHHHHhcccCCCHH Confidence 4444332110 00000000 0001 11112222222 2333333333 444566677778889999999988 Q ss_pred eeeccCccccccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhcc----------CCceEEeCCCCCCCHHHHHH Q lcl|NC_019404. 295 ILKNKNVGGLSSSQNTALETFHKL---IDRKRNAELLPILEFLIPFIVNA----------EEWSVEFSPLDHESSKDKAE 361 (418) Q Consensus 295 ~L~G~s~~gl~stge~d~~~y~~~---I~~~Qe~~l~p~l~~l~~~i~~~----------~~~~~~f~pL~~~~eke~ae 361 (418) .|.|.+ .. ++||+.-...+... ++.+ +..+.+.|.+++.+++.- .++.+.|.+-..++..+.|+ T Consensus 330 ~fg~~~-~n-~~Sg~Al~~~~~~l~~ka~~~-~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad 406 (486) T protein:vir:42 330 YLSTAA-DN-PASAEAIRAAESRLIKKVERK-NLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKAD 406 (486) T ss_pred Hhcccc-Cc-hhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHH Confidence 775544 22 24565443333332 3333 345678888888776431 25778999998999988877 Q ss_pred HHHHHHHHHH-------HHHhCCCCCHHHH-HHHHHh-h-c------CcC-CCCh-hhcccccccCCCccccccC Q lcl|NC_019404. 362 VLEKSVNSIA-------ALIAAGAMDIKEA-RDTLRT-I-A------PEI-KIGD-NDIQTEESELITETEVVIA 418 (418) Q Consensus 362 ~~~~~a~a~~-------~~~~~g~i~~~e~-r~~l~~-~-~------~~~-~~~~-~~~~~~e~~~~~e~e~~~~ 418 (418) ...+.+++.. .+-..|++..... .+.+.+ . . ... .-+. .+-..+..++....+..-+ T Consensus 407 ~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (486) T protein:vir:42 407 AATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIES 481 (486) T ss_pred HHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCC Confidence 6655554311 1222333322111 011100 0 0 000 0000 0000000011111111111 No 159 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.34 E-value=1.4e-12 Score=85.61 Aligned_cols=376 Identities=11% Similarity=0.070 Sum_probs=178.6 Q ss_pred ccchh---hHH-----------------HHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc Q lcl|NC_019404. 2 VKTDS---YAN-----------------IFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID 61 (418) Q Consensus 2 ~~~D~---~~n-----------------~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~ 61 (418) |+-|. +.. -..|-..- ... ....+.++......+.+++.||+..++-+.-.||... T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i-~~~---~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~ 76 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRV-RDL---GVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNG 76 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-hhc---CcccchhhhhhhhhcchHHHHHHHHHhhhccccccCC Confidence 22222 111 11221110 000 1112234444555679999999999998877776643 Q ss_pred CcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecc--ccccc--ccccc Q lcl|NC_019404. 62 GIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRT--QVKVQ--NREEN 136 (418) Q Consensus 62 ~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~--~i~~~--~~~~d 136 (418) + .+.+.+-|++-++.....++++.+.+||.|++++..+ +|.+.-.++++... +.+.+.. .+... .+..+ T Consensus 77 --d-~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~---~~i~d~~~~~~~~~~~~~~~~ 150 (441) T protein:vir:80 77 --D-GYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNC---TGKFSADGSRLDAGLVVQQTC 150 (441) T ss_pred --C-hHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceE---EEEEeCCCCceeEEEEEEEEe Confidence 2 2456667777888999999999999999999988764 23322222221110 0011110 01100 00000 Q ss_pred -cc---ccccCcceEEEEecCCcccc----cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 137 -PR---NARFGKPLTYRITTNESDMF----YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 137 -p~---s~~yg~p~~y~i~~~~~~~~----~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) .. ..-|..-..|++...+.+.- ..-|+--.+.++ +.+.. .....+||.|.+.+.+.+.+.+++.++... T Consensus 151 ~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv--~~~n~-~~~~~~~G~s~l~~~v~~liDa~~~~~s~~ 227 (441) T protein:vir:80 151 DPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLV--PIVNR-RRTSRIDGRSEITRSIRAYTDEAVRTLLGQ 227 (441) T ss_pred cCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEE--Eeecc-ccCCccCCcccchhhHHHHHHHHHHHHHHH Confidence 00 00011112222211111100 112222111111 00111 122446899988766777888888888887 Q ss_pred HHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc--CCCceeEeecccCCHH---HHHHHHHH Q lcl|NC_019404. 209 TQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA--ESEEYSVLNSDIGGID---AFLDKKFD 283 (418) Q Consensus 209 ~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~--~~e~~~~~~~~~~gl~---~~~~~~~~ 283 (418) ...+..++.+.+.+.+.. + .. ...-..+. ..+....+++ +++..+..+.+-++++ +.++.... T Consensus 228 ~~~~~~~~~~~~~i~G~~-~-~~---~~~~~~~~-------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~ 295 (441) T protein:vir:80 228 SVNRDFYAYPQRWVTGVS-A-DE---FSQPGWVL-------SMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQ 295 (441) T ss_pred HHHHHhhcCceeeeecCC-c-cc---cccchhhh-------cccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHH Confidence 777777777666665431 1 11 00000111 0112222322 2223444444434444 45666689 Q ss_pred HHhhhhcCCeeeeeccCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhcc-----------CCceEEeCC Q lcl|NC_019404. 284 RIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLI--DRKRNAELLPILEFLIPFIVNA-----------EEWSVEFSP 350 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I--~~~Qe~~l~p~l~~l~~~i~~~-----------~~~~~~f~p 350 (418) .+++.++||...| |.++.. ++||+.-.-.+...+ ...++..+.+.|.+++.+++.- .++++.|++ T Consensus 296 ~~~~~~~~p~~~~-g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~ 373 (441) T protein:vir:80 296 LTAGEAAVPERYF-GFITSN-PPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRD 373 (441) T ss_pred HHhcccCCCHHHh-ccCCCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCC Confidence 9999999998666 444432 234554332222222 2233345678888887776541 256889999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHH--HHHHHHHhhcCcCCCChhhcccccccCC------CccccccC Q lcl|NC_019404. 351 LDHESSKDKAEVLEKSVNSIAALIAAGAMDIK--EARDTLRTIAPEIKIGDNDIQTEESELI------TETEVVIA 418 (418) Q Consensus 351 L~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~--e~r~~l~~~~~~~~~~~~~~~~~e~~~~------~e~e~~~~ 418 (418) -...+..|. |+++.+++++|++..+ .++. .. +.++++++.-+.+-. ...-++++ T Consensus 374 ~~~~~~~e~-------ad~~~kl~~~g~~~~s~~~~~~----~l---~~~~~e~~~~~~e~~e~~~~~~~~~~~~~ 435 (441) T protein:vir:80 374 ASTPTRAAT-------ADAVTKLVGAGILPADSRTVLE----ML---GLDDVQVEAVMRHRAESSDPLAVLAGAIS 435 (441) T ss_pred CCCcCHHHH-------HHHHHHHHhcCcccccHHHHHH----hC---CCCHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 888888776 4556666666764322 2221 11 222222222111110 00111111 No 160 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.33 E-value=3e-13 Score=89.25 Aligned_cols=322 Identities=13% Similarity=0.035 Sum_probs=155.5 Q ss_pred hhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--Ccc----hH------HHHHHH Q lcl|NC_019404. 5 DSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GID----DE------PAFWSR 72 (418) Q Consensus 5 D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~----d~------~~i~~~ 72 (418) =||.+-.-+-.+........+..++ +-+.....+..+++||+.+|++.-+-++.+- ..+ +. ..+... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~~l 79 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQRVTAW-QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEV 79 (378) T ss_pred CccchhhhhhhcccccCCcceeeec-ccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccchHHHH Confidence 0111111000000001111111111 1112222456789999999999999888651 110 10 011222 Q ss_pred HH----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceE Q lcl|NC_019404. 73 WD----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLT 147 (418) Q Consensus 73 ~~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~ 147 (418) +. ..-....|.+.+.. -.++|.|++++..++ . .+.+.++.+. T Consensus 80 L~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~-~--------~g~~~~l~~~------------------------ 126 (378) T protein:vir:16 80 LNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDD-N--------TGELLDLLFA------------------------ 126 (378) T ss_pred HhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeec-C--------CceEEEEEec------------------------ Confidence 21 11123445554444 445799998875432 1 1222222110 Q ss_pred EEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC-ceeecchHH Q lcl|NC_019404. 148 YRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ-AVWKAKGLA 226 (418) Q Consensus 148 y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~-~v~k~~~l~ 226 (418) + ...++.++.||||.+. .+..-|.|++.. +...+... +.+ ... -+++++.. T Consensus 127 ------~--~~~~~~~~diih~r~~--------~~~~~~~s~l~~-~~~~i~~~---~~~-------~~~~g~l~~~~~- 178 (378) T protein:vir:16 127 ------D--DKKEYKPEELVRLTSP--------FYINEDTSILDN-ALASIQTK---LEQ-------GKLRGLLKINAF- 178 (378) T ss_pred ------C--CeeEecccceEEecCc--------cCccchhHHHHH-HHHHHHHH---Hhc-------CccceeeEeCCc- Confidence 0 1135677889999631 111235666643 44433222 111 111 23444421 Q ss_pred HhhcCcchHHHHHHHHHHHHHhc---CCcceeEEEcCCCceeEeecccCCHH-HHHHHHHHHHhhhhcCCeeeeeccCcc Q lcl|NC_019404. 227 ELCDDSEGFGAARLRLAQVDNNS---GVGQAIGIDAESEEYSVLNSDIGGID-AFLDKKFDRIVALSGIHEIILKNKNVG 302 (418) Q Consensus 227 ~~~~~~~~~~~~~~r~~~~~~~~---~~~~~~~~d~~~e~~~~~~~~~~gl~-~~~~~~~~~iaaas~IP~t~L~G~s~~ 302 (418) +. ++...+..+++...-... ++.+.+++..++.+|++++.+....+ ...++...+||.+.+||..+|.|. T Consensus 179 --l~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~--- 252 (378) T protein:vir:16 179 --LD-IDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT--- 252 (378) T ss_pred --CC-HHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC--- Confidence 11 122233444444433221 12334555555678998887664322 234677889999999999888441 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cc-------------CCceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_019404. 303 GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---NA-------------EEWSVEFSPLDHESSKDKAEVLEKS 366 (418) Q Consensus 303 gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~~-------------~~~~~~f~pL~~~~eke~ae~~~~~ 366 (418) +.+....+||.. -|.|++..+-..+- .+ .++.|+++.|...|.+++ T Consensus 253 ----~~e~~~~~f~~~-------tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~------- 314 (378) T protein:vir:16 253 ----ASQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL------- 314 (378) T ss_pred ----chHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHH------- Confidence 123444556543 37787766655442 11 136677788888888765 Q ss_pred HHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh-----------hhcc------cccc-cCCCccc Q lcl|NC_019404. 367 VNSIAALIAAGAMDIKEARDTLRTIAPEIKIGD-----------NDIQ------TEES-ELITETE 414 (418) Q Consensus 367 a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~-----------~~~~------~~e~-~~~~e~e 414 (418) ++++.++++.|++|++|+|+.+. ..|..+ +| ++.. ..+. .-++.+| T Consensus 315 ~~~~~~~~~~G~~T~NE~R~~~g-~~p~~g-gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 315 IDLYHENINGPIFTQNQLLVKMG-EQPIEG-GDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHHHhCCCcCHHHHHHHhC-CCCCCC-CCeEeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 66788999999999999999763 222111 11 0000 0111 1122233 No 161 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.32 E-value=5.5e-12 Score=82.32 Aligned_cols=359 Identities=10% Similarity=0.020 Sum_probs=190.6 Q ss_pred ccchhhHHHHhc--CCCC-----cccc-Cc-----cccCCHHHHHHHHH-cCCccchhhhcchhhhccCCccccCcchHH Q lcl|NC_019404. 2 VKTDSYANIFLG--GSDG-----SEIY-GS-----LQNQAPTILASLYA-DNALVRRIIDTIPETALAAGFHIDGIDDEP 67 (418) Q Consensus 2 ~~~D~~~n~~~g--~~~~-----~~~~-~~-----~~~~~~~~l~~~Y~-~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~ 67 (418) |..+.+.-+..- .... .++| |. ...-.+.++...|+ ..++.++|||.+++-+.=+||.. +|. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~---~d~- 76 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN---DDF- 76 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccC---cch- Confidence 555555443311 1100 0011 11 11113445544443 34788999999999888788763 332 Q ss_pred HHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccccccCCCceEEEEEeecc--ccccccccccccccccCc Q lcl|NC_019404. 68 AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTSPVREGAELETVRVYDRT--QVKVQNREENPRNARFGK 144 (418) Q Consensus 68 ~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~pl~~~~~i~~i~v~~~~--~i~~~~~~~dp~s~~yg~ 144 (418) .+.+-|.+-++.....++++.+.+||.|++++.-++ +.+.-.+..+... +.++|+. .+.......+.. ..|. T Consensus 77 ~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~---~~i~D~~~~~~~~a~~~~~~d--~~~~ 151 (409) T protein:vir:16 77 TVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNA---TGIIDPITGLLTEGYAVLERD--ENNN 151 (409) T ss_pred HHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccce---EEEeecccccceeeeEEEEec--CCCc Confidence 356677777888889999999999999999886432 2211111111110 1112221 111111000000 0111 Q ss_pred ce---------EEEEecCCcccccccCccc---EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 145 PL---------TYRITTNESDMFYDVHYSR---IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 145 p~---------~y~i~~~~~~~~~~iH~SR---~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) +. .|++...++.....-|+-- +++|..++ .....||.|.+.+++.+.+.++.+++....-.. T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~------~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~ 225 (409) T protein:vir:16 152 VVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRP------DAVRPFGRSRITRSGMYWQSNAKRTLERADVTA 225 (409) T ss_pred eEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccc------cccccCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 11 1111111111112234433 33333221 123568999887778888899999988777666 Q ss_pred HHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEE--EcCCCc--eeEe-ecccCCHHHHHHHHHHHHhh Q lcl|NC_019404. 213 RRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGI--DAESEE--YSVL-NSDIGGIDAFLDKKFDRIVA 287 (418) Q Consensus 213 ~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~--d~~~e~--~~~~-~~~~~gl~~~~~~~~~~iaa 287 (418) .-++.+...+.|+.. ++.......... +..+.+ +.+++. +.++ ..++.+.-+.++....++|+ T Consensus 226 e~~a~pqr~i~G~d~---d~~~~~~~~~~~---------~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~ 293 (409) T protein:vir:16 226 EFYSFPQKYVTGLSD---DAEPMETWKATV---------SSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAG 293 (409) T ss_pred HHhcChhheeEecCC---CCCccchhhhhh---------hHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhh Confidence 667777766666421 122211111111 111222 122222 3222 34567788889999999999 Q ss_pred hhcCCeeeeeccCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------CCceEEeCCCCC Q lcl|NC_019404. 288 LSGIHEIILKNKNVGGLSSSQNT---ALETFHKLIDRKRNAELLPILEFLIPFIVNA-----------EEWSVEFSPLDH 353 (418) Q Consensus 288 as~IP~t~L~G~s~~gl~stge~---d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-----------~~~~~~f~pL~~ 353 (418) .+++|...|.|++.. -+|++. ........++++|+. ..+.++++..+++.- .++.+.|.|+.. T Consensus 294 ~s~lP~~~lg~~~~N--psSa~Ai~a~~~~L~~ka~~k~~~-fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~ 370 (409) T protein:vir:16 294 ETGLTLDDLGFVSDN--PSSVEAIKASHENLRLAGRKAQRS-LGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFE 370 (409) T ss_pred hcCCCHHHcccccCc--hhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCC Confidence 999999888766521 133332 223445566666654 577777776665421 145788998877 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhCCC--CCHHHHHHHHHhhcCcCCCChhh Q lcl|NC_019404. 354 ESSKDKAEVLEKSVNSIAALIAAGA--MDIKEARDTLRTIAPEIKIGDND 401 (418) Q Consensus 354 ~~eke~ae~~~~~a~a~~~~~~~g~--i~~~e~r~~l~~~~~~~~~~~~~ 401 (418) ++....|. .|+++.+++++|. .+.+.+++.| |+++.| T Consensus 371 ~~~~s~a~----~aDa~~Kl~~a~~~~~~~~v~~~~~-------g~~~~d 409 (409) T protein:vir:16 371 ADASMLSL----IGDGAIKLNQAIPEFINKDTIRDLT-------GIKGAE 409 (409) T ss_pred cchhhHHH----HHHHHHHHHhhcccccchhHHHHhc-------cCCCCC Confidence 77555444 5899999999873 3345555543 333333 No 162 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.32 E-value=1.2e-12 Score=85.95 Aligned_cols=385 Identities=11% Similarity=0.115 Sum_probs=179.3 Q ss_pred Cccch-------hhHHHHhcCCCCccccCccccC-CHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHH Q lcl|NC_019404. 1 MVKTD-------SYANIFLGGSDGSEIYGSLQNQ-APTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFW 70 (418) Q Consensus 1 ~~~~D-------~~~n~~~g~~~~~~~~~~~~~~-~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~ 70 (418) +.-.+ -..+...|-+..- ...... .......-.....+++.||+..|.-++.++..|..+++. +.+. T Consensus 31 ~~~~~~~~~~i~~~~~yy~g~~~~~---~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i~~~d~~~~e~l~ 107 (496) T protein:vir:38 31 VNANDEDYKYIDMWKRLYQGHYAEW---HNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKINIDDKAAEEFVL 107 (496) T ss_pred CcCCHHHHHHHHHHHHHhcCCCchh---hcchhccCCCccccceeecchHHHHHHHHhhhhhCCcceEeeCChHHHHHHH Confidence 10011 1111111111000 000000 000000011224889999999999999999998765533 3345 Q ss_pred HHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCcccc--------ccc-CCCceEEEEEeeccccccc-----cccc Q lcl|NC_019404. 71 SRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRALTS--------PVR-EGAELETVRVYDRTQVKVQ-----NREE 135 (418) Q Consensus 71 ~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~l~~--------pl~-~~~~i~~i~v~~~~~i~~~-----~~~~ 135 (418) +-++.-++...+.+++..+..+|++++.+.++. +..... |+- ..+.+..+..+........ ++.. T Consensus 108 ~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~ 187 (496) T protein:vir:38 108 NVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNE 187 (496) T ss_pred HHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEE Confidence 555666899999999999999999999888763 322111 221 1223332211111111000 0000 Q ss_pred cccccccCcceE--EEEecCCcccccc-----cC----c-------ccEE-EecCccchhhhhhccccCCcchHHHHHHH Q lcl|NC_019404. 136 NPRNARFGKPLT--YRITTNESDMFYD-----VH----Y-------SRIH-IIDGERVPNAMRRQNDGWGRSVLSSDILD 196 (418) Q Consensus 136 dp~s~~yg~p~~--y~i~~~~~~~~~~-----iH----~-------SR~i-~~~g~~lp~~~~~~~~~~G~S~l~~~~~~ 196 (418) .. ...|..++ |..... ...+.. +| + +|.. ++-..+.+. -.....++|.|++.. +.+ T Consensus 188 ~~--~~~~~I~~~~y~~~~~-~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N-~~~~~~p~G~Sd~~~-~~~ 262 (496) T protein:vir:38 188 WQ--GDVYTVTTELYQSDDP-NELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIAN-NKNLTSPLGISVYAN-ALD 262 (496) T ss_pred Ee--CceEEEEEEEEecCCc-cccCccccccccccccccceeecCCCcceEEEecCCccc-ccccCCcCCCchHhh-HHH Confidence 00 00011111 111110 000001 11 1 2221 111111111 122345689999975 889 Q ss_pred HHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC--CCceeEeeccc--C Q lcl|NC_019404. 197 SIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE--SEEYSVLNSDI--G 272 (418) Q Consensus 197 ~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~--~e~~~~~~~~~--~ 272 (418) .+..++.+....+.-+....-.++--..+-....++.+.. ...+. ........+..+.. ...++..+..+ . T Consensus 263 lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~--~~~~~---~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e 337 (496) T protein:vir:38 263 TLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGST--TQYFD---STDEAFFLYQGDQDDNGKAIKDISVEIRST 337 (496) T ss_pred HHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCcc--ccCCC---CccceEEEeecCCCcccccceeeccccCHH Confidence 9999998877777666543333332111111111122110 00000 00000111111111 12366665554 3 Q ss_pred CHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhc--------- Q lcl|NC_019404. 273 GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEFLIPFIVN--------- 340 (418) Q Consensus 273 gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------- 340 (418) .....++.+.+.++..+|+|...| |...+|.. |+.+-. ...+.++..+| +.++..|++++..+++ T Consensus 338 ~~~~~l~~~l~~i~~~~g~~~~~f-~~~~~g~~-tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~ 414 (496) T protein:vir:38 338 EFIESINAMLRIYAMQVGLSAGTF-TFDENGLK-TATEVVSEKSETYQTKNSHS-QLIEQGIKEMIVSILEVGKFIEAYS 414 (496) T ss_pred HHHHHHHHHHHHHHHhhCCChhhc-CCCccccc-hHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhc Confidence 456778888899999999998654 65666653 444432 23444455543 4577888777776652 Q ss_pred -----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhh-------------- Q lcl|NC_019404. 341 -----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDND-------------- 401 (418) Q Consensus 341 -----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~-------------- 401 (418) ..++++.|+.-...++.+. ++++.+++++|++|.+.+...+ .+.++++ T Consensus 415 g~~~~~~~i~v~f~d~i~~d~~~~-------~~~~~~~~~~GiiS~et~l~~~------~~~~d~ea~~el~ri~~E~~~ 481 (496) T protein:vir:38 415 GEVVELDTITVDFDDSIAQDEDTT-------INRYTNAKNQGMIPLKIALQRA------WNITEAEADEWAEMLAKEKQA 481 (496) T ss_pred CCCCCccceEEEeCCCCCCCHHHH-------HHHHHHHHhcCCCCHHHHHHhc------CCCChHHHHHHHHHHHHhhhc Confidence 1357899998888887765 4455566677888876664321 1111111 Q ss_pred -ccccc-ccCCCccc Q lcl|NC_019404. 402 -IQTEE-SELITETE 414 (418) Q Consensus 402 -~~~~e-~~~~~e~e 414 (418) +++++ ....+++| T Consensus 482 ~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 482 EMPNNDMNGIFGEEE 496 (496) T ss_pred cCccccccCCCCCCC Confidence 11111 11223333 No 163 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.31 E-value=1.2e-11 Score=80.53 Aligned_cols=381 Identities=10% Similarity=0.075 Sum_probs=200.0 Q ss_pred CccchhhHHH----------------------HhcCCCCccccCc---cccCCHH------------HHHHHHHcCCccc Q lcl|NC_019404. 1 MVKTDSYANI----------------------FLGGSDGSEIYGS---LQNQAPT------------ILASLYADNALVR 43 (418) Q Consensus 1 ~~~~D~~~n~----------------------~~g~~~~~~~~~~---~~~~~~~------------~l~~~Y~~~~~~r 43 (418) |=|+..-.|+ +.+++...+..++ +...+.+ ....+|+.|++++ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 2222222221 1122222222222 2222222 2346699999999 Q ss_pred hhhhcchhhhcc-CCccccCc--------ch--HHHHHHHHHHh------------CchHHHHHHHHhccccceEEEEEe Q lcl|NC_019404. 44 RIIDTIPETALA-AGFHIDGI--------DD--EPAFWSRWDDL------------EMTQNINDAWSWARLFGGAAIVAI 100 (418) Q Consensus 44 ~iVd~~a~d~~r-~~~~i~~~--------~d--~~~i~~~~~~l------------~~~~~~~~a~~~~rl~G~~~i~i~ 100 (418) .+|+..+...+= .|+.++.. ++ .++++..|++. .+.+...-+++.....|.|++.+. T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 160 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREH 160 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEe Confidence 999999999995 68877531 11 23455555442 233334456666667899988775 Q ss_pred ecCCCcccccccCCCceEEEEEeecccccccccccc---------ccccccCcceEEEEecCCc-----------ccccc Q lcl|NC_019404. 101 VKDNRALTSPVREGAELETVRVYDRTQVKVQNREEN---------PRNARFGKPLTYRITTNES-----------DMFYD 160 (418) Q Consensus 101 ~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~d---------p~s~~yg~p~~y~i~~~~~-----------~~~~~ 160 (418) ..++.+ .| + .|.++++..|....+... ..-..+|+|..|+|...-. ..... T Consensus 161 ~~~~~~--~~------~-~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~r 231 (505) T protein:vir:96 161 RGYPNK--WG------Y-ALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYER 231 (505) T ss_pred ecCCCC--cc------e-EEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccc Confidence 543221 11 2 367778877754322111 1113479999999953211 11234 Q ss_pred cCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHH---HHHHHHHHcCCceeecchHHHhhcCcchHHH Q lcl|NC_019404. 161 VHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCER---LATQLLRRKQQAVWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 161 iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~---~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~ 237 (418) |-.++|+|+-- .....+.-|.|.|. ++...++++++-.. ..+.+.-.+. -++|.+.-+.-....+.... T Consensus 232 vpa~~vlH~f~------~~r~gQ~RGis~la-pvl~~l~~l~~y~dael~~a~i~A~~a-~fi~~~~~~~~~~~~~~~~~ 303 (505) T protein:vir:96 232 VPADEIIHTFV------PWRPHQNRGIPWTH-ASMVELHHIGEYRKSEMIAAELGAKKV-GFYEQDPEAYDQPPEDDQGE 303 (505) T ss_pred cCHhHhhhhhc------ccCCccccCcchHH-HHHHHHHHHhHHHHHHHHHHHHhhhhe-eeeecCCccCCCccccccCc Confidence 66666776621 12233456888774 56666665555443 3333332222 24454422110000110011 Q ss_pred HHHHHHHHHHhcCCcceeEEEcCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeeeccCcc-ccccchhHHHHH Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVG-GLSSSQNTALET 314 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~-gl~stge~d~~~ 314 (418) ... .-..+.+.....+++++..+.+ -++..++.......||+.+|||.-.|.|.-.+ .. |+.-..+.. T Consensus 304 ~~~--------~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nY-SS~R~~~~e 374 (505) T protein:vir:96 304 IVE--------EVEAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNF-SSLRSGELD 374 (505) T ss_pred ccc--------ccCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH-HHHHHHHHH Confidence 111 1124556666778999998876 35899999999999999999999988885433 34 344556667 Q ss_pred HHHHHHHHHHHHHH----HHHHHHHHHhhccCC-------------ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019404. 315 FHKLIDRKRNAELL----PILEFLIPFIVNAEE-------------WSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG 377 (418) Q Consensus 315 y~~~I~~~Qe~~l~----p~l~~l~~~i~~~~~-------------~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g 377 (418) +...++..|...+. |+.+..++..++... ..|..+..-..|. .|.+++....+++| T Consensus 375 ~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP-------~Ke~~a~~~~i~~G 447 (505) T protein:vir:96 375 ERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDP-------AKDSKAHSESIKNR 447 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccCh-------HHHHHHHHHHHHcC Confidence 88888889887654 455554444433221 1222222222333 46788888999999 Q ss_pred CCCHHHHHHHHH--------------hhcCcCCCCh-------hhcccccccCCCccc Q lcl|NC_019404. 378 AMDIKEARDTLR--------------TIAPEIKIGD-------NDIQTEESELITETE 414 (418) Q Consensus 378 ~i~~~e~r~~l~--------------~~~~~~~~~~-------~~~~~~e~~~~~e~e 414 (418) +.|..++..+.. +.....|+.. ..-...+.+..+.+| T Consensus 448 ~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 448 TRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred CCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 999877654310 0111122210 000111111111112 No 164 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.28 E-value=2.7e-12 Score=84.02 Aligned_cols=320 Identities=13% Similarity=0.055 Sum_probs=152.4 Q ss_pred Cc---cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccc-c-Ccc----hH----- Q lcl|NC_019404. 1 MV---KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHI-D-GID----DE----- 66 (418) Q Consensus 1 ~~---~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i-~-~~~----d~----- 66 (418) |= +.-++....+..+ ....+.+.... ..| .+..+.+||+.+|+++-+-++.+ + ..+ +. T Consensus 1 M~if~~~~~~~~~~~~~~-~~~~~~~~~~~------~~~-~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~ 72 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNND-TQRVTAWQNEA------VEY-TSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMA 72 (378) T ss_pred CchhHHhHhhhhcccccC-cceeeeeecch------hhh-hhHHHHHHHHHHHHhHhhCceeeeeecccccccccccccc Confidence 11 1111111111111 11111111111 112 23457899999999999988765 1 110 10 Q ss_pred -HHHHHHHH----HhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccccccccc Q lcl|NC_019404. 67 -PAFWSRWD----DLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNA 140 (418) Q Consensus 67 -~~i~~~~~----~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~ 140 (418) ..+...+. ..-....|.+.+.+ -.+.|.|+++....+ ..|.+.+.. T Consensus 73 ~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~---------~~g~~~~~~------------------- 124 (378) T protein:vir:94 73 GSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDS---------ETGELLDLL------------------- 124 (378) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeC---------CCCcEEEEE------------------- Confidence 01111121 11123455555555 446798998754432 123332221 Q ss_pred ccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC-ce Q lcl|NC_019404. 141 RFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ-AV 219 (418) Q Consensus 141 ~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~-~v 219 (418) ... ..+.+.++.|+|+... . +..-+.+++. .+...+.. .+. ..+. -+ T Consensus 125 ---------~~~----~~~~~~~~dvih~~~~---~-----~~~~~~~~~~-~~~~~~~~---~~~-------~~~~~g~ 172 (378) T protein:vir:94 125 ---------FAN----DKKEYKPEELVRLTSP---F-----YINEDTSILD-NALASIQT---KLE-------QGKLRGL 172 (378) T ss_pred ---------Eec----CcEEechhceeeecCc---C-----CcccchhHHH-HHHHHHHH---HHh-------hCCcccc Confidence 111 1135678889998521 1 1111334443 23332222 111 1111 13 Q ss_pred eecchHHHhhcCcchHHHHHHHHHHHHHh---cCCcceeEEEcCCCceeEeecccCCHH-HHHHHHHHHHhhhhcCCeee Q lcl|NC_019404. 220 WKAKGLAELCDDSEGFGAARLRLAQVDNN---SGVGQAIGIDAESEEYSVLNSDIGGID-AFLDKKFDRIVALSGIHEII 295 (418) Q Consensus 220 ~k~~~l~~~~~~~~~~~~~~~r~~~~~~~---~~~~~~~~~d~~~e~~~~~~~~~~gl~-~~~~~~~~~iaaas~IP~t~ 295 (418) ++++.. ++ .+...+..+++...... -.+.+.+++..++.+|+.++.+...++ +.+++....||.+.|||..+ T Consensus 173 l~~~~~---l~-~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~ 248 (378) T protein:vir:94 173 LKINAF---LD-IDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENI 248 (378) T ss_pred eeeCCc---CC-HHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhHHHHHHHHHHHHHHhCCCHHH Confidence 444421 11 11222334444433221 122333555556788999988775433 33567788999999999988 Q ss_pred eeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---c-c------------CCceEEeCCCCCCCHHHH Q lcl|NC_019404. 296 LKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV---N-A------------EEWSVEFSPLDHESSKDK 359 (418) Q Consensus 296 L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~---~-~------------~~~~~~f~pL~~~~eke~ 359 (418) |.|.. .+....+||.. -|.|++..+-..|- . . .++.|+++.|...|.+++ T Consensus 249 l~g~~-------~e~~~~~f~~~-------tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~ 314 (378) T protein:vir:94 249 LLGTA-------TQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL 314 (378) T ss_pred hcCCc-------hHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHH Confidence 85411 13344455553 47887766544432 1 1 135677788888888765 Q ss_pred HHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC---------h-hhc---cccc---c-cCCCccc Q lcl|NC_019404. 360 AEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG---------D-NDI---QTEE---S-ELITETE 414 (418) Q Consensus 360 ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~---------~-~~~---~~~e---~-~~~~e~e 414 (418) ++++.+++++|++|++|+|+.+. ..|..+-+ . +.. +... . ..++.+| T Consensus 315 -------~e~~~~~~~~G~~t~NE~R~~~g-~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 315 -------IDLYHENINGPIFTQNQLLVKMG-EQPIEGGDVYIANLNAVAVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred -------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeecccccchhcchhcccccCCCCCCCCCCCC Confidence 66788899999999999999763 22221110 0 000 0000 1 1122233 No 165 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.28 E-value=4.7e-11 Score=77.22 Aligned_cols=389 Identities=14% Similarity=0.091 Sum_probs=201.8 Q ss_pred Cccch----------------h--hHHHHhcCCCCccccCccccCCHH------------HHHHHHHcCCccchhhhcch Q lcl|NC_019404. 1 MVKTD----------------S--YANIFLGGSDGSEIYGSLQNQAPT------------ILASLYADNALVRRIIDTIP 50 (418) Q Consensus 1 ~~~~D----------------~--~~n~~~g~~~~~~~~~~~~~~~~~------------~l~~~Y~~~~~~r~iVd~~a 50 (418) |=-.| + ....+-+.++..+.-+++...+.. ....+|+.||+++++|+... T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 11111 0 001122222222222222222221 23467999999999999999 Q ss_pred hhhcc-CCccccC----cc---h---HHHHHHHHHH----------hCchHHHHHHHHhccccceEEEEEeecCCCcccc Q lcl|NC_019404. 51 ETALA-AGFHIDG----ID---D---EPAFWSRWDD----------LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTS 109 (418) Q Consensus 51 ~d~~r-~~~~i~~----~~---d---~~~i~~~~~~----------l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~ 109 (418) ...+= .|+.+.. .+ + .++|++.|++ +.+.....-+++.-...|.+++.+.......++ T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~- 159 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLT- 159 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccC- Confidence 99995 4766532 11 1 2345666552 344444455666666789999887653322111 Q ss_pred cccCCCce-EEEEEeecccccccccccc-----ccccccCcceEEEEecC---C--cccccccCcccEEEecCccchhhh Q lcl|NC_019404. 110 PVREGAEL-ETVRVYDRTQVKVQNREEN-----PRNARFGKPLTYRITTN---E--SDMFYDVHYSRIHIIDGERVPNAM 178 (418) Q Consensus 110 pl~~~~~i-~~i~v~~~~~i~~~~~~~d-----p~s~~yg~p~~y~i~~~---~--~~~~~~iH~SR~i~~~g~~lp~~~ 178 (418) ....+ ..|.++++..|+....+.+ ..-..+|+|..|+|... . ......|..++|+|+-- . T Consensus 160 ---~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~------~ 230 (502) T protein:vir:79 160 ---PSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKF------V 230 (502) T ss_pred ---CCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeec------c Confidence 01111 1456778877753322111 11135799999999632 1 11225688899998732 2 Q ss_pred hhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH-cC-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCC-ccee Q lcl|NC_019404. 179 RRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR-KQ-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGV-GQAI 255 (418) Q Consensus 179 ~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~-~~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~ 255 (418) ...++.-|.|.|. ++...+++++.-..+...-..- +. .-++|++..........+.... . ...+. .+++ T Consensus 231 ~r~gQ~RGis~la-pvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~-~------~~~~l~pG~i 302 (502) T protein:vir:79 231 RRLHQMRGTSLLS-GVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKEN-E------RELTIQPGII 302 (502) T ss_pred cCCccccCCchHH-HHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCc-c------ccccccCCcc Confidence 2334556888775 5777776666554333221111 12 2234544222111111110000 0 00112 2333 Q ss_pred E-EEcCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_019404. 256 G-IDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELL---- 328 (418) Q Consensus 256 ~-~d~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~---- 328 (418) + ....+++++..+.+ .++..++.......||+++|||.-.|.|.-.+.. |+.-..+.-+...++..|+..+. T Consensus 303 ~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~ny-Ss~R~~~~e~~r~~~~~q~~~~~~~~~ 381 (502) T protein:vir:79 303 YDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTY-SAQRQELVESTDGYLILQDWFIGAVTR 381 (502) T ss_pred ccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 34677889988864 4689999999999999999999999999864433 44555666788899999986544 Q ss_pred HHHHHHHHHhhccCCc------------eEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH------ Q lcl|NC_019404. 329 PILEFLIPFIVNAEEW------------SVEF--SPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTL------ 388 (418) Q Consensus 329 p~l~~l~~~i~~~~~~------------~~~f--~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l------ 388 (418) |+.+..++..++...+ ...| +..-..|. .|.+++....+++|+.|..++..+. T Consensus 382 pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP-------~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~ 454 (502) T protein:vir:79 382 PMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDP-------VKEAEAWKIQIRGGAATESDWVRAGGRNPDD 454 (502) T ss_pred HHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHcCCCHHH Confidence 4444444444332211 2233 33223343 4668888899999998877765431 Q ss_pred ------------HhhcCcCC--------CChhhccccc-ccCCCcccc Q lcl|NC_019404. 389 ------------RTIAPEIK--------IGDNDIQTEE-SELITETEV 415 (418) Q Consensus 389 ------------~~~~~~~~--------~~~~~~~~~e-~~~~~e~e~ 415 (418) ++.+-... -....-+..| .+..++.|. T Consensus 455 v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 455 VKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 11111000 0001111111 122223333 No 166 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.27 E-value=2.8e-11 Score=78.47 Aligned_cols=385 Identities=14% Similarity=0.086 Sum_probs=202.5 Q ss_pred CccchhhH------------HHHhcCCCCccccCcccc-CCH----------HHHHHHHHcCCccchhhhcchhhhccCC Q lcl|NC_019404. 1 MVKTDSYA------------NIFLGGSDGSEIYGSLQN-QAP----------TILASLYADNALVRRIIDTIPETALAAG 57 (418) Q Consensus 1 ~~~~D~~~------------n~~~g~~~~~~~~~~~~~-~~~----------~~l~~~Y~~~~~~r~iVd~~a~d~~r~~ 57 (418) |+-. ||. ..+-|.+.+.+..+.+.. .+. .....+|+.|++++.+|+......+-.| T Consensus 3 ~~~~-~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~G 81 (495) T protein:vir:10 3 MTPS-GYQSLASGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNG 81 (495) T ss_pred cccc-cccccchhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCC Confidence 1111 111 112222222222222211 111 1334568999999999999999999889 Q ss_pred ccccCc-ch---HHHHHHHHHH----------hCchHHHHHHHHhccccceEEEEEeecCCC-cccccccCCCceEEEEE Q lcl|NC_019404. 58 FHIDGI-DD---EPAFWSRWDD----------LEMTQNINDAWSWARLFGGAAIVAIVKDNR-ALTSPVREGAELETVRV 122 (418) Q Consensus 58 ~~i~~~-~d---~~~i~~~~~~----------l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~-~l~~pl~~~~~i~~i~v 122 (418) +..... ++ .++++..|++ +.+.....-+++.....|.+++.+...... ...-| ..|.+ T Consensus 82 i~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~-------~~lql 154 (495) T protein:vir:10 82 LTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVP-------LQLQI 154 (495) T ss_pred cccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccc-------eEEEE Confidence 887542 22 2345555543 234444555666667789998877653211 11111 25677 Q ss_pred eeccccccccc-ccccc---------ccccCcceEEEEecCCcc---------cccccCcccEEEecCccchhhhhhccc Q lcl|NC_019404. 123 YDRTQVKVQNR-EENPR---------NARFGKPLTYRITTNESD---------MFYDVHYSRIHIIDGERVPNAMRRQND 183 (418) Q Consensus 123 ~~~~~i~~~~~-~~dp~---------s~~yg~p~~y~i~~~~~~---------~~~~iH~SR~i~~~g~~lp~~~~~~~~ 183 (418) +++.+|+.... ...+. -..+|+|..|+|...-.+ ....|-.++|+|+-. + + ..+ T Consensus 155 iepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~-~-----r-~gQ 227 (495) T protein:vir:10 155 IEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV-L-----T-VRS 227 (495) T ss_pred echhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc-c-----C-CCc Confidence 88887753221 11111 125899999999632111 124577888988842 2 1 234 Q ss_pred cCCcchHHHHHHH--HHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcc---hHHHHHHHHHHHHHhcCCcceeEEE Q lcl|NC_019404. 184 GWGRSVLSSDILD--SIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSE---GFGAARLRLAQVDNNSGVGQAIGID 258 (418) Q Consensus 184 ~~G~S~l~~~~~~--~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~---~~~~~~~r~~~~~~~~~~~~~~~~d 258 (418) .-|.|.| .++.. .+..|..+.-..+.+.-.+ .-++|.+.......... .......+ ...-..+.+.-. T Consensus 228 ~RGis~l-a~i~~l~~l~~y~dael~~a~i~A~~-~~fi~~~~~~~~~~~~~~~~~~~~~~~~-----~~~l~pG~i~~L 300 (495) T protein:vir:10 228 DAGAPWF-QLLLRLNELDQYEDAELVRKKTAALF-AAFIQEATADSTGGPTIGQPKRSKGGKR-----ITGLNPGTLQYL 300 (495) T ss_pred ccCcchh-HHHHHHHHhhHHHHHHHHHHHHhhhh-eeeeecCCCccccccccCccccccCccc-----ceecCCceeeec Confidence 4466644 34432 3444444444444333222 22344442221111100 00000000 011124556666 Q ss_pred cCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHH-----HHHHH Q lcl|NC_019404. 259 AESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAE-----LLPIL 331 (418) Q Consensus 259 ~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~-----l~p~l 331 (418) ..+++++..+.+ -++..++.......||+..|||.-.|.|.-.+.-=||.-..+..+...+++.|.+. ++|+. T Consensus 301 ~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~ 380 (495) T protein:vir:10 301 QPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVG 380 (495) T ss_pred CCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 778999999875 46899999999999999999999999886544322344455666788888888654 35666 Q ss_pred HHHHHHhhccCCc-------------eEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHH------- Q lcl|NC_019404. 332 EFLIPFIVNAEEW-------------SVEF--SPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLR------- 389 (418) Q Consensus 332 ~~l~~~i~~~~~~-------------~~~f--~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~------- 389 (418) +..++..++...+ ..+| +..-..|. .|.+++....+++|+.|..++..+.. T Consensus 381 ~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP-------~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~ 453 (495) T protein:vir:10 381 RWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDP-------LKKHLADLGDVRAGFAPISDKQAERGYDMEELF 453 (495) T ss_pred HHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHH Confidence 6666655443211 1223 22222333 56788889999999998877654310 Q ss_pred -------hhcCcCCCC----------hhhcccccccCCCccc Q lcl|NC_019404. 390 -------TIAPEIKIG----------DNDIQTEESELITETE 414 (418) Q Consensus 390 -------~~~~~~~~~----------~~~~~~~e~~~~~e~e 414 (418) +.....|+. .....+...+..+++| T Consensus 454 ~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 454 DMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 000111111 0111222222233333 No 167 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.26 E-value=2.8e-11 Score=78.45 Aligned_cols=389 Identities=12% Similarity=0.080 Sum_probs=182.1 Q ss_pred CccchhhHHHHhcCCCCccccCcc-ccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHHHhC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSL-QNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLE 77 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~-~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l~ 77 (418) +-|.+-+..-+-|.+.--...... ...... . ..+++++.||+..+...+.+++.++++++. +.+..-+++-+ T Consensus 32 ~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~---k--i~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~ 106 (489) T protein:vir:99 32 LERLKELKRYYLGDNNIKYRPAKTDKYAADN---R--IASDFAKYITVFEQGYMLGVPVEYKNENKDLQAAIDLMSVRNN 106 (489) T ss_pred HHHHHHHHHHhcccCccccccccccccCCcc---e--eecchHHHHHHHHhhhhccCCceeecCChhHHHHHHHHHhhcC Confidence 222233333233332111000000 000000 1 246899999999999999999999865543 34455556667 Q ss_pred chHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccc Q lcl|NC_019404. 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDM 157 (418) Q Consensus 78 ~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~ 157 (418) +...+.++.+.+.+||.|++++.+..+. +..+.+ .+.++++.++.+.+.+.....+.++- .+|.+....+.. T Consensus 107 ~~~~~~~~~~~~~~~G~~~~~v~~~~~~------d~~~~~-~i~~~~p~~~~~v~dd~~~~~~~~~i-~~~~~~~~~~~~ 178 (489) T protein:vir:99 107 EDYHNVKIKTDLSIYGRAYELLTVEKID------DKKTEV-KLYQLPAEQTFVIYDDTYQRNSLMAV-HFYDIDYGSGKR 178 (489) T ss_pred hhHHHHHHHHHHhhCCeEEEEEeeccCc------CCCcce-EEEEEcccceEEEEcCCCCCceEEEE-EEEEEecCCCce Confidence 8888999999999999999887653211 111212 24445555443332111111111110 112221110000 Q ss_pred --cc-ccCcccEEEecCc------------------cchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019404. 158 --FY-DVHYSRIHIIDGE------------------RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQ 216 (418) Q Consensus 158 --~~-~iH~SR~i~~~g~------------------~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~ 216 (418) .. -+.+.++.+|... .+|. ....++.+|.|.++ .+.+.+.+++.+....+..+..++ T Consensus 179 ~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~~~s~~~-~v~~liDa~d~~~s~~~~~~~~~~ 256 (489) T protein:vir:99 179 KQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPV-NEYANNEERTGAYE-SVLDNIDAYDLSQSELANFQQDSV 256 (489) T ss_pred EEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeE-EEeecCCCCCCchh-hhHHHHHHHHHHHHHHHHHHHHhh Confidence 00 0112222222110 0111 11124456888886 478889999998888777777776 Q ss_pred CceeecchHHHhhcCcchHHHHHHHHHHHH-----HhcCCcceeEEEcC------CCc--eeEeecccCCHHHHHHHHHH Q lcl|NC_019404. 217 QAVWKAKGLAELCDDSEGFGAARLRLAQVD-----NNSGVGQAIGIDAE------SEE--YSVLNSDIGGIDAFLDKKFD 283 (418) Q Consensus 217 ~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~-----~~~~~~~~~~~d~~------~e~--~~~~~~~~~gl~~~~~~~~~ 283 (418) ..++.+.+.... ...........+..... ........+.++.. +.+ |-.+..+..+.+..++.+.+ T Consensus 257 ~~~l~i~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 335 (489) T protein:vir:99 257 NALLVIAGNAYT-GADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVA 335 (489) T ss_pred hhhhhhccCCcc-cccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHH Confidence 666665542211 10000000000000000 00000111222111 122 33445566788889999999 Q ss_pred HHhhhhcCCeeeeeccCccccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhcc--------------CCceE Q lcl|NC_019404. 284 RIVALSGIHEIILKNKNVGGLSSSQNTALETFH---KLIDRKRNAELLPILEFLIPFIVNA--------------EEWSV 346 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~---~~I~~~Qe~~l~p~l~~l~~~i~~~--------------~~~~~ 346 (418) .|...+++|..-. ...+| |+||+.-...+. ..++.+| ..++..+++++.+++.- .++++ T Consensus 336 ~i~~~s~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~k~~~k~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v 411 (489) T protein:vir:99 336 DILRFTFTPDTQD--MKFSG-VQSGESMKYKLMASDNYREKQE-RLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSI 411 (489) T ss_pred HHHHHhCCccccc--ccccc-cchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCccccccccccceE Confidence 9999999996433 12223 456665333333 3333333 45788888887776421 25789 Q ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCCCHHHHHHHHHhh----c-----CcCCCC---hhhcccccccC Q lcl|NC_019404. 347 EFSPLDHESSKDKAEVLEKSVNSIAA---LIAAGAMDIKEARDTLRTI----A-----PEIKIG---DNDIQTEESEL 409 (418) Q Consensus 347 ~f~pL~~~~eke~ae~~~~~a~a~~~---~~~~g~i~~~e~r~~l~~~----~-----~~~~~~---~~~~~~~e~~~ 409 (418) .|++-...+..+.+++..+.+..++. +-..+.++.+++.++++.. . ....+. .++.+..+.+| T Consensus 412 ~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 412 VFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred EeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 99998889998888765554322111 1122333322222222110 0 000000 01111111111 No 168 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.26 E-value=8.5e-11 Score=75.78 Aligned_cols=395 Identities=10% Similarity=0.053 Sum_probs=203.0 Q ss_pred CccchhhHHH--Hh----cC-CCCccccCc-cccCCHH------------HHHHHHHcCCccchhhhcchhhhccCCccc Q lcl|NC_019404. 1 MVKTDSYANI--FL----GG-SDGSEIYGS-LQNQAPT------------ILASLYADNALVRRIIDTIPETALAAGFHI 60 (418) Q Consensus 1 ~~~~D~~~n~--~~----g~-~~~~~~~~~-~~~~~~~------------~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i 60 (418) ++..|+.... +. |+ ++..+..++ +...+.. ....+|+.|++++++|+..+...+-.|+.. T Consensus 6 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 85 (530) T protein:vir:38 6 LVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRL 85 (530) T ss_pred eecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 5666653221 11 11 122223332 1222221 234669999999999999999999999877 Q ss_pred cCc-----------ch---HHHHHHHHHH--------------hCchHHHHHHHHhccccceEEEEEeecCCCccccccc Q lcl|NC_019404. 61 DGI-----------DD---EPAFWSRWDD--------------LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR 112 (418) Q Consensus 61 ~~~-----------~d---~~~i~~~~~~--------------l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~ 112 (418) ... .+ .++++..|.+ +.+.+...-+++.....|.+++.+.......+.-| T Consensus 86 ~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~-- 163 (530) T protein:vir:38 86 SYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLFR-- 163 (530) T ss_pred eeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCCccc-- Confidence 531 11 1345555542 12334444555666668999887765322211112 Q ss_pred CCCceEEEEEeecccccccccc-cc------ccccccCcceEEEEecC----Cc-c------cccccCcccEEEecCccc Q lcl|NC_019404. 113 EGAELETVRVYDRTQVKVQNRE-EN------PRNARFGKPLTYRITTN----ES-D------MFYDVHYSRIHIIDGERV 174 (418) Q Consensus 113 ~~~~i~~i~v~~~~~i~~~~~~-~d------p~s~~yg~p~~y~i~~~----~~-~------~~~~iH~SR~i~~~g~~l 174 (418) ..|.++++..|...... .. ..-..+|+|..|+|... .. . ....++.++|||+-.. T Consensus 164 -----~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~-- 236 (530) T protein:vir:38 164 -----TQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEP-- 236 (530) T ss_pred -----eEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccc-- Confidence 14667777776532211 00 11134799999999532 11 0 1123555578877321 Q ss_pred hhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH---HHHHHHcCCceeecchHHH----hhcCcchH--HHHHHHHHHH Q lcl|NC_019404. 175 PNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA---TQLLRRKQQAVWKAKGLAE----LCDDSEGF--GAARLRLAQV 245 (418) Q Consensus 175 p~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~---~~l~~~~~~~v~k~~~l~~----~~~~~~~~--~~~~~r~~~~ 245 (418) ....+.-|.|.|. ++...+++++.-..+. +.+.-.+ .-++|.+.... .....++. .......... T Consensus 237 ----~r~gQ~RGis~la-pvl~~l~~l~~y~dael~~a~i~A~~-a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (530) T protein:vir:38 237 ----MEDGQTRGANAFY-SVMEQMKMLDTLQNTQLQSAIVKAMY-AATIESELDTQSAMDFILGADNKEQQSKLTGWLGE 310 (530) T ss_pred ----cCCCcccCCchHH-HHHHHHHHHhHHHHHHHHHHHHhhhh-eeeeeccCCccccccccccCCcccccccccccchh Confidence 2234556888774 5667666665554433 3322211 22344432111 00000000 0000000000 Q ss_pred -------HHhcCCcceeEEEcCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeeeccCcc-ccccchhHHHHHH Q lcl|NC_019404. 246 -------DNNSGVGQAIGIDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVG-GLSSSQNTALETF 315 (418) Q Consensus 246 -------~~~~~~~~~~~~d~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~-gl~stge~d~~~y 315 (418) ....-..+.+.....+++++..+.+ -++..++.......||+++|||...|.|.-.+ .. ||.-..+.-+ T Consensus 311 ~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nY-SS~R~~~~e~ 389 (530) T protein:vir:38 311 MAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSY-STARASANES 389 (530) T ss_pred hhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH-HHHHHHHHHH Confidence 0011124556666778889988876 45889999999999999999999999885333 33 4455566778 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHhhccCC--------------------ceEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019404. 316 HKLIDRKRNAELLPILEFL----IPFIVNAEE--------------------WSVEFSPLDHESSKDKAEVLEKSVNSIA 371 (418) Q Consensus 316 ~~~I~~~Qe~~l~p~l~~l----~~~i~~~~~--------------------~~~~f~pL~~~~eke~ae~~~~~a~a~~ 371 (418) ...++..|...+.+.+..+ ++..++... ..|..+..-..|. .|.+++.. T Consensus 390 ~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-------~Ke~~a~~ 462 (530) T protein:vir:38 390 WAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDG-------LKEVQEAV 462 (530) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccCh-------HHHHHHHH Confidence 8999999988766554433 333232211 1222233333444 56788888 Q ss_pred HHHhCCCCCHHHHHHHHH--------------hhcCcCCCCh-----hhccc--ccccCCCccccccC Q lcl|NC_019404. 372 ALIAAGAMDIKEARDTLR--------------TIAPEIKIGD-----NDIQT--EESELITETEVVIA 418 (418) Q Consensus 372 ~~~~~g~i~~~e~r~~l~--------------~~~~~~~~~~-----~~~~~--~e~~~~~e~e~~~~ 418 (418) ..+++|+.|..++..+.. ......|+.. ..... ...+.++++..-=| T Consensus 463 ~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 463 MLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 999999998877654310 0001111111 01000 11111111111111 No 169 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.25 E-value=1.9e-10 Score=73.87 Aligned_cols=396 Identities=10% Similarity=0.037 Sum_probs=204.0 Q ss_pred CccchhhHHH-----Hhc-CC-CCccccCc-cccCCHH------------HHHHHHHcCCccchhhhcchhhhccCCccc Q lcl|NC_019404. 1 MVKTDSYANI-----FLG-GS-DGSEIYGS-LQNQAPT------------ILASLYADNALVRRIIDTIPETALAAGFHI 60 (418) Q Consensus 1 ~~~~D~~~n~-----~~g-~~-~~~~~~~~-~~~~~~~------------~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i 60 (418) ....++.... ..+ +. .+.+..++ +...+.. ....+|+.|++++++|+..+...+-.|+.+ T Consensus 9 ~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 88 (533) T protein:vir:34 9 LLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRL 88 (533) T ss_pred hhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 2222322111 122 11 12222222 1222221 334668999999999999999999999987 Q ss_pred cCc----------chH----HHHHHHHHH--------------hCchHHHHHHHHhccccceEEEEEeecCCCccccccc Q lcl|NC_019404. 61 DGI----------DDE----PAFWSRWDD--------------LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR 112 (418) Q Consensus 61 ~~~----------~d~----~~i~~~~~~--------------l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~ 112 (418) ... +.. +.|++.|++ +.+.+...-+++.....|.+++.+..........| T Consensus 89 ~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~-- 166 (533) T protein:vir:34 89 SHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFR-- 166 (533) T ss_pred eeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCCCccc-- Confidence 542 111 245555533 12334445566666678999888765322111111 Q ss_pred CCCceEEEEEeeccccccccccc-------cccccccCcceEEEEecC--Ccc---------cccccCcccEEEecCccc Q lcl|NC_019404. 113 EGAELETVRVYDRTQVKVQNREE-------NPRNARFGKPLTYRITTN--ESD---------MFYDVHYSRIHIIDGERV 174 (418) Q Consensus 113 ~~~~i~~i~v~~~~~i~~~~~~~-------dp~s~~yg~p~~y~i~~~--~~~---------~~~~iH~SR~i~~~g~~l 174 (418) ..|.++++..|....... -..-..+|+|..|+|... .+. ....++.++|||+-- T Consensus 167 -----~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~--- 238 (533) T protein:vir:34 167 -----TQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFE--- 238 (533) T ss_pred -----eEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeecc--- Confidence 146778887775432210 111235899999999632 111 112366677887731 Q ss_pred hhhhhhccccCCcchHHHHHHHHHHHHHHHHH---HHHHHHHHcCCceeecchHHH----hhcCcchH--H-HHHHHHHH Q lcl|NC_019404. 175 PNAMRRQNDGWGRSVLSSDILDSIKDYTNCER---LATQLLRRKQQAVWKAKGLAE----LCDDSEGF--G-AARLRLAQ 244 (418) Q Consensus 175 p~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~---~~~~l~~~~~~~v~k~~~l~~----~~~~~~~~--~-~~~~r~~~ 244 (418) .....+..|.|.|. ++...+++++.-.. ..+.+.-.+ .-++|.+.... .+...... . ........ T Consensus 239 ---~~r~gQ~RGis~la-pvl~~l~~l~~y~dael~~a~i~A~~-a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (533) T protein:vir:34 239 ---PVEDGQTRGANVFY-SVMEQMKMLDTLQNTQLQSAIVKAMY-AATIESELDTQSAMDFILGANSQEQRERLTGWIGE 313 (533) T ss_pred ---ccCCCcccCCchHH-HHHHHHHHHHHHHHHHHHHHHHhhhh-eeeeecCCCcccccccccCCCcccccccccccchh Confidence 22334556888774 56666665555433 333333222 22344432110 00000000 0 00000000 Q ss_pred HH------HhcCCcceeEEEcCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHH Q lcl|NC_019404. 245 VD------NNSGVGQAIGIDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFH 316 (418) Q Consensus 245 ~~------~~~~~~~~~~~d~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~ 316 (418) .. ...-..+.+.....+++++..+.+ -++..++.......||++.|||...|.|.-.+.-=||.-..+..+. T Consensus 314 ~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~ 393 (533) T protein:vir:34 314 IAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESW 393 (533) T ss_pred hhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHH Confidence 00 001123555566778889988865 4689999999999999999999999988643322244555666788 Q ss_pred HHHHHHHHHHHH----HHHHHHHHHhhccCCc--------------------eEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 317 KLIDRKRNAELL----PILEFLIPFIVNAEEW--------------------SVEFSPLDHESSKDKAEVLEKSVNSIAA 372 (418) Q Consensus 317 ~~I~~~Qe~~l~----p~l~~l~~~i~~~~~~--------------------~~~f~pL~~~~eke~ae~~~~~a~a~~~ 372 (418) ..++..|...+. |+.+..++..++...+ .|..+..-..|. .|.+++... T Consensus 394 r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP-------~Ke~~a~~~ 466 (533) T protein:vir:34 394 AYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDG-------LKEVQEAVM 466 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccCh-------HHHHHHHHH Confidence 889999987654 4445444443333211 222222223333 577888999 Q ss_pred HHhCCCCCHHHHHHHHH--------------hhcCcCCCChh------hcccccccCCCccccccC Q lcl|NC_019404. 373 LIAAGAMDIKEARDTLR--------------TIAPEIKIGDN------DIQTEESELITETEVVIA 418 (418) Q Consensus 373 ~~~~g~i~~~e~r~~l~--------------~~~~~~~~~~~------~~~~~e~~~~~e~e~~~~ 418 (418) .+++|+.|..++..+.. +.....++... .......+.+++.++--| T Consensus 467 ~i~~G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~~ 532 (533) T protein:vir:34 467 LIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRA 532 (533) T ss_pred HHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCCC Confidence 99999999877654321 01111222111 111111122222333333 No 170 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.24 E-value=1.2e-10 Score=75.02 Aligned_cols=364 Identities=11% Similarity=0.067 Sum_probs=186.0 Q ss_pred Cccchh-----------hHHHHhcCCCCcccc---CccccC---------CHHHHHHHHHcCCccchhhhcchhhhccCC Q lcl|NC_019404. 1 MVKTDS-----------YANIFLGGSDGSEIY---GSLQNQ---------APTILASLYADNALVRRIIDTIPETALAAG 57 (418) Q Consensus 1 ~~~~D~-----------~~n~~~g~~~~~~~~---~~~~~~---------~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~ 57 (418) ++-.|| +...+.......... ..+..+ +...+..+. +.+-+..++++.....+... T Consensus 5 i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m~-~D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:10 5 LWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELR-ADAHVGGCVRRRKAAVKALE 83 (491) T ss_pred eeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcCCCHHHHHHHh-hChHHHHHHHHHHHHHhCCC Confidence 111222 111111000000000 011111 223333443 57888888898888888888 Q ss_pred ccccCc-ch---HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 58 FHIDGI-DD---EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 58 ~~i~~~-~d---~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) |.|+.. ++ .+.+.+.++++.+...+.+.+ .+.+||+|+.=+.-+ ++. .-.+..+...++.++.... T Consensus 84 w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g--------~~~~~~l~~r~~~~f~~d~ 154 (491) T protein:vir:10 84 WGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGN--------YIVPIDVVGKPADWFVYDP 154 (491) T ss_pred cEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCC--------eeEEEEeeeecccceeecc Confidence 888632 22 245666777787777666665 689999999865432 111 1124456566655544321 Q ss_pred ccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 133 REENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 133 ~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) + +. ..|.. ..+...+..+++.+.+++.... ...++||.+.+ +.||....--..+...-+..+ T Consensus 155 --------~-~~-l~~~~-~~~~~~g~~l~~~k~i~~~~~~------~~~~p~g~gLl-~~~~w~~~fK~~~~~~w~~f~ 216 (491) T protein:vir:10 155 --------E-NQ-LRFRS-KDHWMQGEELPARKFLVPRQEA------TYLNPYGFPDL-SMCFWPTTFKKGGLKFWVQFT 216 (491) T ss_pred --------C-Cc-eEEec-CCCCCCcceecCCCEEEEEecC------CCCCcccchhH-HHHHHHHHHHHHHHHHHHHHH Confidence 1 11 12221 1222234567888888776432 34567888866 568887777777777888889 Q ss_pred HHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeeccc-C-C---HHHHHHHHHHHH Q lcl|NC_019404. 213 RRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDI-G-G---IDAFLDKKFDRI 285 (418) Q Consensus 213 ~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~-~-g---l~~~~~~~~~~i 285 (418) .+++++ +.|++.. ..+.+ ..++.......+....+++ -++.+++.++..- + + -..+++.+-..| T Consensus 217 E~yG~P~~igky~~~-----a~~~e---k~~l~~al~~~~~~a~~vi-P~~~~ie~~ea~~~~g~~~~y~~li~~~d~~I 287 (491) T protein:vir:10 217 EKYGSPMLVGKHPRS-----ASDGE---KNLLLDCLEDMVQDAVAVV-PDDSSIEIKEAAGKTGSADVYERLLHFCRGEV 287 (491) T ss_pred HHcCCCeEEEecCCC-----CCHHH---HHHHHHHHHHHhcCcEEEe-cCCceeEEEecCCCCCChhHHHHHHHHHHHHH Confidence 998855 4565421 11111 2222322333344444444 4568899988754 2 2 345677777777 Q ss_pred hhh-hcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----CCceEEeCCCCCCCHHHH Q lcl|NC_019404. 286 VAL-SGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA-----EEWSVEFSPLDHESSKDK 359 (418) Q Consensus 286 aaa-s~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-----~~~~~~f~pL~~~~eke~ 359 (418) +.+ .|--.| ....|+ .|.|+--.....+.+++... .+...+++++.-++.- ....|.|... + T Consensus 288 sk~iLGqtlT---t~~~gs-~a~~~vh~~v~~di~~~D~~-~i~~tln~li~~l~~~N~~~~~~p~f~~~~~------~- 355 (491) T protein:vir:10 288 SIALLGQNQT---TEATST-RASAQAGLEVTDDIRDGDKA-VVSEAMNMLIRWICDLNFDGADRPVFDMWEQ------E- 355 (491) T ss_pred HHHHhhhhcc---cCcccc-hhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCCcceEEecCc------C- Confidence 743 222211 112222 23344444556666666653 4555666666655421 2244555421 1 Q ss_pred HHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 360 AEVLEKSVNSIAALIAAGA-MDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 360 ae~~~~~a~a~~~~~~~g~-i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) +..+..|++++++++.|+ ++.+.+++.+.--.+ ..+....+...+.+. .....| T Consensus 356 -e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~--~~~~~~~~~~~~~~~--~~~~~~ 410 (491) T protein:vir:10 356 -QVDEIQAGRDQKLTQAGARFTPAYFKRAYNLQDG--DLDERPLPVSAVDTV--GAASFA 410 (491) T ss_pred -chhHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCC--CcCccccccCCCCCc--cccccc Confidence 233567899999999997 888888887631111 111111111111100 000011 No 171 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.24 E-value=4.6e-12 Score=82.71 Aligned_cols=398 Identities=11% Similarity=0.084 Sum_probs=196.7 Q ss_pred CccchhhHHHHhcCCCC---ccccC-----ccccCCHH------HHHHHHH---------------------cCCccchh Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG---SEIYG-----SLQNQAPT------ILASLYA---------------------DNALVRRI 45 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~---~~~~~-----~~~~~~~~------~l~~~Y~---------------------~~~~~r~i 45 (418) |==-|...|++..+... ....+ .....++. .-.++|+ +-.+++.| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 32333444443221110 00000 00111111 1123342 23788999 Q ss_pred hhcchhhhccCCccccCcch--HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc-c-------ccccc-CC Q lcl|NC_019404. 46 IDTIPETALAAGFHIDGIDD--EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA-L-------TSPVR-EG 114 (418) Q Consensus 46 Vd~~a~d~~r~~~~i~~~~d--~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~-l-------~~pl~-~~ 114 (418) |+..|+-++.+...|+.+++ .+.+.+.+++-++...+.+++..+..+|++++-+.++.++. + --|+. .. T Consensus 81 ~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~~~~~i~~v~ad~~~P~~~d~ 160 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDSGKIKLAWATADQVYPLQADT 160 (505) T ss_pred HHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeCCceEEEEEcCCeeEEEEEcC Confidence 99999999999888765433 34566667777899999999999999999998877754431 1 11321 12 Q ss_pred CceEEEEEeeccccccc---------cccccccccccCcceEEEEecCCc-ccccccCccc---------EEEecCccch Q lcl|NC_019404. 115 AELETVRVYDRTQVKVQ---------NREENPRNARFGKPLTYRITTNES-DMFYDVHYSR---------IHIIDGERVP 175 (418) Q Consensus 115 ~~i~~i~v~~~~~i~~~---------~~~~dp~s~~yg~p~~y~i~~~~~-~~~~~iH~SR---------~i~~~g~~lp 175 (418) +.+..+.++-+|..... ++.+. .-..|..++..+...++ ..+..|.-+. -+.+.|-+.| T Consensus 161 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~--~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p 238 (505) T protein:vir:79 161 NQVNELAIASRTTEVENHRTIYYTLLEFHQW--DHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHP 238 (505) T ss_pred CCeEEEEEEEEEEEecCCcceEEEEEEEEEe--cCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcc Confidence 33333333322211110 00000 00011111111111111 0111111000 1122222211 Q ss_pred h---------hhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHH Q lcl|NC_019404. 176 N---------AMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVD 246 (418) Q Consensus 176 ~---------~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~ 246 (418) . .-+....++|.|++.. +.+.++.++.+......-+......++--..+-.....+.+...... ..... T Consensus 239 ~f~~~~~~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~-~~~fd 316 (505) T protein:vir:79 239 LFAFYRNKGANNKNFTSPMGMSLIDN-SYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETH-PPMFD 316 (505) T ss_pred eEEEecCCcccccccCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCccccccc-ccCCC Confidence 1 1122235689999975 77999999988877777665544444332211111111111100000 00000 Q ss_pred HhcCCcceeEEEcCCCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHH---HHHHHHHHHH Q lcl|NC_019404. 247 NNSGVGQAIGIDAESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTA---LETFHKLIDR 321 (418) Q Consensus 247 ~~~~~~~~~~~d~~~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d---~~~y~~~I~~ 321 (418) ........+..+..+..++..+..+ ....+.++.+++.|+..+|++...| |...+|.. |+.+- .+..|.+++. T Consensus 317 ~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~-~~~~~~~~-TAtei~s~~~~l~~t~~~ 394 (505) T protein:vir:79 317 PDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTF-TTSPSGIQ-TATEVVTNNSQTYQTRSS 394 (505) T ss_pred ccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhc-CCCccccc-hHHHHHHHHhHHHHHHHH Confidence 0001122222333345677777665 3466778889999999999997655 55556653 44333 3456777887 Q ss_pred HHHHHHHHHHHHHHHHhhc--------------------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCH Q lcl|NC_019404. 322 KRNAELLPILEFLIPFIVN--------------------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDI 381 (418) Q Consensus 322 ~Qe~~l~p~l~~l~~~i~~--------------------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~ 381 (418) +|. .++..|+.|+..|++ ..+++|.|++-...|+.+. ++...+++++|+++. T Consensus 395 ~~~-~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~-------~~~~~~~v~~Gi~s~ 466 (505) T protein:vir:79 395 YIT-QVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESK-------RAADLQAVQAQVMPK 466 (505) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHH-------HHHHHHHHHcCCCCH Confidence 764 478888888877753 1257899999888887654 455677888899988 Q ss_pred HHHHHHHHhhcCcCCCChhhcccccccCCCccc------cccC Q lcl|NC_019404. 382 KEARDTLRTIAPEIKIGDNDIQTEESELITETE------VVIA 418 (418) Q Consensus 382 ~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e------~~~~ 418 (418) ++++..+ .+.++++....-.++..|+. .-+. T Consensus 467 e~~l~~~------~~~~eeea~~el~ri~~E~~~~~p~~~~~g 503 (505) T protein:vir:79 467 KQFLMRN------YGLDEEEADEWLAQIDAENSTAEPEFNQFG 503 (505) T ss_pred HHHHHhc------CCCChHHHHHHHHHHHHhccccCCCchhcc Confidence 7766321 22333221110011111110 0111 No 172 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.24 E-value=9.2e-11 Score=75.60 Aligned_cols=387 Identities=10% Similarity=0.014 Sum_probs=183.7 Q ss_pred Cccchh-----------hHHHHhcC-CCC---cccc-Cc-----cccCCHHHHHHHHHcCCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MVKTDS-----------YANIFLGG-SDG---SEIY-GS-----LQNQAPTILASLYADNALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~~~D~-----------~~n~~~g~-~~~---~~~~-~~-----~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~ 59 (418) -.|..| |......- .+- .++| |. ...-.+.++..+-..-++.++|||.+++-..=+||. T Consensus 6 ~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~ 85 (474) T protein:vir:81 6 TVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFV 85 (474) T ss_pred cCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcccceE Confidence 011111 11111100 000 0111 10 111124455544455788899999999998889998 Q ss_pred ccCcch-HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccc--------- Q lcl|NC_019404. 60 IDGIDD-EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVK--------- 129 (418) Q Consensus 60 i~~~~d-~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~--------- 129 (418) +.+.++ ...+.+-|.+-++.....++++-+.+||.||+++.-+++.+. .|+ |+++++.++. T Consensus 86 ~~d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~-~~~--------i~~~sp~~~~~~~D~~~~~ 156 (474) T protein:vir:81 86 WPDGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEP-EAL--------IHVKDASEATGEWNRRRRG 156 (474) T ss_pred CCCCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCc-eeE--------EEEeccceEEEEEeCCCCc Confidence 754332 235677788888888889999999999999988765422110 111 2222222221 Q ss_pred --cc--ccccccc----ccccCcc-eEEEEecCCcccc----cccCccc--EEEecCccchhhhhhccccCCcchHHHHH Q lcl|NC_019404. 130 --VQ--NREENPR----NARFGKP-LTYRITTNESDMF----YDVHYSR--IHIIDGERVPNAMRRQNDGWGRSVLSSDI 194 (418) Q Consensus 130 --~~--~~~~dp~----s~~yg~p-~~y~i~~~~~~~~----~~iH~SR--~i~~~g~~lp~~~~~~~~~~G~S~l~~~~ 194 (418) .. .+..|.. .-.++-| ..|++........ ..-|+=. ++.|..++ .....+|.|.+.+++ T Consensus 157 ~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~------~~~~~~G~s~i~e~v 230 (474) T protein:vir:81 157 LNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKP------APKRPFGQSRITKPM 230 (474) T ss_pred ceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccc------cccCcCCccccchhH Confidence 11 0111111 1111111 2222221111110 0112111 22332221 223457999887778 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhh-c--CcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEe-ecc Q lcl|NC_019404. 195 LDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELC-D--DSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL-NSD 270 (418) Q Consensus 195 ~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~-~--~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~-~~~ 270 (418) .+.+.++.+++....-...-++.+...+.|...-. . ++............+...-....+.......-++-++ ..+ T Consensus 231 ~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~ 310 (474) T protein:vir:81 231 MGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAAS 310 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCC Confidence 88888888887776666666666655554433211 1 1111111111111111110100110000001123333 356 Q ss_pred cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------ Q lcl|NC_019404. 271 IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTA---LETFHKLIDRKRNAELLPILEFLIPFIVNA------ 341 (418) Q Consensus 271 ~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d---~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------ 341 (418) +.+.-+.+.....++|+.++||..-| |...-.-.+|++.- .......++.+|+. +.+.++++..+.+.- T Consensus 311 l~~~~~~l~~~~~~~a~~t~iP~~~l-G~~~~~np~SaeAi~a~~~~l~~kae~k~~~-fg~~l~~~~rla~~i~~~~~~ 388 (474) T protein:vir:81 311 PDAHWSDINGLAKLFAREASLPDTAV-AISGLSNPTSAESYDASQYELIAEAEGAVDD-FTPALRKAFIRALAMKNKVAI 388 (474) T ss_pred hhHHHHHHHHHHHHHHhhhCCCHHHh-cccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCc Confidence 67788889999999999999998766 42211112334332 23445566666654 677888887776421 Q ss_pred -------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCH-HHHHHHHHhhcCcCCCChhhcccccccCC-C Q lcl|NC_019404. 342 -------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGA-MDI-KEARDTLRTIAPEIKIGDNDIQTEESELI-T 411 (418) Q Consensus 342 -------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~-i~~-~e~r~~l~~~~~~~~~~~~~~~~~e~~~~-~ 411 (418) ..+.+.|.+...+|..+. |+++.+++++|. +.. +-.++. .|+++++++..+.+-. . T Consensus 389 ~~~~~~~~~~~v~W~d~~~~s~a~~-------aDa~~Kl~~a~~~~~~~~~~~~~-------lg~t~~~i~~~~~~~~~~ 454 (474) T protein:vir:81 389 DEIPDEWKSIDAKWRDPRYLSKSAQ-------ADAGMKQLAAVPWLAETEVGLEL-------IGLTPQQARRAMADKRRV 454 (474) T ss_pred cccchhhccceeEecCCCccCHHHH-------HHHHHHHHhcccCCCcHHHHHhh-------cCCCHHHHHHHHHHHHHH Confidence 135678999888888776 555555665552 222 222221 1333333322111100 0 Q ss_pred cc------------ccccC Q lcl|NC_019404. 412 ET------------EVVIA 418 (418) Q Consensus 412 e~------------e~~~~ 418 (418) +. ++.=| T Consensus 455 ~~~~~~~~l~~~~~~~~~a 473 (474) T protein:vir:81 455 QGRGTLQALIDRSNNGATA 473 (474) T ss_pred hHHHHHHHHHhcCCCCCCC Confidence 01 11111 No 173 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.22 E-value=1.5e-11 Score=79.97 Aligned_cols=387 Identities=12% Similarity=0.115 Sum_probs=186.5 Q ss_pred cchhhHHHH------hcCC-CCccc-cCccccCCH------HHHHHHH-----------------------HcCCccchh Q lcl|NC_019404. 3 KTDSYANIF------LGGS-DGSEI-YGSLQNQAP------TILASLY-----------------------ADNALVRRI 45 (418) Q Consensus 3 ~~D~~~n~~------~g~~-~~~~~-~~~~~~~~~------~~l~~~Y-----------------------~~~~~~r~i 45 (418) =.|++.+.+ ||.. .-... ...-...+. .....+| ....+++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 112221111 1110 00000 000000000 0011111 235889999 Q ss_pred hhcchhhhccCCccccCcchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCC-Cccc--------cccc-C Q lcl|NC_019404. 46 IDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDN-RALT--------SPVR-E 113 (418) Q Consensus 46 Vd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~-~~l~--------~pl~-~ 113 (418) |+..|+-++.++..|+.+++. +.+++.++.-++...+.+++..+..+|++++.+.++.+ ...- -|+- . T Consensus 81 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d 160 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSND 160 (499) T ss_pred HHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEec Confidence 999999999999888765543 34555566667999999999999999999998888532 2111 1321 1 Q ss_pred CCceEEEEEeeccccccccccccccc-ccc-CcceEEEEec-----C-Ccccccc---------cCc-------ccE--E Q lcl|NC_019404. 114 GAELETVRVYDRTQVKVQNREENPRN-ARF-GKPLTYRITT-----N-ESDMFYD---------VHY-------SRI--H 167 (418) Q Consensus 114 ~~~i~~i~v~~~~~i~~~~~~~dp~s-~~y-g~p~~y~i~~-----~-~~~~~~~---------iH~-------SR~--i 167 (418) .+.+..+..+........++ +-.-. ... .+-..|+|.. . ....+.. +.+ +|. + T Consensus 161 ~~~~~~~~f~~~~~~~~~~y-~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~ 239 (499) T protein:vir:80 161 SENVDECLIANSFHKNNKYY-KLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFI 239 (499) T ss_pred CCCeEEEEEEEEEeecCeEE-EEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceE Confidence 23333332222111110000 00000 000 0001222221 0 0000111 111 121 1 Q ss_pred EecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHH Q lcl|NC_019404. 168 IIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDN 247 (418) Q Consensus 168 ~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~ 247 (418) +|. .+.+. -+....++|.|++.. +.+.+..++.+....+.-+......++--..+-....+.++.. ...+. . T Consensus 240 ~~~-~~~~N-~~~~~splG~S~~~~-~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~--~~~~~---~ 311 (499) T protein:vir:80 240 YIK-PNIAN-NKNLTSPLGISVYAN-ALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGST--TQYFD---S 311 (499) T ss_pred eec-CCccc-cccCCCccCCchHhh-HHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCc--ccCCC---c Confidence 111 11111 122345689999974 8899999999988887766554444432111111111111111 00000 0 Q ss_pred hcCCcceeEEEcC--CCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHH Q lcl|NC_019404. 248 NSGVGQAIGIDAE--SEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLID 320 (418) Q Consensus 248 ~~~~~~~~~~d~~--~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~ 320 (418) .......+..+.+ ...++..+..+ ......++.+.+.+...+|+|...| |...+|.. |+.+-. ...+.++. T Consensus 312 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~f-g~~~~g~~-TAtei~s~~~~l~~~~~ 389 (499) T protein:vir:80 312 TDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTF-TFDENGLK-TATEVVSEKSETYQTKN 389 (499) T ss_pred ccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhc-CCCcccch-hHHHHHHHHHHHHHHHH Confidence 0111112111111 12366666555 3456778888899999999998654 65666653 444433 33455555 Q ss_pred HHHHHHHHHHHHHHHHHhhc--------------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019404. 321 RKRNAELLPILEFLIPFIVN--------------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARD 386 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~i~~--------------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~ 386 (418) .+| ..++..|++|+..|++ ..++.+.|++-...++.+. +++..+++.+|++|.+.++. T Consensus 390 ~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~-------~~~~~~~~~~Gi~S~et~l~ 461 (499) T protein:vir:80 390 SHS-QLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTT-------INRYTTAKNQGMIPLKIALQ 461 (499) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHH-------HHHHHHHHHcCCCCHHHHHh Confidence 554 4578888888777653 1358899999888887654 55666777788888766643 Q ss_pred HHHhhcCcCCCChh---------------hccccc-ccCCCccc Q lcl|NC_019404. 387 TLRTIAPEIKIGDN---------------DIQTEE-SELITETE 414 (418) Q Consensus 387 ~l~~~~~~~~~~~~---------------~~~~~e-~~~~~e~e 414 (418) .+ . +.+++ ++++.+ .-..+|.| T Consensus 462 ~~--~----~~~d~ea~~el~~i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 462 RA--W----NITEAEADEWAEMLAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred hc--C----CCChHHHHHHHHHHHHHhhcCCCCCCccccCCCCC Confidence 21 1 11111 111111 11223444 No 174 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.21 E-value=3e-11 Score=78.27 Aligned_cols=394 Identities=12% Similarity=0.083 Sum_probs=192.1 Q ss_pred CccchhhHHHHhcC------CCCcccc-CccccCCHH------HHHHHHHc---------------------CCccchhh Q lcl|NC_019404. 1 MVKTDSYANIFLGG------SDGSEIY-GSLQNQAPT------ILASLYAD---------------------NALVRRII 46 (418) Q Consensus 1 ~~~~D~~~n~~~g~------~~~~~~~-~~~~~~~~~------~l~~~Y~~---------------------~~~~r~iV 46 (418) |==-|.+.|.+-.+ .+-.... ..--.+++. .-.++|.. -++++.|| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 33334444443110 0000000 000111221 22234432 27889999 Q ss_pred hcchhhhccCCccccCcch--HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCccc--------ccccC-CC Q lcl|NC_019404. 47 DTIPETALAAGFHIDGIDD--EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALT--------SPVRE-GA 115 (418) Q Consensus 47 d~~a~d~~r~~~~i~~~~d--~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~--------~pl~~-~~ 115 (418) +..|+-++.+...|+.+++ .+.+++.++.-++...+.+++..+...|++++-+.++++...- -|+.- .+ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~ 160 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQ 160 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCC Confidence 9999999999887765443 2456666777789999999999999999998877665433110 12211 11 Q ss_pred ceEEEEEeeccc--c-------ccccccc-cccccccCcceEEEEecCCc-ccccc-----c----CcccEEEecCccch Q lcl|NC_019404. 116 ELETVRVYDRTQ--V-------KVQNREE-NPRNARFGKPLTYRITTNES-DMFYD-----V----HYSRIHIIDGERVP 175 (418) Q Consensus 116 ~i~~i~v~~~~~--i-------~~~~~~~-dp~s~~yg~p~~y~i~~~~~-~~~~~-----i----H~SR~i~~~g~~lp 175 (418) .+....++-++. . +-.++.. +.. ..|..+...+...+. ..+.. + .+.- .+.|-+.| T Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~--~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~--~~~~~~~p 236 (500) T protein:vir:98 161 DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSS--DDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEA--KVTDVTRP 236 (500) T ss_pred CeEEEEEEEEEeeeecCCceEEEEEEEEEEeCC--ceeEEEEEEEecccccccCcccccccccCCcCcce--EeccCCCc Confidence 111111111100 0 0000000 000 011111111111110 00111 1 1111 11221111 Q ss_pred h--h-------hhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCc-chHHHHHHHHHHH Q lcl|NC_019404. 176 N--A-------MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDS-EGFGAARLRLAQV 245 (418) Q Consensus 176 ~--~-------~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~-~~~~~~~~r~~~~ 245 (418) . + -.....++|.|++.. +.+.++.++.+......-+......++--..+-.....+ .+..-...++.. T Consensus 237 ~f~~~~~~~~N~~~~~sp~G~S~~~~-~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~- 314 (500) T protein:vir:98 237 IFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFES- 314 (500) T ss_pred cEEEecCCccccccCCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCC- Confidence 1 1 122245689999975 889999999998888877765444443322111100000 110000001100 Q ss_pred HHhcCCcceeEE-EcCCCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhH---HHHHHHHHH Q lcl|NC_019404. 246 DNNSGVGQAIGI-DAESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNT---ALETFHKLI 319 (418) Q Consensus 246 ~~~~~~~~~~~~-d~~~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~---d~~~y~~~I 319 (418) .+.....+-. ++++..++..+..+ ......++.+.+.++..+|++...| |...+|.. |..+ ..+.-|.++ T Consensus 315 --~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~-~~~~~g~~-TAtei~s~~~~~~~t~ 390 (500) T protein:vir:98 315 --DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLF-SFDGKSMK-TATEIVSENSDTYQMR 390 (500) T ss_pred --CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCcccc-ccCcCccc-cHHHHHHHHHHHHHHH Confidence 0000111100 12223477766554 3467778889999999999998655 44445542 4443 334567777 Q ss_pred HHHHHHHHHHHHHHHHHHhhcc--------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPFIVNA--------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEAR 385 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~i~~~--------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r 385 (418) +++|. .++..|+.|+..|+.. .++.+.|++-...|+.+. ++.+.+++.+|+++.++++ T Consensus 391 ~~~~~-~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~-------~~~~~~~v~aGi~s~~~~i 462 (500) T protein:vir:98 391 NSIVA-LVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAE-------LDYWIKVVNAGFGTREMAI 462 (500) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHH-------HHHHHHHHHcCCCCHHHHH Confidence 77765 4788999988887531 257789998777776544 5566778888999888776 Q ss_pred HHHHhhcCcCCCChhh-------ccc---ccccCCCccccccC Q lcl|NC_019404. 386 DTLRTIAPEIKIGDND-------IQT---EESELITETEVVIA 418 (418) Q Consensus 386 ~~l~~~~~~~~~~~~~-------~~~---~e~~~~~e~e~~~~ 418 (418) ..+ . +.++++ +.+ ++.....+..+++- T Consensus 463 ~~~--~----g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g 499 (500) T protein:vir:98 463 QKV--L----NVTEEKAQEIAAEINTGIVDEINQQRTDTHLYG 499 (500) T ss_pred Hhc--C----CCCHHHHHHHHHHHHHhccccCCCCCccccccC Confidence 432 1 122221 110 01111112223333 No 175 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.21 E-value=3e-11 Score=78.27 Aligned_cols=394 Identities=12% Similarity=0.083 Sum_probs=192.1 Q ss_pred CccchhhHHHHhcC------CCCcccc-CccccCCHH------HHHHHHHc---------------------CCccchhh Q lcl|NC_019404. 1 MVKTDSYANIFLGG------SDGSEIY-GSLQNQAPT------ILASLYAD---------------------NALVRRII 46 (418) Q Consensus 1 ~~~~D~~~n~~~g~------~~~~~~~-~~~~~~~~~------~l~~~Y~~---------------------~~~~r~iV 46 (418) |==-|.+.|.+-.+ .+-.... ..--.+++. .-.++|.. -++++.|| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 33334444443110 0000000 000111221 22234432 27889999 Q ss_pred hcchhhhccCCccccCcch--HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCccc--------ccccC-CC Q lcl|NC_019404. 47 DTIPETALAAGFHIDGIDD--EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALT--------SPVRE-GA 115 (418) Q Consensus 47 d~~a~d~~r~~~~i~~~~d--~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~--------~pl~~-~~ 115 (418) +..|+-++.+...|+.+++ .+.+++.++.-++...+.+++..+...|++++-+.++++...- -|+.- .+ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~ 160 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQ 160 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCC Confidence 9999999999887765443 2456666777789999999999999999998877665433110 12211 11 Q ss_pred ceEEEEEeeccc--c-------ccccccc-cccccccCcceEEEEecCCc-ccccc-----c----CcccEEEecCccch Q lcl|NC_019404. 116 ELETVRVYDRTQ--V-------KVQNREE-NPRNARFGKPLTYRITTNES-DMFYD-----V----HYSRIHIIDGERVP 175 (418) Q Consensus 116 ~i~~i~v~~~~~--i-------~~~~~~~-dp~s~~yg~p~~y~i~~~~~-~~~~~-----i----H~SR~i~~~g~~lp 175 (418) .+....++-++. . +-.++.. +.. ..|..+...+...+. ..+.. + .+.- .+.|-+.| T Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~--~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~--~~~~~~~p 236 (500) T protein:vir:30 161 DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSS--DDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEA--KVTDVTRP 236 (500) T ss_pred CeEEEEEEEEEeeeecCCceEEEEEEEEEEeCC--ceeEEEEEEEecccccccCcccccccccCCcCcce--EeccCCCc Confidence 111111111100 0 0000000 000 011111111111110 00111 1 1111 11221111 Q ss_pred h--h-------hhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCc-chHHHHHHHHHHH Q lcl|NC_019404. 176 N--A-------MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDS-EGFGAARLRLAQV 245 (418) Q Consensus 176 ~--~-------~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~-~~~~~~~~r~~~~ 245 (418) . + -.....++|.|++.. +.+.++.++.+......-+......++--..+-.....+ .+..-...++.. T Consensus 237 ~f~~~~~~~~N~~~~~sp~G~S~~~~-~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~- 314 (500) T protein:vir:30 237 IFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFES- 314 (500) T ss_pred cEEEecCCccccccCCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCC- Confidence 1 1 122245689999975 889999999998888877765444443322111100000 110000001100 Q ss_pred HHhcCCcceeEE-EcCCCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhH---HHHHHHHHH Q lcl|NC_019404. 246 DNNSGVGQAIGI-DAESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNT---ALETFHKLI 319 (418) Q Consensus 246 ~~~~~~~~~~~~-d~~~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~---d~~~y~~~I 319 (418) .+.....+-. ++++..++..+..+ ......++.+.+.++..+|++...| |...+|.. |..+ ..+.-|.++ T Consensus 315 --~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~-~~~~~g~~-TAtei~s~~~~~~~t~ 390 (500) T protein:vir:30 315 --DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLF-SFDGKSMK-TATEIVSENSDTYQMR 390 (500) T ss_pred --CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCcccc-ccCcCccc-cHHHHHHHHHHHHHHH Confidence 0000111100 12223477766554 3467778889999999999998655 44445542 4443 334567777 Q ss_pred HHHHHHHHHHHHHHHHHHhhcc--------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPFIVNA--------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEAR 385 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~i~~~--------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r 385 (418) +++|. .++..|+.|+..|+.. .++.+.|++-...|+.+. ++.+.+++.+|+++.++++ T Consensus 391 ~~~~~-~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~-------~~~~~~~v~aGi~s~~~~i 462 (500) T protein:vir:30 391 NSIVA-LVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAE-------LDYWIKVVNAGFGTREMAI 462 (500) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHH-------HHHHHHHHHcCCCCHHHHH Confidence 77765 4788999988887531 257789998777776544 5566778888999888776 Q ss_pred HHHHhhcCcCCCChhh-------ccc---ccccCCCccccccC Q lcl|NC_019404. 386 DTLRTIAPEIKIGDND-------IQT---EESELITETEVVIA 418 (418) Q Consensus 386 ~~l~~~~~~~~~~~~~-------~~~---~e~~~~~e~e~~~~ 418 (418) ..+ . +.++++ +.+ ++.....+..+++- T Consensus 463 ~~~--~----g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g 499 (500) T protein:vir:30 463 QKV--L----NVTEEKAQEIAAEINTGIVDEINQQRTDTHLYG 499 (500) T ss_pred Hhc--C----CCCHHHHHHHHHHHHHhccccCCCCCccccccC Confidence 432 1 122221 110 01111112223333 No 176 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.19 E-value=7.5e-12 Score=81.56 Aligned_cols=319 Identities=13% Similarity=0.043 Sum_probs=148.9 Q ss_pred HHHHhc---CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc---Ccc---hH--H----HHHHH Q lcl|NC_019404. 8 ANIFLG---GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID---GID---DE--P----AFWSR 72 (418) Q Consensus 8 ~n~~~g---~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~---~~~---d~--~----~i~~~ 72 (418) .|+|-. ........+.....+.. -......+..+++||+.+|++.-+-++.+- ..+ +. . .+... T Consensus 1 M~~f~k~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~l 79 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDTQRVTAWQ-NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEV 79 (378) T ss_pred CchhhhhhhhhhcccccCCcceeeee-ccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccccccchHHHH Confidence 222210 00011111111111000 001122446678899999999999888762 110 00 0 11111 Q ss_pred HH----HhCchHHHHHHHH-hccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceE Q lcl|NC_019404. 73 WD----DLEMTQNINDAWS-WARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLT 147 (418) Q Consensus 73 ~~----~l~~~~~~~~a~~-~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~ 147 (418) +. ..-....|.+.+. +-.++|.|++++..++ ..|.+.+.. T Consensus 80 L~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~---------~~g~~~~~~-------------------------- 124 (378) T protein:vir:85 80 LNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDS---------ETGELLDLL-------------------------- 124 (378) T ss_pred HhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecC---------CCceEEEEE-------------------------- Confidence 11 1112234555444 4557899998865433 122232221 Q ss_pred EEEecCCcccccccCcccEEEecCccchhhhhhccccCC-cchHHHHHHHHHHHHHHHHHHHHHHHHHcCC-ceeecchH Q lcl|NC_019404. 148 YRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWG-RSVLSSDILDSIKDYTNCERLATQLLRRKQQ-AVWKAKGL 225 (418) Q Consensus 148 y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G-~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~-~v~k~~~l 225 (418) ... ..+.+.++.+||+... ....| .+.+ ..+...+. .. +..... -++|+++ T Consensus 125 --~~~----~~~~~~~~dvih~~~~---------~~~~~~~~~~-~~a~~~~~---~~-------~~~~~~~g~l~~~~- 177 (378) T protein:vir:85 125 --FAN----DKKEYKPEELVRLVSP---------FYINEDTSIL-DNALASIQ---TK-------LEQGKLRGLLKINA- 177 (378) T ss_pred --ecC----CCEEEcccceEEEecC---------cCccchhhHH-HHHHHHHH---HH-------HhcCCcceEEEeCC- Confidence 111 1134556778877521 11122 2222 22222221 11 111111 2334442 Q ss_pred HHhhcCcchHHHHHHHHHHHHHh---cCCcceeEEEcCCCceeEeecccCCHH-HHHHHHHHHHhhhhcCCeeeeeccCc Q lcl|NC_019404. 226 AELCDDSEGFGAARLRLAQVDNN---SGVGQAIGIDAESEEYSVLNSDIGGID-AFLDKKFDRIVALSGIHEIILKNKNV 301 (418) Q Consensus 226 ~~~~~~~~~~~~~~~r~~~~~~~---~~~~~~~~~d~~~e~~~~~~~~~~gl~-~~~~~~~~~iaaas~IP~t~L~G~s~ 301 (418) .+. .+...+.++++...... -.+.+.+++..++.+|++++.+...++ +.+++..+.||.+.|||..+|.| T Consensus 178 --~l~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~~--- 251 (378) T protein:vir:85 178 --FLD-IDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIELIKSELLTGYFMNENILLG--- 251 (378) T ss_pred --cCC-HHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC--- Confidence 111 22233444555443221 123334555555688998887654433 23466678899999999988743 Q ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc-------------CCceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019404. 302 GGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI---VNA-------------EEWSVEFSPLDHESSKDKAEVLEK 365 (418) Q Consensus 302 ~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i---~~~-------------~~~~~~f~pL~~~~eke~ae~~~~ 365 (418) +..+....+||. .-|.|.+.++-..+ +++ .++.|+++.|...|.+++ T Consensus 252 ----s~~e~~~~~f~~-------~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~------ 314 (378) T protein:vir:85 252 ----TATQEQQIYFYN-------STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL------ 314 (378) T ss_pred ----CchHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHH------ Confidence 112334444544 34888887765544 111 124566667777777655 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCC---------h-hhcccc------c-ccCCCccc Q lcl|NC_019404. 366 SVNSIAALIAAGAMDIKEARDTLRTIAPEIKIG---------D-NDIQTE------E-SELITETE 414 (418) Q Consensus 366 ~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~---------~-~~~~~~------e-~~~~~e~e 414 (418) ++++.+++++|++|++|+|+.+. ..+..+.+ . ++..+. + +.-++.+| T Consensus 315 -~~~~~~~~~~G~~T~NE~R~~lg-l~p~~gGD~~~~~~N~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 315 -IDLYHENINGPIFTQNQLLVKMG-EQPIEGGDIYIANLNAVAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred -HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccccccchhhcCccCCCCCCCCCCCC Confidence 77888999999999999999763 22211100 0 001000 0 01112233 No 177 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.16 E-value=4.7e-10 Score=71.72 Aligned_cols=391 Identities=10% Similarity=0.042 Sum_probs=197.3 Q ss_pred Cccchhh-----------HH------HHhcCCCC-ccccCc-cccCCH------------HHHHHHHHcCCccchhhhcc Q lcl|NC_019404. 1 MVKTDSY-----------AN------IFLGGSDG-SEIYGS-LQNQAP------------TILASLYADNALVRRIIDTI 49 (418) Q Consensus 1 ~~~~D~~-----------~n------~~~g~~~~-~~~~~~-~~~~~~------------~~l~~~Y~~~~~~r~iVd~~ 49 (418) |+++=+- .. .+-|.++. .+..++ +...+. .....+++.|++++.+|+.. T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 1111000 00 01111111 112221 112111 23346699999999999999 Q ss_pred hhhhccCCccccCc---------c--hH----HHHHHHHHHh--------------CchHHHHHHHHhccccceEEEEEe Q lcl|NC_019404. 50 PETALAAGFHIDGI---------D--DE----PAFWSRWDDL--------------EMTQNINDAWSWARLFGGAAIVAI 100 (418) Q Consensus 50 a~d~~r~~~~i~~~---------~--d~----~~i~~~~~~l--------------~~~~~~~~a~~~~rl~G~~~i~i~ 100 (418) ....+-.|+..+.. + .. +.++..|++. .+.....-+++.....|.+++.+. T Consensus 81 ~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 160 (553) T protein:vir:63 81 RDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAE 160 (553) T ss_pred HHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEee Confidence 99999999887531 1 11 2345444432 233444556666667899988776 Q ss_pred ecCCCcccccccCCCceEEEEEeecccccccccc-------ccccccccCcceEEEEecCCcc----------------c Q lcl|NC_019404. 101 VKDNRALTSPVREGAELETVRVYDRTQVKVQNRE-------ENPRNARFGKPLTYRITTNESD----------------M 157 (418) Q Consensus 101 ~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~-------~dp~s~~yg~p~~y~i~~~~~~----------------~ 157 (418) .........| ..|.++++..|...... .-+--..+|+|..|+|...-.+ . T Consensus 161 ~~~~~~~~~~-------~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~ 233 (553) T protein:vir:63 161 WDRAANRPYA-------TCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQ 233 (553) T ss_pred eccCCCCccc-------ceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeecc Confidence 5321111111 13567788777543221 0111124799999999532111 1 Q ss_pred ccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHH---HHHHHHHHHHHcCCceeecchHHHh----hc Q lcl|NC_019404. 158 FYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTN---CERLATQLLRRKQQAVWKAKGLAEL----CD 230 (418) Q Consensus 158 ~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~---~~~~~~~l~~~~~~~v~k~~~l~~~----~~ 230 (418) ...|+.++|||+-- .....+.-|.|.|. ++...|++++. +.-..+.+.-.+ .-++|.+.-... +. T Consensus 234 ~~~v~a~~vlH~f~------~~r~gQ~RGis~la-pvl~~l~~l~~y~daeL~~a~i~A~~-a~fi~~~~~~~~~~~~~~ 305 (553) T protein:vir:63 234 SKPWGRRQVIHILE------PREPDQSRGIADIV-SGLKDMRMAKRFKEMSLQNAVINASY-AAAIESELPPEFIHSQMS 305 (553) T ss_pred ccccChhHheeccc------ccCCCcccCCchHH-HHHHHHHHHhHHHHHHHHHHHHhhhh-eeeeecCCChhhhhhhcc Confidence 13577788887631 12234456888774 46665555444 433333333222 224454421111 11 Q ss_pred CcchHH-----------HHHHHHHHHHHhcCCcceeEEEcCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeee Q lcl|NC_019404. 231 DSEGFG-----------AARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILK 297 (418) Q Consensus 231 ~~~~~~-----------~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~ 297 (418) .+.+.. ....-........-..+.+.....+++++..+.+ -++..++.......||+..|||.-.|. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt 385 (553) T protein:vir:63 306 GGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFT 385 (553) T ss_pred cccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHh Confidence 000000 0000000000011124566666778889888876 458999999999999999999999998 Q ss_pred ccCcc-ccccchhHHHHHHHHHHHHHHHHHHH----HHHHHHHHHhhccCCc-----------------------eEEeC Q lcl|NC_019404. 298 NKNVG-GLSSSQNTALETFHKLIDRKRNAELL----PILEFLIPFIVNAEEW-----------------------SVEFS 349 (418) Q Consensus 298 G~s~~-gl~stge~d~~~y~~~I~~~Qe~~l~----p~l~~l~~~i~~~~~~-----------------------~~~f~ 349 (418) |.-.+ .. |+.-..+.-+...++..|...+. |+.+..++..++...+ .|..+ T Consensus 386 ~D~s~~nY-SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p 464 (553) T protein:vir:63 386 RDFSKANY-SSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGA 464 (553) T ss_pred hhcccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecC Confidence 86433 34 34455666788888888886543 4444444443332211 12222 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH------------------HhhcCcCCCC---------hhhc Q lcl|NC_019404. 350 PLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTL------------------RTIAPEIKIG---------DNDI 402 (418) Q Consensus 350 pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l------------------~~~~~~~~~~---------~~~~ 402 (418) ..-..|. .|.+++....+++|+.|..++..+. ++.+-....+ +..- T Consensus 465 ~~~~iDP-------~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~ 537 (553) T protein:vir:63 465 SQGQIDQ-------LKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAAT 537 (553) T ss_pred CccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCC Confidence 2222333 5678888899999998887765432 1111100000 0000 Q ss_pred c-ccc---ccCCCccc Q lcl|NC_019404. 403 Q-TEE---SELITETE 414 (418) Q Consensus 403 ~-~~e---~~~~~e~e 414 (418) + .++ .+..++.| T Consensus 538 ~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 538 GIAEDPAAAQTSQQGE 553 (553) T ss_pred CCCCCCCCCCcccccC Confidence 0 011 12222333 No 178 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.11 E-value=1.8e-09 Score=68.50 Aligned_cols=361 Identities=12% Similarity=0.051 Sum_probs=180.5 Q ss_pred CccchhhHHHHhcCCCCccccC-ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCc--chH--HHHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYG-SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI--DDE--PAFWSRWDD 75 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~-~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~--~d~--~~i~~~~~~ 75 (418) ..+.+.+.+-..++-...+..- ..+..+...+..+ ..++-+..++++.-...+...|.|+.. +++ +.+.+.+++ T Consensus 27 a~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m-~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~ 105 (491) T protein:vir:79 27 ATRARSIDFFALGMYLPNPDPVLKALGKDIRVYREL-RADAHVGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFAD 105 (491) T ss_pred hhhccccccccccccCcchhHHHhhccCCHHHHHHH-hhChHHHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhc Confidence 0011111110001000000000 0001122333344 357888888888888888888888632 222 456777778 Q ss_pred hCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCC Q lcl|NC_019404. 76 LEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNE 154 (418) Q Consensus 76 l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~ 154 (418) +.+...+.+.+ .+.+||.|+.=+.-+ ++.. -.++.+...++.++... +. +.. .|... .+ T Consensus 106 ~~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~--------~~~~~l~~r~~~~f~~d-----~~----~~l-~l~~~-~~ 165 (491) T protein:vir:79 106 LDLSRIATEML-DAVLYGYQPMEITWGKVGNY--------IVPIDVVGKPADWFVYD-----PE----NQL-RFRSK-EH 165 (491) T ss_pred CCHHHHHHHHH-HhhhhcceeEEEEEeecCCe--------eeEEeeeeecccceeec-----cC----Cce-EEeec-CC Confidence 87666555554 689999999865432 1111 12345555555544321 11 111 12211 12 Q ss_pred cccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCc Q lcl|NC_019404. 155 SDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDS 232 (418) Q Consensus 155 ~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~ 232 (418) ...+..+.+.+.+++.... ...++||.+.+ +.||....--..+...-+..+.+++++ +.|++.- .. T Consensus 166 ~~~g~~lp~~k~i~~~~~~------~~g~p~g~gLl-~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~-----a~ 233 (491) T protein:vir:79 166 WVQGEELPARKFLVPRQEA------TYLNPYGFPDL-SMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS-----AS 233 (491) T ss_pred CCCceeecCCCeEEEEecC------CCCCcccchhH-HHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC-----CC Confidence 2233567777877775432 34567888866 568776666666777778888888865 5565521 11 Q ss_pred chHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeeccc-CC----HHHHHHHHHHHHhhhhcCCeeeeeccC----ccc Q lcl|NC_019404. 233 EGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDI-GG----IDAFLDKKFDRIVALSGIHEIILKNKN----VGG 303 (418) Q Consensus 233 ~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~-~g----l~~~~~~~~~~iaaas~IP~t~L~G~s----~~g 303 (418) +.+ ..++.......+... .++.-++.+++.++..- +| -..+++.+-..|+.+. +||+ .+| T Consensus 234 ~~e---k~~l~~al~~~~~~a-~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-------LGqtlTt~~~g 302 (491) T protein:vir:79 234 DAE---TNLLLDRLEDMVQDA-VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-------LGQNQTTEATS 302 (491) T ss_pred HHH---HHHHHHHHHHHhcCe-EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-------hhhhhccCccc Confidence 111 222222222333344 44445568899988763 33 3456666677777432 3443 122 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019404. 304 LSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNA-----EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGA 378 (418) Q Consensus 304 l~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~-----~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~ 378 (418) -.|.|+--.....+.+++..+. +...+++++.-++.- ....|.|.. .| +..+..|++++++++.|+ T Consensus 303 s~a~~~vh~~v~~~i~~~D~~~-i~~tln~li~~l~~~N~~~~~~p~f~~~e------~e--e~~~~~a~~~~~L~~~G~ 373 (491) T protein:vir:79 303 TRASAQAGLEVTDDIRDGDKAI-VVEAMNMLIRWICDLNFDGAARPVFDMWE------QE--QVDEIQAGRDEKLTRAGA 373 (491) T ss_pred chhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCCcceEeecC------cC--chhHHHHHHHHHHHhCCC Confidence 2234444555566666666543 455556666655431 223344332 22 333567899999999997 Q ss_pred -CCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 379 -MDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 379 -i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) ++.+.+++.+.- +..+.+.+..+...+... .....| T Consensus 374 ~i~~~~~~e~~Gi--p~~~~~e~~~~~~~~~~~--~~~~~~ 410 (491) T protein:vir:79 374 RFTPAYFKRAYNL--QDGDLDERPLPVSAVDAV--GAASFA 410 (491) T ss_pred ccCHHHHHHHhCC--CCCCCCccccCcCccccc--cccccc Confidence 888888876631 111111110110000000 000001 No 179 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.11 E-value=3.9e-10 Score=72.15 Aligned_cols=389 Identities=13% Similarity=0.100 Sum_probs=199.4 Q ss_pred CccchhhH------------------HHHhcCCCCccccCccccCCH------------HHHHHHHHcCCccchhhhcch Q lcl|NC_019404. 1 MVKTDSYA------------------NIFLGGSDGSEIYGSLQNQAP------------TILASLYADNALVRRIIDTIP 50 (418) Q Consensus 1 ~~~~D~~~------------------n~~~g~~~~~~~~~~~~~~~~------------~~l~~~Y~~~~~~r~iVd~~a 50 (418) |=-.|-+. ..+.+.+.+.+..+++...+. .....+|+.|++++++|+... T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 11111111 112333333333333322222 233466999999999999998 Q ss_pred hhhcc-CCcccc----CcchH------HHHHHHHHH----------hCchHHHHHHHHhccccceEEEEEeecCCCcccc Q lcl|NC_019404. 51 ETALA-AGFHID----GIDDE------PAFWSRWDD----------LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTS 109 (418) Q Consensus 51 ~d~~r-~~~~i~----~~~d~------~~i~~~~~~----------l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~ 109 (418) ...+= .|+.++ +.+.. ++|++.|++ +.+.+...-+++.....|.+++.+........ T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~-- 158 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNY-- 158 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccc-- Confidence 88885 455553 22211 234444433 33445555577777778999887765321111 Q ss_pred cccCCCc--eEEEEEeeccccccccccc------cccccccCcceEEEEecCC---------cccccccCcccEEEecCc Q lcl|NC_019404. 110 PVREGAE--LETVRVYDRTQVKVQNREE------NPRNARFGKPLTYRITTNE---------SDMFYDVHYSRIHIIDGE 172 (418) Q Consensus 110 pl~~~~~--i~~i~v~~~~~i~~~~~~~------dp~s~~yg~p~~y~i~~~~---------~~~~~~iH~SR~i~~~g~ 172 (418) ..+. -..|.++++..|....... -..-..+|+|..|+|.... ......|-.++|+|+-- T Consensus 159 ---~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~- 234 (548) T protein:vir:95 159 ---TFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAY- 234 (548) T ss_pred ---cCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheeccc- Confidence 0111 1135677777664322111 1111357999999996421 11224577788887631 Q ss_pred cchhhhhhccccCCcchHHHHHHHHHHHHHHHHH---HHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhc Q lcl|NC_019404. 173 RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCER---LATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNS 249 (418) Q Consensus 173 ~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~---~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~ 249 (418) .....+.-|.|.|. ++...|+.++.-.. ..+.+.-.+ .-++|.+........+ ..... . ... T Consensus 235 -----~~r~gQ~RGvs~la-pvl~~l~~l~~y~dael~~aki~A~~-a~fi~~~~~~~~~~~~-~~~~~-~------~~~ 299 (548) T protein:vir:95 235 -----RKRIGQNRGVPMLH-AVLIRLADLKDYEESERVAARISAAL-AMYIKKGNPDSYTVEP-GKDRK-N------RTI 299 (548) T ss_pred -----ccCCccccCcchHH-HHHHHHHHHhHHHHHHHHHHHHhhhh-eeeeecCCCccccCCC-Ccccc-c------ccc Confidence 12334556888774 56666665554433 333322222 2234444222111111 11000 0 011 Q ss_pred CC-cceeE-EEcCCCceeEeecc--cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHH Q lcl|NC_019404. 250 GV-GQAIG-IDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNA 325 (418) Q Consensus 250 ~~-~~~~~-~d~~~e~~~~~~~~--~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~ 325 (418) .. .++++ ....+++++..+.+ -++..++...+...||+++|||.-.|.|...+.. ||.-..+.-+...++..|.. T Consensus 300 ~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nY-SS~R~~l~e~~r~~~~~q~~ 378 (548) T protein:vir:95 300 PIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTY-SAQRQELVEGWLGYDLLQHE 378 (548) T ss_pred cccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhH-HHHHHHHHHHHHHHHHHHHH Confidence 12 23333 34667889988865 4689999999999999999999999999864333 44555666788888888876 Q ss_pred HHH----HHHHHHHHHhhccCC------------ceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019404. 326 ELL----PILEFLIPFIVNAEE------------WSVEF--SPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDT 387 (418) Q Consensus 326 ~l~----p~l~~l~~~i~~~~~------------~~~~f--~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~ 387 (418) .+. |+.+..++..++... +..+| +..-..|. .|.+++....+++|+.|..++..+ T Consensus 379 ~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP-------~Kea~A~~~~i~~Gl~T~~~~~a~ 451 (548) T protein:vir:95 379 FIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINP-------MHEANAWELLVKAGFADEAEVARA 451 (548) T ss_pred HHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHH Confidence 543 444444444433221 23344 22222333 567888899999999998776543 Q ss_pred HH--------------hhcCcCCCCh-----hhcccccccCCCccccccC Q lcl|NC_019404. 388 LR--------------TIAPEIKIGD-----NDIQTEESELITETEVVIA 418 (418) Q Consensus 388 l~--------------~~~~~~~~~~-----~~~~~~e~~~~~e~e~~~~ 418 (418) .. +.....|+.. ........++.++.+..++ T Consensus 452 ~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (548) T protein:vir:95 452 RGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYL 501 (548) T ss_pred hCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhcc Confidence 10 0001112110 0011111111112211111 No 180 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.11 E-value=7.7e-11 Score=76.03 Aligned_cols=395 Identities=11% Similarity=0.054 Sum_probs=188.4 Q ss_pred CccchhhHHHHhcCCCCcccc-----------Cc------------c-ccCCHHHHHHHHHcCCccchhhhcchhhhccC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIY-----------GS------------L-QNQAPTILASLYADNALVRRIIDTIPETALAA 56 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~-----------~~------------~-~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~ 56 (418) .++++-+.-.-..+...+..+ +. + .......+.......++++.||+..|+-++.+ T Consensus 4 ~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll~~e 83 (518) T protein:vir:78 4 WSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYISGK 83 (518) T ss_pred hhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhhcCC Confidence 333333322222111110000 00 0 00001112222335678999999999999999 Q ss_pred Ccccc--Cc---chH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc--------ccccccCCCceEEE Q lcl|NC_019404. 57 GFHID--GI---DDE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA--------LTSPVREGAELETV 120 (418) Q Consensus 57 ~~~i~--~~---~d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~--------l~~pl~~~~~i~~i 120 (418) ...|+ +. +++ +.+.+.++..+++..+.+++..+...|++++-+.+.+++. .--|+-..|.+..+ T Consensus 84 ~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~~~g~~~~~ 163 (518) T protein:vir:78 84 PLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSISVHSSSQFWIDFKNNEPFRF 163 (518) T ss_pred CceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEEEEcCCeeEEEeecCcEEEE Confidence 87663 22 222 3455566777899999999999999998887665544432 11233333444433 Q ss_pred EEeeccccccc--------ccc----ccccccccC--cce--EEEEecCCcccccc-----------cCccc----EEEe Q lcl|NC_019404. 121 RVYDRTQVKVQ--------NRE----ENPRNARFG--KPL--TYRITTNESDMFYD-----------VHYSR----IHII 169 (418) Q Consensus 121 ~v~~~~~i~~~--------~~~----~dp~s~~yg--~p~--~y~i~~~~~~~~~~-----------iH~SR----~i~~ 169 (418) +|........ .+. .+-..-.|+ ..+ .|+-..+. ..... .++.. .... T Consensus 164 -~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~-~v~~~~~~~~~~l~~~~~~~~~~e~~~~~ 241 (518) T protein:vir:78 164 -NFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDK-TTPISAERLPEQITSYLHTNDIQLNHSVS 241 (518) T ss_pred -EEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcc-cccccccccccccccccccccCccceeec Confidence 1111111100 000 000000011 111 11111000 00000 01110 0111 Q ss_pred cCccchh---------hhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHH Q lcl|NC_019404. 170 DGERVPN---------AMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARL 240 (418) Q Consensus 170 ~g~~lp~---------~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~ 240 (418) .|.+.|. -.+....++|.|++.. +.+.++.++.+......-+......++-...+-....++.+.... - T Consensus 242 tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~-~ 319 (518) T protein:vir:78 242 IGLKSMGAYLINNSPSNTRYPHLNLGESDLSQ-CTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVNKSTDKEE-W 319 (518) T ss_pred cCCccceEEeeccccccccccCCCcCcchHhh-hhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCCCCCCccc-c Confidence 1222111 1122345689999974 789999999998888888765544444332221111111111100 0 Q ss_pred HHHHHHHhcCCcceeE--EEcCC---CceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHH-- Q lcl|NC_019404. 241 RLAQVDNNSGVGQAIG--IDAES---EEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTA-- 311 (418) Q Consensus 241 r~~~~~~~~~~~~~~~--~d~~~---e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d-- 311 (418) .+.. .......+- .++.. +.++.++..+- .....++.++..+...+|++...| |...+.. |+.+- T Consensus 320 ~fd~---~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tf-g~~~~~~--TATei~s 393 (518) T protein:vir:78 320 SMNV---DEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATF-NLGNREV--KATEIWS 393 (518) T ss_pred ccCC---CCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhc-Ccccccc--cHHHHHH Confidence 0000 000011110 11111 13666666553 456677888888889999988765 6554433 33333 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 312 -LETFHKLIDRKRNAELLPILEFLIPFIVNA----------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI 374 (418) Q Consensus 312 -~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~----------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~ 374 (418) ....|.+++.+|. .++..|+.|+..++.. .+++|+|++-...|+++++ +++++++ T Consensus 394 ~~~~~~~t~~~~~~-~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~-------~~~~~~v 465 (518) T protein:vir:78 394 LQDATVRKIEKKKR-LIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELS-------STLNNMN 465 (518) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHH-------HHHHHHH Confidence 3345777888774 5888888887776421 1478999999999987664 4566677 Q ss_pred hCCCCCHHHHHHHHHhhcCcCCCChhhc-------ccccccCC-CccccccC Q lcl|NC_019404. 375 AAGAMDIKEARDTLRTIAPEIKIGDNDI-------QTEESELI-TETEVVIA 418 (418) Q Consensus 375 ~~g~i~~~e~r~~l~~~~~~~~~~~~~~-------~~~e~~~~-~e~e~~~~ 418 (418) .+|++|.+++.+.+. .+.++++. .++..... ++.+.|-. T Consensus 466 ~aGimS~e~~i~~~~-----~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g 512 (518) T protein:vir:78 466 SALAMSVEEKVKLIH-----PKWEDEEIQAEVKRIYLENAIGEVPDPEAIGG 512 (518) T ss_pred hcCCCCHHHHHHHhC-----CCCCHHHHHHHHHHHHHHhcccCCCCCccccC Confidence 888888877654331 12233221 11111111 11111111 No 181 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=99.08 E-value=4e-10 Score=72.11 Aligned_cols=301 Identities=10% Similarity=0.023 Sum_probs=150.2 Q ss_pred Cc-cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MV-KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~-~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) |+ +.|=+...=.-..+.++.|-.| .++.-|..+++.|+....+|..-.....+ ++. -....+ + T Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~epp--~~~~~La~l~~~n~~h~~~i~~k~N~l~~-~~~--Pn~~~t-----------~ 92 (348) T protein:vir:26 29 VDTNSWMTRYCELFYNDFDDYWEPP--ISLKGLAEIANANGYHGSLLKARANYVAG-RFM--NGGGLP-----------M 92 (348) T ss_pred ecCcchHHHHHHHHhcCCCccccCC--CCHHHHHHHHhhhhhhhhhHhhhhhHHhh-ccc--CCCCCC-----------H Confidence 21 1111111100011112233333 34567778888887777776655544332 111 111111 1 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ..|.+++..-.++|.|++.+.- + ..|.+..+.++++..++... | | .+|.+...+ ... T Consensus 93 ~~f~~~~~d~ll~Gnay~~~~r-n---------~~G~~~~L~~l~~~~v~~~~---d------~--~~~~~~~~g--~~~ 149 (348) T protein:vir:26 93 YKMNSACWDYFGLGMSAFVKIR-S---------YLKNVIALEPLPMVHMRKRK---N------G--DFVQLLRNN--EQK 149 (348) T ss_pred HHHHHHHHHHHhcCCeEEEEEE-c---------CCCcEEEEEEecCceeEeee---c------C--cEEEEEecC--eEE Confidence 2233444444578999988753 2 23557788888887766532 1 1 235454433 235 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~ 237 (418) .++++.|+||.+. .+....+|.|++.. +...+..-..+.......+...... ++.+++ ..+ +.+..+. T Consensus 150 ~f~~~dIiHir~~------~~~~~~~Gls~~~~-a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~--~~l-s~e~~~~ 219 (348) T protein:vir:26 150 VFKAKDVIFIPQY------DPQQQIYGLPDYLG-SIQSSLLNRDATLFRRRYYLNGAHMGFIFYATD--PNL-SEADEKA 219 (348) T ss_pred EEcCccEEEEcCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--CCC-CHHHHHH Confidence 6889999999642 12345679999975 4455555444444444444333222 233332 111 2334445 Q ss_pred HHHHHHHHHHhcCCcceeEEE---cCCCceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccC---ccccccc Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGID---AESEEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKN---VGGLSSS 307 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d---~~~e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s---~~gl~st 307 (418) +++.++.. ...++.+.+++. ++.+.++......+. .-++.+...+.||++.+||..+ .|.. .++++ + T Consensus 220 lk~~~~~~-~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~l-lGi~~~~~~~~s-n 296 (348) T protein:vir:26 220 LKEKIASS-KGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGM-GGMLPQQGANVP-D 296 (348) T ss_pred HHHHHHHh-cCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHH-ccccCCCCCccc-c Confidence 55555442 223344445554 112334444443332 3445567778899999999864 4654 33443 3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccCCceE--EeCCCCCCCHHHHHHH Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIV----NAEEWSV--EFSPLDHESSKDKAEV 362 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~----~~~~~~~--~f~pL~~~~eke~ae~ 362 (418) -+...+.|+. +.|.|+++.+-..|- ..+++.+ +|+|..+-++. +.+ T Consensus 297 ~e~~~~~f~~-------~~l~P~~~~ie~~ln~~l~~~~~~~~~fdl~~~~e~~~~--~a~ 348 (348) T protein:vir:26 297 PLKVSQVYDF-------YEVIPVCKRFMDAVNNDPEIPDNLKLKFNLNPGVESANG--SAV 348 (348) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHhhhhCCCCccEEEEecCcccccchh--hcC Confidence 4555666664 347787777655432 2344444 44553322222 222 No 182 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.07 E-value=3.3e-10 Score=72.55 Aligned_cols=390 Identities=13% Similarity=0.096 Sum_probs=193.4 Q ss_pred CccchhhHHHH------hcCCC-CccccCcc-ccCCHH------HHHHHHHc---------------------CCccchh Q lcl|NC_019404. 1 MVKTDSYANIF------LGGSD-GSEIYGSL-QNQAPT------ILASLYAD---------------------NALVRRI 45 (418) Q Consensus 1 ~~~~D~~~n~~------~g~~~-~~~~~~~~-~~~~~~------~l~~~Y~~---------------------~~~~r~i 45 (418) |==-+.+.|+| +|... -....... -..++. ....+|+. -++++.| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 22234445554 22110 00000000 011221 22234432 2788999 Q ss_pred hhcchhhhccCCccccC-cchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc-c-------ccccc-C Q lcl|NC_019404. 46 IDTIPETALAAGFHIDG-IDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA-L-------TSPVR-E 113 (418) Q Consensus 46 Vd~~a~d~~r~~~~i~~-~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~-l-------~~pl~-~ 113 (418) |+..|+-++.+..+++- +++. +.+.+.+++-+++..+.+++..+..+|++++-+.++.+.. + --|+. . T Consensus 81 ~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~~d 160 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGNHIKIAWVRADQFYPLQSN 160 (508) T ss_pred HHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCCeeEEEEEcCCeeEEEEEc Confidence 99999999999877763 2222 2456677777899999999999999999998777654321 1 11321 2 Q ss_pred CCceEEEEEeecccccc---------ccccccccccccCcceEEEEecCCc-ccccccCccc---------EEEecCccc Q lcl|NC_019404. 114 GAELETVRVYDRTQVKV---------QNREENPRNARFGKPLTYRITTNES-DMFYDVHYSR---------IHIIDGERV 174 (418) Q Consensus 114 ~~~i~~i~v~~~~~i~~---------~~~~~dp~s~~yg~p~~y~i~~~~~-~~~~~iH~SR---------~i~~~g~~l 174 (418) .+.+..+.++.+..... .++.+.- ...-|..++..+...+. ..+..|.-+. .+.+.|-+- T Consensus 161 ~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~-~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~ 239 (508) T protein:vir:15 161 TNDISEAAIASRTQRTESNQTKYYTLLEFHQWQ-DNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQR 239 (508) T ss_pred CCCeEEEEEEEEEEeecCCCceEEEEEEEEEEe-cCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCc Confidence 23343332222211110 0000000 00012222222222110 0011111000 122222221 Q ss_pred hh--h-------hhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhc-CcchHHHHHHHHHH Q lcl|NC_019404. 175 PN--A-------MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCD-DSEGFGAARLRLAQ 244 (418) Q Consensus 175 p~--~-------~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~-~~~~~~~~~~r~~~ 244 (418) |. + -.....++|.|++.. +.+.++.++.+......-++.....++--. .++. ++++.. .+ T Consensus 240 p~f~y~~~~~~N~~~~~splG~S~~~~-~~~lid~lD~~~s~~~~e~~~~~~~i~v~~---~~l~~d~~~~~----~~-- 309 (508) T protein:vir:15 240 PLFAYFKTPGANNINIESPLGLGVVDN-AKHVLDDINDTHDQFIWEIRLGQKHIAVQP---GMLRFDDEHKP----TF-- 309 (508) T ss_pred ceeEEecCCccccccCCCCcCCchHhh-hHHHHHHHHHHHHHHHHHHHhcccceeech---HHhcCCCCCcc----cc-- Confidence 11 1 112245789999975 789999999998888877754444443322 2222 121110 00 Q ss_pred HHHhcCCcceeEEEcC-CCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHH---HHHHHHH Q lcl|NC_019404. 245 VDNNSGVGQAIGIDAE-SEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTA---LETFHKL 318 (418) Q Consensus 245 ~~~~~~~~~~~~~d~~-~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d---~~~y~~~ 318 (418) .........+-.+.+ +..++.++.++ ......++.+.+.+...+|++..- ||...+|.. |+.+. .+.-|.+ T Consensus 310 -~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~-f~~~~~~~~-TAtei~s~~~~~~~t 386 (508) T protein:vir:15 310 -DTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGT-FSYSNDGVK-TATEVVSNNSMTYQT 386 (508) T ss_pred -CCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchh-cccccCccc-cHHHHHHHHHHHHHH Confidence 001111111112222 23476666654 346778888888899999999754 465666653 45443 4556777 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcc----------------------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019404. 319 IDRKRNAELLPILEFLIPFIVNA----------------------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA 376 (418) Q Consensus 319 I~~~Qe~~l~p~l~~l~~~i~~~----------------------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~ 376 (418) +.++|. .++..|+.++..|++. .+++|.|++-..+|..++ ++.+.+++.+ T Consensus 387 ~~~~~~-~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~-------~~~~~~~v~a 458 (508) T protein:vir:15 387 RSSYLT-MVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQ-------LEEDAKVLAI 458 (508) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHH-------HHHHHHHHhc Confidence 777764 5788888887776531 146789999888887654 5566677888 Q ss_pred CCCCHHHHHHHHHhhcCcCCCChhh----c---ccccccCCCcc--ccccC Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIKIGDND----I---QTEESELITET--EVVIA 418 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~~~~~~----~---~~~e~~~~~e~--e~~~~ 418 (418) |+++.++++..+ .+.++++ + .++......+. -...+ T Consensus 459 Gi~s~e~~i~~~------~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~ 503 (508) T protein:vir:15 459 GALSKQTFLQRN------YGMTDEQAAEELAKIQSEAPTDTFEGGRSAILN 503 (508) T ss_pred CCCCHHHHHHhc------CCCChHHHHHHHHHHHHhccccCccccccccCC Confidence 888887765321 1222221 1 11111000000 00111 No 183 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=99.03 E-value=1.9e-10 Score=73.91 Aligned_cols=326 Identities=10% Similarity=-0.015 Sum_probs=150.2 Q ss_pred CccchhhHHHHhcCCCC-ccccCccccC-CHH---HHHHHHHcCCccchhhhcch-hhhccCCccccCcchH-HHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG-SEIYGSLQNQ-APT---ILASLYADNALVRRIIDTIP-ETALAAGFHIDGIDDE-PAFWSRW 73 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~-~~~~~~~~~~-~~~---~l~~~Y~~~~~~r~iVd~~a-~d~~r~~~~i~~~~d~-~~i~~~~ 73 (418) ..+.-.-.+.....+.. ...+|.+... +.. ++...+....+++.-|+.-+ ...++......+.--. .-+..-+ T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~~~~~h~~~~~~~~n~l~l~ 99 (368) T protein:vir:79 20 ANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFRAAAHHSSAVYVKRNILVST 99 (368) T ss_pred ccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHhhccccchhhhhhcchhhhh Confidence 11111101110111100 1112222211 111 11112221112222221100 0011110000000000 0000000 Q ss_pred H---HhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEE Q lcl|NC_019404. 74 D---DLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRI 150 (418) Q Consensus 74 ~---~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i 150 (418) . .+--+..|...+..-.++|.|++.+.-+ ..|.+..+.++++..+...... + .+|.+ T Consensus 100 ~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~----------~~G~~~~L~~l~~~~v~~~~~~--------~--~~~~~ 159 (368) T protein:vir:79 100 FIPHPLLSRATFERLVLDWQVFGNAYLERREN----------VLGGTIRLDTPLAKYVRRGLDL--------N--TYFFV 159 (368) T ss_pred cCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEc----------CCCCEEEEEEeCcccceeeccC--------C--EEEEE Confidence 0 1111233544454556899999887542 3355778888888777543211 1 23334 Q ss_pred ecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHh Q lcl|NC_019404. 151 TTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAEL 228 (418) Q Consensus 151 ~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~ 228 (418) .+.+ ....+.+..|||+... .+....+|.|+++. +...+..-..+.......+...... ++++.+ .. T Consensus 160 ~~~~--~~~~~~~~dIihir~~------~~~~~~yGlsp~~~-a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~--~~ 228 (368) T protein:vir:79 160 QNWQ--QPYTFAAGSVFHLQEP------DINQEVYGLPEYLS-ALNATWLNESATLFRRRYYKNGSHAGFILYMTD--AA 228 (368) T ss_pred ecCC--eEEEEccccEEEecCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC--CC Confidence 3332 3356888999999532 12335689999975 5567776666666666665544322 233332 11 Q ss_pred hcCcchHHHHHHHHHHHHHhcCCcceeEEEcC---CCceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCc Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE---SEEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNV 301 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~---~e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~ 301 (418) + +.+..+.+++.++. ...-++.+.+++... .+.++.+.++.+. .-++.+...+.||++.+||.. |+|..+ T Consensus 229 l-~~e~~~~lk~~~~~-~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~ 305 (368) T protein:vir:79 229 Q-KQEDVDTLREAMKS-AKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQ-LMGIIP 305 (368) T ss_pred C-CHHHHHHHHHHHHH-hcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHH-HccccC Confidence 2 23334455555543 222334445555421 2334444444432 344567778999999999985 447654 Q ss_pred ccc--ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCC--CCCCCHHHHHHHHHHHH Q lcl|NC_019404. 302 GGL--SSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSP--LDHESSKDKAEVLEKSV 367 (418) Q Consensus 302 ~gl--~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~p--L~~~~eke~ae~~~~~a 367 (418) ++- .++-+...+.|+. +.|.|+++++-.+.-+-....+.|++ |...|.+.+|+...+-| T Consensus 306 ~~t~~~sn~e~~~~~f~~-------~~l~Pl~~~ie~ln~~l~~e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 306 NNTGGFGDVEKAAMVFAR-------NEVKPLQDRLLAINDWIGDEVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred CCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHhccCcceeeechhHhhcccccccCCcccccC Confidence 321 1344555566654 35788877775443222233456664 66677777776555555 No 184 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.00 E-value=7.1e-10 Score=70.74 Aligned_cols=399 Identities=12% Similarity=0.080 Sum_probs=187.1 Q ss_pred CccchhhHHHHhcCC------CCccccC-ccccCCHH------HHHHHHHc---------------------CCccchhh Q lcl|NC_019404. 1 MVKTDSYANIFLGGS------DGSEIYG-SLQNQAPT------ILASLYAD---------------------NALVRRII 46 (418) Q Consensus 1 ~~~~D~~~n~~~g~~------~~~~~~~-~~~~~~~~------~l~~~Y~~---------------------~~~~r~iV 46 (418) |---+...|.+.-++ .-..... .....++. .-.++|+. -++++.|| T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 555556656553111 0000000 01111111 11223322 28889999 Q ss_pred hcchhhhccCCccccCcchH--HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc--------ccccccC-CC Q lcl|NC_019404. 47 DTIPETALAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA--------LTSPVRE-GA 115 (418) Q Consensus 47 d~~a~d~~r~~~~i~~~~d~--~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~--------l~~pl~~-~~ 115 (418) +..|+-++.+...|+.+++. +.+.+.++.-++...+.+++..+...|++++-+.++.+.. .--|+.- .+ T Consensus 81 ~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~i~~v~ad~~~P~~~~~~ 160 (522) T protein:vir:47 81 KKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVAFIQAPVFFPLESNTQ 160 (522) T ss_pred HHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCceEEEEEcCCceEEEEEcCC Confidence 99999999998887654432 4566667777899999999999998888888776654321 1123311 11 Q ss_pred ceEEEEEeeccc---------ccccc---c-----cccccccc--cCcceEEEEecCCc-ccc-----------cccCcc Q lcl|NC_019404. 116 ELETVRVYDRTQ---------VKVQN---R-----EENPRNAR--FGKPLTYRITTNES-DMF-----------YDVHYS 164 (418) Q Consensus 116 ~i~~i~v~~~~~---------i~~~~---~-----~~dp~s~~--yg~p~~y~i~~~~~-~~~-----------~~iH~S 164 (418) .+....++.+.. ++... + ..+...-. .|..+...+.+... ..+ ..+++. T Consensus 161 ~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~ 240 (522) T protein:vir:47 161 DVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPV 240 (522) T ss_pred ceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCc Confidence 111111111100 00000 0 00010000 11111111111110 001 112222 Q ss_pred cEEEecCccchh--h-------hhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchH Q lcl|NC_019404. 165 RIHIIDGERVPN--A-------MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGF 235 (418) Q Consensus 165 R~i~~~g~~lp~--~-------~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~ 235 (418) - .+.|-+.|. + -.....++|.|++.. +.+.++..+.+......=+......++=-..+-.....+.+. T Consensus 241 ~--~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g 317 (522) T protein:vir:47 241 T--VFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDG 317 (522) T ss_pred e--EeCCCCcceEEEecCCcccccccCCCcCCchhhh-hHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCc Confidence 1 122211111 1 122245789999974 779999888887666655544333333222111111111111 Q ss_pred H-HHHHHHHHHH-HhcCCcceeEEEcCCCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhH- Q lcl|NC_019404. 236 G-AARLRLAQVD-NNSGVGQAIGIDAESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNT- 310 (418) Q Consensus 236 ~-~~~~r~~~~~-~~~~~~~~~~~d~~~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~- 310 (418) . .....+.... .++..+.. +.++..++..+..+ ......++.+...|+-.+|++...| |-..+|.. |..+ T Consensus 318 ~~~~~~~fd~~~~~f~~~~~~---~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf-~~~~~~~k-TAtEi 392 (522) T protein:vir:47 318 TIDFRPRFDVEQNVYMQIGGS---SMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMF-TFDGQGMK-TATEI 392 (522) T ss_pred ccccccccCcccceEeecCCC---CCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCcccc-Cccccccc-cHHHH Confidence 1 0010111000 01111111 11223466666554 3466677888888888888887654 55555542 3333 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------------cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 311 --ALETFHKLIDRKRNAELLPILEFLIPFIVN--------------AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI 374 (418) Q Consensus 311 --d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------------~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~ 374 (418) ..+..|.+++++|. .++..|+.|+..|+. ..++++.|++-..+|..++ ++.+.+++ T Consensus 393 ~s~~~~~~~t~~~~~~-~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~-------~~~~~~~v 464 (522) T protein:vir:47 393 VSENSDTYQMRSSIVA-LVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAE-------LDYWAKMV 464 (522) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHH-------HHHHHHHH Confidence 34556778888775 478888888777752 1257889999888887654 44555667 Q ss_pred hCCCCCHHHHHH------------HHHhhcCcCC---CChhhcc--cccccCCCcccc Q lcl|NC_019404. 375 AAGAMDIKEARD------------TLRTIAPEIK---IGDNDIQ--TEESELITETEV 415 (418) Q Consensus 375 ~~g~i~~~e~r~------------~l~~~~~~~~---~~~~~~~--~~e~~~~~e~e~ 415 (418) .+|+++.++++. .|.....+.. ....++. .++.+..+.+|+ T Consensus 465 ~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 465 AAGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred hcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 777777666543 2221111100 0000110 011122222333 No 185 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.95 E-value=2e-09 Score=68.23 Aligned_cols=296 Identities=10% Similarity=0.032 Sum_probs=155.8 Q ss_pred Cc-cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MV-KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~-~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) |+ +.|-+.. ++....+.+|..|. ++.-|..+++.|+....+|..-+....+ ++. .....+ + T Consensus 34 v~~~~~~~~~--~~~~~~~~~~~pp~--~~~~la~~~~a~~~h~~~i~~k~n~l~~-~~~--Pn~~lt-----------~ 95 (344) T protein:vir:20 34 VLDRRDILDY--VECISNGRWYEPPV--SFTGLAKSLRAAVHHSSPIYVKRNILAS-TFI--PHPWLS-----------Q 95 (344) T ss_pred ecCcchhhhh--hhhhhcCceecCCC--CHHHHHHHHhhhhhhCccceehhhhHHH-hcc--CCCCCC-----------H Confidence 22 2221111 11111123333333 4556777777777666666544333221 111 000011 1 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ..|+.++..-.++|.|++.+.- + ..|.+..+.++++..+...... + .+|++...+ ... T Consensus 96 ~~f~~~~~d~ll~Gnay~~i~r-n---------~~G~~~~L~pl~~~~vr~~~~~------~----~~~~~~~~~--~~~ 153 (344) T protein:vir:20 96 QDFSRFVLDFLVFGNAFLEKRY-S---------TTGKVIRLETSPAKYTRRGVEE------D----VYWWVPSFN--EPT 153 (344) T ss_pred HHHHHHHHHHHhcCCeEEEEEE-C---------CCCcEEEEEEcCCceeEeeecC------C----EEEEEccCC--eEE Confidence 1233333334578999998743 2 3356778888887776653211 0 234454432 335 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~ 237 (418) .+.+..|||+.... +....+|.|++.. +...+..-..+.......+...+.. ++++++ +. + +.+..+. T Consensus 154 ~~~~~eIiHir~~~------~~~~~yGls~~~~-a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d-~~-l-~~e~~~~ 223 (344) T protein:vir:20 154 AFAPGSVFHLLEPD------INQELYGLPEYLS-ALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AV-Q-DRNDIEM 223 (344) T ss_pred EEcCccEEEeCCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-cC-C-CHHHHHH Confidence 67888999996321 2345689999975 5566666555555555555443322 344432 11 1 2334455 Q ss_pred HHHHHHHHHHhcCCcceeEEEcCC---CceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCc---cccccc Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAES---EEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNV---GGLSSS 307 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~~---e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~---~gl~st 307 (418) ++++++.. ...++...+++..++ +.++...++.+. .-++.....+.||++.+||..+| |..+ +|++ + T Consensus 224 ik~~~~~~-~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~ll-Gi~~~~t~~~~-n 300 (344) T protein:vir:20 224 LRENMVKS-KGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLM-GGKPENVGSLG-D 300 (344) T ss_pred HHHHHHHh-cCCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHh-ccCCCCCCccc-c Confidence 55556442 234555667765332 334444444433 34456678899999999999755 7543 3343 3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--ccCCceEEeCCCCCCCH Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIV--NAEEWSVEFSPLDHESS 356 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~--~~~~~~~~f~pL~~~~e 356 (418) -+...+.|+. +.|.|+++++-.+.- ..+.|.|.++.|..-+| T Consensus 301 ~e~~~~~f~~-------~~l~P~~~~~e~in~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 301 IEKVAKVFVR-------NELIPLQDRIREINGWLGQEVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCcccccCccccccCCC Confidence 4555555543 457887776654432 23557788788776666 No 186 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.92 E-value=9.5e-10 Score=70.05 Aligned_cols=384 Identities=13% Similarity=0.116 Sum_probs=183.4 Q ss_pred CccchhhHHH-HhcCCC---------------CccccCccccCCHHHHHHHHH----cCCccchhhhcchhhhccC---- Q lcl|NC_019404. 1 MVKTDSYANI-FLGGSD---------------GSEIYGSLQNQAPTILASLYA----DNALVRRIIDTIPETALAA---- 56 (418) Q Consensus 1 ~~~~D~~~n~-~~g~~~---------------~~~~~~~~~~~~~~~l~~~Y~----~~~~~r~iVd~~a~d~~r~---- 56 (418) |-.+.|+.-- ..+|++ .+..+| ...++-.+|-..|+ +|+.+..+|+.++++|+.. T Consensus 20 ~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~g-g~~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~~~ 98 (533) T protein:vir:58 20 LSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYG-GIEFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNENG 98 (533) T ss_pred hchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhc-cccccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecCCC Confidence 1111111000 000100 011111 12234455666654 5799999999999999763 Q ss_pred -CccccCcc--hHHHHHHHHHH-hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 57 -GFHIDGID--DEPAFWSRWDD-LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 57 -~~~i~~~~--d~~~i~~~~~~-l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) .+++.-++ -.+++.+.+.+ |++..+-.+.+|.--++|..+.-..++ +++..|+.++.+||..+.... T Consensus 99 ~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik---------~~k~GI~elr~lDPr~i~~vr 169 (533) T protein:vir:58 99 NIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEK---------GSDGTIEKFQVVSPYIFSKRY 169 (533) T ss_pred ceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccC---------CcccchhhheecCCeeeEEEE Confidence 23332211 11344444443 567766666677666666665544332 145778888999998886643 Q ss_pred c-cccccccccCcceEEEEec-----CCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH--HHHHHHHHHH Q lcl|NC_019404. 133 R-EENPRNARFGKPLTYRITT-----NESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI--LDSIKDYTNC 204 (418) Q Consensus 133 ~-~~dp~s~~yg~p~~y~i~~-----~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~--~~~l~~~~~~ 204 (418) . .++ .++|-+++ .......+|+++.|+++..-- ......++.|-|.+++ ++.|+-.+. T Consensus 170 ~~~t~--------~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl-----~d~~~~~iisyLhkAiKp~NQLkmiED- 235 (533) T protein:vir:58 170 NPETD--------TWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKI-----DTNFFPYGRSYLESARAIWNQLRLMED- 235 (533) T ss_pred eeccc--------eEEEeecccccccccCccccccchhheeeeeecc-----ccCCCCceehhhhHHHHHHHHHHHHHH- Confidence 2 122 13333331 122233679999998885331 2223345666665432 445554433 Q ss_pred HHHHHHHHHHcCCc----eeecchHHHhhcC--cchHHHHHHHHHHHHHhcCCcceeEEEcC------------------ Q lcl|NC_019404. 205 ERLATQLLRRKQQA----VWKAKGLAELCDD--SEGFGAARLRLAQVDNNSGVGQAIGIDAE------------------ 260 (418) Q Consensus 205 ~~~~~~l~~~~~~~----v~k~~~l~~~~~~--~~~~~~~~~r~~~~~~~~~~~~~~~~d~~------------------ 260 (418) |-++++.+-. |+-++ ..++-.. .+-...++-++....-+...++.+.-|.. T Consensus 236 ----AlVIYRisRAPeRRvFYID-VGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReG 310 (533) T protein:vir:58 236 ----ALMLYRVVRSVDRRVFYVD-VGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGD 310 (533) T ss_pred ----HHHHHhhcCChhheEEEEe-ecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCC Confidence 3445554432 44333 2232211 11111111122111111222222211111 Q ss_pred --CCceeEeec-ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHH--H Q lcl|NC_019404. 261 --SEEYSVLNS-DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFL--I 335 (418) Q Consensus 261 --~e~~~~~~~-~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l--~ 335 (418) +-+++.+.- +++.++| +..|...+=.|.+||.++|-.++..|-++.=.-|.-.|..+|.+.|..+ .+++++- + T Consensus 311 grgTEI~TLpGg~lgemeD-V~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI~rLR~rF-~~ll~~qLil 388 (533) T protein:vir:58 311 RRAVEIDILQGSKVDLAED-VEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTIKRIQGFF-VEELERMVRM 388 (533) T ss_pred CccceeeecCCCCCCcHHH-HHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHHHHHHHHH-HHHHhccccc Confidence 112333322 3334443 6789999999999999999655544332221114445999999999764 4444431 2 Q ss_pred HHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhh-cccccc----cCC Q lcl|NC_019404. 336 PFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDND-IQTEES----ELI 410 (418) Q Consensus 336 ~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~-~~~~e~----~~~ 410 (418) +-++..++|+|+|..=..-+|...+|+...+..+++.+- +.+.-+=++...-...++ ....++ |+++-. +.. T Consensus 389 k~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~d--pyvgk~yi~k~ILr~tde-i~~q~e~ie~E~~~~~~~~~ 465 (533) T protein:vir:58 389 NKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLK--GWVREDWIYSNILQIPYD-LKPQEEVAEAAGGGGLFDTG 465 (533) T ss_pred ccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhc--chhhHHHHHHHHhcCChh-hhHHHHHHHHhhcCCCCCCC Confidence 334556899999998777778888888887777766642 233333333221111110 000000 110000 000 Q ss_pred CccccccC Q lcl|NC_019404. 411 TETEVVIA 418 (418) Q Consensus 411 ~e~e~~~~ 418 (418) .+++.|=. T Consensus 466 ~~~~e~~~ 473 (533) T protein:vir:58 466 GFGEETTP 473 (533) T ss_pred CcccccCC Confidence 01111111 No 187 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.90 E-value=6.7e-09 Score=65.39 Aligned_cols=300 Identities=10% Similarity=0.070 Sum_probs=148.2 Q ss_pred Cccchhh--HHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCc Q lcl|NC_019404. 1 MVKTDSY--ANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEM 78 (418) Q Consensus 1 ~~~~D~~--~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~ 78 (418) .+-++.+ .+.+. ...++.|-.| .++.-|..+++.|+....++..-.....+ .+ ...... - T Consensus 28 ~~~~~~~~y~~~~~--~~~~~~~epp--~~~~~la~l~~~~~~h~~~i~~k~n~l~~-~~--~Pn~~l-----------t 89 (345) T protein:vir:37 28 ISASPALDYVGIGF--DENYNCYLPP--VNRHALAKLPHQNAQHGGILHSRANMVSS-LY--EGGKAL-----------S 89 (345) T ss_pred cccccchhhhhhhh--cCCccccCCC--CCHHHHHHHhhcccccccceeeechHHHh-hc--cCCCCC-----------C Confidence 1111111 11000 0111122222 13455666666666665555433322211 11 100101 1 Q ss_pred hHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccc Q lcl|NC_019404. 79 TQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMF 158 (418) Q Consensus 79 ~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~ 158 (418) +..|..++..-.+||.|++++.-+ ..|.+..+.++++..+.... |. ..|+....+.+... +.. T Consensus 90 ~~~f~~~~~d~ll~Gnay~~~~rn----------~~G~~~~L~pl~~~~vr~~~---d~--~~~~~~~~~~~~~~--g~~ 152 (345) T protein:vir:37 90 RMDMRALCLNLIQFGDVGLLKVRN----------GFGQVVRLVPLSSLYLRVRK---DG--GYSYLMKKSLYDTA--QEI 152 (345) T ss_pred HHHHHHHHHHHHhcCCeEEEEEEc----------CCCcEEEEEEEcCceeEEEE---eC--CeeEEEEEeEecCC--ceE Confidence 122333444445889999887532 33567788888877765432 11 12333333333222 233 Q ss_pred cccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHH Q lcl|NC_019404. 159 YDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFG 236 (418) Q Consensus 159 ~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~ 236 (418) ..+.+..|||+.+.. +....+|.|+++. +...+..-..+.......+...... ++++++ ..+ +.+..+ T Consensus 153 ~~~~~~dVihir~~~------~~~~~~Gls~~~~-a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d--~~l-~~e~~~ 222 (345) T protein:vir:37 153 YRYDAKDIIFIKLYD------PMQQVYGSPDYVG-GIQSALLNSDATVFRRRYFSNGAHMGFILYSTD--PDL-TEEMEE 222 (345) T ss_pred EEEccccEEEecCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCcceEEEecC--CCC-CHHHHH Confidence 567888999995421 2345689999976 5566666656655555555443322 334332 111 233444 Q ss_pred HHHHHHHHHHHhcCCcceeEEEc---CCCceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCcc---cccc Q lcl|NC_019404. 237 AARLRLAQVDNNSGVGQAIGIDA---ESEEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNVG---GLSS 306 (418) Q Consensus 237 ~~~~r~~~~~~~~~~~~~~~~d~---~~e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~~---gl~s 306 (418) .++++++.. ...++.+.+++.. ..+.++....+.+. .-++.....+.||++.+||..+ +|..+. |++ T Consensus 223 ~lk~~~~~~-~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~l-lGi~~~~~~~~~- 299 (345) T protein:vir:37 223 EIARKISES-KGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGL-SGIIPTNTGGLG- 299 (345) T ss_pred HHHHHHHHh-cCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHH-hCccCCCCCCcc- Confidence 556665542 3344555566542 12334433333332 3345567888999999999864 576543 332 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccCCceEEeCCCCCCCH Q lcl|NC_019404. 307 SQNTALETFHKLIDRKRNAELLPILEFLIPFIV----NAEEWSVEFSPLDHESS 356 (418) Q Consensus 307 tge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~----~~~~~~~~f~pL~~~~e 356 (418) +-|...+.|+. +.|.|+++++-..+- ...+..+.|++- ++++ T Consensus 300 ~~e~~~~~f~~-------~~l~P~~~~ie~~ln~~~~~~~~~~i~F~~~-~L~~ 345 (345) T protein:vir:37 300 DPLKYREVYHY-------DEVMPLQEIIAETINQDPEIKNLLKIKFREQ-NFAK 345 (345) T ss_pred cHHHHHHHHHH-------HHHHHHHHHHHHHhhhhccCCCcceEEecch-hhcC Confidence 33445555553 457888777766652 234667788741 1111 No 188 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.89 E-value=2e-08 Score=62.84 Aligned_cols=357 Identities=13% Similarity=0.051 Sum_probs=174.4 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHH---------------HHHHHH----HcCCccchhhhcchhhhccCCcccc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPT---------------ILASLY----ADNALVRRIIDTIPETALAAGFHID 61 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~---------------~l~~~Y----~~~~~~r~iVd~~a~d~~r~~~~i~ 61 (418) -.+..+....+.+ ...+.+||. ....+| .+.+-+..++.+.....+...|.|+ T Consensus 22 ~~~~~~~~~~~~~--------~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~Rk~av~~~~w~I~ 93 (528) T protein:vir:10 22 TAHLAGLAKEFAN--------HPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSKRKRAVLGLDWTIE 93 (528) T ss_pred hhhhhhhhhhhcc--------cCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhcCCceEe Confidence 1111111111111 111122332 222233 3577788888888888888888886 Q ss_pred Cc--c---hH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 62 GI--D---DE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 62 ~~--~---d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) .. + ++ +.+++.+.++.-+..+..-+-.+.+||+|++=+.-+ ++. .-.++.+...++.++.... T Consensus 94 p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~lda~~~G~s~~Ei~w~~~~g--------~~~~~~~~~r~~~~f~~~~ 165 (528) T protein:vir:10 94 PPRNASAAEKADAEYLHELLLDLEGIEDLMLDCMDGVGHGYSAIELDWSLQGR--------EWLPQAFDHRPQSWFQLNP 165 (528) T ss_pred cCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHHHhhhhhcceeEEEEEeecCC--------ceeEEEeeeecccceeecc Confidence 32 1 11 224555556654556666667799999999865432 211 1224455555554433211 Q ss_pred ccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 133 REENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (418) Q Consensus 133 ~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~ 212 (418) .+.. .+.+..+ ...+..++|.+.+++.+.. ...+.+|.+.+ +.||....--..+...-+..+ T Consensus 166 ---------~~~~-~l~~~~~-~~~g~~l~~~k~iv~~~~~------~~g~p~g~gLl-r~~~w~~~fK~~~~~~w~~f~ 227 (528) T protein:vir:10 166 ---------DDQD-ELRLRDN-SIAGEVLQPFGWIMHKPRS------RSGYVARSGLF-RVLAWPYLFKHYSTADLAEML 227 (528) T ss_pred ---------CCCc-EEeccCC-CCCceeecCCCeEEEeecC------CCCCccccchH-HHHHHHHHHHHhhHHHHHHHH Confidence 1111 1222221 1123457777766665432 34566788866 567776665566666677788 Q ss_pred HHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH---HHHHHHHHHHHhh Q lcl|NC_019404. 213 RRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI---DAFLDKKFDRIVA 287 (418) Q Consensus 213 ~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl---~~~~~~~~~~iaa 287 (418) .+++++ +.|++.. ..+. -+.++.......+...+++ .-.+.+++.++.+-++. ..+++.+-..||. T Consensus 228 E~yG~P~~igky~~~-----a~~~---ek~~L~~al~~i~~~~~~i-iP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk 298 (528) T protein:vir:10 228 EIYGLPIRLGKYPPG-----TPDE---EKVTLLRAVTGLGHAAAGI-IPESMSIDFQEASKGSAEPFMAMMRWCDDSMSK 298 (528) T ss_pred HHcCCCeEEEecCCC-----CCHH---HHHHHHHHHHHHhhCcEEE-ecCCceeEEeecCCCChhHHHHHHHHHHHHHHH Confidence 888865 5565521 1111 1222333233334344444 44568899988764443 3456666666664 Q ss_pred hhcCCeeeeeccCc--------cccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhccC-----C----ceEEeC Q lcl|NC_019404. 288 LSGIHEIILKNKNV--------GGLSSSQNTALETFHKLIDRKRNAELLPILE-FLIPFIVNAE-----E----WSVEFS 349 (418) Q Consensus 288 as~IP~t~L~G~s~--------~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~-~l~~~i~~~~-----~----~~~~f~ 349 (418) + ++||+- +|-+|-|+--.....+.+++-.+. +...++ .|+.-++.-. + -+|.|. T Consensus 299 ~-------iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~-i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~ 370 (528) T protein:vir:10 299 A-------ILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQ-LAATLSRDLLWPLLVLNRSGNLDARRAPRLVFD 370 (528) T ss_pred H-------HhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCCCCccccceEEec Confidence 3 234432 122222333334555555555543 444443 4655554211 0 133444 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhcCcCCCChhhcccccccC-----CCcccccc------ Q lcl|NC_019404. 350 PLDHESSKDKAEVLEKSVNSIAALIAAGA-MDIKEARDTLRTIAPEIKIGDNDIQTEESEL-----ITETEVVI------ 417 (418) Q Consensus 350 pL~~~~eke~ae~~~~~a~a~~~~~~~g~-i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~-----~~e~e~~~------ 417 (418) .- |..|+ +..|++++++++.|+ |+.+.+++.+.--.+. ..+++...+... .......- T Consensus 371 ~~------e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p~---~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (528) T protein:vir:10 371 LK------DRADL-AAMATSLPPLVKLGVQVPVNWVQEQLGIPLPA---NGEAVLGDQAGAGIAQLSRRPGPRIAALAQV 440 (528) T ss_pred CC------CcccH-HHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCC---CCcccccCCCcccccccCccccccccccccc Confidence 32 22232 457899999999998 9999999877421110 111111000000 00000000 Q ss_pred ---------------C Q lcl|NC_019404. 418 ---------------A 418 (418) Q Consensus 418 ---------------~ 418 (418) + T Consensus 441 ~~~~~~~~~~~d~~~~ 456 (528) T protein:vir:10 441 IGPRYRDQEALDQVLA 456 (528) T ss_pred ccccccccchHHHHHH Confidence 0 No 189 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.89 E-value=4.1e-09 Score=66.57 Aligned_cols=296 Identities=10% Similarity=0.015 Sum_probs=150.7 Q ss_pred Cc-cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MV-KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~-~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) |+ +.|-+.. ++.-..+..+..|. ++.-|..+++.|+....+|..-+....+ ++. -.... -+ T Consensus 34 v~~~~~~~~~--~~~~~~~~~~~pp~--~~~~la~~~~a~~~h~~~i~~k~n~l~~-~~~--Pn~~~-----------t~ 95 (344) T protein:vir:60 34 VLDRRDILDY--VECISNGRWYEPPI--SFTGLAKSLRAAVHHSSPIYVKRNILAS-TFI--PHPWL-----------SQ 95 (344) T ss_pred ecCCcchhHH--HHhhhcCccccCCC--CHHHHHHHHHhhhhhccchhhhhhHHHh-hcc--CCCCC-----------CH Confidence 11 1111111 11111122232222 3445556666655555554433332211 111 00001 11 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ..|+..+..-.++|.|++.+.- + ..|.+..|.++++..++..... + .+|++...+ ... T Consensus 96 ~~f~~~~~d~ll~Gnay~~i~r-n---------~~G~~~~L~~l~~~~vr~~~~~------~----~~~~v~~~~--~~~ 153 (344) T protein:vir:60 96 QDFSRFVLDFLVFGNAFLEKRY-S---------TTGKVIRLETSPAKYTRRGVEE------D----VYWWVPSFN--EPT 153 (344) T ss_pred HHHHHHHHHHHhcCCeEEEEEE-C---------CCCcEEEEEEcCcceEEEeecC------C----eEEEEccCC--eEE Confidence 2243333344578999988753 2 2355677888877766553211 1 235554433 235 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC--ceeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ--AVWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~~~~~~~~~~~~ 237 (418) .+.+..|||+... .+....+|.|++.. +...+..-..+.......+..... -++++++ .. + +.+..+. T Consensus 154 ~~~~~eIiHir~~------~~~~~~yGlsp~~~-a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~-~~-l-s~e~~~~ 223 (344) T protein:vir:60 154 AFAPGSVFHLLEP------DINQELYGLPEYLS-ALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AV-Q-DRNDIEM 223 (344) T ss_pred EEcCccEEEEcCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-cC-C-CHHHHHH Confidence 6788889999632 12345689999975 556676666665555555544433 2444432 11 2 2333445 Q ss_pred HHHHHHHHHHhcCCcceeEEEcCC---CceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCc---cccccc Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAES---EEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNV---GGLSSS 307 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~~---e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~---~gl~st 307 (418) ++++++.. ...++...+++..++ +.++.+.++.+. .-++.+...+.||++.+||..+ +|..+ +|++ + T Consensus 224 ik~~~~~~-~g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~l-lGi~~~~t~~~~-n 300 (344) T protein:vir:60 224 LRENMVKS-KGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQL-MGGKPENVGSLG-D 300 (344) T ss_pred HHHHHHHh-cCCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHH-hcccCCCCCccc-c Confidence 55555442 234555667775322 334444444433 3455668889999999999874 46543 3343 3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCCceEEeCCCCCCCH Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIVN--AEEWSVEFSPLDHESS 356 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--~~~~~~~f~pL~~~~e 356 (418) -+...+.|+. +.|.|+++++-.+.-+ .+.|.|....|..-+. T Consensus 301 ~e~~~~~f~~-------~~L~Pl~~~~e~ln~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 301 IEKVAKVFVR-------NELIPLQDRIREINGWLGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCcccccCccccCCCCC Confidence 4555556654 4578887776554322 2446666666665555 No 190 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=98.88 E-value=1.5e-08 Score=63.52 Aligned_cols=381 Identities=10% Similarity=0.034 Sum_probs=183.7 Q ss_pred CccchhhHHHHhcCCCC---ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH--HHHHHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG---SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDD 75 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~---~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~--~~i~~~~~~ 75 (418) +.++|++-= ..|--.. -+..|--....-..++.|.++.+-++.++.+-....++-.|.|+..+++ +.+.+.+.+ T Consensus 18 ~~~~~~~~~-~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p~~~~~a~~v~~~l~~ 96 (446) T protein:vir:98 18 IYAMEHLGL-ATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQHGDKRIKKFIDDQLRN 96 (446) T ss_pred hhccccchh-hcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceecCccHHHHHHHHHHHhh Confidence 333333211 1110000 0000100000012223444668888889999888888888999865543 246667777 Q ss_pred hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEe------ecccc-------cccccc---ccccc Q lcl|NC_019404. 76 LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVY------DRTQV-------KVQNRE---ENPRN 139 (418) Q Consensus 76 l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~------~~~~i-------~~~~~~---~dp~s 139 (418) +.+...+.. +-.+-.||.|+.=+.-+-......|..-...+..+++. +.+.. +...+. ..|.. T Consensus 97 ~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 175 (446) T protein:vir:98 97 RAKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDNGRIVDGDTVTASQYKSGYWVPLP 175 (446) T ss_pred cCchhHHHH-HHHHHhhCceeeeEEEeecccccccchhhccccccccccceeeeccCCccccccccchhhcccccccCcc Confidence 766444444 34556699999855443111111221110111122222 11111 111110 00111 Q ss_pred c-ccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcC-- Q lcl|NC_019404. 140 A-RFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQ-- 216 (418) Q Consensus 140 ~-~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~-- 216 (418) | .++.|..+. ...+....|.+.|++++... ..+.+++|.|.+ +.||-...--..+..--+..+.+++ T Consensus 176 ~~~~~~~~~~~---~~~g~~~~iP~~kfi~~~~~------~~~~~p~G~gLl-r~~~w~~~fK~~~~~~w~~f~E~yG~P 245 (446) T protein:vir:98 176 PYRIGDPPKKV---DVVGSHVRLPSHKRLFINYN------TKGNNPWGTSCL-TSVLDYSIFKRAFRDMMLIALDRYGTP 245 (446) T ss_pred cchhhhhhhhc---ccCcccccccccceEEEEec------CCCCCccccchH-HHHHHHHHHHHhhHHHHHHHHhHcCCc Confidence 1 122222221 11222345777888877533 345678898855 6677766555566666677778876 Q ss_pred CceeecchHHHhhc--CcchHHHHH---HHHHHHHHhcCCcceeEE----EcCCCceeEeecccCC---HHHHHHHHHHH Q lcl|NC_019404. 217 QAVWKAKGLAELCD--DSEGFGAAR---LRLAQVDNNSGVGQAIGI----DAESEEYSVLNSDIGG---IDAFLDKKFDR 284 (418) Q Consensus 217 ~~v~k~~~l~~~~~--~~~~~~~~~---~r~~~~~~~~~~~~~~~~----d~~~e~~~~~~~~~~g---l~~~~~~~~~~ 284 (418) +.+.|.+..++.-. +++..+... +.+....+..+...++++ +-++-+++.++..-++ ...+++.+-.+ T Consensus 246 ~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~ 325 (446) T protein:vir:98 246 LIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNN 325 (446) T ss_pred eeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHH Confidence 44667654332111 111111111 122222333333333333 2344578877765443 56778888889 Q ss_pred HhhhhcCCeeeeeccCcc--ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC----C--ceEEe-CCCCCCC Q lcl|NC_019404. 285 IVALSGIHEIILKNKNVG--GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNAE----E--WSVEF-SPLDHES 355 (418) Q Consensus 285 iaaas~IP~t~L~G~s~~--gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~----~--~~~~f-~pL~~~~ 355 (418) ||.+.--.... +|++.+ |=++-|+.....+.+.+++-.+..-.-+.+.|+.-++.-. . ....+ .|-.... T Consensus 326 IskaiLg~~Lt-l~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~~~~~~~~~~ 404 (446) T protein:vir:98 326 MLMGMGIPNLL-VQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASNTGYITRLPG 404 (446) T ss_pred HHHHHhccccc-ccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccceeccC Confidence 99887776433 355533 3233455555667777777776533333345666554211 0 00000 0111222 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCCCH---HHHHHHHHhhcCcCCCChhhcccccccC Q lcl|NC_019404. 356 SKDKAEVLEKSVNSIAALIAAGAMDI---KEARDTLRTIAPEIKIGDNDIQTEESEL 409 (418) Q Consensus 356 eke~ae~~~~~a~a~~~~~~~g~i~~---~e~r~~l~~~~~~~~~~~~~~~~~e~~~ 409 (418) |..|+ ++.|++++++++.|++.+ +.+|+.+. |++.++++ T Consensus 405 --e~eDl-~~~a~~~~~L~~~G~~~p~~~~~ire~~g------------iP~~~~~~ 446 (446) T protein:vir:98 405 --RATDL-AALVEAIKQMHDMGFLVDGDKDHIRSITG------------LPDAISST 446 (446) T ss_pred --ChhhH-HHHHHHHHHHHhCCccccccHHHHHHHhC------------cCCCCCCC Confidence 33343 467999999999998654 33665441 33333333 No 191 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=98.88 E-value=3.6e-09 Score=66.87 Aligned_cols=302 Identities=9% Similarity=-0.011 Sum_probs=150.9 Q ss_pred CccchhhHHH--------------------------HhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhc Q lcl|NC_019404. 1 MVKTDSYANI--------------------------FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETAL 54 (418) Q Consensus 1 ~~~~D~~~n~--------------------------~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~ 54 (418) ..+.+.-.+. +++....+..+-.|. ++.-|..+++.++....+|..-+..-+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~--~~~~la~~~~~~~~h~~~l~~k~n~l~ 89 (351) T protein:vir:78 12 FAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPV--SFAGLAKSFRASTHHSSALFFKANVLA 89 (351) T ss_pred CCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCC--CHHHHHHHHhhhHhhhhhhhhhhhHHh Confidence 1111111110 111111122222222 355677777777777666654443332 Q ss_pred cCCccccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc Q lcl|NC_019404. 55 AAGFHIDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE 134 (418) Q Consensus 55 r~~~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~ 134 (418) + ++. -.... -+..|..++..-.+||.|++++.-+ ..|.+..|.++++..+.+.... T Consensus 90 ~-~~~--Pn~~~-----------t~~~f~~~~~d~ll~Gnay~~~~rn----------~~G~~~~L~pl~~~~v~~~~~~ 145 (351) T protein:vir:78 90 S-TFR--PHRWL-----------SRHAFERWALDFLTFGNGYLERRRN----------MVGGTLRLEPALAKYVRRKADF 145 (351) T ss_pred h-ccc--CCCCC-----------CHHHHHHHHHHHHhcCCeEEEEEEC----------CCCCEEEEEEecCcceEEeeeC Confidence 2 111 00111 1222444454556789999887542 2355778888887776553311 Q ss_pred ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 135 ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 135 ~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) . .+|.+.+.+ ....+.+..|||+.+. .+....+|.|++.. +...+..-..+.......+.. T Consensus 146 ~----------~~~~~~~~~--~~~~~~~~eVihir~~------~~~~~~yGl~~~~~-a~~si~l~~~a~~~~~~~f~N 206 (351) T protein:vir:78 146 S----------GFVYVNGWQ--ERHEFAPDSVFQLVRP------DINQEVYGLPEYLS-SLHSAWLNESSTLFRRKYYEN 206 (351) T ss_pred C----------eEEEEecCC--eEEEEccccEEEEcCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhc Confidence 0 123333322 2356788899999632 23345689999975 666666665655555555544 Q ss_pred cCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC---CCceeEeecccCC----HHHHHHHHHHHH Q lcl|NC_019404. 215 KQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE---SEEYSVLNSDIGG----IDAFLDKKFDRI 285 (418) Q Consensus 215 ~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~---~e~~~~~~~~~~g----l~~~~~~~~~~i 285 (418) .... ++++++ ..+ +.+..+.+++.++.. ...++.+.+++... .+.++.+..+.+. .-++.+...+.| T Consensus 207 Ga~pggIl~~~~--~~l-s~e~~~~lr~~~~~~-~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eI 282 (351) T protein:vir:78 207 GSHAGFILYMTD--AAQ-KQDDVDNMRDALKNA-KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDL 282 (351) T ss_pred cCCCceEEEecC--CCC-CHHHHHHHHHHHHHh-cCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHH Confidence 3322 334332 112 233445566666432 33344455555422 2334444444332 445566778889 Q ss_pred hhhhcCCeeeeeccCcc---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 286 VALSGIHEIILKNKNVG---GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 286 aaas~IP~t~L~G~s~~---gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~a 360 (418) |++.+||..++ |..+. |++ +-+...+.|+. +.|.|+++++-.+.-+-..-.|+|++--=+...++| T Consensus 283 a~a~~VPp~ll-Gi~~~~t~~~s-n~e~~~~~f~~-------~~l~P~~~~iee~n~~l~~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 283 LAAHRVPPQLL-GIVPSNSGGFG-TPDTAARVFGR-------NEIRPLQARFAELNDWLGDEVVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHhCCCHHHh-cccCCCCCCcc-cHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCccceecChhhhccccccC Confidence 99999998655 76543 333 33444455543 457887777755432212223666643322222233 No 192 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=98.86 E-value=6.4e-09 Score=65.52 Aligned_cols=301 Identities=10% Similarity=0.000 Sum_probs=151.8 Q ss_pred CccchhhHHH--------------------------HhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhc Q lcl|NC_019404. 1 MVKTDSYANI--------------------------FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETAL 54 (418) Q Consensus 1 ~~~~D~~~n~--------------------------~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~ 54 (418) ..+.+.-.+. +++....+..+-.|. ++.-|..+++.++....+|..-+.+-. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~--~~~~la~~~~~~~~h~~~l~~k~n~l~ 89 (351) T protein:vir:79 12 FAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPV--SFAGLAKSFRASTHHSSALFFKANVLA 89 (351) T ss_pred CCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCC--CHHHHHHHHhhhHhhhhhhhhhhhHHh Confidence 1111111110 111111122222222 355677777777777776654443332 Q ss_pred cCCccccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc Q lcl|NC_019404. 55 AAGFHIDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE 134 (418) Q Consensus 55 r~~~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~ 134 (418) + ++.- ....+ +..|..++..-.++|.|++++.-+ ..|.+..+.++++..+...... T Consensus 90 ~-~~~P--np~~t-----------~~~f~~~v~d~ll~Gnay~~~~r~----------~~G~~~~L~~l~~~~v~~~~~~ 145 (351) T protein:vir:79 90 S-TFRP--HRWLS-----------RHAFERWALDFLTFGNGYLERRRN----------MVGGTLRLEPALAKYVRRKADF 145 (351) T ss_pred h-cccC--CCCCC-----------HHHHHHHHHHHHhcCCeEEEEEEC----------CCCCEEEEEEeCCcceeeeecC Confidence 2 1110 00111 122444444445789999887542 2356778888888877654321 Q ss_pred ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 135 ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRR 214 (418) Q Consensus 135 ~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~ 214 (418) . .+|.+...+ ....+.++.|||+... .+....+|.|+++. +...+..-..+.......+.. T Consensus 146 ~----------~~~~~~~~g--~~~~~~~~eIihir~~------~~~~~~yGl~~~~~-a~~si~l~~~a~~~~~~~f~N 206 (351) T protein:vir:79 146 S----------GFVYVNGWQ--ERHEFEPDSVFQLVRP------DINQEVYGLPEYLS-SLHSAWLNESSTLFRRKYYEN 206 (351) T ss_pred C----------eEEEEecCc--eEEEEcCccEEEeCCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhc Confidence 1 123333322 2346778899999532 12345689999975 566676665665555555544 Q ss_pred cCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC---CCceeEeecccCC----HHHHHHHHHHHH Q lcl|NC_019404. 215 KQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE---SEEYSVLNSDIGG----IDAFLDKKFDRI 285 (418) Q Consensus 215 ~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~---~e~~~~~~~~~~g----l~~~~~~~~~~i 285 (418) .... ++++++. .+ +.+..+.+++.++. ....++.+.+++... .+.++.+....+. .-++.+...+.| T Consensus 207 Ga~pg~il~~~~~--~l-s~e~~~~lk~~~~~-~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI 282 (351) T protein:vir:79 207 GSHAGFILYMTDA--AQ-KQDDVDNMRDALKN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDL 282 (351) T ss_pred cCCCceEEEecCC--CC-CHHHHHHHHHHHHH-hcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHH Confidence 4332 3344321 12 23345556666654 233344455555422 2334444444432 445666788899 Q ss_pred hhhhcCCeeeeeccCcc---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cCCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 286 VALSGIHEIILKNKNVG---GLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN-AEEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 286 aaas~IP~t~L~G~s~~---gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~-~~~~~~~f~pL~~~~eke~a 360 (418) |++.+||..+| |..+. |+ ++-+...+.|+. +.|.|+++++-.+--+ ..+ .++|++---+....+| T Consensus 283 ~~a~~VPp~ll-Gi~~~~t~~~-~n~e~~~~~f~~-------~~l~Pl~~~ie~ln~~lg~~-~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 283 LAAHRVPPQLL-GIVPSNSGGF-GTPDTAARVFGR-------NEIRPLQARFAELNDWLGDE-VVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHhCCCHHHh-cccCCCCCCc-ccHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCcc-eeeeChhhhccccccC Confidence 99999998655 76543 33 334555556654 3477777666543222 222 2567653323222222 No 193 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=98.85 E-value=2.5e-09 Score=67.79 Aligned_cols=300 Identities=10% Similarity=-0.010 Sum_probs=146.6 Q ss_pred Ccc-chhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MVK-TDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~~-~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) |+. .|-+.. ++....+..+-.|. ++.-|..+++.|+....+|..-+.+..+ ++. -.... -+ T Consensus 64 v~~~~~~~~~--~~~~~~~~~~~pp~--~~~~La~~~~~~~~h~s~l~~k~n~l~~-~~~--Pnp~l-----------T~ 125 (376) T protein:vir:10 64 VMNRAEILDY--VECWSNGEWFEPPV--SFAGLAKSFRASTHHSSALFFKANVLAS-TFR--PHRWL-----------SR 125 (376) T ss_pred ccCcchhhhh--hhhhhcCceecCCC--CHHHHHHHHhhhHHhhhhHHHHhHHHHh-ccC--CCCCC-----------CH Confidence 111 110000 01001111222222 3445666666666666665544443322 111 00000 12 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ..|.+++..-.++|.|++.+.-+ ..|.+..|.++++.++++... +. .+|++...+ ... T Consensus 126 ~~f~~~v~d~ll~Gnay~~~~rn----------~~G~~~~L~pl~~~~vr~~~d---~~-------~~~~~~~~~--~~~ 183 (376) T protein:vir:10 126 HAFERWALDFLTFGNGYLERRRN----------MVGGTLRLEPALAKYVRRKAD---FN-------GFVYVNGWQ--ERH 183 (376) T ss_pred HHHHHHHHHHHhcCCeEEEEEEC----------CCCCEEEEEEeCCcceEEEee---CC-------eEEEEEcCC--eEE Confidence 23444444556789999877532 345677888888887765421 11 133333322 234 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~ 237 (418) .+.++.|||+... .+....+|.|+++. +...+..-..+.......+...... ++++.+ ..+ +.+..+. T Consensus 184 ~~~~~eViHir~~------~~~~~~yGls~~~~-a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d--~~l-~~e~~~~ 253 (376) T protein:vir:10 184 EFEPDSVFQLVRP------DINQEVYGLPEYLS-SLHSAWLNESSTLFRRKYYENGSHAGFILYMTD--AAQ-KQDDVDN 253 (376) T ss_pred EEccccEEEecCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--CCC-CHHHHHH Confidence 5778889999532 23345689999975 5566665555555555554443322 333332 112 2334445 Q ss_pred HHHHHHHHHHhcCCcceeEEEcC---CC--ceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCcc---ccccc Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAE---SE--EYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNVG---GLSSS 307 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~---~e--~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~~---gl~st 307 (418) +++.++.. ...++.+.+++... .+ +|..++.+.. ..-++.+...+.||++.+||.. |+|..+. |+ ++ T Consensus 254 lr~~~~~~-~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~~~t~~~-sn 330 (376) T protein:vir:10 254 MRDALKNA-KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ-LLGIVPSNSGGF-GT 330 (376) T ss_pred HHHHHHHh-cCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCc-cc Confidence 55555432 23344455555422 23 3444444332 2345566778899999999985 5587643 33 33 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~a 360 (418) -|...+.|+. +.|.|+++.+-.+.-+-..-.++|++-.-+.-.++| T Consensus 331 ~eq~~~~f~~-------~~L~Pl~~~ieeln~~L~~~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 331 PDTAARVFGR-------NEIRPLQARFAELNDWLGEEVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHhhccccccccChhHhhcccccC Confidence 4555555654 347887777655332211223666642222222222 No 194 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.79 E-value=3.2e-08 Score=61.65 Aligned_cols=392 Identities=15% Similarity=0.143 Sum_probs=185.7 Q ss_pred CccchhhHHHH------hcCCCCccccCcc-ccCCHH------HHHHHHH---------------------cCCccchhh Q lcl|NC_019404. 1 MVKTDSYANIF------LGGSDGSEIYGSL-QNQAPT------ILASLYA---------------------DNALVRRII 46 (418) Q Consensus 1 ~~~~D~~~n~~------~g~~~~~~~~~~~-~~~~~~------~l~~~Y~---------------------~~~~~r~iV 46 (418) |=--+.+.|.+ ++..+-......+ -..++. .-.++|+ +-+++++|+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 33333444433 1111111111100 001110 1112232 226888999 Q ss_pred hcchhhhccCCccccCcc--h-----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc-c----- Q lcl|NC_019404. 47 DTIPETALAAGFHIDGID--D-----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA-L----- 107 (418) Q Consensus 47 d~~a~d~~r~~~~i~~~~--d-----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~-l----- 107 (418) ...|+-.+.+...|.-++ . .+.+.+.++.-++...+.+++..+...|++++-+.++.+.. + T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~I~~v~a 160 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDNGEIEFSWALA 160 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeCCeeEEEEEcC Confidence 999999999876764221 0 12355556666899999999999998898888766643321 1 Q ss_pred --ccccc--CCCceEEEEEeeccc--------cccccccccccccccCcceEEEEec-----C-Ccccccc--------- Q lcl|NC_019404. 108 --TSPVR--EGAELETVRVYDRTQ--------VKVQNREENPRNARFGKPLTYRITT-----N-ESDMFYD--------- 160 (418) Q Consensus 108 --~~pl~--~~~~i~~i~v~~~~~--------i~~~~~~~dp~s~~yg~p~~y~i~~-----~-~~~~~~~--------- 160 (418) --|+. ..+-+....+..-+. ++..++. .+..-.++. ..|+|.. . ....+.+ T Consensus 161 d~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H-~~~~~~~~~-~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~ 238 (517) T protein:vir:98 161 NAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFH-EWEKTEEGE-SLYVITNELYKSDNEGEIGKRIPLEELYEG 238 (517) T ss_pred CeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEE-ecCceeccC-CcEEEEEEEEecCCCccccccccccccccC Confidence 01221 111111111110000 0000000 000000000 0133321 1 1111111 Q ss_pred cCcccEEEecCccchh--hh-------hhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhc- Q lcl|NC_019404. 161 VHYSRIHIIDGERVPN--AM-------RRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCD- 230 (418) Q Consensus 161 iH~SR~i~~~g~~lp~--~~-------~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~- 230 (418) +.++ +.+.|-+-|. ++ .....++|.|++.. +.+.++..+.+......-+......++- + ..++. T Consensus 239 l~~~--~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~-a~~~~d~lD~~~s~~~~e~~~g~~~i~v-p--~~~l~~ 312 (517) T protein:vir:98 239 MQEK--TYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDN-SVSTLKKINDTYDQFWWEIKMGQRTVFV-S--DVMLRT 312 (517) T ss_pred CCcc--eeECCCCcceEEEecCCcccccccCCCCCCchhhh-hHHHHHHHHHHHHHHHHHHHhCCcceec-C--hhhhcc Confidence 1111 2233322221 11 11235789999974 7799999988877776655544443322 1 22221 Q ss_pred --CcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc Q lcl|NC_019404. 231 --DSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS 306 (418) Q Consensus 231 --~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s 306 (418) ++.+.... ..+ .........+..+..+..++..+..+ ......++.+.+.|+..+|+|..-| |....|.. T Consensus 313 ~~~~~g~~~~-~~~---d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~-~~~~~~~k- 386 (517) T protein:vir:98 313 VPDESGMPPP-QVF---DPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTF-SFDGRSMK- 386 (517) T ss_pred ccCCCCcccC-CCC---CcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccc-cccccccc- Confidence 11110000 000 00001111222222233455555554 3466778889999999999998655 66666664 Q ss_pred chhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------------cCCceEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019404. 307 SQNT---ALETFHKLIDRKRNAELLPILEFLIPFIVN--------------AEEWSVEFSPLDHESSKDKAEVLEKSVNS 369 (418) Q Consensus 307 tge~---d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~--------------~~~~~~~f~pL~~~~eke~ae~~~~~a~a 369 (418) |..+ ..+..|.+++++|. .++..|++|+..|+. ..++++.|++-..+|.++. +++ T Consensus 387 TATEi~s~~~~~~~t~~~~~~-~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~-------~~~ 458 (517) T protein:vir:98 387 TATEIVSENDLTYRTRNDHVY-EVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSAL-------LRF 458 (517) T ss_pred cHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHH-------HHH Confidence 4432 33456778888775 478899888877742 1257899999888887665 444 Q ss_pred HHHHHhCCCCCHHHHHH------------HHHhhcCc-CCCChhh-cccccccCCCccc Q lcl|NC_019404. 370 IAALIAAGAMDIKEARD------------TLRTIAPE-IKIGDND-IQTEESELITETE 414 (418) Q Consensus 370 ~~~~~~~g~i~~~e~r~------------~l~~~~~~-~~~~~~~-~~~~e~~~~~e~e 414 (418) +.+++.+|+++..+++. .+....++ .+.++.. .+..+....+++| T Consensus 459 ~~~~v~aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 459 YGQAKTFGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred HHHHHhcCCCCHHHHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 55667777777666543 33221111 1111111 1222333444555 No 195 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.78 E-value=2.7e-08 Score=62.08 Aligned_cols=299 Identities=11% Similarity=0.074 Sum_probs=145.7 Q ss_pred ccchhhHHH---Hh-----------------------cCC--CCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh Q lcl|NC_019404. 2 VKTDSYANI---FL-----------------------GGS--DGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA 53 (418) Q Consensus 2 ~~~D~~~n~---~~-----------------------g~~--~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~ 53 (418) |+...-+.. .+ +.. ..+..+-.| .++..|..+++.|+....++..-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp--~~~~~la~~~~~~~~h~~~i~~k~n~l 78 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCYLPP--VNRHALAKLPHQNAQHGGILHSRANMV 78 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccccCC--CCHHHHHHHhhcchhhcchhhhhhhHH Confidence 111111000 00 000 011111111 235566777777766666654433332 Q ss_pred ccCCccccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc Q lcl|NC_019404. 54 LAAGFHIDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR 133 (418) Q Consensus 54 ~r~~~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~ 133 (418) .+ ++. -.... -+..|.+++..-.++|.|++.+.-+ ..|.+..+.++++..+..... T Consensus 79 ~~-~~~--Pn~~~-----------t~~~f~~~v~d~ll~Gnay~~i~rn----------~~G~~~~L~pl~~~~vr~~~d 134 (345) T protein:vir:37 79 SA-TYE--GGKAL-----------SKMEMRALCLNLIQFGDVGLLKVRN----------GFGQVVRLVPLSSLYLRVHKD 134 (345) T ss_pred hh-ccC--CCCCC-----------CHHHHHHHHHHHHhcCCeEEEEEEC----------CCCCEEEEEEecCceeEEeec Confidence 22 111 00000 1233444444456789999988642 335577788888776654221 Q ss_pred cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 134 EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 134 ~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) .. .|.....|.+... +....+.++.|+||.... +....+|.|++.. +...+..-..+.......+. T Consensus 135 ---~~--~~~~~~~~~~~~~--g~~~~~~~~eViHir~~~------~~~~~~Gl~~~~~-a~~si~l~~~a~~~~~~~f~ 200 (345) T protein:vir:37 135 ---GG--YSYLMKKSLYDTA--QEIYRYDAKDIIFIKLYD------PMQQVYGSPDYVG-GIQSALLNSDATVFRRRYFS 200 (345) T ss_pred ---CC--eeEEEeeeeeccC--ceEEEEccccEEEEcCCC------CCCCcccchHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 11 1111112222221 233467889999996421 2334679999875 44555554555555555444 Q ss_pred HcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC---CC--ceeEeecccCC--HHHHHHHHHHH Q lcl|NC_019404. 214 RKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE---SE--EYSVLNSDIGG--IDAFLDKKFDR 284 (418) Q Consensus 214 ~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~---~e--~~~~~~~~~~g--l~~~~~~~~~~ 284 (418) ..... ++++++ ..+ +.+..+.++++++.. ...++.+.+++... .+ ++..++.+... .-++.+...+. T Consensus 201 NGa~~~~Il~~t~--~~l-~~e~~~~lk~~~~~~-~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~d 276 (345) T protein:vir:37 201 NGAHMGFILYSTD--PDL-TEEMEEEIARKISES-KGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQD 276 (345) T ss_pred ccCCcceEEEeCC--CCC-CHHHHHHHHHHHHHh-cCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHH Confidence 33222 333332 112 233344455555442 23344455555422 12 34444443322 34456778889 Q ss_pred HhhhhcCCeeeeeccCc---cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccCCceEEeCC--CCC Q lcl|NC_019404. 285 IVALSGIHEIILKNKNV---GGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV----NAEEWSVEFSP--LDH 353 (418) Q Consensus 285 iaaas~IP~t~L~G~s~---~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~----~~~~~~~~f~p--L~~ 353 (418) ||++.+||..++ |..+ ++++ +-+...+.|+. +.|.|+++.+-..+- ...+..+.|++ |.. T Consensus 277 I~~a~~VPp~li-Gi~~~~t~~~s-~~e~~~~~f~~-------~~l~P~~~~ie~~ln~~~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 277 VLTAHRFPAGLS-GIIPTNTGGLG-DPLKYREVYHY-------DEVMPLQEIIAETINQDPEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHhCCCHHHh-ccccCCCCCcc-cHHHHHHHHHH-------HHHHHHHHHHHHHhhhhhccCCcceEEECchhhcC Confidence 999999998654 7654 3443 34555556654 457888777766653 23456677774 222 No 196 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.77 E-value=3.6e-08 Score=61.40 Aligned_cols=289 Identities=9% Similarity=0.043 Sum_probs=144.9 Q ss_pred Cccchhh--HHHHhcC--CCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHh Q lcl|NC_019404. 1 MVKTDSY--ANIFLGG--SDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDL 76 (418) Q Consensus 1 ~~~~D~~--~n~~~g~--~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l 76 (418) |+ |+. .+ .++. ...++++-.| .++.-|..+++.|+...+++..-+....+ ++... T Consensus 28 ~~--~~~~~~~-~~~~~~~~~~~~~~pP--~~~~~La~l~~~~~~h~~~L~~k~N~~~~-~f~~~--------------- 86 (337) T protein:vir:78 28 ID--PTAWMTD-YTGVFYNPYGEYYQPP--IDRKGLAKVARANAHHGAILMARRNMVAG-RFTNQ--------------- 86 (337) T ss_pred cc--CcchhHh-hhhhhhccCcceecCC--CCHHHHHHHhhcchhhhhHHHhhhccccc-cCcCc--------------- Confidence 21 111 00 0010 0011222222 24566777777777766666654442222 22211 Q ss_pred CchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcc Q lcl|NC_019404. 77 EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESD 156 (418) Q Consensus 77 ~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~ 156 (418) ++.+..++..-.++|.|++.+.- + ..|.+..|.++++..+.... | | ..|.+..++ T Consensus 87 --~~~~~~~~~d~ll~GNay~~~~r-n---------~~G~~~~L~pl~~~~v~~~~---d------~--~~~~~~~~~-- 141 (337) T protein:vir:78 87 --RATITAFVHNYLQFGDGGLLKLR-N---------SFGQVVGLHPLSSVYLRRRE---D------G--CFVYLQQGK-- 141 (337) T ss_pred --HHHHHHHHHHHHhhCCeEEEEEE-C---------CCCcEEEEEEeCCceeEeee---C------C--eEEEEEcCC-- Confidence 11233334444678999988743 2 23557778888776665432 1 1 123333332 Q ss_pred cccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcch Q lcl|NC_019404. 157 MFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEG 234 (418) Q Consensus 157 ~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~ 234 (418) ....+.++.|+|+.+.. +....+|.|+++. +...+..-..+.......+...... ++.+++. . + +.+. T Consensus 142 ~~~~~~~~eIiHik~~~------~~~~~~Gls~~~~-a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~-~-l-~~e~ 211 (337) T protein:vir:78 142 PNLIYRPDDVIWLAQYD------PEQQVYGMPDYLG-GLQSALLNQDATLFRRRYFLNGAHMGFIFYATDP-N-M-DDDT 211 (337) T ss_pred ceEEECCccEEEECCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-C-C-CHHH Confidence 23467788899986432 2334579999976 5566766666666666655443322 3333321 1 1 2233 Q ss_pred HHHHHHHHHHHHHhcCCcceeEEEcC---CCceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCcccccc- Q lcl|NC_019404. 235 FGAARLRLAQVDNNSGVGQAIGIDAE---SEEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS- 306 (418) Q Consensus 235 ~~~~~~r~~~~~~~~~~~~~~~~d~~---~e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s- 306 (418) .+.+++.++. ....++.+.+++... .+.++...++.+. .-++.....+.||++.+||..++ |..+.+-++ T Consensus 212 ~~~lk~~~~~-~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~ll-Gi~~~~~~~~ 289 (337) T protein:vir:78 212 EEEMKEMIAN-SKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALA-GIIPTNGGGG 289 (337) T ss_pred HHHHHHHHHH-hcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-ccccCCCcCc Confidence 4445555543 222344555555421 2334444444333 33456678889999999998644 765443222 Q ss_pred --chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---cCC--ceEEeCCCCCC Q lcl|NC_019404. 307 --SQNTALETFHKLIDRKRNAELLPILEFLIPFIVN---AEE--WSVEFSPLDHE 354 (418) Q Consensus 307 --tge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~---~~~--~~~~f~pL~~~ 354 (418) +-|.....|+. +.|.|+++++-..+-+ ... +.|+++.=-.+ T Consensus 290 ~~n~e~~~~~f~~-------~~L~P~~~~ie~~~n~~ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 290 LGDPEKYDATYAR-------NEVLPLCELVQDAINSAGLPRALWVTFRETIGAAV 337 (337) T ss_pred cccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCChhhceeccccccccC Confidence 23545555554 4578887776655522 221 23444422222 No 197 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.75 E-value=2.3e-08 Score=62.45 Aligned_cols=297 Identities=14% Similarity=0.094 Sum_probs=146.7 Q ss_pred Cc-cchhhHHHHhcC-CCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHH---H Q lcl|NC_019404. 1 MV-KTDSYANIFLGG-SDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWD---D 75 (418) Q Consensus 1 ~~-~~D~~~n~~~g~-~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~---~ 75 (418) |+ +.|-+.+ ++. ...+..+-.| .++..|..+++.++.-..++..-..... .-++ . T Consensus 32 ~~~~~~~~~~--~~~~~~~~~~~~pp--~~~~~la~l~~~~~~h~~~i~~k~n~l~----------------~l~~~Pn~ 91 (346) T protein:vir:10 32 VLDRADILNY--LECSAMYEKWYNPP--MSFDGLAKSLRSSTHHESAIITKANILL----------------STCEVDSR 91 (346) T ss_pred ecCchhHHHH--HHHhhcCCceEecC--CCHHHHHHHHHhhhhcchhhhhhhhhHH----------------HHHhCCCC Confidence 11 1111111 111 0011111111 2345566666666654444433222211 1111 1 Q ss_pred hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCc Q lcl|NC_019404. 76 LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNES 155 (418) Q Consensus 76 l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~ 155 (418) +--+..|.+++..-.++|.|++.+.-+ ..|.+..+.+++++.+++.... |- | .|.+...+ T Consensus 92 ~~t~~~f~~~~~d~ll~Gnay~~i~r~----------~~G~~~~L~pl~~~~v~~~~~~-~~----~----~~~~~~~~- 151 (346) T protein:vir:10 92 YLSRRDLSSFVKDYLVFGNAYFEVVRN----------RLGQVQRIESPLAKYVRKGLEA-GQ----F----YYVPQRFD- 151 (346) T ss_pred CCCHHHHHHHHHHHHhcCCeEEEEEEc----------CCCcEEEEEEecCCceEEEEcC-Ce----E----EEEEEccC- Confidence 112334455554556799999887532 3355778888888888763321 11 1 22232222 Q ss_pred ccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcc Q lcl|NC_019404. 156 DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSE 233 (418) Q Consensus 156 ~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~ 233 (418) +....+-++.|||+.... +....+|.|++.. +...+..-..+.......+...... ++++++ ..+ +.+ T Consensus 152 g~~~~~~~~dIih~r~~~------~~~~~~G~~~~~~-a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d--~~l-~~e 221 (346) T protein:vir:10 152 HQEHEFAKGSIYHLLEPD------INQDIYGLPQYLS-ALQSAWLNESATLFRRKYFLNGAHAGFVFYMSD--ASQ-KQE 221 (346) T ss_pred CeEEEEecccEEEecCCC------CCCCeeeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC--CCC-CHH Confidence 223567788899995332 2334689999976 5567776666666666665554332 233332 111 233 Q ss_pred hHHHHHHHHHHHHHhcCCcceeEEEcC---CCceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCcc---c Q lcl|NC_019404. 234 GFGAARLRLAQVDNNSGVGQAIGIDAE---SEEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNVG---G 303 (418) Q Consensus 234 ~~~~~~~r~~~~~~~~~~~~~~~~d~~---~e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~~---g 303 (418) ..+.+++.++.. ...++.+.+++... .+.++....+.+. .-++.+...+.||++.+||..+| |..++ + T Consensus 222 ~~~~i~~~~~~~-~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~ll-G~~~~~~~~ 299 (346) T protein:vir:10 222 DVENIRQQLKQS-KGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLM-GIIPNNTGG 299 (346) T ss_pred HHHHHHHHHHHh-cCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHh-cccCCCCCC Confidence 344455555432 23344455555432 2334444433333 23345677889999999999754 76543 3 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cCCceEEeCCCCCCCHHH Q lcl|NC_019404. 304 LSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVN-AEEWSVEFSPLDHESSKD 358 (418) Q Consensus 304 l~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~-~~~~~~~f~pL~~~~eke 358 (418) ++ +-+...+.|+. +.|.|+++++-.+.-+ ..+ .++|++-.-+.-+| T Consensus 300 ~s-~~e~~~~~f~~-------~~l~P~~~~iee~n~~L~~e-~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 300 FG-NVADAAEVFFI-------TEIEPLQERLKEFNQWLGQE-VIKFKPSKLLQRTQ 346 (346) T ss_pred cc-cHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccc-eeeechhhhcccCC Confidence 43 34555556654 4578887777553321 122 35666433333332 No 198 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.75 E-value=2.3e-08 Score=62.45 Aligned_cols=298 Identities=9% Similarity=0.033 Sum_probs=151.8 Q ss_pred Cc-cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MV-KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~-~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) |+ +.|-+..+ +....+.+|..|. ++.-|..+++.|+....+|..-+....+ ++. -..-.+ + T Consensus 34 v~~~~~~~~~~--~~~~~~~~~~pp~--~~~~la~~~~a~~~h~s~i~~k~n~l~~-~~~--Pnp~~t-----------~ 95 (344) T protein:vir:56 34 VLDRRDILDYV--ECISNGRWYEPPV--SFTGLAKSLRAAVHHSSPIYVKRNILAS-TFI--PHPWLS-----------Q 95 (344) T ss_pred ecCcchhhhHH--HhhhcCccccCCC--CHHHHHHHHhhhhhhCccceehhhhHHh-hcC--CCCCCC-----------H Confidence 21 22222111 1111123333333 4556777877777666666554443222 111 000011 1 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ..|...+..-.++|.|++.+.- + ..|.+..+.++++..+...... + .+|++...+ ... T Consensus 96 ~~f~~~~~d~ll~Gnay~~~~r-n---------~~G~~~~L~pl~~~~v~~~~~~-~---------~~~~~~~~g--~~~ 153 (344) T protein:vir:56 96 QDFSRFVLDFLVFGNAFLEKRY-S---------TTGKVIRLETSPAKYTRRGVEE-D---------VYWWVPSFN--EPT 153 (344) T ss_pred HHHHHHHHHHHhcCCeEEEEEE-C---------CCCcEEEEEEeCCceeEEeecC-C---------EEEEEecCC--eEE Confidence 1132233334568999988743 2 3356778888887777653211 1 234554433 335 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~ 237 (418) .+.++.|||+.+.. +....+|.|++.. +...+..-..+.......+...... ++++++ ..+ +.+..+. T Consensus 154 ~~~~~dIiHir~~~------~~~~~~Gls~~~~-a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d--~~l-s~e~~~~ 223 (344) T protein:vir:56 154 AFAPGSVFHLLEPD------INQELYGLPEYLS-ALNSAWLNESATLFRRKYYENGAHAGYIMYVTD--AVQ-DRNDIEM 223 (344) T ss_pred EEcCccEEEECCCC------CCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--CCC-CHHHHHH Confidence 68889999996421 2344689999975 5567776666666666655543322 334432 112 2334455 Q ss_pred HHHHHHHHHHhcCCcceeEEEcCC---CceeEeecccCC----HHHHHHHHHHHHhhhhcCCeeeeeccCc---cccccc Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAES---EEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNV---GGLSSS 307 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~~---e~~~~~~~~~~g----l~~~~~~~~~~iaaas~IP~t~L~G~s~---~gl~st 307 (418) ++++++.. ...++...+++..++ +.++....+.+. .-++.....+.||++.+||..+| |..+ +|++ + T Consensus 224 lk~~~~~~-~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~ll-Gi~~~~t~~~~-n 300 (344) T protein:vir:56 224 LRENMVKS-KGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLM-GGKPENVGSLG-D 300 (344) T ss_pred HHHHHHHh-cCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHh-ccCCCCCCccc-c Confidence 55666542 234556667775322 334444444332 34567788889999999999755 6543 3343 3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHH Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKA 360 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~a 360 (418) -+...+.|+. +.|.|+++++-.+.-+-..=.+.|++..-..+ .| T Consensus 301 ~eq~~~~f~~-------~tL~Pl~~~ie~~n~~l~~~~~~F~~y~l~~~--~~ 344 (344) T protein:vir:56 301 IEKVAKVFVR-------NELIPLQDRIREINGWIGQEVIRFKNYSLDTD--NG 344 (344) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHhhhccccccCCCcccccc--CC Confidence 4555555544 45778776665432211111244543321111 11 No 199 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.73 E-value=4.2e-08 Score=61.02 Aligned_cols=296 Identities=9% Similarity=0.006 Sum_probs=146.6 Q ss_pred Cc-cchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchHHHHHHHHHHhCch Q lcl|NC_019404. 1 MV-KTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDDLEMT 79 (418) Q Consensus 1 ~~-~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~~~i~~~~~~l~~~ 79 (418) ++ +.|-+.+ ++.-..++++-.|. ++.-|..+++.|+....+|..-+....+ ++. ..... .+ T Consensus 31 ~~~~~~~~~~--~~~~~~~~~~~pp~--~~~~la~l~~a~~~h~s~i~~k~n~l~~-~~~--Pn~~l-----------t~ 92 (340) T protein:vir:98 31 VLDKRDILDY--VECISNGKWYEPPV--SFSGLAKSLRSAVHHSSPIYVKRNVLAS-TYI--PHPLL-----------SR 92 (340) T ss_pred ecCcchhhhh--hhhhhcCceecCCC--CHHHHHHHHHhccccchhhhhhhhHHhh-ccC--CCCCC-----------CH Confidence 11 1111111 11111112222222 3456777777777666665544443322 111 01111 11 Q ss_pred HHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCccccc Q lcl|NC_019404. 80 QNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFY 159 (418) Q Consensus 80 ~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~ 159 (418) ..|..++..-.++|.|++.+.- + ..|.+..+.+++++.+..... . . .+|++...+ ... T Consensus 93 ~~f~~~~~d~ll~Gnay~~~~r-n---------~~G~~~~L~pl~~~~vr~~~~---~------~-~~~~~~~~~--~~~ 150 (340) T protein:vir:98 93 QDFSRFALDYLVFGNAFLEQRH-S---------VTGQLIKLLTSPAKYTRRGVD---D------S-VFWFVENFT--QPH 150 (340) T ss_pred HHHHHHHHHHHhcCCeEEEEEE-C---------CCCcEEEEEEeCCceEEEccc---C------c-EEEEEecCC--eEE Confidence 2233344444678999988753 2 335577788887777654321 1 1 356666543 235 Q ss_pred ccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHH Q lcl|NC_019404. 160 DVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGA 237 (418) Q Consensus 160 ~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~ 237 (418) .++++.|+||... .+....+|.|++.. +...+..-..+.......+...... ++.+++ ..+ +.+..+. T Consensus 151 ~~~~~eViHir~~------~~~~~~~Gls~~~~-a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~--~~l-s~e~~~~ 220 (340) T protein:vir:98 151 EFAPDTVFHLLEP------DINQEIYGLPEYLS-ALNSAWLNESATLFRRKYYQNGAHAGYIMYVTD--PAQ-SATDVES 220 (340) T ss_pred EEccccEEEEcCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--CCC-CHHHHHH Confidence 6889999999642 12344689999976 4455555444444444444433222 233332 112 2334445 Q ss_pred HHHHHHHHHHhcCCcceeEEEcC---CC--ceeEeecccC--CHHHHHHHHHHHHhhhhcCCeeeeeccCc---cccccc Q lcl|NC_019404. 238 ARLRLAQVDNNSGVGQAIGIDAE---SE--EYSVLNSDIG--GIDAFLDKKFDRIVALSGIHEIILKNKNV---GGLSSS 307 (418) Q Consensus 238 ~~~r~~~~~~~~~~~~~~~~d~~---~e--~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s~---~gl~st 307 (418) +++.++.. ...++.+.+++... .+ ++..++.+.. ..-++.....+.||++.+||.. |.|..+ +|++ + T Consensus 221 lk~~~~~~-~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~s-n 297 (340) T protein:vir:98 221 LRDAMRNS-KGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQ-LMGGKPENIGSLG-D 297 (340) T ss_pred HHHHHHHh-cCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccc-c Confidence 55555542 33344445555422 23 3444443332 2446677888999999999986 447653 3343 3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-ccCCceEEeCCCCCCCHH Q lcl|NC_019404. 308 QNTALETFHKLIDRKRNAELLPILEFLIPFIV-NAEEWSVEFSPLDHESSK 357 (418) Q Consensus 308 ge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~-~~~~~~~~f~pL~~~~ek 357 (418) -+...+.|+. +.|.|+++++-.+.- +..++ ++|++-.-++.. T Consensus 298 ~e~~~~~f~~-------~~l~Pl~~~iee~n~~L~~e~-~rF~~~~l~~~d 340 (340) T protein:vir:98 298 VEKVAKVFVR-------NELSPLQDRFREVNDWLGMEV-IRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHhcccccc-cccCccccccCC Confidence 4555555554 458888777755321 11222 456543322222 No 200 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=98.73 E-value=1.6e-08 Score=63.25 Aligned_cols=297 Identities=11% Similarity=0.030 Sum_probs=141.5 Q ss_pred CccchhhHHHHh---------------------------cCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh Q lcl|NC_019404. 1 MVKTDSYANIFL---------------------------GGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA 53 (418) Q Consensus 1 ~~~~D~~~n~~~---------------------------g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~ 53 (418) .++.....+..+ +.-..+..+..+ .++.-|..+++.|+....++..-.... T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp--~~~~~la~~~~~~~~h~~~l~~k~n~l 91 (350) T protein:vir:11 14 TVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPP--LSMEGLAKSVGSSVYLQSGLKFKRNML 91 (350) T ss_pred ccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCC--CCHHHHHHHHhhhhhhccchhhhhhhh Confidence 111111111111 000011222222 234445566655555555554333222 Q ss_pred ccCCccccCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc Q lcl|NC_019404. 54 LAAGFHIDGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR 133 (418) Q Consensus 54 ~r~~~~i~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~ 133 (418) .+ .+ .-.. +-.+..|..++..-.+||.|++.+.- + ..|.+..+.++++..+..... T Consensus 92 ~~-~~--~Pn~-----------~~t~~~f~~~v~d~ll~Gnay~~~~r-n---------~~G~~~~L~~l~~~~vr~~~~ 147 (350) T protein:vir:11 92 AK-TF--IPHR-----------LLSRATFEQFSLDWLTFGSAYLEQPR-S---------RLGTRMPLQAPLAKYMRRGTD 147 (350) T ss_pred hh-cc--cCCC-----------CCCHHHHHHHHHHHHhcCCeEEEEEE-c---------CCCCEEEEEEeCCceeEeeec Confidence 11 11 0000 11122344444455689999998753 2 235577788888877764321 Q ss_pred cccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 134 EENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLR 213 (418) Q Consensus 134 ~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~ 213 (418) . + .+|++...+ ....+.++.||||.+. .+....+|.|++.. +...+..-..+.......+. T Consensus 148 ~-~---------~~~~~~~~~--~~~~~~~~eVihir~~------~~~~~~yGls~~~~-a~~si~l~~~a~~~~~~~f~ 208 (350) T protein:vir:11 148 L-E---------TFYQVRSWK--DEHEFEKGSVIQLREA------DINQEIYGVPEWFC-ALQSALLNESATLFRRKYYN 208 (350) T ss_pred C-C---------eEEEEeeCC--eEEEECcccEEEeCCC------CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 1 0 145555433 3357888999999642 22345689999976 55666665555555555554 Q ss_pred HcCC--ceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC---CC--ceeEeecccC--CHHHHHHHHHHH Q lcl|NC_019404. 214 RKQQ--AVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE---SE--EYSVLNSDIG--GIDAFLDKKFDR 284 (418) Q Consensus 214 ~~~~--~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~---~e--~~~~~~~~~~--gl~~~~~~~~~~ 284 (418) .... -++++++ ..+ +.+..+.+++.++.. ...++.+.+++... .+ ++..++.+.. ..-++.+...+. T Consensus 209 NGa~~~gil~~~~--~~l-s~e~~~~l~~~~~~~-~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~e 284 (350) T protein:vir:11 209 NGSHAGFILYMTD--AAQ-NEEDIDALRTALKTA-KGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDD 284 (350) T ss_pred ccCCCceEEEecC--CCC-CHHHHHHHHHHHHHh-cCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHH Confidence 4333 2344442 112 233444555555442 22334445555422 23 3444443332 234566688889 Q ss_pred HhhhhcCCeeeeeccCc---cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-ccCCceEEeCCCCCCCHH Q lcl|NC_019404. 285 IVALSGIHEIILKNKNV---GGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIV-NAEEWSVEFSPLDHESSK 357 (418) Q Consensus 285 iaaas~IP~t~L~G~s~---~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~-~~~~~~~~f~pL~~~~ek 357 (418) ||++.+||..+| |..+ +|++ +-+...+.|+. +.|.|+++.+-.+.- +..++ +.|++- .++.- T Consensus 285 Ia~a~~VPp~ll-Gi~~~~t~~~s-n~e~~~~~f~~-------~~L~P~~~~ie~ln~~l~~~~-~~F~~~-~~~~l 350 (350) T protein:vir:11 285 QLAGLRVYPQLM-GVVPQNAGGFG-SISDAAAVWAS-------LELAPMQTRLQQVNEMIGEEV-VRFAQF-DAPGL 350 (350) T ss_pred HHHHhCCCHHHh-cccCCCCCCcC-CHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCccc-cccCcc-cccCC Confidence 999999998744 6543 4444 33445555544 346776666544321 11111 223321 11111 No 201 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.70 E-value=1e-07 Score=58.96 Aligned_cols=365 Identities=13% Similarity=0.051 Sum_probs=170.1 Q ss_pred CccchhhHHHHh-cCCCCcc-cc-CccccCCHHHH---------------HHHH----HcCCccchhhhcchhhhccCCc Q lcl|NC_019404. 1 MVKTDSYANIFL-GGSDGSE-IY-GSLQNQAPTIL---------------ASLY----ADNALVRRIIDTIPETALAAGF 58 (418) Q Consensus 1 ~~~~D~~~n~~~-g~~~~~~-~~-~~~~~~~~~~l---------------~~~Y----~~~~~~r~iVd~~a~d~~r~~~ 58 (418) -+....+...-+ -.+...+ .. ...+.+||..+ ..+| .+.+-+..++.+.-...+...| T Consensus 11 p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~~~w 90 (526) T protein:vir:99 11 PIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGLDW 90 (526) T ss_pred ccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCc Confidence 111121211110 0000001 01 11122333222 2233 3567777777777777777778 Q ss_pred cccCc--c---hH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeeccccc Q lcl|NC_019404. 59 HIDGI--D---DE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVK 129 (418) Q Consensus 59 ~i~~~--~---d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~ 129 (418) .|+.. + ++ +.+++.+.++.-+..+..-+-.+.+||.|+.=+.-+ ++. .-.++.+...++.++. T Consensus 91 ~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~lda~~~G~s~~Eivw~~~~g--------~~~~~~l~~r~~~~f~ 162 (526) T protein:vir:99 91 AVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALDGIGHGYSCIELEWALQGR--------EWMPLAFHHRPQSWFQ 162 (526) T ss_pred eEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHHHhhhhcceeEEEEEeecCC--------ceeEEEeeeeccccee Confidence 88531 1 11 235555555543444444455699999999866432 111 1123344444443332 Q ss_pred cccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 130 VQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLAT 209 (418) Q Consensus 130 ~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~ 209 (418) .. +. .+ . .+.+.. +...+..++|.+.+.+.+.. ...+++|.+.+ +.||....--..+..--+ T Consensus 163 ~~-----~~---~~-~-~l~~~~-~~~~g~~l~~~k~i~~~~~~------~~g~p~g~gLl-r~~~w~~~fK~~~~~~w~ 224 (526) T protein:vir:99 163 LN-----PE---DQ-N-ELRLRD-NSPAGEALQPFGWIIHRPRA------RSGYVARSGLF-RVLAWPYLFRHYATSDLA 224 (526) T ss_pred ec-----cC---CC-c-EEEecC-CCCCceeecCCCeEEEeecC------CcCCccccchH-HHHHHHHHHHHhhHHHHH Confidence 11 11 11 0 111211 12233457777776665432 34567888866 456665554444556666 Q ss_pred HHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH---HHHHHHHHHH Q lcl|NC_019404. 210 QLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI---DAFLDKKFDR 284 (418) Q Consensus 210 ~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl---~~~~~~~~~~ 284 (418) ..+.+++++ +.|++.. +.+. .+.++.......+... .++.-++.+++.++..-++. ..+++.+-.. T Consensus 225 ~f~E~yG~P~~igky~~~-----a~~~---ek~~L~~av~~i~~d~-~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~ 295 (526) T protein:vir:99 225 EMLEIYGLPIRLGKYPPG-----TADE---EKATLLRAVTGLGHAA-AGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDA 295 (526) T ss_pred HHHHHcCCceEEEecCCC-----CCHH---HHHHHHHHHHHHhhCc-EEEecCCceeEEeecCCCCHHHHHHHHHHHHHH Confidence 778888855 5565421 1111 1222223233334344 44455568899998764443 3455666666 Q ss_pred HhhhhcCCeeeeeccC-cc-------ccccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhccC-----C----ceE Q lcl|NC_019404. 285 IVALSGIHEIILKNKN-VG-------GLSSSQNTALETFHKLIDRKRNAELLPIL-EFLIPFIVNAE-----E----WSV 346 (418) Q Consensus 285 iaaas~IP~t~L~G~s-~~-------gl~stge~d~~~y~~~I~~~Qe~~l~p~l-~~l~~~i~~~~-----~----~~~ 346 (418) |+.+ ++||+ ++ |-+|-|+--.....+.+++-.+. +...+ +.|+.-++.-. + -++ T Consensus 296 Isk~-------iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~-i~~tln~~Li~~l~~~N~~~~~~~~~~p~~ 367 (526) T protein:vir:99 296 ISKA-------VLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQ-LAATLSRDLLWPLLVLNRPGSPDVRRAPRL 367 (526) T ss_pred HHHH-------HhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCcCCccccceE Confidence 6643 33544 11 22222333344555555555443 44444 34666554321 1 134 Q ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhcCcCCCChhhc-cccc-ccCCCccccccC Q lcl|NC_019404. 347 EFSPLDHESSKDKAEVLEKSVNSIAALIAAGA-MDIKEARDTLRTIAPEIKIGDNDI-QTEE-SELITETEVVIA 418 (418) Q Consensus 347 ~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~-i~~~e~r~~l~~~~~~~~~~~~~~-~~~e-~~~~~e~e~~~~ 418 (418) +|..- |..|+ +..|++++++++.|+ |+.+++++.+.--.+ . ..+++ .... +.........-+ T Consensus 368 ~~~~~------e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~--~-~~e~~l~~~~~~~~~~~~~~~~~ 432 (526) T protein:vir:99 368 VFDLR------EQADI-TSMAQSIPALVNVGLEIPSAWVYDKLGIPQP--A-KNEPVLRSAAQPAILSRQHGQRV 432 (526) T ss_pred EeCCC------CcccH-HHHHHHHHHHHhCCCccCHHHHHHHhCCCCC--C-CcccccCCCCCCccccccccccc Confidence 44432 22233 457899999999997 999999987742111 0 11111 0000 000000000000 No 202 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.62 E-value=5.4e-08 Score=60.43 Aligned_cols=379 Identities=14% Similarity=0.141 Sum_probs=184.0 Q ss_pred hcCCCC---------ccccCc---------cccCCHHHHHHHHHcC----------Cc---cchhhhcchhhhccC---- Q lcl|NC_019404. 12 LGGSDG---------SEIYGS---------LQNQAPTILASLYADN----------AL---VRRIIDTIPETALAA---- 56 (418) Q Consensus 12 ~g~~~~---------~~~~~~---------~~~~~~~~l~~~Y~~~----------~~---~r~iVd~~a~d~~r~---- 56 (418) |+.+.. +..-++ .+-..|..+..+|..+ +- .|++++.-..-.+-. T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~~ 80 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMRF 80 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCCccee Confidence 332211 001111 1122345666677664 11 133333333222222 Q ss_pred ---CccccCc-ch---HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc----cc-ccccC----------- Q lcl|NC_019404. 57 ---GFHIDGI-DD---EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA----LT-SPVRE----------- 113 (418) Q Consensus 57 ---~~~i~~~-~d---~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~----l~-~pl~~----------- 113 (418) |.+...+ .+ ++.++.-+++=++..++.++-+|+-+-|.++..+.-+.+++ ++ .++++ T Consensus 81 ~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~ 160 (527) T protein:vir:10 81 LGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRY 160 (527) T ss_pred eccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCC Confidence 2222111 11 12344445566788899999999999998887776554432 11 12221 Q ss_pred CCceEEEEEeeccccccccc-------------c-ccccccc-cCcc----eEEEE---ecC-----Cc------ccccc Q lcl|NC_019404. 114 GAELETVRVYDRTQVKVQNR-------------E-ENPRNAR-FGKP----LTYRI---TTN-----ES------DMFYD 160 (418) Q Consensus 114 ~~~i~~i~v~~~~~i~~~~~-------------~-~dp~s~~-yg~p----~~y~i---~~~-----~~------~~~~~ 160 (418) .+.+.++...+.|..+...- . .|...|- .|.. .+|.+ +.. .. ..+.. T Consensus 161 ~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~ 240 (527) T protein:vir:10 161 PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTE 240 (527) T ss_pred CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcCcee Confidence 13455666555565554211 0 1112222 1221 12322 000 00 00011 Q ss_pred cCccc-------EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcc Q lcl|NC_019404. 161 VHYSR-------IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSE 233 (418) Q Consensus 161 iH~SR-------~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~ 233 (418) +|+.- ++||++ ..+.+..||.|.|+. +..-+....+++--...++.--+..++.+++....-..++ T Consensus 241 l~~lp~pi~fiPvV~~~t------~p~~~~~WG~S~La~-ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~ 313 (527) T protein:vir:10 241 EEPLPEQITTLPVFHFRG------HPIMNAMFGRSGLAG-LESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGN 313 (527) T ss_pred eecccCCCCccceEeecC------CCccccccChhhHhH-HHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCC Confidence 22211 233322 234456899999975 5555555555554444444445677788776653321111 Q ss_pred hHHHHHHHHHHHHHhcCC-cceeEEEcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhH Q lcl|NC_019404. 234 GFGAARLRLAQVDNNSGV-GQAIGIDAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNT 310 (418) Q Consensus 234 ~~~~~~~r~~~~~~~~~~-~~~~~~d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~ 310 (418) ... -.. .+.+.-.+++-.+..++. .+.+..+.++.+++.|+..+++|.+-+...-+++ +.||-. T Consensus 314 ~~~------------~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG~A 380 (527) T protein:vir:10 314 MVP------------WTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAV-AESGIA 380 (527) T ss_pred cCc------------cccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHHHH Confidence 100 112 234444566667887776 4567888899999999999999998774222333 455554 Q ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHH-Hh--h--------c-c----CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 311 ALETFHKLIDRKRNAEL--LPILEFLIP-FI--V--------N-A----EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA 372 (418) Q Consensus 311 d~~~y~~~I~~~Qe~~l--~p~l~~l~~-~i--~--------~-~----~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~ 372 (418) -.-.+-..+++-|+..+ +-++.+... .+ + . . -.+.+.|.|....++++. .+...+ T Consensus 381 LeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av-------ie~v~t 453 (527) T protein:vir:10 381 LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKR-------FNQLLQ 453 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHH-------HHHHHH Confidence 44444445555555432 333322111 10 0 0 1 135799999987777654 667788 Q ss_pred HHhCCCCCHHHHHHHHHhhcCcCCCChhh---cccc-------cccCC-------------CccccccC Q lcl|NC_019404. 373 LIAAGAMDIKEARDTLRTIAPEIKIGDND---IQTE-------ESELI-------------TETEVVIA 418 (418) Q Consensus 373 ~~~~g~i~~~e~r~~l~~~~~~~~~~~~~---~~~~-------e~~~~-------------~e~e~~~~ 418 (418) ++++|++|.+-|.+.|.+.+- ......+ |.++ +-+.. +++|.-.+ T Consensus 454 L~~aGi~S~~tAv~~L~~~~g-~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~ 521 (527) T protein:vir:10 454 LWEAGLIPAKKLTEELSKIMG-FELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQA 521 (527) T ss_pred HHHcCchhHHHHHHHHHhccC-CCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccc Confidence 999999999999988865431 1111111 1000 00000 01111111 No 203 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.62 E-value=5.4e-08 Score=60.43 Aligned_cols=379 Identities=13% Similarity=0.143 Sum_probs=184.0 Q ss_pred hcCCCC---------ccccCc---------cccCCHHHHHHHHHcC----------Cc---cchhhhcchhhhccC---- Q lcl|NC_019404. 12 LGGSDG---------SEIYGS---------LQNQAPTILASLYADN----------AL---VRRIIDTIPETALAA---- 56 (418) Q Consensus 12 ~g~~~~---------~~~~~~---------~~~~~~~~l~~~Y~~~----------~~---~r~iVd~~a~d~~r~---- 56 (418) |+.+.. +..-++ .+-..|..+..+|..+ +- .|++++.-..-.+-. T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~~ 80 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMRF 80 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCCccee Confidence 332211 001111 1122345666677664 11 133333333222222 Q ss_pred ---CccccCc-ch---HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc----cc-ccccC----------- Q lcl|NC_019404. 57 ---GFHIDGI-DD---EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA----LT-SPVRE----------- 113 (418) Q Consensus 57 ---~~~i~~~-~d---~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~----l~-~pl~~----------- 113 (418) |.+...+ .+ ++.+..-+++=++..++.++-+|+-+-|.++..+.-+.+++ ++ .++++ T Consensus 81 ~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~ 160 (527) T protein:vir:10 81 LGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRY 160 (527) T ss_pred eccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCC Confidence 2222111 11 12344445566788899999999999998887776554432 11 12221 Q ss_pred CCceEEEEEeeccccccccc-------------c-ccccccc-cCcc----eEEEE---ecC-----Cc------ccccc Q lcl|NC_019404. 114 GAELETVRVYDRTQVKVQNR-------------E-ENPRNAR-FGKP----LTYRI---TTN-----ES------DMFYD 160 (418) Q Consensus 114 ~~~i~~i~v~~~~~i~~~~~-------------~-~dp~s~~-yg~p----~~y~i---~~~-----~~------~~~~~ 160 (418) .+.+.++...+.|..+...- . .|...|- .|.. .+|.+ +.. .. ..+.. T Consensus 161 ~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~ 240 (527) T protein:vir:10 161 PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTE 240 (527) T ss_pred CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcCcee Confidence 13455666555565554211 0 1112222 1221 12322 000 00 00011 Q ss_pred cCccc-------EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcc Q lcl|NC_019404. 161 VHYSR-------IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSE 233 (418) Q Consensus 161 iH~SR-------~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~ 233 (418) +|+.- ++||++ ..+.+..||.|.|+. +..-+....+++--...++.--+..++.+++....-..++ T Consensus 241 l~~lp~pi~fiPvV~~~t------~p~~~~~WG~S~La~-ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~ 313 (527) T protein:vir:10 241 EEPLPEQITTLPVFHFRG------HPIMNAMFGRSGLAG-LESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGN 313 (527) T ss_pred eecccCCCCccceEeecC------CCccccccChhhHhH-HHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCC Confidence 22211 233322 234456899999975 5555555555554444444445677788776653321111 Q ss_pred hHHHHHHHHHHHHHhcCC-cceeEEEcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhH Q lcl|NC_019404. 234 GFGAARLRLAQVDNNSGV-GQAIGIDAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNT 310 (418) Q Consensus 234 ~~~~~~~r~~~~~~~~~~-~~~~~~d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~ 310 (418) ... -.. .+.+.-.+++-.+..++. .+.+..+.++.+++.|+..+++|.+-+...-+++ +.||-. T Consensus 314 ~~~------------~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG~A 380 (527) T protein:vir:10 314 MVP------------WTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAV-AESGIA 380 (527) T ss_pred cCc------------cccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHHHH Confidence 100 112 234444566667887776 4567888899999999999999998774222333 455554 Q ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHH-Hh--h--------c-c----CCceEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 311 ALETFHKLIDRKRNAEL--LPILEFLIP-FI--V--------N-A----EEWSVEFSPLDHESSKDKAEVLEKSVNSIAA 372 (418) Q Consensus 311 d~~~y~~~I~~~Qe~~l--~p~l~~l~~-~i--~--------~-~----~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~ 372 (418) -.-.+-..+++-|+..+ +-++.+... .+ + . . -.+.+.|.|....++++. .+.... T Consensus 381 LeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av-------ie~v~t 453 (527) T protein:vir:10 381 LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKR-------FAQLLE 453 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHH-------HHHHHH Confidence 44444445555555432 333322111 10 0 0 1 135799999987777654 567778 Q ss_pred HHhCCCCCHHHHHHHHHhhcCcCCCChhhcc---cc-------cccCC-------------CccccccC Q lcl|NC_019404. 373 LIAAGAMDIKEARDTLRTIAPEIKIGDNDIQ---TE-------ESELI-------------TETEVVIA 418 (418) Q Consensus 373 ~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~---~~-------e~~~~-------------~e~e~~~~ 418 (418) ++++|++|.+-|.+.|.+.+- ......++. ++ +-+.. +++|.-.+ T Consensus 454 L~~aGiiS~etAv~~L~~~~g-~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~ 521 (527) T protein:vir:10 454 LWEAGLIPAKKLTEELSKIMG-FELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQA 521 (527) T ss_pred HHHcCchhHHHHHHHHHhccC-CCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccc Confidence 999999999999988865431 111111110 00 00000 01111111 No 204 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.55 E-value=3.1e-07 Score=56.28 Aligned_cols=364 Identities=13% Similarity=0.074 Sum_probs=168.5 Q ss_pred CccchhhHHH----HhcCCCCccccCccccCCHHHHHH---------------HH----HcCCccchhhhcchhhhccCC Q lcl|NC_019404. 1 MVKTDSYANI----FLGGSDGSEIYGSLQNQAPTILAS---------------LY----ADNALVRRIIDTIPETALAAG 57 (418) Q Consensus 1 ~~~~D~~~n~----~~g~~~~~~~~~~~~~~~~~~l~~---------------~Y----~~~~~~r~iVd~~a~d~~r~~ 57 (418) -++...+... +.+..+. ......+.+||..+.. +| .+.+-+..++.+--...+... T Consensus 11 p~~~~~~~~~~~~~~~~~~~~-~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~Rk~av~~~~ 89 (526) T protein:vir:79 11 PIRPQQLREPQTSRLAGLAKE-FAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGLD 89 (526) T ss_pred ccCccccchhhhhhhhhhhhh-cccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCC Confidence 1222222111 1111111 0111222344433322 23 245556666666666666667 Q ss_pred ccccC--cc---hH---HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecccc Q lcl|NC_019404. 58 FHIDG--ID---DE---PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQV 128 (418) Q Consensus 58 ~~i~~--~~---d~---~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i 128 (418) |.|+- ++ +. +.+++.+.++.-+..+..-+-.+..||.|+.=+.-+ ++.. -.++.+...++.++ T Consensus 90 w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ldA~~~G~s~~Ei~w~~~~g~--------~~~~~l~~r~~~~F 161 (526) T protein:vir:79 90 WAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALDGIGHGYSCIELEWALQGRE--------WMPLAFHHRPQSWF 161 (526) T ss_pred ceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHhhhhhcceeEEEEEeecCCc--------eeEEEeeeecccce Confidence 77752 11 11 135555556543444555555599999999866432 1111 12334444444333 Q ss_pred ccccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 129 KVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (418) Q Consensus 129 ~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~ 208 (418) .. |+.. + . ...+.. +...+..++|-+.+.+.+.. ...+++|.+.+ +.||....--..+...- T Consensus 162 ~~-----~~~~---~-~-~l~~~~-~~~~g~~l~~~k~iv~~~~~------~~g~p~g~gLl-r~~~w~~~fK~~~~~~w 223 (526) T protein:vir:79 162 QL-----NPED---Q-N-ELRLRD-NSPAGEALQPFGWIIHRPRA------RSGYVARSGLF-RVLAWPYLFRHYATSDL 223 (526) T ss_pred Ee-----ccCC---C-c-EEEecC-CCCCceeecCCceEEEeecC------CcCCccccchH-HHHHHHHHHHHhhHHHH Confidence 21 1111 1 0 111111 12223456777766665432 34567888866 55666555444456666 Q ss_pred HHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH---HHHHHHHHH Q lcl|NC_019404. 209 TQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI---DAFLDKKFD 283 (418) Q Consensus 209 ~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl---~~~~~~~~~ 283 (418) +..+.+++++ +.|++.. +.+. -+.++.......+... .++.-++.+++.++..-++. ..+++.+-. T Consensus 224 ~~F~E~yG~P~~igky~~~-----a~~~---ek~~L~~av~~i~~da-~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~ 294 (526) T protein:vir:79 224 AEMLEIYGLPIRLGKYPPG-----TADE---EKATLLRAVTGLGHAA-AGIIPETMAIDFQQAAQGSSEPFLAMMRQSED 294 (526) T ss_pred HHHHHHcCCceEEEecCCC-----CCHH---HHHHHHHHHHHHhcCc-EEEecCCceeEEeecCCCCHHHHHHHHHHHHH Confidence 6778888855 5565421 1111 1223333333334344 44455568899999765443 345556666 Q ss_pred HHhhhhcCCeeeeeccC-cc-------ccccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhccC-----C----ce Q lcl|NC_019404. 284 RIVALSGIHEIILKNKN-VG-------GLSSSQNTALETFHKLIDRKRNAELLPIL-EFLIPFIVNAE-----E----WS 345 (418) Q Consensus 284 ~iaaas~IP~t~L~G~s-~~-------gl~stge~d~~~y~~~I~~~Qe~~l~p~l-~~l~~~i~~~~-----~----~~ 345 (418) .||.+ ++||+ ++ |-+|-|+--.....+.+++-.+. +...+ +.|+.-++.-. + -+ T Consensus 295 ~Isk~-------iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~-i~~tln~~Li~~l~~~N~~~~~~~~~~p~ 366 (526) T protein:vir:79 295 AISKA-------VLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQ-LAATLSRDLLWPLLVLNRPGSPDVRRAPR 366 (526) T ss_pred HHHHH-------HhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCcCCccccce Confidence 66643 23444 11 22223443445566666665543 44444 34666654321 1 12 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhcCcCCCChhhccccccc--CCCccc----cccC Q lcl|NC_019404. 346 VEFSPLDHESSKDKAEVLEKSVNSIAALIAAGA-MDIKEARDTLRTIAPEIKIGDNDIQTEESE--LITETE----VVIA 418 (418) Q Consensus 346 ~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~-i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~--~~~e~e----~~~~ 418 (418) +.|..- |..|+ +..|++++++++.|+ |+.+.+++.+.--.+ + ..+++...... ...... ...+ T Consensus 367 ~~~~~~------e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~gip~~--~-~~e~~l~~~~~~~~~~~~~~~~~~~~~ 436 (526) T protein:vir:79 367 LVFDLR------EQADI-TSMAQSIPALVNVGLEIPSAWVYDKLGIPQP--A-KNEPVLRPAAQPAILSRQHGQRVAALA 436 (526) T ss_pred EEeCCC------CcccH-HHHHHHHHHHHhCCCcCCHHHHHHHhCCCCC--C-CchhhccccCCcccccccccccccccc Confidence 444332 22222 457899999999997 899999887642111 1 11111100000 000000 0000 No 205 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=98.53 E-value=3.5e-07 Score=56.00 Aligned_cols=383 Identities=12% Similarity=0.130 Sum_probs=175.6 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccCcc------hH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDGID------DE 66 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~~~------d~ 66 (418) =-..||-.+.++|+..+...--.....+-.+|-.-||+ +|.+..+|+.++++|+-- .+.+.-++ -. T Consensus 25 ~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK 104 (537) T protein:vir:10 25 KDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNETICGNFDDVPISIDLHNLKQSEKIK 104 (537) T ss_pred CCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHH Confidence 11223334433332222111112233345677777765 899999999999999762 23332111 12 Q ss_pred HHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccccc-------- Q lcl|NC_019404. 67 PAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNRE-------- 134 (418) Q Consensus 67 ~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~-------- 134 (418) ++|.++++. |++..+-.+.+|.--+.|.-+.-..++. + +++..|..++.+|+..+...... T Consensus 105 ~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~-k------~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~ 177 (537) T protein:vir:10 105 KLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDP-K------KPRQGLVELRYVDPRKIRKVTEYEAKRPEAL 177 (537) T ss_pred HHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeC-C------CccccceeeeeeCCccceeeEeecccCCccc Confidence 456666654 5667777777776666665554444422 1 23456667777777665432111 Q ss_pred --ccccccccCc-ceEEEEecC----CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH--HHHHHHHHHHH Q lcl|NC_019404. 135 --ENPRNARFGK-PLTYRITTN----ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI--LDSIKDYTNCE 205 (418) Q Consensus 135 --~dp~s~~yg~-p~~y~i~~~----~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~--~~~l~~~~~~~ 205 (418) .+-...-+.. -++|.+++. +...+.+|+++ .|+|+..-+-+ .+.+...|-|..++ .+.|+-.+.+ T Consensus 178 ~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~d-AI~y~hSGl~d----~n~~~i~syLhkAiKp~NQLkm~EDA- 251 (537) T protein:vir:10 178 RTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPD-SIAYCHSGIQD----LNKNMVLSHLHKAIKAVNQLRMIEDS- 251 (537) T ss_pred eEEecceeeeecccceeeeccccccccCCCceeccHh-heeeeccccee----CCCCeeeeeehhhhHHHHhhHHHHhh- Confidence 1111111111 223333321 12234567664 34444333222 22233344343321 1223322222 Q ss_pred HHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC--------------------- Q lcl|NC_019404. 206 RLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE--------------------- 260 (418) Q Consensus 206 ~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~--------------------- 260 (418) .++++.+ -.|+=++ .+++... +.+ +-+. ..+...+ +-++-|.. T Consensus 252 ----lVIYRitRAPeRRvFYID-VGnLPk~-KAe-qYlr--~iM~k~K---NklVYDa~TGev~ddrk~msMlEDyWLPR 319 (537) T protein:vir:10 252 ----LVIYRLSRAPERRIFYID-VGNLPKN-KAE-QYLR--EVMGRYR---NKLVYDANTGEIKDDKKFMSMLEDFWLPR 319 (537) T ss_pred ----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcc---ceEEEeccCceecccchhhhhhhhhcccc Confidence 2233321 1233333 2232211 111 1111 1111111 11111211 Q ss_pred -----CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 261 -----SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLIDRKRNAELLP 329 (418) Q Consensus 261 -----~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I~~~Qe~~l~p 329 (418) +-+++.+. -+++.++| +..|...+=.+.++|.++|-.+ +|+|- ++ .-|.-.|..+|.+.|..+ .. T Consensus 320 ReGgrgTEItTLpGgqnlgem~D-V~YF~kKLy~aLnVP~SRl~~e--~~f~~Gr~~EItRDEiKF~KFI~RLR~rF-s~ 395 (537) T protein:vir:10 320 REGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKRF-SE 395 (537) T ss_pred cCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCCccccCCC--CcccccccchhhHHHHHHHHHHHHHHHHH-HH Confidence 12233333 24555666 4688899999999999999444 45442 21 123356999999998764 44 Q ss_pred HHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHHHHHHHhhcCc Q lcl|NC_019404. 330 ILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEARDTLRTIAPE 394 (418) Q Consensus 330 ~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~r~~l~~~~~~ 394 (418) ++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. .-.++.+-++...-.. T Consensus 396 lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~--- 472 (537) T protein:vir:10 396 LFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQ--- 472 (537) T ss_pred HHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhcc--- Confidence 44444332 2222 3567888877777788888888888888777521 1134554444321111 Q ss_pred CCCChhhccccccc----------------------------CCCccccccC Q lcl|NC_019404. 395 IKIGDNDIQTEESE----------------------------LITETEVVIA 418 (418) Q Consensus 395 ~~~~~~~~~~~e~~----------------------------~~~e~e~~~~ 418 (418) +|++|.+.+.. .+++.+..-| T Consensus 473 ---tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (537) T protein:vir:10 473 ---TESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQT 521 (537) T ss_pred ---CHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCccc Confidence 11111111111 1111111111 No 206 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=98.51 E-value=2.8e-07 Score=56.52 Aligned_cols=379 Identities=11% Similarity=0.134 Sum_probs=171.0 Q ss_pred Cc---cchhhHHHHhcCCCCccccCc-cccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccCc--c-- Q lcl|NC_019404. 1 MV---KTDSYANIFLGGSDGSEIYGS-LQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDGI--D-- 64 (418) Q Consensus 1 ~~---~~D~~~n~~~g~~~~~~~~~~-~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~~--~-- 64 (418) .+ .-||-.++ .++|-.+.+++. ...++-.+|-..|++ +|.+..+|+.++++|+-. .+.+.-+ + T Consensus 22 ~~~p~~ddg~~~~-~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s 100 (558) T protein:vir:10 22 PVPKNNEDGVDNF-ISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEAIVSDLYDSPVEVELSNLNAS 100 (558) T ss_pred ccCCCccccccce-eccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcc Confidence 11 12222222 122222222221 222345677667764 899999999999999762 2333211 1 Q ss_pred --hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccc-cccc Q lcl|NC_019404. 65 --DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNR-EENP 137 (418) Q Consensus 65 --d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~-~~dp 137 (418) -.++|.++++. |++..+-.+.+|.--+.|.-+.-..++.. +++..|..++.+|+..+....- ...+ T Consensus 101 ~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k-------~pk~GI~ELr~lDPr~i~~Vr~i~~~~ 173 (558) T protein:vir:10 101 NTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTK-------NPQEGIQDLRYIDPLKIKFIRQEKRKP 173 (558) T ss_pred hHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC-------CccccceeeeeeCcccceeeeeecccc Confidence 23467777665 46677777777766666665544444221 2445666777777766543211 0110 Q ss_pred c------------c--cccCcceEEEEecC-----------CcccccccCcc-------cEEEecCccchhhhhhccccC Q lcl|NC_019404. 138 R------------N--ARFGKPLTYRITTN-----------ESDMFYDVHYS-------RIHIIDGERVPNAMRRQNDGW 185 (418) Q Consensus 138 ~------------s--~~yg~p~~y~i~~~-----------~~~~~~~iH~S-------R~i~~~g~~lp~~~~~~~~~~ 185 (418) . . .+=+--++|.+++. +.+.+.+|+.+ -++..++.-++.++..+-- T Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~d~~~~~i~syLhkAIK-- 251 (558) T protein:vir:10 174 GNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLVDRNKNRVLSYLHKAIK-- 251 (558) T ss_pred ccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccceecCCCeeeecchHhhH-- Confidence 0 0 00011133333321 11222344443 2333333322222221111 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC- Q lcl|NC_019404. 186 GRSVLSSDILDSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE- 260 (418) Q Consensus 186 G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~- 260 (418) ..+.|+-.+.+ .++++. - -.|+=++ .+++... ..+ +-+. ..+...+ +-++-|.. T Consensus 252 --------p~NQLkmlEDA-----lVIYRitRAPERRvFYID-VGnLPk~-KAe-qYlr--~iM~k~K---NklVYDa~T 310 (558) T protein:vir:10 252 --------ALNQLRMIEDS-----LVIYRLSRAPERRIFYID-VGNLPKV-KAE-QYLK--EVMSRYR---NKLVYDANT 310 (558) T ss_pred --------hHHhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcc---ceEEEeccC Confidence 11222222222 222222 1 1233333 2222111 111 1111 1111111 11111111 Q ss_pred -------------------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch--hHH Q lcl|NC_019404. 261 -------------------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ--NTA 311 (418) Q Consensus 261 -------------------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg--e~d 311 (418) +-+++.+. -+++.++| +..|...+=.+.+||.++|-.++.-.++.++ .-| T Consensus 311 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRD 389 (558) T protein:vir:10 311 GEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSD-VDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRD 389 (558) T ss_pred ceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHH-HHHHHHHHHHHhCCCccccCCCCcccccccchhhHH Confidence 12233333 24566666 4688899999999999999554332222221 123 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--C Q lcl|NC_019404. 312 LETFHKLIDRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--A 376 (418) Q Consensus 312 ~~~y~~~I~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~ 376 (418) .-.|..+|.+.|..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. . T Consensus 390 EiKF~KFI~RLR~rF-s~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvG 468 (558) T protein:vir:10 390 ELKFAKFVGRLRKRF-AAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIG 468 (558) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 346999999998754 4444444332 2222 3567888877777888889998888888877632 2 Q ss_pred CCCCHHHHHHHHHhhcCcCCCChhhcccccccCC----------CccccccC Q lcl|NC_019404. 377 GAMDIKEARDTLRTIAPEIKIGDNDIQTEESELI----------TETEVVIA 418 (418) Q Consensus 377 g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~----------~e~e~~~~ 418 (418) -.+|.+-++...-.. +|++|.+.+..++ ++.+.+.+ T Consensus 469 ky~S~dyi~k~ILr~------tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~ 514 (558) T protein:vir:10 469 KYYSTEYVRKRVLRQ------TDMEIEEIDTQIEDEIQKGIIPDPSQIDPIT 514 (558) T ss_pred cccchHHHHHHHhcc------CHHHHHHHHHHHHHHHhCCCCCCccccChhh Confidence 245666555432111 1222211111100 11111111 No 207 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=98.45 E-value=6.1e-07 Score=54.66 Aligned_cols=386 Identities=12% Similarity=0.090 Sum_probs=177.9 Q ss_pred Cccchh---------hHHHHhcCCCCccccC--ccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----Ccccc Q lcl|NC_019404. 1 MVKTDS---------YANIFLGGSDGSEIYG--SLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHID 61 (418) Q Consensus 1 ~~~~D~---------~~n~~~g~~~~~~~~~--~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~ 61 (418) ++.-|. -.|-+.+++-....|+ .+...+-.+|-..|++ +|.+..+|+.++++|+-. .+++. T Consensus 33 ~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~ 112 (524) T protein:vir:10 33 VTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALN 112 (524) T ss_pred cccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEE Confidence 111110 0111121111112222 1222344566666654 899999999999999762 23332 Q ss_pred Cc--c----hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GI--D----DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~--~----d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) -+ + -.++|.++++. |++..+-.+.+|.--+.|.-+.-..++ |=+++..|..++.+||..+... T Consensus 113 Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid-------~~~pk~GI~Elr~lDPr~i~~v 185 (524) T protein:vir:10 113 LDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIIN-------PKKMKDGVQELRRLDPRQVQYI 185 (524) T ss_pred ecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEee-------CCCccccceeeeeeCCccceee Confidence 11 1 12456666654 466776667676555566544433332 1224456777777777666432 Q ss_pred -cccccccc--cccCc-ceEEEEecC-----------CcccccccCcccEEEecCccchhhhhhccccCCcchHHHH--H Q lcl|NC_019404. 132 -NREENPRN--ARFGK-PLTYRITTN-----------ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD--I 194 (418) Q Consensus 132 -~~~~dp~s--~~yg~-p~~y~i~~~-----------~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~--~ 194 (418) ....++.. ..+.. -++|.++++ +.....+|+.+-|++....-++. ....=.|-|..+ . T Consensus 186 r~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~d~-----~~~~i~syLhkAiKp 260 (524) T protein:vir:10 186 REIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLLDC-----CGKNIIGYLQRAIKP 260 (524) T ss_pred eeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhheeeeccCcccC-----CCCceeccchHhhHH Confidence 11111111 01111 223333221 12233567777655443221111 100011212221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcC------CcceeE-------- Q lcl|NC_019404. 195 LDSIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQAIG-------- 256 (418) Q Consensus 195 ~~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~~~-------- 256 (418) .+.|+-.+.+ .++++.+ -.|+=++ .+++... +. ++-+. ..+...++ .++.+- T Consensus 261 ~NQLkm~EDA-----lVIYRitRAPeRRvFYID-VGnlPk~-KA-eqYl~--~im~k~kNKlvYDa~TGev~ddrk~msM 330 (524) T protein:vir:10 261 ANQLKLMEDA-----MVIYRITRAPDRRVFYID-TGNMPSR-KA-AAQMQ--HIMNTMKNRVVYDASTGKIKNQQHNMSM 330 (524) T ss_pred HHhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hH-HHHHH--HHHHhcCceeEEeccCCeeccchhhhhh Confidence 1223222222 2233321 1223232 2222111 11 11111 01111111 011100 Q ss_pred -----E----EcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHHHH Q lcl|NC_019404. 257 -----I----DAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLIDR 321 (418) Q Consensus 257 -----~----d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I~~ 321 (418) + .+.+-+++.+.- +++.++| +..|..-+=.+.++|.++|-.++++|++- ++ .-|.-.|..+|.+ T Consensus 331 lEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~r 409 (524) T protein:vir:10 331 TEDYWLQRRDGKAVTEVDTMPGATGMSDMDD-VLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQ 409 (524) T ss_pred HhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHH Confidence 0 011122333332 4556666 46888999999999999996666666542 11 1233569999999 Q ss_pred HHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHHHH Q lcl|NC_019404. 322 KRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEARD 386 (418) Q Consensus 322 ~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~r~ 386 (418) .|..+ .+++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. .-.++.+-++. T Consensus 410 LR~rF-s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k 488 (524) T protein:vir:10 410 LQNKF-EEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMK 488 (524) T ss_pred HHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHH Confidence 98754 4444444332 2222 3567888877777888889998888888887643 22457766665 Q ss_pred HHHhhcCcCCCChhhccccccc------------CCCccccc Q lcl|NC_019404. 387 TLRTIAPEIKIGDNDIQTEESE------------LITETEVV 416 (418) Q Consensus 387 ~l~~~~~~~~~~~~~~~~~e~~------------~~~e~e~~ 416 (418) ..-. .+|++|.+.+.. +++|.|.- T Consensus 489 ~ILr------~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 489 DFLQ------MTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HHhc------cCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 4322 234343333222 22222222 No 208 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.42 E-value=7.3e-07 Score=54.23 Aligned_cols=380 Identities=15% Similarity=0.127 Sum_probs=171.8 Q ss_pred CccchhhHHHH-hcCCCCccccCcc-c--cCCH--------HHHHHHHHcCCccchhhhcchhhhccCCccccCcc--hH Q lcl|NC_019404. 1 MVKTDSYANIF-LGGSDGSEIYGSL-Q--NQAP--------TILASLYADNALVRRIIDTIPETALAAGFHIDGID--DE 66 (418) Q Consensus 1 ~~~~D~~~n~~-~g~~~~~~~~~~~-~--~~~~--------~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~--d~ 66 (418) |++.=..+..+ -.++++-...+.. . .+++ ..++.|-.+.+-+..++++.....++..|.|+..+ ++ T Consensus 4 ~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~~~e 83 (469) T protein:vir:10 4 RVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANGASDE 83 (469) T ss_pred cccCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCCCHH Confidence 33333332211 0111111111110 0 0111 12223334688889999999888888888886322 21 Q ss_pred --HHHHHHHHHh-----------------CchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecc Q lcl|NC_019404. 67 --PAFWSRWDDL-----------------EMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRT 126 (418) Q Consensus 67 --~~i~~~~~~l-----------------~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~ 126 (418) +.+.+.+... .....|.+.+-.+..||.|+.=+.-+ ++... .. .-.+..+...++. T Consensus 84 ~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~-dG---~~~~~~l~~rp~~ 159 (469) T protein:vir:10 84 VTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSP-DG---RFWLRKLAPRPQW 159 (469) T ss_pred HHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccC-CC---ceeeeeeeecCcc Confidence 2233333221 12345566666678899999865432 11100 00 0011122222211 Q ss_pred -----ccccc-----cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHH Q lcl|NC_019404. 127 -----QVKVQ-----NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILD 196 (418) Q Consensus 127 -----~i~~~-----~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~ 196 (418) .+.+. .+...|..+.-+.+ |. ....+..+.+.+.|.+.... ...+++|.+.+ +.||. T Consensus 160 ~i~~~~~~~~~~l~~~~~~~~~~~~~~~~--~~----~~~~~~~lp~~k~i~~~~~~------~~g~p~g~gLl-r~~~~ 226 (469) T protein:vir:10 160 TISKFNVAPDGGLESIEQIAPPARTRGSL--YV----ANIAPPEIPVNRLVVYTRNK------RPGQWQGKSIL-RSAYK 226 (469) T ss_pred cceeeeeccCCceeeeeecCccccccccc--cc----CCCCccccccCcEEEEEecC------CCCCcccchhH-HHHHH Confidence 11110 00001110000000 00 01123468888888775432 45667888866 56877 Q ss_pred HHHHHHHHHHHHHHHHHHcC--CceeecchHHHhhcCcchHHHHHHHHHHHHH-hcCCcceeEEEcCCCceeEeecccC- Q lcl|NC_019404. 197 SIKDYTNCERLATQLLRRKQ--QAVWKAKGLAELCDDSEGFGAARLRLAQVDN-NSGVGQAIGIDAESEEYSVLNSDIG- 272 (418) Q Consensus 197 ~l~~~~~~~~~~~~l~~~~~--~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~-~~~~~~~~~~d~~~e~~~~~~~~~~- 272 (418) ...--..+...-+..+.+++ +.+.|.+.. ..+. ..+.+..... .++...+.++..++.+++.++.+-+ T Consensus 227 ~~~fK~~~~~~w~~f~EryG~P~~vgky~~~-----a~~~---ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~ 298 (469) T protein:vir:10 227 HWLLKDKLLRIEAATAERNGMGIPVGTASSA-----TDED---EVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNL 298 (469) T ss_pred HHHHHHHHHHHHHHHHHHcCCcceEEecCCC-----CCHH---HHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCc Confidence 75555556666777788876 445566521 1111 1222222222 2222344445556678998886532 Q ss_pred -CHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhcc----C--Cc Q lcl|NC_019404. 273 -GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPILE-FLIPFIVNA----E--EW 344 (418) Q Consensus 273 -gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~-~l~~~i~~~----~--~~ 344 (418) .-..+++.+-..|+.+.=-. | |..++.+|-.|.|+--.....+.+++..+. +...++ .|++-++.- . -. T Consensus 299 ~~~~~li~~~d~~Isk~iLG~-t-lTs~~~gGS~a~~~vh~ev~~d~~~sDa~~-i~~tln~~li~~l~~lN~g~~~~~P 375 (469) T protein:vir:10 299 PDIRRAIEGHDRSIALSGLAH-F-LNLDGKGGSYALASVLEDPFTQAVHAYATS-ICRIANQHIIEDLVDINFGVDTPAP 375 (469) T ss_pred hHHHHHHHHHHHHHHHHHhcc-c-ccccCccchhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCCCcc Confidence 24456666666677443222 1 122222333344555556677777776554 445553 466655431 1 13 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCC-----HHHHHHHHHhhcCcCCCChhhccccccc----C------ Q lcl|NC_019404. 345 SVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMD-----IKEARDTLRTIAPEIKIGDNDIQTEESE----L------ 409 (418) Q Consensus 345 ~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~-----~~e~r~~l~~~~~~~~~~~~~~~~~e~~----~------ 409 (418) +|+|...- +. .+..|++++.+++.|++. .+.+++.+.--.+ ...+...+...+. . T Consensus 376 ~~~~~~~e---~~-----~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~--~~~~~~~~~~~~~~~~~~~~~~~~ 445 (469) T protein:vir:10 376 VLTFDPIG---SR-----QDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSE--LNDTPSAEPEEPAAVPNQSAAPAR 445 (469) T ss_pred EEEecCCC---Cc-----HHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCC--CCCcccccchhcccCCCCCccccc Confidence 56665432 11 134688999999999954 4455655421101 0000001000000 0 Q ss_pred ---------------CCccccccC Q lcl|NC_019404. 410 ---------------ITETEVVIA 418 (418) Q Consensus 410 ---------------~~e~e~~~~ 418 (418) ....++-.| T Consensus 446 ~~~~~~~~~~~~~~~~~~~~l~da 469 (469) T protein:vir:10 446 TRSSGNADARARAPKADQGVLFDA 469 (469) T ss_pred cCCCCCcccccccCCChHHhhccC Confidence 001111111 No 209 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=98.40 E-value=8.3e-07 Score=53.91 Aligned_cols=383 Identities=13% Similarity=0.124 Sum_probs=180.6 Q ss_pred CccchhhHHHHhcCC-CCccccCccc---------cCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MVKTDSYANIFLGGS-DGSEIYGSLQ---------NQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~~~D~~~n~~~g~~-~~~~~~~~~~---------~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) -...||-...-...+ ..+.++|... .++-.+|-..|++ +|.+..+|+.++++|+-- .+++.- T Consensus 34 p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L 113 (524) T protein:vir:72 34 PKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNL 113 (524) T ss_pred ccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEe Confidence 333444322211111 1112222111 2345667666754 899999999999999762 233322 Q ss_pred cc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 ID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++ -+++|.++++. |++..+-.+.+|.--+.|.-+.-..++. =+++..|+.++.+||..+.... T Consensus 114 ~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~-------k~pk~GI~Elr~lDPr~i~~vr 186 (524) T protein:vir:72 114 DKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDP-------KRPKEGIKELRRLDPRQVQYVR 186 (524) T ss_pred cCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeC-------CCccccceeeeeeCCccceeee Confidence 11 12356666654 5667777777776666665554444422 1245567777777777664421 Q ss_pred -ccccccc--ccc-CcceEEEEecC-----------CcccccccCcccEEEecCccchhhhhhccccCCcchHHHH--HH Q lcl|NC_019404. 133 -REENPRN--ARF-GKPLTYRITTN-----------ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD--IL 195 (418) Q Consensus 133 -~~~dp~s--~~y-g~p~~y~i~~~-----------~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~--~~ 195 (418) ...++.. ..+ +--++|.++++ ..+...+||.+=+ +|+..-+-+. ....=.|-|..+ .. T Consensus 187 ~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI-~y~hSGL~d~----~~~~i~gyLhkAiKp~ 261 (524) T protein:vir:72 187 EIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAV-VYAHSGLVDC----CGKNIIGYLHRAVKPA 261 (524) T ss_pred eeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhhe-eeeeccceeC----CCCceeccchhhhHhH Confidence 1111111 111 11223333321 1123345665543 3332222111 000001112211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----------- Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----------- 260 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----------- 260 (418) +.|+-.+.+ -++++. - -.|+=++ .+++... +. ++-+. ..+...+ +-++-|.. T Consensus 262 NQLkmlEDA-----lVIYRitRAPeRRvFYID-vGnlPk~-KA-eqYl~--~im~k~K---NklvYDa~TGev~ddrk~m 328 (524) T protein:vir:72 262 NQLKLLEDA-----VVIYRITRAPDRRVWYVD-TGNMPAR-KA-AEHMQ--HVMNTMK---NRVVYDASTGKIKNQQHNM 328 (524) T ss_pred HhhhHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hH-HHHHH--HHHHhcC---ceeEEeCCCCeeccchhhh Confidence 222222222 222321 1 1233333 2222111 11 11111 1111111 11111211 Q ss_pred ---------------CCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHH Q lcl|NC_019404. 261 ---------------SEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLI 319 (418) Q Consensus 261 ---------------~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I 319 (418) +-+++.+.- +++.++| +..|...+=.|.++|.++|-+.+++|++- ++ .-|.-.|..+| T Consensus 329 sMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI 407 (524) T protein:vir:72 329 SMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-IRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFI 407 (524) T ss_pred hhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHH Confidence 122333332 4555666 46888999999999999997777777652 11 12335699999 Q ss_pred HHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEA 384 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~ 384 (418) .+.|..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. .-.++.+-+ T Consensus 408 ~rLR~rF-s~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi 486 (524) T protein:vir:72 408 RELQHKF-EEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTA 486 (524) T ss_pred HHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHH Confidence 9998754 4444444332 2222 3577888877777888889998888888887643 224577766 Q ss_pred HHHHHhhcCcCCCChhhccccccc------------CCCccccc Q lcl|NC_019404. 385 RDTLRTIAPEIKIGDNDIQTEESE------------LITETEVV 416 (418) Q Consensus 385 r~~l~~~~~~~~~~~~~~~~~e~~------------~~~e~e~~ 416 (418) +...-. .+|++|.+.+.. +++|.|.- T Consensus 487 ~k~ILr------~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 487 MKDILQ------MTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHhc------cCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 654322 234444333332 22232332 No 210 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=98.40 E-value=8.5e-07 Score=53.87 Aligned_cols=383 Identities=14% Similarity=0.126 Sum_probs=180.7 Q ss_pred CccchhhHHHHhcCC-CCccccCccc---------cCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MVKTDSYANIFLGGS-DGSEIYGSLQ---------NQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~~~D~~~n~~~g~~-~~~~~~~~~~---------~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) -...||-...-...+ ..+.++|... .++-.+|-..|++ +|.+..+|+.++++|+-- .+++.- T Consensus 34 p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L 113 (524) T protein:vir:10 34 PKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNL 113 (524) T ss_pred ccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEe Confidence 333444322211111 1112222111 2345667666754 899999999999999762 233322 Q ss_pred cc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 ID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++ -+++|.++++. |++..+-.+.+|.--+.|.-+.-..++. =+++..|+.++.+||..+.... T Consensus 114 ~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~-------k~pk~GI~Elr~lDPr~i~~vr 186 (524) T protein:vir:10 114 DKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDP-------KRPKEGIKELRRLDPRQVQYVR 186 (524) T ss_pred cCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeC-------CCccccceeeeeeCCccceeee Confidence 11 12356666654 5667777777776666665554444422 1245567777777777664421 Q ss_pred -ccccccc--ccc-CcceEEEEecC-----------CcccccccCcccEEEecCccchhhhhhccccCCcchHHHH--HH Q lcl|NC_019404. 133 -REENPRN--ARF-GKPLTYRITTN-----------ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD--IL 195 (418) Q Consensus 133 -~~~dp~s--~~y-g~p~~y~i~~~-----------~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~--~~ 195 (418) ...++.. ..+ +--++|.++++ ..+...+|+.+=+ +|+..-+-+. ....=.|-|..+ .. T Consensus 187 ~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI-~y~hSGL~d~----~~~~i~gyLhkAiKp~ 261 (524) T protein:vir:10 187 EIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAI-VYAHSGLVDC----CGKNIIGYLHRAVKPA 261 (524) T ss_pred eeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhhe-eeeeccceeC----CCCceeccchhhhHHH Confidence 1111111 111 11223333321 1123345665543 3332222111 000001112211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----------- Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----------- 260 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----------- 260 (418) +.|+-.+.+ -++++. - -.|+=++ .+++... +. ++-+. ..+...+ +-++-|.. T Consensus 262 NQLkmlEDA-----lVIYRitRAPeRRvFYID-vGnlPk~-KA-eqYl~--~im~k~K---NklvYDa~TGev~ddrk~m 328 (524) T protein:vir:10 262 NQLKLLEDA-----VVIYRITRAPDRRVWYVD-TGNMPAR-KA-AEHMQ--HVMNTMK---NRVVYDASTGKIKNQQHNM 328 (524) T ss_pred HhhhHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hH-HHHHH--HHHHhcC---ceeEEeCCCCeeccchhhh Confidence 222222222 222322 1 1233333 2222111 11 11111 1111111 11111211 Q ss_pred ---------------CCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHH Q lcl|NC_019404. 261 ---------------SEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLI 319 (418) Q Consensus 261 ---------------~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I 319 (418) +-+++.+.- +++.++| +..|...+=.|.++|.++|-+.+++|++- ++ .-|.-.|..+| T Consensus 329 sMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI 407 (524) T protein:vir:10 329 SMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-VRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFI 407 (524) T ss_pred hhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHH Confidence 122333332 4555666 46888999999999999997777777652 11 12335699999 Q ss_pred HHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEA 384 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~ 384 (418) .+.|..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. .-.++.+-+ T Consensus 408 ~rLR~rF-s~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi 486 (524) T protein:vir:10 408 RELQHKF-EEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTA 486 (524) T ss_pred HHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHH Confidence 9998754 4444444332 2222 3577888877777888889998888888887643 224577766 Q ss_pred HHHHHhhcCcCCCChhhccccccc------------CCCccccc Q lcl|NC_019404. 385 RDTLRTIAPEIKIGDNDIQTEESE------------LITETEVV 416 (418) Q Consensus 385 r~~l~~~~~~~~~~~~~~~~~e~~------------~~~e~e~~ 416 (418) +...-. .+|++|.+.+.. +++|.|.- T Consensus 487 ~k~ILr------~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 487 MKDILQ------MTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHhc------cCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 654322 234444333322 33333333 No 211 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.39 E-value=8.8e-07 Score=53.78 Aligned_cols=367 Identities=10% Similarity=0.078 Sum_probs=172.1 Q ss_pred ccchhhHHHHhc---CCCCcc-ccCccccC-----------CHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch- Q lcl|NC_019404. 2 VKTDSYANIFLG---GSDGSE-IYGSLQNQ-----------APTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD- 65 (418) Q Consensus 2 ~~~D~~~n~~~g---~~~~~~-~~~~~~~~-----------~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d- 65 (418) |.-=.+...+.. +....+ ..+..... +...++.+ .+.+-+..++++.-...++..|+|+..++ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l-~~D~~i~s~l~~rk~av~~~~w~i~p~~~~ 79 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEI-LSDAQVKTVWGQRQLAVVSREWKVEAGGDR 79 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHH-hhChHHHHHHHHHHHHHhcCCceEEcCCCC Confidence 111112211111 000000 00000000 12223344 45788889999999999998999963221 Q ss_pred -H-----HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeecccccccccccccc Q lcl|NC_019404. 66 -E-----PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQNREENPR 138 (418) Q Consensus 66 -~-----~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~ 138 (418) + +.+++.++++++...+.+.+ .+.+||.|++=+.-+ ++.. -.+..+...++.++.... . T Consensus 80 ~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~--------~~~~~l~~r~~~~f~~d~-----~ 145 (488) T protein:vir:99 80 PIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRY--------ITLEAIKVRNRRRFRYDQ-----D 145 (488) T ss_pred hHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCe--------eeEeeeeeecccceeecC-----C Confidence 1 34666677777666666655 689999999865432 2111 123445555544433211 1 Q ss_pred ccccCcceEEEEecCCcccccccC-cccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019404. 139 NARFGKPLTYRITTNESDMFYDVH-YSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ 217 (418) Q Consensus 139 s~~yg~p~~y~i~~~~~~~~~~iH-~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~ 217 (418) +... +.. ..+...+..++ |-..+...+. ....++||.+.+ +.||....--..+...-+..+.++++ T Consensus 146 ----~~l~-~~~-~~~~~~g~~lp~~~~~i~~~~~------~~~g~p~g~gLl-~~~~w~~~fK~~~~~~w~~f~E~yG~ 212 (488) T protein:vir:99 146 ----GGLR-LLT-PNNMFEGEPCPAPYFWHFSTGA------DNDDEPYGLGLA-HWLYWPVFFKRNGIKFWLIFLDKFGM 212 (488) T ss_pred ----CceE-Eec-cCCCCCccccccCceEEEEeec------CCCCCcccchHH-HHHHHHHHHHHhhHHHHHHHHHHcCC Confidence 1111 111 11111122332 2222222111 124577899866 56777655555556666677888776 Q ss_pred cee--ecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH---HHHHHHHHHHHhhhh-cC Q lcl|NC_019404. 218 AVW--KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI---DAFLDKKFDRIVALS-GI 291 (418) Q Consensus 218 ~v~--k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl---~~~~~~~~~~iaaas-~I 291 (418) ++. |++. .+. ..+...++.......+...++++. .+.+++.++..-++. ..+++.+-.+|+.+. |= T Consensus 213 P~~igky~~------~~a-~~~ek~~l~~av~~~~~~~~~viP-~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGq 284 (488) T protein:vir:99 213 PTAVGRYDD------KTA-TPEDKAKLLAALHAIQTDSAIIMP-AGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQ 284 (488) T ss_pred ceeeeecCC------CCC-CHHHHHHHHHHHHHHhcCcEEEec-CCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhh Confidence 654 4431 111 111223333333334444455554 457899888654443 446777777777542 32 Q ss_pred CeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcc----CCc-eEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019404. 292 HEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPIL-EFLIPFIVNA----EEW-SVEFSPLDHESSKDKAEVLEK 365 (418) Q Consensus 292 P~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l-~~l~~~i~~~----~~~-~~~f~pL~~~~eke~ae~~~~ 365 (418) | |.++..+|-.|.|+.-.....+.+++..+. +...+ +.|+..++.- ... .+.|....+.+. +. T Consensus 285 --t-lts~~~~Gs~a~~~vh~~v~~d~~~aDa~~-i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~edl-------~~ 353 (488) T protein:vir:99 285 --V-ASTQGTPGRLGNDDLQADVRLDLVKADADL-ICESFNLGPARWLTEWNFPGAQPPRVYRVIEEPEDI-------TA 353 (488) T ss_pred --h-hcccccccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCcCCcCCceeEecCCCcccH-------HH Confidence 1 223332333345555556677777777654 44444 3466655431 111 233333222222 35 Q ss_pred HHHHHHHHHhC-CC-CCHHHHHHHHHhhcCcCCCChhhcccc-------cccCCC-ccccccC Q lcl|NC_019404. 366 SVNSIAALIAA-GA-MDIKEARDTLRTIAPEIKIGDNDIQTE-------ESELIT-ETEVVIA 418 (418) Q Consensus 366 ~a~a~~~~~~~-g~-i~~~e~r~~l~~~~~~~~~~~~~~~~~-------e~~~~~-e~e~~~~ 418 (418) .|++++++++. |+ ++.+.+++.+.--.+.. .++.... +..... ....+-. T Consensus 354 ~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (488) T protein:vir:99 354 KAERDEKVFRMSGFRPTRGYVQETYGVEVEST---QAEATAPTPSTEFAEGDQPSDPAAAMAP 413 (488) T ss_pred HHHHHHHHHhhcCCCCCHHHHHHHcCCCCccc---ccccccCCCcccCCCCCCCCCchHHHHH Confidence 57888899886 75 88888887663211110 0000000 000000 0000000 No 212 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=98.38 E-value=9.7e-07 Score=53.56 Aligned_cols=382 Identities=11% Similarity=0.114 Sum_probs=171.1 Q ss_pred Ccc---------chhhHHHHhcCCCCccccC-ccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MVK---------TDSYANIFLGGSDGSEIYG-SLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~~---------~D~~~n~~~g~~~~~~~~~-~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) --+ .||-..+..|+. .+.+.+ .....+-.+|-.-||+ +|.+..+|+.++++|+-- .+.+.- T Consensus 15 ~~~~~s~~~~~~~dg~~~i~~~~~-~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~L 93 (533) T protein:vir:10 15 APKGPSFVQKDNLDGSQPVSGGGY-YGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETICGNFDDVPVSVEL 93 (533) T ss_pred cccCCCCCCCCcccccceeecccc-cceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhcceeeecCCCceEEEEe Confidence 111 222222222211 111111 1222345677777765 899999999999999762 233322 Q ss_pred cc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 ID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++ -.++|.++++. |++..+-.+.+|.--+.|.-+.-..++ |=+++..|..++.+||..+.+.. T Consensus 94 d~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid-------~~~pk~GI~ELr~lDPr~i~~vr 166 (533) T protein:vir:10 94 SNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVID-------PDNPQGGLIELRYIDPRKIRKIN 166 (533) T ss_pred cccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec-------CCCccccceeeeeccccceeeee Confidence 11 12356666654 455555555555444444444333232 12245677788888887766532 Q ss_pred cc----ccc------cccccCc-ceEEEEecC----CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH--H Q lcl|NC_019404. 133 RE----ENP------RNARFGK-PLTYRITTN----ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI--L 195 (418) Q Consensus 133 ~~----~dp------~s~~yg~-p~~y~i~~~----~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~--~ 195 (418) .. .|. ....++. -++|.+++. +...+.+|+++ .|+|+..-+-+ .+.+.=.|-|..++ . T Consensus 167 ~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~d-AI~y~hSGl~d----~~~~~i~syLhkAiKp~ 241 (533) T protein:vir:10 167 ETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPD-SICYVHSGIMD----LNKNMTLSHLHKAIKAV 241 (533) T ss_pred eeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchh-heeeeecccee----CCCCceeccchHhHHHH Confidence 11 111 1122333 234444432 12233567664 33443322211 11111112222211 1 Q ss_pred HHHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----------- Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----------- 260 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----------- 260 (418) +.|+-.+.+ .++++.+ -.|+=++ .+++-.. +.+ +-+. ..+...+ +-++-|.. T Consensus 242 NQLkm~EDA-----lVIYRitRAPeRRvFYID-VGnLPk~-KAe-qYlr--~iM~k~K---NklVYDa~TGev~ddrk~m 308 (533) T protein:vir:10 242 NQLRMIEDS-----LVIYRLSRAPERRIFYID-VGNLPKN-KAE-QYLR--EVMGRYR---NKLVYDANTGEIKDDKKFM 308 (533) T ss_pred HhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcc---ceEEEeccCceecccchhh Confidence 222222222 2233321 1233333 2222111 111 1111 1111111 11111111 Q ss_pred ---------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHH Q lcl|NC_019404. 261 ---------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLI 319 (418) Q Consensus 261 ---------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I 319 (418) +-+++.+. -+++.++| +..|...+=.+.++|.++|-.+ +|+|- ++ .-|.-.|..+| T Consensus 309 sMlEDyWLPRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY~aLnVP~SRl~~e--~~f~~Gr~~EItRDEiKF~KFI 385 (533) T protein:vir:10 309 SMLEDFWLPRREGGRGTEITTLPGGQNLGELED-VKYFQKKLYKSLNVPGSRLETE--TTFNVGRAAEITRDEVKFQKFV 385 (533) T ss_pred hhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCccccCCC--CcccccccchhhHHHHHHHHHH Confidence 12233333 24555666 4688899999999999999443 45442 11 12335699999 Q ss_pred HHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--hCCCCCHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI--AAGAMDIKEA 384 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~--~~g~i~~~e~ 384 (418) .+.|..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+- -.-.+|.+-+ T Consensus 386 ~RLR~rF-s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi 464 (533) T protein:vir:10 386 ARLRKRF-SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYM 464 (533) T ss_pred HHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHH Confidence 9998764 4444444332 2222 356788887777778888888888888877752 1223455555 Q ss_pred HHHHHhhcCcCCCChhhccccccc-----------------------CCCccccccC Q lcl|NC_019404. 385 RDTLRTIAPEIKIGDNDIQTEESE-----------------------LITETEVVIA 418 (418) Q Consensus 385 r~~l~~~~~~~~~~~~~~~~~e~~-----------------------~~~e~e~~~~ 418 (418) +...-.. +|++|.+.+.. +.+|..+.-+ T Consensus 465 ~k~ILr~------tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~ 515 (533) T protein:vir:10 465 RRQVLKQ------TDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPA 515 (533) T ss_pred HHHHhcc------CHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCccc Confidence 4322111 11111111111 1111111111 No 213 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=98.37 E-value=9.2e-07 Score=53.68 Aligned_cols=383 Identities=12% Similarity=0.123 Sum_probs=170.3 Q ss_pred Cc---cchhhHHHHhcCCC--CccccCccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccCcc--- Q lcl|NC_019404. 1 MV---KTDSYANIFLGGSD--GSEIYGSLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDGID--- 64 (418) Q Consensus 1 ~~---~~D~~~n~~~g~~~--~~~~~~~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~~~--- 64 (418) ++ ..||... +.||.- .....|.....+-.+|-..|++ +|.+..+|+.++++|+-. .+++.-++ T Consensus 20 ~vpp~~~~~~~~-i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVneaIv~d~~~~pV~vdL~~~~~ 98 (564) T protein:vir:10 20 PVPPNDEASVST-VAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNEFVVNDGDDKPVEVDLQNLEI 98 (564) T ss_pred cccCCcCCChhh-hhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCc Confidence 22 2344333 233221 1112222223445577777764 899999999999998752 33343211 Q ss_pred ---hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc-cccc Q lcl|NC_019404. 65 ---DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN-REEN 136 (418) Q Consensus 65 ---d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~-~~~d 136 (418) -+++|.++++. |++..+-.+.+|.--+.|..+.-..++. =+++..|+.|+.+||..+.... ..++ T Consensus 99 s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~-------~~pk~GI~eLr~lDPr~i~~vr~i~~~ 171 (564) T protein:vir:10 99 GSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDL-------DNPKKGILELRYIDSLKIRKVRQKLKD 171 (564) T ss_pred chHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeC-------CChhhhhhhhhhhcccceeeeeeeccc Confidence 12456666664 4556555555554444454443333321 1233457777777776555432 1111 Q ss_pred c--cc----------cccCc-ceEEEEecCCc---------------ccccccCcccEEEecCccchhhhhhccccCCcc Q lcl|NC_019404. 137 P--RN----------ARFGK-PLTYRITTNES---------------DMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRS 188 (418) Q Consensus 137 p--~s----------~~yg~-p~~y~i~~~~~---------------~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S 188 (418) + .. -+|+. +++|.+++.+. +...+|+.+=+..- ..-|-+ .+.+.=.| T Consensus 172 ~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~-hSGL~d----~~~~~i~g 246 (564) T protein:vir:10 172 VDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQS-TSGLMD----LNKKMTLS 246 (564) T ss_pred cccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhccee-ccccee----CCCCceec Confidence 1 11 12333 56676664321 11234555433222 111111 01111111 Q ss_pred hHHHH--HHHHHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC-- Q lcl|NC_019404. 189 VLSSD--ILDSIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE-- 260 (418) Q Consensus 189 ~l~~~--~~~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~-- 260 (418) -|..+ .++.|+-.+.+ .++++.+ -.|+=++ .+++-.. ..+ +-+. ..+...+ +-++-|.. T Consensus 247 yLhkAIKp~NQLkmlEDA-----lVIYRitRAPeRRvFYID-VGnLPk~-KAe-qYlr--~iM~k~K---NklVYDa~TG 313 (564) T protein:vir:10 247 FLHKAIKSLNQLRMIEDS-----LVIYRLSRAPERRIFYID-VGNLPKV-KAE-QYLR--DVMSRYR---NKLVYDGQTG 313 (564) T ss_pred cchhhhHhHHhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcC---ceEEEeccCc Confidence 12211 11222222222 2223221 1233333 2222111 111 1111 1111111 11111111 Q ss_pred ------------------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccc--cch--hH Q lcl|NC_019404. 261 ------------------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLS--SSQ--NT 310 (418) Q Consensus 261 ------------------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~--stg--e~ 310 (418) +-+++.+. -+++.++| +..|...+=.+.+||.++|-.+ .+|++ .++ .- T Consensus 314 evrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY~aLnVP~SRl~~e-~~~f~~Gr~~EItR 391 (564) T protein:vir:10 314 EIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKD-VEYFKKKLYNSLNLPPSRLTDD-NKAFNLGKSTEILR 391 (564) T ss_pred eecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHH-HHHHHHHHHHHhCCCcccccCC-CceeecccccchhH Confidence 12233333 25566666 4688899999999999999554 33333 222 12 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--h Q lcl|NC_019404. 311 ALETFHKLIDRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI--A 375 (418) Q Consensus 311 d~~~y~~~I~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~--~ 375 (418) |.-.|..+|.+.|..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+- - T Consensus 392 DEiKF~KFI~RLR~rF-s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyv 470 (564) T protein:vir:10 392 DELKFTKFIGRLRKRF-AQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFV 470 (564) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhh Confidence 3346999999998764 4444444332 2222 356788887777778888888888888877752 1 Q ss_pred CCCCCHHHHHHHHHhhcCcCCCChhhcccccc------------cCCC---------ccc-------cccC Q lcl|NC_019404. 376 AGAMDIKEARDTLRTIAPEIKIGDNDIQTEES------------ELIT---------ETE-------VVIA 418 (418) Q Consensus 376 ~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~------------~~~~---------e~e-------~~~~ 418 (418) .-.+|.+-++...-.. +|++|.+.+. ++.+ +++ .|-+ T Consensus 471 Gky~S~dyi~k~ILr~------tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~ 535 (564) T protein:vir:10 471 GKYFSTEYIRRKILMQ------TENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQD 535 (564) T ss_pred ccccchHHHHHHHhcc------CHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhcc Confidence 2234555554322111 1111111111 1111 111 1111 No 214 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=98.37 E-value=1e-06 Score=53.40 Aligned_cols=385 Identities=10% Similarity=0.119 Sum_probs=178.3 Q ss_pred Cccch---hhHHHHhcCCCC----ccccCc-----cccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----Cccc Q lcl|NC_019404. 1 MVKTD---SYANIFLGGSDG----SEIYGS-----LQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHI 60 (418) Q Consensus 1 ~~~~D---~~~n~~~g~~~~----~~~~~~-----~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i 60 (418) ++.-| |-.-+=.+.+.+ +..++. ....+-.+|-..|++ +|.+..+|+.++++|+-. .+++ T Consensus 30 ~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l 109 (521) T protein:vir:81 30 IAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIVFEEGHEVVSL 109 (521) T ss_pred cccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEE Confidence 11111 000000001101 111111 112244566666654 899999999999999762 3333 Q ss_pred cCcc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccc Q lcl|NC_019404. 61 DGID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKV 130 (418) Q Consensus 61 ~~~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~ 130 (418) .-++ -.++|.++++. |++..+-.+.+|.--+.|.-+.-..+++ +++..|..++.+||..+.. T Consensus 110 ~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~--------~pk~GI~Elr~lDPr~i~~ 181 (521) T protein:vir:81 110 NLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGK--------NPKDGIVELRQLDPRNLEY 181 (521) T ss_pred EecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcC--------CccccceeeeeeCCcceee Confidence 2211 12456666654 5667777777776666776666555531 1355677777777766654 Q ss_pred ccc---cccccccccCcc-eEEEEecC-----------CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH- Q lcl|NC_019404. 131 QNR---EENPRNARFGKP-LTYRITTN-----------ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI- 194 (418) Q Consensus 131 ~~~---~~dp~s~~yg~p-~~y~i~~~-----------~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~- 194 (418) ... ...+.-..++.. ++|.++++ +...+.+|+.+=+ +|+..-+- +...+.=.|-|..++ T Consensus 182 vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI-~y~hSGl~----d~~~~~i~syLhkAiK 256 (521) T protein:vir:81 182 VREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAI-TYAHSGLM----DCDDKYIIGYLHRAVK 256 (521) T ss_pred eeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhhe-eeeeccce----eCCCCeeeecchhhhH Confidence 321 111111112222 23333221 1122345555433 33322111 111111112222211 Q ss_pred -HHHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcC------Ccce-------eE Q lcl|NC_019404. 195 -LDSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQA-------IG 256 (418) Q Consensus 195 -~~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~-------~~ 256 (418) .+.|+-.+.+ .++++. - -.|+=++ .+++... +.+ +-+. ..+...++ .++. +. T Consensus 257 p~NQLkm~EDA-----lVIYRitRAPeRRvFYID-vGnlpk~-KAe-qYl~--~im~k~kNklvYDa~TGev~ddrk~ms 326 (521) T protein:vir:81 257 PANQLKLLEDA-----MVVYRITRAPERRVFFID-TGNMNNR-KAA-QHMN--SVAQSFKNRVVYDASTGKLKNQQANLS 326 (521) T ss_pred hHHhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcCceeEeecccccccccccccc Confidence 1223322222 223332 1 1233333 2233211 111 1111 11111111 0000 00 Q ss_pred E----------EcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHHH Q lcl|NC_019404. 257 I----------DAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLID 320 (418) Q Consensus 257 ~----------d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I~ 320 (418) + .+.+-+++.+.- +++.++| +..|..-+=.|.++|.++|-.++.+|++- ++ .-|.-.|..+|. T Consensus 327 MlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~ 405 (521) T protein:vir:81 327 MTEDYWLQRRDGKAITDVTTLPGASGMSDIDD-IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIR 405 (521) T ss_pred hhhhhcccccCCCcccceeecccCCCCChHHH-HHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHH Confidence 0 011123444432 5566666 46888999999999999996666666542 11 123356999999 Q ss_pred HHHHHHHHHHHHHHHHH------hhccC-------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHHH Q lcl|NC_019404. 321 RKRNAELLPILEFLIPF------IVNAE-------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEAR 385 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~------i~~~~-------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~r 385 (418) +.|..+ .+++..+++. ++..+ .+.|+|..=..-+|...+|+...+.++++.+-. .-.++.+-++ T Consensus 406 rLR~rF-s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~ 484 (521) T protein:vir:81 406 TRQSQF-SEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVM 484 (521) T ss_pred HHHHHH-HHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHH Confidence 998754 4444444332 23233 467888877777888889998888888887643 2245777666 Q ss_pred HHHHhhcCcCCCChhhccccccc------------CCCccccc Q lcl|NC_019404. 386 DTLRTIAPEIKIGDNDIQTEESE------------LITETEVV 416 (418) Q Consensus 386 ~~l~~~~~~~~~~~~~~~~~e~~------------~~~e~e~~ 416 (418) ...-. .+|++|.+.+.. ++++.|+- T Consensus 485 k~ILr------~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 485 RDILK------YTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHhc------cCHHHHHHHHHHHHHHhhCCCCCCCcccccCC Confidence 54322 233333332222 22222222 No 215 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=98.34 E-value=1.2e-06 Score=53.03 Aligned_cols=385 Identities=14% Similarity=0.144 Sum_probs=178.0 Q ss_pred CccchhhHHHHhcCCCC--------ccccCc--cccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDG--------SEIYGS--LQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~--------~~~~~~--~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) --..||-...-.++..+ .+.|+. +..++-.+|-..|++ +|.+..+|+.++++|+-- .+.+.- T Consensus 34 p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~L 113 (523) T protein:vir:68 34 PKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINL 113 (523) T ss_pred cCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeeecCCCceEEEEe Confidence 22233322221111111 111221 222355677667754 899999999999999762 233322 Q ss_pred cc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 ID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++ -.++|.++++. |++..+-.+.+|.--+.|.-+.-..++. =+++..|+.++.+||..+.... T Consensus 114 d~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~-------k~pk~GI~Elr~lDPr~i~~vr 186 (523) T protein:vir:68 114 DNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDP-------KRPKEGIKELRRLDPRQVQYVR 186 (523) T ss_pred cccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeC-------CCccccceeeeeeCCcceeEEE Confidence 11 12456666654 5667777777776666665554444422 1245567777777777664432 Q ss_pred ccccccccc---c-CcceEEEEecCC-----------cccccccCcccEEEecCccchhhhhhccccCCcchHHHH--HH Q lcl|NC_019404. 133 REENPRNAR---F-GKPLTYRITTNE-----------SDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD--IL 195 (418) Q Consensus 133 ~~~dp~s~~---y-g~p~~y~i~~~~-----------~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~--~~ 195 (418) ...+....+ + +--++|.+++.. .+...+|+.+=+ +|+..-|-+. ....=.|-|..+ .. T Consensus 187 ~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI-~y~hSGL~d~----~~~~i~gyLhkAiKp~ 261 (523) T protein:vir:68 187 EVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAI-VYAHSGLVDC----CGKNIIGYLHRAIKPA 261 (523) T ss_pred eecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhhe-eeeeccceeC----CCCceeccchhhhHHH Confidence 211111111 1 112233333211 123345665543 3332222111 000001112211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcC------CcceeE--------- Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQAIG--------- 256 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~~~--------- 256 (418) +.|+-.+.+ -++++. - -.|+=++ .+++... +. ++-+. ..+...++ .++.+- T Consensus 262 NQLkmlEDA-----lVIYRitRAPeRRvFYID-vGnlPk~-KA-eqYl~--~im~k~kNKlvYDa~TGev~ddrk~msMl 331 (523) T protein:vir:68 262 NQLKLLEDA-----VVIYRITRAPDRRVWYVD-TGNMPSR-KA-AEHMQ--HVMNTMKNRIAYDATTGKIKNQQHIMSMT 331 (523) T ss_pred HhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hH-HHHHH--HHHHhhcceeEEeccCCeeccchhhhhhH Confidence 222222222 222321 1 1233333 2222111 11 11111 01111111 011100 Q ss_pred ----E----EcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccc--cch--hHHHHHHHHHHHHH Q lcl|NC_019404. 257 ----I----DAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLS--SSQ--NTALETFHKLIDRK 322 (418) Q Consensus 257 ----~----d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~--stg--e~d~~~y~~~I~~~ 322 (418) + .+.+-+++.+.- +++.++| +..|..-+=.|.+||.++|-+++ +|++ .++ .-|.-.|..+|.+. T Consensus 332 EDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~~~-~~f~~Gr~~EItRDEikF~KFI~rL 409 (523) T protein:vir:68 332 EDYWLQRRDGKAVTEVDTLPGADNTGNMED-VRWFRNALYMALRIPITRIPSDQ-GGIQFDAGTSITRDELSFGKFIREL 409 (523) T ss_pred hhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCcceeecCCC-cceecccccchhHHHHHHHHHHHHH Confidence 0 011123433332 4555666 46888999999999999996543 4444 222 12335699999999 Q ss_pred HHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHHHHH Q lcl|NC_019404. 323 RNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEARDT 387 (418) Q Consensus 323 Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~r~~ 387 (418) |..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. .-.++.+-++.. T Consensus 410 R~rF-s~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ 488 (523) T protein:vir:68 410 QHKF-EEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKD 488 (523) T ss_pred HHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHH Confidence 8754 4444444332 2222 3567888877777888889998888888887643 224577766654 Q ss_pred HHhhcCcCCCChhhcccccc------------cCCCccccc Q lcl|NC_019404. 388 LRTIAPEIKIGDNDIQTEES------------ELITETEVV 416 (418) Q Consensus 388 l~~~~~~~~~~~~~~~~~e~------------~~~~e~e~~ 416 (418) .-. .+|++|.+.+. ++++|.|.- T Consensus 489 ILr------~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 489 ILQ------MSDEEIEQEAKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred Hhc------cCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 322 23444333332 233333333 No 216 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=98.33 E-value=1.3e-06 Score=52.95 Aligned_cols=382 Identities=12% Similarity=0.097 Sum_probs=176.7 Q ss_pred Cccc---hhhH-------HHHhcCCCCccccCccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MVKT---DSYA-------NIFLGGSDGSEIYGSLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~~~---D~~~-------n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) ++.- ||-. |...||.-+.........++-.+|-..|++ +|.+..+|+.++++|+-. .+++.- T Consensus 29 ~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L 108 (516) T protein:vir:10 29 IATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVERAVANIVNEAIVYERGHKVVSLDL 108 (516) T ss_pred ccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEe Confidence 1111 1111 111121111111122233355677777754 899999999999999762 233322 Q ss_pred cc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 ID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++ -.++|.++++. |++..+-.+.+|.--+.|.-+.--.+ | +++..|..++.+||..+.... T Consensus 109 ~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii-d--------~~k~GI~Elr~lDPr~i~~vR 179 (516) T protein:vir:10 109 DDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIM-P--------NPKKGIAELRRLDPRFMEYYR 179 (516) T ss_pred cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEe-c--------CccccceeeeeeCCcceeeEe Confidence 11 12456666665 45566666666655555655433223 2 344566777777776654421 Q ss_pred cc--ccccccc--cCcceEEEEe---------cC--CcccccccCcccEEEecCccchhhhhhccccCCcchHHHH--HH Q lcl|NC_019404. 133 RE--ENPRNAR--FGKPLTYRIT---------TN--ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD--IL 195 (418) Q Consensus 133 ~~--~dp~s~~--yg~p~~y~i~---------~~--~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~--~~ 195 (418) .. .|..... -|--++|.++ +. +...+.+|+.|= |+|+..-+-+. +.+.=.|-|..+ .. T Consensus 180 ~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dA-I~y~hSGL~d~----~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 180 EIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSA-VVYASSGLMDC----SDRGIIGYLHNAVKPA 254 (516) T ss_pred eecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhh-eeeecccceeC----CCCceeeeehhhhHhH Confidence 11 1111110 0111222222 21 112234556553 33333222110 100001112211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----------- Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----------- 260 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----------- 260 (418) +.|+-.+.+ .++++. - -.|+=++ .+++... +.+ +-+. ..+...+ +-++-|.. T Consensus 255 NQLkm~EDA-----lVIYRitRAPeRRvFYID-vGnlPk~-KAe-qYl~--~im~k~k---NklvYDa~TGev~ddrk~m 321 (516) T protein:vir:10 255 NQLKLLEDA-----MVIYRITRAPERRVFYID-VGNMNNR-KAT-EYVN--GIMQSLK---NRVVYDSNTGTVKNQKRNL 321 (516) T ss_pred HhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcC---ceeEEeCCCCeeccchhhh Confidence 223322222 222321 1 1223333 2222111 111 1111 1111111 11111211 Q ss_pred ---------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccc--ccch--hHHHHHHHHHH Q lcl|NC_019404. 261 ---------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGL--SSSQ--NTALETFHKLI 319 (418) Q Consensus 261 ---------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl--~stg--e~d~~~y~~~I 319 (418) +-+++.+. -+++.++| +..|..-+=.|.++|.++|-.+++..+ +.++ .-|.-.|..+| T Consensus 322 sMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI 400 (516) T protein:vir:10 322 SMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD-VRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFV 400 (516) T ss_pred hhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHH Confidence 12233333 24555666 468889999999999999976666544 2221 22334699999 Q ss_pred HHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--hCCCCCHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI--AAGAMDIKEA 384 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~--~~g~i~~~e~ 384 (418) .+.|..+ .+++..+++. ++.. +++.|+|..=..-+|...+|+...+.++++.+- -...++.+-+ T Consensus 401 ~rLR~rF-s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi 479 (516) T protein:vir:10 401 VQLQHDF-EEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYV 479 (516) T ss_pred HHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHH Confidence 9998754 4444444332 2222 356788887777788888999998888888874 3567888877 Q ss_pred HHHHHhhcCcCCCChhhcccccccCCC----------cccccc Q lcl|NC_019404. 385 RDTLRTIAPEIKIGDNDIQTEESELIT----------ETEVVI 417 (418) Q Consensus 385 r~~l~~~~~~~~~~~~~~~~~e~~~~~----------e~e~~~ 417 (418) +...-. .++++|++.+..++. |.|.-+ T Consensus 480 ~k~ILr------~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 480 MKNILQ------MTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred HHHHhc------CCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 754322 233343333322221 222222 No 217 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=98.33 E-value=1.3e-06 Score=52.95 Aligned_cols=382 Identities=12% Similarity=0.097 Sum_probs=176.7 Q ss_pred Cccc---hhhH-------HHHhcCCCCccccCccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MVKT---DSYA-------NIFLGGSDGSEIYGSLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~~~---D~~~-------n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) ++.- ||-. |...||.-+.........++-.+|-..|++ +|.+..+|+.++++|+-. .+++.- T Consensus 29 ~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L 108 (516) T protein:vir:10 29 IATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVERAVANIVNEAIVYERGHKVVSLDL 108 (516) T ss_pred ccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEe Confidence 1111 1111 111121111111122233355677777754 899999999999999762 233322 Q ss_pred cc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 ID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) ++ -.++|.++++. |++..+-.+.+|.--+.|.-+.--.+ | +++..|..++.+||..+.... T Consensus 109 ~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii-d--------~~k~GI~Elr~lDPr~i~~vR 179 (516) T protein:vir:10 109 DDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIM-P--------NPKKGIAELRRLDPRFMEYYR 179 (516) T ss_pred cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEe-c--------CccccceeeeeeCCcceeeEe Confidence 11 12456666665 45566666666655555655433223 2 344566777777776654421 Q ss_pred cc--ccccccc--cCcceEEEEe---------cC--CcccccccCcccEEEecCccchhhhhhccccCCcchHHHH--HH Q lcl|NC_019404. 133 RE--ENPRNAR--FGKPLTYRIT---------TN--ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSD--IL 195 (418) Q Consensus 133 ~~--~dp~s~~--yg~p~~y~i~---------~~--~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~--~~ 195 (418) .. .|..... -|--++|.++ +. +...+.+|+.|= |+|+..-+-+. +.+.=.|-|..+ .. T Consensus 180 ~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dA-I~y~hSGL~d~----~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 180 EIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSA-VVYASSGLMDC----SDRGIIGYLHNAVKPA 254 (516) T ss_pred eecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhh-eeeecccceeC----CCCceeeeehhhhHhH Confidence 11 1111110 0111222222 21 112234556553 33333222110 100001112211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC----------- Q lcl|NC_019404. 196 DSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE----------- 260 (418) Q Consensus 196 ~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~----------- 260 (418) +.|+-.+.+ .++++. - -.|+=++ .+++... +.+ +-+. ..+...+ +-++-|.. T Consensus 255 NQLkm~EDA-----lVIYRitRAPeRRvFYID-vGnlPk~-KAe-qYl~--~im~k~k---NklvYDa~TGev~ddrk~m 321 (516) T protein:vir:10 255 NQLKLLEDA-----MVIYRITRAPERRVFYID-VGNMNNR-KAT-EYVN--GIMQSLK---NRVVYDSNTGTVKNQKRNL 321 (516) T ss_pred HhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcC---ceeEEeCCCCeeccchhhh Confidence 223322222 222321 1 1223333 2222111 111 1111 1111111 11111211 Q ss_pred ---------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccc--ccch--hHHHHHHHHHH Q lcl|NC_019404. 261 ---------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGL--SSSQ--NTALETFHKLI 319 (418) Q Consensus 261 ---------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl--~stg--e~d~~~y~~~I 319 (418) +-+++.+. -+++.++| +..|..-+=.|.++|.++|-.+++..+ +.++ .-|.-.|..+| T Consensus 322 sMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI 400 (516) T protein:vir:10 322 SMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD-VRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFV 400 (516) T ss_pred hhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHH Confidence 12233333 24555666 468889999999999999976666544 2221 22334699999 Q ss_pred HHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--hCCCCCHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALI--AAGAMDIKEA 384 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~--~~g~i~~~e~ 384 (418) .+.|..+ .+++..+++. ++.. +++.|+|..=..-+|...+|+...+.++++.+- -...++.+-+ T Consensus 401 ~rLR~rF-s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi 479 (516) T protein:vir:10 401 VQLQHDF-EEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYV 479 (516) T ss_pred HHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHH Confidence 9998754 4444444332 2222 356788887777788888999998888888874 3567888877 Q ss_pred HHHHHhhcCcCCCChhhcccccccCCC----------cccccc Q lcl|NC_019404. 385 RDTLRTIAPEIKIGDNDIQTEESELIT----------ETEVVI 417 (418) Q Consensus 385 r~~l~~~~~~~~~~~~~~~~~e~~~~~----------e~e~~~ 417 (418) +...-. .++++|++.+..++. |.|.-+ T Consensus 480 ~k~ILr------~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 480 MKNILQ------MTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred HHHHhc------CCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 754322 233343333322221 222222 No 218 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=98.27 E-value=1.8e-06 Score=52.03 Aligned_cols=364 Identities=12% Similarity=-0.005 Sum_probs=168.4 Q ss_pred CccchhhHHHHhc-CCCC-cccc-CccccCCHHHHH-------------------HHHHcCCccchhhhcchhhhccCCc Q lcl|NC_019404. 1 MVKTDSYANIFLG-GSDG-SEIY-GSLQNQAPTILA-------------------SLYADNALVRRIIDTIPETALAAGF 58 (418) Q Consensus 1 ~~~~D~~~n~~~g-~~~~-~~~~-~~~~~~~~~~l~-------------------~~Y~~~~~~r~iVd~~a~d~~r~~~ 58 (418) =++...+..--.. .+.. .... ...+.+|+..+. .+-.+.+-+..++.+--...+...| T Consensus 11 p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~Rk~av~~~~w 90 (512) T protein:vir:19 11 PFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSKRRLAIQALEW 90 (512) T ss_pred ccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCc Confidence 0111111110000 0000 0011 112223433322 2223455666666666666676677 Q ss_pred cccCc--ch--HH----HHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCCcccccccCCCceEEEEEeeccccc Q lcl|NC_019404. 59 HIDGI--DD--EP----AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVK 129 (418) Q Consensus 59 ~i~~~--~d--~~----~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~~l~~pl~~~~~i~~i~v~~~~~i~ 129 (418) .|+-. ++ .+ .+++.+..+.-+..+..-+-.+.+||.|++=+.-+ ++. ...++.+...++.++. T Consensus 91 ~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~lldA~~~G~s~~Ei~w~~~~g--------~~~~~~~~~r~~~~f~ 162 (512) T protein:vir:19 91 RIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAGDAILKGYSMQEIEWGWLGK--------MRVPVALHHRDPALFC 162 (512) T ss_pred eEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhhhcceeeeeEeeeeCC--------ceeeeeeeeeccccce Confidence 77522 11 11 24445555543445555556799999999855332 111 1234455555655443 Q ss_pred cccccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 130 VQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLAT 209 (418) Q Consensus 130 ~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~ 209 (418) ..... .. ...+..+ ...+..++|.+.+.+...+ ...+++|.+.+ +.||....--..+...-+ T Consensus 163 ~~~~~---------~~-~lr~~~~-~~~G~~l~~~k~i~~~~~~------~~g~p~g~gLl-r~~~w~~~fK~~~~~~w~ 224 (512) T protein:vir:19 163 ANPDN---------LN-ELRLRDA-SYHGLELQPFGWFMHRAKS------RTGYVGTNGLV-RTLIWPFIFKNYSVRDFA 224 (512) T ss_pred eccCC---------Cc-EEEecCC-CCCceeecCCceEEEeccC------CCCCcccccHH-HHHHHHHHHHHHHHHHHH Confidence 22110 00 0111111 1123456666666554332 34567888866 567776766667777778 Q ss_pred HHHHHcCCce--eecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCCH---HHHHHHHHHH Q lcl|NC_019404. 210 QLLRRKQQAV--WKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI---DAFLDKKFDR 284 (418) Q Consensus 210 ~l~~~~~~~v--~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~gl---~~~~~~~~~~ 284 (418) ..+.++++++ .|++.- ..+ ....++.......+.... ++.-++.+++.++..-++. ..+++.+-.. T Consensus 225 ~f~E~yG~P~~igky~~~-----a~~---~ek~~L~~al~~~~~~a~-~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~ 295 (512) T protein:vir:19 225 EFLEIYGLPMRVGKYPTG-----STN---REKATLMQAVMDIGRRAG-GIIPMGMTLDFQSAADGQSDPFMAMIGWAEKA 295 (512) T ss_pred HHHHHcCCCeeEEecCCC-----CCH---HHHHHHHHHHHHHhhCcE-EEecCCceEEEeecCCCCHHHHHHHHHHHHHH Confidence 8888888654 455421 111 122233333333343444 4445567899888754443 3456666666 Q ss_pred HhhhhcCCeeeeeccC-cc-----ccccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhccC-----C----ceEEe Q lcl|NC_019404. 285 IVALSGIHEIILKNKN-VG-----GLSSSQNTALETFHKLIDRKRNAELLPIL-EFLIPFIVNAE-----E----WSVEF 348 (418) Q Consensus 285 iaaas~IP~t~L~G~s-~~-----gl~stge~d~~~y~~~I~~~Qe~~l~p~l-~~l~~~i~~~~-----~----~~~~f 348 (418) |+.+ ++||+ ++ |-+|.|+--.....+.+++..+. +...+ +.|+.-++.-. + =.+.| T Consensus 296 Isk~-------iLGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~-i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f 367 (512) T protein:vir:19 296 ISKA-------ILGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQ-LARSINRDLIYPLLALNSDSTIDINRLPGIVF 367 (512) T ss_pred HHHH-------HhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCCCCCCccccceEEe Confidence 7744 23444 22 22334444555666666666554 44444 44666654211 1 12344 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHhCC-CCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 349 SPLDHESSKDKAEVLEKSVNSIAALIAAG-AMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 349 ~pL~~~~eke~ae~~~~~a~a~~~~~~~g-~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) ..-. ..+ .++.++++.++. .| -++.+.+++.+.--.+ . .++++...........+..-+ T Consensus 368 ~~~e------~eD-l~~~a~~~~~l~-~G~~i~~~~i~e~~Gip~~--~-~~e~~~~~~~~~~~~~~~~~~ 427 (512) T protein:vir:19 368 DTSE------AGD-ITALSDAIPKLA-AGMRIPVSWIQEKLHIPQP--V-GDEAVFTIQPVVPDNGSQKEA 427 (512) T ss_pred cCCC------hhh-HHHHHHHHHHHh-cCCCCCHHHHHHHhCCCCC--C-CccccccCCCccccccccccc Confidence 3322 112 245667777775 56 4899999987742111 1 111111111111111111111 No 219 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=98.20 E-value=1.9e-06 Score=51.99 Aligned_cols=389 Identities=9% Similarity=-0.001 Sum_probs=181.1 Q ss_pred CccchhhHH--------HHhcCCCC----ccccCc-cccCCHHHHHHHHH----cCCccchhhhcchhhhccCCccccCc Q lcl|NC_019404. 1 MVKTDSYAN--------IFLGGSDG----SEIYGS-LQNQAPTILASLYA----DNALVRRIIDTIPETALAAGFHIDGI 63 (418) Q Consensus 1 ~~~~D~~~n--------~~~g~~~~----~~~~~~-~~~~~~~~l~~~Y~----~~~~~r~iVd~~a~d~~r~~~~i~~~ 63 (418) -.+.+...+ .+++|... +..|-+ +..-+. +-++.|. ..++.+++|+..+...+|+.++++.+ T Consensus 11 ~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~-~~Y~~rl~rA~~~n~~~~tl~~l~G~vf~k~p~~~~~ 89 (513) T protein:vir:97 11 TTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETD-KGYQERLASAVLLNMVEQTLDTLSGKPFSEPIKLNED 89 (513) T ss_pred cCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCH-HHHHHHHhcccCCChHHHHHHHHhhhhhhcCcccCcC Confidence 111111111 23444321 111211 111111 2222332 25788999999999999999888642 Q ss_pred chHHHHHHHH-HH-----hCchHHHHHHHHhccccceEEEEEeecCCCcc--ccccc-----CCCceEEEEEeeccccc- Q lcl|NC_019404. 64 DDEPAFWSRW-DD-----LEMTQNINDAWSWARLFGGAAIVAIVKDNRAL--TSPVR-----EGAELETVRVYDRTQVK- 129 (418) Q Consensus 64 ~d~~~i~~~~-~~-----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l--~~pl~-----~~~~i~~i~v~~~~~i~- 129 (418) ...++.+.| +. .++.+.++.+++....||.|+|++........ ..|+. ..+..-||..+.+.+|- T Consensus 90 -~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~Iin 168 (513) T protein:vir:97 90 -VPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPECLLF 168 (513) T ss_pred -chHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHhhhcC Confidence 233444433 22 46789999999999999999999875321111 01110 11111233333332220 Q ss_pred -------------cc-----ccccccccccc---------CcceEEEEecCCcc--cccccCcccEEEecCccc-hhhhh Q lcl|NC_019404. 130 -------------VQ-----NREENPRNARF---------GKPLTYRITTNESD--MFYDVHYSRIHIIDGERV-PNAMR 179 (418) Q Consensus 130 -------------~~-----~~~~dp~s~~y---------g~p~~y~i~~~~~~--~~~~iH~SR~i~~~g~~l-p~~~~ 179 (418) -. ....|.+.... |.-+.|+....+.. ....+|.+.-..+.--|+ +.... T Consensus 169 W~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~ 248 (513) T protein:vir:97 169 ARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLNYVPLVTFYAD 248 (513) T ss_pred cceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCCceeEEEEecC Confidence 00 00112211100 11112221111111 111222222111111111 11112 Q ss_pred hccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc Q lcl|NC_019404. 180 RQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA 259 (418) Q Consensus 180 ~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~ 259 (418) .....-|.+||+..++=.+..|... ..--++++...++++-+.++...- .+. ...+.+..+.+.+ T Consensus 249 ~~~~~~~~pPLl~LA~ln~~hy~~~-Sd~~~il~~~~~P~l~~~G~~~~~--~~~------------i~iG~~~~~~lpe 313 (513) T protein:vir:97 249 RQGFMMGKPPLLDLAHLNVAHWQSA-SDQRHILTVSRFPILACSGASGED--SDP------------VVVGPNKVLYNPD 313 (513) T ss_pred CCCCCCCccchHHHHHHHHHHHhhh-hhHHHHHHhcccceeeeecCCcCC--CCc------------eEeeccccccCCC Confidence 2223348899998777777777444 445567888888888777643210 010 1123344444554 Q ss_pred CCCceeEeecccCCHH---HHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 260 ESEEYSVLNSDIGGID---AFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTAL---ETFHKLIDRKRNAELLPILEF 333 (418) Q Consensus 260 ~~e~~~~~~~~~~gl~---~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~~~Qe~~l~p~l~~ 333 (418) ++.++..++.+-+++. +.++...+++..+--.+ | ..+++ +.|++.-. ..=+..++++.. .+...+++ T Consensus 314 ~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~l---l-~~~~~--~~Ta~a~~~~~~~~~S~L~~~a~-~le~al~~ 386 (513) T protein:vir:97 314 PAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEF---L-KRKTG--GQTATARALDSAEATSDLSAMTG-LFEDALAQ 386 (513) T ss_pred CCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHh---h-ccCCc--cccHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 4566777777666654 44555556664433322 2 22333 23443322 223344444443 36777888 Q ss_pred HHHHhhc-----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC-CCChhhcc-ccc Q lcl|NC_019404. 334 LIPFIVN-----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI-KIGDNDIQ-TEE 406 (418) Q Consensus 334 l~~~i~~-----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~-~~~~~~~~-~~e 406 (418) +++++.. .++.+|+.++-+.....+. ...+++..+++.|.|+-+..++.|++.+... .++++.+. +.. T Consensus 387 ~l~~~a~wlg~~~~~~~v~in~dF~~~~~~~-----~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~ 461 (513) T protein:vir:97 387 ALDITADWLRLGPNGGTVELVKDYDLEEMDA-----PGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELM 461 (513) T ss_pred HHHHHHHHhCCCCCccEEEeccccCcccCCH-----HHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHH Confidence 8888743 2357777776554333222 2356777888999999999998887654321 22211111 111 Q ss_pred ccCC------------------------CccccccC Q lcl|NC_019404. 407 SELI------------------------TETEVVIA 418 (418) Q Consensus 407 ~~~~------------------------~e~e~~~~ 418 (418) ++++ +|.+..=+ T Consensus 462 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 497 (513) T protein:vir:97 462 EEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGG 497 (513) T ss_pred HhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCC Confidence 1111 01111111 No 220 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=98.18 E-value=3e-06 Score=50.83 Aligned_cols=383 Identities=13% Similarity=0.124 Sum_probs=174.2 Q ss_pred Cccchh----------------------hHHHHhcCCC------CccccC-ccccCCHHHHHHHHHc---CCccchhhhc Q lcl|NC_019404. 1 MVKTDS----------------------YANIFLGGSD------GSEIYG-SLQNQAPTILASLYAD---NALVRRIIDT 48 (418) Q Consensus 1 ~~~~D~----------------------~~n~~~g~~~------~~~~~~-~~~~~~~~~l~~~Y~~---~~~~r~iVd~ 48 (418) ..+.|. -...-.|.++ .+.+++ .....+-.+|-.-||+ +|.+..+|+. T Consensus 10 ~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pEvd~Av~e 89 (516) T protein:vir:10 10 WDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNPEVERAVAN 89 (516) T ss_pred ccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhccchhHHHHH Confidence 111111 1110011000 011111 1222344566666654 8999999999 Q ss_pred chhhhcc-----CCccccCcch------HHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccC Q lcl|NC_019404. 49 IPETALA-----AGFHIDGIDD------EPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVRE 113 (418) Q Consensus 49 ~a~d~~r-----~~~~i~~~~d------~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~ 113 (418) ++++|+- ..+++..++- .++|.++++. |++..+-.+.+|.--+.|.-+.--.+ | ++ T Consensus 90 Ivneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii-d--------~~ 160 (516) T protein:vir:10 90 IVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIM-P--------NP 160 (516) T ss_pred hhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEe-c--------Cc Confidence 9999976 2344432221 2356666665 45566666666655555555433223 2 34 Q ss_pred CCceEEEEEeecccccccccc-c-cc-ccccc-CcceEEEE---------ecC--CcccccccCcccEEEecCccchhhh Q lcl|NC_019404. 114 GAELETVRVYDRTQVKVQNRE-E-NP-RNARF-GKPLTYRI---------TTN--ESDMFYDVHYSRIHIIDGERVPNAM 178 (418) Q Consensus 114 ~~~i~~i~v~~~~~i~~~~~~-~-dp-~s~~y-g~p~~y~i---------~~~--~~~~~~~iH~SR~i~~~g~~lp~~~ 178 (418) +..|..++.+||..+...... . |. ...-+ |--++|.+ ++. +...+.+|+.+= |+|+..-+-+. T Consensus 161 k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~da-I~y~hSGl~d~- 238 (516) T protein:vir:10 161 KEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSA-IVYAHSGLQDC- 238 (516) T ss_pred ccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhh-eeeeecCcccC- Confidence 566777777777666543111 1 10 00001 11122222 221 111224455442 23332211111 Q ss_pred hhccccCCcchHHHH--HHHHHHHHHHHHHHHHHHHHHc---C-CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCc Q lcl|NC_019404. 179 RRQNDGWGRSVLSSD--ILDSIKDYTNCERLATQLLRRK---Q-QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVG 252 (418) Q Consensus 179 ~~~~~~~G~S~l~~~--~~~~l~~~~~~~~~~~~l~~~~---~-~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~ 252 (418) ....=.|-|..+ ..+.|+-.+.+ -++++. - -.|+=++ .+++... +.+ +-+. ..+...+ T Consensus 239 ---~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRitRAPeRRvFYID-VGnLPk~-KAe-qYl~--~iM~k~K--- 302 (516) T protein:vir:10 239 ---SDRGIVGYLHNAVKPANQLKLLEDA-----LVIYRITRAPERRVFYID-VGNMPNR-KAT-EYVN--GIMQSLK--- 302 (516) T ss_pred ---CCCceeceehhhhHhHHhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcC--- Confidence 000001212211 11222222222 222322 1 1233333 2222111 111 1111 1111111 Q ss_pred ceeEEEcC--------------------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccc Q lcl|NC_019404. 253 QAIGIDAE--------------------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGL 304 (418) Q Consensus 253 ~~~~~d~~--------------------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl 304 (418) +-++-|.. +-+++.+. -+++.++| +..|..-+=.|.++|.++|-.+++..+ T Consensus 303 NklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~SRl~~e~~~~~ 381 (516) T protein:vir:10 303 NRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDD-VRWFNKKLYEALRIPLSRMPRDDGGMV 381 (516) T ss_pred ceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCcee Confidence 11111211 12233333 24555666 468889999999999999976666555 Q ss_pred --ccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019404. 305 --SSSQ--NTALETFHKLIDRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSV 367 (418) Q Consensus 305 --~stg--e~d~~~y~~~I~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a 367 (418) +.++ .-|.-.|..+|.+.|..+ .+++..+++. ++.. +++.|+|..=..-+|...+|+...+. T Consensus 382 ~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl 460 (516) T protein:vir:10 382 IGGQDMAITRDELDFRKFIVQLQHNF-EEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRV 460 (516) T ss_pred eccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 2222 123346999999998754 3333333222 3333 35678888777778888899998888 Q ss_pred HHHHHHH--hCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCc-cccccC Q lcl|NC_019404. 368 NSIAALI--AAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITE-TEVVIA 418 (418) Q Consensus 368 ~a~~~~~--~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e-~e~~~~ 418 (418) .+++.+- -...++.+-++...-. .++++|++.+..++.| .+++|. T Consensus 461 ~~l~~~dpyvGky~s~~yi~k~ILr------~tDeei~~~~k~I~~E~~~~~~~ 508 (516) T protein:vir:10 461 DALSQIEPYVGKYVSHDYVMKNILQ------MTDEQIAQEEKQIEKEANVKRFQ 508 (516) T ss_pred HHHHHhhhhhccccchHHHHHHHhc------CCHhHHHHHHHHHHHhhhCCCCC Confidence 8888874 3557888877754322 2344443333322222 111221 No 221 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=98.17 E-value=3.1e-06 Score=50.75 Aligned_cols=385 Identities=10% Similarity=0.118 Sum_probs=179.2 Q ss_pred Cccch---hhHHHHh--cC--CCCccccCcc-----ccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----Cccc Q lcl|NC_019404. 1 MVKTD---SYANIFL--GG--SDGSEIYGSL-----QNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHI 60 (418) Q Consensus 1 ~~~~D---~~~n~~~--g~--~~~~~~~~~~-----~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i 60 (418) +..-| |-.-+=. ++ ++.+..++.+ ...+-.+|-..|++ +|.+..+|+.++++|+-. .+++ T Consensus 30 ~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l 109 (521) T protein:vir:65 30 IAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIVFEEGHEVVSL 109 (521) T ss_pred ccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEE Confidence 22111 1110000 00 0011111111 12244566666654 899999999999999762 3333 Q ss_pred cCcc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccc Q lcl|NC_019404. 61 DGID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKV 130 (418) Q Consensus 61 ~~~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~ 130 (418) .-++ -.++|.++++. |++..+-.+.+|.--+.|.-+.-..+++ +++..|..++.+||..+.. T Consensus 110 ~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~--------~pk~GI~ELr~lDPr~i~~ 181 (521) T protein:vir:65 110 NLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGK--------NPKDGIVELRQLDPRNLEY 181 (521) T ss_pred EecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcC--------CccccceeeeeeCCcceee Confidence 2211 12356666654 5667777777776666776666555531 1355677777777766654 Q ss_pred cccc---ccccccccCcc-eEEEEecC-----------CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH- Q lcl|NC_019404. 131 QNRE---ENPRNARFGKP-LTYRITTN-----------ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI- 194 (418) Q Consensus 131 ~~~~---~dp~s~~yg~p-~~y~i~~~-----------~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~- 194 (418) .... ..+.-..++.. ++|.++++ +...+.+|+.+=+ +|+..-+-+ .+.+.=.|-|..++ T Consensus 182 vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI-~y~hSGl~d----~~~~~i~syLhkAiK 256 (521) T protein:vir:65 182 VREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAI-TYAHSGLMD----CDDKYIIGYLHRAVK 256 (521) T ss_pred eeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhhe-eeeecccee----CCCCeeeecchhhhH Confidence 3211 11111112222 22222211 1122345555433 333222111 11111112222211 Q ss_pred -HHHHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcC------Ccce-------eE Q lcl|NC_019404. 195 -LDSIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQA-------IG 256 (418) Q Consensus 195 -~~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~-------~~ 256 (418) .+.|+-.+.+ .++++.+ -.|+=++ .+++... +.+ +-+. ..+...++ .++. +. T Consensus 257 p~NQLkm~EDA-----lVIYRitRAPeRRvFYID-vGnlPk~-KAe-qYl~--~im~k~kNklvYDa~TGev~ddrk~ms 326 (521) T protein:vir:65 257 PANQLKLLEDA-----MVVYRITRAPERRVFFID-TGNMNNR-KAA-QHMN--SVAQSFKNRVVYDASTGKLKNQQANLS 326 (521) T ss_pred hHHhhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-hHH-HHHH--HHHHhcCceeEeecccccccccccccc Confidence 1223322222 2233321 1233333 2233211 111 1111 11111111 0000 00 Q ss_pred E----------EcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCcccccc--ch--hHHHHHHHHHHH Q lcl|NC_019404. 257 I----------DAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSS--SQ--NTALETFHKLID 320 (418) Q Consensus 257 ~----------d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~s--tg--e~d~~~y~~~I~ 320 (418) + .+.+-+++.+.- +++.++| +..|..-+=.|.++|.++|-.++.+|++- ++ .-|.-.|..+|. T Consensus 327 MlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~ 405 (521) T protein:vir:65 327 MTEDYWLQRRDGKAITDVTTLPGASGMSDIDD-IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIR 405 (521) T ss_pred hhhhhcccccCCCCccceeecccCCCcChHHH-HHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHH Confidence 0 011223444432 5566666 46888999999999999996666666542 11 223356999999 Q ss_pred HHHHHHHHHHHHHHHHH------hhccC-------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHHHH Q lcl|NC_019404. 321 RKRNAELLPILEFLIPF------IVNAE-------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIKEAR 385 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~------i~~~~-------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~e~r 385 (418) +.|..+ .+++..+++. ++..+ .+.|+|..=..-+|...+|+...+.++++.+-. .-.+|.+-++ T Consensus 406 rLR~rF-s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~ 484 (521) T protein:vir:65 406 TLQSQF-SEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVM 484 (521) T ss_pred HHHHHH-HHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHH Confidence 998764 4444443332 23233 467888877777888889998888888887643 2256777776 Q ss_pred HHHHhhcCcCCCChhhcccccc------------cCCCccccc Q lcl|NC_019404. 386 DTLRTIAPEIKIGDNDIQTEES------------ELITETEVV 416 (418) Q Consensus 386 ~~l~~~~~~~~~~~~~~~~~e~------------~~~~e~e~~ 416 (418) ...-. .+|++|.+.+. +++++.|+- T Consensus 485 k~ILr------~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 485 RDILK------YTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHhc------cCHHHHHHHHHHHHHhhhCCCCCCCcccccCC Confidence 54322 23333333222 222233333 No 222 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=98.09 E-value=6.8e-07 Score=54.41 Aligned_cols=200 Identities=11% Similarity=0.125 Sum_probs=105.4 Q ss_pred EEEeeccccccccccccccccccCcceEEEEecC---CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHH Q lcl|NC_019404. 120 VRVYDRTQVKVQNREENPRNARFGKPLTYRITTN---ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILD 196 (418) Q Consensus 120 i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~---~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~ 196 (418) +++- .| |. -.|++... ..+....++++.|+||.+. .+....+|.||++. +.. T Consensus 1 ~r~~-----------~d------g~-~~y~~~~~~~~~~g~~~~~~~~eilH~r~~------~~~~~~~Glspi~~-a~~ 55 (219) T protein:vir:98 1 MRVC-----------KD------GN-YKYLMKKSLYDTKSEIYEYNKNDVIFIKLY------DPMQQVYGSPDYVG-GIT 55 (219) T ss_pred Ccee-----------ec------Ce-EEEEEecceecCCceeEEeccccEEEecCC------CCCCCcceecHHHH-HHH Confidence 1110 00 11 01212111 1122356899999999653 12344579999975 566 Q ss_pred HHHHHHHHHHHHHHHHHHcCCc--eeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEE-----cCCCceeEeec Q lcl|NC_019404. 197 SIKDYTNCERLATQLLRRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGID-----AESEEYSVLNS 269 (418) Q Consensus 197 ~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d-----~~~e~~~~~~~ 269 (418) .+.....+.......+.....+ |+++++ ..+ +++..+++.+.++.. ...++.+.+++. .++-+|+.++. T Consensus 56 ~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~--~~l-~~e~~~~~~~~~~~~-~g~~n~~~~~l~~~gg~~~G~~~~~~~~ 131 (219) T protein:vir:98 56 SALLNSDATIFRRRYYSNGAHMGFILYSTD--PDM-TEEMEDEIAERIRDS-KGVGNFRSMFVNIAGGHPDGLKVIPIGD 131 (219) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEEeCC--CCC-CHHHHHHHHHHHHHh-cCcccccceeEecCCCCccceeEEEccC Confidence 6766655655555555554333 344442 112 223344555555442 222333455554 22345666665 Q ss_pred ccC--CHHHHHHHHHHHHhhhhcCCeeeeeccC---ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc Q lcl|NC_019404. 270 DIG--GIDAFLDKKFDRIVALSGIHEIILKNKN---VGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFI----VN 340 (418) Q Consensus 270 ~~~--gl~~~~~~~~~~iaaas~IP~t~L~G~s---~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i----~~ 340 (418) +.. ..-+...+....||.+.+||..+| |.. .++.+ +-+.....|+. ..|.|.++++-..| .. T Consensus 132 ~~~d~qfle~rk~~~~eIa~~fgVPp~~l-G~~~~~~~~~s-n~eq~~~~f~~-------~tL~P~~~~ie~~ln~~~~~ 202 (219) T protein:vir:98 132 TGQKDEFANIKNISAQDVLTSHRFPPGLS-GIIPVNTAGLG-DPLKIREAYQA-------DEVLPLQEIIAESINSDYEI 202 (219) T ss_pred CHHHHHHHHHHHhhHHHHHHHhCCCHHHc-ccccCCCCCcc-CHHHHHHHHHH-------HHHHHHHHHHHHHhhhhhcC Confidence 554 344556677899999999999765 644 23332 33445555554 45889887776665 33 Q ss_pred cCCceEEeCCCCCCCHHH Q lcl|NC_019404. 341 AEEWSVEFSPLDHESSKD 358 (418) Q Consensus 341 ~~~~~~~f~pL~~~~eke 358 (418) ..+..++|+.- .++++. T Consensus 203 ~~~~~~~F~~~-~~~d~~ 219 (219) T protein:vir:98 203 KSALKVNFKQP-EKRDKN 219 (219) T ss_pred CCccEEeecCc-ccccCC Confidence 45777888632 122222 No 223 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=98.05 E-value=2.9e-07 Score=56.45 Aligned_cols=226 Identities=13% Similarity=0.067 Sum_probs=121.4 Q ss_pred CccchhhHHHHhcCCCCccccCc-cccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcchH-------HHHHHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGS-LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-------PAFWSR 72 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~-~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d~-------~~i~~~ 72 (418) .-..+.....+.. .....+. ....+. .-+.+++-++.+|+.++++.-+-++.+...... ..+..+ T Consensus 13 ~~~~~~~~~~~~~---~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~ll~~~ 85 (251) T protein:vir:46 13 QYNEDDLQMMVQT---LPSFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTR 85 (251) T ss_pred CCCccchhhhhhh---hccccCcCcceech----hhhhccHHHHHHHHHHHHhHhhCceEEeeCccccccchHHHHHhcc Confidence 0000000000000 0000000 000111 123457778899999999999998888532111 112222 Q ss_pred HHHhCchHHHHHHHHh-ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEe Q lcl|NC_019404. 73 WDDLEMTQNINDAWSW-ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRIT 151 (418) Q Consensus 73 ~~~l~~~~~~~~a~~~-~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~ 151 (418) -...--...|.+++.. -.++|.|++++.- + ..|.+..|.++++.++++.... .|.+.++... T Consensus 86 Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r-~---------~~G~~~~L~~i~~~~v~v~~~~-------~g~~~~~~~~ 148 (251) T protein:vir:46 86 PNPMYNGYIFKLVVFVSALLTSHGYIEITR-D---------KTGEPMNLTFRKTSEIELKSDA-------RGRLYYFHQR 148 (251) T ss_pred CCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-C---------CCCcEEEEEEECCceEEEEECC-------CCcEEEEEEE Confidence 2222233455555555 4678999988753 2 2356788999999988765421 2444333222 Q ss_pred --cCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCc--eeecchHHH Q lcl|NC_019404. 152 --TNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQA--VWKAKGLAE 227 (418) Q Consensus 152 --~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~ 227 (418) ....+....+.++.||||.+.++ ...+|.||+.. +.+.|.....+.......+.....+ ++++++ . T Consensus 149 ~~~~~~g~~~~~~~~diiH~r~~~~-------dg~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ 218 (251) T protein:vir:46 149 IDSNGNNIERNVKFEDMLDIKFYSL-------DGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--V 218 (251) T ss_pred eccCCcceeEEECCccEEEecCcCC-------CCeeecCHHHH-HHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCC--C Confidence 12222335789999999975431 23589999975 7899999999999999888875433 556653 1 Q ss_pred hhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 228 LCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 228 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) +...+..+++++++...-...++.+. +.++.+| T Consensus 219 -l~~~e~~~~~~~~~~~~~~g~~n~g~-~~~gm~~ 251 (251) T protein:vir:46 219 -LDNKKARDRAREEFPKVLVELNKLGK-LSYSMNQ 251 (251) T ss_pred -CCCHHHHHHHHHHHHHHhcCcccccc-cccccCC Confidence 22222234455555544333233343 3345544 No 224 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=98.05 E-value=5.9e-06 Score=49.27 Aligned_cols=375 Identities=14% Similarity=0.145 Sum_probs=161.6 Q ss_pred hcCCCCccccCcc---------------cc-CCHHHHHHHHHcC------------------CccchhhhcchhhhccCC Q lcl|NC_019404. 12 LGGSDGSEIYGSL---------------QN-QAPTILASLYADN------------------ALVRRIIDTIPETALAAG 57 (418) Q Consensus 12 ~g~~~~~~~~~~~---------------~~-~~~~~l~~~Y~~~------------------~~~r~iVd~~a~d~~r~~ 57 (418) |+........+.+ .+ ..|..+.++|..+ +-+|++|++... -+-.+ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~~ps~r~~V~~~~~-~Lg~~ 79 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDDSVPILMPSGRKIVEAVHR-FLGVG 79 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCceeeeccchHHHHHHHHHH-hcCCC Confidence 3322111110000 01 1233444444332 347788998554 44556 Q ss_pred ccc--cCcc-h---HHHHHHHH----HHhCchHHHHHHHHhccccceEEEEEeecCCCc----cc-cccc--------CC Q lcl|NC_019404. 58 FHI--DGID-D---EPAFWSRW----DDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA----LT-SPVR--------EG 114 (418) Q Consensus 58 ~~i--~~~~-d---~~~i~~~~----~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~----l~-~pl~--------~~ 114 (418) ..+ +..+ + .++++..+ ++=++..++.++.+|+-+-|.++..+.-+.+++ ++ .+++ .. T Consensus 80 ~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~~dp 159 (563) T protein:vir:74 80 FDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLIEDG 159 (563) T ss_pred cEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceeeeccCC Confidence 665 3211 1 13444433 445788999999999999998887665433221 11 1111 12 Q ss_pred CceEEEEEee---cccccccc-------------ccccccccccCc-----ceEEEE-----ecCCc-----ccccccCc Q lcl|NC_019404. 115 AELETVRVYD---RTQVKVQN-------------REENPRNARFGK-----PLTYRI-----TTNES-----DMFYDVHY 163 (418) Q Consensus 115 ~~i~~i~v~~---~~~i~~~~-------------~~~dp~s~~yg~-----p~~y~i-----~~~~~-----~~~~~iH~ 163 (418) +.+.++..++ .|..+... ++..+ -|.- -+.|.. .+... .....+|- T Consensus 160 d~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg---~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~ 236 (563) T protein:vir:74 160 STVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEG---MFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSA 236 (563) T ss_pred CCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCC---CccceeeeccchhccccccccCccchhhhcccchhhhh Confidence 2222222221 22221110 01111 1110 111221 00000 00011222 Q ss_pred cc---------------EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHh Q lcl|NC_019404. 164 SR---------------IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAEL 228 (418) Q Consensus 164 SR---------------~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~ 228 (418) +| ++||++ ..+.+..||.|.|.. +..-+.....++--.+.++.-.+..++-+.+.+.. T Consensus 237 ~~d~e~~~LP~pi~~iPiv~~~t------ip~~~s~WG~S~La~-ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~ 309 (563) T protein:vir:74 237 QHDEEEEELPEPISQLPLYRWRN------KPPQNSSWGTSQLEG-METLAYALNQSLTDEDATIVFQGLGMYVTNASAPV 309 (563) T ss_pred hhhchhhhccccccCccEEEcCC------CCCcccccchhhHHH-HHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccc Confidence 22 223332 234567899999865 44444444444443444444456677776643322 Q ss_pred hcCcchHHHHHHHHHHHHHhcCCcceeEEEcCC-C--ceeEeec--ccCCHHHHHHHHHH-HHhhhhcCCeeeee----c Q lcl|NC_019404. 229 CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAES-E--EYSVLNS--DIGGIDAFLDKKFD-RIVALSGIHEIILK----N 298 (418) Q Consensus 229 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~-e--~~~~~~~--~~~gl~~~~~~~~~-~iaaas~IP~t~L~----G 298 (418) .. ...+... ++ -+ -+.++=.+++ + -++.++. +++++..=++.+.+ .++..+++|.+-|. | T Consensus 310 d~---~~g~~~~-w~-----vg-pG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~ 379 (563) T protein:vir:74 310 DP---NTGELTD-WN-----IG-PMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVT 379 (563) T ss_pred cc---ccccccc-cc-----cC-CceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccc Confidence 11 1111100 10 01 1222222221 2 2555554 34566655776666 57888999998774 2 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhh-------cc---C------------CceEEeCCCC Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEF----LIPFIV-------NA---E------------EWSVEFSPLD 352 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~----l~~~i~-------~~---~------------~~~~~f~pL~ 352 (418) ..++|.+= +-.+.-.-..++.++ ..+.-.+.. .+.+++ .. . .+.+.|.|.. T Consensus 380 ~~~SGiAL--eL~L~PL~a~~~ek~-l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~ 456 (563) T protein:vir:74 380 SAESGISL--ELQLKPLLAANEEKE-LEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPM 456 (563) T ss_pred cccchhhh--hhhhhHHHHhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCC Confidence 33444321 222222222222221 112222221 222211 11 1 2567899998 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC--------CCChhhccc---ccccCCCccccccC Q lcl|NC_019404. 353 HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI--------KIGDNDIQT---EESELITETEVVIA 418 (418) Q Consensus 353 ~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~--------~~~~~~~~~---~e~~~~~e~e~~~~ 418 (418) ..+..+ ..+-...++++|+|+.+.|.+.|.+.+-.. .+..++|.+ .+-+... .=++-| T Consensus 457 P~d~~~-------vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~-~~~~~a 525 (563) T protein:vir:74 457 PVNKTQ-------VTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADA-SLGLSA 525 (563) T ss_pred CccHHH-------HHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccC-ccccee Confidence 777654 355667899999999999999997765111 112222222 1100000 000000 No 225 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=98.03 E-value=6.5e-06 Score=49.01 Aligned_cols=382 Identities=14% Similarity=0.128 Sum_probs=178.5 Q ss_pred Cccchhh----------HHHHhcCCCCccccCcc-ccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----Ccccc Q lcl|NC_019404. 1 MVKTDSY----------ANIFLGGSDGSEIYGSL-QNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHID 61 (418) Q Consensus 1 ~~~~D~~----------~n~~~g~~~~~~~~~~~-~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~ 61 (418) ++.-|.- -+.+.+++-.+++++.- ..++-.+|-..|++ +|.+..+|+.++++|+-- .+.+. T Consensus 31 ~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~ 110 (521) T protein:vir:10 31 FAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHEVDNAIDEIINDAIVQEDNRDTVYLD 110 (521) T ss_pred cccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHhhccchhhHHHhhhcceEEecCCCceEEEE Confidence 1111100 00001111111122211 11345677777754 899999999999999762 23332 Q ss_pred Ccc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeeccccccc Q lcl|NC_019404. 62 GID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 62 ~~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~ 131 (418) -++ -+++|.++++. |++..+-.+.+|.--+.|.-+.-..++ |=+++..|..++.+||..+... T Consensus 111 Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid-------~~~pk~GI~Elr~lDPr~i~~v 183 (521) T protein:vir:10 111 LDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMID-------PARPKDGIKELRLLDPRNVEYY 183 (521) T ss_pred ecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEee-------CCCccccceeeeeeCCcceeee Confidence 111 12356666664 466777667777555566544433332 1224456777777777665432 Q ss_pred cc-c--cccccccc-CcceEEEEec---------CCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH--HH Q lcl|NC_019404. 132 NR-E--ENPRNARF-GKPLTYRITT---------NESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI--LD 196 (418) Q Consensus 132 ~~-~--~dp~s~~y-g~p~~y~i~~---------~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~--~~ 196 (418) .. . .+..-..+ |--++|.+++ +..+...+|+.| .|+|+..-+- ..+.+...|-|..++ .+ T Consensus 184 r~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~d-aI~y~hSGL~----d~~~~~i~syLhkAiKp~N 258 (521) T protein:vir:10 184 RVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPID-AIVYSHSGKV----DIDGKTIVGYLHNVIKPAN 258 (521) T ss_pred eeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechh-heeeecccce----eCCCCceeccchhhhHhHH Confidence 11 0 00000001 1112333321 112233567775 4445433222 222233444443322 12 Q ss_pred HHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC------------ Q lcl|NC_019404. 197 SIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE------------ 260 (418) Q Consensus 197 ~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~------------ 260 (418) .|+-.+.+ .++++.+ -.|+=++ .+++... + +++-+. ..+...++ -++=|.. T Consensus 259 QLkm~EDA-----lVIYRitRAPeRRvFYID-vGnlpk~-K-AeqYl~--~iM~k~kN---klVYDa~TGev~ddrk~ms 325 (521) T protein:vir:10 259 QLKMLEDA-----MVIYRITRAPERRVFYID-VGTMPNK-K-ATQHLN--NVMQGLKN---RVVYDSSTGKVKNSSNNLA 325 (521) T ss_pred hhHHHHhh-----HHHHhhhccccceEEEEe-cCCCCch-h-HHHHHH--HHHHhcCc---eEEEeccCceeccchhhhh Confidence 23322222 2233321 1233333 2222211 1 111111 01111111 1111111 Q ss_pred --------------CCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccc--cch--hHHHHHHHHHHH Q lcl|NC_019404. 261 --------------SEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLS--SSQ--NTALETFHKLID 320 (418) Q Consensus 261 --------------~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~--stg--e~d~~~y~~~I~ 320 (418) +-+++.+.- +++.++| +..|..-+=.|.++|.++|-.+ .+|++ .++ .-|.-.|..+|. T Consensus 326 MlEDyWLpRReGgrgTEI~TLpggqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e-~~~f~~Gr~~EItRDEikF~KFI~ 403 (521) T protein:vir:10 326 MTEDYWLMRRDGKATTEVSTLPGAQSMGEMDD-VRWFNRKLYESMKIPLSRLPQE-GAGVTFGAGNDITRDELQFTKYIR 403 (521) T ss_pred hHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCccccCCC-CCceecccccchhHHHHHHHHHHH Confidence 122333332 4555666 4688899999999999999544 33332 222 123356999999 Q ss_pred HHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh----CCCCCHHH Q lcl|NC_019404. 321 RKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA----AGAMDIKE 383 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~----~g~i~~~e 383 (418) +.|..+ ..++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-- .-.++.+- T Consensus 404 rLR~rF-s~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dy 482 (521) T protein:vir:10 404 GLQQQF-EPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEY 482 (521) T ss_pred HHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHH Confidence 998754 4444444332 2222 3567888877777888889999999888887733 33677777 Q ss_pred HHHHHHhhcCcCCCChhhcccccc------------cCCCccccc Q lcl|NC_019404. 384 ARDTLRTIAPEIKIGDNDIQTEES------------ELITETEVV 416 (418) Q Consensus 384 ~r~~l~~~~~~~~~~~~~~~~~e~------------~~~~e~e~~ 416 (418) ++...-. .+|++|.+.+. ++++|.|+- T Consensus 483 i~k~ILr------~tDeeik~~~k~I~~E~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 483 VMKNILR------MSDEDIKTEREKIDGELKDSVYKNPEDPMEEF 521 (521) T ss_pred HHHHHhc------CCHhHHHHHHHHHHHhhhCCCCCCCcchhhcC Confidence 7754322 23333332222 222232332 No 226 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=97.98 E-value=8.4e-06 Score=48.41 Aligned_cols=382 Identities=12% Similarity=0.114 Sum_probs=180.4 Q ss_pred Cccchhh----------HHHHhcCCCCccccCc--cccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----Cccc Q lcl|NC_019404. 1 MVKTDSY----------ANIFLGGSDGSEIYGS--LQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHI 60 (418) Q Consensus 1 ~~~~D~~----------~n~~~g~~~~~~~~~~--~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i 60 (418) ++.-|.- -+...||.... .++. ..-.+-.+|-..|++ +|.+..+|+.++++|+-. .+++ T Consensus 35 ~~~p~~~dGa~~i~~~~~~~~~~g~~~~-~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l 113 (524) T protein:vir:98 35 VAPPKNNDGAYEIETDLNNQKYAGVFQQ-FYSGQDPAIQNKEQLINTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITM 113 (524) T ss_pred ccCCCCCCCceeecCCCCcceecceeee-eccccccccchHHHHHHHHHHHhhccchhhHHHhhhcceeEecCCCceEEE Confidence 2111111 11111111111 1111 111345567666654 899999999999999752 3333 Q ss_pred cCcc------hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccc Q lcl|NC_019404. 61 DGID------DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKV 130 (418) Q Consensus 61 ~~~~------d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~ 130 (418) .-++ -.++|.++++. |++..+-.+.+|.--+.|.-+.-..+++ . .++ .|..++.+||..+.. T Consensus 114 ~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~-~------~~k-GI~ELr~lDPr~i~~ 185 (524) T protein:vir:98 114 DLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHK-D------ESK-GIRELRQLDPRCMEL 185 (524) T ss_pred EecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcC-C------CCc-ceeeeeeeCCcccee Confidence 2211 12356666654 5667777777776667776666665542 1 123 488888888877654 Q ss_pred cc-c---cccccccccCc-ceEEEEecC-----------CcccccccCcccEEEecCccchhhhhhccccCCcchHHHHH Q lcl|NC_019404. 131 QN-R---EENPRNARFGK-PLTYRITTN-----------ESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI 194 (418) Q Consensus 131 ~~-~---~~dp~s~~yg~-p~~y~i~~~-----------~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~ 194 (418) .. . ..|....-+.. -++|.+++. +.....+|+.+=+++....-++ ..+ +. .|-|..++ T Consensus 186 vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d-~~~---~i--isyLhkAi 259 (524) T protein:vir:98 186 IRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLED-CSN---NI--IGYLHRAV 259 (524) T ss_pred eeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheeeeccCccc-CCC---Ce--eeehhHhh Confidence 21 1 11212222222 234444321 0112246777765544322221 111 10 13233211 Q ss_pred --HHHHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcC------Cccee------- Q lcl|NC_019404. 195 --LDSIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQAI------- 255 (418) Q Consensus 195 --~~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~~------- 255 (418) .+.|+-.+. |.++++.+ -.|+=++ .+++... +. ++-+. ..+...++ .++.+ T Consensus 260 Kp~NQLkm~ED-----AlVIYRitRAPeRRvFYID-vGnlPk~-KA-eqYl~--~im~k~kNklvYDa~TGevrddrk~m 329 (524) T protein:vir:98 260 KPANQLRLLED-----AMVIYRITRAPERRVFYID-VGQMGGN-KA-TQYVN--NIAQGLKNRVVYDARTGTVKNQQNNL 329 (524) T ss_pred HhHHhhHHHHh-----hHHHHhhhccccceEEEEe-cCCCCch-hH-HHHHH--HHHHhcCceeEeeccCceeecccccc Confidence 122332222 22333321 1233333 2333211 11 11111 11111121 01110 Q ss_pred EE----------EcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccc--cch--hHHHHHHHHHH Q lcl|NC_019404. 256 GI----------DAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLS--SSQ--NTALETFHKLI 319 (418) Q Consensus 256 ~~----------d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~--stg--e~d~~~y~~~I 319 (418) .+ .+.+-+++.+.- +++.++| +..|..-+=.+.++|.++|- ++.+|++ .++ .-|.-.|..+| T Consensus 330 sMlEDyWLpRReGgrgTEItTLpggqnlgem~D-V~YF~kkLy~aLnVP~sRl~-~~~~~f~~Gr~~EItRDEiKF~KFI 407 (524) T protein:vir:98 330 SMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDD-IKWFNRKLYEALRVPLSRMP-RDDGGMQIGGGGEITRDELKFSKFI 407 (524) T ss_pred chhhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCceecc-CCCCccccccccchhHHHHHHHHHH Confidence 00 011122333332 4555666 46888999999999999993 2223332 221 22335699999 Q ss_pred HHHHHHHHHHHHHHHHHH------hhccC-------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh-CC-CCCHHHH Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPF------IVNAE-------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA-AG-AMDIKEA 384 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~------i~~~~-------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~-~g-~i~~~e~ 384 (418) .+.|..+ .+++..+++. ++..+ .+.|+|..=..-+|...+|+...+..+++.+-. -| .++.+-+ T Consensus 408 ~rLR~rF-s~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi 486 (524) T protein:vir:98 408 RTLQIQF-SPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYI 486 (524) T ss_pred HHHHHHH-HHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHH Confidence 9998764 4444443332 23233 467888877777888889998888888887644 23 6777766 Q ss_pred HHHHHhhcCcCCCChhhcccccc------------cCCCccccc Q lcl|NC_019404. 385 RDTLRTIAPEIKIGDNDIQTEES------------ELITETEVV 416 (418) Q Consensus 385 r~~l~~~~~~~~~~~~~~~~~e~------------~~~~e~e~~ 416 (418) +...-. .+|++|.+.+. ++++|.|.- T Consensus 487 ~k~ILr------~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 487 MKEILR------MSDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred HHHHhc------cCHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 654321 23444433322 223333332 No 227 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.86 E-value=1.4e-05 Score=47.19 Aligned_cols=393 Identities=13% Similarity=0.085 Sum_probs=176.2 Q ss_pred CccchhhHH--------HHhcCCCC----ccccCc-c-ccCCHH---HHHHHHHc----CCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MVKTDSYAN--------IFLGGSDG----SEIYGS-L-QNQAPT---ILASLYAD----NALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~~~D~~~n--------~~~g~~~~----~~~~~~-~-~~~~~~---~l~~~Y~~----~~~~r~iVd~~a~d~~r~~~~ 59 (418) -.+.+...+ .+++|... +..|-+ + ....+. +.++.|.. ..+++++++..+.-.+|++.. T Consensus 6 ~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf~k~p~ 85 (501) T protein:vir:95 6 FIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVFMRDPV 85 (501) T ss_pred CCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhhcCCcc Confidence 011111111 13444431 112211 1 111111 23333433 477888999999999999988 Q ss_pred ccCcchHHHHHHHHHH--hCchHHHHHHHHhccccceEEEEEeecC--CCcccccc--cCCCceEEEEEeecccc----- Q lcl|NC_019404. 60 IDGIDDEPAFWSRWDD--LEMTQNINDAWSWARLFGGAAIVAIVKD--NRALTSPV--REGAELETVRVYDRTQV----- 128 (418) Q Consensus 60 i~~~~d~~~i~~~~~~--l~~~~~~~~a~~~~rl~G~~~i~i~~~d--~~~l~~pl--~~~~~i~~i~v~~~~~i----- 128 (418) ++.++....+....+. .++.+.++.++.....||.|+|++.... +.....-. ...+..-||..+.+.+| T Consensus 86 ~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~ 165 (501) T protein:vir:95 86 VKVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPTEIINWRT 165 (501) T ss_pred eeCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHhhhcCcce Confidence 8644333333332322 3688999999999999999999987531 11100000 00111112322222111 Q ss_pred --------------------ccc---------cccccccccccCcceEEEEecCCcccccccC------cccEEEe--cC Q lcl|NC_019404. 129 --------------------KVQ---------NREENPRNARFGKPLTYRITTNESDMFYDVH------YSRIHII--DG 171 (418) Q Consensus 129 --------------------~~~---------~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH------~SR~i~~--~g 171 (418) ... ++...+..+.+..-+.|+-...+...+..+. .+..... .+ T Consensus 166 ~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 245 (501) T protein:vir:95 166 TDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQG 245 (501) T ss_pred eccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccceeeeeccCC Confidence 000 0000111111222233332211110000000 0000000 01 Q ss_pred cc---chhhh-hhcccc--CCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHH Q lcl|NC_019404. 172 ER---VPNAM-RRQNDG--WGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQV 245 (418) Q Consensus 172 ~~---lp~~~-~~~~~~--~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~ 245 (418) .+ +|... ....++ -|.+||+..++=.|..|..... --++++..++++.-+.++...-....... T Consensus 246 ~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd-~~~~l~~~~~P~l~i~G~~~~~~~~~~~~--------- 315 (501) T protein:vir:95 246 KRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSAD-YEESCYIVGQPTPVLIGLTEEWVTNVLKG--------- 315 (501) T ss_pred CcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhH-HHHHHHHcccceeeeeCCcccccccCCCC--------- Confidence 11 12111 112222 3678898888777777766554 55678888888887776543211110000 Q ss_pred HHhcCCcceeEEEcCCCceeEeecccCCH-HHHHHHHHHHHhhh-hcCCeeeeeccCccccccchhHHH---HHHHHHHH Q lcl|NC_019404. 246 DNNSGVGQAIGIDAESEEYSVLNSDIGGI-DAFLDKKFDRIVAL-SGIHEIILKNKNVGGLSSSQNTAL---ETFHKLID 320 (418) Q Consensus 246 ~~~~~~~~~~~~d~~~e~~~~~~~~~~gl-~~~~~~~~~~iaaa-s~IP~t~L~G~s~~gl~stge~d~---~~y~~~I~ 320 (418) ....+.+..+.+. ++.++..++.+-.++ +..++...+++..+ +.+. ..+++ +.|++.-. ..=+..+. T Consensus 316 ~i~~G~~~~~~lP-~~~~~~~ie~~~~~i~~~~l~~l~~~m~~~Ga~ll-----~~~~~--~~Ta~~~~~~~~~~~S~L~ 387 (501) T protein:vir:95 316 SVNFGSRGGIPLP-VGADAKLLQASENTMLKEAMDTKERQMVALGAKLV-----EQKEV--QRTATEAELEAASEGSTLS 387 (501) T ss_pred ceeecccccccCC-CCCceeEEecChhhHHHHHHHHHHHHHHHHHHhhc-----cCCcc--chhHHHHHHHHHHHhHHHH Confidence 0112333334443 445677776654444 44555555555433 3322 22222 22332222 11233333 Q ss_pred HHHHHHHHHHHHHHHHHhhc-----cCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC Q lcl|NC_019404. 321 RKRNAELLPILEFLIPFIVN-----AEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI 395 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~i~~-----~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~ 395 (418) ..- ..+...+++++++++. .++.+|+.++-+....-+ ...++++..+.++|.|+-++.++.|+..+.. T Consensus 388 ~~a-~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~-----~~~~~al~~~~~~G~is~~t~~~~L~~~~v~- 460 (501) T protein:vir:95 388 SAT-KNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMT-----PDERRSLVEEWQKGAITFEEMRTGLRKAGVA- 460 (501) T ss_pred HHH-HHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCC-----HHHHHHHHHHHhCCCCcHHHHHHHHHhCCCC- Confidence 333 2367778888887653 244677766655433322 2336788889999999999999999875432 Q ss_pred CCCh----hhcccccccCCCc----cccc-------cC Q lcl|NC_019404. 396 KIGD----NDIQTEESELITE----TEVV-------IA 418 (418) Q Consensus 396 ~~~~----~~~~~~e~~~~~e----~e~~-------~~ 418 (418) ..+. +.|+++.++...- .... .+ T Consensus 461 ~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~ 498 (501) T protein:vir:95 461 TEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVG 498 (501) T ss_pred ChhHHHHHHHHHhhhcCcccccccCCCCCCCccccccc Confidence 1121 1111111110000 0000 00 No 228 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=97.67 E-value=3e-05 Score=45.35 Aligned_cols=376 Identities=10% Similarity=0.050 Sum_probs=173.9 Q ss_pred CccchhhHHHHhcCCCCccccCcc------------ccCCH----HHHHHHHH-----cCCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSL------------QNQAP----TILASLYA-----DNALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~------------~~~~~----~~l~~~Y~-----~~~~~r~iVd~~a~d~~r~~~~ 59 (418) +++ |++.-.+-. .+..|-+. .+..+ .+.+..|+ ..++.+++++..+.-.+|+.++ T Consensus 32 ~~~-d~g~~~~k~---~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl~~l~G~vfrk~p~ 107 (488) T protein:vir:96 32 RNL-DCVMDNIKR---KKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPTMNAITGAVMRREPE 107 (488) T ss_pred Hhh-hhhhHHHHH---hhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHHHHHhcchhhccCce Confidence 222 332221110 01111111 01100 11122222 2488899999999999999999 Q ss_pred ccCcchHHHHHHHHHH-----hCchHHHHHHHHhccccceEEEEEeecCCCc-ccccccCCCceEEEEEeeccccc---- Q lcl|NC_019404. 60 IDGIDDEPAFWSRWDD-----LEMTQNINDAWSWARLFGGAAIVAIVKDNRA-LTSPVREGAELETVRVYDRTQVK---- 129 (418) Q Consensus 60 i~~~~d~~~i~~~~~~-----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~-l~~pl~~~~~i~~i~v~~~~~i~---- 129 (418) ++.++. .+++.-++. .++.+.++.+++....||.|+|++....... ..+. ...+..-++..+.+.+|- T Consensus 108 ~~~~~~-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade-~~~~~rPy~~~~~a~~IinW~~ 185 (488) T protein:vir:96 108 FDTMDN-PVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADW-NKGKKLPTAAFYDALHIIDWEV 185 (488) T ss_pred eccCCc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHH-HHhcCCcEEEEechhhhcCcce Confidence 986543 234444443 4678999999999999999999987632110 0000 001111122222221110 Q ss_pred ----------cc-----cccccccccccCcceEEE-------------EecCCcccccccC-----cccEEEecCccchh Q lcl|NC_019404. 130 ----------VQ-----NREENPRNARFGKPLTYR-------------ITTNESDMFYDVH-----YSRIHIIDGERVPN 176 (418) Q Consensus 130 ----------~~-----~~~~dp~s~~yg~p~~y~-------------i~~~~~~~~~~iH-----~SR~i~~~g~~lp~ 176 (418) -. ....|+. +|+.+..|+ ...++.......| +=..|.|. . T Consensus 186 ~~v~G~~~L~~v~lrE~~~~~D~~--~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v-----~ 258 (488) T protein:vir:96 186 EYIDGEEKLTYLSLLEDYQERDGG--TYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLINSKQSDTIPFF-----L 258 (488) T ss_pred eccCCceeeEEEEEEEEEEeccCC--CcccceEEEEEEEECcEEEEEEEecCCcccceEeecCCCcccCeeEEE-----E Confidence 00 0011221 222222222 1111111111111 11112221 1 Q ss_pred hhhhc-cccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCccee Q lcl|NC_019404. 177 AMRRQ-NDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAI 255 (418) Q Consensus 177 ~~~~~-~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~ 255 (418) ..... ...-|.+||+..++=.|..|.....- -++++.+.++++-+.. ..+ ..+ ....... ...+....+ T Consensus 259 ~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~-~~il~~~~~p~lv~~~-~~~-~~~-~~~~~~~------~g~~~~~~~ 328 (488) T protein:vir:96 259 ASSQSNEWCIDSTPLTSLAEISLSIYVMNAYS-NKAMILANEAKWMVDM-GDM-NKT-MASEMNP------LGFTLAGRM 328 (488) T ss_pred EecCCCCCCCCCCchHHHHHHHHHHHhhhhHH-HHHHHhcCCceeeecc-CCC-Ccc-ccccccc------ceeeecccc Confidence 11122 22347889998888888999888776 6667788877654331 111 000 0000000 001111112 Q ss_pred EEEcCCCceeEeecccCCH-HHHHHHHHHHHh-hhhcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHH Q lcl|NC_019404. 256 GIDAESEEYSVLNSDIGGI-DAFLDKKFDRIV-ALSGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPI 330 (418) Q Consensus 256 ~~d~~~e~~~~~~~~~~gl-~~~~~~~~~~ia-aas~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~ 330 (418) ....+..++...+...+.+ +..++...+++. ..+.+.. ++ + +.|+++-...+ +..+++.- ..+... T Consensus 329 ~~~~~~g~~~~~e~~~~~l~~~~l~~l~~qm~~~Ga~l~~-----~~-~--~~Ta~~~~~~~~~~~S~L~~~a-~~le~a 399 (488) T protein:vir:96 329 PYYVKNGDVKVIQAQFSPETENKVEKLFEQAVKVGASLFT-----QQ-S--NETATGAAIRSGSSTASMATLG-NNVEDT 399 (488) T ss_pred cccccCCceeecCCchhHHHHHHHHHHHHHHHHHhHhhcc-----CC-C--cchHHHHHHHHHHhhHHHHHHH-HHHHHH Confidence 2222233455555444433 445555555543 2333332 11 1 22333222111 33333332 346778 Q ss_pred HHHHHHHhhc-c---------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCc-CCCCh Q lcl|NC_019404. 331 LEFLIPFIVN-A---------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPE-IKIGD 399 (418) Q Consensus 331 l~~l~~~i~~-~---------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~-~~~~~ 399 (418) ++++++++.. . .+..|.-|.-+..-..+ ....+++..+.++|.||-+..++.|+..+.- ...+ T Consensus 400 l~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld-----~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~- 473 (488) T protein:vir:96 400 VRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVN-----PQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMS- 473 (488) T ss_pred HHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCC-----HHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCC- Confidence 8888888753 1 23556555433222111 2346788889999999999999999876432 1222 Q ss_pred hhcccccccCCCccccc Q lcl|NC_019404. 400 NDIQTEESELITETEVV 416 (418) Q Consensus 400 ~~~~~~e~~~~~e~e~~ 416 (418) .++.+++++++.=+| T Consensus 474 --~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 474 --KEEFDEHIAELGFGM 488 (488) T ss_pred --HHHHHHHHhhcCCCC Confidence 233344555555555 No 229 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=97.63 E-value=3.5e-05 Score=45.02 Aligned_cols=293 Identities=13% Similarity=0.088 Sum_probs=129.2 Q ss_pred eEEEEEeecCCCcccccccCCCceEEEEEeeccccccccccccccccccCcceEEEEecCCcccccccCcccEEEecCcc Q lcl|NC_019404. 94 GAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQNREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGER 173 (418) Q Consensus 94 ~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~ 173 (418) -.=++....++. -.+..|...++..+ ..+..+ ++-+ ...+...+..+..+..+.+.+.|++... T Consensus 1 v~Eivw~~~~g~---------~~~~~l~~r~~~~~--~~f~~~---~~~~-l~~~~~~~~~g~~~~~lp~~kfi~~~~~- 64 (355) T protein:vir:78 1 MFEQVYRIENGR---------ARLGKLAWRPPRTI--SRFDVA---PDGG-LVAIEQWGVFGKATVRIPVDRLVVFVNE- 64 (355) T ss_pred CeEEEEEeeCCe---------EEEeeeeecCccce--eeeeec---cCCc-eeEEEecCCCCCCcceeccCCEEEEEeC- Confidence 222222222211 01222322222211 111111 1111 1222222222223356788887776532 Q ss_pred chhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHc--CCceeecchHHHhhcCcc-----hHHHHHHHHHHHH Q lcl|NC_019404. 174 VPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRK--QQAVWKAKGLAELCDDSE-----GFGAARLRLAQVD 246 (418) Q Consensus 174 lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~--~~~v~k~~~l~~~~~~~~-----~~~~~~~r~~~~~ 246 (418) ....++||.+.+ +.||-...--.....--+..+.++ .+++.+.+........+. ........+..+. T Consensus 65 -----~~~g~p~G~gLl-r~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~ 138 (355) T protein:vir:78 65 -----REGANWLGQSLL-RQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLA 138 (355) T ss_pred -----CCCCCccchhhH-HHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHH Confidence 235667899866 557776666666666677778887 777877663221111110 0011122222222 Q ss_pred Hhc-CCcceeEEEcCCCceeEeeccc--CCHHHHHHHHHHHHhhhhcCCeeeeeccC-ccccccchhHHHHHHHHHHHHH Q lcl|NC_019404. 247 NNS-GVGQAIGIDAESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKN-VGGLSSSQNTALETFHKLIDRK 322 (418) Q Consensus 247 ~~~-~~~~~~~~d~~~e~~~~~~~~~--~gl~~~~~~~~~~iaaas~IP~t~L~G~s-~~gl~stge~d~~~y~~~I~~~ 322 (418) ... +...+.++.-++.+++.++..- ++...+++.+-.+||.+.--. |.-.+.+ -+|-.|-|+.-.....+.+++. T Consensus 139 ~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~Alg~vh~~v~~~~~~aD 217 (355) T protein:vir:78 139 KEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYALGDTFASFFTGSLNAV 217 (355) T ss_pred HHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhhHHHHHHHHHHHHHHHH Confidence 222 2223444555567888887653 346677777777777554222 1111111 1122233555556677777777 Q ss_pred HHHHHHHHH-HHHHHHhhcc----CC--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHH-HHHHHHh-hc- Q lcl|NC_019404. 323 RNAELLPIL-EFLIPFIVNA----EE--WSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKE-ARDTLRT-IA- 392 (418) Q Consensus 323 Qe~~l~p~l-~~l~~~i~~~----~~--~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e-~r~~l~~-~~- 392 (418) ... +...+ +.|+.-|+.- .. -.|+|.... +. .++.|++++.+++.|++.+++ ..+.+++ .+ T Consensus 218 ~~~-i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~~~---~~-----~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gi 288 (355) T protein:vir:78 218 MKH-IADVTQQHVVEDLVDQNWGPEEPAPRLVPAQLG---KE-----QPVTAEAIRALVECGAFTADPELEKDLRARYGL 288 (355) T ss_pred HHH-HHHHHHHHHHHHHHHhcCCCCCCCCEEEecCcC---hh-----HHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC Confidence 654 44444 4466655421 11 134554321 11 134689999999999876643 2333332 22 Q ss_pred CcCCCChhhcccc-cccCC----CccccccC Q lcl|NC_019404. 393 PEIKIGDNDIQTE-ESELI----TETEVVIA 418 (418) Q Consensus 393 ~~~~~~~~~~~~~-e~~~~----~e~e~~~~ 418 (418) +.....++..... ++.+. ...++.-. T Consensus 289 p~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (355) T protein:vir:78 289 PAPAERDDGADAAAAKAAGRRRAKRLPGQRQ 319 (355) T ss_pred CCCCCCCcccCCccccccccccccccCCccc Confidence 1111111111110 00000 00111100 No 230 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=97.61 E-value=3.8e-05 Score=44.83 Aligned_cols=405 Identities=13% Similarity=0.149 Sum_probs=174.8 Q ss_pred Cccchh--hH-HHHhcCCCC-cccc------Cc-------cccCCHHHHHHHHH---cCCccchhhhcchhhhccC---- Q lcl|NC_019404. 1 MVKTDS--YA-NIFLGGSDG-SEIY------GS-------LQNQAPTILASLYA---DNALVRRIIDTIPETALAA---- 56 (418) Q Consensus 1 ~~~~D~--~~-n~~~g~~~~-~~~~------~~-------~~~~~~~~l~~~Y~---~~~~~r~iVd~~a~d~~r~---- 56 (418) ++-+-| ++ ..++++..+ .+.. |. -...+..|++..|. ..+.+....++.+.-++-- T Consensus 42 ~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~~g~~~~~~~~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~T 121 (569) T protein:vir:10 42 LFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKT 121 (569) T ss_pred EEeecCcchhhhhhhccCccccchhhhhHHHHHHHHhhhccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccc Confidence 222222 11 011221111 1111 10 11134556665543 3555555555554444321 Q ss_pred C--ccc----cCc----chHHHHHHHHHH-hC--chHHHHHHHHhccccceEEEEEeecCCCcc-------------ccc Q lcl|NC_019404. 57 G--FHI----DGI----DDEPAFWSRWDD-LE--MTQNINDAWSWARLFGGAAIVAIVKDNRAL-------------TSP 110 (418) Q Consensus 57 ~--~~i----~~~----~d~~~i~~~~~~-l~--~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l-------------~~p 110 (418) | +-| .++ +.++++.+++.. |. +++.+-...+|.-.||.||.=|..+.++.. =+| T Consensus 122 Gd~vfI~p~~~~~~a~~daakai~~el~~dl~~~iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqp 201 (569) T protein:vir:10 122 GQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKE 201 (569) T ss_pred cceEEEEeecCCCCCcchHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccch Confidence 1 222 111 222355555543 32 456666778888899999998877654431 134 Q ss_pred ccCCCceEEEEEeeccccccccccccccc-cccCcceEEEEecCCcccccccCcccE-EEecCccchhhhhhccccCCcc Q lcl|NC_019404. 111 VREGAELETVRVYDRTQVKVQNREENPRN-ARFGKPLTYRITTNESDMFYDVHYSRI-HIIDGERVPNAMRRQNDGWGRS 188 (418) Q Consensus 111 l~~~~~i~~i~v~~~~~i~~~~~~~dp~s-~~yg~p~~y~i~~~~~~~~~~iH~SR~-i~~~g~~lp~~~~~~~~~~G~S 188 (418) +...+...+|.+..+...+-.-.-.+|-+ -.+..|.+=.+ .+. ..||.=.. ....++ .+...-....-+|+| T Consensus 202 FE~g~~tvGF~~~~~~~~~~ti~~l~p~qm~rmKmPrm~~i--~q~---~~v~~g~~~~~L~~d-~~~~~Pi~psn~GgS 275 (569) T protein:vir:10 202 FEVSGNLAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPK--SNL---MPVHTGHKAYSLLDN-PEERTPIETQNYGTS 275 (569) T ss_pred hhhcCceEEeecccCCccccceeeechhhhhhhcccceeec--ccc---chhhhhhhheeeccc-ccccccccchhhhhH Confidence 44556666665543211110000000000 00222321000 000 01111000 001111 111111112347999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHH--HHHH--cCCceeecchHHHhhcCc--chHHHHHHHHHHHHHhcCCc---------c Q lcl|NC_019404. 189 VLSSDILDSIKDYTNCERLATQ--LLRR--KQQAVWKAKGLAELCDDS--EGFGAARLRLAQVDNNSGVG---------Q 253 (418) Q Consensus 189 ~l~~~~~~~l~~~~~~~~~~~~--l~~~--~~~~v~k~~~l~~~~~~~--~~~~~~~~r~~~~~~~~~~~---------~ 253 (418) -|+. +|+-..+...+..+... +... -++..+.++++.....+. -...+.++|.......+..+ + T Consensus 276 FL~~-ae~pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H 354 (569) T protein:vir:10 276 LLEY-AYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNT 354 (569) T ss_pred HHHH-HHhHHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCcccccccee Confidence 7764 77766655555443211 1110 112223344333322211 23345666666665544432 2 Q ss_pred eeEEEcCCC-----ceeEeecccCCHHHHHHHHHHHHhhhhcCCeeee-eccC-ccccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 254 AIGIDAESE-----EYSVLNSDIGGIDAFLDKKFDRIVALSGIHEIIL-KNKN-VGGLSSSQNTALETFHKLIDRKRNAE 326 (418) Q Consensus 254 ~~~~d~~~e-----~~~~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L-~G~s-~~gl~stge~d~~~y~~~I~~~Qe~~ 326 (418) .+=+.++.- |......+..|+.|++ +-..++|++.||-.+.| |... +|||+..| -|.-.+++.++.. T Consensus 355 ~LPv~gekq~~~tvDt~~~~A~~~gIEdvM-~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG-----~frtSaQaa~RS~ 428 (569) T protein:vir:10 355 LLPIMGDGKGQMTIDTQTIQADINGIEDIL-TYMRQLAAALGLDYTLLGWADQMSGGLGEGG-----FLRTAIQAAMRAS 428 (569) T ss_pred eeeeecCccccccccccccccCcccHHHHH-HHHHHHHhhhccchhHhhHHHHhcccccccH-----HHHHHHHHHHHHH Confidence 222333322 1223333455777764 56678999999999977 3333 88886432 3666676666554 Q ss_pred -HHHHH----HHHHHHhhc--------cC--CceEEeCCCCCCCHHHHHHHHHHHHHHHHHH-------HhCCCCCHHHH Q lcl|NC_019404. 327 -LLPIL----EFLIPFIVN--------AE--EWSVEFSPLDHESSKDKAEVLEKSVNSIAAL-------IAAGAMDIKEA 384 (418) Q Consensus 327 -l~p~l----~~l~~~i~~--------~~--~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~-------~~~g~i~~~e~ 384 (418) +|..+ ++++++=+- +. -|.++|++-.+-=+.|..+++..++++.... -+++++-.+|. T Consensus 429 ~iRqa~~e~in~iidiH~~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~ 508 (569) T protein:vir:10 429 WIQQGVEEFIQRAIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDA 508 (569) T ss_pred HHHHHHHHHHHHHHHHHhhhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHH Confidence 44444 444444332 12 3899999987766666666655555554443 34444443433 Q ss_pred H--HHHHhhcCcCCCChhhc-ccccccCCCccccccC Q lcl|NC_019404. 385 R--DTLRTIAPEIKIGDNDI-QTEESELITETEVVIA 418 (418) Q Consensus 385 r--~~l~~~~~~~~~~~~~~-~~~e~~~~~e~e~~~~ 418 (418) . -.|.+.....+-..|-+ .+--.++.+|+-+|.. T Consensus 509 ~m~y~l~d~~~~De~~~e~l~ae~~akp~DEe~~~~~ 545 (569) T protein:vir:10 509 FKRYLFSDVLEIDEKISEALVNELKAKSEDDDHLMDS 545 (569) T ss_pred HHHHHHHHHhhcchhHHHHHHhhcCCCcchhHHHHHH Confidence 2 22222211111011111 1111234444444433 No 231 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=97.58 E-value=4.2e-05 Score=44.56 Aligned_cols=373 Identities=11% Similarity=0.044 Sum_probs=153.8 Q ss_pred CccchhhHHHHhcCCCCccccCc----cccCCHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchHH-H----H Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGS----LQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDEP-A----F 69 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~----~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~~-~----i 69 (418) ..+.-+....+.+....+..+.. .+....-++.......+-+..++.+--...++..|.|+ ++++++ + + T Consensus 23 ~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v 102 (448) T protein:vir:79 23 VPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFI 102 (448) T ss_pred chhhhhhhhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHH Confidence 11112222222221111111100 01111112222234478888889998888888888886 222221 1 2 Q ss_pred HHHHHHhC------chHHHHHHHHhccccceEEEEEeec---CCCcc-cccc-cCCCceEEEEEeecccccccccccccc Q lcl|NC_019404. 70 WSRWDDLE------MTQNINDAWSWARLFGGAAIVAIVK---DNRAL-TSPV-REGAELETVRVYDRTQVKVQNREENPR 138 (418) Q Consensus 70 ~~~~~~l~------~~~~~~~a~~~~rl~G~~~i~i~~~---d~~~l-~~pl-~~~~~i~~i~v~~~~~i~~~~~~~dp~ 138 (418) .+.++... -+..+..-+-.+.+||.|++=+.-+ +|.-. .... .+...+..|.+.+-..+..... .++. T Consensus 103 ~~~l~~~~~~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~~~~~~f~~~~d~~l~~~~~-~~~~ 181 (448) T protein:vir:79 103 HAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGPKALKL-SGEV 181 (448) T ss_pred HHHhhhhhhhhccCCHHHHHHHHHHhhhhcceeEEEEeeecCCCceecccccccCCccccceeeecCCceEEeec-CCcc Confidence 22222110 1334445556688999999755432 22210 0000 0111121221111000000000 0000 Q ss_pred ccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCC- Q lcl|NC_019404. 139 NARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQ- 217 (418) Q Consensus 139 s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~- 217 (418) -|.+ ....+..+...+++++... .+.++||.+.+ +.||-...--..+...-+..+.++++ T Consensus 182 ---~~~~--------~~~~~~~lP~~~~i~~~~~-------~~g~p~g~gLl-r~~~w~~~fK~~~~~~w~~f~E~yG~P 242 (448) T protein:vir:79 182 ---KGGS--------QFVSGLEIPIWKTVVFLHN-------DDGSFTGQSAL-RAAVPHWLAKRALILLINHGLERFMIG 242 (448) T ss_pred ---cccc--------cCCCccccccceEEEEecC-------ccCCcccchhH-HHHHHHHHHHHHHHHHHHHHHHHcCCc Confidence 0000 0001123456677776422 23567888855 56777665555666666778888874 Q ss_pred -ceeecchHHHhhcCcchHHHHHHHHHHHHH-hcCCcceeEEEcCCCceeEeecccC--CHHHHHHHHHHHHhhhhcCCe Q lcl|NC_019404. 218 -AVWKAKGLAELCDDSEGFGAARLRLAQVDN-NSGVGQAIGIDAESEEYSVLNSDIG--GIDAFLDKKFDRIVALSGIHE 293 (418) Q Consensus 218 -~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~-~~~~~~~~~~d~~~e~~~~~~~~~~--gl~~~~~~~~~~iaaas~IP~ 293 (418) .+.|.+..+ .+.. +.++.+..+.+ ......+.++.-++.+++.++..-+ ...++++.+-..||.+. T Consensus 243 ~~vgky~~ga-----~~~~-~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i---- 312 (448) T protein:vir:79 243 VPTLTIPKSV-----RQGT-KQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL---- 312 (448) T ss_pred eEEEecCCCC-----CcCH-HHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHH---- Confidence 466765322 1111 12222333222 2222344455556688888887532 35556666666666432 Q ss_pred eeeeccC-c----cccccchhHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHhhc----c-CCc-eEEeCCCCCCCHHHHH Q lcl|NC_019404. 294 IILKNKN-V----GGLSSSQNTA-LETFHKLIDRKRNAELLPILE-FLIPFIVN----A-EEW-SVEFSPLDHESSKDKA 360 (418) Q Consensus 294 t~L~G~s-~----~gl~stge~d-~~~y~~~I~~~Qe~~l~p~l~-~l~~~i~~----~-~~~-~~~f~pL~~~~eke~a 360 (418) +||+ + +|-++.+.++ .....+.+++-.+. +...++ .|+.-++. . ... .|.|... |.. T Consensus 313 ---LGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~-i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~------e~~ 382 (448) T protein:vir:79 313 ---GIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQRE-FASAVNLYLIPKLVLPNWPSATRFPRLTFEME------ERN 382 (448) T ss_pred ---hhhhhccccccchhhhhhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC------ChH Confidence 2433 1 1211111111 12233344443332 333333 35544432 1 111 3444321 222 Q ss_pred HHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCChhhcccccccCCCc---------cccccC Q lcl|NC_019404. 361 EVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGDNDIQTEESELITE---------TEVVIA 418 (418) Q Consensus 361 e~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e---------~e~~~~ 418 (418) |+ ++.|+++.++++.+.+..+-+++.+..-.+ .+.+.+..-...... +.-.+- T Consensus 383 Dl-~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~----~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 444 (448) T protein:vir:79 383 DF-SAAANLMGMLINAVKDSEDIPTELKALIDA----LPSKMRRALGVVDEVREAVRQPADSRYLYT 444 (448) T ss_pred HH-HHHHHHhhhhhccchhhHHHHHHhhcCCCC----CCCccccccCCCCcccccccCCccccchhh Confidence 33 446888888887765544445543321111 111111111111111 111111 No 232 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=97.30 E-value=0.0001 Score=42.49 Aligned_cols=371 Identities=10% Similarity=0.063 Sum_probs=174.3 Q ss_pred ccch-----hhHH--------HHhcCCCC----ccccCccccCCHHHHHHHHHc----CCccchhhhcchhhhccCCccc Q lcl|NC_019404. 2 VKTD-----SYAN--------IFLGGSDG----SEIYGSLQNQAPTILASLYAD----NALVRRIIDTIPETALAAGFHI 60 (418) Q Consensus 2 ~~~D-----~~~n--------~~~g~~~~----~~~~~~~~~~~~~~l~~~Y~~----~~~~r~iVd~~a~d~~r~~~~i 60 (418) |..+ ...+ .+++|... +..|-....--..+-++.|.. .++++++|+..+...+|+.+.+ T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~p~~ 80 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQPPVI 80 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCCcee Confidence 1111 1111 12333322 111211111111222333333 5888999999999999999998 Q ss_pred cCcchHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeec-CCC-ccc---ccc-------cCCCceEEEEEeecccc Q lcl|NC_019404. 61 DGIDDEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNR-ALT---SPV-------REGAELETVRVYDRTQV 128 (418) Q Consensus 61 ~~~~d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~-d~~-~l~---~pl-------~~~~~i~~i~v~~~~~i 128 (418) +.++....+.....-.++.+.++.+++....||.|++++... .+. |.- .|. +..|.+..+.......+ T Consensus 81 ~~p~~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g~l~~v~lre~~~~ 160 (452) T protein:vir:94 81 THPDAMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDGRLLMVVLREFYTV 160 (452) T ss_pred cccHHHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccccCCeeEEEEEEEEEE Confidence 765433333222223468899999999999999999998764 221 110 010 11232322111111111 Q ss_pred ccccccccccccccCc------------ceEEEE---ecCCcccccccC----------cccEEEecCccchhhhhhccc Q lcl|NC_019404. 129 KVQNREENPRNARFGK------------PLTYRI---TTNESDMFYDVH----------YSRIHIIDGERVPNAMRRQND 183 (418) Q Consensus 129 ~~~~~~~dp~s~~yg~------------p~~y~i---~~~~~~~~~~iH----------~SR~i~~~g~~lp~~~~~~~~ 183 (418) .|+.. .||. |-.|++ ...++ ....++ +=..|.|. +........ T Consensus 161 ------~d~~d-~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~l~~IP~v----~~~~~~~~~ 228 (452) T protein:vir:94 161 ------RDTAD-RYVQNIRVRYRCLELVDGLLQITVHETQDG-KVWELAKTSTIQNVGVTMDYIPFF----CITPSGLSM 228 (452) T ss_pred ------ecCCC-cccceeEEEEEEEEEeCCeEEEEEEEccCC-ceeeeccceeecCCCcccceeEEE----EEcCCCCCC Confidence 11110 1111 111211 11111 111111 11122221 111122223 Q ss_pred cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCc Q lcl|NC_019404. 184 GWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEE 263 (418) Q Consensus 184 ~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~ 263 (418) ..|.+||+..++=.+..|..... --++++..++++.-+.++.+ ..+ ...+.+..+.+...+.+ T Consensus 229 ~~~~pPLl~LA~ln~~hy~~~sd-~~~~l~~~~~P~l~~~g~~~----~~~------------i~iG~~~~~~lpe~~~~ 291 (452) T protein:vir:94 229 TPAKPPMIDIVDINYSHYRTSAD-LEHGRHFTGLPTPWITGAES----QST------------MHIGSTKAWVIPEVAAK 291 (452) T ss_pred CCCccchHHHHHHHHHHhcchhH-HHHHHHHcccceeEeecCcC----CCc------------eEecccccccCCCCCCc Confidence 45889999887777788876655 56777888888777765421 111 12244445555544556 Q ss_pred eeEeecccCCH---HHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 264 YSVLNSDIGGI---DAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKRNAELLPILEFLIPF 337 (418) Q Consensus 264 ~~~~~~~~~gl---~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Qe~~l~p~l~~l~~~ 337 (418) +..++.+-+++ .+.++...+++..+-. +|+-.++.+ +.|++.....+ ...+.+.- ..+...+++++++ T Consensus 292 ~~yie~~g~~i~~~~~~l~~le~~m~~~Ga----~ll~~~~~~-~~s~ea~~~~~~~~~s~L~~~a-~~~e~al~~~l~~ 365 (452) T protein:vir:94 292 VGFLEFTGQGLQSLEKALSEKQAQLASLSA----RLIDNSTRG-SEATETVKLRYMSETASLKSVT-RAVEALLNKAYSC 365 (452) T ss_pred ceEEccCchhHHHHHHHHHHHHHHHHHHHH----HhhccCCCc-chHHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHH Confidence 77777666665 4555555555543322 222222223 22333221112 23333333 2356777888887 Q ss_pred hhc----cCCceEEeCC--CC-CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCc-CCCChhhcccccc-- Q lcl|NC_019404. 338 IVN----AEEWSVEFSP--LD-HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPE-IKIGDNDIQTEES-- 407 (418) Q Consensus 338 i~~----~~~~~~~f~p--L~-~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~-~~~~~~~~~~~e~-- 407 (418) +.. ..+..|+.|. .. .++. ...+++..++++|.|+-+..++.|+..+.- .....+.+.++.+ T Consensus 366 ~a~w~g~~~~~~v~~n~dF~~~~~~~--------~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~ 437 (452) T protein:vir:94 366 IMDMESMGGTLNIKLNSAFLDSKLTA--------AELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAP 437 (452) T ss_pred HHHHcCCCCceEEEeccccccccCCH--------HHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhcc Confidence 754 1233444332 11 2222 235667788999999999999999875531 1111111111000 Q ss_pred -------cCCCcccc Q lcl|NC_019404. 408 -------ELITETEV 415 (418) Q Consensus 408 -------~~~~e~e~ 415 (418) ++.+-+|. T Consensus 438 ~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 438 EPSPSNTPPNPSSKA 452 (452) T ss_pred CcccCCCCCCCccCC Confidence 01111222 No 233 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=97.29 E-value=0.0001 Score=42.40 Aligned_cols=385 Identities=11% Similarity=0.044 Sum_probs=175.3 Q ss_pred CccchhhHHH--------------------HhcCCCC--ccccC--ccccCCHHHHHHHHHc----CCccchhhhcchhh Q lcl|NC_019404. 1 MVKTDSYANI--------------------FLGGSDG--SEIYG--SLQNQAPTILASLYAD----NALVRRIIDTIPET 52 (418) Q Consensus 1 ~~~~D~~~n~--------------------~~g~~~~--~~~~~--~~~~~~~~~l~~~Y~~----~~~~r~iVd~~a~d 52 (418) |+++.|-.|. +++|... .+..+ ++-.-..+..++.|.. .++.+++++..+.. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhch Confidence 7777766442 2444322 11111 1111111223444442 47789999999999 Q ss_pred hccCCccccCcchHHHHHHHHHH--hCchHHHHHHHHhccccceEEEEEeecCCCccc-ccccCCCceEEEEEeecccc- Q lcl|NC_019404. 53 ALAAGFHIDGIDDEPAFWSRWDD--LEMTQNINDAWSWARLFGGAAIVAIVKDNRALT-SPVREGAELETVRVYDRTQV- 128 (418) Q Consensus 53 ~~r~~~~i~~~~d~~~i~~~~~~--l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~-~pl~~~~~i~~i~v~~~~~i- 128 (418) .+|+...++.++....+....+. .++.+.++.+++....||.|+|++........+ ..-...+..-||..+.+.+| T Consensus 81 vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~Ii 160 (489) T protein:vir:78 81 VMRKEPEINIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIV 160 (489) T ss_pred hhcCCcceeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhc Confidence 99999888654433333333332 467899999999999999999998763211100 00000111112222222111 Q ss_pred -------------ccc----------------------cccccccccccCcceEEEEecCCcccc--cccC-cc-----c Q lcl|NC_019404. 129 -------------KVQ----------------------NREENPRNARFGKPLTYRITTNESDMF--YDVH-YS-----R 165 (418) Q Consensus 129 -------------~~~----------------------~~~~dp~s~~yg~p~~y~i~~~~~~~~--~~iH-~S-----R 165 (418) +-. ++...+......+.+.|+....+.... ..++ .+ . T Consensus 161 nW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l~ 240 (489) T protein:vir:78 161 NWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGESLRG 240 (489) T ss_pred CceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCCCccC Confidence 000 001111111112233343332222111 1121 11 1 Q ss_pred EEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHH Q lcl|NC_019404. 166 IHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQV 245 (418) Q Consensus 166 ~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~ 245 (418) .|.|. +.........-|.+||+..++=.|..|..... --++++..+++++-+.++.+... +......++ T Consensus 241 ~IPfv----~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~~~~l~~~~~P~l~i~G~d~~~~--~~~~~~~~~---- 309 (489) T protein:vir:78 241 VIPFT----FIGATNNDATIDDAPLLPLAELNIGHYRNSAD-NEESSFVVGQPTLFIYPGENLTP--QAFKEANPN---- 309 (489) T ss_pred eeeEE----EEecCCCCCCCCcCchHHHHHHHHHHhhhhhH-HHHHHHHcccceeeeecCccCCc--ccccccCcc---- Confidence 22221 01111122233788999888778888877765 56778888888887776432211 000000000 Q ss_pred HHhcCCcceeEEEcCCCceeEeecccCCH-HHHHHHHHHHHhh-hhcCCeeeeeccCccccccchhHHHHHH---HHHHH Q lcl|NC_019404. 246 DNNSGVGQAIGIDAESEEYSVLNSDIGGI-DAFLDKKFDRIVA-LSGIHEIILKNKNVGGLSSSQNTALETF---HKLID 320 (418) Q Consensus 246 ~~~~~~~~~~~~d~~~e~~~~~~~~~~gl-~~~~~~~~~~iaa-as~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~ 320 (418) .-..+....+.+ .++.++..++.+-+++ +..++...+++.. .+.+. ..+ + +.|++.-...+ +..++ T Consensus 310 ~i~~g~~~~~~l-p~~~~~~~ie~~~~~~~r~~l~~le~qm~~lGa~l~-----~~~-~--~~Ta~~~~~~~~~~~S~L~ 380 (489) T protein:vir:78 310 GIKFGSRRGHNL-GYGGSAQLIQAGENNLARQNMLDKEQQAIQIGAQLI-----TPT-Q--QITAQSARIQRGADTSVMA 380 (489) T ss_pred ceeeCCcccccC-CCCCCcceeccCcchHHHHHHHHHHHHHHHHhhhhc-----cCC-c--chhHHHHHHHHHHhhHHHH Confidence 001122222222 2334444554443332 4444444444442 23332 111 1 23333222122 33333 Q ss_pred HHHHHHHHHHHHHHHHHhhc----c--CCceEEeCCCC---CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhh Q lcl|NC_019404. 321 RKRNAELLPILEFLIPFIVN----A--EEWSVEFSPLD---HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTI 391 (418) Q Consensus 321 ~~Qe~~l~p~l~~l~~~i~~----~--~~~~~~f~pL~---~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~ 391 (418) +.- ..+...+++++++++. . .+..|.-|.=+ .++. ...+++..++++|.|+.+..++.|+.. T Consensus 381 ~~a-~~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~--------~~~~al~~~~~~G~is~~t~~~~L~~~ 451 (489) T protein:vir:78 381 TIA-RNVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTA--------QDRAAWMADINAGLLPATAYYAALRKA 451 (489) T ss_pred HHH-HHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccCcccCCH--------HHHHHHHHHHhcCCCCHHHHHHHHHhC Confidence 332 2367778888887754 1 22344333222 2222 236677788899999999999999875 Q ss_pred cCcCCCChhhcccccccCCCccccc---cC Q lcl|NC_019404. 392 APEIKIGDNDIQTEESELITETEVV---IA 418 (418) Q Consensus 392 ~~~~~~~~~~~~~~e~~~~~e~e~~---~~ 418 (418) +.. ..+++++++ +..++.... +. T Consensus 452 gv~-d~~~e~~~~---ei~~~~~~~~~~~~ 477 (489) T protein:vir:78 452 GVT-DWTDADIKD---AVADQPLPVATEVQ 477 (489) T ss_pred CCC-CccHHHHHH---HHhhcCCCcccCCc Confidence 431 223333221 111111110 00 No 234 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=97.25 E-value=0.00012 Score=42.17 Aligned_cols=392 Identities=11% Similarity=0.066 Sum_probs=161.0 Q ss_pred CccchhhHHHHhcCCCCccccCcc-cc--------CCHHHHHHHHHcCCccchhhhcchhhhccCCccccC----cchHH Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSL-QN--------QAPTILASLYADNALVRRIIDTIPETALAAGFHIDG----IDDEP 67 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~-~~--------~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~----~~d~~ 67 (418) =++.--+..++.. ...++.+.. .. .....+..+. +.+-+..++.+.-...+...|.|+- .++.+ T Consensus 10 gl~p~rl~~i~~~--~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~-~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~~ 86 (488) T protein:vir:95 10 SLPPFRMGEVGSL--GLKVKNGRIYEEPRQALRFPESIKTFQLMM-RDPAVAASVNIIKMFVRKVNWRFVPPKGKEQDPK 86 (488) T ss_pred CCCHHHHHHHHHH--hhccccchhhccchhhhcccchHHHHHHHh-hChHHHHHHHHHHHHHhcCCceEecCCCCchhHH Confidence 1222223333211 111122211 11 1122333343 4788888999988888888888862 22221 Q ss_pred ------HHHHHHHHhCc-hHHHHHHHHhccccceEEEEEeecCCCcccccc---cCCCc--eEEEEEeecccccccccc- Q lcl|NC_019404. 68 ------AFWSRWDDLEM-TQNINDAWSWARLFGGAAIVAIVKDNRALTSPV---REGAE--LETVRVYDRTQVKVQNRE- 134 (418) Q Consensus 68 ------~i~~~~~~l~~-~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl---~~~~~--i~~i~v~~~~~i~~~~~~- 134 (418) .++..++.+.. +..+...+-.+.+||.|+.=+.-+-+.....++ ..+|. ++.|.+.++ .+...+. T Consensus 87 ~~~~a~~v~~~l~~~~~~~~~~i~~~lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq--~~~~~f~~ 164 (488) T protein:vir:95 87 MLERADFFNSLMDDMEHDWADFINSVMSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQ--STLDKWYF 164 (488) T ss_pred HHHHHHHHHHHHhccCccHHHHHHHHHHhhcccceeeeeeeeccccccccccccccCCeeeeeeeeecCc--ccccceee Confidence 23344455543 334444444689999999755443221111111 11222 223333222 2111111 Q ss_pred -----------ccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHH Q lcl|NC_019404. 135 -----------ENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTN 203 (418) Q Consensus 135 -----------~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~ 203 (418) +++....+..+..|.. ....+..+.+.+.+.+... ....++||.+.+. .||-...--.. T Consensus 165 d~d~~l~~~~~~~~~~~~~~~~~~~~~---~~~~~~~lP~~kfi~~~~~------~~~g~p~g~gLlr-~~~w~~~fK~~ 234 (488) T protein:vir:95 165 DEDFRRVTGVRQNLRNVSHIAGAINLG---ERPLTRKLPRAKFMLFKYD------DEYGNPEGRSPLL-NAYVPWKYKVQ 234 (488) T ss_pred ccCCCceeecccccccccccccccccc---cccccccccccceEEEeec------CCCCccchhhHHH-HHHHHHHHHHH Confidence 1111111222222211 1112345777787766432 2356788998664 56654433334 Q ss_pred HHHHHHHHHHHc--CCceeecchHHHhhc-Ccch-HHHHHHHHHHHHHh-cCC-cceeEEE------cCCCcee--Eeec Q lcl|NC_019404. 204 CERLATQLLRRK--QQAVWKAKGLAELCD-DSEG-FGAARLRLAQVDNN-SGV-GQAIGID------AESEEYS--VLNS 269 (418) Q Consensus 204 ~~~~~~~l~~~~--~~~v~k~~~l~~~~~-~~~~-~~~~~~r~~~~~~~-~~~-~~~~~~d------~~~e~~~--~~~~ 269 (418) +..--+.-+.++ .+++.+.+. ...+ +.+. ...+.+.+..+... ... ..++++- .+.+.++ ..+. T Consensus 235 ~~~~w~~f~Er~g~g~p~~~~p~--~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~ 312 (488) T protein:vir:95 235 IEEYEAVGVSRDLVGMPKIGLPP--DYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSR 312 (488) T ss_pred HHHHHHHHHHHhcccceeEeecc--CCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhcccc Confidence 444444555554 566666542 1111 1111 11223323222211 111 1122221 1111111 1222 Q ss_pred ccC---CHHHHHHHHHHHHhhhhcCCeeeeeccCccccccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcc---- Q lcl|NC_019404. 270 DIG---GIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQNTALETFHKLIDRKRNAELLPIL-EFLIPFIVNA---- 341 (418) Q Consensus 270 ~~~---gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l-~~l~~~i~~~---- 341 (418) .=+ .-..+++..-..||.+.--.. .-.+...+|-+|.|+--.....+.+++-... +...+ +.|+.-++.- T Consensus 313 ~~~~~~~~~~li~~~d~~Isk~iLGqt-LT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~-i~~tln~~li~~l~~~Nfg~ 390 (488) T protein:vir:95 313 QGAKAYDTGSIIDRYSKQIMMAFMSDV-LAMGQSKYGSFSLADSKTSLLAMSVDILLKQ-IKNVINRDLVAQTYALNMWD 390 (488) T ss_pred ccCCchhHHHHHHHHHHHHHHHHhccc-cccccCcchhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCC Confidence 112 244567777777875542221 1112222343444555556677777666554 33333 4465544321 Q ss_pred CC--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHH-HHHHHHh-hcCcCCCChhhccccc-----c----- Q lcl|NC_019404. 342 EE--WSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGAMDIKE-ARDTLRT-IAPEIKIGDNDIQTEE-----S----- 407 (418) Q Consensus 342 ~~--~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e-~r~~l~~-~~~~~~~~~~~~~~~e-----~----- 407 (418) .. -+|.|... |..|. ++.|++++++++.|+.-++. ..+.+++ .+-...-.++...... . T Consensus 391 ~~~~P~~~~~~~------e~~Dl-~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~ 463 (488) T protein:vir:95 391 DEEHVQITYDDI------ETPDL-EAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDG 463 (488) T ss_pred CCCccEEEecCc------ChhhH-HHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcc Confidence 11 23455432 22222 46689999999999866532 2232322 2210000111111000 0 Q ss_pred ---------cCCCccccccC Q lcl|NC_019404. 408 ---------ELITETEVVIA 418 (418) Q Consensus 408 ---------~~~~e~e~~~~ 418 (418) ........-+| T Consensus 464 ~~~~~~~~~~~~~~~~~~~a 483 (488) T protein:vir:95 464 YKTAGEGTAKTPSAKDPSTA 483 (488) T ss_pred cCCCcccCCcccccccchhh Confidence 00000111111 No 235 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=97.09 E-value=0.00018 Score=41.16 Aligned_cols=382 Identities=14% Similarity=0.126 Sum_probs=179.1 Q ss_pred Cc---cchhhH-------HHHhcCCCCccccCccccCCHHHHHHHHHc---CCccchhhhcchhhhccC-----CccccC Q lcl|NC_019404. 1 MV---KTDSYA-------NIFLGGSDGSEIYGSLQNQAPTILASLYAD---NALVRRIIDTIPETALAA-----GFHIDG 62 (418) Q Consensus 1 ~~---~~D~~~-------n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~---~~~~r~iVd~~a~d~~r~-----~~~i~~ 62 (418) ++ .-||-. +..++|+-.+.+.+......-.+|-.-||+ +|.+..+|+.++++|+-- .+++.- T Consensus 23 ~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIvne~iv~d~~~~pV~l~l 102 (511) T protein:vir:56 23 FSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDAIQEIVDEAIVYENDKEVVWLNL 102 (511) T ss_pred ccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEe Confidence 11 112211 111222212222222222222467777765 899999999999999762 333321 Q ss_pred c--c----hHHHHHHHHHH----hCchHHHHHHHHhccccceEEEEEeecCCCcccccccCCCceEEEEEeecccccccc Q lcl|NC_019404. 63 I--D----DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQN 132 (418) Q Consensus 63 ~--~----d~~~i~~~~~~----l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~i~~~~ 132 (418) + + -.++|.++++. |++..+-.+.+|.--+.|.-+.-..++ ++..|+.++.+||..+.... T Consensus 103 d~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid----------~k~GI~eLr~lDPr~i~~vr 172 (511) T protein:vir:56 103 DNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILD----------KDNNIIELRPLNPMKMELVR 172 (511) T ss_pred cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec----------cccceeehhhcCcccchhhh Confidence 1 1 12356666654 466666666666555556544433332 23467777777776654421 Q ss_pred -cccccccc--cc-CcceEEEEecCC------------cccccccCcccEEEecCccchhhhhhccccCCcchHHHHH-- Q lcl|NC_019404. 133 -REENPRNA--RF-GKPLTYRITTNE------------SDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDI-- 194 (418) Q Consensus 133 -~~~dp~s~--~y-g~p~~y~i~~~~------------~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~-- 194 (418) ...++..- -+ +--++|.+++.+ .....+|+.+-+.... .-|-+ -..+.+...|-|..++ T Consensus 173 ~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~h-SGL~d--~~~~~g~i~syLhkAiKp 249 (511) T protein:vir:56 173 EIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAH-SGLMR--GCADDPYIIGYLDRAIKP 249 (511) T ss_pred hhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeec-cccee--ccCCCCeeeccchhhhHH Confidence 11111110 01 113344444321 1123456665543221 11110 0123344455454322 Q ss_pred HHHHHHHHHHHHHHHHHHHHcC----CceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC---------- Q lcl|NC_019404. 195 LDSIKDYTNCERLATQLLRRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE---------- 260 (418) Q Consensus 195 ~~~l~~~~~~~~~~~~l~~~~~----~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~---------- 260 (418) .+.|+-.+. |.++++.+ -.|+=++ .+++... . +++-+. ..+...++ -++=|.. T Consensus 250 ~NQLkm~ED-----AlVIYRitRAPeRRvFYID-VGnLPk~-K-AeqYl~--~iM~k~kN---klVYDa~TGev~ddrk~ 316 (511) T protein:vir:56 250 ANQLKMLED-----ALVIYRLARAPERRVFYVD-VGNLPTQ-K-AQQYVN--GIMQNVKN---RVVYDTQTGQVKNTTNA 316 (511) T ss_pred HHhhHHHHh-----hHHHHhhhccccceEEEEe-cCCCCch-h-HHHHHH--HHHHhcCc---eEEEeccCceeccchhh Confidence 122332222 22333321 1233333 2232211 1 111111 11111111 1111111 Q ss_pred ----------------CCceeEee--cccCCHHHHHHHHHHHHhhhhcCCeeeeecc-Ccccccc--ch--hHHHHHHHH Q lcl|NC_019404. 261 ----------------SEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNK-NVGGLSS--SQ--NTALETFHK 317 (418) Q Consensus 261 ----------------~e~~~~~~--~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~-s~~gl~s--tg--e~d~~~y~~ 317 (418) +-+++.+. -+++.++| +..|..-+=.|.++|.++|-.. +.+|+|- ++ .-|.-.|.. T Consensus 317 msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~K 395 (511) T protein:vir:56 317 MSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIED-VLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTK 395 (511) T ss_pred hhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHH Confidence 12233333 24555666 4688899999999999999744 3455551 11 123356999 Q ss_pred HHHHHHHHHHHHHHHHHHHH------hhcc-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019404. 318 LIDRKRNAELLPILEFLIPF------IVNA-------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA--AGAMDIK 382 (418) Q Consensus 318 ~I~~~Qe~~l~p~l~~l~~~------i~~~-------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~--~g~i~~~ 382 (418) +|.+.|..+ .+++..+++. ++.. +++.|+|..=..-+|...+|+...+..+++.+-. .-.+|.+ T Consensus 396 FI~RLR~rF-s~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~ 474 (511) T protein:vir:56 396 FVKRLQTKF-ETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHK 474 (511) T ss_pred HHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchH Confidence 999998754 4444444332 2222 3567888877777888889998888888877632 2245777 Q ss_pred HHHHHHHhhcCcCCCChhhcccccccCCCc---------cccc Q lcl|NC_019404. 383 EARDTLRTIAPEIKIGDNDIQTEESELITE---------TEVV 416 (418) Q Consensus 383 e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e---------~e~~ 416 (418) -++...-. .+|++|.+.+..++.| +|+- T Consensus 475 yi~k~ILr------~tDeei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 475 YIQKNILR------LSDDQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred HHHHHHhc------cCHHHHHHHHHHHHHhhcCCCCCCcccCC Confidence 77654322 2344443333322222 2222 No 236 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=96.98 E-value=0.00023 Score=40.57 Aligned_cols=390 Identities=13% Similarity=0.052 Sum_probs=169.7 Q ss_pred CccchhhHH--------HHhcCCCC----ccccCcc--cc-CC--HHHHHHHHH----cCCccchhhhcchhhhccCCcc Q lcl|NC_019404. 1 MVKTDSYAN--------IFLGGSDG----SEIYGSL--QN-QA--PTILASLYA----DNALVRRIIDTIPETALAAGFH 59 (418) Q Consensus 1 ~~~~D~~~n--------~~~g~~~~----~~~~~~~--~~-~~--~~~l~~~Y~----~~~~~r~iVd~~a~d~~r~~~~ 59 (418) -.+.+...+ .+++|... +..|-+. .. .. ..+.++.|. ..++.+++|+..+...+|++.. T Consensus 37 ~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~ 116 (535) T protein:vir:80 37 YQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPI 116 (535) T ss_pred cCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcc Confidence 011111111 23444322 1112111 11 00 012222222 2578899999999999999988 Q ss_pred ccCcchHHHHHHHHHH--hCchHHHHHHHHhccccceEEEEEeecCC-Cccc--ccccCCCceEEEEEeecccccc---- Q lcl|NC_019404. 60 IDGIDDEPAFWSRWDD--LEMTQNINDAWSWARLFGGAAIVAIVKDN-RALT--SPVREGAELETVRVYDRTQVKV---- 130 (418) Q Consensus 60 i~~~~d~~~i~~~~~~--l~~~~~~~~a~~~~rl~G~~~i~i~~~d~-~~l~--~pl~~~~~i~~i~v~~~~~i~~---- 130 (418) ++.++....+....+. .++.+.++.++.....||.|+|++..... ...+ +. ...+..-||..+.+.+|.- T Consensus 117 ~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade-~~~~~rPy~~~y~ae~IinW~~~ 195 (535) T protein:vir:80 117 RQLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQ-KLGLYRPTITLVHPTSIINWRTK 195 (535) T ss_pred eeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHH-HhcCCCcEEEEechhhccCcccc Confidence 7654333333322222 36889999999999999999999875321 1100 00 0001111222222211100 Q ss_pred ----------c---c--cc-ccccccc--------------cCcceEEEEecCCcc-ccc--------ccCcccEEEecC Q lcl|NC_019404. 131 ----------Q---N--RE-ENPRNAR--------------FGKPLTYRITTNESD-MFY--------DVHYSRIHIIDG 171 (418) Q Consensus 131 ----------~---~--~~-~dp~s~~--------------yg~p~~y~i~~~~~~-~~~--------~iH~SR~i~~~g 171 (418) . + .. .|.+... .++...|.....+.. ... ..|+=..|.|. T Consensus 196 ~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv- 274 (535) T protein:vir:80 196 LVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQ- 274 (535) T ss_pred ccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEE- Confidence 0 0 00 1111111 111222222111110 000 11222233221 Q ss_pred ccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCC Q lcl|NC_019404. 172 ERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGV 251 (418) Q Consensus 172 ~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~ 251 (418) +..........|.+||+..++=.|..|..... --++++..++++.-+.++............ ....+. T Consensus 275 ---~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~~~il~~~~~P~l~i~G~~~~~~~~~~~~~--------~i~iG~ 342 (535) T protein:vir:80 275 ---FIGPLDNNADIDHPPLLDLCEVNIGHYRNSAD-YEEMAFVAGQPTAFFTGLTKDWVEDVFKDF--------KVHLGS 342 (535) T ss_pred ---EeecCCCCCCCCccchHHHHHHHHHHhhchhH-HHHHHHHhcCceeeeecCchhhhhcCCCCc--------ceEecC Confidence 01112223345889999888777888776655 556788888888777765433211110000 011233 Q ss_pred cceeEEEcCCCceeEeecccCCH-HHHHHHHHHHHhhhhcCCeeeeeccCccccccchh-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 252 GQAIGIDAESEEYSVLNSDIGGI-DAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQN-TALETFHKLIDRKRNAELLP 329 (418) Q Consensus 252 ~~~~~~d~~~e~~~~~~~~~~gl-~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stge-~d~~~y~~~I~~~Qe~~l~p 329 (418) +..+.+. ++.++..+..+-+++ .+.++...+++...-.-..+ +++++..++.- .+...=+..++..- ..++. T Consensus 343 ~~~~~lP-~~~~~~~~e~~~~~~a~~~l~~~e~qM~~lGa~ll~----~~~~~~Ta~~a~~~~~~~~S~L~~~a-~~le~ 416 (535) T protein:vir:80 343 RAIIPLP-QGATAGILQITPNSVPFEAMTHKESQMIAMGANLLV----KSGGNRTFGEAQQEEASEQSILSACT-KNVSM 416 (535) T ss_pred cccccCC-CCCCcceeeeccchhHHHHHHHHHHHHHHHHHHhhc----cCcccccHHHHHHHHHHHhHHHHHHH-HHHHH Confidence 3333333 334454444433332 33455555555543322222 22333322210 11111123333332 23677 Q ss_pred HHHHHHHHhhcc-------CCceEEeCCCC---CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcC-CCC Q lcl|NC_019404. 330 ILEFLIPFIVNA-------EEWSVEFSPLD---HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEI-KIG 398 (418) Q Consensus 330 ~l~~l~~~i~~~-------~~~~~~f~pL~---~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~-~~~ 398 (418) .++++++++..- ++..|..|.=+ .++. ..++++..++++|.|+-+..++.|+..+.-. ..+ T Consensus 417 al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~--------~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~ 488 (535) T protein:vir:80 417 AFRKALRWANQFQTGIVNDETVEYNLNTDFPAARLTP--------NERAELILEWQQGAITFKEMRAGLRRAGVASEDDA 488 (535) T ss_pred HHHHHHHHHHHHcCCccCCCceEEEeccccccccCCH--------HHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccc Confidence 788888876521 23455544322 2222 2367778889999999999999997754311 111 Q ss_pred hhh-cccccccCC----Cccc-cccC Q lcl|NC_019404. 399 DND-IQTEESELI----TETE-VVIA 418 (418) Q Consensus 399 ~~~-~~~~e~~~~----~e~e-~~~~ 418 (418) .++ ....+.+.. .-.. .-.+ T Consensus 489 ~eee~~ri~~E~~~~~~~~g~~~d~~ 514 (535) T protein:vir:80 489 KAETEGKATVEFIAKTAAAGKVGDAA 514 (535) T ss_pred hHHHHHHHHhhhhhccccCCCCCCCC Confidence 111 110111100 0000 0001 No 237 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=96.81 E-value=0.00033 Score=39.71 Aligned_cols=391 Identities=10% Similarity=0.017 Sum_probs=177.0 Q ss_pred CccchhhHHH--------------------HhcCCCCc--c-ccCccccC-CHHHHHHHHHc----CCccchhhhcchhh Q lcl|NC_019404. 1 MVKTDSYANI--------------------FLGGSDGS--E-IYGSLQNQ-APTILASLYAD----NALVRRIIDTIPET 52 (418) Q Consensus 1 ~~~~D~~~n~--------------------~~g~~~~~--~-~~~~~~~~-~~~~l~~~Y~~----~~~~r~iVd~~a~d 52 (418) |+++.|-.|. +++|.... + .+-..... ..+..++.|.. .++.+++++..+.. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhch Confidence 7777776542 23443221 1 11111111 11222333332 47889999999999 Q ss_pred hccCCccccCcchHHHHHHHHHH--hCchHHHHHHHHhccccceEEEEEeecCCCcccc-cccCCCceEEEEEeecccc- Q lcl|NC_019404. 53 ALAAGFHIDGIDDEPAFWSRWDD--LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTS-PVREGAELETVRVYDRTQV- 128 (418) Q Consensus 53 ~~r~~~~i~~~~d~~~i~~~~~~--l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~-pl~~~~~i~~i~v~~~~~i- 128 (418) .+|+.+.++.++....+....+. .++.+.++.+++....||.|+|++........+. .-...+..-||..+.+.+| T Consensus 81 vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~Ii 160 (491) T protein:vir:95 81 VMRKEPEINIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIV 160 (491) T ss_pred hhcCCceeeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhc Confidence 99999988654433333333332 4678999999999999999999987532111000 0000111123333333222 Q ss_pred -------------ccccc-----cccccccccCcceEEEEe------------------cCCcccc---cccCcccEEEe Q lcl|NC_019404. 129 -------------KVQNR-----EENPRNARFGKPLTYRIT------------------TNESDMF---YDVHYSRIHII 169 (418) Q Consensus 129 -------------~~~~~-----~~dp~s~~yg~p~~y~i~------------------~~~~~~~---~~iH~SR~i~~ 169 (418) +-..+ ..||. ..|++...+++. ..++... ..+|.+.-..+ T Consensus 161 nW~~~~v~g~~~L~~v~l~E~~~~~d~~-~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l 239 (491) T protein:vir:95 161 NWRLTRVGSVNRVTMVVLRETWEYHEPG-NEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLR 239 (491) T ss_pred CceeeeeCCceeeeEEEEEEeEEeecCC-CCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCccc Confidence 10000 01111 124443322221 1111000 01121111011 Q ss_pred cCccchhhh-hhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHh Q lcl|NC_019404. 170 DGERVPNAM-RRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNN 248 (418) Q Consensus 170 ~g~~lp~~~-~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~ 248 (418) .--|+.... ......-|.+||+..++=.|..|..... --++++-.++++.-+.+..+.-. +. ..-.+..... T Consensus 240 ~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~~~~l~~~~~P~l~~~G~d~~~~--~~----~~~~~~~~i~ 312 (491) T protein:vir:95 240 GVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSAD-NEESSFVVGQPTLFIYPGDNLTP--QS----FKEANPNGIK 312 (491) T ss_pred CeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhH-HHHHHHHcccceeeeecCcccCc--ch----hhccCcceeE Confidence 000111111 1122233788999888778888877765 56778888888887765432211 10 0000000011 Q ss_pred cCCcceeEEEcCCCceeEeecccCCH-HHHHHHHHHHHhhh-hcCCeeeeeccCccccccchhHHHHHH---HHHHHHHH Q lcl|NC_019404. 249 SGVGQAIGIDAESEEYSVLNSDIGGI-DAFLDKKFDRIVAL-SGIHEIILKNKNVGGLSSSQNTALETF---HKLIDRKR 323 (418) Q Consensus 249 ~~~~~~~~~d~~~e~~~~~~~~~~gl-~~~~~~~~~~iaaa-s~IP~t~L~G~s~~gl~stge~d~~~y---~~~I~~~Q 323 (418) .+.+..+.+ .++.++..++.+-+++ +..++...+++..+ +.+ +..+ + +.|+++-...+ +..+++.- T Consensus 313 ~g~~~~~~l-P~~~~~~~ie~~~~~~~~~~l~~~e~qm~~~Ga~l-----~~~~--~-~~Ta~~~~~~~~~~~S~L~~~a 383 (491) T protein:vir:95 313 FGSRCGHNL-GYGGSAQLIQAGENNLARQNMLDKEQQAIQIGAQL-----ITPS--Q-QITAESARIQRGADTSVMATIA 383 (491) T ss_pred ecCcCCcCC-CCCCccceeecCcchHHHHHHHHHHHHHHHHHHHh-----ccCC--c-chhHHHHHHHHHHhhHHHHHHH Confidence 111222222 2344566666544333 44444444443322 332 2211 1 23433322222 23333332 Q ss_pred HHHHHHHHHHHHHHhhcc------CCceEEeCCCC---CCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCc Q lcl|NC_019404. 324 NAELLPILEFLIPFIVNA------EEWSVEFSPLD---HESSKDKAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPE 394 (418) Q Consensus 324 e~~l~p~l~~l~~~i~~~------~~~~~~f~pL~---~~~eke~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~ 394 (418) ..+...+++++++++.- .+..|..|.=+ .++. ...+++..++++|.|+.+..++.|+..+.. T Consensus 384 -~~~e~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~~--------~~~~all~~~~~G~is~~t~~~~L~~~~vl 454 (491) T protein:vir:95 384 -RNVSQAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQPMTA--------QDRAAWMADINAGLLPATAYYAALRKAGVT 454 (491) T ss_pred -HHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccccccCCH--------HHHHHHHHHHhcCCCCHHHHHHHHHhCCCC Confidence 23567778887776531 23344333221 2222 236777888999999999999999875532 Q ss_pred CCCChhhcccc-ccc-------CCCccccccC Q lcl|NC_019404. 395 IKIGDNDIQTE-ESE-------LITETEVVIA 418 (418) Q Consensus 395 ~~~~~~~~~~~-e~~-------~~~e~e~~~~ 418 (418) ..++|++.+. +++ ++.-.|.+-| T Consensus 455 -~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~ 485 (491) T protein:vir:95 455 -DWTDEDILNAIEDAPLPSGAVTQVAGEIPQA 485 (491) T ss_pred -CccHHHHHHHHHhcCCCCCccccccccchhh Confidence 3343332111 111 1111222222 No 238 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=96.24 E-value=0.00083 Score=37.47 Aligned_cols=375 Identities=10% Similarity=0.032 Sum_probs=160.4 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc-----------h- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID-----------D- 65 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~-----------d- 65 (418) +.+-|+-. .++.....-.-+..+.. +-++.++. ..+ .+.||.+.-.+ + T Consensus 32 ~~~~~~~~--------~~~~~~~~~dstg~~a~-----~~LAa~l~----~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~ 94 (510) T protein:vir:78 32 LMVDPMSG--------SRGVVEHDFQSAGALLV-----NNLAAKLA----RSLFPTGIPFFRSELTDAIRREADSRDTDI 94 (510) T ss_pred cccCCCCc--------ccccccCcccchHHHHH-----HHHHHHHH----HhhcCCCCcccccCCChHHhhhcccCcchH Confidence 22222211 11111111111111111 11111111 111 12344442110 0 Q ss_pred ----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCccccccc-------CCCceEEEEEeecccc Q lcl|NC_019404. 66 ----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR-------EGAELETVRVYDRTQV 128 (418) Q Consensus 66 ----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~-------~~~~i~~i~v~~~~~i 128 (418) ++.+.+.+.+-++...+.++++.--.+|.+.+++.-+++.--.-|+. ..|.+..+ +-+..+ T Consensus 95 ~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i--~rr~~~ 172 (510) T protein:vir:78 95 TEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDATGRWMDI--VLKQRY 172 (510) T ss_pred HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCCCeEEEEEcceeEEeeCCCcCeeEE--Eeeeec Confidence 12344556677899999999999888998887765322222223442 33555332 222222 Q ss_pred ccc----cccccccc-----cccCcceEEEEe-cC-Cccc-ccccCc----ccEEEecC-----cc-ch-hhhhhccccC Q lcl|NC_019404. 129 KVQ----NREENPRN-----ARFGKPLTYRIT-TN-ESDM-FYDVHY----SRIHIIDG-----ER-VP-NAMRRQNDGW 185 (418) Q Consensus 129 ~~~----~~~~dp~s-----~~yg~p~~y~i~-~~-~~~~-~~~iH~----SR~i~~~g-----~~-lp-~~~~~~~~~~ 185 (418) ++. .+..+..+ ..+...+.|+.. .. +..+ ..-+|. .++..-.+ .| ++ -+.+..+.-| T Consensus 173 t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i~~~~~~~~~e~P~~~~Rw~~~~ge~Y 252 (510) T protein:vir:78 173 KSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHY 252 (510) T ss_pred cHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeeeccccccccccCCeeeeeeeecCCCcc Confidence 211 11111100 011122222221 10 0000 111221 12211111 11 11 2345566689 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCcee Q lcl|NC_019404. 186 GRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYS 265 (418) Q Consensus 186 G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~ 265 (418) |.||.+. +++.++..............++.-..+-.+. .+... ..++ .....+.++-+..+++. T Consensus 253 Grgp~~~-~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p--------~g~~~-~~~l------~~~~~g~~v~g~~~~v~ 316 (510) T protein:vir:78 253 GRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDE--------AKGAV-VDDY------QDAEMGDYVPGGAEAVR 316 (510) T ss_pred ccchHHH-HHHHHHHHHHHHHHHHHHHHHhhcCCcccCC--------ccccc-hhhh------ccCCCceeecCCccccc Confidence 9999986 8899999999988888777666544444332 11100 0111 11222333455556666 Q ss_pred Eeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 266 VLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPILEFLIP 336 (418) Q Consensus 266 ~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l~~l~~ 336 (418) .++. +|.-....++.+.+.|..++= .. |..+....+.||- ++-....--.+.+.|.-.+.|.+++.+. T Consensus 317 ~~~~~~~~d~~~~~~~i~~~~~rI~~aF~--~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) T protein:vir:78 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) T ss_pred ccccCcccchHHHHHHHHHHHHHHHHHHh--hc-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 6553 345556778888888887751 22 3222222344431 1122334445667777788999999988 Q ss_pred Hhhcc-------C---CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh----CCCCCHHHHHHHHHhhcCcCCCChhhc Q lcl|NC_019404. 337 FIVNA-------E---EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIA----AGAMDIKEARDTLRTIAPEIKIGDNDI 402 (418) Q Consensus 337 ~i~~~-------~---~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~----~g~i~~~e~r~~l~~~~~~~~~~~~~~ 402 (418) ++... + ...+++- ..+.-.++++-.....+.++.+.+ .-.|+.+++.+.+.... |++...+ T Consensus 394 il~r~gl~p~p~~~~~~~~v~~i--s~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~---Gv~p~~i 468 (510) T protein:vir:78 394 EVDDALLQGLITKQHKPAIETGL--PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAF---SVDTSQF 468 (510) T ss_pred HHHhccCCCCCcccccceeeecc--cHHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHh---CCChhhh Confidence 88642 1 1223322 122222233322222333333222 22377888877765322 2222222 Q ss_pred ccccccCCCccc------cccC Q lcl|NC_019404. 403 QTEESELITETE------VVIA 418 (418) Q Consensus 403 ~~~e~~~~~e~e------~~~~ 418 (418) -..+.+.....+ ..=| T Consensus 469 vrs~eev~a~~~~~~~q~~~~~ 490 (510) T protein:vir:78 469 YKSADELQAEAEEQRRQAAQAQ 490 (510) T ss_pred cCCHHHHHHHHHHHHHHHHHHH Confidence 211111111111 1111 No 239 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=376 Identities=10% Similarity=0.032 Sum_probs=157.3 Q ss_pred CccchhhHHH-----HhcCCCC-ccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc------- Q lcl|NC_019404. 1 MVKTDSYANI-----FLGGSDG-SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID------- 64 (418) Q Consensus 1 ~~~~D~~~n~-----~~g~~~~-~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~------- 64 (418) .-+..-+... +..-++. +......-.-+..+...- ++.+ .-..+ .+.||.+.-.+ T Consensus 20 e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~-----Laa~----l~~~ltpp~~~WF~l~~~d~~~~~~~ 90 (555) T protein:vir:17 20 LDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNV-----LASK----LMLSLFPVNTSFFKLQINDAEIDNLG 90 (555) T ss_pred HHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHH-----HHHH----HHHhhcCCCCcccccccCHHHHhhcc Confidence 0000011110 0000001 111011111111111111 1111 11111 22455553211 Q ss_pred ---------------hHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCccccccc-------CCCceEEEEE Q lcl|NC_019404. 65 ---------------DEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR-------EGAELETVRV 122 (418) Q Consensus 65 ---------------d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~-------~~~~i~~i~v 122 (418) -++.+...+.+-++...+.++++.--.+|.+++++.- ++.. .-|+. ..|.+..+ T Consensus 91 ~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~-~~~~-~~pl~~y~v~~d~~G~vd~v-- 166 (555) T protein:vir:17 91 MDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGK-KNLK-LYPLDRFVVSRDGEGNVMEI-- 166 (555) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecC-Ccee-EEEcCeEEEeeCCCcCeeEE-- Confidence 1123455566778999999999988889999888753 2221 12332 33444322 Q ss_pred eeccccccccc-----------------ccccccc----------ccCcce--------------EEEEecCCccccccc Q lcl|NC_019404. 123 YDRTQVKVQNR-----------------EENPRNA----------RFGKPL--------------TYRITTNESDMFYDV 161 (418) Q Consensus 123 ~~~~~i~~~~~-----------------~~dp~s~----------~yg~p~--------------~y~i~~~~~~~~~~i 161 (418) +-+..++.... ..+|..+ +...+. .++.....+. .+ T Consensus 167 ~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~~~~~~~e~~~~---~v 243 (555) T protein:vir:17 167 VTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDGQVKWHQECDGK---VI 243 (555) T ss_pred EeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCCeeEEEEecCce---ec Confidence 21222221100 0011110 000011 1111111000 00 Q ss_pred -CcccEEEecCcc-c-hhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHH Q lcl|NC_019404. 162 -HYSRIHIIDGER-V-PNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAA 238 (418) Q Consensus 162 -H~SR~i~~~g~~-l-p~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~ 238 (418) |+.+---|...| + +-+.+..+.-||.||.+. +++.++............+.++.-..+..+.-+.. T Consensus 244 ~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~---------- 312 (555) T protein:vir:17 244 PGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEE-FMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATT---------- 312 (555) T ss_pred cccccccCcccCCeeeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc---------- Confidence 111111111112 1 223455667899999986 88999999999999988888876666555421100 Q ss_pred HHHHHHHHHhcCCcceeEEEcCCCceeEeecc----cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----h Q lcl|NC_019404. 239 RLRLAQVDNNSGVGQAIGIDAESEEYSVLNSD----IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----N 309 (418) Q Consensus 239 ~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~----~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e 309 (418) .+. +.....++.++.+..+++..+... |.-+...++.+.+.|.-++-+ +.-+....+.+|- + T Consensus 313 -~~~----~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~----~~~~d~~r~TAtEV~~r~~ 383 (555) T protein:vir:17 313 -KPQ----NLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM----LQVRQSERTTATEVQATVQ 383 (555) T ss_pred -Ccc----eeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh----cCCCCcccchHHHHHHHHH Confidence 000 011122333445555667666543 444556677777777766532 2122333344431 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-------C-ceEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHhCC--- Q lcl|NC_019404. 310 TALETFHKLIDRKRNAELLPILEFLIPFIVNAE-------E-WSVEFS-PLDHESSKDKAEVLEKSVNSIAALIAAG--- 377 (418) Q Consensus 310 ~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~-------~-~~~~f~-pL~~~~eke~ae~~~~~a~a~~~~~~~g--- 377 (418) +-...+--.+.+++...+.|++++.+.++.+.. + ..+.+. +|..+... +++ .+..+.++.+.+.+ T Consensus 384 E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~--~~~-~~l~~~~~~laq~~~~p 460 (555) T protein:vir:17 384 ELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRG--QDK-QQLMEFITTLAQTMGPE 460 (555) T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHH--HHH-HHHHHHHHHHHhhcCch Confidence 222445555666777789999999999887642 1 111111 11111111 222 22333444444432 Q ss_pred ----CCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 378 ----AMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 378 ----~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) .|+.+++.+.+... .|++...+-..+.+.....+..=+ T Consensus 461 ~~~d~id~d~~~~~~a~~---~Gv~p~~ivrs~eev~~~rq~~~~ 502 (555) T protein:vir:17 461 IAMKYINPTEFIKRLAAA---QGIDTLQLINSPETMKQLGDQQKQ 502 (555) T ss_pred hHhhcCCHHHHHHHHHHH---cCCChhhhcCCHHHHHHHHHHHHH Confidence 37777777665432 222222221111111100000000 No 240 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=95.47 E-value=0.002 Score=35.38 Aligned_cols=385 Identities=11% Similarity=0.082 Sum_probs=173.2 Q ss_pred CccchhhHHH-----Hhc-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------h Q lcl|NC_019404. 1 MVKTDSYANI-----FLG-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------D 65 (418) Q Consensus 1 ~~~~D~~~n~-----~~g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------d 65 (418) .-+..-+... +.. ++..+......-.-+..+...-.. +++...+ .|+ +.||.+...+ . T Consensus 27 e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~La-s~l~~~l--tP~----~~WFrl~~~d~~~~~~~~~~ 99 (522) T protein:vir:94 27 ETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLA-AKLMLAL--FPQ----SPWMRLTVSEYEAKTLSQDS 99 (522) T ss_pred HHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHH-HHHHhhc--CCC----CcccccccchhhhhccCccc Confidence 1111111111 100 011111111111112222222211 3333333 242 3577774211 0 Q ss_pred -------------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecC-CCc--c-ccccc-------CCCceEEEE Q lcl|NC_019404. 66 -------------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKD-NRA--L-TSPVR-------EGAELETVR 121 (418) Q Consensus 66 -------------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d-~~~--l-~~pl~-------~~~~i~~i~ 121 (418) ++.+...+.+-++...+.++++.--.+|.+++++.-+. +.. + ..|+. ..|.+..+. T Consensus 100 ~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~ 179 (522) T protein:vir:94 100 EAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIV 179 (522) T ss_pred chhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEe Confidence 11244456667899999999998888999998875322 111 1 23542 335443221 Q ss_pred E---eeccccccc---cccccccccccCcceEEEE-ecCCcccccccCcc---cEE-E------ecCcc--chhhhhhcc Q lcl|NC_019404. 122 V---YDRTQVKVQ---NREENPRNARFGKPLTYRI-TTNESDMFYDVHYS---RIH-I------IDGER--VPNAMRRQN 182 (418) Q Consensus 122 v---~~~~~i~~~---~~~~dp~s~~yg~p~~y~i-~~~~~~~~~~iH~S---R~i-~------~~g~~--lp~~~~~~~ 182 (418) - +...+++.. ....|...|+ -..+.|+. ...... ..+|.+ ..+ - |...| ++-+.+..+ T Consensus 180 r~~~~~~~~l~~~~~~~~~~~~~~p~-~~v~v~~~v~~~~~~--~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~g 256 (522) T protein:vir:94 180 TIDKVAFSALPEDVKSQLNADDYEPD-TELEVYTHIYRQDDE--YLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDG 256 (522) T ss_pred eeeeccHHhcchHHHHHHhcccCCcc-ceEEEEEEEEeeCCc--eeEEeeccCceecccCCCCccccCCceeeeeeecCC Confidence 1 112222221 1122222221 22233332 221111 112211 111 1 11122 122345566 Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 183 DGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 183 ~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) .-||.||.+. +++.++............+.++.-..+..+.-+ ..+. .++ ..+.++. ++.+..+ T Consensus 257 e~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g----~~~~-----~~~-----~~~~~g~-~v~g~~~ 320 (522) T protein:vir:94 257 EDYGRSYCEE-YLGDLNSLETITEAITKMAKVASKVVGLVNPNG----ITQP-----RRL-----NKAATGE-FVAGRVE 320 (522) T ss_pred CccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeecccc----cccc-----hhe-----eccCCce-eecCCcc Confidence 6899999986 889999999999999999888776666654211 0000 000 1122233 3445455 Q ss_pred ceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccc-----hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 263 EYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS-----QNTALETFHKLIDRKRNAELLPILEF 333 (418) Q Consensus 263 ~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st-----ge~d~~~y~~~I~~~Qe~~l~p~l~~ 333 (418) ++..+.. +|.-....++.+.+.|..++=+- .+.-+....+.+| .++-...+--.+.+.+...+.|++++ T Consensus 321 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 398 (522) T protein:vir:94 321 DINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRV 398 (522) T ss_pred cceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 6665442 45556777888888888777322 1211233445443 12223445556677788889999999 Q ss_pred HHHHhhcc--------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC------CCCHHHHHHHHHhhcCcCCCCh Q lcl|NC_019404. 334 LIPFIVNA--------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG------AMDIKEARDTLRTIAPEIKIGD 399 (418) Q Consensus 334 l~~~i~~~--------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g------~i~~~e~r~~l~~~~~~~~~~~ 399 (418) .+.++.+. +.++++|-+. +....|+.-..+..+.++.+.+.+ -|+.+++.+.+.... |++. T Consensus 399 ~~~il~r~g~lP~~p~~~v~v~~~s~--La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~---Gv~~ 473 (522) T protein:vir:94 399 LMNQLQSAGMIPDLPKEAVEPTVSTG--LEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNAL---GIDT 473 (522) T ss_pred HHHHHHhcCCCCCCCcccEEeeEecH--HHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHc---CCCh Confidence 99998653 3466666532 222233222222222233222221 267777776664432 2222 Q ss_pred hhcccccccCC------CccccccC Q lcl|NC_019404. 400 NDIQTEESELI------TETEVVIA 418 (418) Q Consensus 400 ~~~~~~e~~~~------~e~e~~~~ 418 (418) ..+-..+.+.. ...+.+-+ T Consensus 474 ~~ivr~~ee~~~~~~q~~~~~~~~~ 498 (522) T protein:vir:94 474 AGLLLTQDEKIQRMAEQSSQQAVVQ 498 (522) T ss_pred hhccCCHHHHHHHHHHHHHHHHHHH Confidence 22211111100 00111111 No 241 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=95.45 E-value=0.0021 Score=35.32 Aligned_cols=387 Identities=10% Similarity=0.040 Sum_probs=175.6 Q ss_pred CccchhhHH-----HHhcCCCC-ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------- Q lcl|NC_019404. 1 MVKTDSYAN-----IFLGGSDG-SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------- 64 (418) Q Consensus 1 ~~~~D~~~n-----~~~g~~~~-~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------- 64 (418) .-+..-+.. .+...++. ++.....-.-+..+...-.. +++...+. |+ +.||.+.-.+ T Consensus 29 e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~La-a~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~ 101 (543) T protein:vir:88 29 ETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLS-AKVMLALF--PL----QSWMKLKVSEWQAKQLVSDP 101 (543) T ss_pred HHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHH-HHHHHhhc--CC----CcccccccChHHHhcccCCh Confidence 111111111 11111111 11111111112222222211 23333332 43 3577774211 Q ss_pred -hH-----------HHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCC-----ccc-ccc-------cCCCceEE Q lcl|NC_019404. 65 -DE-----------PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNR-----ALT-SPV-------REGAELET 119 (418) Q Consensus 65 -d~-----------~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~-----~l~-~pl-------~~~~~i~~ 119 (418) +. +.+...+.+.++...+.++++.--.+|.+++++.-+.+. ++. .|+ +..|.+.. T Consensus 102 ~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~ 181 (543) T protein:vir:88 102 SQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQ 181 (543) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEcceEEEeeCCCCCeee Confidence 11 124445666789999999999877899999887543221 111 344 23444432 Q ss_pred EEEeeccccccccc---------cccccccccCcceEEEEe-cCCcccccccCc---ccEEEec-------Ccc--chhh Q lcl|NC_019404. 120 VRVYDRTQVKVQNR---------EENPRNARFGKPLTYRIT-TNESDMFYDVHY---SRIHIID-------GER--VPNA 177 (418) Q Consensus 120 i~v~~~~~i~~~~~---------~~dp~s~~yg~p~~y~i~-~~~~~~~~~iH~---SR~i~~~-------g~~--lp~~ 177 (418) ++-+..++...+ ...-..| |...+.|+.. +........+|. |-.+.-. ..| ++-+ T Consensus 182 --i~r~~~~~~~~l~~~~~~~v~~~~~~~p-~~~~~v~~~V~pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw 258 (543) T protein:vir:88 182 --IVTLDKVAYAALPEDVRNSLSGGQEYKP-EQELEVYTHIYIDDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRW 258 (543) T ss_pred --eeeeeeccHHHHhHHhhHHHHHHhhcCC-ccceEEEEEEEeecCCCcccccccccCeeeecCCCccccccCCceeeee Confidence 222222222111 0001122 2334444432 221111122331 2222111 112 1123 Q ss_pred hhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEE Q lcl|NC_019404. 178 MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGI 257 (418) Q Consensus 178 ~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~ 257 (418) .+..+.-||.||.+. +++.++............+.++.-..+..+.- +.. ... +......+.++ T Consensus 259 ~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~------g~~-----~~~----~~~~~~~g~~v 322 (543) T protein:vir:88 259 TKRDGEHYGRSHVEE-YLGDLNSLESLNEAMIKFAMISSKVVGLVNPN------GIT-----QVR----RLVKAQTGDFV 322 (543) T ss_pred eecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeccc------ccc-----chh----hcccCCCceee Confidence 445566799999986 88999999999999988888776666555421 100 000 01111222334 Q ss_pred EcCCCceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 258 DAESEEYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELL 328 (418) Q Consensus 258 d~~~e~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~ 328 (418) .+..+++..+.. +|......++.+.+.|.-++=+- -+..+....+.+|- ++-...+--.+.+++...+. T Consensus 323 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~ 400 (543) T protein:vir:88 323 AGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLN--SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQL 400 (543) T ss_pred cCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHH Confidence 444456655433 46667777888888888766322 12223344454431 22224455566777888899 Q ss_pred HHHHHHHHHhhcc--------CCceEEeCC-CCCCCHHHHHHHHHHHHHHHHHHHh---CCCCCHHHHHHHHHhhcCcCC Q lcl|NC_019404. 329 PILEFLIPFIVNA--------EEWSVEFSP-LDHESSKDKAEVLEKSVNSIAALIA---AGAMDIKEARDTLRTIAPEIK 396 (418) Q Consensus 329 p~l~~l~~~i~~~--------~~~~~~f~p-L~~~~eke~ae~~~~~a~a~~~~~~---~g~i~~~e~r~~l~~~~~~~~ 396 (418) |++++.+.++.+. +++++++-+ |-.+.-..+++-....++.+..+.+ .-.|+.+++.+.+.... | T Consensus 401 Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~---G 477 (543) T protein:vir:88 401 PIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAI---G 477 (543) T ss_pred HHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHh---C Confidence 9999999988653 345566542 3333333333333333333333333 22367888777765322 2 Q ss_pred CChhhcccccccCCCc-cccccC Q lcl|NC_019404. 397 IGDNDIQTEESELITE-TEVVIA 418 (418) Q Consensus 397 ~~~~~~~~~e~~~~~e-~e~~~~ 418 (418) ++...+-..+++.... .+.+.+ T Consensus 478 v~~~~i~r~~~e~~~~~~q~~~q 500 (543) T protein:vir:88 478 IDTAGLLLTEAEKAQAQSQEMLK 500 (543) T ss_pred CChhhhcCCHHHHHHHHHHHHHH Confidence 2222221111111100 000010 No 242 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=94.83 E-value=0.0034 Score=34.13 Aligned_cols=386 Identities=11% Similarity=0.131 Sum_probs=167.3 Q ss_pred CccchhhHHH-------HhcCCC-Cccc-cCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch------ Q lcl|NC_019404. 1 MVKTDSYANI-------FLGGSD-GSEI-YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD------ 65 (418) Q Consensus 1 ~~~~D~~~n~-------~~g~~~-~~~~-~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d------ 65 (418) .-+..-+... |.+... .+.. ....-.-+..+... -++.++.-..-- ..+.||.+.-.++ T Consensus 24 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~-----~Las~l~~~ltp-p~~~WF~l~~~d~~~~~~~ 97 (556) T protein:vir:73 24 ESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQR-----ILSSGMMSGITS-PARPWFKLATPDPDMMDYG 97 (556) T ss_pred HHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHH-----HHHHHHHHhhcC-CCCcccccccCcccccchH Confidence 1111111111 111110 0000 11111111111111 111111111000 2345777642211 Q ss_pred ---------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcc---cccc-------cCCCceEEEEEeecc Q lcl|NC_019404. 66 ---------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPV-------REGAELETVRVYDRT 126 (418) Q Consensus 66 ---------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l---~~pl-------~~~~~i~~i~v~~~~ 126 (418) ++.+...+.+-++...+.++++.--.||.+.+++.-+.++.+ .-|+ +..|.+.. |+-+. T Consensus 98 ~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~~~~~~d~~G~vd~--i~r~~ 175 (556) T protein:vir:73 98 PVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGSYYLANSPRGSVDT--CIRQF 175 (556) T ss_pred HHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecceeEEeeCCCCCeEE--EEEEE Confidence 123445566778899999999988899999998764322221 2333 23344432 22111 Q ss_pred ccc--------------cc---cccccccccccCcceEEEE-ecCCcccc------------cccC----cccEEEecC- Q lcl|NC_019404. 127 QVK--------------VQ---NREENPRNARFGKPLTYRI-TTNESDMF------------YDVH----YSRIHIIDG- 171 (418) Q Consensus 127 ~i~--------------~~---~~~~dp~s~~yg~p~~y~i-~~~~~~~~------------~~iH----~SR~i~~~g- 171 (418) .++ .. .+..+|.... .+.++. .+...... .-+| ..+++.-.| T Consensus 176 ~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~---~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~esg~ 252 (556) T protein:vir:73 176 SMTVRQMVQEFGLDNVSTSVKGMWENGTYETW---VEVNHCITPNVNRDSGKMDSKNKPYRSVYFESGGDSDKLLRESGF 252 (556) T ss_pred eccHHHHHHHcCcccCCHHHHHHHhcCCccce---EEEEEEEeccccccccccCcccceEEEEEEEecCCCceecccCCc Confidence 111 11 1112221111 111111 11100000 0011 111221111 Q ss_pred --cc--chhhhhhccccCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHH Q lcl|NC_019404. 172 --ER--VPNAMRRQNDGWGRS-VLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVD 246 (418) Q Consensus 172 --~~--lp~~~~~~~~~~G~S-~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~ 246 (418) .| ++-+.+..+.-||.| |.+. +++.++............+..+.-..+..+.-. .. ..+.. T Consensus 253 ~e~P~~~~Rw~~~~ge~YGrg~P~~~-~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~------~~-----~~~~~-- 318 (556) T protein:vir:73 253 DEFPILAPRWEVNGEDVYASSCPGML-ALGQVKALQVEQKRKAQLIDKATNPPMVAPTSL------KN-----QRVSL-- 318 (556) T ss_pred ccCCceeeeeeecCCcccccCccHHH-hHHHHHHHHHHHHHHHHHHHHHhcCceeccccc------cc-----cceee-- Confidence 11 122345667789998 7875 789999999999998888888776666554211 00 00110 Q ss_pred HhcCCcceeEEEcCCCceeEee---cccCCHHHHHHHHHHHHhhhhcCCeeeeeccC-ccccccc-----hhHHHHHHHH Q lcl|NC_019404. 247 NNSGVGQAIGIDAESEEYSVLN---SDIGGIDAFLDKKFDRIVALSGIHEIILKNKN-VGGLSSS-----QNTALETFHK 317 (418) Q Consensus 247 ~~~~~~~~~~~d~~~e~~~~~~---~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s-~~gl~st-----ge~d~~~y~~ 317 (418) .-+-.+.....+..+.+..+. .++..+...++.+.+.|..++=......+++. ..-+.+| .++-....-- T Consensus 319 -~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~ 397 (556) T protein:vir:73 319 -LPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGP 397 (556) T ss_pred -ccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhH Confidence 001001111223334565543 34445556667778888877755433333332 3334443 2222234445 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCCCCHHHHHHH---HHHHHHHHHHHHhCC----- Q lcl|NC_019404. 318 LIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDHESSKDKAEV---LEKSVNSIAALIAAG----- 377 (418) Q Consensus 318 ~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~~~eke~ae~---~~~~a~a~~~~~~~g----- 377 (418) .+.+.+.-.+.|.+++.+.++.+. .+++++|-+. +....++.- ....++.+..+.+.+ T Consensus 398 v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~--La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d 475 (556) T protein:vir:73 398 VLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISV--MAQAQKSIGLTSLSQTVGFIGQLAQFKPEALD 475 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecH--HHHHHHHHHHHHHHHHHHHHHHHhccChhhHh Confidence 566667778999999999988752 2466676543 222222222 234444454554443 Q ss_pred CCCHHHHHHHHHhhcCcCCCChhhcccccccCCCccccccC Q lcl|NC_019404. 378 AMDIKEARDTLRTIAPEIKIGDNDIQTEESELITETEVVIA 418 (418) Q Consensus 378 ~i~~~e~r~~l~~~~~~~~~~~~~~~~~e~~~~~e~e~~~~ 418 (418) .|+.+++.+.+.... |++. .+-..+.+.....+...+ T Consensus 476 ~id~d~~~~~~a~~~---Gvp~-~~irs~eev~~~rq~r~~ 512 (556) T protein:vir:73 476 KLDVDQAIDAFSEMS---GVSP-TVIVPQEQVQGIREERAK 512 (556) T ss_pred cCCHHHHHHHHHHHc---CCCh-hhcCCHHHHHHHHHHHHH Confidence 377777777664322 2221 121111111111111111 No 243 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=94.41 E-value=0.0045 Score=33.46 Aligned_cols=384 Identities=10% Similarity=0.075 Sum_probs=166.1 Q ss_pred Cc--cchhhHHHHhcCCCCccccC-ccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch------------ Q lcl|NC_019404. 1 MV--KTDSYANIFLGGSDGSEIYG-SLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD------------ 65 (418) Q Consensus 1 ~~--~~D~~~n~~~g~~~~~~~~~-~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d------------ 65 (418) ++ +.+.+.+ ..+.+....... ..-.-+..+... -++.++.-..-- ..+.||.+.-.+. T Consensus 31 ~lP~~~~~~~~-~~~~~~~~~~~~~~i~dst~~~a~~-----~Las~L~~~ltP-p~~~WF~l~~~d~~~~~~~~v~~~L 103 (547) T protein:vir:10 31 IMPMRSDFFSD-LRSEGSINWNQNREVFDSTAGDGLE-----TLSSSLHGSLTS-PATKWFELAFRDKELNSDDECRKWL 103 (547) T ss_pred hcccccccccC-CCCCcccccccccccccchHHHHHH-----HHHHHHHHhhcC-CCCcccccccCCccccchHHHHHHH Confidence 11 1111111 111111000000 000001111111 111111111000 1235666542211 Q ss_pred ---HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCC--Cc---cccccc-------CCCceEEEEEeecccccc Q lcl|NC_019404. 66 ---EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDN--RA---LTSPVR-------EGAELETVRVYDRTQVKV 130 (418) Q Consensus 66 ---~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~--~~---l~~pl~-------~~~~i~~i~v~~~~~i~~ 130 (418) ++.+...+.+-++...+.++++.--.||.+.+++.-+.+ .. -.-|+. ..|.+.. ++-+..++. T Consensus 104 ~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~~d~~G~v~~--i~r~~~~t~ 181 (547) T protein:vir:10 104 ENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFEEDSRGQVVN--FYRVFRWTP 181 (547) T ss_pred HHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEeeCCCcCeee--eeeeeeccH Confidence 123455667778999999999988889999888754211 11 123332 2344422 121111111 Q ss_pred --------------c---cccccccccccCcceEEEEe-cCC---c-----------------------ccccccCcccE Q lcl|NC_019404. 131 --------------Q---NREENPRNARFGKPLTYRIT-TNE---S-----------------------DMFYDVHYSRI 166 (418) Q Consensus 131 --------------~---~~~~dp~s~~yg~p~~y~i~-~~~---~-----------------------~~~~~iH~SR~ 166 (418) . ....||..+. -+.+.|+.. +.. . +...-++.|.. T Consensus 182 ~qi~~~fg~~~l~~~v~~~~~~~~~~~~-~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~ 260 (547) T protein:vir:10 182 AQIYDRFGDEGTPEAIIKKAKEASNQAA-LKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGY 260 (547) T ss_pred HHHHHhcCcccCCHHHHHHHhcCCCccc-ceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCceeeeecCCc Confidence 1 0112222110 001111100 000 0 00001111211 Q ss_pred EEecCcc--chhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHH Q lcl|NC_019404. 167 HIIDGER--VPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQ 244 (418) Q Consensus 167 i~~~g~~--lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~ 244 (418) ...| ++-+.+..+.-||.||.+. +++.++............+..+.-..+..+.-+. . + .+. T Consensus 261 ---~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~-~---~-------~~~- 324 (547) T protein:vir:10 261 ---YEMPAYAIRWRKSAGSQWGFGPSHL-ALPDVLTANRYVELVLRSSEKVIDPAIMVTERGL-I---S-------DID- 324 (547) T ss_pred ---ccCCeeeeeeeecCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceecccccc-c---c-------cce- Confidence 1112 1223455667799999986 7899999999999988888887766665542111 1 0 111 Q ss_pred HHHhcCCcceeEEEcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccc-----hhHHHHHHHH Q lcl|NC_019404. 245 VDNNSGVGQAIGIDAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS-----QNTALETFHK 317 (418) Q Consensus 245 ~~~~~~~~~~~~~d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st-----ge~d~~~y~~ 317 (418) -..+++...+..+++..++. +|.-....++.+.+.|..++=.....+ .....+.|| .++-....-- T Consensus 325 -----~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~--~~~~~~TAtEV~~r~~E~~~~LG~ 397 (547) T protein:vir:10 325 -----LGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQM--KDSPAMTATEVQVRYELMQRLLGP 397 (547) T ss_pred -----ecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhc--CCCccccHHHHHHHHHHHHHHhhH Confidence 12344555566677776654 344456677788888887664332211 123334443 1222344455 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccC---------------CceEEeCCCCCCC-HHHHHHHHHHHHHHHHHHHhCC---- Q lcl|NC_019404. 318 LIDRKRNAELLPILEFLIPFIVNAE---------------EWSVEFSPLDHES-SKDKAEVLEKSVNSIAALIAAG---- 377 (418) Q Consensus 318 ~I~~~Qe~~l~p~l~~l~~~i~~~~---------------~~~~~f~pL~~~~-eke~ae~~~~~a~a~~~~~~~g---- 377 (418) .+++.+...+.|.+++.+.++.+.. ++.++|-+..... ..+.........+.+..+.+.+ T Consensus 398 v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vl 477 (547) T protein:vir:10 398 TLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVL 477 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh Confidence 5666777788999999999886531 2345544322111 1112222233344444444433 Q ss_pred -CCCHHHHHHHHHhhc-CcCCC--ChhhcccccccCCCccccccC Q lcl|NC_019404. 378 -AMDIKEARDTLRTIA-PEIKI--GDNDIQTEESELITETEVVIA 418 (418) Q Consensus 378 -~i~~~e~r~~l~~~~-~~~~~--~~~~~~~~e~~~~~e~e~~~~ 418 (418) .|+.+++.+.+.... ....+ ++++.+.-. +...+...+-+ T Consensus 478 d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r-~qr~~~~q~~~ 521 (547) T protein:vir:10 478 DIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIR-KNRSQTQQKAE 521 (547) T ss_pred hcCCHHHHHHHHHHHhCCChhccCCHHHHHHHH-HHHHHHHHHHH Confidence 378888877765432 21111 222221100 00001111111 No 244 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=94.36 E-value=0.0046 Score=33.38 Aligned_cols=374 Identities=11% Similarity=0.049 Sum_probs=162.7 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc-----------h- Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID-----------D- 65 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~-----------d- 65 (418) .++-|+-. .+......-.-+..+.. +-++.++. ..+ .+.||.+.-.+ + T Consensus 32 ~~~~~~~~--------~~~~~~~~~dstg~~a~-----~~LAa~l~----~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~ 94 (510) T protein:vir:63 32 LMVDPMSG--------SRGVVEHDFQSAGALLV-----NNLAAKLA----RSLFPTGIPFFRSELTDAIRREADSRDTDI 94 (510) T ss_pred cCCCCCCc--------cccccCCCccchHHHHH-----HHHHHHHH----hhhcCCCCcccccCCChHHhhcccccchhH Confidence 22222211 11111111111111111 11111211 111 22455543111 0 Q ss_pred ----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCC-ccccccc-------CCCceEEEEEeeccc Q lcl|NC_019404. 66 ----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNR-ALTSPVR-------EGAELETVRVYDRTQ 127 (418) Q Consensus 66 ----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~-~l~~pl~-------~~~~i~~i~v~~~~~ 127 (418) ++.+.+.+.+-++...+.++++.--.||.+.+++. +|+. --.-|+. ..|.+..+ +-+.. T Consensus 95 ~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~-~~~~~~~~~pl~~y~v~~d~~G~vd~i--~rr~~ 171 (510) T protein:vir:63 95 TEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRD-SDAATVVAWSLRSYAVRRDATGRWMDI--VLKQR 171 (510) T ss_pred HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEc-CCCcEEEEEEcceeEEeeCCCcCeeEE--Eeeee Confidence 11245566777899999999999888999888865 3332 2223442 34554332 22222 Q ss_pred cccccc--------cccccc-cccCcceEEEEecC--Ccccc-cccCc----ccEEEecC-----cc-c-hhhhhhcccc Q lcl|NC_019404. 128 VKVQNR--------EENPRN-ARFGKPLTYRITTN--ESDMF-YDVHY----SRIHIIDG-----ER-V-PNAMRRQNDG 184 (418) Q Consensus 128 i~~~~~--------~~dp~s-~~yg~p~~y~i~~~--~~~~~-~~iH~----SR~i~~~g-----~~-l-p~~~~~~~~~ 184 (418) +++... ..+... ..+.+.+.|+..-. +...+ ..||. .++..-.+ .| + +-+.+..+.- T Consensus 172 ~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw~~~~ge~ 251 (510) T protein:vir:63 172 YKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEH 251 (510) T ss_pred ccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCceeccccccccccCceeeeeeeecCCCc Confidence 221110 011111 11223333332211 10011 11121 12211111 11 1 1234556667 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCce Q lcl|NC_019404. 185 WGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEY 264 (418) Q Consensus 185 ~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~ 264 (418) ||.||.+. +++.++..............++.-..+-.+. .+... ..++ ..+-++. ++-+..+++ T Consensus 252 YGrgp~~~-~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p--------~g~~~-~~~~-----~~~~~g~-~v~g~~~~v 315 (510) T protein:vir:63 252 YGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDE--------AKGAV-VDDY-----QDAEMGD-YVPGGAEAV 315 (510) T ss_pred cccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCCcccCc--------ccccc-hhhh-----ccCCCce-eecCCcccc Confidence 99999986 8899999999988888877666444443332 11100 0111 1122233 344555666 Q ss_pred eEeecc----cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 265 SVLNSD----IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPILEFLI 335 (418) Q Consensus 265 ~~~~~~----~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l~~l~ 335 (418) ..++.. |.-+...++.+.+.|..++ ... |..+....+.||- ++-....--.+.+.|.-.+.|.+++.+ T Consensus 316 ~~~~~~~~~d~~~~~~~i~~~~~rI~~af--~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) T protein:vir:63 316 RAYERGDYNKMAAIQQSLQAVVVRLNQAF--MYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) T ss_pred eeeecCcccchHHHHHHHHHHHHHHHHHH--Hhh-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 665543 4556677888888888876 122 3222223345431 112233444566777778899999998 Q ss_pred HHhhcc-------CCc---eEEeCCCCCCCHHHHHHHHHHHHHHHHHH---Hh-CCCCCHHHHHHHHHhhcCcCCCChhh Q lcl|NC_019404. 336 PFIVNA-------EEW---SVEFSPLDHESSKDKAEVLEKSVNSIAAL---IA-AGAMDIKEARDTLRTIAPEIKIGDND 401 (418) Q Consensus 336 ~~i~~~-------~~~---~~~f~pL~~~~eke~ae~~~~~a~a~~~~---~~-~g~i~~~e~r~~l~~~~~~~~~~~~~ 401 (418) .++... +.+ .+++ +..+....+++-.....+.++.. .+ .--|+.+++.+.+.... |++... T Consensus 393 ~il~r~gl~p~p~~~~~~~~v~~--is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~---Gv~p~~ 467 (510) T protein:vir:63 393 SEVDDALLQGLITKQHKPAIETG--LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAF---SVDTSQ 467 (510) T ss_pred HHHHhccCCCCCchhcccceecc--hhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHh---CCChhH Confidence 888642 111 1222 11222222333222222222222 12 22477888877765322 233333 Q ss_pred cccccccCCCcccc------ccC Q lcl|NC_019404. 402 IQTEESELITETEV------VIA 418 (418) Q Consensus 402 ~~~~e~~~~~e~e~------~~~ 418 (418) +-..+.+...+.|. .=+ T Consensus 468 ivrs~eev~a~~~~~~qq~~~~~ 490 (510) T protein:vir:63 468 FYKSADELQAEAEQQRQQAAQAQ 490 (510) T ss_pred hcCCHHHHHHHHHHHHHHHHHHH Confidence 32222221111110 000 No 245 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=93.38 E-value=0.0078 Score=32.15 Aligned_cols=389 Identities=12% Similarity=0.146 Sum_probs=160.7 Q ss_pred CccchhhHHH-------HhcCCC-Ccc-ccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcch------ Q lcl|NC_019404. 1 MVKTDSYANI-------FLGGSD-GSE-IYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDD------ 65 (418) Q Consensus 1 ~~~~D~~~n~-------~~g~~~-~~~-~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~d------ 65 (418) .-+..-+... |.+... .+. .....-.-+..+... -++.++.-..-- ..+.||.+.-.++ T Consensus 24 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~-----~Las~l~~~ltp-p~~~WF~l~~~d~~~~e~~ 97 (559) T protein:vir:95 24 EPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAAR-----TLASGMMSGITS-PARPWFRLATPDPEMMDYG 97 (559) T ss_pred HHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHH-----HHHHHHHHhhcC-CCCcccccccCCccccchH Confidence 1111111111 111110 000 001111111111111 111111111000 1345666643211 Q ss_pred ---------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcc---ccccc-------CCCceEEEEEeecc Q lcl|NC_019404. 66 ---------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPVR-------EGAELETVRVYDRT 126 (418) Q Consensus 66 ---------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l---~~pl~-------~~~~i~~i~v~~~~ 126 (418) ++.+...+.+-++...+.++++.--.||.+++++.-+.++.+ .-|+. ..|.+.. |+-+. T Consensus 98 ~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l~~~~v~~d~~G~vd~--i~r~~ 175 (559) T protein:vir:95 98 PVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPIGSYYLANSPRGSVDT--CFRKF 175 (559) T ss_pred HHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeecCeEEEeeCCCCCeEE--EEEeE Confidence 122455666778899999999888899999998864333321 23332 3344432 22211 Q ss_pred cccc--------------c---cccccccccccCcceEEEE-ecCCccccc----------ccC------cccEEEecCc Q lcl|NC_019404. 127 QVKV--------------Q---NREENPRNARFGKPLTYRI-TTNESDMFY----------DVH------YSRIHIIDGE 172 (418) Q Consensus 127 ~i~~--------------~---~~~~dp~s~~yg~p~~y~i-~~~~~~~~~----------~iH------~SR~i~~~g~ 172 (418) .+++ . ....+|...+ -+.|+. .+....... .|| ..+++.-.|. T Consensus 176 ~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~---v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~ 252 (559) T protein:vir:95 176 SMTVRQLVQEFGLNNVSESVKSMWESGTYEKW---IEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGF 252 (559) T ss_pred ecCHHHHHHHcCcccCCHHHHHHHhcCCCCCe---EEEEEEEeccccccccccccccceEEEEEEEecCCCceeeecCCc Confidence 2221 1 1122222211 111211 111000000 010 0122221111 Q ss_pred ---c-ch-hhhhhccccCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHH Q lcl|NC_019404. 173 ---R-VP-NAMRRQNDGWGRS-VLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVD 246 (418) Q Consensus 173 ---~-lp-~~~~~~~~~~G~S-~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~ 246 (418) | ++ -+.+..+.-||.| |... +++.++............+..+.-..+..+.-. .. ..+.. T Consensus 253 ~e~P~~~~Rw~~~~ge~YGrg~P~~~-al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~------~~-----~~~~l-- 318 (559) T protein:vir:95 253 DEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL------KN-----QRASL-- 318 (559) T ss_pred ccCCccceeeeecCCccccccchHHH-hhHHHHHHHHHHHHHHHHHHHHhcCceeccccc------cc-----cceee-- Confidence 1 12 2344566679998 7864 789999999999998888888776666554211 00 00110 Q ss_pred HhcCCcceeEEEcC---CCceeEee---cccCCHHHHHHHHHHHHhhhhcCCee-eeeccCccccccc-----hhHHHHH Q lcl|NC_019404. 247 NNSGVGQAIGIDAE---SEEYSVLN---SDIGGIDAFLDKKFDRIVALSGIHEI-ILKNKNVGGLSSS-----QNTALET 314 (418) Q Consensus 247 ~~~~~~~~~~~d~~---~e~~~~~~---~~~~gl~~~~~~~~~~iaaas~IP~t-~L~G~s~~gl~st-----ge~d~~~ 314 (418) ..+++..... .+.+.... ..+..+...++.+.+.|..++=.-.. .|..+....+.+| .++-... T Consensus 319 ----~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~ 394 (559) T protein:vir:95 319 ----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (559) T ss_pred ----eccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHH Confidence 1121211111 13344332 22333334455667777666644322 2323334445443 1222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCCC-CHHHHHHHHHHHHHHHHHHHhCC---- Q lcl|NC_019404. 315 FHKLIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDHE-SSKDKAEVLEKSVNSIAALIAAG---- 377 (418) Q Consensus 315 y~~~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~~-~eke~ae~~~~~a~a~~~~~~~g---- 377 (418) .--.+.+.+.-.+.|.+++.+.++.+. .+++++|-+.... -..+..+.....++.+..+.+.+ T Consensus 395 LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevl 474 (559) T protein:vir:95 395 LGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEAL 474 (559) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhh Confidence 444466667778999999999998652 2455666432211 11112222234444555554443 Q ss_pred -CCCHHHHHHHHHhhc-CcCCC--ChhhcccccccC--CCccccccC Q lcl|NC_019404. 378 -AMDIKEARDTLRTIA-PEIKI--GDNDIQTEESEL--ITETEVVIA 418 (418) Q Consensus 378 -~i~~~e~r~~l~~~~-~~~~~--~~~~~~~~e~~~--~~e~e~~~~ 418 (418) .|+.+++.+.+.... ...++ ++++......+- -.+...+.+ T Consensus 475 d~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~ 521 (559) T protein:vir:95 475 DKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMA 521 (559) T ss_pred hcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 377888777664322 21111 111111000000 000000000 No 246 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=93.10 E-value=0.0088 Score=31.86 Aligned_cols=380 Identities=12% Similarity=0.091 Sum_probs=171.5 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------h------ Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------D------ 65 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------d------ 65 (418) ..+.|+ +..+......-.-+..+...-.. +++...+. |+ +.||.+.-.+ + T Consensus 42 ~~~~~~--------~~~~~~~~~~~dst~~~a~~~La-a~l~~~lt--P~----~~WFrl~~~d~~~~~~~~~~~~~~~v 106 (536) T protein:vir:21 42 LFPKDS--------DNASTDYQTPWQAVGARGLNNLA-SKLMLALF--PM----QTWMRLTISEYEAKQLLSDPDGLAKV 106 (536) T ss_pred ccCCCC--------CcccccccccccccHHHHHHHHH-HHHHHhhc--CC----CcccccccChhhhhccccchhhHHHH Confidence 222222 11111111111112333332212 34444442 54 3477774211 0 Q ss_pred -------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc---c-ccccc-------CCCceEEEEEeeccc Q lcl|NC_019404. 66 -------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA---L-TSPVR-------EGAELETVRVYDRTQ 127 (418) Q Consensus 66 -------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~---l-~~pl~-------~~~~i~~i~v~~~~~ 127 (418) ++.+...+.+-++...+.++++.--.+|.+++++.-+.+.. . .-|+. ..|.+..+ +-+.. T Consensus 107 ~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i--~r~~~ 184 (536) T protein:vir:21 107 DEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQM--VTRDQ 184 (536) T ss_pred HHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEE--eeeee Confidence 12345556677899999999998888999998886443321 1 23442 34544332 22222 Q ss_pred cccc----cccccccc-----cccCcceEEEEe-cCCcccccccCc----ccEEEecC------cc-ch-hhhhhccccC Q lcl|NC_019404. 128 VKVQ----NREENPRN-----ARFGKPLTYRIT-TNESDMFYDVHY----SRIHIIDG------ER-VP-NAMRRQNDGW 185 (418) Q Consensus 128 i~~~----~~~~dp~s-----~~yg~p~~y~i~-~~~~~~~~~iH~----SR~i~~~g------~~-lp-~~~~~~~~~~ 185 (418) ++.. .+..+..+ ..+...+.|+.. .........+|. -+++...| .| ++ -+.+....-| T Consensus 185 ~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~Y 264 (536) T protein:vir:21 185 IAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESY 264 (536) T ss_pred ccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCeeeccccCccccccCCeeeeeeeecCCCcc Confidence 2211 11111111 112223333222 111111111121 12222223 12 11 2344556679 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCcee Q lcl|NC_019404. 186 GRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYS 265 (418) Q Consensus 186 G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~ 265 (418) |.||.+. +++.++...............+.-..+..+. .+... ..++ ..+.++.+ +-+..+++. T Consensus 265 Grgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p--------~g~~~-~~~~-----~~~~~g~~-v~g~~~~v~ 328 (536) T protein:vir:21 265 GRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNP--------AGITQ-PRRL-----TKAQTGDF-VTGRPEDIS 328 (536) T ss_pred ccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcccCc--------ccccc-hhhh-----ccCCCcce-ecCCcccce Confidence 9999986 7899999999988888876665444443331 11100 0011 12223333 333335555 Q ss_pred Eee----cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 266 VLN----SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPILEFLIP 336 (418) Q Consensus 266 ~~~----~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l~~l~~ 336 (418) .+. .+|......++.+.+.|.-++=+- .+.-+....+.+|- ++-....--.+.+.+...+.|.+++++. T Consensus 329 ~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~ 406 (536) T protein:vir:21 329 FLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 406 (536) T ss_pred eeeccccccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 443 245556777888888888777221 22223344454431 1122333445566788889999999999 Q ss_pred HhhccC--------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC------CCCHHHHHHHHHhh-cC-cCCC--C Q lcl|NC_019404. 337 FIVNAE--------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG------AMDIKEARDTLRTI-AP-EIKI--G 398 (418) Q Consensus 337 ~i~~~~--------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g------~i~~~e~r~~l~~~-~~-~~~~--~ 398 (418) ++.... .+.+++.+. +....++.-..+..+.++.+.+.+ .|+.+++.+.+.+. +. ..++ + T Consensus 407 il~r~g~lP~~p~~~v~~~~vs~--l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt 484 (536) T protein:vir:21 407 QLQATQQIPELPKEAVEPTISTG--LEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLT 484 (536) T ss_pred HHHhCCCCCCCChhhccceEEec--HHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCC Confidence 986532 234444321 223334333333344444443332 37788887776542 21 1111 2 Q ss_pred hhhcccccccCCCccccccC Q lcl|NC_019404. 399 DNDIQTEESELITETEVVIA 418 (418) Q Consensus 399 ~~~~~~~e~~~~~e~e~~~~ 418 (418) +++....-.+ ..+...+-+ T Consensus 485 ~eev~~~r~q-~~~~~~~~~ 503 (536) T protein:vir:21 485 EEQKQQKMAQ-QSMQMGMDN 503 (536) T ss_pred HHHHHHHHHH-HHHHHHHHH Confidence 2221111000 000000000 No 247 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=92.24 E-value=0.012 Score=31.07 Aligned_cols=381 Identities=10% Similarity=0.056 Sum_probs=154.9 Q ss_pred CccchhhHHH-----HhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc-------- Q lcl|NC_019404. 1 MVKTDSYANI-----FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID-------- 64 (418) Q Consensus 1 ~~~~D~~~n~-----~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~-------- 64 (418) .-+..-+... +..-+...+.. ..-.-+..+.. +-++.++ -..+ .+.||.+.-.+ T Consensus 27 e~~w~e~~~~~lP~~~~~~~~~~~~~-~~~dstg~~a~-----~~LAa~l----~~~ltpp~~~WF~l~~~~~~l~~~~~ 96 (517) T protein:vir:10 27 LSRAENYSRFTLPYLMADVNDDLSSQ-NAWQDDGASAT-----NFLSNKL----SQVLFPAQRSFFRIDLTPEGIKQLDN 96 (517) T ss_pred HHHHHHHHHHhccccccCCCCCcccc-ccccchHHHHH-----HHHHHHH----HHhhcCCCCccccccCCHHHHHhhcc Confidence 0000000000 00000000000 00001111111 1111111 1111 12355443111 Q ss_pred --------------hHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCC--cccccc-------cCCCceEEEE Q lcl|NC_019404. 65 --------------DEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNR--ALTSPV-------REGAELETVR 121 (418) Q Consensus 65 --------------d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~--~l~~pl-------~~~~~i~~i~ 121 (418) -++.+.+.+.+.++...+.++++.--.+|.+++++ .++. --.-|+ +..|.+..+. T Consensus 97 ~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~~~~~~~~~~pl~~y~v~~d~~G~v~~iv 174 (517) T protein:vir:10 97 EAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYH--PDKTSPIQAVPLHHYCVRRDNNGTVLDIV 174 (517) T ss_pred CcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--eCCCCcEEEEEcCeEEEeeCCCcCeEEEE Confidence 01234556677799999999999988899988765 3322 112233 2345443321 Q ss_pred E---eecccccccccc-------ccccccccCcceEEEEecCCcccccccCcc----cEEEecC-----cc-c-hhhhhh Q lcl|NC_019404. 122 V---YDRTQVKVQNRE-------ENPRNARFGKPLTYRITTNESDMFYDVHYS----RIHIIDG-----ER-V-PNAMRR 180 (418) Q Consensus 122 v---~~~~~i~~~~~~-------~dp~s~~yg~p~~y~i~~~~~~~~~~iH~S----R~i~~~g-----~~-l-p~~~~~ 180 (418) . +..+++...... ..-..| +...+.|+..-........+|.+ ++..-.+ .| + +-+.+. T Consensus 175 rr~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~~s~y~~~e~P~~~~Rw~~~ 253 (517) T protein:vir:10 175 FLQEKALETFEPSIRMAIQASRKGKQYKD-KDNVKLYTHAKRTKDGKYLIRQSADDVPVGKESTVTEDKSPFLILTWKRS 253 (517) T ss_pred eeeeccHHHHHHHhhhhcchhhhhhccCC-cCceEEEEEEEEeCCCceEEEEEeCceeeccccccccccCCeeeeeeeec Confidence 1 112222211100 000111 11122222211000000112211 1111111 11 1 123445 Q ss_pred ccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC Q lcl|NC_019404. 181 QNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE 260 (418) Q Consensus 181 ~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~ 260 (418) .+.-||.||.+. +++.++...............+.-..+-.+. ++. ..+. .......+.++-+. T Consensus 254 ~ge~YGrgp~~~-~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~--------~~~---~~~~----~l~~~~~g~~~~g~ 317 (517) T protein:vir:10 254 YGEDYGRGMAED-HAGAFFVIQFLSEALARGMALMADVKYLVKP--------GSY---TDIN----QFVEGGSGAVLHGV 317 (517) T ss_pred CCCCcccchHHH-hHHHHHHHHHHHHHHHHHHHHhccCCcccCc--------ccc---cchh----hccCCCccccccCC Confidence 567899999986 7899999998888887776655444443331 110 0000 01111222233344 Q ss_pred CCceeEeecc----cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 261 SEEYSVLNSD----IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPIL 331 (418) Q Consensus 261 ~e~~~~~~~~----~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l 331 (418) .+++..+... |......++.+.+.|..++=+- .|.-+....+.+|- ++-....--.+.+.|...+.|.+ T Consensus 318 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli 395 (517) T protein:vir:10 318 EGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMME--AMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLA 395 (517) T ss_pred cccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHH Confidence 4556655533 4556677788888888776321 12212334454431 11223344455667777888888 Q ss_pred HHHHHHhhc---cCCceEEeCCCCCCCHHHHHHHHHHHHHH---HHHHHhCC-----CCCHHHHHHHHHhhcCcCCCChh Q lcl|NC_019404. 332 EFLIPFIVN---AEEWSVEFSPLDHESSKDKAEVLEKSVNS---IAALIAAG-----AMDIKEARDTLRTIAPEIKIGDN 400 (418) Q Consensus 332 ~~l~~~i~~---~~~~~~~f~pL~~~~eke~ae~~~~~a~a---~~~~~~~g-----~i~~~e~r~~l~~~~~~~~~~~~ 400 (418) ++++.++.. .+++.+++-+- +....|+.-..+..+. +..+.+.. .|+.+++.+.+.... |++. T Consensus 396 ~r~~~~l~~~l~~~~v~~~~~s~--la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~---Gvp~- 469 (517) T protein:vir:10 396 RWFMNGISSILTSKNVSPTILTG--IEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQI---SANF- 469 (517) T ss_pred HHHHHHhhhhcCCCCccceeecc--HHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHh---CCCh- Confidence 888877653 23555554322 2233333222233333 33333211 466777776664322 2221 Q ss_pred hcccccccCCCccccccC Q lcl|NC_019404. 401 DIQTEESELITETEVVIA 418 (418) Q Consensus 401 ~~~~~e~~~~~e~e~~~~ 418 (418) .+-..+.+...+.+..-. T Consensus 470 ~~irs~~ev~~~~~~~~~ 487 (517) T protein:vir:10 470 PFFKTQDELNAEAQAQQE 487 (517) T ss_pred hhcCCHHHHHHHHHHHHH Confidence 111111111111110000 No 248 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=92.21 E-value=0.012 Score=31.05 Aligned_cols=388 Identities=11% Similarity=0.039 Sum_probs=173.7 Q ss_pred CccchhhHH-----HHhcCCC-CccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------- Q lcl|NC_019404. 1 MVKTDSYAN-----IFLGGSD-GSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------- 64 (418) Q Consensus 1 ~~~~D~~~n-----~~~g~~~-~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------- 64 (418) .-+..-+.. .+.+.++ .++.....-.-+..+...-.. +++...+. |+ +.||.+.-.+ T Consensus 29 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~La-a~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~ 101 (535) T protein:vir:33 29 ETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLA-SKLMLALF--PM----QSWMKLTISEYEAKQLVGDP 101 (535) T ss_pred HHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHH-HHHHHhhc--CC----CcccccccChHHHhccccCc Confidence 000111100 1111111 111111111112222222211 33333333 53 3577764211 Q ss_pred -h-----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc---cccccc-------CCCceEEEEE Q lcl|NC_019404. 65 -D-----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA---LTSPVR-------EGAELETVRV 122 (418) Q Consensus 65 -d-----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~---l~~pl~-------~~~~i~~i~v 122 (418) . ++.+...+.+.++...+.++++.--.+|.+++++.-+.++. -.-|+. ..|.+..+.. T Consensus 102 ~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r 181 (535) T protein:vir:33 102 DGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVT 181 (535) T ss_pred chHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEe Confidence 0 11244556677899999999998888999988875432221 123442 3454433211 Q ss_pred ---eeccccccccccc---c-ccccccCcceEEEEe-cCCcccccccCcc----cEEE------ecCcc--chhhhhhcc Q lcl|NC_019404. 123 ---YDRTQVKVQNREE---N-PRNARFGKPLTYRIT-TNESDMFYDVHYS----RIHI------IDGER--VPNAMRRQN 182 (418) Q Consensus 123 ---~~~~~i~~~~~~~---d-p~s~~yg~p~~y~i~-~~~~~~~~~iH~S----R~i~------~~g~~--lp~~~~~~~ 182 (418) +..+++....... + .....+..++.|+.. .........+|.+ ++.- |...| ++-+.+..+ T Consensus 182 ~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~g 261 (535) T protein:vir:33 182 RDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDG 261 (535) T ss_pred eEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCC Confidence 1222221111100 0 000112223333221 1100011112211 1100 11111 112344556 Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 183 DGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 183 ~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) .-||.||.+. +++.++............+.++.-..+..+. ++. ..+... ..+ ..+.++.+..+ T Consensus 262 e~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~------~g~-----~~~~~~---~~~-~~g~~v~g~~~ 325 (535) T protein:vir:33 262 ESYGRSYCEE-YLGDLRSLENLQEAIVKMSMISAKVIGLVNP------AGI-----TQPRRL---TKA-QTGDFVPGRRE 325 (535) T ss_pred CccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecc------ccc-----cchhhc---ccC-CceeeecCCcc Confidence 6799999986 8899999999999999888887665555442 110 011100 111 22234445556 Q ss_pred ceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccc-----hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 263 EYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS-----QNTALETFHKLIDRKRNAELLPILEF 333 (418) Q Consensus 263 ~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st-----ge~d~~~y~~~I~~~Qe~~l~p~l~~ 333 (418) ++..+.. +|......++.+.+.|.-++=+- -+..+....+.+| .++-...+--.+.+++...+.|++++ T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r 403 (535) T protein:vir:33 326 DIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRV 403 (535) T ss_pred cceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 6666643 45567777888888888776111 1222334445443 12223445566677888889999999 Q ss_pred HHHHhhcc--------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC------CCCHHHHHHHHHhhcCcCCCCh Q lcl|NC_019404. 334 LIPFIVNA--------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG------AMDIKEARDTLRTIAPEIKIGD 399 (418) Q Consensus 334 l~~~i~~~--------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g------~i~~~e~r~~l~~~~~~~~~~~ 399 (418) ++.++.+. +.++++|-+.. ....|..-..+..+.++.+.+.+ .|+.+++.+.+.... |++. T Consensus 404 ~~~il~r~g~lP~~p~~~v~~~yis~L--a~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~---Gvp~ 478 (535) T protein:vir:33 404 LLKQLQATSQIPELPKEAVEPTISTGL--EAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAI---GIDT 478 (535) T ss_pred HHHHHHhcCCCCCCCccceeEEEecHH--HHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHc---CCCH Confidence 99998653 35677765432 22233222223333333333322 367777776664322 2222 Q ss_pred hhcccccccCCCc-cccccC Q lcl|NC_019404. 400 NDIQTEESELITE-TEVVIA 418 (418) Q Consensus 400 ~~~~~~e~~~~~e-~e~~~~ 418 (418) ..+-..+.+.... .+.+-+ T Consensus 479 ~~i~~~~ee~~~~~~q~~~~ 498 (535) T protein:vir:33 479 SGILLTDEQKQALMMQDAAQ 498 (535) T ss_pred hHhcCCHHHHHHHHHHHHHH Confidence 2221111110000 000000 No 249 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=91.90 E-value=0.014 Score=30.79 Aligned_cols=373 Identities=12% Similarity=0.068 Sum_probs=142.4 Q ss_pred Cccchhh-----HHHHhcCCCCccccCc----cccC-CHHHHHHHHHcCCccchhhhcchhhhccCCcccc--CcchH-H Q lcl|NC_019404. 1 MVKTDSY-----ANIFLGGSDGSEIYGS----LQNQ-APTILASLYADNALVRRIIDTIPETALAAGFHID--GIDDE-P 67 (418) Q Consensus 1 ~~~~D~~-----~n~~~g~~~~~~~~~~----~~~~-~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~--~~~d~-~ 67 (418) +..++.- ...+.+....+..+.. .+.. ....++.| ...+-+..++++.-...++..|.|+ +++.+ . T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m-~~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~ 96 (448) T protein:vir:77 18 IDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKM-LSDGTVKNALNYIFGRIRSAKWYVEPASTDPEDI 96 (448) T ss_pred cchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHH-hhChHHHHHHHHHHHHHhcCCceEecCCCCHHHH Confidence 1111110 0001110000000000 0001 11222333 3477778888888888887778885 22222 1 Q ss_pred H----HHHHHHH-------hCchHHHHHHHHhccccceEEEEEeec---CCCc-ccccc-cCCCceEEEEEeeccccccc Q lcl|NC_019404. 68 A----FWSRWDD-------LEMTQNINDAWSWARLFGGAAIVAIVK---DNRA-LTSPV-REGAELETVRVYDRTQVKVQ 131 (418) Q Consensus 68 ~----i~~~~~~-------l~~~~~~~~a~~~~rl~G~~~i~i~~~---d~~~-l~~pl-~~~~~i~~i~v~~~~~i~~~ 131 (418) + +.+.+.. +.+.. +...+-.+.+||.|++=+.-+ ||.- +.... .+...+..|.+.+-..+... T Consensus 97 ~~ae~v~~~l~~~~~~~~~~~f~~-~i~~~lda~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~~~~~~f~~~~~~~l~~~ 175 (448) T protein:vir:77 97 AIAAFIHAQLGIDDASVGKYPFGR-LFAIYENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGPKAL 175 (448) T ss_pred HHHHHHHHHhhchhhhhccCCHHH-HHHHHHHhhhhcceeEEEEEeecCCCceeeccccccCCCccceeeeecCCceEEE Confidence 1 2222221 23334 444445799999999854432 2221 00000 01112222211111111000 Q ss_pred cccccccccccCcceEEEEecCCcccccccCcccEEEecCccchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 132 NREENPRNARFGKPLTYRITTNESDMFYDVHYSRIHIIDGERVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQL 211 (418) Q Consensus 132 ~~~~dp~s~~yg~p~~y~i~~~~~~~~~~iH~SR~i~~~g~~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l 211 (418) .. .++ .-|.+. ......+...+++++... ...+++|.+.+ +.||-...--..+..--+.. T Consensus 176 ~~-~~~---~~~~~~--------~~~~~~lP~~~~i~~~~~-------~~g~p~g~gLl-r~~~w~~~fK~~~~~~w~~f 235 (448) T protein:vir:77 176 KL-SGE---VKGGSQ--------FVNGLEIPIWKTVVFLHN-------DDGSFTGQSAL-RAAVPHWLAKRALILLINHG 235 (448) T ss_pred ec-CCc---cccccc--------CCCccccccceEEEEecC-------CcCCcccchHH-HHHHHHHHHHHhhHHHHHHH Confidence 00 000 000000 001123455667766321 24567888866 55666554445555556667 Q ss_pred HHHcCC--ceeecchHHHhhcCcchHHHHHHHHHHHH-HhcCCcceeEEEcCCCceeEeeccc--CCHHHHHHHHHHHHh Q lcl|NC_019404. 212 LRRKQQ--AVWKAKGLAELCDDSEGFGAARLRLAQVD-NNSGVGQAIGIDAESEEYSVLNSDI--GGIDAFLDKKFDRIV 286 (418) Q Consensus 212 ~~~~~~--~v~k~~~l~~~~~~~~~~~~~~~r~~~~~-~~~~~~~~~~~d~~~e~~~~~~~~~--~gl~~~~~~~~~~ia 286 (418) +.++++ .+.|.+..+ .+. .+.++.+..+. +.+....+.++.-++.+++.++..- +...++++..-.+|| T Consensus 236 ~E~yG~P~~vgky~~ga-----~~~-~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Is 309 (448) T protein:vir:77 236 LERFMIGVPTLTIPKSV-----RQG-TKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIA 309 (448) T ss_pred HHHcCCceeEEecCCCC-----CCC-HHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHH Confidence 778774 455655321 111 11222222222 2222233444555667888888753 235566777677777 Q ss_pred hhhcCCeeeeeccCccccccchhHH-HHHHHHHHHHHHHHHHHHHH-HHHHHHhhc-c----CCc-eEEeCCCCCCCHHH Q lcl|NC_019404. 287 ALSGIHEIILKNKNVGGLSSSQNTA-LETFHKLIDRKRNAELLPIL-EFLIPFIVN-A----EEW-SVEFSPLDHESSKD 358 (418) Q Consensus 287 aas~IP~t~L~G~s~~gl~stge~d-~~~y~~~I~~~Qe~~l~p~l-~~l~~~i~~-~----~~~-~~~f~pL~~~~eke 358 (418) .+.--. | |.-++.+|.++...++ .....+.+++..+. +...+ +.|+.-++. . ... .|.|...-.. T Consensus 310 k~iLGq-t-lTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~-i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~e---- 382 (448) T protein:vir:77 310 RALGID-F-NTVQLNMGVQAVNIGEFVSLTQQTIISLQRE-FASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERN---- 382 (448) T ss_pred HHHhcc-c-cccccccchhhhhhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChh---- Confidence 543222 1 1111222222111112 12334444443332 33333 345554432 1 111 4555432222 Q ss_pred HHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcCcCCCCh---h---hc-cccc-----ccCCCccccccC Q lcl|NC_019404. 359 KAEVLEKSVNSIAALIAAGAMDIKEARDTLRTIAPEIKIGD---N---DI-QTEE-----SELITETEVVIA 418 (418) Q Consensus 359 ~ae~~~~~a~a~~~~~~~g~i~~~e~r~~l~~~~~~~~~~~---~---~~-~~~e-----~~~~~e~e~~~~ 418 (418) |+ ++.|+.+..+++ .+++.+.--.+..+..+ . .. ..++ ...+..++.+.+ T Consensus 383 --Dl-~~~a~~~~~l~~-------~~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (448) T protein:vir:77 383 --DF-SAAANLMGMLIN-------AVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVRQPADSRYLYT 444 (448) T ss_pred --hH-HHHHHHhHHHHH-------HHHHHhcCCccCCcCCCCCchhcccccCCCCCCCchhhcchhhHHHHh Confidence 22 335566666552 23332211000000000 0 00 0000 011112222222 No 250 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=91.73 E-value=0.014 Score=30.66 Aligned_cols=386 Identities=12% Similarity=0.112 Sum_probs=162.6 Q ss_pred CccchhhHHH-------Hh-c-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh-------ccCCccccCcc Q lcl|NC_019404. 1 MVKTDSYANI-------FL-G-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA-------LAAGFHIDGID 64 (418) Q Consensus 1 ~~~~D~~~n~-------~~-g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~-------~r~~~~i~~~~ 64 (418) .-+..-+... +. + ++.+.......-. +-+.+.+++.|..+ .+.||.+.-.+ T Consensus 25 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~d-------------st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d 91 (555) T protein:vir:10 25 MSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILD-------------NTGTRALRVLAAGMMAGMTSPARPWFRLTTSI 91 (555) T ss_pred HHHHHHHHHHhCcccccccCCCCCcchhccccccc-------------ccHHHHHHHHHHHHHHhhcCCCCcccccccCc Confidence 0000111110 11 1 1111111111111 11112222222222 23466654221 Q ss_pred h---------------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcc---ccccc-------CCCceEE Q lcl|NC_019404. 65 D---------------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPVR-------EGAELET 119 (418) Q Consensus 65 d---------------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l---~~pl~-------~~~~i~~ 119 (418) . ++.+...+.+-++...+.++++.--.+|.+.+++.-+.++.+ .-|+. ..|.+.. T Consensus 92 ~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~v~~d~~G~vd~ 171 (555) T protein:vir:10 92 PELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYAIAADNQGRVNT 171 (555) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeEEeeCCCCCEEE Confidence 0 123445666778999999999988899999998764433322 23442 3454432 Q ss_pred E-EEe--eccc---------cccc---cccccccccccCcceEEEE-ecCCcc---------cc---c----ccCcccEE Q lcl|NC_019404. 120 V-RVY--DRTQ---------VKVQ---NREENPRNARFGKPLTYRI-TTNESD---------MF---Y----DVHYSRIH 167 (418) Q Consensus 120 i-~v~--~~~~---------i~~~---~~~~dp~s~~yg~p~~y~i-~~~~~~---------~~---~----~iH~SR~i 167 (418) + +-+ ..++ ++.. ....+|...+ -+.|+. .+.... .+ . ..+..+++ T Consensus 172 i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~---v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl 248 (555) T protein:vir:10 172 LYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQW---VTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTL 248 (555) T ss_pred EEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCce---EEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCcccc Confidence 1 111 1111 1111 1111221111 111111 110000 00 0 01111222 Q ss_pred E---ecCcc-ch-hhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHH Q lcl|NC_019404. 168 I---IDGER-VP-NAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRL 242 (418) Q Consensus 168 ~---~~g~~-lp-~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~ 242 (418) . |...| ++ -+.+....-||.||.+. +++.++..........+.+..+.-..+-.+.-. .. ..+ T Consensus 249 ~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~-~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~------~~-----~~~ 316 (555) T protein:vir:10 249 RESGYRSFRALCPRWALVGGDIYGNSPAME-ALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA------KN-----QDI 316 (555) T ss_pred ccCCcccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecccc------cc-----ccc Confidence 1 11112 11 23445566899999986 889999999988888888877655555444211 00 000 Q ss_pred HHHHHhcCCcceeEEEcCCCc-ee---EeecccCCHHHHHHHHHHHHhhhhcCCeeeeecc-Cccccccch-----hHHH Q lcl|NC_019404. 243 AQVDNNSGVGQAIGIDAESEE-YS---VLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNK-NVGGLSSSQ-----NTAL 312 (418) Q Consensus 243 ~~~~~~~~~~~~~~~d~~~e~-~~---~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~-s~~gl~stg-----e~d~ 312 (418) . ........+..+...+ +. ..+.+|+.+...++.+.+.|..++=......+++ ....+.||. ++-. T Consensus 317 ~----~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~ 392 (555) T protein:vir:10 317 S----TVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKL 392 (555) T ss_pred e----eccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHH Confidence 0 0011111122222222 11 2233566667778888888887774432222222 233344431 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCC-CCHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_019404. 313 ETFHKLIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDH-ESSKDKAEVLEKSVNSIAALIAAG-- 377 (418) Q Consensus 313 ~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~-~~eke~ae~~~~~a~a~~~~~~~g-- 377 (418) ...--...+.+.-.+.|.+++.+.++.+. .+++++|-+... .-..+.+......++.+..+.+.+ T Consensus 393 ~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~ 472 (555) T protein:vir:10 393 LMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPE 472 (555) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChh Confidence 33444456667778899999999988653 145666654321 111122222233444444444433 Q ss_pred ---CCCHHHHHHHHHhh-cCcCCC--Chhhccccccc-CCCccccccC Q lcl|NC_019404. 378 ---AMDIKEARDTLRTI-APEIKI--GDNDIQTEESE-LITETEVVIA 418 (418) Q Consensus 378 ---~i~~~e~r~~l~~~-~~~~~~--~~~~~~~~e~~-~~~e~e~~~~ 418 (418) .|+.+++.+.+... +...++ +++++..-..+ ........-| T Consensus 473 vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a 520 (555) T protein:vir:10 473 VLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQA 520 (555) T ss_pred hhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHH Confidence 47788877776543 221121 22221110000 0000000000 No 251 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=91.73 E-value=0.014 Score=30.66 Aligned_cols=386 Identities=12% Similarity=0.112 Sum_probs=162.6 Q ss_pred CccchhhHHH-------Hh-c-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh-------ccCCccccCcc Q lcl|NC_019404. 1 MVKTDSYANI-------FL-G-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA-------LAAGFHIDGID 64 (418) Q Consensus 1 ~~~~D~~~n~-------~~-g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~-------~r~~~~i~~~~ 64 (418) .-+..-+... +. + ++.+.......-. +-+.+.+++.|..+ .+.||.+.-.+ T Consensus 25 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~d-------------st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d 91 (555) T protein:vir:10 25 MSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILD-------------NTGTRALRVLAAGMMAGMTSPARPWFRLTTSI 91 (555) T ss_pred HHHHHHHHHHhCcccccccCCCCCcchhccccccc-------------ccHHHHHHHHHHHHHHhhcCCCCcccccccCc Confidence 0000111110 11 1 1111111111111 11112222222222 23466654221 Q ss_pred h---------------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcc---ccccc-------CCCceEE Q lcl|NC_019404. 65 D---------------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPVR-------EGAELET 119 (418) Q Consensus 65 d---------------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l---~~pl~-------~~~~i~~ 119 (418) . ++.+...+.+-++...+.++++.--.+|.+.+++.-+.++.+ .-|+. ..|.+.. T Consensus 92 ~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~v~~d~~G~vd~ 171 (555) T protein:vir:10 92 PELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYAIAADNQGRVNT 171 (555) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeEEeeCCCCCEEE Confidence 0 123445666778999999999988899999998764433322 23442 3454432 Q ss_pred E-EEe--eccc---------cccc---cccccccccccCcceEEEE-ecCCcc---------cc---c----ccCcccEE Q lcl|NC_019404. 120 V-RVY--DRTQ---------VKVQ---NREENPRNARFGKPLTYRI-TTNESD---------MF---Y----DVHYSRIH 167 (418) Q Consensus 120 i-~v~--~~~~---------i~~~---~~~~dp~s~~yg~p~~y~i-~~~~~~---------~~---~----~iH~SR~i 167 (418) + +-+ ..++ ++.. ....+|...+ -+.|+. .+.... .+ . ..+..+++ T Consensus 172 i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~---v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl 248 (555) T protein:vir:10 172 LYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQW---VTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTL 248 (555) T ss_pred EEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCce---EEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCcccc Confidence 1 111 1111 1111 1111221111 111111 110000 00 0 01111222 Q ss_pred E---ecCcc-ch-hhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHH Q lcl|NC_019404. 168 I---IDGER-VP-NAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRL 242 (418) Q Consensus 168 ~---~~g~~-lp-~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~ 242 (418) . |...| ++ -+.+....-||.||.+. +++.++..........+.+..+.-..+-.+.-. .. ..+ T Consensus 249 ~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~-~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~------~~-----~~~ 316 (555) T protein:vir:10 249 RESGYRSFRALCPRWALVGGDIYGNSPAME-ALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA------KN-----QDI 316 (555) T ss_pred ccCCcccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecccc------cc-----ccc Confidence 1 11112 11 23445566899999986 889999999988888888877655555444211 00 000 Q ss_pred HHHHHhcCCcceeEEEcCCCc-ee---EeecccCCHHHHHHHHHHHHhhhhcCCeeeeecc-Cccccccch-----hHHH Q lcl|NC_019404. 243 AQVDNNSGVGQAIGIDAESEE-YS---VLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNK-NVGGLSSSQ-----NTAL 312 (418) Q Consensus 243 ~~~~~~~~~~~~~~~d~~~e~-~~---~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~-s~~gl~stg-----e~d~ 312 (418) . ........+..+...+ +. ..+.+|+.+...++.+.+.|..++=......+++ ....+.||. ++-. T Consensus 317 ~----~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~ 392 (555) T protein:vir:10 317 S----TVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKL 392 (555) T ss_pred e----eccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHH Confidence 0 0011111122222222 11 2233566667778888888887774432222222 233344431 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCC-CCHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_019404. 313 ETFHKLIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDH-ESSKDKAEVLEKSVNSIAALIAAG-- 377 (418) Q Consensus 313 ~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~-~~eke~ae~~~~~a~a~~~~~~~g-- 377 (418) ...--...+.+.-.+.|.+++.+.++.+. .+++++|-+... .-..+.+......++.+..+.+.+ T Consensus 393 ~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~ 472 (555) T protein:vir:10 393 LMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPE 472 (555) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChh Confidence 33444456667778899999999988653 145666654321 111122222233444444444433 Q ss_pred ---CCCHHHHHHHHHhh-cCcCCC--Chhhccccccc-CCCccccccC Q lcl|NC_019404. 378 ---AMDIKEARDTLRTI-APEIKI--GDNDIQTEESE-LITETEVVIA 418 (418) Q Consensus 378 ---~i~~~e~r~~l~~~-~~~~~~--~~~~~~~~e~~-~~~e~e~~~~ 418 (418) .|+.+++.+.+... +...++ +++++..-..+ ........-| T Consensus 473 vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a 520 (555) T protein:vir:10 473 VLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQA 520 (555) T ss_pred hhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHH Confidence 47788877776543 221121 22221110000 0000000000 No 252 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=91.73 E-value=0.014 Score=30.66 Aligned_cols=386 Identities=12% Similarity=0.112 Sum_probs=162.6 Q ss_pred CccchhhHHH-------Hh-c-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh-------ccCCccccCcc Q lcl|NC_019404. 1 MVKTDSYANI-------FL-G-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA-------LAAGFHIDGID 64 (418) Q Consensus 1 ~~~~D~~~n~-------~~-g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~-------~r~~~~i~~~~ 64 (418) .-+..-+... +. + ++.+.......-. +-+.+.+++.|..+ .+.||.+.-.+ T Consensus 25 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~d-------------st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d 91 (555) T protein:vir:98 25 MSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILD-------------NTGTRALRVLAAGMMAGMTSPARPWFRLTTSI 91 (555) T ss_pred HHHHHHHHHHhCcccccccCCCCCcchhccccccc-------------ccHHHHHHHHHHHHHHhhcCCCCcccccccCc Confidence 0000111110 11 1 1111111111111 11112222222222 23466654221 Q ss_pred h---------------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcc---ccccc-------CCCceEE Q lcl|NC_019404. 65 D---------------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPVR-------EGAELET 119 (418) Q Consensus 65 d---------------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l---~~pl~-------~~~~i~~ 119 (418) . ++.+...+.+-++...+.++++.--.+|.+.+++.-+.++.+ .-|+. ..|.+.. T Consensus 92 ~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~v~~d~~G~vd~ 171 (555) T protein:vir:98 92 PELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYAIAADNQGRVNT 171 (555) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeEEeeCCCCCEEE Confidence 0 123445666778999999999988899999998764433322 23442 3454432 Q ss_pred E-EEe--eccc---------cccc---cccccccccccCcceEEEE-ecCCcc---------cc---c----ccCcccEE Q lcl|NC_019404. 120 V-RVY--DRTQ---------VKVQ---NREENPRNARFGKPLTYRI-TTNESD---------MF---Y----DVHYSRIH 167 (418) Q Consensus 120 i-~v~--~~~~---------i~~~---~~~~dp~s~~yg~p~~y~i-~~~~~~---------~~---~----~iH~SR~i 167 (418) + +-+ ..++ ++.. ....+|...+ -+.|+. .+.... .+ . ..+..+++ T Consensus 172 i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~---v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl 248 (555) T protein:vir:98 172 LYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQW---VTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTL 248 (555) T ss_pred EEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCce---EEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCcccc Confidence 1 111 1111 1111 1111221111 111111 110000 00 0 01111222 Q ss_pred E---ecCcc-ch-hhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHH Q lcl|NC_019404. 168 I---IDGER-VP-NAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRL 242 (418) Q Consensus 168 ~---~~g~~-lp-~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~ 242 (418) . |...| ++ -+.+....-||.||.+. +++.++..........+.+..+.-..+-.+.-. .. ..+ T Consensus 249 ~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~-~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~------~~-----~~~ 316 (555) T protein:vir:98 249 RESGYRSFRALCPRWALVGGDIYGNSPAME-ALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA------KN-----QDI 316 (555) T ss_pred ccCCcccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecccc------cc-----ccc Confidence 1 11112 11 23445566899999986 889999999988888888877655555444211 00 000 Q ss_pred HHHHHhcCCcceeEEEcCCCc-ee---EeecccCCHHHHHHHHHHHHhhhhcCCeeeeecc-Cccccccch-----hHHH Q lcl|NC_019404. 243 AQVDNNSGVGQAIGIDAESEE-YS---VLNSDIGGIDAFLDKKFDRIVALSGIHEIILKNK-NVGGLSSSQ-----NTAL 312 (418) Q Consensus 243 ~~~~~~~~~~~~~~~d~~~e~-~~---~~~~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~-s~~gl~stg-----e~d~ 312 (418) . ........+..+...+ +. ..+.+|+.+...++.+.+.|..++=......+++ ....+.||. ++-. T Consensus 317 ~----~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~ 392 (555) T protein:vir:98 317 S----TVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKL 392 (555) T ss_pred e----eccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHH Confidence 0 0011111122222222 11 2233566667778888888887774432222222 233344431 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------CCceEEeCCCCC-CCHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_019404. 313 ETFHKLIDRKRNAELLPILEFLIPFIVNA------------EEWSVEFSPLDH-ESSKDKAEVLEKSVNSIAALIAAG-- 377 (418) Q Consensus 313 ~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~------------~~~~~~f~pL~~-~~eke~ae~~~~~a~a~~~~~~~g-- 377 (418) ...--...+.+.-.+.|.+++.+.++.+. .+++++|-+... .-..+.+......++.+..+.+.+ T Consensus 393 ~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~ 472 (555) T protein:vir:98 393 LMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPE 472 (555) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChh Confidence 33444456667778899999999988653 145666654321 111122222233444444444433 Q ss_pred ---CCCHHHHHHHHHhh-cCcCCC--Chhhccccccc-CCCccccccC Q lcl|NC_019404. 378 ---AMDIKEARDTLRTI-APEIKI--GDNDIQTEESE-LITETEVVIA 418 (418) Q Consensus 378 ---~i~~~e~r~~l~~~-~~~~~~--~~~~~~~~e~~-~~~e~e~~~~ 418 (418) .|+.+++.+.+... +...++ +++++..-..+ ........-| T Consensus 473 vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a 520 (555) T protein:vir:98 473 VLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQA 520 (555) T ss_pred hhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHH Confidence 47788877776543 221121 22221110000 0000000000 No 253 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=91.67 E-value=0.015 Score=30.62 Aligned_cols=380 Identities=11% Similarity=0.057 Sum_probs=159.0 Q ss_pred CccchhhHHHH-----hc-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc------- Q lcl|NC_019404. 1 MVKTDSYANIF-----LG-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID------- 64 (418) Q Consensus 1 ~~~~D~~~n~~-----~g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~------- 64 (418) .-+..-+.... .. ++..+......-.-+..+.. +-++.+ .-..+ .+.||.+.-.+ T Consensus 20 e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~-----~~Laa~----l~~~ltpp~~~WF~l~~~d~~l~~~~ 90 (542) T protein:vir:78 20 LDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGV-----NALSSK----LMLSLFPIQTSFFKLQINDAEIASVP 90 (542) T ss_pred HHHHHHHHHHhccccCCCCCCcccccccccccchHHHHH-----HHHHHH----HHHhhcCCCCccccccCCHHHHHhhc Confidence 11111111100 00 00001111111111111111 111111 11122 23455553111 Q ss_pred ---h-------------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCccccccc-------CCCceEEEE Q lcl|NC_019404. 65 ---D-------------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR-------EGAELETVR 121 (418) Q Consensus 65 ---d-------------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~-------~~~~i~~i~ 121 (418) + ++.+...+.+.+....+.++++.--.||.+++++.- +. --.-|+. ..|.+.. T Consensus 91 ~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~-~~-~~~~pl~~y~v~~d~~G~vd~-- 166 (542) T protein:vir:78 91 ELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGK-KT-LKVYPLDRYVIERDGDGNVIE-- 166 (542) T ss_pred cCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecC-CC-ceEEecceeEEeeCCCCCeEE-- Confidence 0 123445667779999999999998889999888642 22 1122332 3344432 Q ss_pred Eeecccccccc--------------cccccccc--ccC--------------------cceEEEEec-CCcccccccCcc Q lcl|NC_019404. 122 VYDRTQVKVQN--------------REENPRNA--RFG--------------------KPLTYRITT-NESDMFYDVHYS 164 (418) Q Consensus 122 v~~~~~i~~~~--------------~~~dp~s~--~yg--------------------~p~~y~i~~-~~~~~~~~iH~S 164 (418) |+-+..++... ....+..| .|. .|..++... .+......+..+ T Consensus 167 v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~v~~~~~e~ 246 (542) T protein:vir:78 167 IITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKEIKGSRSSS 246 (542) T ss_pred EeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEEecccccccccccc Confidence 22222222111 00011110 000 010111100 000000000111 Q ss_pred cEEEecCcc--chhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHH Q lcl|NC_019404. 165 RIHIIDGER--VPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRL 242 (418) Q Consensus 165 R~i~~~g~~--lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~ 242 (418) -|...| ++-+.+....-||.||.+. +++.++............+.++.-..+..+.- +. ..+. T Consensus 247 ---g~~~~P~i~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~--------g~---~~~~ 311 (542) T protein:vir:78 247 ---PLKHSPWLPLRFNVVDGESYGRGRVEE-FFGDLSSLDALTRSLIEGSAAAAKVVFMVSPS--------AT---TKPQ 311 (542) T ss_pred ---ccccCCceeeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeccc--------cc---cchh Confidence 111111 1123445566799999986 88999999999999998888876666555421 10 0110 Q ss_pred HHHHHhcCCcceeEEEcCCCceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHH Q lcl|NC_019404. 243 AQVDNNSGVGQAIGIDAESEEYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALE 313 (418) Q Consensus 243 ~~~~~~~~~~~~~~~d~~~e~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~ 313 (418) . ...+.++. ++.+..+++..+.. +|......++.+.+.|.-++=+ +-.+....+.||- ++-.. T Consensus 312 ~---~~~~~~g~-iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~----~~~~d~~rvTAtEV~~r~~E~~~ 383 (542) T protein:vir:78 312 S---LARAGTGA-IIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLI----LNVRQSERTTATEVREVQMELDR 383 (542) T ss_pred h---cccCCCce-eecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcc----cccCCcccccHHHHHHHHHHHHH Confidence 0 01122333 34455566655542 3555677788888888877521 1112233333321 11123 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccC--------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC-------CC Q lcl|NC_019404. 314 TFHKLIDRKRNAELLPILEFLIPFIVNAE--------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA-------GA 378 (418) Q Consensus 314 ~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~--------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~-------g~ 378 (418) .+--.+.+.+...+.|++++.+.++.+.. -++++|.+.. ....|+.-..+..+.++.+.+. -. T Consensus 384 ~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~L--a~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~ 461 (542) T protein:vir:78 384 QLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGL--GGVGRGEDRAALIEFMQTVGQAMGPEALQQF 461 (542) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechH--HHHHHHHHHHHHHHHHHHHHHhcCChhHHhc Confidence 34445566777788999999999887542 3455555322 2233332223333333333221 13 Q ss_pred CCHHHHHHHHHhh-cCc-CCC--Chhhcccc--cccCCCccc------cccC Q lcl|NC_019404. 379 MDIKEARDTLRTI-APE-IKI--GDNDIQTE--ESELITETE------VVIA 418 (418) Q Consensus 379 i~~~e~r~~l~~~-~~~-~~~--~~~~~~~~--e~~~~~e~e------~~~~ 418 (418) |+.+++.+.+... +.. ..+ +++++... +.......+ ++.| T Consensus 462 id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a 513 (542) T protein:vir:78 462 IDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLA 513 (542) T ss_pred CCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 6677777666432 221 111 12221110 000000000 1111 No 254 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=91.50 E-value=0.016 Score=30.50 Aligned_cols=380 Identities=12% Similarity=0.090 Sum_probs=170.9 Q ss_pred CccchhhHHHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------h------ Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------D------ 65 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------d------ 65 (418) ..+.|+ +..+......-.-+..+...-.. +++...+. |+ +.||.+.-.+ + T Consensus 42 ~~~~~~--------~~~~~~~~~~~dst~~~a~~~La-a~l~~~lt--P~----~~WFrl~~~d~~~~~~~~~~~~~~~v 106 (536) T protein:vir:10 42 LFPKDS--------DNASTDYQTPWQAVGARGLNNLA-SKLMLALF--PM----QTWMRLTISEYEAKQLLSDPDGLAKV 106 (536) T ss_pred ccCCCC--------CcccccccccccccHHHHHHHHH-HHHHhhhc--CC----CcccccccChhhhhccccchhhHHHH Confidence 222222 11111111111112333322211 34444332 54 3477774211 0 Q ss_pred -------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc---c-ccccc-------CCCceEEEEEeeccc Q lcl|NC_019404. 66 -------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA---L-TSPVR-------EGAELETVRVYDRTQ 127 (418) Q Consensus 66 -------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~---l-~~pl~-------~~~~i~~i~v~~~~~ 127 (418) ++.+...+.+-++...+.++++.--.+|.+++++.-+.+.. . .-|+. ..|.+..+ +-+.. T Consensus 107 ~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i--~r~~~ 184 (536) T protein:vir:10 107 DEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQM--VTRDQ 184 (536) T ss_pred HHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEE--eeeee Confidence 12345556677899999999998888999998885443321 1 23442 34544332 22222 Q ss_pred cccc----cccccccc-----cccCcceEEEE-ecCCcccccccCc----ccEEEecC------cc-ch-hhhhhccccC Q lcl|NC_019404. 128 VKVQ----NREENPRN-----ARFGKPLTYRI-TTNESDMFYDVHY----SRIHIIDG------ER-VP-NAMRRQNDGW 185 (418) Q Consensus 128 i~~~----~~~~dp~s-----~~yg~p~~y~i-~~~~~~~~~~iH~----SR~i~~~g------~~-lp-~~~~~~~~~~ 185 (418) ++.. .+..+..+ ..+...+.|+. ..........+|. .++....| .| ++ -+.+....-| T Consensus 185 ~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~Y 264 (536) T protein:vir:10 185 IAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESY 264 (536) T ss_pred ccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCccccccccccccccCCceeeeeeecCCCcc Confidence 2211 11111110 11222233322 1111111112221 12222222 11 11 2334556679 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCcee Q lcl|NC_019404. 186 GRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYS 265 (418) Q Consensus 186 G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~ 265 (418) |.||.+. +++.++...............+.-..+..+. .+... ..++ ..+.++.+ +-+..+++. T Consensus 265 Grgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p--------~g~~~-~~~~-----~~~~~g~~-v~g~~~~v~ 328 (536) T protein:vir:10 265 GRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNP--------AGITQ-PRRL-----TKAQTGDF-VTGRPEDIS 328 (536) T ss_pred ccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcccCc--------ccccc-hhhh-----ccCCCcce-ecCCcccce Confidence 9999986 7899999999988888876665444443331 11100 0011 12223333 333335555 Q ss_pred Eee----cccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 266 VLN----SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPILEFLIP 336 (418) Q Consensus 266 ~~~----~~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l~~l~~ 336 (418) .+. .+|......++.+.+.|.-++=+- .+.-+....+.+|- ++-....--.+.+.+...+.|.+++++. T Consensus 329 ~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~ 406 (536) T protein:vir:10 329 FLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 406 (536) T ss_pred eeeccccccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 443 245556777888888888777221 22223344454431 1122333445566788889999999999 Q ss_pred HhhccC--------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC------CCCHHHHHHHHHhh-cC-cCCC--C Q lcl|NC_019404. 337 FIVNAE--------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG------AMDIKEARDTLRTI-AP-EIKI--G 398 (418) Q Consensus 337 ~i~~~~--------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g------~i~~~e~r~~l~~~-~~-~~~~--~ 398 (418) ++.... .+.+++.+. +....++.-..+..+.++.+.+.+ .|+.+++.+.+... +. ..++ + T Consensus 407 il~r~g~lP~~p~~~v~~~~vs~--l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt 484 (536) T protein:vir:10 407 QLQATQQIPELPKEAVEPTISTG--LEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLT 484 (536) T ss_pred HHHhCCCCCCCChhhccceEEec--HHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCC Confidence 986532 234444321 223333333333333444443332 47888888776543 21 1111 2 Q ss_pred hhhcccccccCCCccccccC Q lcl|NC_019404. 399 DNDIQTEESELITETEVVIA 418 (418) Q Consensus 399 ~~~~~~~e~~~~~e~e~~~~ 418 (418) +++....-.+ ..+...+-+ T Consensus 485 ~eev~~~r~q-~~~~~~~~~ 503 (536) T protein:vir:10 485 EEQKQQKMAQ-QSMQMGMDN 503 (536) T ss_pred HHHHHHHHHH-HHHHHHHHH Confidence 2222111000 000000000 No 255 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=91.45 E-value=0.016 Score=30.46 Aligned_cols=390 Identities=9% Similarity=0.077 Sum_probs=156.1 Q ss_pred CccchhhH-----HHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc----------- Q lcl|NC_019404. 1 MVKTDSYA-----NIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID----------- 64 (418) Q Consensus 1 ~~~~D~~~-----n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~----------- 64 (418) .-+..-+. +.+..-++...... .-.-+..+.. +-++.++.-..- =..+.||.+.-.+ T Consensus 31 e~~w~e~a~~~lP~~~~~~~~~~~~~~-~~dstg~~a~-----~~LAa~l~~~lt-pp~~~WF~L~~~~~~~~~~~~~~~ 103 (516) T protein:vir:96 31 LDRAKHYSKLTLPYLMNDKGDNETSQN-GWQGVGAQAT-----NHLANKLAQVLF-PAQRSFFRVDLTAQGEKVLNQRGL 103 (516) T ss_pred HHHHHHHHHhhcccccCCCCCccccCC-cccchHHHHH-----HHHHHHHHhhhc-CCCCcccccccChhHHhhccccCc Confidence 00000000 01111111111000 0011111111 111122111100 0123455543111 Q ss_pred -----------hHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccc-------cCCCceEEEEEeecc Q lcl|NC_019404. 65 -----------DEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPV-------REGAELETVRVYDRT 126 (418) Q Consensus 65 -----------d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl-------~~~~~i~~i~v~~~~ 126 (418) -++.+.+.+.+-++...+.+++..--.+|.+++++.-+++ --.-|+ +..|.+.. ++-+. T Consensus 104 ~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~~-~~~~pl~~y~v~~d~~G~v~~--i~rr~ 180 (516) T protein:vir:96 104 KKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKGA-ISAIPMHHYVVNRDTNGDLLD--IILLQ 180 (516) T ss_pred hhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCCC-EEEEEcCeEEEeeCCCCCeee--ehhhh Confidence 0122445566778999999999998889999888742221 112344 23344432 22222 Q ss_pred ccccccccc-----------cccccccCcceEEEEecCCcccccccCc----ccEEEecC-----cc-c-hhhhhhcccc Q lcl|NC_019404. 127 QVKVQNREE-----------NPRNARFGKPLTYRITTNESDMFYDVHY----SRIHIIDG-----ER-V-PNAMRRQNDG 184 (418) Q Consensus 127 ~i~~~~~~~-----------dp~s~~yg~p~~y~i~~~~~~~~~~iH~----SR~i~~~g-----~~-l-p~~~~~~~~~ 184 (418) .++...... +.....+...+.|...-........+|. -++..-.+ .| + +-+.+..+.- T Consensus 181 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~es~~~~~e~P~~~~Rw~~~~ge~ 260 (516) T protein:vir:96 181 EKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSKIKSEKLPFIPLTWKRSYGED 260 (516) T ss_pred HhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEeCceeeccccccccccCCeeeeeeeecCCCC Confidence 222211110 0001112233333221111111111121 11211111 11 1 1234556668 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCce Q lcl|NC_019404. 185 WGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEY 264 (418) Q Consensus 185 ~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~ 264 (418) ||.||.+. +++.++............+..+.-..+-.+. .+. .... ....-..+.++.+..+++ T Consensus 261 YGrgp~~~-~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p--------~g~---~~~~----~l~~~~~g~i~~g~~~~v 324 (516) T protein:vir:96 261 WGRPLAED-YSGDLFVIQFLSEAVARGAALMADIKYLIRP--------GAQ---TDVD----HFVNSGTGEVVTGVEEDI 324 (516) T ss_pred cccchHHH-hhHHHHHHHHHHHHHHHHHHHhcCCccccCc--------ccc---cchh----hhccCCCceeecCCcccc Confidence 99999986 7899999988888777766555444443321 111 0111 111122234455666677 Q ss_pred eEeecc----cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 265 SVLNSD----IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPILEFLI 335 (418) Q Consensus 265 ~~~~~~----~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l~~l~ 335 (418) ..++.. |..+...++.+.+.|..++=+- .|.-+....+.+|- ++-....--.+.+.|.-.+.|++++++ T Consensus 325 ~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l 402 (516) T protein:vir:96 325 HIVQLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGL 402 (516) T ss_pred eeeecCcccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 766543 4556677888888888776321 12222344454431 111123333445566667777777765 Q ss_pred HHhhcc---CCceEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHhCC-----CCCHHHHHHHHHhh-cCcCCC--Chhhcc Q lcl|NC_019404. 336 PFIVNA---EEWSVEFS-PLDHESSKDKAEVLEKSVNSIAALIAAG-----AMDIKEARDTLRTI-APEIKI--GDNDIQ 403 (418) Q Consensus 336 ~~i~~~---~~~~~~f~-pL~~~~eke~ae~~~~~a~a~~~~~~~g-----~i~~~e~r~~l~~~-~~~~~~--~~~~~~ 403 (418) ..+... ..+.+++- +|..+....+++-...-++.+..+.+.. .|+.+++.+.+... +....+ ++++.. T Consensus 403 ~~~~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~ 482 (516) T protein:vir:96 403 LEAGESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMA 482 (516) T ss_pred HhcCCCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHH Confidence 544211 12333322 2222222223332233333333333222 46777777666432 221111 122211 Q ss_pred cccc-cCCCc------------------cccccC Q lcl|NC_019404. 404 TEES-ELITE------------------TEVVIA 418 (418) Q Consensus 404 ~~e~-~~~~e------------------~e~~~~ 418 (418) ..-. ..+.. .+++=| T Consensus 483 ~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 483 QEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 0000 00000 000000 No 256 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=90.95 E-value=0.018 Score=30.12 Aligned_cols=381 Identities=11% Similarity=0.109 Sum_probs=162.8 Q ss_pred CccchhhHHH-----Hhc-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc------- Q lcl|NC_019404. 1 MVKTDSYANI-----FLG-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID------- 64 (418) Q Consensus 1 ~~~~D~~~n~-----~~g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~------- 64 (418) .-+..-+... +.. ++.+.+.....-.-+..+.. +-++.++ -..+ .+.||.+.-.+ T Consensus 29 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~-----~~LAa~L----~~~ltpp~~~WF~l~~~d~~l~~~~ 99 (532) T protein:vir:99 29 ETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGL-----NNLASKL----MLALFPVGSSFFKLNVSELEVKQSI 99 (532) T ss_pred HHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHH-----HHHHHHH----HHhhcCCCCccccccCCHHHHhccC Confidence 0000000000 000 01111111111111111111 1111111 1112 23455543111 Q ss_pred ----h-----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCC---c---ccccc-------cCCCc Q lcl|NC_019404. 65 ----D-----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNR---A---LTSPV-------REGAE 116 (418) Q Consensus 65 ----d-----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~---~---l~~pl-------~~~~~ 116 (418) + ++.+...+.+-++...+.++++.--.+|.+++++.-++.. . -..|+ +..|. T Consensus 100 ~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~~~pl~~y~v~~d~~G~ 179 (532) T protein:vir:99 100 TSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDN 179 (532) T ss_pred CChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCcccceEEEEcCeEEEeeCCCCC Confidence 0 1234456667789999999999988899999988643211 1 11244 23454 Q ss_pred eEEEE---Eeecccccccccc---ccc--cccccCcceEEEE-ecCCcccccccCc---cc-EEEecC------cc-c-h Q lcl|NC_019404. 117 LETVR---VYDRTQVKVQNRE---ENP--RNARFGKPLTYRI-TTNESDMFYDVHY---SR-IHIIDG------ER-V-P 175 (418) Q Consensus 117 i~~i~---v~~~~~i~~~~~~---~dp--~s~~yg~p~~y~i-~~~~~~~~~~iH~---SR-~i~~~g------~~-l-p 175 (418) +..+. -+....++..... ..+ .+| +...+.|+. ........+.+|+ +. +....| .| + + T Consensus 180 v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p-~~~v~v~~~v~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~ 258 (532) T protein:vir:99 180 VLQIVTEDKIARAALPEDVRKSLEDAQGDQNP-SEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPV 258 (532) T ss_pred eeeEeeeeeecHHhcChHHHHHhhccccccCC-CcceEEEEEEEecCCCCeeEEEEeecCceecccccccccccCCceee Confidence 53321 1233333322110 000 011 122233321 1111111111111 11 111111 11 1 1 Q ss_pred hhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCccee Q lcl|NC_019404. 176 NAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAI 255 (418) Q Consensus 176 ~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~ 255 (418) -+.+..+.-||.||.+. +++.++...............+.-..+-.+. .+... ..++ ..+.++. T Consensus 259 Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p--------~g~~~-~~~~-----~~~~~g~- 322 (532) T protein:vir:99 259 RLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVNP--------NGVTQ-IRRV-----AKANTGD- 322 (532) T ss_pred eeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHcCCCceecc--------ccccc-hhhh-----ccCCCcc- Confidence 23445566799999986 7899999999988888877766544444331 11000 0011 1122333 Q ss_pred EEEcCCCceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 256 GIDAESEEYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAE 326 (418) Q Consensus 256 ~~d~~~e~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~ 326 (418) ++-+..+++..+.. +|.-....++.+.+.|..++=+- .+..+....+.+|- ++-...+--.+.+.|... T Consensus 323 ~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~ 400 (532) T protein:vir:99 323 FVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN--SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) T ss_pred eecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 33344455555542 35556777888888887776211 12223334454431 222234445566677888 Q ss_pred HHHHHHHHHHHhhccC------------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC-----CCCCHHHHHHHHH Q lcl|NC_019404. 327 LLPILEFLIPFIVNAE------------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAA-----GAMDIKEARDTLR 389 (418) Q Consensus 327 l~p~l~~l~~~i~~~~------------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~-----g~i~~~e~r~~l~ 389 (418) +.|.+++.+.++.+.. ++ +.+ +..+...++.+ ..++.++.+.+. -.|+.+++.+.+. T Consensus 401 l~Pli~r~~~il~r~g~lP~~p~~~~~~~i-v~~--is~Laraq~~~---~l~~~~~~laq~~p~~~d~id~d~~~~~~a 474 (532) T protein:vir:99 401 QLPLVKILLKELQATSKIPNLPKEAVEPAI-ATG--LEALGRGHDLN---KLNVFIDYMIKLAGLQDDDINLLDVKMRLA 474 (532) T ss_pred HHHHHHHHHHHHHhcCCCCCCChhhcccce-eec--chHHHHHHHHH---HHHHHHHHHHhhcchhhhhCCHHHHHHHHH Confidence 9999999999986531 11 222 22233333333 333333333332 2577888877765 Q ss_pred hhcCcCCCChhhcccccccCCCcccc-cc------C Q lcl|NC_019404. 390 TIAPEIKIGDNDIQTEESELITETEV-VI------A 418 (418) Q Consensus 390 ~~~~~~~~~~~~~~~~e~~~~~e~e~-~~------~ 418 (418) ... |++...+-..+.+...+.+. .- | T Consensus 475 ~~~---GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a 507 (532) T protein:vir:99 475 NSL---GMDTTGLILTQQDKQAKMAEASTAAGMVTA 507 (532) T ss_pred HHh---CCChhhccCCHHHHHHHHHHHHHHHHHHHH Confidence 432 22222221111111100000 00 0 No 257 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=90.44 E-value=0.021 Score=29.80 Aligned_cols=388 Identities=11% Similarity=0.040 Sum_probs=172.6 Q ss_pred CccchhhHH-----HHhcCCC-CccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------- Q lcl|NC_019404. 1 MVKTDSYAN-----IFLGGSD-GSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------- 64 (418) Q Consensus 1 ~~~~D~~~n-----~~~g~~~-~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------- 64 (418) .-+..-+.. .+.+-++ .++.....-.-+..+...-.. +++...+. |+ +.||.+.-.+ T Consensus 29 e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~La-a~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~ 101 (535) T protein:vir:15 29 ETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLA-SKLMLALF--PM----QSWMKLTISEYEAKQLVGDP 101 (535) T ss_pred HHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHH-HHHHHhhc--CC----CcccccccChHHHhccCCCc Confidence 000111110 1111111 111111111112222222211 34444332 53 3588774211 Q ss_pred -h-----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc---cccccc-------CCCceEEEEE Q lcl|NC_019404. 65 -D-----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA---LTSPVR-------EGAELETVRV 122 (418) Q Consensus 65 -d-----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~---l~~pl~-------~~~~i~~i~v 122 (418) + ++.+...+.+.++...+.++++.--.+|.+.+++.-..+.. -.-|+. ..|.+..+.- T Consensus 102 ~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r 181 (535) T protein:vir:15 102 DGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVT 181 (535) T ss_pred chHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEE Confidence 0 11244556677899999999998889999988875332221 122332 3344432211 Q ss_pred ---eeccccccccccc---c-ccccccCcceEEEEe-cCCcccccccCcc----cEE------EecCcc--chhhhhhcc Q lcl|NC_019404. 123 ---YDRTQVKVQNREE---N-PRNARFGKPLTYRIT-TNESDMFYDVHYS----RIH------IIDGER--VPNAMRRQN 182 (418) Q Consensus 123 ---~~~~~i~~~~~~~---d-p~s~~yg~p~~y~i~-~~~~~~~~~iH~S----R~i------~~~g~~--lp~~~~~~~ 182 (418) +..+++....... + .....+-..+.|+.. .........+|.+ ++. .|...| ++-+.+..+ T Consensus 182 ~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~g 261 (535) T protein:vir:15 182 RDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDG 261 (535) T ss_pred eEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCCceeeeeeecCC Confidence 1222222111000 0 000011122223221 1100011111110 110 011111 112344556 Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 183 DGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 183 ~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) .-||.||.+. +++.++............+.++.-..+..+.- +. ..+... ..+ ..+.++.+..+ T Consensus 262 e~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~------g~-----~~~~~l---~~~-~~g~~v~g~~~ 325 (535) T protein:vir:15 262 ESYGRSYCEE-YLGDLRSLENLQEAIVKMSMISAKVIGLVNPA------GI-----TQPRRL---TKA-QTGDFVPGRRE 325 (535) T ss_pred CccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeccc------cc-----ccchhc---ccC-CceeeecCCcc Confidence 6799999986 88999999999999998888876655554421 10 011000 111 22233445556 Q ss_pred ceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccc-----hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 263 EYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS-----QNTALETFHKLIDRKRNAELLPILEF 333 (418) Q Consensus 263 ~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st-----ge~d~~~y~~~I~~~Qe~~l~p~l~~ 333 (418) ++..+.. +|......++.+.+.|.-++=+- -+..+....+.+| .++-...+--.+.+++...+.|++++ T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r 403 (535) T protein:vir:15 326 DIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRV 403 (535) T ss_pred cceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 6666643 45567777888888888776111 1222334445443 12223445566677888889999999 Q ss_pred HHHHhhcc--------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC------CCCHHHHHHHHHhhcCcCCCCh Q lcl|NC_019404. 334 LIPFIVNA--------EEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG------AMDIKEARDTLRTIAPEIKIGD 399 (418) Q Consensus 334 l~~~i~~~--------~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g------~i~~~e~r~~l~~~~~~~~~~~ 399 (418) ++.++.+. +.++++|-+.. ....|..-..+..+.++.+.+.+ .|+.+++.+.+.... |++. T Consensus 404 ~~~il~r~g~lP~~p~~~v~~~yis~L--a~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~---Gvp~ 478 (535) T protein:vir:15 404 LLKQLQATSQIPELPKEAVEPTISTGL--EAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAI---GIDT 478 (535) T ss_pred HHHHHHhcCCCCCCCccceeEEEecHH--HHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHc---CCCh Confidence 99998653 35667765432 22233222223333333333322 367777776664322 2222 Q ss_pred hhcccccccCC------CccccccC Q lcl|NC_019404. 400 NDIQTEESELI------TETEVVIA 418 (418) Q Consensus 400 ~~~~~~e~~~~------~e~e~~~~ 418 (418) ..+-..+.+.. .+.+.+-+ T Consensus 479 ~~i~~~~eev~~~~~q~~~~~~~~~ 503 (535) T protein:vir:15 479 SGILLTDEQKQALMMQDAAQTGIEN 503 (535) T ss_pred hhhcCCHHHHHHHHHHHHHHHHHHH Confidence 22211111100 00000000 No 258 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=88.10 E-value=0.034 Score=28.60 Aligned_cols=389 Identities=9% Similarity=0.091 Sum_probs=150.6 Q ss_pred CccchhhH-----HHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc----------- Q lcl|NC_019404. 1 MVKTDSYA-----NIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID----------- 64 (418) Q Consensus 1 ~~~~D~~~-----n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~----------- 64 (418) .-+..-+. +.+..-++..+... .-.-+..+.. +-++.++.-..- =..+.||.+.-.+ T Consensus 31 e~~w~e~a~~~lP~~~~~~~~~~~~~~-~~dstg~~a~-----~~LAa~l~~~lt-pp~~~WF~L~~~d~~~~~~~~~~~ 103 (516) T protein:vir:10 31 LDRAKHYSKLTLPYLMNDKGDNETSQN-GWQGVGAQAT-----NHLANKLAQVLF-PAQRSFFRVDLTAQGEKVLNQRGL 103 (516) T ss_pred HHHHHHHHHhhcccccCCCCCcccccc-cccchHHHHH-----HHHHHHHHhhhc-CCCCccccccCChhhHhhhhccCc Confidence 00000000 00110000000000 0000111111 111111111100 0112344443111 Q ss_pred -----------hHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccc-------cCCCceEEEEEeecc Q lcl|NC_019404. 65 -----------DEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPV-------REGAELETVRVYDRT 126 (418) Q Consensus 65 -----------d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl-------~~~~~i~~i~v~~~~ 126 (418) -++.+.+.+.+-++...+.+++..--.+|.+++++.-+++- -.-|+ +..|.+..+ +-+. T Consensus 104 ~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~~~-~~~pl~~y~v~~d~~G~v~~i--vrr~ 180 (516) T protein:vir:10 104 KKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKGAI-SAIPMHHYVVNRDTNGDLLDI--ILLQ 180 (516) T ss_pred hhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCCCe-EEEEcCeEEEeeCCCCCeEEE--eeee Confidence 01234456677789999999999988899998877433221 12344 234544332 2122 Q ss_pred cccccc----cc-------ccccccccCcceEEEEecCCcccccccCc--ccEEE-------ecCcc-c-hhhhhhcccc Q lcl|NC_019404. 127 QVKVQN----RE-------ENPRNARFGKPLTYRITTNESDMFYDVHY--SRIHI-------IDGER-V-PNAMRRQNDG 184 (418) Q Consensus 127 ~i~~~~----~~-------~dp~s~~yg~p~~y~i~~~~~~~~~~iH~--SR~i~-------~~g~~-l-p~~~~~~~~~ 184 (418) .++... +. .......+.+.+.|.-.-........+|. ..... |...| + +-+.+..+.- T Consensus 181 ~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~~~~~d~~~~~~~s~~~~~e~P~~~~Rw~~~~ge~ 260 (516) T protein:vir:10 181 EKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWELKQSADDIPVGKVSKIKSEKLPFIPLTWKRSYGED 260 (516) T ss_pred cccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEEEEeeCceeeccccccccccCCeeeeeeeecCCCC Confidence 222111 00 00111112333333211110000011111 11111 11111 1 1234556668 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCce Q lcl|NC_019404. 185 WGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEY 264 (418) Q Consensus 185 ~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~ 264 (418) ||.||.+. +++.++............+..+.-..+-.+. .+. ..+. .......+.++.+..+++ T Consensus 261 YGrgp~~~-~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p--------~g~---~~~~----~l~~~~~g~~~~g~~~~v 324 (516) T protein:vir:10 261 WGRPLAED-YSGDLFVIQFLSEAVARGAALMADIKYLIRP--------GAQ---TDVD----HFVNSGTGEVVTGVEEDI 324 (516) T ss_pred cccchHHH-hhHHHHHHHHHHHHHHHHHHHhcCCCcccCc--------ccc---cchh----hhccCCCceeecCCcccc Confidence 99999986 7899999998888887777655544444331 111 0111 111122234455666677 Q ss_pred eEeecc----cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 265 SVLNSD----IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPILEFLI 335 (418) Q Consensus 265 ~~~~~~----~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~l~~l~ 335 (418) ..++.. |..+...++.+.+.|..++=+- .|.-+....+.+|- ++-....--.+.+.|.-.+.|.+++.. T Consensus 325 ~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~ 402 (516) T protein:vir:10 325 HIVQLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGL 402 (516) T ss_pred eeeecCcccchHHHHHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 766543 4555677788888887765332 12222333454431 111122333444455556677776664 Q ss_pred HHhhc--cC---CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC-----CCCHHHHHHHHHh-hcCcCCC--Chhhc Q lcl|NC_019404. 336 PFIVN--AE---EWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAG-----AMDIKEARDTLRT-IAPEIKI--GDNDI 402 (418) Q Consensus 336 ~~i~~--~~---~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g-----~i~~~e~r~~l~~-~~~~~~~--~~~~~ 402 (418) ..+.. ++ +..+ =.+|..+....+++-....++.+..+.+.. .|+.+++.+.+.. .+...++ +++++ T Consensus 403 ~~~~p~~P~~lv~~~~-v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev 481 (516) T protein:vir:10 403 LEAGDSFTSDLVDPVI-ITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEM 481 (516) T ss_pred HhhCCCCChhhcCcce-ehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHH Confidence 33321 11 1111 012222333333333333333333333222 2455554444332 2222221 22222 Q ss_pred ccccccC-CCccccccC Q lcl|NC_019404. 403 QTEESEL-ITETEVVIA 418 (418) Q Consensus 403 ~~~e~~~-~~e~e~~~~ 418 (418) .....+- ..+...|-| T Consensus 482 ~~~r~~~~~~q~~~~~~ 498 (516) T protein:vir:10 482 EQEQEAQMQAQQAQMLE 498 (516) T ss_pred HHHHHHHHHHHHHHHHH Confidence 1110000 001111111 No 259 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=87.14 E-value=0.041 Score=28.20 Aligned_cols=386 Identities=12% Similarity=0.065 Sum_probs=165.7 Q ss_pred CccchhhHHHH-----hc-CCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc---------- Q lcl|NC_019404. 1 MVKTDSYANIF-----LG-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID---------- 64 (418) Q Consensus 1 ~~~~D~~~n~~-----~g-~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~---------- 64 (418) .-+..-+...+ .. +++..+.....-.-+..+...-.. +++...+. |+ +.||.+.-.+ T Consensus 30 e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~La-a~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~ 102 (535) T protein:vir:94 30 ETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLA-SKLMLALF--PM----QTWMKLTISEFEAKQLVAQP 102 (535) T ss_pred HHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHH-HHHHhhhc--CC----CCccccccChhhhhccccch Confidence 11111111110 00 011111111111122223322211 34444442 54 3477763211 Q ss_pred -h-----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCc---ccccc-------cCCCceEEEEE Q lcl|NC_019404. 65 -D-----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRA---LTSPV-------REGAELETVRV 122 (418) Q Consensus 65 -d-----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~---l~~pl-------~~~~~i~~i~v 122 (418) + ++.+...+.+.++...+.++++.--.+|.+++++.-+.+.. -.-|+ +..|.+..+.- T Consensus 103 ~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r 182 (535) T protein:vir:94 103 AELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRLSSYVVQRDAFGTVLQIVT 182 (535) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEEcCeEEEeeCCCCCeEEEEe Confidence 0 01234456667899999999998778999998875432221 12333 23455533211 Q ss_pred ---eeccccccccc---cccc-cccccCcceEEEEe-cCCcccccccCcc---cEE-------EecCcc-c-hhhhhhcc Q lcl|NC_019404. 123 ---YDRTQVKVQNR---EENP-RNARFGKPLTYRIT-TNESDMFYDVHYS---RIH-------IIDGER-V-PNAMRRQN 182 (418) Q Consensus 123 ---~~~~~i~~~~~---~~dp-~s~~yg~p~~y~i~-~~~~~~~~~iH~S---R~i-------~~~g~~-l-p~~~~~~~ 182 (418) +...+++.... .... ..| +-..+.|+.. .......+.+|.+ ..+ -|...| + +-+.+..+ T Consensus 183 ~~~~~~~~l~~~~~~~~~~~~~~~~-~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~g 261 (535) T protein:vir:94 183 LDKTAYAALPEDVRNSMDSSQEHKG-DEMIDVYTHIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDG 261 (535) T ss_pred eeeccHHHhhHHHHHHHHhccccCC-CceeEEEEEEEeeCCCCcEEEEEEecCeeeccccccCccccCCceeeeeeecCC Confidence 11222222110 0000 011 1112223221 1111111222211 111 111112 1 12334556 Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCC Q lcl|NC_019404. 183 DGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESE 262 (418) Q Consensus 183 ~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e 262 (418) .-||.||.+. +++.++...............+.-..+..+. .+.... .++ ..+-++. ++-+..+ T Consensus 262 e~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p--------~g~~~~-~~~-----~~~~~g~-~v~g~~~ 325 (535) T protein:vir:94 262 ESYGRSYCEE-YLGDLRSLENLQEAIVKMSMISAKVIGLVNP--------AGITQV-RRL-----TKAQTGD-FVSGRPE 325 (535) T ss_pred CccccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCCccccc--------ccccch-hhc-----ccCCCce-eecCCcc Confidence 6799999986 7899999998888777766655433333321 111000 011 1112233 3444456 Q ss_pred ceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccc-----hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 263 EYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS-----QNTALETFHKLIDRKRNAELLPILEF 333 (418) Q Consensus 263 ~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st-----ge~d~~~y~~~I~~~Qe~~l~p~l~~ 333 (418) ++..+.. +|.-....++.+.+.|..++= ...+.......+.+| .++-...+--.+.+.+...+.|.+++ T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~--~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r 403 (535) T protein:vir:94 326 DISFLQLEKAADFSVARAVSEQIEGRLSYAFM--LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRV 403 (535) T ss_pred cceeeecccccchhHHHHHHHHHHHHHHHHHh--HhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 6655543 345566778888888887771 222222334445443 12222344455666777889999999 Q ss_pred HHHHhhcc--------CCceEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHhCC------CCCHHHHHHHHHhhcCcCCCC Q lcl|NC_019404. 334 LIPFIVNA--------EEWSVEFS-PLDHESSKDKAEVLEKSVNSIAALIAAG------AMDIKEARDTLRTIAPEIKIG 398 (418) Q Consensus 334 l~~~i~~~--------~~~~~~f~-pL~~~~eke~ae~~~~~a~a~~~~~~~g------~i~~~e~r~~l~~~~~~~~~~ 398 (418) .+.++.+. +.+.+++- +|.+ ..|+.-..+..+.++.+.+.+ .|+.+++.+.+.... |++ T Consensus 404 ~~~il~r~g~lP~~p~~~v~~~~vs~la~---l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~---Gvp 477 (535) T protein:vir:94 404 LLKQLQATNQIPELPKEAVEPTISTGMEA---LGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAI---GID 477 (535) T ss_pred HHHHHHhCCCCCCCChhhccceEeehHHH---HHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHh---CCC Confidence 99998653 22334433 3332 233322223333333333332 467777777665432 222 Q ss_pred hhhcccccccCCCccc------cccC Q lcl|NC_019404. 399 DNDIQTEESELITETE------VVIA 418 (418) Q Consensus 399 ~~~~~~~e~~~~~e~e------~~~~ 418 (418) ...+-..+.+...+.+ .+=+ T Consensus 478 ~~~i~rs~eev~~~~~q~~~~~~~~~ 503 (535) T protein:vir:94 478 TSGILKTPEEKQQEMAEAAQGTAMQN 503 (535) T ss_pred hhhhcCCHHHHHHHHHHHHHHHHHHH Confidence 1222111111111100 0000 No 260 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=86.85 E-value=0.043 Score=28.09 Aligned_cols=385 Identities=10% Similarity=0.075 Sum_probs=153.8 Q ss_pred CccchhhH-----HHHhcCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh---ccCCccccCcc-------- Q lcl|NC_019404. 1 MVKTDSYA-----NIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA---LAAGFHIDGID-------- 64 (418) Q Consensus 1 ~~~~D~~~-----n~~~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~---~r~~~~i~~~~-------- 64 (418) .-+..-+. +.+..-++..+... .-.-+..+.. +-++.++ -..+ .+.||.+.-.+ T Consensus 30 e~~w~e~~~~tlP~~~~~~~~~~~~~~-~~dstg~~a~-----~~LAa~l----~~~ltpp~~~WF~l~~~d~~~~~l~~ 99 (515) T protein:vir:70 30 LDRAKHFAKLTLPYLMNNKGDNETSQN-GWQGVGAQAT-----NHLANKL----AQVLFPAQRSFFRVDLTAKGEKVLDD 99 (515) T ss_pred HHHHHHHHHHhcccccCCCCCcccccc-cccchHHHHH-----HHHHHHH----HHhhcCCCCcccccccChhhhhcccc Confidence 00000000 01111011111000 0011111111 1111111 1111 22355543100 Q ss_pred ---h-----------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccc-------cCCCceEEEEE- Q lcl|NC_019404. 65 ---D-----------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPV-------REGAELETVRV- 122 (418) Q Consensus 65 ---d-----------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl-------~~~~~i~~i~v- 122 (418) + ++.+.+.+.+.++...+.++++.--.+|.+++++.-+++- -.-|+ +..|.+..+.. T Consensus 100 ~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~~~-~~~pl~~y~v~~d~~G~v~~i~rr 178 (515) T protein:vir:70 100 RGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAM-SAVPMHHYVVNRDTNGDLMDVILL 178 (515) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCCCe-EEEEcCeEEEeeCCCcCeeEEEee Confidence 0 1224455667799999999999988899998887533321 12344 23454443211 Q ss_pred --eeccccccc----------cccccccccccCcceEEEEecCCcccc----cccCcccEEEecC-----cc-c-hhhhh Q lcl|NC_019404. 123 --YDRTQVKVQ----------NREENPRNARFGKPLTYRITTNESDMF----YDVHYSRIHIIDG-----ER-V-PNAMR 179 (418) Q Consensus 123 --~~~~~i~~~----------~~~~dp~s~~yg~p~~y~i~~~~~~~~----~~iH~SR~i~~~g-----~~-l-p~~~~ 179 (418) +..+++... ....+| +...+.|...-...... +.++-.++..-.| .| + +-+.+ T Consensus 179 ~~~t~~~l~~~f~~~~~~~~~~~~~~~----~~~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~y~~~e~P~~~~Rw~~ 254 (515) T protein:vir:70 179 QEKALRTFDPATRMAIEVGMKGKKCKE----DDNVKLYTHAQYAGEGFWKINQSADDIPVGKESRIKSEKLPFIPLTWKR 254 (515) T ss_pred eeccHHHHHHhhhhhhhhhhhhhhcCC----CCceEEEEEEEecCCCceEEEEecCceeeccccccccccCCceeeeeee Confidence 111222211 011112 22223332221110000 1111111111111 11 1 12345 Q ss_pred hccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEc Q lcl|NC_019404. 180 RQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA 259 (418) Q Consensus 180 ~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~ 259 (418) ..+.-||.||.+. +++.++............+..+.-..+-.+. ++.. .. ........+.++-+ T Consensus 255 ~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~--------~g~~---~~----~~l~~~~~g~iv~g 318 (515) T protein:vir:70 255 SYGEDWGRPLAED-YSGDLFVIQFLSEAMARGAALMADIKYLIRP--------GSQT---DV----DHFVNSGTGEVITG 318 (515) T ss_pred cCCCCcccchHHH-hhHHHHHHHHHHHHHHHHHHHhcCCCeeeCc--------cccc---ch----hhccccCCceeecC Confidence 5566799999986 8899999998888888877666555544432 1100 00 01111232344556 Q ss_pred CCCceeEeecc----cCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 260 ESEEYSVLNSD----IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELLPI 330 (418) Q Consensus 260 ~~e~~~~~~~~----~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p~ 330 (418) ..+++..+... |..+...++.+.+.|..++=+- -|.-.....+.||- ++-....--.+.+.|.-.+.|+ T Consensus 319 ~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pl 396 (515) T protein:vir:70 319 VAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPI 396 (515) T ss_pred CcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 66677666543 4556677888888887766332 12222333454431 1111223333444555556666 Q ss_pred HHHHHHHhhcc---CCceEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHhC-----CCCCHHHHHHHHHhhc-CcCCC--C Q lcl|NC_019404. 331 LEFLIPFIVNA---EEWSVEFS-PLDHESSKDKAEVLEKSVNSIAALIAA-----GAMDIKEARDTLRTIA-PEIKI--G 398 (418) Q Consensus 331 l~~l~~~i~~~---~~~~~~f~-pL~~~~eke~ae~~~~~a~a~~~~~~~-----g~i~~~e~r~~l~~~~-~~~~~--~ 398 (418) +.++...+... +...+.+- +|..+....+++-....++.+....+. -.|+.+++.+.+.... ...++ + T Consensus 397 i~r~~~~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs 476 (515) T protein:vir:70 397 AMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKS 476 (515) T ss_pred HHHHHHhhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCC Confidence 55543333211 12222221 222222222232222233333322222 2467777666654332 22222 2 Q ss_pred hhhccccccc-CCCccccccC Q lcl|NC_019404. 399 DNDIQTEESE-LITETEVVIA 418 (418) Q Consensus 399 ~~~~~~~e~~-~~~e~e~~~~ 418 (418) ++++.....+ ...+...+.| T Consensus 477 ~eev~~~r~q~~~~~~~~~~~ 497 (515) T protein:vir:70 477 EEEMQQEMAQQAQAQQEAMLN 497 (515) T ss_pred HHHHHHHHHHHHHHHHHHHHH Confidence 3332211100 0000011111 No 261 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=86.48 E-value=0.045 Score=27.95 Aligned_cols=382 Identities=12% Similarity=0.102 Sum_probs=162.2 Q ss_pred CccchhhHHHHh---cCCCCccccCccccCCHHHHHHHHHcCCccchhhhcchhhh-------ccCCccccCcch----- Q lcl|NC_019404. 1 MVKTDSYANIFL---GGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA-------LAAGFHIDGIDD----- 65 (418) Q Consensus 1 ~~~~D~~~n~~~---g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~-------~r~~~~i~~~~d----- 65 (418) .-+..-+....+ +........+..+..... .+.+.+.+++.|..+ .+.||.+.-.++ T Consensus 18 e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~--------dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~l~~~ 89 (522) T protein:vir:10 18 LDKAVECSELTLPYLIDDDISSRPNHKSLTVPW--------QSVGAKCCVTLAAKLMLAVLPPQTSFFKLQVRDDKLGEE 89 (522) T ss_pred HHHHHHHHHHhhhcccCCCCCCCcccccccccc--------cchHHHHHHHHHHHHHHhhcCCCCccccccCChHHHhhh Confidence 111111111110 000000000100000000 011122222222222 234666532110 Q ss_pred ----------------HHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCcccccc-------cCCCceEEEEE Q lcl|NC_019404. 66 ----------------EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPV-------REGAELETVRV 122 (418) Q Consensus 66 ----------------~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl-------~~~~~i~~i~v 122 (418) ++.+...+.+.+....+.++++.--.+|.+++++.- ++-. .-|+ +..|.+..+ T Consensus 90 ~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~-~~~~-~~pl~~y~v~~d~~G~vd~i-- 165 (522) T protein:vir:10 90 LDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGK-DGLK-TFPLTRYVINRDGDGNVLEI-- 165 (522) T ss_pred cChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcC-CCce-EEEcceEEEeeCCCCCeeEE-- Confidence 112445566778999999999999999999988753 2211 2344 234555432 Q ss_pred eeccccccccc------------cccccccccCcceEEEEe-cCCcccccccCcc----cEE------EecCcc-ch-hh Q lcl|NC_019404. 123 YDRTQVKVQNR------------EENPRNARFGKPLTYRIT-TNESDMFYDVHYS----RIH------IIDGER-VP-NA 177 (418) Q Consensus 123 ~~~~~i~~~~~------------~~dp~s~~yg~p~~y~i~-~~~~~~~~~iH~S----R~i------~~~g~~-lp-~~ 177 (418) +-+..++.... ..+...| +-..+.|+.. +........+|.+ .+. -|...| ++ -+ T Consensus 166 ~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~-~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s~~g~~~~P~~~~Rw 244 (522) T protein:vir:10 166 VTKELISRKVLDIELPEPKPNTGIDESSTT-NDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRSTAPKNASPWLPLRF 244 (522) T ss_pred EeeeeccHHHHHHhcchhccchhhhcccCC-CCceEEEEEEEeeccCCceEEEEccCCccccccccccccccCCceeeee Confidence 22222221110 0111111 1112222221 1100001111111 111 111111 11 23 Q ss_pred hhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEE Q lcl|NC_019404. 178 MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGI 257 (418) Q Consensus 178 ~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~ 257 (418) .+..+.-||.||.+. +++.++............+.++.-..+-.+.-+.. + +.. ...+ .++.++ T Consensus 245 ~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~----~-------~~~---l~~~-~~~~~v 308 (522) T protein:vir:10 245 NTVDGEDYGRGRVEE-FLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTT----K-------PAT---IAKA-GNGAIV 308 (522) T ss_pred eecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhcCCceeecccccc----c-------ccc---ccCC-CCccee Confidence 344566799999986 88999999999999988888877666655421100 0 000 0112 223334 Q ss_pred EcCCCceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccch-----hHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 258 DAESEEYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSSQ-----NTALETFHKLIDRKRNAELL 328 (418) Q Consensus 258 d~~~e~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~ 328 (418) -+..+++..+.. +|..+...++.+.+.|..++= .+..+.+..+.+|- ++-....--.+.+.+...+. T Consensus 309 ~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl----~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~ 384 (522) T protein:vir:10 309 QGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFL----VMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLI 384 (522) T ss_pred cCCCccceeecccccccchHHHHHHHHHHHHHHHHHh----hccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 455566665553 345567777888888887751 22233345555431 11223333445667777899 Q ss_pred HHHHHHHHHhhccC-------C----ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--h--CCCCCHHHHHHHHHhh-c Q lcl|NC_019404. 329 PILEFLIPFIVNAE-------E----WSVEFSPLDHESSKDKAEVLEKSVNSIAALI--A--AGAMDIKEARDTLRTI-A 392 (418) Q Consensus 329 p~l~~l~~~i~~~~-------~----~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~--~--~g~i~~~e~r~~l~~~-~ 392 (418) |.+++.+.++.+.. + ..+++-+ .+.-.++++.....++.+...+ . .-.|+.+++.+.+... + T Consensus 385 Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is--~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~G 462 (522) T protein:vir:10 385 PYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVN--ALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQG 462 (522) T ss_pred HHHHHHHHHHHhcCCCCCCCccccccccccchh--HHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhC Confidence 99999999887642 1 1122221 1222233333233333332222 1 1247777777666432 2 Q ss_pred Cc-CCC--ChhhcccccccCCCccccccC Q lcl|NC_019404. 393 PE-IKI--GDNDIQTEESELITETEVVIA 418 (418) Q Consensus 393 ~~-~~~--~~~~~~~~e~~~~~e~e~~~~ 418 (418) .. ..+ +++++.... +...+....-+ T Consensus 463 vp~~~ivrt~eev~~~~-q~~q~~~~~~~ 490 (522) T protein:vir:10 463 IDVLNLVKTEQQLAEEQ-QAAQQQAAQQS 490 (522) T ss_pred CChhhhcCCHHHHHHHH-HHHHHHHHHHH Confidence 11 122 222221110 00000000000 No 262 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=79.93 E-value=0.1 Score=26.07 Aligned_cols=386 Identities=9% Similarity=0.017 Sum_probs=165.1 Q ss_pred CccchhhHHHHh-------cCCCC-ccccCccccCCHHHHHHHHHcCCccchhhhcchhhhccCCccccCcc-------- Q lcl|NC_019404. 1 MVKTDSYANIFL-------GGSDG-SEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGID-------- 64 (418) Q Consensus 1 ~~~~D~~~n~~~-------g~~~~-~~~~~~~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~~r~~~~i~~~~-------- 64 (418) .-+..-+....+ +.+.. +......-.-+..... +-++.++.-..- =..+.||.+.-.+ T Consensus 18 e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~-----~~LAa~l~~~lt-pp~~~WF~l~~~d~~~~~~~~ 91 (514) T protein:vir:80 18 IRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLV-----NNLTAKLALTLF-PPGRPSFQIELDDTLQELAAA 91 (514) T ss_pred HHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHH-----HHHHHHHHhhhc-CCCCcccccccCchhhhhccc Confidence 111111111111 00000 0000000011111111 111111111100 0112355543111 Q ss_pred --------------hHHHHHHHHHHhCchHHHHHHHHhccccceEEEEEeecCCCccccccc-------CCCceEEEEEe Q lcl|NC_019404. 65 --------------DEPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR-------EGAELETVRVY 123 (418) Q Consensus 65 --------------d~~~i~~~~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l~~pl~-------~~~~i~~i~v~ 123 (418) -++.+.+.+.+-++...+.++++.--.+|.+.+++.-+.+.--.-|+. ..|.+..+ + T Consensus 92 ~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i--~ 169 (514) T protein:vir:80 92 NGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTGKMLVWTMQSYTVRRTSHGDPAVV--V 169 (514) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCCcEEEEEcCeEEEeeCCCcCeEEE--E Confidence 012344556677899999999999889999988874321211223442 33444322 1 Q ss_pred eccccccccccccc--------ccc-ccCcceEEEEe---cCCcccccccCc----ccEEEecCc---cch----hhhhh Q lcl|NC_019404. 124 DRTQVKVQNREENP--------RNA-RFGKPLTYRIT---TNESDMFYDVHY----SRIHIIDGE---RVP----NAMRR 180 (418) Q Consensus 124 ~~~~i~~~~~~~dp--------~s~-~yg~p~~y~i~---~~~~~~~~~iH~----SR~i~~~g~---~lp----~~~~~ 180 (418) -+..+++.....+. ..+ .+.+.+.|... +.......-+|. .++..-.|. ..| -+.+. T Consensus 170 rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i~~es~y~~~e~P~i~~Rw~~~ 249 (514) T protein:vir:80 170 LRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRVGPESSYPAHLCPYVPVAWNVP 249 (514) T ss_pred eeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceeecccCccccccCCeeeeeeEec Confidence 12222221111100 000 01111222211 110000011221 222111111 112 23455 Q ss_pred ccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcC Q lcl|NC_019404. 181 QNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAE 260 (418) Q Consensus 181 ~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~ 260 (418) .+.-||.||.+. +++.++...............+.-..+..+. .+. .... ......++.++-+. T Consensus 250 ~ge~YGrgp~~~-al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~--------~g~---~~~~----~l~~~~~g~~v~g~ 313 (514) T protein:vir:80 250 DGEHYGRGYVEE-YSGDFARLSILSERLGLYEFEALSLLNLVDE--------AKG---GAVD----DYRDAETGDFVPGQ 313 (514) T ss_pred CCCCcccchHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCceeCc--------ccc---cchh----hhcccCCceeecCC Confidence 566799999986 8899999998888888777665554444432 110 0000 01111223344555 Q ss_pred CCceeEeec----ccCCHHHHHHHHHHHHhhhhcCCeeeeec--cCccccccch-----hHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 261 SEEYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKN--KNVGGLSSSQ-----NTALETFHKLIDRKRNAELLP 329 (418) Q Consensus 261 ~e~~~~~~~----~~~gl~~~~~~~~~~iaaas~IP~t~L~G--~s~~gl~stg-----e~d~~~y~~~I~~~Qe~~l~p 329 (418) .+++..++. +|.-+...++.+.+.|.-++ .|+. +....+.+|- ++-....--.+.+.|.-.+.| T Consensus 314 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF-----ml~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~P 388 (514) T protein:vir:80 314 VGSVASYERGDYNKIAQASASVESIVMRLNRAF-----MYTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAP 388 (514) T ss_pred CccceeeecCcccchHHHHHHHHHHHHHHHHHH-----hhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 566666654 35555677888888887664 2322 2233354431 111123333455677778899 Q ss_pred HHHHHHHHhhcc----------CCceEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHhCC-----CCCHHHHHHHHHhhcC Q lcl|NC_019404. 330 ILEFLIPFIVNA----------EEWSVEFS-PLDHESSKDKAEVLEKSVNSIAALIAAG-----AMDIKEARDTLRTIAP 393 (418) Q Consensus 330 ~l~~l~~~i~~~----------~~~~~~f~-pL~~~~eke~ae~~~~~a~a~~~~~~~g-----~i~~~e~r~~l~~~~~ 393 (418) .+++.+.++.+. +-..+++- +|.++.-...++....-++.++.+.+.. .|+.+++.+.+.... T Consensus 389 li~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~- 467 (514) T protein:vir:80 389 LAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANN- 467 (514) T ss_pred HHHHHHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHh- Confidence 999988877531 12334432 3444444444444445555555555432 377888887764322 Q ss_pred cCCCChhhccccccc--CCCccc-----cccC Q lcl|NC_019404. 394 EIKIGDNDIQTEESE--LITETE-----VVIA 418 (418) Q Consensus 394 ~~~~~~~~~~~~e~~--~~~e~e-----~~~~ 418 (418) |++...+-..++. ...+.+ .+.+ T Consensus 468 --Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~ 497 (514) T protein:vir:80 468 --SVDLSTLSKDPDVVAAEAEQEAALAQQQLD 497 (514) T ss_pred --CCCHhhccCCHHHHHHHHHHHHHHHHHHHH Confidence 2222222111111 111111 1111 No 263 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=75.73 E-value=0.14 Score=25.21 Aligned_cols=380 Identities=11% Similarity=0.095 Sum_probs=161.6 Q ss_pred CccchhhHHHHhcCCCCccccCc-cccCCHHHHHHHHHcCCccchhhhcchhhh-------ccCCccccCcchH------ Q lcl|NC_019404. 1 MVKTDSYANIFLGGSDGSEIYGS-LQNQAPTILASLYADNALVRRIIDTIPETA-------LAAGFHIDGIDDE------ 66 (418) Q Consensus 1 ~~~~D~~~n~~~g~~~~~~~~~~-~~~~~~~~l~~~Y~~~~~~r~iVd~~a~d~-------~r~~~~i~~~~d~------ 66 (418) +=+--.+.+...+.+..+..... .-.-+.. +.+++.|..+ .+.||.+.-.++. T Consensus 38 lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~-------------~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~ 104 (549) T protein:vir:10 38 MPRLDKFGQLPRPDSEKGRERSQKMFDSTAP-------------LALRNFVAAMDSMITPATQLWHRLKTGNDALNEIAS 104 (549) T ss_pred ccccccccccCCCCCCcccccccccccchHH-------------HHHHHHHHHHHhhccCCCCccccccCCccchhhhhH Confidence 11111111111111111111111 1111111 2222222221 2346666432210 Q ss_pred -H--------HHHHHH--HHhCchHHHHHHHHhccccceEEEEEeecCCCcc---ccccc-------CCCceEEE-EEe- Q lcl|NC_019404. 67 -P--------AFWSRW--DDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPVR-------EGAELETV-RVY- 123 (418) Q Consensus 67 -~--------~i~~~~--~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~~~l---~~pl~-------~~~~i~~i-~v~- 123 (418) + .+...+ .+-++...+.++++.--.||.+.+++.-+.++.+ ..|+. ..|.+..+ +-+ T Consensus 105 v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~ 184 (549) T protein:vir:10 105 VKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYRNVPMQRLWFAENNSGLIDKTHVQWE 184 (549) T ss_pred HHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEEEEEcCeEEEeeCCCCCeEEEEEEee Confidence 1 111211 2457788899999888889999999865433321 23442 34555332 111 Q ss_pred -eccc---------ccccc---ccccccccccCcceEEEEe-cCCcc--------------------cccccCcccEEEe Q lcl|NC_019404. 124 -DRTQ---------VKVQN---REENPRNARFGKPLTYRIT-TNESD--------------------MFYDVHYSRIHII 169 (418) Q Consensus 124 -~~~~---------i~~~~---~~~dp~s~~yg~p~~y~i~-~~~~~--------------------~~~~iH~SR~i~~ 169 (418) ..++ ++... ...+| +-.-+.|+.. +.... ...-++.|.. T Consensus 185 ~t~~ql~~~fg~~~l~~~v~~~~~~~~----~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~esg~--- 257 (549) T protein:vir:10 185 LTLRQAAQRFGRENLSPSMQSTLEKDP----EKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQNSGF--- 257 (549) T ss_pred cCHHHHHHhcCcccCCHHHHHHhhcCC----CceEEEEEEeecCCCCCccccccccCceEEEEEEecCCEeeccCCc--- Confidence 1111 11111 11122 1222223221 11000 0011222221 Q ss_pred cCcc--chhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHH Q lcl|NC_019404. 170 DGER--VPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDN 247 (418) Q Consensus 170 ~g~~--lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~ 247 (418) ...| ++-+.+..+.-||.||.+. +++.++............+..+.-..+-.+..+.. +. ... T Consensus 258 ~e~P~~~~Rw~~~~ge~YGrgp~~~-~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~--~~---------~~l--- 322 (549) T protein:vir:10 258 RTFPFAIGRFYVGTDDVYGGSPAYD-AMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVL--DG---------FDL--- 322 (549) T ss_pred ccCCcceeeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc--cc---------cee--- Confidence 1111 1223455666899999986 78999999999999998888877666665421111 00 000 Q ss_pred hcCCcceeE-EEcCCCceeEeec--ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccccccc-----hhHHHHHHHHHH Q lcl|NC_019404. 248 NSGVGQAIG-IDAESEEYSVLNS--DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGLSSS-----QNTALETFHKLI 319 (418) Q Consensus 248 ~~~~~~~~~-~d~~~e~~~~~~~--~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~gl~st-----ge~d~~~y~~~I 319 (418) ..+-.+-.. -.+....+.+++. +|.-....++.+.+.|..++=.....+. ..+..+.+| .++-....--.. T Consensus 323 ~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~-~~~~~~TAtEV~~r~~E~~~~LGpv~ 401 (549) T protein:vir:10 323 RSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQIL-VDSGDMTATEVLQRAQEKGVLLAPTL 401 (549) T ss_pred ccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhh-cCCCCccHHHHHHHHHHHHHHhhHHH Confidence 011111111 1223334554433 4455667788888888887744321111 234455554 122223334445 Q ss_pred HHHHHHHHHHHHHHHHHHhhccC--------------CceEEeCCCCCCCHHHHHHHH---HHHHHHHHHHHhCC----- Q lcl|NC_019404. 320 DRKRNAELLPILEFLIPFIVNAE--------------EWSVEFSPLDHESSKDKAEVL---EKSVNSIAALIAAG----- 377 (418) Q Consensus 320 ~~~Qe~~l~p~l~~l~~~i~~~~--------------~~~~~f~pL~~~~eke~ae~~---~~~a~a~~~~~~~g----- 377 (418) .+.+.-.+.|.+++.+.++.... ++.++|-+ .+....+++-. ....+.+..+.+.+ T Consensus 402 ~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis--~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld 479 (549) T protein:vir:10 402 GRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDS--PLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAK 479 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeec--HHHHHHHHHHHHHHHHHHHHHHHHhccChhHHh Confidence 56666678999999998886521 24455433 22222232222 23333333443433 Q ss_pred CCCHHHHHHHHHhh-cCcCCC--Chhhcccccc--cCCCccccccC Q lcl|NC_019404. 378 AMDIKEARDTLRTI-APEIKI--GDNDIQTEES--ELITETEVVIA 418 (418) Q Consensus 378 ~i~~~e~r~~l~~~-~~~~~~--~~~~~~~~e~--~~~~e~e~~~~ 418 (418) .|+.+++.+.+... +...++ +++++..-.. .--...+.+-| T Consensus 480 ~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~ 525 (549) T protein:vir:10 480 VPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLA 525 (549) T ss_pred cCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Confidence 36777777666432 221111 1222110000 00000000111 No 264 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=74.19 E-value=0.16 Score=24.93 Aligned_cols=386 Identities=9% Similarity=0.041 Sum_probs=152.9 Q ss_pred CccchhhHHHH---------------------------------------hcCCCCccccCccccCCHHHHHHHHHcCCc Q lcl|NC_019404. 1 MVKTDSYANIF---------------------------------------LGGSDGSEIYGSLQNQAPTILASLYADNAL 41 (418) Q Consensus 1 ~~~~D~~~n~~---------------------------------------~g~~~~~~~~~~~~~~~~~~l~~~Y~~~~~ 41 (418) |.-+|.+...+ .+........|...-.+ +. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~-----------~~ 83 (651) T protein:vir:80 15 YDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITT-----------GK 83 (651) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccC-----------hh Confidence 22222211110 10000111112222111 12 Q ss_pred cchhhhcchhhhc------cCCccccCc--ch-HH----HHHHH----HHHhCchHHHHHHHHhccccceEEEEEeecCC Q lcl|NC_019404. 42 VRRIIDTIPETAL------AAGFHIDGI--DD-EP----AFWSR----WDDLEMTQNINDAWSWARLFGGAAIVAIVKDN 104 (418) Q Consensus 42 ~r~iVd~~a~d~~------r~~~~i~~~--~d-~~----~i~~~----~~~l~~~~~~~~a~~~~rl~G~~~i~i~~~d~ 104 (418) ++..|+.....++ .+|+.+..- .+ .. +++.. +.+-+....+.+++....++|.|++-+.-+.. T Consensus 84 v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~ 163 (651) T protein:vir:80 84 AFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVE 163 (651) T ss_pred HHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecce Confidence 2222222111111 224555322 11 11 12222 23455667777888889999988874322100 Q ss_pred ----------------Cccccccc-----CCCceEEEEEeeccccccccccccccc------------------------ Q lcl|NC_019404. 105 ----------------RALTSPVR-----EGAELETVRVYDRTQVKVQNREENPRN------------------------ 139 (418) Q Consensus 105 ----------------~~l~~pl~-----~~~~i~~i~v~~~~~i~~~~~~~dp~s------------------------ 139 (418) ......+. ..+.+ .+..++++.+-+.....++.. T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~-~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~ 242 (651) T protein:vir:80 164 TAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSP-DFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYY 242 (651) T ss_pred eeeeehheeccccccccccceeeeccceeeecee-EEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhccccc Confidence 00000000 00000 111222211110000000000 Q ss_pred ---------------ccc----------------C---c---ceEEE-EecCCcccccccC----cccEEEecCccc--- Q lcl|NC_019404. 140 ---------------ARF----------------G---K---PLTYR-ITTNESDMFYDVH----YSRIHIIDGERV--- 174 (418) Q Consensus 140 ---------------~~y----------------g---~---p~~y~-i~~~~~~~~~~iH----~SR~i~~~g~~l--- 174 (418) +++ . + -++|. +...+. ....+| -..|++....+. T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~-~~~~~~v~~~g~~il~~~~~~~~~~ 321 (651) T protein:vir:80 243 GVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENK-TYHDVVVTIMGNEVLRFEQNPYWCG 321 (651) T ss_pred chhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCC-ceEEEEEEEcCcEEecccccCCCCC Confidence 000 0 0 01111 011100 000011 112333222221 Q ss_pred -hh----hhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHcCCceeecchHHHhhcCcchHHHHHHHHHHHHHhc Q lcl|NC_019404. 175 -PN----AMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNS 249 (418) Q Consensus 175 -p~----~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~ 249 (418) |+ ..+.....||.|+++. +.+..+............+..++-..+....-+ +. .. .. + . T Consensus 322 ~Pf~~~~~~~~~~~~yG~g~~~~-~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~-~~---~~-~~----l------~ 385 (651) T protein:vir:80 322 RPFVIGTYIPTARQPYAMGALQP-NLGMLHELNIITNQRLDNLELAIDQMYTLRSDG-LL---QP-ED----V------Y 385 (651) T ss_pred CCeeeecceecCccccCCChHHH-HhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCc-cc---cH-HH----h------h Confidence 22 2333446799999975 889999999999999999988888887765211 11 11 00 1 1 Q ss_pred CCcceeEEEcCCCceeEeec---ccCCHHHHHHHHHHHHhhhhcCCeeeeeccCccc---cccch-----hHHHHHHHHH Q lcl|NC_019404. 250 GVGQAIGIDAESEEYSVLNS---DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGG---LSSSQ-----NTALETFHKL 318 (418) Q Consensus 250 ~~~~~~~~d~~~e~~~~~~~---~~~gl~~~~~~~~~~iaaas~IP~t~L~G~s~~g---l~stg-----e~d~~~y~~~ 318 (418) ...++++..+..+++..+.. ++.+...+++.+...+.-.++||.. ..|..+.+ .+||+ +.-....-.. T Consensus 386 ~~pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~-~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v 464 (651) T protein:vir:80 386 TEPGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNY-VGANAARSGERVTAAEVAAVREAGGNRLSGI 464 (651) T ss_pred cCCCceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChH-HhCCCccchhhccHHHHHHHHHHHHHHHHHH Confidence 22344555555566766654 3445667788888899999999863 44544333 23333 1112233334 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccC------------------------CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019404. 319 IDRKRNAELLPILEFLIPFIVNAE------------------------EWSVEFSPLDHESSKDKAEVLEKSVNSIAALI 374 (418) Q Consensus 319 I~~~Qe~~l~p~l~~l~~~i~~~~------------------------~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~ 374 (418) ++.+++..++|++++++.+++... +++.++. +-.....++.+.....++ +..++ T Consensus 465 ~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~-iv~~g~~~~~~r~~~~~~-l~~~~ 542 (651) T protein:vir:80 465 HKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVR-LVPIGSDHVIERKQYIED-RLTFI 542 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeee-eeeccHHHHHHHHHHHHH-HHHHH Confidence 445555678999999999887531 1222221 112233333333223333 33333 Q ss_pred hCCCCCHH-----HHHHHHHhhcCcCCCC--hhhcccccccCCCc-cccccC Q lcl|NC_019404. 375 AAGAMDIK-----EARDTLRTIAPEIKIG--DNDIQTEESELITE-TEVVIA 418 (418) Q Consensus 375 ~~g~i~~~-----e~r~~l~~~~~~~~~~--~~~~~~~e~~~~~e-~e~~~~ 418 (418) +.....+. .....+....+-.|+. +.-+...+...... .+.... T Consensus 543 q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~ 594 (651) T protein:vir:80 543 QAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLS 594 (651) T ss_pred HhhccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHh Confidence 33322221 1111121111112221 11122222211111 111111 No 265 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=34.98 E-value=1.3 Score=20.04 Aligned_cols=249 Identities=18% Similarity=0.201 Sum_probs=97.6 Q ss_pred ccccceEEEEEeecCCCcccccccCCCceEEEEEeeccc-cc-------ccccc-cccccc-----cc----CcceEEEE Q lcl|NC_019404. 89 ARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQ-VK-------VQNRE-ENPRNA-----RF----GKPLTYRI 150 (418) Q Consensus 89 ~rl~G~~~i~i~~~d~~~l~~pl~~~~~i~~i~v~~~~~-i~-------~~~~~-~dp~s~-----~y----g~p~~y~i 150 (418) -.+|.-+ +- -.+-++..|+|-||.. +- +++.+ .|.... -| .--+.|++ T Consensus 1 ~~~~~~~------~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (279) T protein:vir:40 1 MSLFNLS------RR--------AEDVSFSTFTVQDPTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWALQGKEVYRV 66 (279) T ss_pred Ccccccc------hh--------hcccceeeeeecCcchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhhhccceeehh Confidence 0000000 00 0112344444444321 10 00100 010000 00 01122322 Q ss_pred ecCCcc-cccccCcccEEEec--Cc------cchhhhhhccccCCcchHHHHHHHHHHHHHHHHHHHHHHHHHc-CCc-e Q lcl|NC_019404. 151 TTNESD-MFYDVHYSRIHIID--GE------RVPNAMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLRRK-QQA-V 219 (418) Q Consensus 151 ~~~~~~-~~~~iH~SR~i~~~--g~------~lp~~~~~~~~~~G~S~l~~~~~~~l~~~~~~~~~~~~l~~~~-~~~-v 219 (418) --++-. -.++|+.+..-+.. |. |..+.....+..+|.-+= + . ..+. +-+..+++.=|+-. +++ + T Consensus 67 ~~~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~IieNPlv~v~~e-e-~-~kM~--~la~nai~~KLD~~~qIk~f 141 (279) T protein:vir:40 67 WYGGFKYYAQRVNADQFNIVVREPNRREVTIRTNDYEMLLNPFYGANPQ-R-F-GVMF--GMASNGIGRRLDSQAQIKIY 141 (279) T ss_pred hhhhHHHHHhhcCcchhhhheecCCcceeEeecchhhhhhcchheeccc-h-h-hHHH--HHHHhhhhhhhcccceeeeE Confidence 111000 01223332211110 11 111112222222221110 0 0 0000 00111121111111 222 4 Q ss_pred eecchHHHhhcCcchHHHHHHHHHHHHHhcCCcceeEEEcCCCceeEeecccCC-HHHHHHHHHHHHhhhhcCCeeeeec Q lcl|NC_019404. 220 WKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG-IDAFLDKKFDRIVALSGIHEIILKN 298 (418) Q Consensus 220 ~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~g-l~~~~~~~~~~iaaas~IP~t~L~G 298 (418) +|++.-+-+ .+..++.+.|+..+...-...+++...+.+|++.+++-+.++ +.+=++.+..++....+||..+|.| T Consensus 142 IKTd~d~gl---ee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYStslk~die~lkS~l~Sq~GinekIL~G 218 (279) T protein:vir:40 142 WKTKVSSGL---KEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGSLQNDANLAIEIALSEYGMPRELLYG 218 (279) T ss_pred EecCcchhH---HHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccccccHHHHHHHHHHHHhhcCCchhhccc Confidence 566632211 233445666666665554444566666677999999998874 6677889999999999999999998 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019404. 299 KNVGGLSSSQNTALETFHKLIDRKRNAELLPILEFLIPFIVNAEEWSVEFSPLDHESSKDKAEVLEKSVNSIAALIAAGA 378 (418) Q Consensus 299 ~s~~gl~stge~d~~~y~~~I~~~Qe~~l~p~l~~l~~~i~~~~~~~~~f~pL~~~~eke~ae~~~~~a~a~~~~~~~g~ 378 (418) + + .|+.+.+||.+ .+.|+|....+-|+.++++-+.| .+-+.+ .|+ T Consensus 219 s------A-tE~q~iAyy~r-------tVePILkQyek~liY~~E~fv~y---~ttta~------------------gg~ 263 (279) T protein:vir:40 219 Q------S-NEVTIIAFAIQ-------KVLPLLKQHDKNIIFNQENFVAY---ISTTAK------------------GGA 263 (279) T ss_pred c------C-chhhhhhHHHh-------hHHHHHHHhcccccchhhhhhhh---heeccc------------------Ccc Confidence 3 3 35677888875 47888888766555444442221 111100 111 Q ss_pred CCHHHHHHHHHhhcCc Q lcl|NC_019404. 379 MDIKEARDTLRTIAPE 394 (418) Q Consensus 379 i~~~e~r~~l~~~~~~ 394 (418) |.......-....+.. T Consensus 264 ~~s~~~~~~~~~~~~~ 279 (279) T protein:vir:40 264 IESKSSKRDSEPVGND 279 (279) T ss_pred cccccccccCCCCCCC Confidence 1110000000000000 Done!