Query lcl|NC_011057.1_cdsid_YP_002014621.1 [gene=PHAEDRUS_10] [protein=gp10] [protein_id=YP_002014621.1] [location=6231..8135] Match_columns 634 No_of_seqs 22 out of 25 Neff 3.3 Searched_HMMs 1612 Date Thu Nov 7 13:54:58 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_10 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_10_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:102426 Length: 631 100.0 0E+00 0E+00 1865.0 54.3 631 1-634 1-631 (631) 2 protein:vir:107517 Length: 639 100.0 5E-319 3E-322 1765.7 52.2 628 1-634 1-638 (639) 3 protein:vir:97900 Length: 639 100.0 5E-319 3E-322 1765.7 52.2 628 1-634 1-638 (639) 4 protein:vir:8654 Length: 629 # 100.0 7E-316 4E-319 1748.5 51.9 624 2-634 1-629 (629) 5 protein:vir:99088 Length: 629 100.0 9E-316 5E-319 1747.9 52.1 623 2-634 1-629 (629) 6 protein:vir:106027 Length: 629 100.0 1E-313 8E-317 1735.8 50.3 621 2-634 1-628 (629) 7 protein:vir:106491 Length: 646 100.0 9E-275 5E-278 1523.1 46.1 587 1-634 1-628 (646) 8 protein:vir:483 Length: 413 # 99.8 3.5E-19 2.2E-22 121.8 25.2 405 1-526 1-413 (413) 9 protein:vir:101647 Length: 460 99.8 1.6E-18 9.7E-22 118.2 25.5 416 1-528 1-460 (460) 10 protein:vir:102727 Length: 945 99.7 4E-17 2.5E-20 110.5 32.8 507 1-634 56-595 (945) 11 protein:vir:1326 Length: 457 # 99.7 7.7E-19 4.8E-22 119.9 23.3 438 1-526 1-457 (457) 12 protein:vir:101648 Length: 518 99.7 2.7E-17 1.7E-20 111.4 30.6 467 23-592 1-518 (518) 13 protein:vir:4454 Length: 414 # 99.7 3.9E-18 2.4E-21 116.0 25.4 408 1-526 1-414 (414) 14 protein:vir:7853 Length: 518 # 99.7 2.3E-17 1.4E-20 111.8 28.4 470 23-592 1-518 (518) 15 protein:vir:3868 Length: 417 # 99.7 1.5E-17 9.1E-21 112.9 25.8 412 7-527 1-417 (417) 16 protein:vir:93610 Length: 454 99.7 7.5E-17 4.6E-20 109.0 28.8 444 6-551 1-454 (454) 17 protein:vir:102080 Length: 429 99.7 5.4E-17 3.4E-20 109.8 27.2 416 1-517 1-429 (429) 18 protein:vir:100150 Length: 437 99.7 8.9E-17 5.5E-20 108.6 25.2 415 1-528 1-437 (437) 19 protein:vir:4337 Length: 434 # 99.7 2.5E-16 1.6E-19 106.1 27.4 409 9-522 1-434 (434) 20 protein:vir:100249 Length: 431 99.7 6.8E-17 4.2E-20 109.2 24.0 403 1-516 1-431 (431) 21 protein:vir:8418 Length: 409 # 99.7 3E-16 1.8E-19 105.7 27.4 399 1-528 1-409 (409) 22 protein:vir:81072 Length: 432 99.7 5.4E-17 3.4E-20 109.8 23.0 407 4-511 1-432 (432) 23 protein:vir:1380 Length: 422 # 99.7 1.9E-16 1.2E-19 106.8 25.7 409 1-518 1-422 (422) 24 protein:vir:105064 Length: 421 99.7 1E-15 6.2E-19 102.8 28.6 415 1-526 1-421 (421) 25 protein:vir:1266 Length: 416 # 99.6 8.7E-16 5.4E-19 103.1 28.2 408 1-524 1-416 (416) 26 protein:vir:5737 Length: 419 # 99.6 1.6E-16 9.9E-20 107.2 24.1 416 7-535 1-419 (419) 27 protein:vir:80333 Length: 419 99.6 3.6E-16 2.2E-19 105.3 25.4 414 1-550 1-419 (419) 28 protein:vir:6240 Length: 457 # 99.6 4.4E-16 2.7E-19 104.8 25.8 436 1-531 1-457 (457) 29 protein:vir:10362 Length: 432 99.6 3.4E-16 2.1E-19 105.4 24.8 416 1-511 7-432 (432) 30 protein:vir:105002 Length: 432 99.6 2.7E-16 1.7E-19 105.9 24.1 416 1-517 1-432 (432) 31 protein:vir:107605 Length: 432 99.6 2.7E-16 1.7E-19 105.9 24.1 416 1-517 1-432 (432) 32 protein:vir:102855 Length: 432 99.6 2.7E-16 1.7E-19 105.9 24.1 416 1-517 1-432 (432) 33 protein:vir:95378 Length: 406 99.6 2.8E-16 1.7E-19 105.9 23.9 402 1-523 1-406 (406) 34 protein:vir:4509 Length: 424 # 99.6 8.3E-16 5.2E-19 103.3 26.2 412 1-521 1-424 (424) 35 protein:vir:960 Length: 413 # 99.6 6.6E-16 4.1E-19 103.8 25.2 401 1-497 1-413 (413) 36 protein:vir:81152 Length: 411 99.6 4.7E-16 2.9E-19 104.6 24.2 404 1-515 1-411 (411) 37 protein:vir:81218 Length: 423 99.6 1.8E-15 1.1E-18 101.5 27.3 411 1-522 1-423 (423) 38 protein:vir:2683 Length: 412 # 99.6 1.1E-15 7E-19 102.5 25.4 405 1-510 1-412 (412) 39 protein:vir:9702 Length: 406 # 99.6 1.9E-15 1.2E-18 101.3 26.2 401 1-522 1-406 (406) 40 protein:vir:100882 Length: 383 99.6 2.6E-16 1.6E-19 106.0 20.4 372 1-506 1-383 (383) 41 protein:vir:104259 Length: 403 99.6 4.6E-15 2.9E-18 99.2 27.2 394 1-518 1-403 (403) 42 protein:vir:94666 Length: 723 99.6 2.8E-14 1.7E-17 94.9 31.1 507 31-634 1-570 (723) 43 protein:vir:4194 Length: 540 # 99.6 2.3E-14 1.4E-17 95.4 30.1 495 1-630 1-540 (540) 44 protein:vir:8317 Length: 409 # 99.6 4.2E-16 2.6E-19 104.9 20.0 363 1-461 10-409 (409) 45 protein:vir:1884 Length: 424 # 99.6 1.5E-15 9E-19 101.9 22.3 411 1-513 1-424 (424) 46 protein:vir:93943 Length: 409 99.6 5.7E-15 3.5E-18 98.7 25.1 402 1-510 1-409 (409) 47 protein:vir:1431 Length: 419 # 99.6 1.5E-14 9.5E-18 96.3 27.4 411 1-534 1-419 (419) 48 protein:vir:94426 Length: 409 99.6 4.8E-15 3E-18 99.1 24.5 402 1-510 1-409 (409) 49 protein:vir:97060 Length: 432 99.6 1.4E-14 8.5E-18 96.6 26.7 416 1-515 7-432 (432) 50 protein:vir:79772 Length: 648 99.6 1.2E-13 7.6E-17 91.4 31.7 541 1-634 28-624 (648) 51 protein:vir:189 Length: 424 # 99.6 1E-14 6.3E-18 97.3 25.6 407 1-513 1-424 (424) 52 protein:vir:9359 Length: 348 # 99.5 9.9E-15 6.1E-18 97.4 24.6 338 75-510 1-348 (348) 53 protein:vir:3843 Length: 397 # 99.5 4.4E-14 2.7E-17 93.8 27.6 388 7-521 1-397 (397) 54 protein:vir:3153 Length: 467 # 99.5 1.4E-14 8.5E-18 96.6 24.6 406 54-531 1-467 (467) 55 protein:vir:80134 Length: 403 99.5 4E-14 2.5E-17 94.1 26.5 397 1-523 1-403 (403) 56 protein:vir:99312 Length: 563 99.5 5.8E-14 3.6E-17 93.2 27.1 463 1-566 25-563 (563) 57 protein:vir:95599 Length: 563 99.5 5.8E-14 3.6E-17 93.2 27.1 463 1-566 25-563 (563) 58 protein:vir:4156 Length: 542 # 99.5 1.6E-13 1E-16 90.7 28.8 480 1-597 1-542 (542) 59 protein:vir:96579 Length: 576 99.5 2.1E-14 1.3E-17 95.6 23.5 477 1-570 27-576 (576) 60 protein:vir:102118 Length: 409 99.5 5.3E-14 3.3E-17 93.4 25.3 396 8-519 1-409 (409) 61 protein:vir:80644 Length: 551 99.5 3.9E-14 2.4E-17 94.1 24.4 475 1-577 5-551 (551) 62 protein:vir:96980 Length: 409 99.5 6.1E-14 3.8E-17 93.0 23.9 403 1-524 1-409 (409) 63 protein:vir:6210 Length: 394 # 99.5 1.9E-13 1.2E-16 90.3 26.0 385 1-522 1-394 (394) 64 protein:vir:100187 Length: 385 99.5 4.2E-14 2.6E-17 93.9 21.7 374 1-508 1-385 (385) 65 protein:vir:80796 Length: 574 99.4 2.5E-12 1.6E-15 84.2 30.6 481 1-562 41-574 (574) 66 protein:vir:63755 Length: 547 99.4 2.9E-13 1.8E-16 89.3 24.6 479 1-566 1-547 (547) 67 protein:vir:4854 Length: 386 # 99.4 2.6E-13 1.6E-16 89.6 23.4 375 7-510 1-386 (386) 68 protein:vir:8100 Length: 466 # 99.4 6E-13 3.7E-16 87.6 24.4 424 7-518 1-466 (466) 69 protein:vir:78641 Length: 278 99.4 3.4E-13 2.1E-16 89.0 20.9 274 75-417 1-278 (278) 70 protein:vir:100691 Length: 535 99.4 1.4E-11 8.9E-15 80.0 28.8 462 1-536 1-535 (535) 71 protein:vir:81095 Length: 416 99.3 7.2E-12 4.4E-15 81.7 26.6 405 7-521 1-416 (416) 72 protein:vir:4598 Length: 416 # 99.3 7.2E-12 4.4E-15 81.7 26.6 405 7-521 1-416 (416) 73 protein:vir:98396 Length: 441 99.3 4E-12 2.5E-15 83.1 24.2 413 1-521 16-441 (441) 74 protein:vir:9507 Length: 395 # 99.3 3E-12 1.8E-15 83.8 22.1 388 1-525 1-395 (395) 75 protein:vir:100650 Length: 395 99.3 3E-12 1.8E-15 83.8 22.1 388 1-525 1-395 (395) 76 protein:vir:101289 Length: 395 99.3 3E-12 1.8E-15 83.8 22.1 388 1-525 1-395 (395) 77 protein:vir:79984 Length: 441 99.3 2.8E-11 1.7E-14 78.5 26.1 415 1-521 1-441 (441) 78 protein:vir:9408 Length: 441 # 99.3 2.8E-11 1.7E-14 78.5 26.1 415 1-521 1-441 (441) 79 protein:vir:4952 Length: 386 # 99.3 1.3E-11 8E-15 80.3 24.0 373 7-510 1-386 (386) 80 protein:vir:4995 Length: 384 # 99.2 4.7E-12 2.9E-15 82.7 20.4 368 7-461 1-384 (384) 81 protein:vir:95965 Length: 385 99.2 3E-11 1.8E-14 78.3 24.7 372 1-508 1-385 (385) 82 protein:vir:94002 Length: 378 99.2 5E-11 3.1E-14 77.0 25.2 374 1-523 1-378 (378) 83 protein:vir:78310 Length: 376 99.2 6.3E-11 3.9E-14 76.5 24.2 369 1-507 1-376 (376) 84 protein:vir:1661 Length: 378 # 99.2 1.8E-10 1.1E-13 74.0 26.3 374 1-519 1-378 (378) 85 protein:vir:93867 Length: 378 99.2 1.4E-10 8.9E-14 74.5 24.7 372 1-527 1-378 (378) 86 protein:vir:4828 Length: 382 # 99.1 3E-10 1.9E-13 72.8 22.3 374 1-497 1-382 (382) 87 protein:vir:4089 Length: 395 # 99.0 2.2E-09 1.4E-12 68.0 26.7 379 1-526 1-395 (395) 88 protein:vir:7407 Length: 392 # 99.0 3.8E-10 2.3E-13 72.2 21.3 386 1-497 1-392 (392) 89 protein:vir:858 Length: 378 # 99.0 3.5E-09 2.2E-12 67.0 25.9 372 1-519 1-378 (378) 90 protein:vir:1082 Length: 359 # 99.0 4.4E-10 2.8E-13 71.9 20.4 352 1-483 1-359 (359) 91 protein:vir:94869 Length: 378 98.9 8.7E-09 5.4E-12 64.8 25.9 374 1-523 1-378 (378) 92 protein:vir:99452 Length: 651 98.9 8.5E-09 5.3E-12 64.8 25.7 514 1-602 39-651 (651) 93 protein:vir:1023 Length: 392 # 98.9 1.2E-09 7.7E-13 69.4 20.4 386 1-497 1-392 (392) 94 protein:vir:3989 Length: 392 # 98.9 1.2E-09 7.7E-13 69.4 20.4 386 1-497 1-392 (392) 95 protein:vir:9641 Length: 395 # 98.9 8.4E-09 5.2E-12 64.9 23.8 384 1-512 1-395 (395) 96 protein:vir:98643 Length: 395 98.9 1.6E-08 9.8E-12 63.3 24.4 384 1-512 1-395 (395) 97 protein:vir:10321 Length: 495 98.8 2.9E-08 1.8E-11 61.9 24.0 441 1-526 1-495 (495) 98 protein:vir:267 Length: 348 # 98.8 1.1E-09 6.8E-13 69.7 15.4 332 19-436 1-348 (348) 99 protein:vir:1150 Length: 350 # 98.7 2.5E-09 1.5E-12 67.8 15.4 330 9-417 1-350 (350) 100 protein:vir:79150 Length: 368 98.7 2.2E-09 1.3E-12 68.1 14.8 356 1-438 1-368 (368) 101 protein:vir:6058 Length: 344 # 98.7 1.1E-09 6.6E-13 69.8 12.4 330 9-422 1-344 (344) 102 protein:vir:78749 Length: 337 98.6 3.1E-09 1.9E-12 67.2 14.2 328 1-418 1-337 (337) 103 protein:vir:2013 Length: 344 # 98.6 5.5E-09 3.4E-12 65.9 15.0 322 9-422 1-344 (344) 104 protein:vir:3780 Length: 345 # 98.6 2.7E-09 1.7E-12 67.5 13.2 326 10-419 1-345 (345) 105 protein:vir:103971 Length: 376 98.5 4.7E-09 2.9E-12 66.2 12.4 345 1-424 1-376 (376) 106 protein:vir:5691 Length: 344 # 98.5 1.7E-08 1E-11 63.2 14.6 321 9-422 1-344 (344) 107 protein:vir:98567 Length: 340 98.4 4.4E-08 2.7E-11 60.9 14.3 323 1-422 1-340 (340) 108 protein:vir:3743 Length: 345 # 98.3 5.7E-08 3.5E-11 60.3 13.7 315 10-419 1-345 (345) 109 protein:vir:78191 Length: 351 98.3 1.9E-07 1.2E-10 57.5 16.5 331 1-424 1-351 (351) 110 protein:vir:98853 Length: 219 98.3 3.4E-08 2.1E-11 61.5 11.7 215 143-422 1-219 (219) 111 protein:vir:3420 Length: 533 # 98.3 1.9E-06 1.2E-09 51.9 26.7 454 1-531 1-533 (533) 112 protein:vir:6382 Length: 553 # 98.2 2.7E-06 1.6E-09 51.1 32.1 462 1-527 1-553 (553) 113 protein:vir:79207 Length: 351 98.2 1.2E-07 7.5E-11 58.5 12.5 331 1-424 1-351 (351) 114 protein:vir:100328 Length: 346 98.1 1.2E-06 7.4E-10 53.1 17.4 336 9-422 1-346 (346) 115 protein:vir:389 Length: 530 # 98.1 4.6E-06 2.8E-09 49.9 28.0 461 1-530 1-530 (530) 116 protein:vir:95542 Length: 548 98.1 4.6E-06 2.8E-09 49.9 27.4 470 1-538 7-548 (548) 117 protein:vir:96738 Length: 505 98.0 7.1E-06 4.4E-09 48.8 30.9 435 1-515 1-505 (505) 118 protein:vir:79538 Length: 502 98.0 7.4E-06 4.6E-09 48.7 25.2 441 1-521 7-502 (502) 119 protein:vir:107742 Length: 537 92.1 0.013 7.9E-06 31.0 25.3 442 1-537 25-537 (537) 120 protein:vir:94049 Length: 532 91.2 0.017 1E-05 30.3 19.5 449 1-555 1-532 (532) 121 protein:vir:101494 Length: 527 36.6 1.2 0.00072 20.2 19.7 412 1-521 44-527 (527) 122 protein:vir:4698 Length: 251 # 34.0 1.3 0.00082 19.9 11.2 243 1-314 1-251 (251) 123 protein:vir:96068 Length: 765 33.9 1.3 0.00082 19.9 22.6 525 1-634 50-646 (765) 124 protein:vir:102239 Length: 527 31.1 1.5 0.00095 19.6 19.7 412 1-521 44-527 (527) 125 protein:vir:105782 Length: 449 23.7 2.3 0.0014 18.6 23.4 386 1-530 10-449 (449) No 1 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=100.00 E-value=0 Score=1865.05 Aligned_cols=631 Identities=98% Similarity=1.439 Sum_probs=619.9 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |+|++||||||||||++|+++|+||||||+||||++.|||++|++++++||+|||+|||+||||||||||++|||||||| T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) |+||||||||+|||+|+||+|+|+++++||+.|+||++||+|||||+++|||||||+|||+|+||+|++++||||++|++ T Consensus 81 ~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r~~ 160 (631) T protein:vir:10 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (631) T ss_pred EeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) ++||+||++|||++++++|+.|++|.|+||+|++++|+|||||||||||++|||||||+||++||||+||||+|+|++|| T Consensus 161 ~~W~~vt~~ei~~~~~g~g~~v~lp~g~~h~~~~~~D~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakS 240 (631) T protein:vir:10 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (631) T ss_pred cceeeccHHHHhcccCcccceeecCCCCccceecCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ik 320 (634) |||||||||||+|||||++++|+++++|++++|+.|+||++||++||||||+|||+||+|+||+||||+++|+|||++|| T Consensus 241 Rl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E~i~~i~ 320 (631) T protein:vir:10 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (631) T ss_pred HHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011057. 321 HIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAR 400 (634) Q Consensus 321 Hl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~ 400 (634) ||||+||||+++||||||||+|||||+|||||+|||||+|+||||||||+||+||+||+|+|++||+|||++|||++|++ T Consensus 321 hlkf~~ei~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~Lrp~Le~ 400 (631) T protein:vir:10 321 HIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAR 400 (631) T ss_pred EEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhh Q lcl|NC_011057. 401 EGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLA 480 (634) Q Consensus 401 eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~la 480 (634) |||||+|||||||+|+|++|||++|||+++||+|+||+|||||++||++|+||||+|+|+|++||+++|++||+|||+|+ T Consensus 401 eGvDp~kYvvW~DaS~Lt~dPdr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpaLip~lA 480 (631) T protein:vir:10 401 EGIDPSKYVVWYDPSQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLA 480 (631) T ss_pred hCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhcccCcchhhH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHHhhcCC Q lcl|NC_011057. 481 PLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKRRRGRD 560 (634) Q Consensus 481 Pll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~ 560 (634) ||+.++++.++||+|++ +.++++++++++++++++++||||+++.++.. ++..+..||++||+|||||||||+|+|+ T Consensus 481 pl~~~~~~~v~~P~~~a-~~~~g~ed~~~~~~~~~g~~epdt~d~~p~~~--~a~~~~~iv~llv~RALelAGkRl~~r~ 557 (631) T protein:vir:10 481 PLIAGVLKQIEFPQQQA-IDSGGNEDTSDADDLDDGEQEPDTEDDDDGTQ--KAGLETGIVDLMVDRALELVGKRRRGRD 557 (631) T ss_pred HHHHHHhhhccCCCCCC-CCCCCCCccccccccccCCCCCCCCCCCCccc--cccchHHHHHHHHHHHHHhhcchhcCCc Confidence 99999999999997754 55668899999999999999999998855544 4556678999999999999999999999 Q ss_pred hhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 561 RETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVIDVVA 634 (634) Q Consensus 561 R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vvd~~~ 634 (634) |+.++|+++||.|+||++|+||++++|+|||+|||+++|++++++||+|++|||++|++||+++||++|||||+ T Consensus 558 r~~~ar~~~v~~he~H~~~~Pv~~~ev~rli~gwd~~ld~~~~~~Lg~d~~~lr~~v~a~v~~~lt~~~~~~~~ 631 (631) T protein:vir:10 558 RETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVIDVVA 631 (631) T ss_pred ccchhHHhcccccccccccCCCCHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=100.00 E-value=4.8e-319 Score=1765.74 Aligned_cols=628 Identities=51% Similarity=0.865 Sum_probs=605.0 Q ss_pred CCCCCcceeEeccCCCCccchh-hhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSR-ALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~r-al~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) ||| |||||||||||++|++|| +||||||+++||++.+||+++++++++||+|||++||+|||||||||||+||||||| T Consensus 1 ma~-~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~r 79 (639) T protein:vir:10 1 MAA-TSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTT 79 (639) T ss_pred CCc-cceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceee Confidence 665 599999999999999996 999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeecccCCCCCCCCC-CCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGIS-EDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~-ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) ||+||||||||+|||+|. |++|++++++++|+.|+||+|||+|||||+++|||||||+|||+|+||+|+.. ++..+ T Consensus 80 L~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~---~~~~~ 156 (639) T protein:vir:10 80 LIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPV---TGLAA 156 (639) T ss_pred eEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCcccc---Ccccc Confidence 999999999999999997 89999999999999999999999999999999999999999999999998654 66677 Q ss_pred cchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 238 (634) ++++||+||++||++++ ++++.|++|||++|+|+++.|+|||||||||||++|||||||+||++||||+||||+|+|++ T Consensus 157 ~~~~W~vvs~~Ei~~~~-~~~~~i~lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaa 235 (639) T protein:vir:10 157 PRARWYAVTREEIKSKA-GETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (639) T ss_pred cccceeeeeHHHhcccC-CCeeEeecCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 89999999999998554 66788999999999999999999999999999999999999999999999999999999999 Q ss_pred HhHhhhCceeeecccccCCCCcCCC----CcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_011057. 239 KSRLIGNGVLFVPHEMSLPAAQGPV----SEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (634) Q Consensus 239 ~SRL~gnGvlfvP~e~slP~~~~p~----a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~E 314 (634) |||||||||||||+|||||.+++|+ +..+|++++|+.|.|++++|++||||||+|||+||+|+||+||||+++|+| T Consensus 236 kSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E 315 (639) T protein:vir:10 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (639) T ss_pred HHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechH Confidence 9999999999999999999999997 444588999999999999999999999999999999999999999999999 Q ss_pred HhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 315 QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 315 hi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) ||+|||||||+||||+++||||||||+|||||+|||||+|||| +|+||||||||+||+||+||+|+|++||+|||++|| T Consensus 316 ~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl-~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~L 394 (639) T protein:vir:10 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (639) T ss_pred HhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeec-ccccceEEEEecccceeeecchhHHHHHHHHHhhHH Confidence 9999999999999999999999999999999999999999999 699999999999999999999999999999999999 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) |++|++|||||+|||||||+|+|++|||+++||+++||+|+||+||||+|+||++|+||||+|+|+|++||+++|+++|. T Consensus 395 rp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~ 474 (639) T protein:vir:10 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPE 474 (639) T ss_pred HHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHH---HHHHHHHHHHHHH Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLES---GIVDLMVDRALEL 551 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~---a~vdllv~rALel 551 (634) ||++++||+.|.+|+++||+|+++.++++++++++++++++++.|||||+++....++.++... +++++||+||||| T Consensus 475 li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALel 554 (639) T protein:vir:10 475 LIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDL 554 (639) T ss_pred hhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHh Confidence 9999999999999999999999999999999999999999999999999999887777766654 3459999999999 Q ss_pred hhHHh-hcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 552 VGKRR-RGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVI 630 (634) Q Consensus 552 AGkR~-Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vv 630 (634) ||||+ ++++|++++++|+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||++|| T Consensus 555 AGkRr~~~~~r~~~a~~r~vp~he~H~~l~Pv~~~~~~rli~gwd~~ld~~~~a~lg~D~~~lr~~v~~~v~~~lt~~~i 634 (639) T protein:vir:10 555 AGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLI 634 (639) T ss_pred hcccccCCCChhhHHHhhcCChhHceeecCCCChHHHHHHHHHHHhHHHHHHHHHhCCCHHHHHHHHHHHHHHHHhhhhh Confidence 99998 77899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcC Q lcl|NC_011057. 631 DVVA 634 (634) Q Consensus 631 d~~~ 634 (634) |--. T Consensus 635 ~~ev 638 (639) T protein:vir:10 635 EGEV 638 (639) T ss_pred cccc Confidence 8433 No 3 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=100.00 E-value=4.8e-319 Score=1765.74 Aligned_cols=628 Identities=51% Similarity=0.865 Sum_probs=605.0 Q ss_pred CCCCCcceeEeccCCCCccchh-hhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSR-ALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~r-al~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) ||| |||||||||||++|++|| +||||||+++||++.+||+++++++++||+|||++||+|||||||||||+||||||| T Consensus 1 ma~-~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~r 79 (639) T protein:vir:97 1 MAA-TSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTT 79 (639) T ss_pred CCc-cceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceee Confidence 665 599999999999999996 999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeecccCCCCCCCCC-CCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGIS-EDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~-ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) ||+||||||||+|||+|. |++|++++++++|+.|+||+|||+|||||+++|||||||+|||+|+||+|+.. ++..+ T Consensus 80 L~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~---~~~~~ 156 (639) T protein:vir:97 80 LIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPV---TGLAA 156 (639) T ss_pred eEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCcccc---Ccccc Confidence 999999999999999997 89999999999999999999999999999999999999999999999998654 66677 Q ss_pred cchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 238 (634) ++++||+||++||++++ ++++.|++|||++|+|+++.|+|||||||||||++|||||||+||++||||+||||+|+|++ T Consensus 157 ~~~~W~vvs~~Ei~~~~-~~~~~i~lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaa 235 (639) T protein:vir:97 157 PRARWYAVTREEIKSKA-GETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (639) T ss_pred cccceeeeeHHHhcccC-CCeeEeecCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 89999999999998554 66788999999999999999999999999999999999999999999999999999999999 Q ss_pred HhHhhhCceeeecccccCCCCcCCC----CcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_011057. 239 KSRLIGNGVLFVPHEMSLPAAQGPV----SEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (634) Q Consensus 239 ~SRL~gnGvlfvP~e~slP~~~~p~----a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~E 314 (634) |||||||||||||+|||||.+++|+ +..+|++++|+.|.|++++|++||||||+|||+||+|+||+||||+++|+| T Consensus 236 kSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E 315 (639) T protein:vir:97 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (639) T ss_pred HHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechH Confidence 9999999999999999999999997 444588999999999999999999999999999999999999999999999 Q ss_pred HhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 315 QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 315 hi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) ||+|||||||+||||+++||||||||+|||||+|||||+|||| +|+||||||||+||+||+||+|+|++||+|||++|| T Consensus 316 ~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl-~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~L 394 (639) T protein:vir:97 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (639) T ss_pred HhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeec-ccccceEEEEecccceeeecchhHHHHHHHHHhhHH Confidence 9999999999999999999999999999999999999999999 699999999999999999999999999999999999 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) |++|++|||||+|||||||+|+|++|||+++||+++||+|+||+||||+|+||++|+||||+|+|+|++||+++|+++|. T Consensus 395 rp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~ 474 (639) T protein:vir:97 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPE 474 (639) T ss_pred HHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHH---HHHHHHHHHHHHH Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLES---GIVDLMVDRALEL 551 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~---a~vdllv~rALel 551 (634) ||++++||+.|.+|+++||+|+++.++++++++++++++++++.|||||+++....++.++... +++++||+||||| T Consensus 475 li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALel 554 (639) T protein:vir:97 475 LIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDL 554 (639) T ss_pred hhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHh Confidence 9999999999999999999999999999999999999999999999999999887777766654 3459999999999 Q ss_pred hhHHh-hcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 552 VGKRR-RGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVI 630 (634) Q Consensus 552 AGkR~-Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vv 630 (634) ||||+ ++++|++++++|+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||++|| T Consensus 555 AGkRr~~~~~r~~~a~~r~vp~he~H~~l~Pv~~~~~~rli~gwd~~ld~~~~a~lg~D~~~lr~~v~~~v~~~lt~~~i 634 (639) T protein:vir:97 555 AGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLI 634 (639) T ss_pred hcccccCCCChhhHHHhhcCChhHceeecCCCChHHHHHHHHHHHhHHHHHHHHHhCCCHHHHHHHHHHHHHHHHhhhhh Confidence 99998 77899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcC Q lcl|NC_011057. 631 DVVA 634 (634) Q Consensus 631 d~~~ 634 (634) |--. T Consensus 635 ~~ev 638 (639) T protein:vir:97 635 EGEV 638 (639) T ss_pred cccc Confidence 8433 No 4 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=100.00 E-value=6.7e-316 Score=1748.46 Aligned_cols=624 Identities=56% Similarity=0.909 Sum_probs=599.0 Q ss_pred CCCCcceeEeccCCCCccch-hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 2 AATQSLRLVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 2 ~a~~~lr~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) .|++||||||||||++|.+| |+||||||+|+||+++|+|++|++.+++||+|||+|||+|||||||||||+|||||||| T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 56899999999999999887 69999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ||||||||||+|||+|+|+++.|+||++||+.|+||++||+|||||+++|||||||+|||+|+|+++ ++||+++++ T Consensus 81 ~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~----~~d~~~~~~ 156 (629) T protein:vir:86 81 IASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKS----RLDSNGNPV 156 (629) T ss_pred EeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCC----ccCCCCcch Confidence 9999999999999999999999999999999999999999999999999999999999999999755 789999999 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) .+||+||++|||++ ++++.|.+|+|++|+|+++.|+|||||||||||++|||||||+||++||||+||||+|+|++|| T Consensus 157 ~eW~~vt~~ei~~~--~~~~~i~lP~g~~~e~~~~~d~l~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakS 234 (629) T protein:vir:86 157 PEWLALTPEEVRAS--EKKTIIELPTGDKHEFRDGLDGMFRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKS 234 (629) T ss_pred hhheeechHHhhhc--cCceeeEcCCCCcceeeCCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 99999999999866 5569999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCC-ccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIA-PLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~-p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) |||||||||||+|||||++++|...+.++.+. |+.|.|+++||++||||||+|||+||+|+||+||||+++|+|||++| T Consensus 235 RL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:86 235 RLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred HHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 99999999999999999999998776666665 57999999999999999999999999999999999999999999999 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) |||||+||||+++||||||||+|||||+|||||+|||||+|+||||||||+||+||+||+|+|++||+|||++|||++|+ T Consensus 315 ~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le 394 (629) T protein:vir:86 315 THLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLM 394 (629) T ss_pred eEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhh Q lcl|NC_011057. 400 REGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPML 479 (634) Q Consensus 400 ~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~l 479 (634) +|||||+|||||||+|+|++|||+++||+++||+|+||+|||||++||+||+||||+|+|+|++||+++|+++|+|||+| T Consensus 395 ~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~ 474 (629) T protein:vir:86 395 REGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTL 474 (629) T ss_pred HhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCcc---HHHHHHHHHHHHHHHhhHHh Q lcl|NC_011057. 480 APLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGL---ESGIVDLMVDRALELVGKRR 556 (634) Q Consensus 480 aPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~---~~a~vdllv~rALelAGkR~ 556 (634) +||+ +.+..++||.|.++++|+.+.+.+++.+++.+++||+||++++++.++++.+ ..++|++||+|||||||||+ T Consensus 475 a~l~-~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~ 553 (629) T protein:vir:86 475 AVLI-PELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAGTDDSDQASLDSRETAMVEALVFRALELAGKRS 553 (629) T ss_pred hhhh-hhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcC Confidence 9999 8889999999999999988888888888888999999999998887766665 35789999999999999987 Q ss_pred hcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 557 RGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVIDVVA 634 (634) Q Consensus 557 Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vvd~~~ 634 (634) |+ |++++++|+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||++|=--|. T Consensus 554 r~--r~~~a~~r~v~~he~h~~l~Pv~~~~v~rli~gwd~~ld~~~~~~Lg~d~~~lr~~v~a~v~~~lt~~v~~ev~ 629 (629) T protein:vir:86 554 RT--RSLPYELRQLSDRELVRRLEPVRREHVADLIRGWDSMLEERAVQALNMNIPGIRAAVKRAVYGELTKTIDGEVS 629 (629) T ss_pred CC--hhhHHHHhccChhhcceecCCCChHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhcccccccC Confidence 54 99999999999999999999999999999999999999999999999999999999999999999988433334 No 5 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=100.00 E-value=8.7e-316 Score=1747.86 Aligned_cols=623 Identities=57% Similarity=0.915 Sum_probs=598.7 Q ss_pred CCCCcceeEeccCCCCccch-hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 2 AATQSLRLVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 2 ~a~~~lr~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) .|++||||||||||++|.+| |+||||||+|+||+++|+|++|++.+++||+|||+|||+|||||||||||+|||||||| T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 56899999999999999887 69999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ||||||||||+|||+|+|+++.|+||++||+.|+||++||+|||||+++|||||||+|||+|+|+++ ++||+++++ T Consensus 81 ~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~----~~d~~~~~~ 156 (629) T protein:vir:99 81 IASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKS----RLDSNGNPV 156 (629) T ss_pred EeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCC----ccCCCCcch Confidence 9999999999999999999999999999999999999999999999999999999999999999755 789999999 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) .+||+||++|||++ ++++.|.+|+|++|+|+++.|+|||||||||||++|||||||+||++||||+||||+|+|++|| T Consensus 157 ~eW~~vt~~ei~~~--~~~~~i~lP~g~~~e~~~~~d~l~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakS 234 (629) T protein:vir:99 157 PEWLALTPEEVRAS--EKKTIIELPTGDKHEFRDGLDGMFRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKS 234 (629) T ss_pred hhheeechHHhhhc--cCceeEEcCCCCccceeCCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 99999999999866 5569999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCC-ccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIA-PLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~-p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) |||||||||||+|||||++++|...+.++.+. |+.|.|+++||++||||||+|||+||+|+||+||||+++|+|||++| T Consensus 235 RL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:99 235 RLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred HHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 99999999999999999999998776666665 57999999999999999999999999999999999999999999999 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) |||||+||||+++||||||||+|||||+|||||+|||||+|+||||||||+||+||+||+|+|++||+|||++|||++|+ T Consensus 315 ~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le 394 (629) T protein:vir:99 315 THLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLM 394 (629) T ss_pred eEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhh Q lcl|NC_011057. 400 REGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPML 479 (634) Q Consensus 400 ~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~l 479 (634) +|||||+|||||||+|+|++|||+++||+++||+|+||+|||||++||+||+||||+|+|+|++||+++|+++|+|||+| T Consensus 395 ~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~ 474 (629) T protein:vir:99 395 REGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTL 474 (629) T ss_pred HhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCcc---HHHHHHHHHHHHHHHhhHHh Q lcl|NC_011057. 480 APLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGL---ESGIVDLMVDRALELVGKRR 556 (634) Q Consensus 480 aPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~---~~a~vdllv~rALelAGkR~ 556 (634) +||+ +.+..++||.|+++++|+.+.+.+++.+++.+++||+||++++++.++++.+ ..++|++||+|||||||||+ T Consensus 475 a~l~-~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~ 553 (629) T protein:vir:99 475 AVLI-PELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAGTDDSDQASLDSRETAMVEALVFRALELAGKRS 553 (629) T ss_pred hhhh-hhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcC Confidence 9999 8889999999999999988888888888888999999999998877766665 35789999999999999987 Q ss_pred hcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHh-cC Q lcl|NC_011057. 557 RGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVIDV-VA 634 (634) Q Consensus 557 Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vvd~-~~ 634 (634) |+ |++++++|+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||+. ||- |. T Consensus 554 r~--r~~~ar~r~v~~he~h~~l~Pv~~~~i~rli~gwd~~ld~~~~~~Lg~d~~~lr~~v~a~v~~~lt~~-~~~ev~ 629 (629) T protein:vir:99 554 RT--RSLPYELRQLSDRELVRRLEPVRREHVADLIRGWDSMLEERAVQALNMNIPGIRAAVKRAVYGELTKT-IDGEVS 629 (629) T ss_pred CC--hhhHHHHhcCchhhceeecCCCCHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhhh-hccccC Confidence 54 99999999999999999999999999999999999999999999999999999999999999999977 543 33 No 6 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=100.00 E-value=1.4e-313 Score=1735.84 Aligned_cols=621 Identities=55% Similarity=0.933 Sum_probs=590.9 Q ss_pred CCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhh-cccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 2 AATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS-TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 2 ~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~-~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) .|++||||||||||++ + ||+|+|||||+ ||+++|||+ ++.+.+++||+|||+|||+|||||||||||+|||||||| T Consensus 1 ma~~~lrv~rrpk~~p-~-~r~l~aasqp~-~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss~Sr~rL 77 (629) T protein:vir:10 1 MAASTLRVSRRPKGSP-A-RRSLTAASQPM-EPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASSCSRVEL 77 (629) T ss_pred CCccceeEEecCCCcc-c-eeeeccccCCC-CcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhhheeeeE Confidence 5689999999999994 3 99999999998 899999998 555579999999999999999999999999999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ||||||||||+|||+|+||+|+|++|++||+.|+||+|||+||+||+++|||||||+|||||.|+.+ +|||++|+ T Consensus 78 ~as~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~----~pd~~~r~- 152 (629) T protein:vir:10 78 IASELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDK----NPDGSVRH- 152 (629) T ss_pred EEeeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCC----CCCccccc- Confidence 9999999999999999999999999999999999999999999999999999999999999998554 89999775 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) +||+||++||++++. +++.|+||+|++|+|+++.|+|||||||||||++|||||||+||++||||+||||+|+|++|| T Consensus 153 -~W~vVt~~Ei~~kg~-g~~~i~lpdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakS 230 (629) T protein:vir:10 153 -NWYVVTNDEVKNKGA-GKTDIELPDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKS 230 (629) T ss_pred -ceeeecHHHhccccC-ceeEEEcCCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHh Confidence 999999999997764 457899999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCcc-ccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPL-VGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~-~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) |||||||||||+|||||.+++|+....|+.++|+ .|.+++++|++||||||+|||+||+||||+||||+++|+|||+|| T Consensus 231 RL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~i 310 (629) T protein:vir:10 231 RLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKI 310 (629) T ss_pred HHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcCe Confidence 9999999999999999999999988888888775 799999999999999999999999999999999999999999999 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) |||||+||||+++||||||||+|||||+|||||+|||||+|+||||||||+||+||+||+|+|++||+|||++|||++|+ T Consensus 311 khLkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~ 390 (629) T protein:vir:10 311 FHLKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLR 390 (629) T ss_pred eeeeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhh Q lcl|NC_011057. 400 REGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPML 479 (634) Q Consensus 400 ~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~l 479 (634) +|||||++||||||+|+|++|||+++||+++||+|+||+||||+|+||+++++|||+|.|+|++||+++|.+||.||++| T Consensus 391 ~eGiDp~~Yvvw~DaS~Lt~dPd~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~~ 470 (629) T protein:vir:10 391 AEGIDPDRYVLWYDASGLTVDPDKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKVL 470 (629) T ss_pred HhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccH-HHHHHHHHHHHHHHhhHHh-h Q lcl|NC_011057. 480 APLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLE-SGIVDLMVDRALELVGKRR-R 557 (634) Q Consensus 480 aPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~-~a~vdllv~rALelAGkR~-R 557 (634) +||+.|.+|.++||+|+++++++++++++++...+ ++||+||++++.++..+.... +.++++||+|||||||||+ + T Consensus 471 apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~--~~e~~~e~dA~~a~~~~~~aa~~~A~rllv~RALelAGkRl~~ 548 (629) T protein:vir:10 471 APLLTDELAEIDWPEPPAALPPGEDDQADEEQDTT--GSEPSTEDDAEAAARISSVADMVLAERLLTVRALGLAGKRRVN 548 (629) T ss_pred hhhcCCccccccccCCCCcCCCCCcccCccccCCC--CCCcCCCcchhhcccCCchhhHHHHHHHHHHHHHHHccccccC Confidence 99999999999999999999998877666655444 669999999888777666655 4688999999999999999 5 Q ss_pred cCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCCCHHH---HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 558 GRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGT---IRSAVRRKVMAELTRPVIDVVA 634 (634) Q Consensus 558 t~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~---lr~~v~~~v~~~lt~~vvd~~~ 634 (634) +++|+.++|+++||+|+||++|+||++++++|||+|||++||++++++||+|++| |+++|++.|+++||++|||--. T Consensus 549 ~rdR~~~ar~~~vp~he~h~~l~Pv~~~~v~rli~gwd~~l~~~~~a~lg~D~~~~~~~~sav~~~v~~~lt~~~~~~ev 628 (629) T protein:vir:10 549 TNDRAQKARLAGIAPHDYHRVMGPVADADIPRLIAGWDEGLEEEALALLGVDSRRTEALRSAVRAQIRRELTMPVVDAEV 628 (629) T ss_pred CCchhhHHHhhcCChhhceeecCCCChhHHHHHHHhhhhHHHHHHHHHhCCChhhhHHHHHHHHHHHHHHhhhhhhcccc Confidence 6799999999999999999999999999999999999999999999999999975 6777999999999999998533 No 7 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=100.00 E-value=8.6e-275 Score=1523.10 Aligned_cols=587 Identities=29% Similarity=0.483 Sum_probs=535.8 Q ss_pred CCCCCcceeEeccCCCCc-------cchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhh Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRP-------APSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRAS 73 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~-------a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~ 73 (634) |+ .-||||++| ++||+||||||++..+++.++| ++++++++||+|||+|||+|||||||||||+| T Consensus 1 ~~-------~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~k-t~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~ 72 (646) T protein:vir:10 1 MA-------LLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWK-TWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGD 72 (646) T ss_pred Cc-------ccCCCCCCCCcccccccchhhhhhccccccCCCcceee-cCCCcchhhhHHHHHHHhhhhhHhhHhhhhhh Confidence 43 358999999 4568999999999999999877 89999999999999999999999999999999 Q ss_pred ceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCc Q lcl|NC_011057. 74 SCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQP 153 (634) Q Consensus 74 s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~ 153 (634) ||||||||||||| |||+|||++.+ +++++||+.|+||.+||+|||||+++|||||||+|||+ +++...+ T Consensus 73 a~SR~rL~aseid-dtG~~tg~v~~-----~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~-----~~~~~~~ 141 (646) T protein:vir:10 73 SVAQARLYVTEVD-DTGEETGEVQD-----ERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVG-----EGAATSP 141 (646) T ss_pred hhceeeeeeeeec-CCCCCcCccch-----HHHHHHhhhhccchhhHHHHHHHHHhheecccceEEee-----ccccCCC Confidence 9999999999999 99999999998 79999999999999999999999999999999999995 3455555 Q ss_pred ccccccchhceeccHHHHhccCCCcceeeEeCC---CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_011057. 154 DGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPT---GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRT 230 (634) Q Consensus 154 dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~---g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rt 230 (634) ++ +++.||+||.+||+++ ++++.|++|+ |++|+|++++|+|||||||||||++|||||||+||++||||+|| T Consensus 142 ~~---~~~~W~vvt~~Ev~~t--g~~~~i~~p~~~~g~~~v~~~~~d~lvRiW~P~Prr~~epDSpvra~l~~l~Ei~~l 216 (646) T protein:vir:10 142 EA---AEGSWFVVTGSAISRT--GDEIAVRRPQQRGGSKLVLVDGQDILIRCWRPHPNDTDQADSFTRSAIVPLREIELL 216 (646) T ss_pred CC---CccceeeecHHHhccC--CCeeeeecCccCCCCCcceecCCceEEEEecCCcccccCCcchhHHHHHHHHHHHHh Confidence 55 3788999999999764 6789999998 99999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_011057. 231 TKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAG 310 (634) Q Consensus 231 tk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~ 310 (634) ||+|+|++|||||||||||||+|||||.++++.+ ..++|++||||||+|||+||+|+||+||||++ T Consensus 217 t~~I~aaakSRL~GnGvLfvP~e~s~p~~~~~~a--------------~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~ 282 (646) T protein:vir:10 217 TKREFAELDSRLTGAGIMFLPEGVDFPRGEEDPA--------------GLAGFMAYLQRAAAASMADQSRASAMVPIMAT 282 (646) T ss_pred hhHhHHHHHHHHhcCceeeeccccccCCCCCCCc--------------chhHHHHHHHHHHHhhhcCCCCccceeeeEEe Confidence 9999999999999999999999999998875422 36689999999999999999999999999999 Q ss_pred echH---HhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHH Q lcl|NC_011057. 311 VPGE---QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQ 387 (634) Q Consensus 311 vP~E---hi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ 387 (634) +|+| |+++||||+|+||||+++||||||||+|||||+|||||+||||| |+||||||||+||+|| ||+|+|++||+ T Consensus 283 ~P~E~i~~~~~ik~l~f~~eite~aiktR~daI~RlA~glDIppE~LLGlg-d~NHWtAWqI~de~vr-HI~P~l~~ic~ 360 (646) T protein:vir:10 283 IPNEMMEHLDKIKPLTFWSELSAEITPMKDKAIARLASSAEIPGEVLTGIG-DANHWTAWLISDEGIR-WIRGYLGLIAD 360 (646) T ss_pred eChHHHhhhhcceeeccCchhhHHHhhhHHHHHHHHHhccCCchhheeecc-ccceeeeeeeccccch-hhhhHHHHHHH Confidence 9998 55688888999999999999999999999999999999999998 9999999999999999 99999999999 Q ss_pred HHHHHHHHHHHHhcCC-ChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHH Q lcl|NC_011057. 388 ALTDQILRVTLAREGI-DPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQ 466 (634) Q Consensus 388 ait~~~lr~~L~~eG~-d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~ 466 (634) |||++|||++|++||| ||++||||||+|+|++|||+++||+++||+|+||+|||||++||++|++| +++|+|++|++ T Consensus 361 AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~~pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~p--t~~E~~~~~~~ 438 (646) T protein:vir:10 361 ALTRGFLRRALESMGVTNPERYAFAFDTSTLASKPNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMP--TVQERAVQILL 438 (646) T ss_pred HHHhhHHHHHHHHcCCCChhHeEEeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhcccccccC--ChHHHHHHHHH Confidence 9999999999999999 99999999999999999999999999999999999999999999999999 78899999999 Q ss_pred HHhhcCcccc--hhh-hhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccH------ Q lcl|NC_011057. 467 DAVSKDPTLI--PML-APLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLE------ 537 (634) Q Consensus 467 d~v~~dp~Li--~~l-aPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~------ 537 (634) ++|+++|.|| |.+ +++..|.+|+++|| |+++..+++++++ ++.+++.+++||+|++++.++.+..+..+ T Consensus 439 ~~v~~~P~Lil~P~~qa~~~~P~~~~~~lp-p~~~~~~dg~~~~-~e~~g~~~~~E~~~~pda~~~~a~~~~~~~r~~~~ 516 (646) T protein:vir:10 439 GLVKTQPDLILDPAIQAALGLPAVQSVGLP-PTAAQRTDGDLDD-DESEGAPNGGEAPDQPDADEARAITAALDRRIALA 516 (646) T ss_pred HHhcCCccccccchhhccccCCCcCccccC-CcccccccCCCCC-hhhcCCCCCCccCCCCCCCccccccccccccchhh Confidence 9999999998 222 55556777888887 5666667666555 44555777888888998877665443332 Q ss_pred --------------HHHHHHHHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhc-ccccccHHH Q lcl|NC_011057. 538 --------------SGIVDLMVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSG-WDSALDDKI 602 (634) Q Consensus 538 --------------~a~vdllv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~G-Wd~~ld~~~ 602 (634) .++||+||+|||||||||+|| +|+.++++|+||+|+||++|+||++++++||+.| ||+++ ++ T Consensus 517 ~~~~~~~~~p~a~~~aav~l~v~RAL~lAG~Rlrt-~~~~~a~~r~vp~he~h~~l~Pv~~~~~~rl~~G~wd~~~--~v 593 (646) T protein:vir:10 517 ARPVLALPSPEAVFNASAKLMILRALELAGGRLTT-PAERRGRWSDVPRHELHHHVGPITPDKARRVTEGAWNHVA--VA 593 (646) T ss_pred hhhhhccccchhHHHHHHHHHHHHHHHhccccccC-chhhhHHhhcCChhhceeecCCCChhhHHHHHhcccccHH--HH Confidence 257999999999999999975 8999999999999999999999999999999999 99997 69 Q ss_pred HHHhCCCHHHHHHHHHHHHHHHHHHHHH---HhcC Q lcl|NC_011057. 603 LLRLGLDPGTIRSAVRRKVMAELTRPVI---DVVA 634 (634) Q Consensus 603 ~a~~g~Dp~~lr~~v~~~v~~~lt~~vv---d~~~ 634 (634) +++||+|++|||++|++|||++||+||= ||+. T Consensus 594 ~~~lg~D~~~lr~~v~~~Vr~~lt~g~~~~~~~~~ 628 (646) T protein:vir:10 594 AADLGVDAGELERVLSTYVLELLTRGLRHHDDMLY 628 (646) T ss_pred HHhcCCChHHHHHHHHHHHHHHHhcCCCcccccee Confidence 9999999999999999999999999953 2222 No 8 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.77 E-value=3.5e-19 Score=121.79 Aligned_cols=405 Identities=14% Similarity=0.153 Sum_probs=230.7 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHH--HHHHHhhhhhHHHHHhhhhhceeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTD--AWEAVDLVGELRYYVGWRASSCSRC 78 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~e--AW~~yd~VgELryyvgWr~~s~Sr~ 78 (634) |- =..+.+|.....+...-.+.-... .....|... .++.|-..+-+.-.+.-+++.+|.+ T Consensus 1 ~~---f~~~f~r~~~~~~~~~~~~~~~~~---------------~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~ 62 (413) T protein:vir:48 1 MF---FSGLFQRKSDAPVTTPAELAEAIG---------------LSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGML 62 (413) T ss_pred Cc---cchhhccCccCCccchHHHHHhhh---------------cCcccccCceechhhhhccHHHHHHHHHHHHhhhhC Confidence 21 113444433322211111111100 000011111 1233334566777788899999999 Q ss_pred eEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 79 RLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 79 rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) .+..-+.+.+ |.... .+ ..+..++..-...-+...++++.++.+|-+-|++|+.+. |. + |. T Consensus 63 p~~~~~~~~~-----~~~~~--~~-~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~-~~-~-------g~-- 123 (413) T protein:vir:48 63 PCSLYKISGT-----LKTRV--VD-ERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKV-KA-L-------GE-- 123 (413) T ss_pred ceEEEEecCC-----cceee--cc-cHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEE-eC-C-------Cc-- Confidence 9988777744 22211 11 335566666677889999999999999999999998863 32 1 21 Q ss_pred cchhceeccHHHHhccCCCcce---eeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKGSGT---NIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~~~~---~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 235 (634) + .+.+.+..+.+.....+++. .+..++|..+.|..+.=+-||.-+++ -..--||+..+...+.-..-..+... T Consensus 124 ~-~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~d---~~~G~s~i~~~~~~i~~~~~~~~~~~ 199 (413) T protein:vir:48 124 V-VELLPIDPGCVEPKLNSQWQPVYQVTFPDGSVDVLTQDEIWHVRTLTLD---GLVGLNPIAYAREAISLAAATEEHGA 199 (413) T ss_pred E-EEEEEEcCceEEEEEcCCceEEEEEEecCceEEEEccccEEEecCcCCC---CcccccHHHHHHHHHHHHHHHHHHHH Confidence 1 23455555555433333332 36778888877654222334432222 24456887777776655555555555 Q ss_pred HHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_011057. 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (634) Q Consensus 236 na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh 315 (634) +..+.-..-.|||-+|+.++- -..+++.+.+.+.-. ...+.+ -|+|+ + . T Consensus 200 ~~~~ng~~p~gil~~~~~~~~---------------------e~~~~~~~~~~~~~~-g~~n~g-----~~~vl--~--~ 248 (413) T protein:vir:48 200 RLFGNGAVTSGVLRTEQKLTP---------------------DAYERLKKDFEERHT-GLGNAH-----RPMIL--E--M 248 (413) T ss_pred HHHhccCCcceEEEeCCCCCH---------------------HHHHHHHHHHHHHhc-CccccC-----cceec--C--C Confidence 555555555688877764431 134555555552211 112222 23443 2 2 Q ss_pred hcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHH Q lcl|NC_011057. 316 IKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILR 395 (634) Q Consensus 316 i~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr 395 (634) .-+++-|.+... +.--+++|+..+..||+.|.||| .+||...++|+.++.+....-.+..|.|.++.|+++|++.+|. T Consensus 249 g~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~ 326 (413) T protein:vir:48 249 GLDWKSMALNAE-DSQFLETRKFQLEEICRLFRVPL-HMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVR 326 (413) T ss_pred CceEEeccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 235555554322 22247899999999999999999 7778767889999999999999999999999999999999987 Q ss_pred HHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcC Q lcl|NC_011057. 396 VTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKD 472 (634) Q Consensus 396 ~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~d 472 (634) +.- + ..|.|+||.+.|.. +|..+.+ ..+++.|++|.+++|+++|++.-.|=| T Consensus 327 ~~~---~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD------------------ 381 (413) T protein:vir:48 327 ESK---Q---GKFYAKFNAGALLR-GDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD------------------ 381 (413) T ss_pred ccc---c---CCeEEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc------------------ Confidence 632 1 25789999999854 3544443 348889999999999999996422211 Q ss_pred cccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCC Q lcl|NC_011057. 473 PTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQ 526 (634) Q Consensus 473 p~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~ 526 (634) --+. |+ ... ++. ..++++..+ .+++.+++| .+ T Consensus 382 ~~~~----~~--------n~~---~~~-~~~~~~~~~----~~~~~~~~~--~~ 413 (413) T protein:vir:48 382 VYLT----PM--------NMT---TSP-SAGDDNGKK----KESGDADKT--AS 413 (413) T ss_pred eeec----cc--------ccc---ccc-cccccCCCC----CCCCCcccc--CC Confidence 0000 00 000 000 111111000 011111111 10 No 9 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.75 E-value=1.6e-18 Score=118.19 Aligned_cols=416 Identities=15% Similarity=0.086 Sum_probs=217.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhh--hhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFS--KSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRC 78 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~--~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~ 78 (634) ||..- -|+.|+-+ ++--++-..-.. .....+...++..-..+.+-..+-+.-.|.-+++.++.+ T Consensus 1 ~~~~~-~~~~~~~~-------------~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~l 66 (460) T protein:vir:10 1 MANRI-IRALRELT-------------GLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAV 66 (460) T ss_pred CchhH-HHHHhhhh-------------ccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhC Confidence 32210 11111111 111110000000 000111222344434444445566666677889999999 Q ss_pred eEEEeeecccCCCCCCCCCCCCccc-------------------------HHHHHHHHhhcCCcchHHHHHHHHHHhhcc Q lcl|NC_011057. 79 RLVASELDENTGLPTGGISEDNTEG-------------------------ERVREIVSKIADGTLGQAALTKRVVECLTV 133 (634) Q Consensus 79 rL~aseiD~Dtg~ptG~i~ed~~~g-------------------------~r~~~iv~~iagG~lGQaqL~kR~~~~LtV 133 (634) -+..-+.+.| |...+..... ..... ...=...-+...++++.++.+|-+ T Consensus 67 p~~v~~~~~~-----g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-L~~~PN~~~t~~~f~~~~~~~lll 140 (460) T protein:vir:10 67 PYTIKVVKDT-----KAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAF-PLESPNPTQTWADIYSLYKTYMRL 140 (460) T ss_pred ceEEEeccCC-----ccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHH-HHhCCCCCCCHHHHHHHHHHHHhh Confidence 9999999877 4333211000 01111 122255667889999999999999 Q ss_pred ccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcc---------eeeEeC-CCCcccccCCCCeEEEee Q lcl|NC_011057. 134 PGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG---------TNIVLP-TGEEHEFVKGTDIIFRVW 203 (634) Q Consensus 134 pGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~---------~~i~lP-~g~~h~~~~~~D~~~RvW 203 (634) -|++|+.+ .|+..+ ...|. + ...+.|..+.+......++ ..+..+ +|..+.|....=+-||.+ T Consensus 141 ~Gnay~~i-~r~~~~---~~~G~--~-~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~ 213 (460) T protein:vir:10 141 NGNCYFYL-MSPDDG---INAGV--P-SQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYA 213 (460) T ss_pred cCCeEEEE-EecCCC---ccCce--e-EEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecC Confidence 99999765 443321 11222 2 2244444444443222221 223444 344455544333557877 Q ss_pred CCCcc---cccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHH Q lcl|NC_011057. 204 IPKPR---KASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAV 280 (634) Q Consensus 204 ~P~pr---ra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~ 280 (634) +|.-. ....--||+.++...+.- ........ .++..||. .|..+-.+... ....+. T Consensus 214 ~~~~~~~~~~~~G~sp~~~~~~~i~~----~~~~~~~~-~~~f~ng~--~~~~i~~~~~~--------------l~~e~~ 272 (460) T protein:vir:10 214 NPNFDLQGSHLYGMSPIRAILRNINS----QNSTIDNN-VKTMQNGG--VFGFIHGGSTG--------------LTQPQA 272 (460) T ss_pred CCCcccccCccccccHHHHHHHHHHH----HHHHHHHH-HHHHhcCC--CcceeeecCCC--------------CCHHHH Confidence 77532 223445777766554443 33333222 23344442 22222111000 122355 Q ss_pred HHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccc-- Q lcl|NC_011057. 281 QQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLG-- 358 (634) Q Consensus 281 ~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlg-- 358 (634) .++.+.+. ..+.-.+. +--|+++. ..-+++.|.....- .--+++|+..+..||+.|.||| .|||+. T Consensus 273 ~~~~~~~~----~~~~g~~n--~g~~~vl~----~g~~~~~l~~~~~d-~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~ 340 (460) T protein:vir:10 273 DSLKQRLT----EMDKSPDR--LSQIAGAS----GEIAFTKISLNTDE-LKPFDYLKYDQKAICNALGWSD-KLLNNNEG 340 (460) T ss_pred HHHHHHHH----HHhcCccc--cCCceecC----CCceEEEccCChhH-HHHHHHHHHHHHHHHHHhCCCH-HHhCCCCC Confidence 66666655 22222111 22334432 33455666554332 2238999999999999999999 588863 Q ss_pred cCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCH Q lcl|NC_011057. 359 SQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAING 438 (634) Q Consensus 359 s~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ 438 (634) ++.|+.++.+....-++..|.|.+..|+++|++.+|.+ +-....|.|+||.+.|..--.......+++++|++|. T Consensus 341 ~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~-----~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~ 415 (460) T protein:vir:10 341 GGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKR-----FKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTP 415 (460) T ss_pred CCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-----ccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCH Confidence 34688899999999999999999999999999998854 2245678899999998532122333356899999999 Q ss_pred HHHHHHhCCCc--cccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCC Q lcl|NC_011057. 439 EALRKYLGLGD--DAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDG 516 (634) Q Consensus 439 ealr~~~Gl~e--d~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~ 516 (634) +++|+++|++. +.+-| --++| + -++.++ ...+ T Consensus 416 NE~R~~~g~~pi~~~~gD------------------~~~~~----~---n~~~~~---------~~~~------------ 449 (460) T protein:vir:10 416 NEIRIAMKYETLNQDGMD------------------IVFMP----S---NKVRID---------DVSN------------ 449 (460) T ss_pred HHHHHHhCCCCCCCCCCC------------------eeeec----c---cccchh---------hccc------------ Confidence 99999999973 12222 00000 0 000000 0000 Q ss_pred CCCCCCCCCCCC Q lcl|NC_011057. 517 EHEPDTEDDQDD 528 (634) Q Consensus 517 ~~ePDTe~d~~~ 528 (634) +..|...++.+ T Consensus 450 -~~~~~~~nq~~ 460 (460) T protein:vir:10 450 -NLIDSAFNQNQ 460 (460) T ss_pred -ccCCCcccCCC Confidence 00000001111 No 10 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.75 E-value=4e-17 Score=110.46 Aligned_cols=507 Identities=12% Similarity=0.063 Sum_probs=252.2 Q ss_pred CCCCCcceeEe-----------ccCCC--Cccchhhhhhhhc------cCCchhhhhhhhcccCccccccHHHHHHHhhh Q lcl|NC_011057. 1 MAATQSLRLVR-----------RPKGG--RPAPSRALTAASQ------PLPDPSQVFSKSTGISRNSDWQTDAWEAVDLV 61 (634) Q Consensus 1 ~~a~~~lr~vr-----------rp~g~--~~a~~ral~aAs~------~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~V 61 (634) ..---++-|.| -|+.. +|.-.+.+..... .+.+|..-+..+ .+++. .-.. T Consensus 56 ~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vn--Vs~~~---------Alkn 124 (945) T protein:vir:10 56 STVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLIN--LFRKY---------RFNN 124 (945) T ss_pred ceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeeh--hhhhh---------hhcc Confidence 00000111111 12221 1111222222111 112222211111 01110 0112 Q ss_pred hhHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcch----HHHHHHHHHHhhccccce Q lcl|NC_011057. 62 GELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLG----QAALTKRVVECLTVPGEL 137 (634) Q Consensus 62 gELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lG----QaqL~kR~~~~LtVpGE~ 137 (634) .-+.-.+.-+++++|.+.+-.-+-+.| |.-.+...+. .....+..+.+. ..--+. -..+++.++.++-+-|.. T Consensus 125 saV~scI~~IA~sIAsLPlklYrr~ed-G~~~~~~kk~-~~~hpL~~LL~r-PNp~mT~~eFwqsFl~~Lv~dLLL~GNA 201 (945) T protein:vir:10 125 DSKLIKVSEIPKKLTSKELEIYKHIED-KHVNYYLKRI-RDARNILEFLER-PDPYFSEVNSWEYLLGMVLDDILTIDRG 201 (945) T ss_pred HHHHHHHHHHHhhhccCceEEEEeccc-Cccccccccc-ccchHHHHHHhC-CCcccChhHHHHHHHHHHHHHHhhcCCe Confidence 334556777999999999887776656 3222222221 222334444432 222233 335899999999999999 Q ss_pred EEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcc-ee--eEeC-CCCcccccCCCCeEEEeeCCCccccc-- Q lcl|NC_011057. 138 WIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG-TN--IVLP-TGEEHEFVKGTDIIFRVWIPKPRKAS-- 211 (634) Q Consensus 138 wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~-~~--i~lP-~g~~h~~~~~~D~~~RvW~P~prra~-- 211 (634) |+.+. |... |. .-.++.+....+...-..+| .. +... +|....-....|.++++.+|++..-. T Consensus 202 YieIi-Rd~~-------G~---ii~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~G 270 (945) T protein:vir:10 202 AIVKI-RDEQ-------GN---LVAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYG 270 (945) T ss_pred EEEEE-ECCC-------Cc---EEEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCccccc Confidence 99874 5332 32 23577777766643322222 11 2222 44444444466778788888876433 Q ss_pred CCccchhhhhHHHHHHHhhhHHHHHHH-HhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHH Q lcl|NC_011057. 212 EPDSPVRAVLDSIREIVRTTKTIANAS-KSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQV 290 (634) Q Consensus 212 eaDSPvra~l~~LrEI~rttk~I~na~-~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qv 290 (634) .--||+.++...+.--....+.-.+.. ++..+-.|||-++.+....... + ......+.+++.+.+- T Consensus 271 yGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~-~----------~~LseEq~erlKe~we-- 337 (945) T protein:vir:10 271 YSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDI-Y----------PQLSREQLESIQRQLQ-- 337 (945) T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCcccccccc-c----------cccCHHHHHHHHHHHH-- Confidence 345888888776665555544444332 4555667888888776543210 0 0112224444444443 Q ss_pred HhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhh Q lcl|NC_011057. 291 AETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQIS 370 (634) Q Consensus 291 a~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~ 370 (634) .++.-. .+-.|+|+ + ..-+++.|.+... +.--+++|+..+..||..|.||| .+||.+++.|..++.+.. T Consensus 338 --e~~sG~---NnG~piVL--d--eGmef~pLs~s~~-DaQfLEsrkfs~eeIArAFGVPP-~lLG~~e~st~SNiEqq~ 406 (945) T protein:vir:10 338 --AIMMGD---YTQVPILS--G--GKFTWIDFKGKRR-DMQFKELAEFVARKICAVYQVSP-QDVGILEGSNKATAEVMA 406 (945) T ss_pred --HHhCCc---ccccceec--C--CCceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHcccCCCCCcchHHHHH Confidence 222211 12246654 2 3345666665433 23347999999999999999999 777998899999999999 Q ss_pred hhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCcc Q lcl|NC_011057. 371 DEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDD 450 (634) Q Consensus 371 de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed 450 (634) ..-++-.|.|.+..|+++|++.++... +...|-|.||...+...-++.+....+++.|++|.+++|++.|+..- T Consensus 407 ~~Fv~~tL~Pil~~IEqeLNrkLl~~~------eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~lGLpPI 480 (945) T protein:vir:10 407 SLTKAKGLEPLMATISKGFDEVVSEFR------NEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEKGLEPV 480 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccc------cCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999998664321 12457888888887533233333355789999999999999999653 Q ss_pred ccCCCCCHHHHHHHHHHHhhcCcccch--hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCC Q lcl|NC_011057. 451 AGYDFTTREGWVMWAQDAVSKDPTLIP--MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDD 528 (634) Q Consensus 451 ~~yd~~t~Eg~r~wA~d~v~~dp~Li~--~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~ 528 (634) .|-|. -+++ .+.|.-+...+...-.+|+.+. ...+++..+++.+ |++++ T Consensus 481 eGGD~------------------lli~~nn~~P~d~~~ka~~ga~p~q~aq-------~~~dqp~~kGGe~-dEns~--- 531 (945) T protein:vir:10 481 PWGDV------------------PFSGLRNWKPEDEQAKAQQGAMPPQLAQ-------AMADQPSQQGGGV-DENSS--- 531 (945) T ss_pred CCcce------------------eeeccccccccccccccccCCCCccccc-------CCCCCCCCCCCCC-CCCCC--- Confidence 33220 0111 0112111111111101011111 1111111222211 11111 Q ss_pred cccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHHHHHhCC Q lcl|NC_011057. 529 DGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGL 608 (634) Q Consensus 529 ~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~~a~~g~ 608 (634) ....+-+....+...+..+|=+.|-.+++. ||. .-|. T Consensus 532 ~psE~kda~~e~~~~l~~~~~~~a~e~i~~-------------------------------~~e------------~~~~ 568 (945) T protein:vir:10 532 VPSEQKNAGLEVLRNLFKSLDANASENLKQ-------------------------------VIE------------LTND 568 (945) T ss_pred CCCcccchHHHHHHHHHHHHHHHHHHHHHH-------------------------------HHh------------hcCC Confidence 111111122233444555554444444422 110 1111 Q ss_pred CH-HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 609 DP-GTIRSAVRRKVMAELTRPVIDVVA 634 (634) Q Consensus 609 Dp-~~lr~~v~~~v~~~lt~~vvd~~~ 634 (634) |. .+.+..+.+.|+-.=-.+|+..|- T Consensus 569 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 595 (945) T protein:vir:10 569 DNYLKEKELLTRVLKSVGLDSVSEFIE 595 (945) T ss_pred CchhHHHHHHHHHHHHhhhHHHHHHHh Confidence 11 112222222222222224444333 No 11 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.75 E-value=7.7e-19 Score=119.89 Aligned_cols=438 Identities=13% Similarity=0.096 Sum_probs=225.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==. |+.+|.+... ....-.-.-.++ ||...+-..... .+...-.+ ..+ .++-+.-.|.-+++++|.+-| T Consensus 1 Mg~~~--~l~~r~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~-~g~~V~~~--~al-~~~~V~~~v~~Ia~~iA~lp~ 71 (457) T protein:vir:13 1 MGFWS--ALFGRGHSPA--LDGIEARAWEPY-DPSIYNLGAVAA-SGETVTPH--DAL-QVSAVFASVRLLSETIATLPL 71 (457) T ss_pred Cchhh--hhhccccccc--cccccccccccc-chHHHhhccccc-CCceechH--Hhh-ccHHHHHHHHHHHHhhccCce Confidence 55433 2333332211 000000011122 222111000000 00000000 011 234455567788999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+-+.+ +..+ . ....+..+.+.=..+ +...++++.++.+|-+-|+.|+.| .|..+ . + T Consensus 72 ~~~~~~~~------~~~~-~-~~~~l~~~ln~~~n~-~t~~~f~~~~~~~lll~Gna~~~i-~~~~g-~---------~- 130 (457) T protein:vir:13 72 STYSKRGG------SRKE-I-VTPEWLDYPNAEPGG-MGRIDILSQTVLSLLLQGNAFLAV-RWQGP-N---------I- 130 (457) T ss_pred EEEEecCC------cccc-c-ccchHHHhccccCCC-CCHHHHHHHHHHHHhhcCCeEEEE-EecCC-c---------E- Confidence 77665433 1111 1 112344444444443 677899999999999999999887 44221 1 1 Q ss_pred hhceeccHHHHhcc--CCCc-----ceeeEeCC-CCcccccC-CCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhh Q lcl|NC_011057. 161 QEWYAVSKEEIKKS--NKGS-----GTNIVLPT-GEEHEFVK-GTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTT 231 (634) Q Consensus 161 ~~W~~vt~~Ei~~~--~~~~-----~~~i~lP~-g~~h~~~~-~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rtt 231 (634) ...+.|....+... ..++ ...+.... |....... ..+=+|++=.+++.-...--||+..+...+.=..-.. T Consensus 131 ~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~ 210 (457) T protein:vir:13 131 VGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQ 210 (457) T ss_pred EEEEEEccCceEEEEecCCCccceeEEEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHH Confidence 22344444333211 1111 01233332 33222211 2334455655555544566778777666665555555 Q ss_pred HHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_011057. 232 KTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGV 311 (634) Q Consensus 232 k~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~v 311 (634) +...+..+.-.+-.|||-+|+.++- -+.+++++.+. ..+.-.+.+- =++|+ T Consensus 211 ~~~~~~f~ng~~p~gil~~~~~ls~---------------------e~~~~~~~~~~----~~~~g~~nag--~~~vl-- 261 (457) T protein:vir:13 211 KYGSKFFANGAMPGAVVEVPGTMSE---------------------EGLARAREAWR----AANSGVDNAH--RVALL-- 261 (457) T ss_pred HHHHHHHhcCCCcceEEEcCCCCCH---------------------HHHHHHHHHHH----HHhcCccccC--cceec-- Confidence 5555555555555688887765431 14555666554 2222222211 12232 Q ss_pred chHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhh--HhhhhhhhhhHHHHhHHHHHHHHH Q lcl|NC_011057. 312 PGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWS--AWQISDEDVQLHIAPVMEIFCQAL 389 (634) Q Consensus 312 P~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhwt--Aw~i~de~v~~hI~P~~~~i~~ai 389 (634) + ..-+++-|.+... +.--+++|+..+..||..|.||| .|||...+++.|+ ..+....=++..|.|.++.|+++| T Consensus 262 ~--~g~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l 337 (457) T protein:vir:13 262 T--EGAKFSKVAMSPD-EAQFLQTRQFQVPEIARIFGVPP-HLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGF 337 (457) T ss_pred C--CCceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 3345555554322 22347899999999999999999 6779877777754 467777777888999999999999 Q ss_pred HHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHH Q lcl|NC_011057. 390 TDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQ 466 (634) Q Consensus 390 t~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~ 466 (634) ++.+|.+.- ...|.|+||.+.|.. .|..+.+ ..++..|++|.+++|++.|+.--.+- .+-. T Consensus 338 n~~L~~~~~------~~~~~i~fd~~~l~~-~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g-----~~d~---- 401 (457) T protein:vir:13 338 NRLLFAETA------DRFRFVKFNLDEIKR-GAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDG-----LGEK---- 401 (457) T ss_pred HHhhcCccc------cCceeEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC-----cccc---- Confidence 998886521 134678999999843 3443333 35888999999999999999532211 0000 Q ss_pred HHhhcCcccch-hhhhhhhh-hhhcccCCCCCCCCCCCCC---CCCccccCCCCCCCCCCCCCCC Q lcl|NC_011057. 467 DAVSKDPTLIP-MLAPLIAG-VLQQIEFPQQQQAIDSGGN---EDTSDDDNLDDGEHEPDTEDDQ 526 (634) Q Consensus 467 d~v~~dp~Li~-~laPll~p-~~q~~~~P~p~~a~~~~~~---~~~~~d~~~~~~~~ePDTe~d~ 526 (634) -+.| .+.|+.+. ..+. -+.|.+..++.++ +.+.++.+.++++.+.|.|+|+ T Consensus 402 -------~~~~~n~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 402 -------YRVPLNLGEVGEEPEPEP--APAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred -------eeeccccccccccccccc--cCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 0011 11122110 0111 1112222222111 1122333334444566666666 No 12 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.74 E-value=2.7e-17 Score=111.41 Aligned_cols=467 Identities=15% Similarity=0.113 Sum_probs=238.5 Q ss_pred hhhhhhccCCchhhhhhhhcccC-------------ccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeecccC Q lcl|NC_011057. 23 ALTAASQPLPDPSQVFSKSTGIS-------------RNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELDENT 89 (634) Q Consensus 23 al~aAs~~itdp~~~~~~~~~~~-------------~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~Dt 89 (634) -|.|--+.++-|....+.+.... +..+|... .|-..+-+.--|.-+++++|.+.|..-+.+.| T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~- 76 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGG---IYKNQPWVRTVIAKRAQALARLPVKCMFTSGD- 76 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccceecccccchhhH---HHhhhHHHHHHHHHHHHhhccCceEEEEEcCC- Confidence 44555555555543222221111 11122221 13334556666788999999998888788876 Q ss_pred CCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceeccHH Q lcl|NC_011057. 90 GLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKE 169 (634) Q Consensus 90 g~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~ 169 (634) |..+.. ...+..+.. =..--+...++++.++.+|.+-|++|+.+ .|.. +|. ...++.+... T Consensus 77 ----~~~~~~---~~~~~~Ll~-~PN~~~t~~~F~~~lv~~lll~Gnay~~i-~r~~-------~G~---~~~L~~l~p~ 137 (518) T protein:vir:10 77 ----TETEES---DTGYAKLLA-DPCEYLDPFAFWEWVASTLDIYGETYLAI-QKNK-------SGT---PEKLMPMHPS 137 (518) T ss_pred ----Cceecc---chHHHHHHc-CCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------CCc---EEEEEEECCC Confidence 333322 233444443 46777888999999999999999999886 4433 232 2356777666 Q ss_pred HHhccCC--CcceeeEeC--CCCc---ccccCCCCe-EEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhH Q lcl|NC_011057. 170 EIKKSNK--GSGTNIVLP--TGEE---HEFVKGTDI-IFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSR 241 (634) Q Consensus 170 Ei~~~~~--~~~~~i~lP--~g~~---h~~~~~~D~-~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SR 241 (634) .+..... .....+... +|.. .+| +..|+ -||..+|+ .-..--||+.++...+.-..-+.+...+..+.= T Consensus 138 ~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~eViHir~~s~d--g~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng 214 (518) T protein:vir:10 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSF-ADDEVVPIRFFNPD--GLERGLSLMESLKSTIFSEDSSRNATAAMWKNA 214 (518) T ss_pred ceEEEEcCCCCEEEEEEEecCCccceEEEe-cCCcEEEecCCCCC--cccccccHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 6643222 222333332 2221 222 23343 34444433 222344776666555444444433333333332 Q ss_pred hhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccce Q lcl|NC_011057. 242 LIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKH 321 (634) Q Consensus 242 L~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikH 321 (634) ..-.|||-+|+.++ ....+++++.+- ..+.-.+. +--++|+. ..-+++- T Consensus 215 ~~p~gil~~~~~ls---------------------~e~~~~~k~~~~----~~~~G~~n--ag~v~vL~----~G~~~~~ 263 (518) T protein:vir:10 215 GRPNLVLRHEKRLS---------------------EAAQQRLREQFD----RAHSGSSN--TGKTMVVE----EGMEPIP 263 (518) T ss_pred CCccEEEecCCCCC---------------------HHHHHHHHHHHH----HHhcCccc--cCcceEcC----CCceEEE Confidence 33345666655432 124445555443 22221111 11233332 2344555 Q ss_pred eecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011057. 322 IRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLARE 401 (634) Q Consensus 322 l~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~e 401 (634) |.+... +.--+++|+..+..||..|-||| .+||+..++|+.++.+....-++..|.|.+..|+++|++.++.. ++ T Consensus 264 l~~s~~-D~q~le~r~~~~~eIa~afgVPp-~~lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~-~~-- 338 (518) T protein:vir:10 264 LQLTAV-EMQFIEARQLNREEVCGVYDIAP-PIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY-WV-- 338 (518) T ss_pred ccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc-- Confidence 554332 22348999999999999999999 77798778899999999999899999999999999999887654 21 Q ss_pred CCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccc--cCCCCCHHHHHHHHHHHhhcCcccc Q lcl|NC_011057. 402 GIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDA--GYDFTTREGWVMWAQDAVSKDPTLI 476 (634) Q Consensus 402 G~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~--~yd~~t~Eg~r~wA~d~v~~dp~Li 476 (634) ..|-|+||.+.|.. .|..+.+ ..++..|++|.+++|+++||.--. +-| ..--+. T Consensus 339 ----~~~~~~fd~~~llr-~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD-------------~~~~~~--- 397 (518) T protein:vir:10 339 ----RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD-------------ELYANS--- 397 (518) T ss_pred ----CCceEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCC-------------eeeecc--- Confidence 35778999999843 3544443 458888999999999999996321 111 000011 Q ss_pred hhhhhhhh---hhhhcccCCCCCCCCCCCCCC--CCccccCCCCCCCCCCCCCCCCC---c-----ccCCCccH-----H Q lcl|NC_011057. 477 PMLAPLIA---GVLQQIEFPQQQQAIDSGGNE--DTSDDDNLDDGEHEPDTEDDQDD---D-----GTQKAGLE-----S 538 (634) Q Consensus 477 ~~laPll~---p~~q~~~~P~p~~a~~~~~~~--~~~~d~~~~~~~~ePDTe~d~~~---~-----~~~~a~~~-----~ 538 (634) .+.||-. ...+.-+.|.|+ .++... +-+++++....+.++++.+.... . ...++..+ + T Consensus 398 -n~~pl~~~~~~~~~g~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (518) T protein:vir:10 398 -ALQPLGATPDGAVEGEEAPAPK---RPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHL 473 (518) T ss_pred -cceecccccccccCCCCCCCCC---CCCccccccccccccccCCCCCcccccccccccccchhccccCCCcccccchHH Confidence 1223322 112222223222 121111 11122222333344444321111 0 11222222 1 Q ss_pred HHHHH-------HHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHh Q lcl|NC_011057. 539 GIVDL-------MVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMS 592 (634) Q Consensus 539 a~vdl-------lv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~ 592 (634) .+|.. +=.-||.||-|- +-+.-+-|-.|- -.-+.|--+ T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~------------~~~~~~~~~ 518 (518) T protein:vir:10 474 RAVKGAMGRGKDIKGFALQLAEKY----PDDLEDILLAVQ------------LALAERKDN 518 (518) T ss_pred HHHHHHhhcCccchhHhhhhhhhc----chhHHHHHHHHH------------HhhhhccCC Confidence 22322 223345555441 111111111110 000001000 No 13 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.73 E-value=3.9e-18 Score=116.03 Aligned_cols=408 Identities=14% Similarity=0.121 Sum_probs=227.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==. |+.+|...+.......+.-.. .+......+..=.++.+-.++-+.-.|.-+++++|++.+ T Consensus 1 Mg~f~--~lf~r~~~~~~~~~~~~~~~~-------------~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~ 65 (414) T protein:vir:44 1 MVFFS--GLFQRKSDAPVTTPAELADAI-------------GLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPC 65 (414) T ss_pred Cchhh--hhhccCccCcccchhhHhHhh-------------ccCccccCCceechhhhhccHHHHHHHHHHHHHhccCce Confidence 43222 455654443322222211110 000000001000122233445566667778999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+.+.+ |. .+ .....+..+...-...-+-..++++.++.+|-+-|++|+.+ .|.. |. + T Consensus 66 ~~~~~~~~-----~~-~~--~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i-~~~~--------g~--~- 125 (414) T protein:vir:44 66 NLYHLNGS-----LK-QR--ATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYK-VKAF--------GE--V- 125 (414) T ss_pred EEEEecCC-----ce-ee--cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEE-EeCC--------Cc--E- Confidence 88777754 21 11 11234555556567778899999999999999999999876 3432 21 1 Q ss_pred hhceeccHHHHhccCCCcc---eeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSG---TNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~---~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) .+.+.|....+.....+++ -.+..++|....|..+.=+-||..+.+ -..--||+..+...+.-..-..+...+. T Consensus 126 ~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~d---~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 202 (414) T protein:vir:44 126 AELLPVDPGCVVPKLNSSWEPVYQVTFPDGSTDVLSQEDIWHVRTLTLD---GLVGLNPIAYAREAISLAAATEEHGARL 202 (414) T ss_pred EEEEEEcCceEEEEECCCCcEEEEEEecCceEEEEccccEEEecCCCCC---CcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 2344555444332222222 235566777776654321334433322 2456677777766665555555555555 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .+.-....|||-+|+.++- -..+++++.+. ..+...+. +--|+|+. ..- T Consensus 203 f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~----~~~~g~~n--~~~~~vl~----~g~ 251 (414) T protein:vir:44 203 FSNGAVTSGVLRTEQTLSD---------------------QAYERLKKDFE----ERHTGLGN--AHRPMILE----MGL 251 (414) T ss_pred HhccCCCceEEEeCCCCCH---------------------HHHHHHHHHHH----HHhcCccc--cCcceecC----CCc Confidence 5555566788877654331 13455555554 33332221 22244432 233 Q ss_pred ccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (634) Q Consensus 318 ~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~ 397 (634) +++.|.+.. .+.--+++|+..+..||..|.||| .+||.+.++|+.++.+....-++.-|.|.++.|+++|++.+|.+. T Consensus 252 ~~~~l~~~~-~d~~~~e~~~~~~~~Ia~~fgVpp-~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 329 (414) T protein:vir:44 252 DWKSMALNA-EDSQFLETRKFQLEEICRLFRVPL-HMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRKS 329 (414) T ss_pred eEEEccCCh-HHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Confidence 556665432 223347889999999999999999 777877788999999999999999999999999999999887652 Q ss_pred HHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 398 LAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 398 L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) - . ..|.|.||.+.|..- |..+.+ ..++..|++|.+++|+++|+..-.|-| .. .+..|.+ T Consensus 330 ~----~--~~~~i~fd~~~ll~~-d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD-----~~------~~~~n~~ 391 (414) T protein:vir:44 330 K----Q--GVFYAKFNAGALLRG-DMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD-----VY------LTPMNMT 391 (414) T ss_pred c----c--CceEEEEechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-----ee------ccccccc Confidence 1 1 256789999998542 333332 348899999999999999996322111 00 0000100 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQ 526 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~ 526 (634) .. |.... ....+.++..+|+ +++ T Consensus 392 ~~----------------~~~~~-~~~~~~~~~~~d~------------~~~ 414 (414) T protein:vir:44 392 TK----------------PSDGS-KAGKQKDNANADE------------TTS 414 (414) T ss_pred cc----------------CCccc-cCCCCCCCCCCCC------------CCC Confidence 00 00000 0000001111111 111 No 14 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.72 E-value=2.3e-17 Score=111.77 Aligned_cols=470 Identities=15% Similarity=0.109 Sum_probs=236.1 Q ss_pred hhhhhhccCCchhhhhh---------hh--cc--cCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeecccC Q lcl|NC_011057. 23 ALTAASQPLPDPSQVFS---------KS--TG--ISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELDENT 89 (634) Q Consensus 23 al~aAs~~itdp~~~~~---------~~--~~--~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~Dt 89 (634) -|.|--+-++-|....+ .+ ++ .++..+|... .|-..+-+.-.|.-+++++|.+.|..=+-+.+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~---~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~- 76 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGG---IYKNQPWVRTVIAKRAQALARLPVKCMFTSGD- 76 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceeceecccccchhhH---HhhhhHHHHHHHHHHHHhhccCceEEEEEcCC- Confidence 22222222222221100 00 00 1112222221 23345666777888999999998888777766 Q ss_pred CCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceeccHH Q lcl|NC_011057. 90 GLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKE 169 (634) Q Consensus 90 g~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~ 169 (634) |..+.++ ..+..+.. =..--+...++++.++.+|.+-|++|+.+ .|..+ |. ...++.|... T Consensus 77 ----~~~~~~~---~~~~~Ll~-~PN~~~t~~~F~~~lv~~lll~Gnay~~i-~r~~~-------G~---~~~L~~l~p~ 137 (518) T protein:vir:78 77 ----TETEEHD---TGYAKLLA-DPCEYLDPFAFWEWVASTLDIYGETYLAI-QKNKS-------GT---PEKLMPMHPS 137 (518) T ss_pred ----ccccccc---hHHHHHHh-CCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EEcCC-------Cc---EEEEEEECCC Confidence 3333322 23444433 36677888999999999999999999986 45332 32 2346676666 Q ss_pred HHhccCC--CcceeeE--eCCCC--cccccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_011057. 170 EIKKSNK--GSGTNIV--LPTGE--EHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRL 242 (634) Q Consensus 170 Ei~~~~~--~~~~~i~--lP~g~--~h~~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 242 (634) .+..... +....+. ..++. +..-.+..| +-||..+|+. -..--||+.++...+.-..-+.+...+..+.-. T Consensus 138 ~Vtv~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eIiHir~~~~dg--~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~ 215 (518) T protein:vir:78 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDG--LERGLSLMESLKSTIFSEDSSRNATAAMWKNAG 215 (518) T ss_pred ceEEEEcCCCCEEEEEEEecCCccceeEEecCCcEEEecCCCCCc--ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 5543222 2222233 23322 222122344 3355444432 223356766655554444444433333333333 Q ss_pred hhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccccee Q lcl|NC_011057. 243 IGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHI 322 (634) Q Consensus 243 ~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl 322 (634) .-.|||-+|+.++ ....+++.+.+- ..+.-.+. +-=++|+. ..-+++-| T Consensus 216 ~p~gvl~~~~~ls---------------------~e~~~~~k~~~~----~~~~G~~n--ag~~~vL~----~G~~~~~l 264 (518) T protein:vir:78 216 RPNLVLRHEKRLS---------------------PEAQQRLREQFD----RAHAGSSN--TGKTMVVE----EGMEPIPL 264 (518) T ss_pred CccEEEecCCCCC---------------------HHHHHHHHHHHH----HHhcCccc--CCceeEcC----CCceEEec Confidence 3345666665433 113445555554 22221111 12233332 23455555 Q ss_pred ecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 323 RFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREG 402 (634) Q Consensus 323 ~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG 402 (634) .+... +.--+++|+..+..||..|-||| .+||+..++|+.++.+....-++..|.|.+..|.++|++.++.. ++ T Consensus 265 ~~~~~-d~q~le~r~~~~~eIa~afgVPp-~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~-~~--- 338 (518) T protein:vir:78 265 QLTAV-EMQFIEARQLNREEVCGVYDIAP-PIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY-WV--- 338 (518) T ss_pred cCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--- Confidence 55332 33348999999999999999999 67798888999999999988899999999999999999877643 21 Q ss_pred CChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccc--cCCCCCHHHHHHHHHHHhhcCcccch Q lcl|NC_011057. 403 IDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDA--GYDFTTREGWVMWAQDAVSKDPTLIP 477 (634) Q Consensus 403 ~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~--~yd~~t~Eg~r~wA~d~v~~dp~Li~ 477 (634) ..|-|.||.+.|.. +|..+.+ ..+++.|.+|.+++|++.||.--. +-| +-+ +.+ T Consensus 339 ---~~~~~~fd~~~Llr-~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD----~~~-------v~~------ 397 (518) T protein:vir:78 339 ---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD----ELY-------ANS------ 397 (518) T ss_pred ---CcceEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc----eee-------ecc------ Confidence 35678999999843 4554444 458889999999999999996321 111 000 111 Q ss_pred hhhhhhh---hhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCC---Cc-----ccCCCccHH-----HHH Q lcl|NC_011057. 478 MLAPLIA---GVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQD---DD-----GTQKAGLES-----GIV 541 (634) Q Consensus 478 ~laPll~---p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~---~~-----~~~~a~~~~-----a~v 541 (634) .+.|+-. ...+.-+.|.|+..... ...+-+++++....+.++++.++.. .. .+.++..++ .+| T Consensus 398 n~~pl~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (518) T protein:vir:78 398 ALQPLGATPDGAVEGEEAPAPKRPAST-PVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAV 476 (518) T ss_pred cceecccccccccCCCCCCCCCCCCcc-cccccccCccccCCCCCcccccccccccccchhcccCCCCcccccchHHHHH Confidence 1223321 11222233332211111 1111122222233344454432111 10 122222221 223 Q ss_pred HH-------HHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHh Q lcl|NC_011057. 542 DL-------MVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMS 592 (634) Q Consensus 542 dl-------lv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~ 592 (634) .. +=.-||.||-|- +-+.-+-|-.|- -.-+.|--+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~------------~~~~~~~~~ 518 (518) T protein:vir:78 477 KGAMGRGKDIKGFALQLAEKY----PDDLEDILLAVQ------------LALAERKDN 518 (518) T ss_pred HHHhhcCCcchhhhhhhhhhc----chhHHHHHHHHH------------HhhhhccCC Confidence 22 223345555441 111111111110 000001000 No 15 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.71 E-value=1.5e-17 Score=112.87 Aligned_cols=412 Identities=16% Similarity=0.144 Sum_probs=217.5 Q ss_pred ceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeec Q lcl|NC_011057. 7 LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELD 86 (634) Q Consensus 7 lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD 86 (634) .++.|..-.. . -+.-...+++++.......+....+ .+ .++-+.-.|.-+++.++++.|..-+-+ T Consensus 1 m~~~~~~~~~----~--~~~~~~~~~~~~~~~~~~g~~~~~~--------Al-~~~~V~~cv~~ia~~iA~lp~~~~~~~ 65 (417) T protein:vir:38 1 MKLFRGLATE----V--DPHWADHLLDSGVIPSFRGGYLGIS--------AL-RNSDVLTAVSIVSGDVSRFPLVITDSS 65 (417) T ss_pred CccccccccC----C--CccchhhhcccccccccCCceechh--------hc-ccHHHHHHHHHHHHhhccCeeEEEEcC Confidence 3333321110 0 0111222334444443322211111 22 234455567778999999999886655 Q ss_pred ccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceec Q lcl|NC_011057. 87 ENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAV 166 (634) Q Consensus 87 ~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~v 166 (634) .| |.+.+ ..+..++..-..-.+...++++.++.+|-+-|++|+.|. |.+.++ ....++.+ T Consensus 66 ~~-----~~~~~-----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~-r~~~g~---------~~~~l~~l 125 (417) T protein:vir:38 66 TD-----EVIDL-----ANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIV-RDPITN---------EPAMFEFY 125 (417) T ss_pred Cc-----ceecc-----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEE-EcCCCC---------EEEEEEEe Confidence 44 33322 335556666678888999999999999999999999874 432211 11234444 Q ss_pred cHHHHhccCC-Cccee--eEeCCCCcccccCCCCeE-EEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_011057. 167 SKEEIKKSNK-GSGTN--IVLPTGEEHEFVKGTDII-FRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRL 242 (634) Q Consensus 167 t~~Ei~~~~~-~~~~~--i~lP~g~~h~~~~~~D~~-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 242 (634) ..+.+..... .+... +..++|......+..|++ ||.-.. .-..--||+.++...+.=-.-..+...+..+.=. T Consensus 126 ~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~~~dviH~r~~~~---d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~ 202 (417) T protein:vir:38 126 APSQTQVDTSDPDNIIYRFTPYNSSMQKVCGFEDVIHWKFFSY---DTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGL 202 (417) T ss_pred CCceEEEEEcCCCeEEEEEEEcCCcEEEEecCcceEEecCCCC---CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 4333321111 12222 344555554444445544 443221 2234557766665554433333333333333333 Q ss_pred hhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccccee Q lcl|NC_011057. 243 IGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHI 322 (634) Q Consensus 243 ~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl 322 (634) ...|||-.|+.++ ....+++++.+-+ .+.-.+ +--|+|+. ...+++.| T Consensus 203 ~p~~il~~~~~l~---------------------~e~~~~~~~~~~~----~~~g~n---~g~~~vl~----~g~~~~~l 250 (417) T protein:vir:38 203 KGSIIKAKESRLS---------------------AEARQKIREDFER----AQAGAD---AGSPIIVD----ATMDYQPL 250 (417) T ss_pred CCcEEEEeCCCCC---------------------HHHHHHHHHHHHH----Hhcccc---cCCceecc----CCceEEEc Confidence 4445555544332 1245556665542 222212 22445542 34567777 Q ss_pred ecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 323 RFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREG 402 (634) Q Consensus 323 ~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG 402 (634) .+..+-.. -+++|+..+..||..|.||| .+|| ...+..++-++...-++.-|.|.++.|+++|++.+|.+.. + T Consensus 251 ~~~~~d~q-~le~~~~~~~~Ia~~fgVPp-~~lg--~~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~---~ 323 (417) T protein:vir:38 251 EVDTNVLN-LINSNNYSTAQIAKALRVPA-YRLA--QNSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQ---R 323 (417) T ss_pred cCCHHHHH-HHHHHHhhHHHHHHHhCCCH-HHhC--CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhh---c Confidence 66554333 37899999999999999999 5667 3556788889999999999999999999999999987632 1 Q ss_pred CChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhh Q lcl|NC_011057. 403 IDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPL 482 (634) Q Consensus 403 ~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPl 482 (634) .+|.|.||.+.|..- + ..+-..+++.|++|.+++|+++|+.--.+-+. |++..... +.|+ T Consensus 324 ---~~~~~~fd~~~l~~~-~-~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~-----------d~~~~~~n----~~~~ 383 (417) T protein:vir:38 324 ---HQYCIGFDTKSVNGL-P-IADVNTAVNGGLWTGNEGRAELGKKPLKDPNM-----------DRIQSTLN----TVFL 383 (417) T ss_pred ---ccceEEechhhhhHH-H-HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC-----------Ceeeeccc----cccc Confidence 357788998877421 1 22235588999999999999999964222221 01000000 1111 Q ss_pred hhhhhhcccCC-CCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCC Q lcl|NC_011057. 483 IAGVLQQIEFP-QQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQD 527 (634) Q Consensus 483 l~p~~q~~~~P-~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~ 527 (634) - ..+-. .+.....-+|+.+. +++.+.++++.-. T Consensus 384 d-----~~~~~~~~~~~~~kgg~~~~-------~~~~~~~~~~~~~ 417 (417) T protein:vir:38 384 D-----QKEAYQAEHAAELKGGDTNA-------KGNQNGSGTNANS 417 (417) T ss_pred c-----cccccccccccccCCCCCCC-------CCCCcCCCCcCCC Confidence 1 11110 11111111111111 1111111111111 No 16 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.71 E-value=7.5e-17 Score=108.99 Aligned_cols=444 Identities=13% Similarity=0.074 Sum_probs=222.9 Q ss_pred cceeEeccCCCCccchhhhhhh-hccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEee Q lcl|NC_011057. 6 SLRLVRRPKGGRPAPSRALTAA-SQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASE 84 (634) Q Consensus 6 ~lr~vrrp~g~~~a~~ral~aA-s~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~ase 84 (634) -+=..||-|......+ .+..+ ...+ .+... ...+++..++=---.-..+ .++-+.-.|.=++++||.+-+..=+ T Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~~~~~--~~~~~-~~~~g~~~~g~~v~~~~al-~~~~V~~~v~~Ia~~iA~lp~~~~~ 75 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGR-DVREAGWTSL--FQAVA-EPFAGAWQQGVKADPEAVL-SFHAVFACISLISQDIAKMRLRLMQ 75 (454) T ss_pred CCCccccCcccccccc-cccchhhhhh--hhhhh-hhhcchhhcCcccChHHhh-ccHHHHHHHHHHHHhhccCceEEEE Confidence 2333455444332221 11110 1110 00000 0000000000000000111 2244555677789999999888877 Q ss_pred ecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhce Q lcl|NC_011057. 85 LDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWY 164 (634) Q Consensus 85 iD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~ 164 (634) -+.| |...+ ..++ -+.. ...-..-.+.-.++++.++.+|-+-|+.|+.+. |.. +|. ..+++ T Consensus 76 ~~~~-----g~~~~-~~~~-~~~~-L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~-r~~-------~G~---~~~L~ 136 (454) T protein:vir:93 76 TDAQ-----GIRRE-TRRG-DIAR-LCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKI-RNA-------RGQ---IKELR 136 (454) T ss_pred eccC-----Cccch-hhhH-HHHH-HHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEE-ECC-------CCc---EEEEE Confidence 6755 32222 1222 2333 344667788899999999999999999999874 422 232 23467 Q ss_pred eccHHHHhccCCCcc-eeeEeCC----CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_011057. 165 AVSKEEIKKSNKGSG-TNIVLPT----GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASK 239 (634) Q Consensus 165 ~vt~~Ei~~~~~~~~-~~i~lP~----g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 239 (634) .|....++.....+| ..+..-. |....+.-..|=+|++=...+..-..--||+..+...+.-..-..+...+..+ T Consensus 137 ~i~~~~v~v~~~~~g~~~y~~~~~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ 216 (454) T protein:vir:93 137 ILDWNRVEPLVADDGEVFYRITPDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFR 216 (454) T ss_pred EEcCcceEEEEcCCCcEEEEEEeccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 777766653322222 2233221 11111112233345552223333445667777766665544444444443334 Q ss_pred hHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 240 SRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 240 SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) .-..-.|||-+|+.++- -+.++|.+.+-+.- .+ .+.++ ++|+ + ..-++ T Consensus 217 ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~~~-~g-~n~g~-----~~vl--~--~g~~~ 264 (454) T protein:vir:93 217 NGGRPSGVIEIPGSITE---------------------ENAKKLKSNWDSGY-TG-ENAGK-----TAIL--S--NGAKY 264 (454) T ss_pred ccCCccEEEecCCCCCH---------------------HHHHHHHHHHHHHh-cc-cccCC-----ceec--c--CCceE Confidence 33344567777765431 13445555543221 11 22222 2233 2 23466 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) +.|.+... +.--+++|+..+..||..|-||| .+||.+.+.|.+++.+....-++..|.|.+..|+++|++.++.+ T Consensus 265 ~~l~~~~~-d~q~le~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L~~~--- 339 (454) T protein:vir:93 265 NPTTFSPV-DSQTVEQLKMTAEIVCSVFRVPA-YKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETG--- 339 (454) T ss_pred EEcccChh-HHHHHHHHHHHHHHHHHHhCCCH-HHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--- Confidence 66665432 33347899999999999999999 77888788999999999999999999999999999999887643 Q ss_pred hcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccc Q lcl|NC_011057. 400 REGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLI 476 (634) Q Consensus 400 ~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li 476 (634) ..|.|.||.+.|.. .|..+.+ ..+++.|++|.+++|+++|+..-.|-| . -++ T Consensus 340 ------~~~~~~f~~~~ll~-~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD-------------~-----~~~ 394 (454) T protein:vir:93 340 ------ENESTEFDVTTLLR-MDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGD-------------A-----LYL 394 (454) T ss_pred ------CCcEEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-------------e-----eee Confidence 34678999999843 3544444 358899999999999999996432211 0 011 Q ss_pred hh-hhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHH Q lcl|NC_011057. 477 PM-LAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALEL 551 (634) Q Consensus 477 ~~-laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALel 551 (634) +. ..|+-. .-+....+.|.+.. + +.. ...++.++.|....... .+.-..+.+.++-+.. T Consensus 395 ~~~~~~~~~-~~~~~~~~~~~~~~---~-~~~--------~~~~~~~~~d~~~~~~e---~~~d~~~~~~~~~~~~ 454 (454) T protein:vir:93 395 QQQNYSLEA-LSRRDAREDPFASS---G-KTA--------SVPQAVAASDGNKAITE---TEHDAVKAMFRGILKK 454 (454) T ss_pred ccCccchHh-hhccCcccCCCCCC---c-cCC--------CCCCCCCCCCCCCCccC---CccchhhhhhhhhhcC Confidence 11 011110 00111111110000 0 000 00011111111100000 0001111222222211 No 17 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.70 E-value=5.4e-17 Score=109.77 Aligned_cols=416 Identities=13% Similarity=0.119 Sum_probs=227.9 Q ss_pred CCCCCcc-eeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHH-HHHhhhhhHHHHHhhhhhceeee Q lcl|NC_011057. 1 MAATQSL-RLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAW-EAVDLVGELRYYVGWRASSCSRC 78 (634) Q Consensus 1 ~~a~~~l-r~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW-~~yd~VgELryyvgWr~~s~Sr~ 78 (634) |.=-..+ =.-+|.+. . +..++++...+.+-.+. ..+.+.- -. ..| ..+-++-.+.-++++||++ T Consensus 1 M~~~~~~f~~~~r~~~-~----------~~~~~~~~~~~~~~~g~-~~~~~~v-~~~~al-~~~~v~~~i~~ia~~ia~l 66 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTS-Q----------VIELNKDDEKLLEWLGI-SPSTISV-KGKNAL-KVATVFACIKILSESVSKL 66 (429) T ss_pred CchhhhhhcccccCcc-c----------ccccCCChHHHHHHhcC-CCCccee-chhhhh-ccHHHHHHHHHHHHhhccC Confidence 3221111 00122211 0 11111222222111111 1111110 00 122 2355666777889999999 Q ss_pred eEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 79 RLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 79 rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) .+..=+-+++ | . .+ ..+ ..+..++..-+..-+...++++.++.+|-+-|+.|+.+. |... |. T Consensus 67 ~~~~~~~~~~-~----~-~~-~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~-------G~-- 128 (429) T protein:vir:10 67 PLKIYQEDEY-G----I-QR-GTK-HYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK-------GK-- 128 (429) T ss_pred ceEEEEecCC-c----e-ee-ccc-cHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC-------Cc-- Confidence 9887665544 2 1 11 111 334555555567778888999999999999999999874 4322 21 Q ss_pred cchhceeccHHHHhccCCC-------ccee-eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKG-------SGTN-IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRT 230 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~-------~~~~-i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rt 230 (634) ...++.+....+...... .... ....+|..++|.. +=+|++=.+.+..-..-.||+..+...+.-.... T Consensus 129 -~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~--~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~ 205 (429) T protein:vir:10 129 -VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP--EEILHFKNGITLDGLVGVPTMEYLKSTLENSASA 205 (429) T ss_pred -EEEEEEEcCceeEEEEcCcccccccceEEEEEccCCeEEEEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHH Confidence 234555554444321111 1111 2333566655543 3345553444555556778888888777776666 Q ss_pred hHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_011057. 231 TKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAG 310 (634) Q Consensus 231 tk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~ 310 (634) .+...+..+.-..-.|||-+|+.++- -..+++++.+-+ .-...++.+. ++|+ T Consensus 206 ~~~~~~~~~ng~~~~~il~~~~~l~~---------------------e~~~~~~~~~~~-~~~g~~n~~~-----~~vl- 257 (429) T protein:vir:10 206 DKFINNFYKQGLQVKGLVQYVGDLNE---------------------DAKKVFRENFES-MSSGLQNSHR-----IALM- 257 (429) T ss_pred HHHHHHHHhccCCccEEEEcCCCCCH---------------------HHHHHHHHHHHH-HhccccccCc-----eeec- Confidence 66666655554455677777655431 133445554431 1122232222 2222 Q ss_pred echHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_011057. 311 VPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALT 390 (634) Q Consensus 311 vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait 390 (634) + ..-+++-|.+.. .+.--+++|+..+..||+.|.||| .+||...++|+.++.+....-++..|.|.+..|+++|+ T Consensus 258 -~--~g~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~-~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln 332 (429) T protein:vir:10 258 -P--VGYQFQPISLNM-SDAQFLENTELTIRQIATAFGIKM-HQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMT 332 (429) T ss_pred -C--CCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 223555555432 223337899999999999999999 77777778999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHH Q lcl|NC_011057. 391 DQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQD 467 (634) Q Consensus 391 ~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d 467 (634) +.+|-+.-- ...|.|.||.+.|.. +|..+.+ ..++..|++|.+++|+++|++...+-| T Consensus 333 ~kl~~~~~~-----~~g~~~~fd~~~ll~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD------------- 393 (429) T protein:vir:10 333 YKLFLDSEL-----DKGFYSKFNVDAILR-ADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD------------- 393 (429) T ss_pred HhhcChhhc-----CCCcEEEeechhhhc-CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC------------- Confidence 998855321 244668899999844 3444333 448899999999999999996432222 Q ss_pred HhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCC Q lcl|NC_011057. 468 AVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGE 517 (634) Q Consensus 468 ~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~ 517 (634) . -++| + -+..++.. ......+|+++.+.+..+.++. T Consensus 394 ~-----~~~~----~---n~~~~d~~--~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 394 R-----LLVN----G---NMLPIDMA--GQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred e-----eeec----c---cccchhhc--cccccCCCCCCCCCCCCCCCCC Confidence 0 0111 0 01111110 0111122333222222222221 No 18 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.67 E-value=8.9e-17 Score=108.59 Aligned_cols=415 Identities=15% Similarity=0.135 Sum_probs=223.0 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHH-------------HHhhhhhHHHH Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWE-------------AVDLVGELRYY 67 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~-------------~yd~VgELryy 67 (634) |--+-. |++-|++.+ ...-.+ .++ +....|..++|. -+-..+-+.-. T Consensus 1 ~~~~~~-~~~~~~~~~----~~~~~g--~~~-------------s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~c 60 (437) T protein:vir:10 1 MKQGKQ-RALGRIKSS----FLKWLG--VPI-------------SLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSC 60 (437) T ss_pred CCcchh-hhhhhhHHh----hhhhcC--Ccc-------------cCCchhHHHhhcccccCCCceechHhhhccHHHHHH Confidence 221111 111111110 000000 011 111112222331 11133456667 Q ss_pred HhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCC Q lcl|NC_011057. 68 VGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVK 147 (634) Q Consensus 68 vgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~ 147 (634) +.-+++++|++.|..-+.+.| |.-.+ . ....+..+...-...-+...++.+.++.+|-+-|++|+.+ .|.. T Consensus 61 i~~Ia~~ia~lp~~~~~~~~~-----g~~~~-~-~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~~- 131 (437) T protein:vir:10 61 VRLIAETIATLPLNLYQTKPD-----GTRVL-A-KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARK-LRSA- 131 (437) T ss_pred HHHHHHHHhhCceeEEEEcCC-----Cceee-c-cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEE-EecC- Confidence 888999999999988888877 32111 1 1234556566668888999999999999999999999886 4422 Q ss_pred CCCCCcccccccchhceeccHHHHhccCCCcce-e--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHH Q lcl|NC_011057. 148 GAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGT-N--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSI 224 (634) Q Consensus 148 ~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~-~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~L 224 (634) |. ...++.+..+.+......++. . +..++|....|.. .| +|++=+++. .-..--||+.++...+ T Consensus 132 -------g~---~~~L~~l~p~~v~i~~~~~g~~~y~~~~~~g~~~~~~~-~d-Iih~r~~~~-d~~~G~spi~~~~~~i 198 (437) T protein:vir:10 132 -------GV---LIGLELMLPQRTTVKRLTSGALQYTYRNVDGTVSTLAE-DD-VFHVRGFSL-DGLMGLTPIQYAREVL 198 (437) T ss_pred -------Cc---EEEEEEEcCcceEEEECCCCeEEEEEEecCceEEEEcc-cc-EEEecCcCC-CCcccccHHHHHHHHH Confidence 21 123566655544332222221 2 4456777655543 34 334422221 2244568877777666 Q ss_pred HHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccc Q lcl|NC_011057. 225 REIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAF 304 (634) Q Consensus 225 rEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~ 304 (634) .-..-..+...+..+.-..-.|||-+|+.++- -..+++.+.+-+ .+.-. .-+- T Consensus 199 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~----~~~g~--~nag 251 (437) T protein:vir:10 199 GNSTAANKTSASVFRNGLRPSGVLSTDQILQK---------------------EKRAEIRTDLAE----QFGGA--MQAG 251 (437) T ss_pred HHHHHHHHHHHHHHhccCCccEEEEcCCCCCH---------------------HHHHHHHHHHHH----HhcCc--cccC Confidence 65555555555555555566777777654331 133444444432 22211 1122 Q ss_pred cceeEeechHHhcccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchh--hHhhhhhhhhhHHHHhH Q lcl|NC_011057. 305 IPVIAGVPGEQIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW--SAWQISDEDVQLHIAPV 381 (634) Q Consensus 305 vPiva~vP~Ehi~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhw--tAw~i~de~v~~hI~P~ 381 (634) -|+|+. ..-+++-|.+. ..+. -+++|+..+..||+.|.||| .|||...++|.| +..+....=++..|.|. T Consensus 252 ~~~vl~----~g~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P~ 324 (437) T protein:vir:10 252 KTMVLE----AGMKYQAITMN--PGDVQLLETRAFNIEEICRWYRVPP-FMVGHSEKSTSWGTGIEQQTLGFLTFTLRPW 324 (437) T ss_pred cceecc----CCceEEeccCC--hhhHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCcccccchHHHHHHHHHHHHHHHH Confidence 244442 22344545443 3333 38999999999999999999 677887666554 46788888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCH Q lcl|NC_011057. 382 MEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTR 458 (634) Q Consensus 382 ~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~ 458 (634) +..|+++|++.+|.+.- + .+|.|-||.+.|.. .|..+.+ ..+++.|++|.+++|+++|+..-.|-| T Consensus 325 ~~~ie~~l~~kll~~~e---~---~~~~~~fd~~~ll~-~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~---- 393 (437) T protein:vir:10 325 LTRIEQAARRSLLRPGE---R---DQFYAEFSVEGLLR-ADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNA---- 393 (437) T ss_pred HHHHHHHHHhhccCccc---c---CceEEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc---- Confidence 99999999999986622 1 24667789998843 4544433 348899999999999999995422111 Q ss_pred HHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCC Q lcl|NC_011057. 459 EGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDD 528 (634) Q Consensus 459 Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~ 528 (634) +.+..+..++ |+ ...+-..|..+... ...+.+.++|++ ..|+.. T Consensus 394 --------~~~~~~~~~~----~~-----~~~~~~~~~~~~~~--------~~~~~~~~~~~~-~~~~e~ 437 (437) T protein:vir:10 394 --------AVLTVQSALL----PI-----DKLGEHTTATAAQD--------ALKAWLYQEEKT-RATQER 437 (437) T ss_pred --------ceEeecCccc----ch-----hhccCcCCCcchhc--------cccccCCCCCCC-CccccC Confidence 0011111111 11 11111111111000 000111111221 111111 No 19 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.67 E-value=2.5e-16 Score=106.10 Aligned_cols=409 Identities=13% Similarity=0.150 Sum_probs=218.4 Q ss_pred eEeccCCCCccchhhhhhhhccCCchhhhhhhh-cccCccccccHHHHHHH----------------hhhhhHHHHHhhh Q lcl|NC_011057. 9 LVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS-TGISRNSDWQTDAWEAV----------------DLVGELRYYVGWR 71 (634) Q Consensus 9 ~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~-~~~~~~~~WQ~eAW~~y----------------d~VgELryyvgWr 71 (634) .+|..+. ++..|...... ..+.+. ...+...+| .|..+ =.++-+.-.+.-+ T Consensus 1 ~~~~l~~-------~~~~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~i 68 (434) T protein:vir:43 1 MSKSLGK-------VLSSATSAPRS--SLFGWGGKTIRLTDGA---FWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLI 68 (434) T ss_pred Cccchhh-------hhhhcccccch--hhhcccccccccCchH---HHHHHhcCCccCCceechhhhhccHHHHHHHHHH Confidence 1222111 11111111100 000000 000001111 12111 1234455678889 Q ss_pred hhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCC Q lcl|NC_011057. 72 ASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPA 151 (634) Q Consensus 72 ~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~ 151 (634) ++++|.+.+..=+-+.| |+..+ ..+ -.+..++..=...-+...++++.++.+|-+-|+.|+.| .|.. T Consensus 69 a~~ia~lp~~~~~~~~~-----g~~~~-~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~~~~----- 135 (434) T protein:vir:43 69 STSVAGLPLGVYERKAD-----GSRVD-ARS-FPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEI-RRAA----- 135 (434) T ss_pred HHhhhhCceEEEEEcCC-----Ccccc-ccc-cHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EeCC----- Confidence 99999998887777766 33322 112 23445555567788899999999999999999999775 4421 Q ss_pred CcccccccchhceeccHHHHhccCCCcce-e--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHH Q lcl|NC_011057. 152 QPDGSVRTRQEWYAVSKEEIKKSNKGSGT-N--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIV 228 (634) Q Consensus 152 ~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~-~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~ 228 (634) |. .-.++.|..+.++.....+|. . +...+|...+|.. +-||++=.+ +-+-..--||+..+.+.+.-.. T Consensus 136 ---G~---~~~L~~l~p~~v~~~~~~~g~~~y~~~~~~g~~~~~~~--~eVih~~~~-~~dg~~G~spi~~~~~~i~~~~ 206 (434) T protein:vir:43 136 ---GR---PAALDFLLPSRVDLECDENGRLKYFYTTKKGARREIER--TNMLHIPAF-TLDGRIGLSAIRYGVDVFGSVM 206 (434) T ss_pred ---Cc---EEEEEEEcCcceEEEEcCCCeEEEEEEecCceEEEEcc--ccEEEecCc-CCCCccccCHHHHHHHHHHHHH Confidence 21 234566666666433222222 2 3445676666653 333443122 2233446688877776666555 Q ss_pred hhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 229 RTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 229 rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) ...+...+..+.-..-.|||-+|+.++- -+.++|++.+-+ +. +..-+--++| T Consensus 207 ~~~~~~~~~f~ng~~~~gil~~~~~l~~---------------------e~~~~~r~~~~~-----~~--g~~nag~~~v 258 (434) T protein:vir:43 207 SAEDAANGTFKNGLLPTVAFKVDRILQP---------------------AQREEFREYVKS-----VS--GAMNSGRSPV 258 (434) T ss_pred HHHHHHHHHHhccCCcceEEecCCCCCH---------------------HHHHHHHHHHHH-----hc--CccccCCccc Confidence 5555555444444445567777765541 134455554421 11 1111222333 Q ss_pred EeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchh--hHhhhhhhhhhHHHHhHHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW--SAWQISDEDVQLHIAPVMEIFC 386 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhw--tAw~i~de~v~~hI~P~~~~i~ 386 (634) + + ..-+++.|.... .+.--+++|+..+..||..|-||| .|||.....+.| +.-+....-++..|.|.+..|+ T Consensus 259 l--~--~g~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie 332 (434) T protein:vir:43 259 L--E--QGITPETIGINP-VDAQLLETREHGVIEICRWFGVPP-WMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQ 332 (434) T ss_pred c--C--CCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 3 2 233555555432 223348899999999999999999 677875555544 4467777778888999999999 Q ss_pred HHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHH Q lcl|NC_011057. 387 QALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVM 463 (634) Q Consensus 387 ~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~ 463 (634) ++|++.+|.+.- . ..|.|-||.+.|.. .|..+.+ ..++..|++|.+++|+++|+..-.+-| T Consensus 333 ~~ln~kL~~~~~----~--~~~~~~fd~~~llr-~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD--------- 396 (434) T protein:vir:43 333 QCVNKRLLTAPE----R--IRYYAEFSLEGFLK-ADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGD--------- 396 (434) T ss_pred HHHHhhcCChhh----h--cCceEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--------- Confidence 999998876522 1 25788999999843 4444443 348899999999999999997533222 Q ss_pred HHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCC Q lcl|NC_011057. 464 WAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDT 522 (634) Q Consensus 464 wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDT 522 (634) ..-....+ .|+ + .++-..++ .+..+.....+++.||.. T Consensus 397 ----~~~~~~n~----~~~-~----~~~~~~~~--------~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 397 ----ILTVQSNL----VPI-D----QLGQSNKS--------QAVRAALMNWFSQPEPQE 434 (434) T ss_pred ----eEeeccCc----cch-h----hhhccCCC--------cchhhhhhccCCCCCCCC Confidence 11001111 111 0 00000000 000111111223333432 No 20 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.67 E-value=6.8e-17 Score=109.24 Aligned_cols=403 Identities=14% Similarity=0.106 Sum_probs=222.6 Q ss_pred CCCCCcceeEeccCCCCccch-h--hhhhhhccCC-chhhhhhhhcccCccccccHHHHHHHh-------------hhhh Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPS-R--ALTAASQPLP-DPSQVFSKSTGISRNSDWQTDAWEAVD-------------LVGE 63 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~-r--al~aAs~~it-dp~~~~~~~~~~~~~~~WQ~eAW~~yd-------------~VgE 63 (634) |.=-.-|| |.+....+++ + .-..|+++.+ .++..+ .+.+..| ...|-.-. .++- T Consensus 1 Mgl~d~~r---~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----~~~~~~~-~~~~~~~~~~~g~~v~~~~al~~~~ 71 (431) T protein:vir:10 1 MGLFDFIR---REKQPEAQARPHVEPSFQASTPTTSIPGETF-----EGLDDPR-LKEYIRRGELNGGTGRETRALRNMA 71 (431) T ss_pred Ccchhhhh---cCccccccccccccccccccccccccccccc-----ccccchH-HHHhhccCccCcceechhhhhccHH Confidence 54323333 3222111111 1 1122222222 222222 1111101 11111000 1234 Q ss_pred HHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEE Q lcl|NC_011057. 64 LRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILT 143 (634) Q Consensus 64 LryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~ 143 (634) +.-.+.-+++++|.+.+..=+-| +++ ....+ ..+..+++.-.---+...++.+.++.+|.+-|++|+.| . T Consensus 72 V~~ci~~Ia~~iA~lp~~v~~~~-~~~----~~~~~----~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i-~ 141 (431) T protein:vir:10 72 VLRCVTLISGTIGMLPMNLISSD-DSK----QVLTD----DPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARI-V 141 (431) T ss_pred HHHHHHHHHHhhccCceEEEEec-Cce----eeecc----chHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEE-E Confidence 45556778999999988775544 311 22222 33555556667778899999999999999999999987 3 Q ss_pred ecCCCCCCCcccccccchhceeccHHHHhccCCCcc-ee--eEeCCCCcccccCCCCe-EEEeeCCCcccccCCccchhh Q lcl|NC_011057. 144 RPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG-TN--IVLPTGEEHEFVKGTDI-IFRVWIPKPRKASEPDSPVRA 219 (634) Q Consensus 144 rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~-~~--i~lP~g~~h~~~~~~D~-~~RvW~P~prra~eaDSPvra 219 (634) |.. | ++ -+.+.+....+......++ .. +..++|...+|. ..|+ -||-.+++. ..--||+.. T Consensus 142 r~~--------g--~~-~~L~pl~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~-~~dViHir~~~~dg---~~G~spi~~ 206 (431) T protein:vir:10 142 WSG--------N--RP-IRLIPMDRGSAKGRLTSTWQIVYDYTTPTGDKIELP-AREVFHLRDLSIDG---VSGVSRVKL 206 (431) T ss_pred EcC--------C--ce-EEEEEEcCceeEEEEcCCCeEEEEEEeCCceEEEEc-hhhEEEecCcCCCC---cccccHHHH Confidence 421 2 12 2466666666553222222 22 455677766654 3343 344333332 345678777 Q ss_pred hhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcc Q lcl|NC_011057. 220 VLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDED 299 (634) Q Consensus 220 ~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~ 299 (634) +...+.=..-..+...+..+.-..-.|||-+|+.++- ...+++++.+- .++.-.+ T Consensus 207 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~---------------------e~~~~~~~~~~----~~~~g~~ 261 (431) T protein:vir:10 207 SGNALELAEQAERAASRTFRTGVMAGGAIEVPKELSD---------------------NAYGRMKASVQ----ENHTGSE 261 (431) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCH---------------------HHHHHHHHHHH----HHhcCcc Confidence 7766655555555555555555555677776654431 14455666554 2222111 Q ss_pred ccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHH Q lcl|NC_011057. 300 SQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIA 379 (634) Q Consensus 300 S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~ 379 (634) . +-=|+|+ + ..-+++-|.+.. .+.--+++|+..+..||..|.||| .+||...++++.+..|....=++..|. T Consensus 262 n--~g~~~vl--~--~g~~~~~l~~~~-~d~q~le~r~~~~~~Ia~~fgVPp-~~lg~~~~~t~sn~eq~~~~f~~~tL~ 333 (431) T protein:vir:10 262 N--AGSWMLL--E--EGATAKQFSNTA-ASAQQIENRNHQIEEVARMYGVPR-PLLMMDDTSWGSGIEQLAIFFIQYGLS 333 (431) T ss_pred c--cCCceec--C--CCceEEEccCCh-hHHHHHHHHHHhHHHHHHHhCCCH-HHhCCCCCCccccHHHHHHHHHHHHHH Confidence 1 1112222 2 334666676643 233447899999999999999999 777877778888999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccC----CCHHHHHHHhCCCcccc Q lcl|NC_011057. 380 PVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGA----INGEALRKYLGLGDDAG 452 (634) Q Consensus 380 P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~----It~ealr~~~Gl~ed~~ 452 (634) |.+..|+++|++.+|-+.. + ..|-|.||.+.|. +.|..+.+ ..++..|+ +|.+++|+++|++--.+ T Consensus 334 P~~~~ie~~ln~~Ll~~~~---~---~~~~~~fd~~~ll-r~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~ 406 (431) T protein:vir:10 334 HWFVSWEQAAARAFLPEKM---L---GQRQFKFNEGALL-RGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADD 406 (431) T ss_pred HHHHHHHHHHHhhccChhh---c---CCceEEEechhhh-ccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCC Confidence 9999999999999885422 1 2577889999983 34554444 23565555 99999999999964322 Q ss_pred CCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCC Q lcl|NC_011057. 453 YDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDG 516 (634) Q Consensus 453 yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~ 516 (634) .+.+ ..-.|...+.. .+++ ++.+.. T Consensus 407 ~~gD--------------------------------~~~~p~n~~~~-~~~~------~~p~~~ 431 (431) T protein:vir:10 407 PVAD--------------------------------QLRNPMTQKQK-GSGD------EPPATT 431 (431) T ss_pred cccc--------------------------------ceecccccccC-CCCC------CCCCCC Confidence 2111 00111100000 0000 000000 No 21 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.67 E-value=3e-16 Score=105.70 Aligned_cols=399 Identities=16% Similarity=0.170 Sum_probs=217.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhc-ccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKST-GISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~-~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |.=-+ |+.++++-. |+.+..+ -+.+++..+.... ..+... .+ ..+-+.-.|.-++++||.+. T Consensus 1 Mgl~~--~~f~~~~~~-----~~~~~~~-~~~~~~~~~~~~g~~v~~~~--------al-~~~~v~~~v~~ia~~iA~lp 63 (409) T protein:vir:84 1 MSLFT--RIFSGPSEE-----RTLTKIS-GIPSPAEDWAMHGDRPGANS--------AM-TLGAFYACVTLLADTVASLS 63 (409) T ss_pred Cchhh--hhhcCCCcc-----ccccccc-ccccccchhhccCcccchhh--------hh-ccHHHHHHHHHHHHhhhhCc Confidence 54433 455554321 2222111 1122222211100 000000 01 12334445666899999999 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) +..-+-+ |. +.+.+ ..+..+...-..--+...++++.++.+|-+-|+.|+.|..|..++. T Consensus 64 ~~~~~~~-~~----~~~~~-----~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~---------- 123 (409) T protein:vir:84 64 IDAYRKK-DN----VRIPV-----SPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANR---------- 123 (409) T ss_pred eEEEEec-CC----ccccc-----chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCc---------- Confidence 9887755 32 23332 2345556666778889999999999999999999988866643321 Q ss_pred chhceeccHHHHhcc--CCCcceeeEe-CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKS--NKGSGTNIVL-PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~--~~~~~~~i~l-P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ....+.|..+.+... ....+..+.. -.+.-.+|. ..| +|++=+..+..-..--||+..+...+.=..-..+...+ T Consensus 124 ~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~~g~~~~-~~d-vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 201 (409) T protein:vir:84 124 PTAIMPIHPDCIHVTDAKDEDGDWIEPVYRIDGKVVP-NHR-IMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLR 201 (409) T ss_pred eEEEEEEcCceeEEEEcCCCcceEEEEEecCCceEEc-hhh-EEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHH Confidence 223455554443322 1122222221 111223342 234 44443344444456778888777766665555555555 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+.-..-.|||-+|+.++- -+.+++.+.+.+. +.. +--++|+ + .. T Consensus 202 ~f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~~----~~n-----~g~~~vl--~--~g 247 (409) T protein:vir:84 202 WFRDSANPSGILSSDADLTP---------------------DQVKQTQKQWIQS----HHN-----RRLPAVM--S--AG 247 (409) T ss_pred HHhcCCCccEEEecCCCCCH---------------------HHHHHHHHHHHHH----hcc-----CCCeeec--C--CC Confidence 55555556677777764431 1345566655532 222 2224443 3 23 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhh--HhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWS--AWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhwt--Aw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) .+++-+.+.. .+.--+++|+..+..||..|.||| .+||...+.|-|+ ..+....=++..|.|.++.|+++|++.+ T Consensus 248 ~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L- 324 (409) T protein:vir:84 248 IKWQSVSITP-NESQFLETRSFQRSEIAMWFRIPP-HMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFL- 324 (409) T ss_pred ceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc- Confidence 3455555432 122338899999999999999999 6778655666544 4666666678889999999999999764 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSK 471 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~ 471 (634) +..|.|.||.++|.. +|..+.+ ..+++.|++|.+++|+++|+..-.+-| + T Consensus 325 ----------~~g~~i~fd~~~l~~-~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD----~------------ 377 (409) T protein:vir:84 325 ----------PRGQFVKFNVDGLMR-GDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGD----I------------ 377 (409) T ss_pred ----------cCCCeEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----e------------ Confidence 134778999999854 4555544 448889999999999999997533322 0 Q ss_pred Ccccchh-hhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCC Q lcl|NC_011057. 472 DPTLIPM-LAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDD 528 (634) Q Consensus 472 dp~Li~~-laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~ 528 (634) -++|. +.|+-+ +++ . ++. .+.+|+++.++.. T Consensus 378 --~~~~~n~~~~~~--------------~~~-~-~~~--------~~~~~~~~~~gn~ 409 (409) T protein:vir:84 378 --HLQPMNFVPLGY--------------VPP-E-EPA--------QEPQPNSATEGNK 409 (409) T ss_pred --eeeccccccccc--------------CCc-c-ccC--------cCCCCCCccCCCC Confidence 01110 011110 000 0 000 0111111111111 No 22 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.66 E-value=5.4e-17 Score=109.77 Aligned_cols=407 Identities=12% Similarity=0.107 Sum_probs=211.6 Q ss_pred CCcceeE---eccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHH-------------HHHHhhhhhHHHH Q lcl|NC_011057. 4 TQSLRLV---RRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDA-------------WEAVDLVGELRYY 67 (634) Q Consensus 4 ~~~lr~v---rrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eA-------------W~~yd~VgELryy 67 (634) -+..|+. -|-|.. -.-+.++..++... ....+|.... ++-+-..+-+.-. T Consensus 1 ~~~~~~mg~f~r~~~~--------~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAM--------FVPPDPVDIGGGQT------FTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCchhhcchhhhhhhh--------cccccccccccccc------cccCccchhhhcccccccCcccchHhhhccHHHHHH Confidence 1222211 000000 00000010000000 0001111111 1112233445567 Q ss_pred HhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCC Q lcl|NC_011057. 68 VGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVK 147 (634) Q Consensus 68 vgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~ 147 (634) +.-+++++|++.+..-+-+.| |..+. .++ .+..++..=+...+...++++.++.+|.+-|++|+.+. |.. T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~-----g~~~~--~~~-~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~-~~~- 136 (432) T protein:vir:81 67 VKLVSQAIAAMPLTMYMRTPD-----GRKEA--VNH-PLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKV-VTD- 136 (432) T ss_pred HHHHHHhhhhCceeeEEecCC-----cceec--ccc-hHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEE-ecC- Confidence 888999999999988777766 33221 122 24444555566778889999999999999999998764 322 Q ss_pred CCCCCcccccccchhceeccHHHHhccCCCcce-e--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHH Q lcl|NC_011057. 148 GAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGT-N--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSI 224 (634) Q Consensus 148 ~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~-~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~L 224 (634) |. ..+.+.|..+.+......+|. . +...+|...+|.. .| +|++ ...+-....--||+.++.+.+ T Consensus 137 -------g~---~~~L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~-~~-iih~-r~~~~dg~~G~spi~~~~~~i 203 (432) T protein:vir:81 137 -------GR---IESLQYLANDRLTITTDPKGNTAYRYRRTDGQMIDIPK-QQ-IWKI-MGYSLDGENGLSAIRYGAQIF 203 (432) T ss_pred -------Cc---EEEEEEEcCCceEEEECCCCcEEEEEEecCceEEEEcc-cc-EEEe-cCCCCCCcccccHHHHHHHHH Confidence 21 123455555554433222222 2 3344666555433 33 3444 222323345568877766655 Q ss_pred HHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccc Q lcl|NC_011057. 225 REIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAF 304 (634) Q Consensus 225 rEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~ 304 (634) .--.-..+...+..+.-....|||-+|+.++- .+.+.+.+-+-. ..+ +- T Consensus 204 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~---------------------e~~~~~~~~~~~-----~~n-----ag 252 (432) T protein:vir:81 204 GTAIAAEAQAARAFRNGQLQSVYYQIDRFLTD---------------------DQYDSFAKKVSG-----SVE-----AG 252 (432) T ss_pred HHHHHHHHHHHHHHhcCCCcceEEecCCCCCH---------------------HHHHHHHHHHhh-----hhc-----CC Confidence 54444444444443333344577777765431 133444443321 111 11 Q ss_pred cceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcch---hhHhhhhhhhhhHHHHhH Q lcl|NC_011057. 305 IPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNH---WSAWQISDEDVQLHIAPV 381 (634) Q Consensus 305 vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nh---wtAw~i~de~v~~hI~P~ 381 (634) =++|+ + ...+++-|.+... +.--+++|+..+..||..|.||| .|||.....+. .+..|....=++..|.|. T Consensus 253 ~~~vl--~--~g~~~~~l~~~~~-d~q~le~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~ 326 (432) T protein:vir:81 253 RAPLL--E--GGMDVKSLGLNPV-DAQLLQSRQYSVESICRFFGVPP-SMIGHSSAGTTSWGSGIESQQLGFLTMTLSPW 326 (432) T ss_pred Cceec--C--CCceEEEccCCHH-HHHHHHHHHHHHHHHHHHhCCCH-HHcCCcCCccccccchHHHHHHHHHHHHHHHH Confidence 13333 2 3345666665443 22347899999999999999999 77887433333 455677777788899999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCH Q lcl|NC_011057. 382 MEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTR 458 (634) Q Consensus 382 ~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~ 458 (634) +..|+++|++.+|.+.- . ..|.|-||.+.|. +.|..+.+ ..++..|++|.+++|+++|+..-.|-+ T Consensus 327 ~~~ie~~l~~kLl~~~~---~---~~~~~~fd~~~ll-r~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~---- 395 (432) T protein:vir:81 327 LRRIEQSIALNLLSPAE---R---RRYFADFDTSALL-RADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNA---- 395 (432) T ss_pred HHHHHHHHHhhccCccc---c---CceEEEeechhhh-ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc---- Confidence 99999999998887632 1 3588899999985 33444333 347889999999999999996422111 Q ss_pred HHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCcccc Q lcl|NC_011057. 459 EGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDD 511 (634) Q Consensus 459 Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~ 511 (634) +.+..+..++| +-.. . +-+.|.++.....++++..++ T Consensus 396 --------~~~~~~~~~~p----l~~~--~--~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 396 --------AVLTVQSAMVP----LDSI--G--LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred --------ceEeecCcccc----hhhh--c--cCCCCCCCCCCCCcccccccC Confidence 11111112211 1100 0 011111111110111111111 No 23 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.66 E-value=1.9e-16 Score=106.77 Aligned_cols=409 Identities=12% Similarity=0.081 Sum_probs=233.7 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHH-HHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAW-EAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW-~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |.==. |+..|.++......-.....+-+.++++..... + ...++ .--. ..+ ..+-+.-.+.=+++++|++. T Consensus 1 MG~f~--~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---g-~~~~~-~v~~~~al-~~~~v~~ci~~ia~~iA~lp 72 (422) T protein:vir:13 1 MGFLR--GLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKF---G-IKLNF-SVRGKRAL-KENTVYVCTKIRAESIGKLS 72 (422) T ss_pred Cchhh--hhhhccCCccchhhhhhhccccccCcchhhhhc---c-ccCCc-ccchhhhh-ccHHHHHHHHHHHHhhhhCc Confidence 54322 233333332211111111222222222211100 0 00001 0000 001 22344555667899999998 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) +..-+=. ..+.+ ..+..++..-+.-.+-..++++.++.+|-+-|+.|+.+. |.. +|. T Consensus 73 ~~~~~~~-------~~~~~-----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~-------~G~--- 129 (422) T protein:vir:13 73 LKIYKDK-------EEYKE-----HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIE-RDR-------KGK--- 129 (422) T ss_pred eEEEecC-------ccccc-----chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECC-------CCc--- Confidence 8764411 11221 234555555566778888999999999999999998873 422 231 Q ss_pred chhceeccHHHHhccCCCcc-------ee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSG-------TN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRT 230 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~-------~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rt 230 (634) ..+++.|....+......++ .- +..++|...+|.. |=+|++-.+.+..-..-.||+..+.+.+.-.... T Consensus 130 ~~~L~~i~~~~v~~~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~--~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~ 207 (422) T protein:vir:13 130 IIGLYPINSDNVTKIIDDDNFLSSLSKVWYVVTDKNGKEHKLLP--DEMLHFIGDITLDGLIGIKPLDYLRCTIENGRAT 207 (422) T ss_pred EEEEEEECCcceEEEEcCCcceeccceEEEEEEeCCCeEEEEcc--cceEEEcCCCCCCCcccccHHHHHHHHHHHHHHH Confidence 23466666666654322222 22 3345666665554 3445555555666667789999999888777777 Q ss_pred hHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_011057. 231 TKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAG 310 (634) Q Consensus 231 tk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~ 310 (634) .+...+..+.-..-.|||-+|+.++- -..+++++.+.+. ....++.+. ++|+ T Consensus 208 ~~~~~~~f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~~-~~g~~n~~~-----~~vl- 259 (422) T protein:vir:13 208 QEFINKFFKNGLSIKGIVQYVGDLDE---------------------KAKKIFKKEFESM-SNGLENAHS-----ISLL- 259 (422) T ss_pred HHHHHHHHhccCCccEEEEeCCCCCH---------------------HHHHHHHHHHHHH-hcCccccCC-----ceec- Confidence 77777777776677888888765431 1345555555422 111222221 2222 Q ss_pred echHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_011057. 311 VPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALT 390 (634) Q Consensus 311 vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait 390 (634) + ..-+++-+.+... +.=-+++|+..+..||..|.||| .+||...++|+.+..+....-++..|.|.++.|+++|+ T Consensus 260 -~--~g~~~~~l~~~~~-d~q~le~~~~~~~~Ia~~fgVpp-~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~ 334 (422) T protein:vir:13 260 -P--FGYQFQPISLSMA-DAQFLENSKLTKRELAATFGMKS-YHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQ 334 (422) T ss_pred -C--CCceeeeccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 3345555554332 22337899999999999999999 77776668899999999888899999999999999999 Q ss_pred HHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHH Q lcl|NC_011057. 391 DQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQD 467 (634) Q Consensus 391 ~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d 467 (634) +.+|.+.-. ...|-|.||.+.|.. +|..+.+ ..+++.|++|.+++|+++|+..-.+-| T Consensus 335 ~~Ll~~~~~-----~~g~~i~fd~~~l~r-~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD------------- 395 (422) T protein:vir:13 335 DKLFSQYET-----LQDVKAEFNVDTILR-SDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGD------------- 395 (422) T ss_pred HhhCChhhh-----cCCceEEeechhhhc-CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC------------- Confidence 998876321 134778999999854 3433333 448999999999999999997533322 Q ss_pred HhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCC Q lcl|NC_011057. 468 AVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEH 518 (634) Q Consensus 468 ~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ 518 (634) ..-....+ .|+ +..+.+ ...++.++|+ T Consensus 396 ~~~~~~n~----~~l-----~~~~~~---------------~~~~g~~~g~ 422 (422) T protein:vir:13 396 RLLVNGNM----IPI-----EMAGEQ---------------YKKGGEKGGK 422 (422) T ss_pred eeeeccCc----cch-----hhcccc---------------cccCCCcCCC Confidence 11111111 111 111111 0111222222 No 24 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.65 E-value=1e-15 Score=102.82 Aligned_cols=415 Identities=14% Similarity=0.143 Sum_probs=229.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhh-ccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAAS-QPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs-~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |-...-++-.+++.... .--.++.++. ...+.-++. .+.. ..+ .++-+.-.+.-++++||.+. T Consensus 1 m~~~~~~~~~~~~~s~~-~~w~~~~~~~~~~~~~~g~~------vt~~--------~al-~~~~v~~~i~~Ia~~iA~lp 64 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGG-GFWEAMLGGVRSSHSKAGVM------ITPE--------TAL-ALSAVRACVTLLAESVAQLP 64 (421) T ss_pred CCCcchhcccccccCcc-hhhHHHhhhhccCcccCCce------echH--------Hhh-ccHHHHHHHHHHHHhhccCc Confidence 55444443333222111 0001222211 111111110 0111 111 34556667778999999999 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) +..=+-+.| |+..+ ..++ .+..+...=...-+...++++.++.+|-+-|+.|+.+ .|.. +|. T Consensus 65 ~~~~~~~~~-----g~~~~-~~~~-~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i-~r~~-------~G~--- 126 (421) T protein:vir:10 65 VELYRRDKN-----GGRQR-ATDH-PIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSII-DRDG-------KGY--- 126 (421) T ss_pred eEEEEEcCC-----Cceee-cccc-hHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEE-EEcC-------CCc--- Confidence 988788877 33332 1122 3445555557788889999999999999999998886 3432 232 Q ss_pred chhceeccHHHHhccCCCcce-eeEeCC-CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSGT-NIVLPT-GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~~-~i~lP~-g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) ...++.|....+......+|. -+.+.. |. .|. .+-||++=.++ .....--||+..+.+.+.-..-..+...+. T Consensus 127 ~~~L~~l~~~~v~v~~~~~g~~~y~~~~~g~--~~~--~~eiih~~~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~ 201 (421) T protein:vir:10 127 PKELIPINPKKVIVLKGPDGMPYYEIPEIGE--TLP--MRMMHHVKVFS-LDGYIGSSPIQTNADVLGLNLAVEEHASAV 201 (421) T ss_pred EEEEEEecCceEEEEECCCceEEEEEcCCCc--EEc--hhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 234666655554432222232 233432 32 222 23344442332 233456788777776666555555555555 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .+.-..-.|||-.|+++.- . ...-+.++|.+.+-+ .+.--+.+-. ++|+ + ..- T Consensus 202 f~ng~~~~gil~~~~~~~~--~---------------~~~e~~~~~~~~~~~----~~~g~~n~~~--~~vl--~--~g~ 254 (421) T protein:vir:10 202 FRRGATMSGVIERPKEAPA--I---------------KSQEKIDQLLAKWTD----RYSGINNMFS--VALL--Q--EGM 254 (421) T ss_pred HhcCCCccEEEEecCccCc--c---------------CCHHHHHHHHHHHHH----HhcCccccCc--ceec--C--CCc Confidence 5555555677877764421 1 122244455555442 2221112212 2222 2 234 Q ss_pred ccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (634) Q Consensus 318 ~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~ 397 (634) +++.|.+.. .+.--+++|+..+..||..|-||| .+||+...+|..+.-+....=++..|.|.+..|+++|++.+|.+. T Consensus 255 ~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~ 332 (421) T protein:vir:10 255 SYKQMSQDN-EKAQLLQSRQWGVEEVCRLYKIPP-HMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPS 332 (421) T ss_pred eEEecCCCh-hHHHHHHHHHHhHHHHHHHhCCCH-HHcCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcc Confidence 555555433 222348899999999999999999 778987888999999999889999999999999999999887652 Q ss_pred HHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 398 LAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 398 L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) ++ ..|.|-||.+.|.. .|..+.+ ..+++.|++|.+++|+++|+..-.+-| + - T Consensus 333 ---~~---~~~~v~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD----~--------------~ 387 (421) T protein:vir:10 333 ---ER---RDLYIEFNVSGLLR-GDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGD----K--------------Y 387 (421) T ss_pred ---cc---CCeEEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----e--------------e Confidence 21 24678899999854 3444444 347889999999999999996432221 0 0 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQ 526 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~ 526 (634) ++ |+- +...+-+ .+++ +.+.+.++.++.++..+. T Consensus 388 ~~----~~n---~~~~~~~------~~~~-----~~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 388 LT----PLN---MVDSAQI------IPGD-----KKPTAQQMAEIDTILSRT 421 (421) T ss_pred ee----ccc---ccccccc------ccCC-----CCcccccCcccccccccC Confidence 11 111 0000001 1111 111122233333333333 No 25 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.65 E-value=8.7e-16 Score=103.14 Aligned_cols=408 Identities=14% Similarity=0.080 Sum_probs=223.7 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |--- |+..+.-+. +... ...++- .... -++.....+..=.++-+=..+.+.-.|.-++++||++.+ T Consensus 1 m~~~---~~f~~~~~~--~~~~-------~~~~~~-~~~~-~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~ 66 (416) T protein:vir:12 1 MLLE---RMFEKRSGS--SDHE-------DGFNNI-LLNM-FGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPI 66 (416) T ss_pred Cccc---hhcccccCc--cccC-------ccchhH-HHHh-hcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCce Confidence 2100 111111000 0000 000000 0000 000000000000011111345566677889999999998 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..=+-+++ +... .+++. +..++..-+.-.+...++++.++.+|-+-|++|+.+. |.. +|. . T Consensus 67 ~~~~~~~~------~~~~-~~~~~-l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~-r~~-------~G~---~ 127 (416) T protein:vir:12 67 HTYKRTDG------GIER-KPEHK-SAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQ-FGS-------HGY---P 127 (416) T ss_pred EEEEecCC------cccc-ccccH-HHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECC-------CCc---E Confidence 66554433 2221 22333 3334455567778889999999999999999998863 422 232 3 Q ss_pred hhceeccHHHHhccC--CCcceeeE-eCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSN--KGSGTNIV-LPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~--~~~~~~i~-lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) .++++|....++... .++..-+. ..+|...+|....=+-||-.+++. ..--||+.++...+.-.....+...+. T Consensus 128 ~~L~~l~~~~v~v~~~~~~~~~~~~~~~~g~~~~~~~~eiih~~~~~~~~---~~G~s~i~~~~~~i~~~~~~~~~~~~~ 204 (416) T protein:vir:12 128 EALFPLRPDYTNAYVHPTTGMLWYQTVLNGKAIELYDYEVLHFKGLSTDG---IHGKSPIGVVREHIGAQAAATKYNAKL 204 (416) T ss_pred EEEEEECCcceEEEEeCCCcEEEEEEecCCeEEEecCccEEEecCcCCCC---cccccHHHHHHHHHHHHHHHHHHHHHH Confidence 557777777766332 22222223 336666555443324444333332 345688877777776666666666666 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .+.-..-.|||-+|+.++ ....+++++.+-... ..++ ++|+ + ..- T Consensus 205 ~~ng~~p~~il~~~~~~~---------------------~e~~~~~~~~~~~~~-----~~~~-----~~vl--~--~g~ 249 (416) T protein:vir:12 205 YKNEATPRGILKVPAFLD---------------------EKPKENVRKEWKRVN-----KVEN-----IAII--D--YGL 249 (416) T ss_pred HhcCCCCceEEecCCCCC---------------------HHHHHHHHHHHHHHh-----cCCC-----eeec--C--CCc Confidence 666666678887765432 124556666654221 1122 2222 3 233 Q ss_pred ccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (634) Q Consensus 318 ~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~ 397 (634) +++-|.+... +.--+++|+.....||+.|.||| .+||...++|+.++-+....-++..|.|.+..|+++|++.+|-+. T Consensus 250 ~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~~~ 327 (416) T protein:vir:12 250 EYQSISMPLQ-EAQFVESMKFNKAQISMIYKVPL-HKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFLDH 327 (416) T ss_pred eEEEccCChh-hHHHHHHHHHHHHHHHHHhCCCH-HHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCch Confidence 5666665432 23347899999999999999999 777766789999999999999999999999999999999988553 Q ss_pred HHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 398 LAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 398 L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) -. ...|-|-||.++|. +.|..+.+ ..++++|++|.+++|+++|+..-.+-| .- T Consensus 328 ~~-----~~g~~i~fd~~~l~-~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd------------------~~ 383 (416) T protein:vir:12 328 DQ-----KSGHYVKFNIDSEL-RGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGD------------------KY 383 (416) T ss_pred hh-----cCCceEEeechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc------------------ee Confidence 21 13467889999984 34444444 458999999999999999996532222 00 Q ss_pred cchh-hhhhhhhhhhcccCC-CCCCCCCCCCCCCCccccCCCCCCCCCCCCC Q lcl|NC_011057. 475 LIPM-LAPLIAGVLQQIEFP-QQQQAIDSGGNEDTSDDDNLDDGEHEPDTED 524 (634) Q Consensus 475 Li~~-laPll~p~~q~~~~P-~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~ 524 (634) +++. +.|+ + ..+-. .+.+. ++..|-|+..+. T Consensus 384 ~~~~n~~~~-~----~~~~~~~~~~~--------------~~~~gge~~~~g 416 (416) T protein:vir:12 384 ISSLNYVFL-D----FLEEYQRLKAG--------------GAMKGGDNKNEG 416 (416) T ss_pred eeccccccc-c----ccchhhccccc--------------cccCCCCCcCCC Confidence 1110 0110 0 00000 00000 011111111111 No 26 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.65 E-value=1.6e-16 Score=107.20 Aligned_cols=416 Identities=14% Similarity=0.111 Sum_probs=224.2 Q ss_pred ceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeec Q lcl|NC_011057. 7 LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELD 86 (634) Q Consensus 7 lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD 86 (634) +-+-+.=|+.+..++.. -..+ ++......+..+..-.+ +..+ .++-++-.+.-+++++|.+.+..=+-+ T Consensus 1 m~~~~~~~~~~~~~~~~----~~~~--~~~~~~~~~~~g~~v~~----~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~~ 69 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVN----WQVV--PGGMRSSSSQAGVIITP----ETAL-ALSAVRACVTLLAESVAQLPCVLYRRT 69 (419) T ss_pred CcchhhhccCCcccccc----cccc--ccccccccccCCceech----HHhh-ccHHHHHHHHHHHHhhccCceEEEEEc Confidence 33333323321111100 0000 00000000000000001 1122 234566677889999999999887878 Q ss_pred ccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceec Q lcl|NC_011057. 87 ENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAV 166 (634) Q Consensus 87 ~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~v 166 (634) .+ |+.+. ..+ ..+..++..=+.--+...++++.++.+|-+-|+.|+.| .|... |. .-.++.| T Consensus 70 ~~-----g~~~~-~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i-~r~~~-------G~---~~~L~pl 131 (419) T protein:vir:57 70 EN-----GGREI-AFD-HPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLI-DRNGR-------GD---ITELIPI 131 (419) T ss_pred CC-----Cceec-ccc-chHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEE-EECCC-------Cc---EEEEEEE Confidence 77 33222 112 23455555556778888999999999999999998886 35332 32 2345666 Q ss_pred cHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCc Q lcl|NC_011057. 167 SKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNG 246 (634) Q Consensus 167 t~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnG 246 (634) ....+......++..+..-.+....|. .+-||++=+++ .+...--||+.++...+.-.....+...+..+.-..-.| T Consensus 132 ~~~~v~v~~~~~g~~~y~~~~~~~~~~--~~~vih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 208 (419) T protein:vir:57 132 NPHKVIVLKGPDGMPYYDIPSIGEILP--MRMVHHIKSFS-LDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSG 208 (419) T ss_pred cCcceEEEECCCceEEEEEcCCceEEc--hhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccE Confidence 655555443344444322222222332 24444543333 223445688887777666555555554444444444456 Q ss_pred eeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCC Q lcl|NC_011057. 247 VLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDN 326 (634) Q Consensus 247 vlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~ 326 (634) ||.+|..+.- .......+.|.+.+. .++.-.+. +--++|+ + ..-+++.|.+. T Consensus 209 il~~~~~~~~-----------------~~~~e~~~~~~~~~~----~~~~g~~n--ag~~~vl--~--~g~~~~~l~~~- 260 (419) T protein:vir:57 209 VIERPFEAKA-----------------IASQAAVDAILAKWT----ERYGGVRN--AFSVGML--Q--EGMTYKQLSQD- 260 (419) T ss_pred EEEecCcCCc-----------------ccCHHHHHHHHHHHH----HHhccccc--cccceec--C--CCceEEEcCCC- Confidence 7766644321 012234555665554 22222111 2233333 2 23355555542 Q ss_pred chhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChh Q lcl|NC_011057. 327 EITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPS 406 (634) Q Consensus 327 d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~ 406 (634) ..+.--+++|+..+..||..|.||| .+||...++|..++-+....-++.-|.|.++.|.++|++.+|.+.. + . T Consensus 261 ~~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~---~---~ 333 (419) T protein:vir:57 261 NEKAQLLQSRQYTVNEVCRLYKVPP-HMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLLLPSE---R---R 333 (419) T ss_pred hhhHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc---c---C Confidence 2333347899999999999999999 6777767888888888888888999999999999999999997633 1 2 Q ss_pred HheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhh Q lcl|NC_011057. 407 KYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLI 483 (634) Q Consensus 407 ~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll 483 (634) .|.|.||.+.|.. +|..+.+ ..+++.|++|.+++|.++|+..-.+-| . -++ |+- T Consensus 334 ~~~i~fd~~~ll~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-------------~-----~~~----~~n 390 (419) T protein:vir:57 334 DFYIEFNVSSLLR-GDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGD-------------K-----YLT----PLN 390 (419) T ss_pred CeEEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------------e-----eee----ccc Confidence 5889999999844 4544443 347899999999999999996422211 0 011 111 Q ss_pred hhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCc Q lcl|NC_011057. 484 AGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAG 535 (634) Q Consensus 484 ~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~ 535 (634) . . |...+ . +.+...|+..++....-.. -+ T Consensus 391 ~--------~-~~~~~-~------------~~~~~~~~~~~~~~~~~~~-~~ 419 (419) T protein:vir:57 391 M--------V-DSKAL-T------------GIGKATPQQLKDIEAILCT-RN 419 (419) T ss_pred c--------c-ccccc-c------------cccCCCcccCcchhhhhhc-cC Confidence 0 0 00000 0 0011111111111110000 00 No 27 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.64 E-value=3.6e-16 Score=105.28 Aligned_cols=414 Identities=14% Similarity=0.112 Sum_probs=222.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhh-hcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSK-STGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~-~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |-=- |..-|..|...+....-.+. +. +...++ ...++.. .|. .++-+.-.|.=+++++|.+. T Consensus 1 m~~~---~~~~~~~~~~~~~~~~~~~~---~~--g~~~s~~~~~v~~~-----~al----~~~~v~~cv~~ia~~ia~lp 63 (419) T protein:vir:80 1 MFFS---RQLLSNLGQTQPGSGGWVSA---LL--GSARSEAGQVVTPA-----SAL----SLTVLQNCVTLLAESIAQLP 63 (419) T ss_pred CCcc---cccccccCcCCCCcchhhHH---hh--cccccccCcccChH-----Hhh----ccHHHHHHHHHHHHhhccCc Confidence 2100 11111112111111100000 00 000000 0001111 111 23445556777899999999 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) |..=+.+.| |.. ...++ .+..++..-..--+...++++.++.+|-+-|+.|+.+ .|... |. T Consensus 64 ~~~~~~~~~-----~~~--~~~~~-~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i-~r~~~-------G~--- 124 (419) T protein:vir:80 64 VELYERSGD-----DRK--PATDH-PLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFI-DRDQD-------GV--- 124 (419) T ss_pred eEEEEecCC-----Ccc--ccccc-HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEE-EECCC-------Cc--- Confidence 988888876 321 12223 3556666667778889999999999999999999886 44332 32 Q ss_pred chhceeccHHHHhccCCCccee-eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSGTN-IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~~~-i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 238 (634) ...++.|....+.....+++.. +.+-++. .....++++.=+++ .+-..--||+..+...+.-.....+...+.. T Consensus 125 ~~~L~~i~~~~v~i~~~~~~~~~y~~~~~~---~~~~~~i~h~~~~~--~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 199 (419) T protein:vir:80 125 IQGLYPLDNEAVTVMKGPDLKPMYRVAGAD---PLPQRLVHHVRWMS--INGYTGLSPVLLHANAIGHAQAIQQYAGKSF 199 (419) T ss_pred EEEEEEecCceEEEEECCCceEEEEEcCcc---ccchhheEEecCCC--CCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346666666554333332222 3332211 12223443333443 3334567887777666655555555555555 Q ss_pred HhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_011057. 239 KSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKD 318 (634) Q Consensus 239 ~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ 318 (634) +.-..-.|||.+|+.+.- . ......+.|.+.+- ..+...+.+-. ++++ + ...+ T Consensus 200 ~ng~~~~gil~~~~~~~~--~---------------~~~~~~~~~~~~~~----~~~~g~~n~g~--~~vl--~--~g~~ 252 (419) T protein:vir:80 200 MNGTALSGVIERPTDAPA--L---------------KDQASVDRITDGWN----AKFGGSGNAKK--VALL--Q--EGMK 252 (419) T ss_pred hcCCCccEEEEecCCCCc--c---------------cCHHHHHHHHHHHH----HHhcCccccCC--ceec--C--CCce Confidence 555666677777664321 0 11224445555544 22322222211 2222 2 3346 Q ss_pred cceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 319 VKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTL 398 (634) Q Consensus 319 ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L 398 (634) ++.|.+. ..+.--+++|+..+..||..|-||| .|||...++|..++-+....-++..|.|.+..|+++|++.+|-+.. T Consensus 253 ~~~l~~s-~~d~q~~e~~~~~~~~Ia~~fgVPp-~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~ 330 (419) T protein:vir:80 253 FKPLSMT-NVDAALIDALRLSALDIARIYKIPA-HMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSE 330 (419) T ss_pred EEeccCC-hhhHHHHHHHHHHHHHHHHHhCCCH-HHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccc Confidence 6666654 2444467899999999999999999 7778777888888889888888999999999999999998886633 Q ss_pred HhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCccc Q lcl|NC_011057. 399 AREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTL 475 (634) Q Consensus 399 ~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~L 475 (634) + ..|.|.||.+.|.. +|..+.+ ..+++.|++|.+++|+++|+..-.+=| + . + T Consensus 331 ---~---~~~~i~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD----~-~-------------~ 385 (419) T protein:vir:80 331 ---R---KQYFIEYNLAGLLR-GDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD----I-Y-------------L 385 (419) T ss_pred ---c---CCeEEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----e-e-------------e Confidence 1 35889999999844 3443333 348889999999999999996422212 0 0 0 Q ss_pred chhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHH Q lcl|NC_011057. 476 IPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALE 550 (634) Q Consensus 476 i~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALe 550 (634) + |+ -+...+-|.|.+ .+...++++..+. . .|.|. T Consensus 386 ~----~~---n~~~~~~~~~~~------------------~~~~~~~~~~~~~------------~----~~~l~ 419 (419) T protein:vir:80 386 S----PM---NMVDASKPQPIP------------------MGKTEPTKAALDE------------I----GRILS 419 (419) T ss_pred e----cc---cccccccccccc------------------CCCCCchhhhHHH------------H----HhhcC Confidence 0 10 001111111100 0000011100000 0 12221 No 28 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.64 E-value=4.4e-16 Score=104.77 Aligned_cols=436 Identities=14% Similarity=0.114 Sum_probs=217.2 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCC---chhhhhhhhcccCccccccHHH--HHHHhhhhhHHHHHhhhhhce Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLP---DPSQVFSKSTGISRNSDWQTDA--WEAVDLVGELRYYVGWRASSC 75 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~it---dp~~~~~~~~~~~~~~~WQ~eA--W~~yd~VgELryyvgWr~~s~ 75 (634) |.==+.|+ .|++.. +. ..+.-... +|.... .....|..+. .+-+=.++-+.-.|.-++++| T Consensus 1 Mg~~~~l~--~~~~~~--~~----~~~~~~~~~~~~~~~~~------~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~i 66 (457) T protein:vir:62 1 MGFWSALF--GRGHSP--AL----DAAEGRAWEPYDPSIYN------LGATASSGERVTPHDALQVSAVFASVRLLSETI 66 (457) T ss_pred Cchhhhhh--cccccc--cc----ccccccccccchhhhhh------ccccccCCceechHHhhccHHHHHHHHHHHHhH Confidence 55444332 222221 10 00000000 111100 0000111000 011112344555677788999 Q ss_pred eeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccc Q lcl|NC_011057. 76 SRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDG 155 (634) Q Consensus 76 Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg 155 (634) |.+.|..=+-+.+ +..+ .+...+..+...- -..+...++++.++.+|-+-|++|+.| .+. ++. T Consensus 67 A~lp~~~~~~~~~------~~~~--~~~~~~~~ll~~p-n~~~t~~~f~~~~~~~l~l~Gna~~~i-~~~-~g~------ 129 (457) T protein:vir:62 67 ATLPLSTYSKRGG------TRKE--IDTPEWLDFPNAE-PGGMGRIDILSQTVLSLLLQGNAFLAV-RWA-GPN------ 129 (457) T ss_pred hhCceEEEEecCC------cccc--ccchHHHHhcccc-CCCCCHHHHHHHHHHHHhhcCCeEEEE-EeC-CCc------ Confidence 9998877655533 1111 1123333433332 334788999999999999999999887 332 211 Q ss_pred ccccchhceeccHHHHhcc--CCCc-----ceeeEeC-CCCcccccC-CCCeEEEeeCCCcccccCCccchhhhhHHHHH Q lcl|NC_011057. 156 SVRTRQEWYAVSKEEIKKS--NKGS-----GTNIVLP-TGEEHEFVK-GTDIIFRVWIPKPRKASEPDSPVRAVLDSIRE 226 (634) Q Consensus 156 ~~~~~~~W~~vt~~Ei~~~--~~~~-----~~~i~lP-~g~~h~~~~-~~D~~~RvW~P~prra~eaDSPvra~l~~LrE 226 (634) ....+.|....+... ...+ ...+... +|..+.... ..+=||++=.+++.....--||+.++...+.- T Consensus 130 ----~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~ 205 (457) T protein:vir:62 130 ----IAGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGL 205 (457) T ss_pred ----EEEEEEEcCcceEEEEeccCCccceeEEEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHH Confidence 122444444333211 1000 0123333 233332211 22334555455554445667887777766655 Q ss_pred HHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccc Q lcl|NC_011057. 227 IVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIP 306 (634) Q Consensus 227 I~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vP 306 (634) ..-..+...+..+.-.+-.|||-+|+.++- -+.+++++.+-+ .+.-.+.+ -=+ T Consensus 206 ~~~~~~~~~~~f~ng~~p~gil~~~~~ls~---------------------e~~~~~~~~~~~----~~~G~~na--g~~ 258 (457) T protein:vir:62 206 ALAAQKYGAHFFRNGAMPGAVVEVPGTMSE---------------------EGLARAREAWRA----ANSGVDNA--HRV 258 (457) T ss_pred HHHHHHHHHHHHhccCCcceEEEcCCCCCH---------------------HHHHHHHHHHHH----HhcCcccc--Ccc Confidence 555555555555555556688887765431 144555555542 22211111 112 Q ss_pred eeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhH--hhhhhhhhhHHHHhHHHH Q lcl|NC_011057. 307 VIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSA--WQISDEDVQLHIAPVMEI 384 (634) Q Consensus 307 iva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtA--w~i~de~v~~hI~P~~~~ 384 (634) +|+ + ..-+++-|.+... +.--+++|+..+..||..|-||| .|||...++|.|++ -+..-.=++..|.|.++. T Consensus 259 ~vl--~--~g~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ 332 (457) T protein:vir:62 259 ALL--T--EGAKFSKVAMSPD-EAQFLQTRQFQVPEIARIFGVPP-HLISDATNSTSWGSGLAEQNIAFTMFSLRPWLER 332 (457) T ss_pred eec--C--CCceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHcCCCCCcccccchHHHHHHHHHHHHHHHHHHH Confidence 222 2 2345555554322 22338999999999999999999 67898777787653 555555667789999999 Q ss_pred HHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHH Q lcl|NC_011057. 385 FCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGW 461 (634) Q Consensus 385 i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~ 461 (634) |+++|++.+|.+.- ...|.|+||.+.|.. .|..+.+ ..++..|++|.+++|++.||.--.+-. T Consensus 333 ie~~ln~~L~~~~~------~~~~~i~fd~~~l~~-~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~------- 398 (457) T protein:vir:62 333 IEAGFNRLLFAETA------DRFRFVKFNLDEIKR-GAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGL------- 398 (457) T ss_pred HHHHHHhhhcCccc------cCceEEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC------- Confidence 99999999886532 245778999999843 3443333 458889999999999999995432210 Q ss_pred HHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCC--CCCCCCCccccCCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 462 VMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAID--SGGNEDTSDDDNLDDGEHEPDTEDDQDDDGT 531 (634) Q Consensus 462 r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~--~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~ 531 (634) + |--+ .|+---.......++|.++.. ....++...+...++.+..||.+.......+ T Consensus 399 ---~------D~~~----~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 399 ---G------EKYR----VPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred ---c------ceee----eccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 0 0001 121100111111222221111 0011111111111233334443222221112 No 29 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.64 E-value=3.4e-16 Score=105.41 Aligned_cols=416 Identities=12% Similarity=0.135 Sum_probs=216.3 Q ss_pred CCCCCcceeEeccCCCCccch-hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |---..++-.-+|.....+.. .+.+......-+++...+++ +.. -. ++-+-.++-+.-.+.-+++++|++. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-g~~--v~-----~~~al~~~~V~~~i~~Ia~~ia~lp 78 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDT-GAA--VN-----ADAIMRLDAVAACVKLVSQAIAAMP 78 (432) T ss_pred cchhhhhHhhcCCccccccccccccccCcchhhhhccccccc-Ccc--cc-----hhhhhcchHHHHHHHHHHHhhhhCc Confidence 222222222222211000000 01000000000111111110 000 00 1111133455556777899999999 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) +..=+-+.| |.... .+ ..+..+...=...-+...++++.++.+|-+-|++|+.+. |.. |. T Consensus 79 ~~~y~~~~~-----g~~~~--~~-~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~-~~~--------g~--- 138 (432) T protein:vir:10 79 LTMYMRTPD-----GRKEA--VN-HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKV-VTD--------GR--- 138 (432) T ss_pred eeEEEecCC-----Ccccc--cc-cHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEE-ecC--------Cc--- Confidence 988777766 32221 11 224444555577789999999999999999999998763 322 21 Q ss_pred chhceeccHHHHhccCCCcce-e--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSGT-N--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~~-~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ..+++.|....+......+|. . +...+|...+|.. +-+|++=++ +.....--||+..+.+.+.--....+...+ T Consensus 139 ~~~L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~--~~iih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~ 215 (432) T protein:vir:10 139 IESLQYLANDRLTITTDTKGNTAYRYRRTDGQMIDIPK--QQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAAR 215 (432) T ss_pred EEEEEEEcCCceEEEEcCCCcEEEEEEecCceEEEEcC--ccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 234555655555433222222 2 3344666555433 334555222 223345558887777666554445555555 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) .-+.-....|||-+|+.++- .+.++|++-+-. ..+.+. ++|+ + .. T Consensus 216 ~f~ng~~~~gil~~~~~l~~---------------------e~~~~~~~~~~~-----~~nag~-----~~vl--~--~g 260 (432) T protein:vir:10 216 AFRNGQLQSVYYQIDRFLTD---------------------DQYDSFAKKVSG-----SVEAGR-----APLL--E--GG 260 (432) T ss_pred HHhcCCCcceEEecCCCCCH---------------------HHHHHHHHHHhh-----hhhCCC-----ceec--C--CC Confidence 44544556677877765431 134555554431 111111 2333 2 23 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccC---cchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQ---TNHWSAWQISDEDVQLHIAPVMEIFCQALTDQI 393 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~---~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~ 393 (634) -+++-|.+..+ +.--+++|+..+..||..|.||| .|||.... .+..+.-++.-.=++..|.|.+..|+++|++.+ T Consensus 261 ~~~~~l~~~~~-d~q~le~~~~~~~~Ia~afgVPp-~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 261 MDVKSLGLNPV-DAQLLQSRQYSVESICRFFGVPP-SMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIALNL 338 (432) T ss_pred ceEEEccCChH-HHHHHHHHHHHHHHHHHHhCCCH-HHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 45666665443 22247899999999999999999 77786433 223455666667778889999999999999999 Q ss_pred HHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhh Q lcl|NC_011057. 394 LRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVS 470 (634) Q Consensus 394 lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~ 470 (634) |.+.- + ..|.|-||.+.|. +.|..+.+ ..++..|++|.+++|+++||..-.|-+ +.+. T Consensus 339 ~~~~~---~---~~~~~~fd~~~ll-~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~------------~~~~ 399 (432) T protein:vir:10 339 LSPAE---R---RRYFADFDTSALL-RADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNA------------AVLT 399 (432) T ss_pred cCccc---c---CceEEEeechhhh-ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc------------ceEe Confidence 87732 2 3588899999984 33444333 348889999999999999996422211 1111 Q ss_pred cCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCcccc Q lcl|NC_011057. 471 KDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDD 511 (634) Q Consensus 471 ~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~ 511 (634) .+..+. |+=. +. .-++|.++.....++.++.++ T Consensus 400 ~~~~~~----pl~~--~~--~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VQSAMV----PLDS--IG--LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred ecCccc----chhh--hc--ccCCCCCCCCCCCcccccccC Confidence 111111 1110 00 011111111110111111111 No 30 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.64 E-value=2.7e-16 Score=105.95 Aligned_cols=416 Identities=13% Similarity=0.115 Sum_probs=226.6 Q ss_pred CCCCCccee---EeccCCCCccchhhhhhhhccCCchhhhhhhhcccCc-cccccHHHHHHHhhhhhHHHHHhhhhhcee Q lcl|NC_011057. 1 MAATQSLRL---VRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISR-NSDWQTDAWEAVDLVGELRYYVGWRASSCS 76 (634) Q Consensus 1 ~~a~~~lr~---vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~-~~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (634) |.==..|+= .+ +|.+. ..+.+. .+...+.+-.+.+. .-..-.+ ..| ..+.+.-.+.-+++.+| T Consensus 1 M~~~~r~~~~~~~~-~r~~~--~~~~~~-------~~~~~~~~~~g~~~~~~~v~~~--~al-~~~~v~~~i~~ia~~ia 67 (432) T protein:vir:10 1 MKIVDSVKKFFNFE-KRQTS--QVIELN-------KDDEKLLEWLGISPSTISVKGK--NAL-KVATVFACIKILSESVS 67 (432) T ss_pred CChHHHHHHhcCcc-ccCcc--cccccC-------CchHHHHHHhCCCcCccccchh--hhh-ccHHHHHHHHHHHHhhc Confidence 433222210 01 11111 011111 11111111000000 0000000 112 23555666777899999 Q ss_pred eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccc Q lcl|NC_011057. 77 RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGS 156 (634) Q Consensus 77 r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~ 156 (634) ++.+..-+-+++ |. .+ .. ...+..+.+.-+..-+...++++.++.+|.+-|+.|+.+. |... |. T Consensus 68 ~lp~~~~~~~~~-----~~-~~-~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~-------G~ 131 (432) T protein:vir:10 68 KLPLKIYQEDEY-----GI-QR-GT-KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK-------GK 131 (432) T ss_pred cCceEEEEecCC-----ce-ee-cc-ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC-------Cc Confidence 999887776655 21 11 11 1335555555577788999999999999999999999973 4322 21 Q ss_pred cccchhceeccHHHHhccCC-------CcceeeEe-CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHH Q lcl|NC_011057. 157 VRTRQEWYAVSKEEIKKSNK-------GSGTNIVL-PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIV 228 (634) Q Consensus 157 ~~~~~~W~~vt~~Ei~~~~~-------~~~~~i~l-P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~ 228 (634) ...++.|..+.+..... +....+.. .+|....|.. .| +|++=++.|.....--||+.++...+.-.. T Consensus 132 ---~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~-~e-iih~r~~~~~~~~~G~s~~~~~~~~i~~~~ 206 (432) T protein:vir:10 132 ---VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP-EE-ILHFKNGITLDGLVGVPTMEYLKSTLENSA 206 (432) T ss_pred ---EEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEcc-cc-EEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 23455555554432111 11122222 3455444433 33 455534455556667899988887777666 Q ss_pred hhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 229 RTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 229 rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) ...+...+..+.-..-.|||-+|+.++- -..+++.+.+-+ .-...++.+. ++| T Consensus 207 ~~~~~~~~~~~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~-~~~g~~n~~~-----~~v 259 (432) T protein:vir:10 207 SADKFINNFYKQGLQVKGLVQYVGDLNE---------------------DAKKVFRENFES-MSSGLQNSHR-----IAL 259 (432) T ss_pred HHHHHHHHHHhccCCccEEEEcCCCCCH---------------------HHHHHHHHHHHH-HhcccccCCc-----cee Confidence 6666665555555555678877665431 133444444431 1112222221 222 Q ss_pred EeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQA 388 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~a 388 (634) + + ..-+++.|.+.. .+.--+++|+..+..||+.|.||| .+||...++|..++.+....-++..|.|.+..|+++ T Consensus 260 l--~--~g~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~-~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ 333 (432) T protein:vir:10 260 M--P--VGYQFQPISLNM-SDAQFLENTELTIRQIATAFGIKM-HQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQE 333 (432) T ss_pred c--C--CCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2 234666666543 233348899999999999999999 777877788999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHH Q lcl|NC_011057. 389 LTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWA 465 (634) Q Consensus 389 it~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA 465 (634) |++.+|-+.-.. ..|-|.||.+.|.. +|..+.+ ..++..|++|.+++|+++|+....|-| T Consensus 334 ln~kLl~~~~~~-----~g~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD----------- 396 (432) T protein:vir:10 334 MTYKLFLDSELD-----KGFYSKFNVDAILR-ADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----------- 396 (432) T ss_pred HHHhhcChhhcC-----CCcEEEeechhhhc-CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----------- Confidence 999988663322 33557899999854 3444433 448889999999999999996432211 Q ss_pred HHHhhcCcccch-hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCC Q lcl|NC_011057. 466 QDAVSKDPTLIP-MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGE 517 (634) Q Consensus 466 ~d~v~~dp~Li~-~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~ 517 (634) . -+++ .+.|+= . .. .+...+++++.+....+.++. T Consensus 397 --~-----~~~~~n~~~~~-----~--~~---~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 --R-----LLVNGNMLPID-----M--AG---QAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred --e-----Eeecccccchh-----h--cc---ccccCCCCCCCCCCCCCCCCC Confidence 0 0111 001110 0 00 011122322222222211111 No 31 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.64 E-value=2.7e-16 Score=105.95 Aligned_cols=416 Identities=13% Similarity=0.115 Sum_probs=226.6 Q ss_pred CCCCCccee---EeccCCCCccchhhhhhhhccCCchhhhhhhhcccCc-cccccHHHHHHHhhhhhHHHHHhhhhhcee Q lcl|NC_011057. 1 MAATQSLRL---VRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISR-NSDWQTDAWEAVDLVGELRYYVGWRASSCS 76 (634) Q Consensus 1 ~~a~~~lr~---vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~-~~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (634) |.==..|+= .+ +|.+. ..+.+. .+...+.+-.+.+. .-..-.+ ..| ..+.+.-.+.-+++.+| T Consensus 1 M~~~~r~~~~~~~~-~r~~~--~~~~~~-------~~~~~~~~~~g~~~~~~~v~~~--~al-~~~~v~~~i~~ia~~ia 67 (432) T protein:vir:10 1 MKIVDSVKKFFNFE-KRQTS--QVIELN-------KDDEKLLEWLGISPSTISVKGK--NAL-KVATVFACIKILSESVS 67 (432) T ss_pred CChHHHHHHhcCcc-ccCcc--cccccC-------CchHHHHHHhCCCcCccccchh--hhh-ccHHHHHHHHHHHHhhc Confidence 433222210 01 11111 011111 11111111000000 0000000 112 23555666777899999 Q ss_pred eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccc Q lcl|NC_011057. 77 RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGS 156 (634) Q Consensus 77 r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~ 156 (634) ++.+..-+-+++ |. .+ .. ...+..+.+.-+..-+...++++.++.+|.+-|+.|+.+. |... |. T Consensus 68 ~lp~~~~~~~~~-----~~-~~-~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~-------G~ 131 (432) T protein:vir:10 68 KLPLKIYQEDEY-----GI-QR-GT-KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK-------GK 131 (432) T ss_pred cCceEEEEecCC-----ce-ee-cc-ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC-------Cc Confidence 999887776655 21 11 11 1335555555577788999999999999999999999973 4322 21 Q ss_pred cccchhceeccHHHHhccCC-------CcceeeEe-CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHH Q lcl|NC_011057. 157 VRTRQEWYAVSKEEIKKSNK-------GSGTNIVL-PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIV 228 (634) Q Consensus 157 ~~~~~~W~~vt~~Ei~~~~~-------~~~~~i~l-P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~ 228 (634) ...++.|..+.+..... +....+.. .+|....|.. .| +|++=++.|.....--||+.++...+.-.. T Consensus 132 ---~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~-~e-iih~r~~~~~~~~~G~s~~~~~~~~i~~~~ 206 (432) T protein:vir:10 132 ---VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP-EE-ILHFKNGITLDGLVGVPTMEYLKSTLENSA 206 (432) T ss_pred ---EEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEcc-cc-EEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 23455555554432111 11122222 3455444433 33 455534455556667899988887777666 Q ss_pred hhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 229 RTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 229 rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) ...+...+..+.-..-.|||-+|+.++- -..+++.+.+-+ .-...++.+. ++| T Consensus 207 ~~~~~~~~~~~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~-~~~g~~n~~~-----~~v 259 (432) T protein:vir:10 207 SADKFINNFYKQGLQVKGLVQYVGDLNE---------------------DAKKVFRENFES-MSSGLQNSHR-----IAL 259 (432) T ss_pred HHHHHHHHHHhccCCccEEEEcCCCCCH---------------------HHHHHHHHHHHH-HhcccccCCc-----cee Confidence 6666665555555555678877665431 133444444431 1112222221 222 Q ss_pred EeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQA 388 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~a 388 (634) + + ..-+++.|.+.. .+.--+++|+..+..||+.|.||| .+||...++|..++.+....-++..|.|.+..|+++ T Consensus 260 l--~--~g~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~-~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ 333 (432) T protein:vir:10 260 M--P--VGYQFQPISLNM-SDAQFLENTELTIRQIATAFGIKM-HQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQE 333 (432) T ss_pred c--C--CCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2 234666666543 233348899999999999999999 777877788999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHH Q lcl|NC_011057. 389 LTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWA 465 (634) Q Consensus 389 it~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA 465 (634) |++.+|-+.-.. ..|-|.||.+.|.. +|..+.+ ..++..|++|.+++|+++|+....|-| T Consensus 334 ln~kLl~~~~~~-----~g~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD----------- 396 (432) T protein:vir:10 334 MTYKLFLDSELD-----KGFYSKFNVDAILR-ADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----------- 396 (432) T ss_pred HHHhhcChhhcC-----CCcEEEeechhhhc-CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----------- Confidence 999988663322 33557899999854 3444433 448889999999999999996432211 Q ss_pred HHHhhcCcccch-hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCC Q lcl|NC_011057. 466 QDAVSKDPTLIP-MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGE 517 (634) Q Consensus 466 ~d~v~~dp~Li~-~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~ 517 (634) . -+++ .+.|+= . .. .+...+++++.+....+.++. T Consensus 397 --~-----~~~~~n~~~~~-----~--~~---~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 --R-----LLVNGNMLPID-----M--AG---QAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred --e-----Eeecccccchh-----h--cc---ccccCCCCCCCCCCCCCCCCC Confidence 0 0111 001110 0 00 011122322222222211111 No 32 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.64 E-value=2.7e-16 Score=105.95 Aligned_cols=416 Identities=13% Similarity=0.115 Sum_probs=226.6 Q ss_pred CCCCCccee---EeccCCCCccchhhhhhhhccCCchhhhhhhhcccCc-cccccHHHHHHHhhhhhHHHHHhhhhhcee Q lcl|NC_011057. 1 MAATQSLRL---VRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISR-NSDWQTDAWEAVDLVGELRYYVGWRASSCS 76 (634) Q Consensus 1 ~~a~~~lr~---vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~-~~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (634) |.==..|+= .+ +|.+. ..+.+. .+...+.+-.+.+. .-..-.+ ..| ..+.+.-.+.-+++.+| T Consensus 1 M~~~~r~~~~~~~~-~r~~~--~~~~~~-------~~~~~~~~~~g~~~~~~~v~~~--~al-~~~~v~~~i~~ia~~ia 67 (432) T protein:vir:10 1 MKIVDSVKKFFNFE-KRQTS--QVIELN-------KDDEKLLEWLGISPSTISVKGK--NAL-KVATVFACIKILSESVS 67 (432) T ss_pred CChHHHHHHhcCcc-ccCcc--cccccC-------CchHHHHHHhCCCcCccccchh--hhh-ccHHHHHHHHHHHHhhc Confidence 433222210 01 11111 011111 11111111000000 0000000 112 23555666777899999 Q ss_pred eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccc Q lcl|NC_011057. 77 RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGS 156 (634) Q Consensus 77 r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~ 156 (634) ++.+..-+-+++ |. .+ .. ...+..+.+.-+..-+...++++.++.+|.+-|+.|+.+. |... |. T Consensus 68 ~lp~~~~~~~~~-----~~-~~-~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~-------G~ 131 (432) T protein:vir:10 68 KLPLKIYQEDEY-----GI-QR-GT-KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK-------GK 131 (432) T ss_pred cCceEEEEecCC-----ce-ee-cc-ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC-------Cc Confidence 999887776655 21 11 11 1335555555577788999999999999999999999973 4322 21 Q ss_pred cccchhceeccHHHHhccCC-------CcceeeEe-CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHH Q lcl|NC_011057. 157 VRTRQEWYAVSKEEIKKSNK-------GSGTNIVL-PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIV 228 (634) Q Consensus 157 ~~~~~~W~~vt~~Ei~~~~~-------~~~~~i~l-P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~ 228 (634) ...++.|..+.+..... +....+.. .+|....|.. .| +|++=++.|.....--||+.++...+.-.. T Consensus 132 ---~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~-~e-iih~r~~~~~~~~~G~s~~~~~~~~i~~~~ 206 (432) T protein:vir:10 132 ---VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP-EE-ILHFKNGITLDGLVGVPTMEYLKSTLENSA 206 (432) T ss_pred ---EEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEcc-cc-EEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 23455555554432111 11122222 3455444433 33 455534455556667899988887777666 Q ss_pred hhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 229 RTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 229 rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) ...+...+..+.-..-.|||-+|+.++- -..+++.+.+-+ .-...++.+. ++| T Consensus 207 ~~~~~~~~~~~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~-~~~g~~n~~~-----~~v 259 (432) T protein:vir:10 207 SADKFINNFYKQGLQVKGLVQYVGDLNE---------------------DAKKVFRENFES-MSSGLQNSHR-----IAL 259 (432) T ss_pred HHHHHHHHHHhccCCccEEEEcCCCCCH---------------------HHHHHHHHHHHH-HhcccccCCc-----cee Confidence 6666665555555555678877665431 133444444431 1112222221 222 Q ss_pred EeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQA 388 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~a 388 (634) + + ..-+++.|.+.. .+.--+++|+..+..||+.|.||| .+||...++|..++.+....-++..|.|.+..|+++ T Consensus 260 l--~--~g~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~-~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ 333 (432) T protein:vir:10 260 M--P--VGYQFQPISLNM-SDAQFLENTELTIRQIATAFGIKM-HQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQE 333 (432) T ss_pred c--C--CCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2 234666666543 233348899999999999999999 777877788999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHH Q lcl|NC_011057. 389 LTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWA 465 (634) Q Consensus 389 it~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA 465 (634) |++.+|-+.-.. ..|-|.||.+.|.. +|..+.+ ..++..|++|.+++|+++|+....|-| T Consensus 334 ln~kLl~~~~~~-----~g~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD----------- 396 (432) T protein:vir:10 334 MTYKLFLDSELD-----KGFYSKFNVDAILR-ADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----------- 396 (432) T ss_pred HHHhhcChhhcC-----CCcEEEeechhhhc-CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----------- Confidence 999988663322 33557899999854 3444433 448889999999999999996432211 Q ss_pred HHHhhcCcccch-hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCC Q lcl|NC_011057. 466 QDAVSKDPTLIP-MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGE 517 (634) Q Consensus 466 ~d~v~~dp~Li~-~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~ 517 (634) . -+++ .+.|+= . .. .+...+++++.+....+.++. T Consensus 397 --~-----~~~~~n~~~~~-----~--~~---~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 --R-----LLVNGNMLPID-----M--AG---QAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred --e-----Eeecccccchh-----h--cc---ccccCCCCCCCCCCCCCCCCC Confidence 0 0111 001110 0 00 011122322222222211111 No 33 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.63 E-value=2.8e-16 Score=105.88 Aligned_cols=402 Identities=16% Similarity=0.138 Sum_probs=222.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==.-+ |+.+.... .+......... ...+ .......-.+.+ + ..+-++-.+.-+++.||++.+ T Consensus 1 Mg~f~~~---~~~~~~~~--~~~~~~~~~~~------~~~~--~~~~~~~~~~~~--~-~~~~v~~~i~~ia~~ia~~~~ 64 (406) T protein:vir:95 1 MGLFDRW---RRTKRKSK--IRADTGYVGLF------MSGE--DVSFLVPGYVRL--S-DNPEVRMAVHKIADLISSMTI 64 (406) T ss_pred Ccchhhh---cccccccc--ccccchhhhhh------ccCc--ccCccccCHHHH--h-hcHHHHHHHHHHHHhhccCce Confidence 5433333 23332211 11111110000 0000 000000111111 2 347777888889999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEE-EEecCCCCCCCccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVI-LTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~i-l~rp~~~~~~~~dg~~~~ 159 (634) .+-+.+++ |....+ + ........-+..-+...++++.++.+|-+-|+++..+ ..|..+ | + T Consensus 65 ~~~~~~~~-----~~~~~~---~-~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~-------g--~- 125 (406) T protein:vir:95 65 YLMQNTED-----GDIRIR---N-ELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTAD-------G--L- 125 (406) T ss_pred EEEEecCC-----cceeec---c-hHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCC-------C--c- Confidence 99888866 222211 2 2333345667777899999999999988877765443 334222 2 1 Q ss_pred chhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASK 239 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 239 (634) ...++.|....+......++-.+.. +|. +|. ..|++.--.++++.+...--||+..+.+.+.-.....+...+..+ T Consensus 126 ~~~l~~i~~~~v~~~~~~~~~~~~~-~~~--~~~-~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ 201 (406) T protein:vir:95 126 IDELVPLTPSKVNFLDTPDGYQVLY-GGQ--TFN-YDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMS 201 (406) T ss_pred EEEEEEEcCceeEEEEcCCeEEEEe-ccE--EEc-hhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2335555555444333333322221 221 222 234443335778877777889999999988888888888888888 Q ss_pred hHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 240 SRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 240 SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) .-....|||-+|+.++-. ..+++.+-+. ..+. ++--+--++|+...++....+ T Consensus 202 ng~~~~~il~~~~~l~~e---------------------~~~~~~~~~~----~~~~--g~~n~~~~~v~~~~~~~~~~~ 254 (406) T protein:vir:95 202 GKYMPSLIVKVDAATAEL---------------------SSEEGRNAVF----KKYL--QATEAGQPWIIPAELLEVEQV 254 (406) T ss_pred ccCCcceEEEeCCCCCHH---------------------HHHHHHHHHH----HHhc--cccccCCceeecCCCcccccc Confidence 888888999888765421 2233333332 1221 222233344454454444433 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) +.+.. .+.--+++|+..+..||+.|.||| .|||.+++ |. +....-++..|.|.++.|+++|++.+|.+ T Consensus 255 ~~~~~---~d~q~~e~~~~~~~~Ia~~fgVp~-~~lg~~~~-~~----~~~~~~~~~~l~P~~~~ie~~l~~~l~~~--- 322 (406) T protein:vir:95 255 KPLSL---KDIAINEAVELDKRTVAGMFGVPA-FLLGIGEF-NR----DEYNNFINSTILPIAKGIEQELTRKLLIS--- 322 (406) T ss_pred ccCCh---hHHHHHHHHHHHHHHHHHHhCCCH-HHcCCCCc-hH----HHHHHHHHHHHHHHHHHHHHHHHHhcCCC--- Confidence 33322 233456899999999999999999 77787543 32 23334678889999999999999888753 Q ss_pred hcCCChhHheeeecCcccccCCCchHH---HHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccc Q lcl|NC_011057. 400 REGIDPSKYVVWYDASQLTIDPDKSDE---AKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLI 476 (634) Q Consensus 400 ~eG~d~~~yV~w~DaS~L~~~pd~t~e---A~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li 476 (634) .+|.|+||.+.|.. .|..+. ...+++.|++|.+++|+++|+....+-| ..-....+ T Consensus 323 ------~~~~~~fd~~~l~~-~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd-------------~~~~~~n~- 381 (406) T protein:vir:95 323 ------PDLYFKFNPRSLYA-YDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLS-------------ELVILENY- 381 (406) T ss_pred ------CCcEEEeechhhhc-CCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-------------eeeeccCc- Confidence 45789999999943 343333 3568999999999999999996432211 00000011 Q ss_pred hhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCC Q lcl|NC_011057. 477 PMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTE 523 (634) Q Consensus 477 ~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe 523 (634) .|+-. ... ..... ++++++++-.+| T Consensus 382 ---~~~~~-------~~~--~~~~k----------~g~~~~~~~~~~ 406 (406) T protein:vir:95 382 ---IPLDK-------IGD--QSKLK----------GGDNSGADGQTD 406 (406) T ss_pred ---cchhh-------ccc--ccccC----------CCCCCCCCCCCC Confidence 11100 000 00001 111111111111 No 34 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.63 E-value=8.3e-16 Score=103.26 Aligned_cols=412 Identities=13% Similarity=0.108 Sum_probs=218.7 Q ss_pred CC--CCCcceeEeccCCCCcc-----chhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhh Q lcl|NC_011057. 1 MA--ATQSLRLVRRPKGGRPA-----PSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRAS 73 (634) Q Consensus 1 ~~--a~~~lr~vrrp~g~~~a-----~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~ 73 (634) |- -.++- +.|.|.... .++.+..-+.+++.-...+...+ .+ +..+ -.+.+-.++-+.-.|.-+++ T Consensus 1 ~~~~~~~~~---~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~v---s~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:45 1 MLYCWWAHW---LWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDGLF-RA-DVYV---SPETAMKLAAVYSCIYVLSS 72 (424) T ss_pred CeeEeeece---ecCcchhHHHHhhccccCCCCCccccchhhhhhhccc-cC-Ccee---chHHhhccHHHHHHHHHHHH Confidence 10 00110 111110000 00111111112111000000000 00 0000 00111123344455667889 Q ss_pred ceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCc Q lcl|NC_011057. 74 SCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQP 153 (634) Q Consensus 74 s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~ 153 (634) ++|.+-+..=+-+ + |+..+ ..+ ..+.+++..-+..-+...++.+.++.+|.+-|+.|+.| .|... T Consensus 73 ~iA~lp~~v~~~~-~-----~~~~~-~~~-~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i-~r~~~------ 137 (424) T protein:vir:45 73 SLAQMPLHVMRRH-K-----GKVEP-ARD-HPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWV-KRNRR------ 137 (424) T ss_pred HHhhCceEEEEec-C-----Cceee-ccc-chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEE-EEcCC------ Confidence 9998877655544 3 23332 111 23455556667778899999999999999999999875 34332 Q ss_pred ccccccchhceeccHHHHhccCCCcceeeEeC-CCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_011057. 154 DGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLP-TGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTK 232 (634) Q Consensus 154 dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP-~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 232 (634) |. .-.++.+....+.....++...+.+- ++....|.. .| +|++=.+.+ .-..--||+..+.+.+.--.-..+ T Consensus 138 -G~---~~~L~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~-~e-Vih~r~~~~-d~~~G~spi~~~~~~i~~~~~~~~ 210 (424) T protein:vir:45 138 -GE---VISLDCCMPWETTLMNTGGRYTYGLYNEYGAFAISP-DD-MIHIRALGN-NQKMGLSPIMQHAETIGMGMSGQK 210 (424) T ss_pred -Cc---EEEEEEecCceEEEEEcCCeEEEEEEecCceEEECc-cc-EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHH Confidence 32 22355554444443333444444333 222333333 33 455534443 234567888877777665555555 Q ss_pred HHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_011057. 233 TIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (634) Q Consensus 233 ~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP 312 (634) ...+..+.-..-.|||-+|+.++- -..+.+++.+-+.-+-..++.+ -++|+ + T Consensus 211 ~~~~~f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~~~~g~~~n~g-----~~~vl--~ 262 (424) T protein:vir:45 211 YTESFFSGNARPAGIVSVKSGLNK---------------------ESWGWLKDQWQKASQALRRQEN-----KTMLL--P 262 (424) T ss_pred HHHHHHhccCCccEEEEeCCCCCH---------------------HHHHHHHHHHHHHhccccccCC-----ceeEc--C Confidence 555555555555678888776531 1344555555422222112222 22333 2 Q ss_pred hHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHH Q lcl|NC_011057. 313 GEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQ 392 (634) Q Consensus 313 ~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~ 392 (634) ..-+++.|.+.. .+.--+++|+..+..||..|.||| .+||...++|+.++.|....=++..|.|.++.|+++|++. T Consensus 263 --~g~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~k 338 (424) T protein:vir:45 263 --ADLDYKALTVSP-VDAQIIDMMKLNRSMIAGIFNIPA-HMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRR 338 (424) T ss_pred --CCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 345666666543 222348999999999999999999 6777766789999999999999999999999999999999 Q ss_pred HHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHh Q lcl|NC_011057. 393 ILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAV 469 (634) Q Consensus 393 ~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v 469 (634) +|.+.=. ...|-|.||.+.|.. .|..+.+ ..++..|++|.+++|++.|+..-.+-| T Consensus 339 Ll~~~e~-----~~g~~i~fd~~~llr-~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD--------------- 397 (424) T protein:vir:45 339 LFTRAEL-----AAGYYVRFNLTGLLR-GTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLD--------------- 397 (424) T ss_pred cCChhhh-----cCCcEEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc--------------- Confidence 9876321 134678899999843 3444333 348889999999999999996422211 Q ss_pred hcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCC-CCCccccCCCCCCCCCC Q lcl|NC_011057. 470 SKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGN-EDTSDDDNLDDGEHEPD 521 (634) Q Consensus 470 ~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~-~~~~~d~~~~~~~~ePD 521 (634) --+. |+ ....+.++ .+..+++ +++.| T Consensus 398 ---~~~~----~~--------------n~~~~~~~~~~~~~~~-----~~~~~ 424 (424) T protein:vir:45 398 ---EMLV----SV--------------NAANPAGDFKPPKNDE-----GKTNE 424 (424) T ss_pred ---eeee----cc--------------cccccccccCCCCCCC-----CCCCC Confidence 0011 11 00001000 1111110 01001 No 35 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.63 E-value=6.6e-16 Score=103.81 Aligned_cols=401 Identities=13% Similarity=0.108 Sum_probs=228.1 Q ss_pred CCCCCcce------eEeccCCCCccch--hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhh Q lcl|NC_011057. 1 MAATQSLR------LVRRPKGGRPAPS--RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRA 72 (634) Q Consensus 1 ~~a~~~lr------~vrrp~g~~~a~~--ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~ 72 (634) |.--+++| +.|+.+....... .....+..+..++........ .+. --.++ ..+-+.-.+.-++ T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-------~~~~~-~~~~v~~cI~~ia 71 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELI-SDG-------YTKLS-DSPEVRMAVDCIA 71 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhccccccccccccchhhHhhhc-cch-------hHHHh-hchHHHHHHHHHH Confidence 55555555 3344433211111 111111111111111111100 000 00112 2355666678899 Q ss_pred hceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCC Q lcl|NC_011057. 73 SSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQ 152 (634) Q Consensus 73 ~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~ 152 (634) +++|++.+..-+-+.| |....+ + -+..++..=...-+...++++.++.+|-+-|++|+.+. |... T Consensus 72 ~~ia~~~~~~~~~~~~-----~~~~~~---~-~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~-r~~~----- 136 (413) T protein:vir:96 72 DLVSNMTIQLMQNGET-----GDKRIK---N-DLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVK-PQVS----- 136 (413) T ss_pred HhhccCceEEEEecCC-----Cccccc---c-HHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEE-EcCC----- Confidence 9999998888777755 232222 2 24444555567778899999999999999999999864 4322 Q ss_pred cccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEe-eCCCcccccCCccchhhhhHHHHHHHhhh Q lcl|NC_011057. 153 PDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRV-WIPKPRKASEPDSPVRAVLDSIREIVRTT 231 (634) Q Consensus 153 ~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~Rv-W~P~prra~eaDSPvra~l~~LrEI~rtt 231 (634) |. ..-..+.+....|.....++...+....+. .+|. ..|++ ++ ++|++..-..--||+.++.+.+.-..-.. T Consensus 137 --g~--~~~~L~~l~~~~v~~~~~~~~~~y~~~~~~-~~~~-~~evi-h~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~ 209 (413) T protein:vir:96 137 --GD--KIIGLTPISPYKVTFNVSDDDLDYSITFDN-KEYD-PSTLL-HFVLNPSIERPFIGTGYKVALKDIVGNLKQAS 209 (413) T ss_pred --CC--ceEEEEEecCceeEEEEcCCeEEEEEeecC-cEEc-hhhEE-EEeccCCCCCccccccHHHHHHHHHHHHHHHH Confidence 21 112345555555544433444444443222 2342 24443 43 57888777778899999988888888888 Q ss_pred HHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_011057. 232 KTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGV 311 (634) Q Consensus 232 k~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~v 311 (634) +...+..+.-..-.|||-+|+.++- -..+++++.+. ..+.-.+. +-=++|+.. T Consensus 210 ~~~~~~~~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~----~~~~g~~n--~g~~~vl~~ 262 (413) T protein:vir:96 210 VTKKGFMASEYMPNLIVSVDSDSDE---------------------LSDEEGRENFE----EMYLKRKE--AGKPWIIPE 262 (413) T ss_pred HHHHHHHhccCCccEEEEeCCCCCH---------------------HHHHHHHHHHH----HHhcCccc--cCceeeecC Confidence 8888888888888899988876541 12344444443 22222111 112233333 Q ss_pred chHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHH Q lcl|NC_011057. 312 PGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTD 391 (634) Q Consensus 312 P~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~ 391 (634) .+.-...++-+. -.+.--+++|+..+..||..|.||| .+||.+ +.|. +....-++..|.|.++.|+++|++ T Consensus 263 ~~~~~~~~~~~~---~~d~q~~e~~~~~~~~Ia~~fgVP~-~~lg~~-~~~~----~~~~~~~~~~l~P~~~~ie~~ln~ 333 (413) T protein:vir:96 263 GMVNVQQIKPLT---LNDLAINDAVTLDKKTVAGIFGVPA-FLLGVG-TYNK----DEFNNFINTKIMSIAQVIQQTYNK 333 (413) T ss_pred CcccccccccCC---hhHHHHHHHHHHHHHHHHHHhCCCH-HHcCCC-cchH----HHHHHHHHHHHHHHHHHHHHHHHH Confidence 332222222222 2234457899999999999999999 688875 4443 334456888999999999999999 Q ss_pred HHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHH Q lcl|NC_011057. 392 QILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDA 468 (634) Q Consensus 392 ~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~ 468 (634) .+|. ..|-|.||.++|. +.|..+.+ ..+++.|++|.+++|+++|+.-..+-| +-+ T Consensus 334 ~ll~----------~~~~~~fd~~~ll-~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~gd----~~~------- 391 (413) T protein:vir:96 334 LIVE----------EDMYFSLNPRSLY-NYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEMD----DLL------- 391 (413) T ss_pred hhCC----------CCcEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----eee------- Confidence 8763 2467899999984 34544444 358889999999999999996543322 100 Q ss_pred hhcCcccchhhhhhhhhhhhcccCCCCCC Q lcl|NC_011057. 469 VSKDPTLIPMLAPLIAGVLQQIEFPQQQQ 497 (634) Q Consensus 469 v~~dp~Li~~laPll~p~~q~~~~P~p~~ 497 (634) +.+| +.|+-+ ....-..+...+ T Consensus 392 ~~~n------~~~~~~-~~~~~~~~~~dt 413 (413) T protein:vir:96 392 VLEN------YLQQKD-LVNQKKLIQDET 413 (413) T ss_pred eccc------ccchhh-cccccCCCCCCC Confidence 0011 122211 000001112212 No 36 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.62 E-value=4.7e-16 Score=104.60 Aligned_cols=404 Identities=12% Similarity=0.072 Sum_probs=222.7 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==.- |.+..+... .. -.+++|. .+.+. +... +-.+ ..+ ..+-+.-.+.-+++++|.+.+ T Consensus 1 MG~~~~--~~~~~~~~~--~~-------~~~~~~~-~~~~~---g~~~-~~~~--~al-~~~~V~~~v~~Ia~~iA~lp~ 61 (411) T protein:vir:81 1 MGWWSR--LTRFFRPRN--ET-------VDMTNPL-LLQWL---GVDP-DTPR--NQL-SEATYFACLKILSESLGKLPL 61 (411) T ss_pred CchHHH--HHhhccCcc--cc-------cccchHH-HHHHh---cCcc-cChh--hhh-ccHHHHHHHHHHHHhHhhCce Confidence 432221 212111111 11 1122222 11111 1000 1000 011 234456677788999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..=+-++| |.... .+ ..+..+.+.=+.--+...++++.++.+|.+-|++|+.+. |.++ .+.+-..-. T Consensus 62 ~~~~~~~~-----~~~~~--~~-~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~-r~~g----~~~~l~~l~ 128 (411) T protein:vir:81 62 KMYQKTER-----GIVKS--DR-EELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQ-YSGP----QLQALWILP 128 (411) T ss_pred eEEEecCC-----ceeee--cc-cHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ecCC----ceEEEEEEC Confidence 88888866 33321 11 234444555566778999999999999999999998763 4321 122110011 Q ss_pred hhceeccHHHHhccCCCcc--eeeEeC-CCCcccccCCCCeEEEe-eCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSG--TNIVLP-TGEEHEFVKGTDIIFRV-WIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~--~~i~lP-~g~~h~~~~~~D~~~Rv-W~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ..++.+..++-........ ..+..+ +|...+|.. .| +|++ +++ +..-..-.||+.++...+.-..-..+...+ T Consensus 129 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~e-iih~k~~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 205 (411) T protein:vir:81 129 SQYVTIVVDDRGLLGEKNAIWYRYNDPYDGKMYVFRN-DE-ILHFKTSV-TFDGITGLSVRDVLKHTVDGALESQKFMNN 205 (411) T ss_pred CceEEEEEcCcccccccceEEEEEEecCCceEEEEcc-cc-EEEEcCCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 1222222221111111111 124444 454444433 33 3444 332 223345778888887777766666666666 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+.-.+-.|||-+|+.++- -..+++++.+. ..+.-.+.+- -++++ + .. T Consensus 206 ~f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~----~~~~g~~n~g--~~~vl--~--~g 254 (411) T protein:vir:81 206 LYKTGLTGKAVLEYTGDLNQ---------------------EARDRLVKGFE----QFANGSKNAG--KIIPV--P--LG 254 (411) T ss_pred HHhccCCCceEEEeCCCCCH---------------------HHHHHHHHHHH----HHhcCccccC--Cceec--C--CC Confidence 66666666788877765431 14455555554 2222222221 22332 2 33 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -+++-|.+.. .+.--+++|+.....||..|.||| .+||...++|+.++.+....-++..|.|.++.|+++|++.+|.+ T Consensus 255 ~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~ 332 (411) T protein:vir:81 255 MKLVPLDIKL-TDSQFFELKKYTALQIAAAFGIKP-NQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSN 332 (411) T ss_pred ceEEEccCCH-HHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCh Confidence 3555554432 233346889999999999999999 67787778999999999999999999999999999999999877 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) ... ...|-|.||.+.|.. +|..+.+ ..++..|++|.++.|.++|+.-..+-| ..-.+. T Consensus 333 ~~~-----~~~~~~~fd~~~ll~-~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD-------------~~~~~~ 393 (411) T protein:vir:81 333 DLI-----SQGHYFKFNVNVILR-ADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGN-------------NLMANG 393 (411) T ss_pred hhc-----CCCcEEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-------------eeeecc Confidence 442 245668899999843 3444333 458899999999999999996433222 000000 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCC Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDD 515 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~ 515 (634) . +.||=. +. +....++++ T Consensus 394 n----~~pl~~--------------~~------~~~~kgGd~ 411 (411) T protein:vir:81 394 N----YIPLSM--------------LG------ANYGKGGDS 411 (411) T ss_pred C----ccchhh--------------hh------hhhccCCCC Confidence 0 122200 00 000011111 No 37 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.62 E-value=1.8e-15 Score=101.46 Aligned_cols=411 Identities=16% Similarity=0.164 Sum_probs=214.2 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.=-.-|. .+|... +.. -+..+.++ .+......+..... ...| ..++-++-.+.=+++++|++-| T Consensus 1 Mg~~~~~~--~~~~~~--~~~-----~~~~~~~~--~~~~~~~~~~~~~~-~~~~---~~~~~v~~~i~~ia~~ia~lp~ 65 (423) T protein:vir:81 1 MGFLQKLG--LAPSVV--ATP-----EPIELVGP--IFESLKLSTKNMTV-EQIW---EDQPHLRTVTTFIARNVASLQL 65 (423) T ss_pred CchhHhhc--cccccc--cCc-----cccccccc--cccccccccchhhH-HHHH---HhhhHHHHHHHHHHHhHhhCce Confidence 33222221 112110 000 00011111 00000000000000 1112 2445566778889999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..=+.+.| |..++ ..++ -+..+.. -+.--+...++++.++.+|-+-|+.|+.+ .|..+ . ++. . T Consensus 66 ~~~~~~~d-----g~~~~-~~~~-~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i-~rd~~-~----~~~---~ 128 (423) T protein:vir:81 66 QAFERVED-----GGRER-VREG-HLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLL-PGDLG-V----DTP---T 128 (423) T ss_pred EEEEEecC-----Cceee-eccc-hHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EecCC-c----Ccc---e Confidence 88777777 33322 1222 2445555 37777899999999999999999999775 44322 1 111 0 Q ss_pred hhceeccHHHHhccCCC---cceeeEe-----CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKG---SGTNIVL-----PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTK 232 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~---~~~~i~l-----P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 232 (634) -..+.+....+...... ....+.. .+|....|.. .+ +|++=++++..-..--||+..+.+.+.=..-..+ T Consensus 129 ~~l~p~~~~~v~~~~~~~~~~~~~Y~~~~~~~~~g~~~~~~~-~e-vih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~ 206 (423) T protein:vir:81 129 LDIRPIPVSWVQRRAYKDGWGSLDYIIIESGDNDGRSVKVPG-ER-VIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAI 206 (423) T ss_pred EEEeecccceeeeeeccCCCcceEEEEEEecCCCceEEEEcc-cc-eEEecCCCCCCccccccHHHHHHHHHHHHHHHHH Confidence 11122222222111111 1122222 1343333322 23 4444466665555567776665554433333333 Q ss_pred HHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_011057. 233 TIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (634) Q Consensus 233 ~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP 312 (634) ...+..+.-..-.|||.+|+.+.-..- ..-+.+.+++-+- ..+.--.+- +--|+|+ + T Consensus 207 ~~~~~f~ng~~p~gvi~~~~~~~~~~l----------------~~e~~~~~~~~~~----~~~~~~~~n-~g~~~vl--~ 263 (423) T protein:vir:81 207 FRAQMWRNGPRPGMVIMRDPESKAGKW----------------DAESRTRFMANLR----ASFSPKSSD-VGGTLLL--E 263 (423) T ss_pred HHHHHHhccCCCceEEEecCcccCccC----------------CHHHHHHHHHHHH----HHhcccccc-CCcceec--C Confidence 333333323333467777766532110 1113333333332 222111111 2223333 2 Q ss_pred hHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHH Q lcl|NC_011057. 313 GEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQ 392 (634) Q Consensus 313 ~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~ 392 (634) ..-+++-|.+.. .+.--+++|+.....||..|.||| .|||...++|..+..+....=++..|.|.+..|+++|++. T Consensus 264 --~g~~~~~l~~s~-~d~q~~e~~~~~~~eIa~~fgVPp-~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~ 339 (423) T protein:vir:81 264 --DGMKAENFHTTS-KDEQTVETTKLSLQTVAQVYGINP-TMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNLF 339 (423) T ss_pred --CCceEEeccCCh-hhHHHHHHHHhhHHHHHHHhCCCH-HHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 334666666543 344456889999999999999999 7789877888888999998899999999999999999998 Q ss_pred HHHHHHHhcCCChhHheeeecCcccccCCCchHHH--HH-HH-HccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHH Q lcl|NC_011057. 393 ILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA--KF-AY-ENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDA 468 (634) Q Consensus 393 ~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA--~~-~~-~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~ 468 (634) +|.+ .+++...|.|.||.+.|. ++|..+.+ .+ +. ..|++|.+++|+++|+....|=| + T Consensus 340 L~~~----~~~~~~~~~~~fd~~~ll-r~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD----~--------- 401 (423) T protein:vir:81 340 LLPR----VGIDNEKFYFEFNLEEKL-RASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGD----D--------- 401 (423) T ss_pred hcCc----cccccCccEEEecchhhh-ccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc----e--------- Confidence 8765 356667889999999984 34444333 22 23 45999999999999997533322 0 Q ss_pred hhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCC Q lcl|NC_011057. 469 VSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDT 522 (634) Q Consensus 469 v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDT 522 (634) ++ .| ..-. +++. ++ ..+++-|| T Consensus 402 ------~~---------------~p--~n~~-~~~~----~~----~~~~~~~t 423 (423) T protein:vir:81 402 ------LA---------------RP--LNTE-FGDS----ED----APGEEVET 423 (423) T ss_pred ------ee---------------cc--cccc-cCcc----CC----CCCCCCCC Confidence 00 11 0000 1000 00 11122222 No 38 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.62 E-value=1.1e-15 Score=102.55 Aligned_cols=405 Identities=14% Similarity=0.103 Sum_probs=219.0 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.--..=.+++|-|-.-. .+....-...++++..-...+..+. ..+.+-.++-+.-.+.-++++||.+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~v--------~~~~a~~~~~v~~~i~~ia~~iA~lp~ 70 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLI--DNWIDQSTSKLYDFSPWKNRSFWGV--------INNTLETNETIFSAITKLSNSMASLPL 70 (412) T ss_pred CccchhhhhhhhhhhhHh--hhhhcccccccccccccCCcccccc--------chhhhhccHHHHHHHHHHHHhHhhCce Confidence 544333344444442110 0111111112222221111111110 112333556666677889999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+-.. .+ ...+..+...=+.--+...++++.++.+|-+-|+.|+.+ .|. .+|. . T Consensus 71 ~~~~~~~-------~~------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~-------~~G~---~ 126 (412) T protein:vir:26 71 KMYEDYK-------VV------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLI-ERD-------IYHQ---P 126 (412) T ss_pred eEeeccc-------cc------cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEE-EEC-------CCCc---E Confidence 7655221 11 123455555557778899999999999999999999886 342 2332 2 Q ss_pred hhceeccHHHHhccCC--Cccee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNK--GSGTN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~--~~~~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) .+++.|....+..... +...- +...+|....|.. +=+|++=++++..-..--||+.++...+.=...+.+. + T Consensus 127 ~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~--~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~--~ 202 (412) T protein:vir:26 127 SKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--N 202 (412) T ss_pred EEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc--ccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHH--H Confidence 3577776665543322 22232 3344565555433 3345554555555555668876665544322222111 2 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+....+.||+-.|+.++ .-..+.+.+.+-+. +...+ - ++|+ + .. T Consensus 203 ~~~~~~~~~~i~~~~~~l~---------------------~e~~~~~~~~~~~~----~~~~g----~-~~vl--~--~g 248 (412) T protein:vir:26 203 LTEMQKPDSFMLKYGSNVG---------------------KEKRQQVLEDFKQY----YEENG----G-ILFQ--E--PG 248 (412) T ss_pred HHhcCCCCceEEecCCCCC---------------------HHHHHHHHHHHHHH----hhcCC----C-eeec--C--CC Confidence 2222222333333332221 12334444444322 22111 1 2222 2 33 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -+++.|.+... +.--+++|+..+..||..|.||| .|||.+.++|..++.+....-++..|.|.+..|+++|++.+|-+ T Consensus 249 ~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~afgVPp-~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kLl~~ 326 (412) T protein:vir:26 249 VEIEPLPKKYV-SEDIVASENLTRERVANVFQLPS-VFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 326 (412) T ss_pred ceEEEcCCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 45566654432 23348899999999999999999 77787788899999999999999999999999999999988764 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchH---HHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~---eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) . ++. ..|.|-||.+.|..- |..+ ....++..|++|.++.|+++|++.-.+-| . T Consensus 327 ~----~~~-~~~~~~fd~~~l~~~-d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD-------------~----- 382 (412) T protein:vir:26 327 T----DRE-KNRYFKFNVKSYLRA-DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-------------K----- 382 (412) T ss_pred c----ccc-CcceEEeechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------------e----- Confidence 2 221 346688999997542 3322 23458899999999999999996533222 0 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDD 510 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d 510 (634) +++..-+..++-|.-......+|+++..+. T Consensus 383 -------~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 383 -------PLISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred -------eeecccccccccchhhcccccCCCCCcCCC Confidence 000001111122211111122222222211 No 39 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.61 E-value=1.9e-15 Score=101.28 Aligned_cols=401 Identities=15% Similarity=0.148 Sum_probs=204.3 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==.+ |... ..+....+.+-.. .++.. .....+.+| ++-+.-.+.-+++++|++.| T Consensus 1 m~~f~~----~~~~--~~~~~~~~~~~~~--~~~~~------~~~~~~Al~---------~~~V~~~i~~Ia~~iA~lp~ 57 (406) T protein:vir:97 1 MSFFQP----LGTS--KVSYDDYISSVLA--GDVSQ------KYLGVSALK---------NSDILTATSIIAGDIARFPL 57 (406) T ss_pred Cccccc----cCCC--CCCcchHHHHHhc--CCCCc------ccccchhhc---------cHHHHHHHHHHHHhhhhCee Confidence 332211 1111 1011111111100 00000 011112222 13333356778999999999 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) +.-.- | |.+..+ ..+..+.+.-+...+...++++.++.+|.+-|++|+.+ .|..+ .|. . T Consensus 58 ~~~~~--~-----g~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i-~r~~~------~g~---~ 116 (406) T protein:vir:97 58 VKKDV--N-----GDIIHD----EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRI-LRDPK------TNQ---A 116 (406) T ss_pred EEEec--C-----cccccc----chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EecCC------CCe---E Confidence 76433 3 333332 23445555556788899999999999999999999986 34321 121 2 Q ss_pred hhceeccHHHHhccCC-CcceeeEe--C-CCCcccccCCCCeE-EEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNK-GSGTNIVL--P-TGEEHEFVKGTDII-FRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~-~~~~~i~l--P-~g~~h~~~~~~D~~-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 235 (634) ..|+.+....+..... +....+.. + +|....| +..|++ ||.. +.+-..--||+.++.+.+.-..-..+... T Consensus 117 ~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~-~~~evih~r~~---~~dg~~G~spi~~~~~~i~~~~a~~~~~~ 192 (406) T protein:vir:97 117 LQFQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKC-FAHDVIHWKFF---SHDTILGRSPLLSLGDEIDLQTGGINTLI 192 (406) T ss_pred EEEEEECCCeeEEEEcCCceEEEEEEecCCceEEEE-ccccEEEecCC---CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 3466666555543222 22233333 3 3444333 334432 4432 22223456776655555443333333332 Q ss_pred HHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_011057. 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (634) Q Consensus 236 na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh 315 (634) +++.||. .|..+..+..+ ...-..+++++.+- ..+...+ +- -|+|+ +. T Consensus 193 -----~~f~ng~--~~~~i~~~~~~--------------l~~e~~~~~~~~~~----~~~~g~n--~g-~~~vl----~~ 240 (406) T protein:vir:97 193 -----KFFKDGF--SSGILTMKGAQ--------------LSGDARQRARQEFE----KMREGSV--GG-SPLVF----DS 240 (406) T ss_pred -----HHHhccC--CCceEEecCCC--------------CCHHHHHHHHHHHH----HHhcccc--cC-ceeec----CC Confidence 3445553 23333322111 12235566666654 2233221 11 22333 24 Q ss_pred hcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHH Q lcl|NC_011057. 316 IKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILR 395 (634) Q Consensus 316 i~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr 395 (634) ..+++.|.+..+... -+++|+..+..||..|.||| .+||. ..+..+..+...+=++..|.|.+..|.++|++.+|. T Consensus 241 g~~~~~l~~~~~d~q-~le~~~~~~~~Ia~afgVPp-~~lg~--~~~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~ 316 (406) T protein:vir:97 241 TMEYTPLEIDTNVLQ-LITSNNFSTAQIAKALRVPS-YKLGV--NSPNQSVAQLMEDYVTNDLPFYFDAITSELGLKTLN 316 (406) T ss_pred CceEEEccCCHHHHH-HHHHHHhhHHHHHHHhCCCH-HHcCC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHHhhhhcC Confidence 557777776544333 46899999999999999999 77785 334456778887777888999999999999998876 Q ss_pred HHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCccc Q lcl|NC_011057. 396 VTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTL 475 (634) Q Consensus 396 ~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~L 475 (634) +.- . ..|.|-||.+.++.. +.++...+++.|++|.+++|.++|+....+.+-+..- +..| T Consensus 317 ~~~----~--~~~~i~fd~~~~~~~--~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~---------~~~n--- 376 (406) T protein:vir:97 317 DKD----R--RLYHIEFDTRSVTGR--NVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQ---------SSLN--- 376 (406) T ss_pred hhh----c--cceeEEEecCccchh--hHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEe---------eccC--- Confidence 522 1 357788898765432 2445567899999999999999999753332211100 0000 Q ss_pred chhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCC Q lcl|NC_011057. 476 IPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDT 522 (634) Q Consensus 476 i~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDT 522 (634) +.|+ +..+- ++..... ...+++.++++.++ T Consensus 377 ---~~~~-----~~~~~--~~~~~~~-------~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 377 ---YVFL-----DKKEE--YQDKVGI-------KGKGGEVNAEEDKS 406 (406) T ss_pred ---ccch-----hcccc--ccccccc-------ccCCCCCCCCCCCC Confidence 0111 11111 1100000 00111111111111 No 40 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.60 E-value=2.6e-16 Score=106.03 Aligned_cols=372 Identities=12% Similarity=0.105 Sum_probs=204.0 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==..+. ..|.+..... .+ .++.-. ...+++....|=+ ++-+-..+-+.-.+.-+++.||.+.+ T Consensus 1 Mg~~~~~~---~~k~~~~~~~-------~~-~~~~~~--~~~~~~~~~~~v~--~~~~l~~~~v~~~i~~ia~~ia~~~~ 65 (383) T protein:vir:10 1 MGLLTPKN---FSKRNAKNMV-------YP-SNPAFF--TTTVGGMQLSYVS--ALSALQNTNVYSVINRIASDVSSAHF 65 (383) T ss_pred CCcccccc---cccccccccc-------cc-cchhhh--hhhccCccccccc--hhHhhcchHHHHHHHHHHHhhccCce Confidence 33221110 1111111000 00 000000 0000000111100 01111234455567778999999887 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+-.. ..+.+. +---+...++++.++.+|-+-|++|+.+. |. +. T Consensus 66 ~~~~~~~-------------------~~ll~~-PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~-~~-------~~------ 111 (383) T protein:vir:10 66 KTENTAT-------------------LNRLES-PSSLIGRFSFWQGALMQLCLSGNDYIPLV-GQ-------NL------ 111 (383) T ss_pred eecccch-------------------hhhhhC-CCCCCCHHHHHHHHHHHhhhcCCeEEEEE-cC-------ce------ Confidence 5432111 111221 33346788999999999999999999874 21 11 Q ss_pred hhceeccHHHHhccCCCccee--eEeC-CCCcccccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTN--IVLP-TGEEHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~--i~lP-~g~~h~~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) .++.++...|.....+++.. +... +|...+|.. .| +-||.++|.......--||+.+|...+.=...+.+...+ T Consensus 112 -~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~-~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~ 189 (383) T protein:vir:10 112 -EHIPNSDVQINYLPGNMGIVYTVLESNDRPKMVLRQ-DQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMS 189 (383) T ss_pred -eEeecCcceEEEEEcCCceEEEEEEcCCceEEEEcc-cceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 13333333333222222222 2223 454555544 44 446777776666566679999999888877777777766 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+.-.+-.|||.+|+.++=+ -..+.+++.+-+. +.-.++- -|+++. .. T Consensus 190 ~f~ng~~~~~il~~~~~~~~~--------------------e~~~~~~~~~~~~----~~~~n~~---~~~vl~----~g 238 (383) T protein:vir:10 190 AMENQINPAGKLTISNYLSDG--------------------KDLESAREEFEKA----NTGDNSG---RLMVLP----DG 238 (383) T ss_pred HHhccCCcceEEEeCCCCCCH--------------------HHHHHHHHHHHHH----hCccccC---CccccC----CC Confidence 666667777888887654311 1344555555322 1111111 223332 34 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhcccc--CcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGS--QTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs--~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) .+++.|.+.......-.++|+..++.||.+|.||| .+||.+. +.++.++-++.. .+...|.|.++.|++.|++.+| T Consensus 239 ~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp-~~lg~~~~~~~~~sn~eq~~~-~~~~~l~P~~~~ie~~l~~~l~ 316 (383) T protein:vir:10 239 FDYTQLEMKTDVFKALADNSAYSADQISKAFGVPS-DILGGGTSTESQHSNIDQIKA-TYLANLNSYVNPIVDELRLKMN 316 (383) T ss_pred ceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCH-HHcCCccCCCCccccHHHHHH-HHHHHHHHHHHHHHHHHHHhhC Confidence 56777766554444445899999999999999999 7888643 456667777754 4566899999999999998775 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSK 471 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~ 471 (634) . +-+.||.+.|.. .|..+.+ ..+++.|++|.+++|+++|+.--.+-| T Consensus 317 ~------------~~~~f~~~~l~~-~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d----------------- 366 (383) T protein:vir:10 317 A------------PDLELDIKDMLD-VDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDN----------------- 366 (383) T ss_pred C------------ceEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCc----------------- Confidence 3 236788888742 2444333 458899999999999999984211111 Q ss_pred CcccchhhhhhhhhhhhcccCCCCC--CCCCCCCCCC Q lcl|NC_011057. 472 DPTLIPMLAPLIAGVLQQIEFPQQQ--QAIDSGGNED 506 (634) Q Consensus 472 dp~Li~~laPll~p~~q~~~~P~p~--~a~~~~~~~~ 506 (634) .|.+. .....+|+++ T Consensus 367 --------------------~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 367 --------------------LPEFKPLTNETKGGDDK 383 (383) T ss_pred --------------------ccccCCCcccCCCCCCC Confidence 11111 1111112111 No 41 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.60 E-value=4.6e-15 Score=99.18 Aligned_cols=394 Identities=14% Similarity=0.112 Sum_probs=196.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhh-cccCccccccHH-HHHHHhhhhhHHHHHhhhhhceeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS-TGISRNSDWQTD-AWEAVDLVGELRYYVGWRASSCSRC 78 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~-~~~~~~~~WQ~e-AW~~yd~VgELryyvgWr~~s~Sr~ 78 (634) |.--+-|+ ..-+|++..-.. +..+.+.+|... -.+.|-.++.+.--|..+++++|.+ T Consensus 1 mg~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~ 59 (403) T protein:vir:10 1 MGFKSWIT---------------------EKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAEC 59 (403) T ss_pred Ccchhhhh---------------------hccchhhhhhhcccccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhC Confidence 43222221 111222111000 001111111111 0122234567777788999999998 Q ss_pred eEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 79 RLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 79 rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) -+...+-... +.-.-.+.. ..+..+...=..--+...++.+.++.++-+-|++||.+- ++ T Consensus 60 p~~v~~~~~~-~~~~~~~~~-----~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~---~~----------- 119 (403) T protein:vir:10 60 SYTVGDKYNI-VTYANGVKT-----KTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD---GT----------- 119 (403) T ss_pred ceeEeecccc-ccccccccc-----chHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe---Cc----------- Confidence 7655432111 000011111 223344454577788899999999999999999998751 11 Q ss_pred cchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEee--CCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVW--IPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW--~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ..+.+..+.+......++.....-.+....|..+.=+-||-. .+++.....--||+.++...+.-.....+...+ T Consensus 120 ---~l~~l~~~~~~v~~~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 196 (403) T protein:vir:10 120 ---SLYHVPAALMQVEADANKFIKKFIFNNQINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEK 196 (403) T ss_pred ---eeEeecCcceEEEEcCCceEEEEEecCceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 111121111111111112222222222222322111223322 234444444557877777766655555555444 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+.-..-.|||-.|+.++= -..+.+++-+- ..+.- +--+-=|+|+. .. T Consensus 197 ~f~ng~~~~gil~~~~~l~~---------------------e~~~~~~~~~~----~~~~g--~~n~g~~~vl~----~g 245 (403) T protein:vir:10 197 FLDNGTVIGLILETDEILNK---------------------KLRERKQEELQ----LDYNP--STGQSSVLILD----GG 245 (403) T ss_pred HHhccCCcceEEEeCCCCCH---------------------HHHHHHHHHHH----HHhCC--cccCcceeecC----CC Confidence 44444444566666654431 13444554433 22221 11112233332 23 Q ss_pred cccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILR 395 (634) Q Consensus 317 ~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr 395 (634) -+++.+.+.....+. -+++|+..+..||+.|.||| .|||-+ ++.+.-+....-++..|.|.+..|+++|++.+ T Consensus 246 ~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~---~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L-- 319 (403) T protein:vir:10 246 MKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQ-VLLDGG---NNANIRPNIELFYYMTIIPMLNKLTSSLTFFF-- 319 (403) T ss_pred ceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCH-HHcCCC---CCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhc-- Confidence 355655554433333 38999999999999999999 777754 44556777888899999999999999999854 Q ss_pred HHHHhcCCChhHheeeecCcccc-cCCCchHHH--H-HHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhc Q lcl|NC_011057. 396 VTLAREGIDPSKYVVWYDASQLT-IDPDKSDEA--K-FAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSK 471 (634) Q Consensus 396 ~~L~~eG~d~~~yV~w~DaS~L~-~~pd~t~eA--~-~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~ 471 (634) | |-|+||.+.+. .++|....+ + .+++.|++|.+++|.++|+..- +.+.+ T Consensus 320 ------~-----~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi-----~~~~~----------- 372 (403) T protein:vir:10 320 ------G-----YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPL-----DDEQM----------- 372 (403) T ss_pred ------C-----ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-----Ccccc----------- Confidence 2 45788988763 344544333 3 3788999999999999999641 11111 Q ss_pred CcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCC Q lcl|NC_011057. 472 DPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEH 518 (634) Q Consensus 472 dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ 518 (634) |--++ |+= ++ ....+..+++..+.+.+ ..|+ T Consensus 373 d~~~~----p~n---~~-------~~~~~~~~~e~~~~~~~--~~g~ 403 (403) T protein:vir:10 373 NKIRI----PAN---VA-------GSATGVSGQEGGRPKGS--TEGD 403 (403) T ss_pred ccccc----ccc---cc-------cccccCCCCcCCCCCCC--cCCC Confidence 11111 110 00 01111111111111111 1111 No 42 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.59 E-value=2.8e-14 Score=94.87 Aligned_cols=507 Identities=15% Similarity=0.164 Sum_probs=230.5 Q ss_pred CCchhhhhhhhcccCcccccc--------HHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcc Q lcl|NC_011057. 31 LPDPSQVFSKSTGISRNSDWQ--------TDAWEAVDLVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTE 102 (634) Q Consensus 31 itdp~~~~~~~~~~~~~~~WQ--------~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~ 102 (634) ++ ..++. .+..+.|- .++| -.++-+.--|.-+++++|.+.|..- +.| |.+.++ T Consensus 1 ~~---~~~~~---~g~~~~~~~~~~~~~~~~~~---~~~~~V~acV~~Ia~~iA~lpl~l~--~~~-----~~~~~~--- 61 (723) T protein:vir:94 1 MT---TFPSG---AGGWNAWSADSVFGNGAKGW---SNSAVAYRCISMLANNAASVDLVVR--GPD-----GELDEL--- 61 (723) T ss_pred Cc---ccccC---CCccccccccccccccHHHH---hhhHHHHHHHHHHHHhhccceeEEE--cCC-----Cccchh--- Confidence 22 11111 11122231 1222 2345566677889999999988763 333 444432 Q ss_pred cHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceeccHH--HHhccCCC--- Q lcl|NC_011057. 103 GERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKE--EIKKSNKG--- 177 (634) Q Consensus 103 g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~--Ei~~~~~~--- 177 (634) + -+..++..-..--+...++.+.+..+|..-|++|+.+. |... ...| ++.+-|+ +... .+.....+ T Consensus 62 ~-~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~r----~~~g--~p~~l~~-l~~~~~~v~~~~~~~~~ 132 (723) T protein:vir:94 62 H-PLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLN-YNGR----TPAG--VPDEIWY-VYDRVTTIVATRAADAV 132 (723) T ss_pred h-HHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ecCC----cccc--ceeEEEE-ecCcceEEeecCCCccc Confidence 2 24555665677788889999999999999999999864 3221 1223 2323333 2211 11111111 Q ss_pred -----cceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecc Q lcl|NC_011057. 178 -----SGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPH 252 (634) Q Consensus 178 -----~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~ 252 (634) -+-.+...+|....|.. .| ||++=.++|..-..--||+..+...+ .+...+.+.. .++..||.. |. T Consensus 133 ~~~~~~~y~~~~~~G~~~~~~~-~d-IiHir~~~~~dg~~G~Spi~~a~~~i----~~~~aa~~~~-~~~f~NG~~--p~ 203 (723) T protein:vir:94 133 PQAQIIGYVIERTDGVRVPVLA-DE-MLWLRFSDPYDPLAVMAPWKAARAAV----DADFYAATWQ-RQSFKNGAR--PG 203 (723) T ss_pred eeeeeeEEEEEecCceeEEecc-cc-eEEecCCCCCCCcccccHHHHHHHHH----HHHHHHHHHH-HHHHhcCCC--cc Confidence 11224445666555543 23 33433334444445667776655444 4444444433 344455421 11 Q ss_pred c-ccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH----hcccceeecCCc Q lcl|NC_011057. 253 E-MSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ----IKDVKHIRFDNE 327 (634) Q Consensus 253 e-~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh----i~~ikHl~f~~d 327 (634) . ++.|. .+......+.+-+- ..+ .++.-+--|+|+...+.. -+.++.-.++-. T Consensus 204 giL~~~~----------------l~~e~~~~~~~~~~----~~~--~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s 261 (723) T protein:vir:94 204 GVVNLGD----------------MDEQTFTKTVAAFR----SQV--EGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMS 261 (723) T ss_pred eEEEcCC----------------CCHHHHHHHHHHHH----HHh--hchhhcCcceeecccccccccccCCceEEEccCC Confidence 1 12221 01123333333332 111 133345556676543211 123333333333 Q ss_pred hh-HHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChh Q lcl|NC_011057. 328 IT-EVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPS 406 (634) Q Consensus 328 ~t-e~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~ 406 (634) .. .--+++|+..+..||+.|-|||..|+| ++++.+..+....-++..|.|.++.|+++|++.+|. .+|. T Consensus 262 ~~D~q~le~r~~~~~eIa~afgVPp~~i~~---~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~----~~g~--- 331 (723) T protein:vir:94 262 PAEMDYINSRMHSAEEVMLAFGIRKDALLG---GSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLP----DIGW--- 331 (723) T ss_pred HHHHHHHHHHHHhHHHHHHHhCCChhHcCC---CCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcc----cccC--- Confidence 32 335799999999999999999965544 344555666666667899999999999999998874 3454 Q ss_pred HheeeecCcccc-cCCCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhh Q lcl|NC_011057. 407 KYVVWYDASQLT-IDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIA 484 (634) Q Consensus 407 ~yV~w~DaS~L~-~~pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~ 484 (634) +|-|-||..+|. .|...--++ ..++..|++|.+++|+++|+..-.|-|. +.. |-|+-. T Consensus 332 ~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~----------------~~~----~~p~~~ 391 (723) T protein:vir:94 332 TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIG----------------QMT----LTPYRA 391 (723) T ss_pred ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc----------------cce----eccccc Confidence 355778888764 332211222 4478899999999999999964332220 001 112211 Q ss_pred hhhhcccCCCCCCCCCCCCCCCCc----cccCCCCCCCCCCCCCCCCCc----ccCCCc-c-----HHHHHHHHHHHHHH Q lcl|NC_011057. 485 GVLQQIEFPQQQQAIDSGGNEDTS----DDDNLDDGEHEPDTEDDQDDD----GTQKAG-L-----ESGIVDLMVDRALE 550 (634) Q Consensus 485 p~~q~~~~P~p~~a~~~~~~~~~~----~d~~~~~~~~ePDTe~d~~~~----~~~~a~-~-----~~a~vdllv~rALe 550 (634) + +. |+|.++ |+.++... .-+..+.+.+.|.+...+.+. ..+.+. . ...+..+++.=|+. T Consensus 392 ~-~a----~~~~~~--p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (723) T protein:vir:94 392 Q-FA----PAPAPA--PAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTLYERLEALLQPLLVELGRR 464 (723) T ss_pred c-cc----CCCCCC--ccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhHHHHHHHHHhhhHHHHHHH Confidence 1 00 111111 11110000 000000001111111110000 000011 1 11233456666777 Q ss_pred HhhHHhhcC---ChhH------HHHhhCCChHHhhhh--cCCCChhH-HHHHHhcccccccHHHHHHhCCC--------H Q lcl|NC_011057. 551 LVGKRRRGR---DRET------LARLSGVRERDYHRY--MDPVPESE-VDRLMSGWDSALDDKILLRLGLD--------P 610 (634) Q Consensus 551 lAGkR~Rt~---~R~~------~arlr~ip~h~~h~~--~~Pv~~~~-v~rLi~GWd~~ld~~~~a~~g~D--------p 610 (634) +|+...|+- -|.+ .+.+|.|..+.+-+. +++-.-++ ..+....++.... ..+...-+. . T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 543 (723) T protein:vir:94 465 QAAVTLREFDLLMRGERAAALWLADVRAVASEAYERGALLAPPDAEEVPPARLTRLDLAPE-ELAVRINVKRIFNARKWV 543 (723) T ss_pred HHHHHHHhhchhhcchHHHHHHHHHHHHHHHhccccceeccccccchhhHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHH Confidence 777544331 1122 247777877666553 33311111 1122222333331 111110000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh---cC Q lcl|NC_011057. 611 GTIRSAVRRKVMAELTRPVIDV---VA 634 (634) Q Consensus 611 ~~lr~~v~~~v~~~lt~~vvd~---~~ 634 (634) ......+........-...-+| +. T Consensus 544 ~~~~~~l~~~~~~~~~~~~~~v~~~l~ 570 (723) T protein:vir:94 544 ARTKDTLRGWYETAWRTGGDHVAAQLG 570 (723) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 0011111110000000000111 00 No 43 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.59 E-value=2.3e-14 Score=95.41 Aligned_cols=495 Identities=15% Similarity=0.108 Sum_probs=242.2 Q ss_pred CCCC-CcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHH-------------HHHhhhhhHHH Q lcl|NC_011057. 1 MAAT-QSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAW-------------EAVDLVGELRY 66 (634) Q Consensus 1 ~~a~-~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW-------------~~yd~VgELry 66 (634) |--. -++|...|++ .+++.+..+. -|.+..| ++|...+=++- T Consensus 1 ~~~~~~~~~~~~~~~---------------------~~~~~~~~~~---~~~~~~~~~~~pp~~~~~La~~~~~n~~v~s 56 (540) T protein:vir:41 1 MFNYHLSIKSLEKYR---------------------AIKGDTDSQA---LKEDRFEEYVEPKVHPLVLLSLLQVNPYHAS 56 (540) T ss_pred CCCcccChhhccchh---------------------hhhccccccc---cccCCCCccccCCCCHHHHHHHHHhcHHHHH Confidence 2211 1222222211 1111100000 0111112 23333344455 Q ss_pred HHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecC Q lcl|NC_011057. 67 YVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPV 146 (634) Q Consensus 67 yvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~ 146 (634) -|.=+++.++.+-+..-. + + +... + -+..-...-.++++.++.++.+-|.+|+.+. |.. T Consensus 57 cI~~ia~~ia~~~~~i~~-~-~-----~~~~----------~---~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~-r~~ 115 (540) T protein:vir:41 57 ACSIKANDILRTGYLIDG-D-D-----GGVE----------E---LLRACRPSFEFILLQALEDLQVFNYCTLEVV-RDD 115 (540) T ss_pred HHHHHHHHHhcCCceEec-C-c-----cchh----------h---hccCCCCCHHHHHHHHHHHHHhcCCeEEEEE-ECC Confidence 566677777777665411 1 1 1111 1 1233445567899999999999999998763 432 Q ss_pred CCCCCCcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccC-------------------CCCeEEEeeCCCc Q lcl|NC_011057. 147 KGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVK-------------------GTDIIFRVWIPKP 207 (634) Q Consensus 147 ~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~-------------------~~D~~~RvW~P~p 207 (634) +|. ....+.|....++....++ .-+.+.+|....|.+ ..+-||++=+++| T Consensus 116 -------~G~---~~~L~~i~~~~V~v~~~~~-~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~ 184 (540) T protein:vir:41 116 -------QGE---PVRLDYIPAHTVRVHRDGS-RYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSP 184 (540) T ss_pred -------CCc---EEEEEEeCCcceEEeEcCc-eeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCC Confidence 232 2335556666665443222 122233332222111 1233566655666 Q ss_pred ccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHH Q lcl|NC_011057. 208 RKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDML 287 (634) Q Consensus 208 rra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml 287 (634) ..-..--||+.+++..+.-..-..+...+.-+.-..-.|||.+|..++-+..- ..-....+.+.+ T Consensus 185 ~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~---------------~~~~~~~~~~~~ 249 (540) T protein:vir:41 185 ICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMEL---------------GSDGEPTGRTVL 249 (540) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhcc---------------chHHHHHHHHHH Confidence 66667789999988877666555555554444444556788888877764321 111233344444 Q ss_pred HHHHhhcccCccccccccceeEeechHHhcccceeecCCchhH-HHHHHHHHHHHHHhhhccCChHHhhccc--cCcchh Q lcl|NC_011057. 288 FQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITE-VAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHW 364 (634) Q Consensus 288 ~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te-~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~Nhw 364 (634) -+.-+.++.. ....+-.|+|+..|+..-+.++...++....+ --+++|+..+..||..|.||| .+||+. ++.|.. T Consensus 250 ~~~~~~~~~g-~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp-~~lG~~~~~~~n~s 327 (540) T protein:vir:41 250 QGLIEDNFKY-LKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDP-YRLGITDVGPLGGN 327 (540) T ss_pred HHHHHHHhcc-ccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCH-HHcCcccCCCCCcc Confidence 4333333322 12356788999988755455666555544333 358899999999999999999 778973 457888 Q ss_pred hHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHH Q lcl|NC_011057. 365 SAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKY 444 (634) Q Consensus 365 tAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~ 444 (634) ++-+....-.+.-|.|.++.|++.|++.++.. .+ ..|-|.||.++|.. +|.....-.+++.|++|.+++|.. T Consensus 328 n~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~----~~---~~~~i~f~~~~ll~-~D~~~~~~~lv~~G~lT~NE~Re~ 399 (540) T protein:vir:41 328 FAEVARRTYYESVVRPQQEIVSSVLTDFIQLK----LD---PGARFVFNEEILME-SEFVHNYALLVQCGVLTPSEVREK 399 (540) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc----cC---CceEEEecchhhcc-hHHHHHHHHHHhCCCCCHHHHHHH Confidence 99999999889999999999999999865532 22 24778999999854 454443344788899999999975 Q ss_pred h-CCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcc--cCCCCC------CCCCCCCCCCCccccCCCC Q lcl|NC_011057. 445 L-GLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQI--EFPQQQ------QAIDSGGNEDTSDDDNLDD 515 (634) Q Consensus 445 ~-Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~--~~P~p~------~a~~~~~~~~~~~d~~~~~ 515 (634) + |+.. +.| +-+.|.-.++.+..-|.. +-++|. ...++.-+++..++.+.++ T Consensus 400 L~g~e~--gdd------------------~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~ 459 (540) T protein:vir:41 400 LFGLDG--GPD------------------MFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLED 459 (540) T ss_pred hCcCcC--CCc------------------ccccccccccccccccccccCCCCccccccccchhcccccCcccccccccc Confidence 4 6532 222 111111111111000111 111111 1112222222223334344 Q ss_pred CCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhhcCCCChhHHHHHHhccc Q lcl|NC_011057. 516 GEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWD 595 (634) Q Consensus 516 ~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd 595 (634) ...++|+ .-.+..+.+...+... +-+.|+++.--.--|. .+-||+.- ....+. T Consensus 460 ~~~~~~~-~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--------~~~~~~~~----------~~~~~~----- 512 (540) T protein:vir:41 460 KKKKIDE-VLSDFRAEAYENGKKM---LSIAGDMGTMSAINRG--------VSMIPPKP----------SNLEAY----- 512 (540) T ss_pred ccccccc-cccccCCccccchhHH---HHHhhhhhhhhhhhcC--------ceecCCCC----------cchHHH----- Confidence 4444442 1111111111111111 3344444443322111 22333211 111110 Q ss_pred ccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 596 SALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVI 630 (634) Q Consensus 596 ~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vv 630 (634) -.+|....+.+--.++.|++.-+--.-+ T Consensus 513 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (540) T protein:vir:41 513 -------EDLLAASVDDIVERIRHYLYKVIGWREL 540 (540) T ss_pred -------HHHHHhhHHHHHHHHHHHHHHHhhhccC Confidence 0001111111111122222211110000 No 44 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.58 E-value=4.2e-16 Score=104.89 Aligned_cols=363 Identities=16% Similarity=0.144 Sum_probs=202.8 Q ss_pred CCCCCc----------------ceeEeccCCCCccchhh-hhhhhccCCchhhhhhhhcccCccccccHHHHHH-----H Q lcl|NC_011057. 1 MAATQS----------------LRLVRRPKGGRPAPSRA-LTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEA-----V 58 (634) Q Consensus 1 ~~a~~~----------------lr~vrrp~g~~~a~~ra-l~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~-----y 58 (634) .++++. +=-.|||.... .++.+ +.+.+ .+++....|...++.- + T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-------------~~~g~~~~~~~~~~~~~t~~~~ 75 (409) T protein:vir:83 10 IPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEP-EARALPWIRPT-------------AWSGYPESWATPSWGSAQDKLR 75 (409) T ss_pred cccCCCcccccccccccCCCCceeeccCCCcch-hhhhccccccc-------------ccccccccccccCccccchhhH Confidence 222222 23356666533 22221 11111 0111112232222211 1 Q ss_pred hhhhhHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceE Q lcl|NC_011057. 59 DLVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELW 138 (634) Q Consensus 59 d~VgELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~w 138 (634) -.++-+.-.|.=+++++|.+.|+.-+ + | ...+ ....+...-..--+...++++.++.+|.+ |..| T Consensus 76 ~~~~~v~acV~~Ia~~iA~lpl~~~~---~-~----~~~~------~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay 140 (409) T protein:vir:83 76 TLIDVAWACIDLNASVLSSMPIYRMR---N-G----RIID------SVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAF 140 (409) T ss_pred hhhHHHHHHHHHHHHhhccCceEEee---C-C----cccc------chhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcE Confidence 12344445577788999998776543 2 1 1111 11222333455557889999999999988 9999 Q ss_pred EEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcceeeEeC-CCCc-ccccC--CCCeEEEeeCCCcccccCCc Q lcl|NC_011057. 139 IVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLP-TGEE-HEFVK--GTDIIFRVWIPKPRKASEPD 214 (634) Q Consensus 139 i~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP-~g~~-h~~~~--~~D~~~RvW~P~prra~eaD 214 (634) +.++.|.. +|. .-++++|....+ .|++- +|.. +.|.. ..|-||++=..++..-..-- T Consensus 141 ~~~i~r~~-------~G~---~~~L~pl~p~~v---------~v~~~~~g~~~y~~~~~~~~~eiiHir~~~~~~~~~G~ 201 (409) T protein:vir:83 141 VLPMAHGS-------DGY---PIRFRVVPPWLV---------NVELKKGARREYRIGGLNVTDEILHIRYQGNTADAHGH 201 (409) T ss_pred EEEEEECC-------CCc---EEEEEEECCcce---------EEEEcCCceEEEEEccccCccceEEeCCCCCCCCcccc Confidence 88777633 232 234666655433 22222 2221 11111 23456665333343333456 Q ss_pred cchhhhhHHHHHHHhhhHHHHHHHHhHhhhC-----ceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHH Q lcl|NC_011057. 215 SPVRAVLDSIREIVRTTKTIANASKSRLIGN-----GVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQ 289 (634) Q Consensus 215 SPvra~l~~LrEI~rttk~I~na~~SRL~gn-----GvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~q 289 (634) ||+.++... +.+....... ..++..| |||-+|+.++ .-..++|.+-+.+ T Consensus 202 spi~~~~~~----i~~~~a~~~~-~~~~f~nga~p~gil~~~~~ls---------------------~e~~~~~~~~~~~ 255 (409) T protein:vir:83 202 GPLESAAPR----QVVIGLLQKY-VQNLAETGGVPLYWLGVERRLS---------------------ETEAVDLMDRWIE 255 (409) T ss_pred cHHHHHHHH----HHHHHHHHHH-HHHHHhcCCCcceEeecCCCCC---------------------HHHHHHHHHHHHH Confidence 776655444 4444333332 2344444 4454444322 1133445444432 Q ss_pred HHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhh---H Q lcl|NC_011057. 290 VAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWS---A 366 (634) Q Consensus 290 va~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhwt---A 366 (634) +..+ .+-=|+|+..- ++.-|-+.+... +.--+++|+..++.||..|.||| .|||+..+.|.|| . T Consensus 256 ----~~~~----nag~~~il~~g---~~~~~~~~~s~~-d~q~le~r~~~~~eIa~~fgVPp-~llg~~~~~~~~tysn~ 322 (409) T protein:vir:83 256 ----SRSK----YAGHPALVTGG---ATLNQAKSMSAQ-DLSLMELTQFNEARIAILLGVPP-FLVGLPGATGSLTYSNI 322 (409) T ss_pred ----hhCC----ccCccceecCC---cccccccCCCHH-HHHHHHHHHhhHHHHHHHhCCCH-HHccCCCCccccccccH Confidence 2222 22234444322 222233444332 22247899999999999999999 9999866667665 7 Q ss_pred hhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHH Q lcl|NC_011057. 367 WQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRK 443 (634) Q Consensus 367 w~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~ 443 (634) -+....-++..|.|.+..|+++|++.+|.. .+| |-||.+.|. ++|..+.+ ..+++.|++|.+|+|+ T Consensus 323 eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~---------~~~-~~f~~~~ll-r~d~~~r~~~~~~~~~~G~lT~NE~R~ 391 (409) T protein:vir:83 323 EQLFSFHDRSSLRPKATAVMAALDRWALPS---------PQH-LELNRDDYT-RPSLVERATAYKIMIEAGVMEPNEARA 391 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhCCC---------CcE-EEeehhhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 888888899999999999999999987743 234 678888875 33444433 3377889999999999 Q ss_pred HhCCCccccCCCCCHHHH Q lcl|NC_011057. 444 YLGLGDDAGYDFTTREGW 461 (634) Q Consensus 444 ~~Gl~ed~~yd~~t~Eg~ 461 (634) ..||.-.+|-|--|--|+ T Consensus 392 ~~glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 392 MERLHSEAAAVRLSGGGV 409 (409) T ss_pred HhCCCCCCCCcccCCCCC Confidence 999998888875555555 No 45 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.57 E-value=1.5e-15 Score=101.93 Aligned_cols=411 Identities=13% Similarity=0.090 Sum_probs=219.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhh------hhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhc Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVF------SKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASS 74 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~------~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s 74 (634) |--+.- -+-=++.+.-=..-++.-.-.... +|.... .....++. .-=...|+ .++-+.-.|.-++++ T Consensus 1 ~~~~~~-~~~~~~~~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~v~~~~al----~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:18 1 MEEPKY-TIDLRTNNGWWARLQSWFVGGRLV-TPNQGSQTGPVSAHGHLGDS-SINDERIL----QISTVWRCVSLISTL 73 (424) T ss_pred CCCCcc-eEeecCCCchHHHHHhhhcccccc-cccccccccccccccccccc-cccHHHhh----ccHHHHHHHHHHHHh Confidence 322211 111111111100001111100000 111100 00000000 00001111 122344567889999 Q ss_pred eeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcc Q lcl|NC_011057. 75 CSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPD 154 (634) Q Consensus 75 ~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~d 154 (634) +|.+.+..=+.+.|.|+. +-..++ .+..++..=..--+...++.+.++.+|-+-|+.|+.+ .|.. + T Consensus 74 iA~lp~~~~~~~~~~~~~-----~~~~~~-~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~~-------~ 139 (424) T protein:vir:18 74 TACLPLDVFETDQNDNRK-----KVDLSN-PLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNS-------A 139 (424) T ss_pred hccCceEEEEeecCCcee-----eecccc-HHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------C Confidence 999988887877773321 111122 2344445556667888899999999999999999986 4533 2 Q ss_pred cccccchhceeccHHHHhccCCCcceeeEeC-CCCcccccCCCCe-EEEeeCCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_011057. 155 GSVRTRQEWYAVSKEEIKKSNKGSGTNIVLP-TGEEHEFVKGTDI-IFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTK 232 (634) Q Consensus 155 g~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP-~g~~h~~~~~~D~-~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 232 (634) |. ...++++....+.....++...+..- +|....|.. .|+ -||-.+++. ..--||+.++.+.+.--.-..+ T Consensus 140 G~---~~~L~pl~~~~V~v~~~~~~~~y~~~~~g~~~~~~~-~eIih~r~~~~dg---~~G~spi~~~~~~i~~~~a~~~ 212 (424) T protein:vir:18 140 GD---VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQ-KEIFHLKGFGFTG---LVGLSPIAFACKSAGVAVAMED 212 (424) T ss_pred Cc---EEEEEEecCcceEEEEcCCeEEEEEEeCCeEEEecc-ccEEEecCcCCCC---cccccHHHHHHHHHHHHHHHHH Confidence 32 23466776666655444444444443 455444443 443 344333332 3456888877766655555555 Q ss_pred HHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_011057. 233 TIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (634) Q Consensus 233 ~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP 312 (634) ...+..+.-..-.|||-+|+.+. .....+++++.+-+. +.-+++ -=++|+ + T Consensus 213 ~~~~~f~ng~~p~gil~~~~~~l--------------------~~e~~~~~~~~~~~~----~~g~na---g~~~vl--~ 263 (424) T protein:vir:18 213 QQRDFFANGAKSPQILSTGEKVL--------------------TEQQRSQVEENFKEI----AGGPVK---KRLWIL--E 263 (424) T ss_pred HHHHHHHccCCcceEEEeCCcCC--------------------CHHHHHHHHHHHHHH----hCCccc---CCceec--c Confidence 55555555555557887776531 112455565555422 211222 112233 2 Q ss_pred hHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchh--hHhhhhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_011057. 313 GEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW--SAWQISDEDVQLHIAPVMEIFCQALT 390 (634) Q Consensus 313 ~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhw--tAw~i~de~v~~hI~P~~~~i~~ait 390 (634) ..-+++-|.+.. .+.--+++|+..+..||..|.||| .|||...+.+.| +..+....-++..|.|.+..|+++|+ T Consensus 264 --~g~~~~~l~~~~-~d~q~le~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~ 339 (424) T protein:vir:18 264 --AGFSTSAIGVTP-QDAEMMASRKFQVSELARFFGVPP-HLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQ 339 (424) T ss_pred --CCceEEecCCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234555555542 223338899999999999999999 677875666654 45777777889999999999999999 Q ss_pred HHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHH Q lcl|NC_011057. 391 DQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQD 467 (634) Q Consensus 391 ~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d 467 (634) +.+|.+. +. ..|.|.||.+.|. +.|..+.+ ..+++.|++|.+++|+++|+.--.|=| T Consensus 340 ~~L~~~~----~~--~~~~~~fd~~~ll-r~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD------------- 399 (424) T protein:vir:18 340 RWLIPAK----DV--GRIHAEHNLDGLL-RGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD------------- 399 (424) T ss_pred hhcCCcc----cc--CCeEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC------------- Confidence 9988662 22 3588999999984 34544443 448889999999999999996422111 Q ss_pred HhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCC Q lcl|NC_011057. 468 AVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNL 513 (634) Q Consensus 468 ~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~ 513 (634) ..--... +.|+-+ .++.....+.++ T Consensus 400 ~~~~~~n----~~~l~~-----------------~~~~~~p~~~ga 424 (424) T protein:vir:18 400 VAMRQSQ----YVPITD-----------------LGTNKEPRNNGA 424 (424) T ss_pred eeeeccC----ccchHh-----------------hhccCCCccCCC Confidence 0000000 011100 000000011111 No 46 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.57 E-value=5.7e-15 Score=98.67 Aligned_cols=402 Identities=14% Similarity=0.126 Sum_probs=212.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |. ...|+-|.|.+-.. ..+-.-+..++++.-....+..+. .++-+-..+-+.-.|.-+++++|.+.| T Consensus 1 ~~---~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v--------~~~~~~~~~~V~~ci~~Ia~~ia~lp~ 67 (409) T protein:vir:93 1 MA---KENIVTRIKKKLID--NWIDQSTSKLYDFSPWKNRSFWGV--------INNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred CC---ccchhhhhhhhhhh--hhhccccccccccccccCcccccc--------chhhhhccHHHHHHHHHHHHhhhhCce Confidence 22 22233333221100 011111222222222111111000 112233445566677889999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+=+. .+ + ..+..+...=+.-.+...++++.++.+|-+-|+.|+.+ .|.. +|. . T Consensus 68 ~~~~~~~-------~~-----~-~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~~-------~G~---~ 123 (409) T protein:vir:93 68 KMYEDYK-------VV-----N-TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLI-ERDI-------YHQ---P 123 (409) T ss_pred eEeeccc-------cc-----c-chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEE-EECC-------CCc---E Confidence 7754221 11 1 23445555556778889999999999999999999876 3422 232 2 Q ss_pred hhceeccHHHHhccC--CCccee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSN--KGSGTN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~--~~~~~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) .+++.|..+.+...- .+...- +...+|....|.. +=+|++=++.+..-..--||+.++...+.=...+.+. + T Consensus 124 ~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~--~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~--~ 199 (409) T protein:vir:93 124 SKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--N 199 (409) T ss_pred EEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc--ccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH--H Confidence 346677666554322 222233 3334455444332 3345554555555555668876655544422222111 2 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+....+.||+-.++.++ ....+.+.+.+-+ .+...+. + +|+ + .. T Consensus 200 ~~~~~~~~~~i~~~~~~l~---------------------~e~~~~~~~~~~~----~~~~~g~----~-~vl--~--~g 245 (409) T protein:vir:93 200 LTEMQKPDSFMLKYGSNVG---------------------KEKRQQVLEDFKQ----YYEENGG----I-LFQ--E--PG 245 (409) T ss_pred HHhcCCCCceEEecCCCCC---------------------HHHHHHHHHHHHH----HhhcCCC----e-eec--C--CC Confidence 2222222333433333221 1244455555543 2222221 2 222 2 23 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -+++.|.... .+.--+++|+..+..||..|.||| .+||.+.++|..+..+....-++..|.|.++.|.++|++.+|-+ T Consensus 246 ~~~~~l~~~~-~d~q~~e~r~~~~~~Ia~~fgVPp-~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~ 323 (409) T protein:vir:93 246 VEIEPLPKKY-VSEDIVASENLTRERVANVFQLPS-VFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 323 (409) T ss_pred ceEEEcCCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 3455554332 233448899999999999999999 67787788999999999999999999999999999999988754 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchHH---HHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSDE---AKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~e---A~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) .+++ ..|.|-||.+.|.. +|..+. ...++..|++|.+++|+++|+..-.+-| . T Consensus 324 ----~~~~-~~~~~~fd~~~ll~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD-------------~----- 379 (409) T protein:vir:93 324 ----TDRE-KNRYFKFNVKSYLR-ADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-------------K----- 379 (409) T ss_pred ----cccc-CcceEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------------e----- Confidence 2232 34668899998853 233222 2448889999999999999996432222 0 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDD 510 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d 510 (634) ++...-+..++-+.-......+|+++..+. T Consensus 380 -------~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 380 -------PLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred -------eeecccccccccchhhcccccCCCCCcCCC Confidence 000000111111111111112122111111 No 47 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.57 E-value=1.5e-14 Score=96.33 Aligned_cols=411 Identities=15% Similarity=0.115 Sum_probs=220.3 Q ss_pred CCCCCcceeEeccCCCCccchhh----hhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhcee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRA----LTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCS 76 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ra----l~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (634) |.- -|..-+..+........ +.+... +..++. .+.. .|. .++-+.-.|.-++++|| T Consensus 1 ~~~---~r~~~~~~~~~~~~~~~~~~~~~g~~~--s~~~~~------vt~~-----~al----~~~~v~~~v~~ia~~iA 60 (419) T protein:vir:14 1 MFF---SRQLLSNLGQTQMSAGGWVSALLGSSR--SDSGQV------VTPA-----SAL----ALTVLQNCVTLLAESIA 60 (419) T ss_pred Ccc---cccccccccccccCcchhhHHhhcCCC--ccCCcc------cchH-----Hhh----ccHHHHHHHHHHHHhhc Confidence 211 01111111111111111 111100 000000 1111 111 22335556777899999 Q ss_pred eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccc Q lcl|NC_011057. 77 RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGS 156 (634) Q Consensus 77 r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~ 156 (634) .+.|..-+-+.+ |..+ ..++ .+..++..-+..-+...++++.++.+|-+-|+.|+.+ .|.. +|. T Consensus 61 ~lp~~~~~~~~~-----~~~~--~~~~-~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i-~r~~-------~G~ 124 (419) T protein:vir:14 61 QLPIELYERSGE-----DRKP--ATDH-PLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFI-DRDS-------DGV 124 (419) T ss_pred cCceEEEEecCC-----cccc--cccc-HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------CCc Confidence 999988777755 2222 1222 3455556567778899999999999999999998886 4433 232 Q ss_pred cccchhceeccHHHHhccCCCccee-eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_011057. 157 VRTRQEWYAVSKEEIKKSNKGSGTN-IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (634) Q Consensus 157 ~~~~~~W~~vt~~Ei~~~~~~~~~~-i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 235 (634) ...++.|....+.....+++.. +...+... . ..+.++++=.+ +..-..-.||+..+...+.-.....+... T Consensus 125 ---~~~l~pl~~~~v~v~~~~~~~~~y~~~~~~~---~-~~~~i~h~~~~-~~dg~~G~s~i~~~~~~i~~~~~~~~~~~ 196 (419) T protein:vir:14 125 ---IQGLYPLDNEAVTVMRGSDLKPVYRVRGSDP---M-PQRLVHHVRWM-SINGYTGLSPVLLHANAIGHAQAIQQYAG 196 (419) T ss_pred ---EEEEEEecCceEEEEECCCceEEEEEccCcc---c-chhheeEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 2346666655554332222222 22222111 1 12334443222 22334566888777776666666666666 Q ss_pred HHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_011057. 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (634) Q Consensus 236 na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh 315 (634) +..+.-..-.|||-+|+.+.-- ......+.|++.+- ..+.--+. +--++++ + . T Consensus 197 ~~f~ng~~p~gil~~~~~~~~~-----------------~~~~~~~~~~~~~~----~~~~g~~n--ag~~~vl--~--~ 249 (419) T protein:vir:14 197 KSFMNGTALSGVIERPKDAPAL-----------------KDQASVDRITDGWN----AKFGGSGN--AKKVALL--Q--E 249 (419) T ss_pred HHHhccCCccEEEEecCCCCcc-----------------cCHHHHHHHHHHHH----HHhcCccc--cCCceec--C--C Confidence 6666556667788776654220 11224445555444 22222111 2222332 2 2 Q ss_pred hcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHH Q lcl|NC_011057. 316 IKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILR 395 (634) Q Consensus 316 i~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr 395 (634) .-+++-|.+. ..+.--+++|+..+..||..|-||| .+||.+.++|..++-+....-++..|.|.+..|+++|++.+|. T Consensus 250 g~~~~~l~~~-~~d~q~~e~~~~~~~~Ia~~fgVpp-~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll~ 327 (419) T protein:vir:14 250 GMTFRPLSMT-NVDAALIDALRLSALDIARIYKIPA-HMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLL 327 (419) T ss_pred CceEEEccCC-hhhHHHHHHHHHHHHHHHHHhCCCH-HHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Confidence 3456666553 3344567899999999999999999 6778767788888899998889999999999999999998886 Q ss_pred HHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcC Q lcl|NC_011057. 396 VTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKD 472 (634) Q Consensus 396 ~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~d 472 (634) +.- + ..|.|.||.+.|. +.|..+.+ ..+++.|++|.+++|.++|++.-.|-| T Consensus 328 ~~~---~---~~~~i~fd~~~l~-r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD------------------ 382 (419) T protein:vir:14 328 PSE---R---KQYFIEYNLAGLL-RGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD------------------ 382 (419) T ss_pred ccc---c---CCeEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC------------------ Confidence 632 1 3588999999984 34544433 348889999999999999996432222 Q ss_pred cccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCC Q lcl|NC_011057. 473 PTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKA 534 (634) Q Consensus 473 p~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a 534 (634) --+. |+- +...+-|.+ .+ .+...+|....+......+ T Consensus 383 ~~~~----~~n---~~~~~~~~~---------~~---------~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 383 IYLS----PMN---MVDASKPQQ---------LP---------VGKSEPTKAAIDEIGRILS 419 (419) T ss_pred eeee----ccc---ccccccccc---------cc---------CCCCCCccccccchhcccC Confidence 0011 100 000000000 00 0000000111111111111 No 48 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.57 E-value=4.8e-15 Score=99.08 Aligned_cols=402 Identities=14% Similarity=0.125 Sum_probs=211.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |. .=+|.-|=|.+-.. ..+.-....+.|+..-+.++..+. .-+-|-..+.+.-.+.-+++++|++.+ T Consensus 1 ~~---~~~~~~~~k~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v--------~~~~a~~~~~v~~~i~~Ia~~ia~lp~ 67 (409) T protein:vir:94 1 MA---KENIVTRIKKKLID--NWIDQSASKLYDFSPWKNKSFWGV--------INNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred Cc---ccccchhhhhHHhh--hhhcCCcccccccccccCcccccc--------chhhhhccHHHHHHHHHHHHhhhhCce Confidence 32 11222222221100 001001111112111111110000 011133456666777889999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+=. ...+ ..+..+...=+.-.+-..++++.++.+|-+-|++|+.+. |. .+|. . T Consensus 68 ~~~~~~--------~~~~-----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~-------~~G~---~ 123 (409) T protein:vir:94 68 KMYEDY--------KVVN-----TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE-RD-------IYHQ---P 123 (409) T ss_pred eEeecc--------cccc-----hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-EC-------CCCc---E Confidence 764421 1111 224444554467788999999999999999999998763 42 2332 2 Q ss_pred hhceeccHHHHhcc--CCCccee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKS--NKGSGTN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~--~~~~~~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) .+++.|..+.+... ..+...- +...+|....|.. .| +|++=++++..-..--||+.++.+.+.-...+.+. + T Consensus 124 ~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~-~d-vih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~--~ 199 (409) T protein:vir:94 124 SKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN-MD-MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--N 199 (409) T ss_pred EEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc-cc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHH--H Confidence 35777776655432 2222232 3344566555433 33 45553555555556678887765555432222221 1 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ....-..+.||+-.|+.++ ....+.+.+.+-+ .+...+. ++|+ + .. T Consensus 200 ~~~~~~~~~~i~~~~~~l~---------------------~e~~~~~~~~~~~----~~~~~g~-----~~vl--~--~g 245 (409) T protein:vir:94 200 LTEMQKPDSFMLKYGSNVG---------------------KEKRQQVLEDFKQ----YYEENGG-----ILFQ--E--PG 245 (409) T ss_pred HHhcCCCCeeEEecCCCCC---------------------HHHHHHHHHHHHH----HhhcCCC-----eeec--C--CC Confidence 1111112333433333221 1234445554443 2222221 2222 2 33 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -+++.|.+... +.--+++|+..+..||..|.||| .+||.+.++|..+..+....=++..|.|.++.|.++|++.+|-. T Consensus 246 ~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 323 (409) T protein:vir:94 246 VEIEPLPKKYV-SEDIVASENLTRERVANVFQLPS-VFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 323 (409) T ss_pred ceEEEcCCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc Confidence 45555554432 22347899999999999999999 66787778899999999999999999999999999999988753 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchHH--H-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSDE--A-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~e--A-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) .+.+ ..|.|-||.+.|..- |..+. + ..++..|++|.+++|.++|+..-.+-| . T Consensus 324 ----~~~~-~~~~i~fd~~~ll~~-d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD-------------~----- 379 (409) T protein:vir:94 324 ----TDRE-KNRYFKFNVKSYLRA-DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-------------K----- 379 (409) T ss_pred ----cccc-CcceEEeechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------------e----- Confidence 2332 246678999988432 32222 2 448889999999999999996533222 0 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDD 510 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d 510 (634) +++..-+..++-+.-......+|+++..+. T Consensus 380 -------~~~~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 380 -------PLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred -------EeecccccccccchhhcccccCCCCCcCCC Confidence 000000111111111111111111111111 No 49 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.56 E-value=1.4e-14 Score=96.58 Aligned_cols=416 Identities=13% Similarity=0.147 Sum_probs=204.9 Q ss_pred CCCCCcceeEeccCCCCcc-chhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPA-PSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a-~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |---..++-.-+|.....+ ...+.+.-.....+++...+++ +.. - -++-+-..+-+.-.|.-+++++|++. T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~--v-----~~~~a~~~~aV~~~v~~Ia~~ia~lp 78 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDT-GAA--V-----NADAIMRLDAVAACVKLVSQAVAAMP 78 (432) T ss_pred CchhhhhHhhcCCccccccccccccccCchhhhhhccccccc-Ccc--c-----chHhhhcchHHHHHHHHHHHhhccCc Confidence 2211111111111110000 0011110000000111111000 000 0 01112233455567778999999999 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) +..=+-+.| |.... .++ .+..+...=...-+...++++.++.+|-+-|++|+.+. |.. |. T Consensus 79 ~~~y~~~~~-----g~~~~--~~~-pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~-~~~--------g~--- 138 (432) T protein:vir:97 79 LMMYMRTPD-----GRKEA--VNH-PLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKV-VTD--------GR--- 138 (432) T ss_pred eEEEEecCC-----Ccccc--ccc-HHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEE-ecC--------Cc--- Confidence 987776766 32221 122 24444455566778999999999999999999998764 321 21 Q ss_pred chhceeccHHHHhccCCCcc-ee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSG-TN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~-~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ..+.+.|..+.+......+| .. +...+|...+|.. +-+|++=++ +-.-..--||+..+...+.--....+...+ T Consensus 139 ~~~L~~l~p~~v~v~~~~~g~~~y~~~~~~g~~~~~~~--~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~ 215 (432) T protein:vir:97 139 IESLQYLANDRLTITTDTKGNTAYRYRRTDGQMIDIPR--QQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAAR 215 (432) T ss_pred EEEEEEEcCcceEEEEcCCCcEEEEEEecCceEEEEcc--ccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 12344555554443222222 22 2334565544432 334555222 222234456666554444332223333333 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+.-....|||-+|+.++ .-+.+.|.+.+. ...+.+ =++|+ + .. T Consensus 216 ~f~ng~~~~gil~~~~~l~---------------------~e~~~~~~~~~~-----~~~nag-----~~~vl--~--~g 260 (432) T protein:vir:97 216 AFRNGQLQSVYYQIDRFLT---------------------DDQYDSFSKKVS-----GSVEAG-----RAPLL--E--GG 260 (432) T ss_pred HHhccCCcceeEecCCCCC---------------------HHHHHHHHHHHh-----hhhcCC-----Cceec--C--CC Confidence 2232233456776665543 113444554442 111111 12333 2 23 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcch---hhHhhhhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNH---WSAWQISDEDVQLHIAPVMEIFCQALTDQI 393 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nh---wtAw~i~de~v~~hI~P~~~~i~~ait~~~ 393 (634) -+++-|.+... +.=-+++|+..+..||..|.||| .|||.....+. .+.-+....=++..|.|.++.|+++|++.+ T Consensus 261 ~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln~kL 338 (432) T protein:vir:97 261 MDVKSLGLNPV-DAQLLQSRQYSVESICRFFGVPP-SMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNL 338 (432) T ss_pred ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHcCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 34555554332 22337889999999999999999 77786433332 445666666677899999999999999999 Q ss_pred HHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhh Q lcl|NC_011057. 394 LRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVS 470 (634) Q Consensus 394 lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~ 470 (634) |.+. ++ ..|.|-||.+.|.. .|..+.+ ..++..|.+|.+++|+++|++--.|=+ +.+. T Consensus 339 l~~~---e~---~~~~~~fd~~~llr-~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~------------~~~~ 399 (432) T protein:vir:97 339 LTPA---ER---RRYFADFDTSALLR-ADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNA------------AVLT 399 (432) T ss_pred cCcc---cc---CceEEEeechhhhc-cCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc------------ceEe Confidence 8773 22 35788999999853 3443333 458899999999999999996432211 1111 Q ss_pred cCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCC Q lcl|NC_011057. 471 KDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDD 515 (634) Q Consensus 471 ~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~ 515 (634) .+..+. |+-.... -++|.++ +..+++++...+. T Consensus 400 ~~~~~~----pl~~~~~----~~~~~~~----~~~~~~~~~~~~~ 432 (432) T protein:vir:97 400 VQSAMV----PLDSIGL----QASPEPA----SGLGNQQQDKVSK 432 (432) T ss_pred eccccc----chhhhcc----cCCCCCC----CCCCCcccccccC Confidence 111111 2111000 0111111 1111111111011 No 50 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.56 E-value=1.2e-13 Score=91.38 Aligned_cols=541 Identities=15% Similarity=0.181 Sum_probs=233.2 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhh------cccCcccccc------HHHHHHHhhhhhHHHHH Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS------TGISRNSDWQ------TDAWEAVDLVGELRYYV 68 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~------~~~~~~~~WQ------~eAW~~yd~VgELryyv 68 (634) .+---+.|+--.|.+.++..- ....-..||....-|. ++.+...+|. ....+.|..-+-++-.| T Consensus 28 ~~~~~~~~~~~~p~~~~~~~~----~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~~l~~l~~~np~V~~aI 103 (648) T protein:vir:79 28 LVLEESMQLGEAPGAMPKGGG----GGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFNEITSAYNTEGYVRQAV 103 (648) T ss_pred cccccccccCCCccccCCCCc----ccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHHHHHHHHhcChHHHHHH Confidence 111122222222222111111 0011112332222111 0111122232 22346777778888889 Q ss_pred hhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCC Q lcl|NC_011057. 69 GWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKG 148 (634) Q Consensus 69 gWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~ 148 (634) .-++++++++-++..- +.+. .+++ .+ .+.. ..-..-.....++++.+..+|-+=|..|+.+ .|.+++ T Consensus 104 ~iia~~ia~l~~~i~~---~~~~---~~~~-~~--~~~l---l~rPn~~~t~~~f~~~l~~~lll~GNAYvei-iRd~~G 170 (648) T protein:vir:79 104 DKYIEMMFKADWDFVS---KNPN---AVEY-IR--MRFT---LMAEATQIPTNQLFIEIAEDLVKYCNVVIAK-SRAKDA 170 (648) T ss_pred HHHHHHHhhCcceEEe---cCCc---cchh-hH--HHHH---hhccCCCCCHHHHHHHHHHHHHhcCCeEEEE-EecCCC Confidence 9999999998876422 2222 1221 11 1111 1123334567799999999999999999976 443332 Q ss_pred CCCC-----cccccccchhceeccHHHHhccCCCcce----eeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhh Q lcl|NC_011057. 149 APAQ-----PDGSVRTRQEWYAVSKEEIKKSNKGSGT----NIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRA 219 (634) Q Consensus 149 ~~~~-----~dg~~~~~~~W~~vt~~Ei~~~~~~~~~----~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra 219 (634) .... .-..+.....++.|...-++......|- .+..++|.+....+. +=||++...++..-..--||+.+ T Consensus 171 ~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~-~dIIHik~~~~~d~~~GlSpi~~ 249 (648) T protein:vir:79 171 LPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKP-EDIVHIYYKREKGRAFGTPWLLP 249 (648) T ss_pred ccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecC-ccEEEEccCCCCCCceeccHHHH Confidence 1100 0000111223334443333322111111 123334333222223 33677765566555567788888 Q ss_pred hhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcc Q lcl|NC_011057. 220 VLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDED 299 (634) Q Consensus 220 ~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~ 299 (634) |.+.|.-.....+.-.+--+.=....|||.+|..-. ..-..+.+.+-+.+ ++.- T Consensus 250 a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~--------------------~~e~~k~~~e~~~~----~~~~-- 303 (648) T protein:vir:79 250 ALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQE--------------------GFGAEEGEVDLVRG----EVEN-- 303 (648) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCcc--------------------chHHHHHHHHHHHH----hccc-- Confidence 877775544444433332222223345555431100 00122223333321 2211 Q ss_pred ccccccceeEeechHHhcccceeecCCchh--H-HHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhH Q lcl|NC_011057. 300 SQAAFIPVIAGVPGEQIKDVKHIRFDNEIT--E-VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQL 376 (634) Q Consensus 300 S~AA~vPiva~vP~Ehi~~ikHl~f~~d~t--e-~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~ 376 (634) .+ +.. ...+.+-+.|....+ + --+++|+..+..||..|.||| .+||+..++|..++.+. ...++. T Consensus 304 -----~~-i~g----g~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP-~lLG~~~~ss~stae~~-~~~~~~ 371 (648) T protein:vir:79 304 -----MD-VEG----GMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSE-LMMGRGGTASRSTGDNL-SSDFKD 371 (648) T ss_pred -----cc-ccc----cccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCH-hHcccCCCccchHHHHH-HHHHHH Confidence 00 111 112223333432222 1 247789999999999999999 78899878888888665 455788 Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHhcCCChh---HheeeecCcccccCCCchH--HHHHHHHccCCCHHHHHHHhCCCc-c Q lcl|NC_011057. 377 HIAPVMEIFCQALTDQILRVTLAREGIDPS---KYVVWYDASQLTIDPDKSD--EAKFAYENGAINGEALRKYLGLGD-D 450 (634) Q Consensus 377 hI~P~~~~i~~ait~~~lr~~L~~eG~d~~---~yV~w~DaS~L~~~pd~t~--eA~~~~~~G~It~ealr~~~Gl~e-d 450 (634) .|.|....++..+...+++..|-..++++. .|-+.|+.++|.....++. ....+++.|++|.++.|++.|+.- + T Consensus 372 ~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~ 451 (648) T protein:vir:79 372 RIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVD 451 (648) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 899999999999988888887766666542 3455777788765532222 235688999999999999999953 1 Q ss_pred ccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCC-- Q lcl|NC_011057. 451 AGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDD-- 528 (634) Q Consensus 451 ~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~-- 528 (634) .|.+. +.+..++. |. .+ ...+..++..++++...+...+. ...+.+.+|.++... T Consensus 452 ~g~~~-----------~~l~~~~~------~~----~~-~~~~~~~~~~~~~~~~~~a~~eg-~~~e~~~~~~~~~~~g~ 508 (648) T protein:vir:79 452 DGEGR-----------AKMHLQMV------TI----AQ-ATALAALAPTPAGGSSASASGDK-KKKATDNKTKPTNQHGT 508 (648) T ss_pred CCCCc-----------cccccccc------cc----hh-ccccccCCCCCCCCCCCCccccc-cccccCCCCCCCCCCCc Confidence 22110 11111111 00 01 01111111122222221221111 122222333222111 Q ss_pred cccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhhcCCCC------------hhHHHHHHhcccc Q lcl|NC_011057. 529 DGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVP------------ESEVDRLMSGWDS 596 (634) Q Consensus 529 ~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~~~Pv~------------~~~v~rLi~GWd~ 596 (634) ...+...+-..-+..|.--+|+-- | + + +. +..+ ..+||.+..-+- -.--.-+..-|-. T Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~~-~-~-~--~~----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (648) T protein:vir:79 509 KTSPKKQTNGRHVRYMQEMLLEYT-T-L-N--EA----IKAL-IERYYQYGSKEHLKSINGSLMYTEGRLLELTTQYWGE 578 (648) T ss_pred CCCCccccchhhhhhhhhhhhcch-h-h-h--HH----HhhH-HHHHHHHhHHHHHHhhhhhheeccchhHHHHHHHhhh Confidence 111111111122333333333321 0 0 0 00 0000 112222211000 0000111111221 Q ss_pred cccHHHHHHhCCCH----HHHHHHHHHHH--------HHHHHHHHHHhcC Q lcl|NC_011057. 597 ALDDKILLRLGLDP----GTIRSAVRRKV--------MAELTRPVIDVVA 634 (634) Q Consensus 597 ~ld~~~~a~~g~Dp----~~lr~~v~~~v--------~~~lt~~vvd~~~ 634 (634) ++...--+.. +.||.++...+ -.-.+..|.||+. T Consensus 579 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 624 (648) T protein:vir:79 579 ----EVTEKVRIPFHRMTENLREEVMSTIDKVEGVAEASDIAQAVFDVFT 624 (648) T ss_pred ----hhhceeeeeHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHH Confidence 1111111111 11111111110 0112334445444 No 51 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.56 E-value=1e-14 Score=97.30 Aligned_cols=407 Identities=14% Similarity=0.123 Sum_probs=217.3 Q ss_pred CCCCCc---c--------eeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHh Q lcl|NC_011057. 1 MAATQS---L--------RLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVG 69 (634) Q Consensus 1 ~~a~~~---l--------r~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvg 69 (634) |--|.- | |+---.+|...+..... ....++.-.+ +-.....+..+.. .++-+.-.|. T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~-~~~~~~~~~~--~~~~~~v~~~~al---------~~~~v~~cv~ 68 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQG-SQTGPVSAHG--YLGDSSINDERIL---------QISTVWRCVS 68 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccch-hhcccccccc--ccccccccHHHhh---------ccHHHHHHHH Confidence 111100 0 00001111111111000 0011121000 0000001111111 2233555677 Q ss_pred hhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCC Q lcl|NC_011057. 70 WRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGA 149 (634) Q Consensus 70 Wr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~ 149 (634) -+++++|.+.|..=+.+.|+|. .+-..++ .+..++..-+.--+...++.+.++.+|-+-|++|+.| .|.. T Consensus 69 ~Ia~~iA~lp~~vy~~~~~~~~-----~~~~~~~-~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~~--- 138 (424) T protein:vir:18 69 LISTLTACLPLDVFETDQNDNR-----KKVDLSN-PLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNS--- 138 (424) T ss_pred HHHHhhccCceEEEEeccCCce-----eeecccc-HHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEE-EECC--- Confidence 8999999999988888877332 2111222 2445555556777888999999999999999999986 4533 Q ss_pred CCCcccccccchhceeccHHHHhccCCCcceeeEeC-CCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHH Q lcl|NC_011057. 150 PAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLP-TGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIV 228 (634) Q Consensus 150 ~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP-~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~ 228 (634) +|. ...++++....+.....++...+..- +|...+|.. .|+ |++=.+.+ ....--||+..+...+.--. T Consensus 139 ----~G~---~~~L~~l~~~~v~v~~~~~~~~y~~~~~g~~~~~~~-~eV-ihir~~~~-dg~~G~spi~~~~~~i~~~~ 208 (424) T protein:vir:18 139 ----AGD---VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQ-KEI-FHLKGFGF-TGLVGLSPIAFACKSAGVAV 208 (424) T ss_pred ----CCc---EEEEEEecCcceEEEEcCCeEEEEEEeCCeEEEecc-ccE-EEecCcCC-CCcccccHHHHHHHHHHHHH Confidence 232 23577776666655444444444443 565555544 343 34312222 22345588777655554433 Q ss_pred hhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 229 RTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 229 rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) -..+...+..+.-..-.|||-+|+++. .....+++++.+-+.. .-+++ -=++| T Consensus 209 ~~~~~~~~~f~ng~~~~gil~~~~~~l--------------------~~e~~~~~~~~~~~~~----~~~na---g~~~v 261 (424) T protein:vir:18 209 AMEDQQRDFFANGAKSPQILSTGEKVL--------------------TEQQRSQVEENFKEIA----GGPVK---KRLWI 261 (424) T ss_pred HHHHHHHHHHhccCCcceEEEeCCcCC--------------------CHHHHHHHHHHHHHHh----CCccc---CCcee Confidence 333433333333334456777766531 1124556666654322 11222 11333 Q ss_pred EeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhH--hhhhhhhhhHHHHhHHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSA--WQISDEDVQLHIAPVMEIFC 386 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtA--w~i~de~v~~hI~P~~~~i~ 386 (634) + + ..-+++-|.+... +.--+++|+..+..||..|.||| .|||.-.+.|.|++ -|....-++..|.|.++.|+ T Consensus 262 l--~--~g~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie 335 (424) T protein:vir:18 262 L--E--AGFSTSAIGVTPQ-DAEMMASRKFQVSELARFFGVPP-HLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWE 335 (424) T ss_pred c--c--CCceEEecCCChh-HHHHHHHHHHhHHHHHHHhCCCH-HHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHH Confidence 3 2 2345666665432 22337899999999999999999 67787666777554 67777778999999999999 Q ss_pred HHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHH Q lcl|NC_011057. 387 QALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVM 463 (634) Q Consensus 387 ~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~ 463 (634) ++|++.+|-+ .+. ..|.|.||.++|. +.|..+.+ ..+++.|++|.+++|+++|+..-.|=| +- T Consensus 336 ~~ln~~L~~~----~~~--~~~~~~fd~~~ll-r~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD----~~--- 401 (424) T protein:vir:18 336 NSIQRWLIPS----KDV--GRLHAEHNLDGLL-RGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGD----VA--- 401 (424) T ss_pred HHHHhhcCCc----ccc--CCeEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----ee--- Confidence 9999988765 223 3688999999984 34544433 458899999999999999996422211 00 Q ss_pred HHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCC Q lcl|NC_011057. 464 WAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNL 513 (634) Q Consensus 464 wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~ 513 (634) .+..| +.|+- ..++.....+.++ T Consensus 402 ----~~~~n------~~~l~-----------------~~~~~~~~~~n~a 424 (424) T protein:vir:18 402 ----MRQAQ------YVPIT-----------------DLGTNKEPRNNGA 424 (424) T ss_pred ----eeccC------ccchh-----------------hhhccCCccccCC Confidence 00011 11111 0000000001111 No 52 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.55 E-value=9.9e-15 Score=97.36 Aligned_cols=338 Identities=14% Similarity=0.181 Sum_probs=190.0 Q ss_pred eeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcc Q lcl|NC_011057. 75 CSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPD 154 (634) Q Consensus 75 ~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~d 154 (634) +|.+-+..-+=+. .+ ..-+..+.+.=....+...++++.++.+|-+-|++|+.+ .|.. . T Consensus 1 ia~lp~~~~~~~~-------~~------~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i-~r~~-------~ 59 (348) T protein:vir:93 1 MASLPLKMYEDYK-------VV------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLI-ERDI-------Y 59 (348) T ss_pred CcccceEeEecCc-------Cc------ccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------C Confidence 6665555444111 11 123556666557788899999999999999999999886 4433 2 Q ss_pred cccccchhceeccHHHHhcc--CCCcce--eeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_011057. 155 GSVRTRQEWYAVSKEEIKKS--NKGSGT--NIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRT 230 (634) Q Consensus 155 g~~~~~~~W~~vt~~Ei~~~--~~~~~~--~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rt 230 (634) |. + -+++.|..+.+... ..+... .+...+|....|.. .| +|++=++++..-..--||+..+...+. + T Consensus 60 G~--~-~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~-~e-iih~r~~~~~~~~~G~s~~~~~~~~i~----~ 130 (348) T protein:vir:93 60 HQ--P-SKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN-MD-MLHFKHIVASNMVQGISPIDVLKNTTD----F 130 (348) T ss_pred Cc--E-EEEEEEcCCceEEEEeCCCcEEEEEEEcCCCeEEEEcc-cc-EEEecCCCCCCceeeccHHHHHHHHHH----H Confidence 32 2 24666665544422 222222 24455677665543 34 455546666555555677766655433 2 Q ss_pred hHHHHHHHHhHh--hhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 231 TKTIANASKSRL--IGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 231 tk~I~na~~SRL--~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) .....+...+.. .+.|++..++.++ .-..+.+.+.+-+. +. ++- . ++| T Consensus 131 ~~~~~~~~~~~~~~~~~~i~~~~~~l~---------------------~e~~~~~~~~~~~~----~~--n~~-~--~~v 180 (348) T protein:vir:93 131 DNAVRTFNLTEMQKPDSFMLKYGSNVS---------------------TEKRQQVLEDFKQY----YE--ENG-G--ILF 180 (348) T ss_pred HHHHHHHHHHhcCCCceeEEecCCCCC---------------------HHHHHHHHHHHHHH----hh--cCC-C--eee Confidence 222222211111 1222332222221 12445555655432 22 111 1 222 Q ss_pred EeechHHhcccceeecCCchhHHH-HHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITEVA-IKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQ 387 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te~a-iktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ 387 (634) + + ..-+++.|. ....+.. +++|+..+..||..|-||| .|||.+.++|..++.+....-++..|.|.++.|.+ T Consensus 181 l--~--~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~-~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie~ 253 (348) T protein:vir:93 181 Q--E--PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPS-IFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEE 253 (348) T ss_pred c--C--CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2 233455444 3344444 7899999999999999999 67777778999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHH---HHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHH Q lcl|NC_011057. 388 ALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDE---AKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMW 464 (634) Q Consensus 388 ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~e---A~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~w 464 (634) +|++.+|-+. +.+ ..|-|-||.+.|..- |..+. ...+++.|++|.+++|.++|+..-.+-| T Consensus 254 ~l~~~l~~~~----~~~-~g~~i~fd~~~l~~~-d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD---------- 317 (348) T protein:vir:93 254 EFNRKLLTKT----DRE-KNRYFKFNVKSYLRA-DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD---------- 317 (348) T ss_pred HHHHhhCCcc----ccc-CcceEEeechhhhcc-CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcC---------- Confidence 9999988541 222 245577899998542 33332 3558999999999999999996422211 Q ss_pred HHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 465 AQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDD 510 (634) Q Consensus 465 A~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d 510 (634) . +++.--+..++.+.-......+|++++++. T Consensus 318 ---~------------~~~~~n~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 318 ---K------------PLISGDLYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred ---e------------EeecccccccccchhhcccccCCCCCcCCC Confidence 0 111001111121111111111122111111 No 53 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.54 E-value=4.4e-14 Score=93.82 Aligned_cols=388 Identities=14% Similarity=0.140 Sum_probs=200.7 Q ss_pred ceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeec Q lcl|NC_011057. 7 LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELD 86 (634) Q Consensus 7 lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD 86 (634) .-+.++.|.+. .+..++|+.-..- .+.+. ....--....+ .++-+.-.|.-+++++|.+.+-. + T Consensus 1 M~~f~~~~~~~---------~~~~~~~~~~~~~--~~~~~-~~~~v~~~~al-~~~~V~~~v~~ia~~ia~~p~~~---~ 64 (397) T protein:vir:38 1 MPLLKLNKSHS---------QGFSLNDPDWVNF--LTGGE-AQKYVSADTAL-KNSDIFSLIMQLSGDLAMVRYTS---E 64 (397) T ss_pred Ccchhhhhccc---------CcccCCchhhhhh--hcCCc-CCceechHHhh-ccHHHHHHHHHHHHHHhhCcccc---c Confidence 23344443321 1112223321110 01111 11111111223 35667777888899999988731 1 Q ss_pred ccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceec Q lcl|NC_011057. 87 ENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAV 166 (634) Q Consensus 87 ~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~v 166 (634) +..+..+ ..-+.--+...++++.++.+|-+-|++|+.+. |.. +|. .-.++.| T Consensus 65 ----------------~~~~~~l-~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~-r~~-------~g~---~~~l~~l 116 (397) T protein:vir:38 65 ----------------SDRSQSI-ISNPSVTANGYSFWQGMFAQLLLDGNCYAYRH-KNT-------NGV---DLSWEYL 116 (397) T ss_pred ----------------ccHHHHH-HhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEE-ECC-------CCc---EEEEEEE Confidence 1223333 34456678899999999999999999998864 432 232 2346666 Q ss_pred cHHHHhcc--CCCcceeeEeC-----CCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_011057. 167 SKEEIKKS--NKGSGTNIVLP-----TGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASK 239 (634) Q Consensus 167 t~~Ei~~~--~~~~~~~i~lP-----~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 239 (634) ....+... ..+....+... .|...+|. ..| ||++=.+.+..-..--||+.+++..+.-..-..+...+..+ T Consensus 117 ~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~~-~~e-iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ 194 (397) T protein:vir:38 117 RPSQVQPMLLQDGSGLIYNINFDEPAIGYMENVP-AAD-VIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALK 194 (397) T ss_pred cCceeEEEEcCCCceEEEEEEeccccccceeEec-Ccc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 55554322 22222333322 23333333 334 44444444444445678888888777666666666666666 Q ss_pred hHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 240 SRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 240 SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) .-.+-.|||-+|+.++- -..+++.+.+-. .....+.+ -|+|+ + ..+ T Consensus 195 ng~~~~~il~~~~~~~~---------------------e~~~~~~~~~~~--~~~~~n~~-----~~~vl--~----~g~ 240 (397) T protein:vir:38 195 QSVTASAVLTIQKGGLL---------------------DAETRIARSKEI--SKQIHNSD-----GPVVI--D----ALE 240 (397) T ss_pred ccCCccEEEEeCCCCCH---------------------HHHHHHHHHHHH--HhcccccC-----Cceec--C----CCc Confidence 66666788887765432 123334443321 11222222 23443 2 234 Q ss_pred ceeecCCchhHHH-HHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVA-IKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTL 398 (634) Q Consensus 320 kHl~f~~d~te~a-iktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L 398 (634) +.-.++....+.. +++|+..+..||.+|.||| .+||...+.|.+. .+. ...++..|.|.+..|+++|++.++.. T Consensus 241 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~-~~lg~~~~~~~~~-e~~-~~~~~~~l~P~~~~ie~~ln~~l~~~-- 315 (397) T protein:vir:38 241 DYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPD-SYLNGQGDQQSSI-TQI-SGQYAKSLNRYVQAIVGELNDKLHAN-- 315 (397) T ss_pred eEEecCCChhHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCcccHH-HHH-HHHHHHHHHHHHHHHHHHHHHhccCh-- Confidence 4444454444444 8899999999999999999 6666544555433 333 44677789999999999999988764 Q ss_pred HhcCCChhHheeeecCcccccCCCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccch Q lcl|NC_011057. 399 AREGIDPSKYVVWYDASQLTIDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIP 477 (634) Q Consensus 399 ~~eG~d~~~yV~w~DaS~L~~~pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~ 477 (634) .+|-+.+. |..|+....++ ..+++.|++|.+++|+++|+..-.+-|. T Consensus 316 -------~~~~~~~~---~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~---------------------- 363 (397) T protein:vir:38 316 -------ISANIRFA---IDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDL---------------------- 363 (397) T ss_pred -------hccccccc---ccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc---------------------- Confidence 22322221 11222222222 4588999999999999999853111110 Q ss_pred hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 478 MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 478 ~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) |.. +....+ ........+++ .++....+.+.+|+ T Consensus 364 ---~~~----~~~~~~-~~~~~~~~~g~--~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 364 ---PDP----EKEPQQ-AIQLIQQEGGE--NDGNNSDERGSDPE 397 (397) T ss_pred ---ccc----cccccc-cccccccccCC--CCCCCCCCCCCCCC Confidence 000 000000 00111111111 11111222333333 No 54 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.54 E-value=1.4e-14 Score=96.60 Aligned_cols=406 Identities=13% Similarity=0.067 Sum_probs=194.2 Q ss_pred HHHHHhhhhhHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcC---------CcchHHHHH Q lcl|NC_011057. 54 AWEAVDLVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIAD---------GTLGQAALT 124 (634) Q Consensus 54 AW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iag---------G~lGQaqL~ 124 (634) -=++.+.-+-++=.|.=+++.++.+-|.+-.-+...+ . +. .+...+.+.......-- -..-+.+++ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~-~-~~---~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~ 75 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAED-P-DR---DGEQYERVWDFWFGDDSNWQVGPMESERATATNVL 75 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCccc-c-cc---hhhhhhhHHHHhhccCCCccccchhhHhhHHHHHH Confidence 1112222345666777788888887765422221100 0 01 11111222221111111 123567899 Q ss_pred HHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcc---------eeeEe----------- Q lcl|NC_011057. 125 KRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG---------TNIVL----------- 184 (634) Q Consensus 125 kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~---------~~i~l----------- 184 (634) +.++.+|.+-|.+||.+. |... |. .-.++.|....|.....+.. .-+.. T Consensus 76 ~~~~~~l~l~Gn~~i~~~-r~~~-------G~---~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (467) T protein:vir:31 76 QTAWTDYEAIGWLTIEIL-TQTD-------GT---PTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNG 144 (467) T ss_pred HHHHHHHHhcCCeEEEEE-ECCC-------Cc---EEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeeccc Confidence 999999999999999764 5333 21 12355555444433221110 00000 Q ss_pred ------------CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhh----Ccee Q lcl|NC_011057. 185 ------------PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIG----NGVL 248 (634) Q Consensus 185 ------------P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~g----nGvl 248 (634) +.|....| ..+=+|++=.+++..-..--||+.+++..+. +........++.+.+ .||| T Consensus 145 ~~~~~~~~~~~~~~~~~~~~--~~~diih~r~~~~~~~~~G~s~~~~~~~~i~----~~~~~~~~~~~~f~ng~~p~gil 218 (467) T protein:vir:31 145 DLDPVFVDADDGSTGTSVSN--PANELIFKRNHSPLYPHYGAPDIIPAVKTIR----GDSAAQDYNIDFFENDGVPRIAI 218 (467) T ss_pred ceeeeeeeeccccccceeEe--ccccEEEecCCCCCCCcccccHHHHHHHHHH----HHHHHHHHHHHHHhccCCCceEE Confidence 11222222 2233455556777777778899999877653 333333333333332 3677 Q ss_pred eecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHh----hcccC-ccccccccceeEeechHHhc---ccc Q lcl|NC_011057. 249 FVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAE----TAVED-EDSQAAFIPVIAGVPGEQIK---DVK 320 (634) Q Consensus 249 fvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~----tai~D-e~S~AA~vPiva~vP~Ehi~---~ik 320 (634) .++..+-=| -..+.+.+.+-.--+ ..+.. ++...+--++++....+... +++ T Consensus 219 ~~~~~~l~~--------------------e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~ 278 (467) T protein:vir:31 219 IVKGAELTE--------------------KGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLE 278 (467) T ss_pred EecCcCCCH--------------------HHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEE Confidence 776433110 123333333321111 11110 11112222333333322111 112 Q ss_pred eeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchh-hHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 321 HIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW-SAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 321 Hl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhw-tAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) -|+-.+..+.--+++|+..+..||..|.||| .|||+..++|.. ++-++...-.+..|.|.+..|+++|++.++.. T Consensus 279 ~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp-~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~--- 354 (467) T protein:vir:31 279 PLTVGIDEEASFLEFRGRNEHDILKVHDVPP-VIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQ--- 354 (467) T ss_pred eccccChhhHHHHHHHHHHHHHHHHHhCCCH-HHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch--- Confidence 2222233344558999999999999999999 678986666653 46778887788899999999999999988865 Q ss_pred hcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccc Q lcl|NC_011057. 400 REGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLI 476 (634) Q Consensus 400 ~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li 476 (634) +.+...|-|-||.+.|..- |..+.+ ..+++.|++|.+++|+++|+..- .| + .+. T Consensus 355 --~~~~~~~~i~f~~~~l~~~-d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi--~d----~--------------~~~ 411 (467) T protein:vir:31 355 --GLDAPDWTIEFELAKPDTK-LQDVEIASQRVQAMQGLLTVNELRDEFGFEPF--PE----E--------------HVY 411 (467) T ss_pred --hhccCCceEEEecchhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--Cc----c--------------ccc Confidence 3334567778899988644 322222 44889999999999999999521 11 0 000 Q ss_pred hhhhhhhhhhhhcccCCC----CCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 477 PMLAPLIAGVLQQIEFPQ----QQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGT 531 (634) Q Consensus 477 ~~laPll~p~~q~~~~P~----p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~ 531 (634) +. -++. +.+..-..|. ++....+ +++.++..+.-.++-+.+++..-+..+.+ T Consensus 412 ~~-~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 412 GG-ETLV-AEVTGGSGPGGGIGDQIEQLV-EDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred CC-cccc-cccccccCCCCcccCcCCCCC-CCcccchHhhhhhccccchhhhhccccCC Confidence 00 0000 0111111110 0000000 00000000000001111111111111111 No 55 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.53 E-value=4e-14 Score=94.06 Aligned_cols=397 Identities=17% Similarity=0.176 Sum_probs=200.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.= +...||...+.+.... +..... . .....+.+... .+. ..+.+.-.+.-+++++|.+.+ T Consensus 1 Mg~---~~~f~~k~~~~~~~~~-----~~~~~~--~-~~~~~~~~~~~-------~~~-~~~~V~~~I~~ia~~iA~~p~ 61 (403) T protein:vir:80 1 MGL---FNFFRRKTRSEPTNAI-----SWFLTQ--E-AYDTLAIPGYT-------RLS-DNPEVRMAVHKIAELISSMTI 61 (403) T ss_pred Ccc---cccccccccccccchh-----hhhccc--c-cccccccchhh-------hhh-hhHHHHHHHHHHHHhhhhCce Confidence 432 2223332111111000 000000 0 00000000001 111 135566777889999999777 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhcc--ccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTV--PGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtV--pGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) ..-.-.++ |....++ .+..+...=....+...++.+.++..+-. -|.+||.+ .|... | + T Consensus 62 ~~~~~~~~-----g~~~~~~----~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~-~~~~~-------g--~ 122 (403) T protein:vir:80 62 HLMQNTDN-----GDIRIKN----ELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFP-KYTTS-------G--L 122 (403) T ss_pred EEEEecCC-----ceeecCC----hHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEE-EEcCC-------C--c Confidence 66444433 3222222 23344444477888999999999987554 46666654 33221 2 1 Q ss_pred cchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEee-CCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVW-IPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW-~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) ..+++.|...-+......++..|..- | .+|. .|=+|++- ++.|.....--||+.++.+.+.-.....+...+. T Consensus 123 -~~~L~~l~p~~v~~~~~~~g~~~~y~-~--~~~~--~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 196 (403) T protein:vir:80 123 -IDELIPLAPSKVSFVDTDTGYQIWYQ-G--KAYN--YDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSF 196 (403) T ss_pred -EEEEEEEcCCeeEEEEcCCceEEEEe-e--cccc--hhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 23455655444432222222222221 1 1232 23344443 5677777777788777666655555444555555 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .+.-.+..|||-+|+.++=. ..+++++-+++- +... .-+--++++..+..... T Consensus 197 ~~ng~~p~~il~~~~~~~~~---------------------~~~~~~~~~~~~----~~~~--~~~g~~~~~~~~~~~~~ 249 (403) T protein:vir:80 197 MSGKYMPSLIVKVDAATAEL---------------------SSEEGRNAVFKK----YLEA--SEAGQPWIIPAELLDVE 249 (403) T ss_pred HhccCCcceEEEeCCCCChH---------------------HHHHHHHHHHHH----Hhhh--hhcCCeeeecccccccc Confidence 55445556677677654311 122233333211 1111 11233344445544445 Q ss_pred ccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (634) Q Consensus 318 ~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~ 397 (634) +++-+.+ .+.--+++|+..+..||..|.||| .+||+++..+. ... .-++..|.|.++.|+++|++.+|.+ T Consensus 250 ~~~~l~~---~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~---~~~--~f~~~~l~P~~~~ie~~l~~kll~~- 319 (403) T protein:vir:80 250 QVKPLSL---KDLAIHETVELDKRTVAGIFGVPA-FLLGVGKYDKD---EYN--NFINSTILPIAKGIEQELTRKLLIS- 319 (403) T ss_pred eeccCCH---HHHHHHHHHHHhHHHHHHHhCCCH-HHcCCCCccHH---HHH--HHHHHHHHHHHHHHHHHHHHhccCC- Confidence 5544433 233447899999999999999999 77787543322 111 2556789999999999999988743 Q ss_pred HHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 398 LAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 398 L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) .+|-|.||.+.|. +.|..+.+ ..+++.|++|.+++|+++|+....|-| + .-.... T Consensus 320 --------~~~~~~f~~~~ll-~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd----~---------~~~~~n 377 (403) T protein:vir:80 320 --------PDLYFKFNPRSLY-AYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS----E---------LVILEN 377 (403) T ss_pred --------CCcEEEeechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----e---------Eeeccc Confidence 4577899999995 44544433 458899999999999999997543322 0 000111 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTE 523 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe 523 (634) + .||- ..+++.+ ...+++++++-.+| T Consensus 378 ~----~pl~-----------------~~~~~~~--~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 378 Y----IPLD-----------------KIGDQNK--LKGGEKGGADGQTD 403 (403) T ss_pred c----cchh-----------------hccchhh--ccCCCCCCCCCCCC Confidence 1 1110 0011000 01111111111111 No 56 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.53 E-value=5.8e-14 Score=93.16 Aligned_cols=463 Identities=12% Similarity=0.114 Sum_probs=207.9 Q ss_pred CCCCCcceeEec---cC----------CCCccch----hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhh Q lcl|NC_011057. 1 MAATQSLRLVRR---PK----------GGRPAPS----RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGE 63 (634) Q Consensus 1 ~~a~~~lr~vrr---p~----------g~~~a~~----ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgE 63 (634) .---.+-||+-. |+ +..+|.. .++-+...-+++|+-+. ...++.. -=+.|..-+- T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~-------~~~~l~~-~l~~~~~n~i 96 (563) T protein:vir:99 25 IDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMK-------NEHNLHD-VLKKFGNNPI 96 (563) T ss_pred ccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCC-------CcccHHH-HHHHhhcchH Confidence 000000000000 00 1111110 01111111111111000 0001110 0001100111 Q ss_pred HHHHHhhhhhcee---------------eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCC----cchHHHHH Q lcl|NC_011057. 64 LRYYVGWRASSCS---------------RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADG----TLGQAALT 124 (634) Q Consensus 64 LryyvgWr~~s~S---------------r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG----~lGQaqL~ 124 (634) |+--+.=+++.++ .++|+-...+ +..++..+..++...+..+.-- ...-.+++ T Consensus 97 ~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~--------~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~ 168 (563) T protein:vir:99 97 LNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAE--------PGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFC 168 (563) T ss_pred HHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCC--------cchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHH Confidence 2211222222222 3444432222 1222222333444444333221 12345899 Q ss_pred HHHHHhhccccceEEEEE-EecCCCCCCCcccccccchhceeccHHHHhccCCCcce-------eeEeCCCCcccccCCC Q lcl|NC_011057. 125 KRVVECLTVPGELWIVIL-TRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGT-------NIVLPTGEEHEFVKGT 196 (634) Q Consensus 125 kR~~~~LtVpGE~wi~il-~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~-------~i~lP~g~~h~~~~~~ 196 (634) ++++.++-+-|.+++.++ .|.+.+. ....+.|....|......++. -+...+|......... T Consensus 169 ~~lv~~lll~Gn~~~~~~~~rd~~G~----------~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~ 238 (563) T protein:vir:99 169 KKIVRDTYIYDQVNFEKVFNKNNKTK----------LEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSR 238 (563) T ss_pred HHHHHHHHhcCCeEEEEEEEecCCCc----------eEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCc Confidence 999999999999888654 4543211 233555555555433222221 1334456555445567 Q ss_pred CeEEEeeCCCcccc--cCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCcc Q lcl|NC_011057. 197 DIIFRVWIPKPRKA--SEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPL 274 (634) Q Consensus 197 D~~~RvW~P~prra--~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~ 274 (634) |+++++.+|.+... ..--||+.++...+.=..-+.+...+..+.-..-.|||-+|....+ T Consensus 239 evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~l------------------ 300 (563) T protein:vir:99 239 ELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQ------------------ 300 (563) T ss_pred ceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCC------------------ Confidence 88888888876533 2356888887777766666666666666666666778887754333 Q ss_pred ccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHH Q lcl|NC_011057. 275 VGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPER 353 (634) Q Consensus 275 ~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~ 353 (634) ..-+.+.|++.+- .++.-. .-+--+|+|+. +.++...+.....+. -+++|+..+..||..|-||| . T Consensus 301 -s~e~~~~~~~~~~----~~~~G~-~nagk~~~vl~------~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp-~ 367 (563) T protein:vir:99 301 -SQHALENFKREWK----SSLSGI-NGSWQIPVVMA------DDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDP-A 367 (563) T ss_pred -CHHHHHHHHHHHH----HHhccc-cccccceEEcC------CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCH-H Confidence 1123444554443 222111 11223565542 234444444443333 48999999999999999999 8 Q ss_pred hhcccc-----------CcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 354 LLGLGS-----------QTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 354 LLGlgs-----------~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) +||+.. +.|..++.+....-++.-|.|.+..|+++|++.+|.. .| .+|.|+|+-.+...+-+ T Consensus 368 ~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~----~~---~~~~~~f~r~D~~~~~e 440 (563) T protein:vir:99 368 EIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE----YG---DKYTFQFVGGDTKSATD 440 (563) T ss_pred HccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh----cc---cccEEEeccCCHHHHHH Confidence 889743 3466677888888899999999999999999998864 22 46888886555443311 Q ss_pred chHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhh-hhhhhh---------------- Q lcl|NC_011057. 423 KSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPML-APLIAG---------------- 485 (634) Q Consensus 423 ~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~l-aPll~p---------------- 485 (634) .. +...+..+|++|.+++|+++||.--.|-|. -+.|.. .++.+. T Consensus 441 ~~-~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (563) T protein:vir:99 441 KL-NILKLETQIFKTVNEAREEQGKKPIEGGDI------------------ILDASFLQGTAQLQQDKQYNDGKQKERLQ 501 (563) T ss_pred HH-HHHHHhcCCccCHHHHHHHhCCCCCCCcce------------------eecccccccccccccccCCCccccchhhh Confidence 11 112357789999999999999964333220 011110 111100 Q ss_pred -hhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHH Q lcl|NC_011057. 486 -VLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETL 564 (634) Q Consensus 486 -~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~ 564 (634) ..+...-|...+...+..+..+++.+.+..++.+-+ +...+. ..+ -.|.-.+..+.++. T Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~----------------~~~~~~~~~~~~~~ 561 (563) T protein:vir:99 502 MMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGD--DNVYRT--QTS----------------NKGQGRKGEKSSDF 561 (563) T ss_pred hcccccCCCCCCCCCCCCCCCCCCccccccccccccc--cccccc--cCc----------------cccccccCcCcccc Confidence 000011111110000000000000000000000000 000000 000 00010110000110 Q ss_pred HH Q lcl|NC_011057. 565 AR 566 (634) Q Consensus 565 ar 566 (634) .. T Consensus 562 ~~ 563 (563) T protein:vir:99 562 KH 563 (563) T ss_pred cC Confidence 00 No 57 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.53 E-value=5.8e-14 Score=93.16 Aligned_cols=463 Identities=12% Similarity=0.114 Sum_probs=207.9 Q ss_pred CCCCCcceeEec---cC----------CCCccch----hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhh Q lcl|NC_011057. 1 MAATQSLRLVRR---PK----------GGRPAPS----RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGE 63 (634) Q Consensus 1 ~~a~~~lr~vrr---p~----------g~~~a~~----ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgE 63 (634) .---.+-||+-. |+ +..+|.. .++-+...-+++|+-+. ...++.. -=+.|..-+- T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~-------~~~~l~~-~l~~~~~n~i 96 (563) T protein:vir:95 25 IDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMK-------NEHNLHD-VLKKFGNNPI 96 (563) T ss_pred ccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCC-------CcccHHH-HHHHhhcchH Confidence 000000000000 00 1111110 01111111111111000 0001110 0001100111 Q ss_pred HHHHHhhhhhcee---------------eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCC----cchHHHHH Q lcl|NC_011057. 64 LRYYVGWRASSCS---------------RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADG----TLGQAALT 124 (634) Q Consensus 64 LryyvgWr~~s~S---------------r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG----~lGQaqL~ 124 (634) |+--+.=+++.++ .++|+-...+ +..++..+..++...+..+.-- ...-.+++ T Consensus 97 ~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~--------~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~ 168 (563) T protein:vir:95 97 LNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAE--------PGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFC 168 (563) T ss_pred HHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCC--------cchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHH Confidence 2211222222222 3444432222 1222222333444444333221 12345899 Q ss_pred HHHHHhhccccceEEEEE-EecCCCCCCCcccccccchhceeccHHHHhccCCCcce-------eeEeCCCCcccccCCC Q lcl|NC_011057. 125 KRVVECLTVPGELWIVIL-TRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGT-------NIVLPTGEEHEFVKGT 196 (634) Q Consensus 125 kR~~~~LtVpGE~wi~il-~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~-------~i~lP~g~~h~~~~~~ 196 (634) ++++.++-+-|.+++.++ .|.+.+. ....+.|....|......++. -+...+|......... T Consensus 169 ~~lv~~lll~Gn~~~~~~~~rd~~G~----------~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~ 238 (563) T protein:vir:95 169 KKIVRDTYIYDQVNFEKVFNKNNKTK----------LEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSR 238 (563) T ss_pred HHHHHHHHhcCCeEEEEEEEecCCCc----------eEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCc Confidence 999999999999888654 4543211 233555555555433222221 1334456555445567 Q ss_pred CeEEEeeCCCcccc--cCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCcc Q lcl|NC_011057. 197 DIIFRVWIPKPRKA--SEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPL 274 (634) Q Consensus 197 D~~~RvW~P~prra--~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~ 274 (634) |+++++.+|.+... ..--||+.++...+.=..-+.+...+..+.-..-.|||-+|....+ T Consensus 239 evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~l------------------ 300 (563) T protein:vir:95 239 ELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQ------------------ 300 (563) T ss_pred ceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCC------------------ Confidence 88888888876533 2356888887777766666666666666666666778887754333 Q ss_pred ccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHH Q lcl|NC_011057. 275 VGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPER 353 (634) Q Consensus 275 ~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~ 353 (634) ..-+.+.|++.+- .++.-. .-+--+|+|+. +.++...+.....+. -+++|+..+..||..|-||| . T Consensus 301 -s~e~~~~~~~~~~----~~~~G~-~nagk~~~vl~------~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp-~ 367 (563) T protein:vir:95 301 -SQHALENFKREWK----SSLSGI-NGSWQIPVVMA------DDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDP-A 367 (563) T ss_pred -CHHHHHHHHHHHH----HHhccc-cccccceEEcC------CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCH-H Confidence 1123444554443 222111 11223565542 234444444443333 48999999999999999999 8 Q ss_pred hhcccc-----------CcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 354 LLGLGS-----------QTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 354 LLGlgs-----------~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) +||+.. +.|..++.+....-++.-|.|.+..|+++|++.+|.. .| .+|.|+|+-.+...+-+ T Consensus 368 ~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~----~~---~~~~~~f~r~D~~~~~e 440 (563) T protein:vir:95 368 EIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE----YG---DKYTFQFVGGDTKSATD 440 (563) T ss_pred HccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh----cc---cccEEEeccCCHHHHHH Confidence 889743 3466677888888899999999999999999998864 22 46888886555443311 Q ss_pred chHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhh-hhhhhh---------------- Q lcl|NC_011057. 423 KSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPML-APLIAG---------------- 485 (634) Q Consensus 423 ~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~l-aPll~p---------------- 485 (634) .. +...+..+|++|.+++|+++||.--.|-|. -+.|.. .++.+. T Consensus 441 ~~-~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (563) T protein:vir:95 441 KL-NILKLETQIFKTVNEAREEQGKKPIEGGDI------------------ILDASFLQGTAQLQQDKQYNDGKQKERLQ 501 (563) T ss_pred HH-HHHHHhcCCccCHHHHHHHhCCCCCCCcce------------------eecccccccccccccccCCCccccchhhh Confidence 11 112357789999999999999964333220 011110 111100 Q ss_pred -hhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHH Q lcl|NC_011057. 486 -VLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETL 564 (634) Q Consensus 486 -~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~ 564 (634) ..+...-|...+...+..+..+++.+.+..++.+-+ +...+. ..+ -.|.-.+..+.++. T Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~----------------~~~~~~~~~~~~~~ 561 (563) T protein:vir:95 502 MMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGD--DNVYRT--QTS----------------NKGQGRKGEKSSDF 561 (563) T ss_pred hcccccCCCCCCCCCCCCCCCCCCccccccccccccc--cccccc--cCc----------------cccccccCcCcccc Confidence 000011111110000000000000000000000000 000000 000 00010110000110 Q ss_pred HH Q lcl|NC_011057. 565 AR 566 (634) Q Consensus 565 ar 566 (634) .. T Consensus 562 ~~ 563 (563) T protein:vir:95 562 KH 563 (563) T ss_pred cC Confidence 00 No 58 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.52 E-value=1.6e-13 Score=90.71 Aligned_cols=480 Identities=14% Similarity=0.114 Sum_probs=229.2 Q ss_pred CC-CCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MA-ATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~-a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |- ---|.|-+=+|+.-... ...++++.. ..+ ......--+|..=+ ++|.+-+-++--|.=+++.++++. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~-----~~~s~~~~~--~~~--~~~~~pp~~~~~la-~l~~~n~~v~scI~~ia~~IA~l~ 70 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKRE-----EVESQALGE--TRF--EEYVEPKVNPLVLL-SLLQVNPYHASACSIKANDIIRTG 70 (542) T ss_pred Cccccccccccccchhhhhc-----ccccccccc--ccC--CccccCCCCHHHHH-HHHhhcHHHHHHHHHHHHHHhhCc Confidence 11 01122222223221100 011111100 000 00011111222211 455565666777788888888887 Q ss_pred EEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 80 L~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) +-. +.++. . .+..-+....+.-.++++.++.+|-+-|.+|+.+ .|... |. T Consensus 71 ~~~---~~~~~-------------~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i-~rd~~-------G~--- 120 (542) T protein:vir:41 71 YIL---EGDDE-------------G---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEV-VRDDR-------GD--- 120 (542) T ss_pred eee---ecccc-------------h---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EEcCC-------Cc--- Confidence 754 21110 1 1223345566777889999999999999999976 45443 21 Q ss_pred chhceeccHHHHhccCCCcceeeEeCCCC--------------------cccccCCCCeEEEeeCCCcccccCCccchhh Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSGTNIVLPTGE--------------------EHEFVKGTDIIFRVWIPKPRKASEPDSPVRA 219 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~--------------------~h~~~~~~D~~~RvW~P~prra~eaDSPvra 219 (634) ....+.|....|.....+++. +...++. .....+..| +|++=++++..-..--||+.+ T Consensus 121 ~~~L~~l~~~~v~v~~d~~~~-~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~e-IiHir~~~~~~~~~Glspi~~ 198 (542) T protein:vir:41 121 PIRFEYIPSHTIRVHKDGSRY-RQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANE-LVFIHIPSPVCSYYGVPRYVS 198 (542) T ss_pred EEEEEEEcCcceEEEEcCCee-EeeecCCcceeEEeecccccccccccccccccCccc-EEEecCCCCCCCcccccHHHH Confidence 223555555555443322211 1111111 111122234 444544555555566789988 Q ss_pred hhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcc Q lcl|NC_011057. 220 VLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDED 299 (634) Q Consensus 220 ~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~ 299 (634) ++..+.=-.-..+...+..+.-.+..|||.+|..+.=-... .......+.+.|++.+- ..+. .. T Consensus 199 ~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~-----------~~~~~~e~~~~lk~~~~----~~~~-g~ 262 (542) T protein:vir:41 199 AAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEE-----------DPDGNPTGRTVIQALIE----DNFK-HL 262 (542) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCcccccccc-----------ccccCHHHHHHHHHHHH----HHHh-hh Confidence 88776444444444444444445556788888765321000 00112224444444443 2222 11 Q ss_pred ccccccceeEeechHHhcccceeecCCchh-HHHHHHHHHHHHHHhhhccCChHHhhcccc--CcchhhHhhhhhhhhhH Q lcl|NC_011057. 300 SQAAFIPVIAGVPGEQIKDVKHIRFDNEIT-EVAIKTRNDAIARLAMGLDVSPERLLGLGS--QTNHWSAWQISDEDVQL 376 (634) Q Consensus 300 S~AA~vPiva~vP~Ehi~~ikHl~f~~d~t-e~aiktR~daI~rlA~~~D~~pE~LLGlgs--~~NhwtAw~i~de~v~~ 376 (634) .-.+-.|+|+..|++--+.++...+..... .--++.|+..+..||..|-||| .+||+.. ..|..++.+....-++. T Consensus 263 ~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp-~~lG~~~~~t~n~sn~Eq~~~~f~~~ 341 (542) T protein:vir:41 263 KEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDP-YRLGIADTGPLGGNFAEVTRRTYYES 341 (542) T ss_pred hcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH-HHhCcCCCcccccccHHHHHHHHHHH Confidence 224457889999886656666555554333 3457899999999999999999 6788843 34778889999999999 Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHH-hCCCccccCCC Q lcl|NC_011057. 377 HIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKY-LGLGDDAGYDF 455 (634) Q Consensus 377 hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~-~Gl~ed~~yd~ 455 (634) -|.|.+..|+++|++.++.. .+. .|.|.||..+|.. .|+...+..+++.|++|.+++|.. .|++ -+.| T Consensus 342 tL~P~~~~ie~~ln~~L~~~----~~~---~~~~~f~~~~ll~-~d~~~~~~~~v~~GilT~NE~Re~L~g~~--pgdd- 410 (542) T protein:vir:41 342 VVRPQQNIISSILTDFFQVK----FNP---KTRFKFNDETLLE-SDSVRNCALLVQSGVLTPAEARERLFGLD--GGPD- 410 (542) T ss_pred HHHHHHHHHHHHHHhhcccc----cCC---ceEEEecchhhcc-hHHHHHHHHHHhCCCCCHHHHHHhhCCCC--CCCc- Confidence 99999999999999866543 232 4678999999875 466666667889999999999974 4663 2222 Q ss_pred CCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhccc---CCCCCCCCCCC-CCCCCcccc---------CCCCCCCCCCC Q lcl|NC_011057. 456 TTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIE---FPQQQQAIDSG-GNEDTSDDD---------NLDDGEHEPDT 522 (634) Q Consensus 456 ~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~---~P~p~~a~~~~-~~~~~~~d~---------~~~~~~~ePDT 522 (634) +.|.|.-..+- + ++..+ .+....+.... ...+++-++ ...+..++|.- T Consensus 411 -----------------~~l~p~~~~~~-~-~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (542) T protein:vir:41 411 -----------------IFMVPSKGAAK-S-VKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLA 471 (542) T ss_pred -----------------ccccccccccc-c-cccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccchhh Confidence 11222111100 0 00000 00000000000 000000000 00011111111 Q ss_pred C--CCCCCc---------------------ccCCCcc-HHHHHHHHHHHHHHHhhHHhhcCChhHHHHhhCCChHHhhhh Q lcl|NC_011057. 523 E--DDQDDD---------------------GTQKAGL-ESGIVDLMVDRALELVGKRRRGRDRETLARLSGVRERDYHRY 578 (634) Q Consensus 523 e--~d~~~~---------------------~~~~a~~-~~a~vdllv~rALelAGkR~Rt~~R~~~arlr~ip~h~~h~~ 578 (634) | .++-++ ..+++-+ .+.--+-|+....| +-..|. .|-++++ T Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~----~~~~~~~ 536 (542) T protein:vir:41 472 EFRAEAYEAGKKMLIIGGDMGSMSALNQGVSVIPSKPLNLERYEELLEASVE-----------DMIGRI----RHYLYKV 536 (542) T ss_pred hhHHhHHhcCceEEEeecCchhhhhhhccceeccCCCcChHHHHHHHHhhHH-----------HHHHHH----HHHHHHH Confidence 0 011110 0011000 00000000000011 011111 1333333 Q ss_pred cCCCChhHHHHHHhccccc Q lcl|NC_011057. 579 MDPVPESEVDRLMSGWDSA 597 (634) Q Consensus 579 ~~Pv~~~~v~rLi~GWd~~ 597 (634) + ||-.. T Consensus 537 ~-------------~~~~~ 542 (542) T protein:vir:41 537 I-------------GWREL 542 (542) T ss_pred h-------------hhccC Confidence 3 34333 No 59 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.51 E-value=2.1e-14 Score=95.59 Aligned_cols=477 Identities=11% Similarity=0.084 Sum_probs=214.7 Q ss_pred CCCCCcceeEeccCCCCc--cchhhh-hhhhcc-----------CCchhhhh-----hhhcccCccccccHHHHHHHhhh Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRP--APSRAL-TAASQP-----------LPDPSQVF-----SKSTGISRNSDWQTDAWEAVDLV 61 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~--a~~ral-~aAs~~-----------itdp~~~~-----~~~~~~~~~~~WQ~eAW~~yd~V 61 (634) |+--..-|+.+.-.-+-. -+-.+. .|.++| .+.|.-.. ........++ +=.+.+.++ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~n---piv~~~I~~- 102 (576) T protein:vir:96 27 IDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNN---PILNAIILT- 102 (576) T ss_pred cccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcC---HHHHHHHHH- Confidence 110000011100000000 000000 000111 00110000 0000000011 113444444 Q ss_pred hhHHHHHhhhhhceee-----------eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCC----cchHHHHHHH Q lcl|NC_011057. 62 GELRYYVGWRASSCSR-----------CRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADG----TLGQAALTKR 126 (634) Q Consensus 62 gELryyvgWr~~s~Sr-----------~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG----~lGQaqL~kR 126 (634) +++++|. +-|.+.-.+.+ +.++. ++..+...+.+.+....-- ...-.++++. T Consensus 103 ---------ia~~vA~~~~~~~~~~~~~~~~i~lk~~~-~~~~~---~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 169 (576) T protein:vir:96 103 ---------RSNQVAMYCQPSRYNERGLGFEVRMRDLD-AEPGK---KEKEEIKRIENFILNTGRDKDIDRDSFQSFCRK 169 (576) T ss_pred ---------HHHHHHhhhhhhhhccccccceeEEecCc-Cccch---hhhHhhhhHHhhHhhccCCCCCccccHHHHHHH Confidence 4444443 11222223322 22221 1111122233333322211 1234579999 Q ss_pred HHHhhccccceEEEEEE-ecCCCCCCCcccccccchhceeccHHHHhccCCCccee-------eEeCCCCcccccCCCCe Q lcl|NC_011057. 127 VVECLTVPGELWIVILT-RPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTN-------IVLPTGEEHEFVKGTDI 198 (634) Q Consensus 127 ~~~~LtVpGE~wi~il~-rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~-------i~lP~g~~h~~~~~~D~ 198 (634) ++.+|-+-|.+|+.++. |.+. | + .-.++.|...-|......+|.. +...+|.........|+ T Consensus 170 lv~dlll~Gna~~~i~~~rd~~-------g--~-~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~di 239 (576) T protein:vir:96 170 IVRDTYTYDQVNFEKVFNKKNA-------T--T-MDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREM 239 (576) T ss_pred HHHHHHhcCCeEEEEEEecCCC-------C--c-eEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccce Confidence 99999999999998764 3211 1 1 2346666665554333332221 22234443333445677 Q ss_pred EEEeeCCCcccc--cCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCcccc Q lcl|NC_011057. 199 IFRVWIPKPRKA--SEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVG 276 (634) Q Consensus 199 ~~RvW~P~prra--~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g 276 (634) ++++.+|.+... ..--||+.++...+.-..-+.+...+..+.-..-.|||.+|....+ . T Consensus 240 i~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~l-------------------s 300 (576) T protein:vir:96 240 AMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQ-------------------S 300 (576) T ss_pred EEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC-------------------C Confidence 777888876432 3457999998888877777777777777766777888888754433 1 Q ss_pred chHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCch-hHHHHHHHHHHHHHHhhhccCChHHhh Q lcl|NC_011057. 277 EPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEI-TEVAIKTRNDAIARLAMGLDVSPERLL 355 (634) Q Consensus 277 ~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~-te~aiktR~daI~rlA~~~D~~pE~LL 355 (634) .-..+.|.+.+- .++.-. .-+--+|+|+. +.++...+.... +.--+++|+..+..||..|-||| .+| T Consensus 301 ~e~~~~lr~~~~----~~~~G~-~nag~~p~vl~------~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp-~~l 368 (576) T protein:vir:96 301 QRALENFKREWK----SSFSGI-NGSWQVPVVMA------DDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDP-AEI 368 (576) T ss_pred HHHHHHHHHHHH----HHhccc-cccccceeecC------CCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCH-HHc Confidence 113444554443 222211 12233566653 234444444333 34458999999999999999999 777 Q ss_pred cccc-----------CcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCch Q lcl|NC_011057. 356 GLGS-----------QTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKS 424 (634) Q Consensus 356 Glgs-----------~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t 424 (634) |+.. +.|+.++-+....-++.-|.|.+..|+++|++.+|.. .| .+|.|+|+-.++...-+.- T Consensus 369 G~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~----~~---~~~~~~f~r~d~~~~~e~~ 441 (576) T protein:vir:96 369 GFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISE----YS---DKYVFQFVGGDTKSELDKI 441 (576) T ss_pred cccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh----cc---CceEEEeccCCHHHHHHHH Confidence 8743 4478888999999999999999999999999988854 22 4688888765543321111 Q ss_pred HHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhh--------------hhhhhcc Q lcl|NC_011057. 425 DEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLI--------------AGVLQQI 490 (634) Q Consensus 425 ~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll--------------~p~~q~~ 490 (634) .......+|.+|.+++|+++||.--.|=|. . + .|.-.+.+.-.. ++..+.. T Consensus 442 -~~~~~~~~G~lT~NE~R~~~gl~piegGD~-----~-------~--~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 506 (576) T protein:vir:96 442 -KILQEEVKTYKTVNEARKEKGLKPIEGGDV-----L-------L--DGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFL 506 (576) T ss_pred -HHHHHHhcCccCHHHHHHHhCCCCCCCcce-----e-------c--cccccccccccccCCCCCCcccccccccccccc Confidence 111245579999999999999964332220 0 0 000011000000 0000000 Q ss_pred cCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCcc--HHHHHHHHHHHHHHHhhHHhhcC-ChhHHHHh Q lcl|NC_011057. 491 EFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGL--ESGIVDLMVDRALELVGKRRRGR-DRETLARL 567 (634) Q Consensus 491 ~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~--~~a~vdllv~rALelAGkR~Rt~-~R~~~arl 567 (634) .-+.|.+..++..++..+.....+++..+++.--|..........+ -...+..+.. |.+ +-++. T Consensus 507 ~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~--- 573 (576) T protein:vir:96 507 NSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDNVKSQEGSNKGQGTKG----------KGNEKPSDF--- 573 (576) T ss_pred CCCCCCCCCCCCCCCcccccccccCCCCCCccccccccCCCCcccccccccccccccc----------cCCCCcccc--- Confidence 0011111111111111111111112222222211111111111111 0111111111 111 00000 Q ss_pred hCC Q lcl|NC_011057. 568 SGV 570 (634) Q Consensus 568 r~i 570 (634) ..- T Consensus 574 ~~~ 576 (576) T protein:vir:96 574 KNN 576 (576) T ss_pred cCC Confidence 000 No 60 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.50 E-value=5.3e-14 Score=93.36 Aligned_cols=396 Identities=10% Similarity=0.081 Sum_probs=217.3 Q ss_pred eeEeccCCCCccchhhhhhhhccCCchhhhhhhhcc-cCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeec Q lcl|NC_011057. 8 RLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTG-ISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELD 86 (634) Q Consensus 8 r~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~-~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD 86 (634) =+.|+++...... ++ .+|+ ....+... .+...-+...| + ...-++-.+.-+++.+|++.+..=+-+ T Consensus 1 m~f~~~~~~~~~~-~~-------~~~~-~~~~~~g~~~~~~~v~~~~a---l-~~~~v~~~i~~ia~~ia~lp~~~~~~~ 67 (409) T protein:vir:10 1 MLFRKGFKNQSQE-IS-------IDDK-KILEWLGINPSETYVNGKSC---L-KQATVFGCIRILSDNISKLPIKIYQKK 67 (409) T ss_pred CcccccccCcCCC-CC-------CChH-HHHHHhcCCcCcceechhhh---h-ccHHHHHHHHHHHHhhhhCceEEEEec Confidence 2345555443221 11 1111 11111000 00000011112 1 123355566678999999877553322 Q ss_pred ccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceec Q lcl|NC_011057. 87 ENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAV 166 (634) Q Consensus 87 ~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~v 166 (634) | |... .. ...+..++..-..--+...++++.++.+|-+-|+.|+.+ .|... |. ...++.| T Consensus 68 -~-----~~~~--~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i-~r~~~-------G~---~~~L~~i 127 (409) T protein:vir:10 68 -D-----GIKR--VP-DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVAL-DFKKN-------GE---IKGLYPL 127 (409) T ss_pred -C-----Ceee--cc-CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEE-EEcCC-------Cc---EEEEEEE Confidence 3 1111 11 134455556667778888999999999999999999986 34332 21 1235555 Q ss_pred cHHHHhccCC-------Cccee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 167 SKEEIKKSNK-------GSGTN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 167 t~~Ei~~~~~-------~~~~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) ....+..... ...+. +....|...+|.. .| +|++=++++. ...--||+.++.+.+....-+.+...+. T Consensus 128 ~~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~-~e-vih~r~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 204 (409) T protein:vir:10 128 KSDGMKIFVDDTGLLNSENNVWYLYTDDLGQRHKFMS-DE-ILHFKGLTAD-GLAGLSVIELLNHLIENGKSSETYLNNF 204 (409) T ss_pred cCCceEEEEcCCccccccceEEEEEEeCCceeEEecc-cc-EEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 5444432111 11222 2334566555544 23 3333233332 2345688777777766655555555555 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .+.-..-.|||-+|+.++- ...+.+++.+-+.- ...++ +--++|+ + -.- T Consensus 205 f~ng~~~~gil~~~~~l~~---------------------e~~~~~~~~~~~~~-~g~~n-----~~~~~vl--~--~g~ 253 (409) T protein:vir:10 205 FKNGLQVKGLVQYAGDLNP---------------------EAEEVFKENFERMS-SGLKN-----AHRIAML--P--IGY 253 (409) T ss_pred HhccCCCcEEEEcCCCCCH---------------------HHHHHHHHHHHHHh-ccccc-----cCCceec--C--CCc Confidence 5555555677766654431 13445555554211 11111 2223333 2 223 Q ss_pred ccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (634) Q Consensus 318 ~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~ 397 (634) +++-|.+... +.--+++|+.....||..|.||| .+||...++|..++.+....-++..|.|.++.|+++|++.+|-+. T Consensus 254 ~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~ 331 (409) T protein:vir:10 254 KFEPISQKLV-DAQFLENSQLTIRQIASVFGVKM-HQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLFLIS 331 (409) T ss_pred eEEEccCChh-hHHHHHHHHHHHHHHHHHhCCCH-HHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCch Confidence 5666665432 22337899999999999999999 677876678999999999999999999999999999998876442 Q ss_pred HHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 398 LAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 398 L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) - ....|-|-||.+.|.. +|..+.+ ..++..|++|.+++|+++|++.-.|-|.- + +..| T Consensus 332 ~-----~~~~~~~~fd~~~ll~-~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~----~-------~~~n-- 392 (409) T protein:vir:10 332 E-----IKNGFYSKFNVDTILR-ADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVL----L-------INGN-- 392 (409) T ss_pred h-----ccCCcEEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee----e-------eccC-- Confidence 2 1245678899999833 4443333 56889999999999999999653332200 0 0000 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHE 519 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~e 519 (634) +.|+ ++.+ +....+|++ T Consensus 393 ----~~~~--------------------~~~~----~~~~kgGe~ 409 (409) T protein:vir:10 393 ----MIPV--------------------KMAG----EQYSKGGEK 409 (409) T ss_pred ----ccch--------------------hhcc----ccccccCCC Confidence 0010 0000 011122333 No 61 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.50 E-value=3.9e-14 Score=94.10 Aligned_cols=475 Identities=15% Similarity=0.091 Sum_probs=219.8 Q ss_pred CCCCCcceeEeccCCCCccchhh-----------------------hhhhhccCCchhhhhhhhcccCccccccH----- Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRA-----------------------LTAASQPLPDPSQVFSKSTGISRNSDWQT----- 52 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ra-----------------------l~aAs~~itdp~~~~~~~~~~~~~~~WQ~----- 52 (634) |+--.++|.+-+++....+--+. ..|=+.++ .++..+... -..+..+.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~-~~~~~~~~~--~~~r~~~~~~~~l~ 81 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPV-IGSMSANPG--FKTKPSIRNNQDLH 81 (551) T ss_pred hhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeeccc-ccceecCcc--cccCccccChhHHH Confidence 77777777777776655444332 11112222 122222111 011111110 Q ss_pred HHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeecccCCCCC--------CC-CCCCCcccHHHHHHHHhhc----CCcch Q lcl|NC_011057. 53 DAWEAVDLVGELRYYVGWRASSCSRCRLVASELDENTGLPT--------GG-ISEDNTEGERVREIVSKIA----DGTLG 119 (634) Q Consensus 53 eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~Dtg~pt--------G~-i~ed~~~g~r~~~iv~~ia----gG~lG 119 (634) .-=+.|-.-+-|+=.|.=+++.++.+=..+ .++.+ |.+. .. +..+...-..+.++.+... ..+.. T Consensus 82 ~~~~~~~~npiv~~~I~~ia~~IA~~~~~~-~~~~~-g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s 159 (551) T protein:vir:80 82 GVLKKFGGNIILNAIINTRSNQVSMYCKPA-RHSEK-GVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDS 159 (551) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhh-hhhcC-CCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccch Confidence 001112223555555666666666432222 11211 1110 01 1111111123334333322 12235 Q ss_pred HHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcc------ee-eEeCCCC-ccc Q lcl|NC_011057. 120 QAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG------TN-IVLPTGE-EHE 191 (634) Q Consensus 120 QaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~------~~-i~lP~g~-~h~ 191 (634) ..++++.++.+|-+-|.+|+.+ .|... |. ...++.|...-|+.....+| .. +..-+|. ... T Consensus 160 ~~~f~~~lv~dlll~Gnay~~i-~rd~~-------G~---~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~ 228 (551) T protein:vir:80 160 FSSFVKKIVRDTYMYDQVNFEK-VFNRN-------QS---MVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVAT 228 (551) T ss_pred HHHHHHHHHHHHHhcCCEEEEE-EECCC-------Cc---EEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEE Confidence 6789999999999999998875 34332 32 23466666655543222222 11 2222333 333 Q ss_pred ccCCCCeE-EEeeC-CCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCC Q lcl|NC_011057. 192 FVKGTDII-FRVWI-PKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGE 269 (634) Q Consensus 192 ~~~~~D~~-~RvW~-P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~ 269 (634) | ...|++ ||.|. +++.....--||+.++++.+.-..-..+...+..+.-..-.|||.+|....+ T Consensus 229 ~-~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~l------------- 294 (551) T protein:vir:80 229 F-NAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQ------------- 294 (551) T ss_pred E-cccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCC------------- Confidence 3 334544 44442 2344444466888887777665555555554444444445566766543222 Q ss_pred CCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCc-hhHHHHHHHHHHHHHHhhhcc Q lcl|NC_011057. 270 EIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNE-ITEVAIKTRNDAIARLAMGLD 348 (634) Q Consensus 270 ~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d-~te~aiktR~daI~rlA~~~D 348 (634) ..-+.+.|++.+- ..+.=. .-+--+||+.. +.++...+.-. .+.--+++|+..++.||..|. T Consensus 295 ------t~e~~~~lk~~~~----~~~~G~-~nag~~~vl~~------~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFg 357 (551) T protein:vir:80 295 ------SQHALEIFKREWK----NSLSGI-NGSWQIPVVSA------EDVKFVNMTPSARDMEFEKWLNYLINVISALYG 357 (551) T ss_pred ------CHHHHHHHHHHHH----HHhcCc-cccCccccccC------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhc Confidence 1123445555443 222111 12334566532 12344444322 234458899999999999999 Q ss_pred CChHHhhcccc----------CcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccc Q lcl|NC_011057. 349 VSPERLLGLGS----------QTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLT 418 (634) Q Consensus 349 ~~pE~LLGlgs----------~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~ 418 (634) ||| .+||+.+ +.|++++.+....-++..|.|.+..|+++|++.++.. + | ..|.|.||..++. T Consensus 358 VPp-~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~-~---~---~~~~f~f~~~~~~ 429 (551) T protein:vir:80 358 IDP-AEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE-F---G---DKYTFQFVGGDIK 429 (551) T ss_pred CCH-HHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-c---C---CceEEEeeccChh Confidence 999 8888743 3588889999998999999999999999999988753 2 3 4688888865543 Q ss_pred cCCCchHHHHHHHHccCCCHHHHHHHhCCCc-cccCCCCCHHHHHHHHHHHhhcCcccchh-hhhhhhhh--------hh Q lcl|NC_011057. 419 IDPDKSDEAKFAYENGAINGEALRKYLGLGD-DAGYDFTTREGWVMWAQDAVSKDPTLIPM-LAPLIAGV--------LQ 488 (634) Q Consensus 419 ~~pd~t~eA~~~~~~G~It~ealr~~~Gl~e-d~~yd~~t~Eg~r~wA~d~v~~dp~Li~~-laPll~p~--------~q 488 (634) ..-+ ..+...++.+|++|.+++|+++||.- ..|-|. + +.|. +.|+.++. .| T Consensus 430 ~~~~-~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~-------------~-----~~~~~~~~~~~~~~~~~~~~~~~ 490 (551) T protein:vir:80 430 SELE-SVKILAEKAKVAMTVNEVRKELNLPGDVIGGDI-------------P-----LNGVIVQRIGQLMQQEQFEHEKQ 490 (551) T ss_pred hHHH-HHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCce-------------e-----ecccccccccccccccCcchhhh Confidence 3211 12234577889999999999999953 222220 0 1110 01111111 11 Q ss_pred cccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHHHHhh Q lcl|NC_011057. 489 QIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETLARLS 568 (634) Q Consensus 489 ~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~arlr 568 (634) .-.+.++..++.+..+.+.++.+...+... +.+.| ........ ..|.+..++-... ++ T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~--~~~~~----~~~~~~~~--------~~~~~~~~~~~~~---~~----- 548 (551) T protein:vir:80 491 QSNLQMLQEQTGNRVSTDVEDIPDGKDTTG--DIGKD----GQRKDKDN--------ANAGKQGMKGDKP---ND----- 548 (551) T ss_pred hhccccccCcCCCCCCCCCCCCCCccccCC--Ccccc----ccccCccc--------cchhhhhcCCCCc---cc----- Confidence 111221111111111110010000000000 00000 00000000 0000111110000 00 Q ss_pred CCChHHhhh Q lcl|NC_011057. 569 GVRERDYHR 577 (634) Q Consensus 569 ~ip~h~~h~ 577 (634) ..+ T Consensus 549 ------~~~ 551 (551) T protein:vir:80 549 ------WQT 551 (551) T ss_pred ------cCC Confidence 000 No 62 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.48 E-value=6.1e-14 Score=93.03 Aligned_cols=403 Identities=14% Similarity=0.113 Sum_probs=207.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |. .=.|+-|=|.+- -...+.-.+..+.|+..-+.++..+ -... .+-.++-+.-.+.-+++++|.+.+ T Consensus 1 ~~---~~~~~~~~k~~~--~~~~~~~~~~~~~~~~~~~~~~~~~----v~~~----~a~~~~~V~~ci~~ia~~ia~lp~ 67 (409) T protein:vir:96 1 MA---KENIVTRIKKKL--IDNWIDQSASKLYDFSPWKNKSFWG----VINN----TLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred Cc---cccchhhhhhHH--hhhhhccccccccccccccCccccc----cchh----hHhhhHHHHHHHHHHHHhhhhCce Confidence 32 222332322210 0001111111222221111111000 0111 122445566667889999999887 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+=. ... ++ .+..+...=+.-.+...++++.++.+|-+-|+.|+.+ .|.. +|. . T Consensus 68 ~~~~~~--------~~~----~~-~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~~-------~G~---~ 123 (409) T protein:vir:96 68 KMYEDY--------KVV----NT-EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLI-ERDI-------YHQ---P 123 (409) T ss_pred EEeecc--------ccc----ch-hHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEE-EECC-------CCc---E Confidence 665421 111 11 2334444446667889999999999999999999886 3422 232 2 Q ss_pred hhceeccHHHHhccCCC--ccee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKG--SGTN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~--~~~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ...+.+..+.+...... ...- +....|....|.. .| +|++=.+++.....--||+..+...+.-...+.+. + T Consensus 124 ~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~-~e-vih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~--~ 199 (409) T protein:vir:96 124 SKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN-MD-MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--N 199 (409) T ss_pred EEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc-cc-EEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH--H Confidence 34666666655433222 2222 3344566665543 33 45553445555555678876665444322222221 1 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+-...+.+|+-.|+.++ ....+.+.+-+-+ .+++.+. ++++ + .. T Consensus 200 ~~~~~~~~~~i~~~~~~l~---------------------~e~~~~~~~~~~~----~~~n~g~-----~~vl--~--~g 245 (409) T protein:vir:96 200 LTEMQKPDSFMLKYGSNVS---------------------TEKRQQVLEDFKQ----YYEENGG-----ILFQ--E--PG 245 (409) T ss_pred HHhcCCCceeEEecCCCCC---------------------HHHHHHHHHHHHH----HhhcCCC-----eeec--C--CC Confidence 1111111222332333222 1234455555442 2322221 2222 2 34 Q ss_pred cccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 317 ~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -+++.|.+.. .+.--+++|+..+..||..|.||| .|||...++|..++-+....=++..|.|.++.|.++|++.+|-+ T Consensus 246 ~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~ 323 (409) T protein:vir:96 246 VEIEPLPKKY-VSEDIVASENLTRERVANVFQLPS-IFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 323 (409) T ss_pred ceEEEcCCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 4556565432 233347899999999999999999 67787778899999999999999999999999999999988754 Q ss_pred HHHhcCCChhHheeeecCccccc-CC-CchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTI-DP-DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~-~p-d~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) .+.+ ..|.|-||.+.|.. |. ++.+....+++.|++|.+++|+++|+..-.+-| ..--... T Consensus 324 ----~~~~-~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD-------------~~~~~~n 385 (409) T protein:vir:96 324 ----TDRE-KNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-------------KPLISGD 385 (409) T ss_pred ----cccc-CcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcc-------------eeeeccc Confidence 2222 24667899998853 22 122223458899999999999999996432222 0000000 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTED 524 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~ 524 (634) + .|+ +-+.-......+|++... |. T Consensus 386 ~----~~~--------~~~~~~~~~~~gG~~n~~--------------e~ 409 (409) T protein:vir:96 386 L----YPI--------DTPLELRKSLKGGDKNVN--------------ES 409 (409) T ss_pred c----ccc--------ccchhhcccccCCCCCcC--------------CC Confidence 1 111 111001111111111111 11 No 63 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.47 E-value=1.9e-13 Score=90.33 Aligned_cols=385 Identities=12% Similarity=0.051 Sum_probs=192.3 Q ss_pred CCCCCcceeEeccCCCCccchhhhh---hhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALT---AASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSR 77 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~---aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr 77 (634) |.==+-+| +.++..+..+..+. +..... +.-.-+... .| ..+-++-.+.-+++.||. T Consensus 1 MGl~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~vt~~~---al-~~~~v~~~i~~Ia~~iA~ 60 (394) T protein:vir:62 1 MGLRDRFS---NYLFKKAEKRGYLDNVLGKSIRY-------------SGVYVTDSN---IL-QSSDVYELLQDISNQMVL 60 (394) T ss_pred Cchhhhhh---hhccCCCCchhhhhhhhhccccc-------------CccccChhh---hh-ccHHHHHHHHHHHHhhcc Confidence 43222221 11121111211111 111000 000001111 13 235567778889999999 Q ss_pred eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccc Q lcl|NC_011057. 78 CRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSV 157 (634) Q Consensus 78 ~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~ 157 (634) +.+..- +.| |+ .+ . +..+..+... +.--+...++++.++.+|.+-|++|+.+- |. +-+ T Consensus 61 lp~~v~--~~~-g~---~~-~----~~~~~~Ll~~-PN~~~t~~~f~~~~~~~lll~Gn~~~~i~-~~-------~~~-- 118 (394) T protein:vir:62 61 ADIVVE--DEF-GN---EI-K----DDIALQILRN-PNNYLTQSEFIKLMTNTYLLEGETFPILN-GA-------QIH-- 118 (394) T ss_pred cceEEE--cCC-Cc---cc-c----hhhHHHHhcc-CCCCCCHHHHHHHHHHHHHhcCCeEEEEe-cc-------eee-- Confidence 988773 433 32 12 1 2334445443 66778888999999999999999999872 21 111 Q ss_pred ccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 158 RTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 158 ~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) .|-.++ ++.... +. +..-.+ .++|.. |-+|++=.+.+ ....--||+..+.. .+.+....... T Consensus 119 ----~~~~~~---~~~~~~--~~-~~~~~~-~~~~~~--~eiih~r~~~~-d~~~G~s~~~~~~~----~i~~~~~~~~~ 180 (394) T protein:vir:62 119 ----LASNVF---TELDDN--LV-EHFNIG-GHEIPP--CMIRHVKNIGA-DHLRGKGILDLGRD----TLEGVMSAEKT 180 (394) T ss_pred ----ccccce---EEECCc--eE-EEEeeC-CEEech--hheEEecCcCC-CCccccChHHHHHH----HHHHHHHHHHH Confidence 111111 111111 11 112222 234432 33444422322 22334566555444 44444433333 Q ss_pred HHhHhhhC-----ceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_011057. 238 SKSRLIGN-----GVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (634) Q Consensus 238 ~~SRL~gn-----GvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP 312 (634) .+ +++.| |||-+|..++.- ..+.+.+.+.+. ..+.-.+. +-=++|+.. T Consensus 181 ~~-~~~~ng~~~~~il~~~~~~~~~-------------------~~~~~~~~~~~~----~~~~g~~n--~g~~~vl~~- 233 (394) T protein:vir:62 181 LT-DKYKKGGLLTFLLNLDAHINPQ-------------------NGAQSKLINAIL----DQLESIDE--ARSVKMIPL- 233 (394) T ss_pred HH-HHHHccCCcceEEEeCCCCCcC-------------------HHHHHHHHHHHH----HHhccccc--cCceeEeeC- Confidence 32 33333 356555543320 112334444443 22222111 222233332 Q ss_pred hHHhcccceeecCCchhH-HHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHH Q lcl|NC_011057. 313 GEQIKDVKHIRFDNEITE-VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTD 391 (634) Q Consensus 313 ~Ehi~~ikHl~f~~d~te-~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~ 391 (634) ..+++-..+.....+ --+++|+.....||..|.||| .+||-.+. .++-+....-++..|.|.+..|+++|++ T Consensus 234 ---g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~---sn~e~~~~~~~~~~l~P~~~~ie~~l~~ 306 (394) T protein:vir:62 234 ---GKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINV-DTYTELIK---EDIEKAMMYIHNKAVRPIMKNFEDHLSL 306 (394) T ss_pred ---CCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCH-HHcCCCCC---cCHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 234555555554433 367899999999999999999 66774333 3466777778899999999999999999 Q ss_pred HHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhc Q lcl|NC_011057. 392 QILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSK 471 (634) Q Consensus 392 ~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~ 471 (634) .+|.+ .+| .+|.|.||...+..--.+.+.+-.++..|++|.+++|+++|++.-.+ ++|=..| +.+ T Consensus 307 kll~~---~~~---~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~-----~~gd~~~----~~~ 371 (394) T protein:vir:62 307 LFYAQ---NSG---KRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNT-----KESQAIY----ISN 371 (394) T ss_pred hhcCc---ccc---CceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC-----CCCCeee----ccc Confidence 98866 233 46889999999854323444445588889999999999999964211 1110000 000 Q ss_pred CcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCC Q lcl|NC_011057. 472 DPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDT 522 (634) Q Consensus 472 dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDT 522 (634) + +.|+ +-. +...+. ..+|++.+- T Consensus 372 n------~~~~--------~~~----------~~~~~~----~kgge~~en 394 (394) T protein:vir:62 372 D------VTEI--------GKK----------EATDGS----LGGGEENEN 394 (394) T ss_pred c------cccc--------ccc----------cccccc----CCCCCCCCC Confidence 0 0111 000 000011 111111110 No 64 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.46 E-value=4.2e-14 Score=93.90 Aligned_cols=374 Identities=13% Similarity=0.129 Sum_probs=201.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==..+ .+.++..... ..+. ++ ..+ ...+++....+ .-...++ ..+-++-.+.-++++||++.+ T Consensus 1 Mg~~~~~--~~~~~~~~~~-~~~~--------~~-~~~-~~~~~~~~~~~-v~~~~al-~~~~v~~~i~~ia~~ia~~p~ 65 (385) T protein:vir:10 1 MGLLTPR--NFNKRKAKNM-VYPS--------NP-AFF-TTTVGGMQLSY-VSALSAL-QNTNVYSVINRIASDVASAHF 65 (385) T ss_pred Cccccch--hccccccccc-cccc--------ch-hhh-hhhccccCccc-cCHHHhh-ccHHHHHHHHHHHHHHhhCce Confidence 5422221 1111111110 0000 01 001 11111110000 0111222 335566778889999999877 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) -.-. . ....+.+ =+.--+-..++++.++.+|.+-|++|+.+. |. +- T Consensus 66 ~v~~---~----------------~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~-r~-------~~------ 111 (385) T protein:vir:10 66 KTEN---T----------------ATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLV-GQ-------NL------ 111 (385) T ss_pred eeec---c----------------chhhhhh-cCCCCCCHHHHHHHHHHHhhhcCCeEEEEE-cC-------ce------ Confidence 5421 1 0111222 133446778999999999999999999874 31 11 Q ss_pred hhceeccHHHHhccCCCccee--eEeCC-CCcccccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTN--IVLPT-GEEHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~--i~lP~-g~~h~~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) ..+.+...-|+....+++.. +...+ |...+|. ..| +-||..+|.......--||+..|...+.-.....+...+ T Consensus 112 -~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~-~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 189 (385) T protein:vir:10 112 -EHIPNSDVQINYLPGNMGIVYTVLESNDRPQMVLR-QDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMS 189 (385) T ss_pred -eEeecCCceEEEEEcCCceEEEEEEcCCceEEEEc-cccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 12333333333322233333 22233 3344443 344 446667766655556679999998888777777777777 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) ..+.-..-.|||-+|..++=+ -..+.+++.+-+ .+.-.++ --|+++ + .. T Consensus 190 ~~~ng~~~~gil~~~~~~~~~--------------------e~~~~~~~~~~~----~~~~~n~---~~~~vl--~--~g 238 (385) T protein:vir:10 190 AMENQINPAGKLTISNYLSDG--------------------KDLESAREEFEK----ANTGDNS---GRLMVL--P--DG 238 (385) T ss_pred HHhccCCcceEEEeCCCCCCH--------------------HHHHHHHHHHHH----HhCcccc---CCcccc--C--CC Confidence 777777777888887654321 134444444432 2211111 122232 2 23 Q ss_pred cccceeecCCchhHHH--HHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEITEVA--IKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQ 392 (634) Q Consensus 317 ~~ikHl~f~~d~te~a--iktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~ 392 (634) .+++.|... ..+.. .++|+..+..||..|.||| .+||.+ ++.++.+.-+... .+...|.|.++.|+++|++. T Consensus 239 ~~~~~l~~~--~~d~~~l~e~~~~~~~~Ia~~fgVp~-~~lg~~~~~~~~~sn~eq~~~-~~~~~l~P~~~~ie~~l~~~ 314 (385) T protein:vir:10 239 FDYTQLEMK--TDVFKALADNSAYSADQISKAFGVPS-DILGGGTSTESQHSNIDQIKA-TYLANLNSYVNPIVDELRLK 314 (385) T ss_pred ceEEecCCC--hhHHHHHHHHHHHHHHHHHHHhCCCH-HHcCCccCCCcccccHHHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 455555543 33333 4899999999999999999 777753 3556677666544 45557999999999999988 Q ss_pred HHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHh Q lcl|NC_011057. 393 ILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAV 469 (634) Q Consensus 393 ~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v 469 (634) +|.+ -|.||.+.|. ++|..+.+ ..+++.|++|.+++|.++|+.- ++.. T Consensus 315 l~~~------------~~~f~~~~ll-~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p---~p~~------------- 365 (385) T protein:vir:10 315 MNAP------------DLELDIKDML-DVDDSALINQVSNLAKSGVLGAEQAQFILTRSG---FLPD------------- 365 (385) T ss_pred hCCc------------eEEeechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCc---cCCC------------- Confidence 7532 2678888874 33444444 3388899999999999998731 1100 Q ss_pred hcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCc Q lcl|NC_011057. 470 SKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTS 508 (634) Q Consensus 470 ~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~ 508 (634) ....+..|... ..+|++++. T Consensus 366 ------------------~~~~~~~~~~~-~~~g~~~dn 385 (385) T protein:vir:10 366 ------------------NLPEFKPLTTQ-VKGGDEGDN 385 (385) T ss_pred ------------------CCccccCcccc-cCCCCCCCC Confidence 00111112222 222332222 No 65 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.45 E-value=2.5e-12 Score=84.19 Aligned_cols=481 Identities=13% Similarity=0.081 Sum_probs=213.9 Q ss_pred CCCCCc--------ceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhh Q lcl|NC_011057. 1 MAATQS--------LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRA 72 (634) Q Consensus 1 ~~a~~~--------lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~ 72 (634) |.-..+ ..++..|.-.....-... ...-.+.+|.. +++......++..-..|.+++-. -+-=++..++ T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~l~~~~~~~iv~~~i~~~~~--~V~~~~~~i~ 116 (574) T protein:vir:80 41 PYSMESIEKGMNGKTTAYMQPIIGEMSVNPGY-KTKPSIRNSQD-LHKTLKKFGNNIILNAIINTRSN--QVSMYCKPAR 116 (574) T ss_pred CCCHHHHHHhHhhhcccccchhhhhccccccc-cCcCccCCccc-HHHHHHhhccChhHHHHHHHHHH--HHHHHHHHHH Confidence 000000 001111110000000000 00000001111 11111112233344444444321 1222344445 Q ss_pred hceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCC----cchHHHHHHHHHHhhccccceEEEEEEecCCC Q lcl|NC_011057. 73 SSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADG----TLGQAALTKRVVECLTVPGELWIVILTRPVKG 148 (634) Q Consensus 73 ~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG----~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~ 148 (634) .+.+.+-+.+-.-|.| +.+++.. ..+..++..++..-.-. .....++++.++.++-+-|..|+.++ |..+ T Consensus 117 ~~ia~lp~~i~~kd~~-~~~~~~~---~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~-r~~~- 190 (574) T protein:vir:80 117 NSETGVGYEIRLKDIE-AEPTSHD---IANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKV-FDKD- 190 (574) T ss_pred hhhccCceEEEEeccC-CCccchh---hhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEE-ECCC- Confidence 5555554444333433 3333321 12223444444332221 13456899999999999999998764 4332 Q ss_pred CCCCcccccccchhceeccHHHHhccCCC-------cceeeEeCCCCcccccCCCCeEEEeeCCCcc--cccCCccchhh Q lcl|NC_011057. 149 APAQPDGSVRTRQEWYAVSKEEIKKSNKG-------SGTNIVLPTGEEHEFVKGTDIIFRVWIPKPR--KASEPDSPVRA 219 (634) Q Consensus 149 ~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~-------~~~~i~lP~g~~h~~~~~~D~~~RvW~P~pr--ra~eaDSPvra 219 (634) |. ....+.|....|...... +..-+...+|..-......|+++..-+|.+. .-..--||+.+ T Consensus 191 ------G~---~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~ 261 (574) T protein:vir:80 191 ------GN---FIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEI 261 (574) T ss_pred ------Cc---EEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHHHH Confidence 21 233555555555432211 1223444555544444556666554555442 22234477776 Q ss_pred hhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcc Q lcl|NC_011057. 220 VLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDED 299 (634) Q Consensus 220 ~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~ 299 (634) +...+.=..-+.+...+..+.-..-.|||-++....+ ..-+...|++.+-.. -....+ T Consensus 262 a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~l-------------------s~e~~~~lk~~~~~~-~~G~~n-- 319 (574) T protein:vir:80 262 ALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQ-------------------SQQALDIFRREWRSS-LAGING-- 319 (574) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC-------------------CHHHHHHHHHHHHHH-hccccc-- Confidence 6666655555555555555554555677777532111 112444555554311 111111 Q ss_pred ccccccceeEeechHHhcccceeecCCc-hhHHHHHHHHHHHHHHhhhccCChHHhhcccc----------CcchhhHhh Q lcl|NC_011057. 300 SQAAFIPVIAGVPGEQIKDVKHIRFDNE-ITEVAIKTRNDAIARLAMGLDVSPERLLGLGS----------QTNHWSAWQ 368 (634) Q Consensus 300 S~AA~vPiva~vP~Ehi~~ikHl~f~~d-~te~aiktR~daI~rlA~~~D~~pE~LLGlgs----------~~NhwtAw~ 368 (634) +--+||+.. +.++...|... .+.--+++|+..+..||..|.||| .+||+.+ +.|..++.+ T Consensus 320 --~g~~~vl~~------~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp-~~lG~~~~~t~~gs~~~~~n~sn~E~ 390 (574) T protein:vir:80 320 --SWQIPVVSA------EDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDP-AEINFPNNGGATGSKGGSLNEGNSKE 390 (574) T ss_pred --cccceeecC------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCH-HHhcccccccccccccccccchhHHH Confidence 223455532 23444444433 334458999999999999999999 8888733 357788999 Q ss_pred hhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCC Q lcl|NC_011057. 369 ISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLG 448 (634) Q Consensus 369 i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ 448 (634) ....-++.-|.|.+..|+++|++.+|... + ..|.|.||..++...-+ ...+.....+|++|.+++|+++||. T Consensus 391 ~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~----~---~~~~~~f~~~d~~~~~~-~~~~~~~~~~G~lT~NE~R~~lgl~ 462 (574) T protein:vir:80 391 KMQASQNKGLQPLLRFIEDTVNTYIVAEF----G---EKYQFQFRGGDLSAQLD-KLKIIEQEGKVFRTVNEIRHDKGLE 462 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhc----C---CceEEEecccchhhHHH-HHHHHHHHhCCccCHHHHHHHhCCC Confidence 99999999999999999999999988531 2 46889999888764321 1222346778999999999999997 Q ss_pred ccccCCCCCHHHHHHHHHHHhhcCcccchh-hhhhhhhhhhcccCCCCCCCCCCCCC-CCCccccCCCCCCCCC-CCCCC Q lcl|NC_011057. 449 DDAGYDFTTREGWVMWAQDAVSKDPTLIPM-LAPLIAGVLQQIEFPQQQQAIDSGGN-EDTSDDDNLDDGEHEP-DTEDD 525 (634) Q Consensus 449 ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~-laPll~p~~q~~~~P~p~~a~~~~~~-~~~~~d~~~~~~~~eP-DTe~d 525 (634) .-.|-|. -+.+. +.|+-.+. +.-....+......+.. .....+...+.. ++| +.+.| T Consensus 463 Pi~gGD~------------------~~~~~n~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~~~~d 522 (574) T protein:vir:80 463 PIKGGDV------------------ILNGVHIQAIGQAL-QEEQLEYQRSQDRLNRLLELSGGDVEQPEP-EEPKDSQND 522 (574) T ss_pred CCCCCCE------------------eeeccceeeccccc-ccccCCccchhccccccccccCCCCCCCCC-CCCCCcccc Confidence 5443331 11111 12221111 00000000000000000 000000000000 011 00111 Q ss_pred CCC-cccCCC--c-cHHH--------HH-H-HHH----HHHHHHhhHHhhcCChh Q lcl|NC_011057. 526 QDD-DGTQKA--G-LESG--------IV-D-LMV----DRALELVGKRRRGRDRE 562 (634) Q Consensus 526 ~~~-~~~~~a--~-~~~a--------~v-d-llv----~rALelAGkR~Rt~~R~ 562 (634) ... ....+. . .+.. +. | ++= ..+-..-++-++ .+ T Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 574 (574) T protein:vir:80 523 TDVSFQDEQQGLNGKSKKVNGKVDDNVGKDGQLKSEENTNSTKHGTDGIK---KE 574 (574) T ss_pred ccchhhhhhhhhccchhhhcCCcccccccccccccccccccccccCcccc---CC Confidence 000 000000 0 0000 00 0 000 011111111111 11 No 66 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.44 E-value=2.9e-13 Score=89.30 Aligned_cols=479 Identities=14% Similarity=0.097 Sum_probs=212.7 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCch------------hhhhhhh------cccCccccccHHHHHHHhhh- Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDP------------SQVFSKS------TGISRNSDWQTDAWEAVDLV- 61 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp------------~~~~~~~------~~~~~~~~WQ~eAW~~yd~V- 61 (634) |---.+||-+-|+|......-++-.-.+.-+.+- .+..+.. ...+.+..|..+ .+|+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~--~~~~l~~ 78 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIR--NNQDLHG 78 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccC--ChhHHHH Confidence 7777777777676655444333311111111000 0000000 001111112222 133321 Q ss_pred --------hhHHHHHhhhhhceeeeeEEEeeeccc-CCC------CCCC-CCCCCcccHHHHHHHHhhc----CCcchHH Q lcl|NC_011057. 62 --------GELRYYVGWRASSCSRCRLVASELDEN-TGL------PTGG-ISEDNTEGERVREIVSKIA----DGTLGQA 121 (634) Q Consensus 62 --------gELryyvgWr~~s~Sr~rL~aseiD~D-tg~------ptG~-i~ed~~~g~r~~~iv~~ia----gG~lGQa 121 (634) +-|+=-+.=+++.++..= ...+++.+ -|. .... +.++......+.+..+... ..+.... T Consensus 79 l~~~~~~npiv~~~I~~~a~~ia~~~-~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~ 157 (547) T protein:vir:63 79 VLKKFGGNIILNAIINTRSNQVSMYC-KPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFS 157 (547) T ss_pred HHHHhhcCHHHHHHHHHHHHHHhhhh-hhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHH Confidence 222222223333333210 11111111 000 0011 1111111123334333322 1223557 Q ss_pred HHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcc------eee-EeCCCCcccccC Q lcl|NC_011057. 122 ALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSG------TNI-VLPTGEEHEFVK 194 (634) Q Consensus 122 qL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~------~~i-~lP~g~~h~~~~ 194 (634) ++++.++.++-+-|.+|+.+. |.. +|. .-.++.|....|+.....++ ..+ ..-+|..-.... T Consensus 158 ~f~~~lv~d~ll~Gn~~~~i~-rd~-------~G~---~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~ 226 (547) T protein:vir:63 158 SFVKKIVRDTYMYDQVNFEKV-FNR-------NQS---MVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFN 226 (547) T ss_pred HHHHHHHHHHHhhCCEEEEEE-ECC-------CCc---EEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEec Confidence 899999999999999988754 432 232 23455665555543211111 122 222333222233 Q ss_pred CCCeEEEeeCCC--cccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCC Q lcl|NC_011057. 195 GTDIIFRVWIPK--PRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIA 272 (634) Q Consensus 195 ~~D~~~RvW~P~--prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~ 272 (634) ..|++.-..||. +.....--||+.++...+.-..-..+...+..+.-..-.|||.+|....+ T Consensus 227 ~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~l---------------- 290 (547) T protein:vir:63 227 AREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQ---------------- 290 (547) T ss_pred cccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCC---------------- Confidence 455543334443 33333456887777766665555555554444444445567777654332 Q ss_pred ccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchh-HHHHHHHHHHHHHHhhhccCCh Q lcl|NC_011057. 273 PLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEIT-EVAIKTRNDAIARLAMGLDVSP 351 (634) Q Consensus 273 p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~t-e~aiktR~daI~rlA~~~D~~p 351 (634) ..-+.+.|++.+- ..+.-. .-+--+||+.. +.++...+.-... .--+++|+..++.||..|.||| T Consensus 291 ---s~e~~~~lk~~~~----~~~~G~-~nagk~~vl~~------~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP 356 (547) T protein:vir:63 291 ---SQHALEIFKREWK----NSLSGI-NGSWQIPVVSA------EDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDP 356 (547) T ss_pred ---CHHHHHHHHHHHH----HHhcCc-ccccccccccC------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 1124445555443 222111 12333566532 2344444543333 3357899999999999999999 Q ss_pred HHhhccccC----------cchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCC Q lcl|NC_011057. 352 ERLLGLGSQ----------TNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDP 421 (634) Q Consensus 352 E~LLGlgs~----------~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~p 421 (634) .+||+.++ .|++++.+....-++..|.|.+..|+++|++.+|.. + | ..|.|.||..++... T Consensus 357 -~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~-~---~---~~~~~~f~~~~~~~~- 427 (547) T protein:vir:63 357 -AEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE-F---G---DKYTFQFVGGDIKSE- 427 (547) T ss_pred -HHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-c---C---CceEEEeeccccccH- Confidence 88887433 478888898888999999999999999999988753 2 3 357888886554332 Q ss_pred CchHHHHHHHHccCCCHHHHHHHhCCCc-cccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhh--------hhcccC Q lcl|NC_011057. 422 DKSDEAKFAYENGAINGEALRKYLGLGD-DAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGV--------LQQIEF 492 (634) Q Consensus 422 d~t~eA~~~~~~G~It~ealr~~~Gl~e-d~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~--------~q~~~~ 492 (634) ....+...++.+|.+|.+++|+++||.- ..|-|.- + +|.-+ .|+.+.. .|.-.+ T Consensus 428 ~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~----~----------~~~~~---~~~~~~~~~~~~~~~~~~~~~ 490 (547) T protein:vir:63 428 LESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIP----L----------NGVIV---QRIGQLMQQEQFEHEKQQSNL 490 (547) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcee----e----------ccccc---ccccccccccCCccccchhhc Confidence 2222234577889999999999999943 2222210 0 11111 1111100 011111 Q ss_pred CCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHHhhcCChhHHHH Q lcl|NC_011057. 493 PQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKRRRGRDRETLAR 566 (634) Q Consensus 493 P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR~Rt~~R~~~ar 566 (634) +++..++....+.++++.+...+.. .+.++|+.. ..... ..|.+..++-... ++..- T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~d~~~----~~~~~--------~~~~~~~~~~~~~---~~~~~ 547 (547) T protein:vir:63 491 QMLQEQTGNRVSTDVEDIPDGKDTT--GDIGKDGQR----KDKDN--------ANAGKQGMKGDKP---NDWQT 547 (547) T ss_pred cccccccCCCCCCCCCCCCCCcccC--CCcCccccc----cCccc--------cchhhhhcCCCCc---cccCC Confidence 1111111111111111111111100 010111000 00000 0000000000000 00000 No 67 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.42 E-value=2.6e-13 Score=89.55 Aligned_cols=375 Identities=12% Similarity=0.102 Sum_probs=205.4 Q ss_pred ceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeec Q lcl|NC_011057. 7 LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELD 86 (634) Q Consensus 7 lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD 86 (634) .++.++.|++..++.-.- .....++++.-..... +...-....++ .++-+.-.+.-++++||.+.+..-+-. T Consensus 1 M~~f~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~---~~~~v~~~~~~----~~~~v~~~i~~ia~~ia~~p~~~~~~~ 72 (386) T protein:vir:48 1 MPIFNITNLATESPPISQ-GGFFDITDPDFLSTLN---GSEWVSAESAL----RNSDLFSIINQLSNDLATVKLTASRKQ 72 (386) T ss_pred Cccccccccccccccccc-ccccccccchhccccc---CCceechhhhh----cchHHHHHHHHHHHhhccCceeeccch Confidence 667777666543322111 1111222322111111 11111111222 346677778889999999988654311 Q ss_pred ccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhceec Q lcl|NC_011057. 87 ENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAV 166 (634) Q Consensus 87 ~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~v 166 (634) . . . ...-..--+...++++.++.+|-+-|+.|+.+ .|.. +|. ...|+.| T Consensus 73 ~----------------~---~-l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i-~r~~-------~g~---~~~L~~l 121 (386) T protein:vir:48 73 L----------------Q---G-IIDNPSNNANRFNFYQSIFAQMLLGGEAFAYR-WRNE-------NGR---DMKWEYL 121 (386) T ss_pred h----------------H---H-HhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEE-EECC-------CCc---EEEEEEe Confidence 1 1 1 22334556778899999999999999998875 4432 232 2457777 Q ss_pred cHHHHhccCC--CcceeeE--eCC---CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_011057. 167 SKEEIKKSNK--GSGTNIV--LPT---GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASK 239 (634) Q Consensus 167 t~~Ei~~~~~--~~~~~i~--lP~---g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 239 (634) ..+.+..... +...-+. ..+ |...+|. ..| +|++=.+++.....--||+..+...+.-.....+...+..+ T Consensus 122 ~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~-~~e-vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ 199 (386) T protein:vir:48 122 RPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVP-QGD-VLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLK 199 (386) T ss_pred cCceeEEEEcCCCceEEEEEEecCccccceeEec-Ccc-EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6666653322 2222222 222 2222232 234 44554566665555678998888877776666666666656 Q ss_pred hHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 240 SRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 240 SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) .-....|||-+|+.++- -..+.+.+...+ ...+ +--|+|+. ..-++ T Consensus 200 ng~~~~~ii~~~~~~~~---------------------e~~~~~~~~~~~----~~~n-----~g~~~vl~----~g~~~ 245 (386) T protein:vir:48 200 NALNANGILKIKGGGLL---------------------DFKTKLSRSRQA----MKQM-----QGGPLVLD----DLEEF 245 (386) T ss_pred ccCCcceEEEeCCCCCH---------------------HHHHHHHHHHHH----hhcC-----CCCceecC----CCceE Confidence 55566677766654321 123333333322 1111 22234432 22344 Q ss_pred ceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTL 398 (634) Q Consensus 320 kHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L 398 (634) +.|.. ...+. -+++|+..+..||..|.||| .|||. ..+..++++...+-++..|.|.++.|++.|++.++... T Consensus 246 ~~l~~--~~~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~--~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~- 319 (386) T protein:vir:48 246 TPLEI--KSNVSQLLKQADWTTGQFAKVYGIPE-NVVGG--QGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKLSCDV- 319 (386) T ss_pred EEcCC--ChhHHHHHHHHHHHHHHHHHHhCCCH-HHhCC--CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh- Confidence 44443 33333 48999999999999999999 78885 23345788888889999999999999999999887542 Q ss_pred HhcCCChhHheeeecCcccccCCCchH---HHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCccc Q lcl|NC_011057. 399 AREGIDPSKYVVWYDASQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTL 475 (634) Q Consensus 399 ~~eG~d~~~yV~w~DaS~L~~~pd~t~---eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~L 475 (634) ++|- ...+ ++|... ....+++.|.+|..+.|+.+|..- +. +.|. + T Consensus 320 ---~~~~---~~~~-------~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~---~~--~~~~-~------------- 367 (386) T protein:vir:48 320 ---DADI---LPAV-------DPTGSNSVSRINSMVKSGTLAQNQGLYILQQAE---IL--PKEL-P------------- 367 (386) T ss_pred ---hcch---hhhh-------ccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCC---CC--Cccc-h------------- Confidence 1111 1112 223222 234688999999999999998632 11 1110 0 Q ss_pred chhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 476 IPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDD 510 (634) Q Consensus 476 i~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d 510 (634) ..+....+ | ..|++++.+| T Consensus 368 ----------~~~~~~~~-~-----~~gGd~~~~~ 386 (386) T protein:vir:48 368 ----------EGENPNKT-T-----LKGGEINGED 386 (386) T ss_pred ----------hhcCCCCC-c-----cCCCCCCCCC Confidence 00111111 1 1112221111 No 68 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.41 E-value=6e-13 Score=87.59 Aligned_cols=424 Identities=17% Similarity=0.086 Sum_probs=222.6 Q ss_pred ceeEeccCCCCccchh-hhhhhhccC--------------CchhhhhhhhcccCccccccHHH-----HHHHhhhhhHHH Q lcl|NC_011057. 7 LRLVRRPKGGRPAPSR-ALTAASQPL--------------PDPSQVFSKSTGISRNSDWQTDA-----WEAVDLVGELRY 66 (634) Q Consensus 7 lr~vrrp~g~~~a~~r-al~aAs~~i--------------tdp~~~~~~~~~~~~~~~WQ~eA-----W~~yd~VgELry 66 (634) ..+..|=++...++.+ ++.-.++.. ++|.... ...+....|-+.+ ++-|-.++-+.- T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~ 77 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQ---TLAGPSTELAPDTFVGLATQAYQANGPVFA 77 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHH---hhccccccccCccccccchhhhhccHHHHH Confidence 3344443333322221 221111111 1111111 0111112232222 333445677788 Q ss_pred HHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecC Q lcl|NC_011057. 67 YVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPV 146 (634) Q Consensus 67 yvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~ 146 (634) .|.-++++++++-|..-+-+ | |+..+ ..+..+..+.. =.---+-..++++.++.+|-+-|+.|+.| .|.+ T Consensus 78 ~i~~Ia~~ia~lp~~~~~~~-~-----~~~~~--~~~~~~~~L~~-~PN~~~t~~~f~~~l~~~lll~Gnay~~i-~r~~ 147 (466) T protein:vir:81 78 CMLVRQLVFSSVRFRWQRLR-D-----GKPSD--TFGSRDLQILE-TPWKGGTTQDMLSRMIQDADLAGNSYWTI-VDGE 147 (466) T ss_pred HHHHHHHhhccCceEEEEec-C-----Cceee--ccccHHHHHhh-CCCCCCCHHHHHHHHHHHHHhcCCeEEEE-EecC Confidence 88899999999988887765 3 22221 11222344443 35556778899999999999999999987 3433 Q ss_pred CCCCCCcccccccchhceeccHHHHhccCCCccee---e--EeC----CCCcccccCCCCeEEEe-eCCCcccccCCccc Q lcl|NC_011057. 147 KGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTN---I--VLP----TGEEHEFVKGTDIIFRV-WIPKPRKASEPDSP 216 (634) Q Consensus 147 ~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~---i--~lP----~g~~h~~~~~~D~~~Rv-W~P~prra~eaDSP 216 (634) .+. -.++-.. ....++.+..+-+......++.. + ... +++..+| +..| +||+ ..++|.....--|| T Consensus 148 ~g~-l~~~~~g-~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~-~~~d-viHir~~~~~~d~~~G~s~ 223 (466) T protein:vir:81 148 FVR-MRPDWVD-VVVEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGF-LAED-VVHFAPIPDPLASYRGMSW 223 (466) T ss_pred ccc-cccccCc-ceeEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeee-cccc-EEEEcCCCCcccccccccH Confidence 211 1111111 13467777776665433222211 1 111 1222222 2234 3444 34566676777888 Q ss_pred hhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhccc Q lcl|NC_011057. 217 VRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVE 296 (634) Q Consensus 217 vra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~ 296 (634) +..+.+.+.-.....+...+..+.-..-.|||-.|+.++- -..+++.+.+.+. +. T Consensus 224 i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~---------------------e~~~~~~~~~~~~----~~ 278 (466) T protein:vir:81 224 LTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADP---------------------AAVKKWADEVNSK----HA 278 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCH---------------------HHHHHHHHHHHHH----hc Confidence 8887777665555555555555555555667777665431 1345555555422 21 Q ss_pred CccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccC---cchhhHhhhhhhh Q lcl|NC_011057. 297 DEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQ---TNHWSAWQISDED 373 (634) Q Consensus 297 De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~---~NhwtAw~i~de~ 373 (634) ++--+--|+|+ +..-+++.|.+.. .+.--+++|+..+..||..|.||| .+||+-.+ .+..++.++...= T Consensus 279 --g~~n~g~~~vl----~~g~~~~~l~~~~-~d~q~le~~~~~~~~Ia~~fgVPp-~~lG~~~~~~~st~sn~eq~~~~f 350 (466) T protein:vir:81 279 --GVDNAWKNLNL----YPGADADVVGSNL-QEIDFKNVRGGGETRIAAAAGVPP-VIVGLSEGLAAATYSNYGQARRRL 350 (466) T ss_pred --CccccccceEc----CCCceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCH-HHcccccCCCccccccHHHHHHHH Confidence 11112223443 2345677776643 233448899999999999999999 88987433 4455678888888 Q ss_pred hhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH--------H-HHHHccCCCHHHHHHH Q lcl|NC_011057. 374 VQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA--------K-FAYENGAINGEALRKY 444 (634) Q Consensus 374 v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA--------~-~~~~~G~It~ealr~~ 444 (634) ++..|.|.++.|+++|++.++.+ .++ .+|-|=||...|....-++... + .+...| ||.+++|.. T Consensus 351 ~~~tl~P~~~~ie~~l~~~L~~~---~~~---~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g-~t~nE~r~~ 423 (466) T protein:vir:81 351 ADGTAHPLWQNLSGCIGHVMPDM---GPD---VRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAG-YEPESVVAA 423 (466) T ss_pred HHHHHHHHHHHHHHHHHhhcCCc---ccC---cceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcC-CChhhcccc Confidence 89999999999999999988764 222 2355667888877663333221 1 134556 588888876 Q ss_pred hCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCC Q lcl|NC_011057. 445 LGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEH 518 (634) Q Consensus 445 ~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ 518 (634) ...|+.--+. +.-...++ ..|++.+.. .+.+...+.+++++|. T Consensus 424 ~~~gd~~~~~--------------------------~~~~~~~~--~~~~~~~~~---~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 424 VNSGDLRLLK--------------------------HTGLTSVQ--LLPPGVSAS---ASSDTPTSGGADDNGN 466 (466) T ss_pred ccCCcccccc--------------------------CCCcchhh--hcccccccc---cCCCCcccCCCCcCCC Confidence 5544321100 00000011 111111111 1111111222222222 No 69 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.37 E-value=3.4e-13 Score=88.95 Aligned_cols=274 Identities=11% Similarity=0.085 Sum_probs=177.0 Q ss_pred eeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcc Q lcl|NC_011057. 75 CSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPD 154 (634) Q Consensus 75 ~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~d 154 (634) +|++-|.+-+=+.+ . ++ -+..+...-..-.+...++++.++.+|-+-|++|+.+. |..+ T Consensus 1 ia~l~~~~~~~~~~--------~----~~-~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~-r~~~------- 59 (278) T protein:vir:78 1 MASLPLKMYEDYKV--------V----NT-EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE-RDIY------- 59 (278) T ss_pred CccceeEEEecCcc--------c----cc-HHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEE-ECCC------- Confidence 77777766542211 1 12 23344455566778899999999999999999998864 4332 Q ss_pred cccccchhceeccHHHHhccCCCc--cee--eEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_011057. 155 GSVRTRQEWYAVSKEEIKKSNKGS--GTN--IVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRT 230 (634) Q Consensus 155 g~~~~~~~W~~vt~~Ei~~~~~~~--~~~--i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rt 230 (634) |. ..+++++..+.+......+ ... +...+|...+|.. +-+|++=+++|.....-.||+.++...+...... T Consensus 60 G~---~~~l~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~--~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~ 134 (278) T protein:vir:78 60 HQ---PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV 134 (278) T ss_pred Cc---EEEEEEECCceeEEEEcCCCceEEEEEEcCCceEEEEcc--ccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHH Confidence 32 2356666655554332222 222 3444565555443 3356665677777777889999998888765554 Q ss_pred hHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_011057. 231 TKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAG 310 (634) Q Consensus 231 tk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~ 310 (634) .+. |..+....+.|++..|+.++- -..+++++.+- ..+...+ . + +++ T Consensus 135 ~~~--~~~~~~~~~~~i~~~~~~l~~---------------------e~~~~~~~~~~----~~~~~~g---~-~-~vl- 181 (278) T protein:vir:78 135 RTF--NLTEMQKPDSFMLKYGSNVGK---------------------EKRQQVLEDFK----QYYEENG---G-I-LFQ- 181 (278) T ss_pred HHH--HHHHhcCCCcEEEEeCCCCCH---------------------HHHHHHHHHHH----HHhccCC---C-c-eec- Confidence 432 444444456666666655431 13445555543 2332211 1 2 222 Q ss_pred echHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_011057. 311 VPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALT 390 (634) Q Consensus 311 vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait 390 (634) + ...+++.+.+. ..+.--+++|+..++.||.+|.||| .|+|...++|+.++-+....-++..|.|.++.|+++|+ T Consensus 182 -~--~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVpp-~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln 256 (278) T protein:vir:78 182 -E--PGVEIEPLPKK-YVSEDIVASENLTRERVANVFQLPS-VFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFN 256 (278) T ss_pred -C--CCceEEEccCC-hhHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 22356666553 2334458889999999999999998 78898788999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCChhHheeeecCccc Q lcl|NC_011057. 391 DQILRVTLAREGIDPSKYVVWYDASQL 417 (634) Q Consensus 391 ~~~lr~~L~~eG~d~~~yV~w~DaS~L 417 (634) +.+|.+.- ++ ..|-|=||.+.| T Consensus 257 ~~L~~~~e----~~-~g~~~~f~~~~l 278 (278) T protein:vir:78 257 RKLLTKTD----RE-KIGILNLTLNLI 278 (278) T ss_pred hhcCChhH----hc-CCceEEEecccC Confidence 99876521 11 346788999999 No 70 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.35 E-value=1.4e-11 Score=80.04 Aligned_cols=462 Identities=13% Similarity=0.092 Sum_probs=202.6 Q ss_pred CCCCCcceeEe---ccCCC--------------------CccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHH Q lcl|NC_011057. 1 MAATQSLRLVR---RPKGG--------------------RPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEA 57 (634) Q Consensus 1 ~~a~~~lr~vr---rp~g~--------------------~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~ 57 (634) ||--.+||-.- --|.+ ..+.+....+.=-.-++|.-.+++..... ....+. -=.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~-l~~~ 78 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISD-VLSTKK-LLKA 78 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCcccc-ccCHHH-HHHH Confidence 33333332100 00000 00011111111111122221111111110 001110 0111 Q ss_pred HhhhhhHHHHHhhhhhc-----e----------eeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcc---- Q lcl|NC_011057. 58 VDLVGELRYYVGWRASS-----C----------SRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTL---- 118 (634) Q Consensus 58 yd~VgELryyvgWr~~s-----~----------Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~l---- 118 (634) |.-.+-++-.+.=+++. | ..++||-...+ +++.- -.+...+.+++..-..-.+ T Consensus 79 ~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~-----~~~~~---~~~~~~l~~lL~~~PN~~~~~~~ 150 (535) T protein:vir:10 79 YADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKV-----MSKAQ---IKRAHEIEDFIYNTGSEYYEWRD 150 (535) T ss_pred hccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCC-----Ccchh---hhhhhHHHHHHHhCCCCCCChhH Confidence 11111222211111111 1 24555532221 12221 1223344444444444333 Q ss_pred hHHHHHHHHHHh-hccccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCC----Cccee-eEeCCCCcccc Q lcl|NC_011057. 119 GQAALTKRVVEC-LTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNK----GSGTN-IVLPTGEEHEF 192 (634) Q Consensus 119 GQaqL~kR~~~~-LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~----~~~~~-i~lP~g~~h~~ 192 (634) .-.++++.++.+ |..-|..|+.| .|... |. ...++.|....|..... .++.. +..-+|..-.. T Consensus 151 ~~~~~~~~lv~d~l~~~g~ay~~i-~r~~~-------G~---~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~ 219 (535) T protein:vir:10 151 TFPRLLTKIINDMYVQDQINIERI-FKNDS-------NE---LDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVK 219 (535) T ss_pred HHHHHHHHHHHHHHhhCCceEEEE-EECCC-------Cc---EEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEE Confidence 334688998887 56666666554 45332 21 22355666555542211 11122 23333333333 Q ss_pred cCCCCeE-EEeeCC-CcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCC Q lcl|NC_011057. 193 VKGTDII-FRVWIP-KPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEE 270 (634) Q Consensus 193 ~~~~D~~-~RvW~P-~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~ 270 (634) ....|++ ||-|++ +...-..--||+.++...+.-..-..+...+..+.=..-.|||-+|..+.-- T Consensus 220 ~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~------------- 286 (535) T protein:vir:10 220 FSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQ------------- 286 (535) T ss_pred ECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcc------------- Confidence 3445644 444432 2233334568888888777777766666666555555566888887653310 Q ss_pred CCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCC Q lcl|NC_011057. 271 IAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVS 350 (634) Q Consensus 271 ~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~ 350 (634) ...-..+.|++.+- ..+.-. +-+--+||+... .-+.+.+.+... +.--+++|+..+..||..|.|| T Consensus 287 ----ls~e~~e~lk~~~~----~~~~G~-~nag~~~vl~~~----g~~~~~l~~~~~-D~qfle~~~~~~~eIa~afgVP 352 (535) T protein:vir:10 287 ----ANQMMLAGIRRQWT----SQGSGL-GGAWKIPILAAK----DAKFVNMTQNSR-DMEFDKFLNFMIYDTAAIFQMQ 352 (535) T ss_pred ----cCHHHHHHHHHHHH----HHhcCc-ccccccccccCC----CceEEecCCChh-HHHHHHHHHHHHHHHHHHhCCC Confidence 11113344444333 223221 223455666431 124445554332 3345899999999999999999 Q ss_pred hHHhhccccCcch------------hhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccc Q lcl|NC_011057. 351 PERLLGLGSQTNH------------WSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLT 418 (634) Q Consensus 351 pE~LLGlgs~~Nh------------wtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~ 418 (634) | .+||+..++|+ .++.+....-++..|.|.+..|+++|++.+|.. .|. +|.|.||. -+. T Consensus 353 p-~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~----~~~---~~~f~f~~-l~~ 423 (535) T protein:vir:10 353 P-EEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRY----VDT---DYRFSFTL-GDA 423 (535) T ss_pred H-HHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc----cCC---eEEEEecc-ccc Confidence 9 88898666554 345556666677789999999999999988853 232 47777774 344 Q ss_pred cCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhh-hhcccCCCCCC Q lcl|NC_011057. 419 IDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGV-LQQIEFPQQQQ 497 (634) Q Consensus 419 ~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~-~q~~~~P~p~~ 497 (634) .|.....++.++.-+|.+|.+++|+++||.--.|=|.. ...+..+ -++.+. .....-|.+.. T Consensus 424 ~d~~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~----~~~~~~~-------------~~~~~~~~~~~~~p~~~~ 486 (535) T protein:vir:10 424 QDKLQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVP----GFIGSAE-------------NFINATGFGQPNVPDSSD 486 (535) T ss_pred cCHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCcccc----ccccchh-------------hcccccccccccCCCCCC Confidence 44333344556777899999999999999754333310 0000000 000000 00000111100 Q ss_pred ----CCCCCCCCCCccc----cCCCCCCCCCCCCCCCCCc--ccCCCcc Q lcl|NC_011057. 498 ----AIDSGGNEDTSDD----DNLDDGEHEPDTEDDQDDD--GTQKAGL 536 (634) Q Consensus 498 ----a~~~~~~~~~~~d----~~~~~~~~ePDTe~d~~~~--~~~~a~~ 536 (634) ..+..+.++..++ +.+..++..|.+.|..... ...-+.+ T Consensus 487 ~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 487 DSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred CccccCCccccCcccccccccccCCCCCCCCCCcCCCCCccccccccCC Confidence 0000000111100 0111222222222111110 0111111 No 71 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.34 E-value=7.2e-12 Score=81.69 Aligned_cols=405 Identities=14% Similarity=0.140 Sum_probs=195.3 Q ss_pred ceeEeccCCCCccch-hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeee Q lcl|NC_011057. 7 LRLVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASEL 85 (634) Q Consensus 7 lr~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (634) .-|.++.+....... -.+...-.. .++++ +......-.+ ..+ ...-+.-.|.-+++++|++.|..- T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~------~~~~~--~~~~~~~~~~--~al-~~~~v~~cv~~Ia~~iA~~p~~~~-- 67 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQT------LPGFQ--GTKLRQYKDI--EAI-RHSDIFTAVMMIASDLARMPIRVT-- 67 (416) T ss_pred CCcccccccccccCCCcchhHHHHH------hcccc--ccCccccchh--hhh-cchHHHHHHHHHHHhhccCceEEe-- Confidence 334433332111110 001100000 00000 0000000000 000 011122245667889999887542 Q ss_pred cccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhcee Q lcl|NC_011057. 86 DENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYA 165 (634) Q Consensus 86 D~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~ 165 (634) .+ |.+..+ ..+..+...-..--+...++++.++.+|-+-|++|+.+ .|.. +|. ...++. T Consensus 68 -~~-----~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i-~r~~-------~G~---~~~L~~ 126 (416) T protein:vir:81 68 -VN-----GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEI-TRDK-------TGE---PMNLTF 126 (416) T ss_pred -cC-----cccccc----chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------CCc---EEEEEE Confidence 23 333322 23455556667778889999999999999999999885 4532 232 234566 Q ss_pred ccHHHHhccCCCccee---eEeCCCCccc---ccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_011057. 166 VSKEEIKKSNKGSGTN---IVLPTGEEHE---FVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (634) Q Consensus 166 vt~~Ei~~~~~~~~~~---i~lP~g~~h~---~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 238 (634) |....+......+|.. +..-++..+. -.+..| +-||.. +-....--||+..+.+.+.=..-..+...+.. T Consensus 127 i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~~---~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 203 (416) T protein:vir:81 127 RKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFY---SLDGINGLSLLDTLSRTIESDNNGKDFLNNFL 203 (416) T ss_pred EcCceeEEEECCCccEEEEEEEecCCCceeEEEEccccEEEeccC---CCCCccccCHHHHHHHHHHHHHHHHHHHHHHH Confidence 6555543222222221 1111221111 122334 333432 22334556777766665554444445555555 Q ss_pred HhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_011057. 239 KSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKD 318 (634) Q Consensus 239 ~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ 318 (634) +.-..-.|||-+|+.++=+ -+.+++++.+.+.- ...+..+ -|+|+ + ..-+ T Consensus 204 ~ng~~~~gil~~~~~~~~~--------------------~~~~~~~~~~~~~~-~g~~nag-----~~~vl--~--~g~~ 253 (416) T protein:vir:81 204 RNGTHAGGILKMKGVLDNK--------------------KARDRAREEFHKSF-SGTKQAG-----KVVVL--D--ESMT 253 (416) T ss_pred hccCCCcEEEEeCCCCCCH--------------------HHHHHHHHHHHHHh-cCccccC-----ceeec--C--CCce Confidence 5555556778776544311 13344444443221 1111111 23333 2 2335 Q ss_pred cceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 319 VKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTL 398 (634) Q Consensus 319 ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L 398 (634) ++.|.+..+.-+ -+++|+.....||..|-||| .|||+. +.|. +..+..- .+...|.|.+..|+++|++.++... T Consensus 254 ~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp-~~lg~~-~~~~-~~~~~~~-~~~~~l~P~~~~ie~~ln~~l~~~~- 327 (416) T protein:vir:81 254 FDQLEVDTEVLK-LIRENKSSTREIAGVFGIPL-HKFGIE-TANM-SITDANL-DYLSTLKPYITCVCAELNFKFNDEY- 327 (416) T ss_pred eEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCH-HHcCCC-CCCc-cHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc- Confidence 555554432222 37899999999999999999 678973 4432 3334332 3555799999999999998865431 Q ss_pred HhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCccc Q lcl|NC_011057. 399 AREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTL 475 (634) Q Consensus 399 ~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~L 475 (634) ..|-|.||.+.|..- |..+.+ ..++..|++|.+++|+++||+--.+-|-. ...-+..+ T Consensus 328 -------~~~~~~f~~~~l~~~-D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~-----------~~~~~~n~ 388 (416) T protein:vir:81 328 -------VNREFKFDTTEIRVV-DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS-----------IHRVDLNH 388 (416) T ss_pred -------cCceEEEechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc-----------eEeecccc Confidence 246679999998553 333333 34888999999999999999653333310 00000011 Q ss_pred chhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 476 IPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 476 i~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) .|+ +.++-.++..+- +....-.+|++-+ T Consensus 389 ----~~~-----~~~~~~~~~~~~---------~~~~~~kgGe~n~ 416 (416) T protein:vir:81 389 ----VNI-----ELVDEYQMNKSR---------ATDKKLKGGEENE 416 (416) T ss_pred ----ccc-----ccccccCccccc---------ccccccCCCCCCC Confidence 111 011111110000 0000011121111 No 72 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.34 E-value=7.2e-12 Score=81.69 Aligned_cols=405 Identities=14% Similarity=0.140 Sum_probs=195.3 Q ss_pred ceeEeccCCCCccch-hhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEeee Q lcl|NC_011057. 7 LRLVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASEL 85 (634) Q Consensus 7 lr~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (634) .-|.++.+....... -.+...-.. .++++ +......-.+ ..+ ...-+.-.|.-+++++|++.|..- T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~------~~~~~--~~~~~~~~~~--~al-~~~~v~~cv~~Ia~~iA~~p~~~~-- 67 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQT------LPGFQ--GTKLRQYKDI--EAI-RHSDIFTAVMMIASDLARMPIRVT-- 67 (416) T ss_pred CCcccccccccccCCCcchhHHHHH------hcccc--ccCccccchh--hhh-cchHHHHHHHHHHHhhccCceEEe-- Confidence 334433332111110 001100000 00000 0000000000 000 011122245667889999887542 Q ss_pred cccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhcee Q lcl|NC_011057. 86 DENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYA 165 (634) Q Consensus 86 D~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~ 165 (634) .+ |.+..+ ..+..+...-..--+...++++.++.+|-+-|++|+.+ .|.. +|. ...++. T Consensus 68 -~~-----~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i-~r~~-------~G~---~~~L~~ 126 (416) T protein:vir:45 68 -VN-----GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEI-TRDK-------TGE---PMNLTF 126 (416) T ss_pred -cC-----cccccc----chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------CCc---EEEEEE Confidence 23 333322 23455556667778889999999999999999999885 4532 232 234566 Q ss_pred ccHHHHhccCCCccee---eEeCCCCccc---ccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_011057. 166 VSKEEIKKSNKGSGTN---IVLPTGEEHE---FVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (634) Q Consensus 166 vt~~Ei~~~~~~~~~~---i~lP~g~~h~---~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 238 (634) |....+......+|.. +..-++..+. -.+..| +-||.. +-....--||+..+.+.+.=..-..+...+.. T Consensus 127 i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~~---~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 203 (416) T protein:vir:45 127 RKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFY---SLDGINGLSLLDTLSRTIESDNNGKDFLNNFL 203 (416) T ss_pred EcCceeEEEECCCccEEEEEEEecCCCceeEEEEccccEEEeccC---CCCCccccCHHHHHHHHHHHHHHHHHHHHHHH Confidence 6555543222222221 1111221111 122334 333432 22334556777766665554444445555555 Q ss_pred HhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_011057. 239 KSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKD 318 (634) Q Consensus 239 ~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ 318 (634) +.-..-.|||-+|+.++=+ -+.+++++.+.+.- ...+..+ -|+|+ + ..-+ T Consensus 204 ~ng~~~~gil~~~~~~~~~--------------------~~~~~~~~~~~~~~-~g~~nag-----~~~vl--~--~g~~ 253 (416) T protein:vir:45 204 RNGTHAGGILKMKGVLDNK--------------------KARDRAREEFHKSF-SGTKQAG-----KVVVL--D--ESMT 253 (416) T ss_pred hccCCCcEEEEeCCCCCCH--------------------HHHHHHHHHHHHHh-cCccccC-----ceeec--C--CCce Confidence 5555556778776544311 13344444443221 1111111 23333 2 2335 Q ss_pred cceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 319 VKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTL 398 (634) Q Consensus 319 ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L 398 (634) ++.|.+..+.-+ -+++|+.....||..|-||| .|||+. +.|. +..+..- .+...|.|.+..|+++|++.++... T Consensus 254 ~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp-~~lg~~-~~~~-~~~~~~~-~~~~~l~P~~~~ie~~ln~~l~~~~- 327 (416) T protein:vir:45 254 FDQLEVDTEVLK-LIRENKSSTREIAGVFGIPL-HKFGIE-TANM-SITDANL-DYLSTLKPYITCVCAELNFKFNDEY- 327 (416) T ss_pred eEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCH-HHcCCC-CCCc-cHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc- Confidence 555554432222 37899999999999999999 678973 4432 3334332 3555799999999999998865431 Q ss_pred HhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCccc Q lcl|NC_011057. 399 AREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTL 475 (634) Q Consensus 399 ~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~L 475 (634) ..|-|.||.+.|..- |..+.+ ..++..|++|.+++|+++||+--.+-|-. ...-+..+ T Consensus 328 -------~~~~~~f~~~~l~~~-D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~-----------~~~~~~n~ 388 (416) T protein:vir:45 328 -------VNREFKFDTTEIRVV-DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS-----------IHRVDLNH 388 (416) T ss_pred -------cCceEEEechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc-----------eEeecccc Confidence 246679999998553 333333 34888999999999999999653333310 00000011 Q ss_pred chhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 476 IPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 476 i~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) .|+ +.++-.++..+- +....-.+|++-+ T Consensus 389 ----~~~-----~~~~~~~~~~~~---------~~~~~~kgGe~n~ 416 (416) T protein:vir:45 389 ----VNI-----ELVDEYQMNKSR---------ATDKKLKGGEENE 416 (416) T ss_pred ----ccc-----ccccccCccccc---------ccccccCCCCCCC Confidence 111 011111110000 0000011121111 No 73 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.33 E-value=4e-12 Score=83.09 Aligned_cols=413 Identities=14% Similarity=0.116 Sum_probs=193.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHh-----hhhhHHHHHhhhhhce Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVD-----LVGELRYYVGWRASSC 75 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd-----~VgELryyvgWr~~s~ 75 (634) --.+..-.+|+-++... -.|++. .+-.+.+... +...+ |+.-.-..|. .++-+.-.|.-+++++ T Consensus 16 ~~~~~~~~~~~~~f~~~--e~r~~~---~~~~~~~~~~-~~~~~-----~~~~~~~~~~~~~al~~~~V~acv~~Ia~~i 84 (441) T protein:vir:98 16 RKQSRKELVVVGIFYKN--EKRDLQ---YNEDDLQMMV-QTLPG-----FQGTKLRQYKDIEAIRHSDIFTAVMMIASDL 84 (441) T ss_pred ccchhhhhhcccccccc--cccccc---CCCcchHHHH-HHhhc-----ccccCccccchhhhhccHHHHHHHHHHHHhh Confidence 01111111233332211 011111 0111111111 10000 1100000111 1122223466788899 Q ss_pred eeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccc Q lcl|NC_011057. 76 SRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDG 155 (634) Q Consensus 76 Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg 155 (634) +.+.|..-+ + |.+..+ ..+..++..-+..-+...++++.++.+|.+-|++|+.| .|.. +| T Consensus 85 A~lpl~~~~---~-----~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i-~r~~-------~G 144 (441) T protein:vir:98 85 ARMPIRVTV---N-----GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEI-TRDK-------TG 144 (441) T ss_pred ccCceEEec---C-----Cccccc----chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEE-EEcC-------CC Confidence 998876632 3 333322 23455566667888999999999999999999999875 4532 22 Q ss_pred ccccchhceeccHHHHhccCCCcc-eeeEe--CC--CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_011057. 156 SVRTRQEWYAVSKEEIKKSNKGSG-TNIVL--PT--GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRT 230 (634) Q Consensus 156 ~~~~~~~W~~vt~~Ei~~~~~~~~-~~i~l--P~--g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rt 230 (634) . + ...+.+....+......+| ..+.. .+ |......-..+=||++=. ++-.-..--||+..+.+.+.--.-. T Consensus 145 ~--~-~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~-~~~dg~~G~spi~~~~~~i~~~~a~ 220 (441) T protein:vir:98 145 E--P-MNLTFRKTSEIELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF-YSLDGINGLSLLDTLSRTIESDNNG 220 (441) T ss_pred c--E-EEEEEEcCceeEEEECCCCcEEEEEEEeccCcceeeEEEccccEEEecc-CCCCCccccCHHHHHHHHHHHHHHH Confidence 1 1 2344444443322211111 11111 11 111111112222344411 1222233456666655555443334 Q ss_pred hHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_011057. 231 TKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAG 310 (634) Q Consensus 231 tk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~ 310 (634) .+...+..+.-..-.|||-+|+.++=+ -+.+.+++.+. .++.-.+ -|-=|+|+ T Consensus 221 ~~~~~~~f~ng~~~~gil~~~~~~~~~--------------------e~~~~~~~~~~----~~~~G~~--nag~~~vl- 273 (441) T protein:vir:98 221 KDFLNNFLRNGTHAGGILKMKGVLDNK--------------------KARDRAREEFH----KSFSGTK--QAGKVVVL- 273 (441) T ss_pred HHHHHHHHhccCCCcEEEEeCCCCCCH--------------------HHHHHHHHHHH----HHhcCcc--ccCcceec- Confidence 444444344444445777776654311 12333433332 2222111 11123333 Q ss_pred echHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_011057. 311 VPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALT 390 (634) Q Consensus 311 vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait 390 (634) + ..-+++.|.+..+..+ -+++|+..+..||..|.||| .+||+. +.| ++..+..- .+...|.|.+..|+++|+ T Consensus 274 -~--~g~~~~~l~~~~~d~q-~~e~r~~~~~~Ia~~fgVPp-~~lg~~-~~~-~s~~q~~~-~y~~tl~P~~~~ie~~ln 345 (441) T protein:vir:98 274 -D--ESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPL-HKFGIE-TAN-MSITDANL-DYLSTLKPYITCVCAELN 345 (441) T ss_pred -C--CCceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCH-HHcCCC-CCC-ccHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 2 3346666665543323 47999999999999999999 677873 433 23334332 344579999999999999 Q ss_pred HHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHH Q lcl|NC_011057. 391 DQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQD 467 (634) Q Consensus 391 ~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d 467 (634) +.++... ..|-|.||.+.|..- |..+.+ ..+++.|++|.+++|+++||+.-.|-|-. T Consensus 346 ~~L~~~~--------~~~~~~fd~~~llr~-d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~----------- 405 (441) T protein:vir:98 346 FKFNDEY--------VNREFKFDTTEIRVV-DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS----------- 405 (441) T ss_pred hhccccc--------cCceEEEechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc----------- Confidence 8865321 235579999998653 444433 34888999999999999999653333300 Q ss_pred HhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 468 AVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 468 ~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) ...-+-.. .|+ +..+-.++.. + +..+.. ..+|++-| T Consensus 406 ~~~~~~n~----~~~-----~~~~~~q~~~---~--~~~~~~----~kgGe~ne 441 (441) T protein:vir:98 406 IHRVDLNH----VNI-----ELVDEYQMNK---S--RATDKK----LKGGEENE 441 (441) T ss_pred eEeecccc----ccc-----cccccccccc---c--cccccc----cCCCCCCC Confidence 00000000 010 0011111100 0 000111 12222222 No 74 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.30 E-value=3e-12 Score=83.78 Aligned_cols=388 Identities=13% Similarity=0.119 Sum_probs=182.5 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.= ..+-++..... ..+++ ... +..-..+.|-...-++-.+.-+++.+|++.+ T Consensus 1 Mg~------f~~lf~~~~~~--------~~~~~------~~~-------~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~ 53 (395) T protein:vir:95 1 MSI------LEKIFKTRKDI--------TYMLD------LDM-------IEDLSQQAYVKRLAIDSCIEFVARAVAQSHF 53 (395) T ss_pred Cch------hhhhhccCccc--------ccccc------chh-------ccccchhhhhhhHHHHHHHHHHHHhhcccee Confidence 322 11222211000 00110 000 0000111222345566778889999999987 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+ . +.+.+ +. +......=...-+...++++.++.+|-.-|+.++++ +..++. .. T Consensus 54 ~~~~---~-----~~~~~----~~-~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~--~~~~~~---------~~ 109 (395) T protein:vir:95 54 KVLE---G-----NRIQK----ND-VYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV--SDSKEL---------LI 109 (395) T ss_pred Eecc---C-----Ccccc----ch-HHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE--ecCCCe---------Ee Confidence 6532 1 12221 22 233334446778889999999999999999887654 222211 11 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) ..++.++...+. . .--..+..-++......+..|++ ++=..++.-...-.||+.++...+. +.. ++-+. T Consensus 110 ~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~evi-h~~~~~~~~~~~G~spi~~~~~~~~----~~~---~~~~~ 178 (395) T protein:vir:95 110 ADSFYREEYALY--D-DIFKDVTVKDYTYQRTFTMQEVI-YLKYNNNKVTHFVESLFEDYGKIFG----RMI---GAQLK 178 (395) T ss_pred cCCccceeEeec--C-cceeEEEEcCceeeeeeccccEE-EEccCCCCcccccchHHHHHHHHHH----HHH---HHHHh Confidence 122222221111 0 00011222222211112334433 3312233333445677766544432 211 11111 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ik 320 (634) =..-.|+|.+|+.+. ..-+.+++++.+-+ ...+. +.+ . ..|+ ..+ ..-+++ T Consensus 179 ~~~~~gii~~~~~~~--------------------~~e~~~~~~~~~~~-~~~~~-~~~-~---~~v~-~l~--~g~~~~ 229 (395) T protein:vir:95 179 NYQIRGILKSASSAY--------------------DEKNIEKLQAFTNK-LFNTF-NKN-Q---LAIA-PLI--EGFDYE 229 (395) T ss_pred cCCCceEEEeCCCCC--------------------CHHHHHHHHHHHHH-Hhccc-ccc-C---cceE-EcC--CCceee Confidence 111225555554321 11244455544432 11111 111 1 1222 222 334455 Q ss_pred eeecCC---chhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 321 HIRFDN---EITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 321 Hl~f~~---d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -|.+.. +.++. -+++|+..+..||..|.|||. +|| |+ ..++-+....=++..|.|.+..|+++|++.+|.+ T Consensus 230 ~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~-~l~-~~---~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~ 304 (395) T protein:vir:95 230 ELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG-LIY-GE---TADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQ 304 (395) T ss_pred eccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH-Hhc-Cc---ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 555443 33333 488999999999999999995 556 33 3446777777888899999999999999998776 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchH---HHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~---eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) ... .+| +.||.+.|.. .|..+ ....++..|++|.+++|.++|+.--.+... .+-+ +.+| T Consensus 305 ~~~------~~~-~~f~~~~l~~-~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~--d~~~-------~~~n- 366 (395) T protein:vir:95 305 SMY------LKD-TRIEIVGVNK-KDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPEL--DEYL-------ITKN- 366 (395) T ss_pred hhh------ccc-ceecchhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee-------eccc- Confidence 332 112 3577787743 34333 234488999999999999999964322100 0000 0010 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCC Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDD 525 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d 525 (634) +.|+ ....+.+....+....+|++.+- .| T Consensus 367 -----~~~~-----------------~~~~~~~~~~~~~~~kgg~~~~~-g~ 395 (395) T protein:vir:95 367 -----YEKA-----------------NSGENDEKEKDENTLKGGDEDES-GD 395 (395) T ss_pred -----cccc-----------------cccccccCcccccccCCCCCCCC-CC Confidence 0111 01111111111111222222221 11 No 75 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.30 E-value=3e-12 Score=83.78 Aligned_cols=388 Identities=13% Similarity=0.119 Sum_probs=182.5 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.= ..+-++..... ..+++ ... +..-..+.|-...-++-.+.-+++.+|++.+ T Consensus 1 Mg~------f~~lf~~~~~~--------~~~~~------~~~-------~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~ 53 (395) T protein:vir:10 1 MSI------LEKIFKTRKDI--------TYMLD------LDM-------IEDLSQQAYVKRLAIDSCIEFVARAVAQSHF 53 (395) T ss_pred Cch------hhhhhccCccc--------ccccc------chh-------ccccchhhhhhhHHHHHHHHHHHHhhcccee Confidence 322 11222211000 00110 000 0000111222345566778889999999987 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+ . +.+.+ +. +......=...-+...++++.++.+|-.-|+.++++ +..++. .. T Consensus 54 ~~~~---~-----~~~~~----~~-~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~--~~~~~~---------~~ 109 (395) T protein:vir:10 54 KVLE---G-----NRIQK----ND-VYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV--SDSKEL---------LI 109 (395) T ss_pred Eecc---C-----Ccccc----ch-HHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE--ecCCCe---------Ee Confidence 6532 1 12221 22 233334446778889999999999999999887654 222211 11 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) ..++.++...+. . .--..+..-++......+..|++ ++=..++.-...-.||+.++...+. +.. ++-+. T Consensus 110 ~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~evi-h~~~~~~~~~~~G~spi~~~~~~~~----~~~---~~~~~ 178 (395) T protein:vir:10 110 ADSFYREEYALY--D-DIFKDVTVKDYTYQRTFTMQEVI-YLKYNNNKVTHFVESLFEDYGKIFG----RMI---GAQLK 178 (395) T ss_pred cCCccceeEeec--C-cceeEEEEcCceeeeeeccccEE-EEccCCCCcccccchHHHHHHHHHH----HHH---HHHHh Confidence 122222221111 0 00011222222211112334433 3312233333445677766544432 211 11111 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ik 320 (634) =..-.|+|.+|+.+. ..-+.+++++.+-+ ...+. +.+ . ..|+ ..+ ..-+++ T Consensus 179 ~~~~~gii~~~~~~~--------------------~~e~~~~~~~~~~~-~~~~~-~~~-~---~~v~-~l~--~g~~~~ 229 (395) T protein:vir:10 179 NYQIRGILKSASSAY--------------------DEKNIEKLQAFTNK-LFNTF-NKN-Q---LAIA-PLI--EGFDYE 229 (395) T ss_pred cCCCceEEEeCCCCC--------------------CHHHHHHHHHHHHH-Hhccc-ccc-C---cceE-EcC--CCceee Confidence 111225555554321 11244455544432 11111 111 1 1222 222 334455 Q ss_pred eeecCC---chhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 321 HIRFDN---EITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 321 Hl~f~~---d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -|.+.. +.++. -+++|+..+..||..|.|||. +|| |+ ..++-+....=++..|.|.+..|+++|++.+|.+ T Consensus 230 ~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~-~l~-~~---~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~ 304 (395) T protein:vir:10 230 ELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG-LIY-GE---TADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQ 304 (395) T ss_pred eccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH-Hhc-Cc---ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 555443 33333 488999999999999999995 556 33 3446777777888899999999999999998776 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchH---HHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~---eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) ... .+| +.||.+.|.. .|..+ ....++..|++|.+++|.++|+.--.+... .+-+ +.+| T Consensus 305 ~~~------~~~-~~f~~~~l~~-~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~--d~~~-------~~~n- 366 (395) T protein:vir:10 305 SMY------LKD-TRIEIVGVNK-KDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPEL--DEYL-------ITKN- 366 (395) T ss_pred hhh------ccc-ceecchhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee-------eccc- Confidence 332 112 3577787743 34333 234488999999999999999964322100 0000 0010 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCC Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDD 525 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d 525 (634) +.|+ ....+.+....+....+|++.+- .| T Consensus 367 -----~~~~-----------------~~~~~~~~~~~~~~~kgg~~~~~-g~ 395 (395) T protein:vir:10 367 -----YEKA-----------------NSGENDEKEKDENTLKGGDEDES-GD 395 (395) T ss_pred -----cccc-----------------cccccccCcccccccCCCCCCCC-CC Confidence 0111 01111111111111222222221 11 No 76 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.30 E-value=3e-12 Score=83.78 Aligned_cols=388 Identities=13% Similarity=0.119 Sum_probs=182.5 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.= ..+-++..... ..+++ ... +..-..+.|-...-++-.+.-+++.+|++.+ T Consensus 1 Mg~------f~~lf~~~~~~--------~~~~~------~~~-------~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~ 53 (395) T protein:vir:10 1 MSI------LEKIFKTRKDI--------TYMLD------LDM-------IEDLSQQAYVKRLAIDSCIEFVARAVAQSHF 53 (395) T ss_pred Cch------hhhhhccCccc--------ccccc------chh-------ccccchhhhhhhHHHHHHHHHHHHhhcccee Confidence 322 11222211000 00110 000 0000111222345566778889999999987 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+ . +.+.+ +. +......=...-+...++++.++.+|-.-|+.++++ +..++. .. T Consensus 54 ~~~~---~-----~~~~~----~~-~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~--~~~~~~---------~~ 109 (395) T protein:vir:10 54 KVLE---G-----NRIQK----ND-VYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV--SDSKEL---------LI 109 (395) T ss_pred Eecc---C-----Ccccc----ch-HHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE--ecCCCe---------Ee Confidence 6532 1 12221 22 233334446778889999999999999999887654 222211 11 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) ..++.++...+. . .--..+..-++......+..|++ ++=..++.-...-.||+.++...+. +.. ++-+. T Consensus 110 ~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~evi-h~~~~~~~~~~~G~spi~~~~~~~~----~~~---~~~~~ 178 (395) T protein:vir:10 110 ADSFYREEYALY--D-DIFKDVTVKDYTYQRTFTMQEVI-YLKYNNNKVTHFVESLFEDYGKIFG----RMI---GAQLK 178 (395) T ss_pred cCCccceeEeec--C-cceeEEEEcCceeeeeeccccEE-EEccCCCCcccccchHHHHHHHHHH----HHH---HHHHh Confidence 122222221111 0 00011222222211112334433 3312233333445677766544432 211 11111 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ik 320 (634) =..-.|+|.+|+.+. ..-+.+++++.+-+ ...+. +.+ . ..|+ ..+ ..-+++ T Consensus 179 ~~~~~gii~~~~~~~--------------------~~e~~~~~~~~~~~-~~~~~-~~~-~---~~v~-~l~--~g~~~~ 229 (395) T protein:vir:10 179 NYQIRGILKSASSAY--------------------DEKNIEKLQAFTNK-LFNTF-NKN-Q---LAIA-PLI--EGFDYE 229 (395) T ss_pred cCCCceEEEeCCCCC--------------------CHHHHHHHHHHHHH-Hhccc-ccc-C---cceE-EcC--CCceee Confidence 111225555554321 11244455544432 11111 111 1 1222 222 334455 Q ss_pred eeecCC---chhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 321 HIRFDN---EITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 321 Hl~f~~---d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -|.+.. +.++. -+++|+..+..||..|.|||. +|| |+ ..++-+....=++..|.|.+..|+++|++.+|.+ T Consensus 230 ~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~-~l~-~~---~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~ 304 (395) T protein:vir:10 230 ELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG-LIY-GE---TADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQ 304 (395) T ss_pred eccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH-Hhc-Cc---ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 555443 33333 488999999999999999995 556 33 3446777777888899999999999999998776 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchH---HHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~---eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) ... .+| +.||.+.|.. .|..+ ....++..|++|.+++|.++|+.--.+... .+-+ +.+| T Consensus 305 ~~~------~~~-~~f~~~~l~~-~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~--d~~~-------~~~n- 366 (395) T protein:vir:10 305 SMY------LKD-TRIEIVGVNK-KDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPEL--DEYL-------ITKN- 366 (395) T ss_pred hhh------ccc-ceecchhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee-------eccc- Confidence 332 112 3577787743 34333 234488999999999999999964322100 0000 0010 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCC Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDD 525 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d 525 (634) +.|+ ....+.+....+....+|++.+- .| T Consensus 367 -----~~~~-----------------~~~~~~~~~~~~~~~kgg~~~~~-g~ 395 (395) T protein:vir:10 367 -----YEKA-----------------NSGENDEKEKDENTLKGGDEDES-GD 395 (395) T ss_pred -----cccc-----------------cccccccCcccccccCCCCCCCC-CC Confidence 0111 01111111111111222222221 11 No 77 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.27 E-value=2.8e-11 Score=78.48 Aligned_cols=415 Identities=13% Similarity=0.128 Sum_probs=192.5 Q ss_pred CC---------------CCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcc--cCccccccH-HHHHHHhhhh Q lcl|NC_011057. 1 MA---------------ATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTG--ISRNSDWQT-DAWEAVDLVG 62 (634) Q Consensus 1 ~~---------------a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~--~~~~~~WQ~-eAW~~yd~Vg 62 (634) |- ....-=+|+-+|... -.|++. .+-.+.+..++...+ +...+.+-. .| + ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~--e~R~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~a---l-~~~ 71 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKN--EKRDLQ---YNEDDLQMMVQTLPGFQGTKLRQYKDIEA---I-RHS 71 (441) T ss_pred CccccCccccccccccccchhhhhcccccccc--cccccc---CCCcchHHHHHHhcccCcccccccchhhh---h-ccH Confidence 00 000000111111100 001110 011011000000000 000000000 00 0 112 Q ss_pred hHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEE Q lcl|NC_011057. 63 ELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVIL 142 (634) Q Consensus 63 ELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il 142 (634) -+.-.|.-++++++++-|..- .+ |.+..+ ..+..++..-+...+...++++.++.+|-+-|+.|+.+ T Consensus 72 ~V~~cv~~Ia~~iA~lp~~~~---~~-----~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i- 138 (441) T protein:vir:79 72 DIFTAVMMIASDLARMPIRVT---VN-----GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEI- 138 (441) T ss_pred HHHHHHHHHHHhhccCceeee---cC-----cccccc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEE- Confidence 233346668888888876542 23 333322 23455566667888999999999999999999999886 Q ss_pred EecCCCCCCCcccccccchhceeccHHHHhccCCCccee---eEeCCCCcccc--cCCCCeEEEeeCCCcccccCCccch Q lcl|NC_011057. 143 TRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTN---IVLPTGEEHEF--VKGTDIIFRVWIPKPRKASEPDSPV 217 (634) Q Consensus 143 ~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~---i~lP~g~~h~~--~~~~D~~~RvW~P~prra~eaDSPv 217 (634) .|.. +|. + .+.+.|....+......+|.. +...+|..+.+ .-..+=||++=. ++-.-..--||+ T Consensus 139 ~r~~-------~G~--~-~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~-~~~dg~~G~spl 207 (441) T protein:vir:79 139 TRDK-------TGE--P-MNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF-YSLDGINGLSLL 207 (441) T ss_pred EECC-------CCc--E-EEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEecc-CCCCCccccCHH Confidence 4532 232 2 234445444443222122221 11112221111 112222333311 122223345666 Q ss_pred hhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccC Q lcl|NC_011057. 218 RAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVED 297 (634) Q Consensus 218 ra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~D 297 (634) ..+...+.-..-..+...+..+.-..-.|||-+|+.++=+ -+.+++++.+- .++.- T Consensus 208 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--------------------e~~e~~r~~~~----~~~~G 263 (441) T protein:vir:79 208 DTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNK--------------------KARDRAREEFH----KSFSG 263 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCH--------------------HHHHHHHHHHH----HHhcC Confidence 6555554433333344444444444445777777554311 13334444433 22222 Q ss_pred ccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHH Q lcl|NC_011057. 298 EDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLH 377 (634) Q Consensus 298 e~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~h 377 (634) .+. +--|+|+ + ..-+++.|.+..+.-+ -+++|+..+..||..|.||| .|||+. +.| ++..+..-. +... T Consensus 264 ~~n--ag~~~vl--~--~G~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp-~~lg~~-~~~-~s~~q~~~~-~~~t 332 (441) T protein:vir:79 264 TKQ--AGKVVVL--D--ESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPL-HKFGIE-TAN-MSITDANLD-YLST 332 (441) T ss_pred ccc--cCcceec--C--CCceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCH-HHcCCC-CCC-ccHHHHHHH-HHHH Confidence 111 1122333 2 3346666665443222 47899999999999999999 678973 444 344554433 5557 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCC Q lcl|NC_011057. 378 IAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYD 454 (634) Q Consensus 378 I~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd 454 (634) |.|.+..|+++|++.++... ..|-|.||.+.|..- |..+.+ ..+++.|++|.+++|+++||+--.|-| T Consensus 333 l~P~~~~ie~eln~kl~~~~--------~~~~~~fd~~~llr~-D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd 403 (441) T protein:vir:79 333 LKPYITCVCAELNFKFNDEY--------VNREFKFDTTEIRVV-DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGN 403 (441) T ss_pred HHHHHHHHHHHHhhhccccc--------cCceEEeechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 99999999999998876431 245579999998653 433333 347889999999999999996543333 Q ss_pred CCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 455 FTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 455 ~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) -. ...-+-.. .|+ +.++.+++.. .++.+.. -.+|++-| T Consensus 404 ~~-----------~~~~~~n~----~~~-----~~~~~~~~~~-----~~~~~~~----~kgGe~~e 441 (441) T protein:vir:79 404 GS-----------IHRVDLNH----VNI-----ELVDEYQMNK-----SRATDKK----LKGGEENE 441 (441) T ss_pred cc-----------eEeecccc----ccc-----cccccccccc-----ccccccc----cCCCCCCC Confidence 10 00000000 111 0011111100 0000111 11222211 No 78 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.27 E-value=2.8e-11 Score=78.48 Aligned_cols=415 Identities=13% Similarity=0.128 Sum_probs=192.5 Q ss_pred CC---------------CCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcc--cCccccccH-HHHHHHhhhh Q lcl|NC_011057. 1 MA---------------ATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTG--ISRNSDWQT-DAWEAVDLVG 62 (634) Q Consensus 1 ~~---------------a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~--~~~~~~WQ~-eAW~~yd~Vg 62 (634) |- ....-=+|+-+|... -.|++. .+-.+.+..++...+ +...+.+-. .| + ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~--e~R~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~a---l-~~~ 71 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKN--EKRDLQ---YNEDDLQMMVQTLPGFQGTKLRQYKDIEA---I-RHS 71 (441) T ss_pred CccccCccccccccccccchhhhhcccccccc--cccccc---CCCcchHHHHHHhcccCcccccccchhhh---h-ccH Confidence 00 000000111111100 001110 011011000000000 000000000 00 0 112 Q ss_pred hHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEE Q lcl|NC_011057. 63 ELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVIL 142 (634) Q Consensus 63 ELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il 142 (634) -+.-.|.-++++++++-|..- .+ |.+..+ ..+..++..-+...+...++++.++.+|-+-|+.|+.+ T Consensus 72 ~V~~cv~~Ia~~iA~lp~~~~---~~-----~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i- 138 (441) T protein:vir:94 72 DIFTAVMMIASDLARMPIRVT---VN-----GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEI- 138 (441) T ss_pred HHHHHHHHHHHhhccCceeee---cC-----cccccc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEE- Confidence 233346668888888876542 23 333322 23455566667888999999999999999999999886 Q ss_pred EecCCCCCCCcccccccchhceeccHHHHhccCCCccee---eEeCCCCcccc--cCCCCeEEEeeCCCcccccCCccch Q lcl|NC_011057. 143 TRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTN---IVLPTGEEHEF--VKGTDIIFRVWIPKPRKASEPDSPV 217 (634) Q Consensus 143 ~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~---i~lP~g~~h~~--~~~~D~~~RvW~P~prra~eaDSPv 217 (634) .|.. +|. + .+.+.|....+......+|.. +...+|..+.+ .-..+=||++=. ++-.-..--||+ T Consensus 139 ~r~~-------~G~--~-~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~-~~~dg~~G~spl 207 (441) T protein:vir:94 139 TRDK-------TGE--P-MNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF-YSLDGINGLSLL 207 (441) T ss_pred EECC-------CCc--E-EEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEecc-CCCCCccccCHH Confidence 4532 232 2 234445444443222122221 11112221111 112222333311 122223345666 Q ss_pred hhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccC Q lcl|NC_011057. 218 RAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVED 297 (634) Q Consensus 218 ra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~D 297 (634) ..+...+.-..-..+...+..+.-..-.|||-+|+.++=+ -+.+++++.+- .++.- T Consensus 208 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--------------------e~~e~~r~~~~----~~~~G 263 (441) T protein:vir:94 208 DTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNK--------------------KARDRAREEFH----KSFSG 263 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCH--------------------HHHHHHHHHHH----HHhcC Confidence 6555554433333344444444444445777777554311 13334444433 22222 Q ss_pred ccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHH Q lcl|NC_011057. 298 EDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLH 377 (634) Q Consensus 298 e~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~h 377 (634) .+. +--|+|+ + ..-+++.|.+..+.-+ -+++|+..+..||..|.||| .|||+. +.| ++..+..-. +... T Consensus 264 ~~n--ag~~~vl--~--~G~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp-~~lg~~-~~~-~s~~q~~~~-~~~t 332 (441) T protein:vir:94 264 TKQ--AGKVVVL--D--ESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPL-HKFGIE-TAN-MSITDANLD-YLST 332 (441) T ss_pred ccc--cCcceec--C--CCceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCH-HHcCCC-CCC-ccHHHHHHH-HHHH Confidence 111 1122333 2 3346666665443222 47899999999999999999 678973 444 344554433 5557 Q ss_pred HHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCC Q lcl|NC_011057. 378 IAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYD 454 (634) Q Consensus 378 I~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd 454 (634) |.|.+..|+++|++.++... ..|-|.||.+.|..- |..+.+ ..+++.|++|.+++|+++||+--.|-| T Consensus 333 l~P~~~~ie~eln~kl~~~~--------~~~~~~fd~~~llr~-D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd 403 (441) T protein:vir:94 333 LKPYITCVCAELNFKFNDEY--------VNREFKFDTTEIRVV-DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGN 403 (441) T ss_pred HHHHHHHHHHHHhhhccccc--------cCceEEeechhhhcc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 99999999999998876431 245579999998653 433333 347889999999999999996543333 Q ss_pred CCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 455 FTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 455 ~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) -. ...-+-.. .|+ +.++.+++.. .++.+.. -.+|++-| T Consensus 404 ~~-----------~~~~~~n~----~~~-----~~~~~~~~~~-----~~~~~~~----~kgGe~~e 441 (441) T protein:vir:94 404 GS-----------IHRVDLNH----VNI-----ELVDEYQMNK-----SRATDKK----LKGGEENE 441 (441) T ss_pred cc-----------eEeecccc----ccc-----cccccccccc-----ccccccc----cCCCCCCC Confidence 10 00000000 111 0011111100 0000111 11222211 No 79 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.27 E-value=1.3e-11 Score=80.28 Aligned_cols=373 Identities=12% Similarity=0.103 Sum_probs=200.5 Q ss_pred ceeEeccCCCCc---cchhhhhhhhccCCchhhhhhhh--cccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEE Q lcl|NC_011057. 7 LRLVRRPKGGRP---APSRALTAASQPLPDPSQVFSKS--TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLV 81 (634) Q Consensus 7 lr~vrrp~g~~~---a~~ral~aAs~~itdp~~~~~~~--~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~ 81 (634) .++.++-+.... ..+..... +.++....... ...+... |+. .+-+.-.+.=+++.+|.+.+. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~v~~~~-----al~----~~~v~~~i~~ia~~ia~~p~~ 67 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFD----IADSDFLASLNSSEWVSAEN-----ALK----NSDLFSIISQLSNDLATAKIT 67 (386) T ss_pred CchhhhhccCCCCcccchhhhhh----hhhccccccccCCceechhh-----hhc----cHHHHHHHHHHHHHhhhCcee Confidence 334433332211 11111110 00111111000 0111111 211 233445566788999998887 Q ss_pred EeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccch Q lcl|NC_011057. 82 ASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQ 161 (634) Q Consensus 82 aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~ 161 (634) .-+-+.+ . +..-+..-+...++++.++.+|-+-|+.|+.+. |.. +|. .. T Consensus 68 ~~~~~~~-------------------~-l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~-------~g~---~~ 116 (386) T protein:vir:49 68 TSRKQLQ-------------------G-IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRW-RND-------NGR---DM 116 (386) T ss_pred eccchhh-------------------h-hhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEE-ECC-------CCc---EE Confidence 6543221 1 122345566788999999999999999999863 422 231 23 Q ss_pred hceeccHHHHhcc--CCCcceeeEe----CCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_011057. 162 EWYAVSKEEIKKS--NKGSGTNIVL----PTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (634) Q Consensus 162 ~W~~vt~~Ei~~~--~~~~~~~i~l----P~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 235 (634) .++.|....+... ..+....+.. +.+......+..| +|++=.+.+.....-.||+.++.+.+.=.....+... T Consensus 117 ~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~e-vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 195 (386) T protein:vir:49 117 KWEYLRPSQVSFNRLDNQNGLYYNITFDDPHIAPKQHVPQND-ILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTI 195 (386) T ss_pred EEEEecCceeEEEEcCCCceEEEEEEEcCccccceeEEcccc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 4566655554322 2222232222 2233333333344 4555455555555677999999998887777888888 Q ss_pred HHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_011057. 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (634) Q Consensus 236 na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh 315 (634) +..+.-.+..|||.+|+.++-. ....+.+...+ . +..+--++++ + . T Consensus 196 ~~~~ng~~~~~il~~~~~~~~~---------------------~~~~~~~~~~~-----~----~~n~g~~~vl--~--~ 241 (386) T protein:vir:49 196 SALKNALNANGILKIKGGGLLD---------------------FKTKVSRSRQA-----M----KQMQGGPLVL--D--D 241 (386) T ss_pred HHHHccCCccEEEEeCCCCChH---------------------HHHHHHHHHHH-----h----ccCCCCceec--C--C Confidence 8888888999999998765431 11222222211 1 1122233444 2 2 Q ss_pred hcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHH Q lcl|NC_011057. 316 IKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILR 395 (634) Q Consensus 316 i~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr 395 (634) ..+++.|.+... +.--+++|+..+..||..|-||| .+||+ +..|+.++-++ ++-.+..|.|.++.|++.|.+.++. T Consensus 242 g~~~~~l~~~~~-d~~~~e~~~~~~~~Ia~~fgVPp-~~lg~-~~~~~~~~~~~-~~~~~~~i~~~l~~i~~~~~~~l~~ 317 (386) T protein:vir:49 242 LEDFTPLEIKSN-VAQLLSQADWTTGQFAKVYGIPE-SIVGG-DGDQQSSLEMI-YNIYFKSVSRYLRPFVSEMSKKLSC 317 (386) T ss_pred CceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhCC-CCCccchHHHH-HHHHHHHHHHHHHHHHHHHHHHhcc Confidence 335666654322 22347899999999999999999 77786 56666555555 4556777999999999999888764 Q ss_pred HHHHhcCCChhHheeeecCccccc-CC-CchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 396 VTLAREGIDPSKYVVWYDASQLTI-DP-DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 396 ~~L~~eG~d~~~yV~w~DaS~L~~-~p-d~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) + +-||...+.. |+ ++......+++.|++|..+.|.++|-. ++.. .| T Consensus 318 ~-------------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~---~~~~--~~-------------- 365 (386) T protein:vir:49 318 E-------------VDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQA---EILP--KE-------------- 365 (386) T ss_pred h-------------hcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhC---CCCC--Cc-------------- Confidence 3 2244444332 21 122333458899999999999998631 1211 00 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDD 510 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d 510 (634) + | ..+....| ...+|+. ++++ T Consensus 366 -~-----~----~~~~~~~~-----~~~gGd~-~~~~ 386 (386) T protein:vir:49 366 -L-----P----DGKNPNRT-----SLKGGEI-NEQD 386 (386) T ss_pred -C-----c----chhccCCC-----CCCCCCC-CCCC Confidence 0 0 00000011 1112222 2222 No 80 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.24 E-value=4.7e-12 Score=82.69 Aligned_cols=368 Identities=12% Similarity=0.109 Sum_probs=199.7 Q ss_pred ceeEe-ccCCCCc--cchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEe Q lcl|NC_011057. 7 LRLVR-RPKGGRP--APSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVAS 83 (634) Q Consensus 7 lr~vr-rp~g~~~--a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~as 83 (634) ..+.. ++++... ..+.++. .+.++.....+ .+...-....|++ .+=+.=.|.-+++.+|.+.+... T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~---~~~~~v~~~~al~----~~~V~~~i~~Ia~~ia~l~~~~~ 69 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFF----DITDPEFLDAL---NGSEWVSAETALK----NSDLFSIISQLSNDLATAKITTS 69 (384) T ss_pred CccccccccCcccccccchhhc----cccchhhcccc---cCCceechhhhhc----cHHHHHHHHHHHHHHhhCceeee Confidence 34433 3333221 1112221 23333322111 1111111122222 23344556667899999988765 Q ss_pred eecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhc Q lcl|NC_011057. 84 ELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEW 163 (634) Q Consensus 84 eiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W 163 (634) +-+.+ . ...=+...+...++++.++.+|-+-|++|+.+. |.. +|. ...+ T Consensus 70 ~~~~~----------------~----l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~-r~~-------~g~---~~~L 118 (384) T protein:vir:49 70 RKQLQ----------------G----IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRW-RNE-------NGR---DMKW 118 (384) T ss_pred cchhh----------------h----hhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEE-ECC-------CCc---EEEE Confidence 42211 1 111244456678999999999999999999863 422 231 2346 Q ss_pred eeccHHHHhccC--CCcceeeEe--CC---CCcccccCCCCeE-EEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_011057. 164 YAVSKEEIKKSN--KGSGTNIVL--PT---GEEHEFVKGTDII-FRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (634) Q Consensus 164 ~~vt~~Ei~~~~--~~~~~~i~l--P~---g~~h~~~~~~D~~-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 235 (634) +.+....++... .+....+.. .+ |....| +..|++ ||. +.+.....--||+.++.+.+.-.....+... T Consensus 119 ~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~eVih~~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 195 (384) T protein:vir:49 119 EYLRPSQVSFNRLDNQNGLYYNITFDDPRIPPKQHV-PQGDILHFRL--LSVDGGLTSVSPLMALGRELNIQKASDKLTL 195 (384) T ss_pred EEEcCceeEEEEcCCCceEEEEEEecCccccceeEe-cCccEEEecC--CCCCCceeeccHHHHHHHHHHHHHHHHHHHH Confidence 666655554321 122222222 22 222233 344544 554 3444444567999999988887777788888 Q ss_pred HHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_011057. 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (634) Q Consensus 236 na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh 315 (634) +..+.-..-.|||-+|+.++... ..++..+. ....+..+ . |+++ + . T Consensus 196 ~~~~ng~~~~~il~~~~~~~~~~--------------------~~~~~~~~-----~~~~~n~~---~--~~vl--~--~ 241 (384) T protein:vir:49 196 NALKNALNANGILKIKGGGLLDF--------------------KTKQSRSR-----QAMKQMQG---G--PLVL--D--D 241 (384) T ss_pred HHHhccCCCceEEEeCCCCChHH--------------------HHHHHHHH-----HhcccCCc---c--ceec--C--C Confidence 77777777778888876544310 11111111 11112222 2 3333 2 2 Q ss_pred hcccceeecCCchhHHH-HHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 316 IKDVKHIRFDNEITEVA-IKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 316 i~~ikHl~f~~d~te~a-iktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) .-+++.+... ..+.. +++|+..+..||..|-||| .+||. ...|+.+.-.+ ++..+.+|.|.++-|++.|.+.+. T Consensus 242 g~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVp~-~~lg~-~~~~~~~~~~~-~~~~~~~i~~~l~pi~~~i~~~l~ 316 (384) T protein:vir:49 242 LEDFTPLEIK--SNVAQLLSQADWTTGQFAKVYGIPE-SVVGG-EGDKQSSLEMI-YNIYFKAVSRFLRPFVSELSKKLS 316 (384) T ss_pred CceEEEccCC--hhhHHHHHHHHHHHHHHHHHhCCCH-HHhCC-CCCccccHHHH-HHHHHHHHHHHHHHHHHHHHHHhc Confidence 3345555443 33333 7899999999999999999 56775 45565555554 344555677777777777777776 Q ss_pred HHHHHhc--CCChhHheeeecCcccccCC-CchHHHHH-HHHccCCCHHHHHHHhCCCccccCCCCCHHHH Q lcl|NC_011057. 395 RVTLARE--GIDPSKYVVWYDASQLTIDP-DKSDEAKF-AYENGAINGEALRKYLGLGDDAGYDFTTREGW 461 (634) Q Consensus 395 r~~L~~e--G~d~~~yV~w~DaS~L~~~p-d~t~eA~~-~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~ 461 (634) ++..-.. -+++..|-++||.+.|..-. -.-.|+.+ +.+.|.++ .++|++.|+.--.|= ++-|.| T Consensus 317 ~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gG--d~~~~~ 384 (384) T protein:vir:49 317 CEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGG--ETNEQY 384 (384) T ss_pred hhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCC--CCCCCC Confidence 6542111 22455667788888775442 22344444 55568777 889999998432222 233434 No 81 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.24 E-value=3e-11 Score=78.31 Aligned_cols=372 Identities=11% Similarity=0.062 Sum_probs=185.2 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==..| ..|-+ +. +-+.|... +.. .+.+ .|-...-+.-.+.-+++.||++.+ T Consensus 1 Mg~f~~~--f~~~~----~~--------~~~~~~~~-~~~---~~~~---------~a~~~~~v~~~i~~ia~~ia~~p~ 53 (385) T protein:vir:95 1 MGLFDSV--FKRHS----EL--------SWMYDLEF-LQD---KSKK---------AYLKQIALNTVVEMVARTISQSEF 53 (385) T ss_pred Cchhhhh--hccCc----cc--------ccccchhh-hhc---cchh---------hhhhhHHHHHHHHHHHHHHcccce Confidence 4321111 11100 00 11111111 100 1111 111223445567889999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+= |. .+. .-+..+...-....+-..++++.++.+|-+-|++||++ .|.++ .... T Consensus 54 ~~~~~----~~---~~~------~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~-~~~~~---------~~~~ 110 (385) T protein:vir:95 54 RVMKN----NT---KEK------GTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVK-NDEGH---------FFVA 110 (385) T ss_pred eeeec----Cc---ccc------chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEE-ecCCC---------eeec Confidence 77541 11 111 12334444446678888999999999999999999864 33221 1122 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) ..|...+...+.... -..+...++......+. +-||++=.+.+.-...-.||+..+...+.-. ... T Consensus 111 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~----------~~~ 176 (385) T protein:vir:95 111 DDFEKEDELGLYSHR---FTNVLVNDFEFKRVFTM-DDVIYLKYNNQKLDAFSLGLFEDYGEIFGRM----------IDL 176 (385) T ss_pred ccccccccccccccc---ceeeeecccceeeeecc-ccEEEecCCCCCcccccchHHHHHHHHHHHH----------HHH Confidence 223333332222111 11122222222111222 3345554455544455667776665443221 111 Q ss_pred HhhhC---ceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 241 RLIGN---GVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 241 RL~gn---GvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .--+| |+|-+|....+ ..-+.+.+++.+-+--..... +--++++ + +... T Consensus 177 ~~~~~~~~g~l~~~~~~~~-------------------~~e~~~~~~~~~~~~~~g~~~------~~~~i~~-l--~~g~ 228 (385) T protein:vir:95 177 QMLNNQIRGILKVDATKFY-------------------NKEKQKELQAYIDTLFDAFQN------NTIAVVP-L--TEGL 228 (385) T ss_pred HHhcCCCceEEEeCCccCC-------------------CHHHHHHHHHHHHHHhhhhhh------cCCceEE-c--CCCc Confidence 11233 33333322111 112334444444322111111 1112222 1 2344 Q ss_pred ccceeecCC-----chhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDN-----EITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQ 392 (634) Q Consensus 318 ~ikHl~f~~-----d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~ 392 (634) +++-|++.. .-+.--+++|+.....||..|.|||..| | +|..++.+....-++..|.|.+..|+++|++. T Consensus 229 ~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-~----~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~ 303 (385) T protein:vir:95 229 AYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLV-L----GEMADLEKTIESYLQFCINPLLRKIEAELNSK 303 (385) T ss_pred eeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh-c----CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 555554432 2234468899999999999999999655 3 34567788888889999999999999999999 Q ss_pred HHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccc--cCCCCCHHHHHHHHHH Q lcl|NC_011057. 393 ILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDA--GYDFTTREGWVMWAQD 467 (634) Q Consensus 393 ~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~--~yd~~t~Eg~r~wA~d 467 (634) +|.+.- . ..|-|.||.+.|. +.|..+.+ ..+++.|++|.+++|.++|+.--. +-| + T Consensus 304 L~~~~~---~---~~~~~~fd~~~l~-~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd----~-------- 364 (385) T protein:vir:95 304 FFYQDE---Y---LNDDMHIKVVGID-KRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELD----K-------- 364 (385) T ss_pred cCChhh---c---ccceEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc----e-------- Confidence 987622 1 2334678998874 33443333 348889999999999999996311 111 0 Q ss_pred HhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCc Q lcl|NC_011057. 468 AVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTS 508 (634) Q Consensus 468 ~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~ 508 (634) -+++ + -++.+ -.+ .+|+.+++ T Consensus 365 ------~~~~----~---n~~~~------~~~-kgge~~~e 385 (385) T protein:vir:95 365 ------FIIT----K---NLQSA------DAF-KGGESNEE 385 (385) T ss_pred ------eeec----c---cceec------ccc-cCCCCCCC Confidence 0000 0 00000 011 11111111 No 82 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.22 E-value=5e-11 Score=77.04 Aligned_cols=374 Identities=13% Similarity=0.099 Sum_probs=184.3 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |-==..++=.+|.+ ..+|.. +-..|+.+.= .|.+ .=++-.|.-+++++|.+.+ T Consensus 1 Mg~f~~~~~~~~~~---------------~~~~~~----------~~~~~~~~~~-~~~~-~~v~~~v~~IA~~iA~lp~ 53 (378) T protein:vir:94 1 MNLFGKVVSFSRGK---------------LNNDTQ----------RVTAWQNEAV-EYTS-AFVTNIHNKIANEITKVEF 53 (378) T ss_pred CCccccchhccccc---------------ccCCcc----------eeeeeccchh-HHHH-HHHHHHHHHHHhhhhhCce Confidence 22111111000000 001110 1122443321 1111 1133457889999999887 Q ss_pred EEeeecccCCCCCCCCCC-CCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISE-DNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~e-d~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~ 159 (634) ..=..+...| +.++ ......-+..+..--+.-.+...++++.++.+|-.-|+.||.++.+.. .| +. T Consensus 54 ~~~~~~~~~~----~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~-------~g--~~ 120 (378) T protein:vir:94 54 NHVKYKKSDV----GSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDN-------TG--EL 120 (378) T ss_pred eeEEEcccCc----ccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCC-------Cc--eE Confidence 5444443311 1111 111123344455555777888999999999999999999998654311 11 11 Q ss_pred chhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_011057. 160 RQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASK 239 (634) Q Consensus 160 ~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 239 (634) -...|.+.+.+|.. |=+||+=+| -.-..--||...+...+. ++.+ T Consensus 121 ---------------------~~l~p~~~~~~~~~--~diiH~~~~--~~~~~g~s~l~~~~~~i~----------~~~~ 165 (378) T protein:vir:94 121 ---------------------LDLLFADDKKEYKP--EELVRLTSP--FYINEDTSILDNALASIQ----------TKLE 165 (378) T ss_pred ---------------------EEEEecCCeeEeee--eeeEEecCc--CCccchhHHHHHHHHHHH----------HHHh Confidence 11223333334432 234555333 233345566555554432 2222 Q ss_pred hHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_011057. 240 SRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 240 SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~i 319 (634) +-- -+|+|-+|..++-. +..++.+.+-+--+......+ +.. ++++ +...++ T Consensus 166 ~~~-~~gil~~~~~l~~~---------------------~~~~~~~~~~~~~~~~~~~~~-~g~--~~vl----~~g~~~ 216 (378) T protein:vir:94 166 QGK-LRGLLKINAFLDID---------------------NTQEYREKALTTIKNMQEGSS-YNG--LTPV----DNKTEI 216 (378) T ss_pred ccc-ccceeeeCCcCCHH---------------------HHHHHHHHHHHHHHHhhcccc-ccc--ceec----CCCceE Confidence 211 24777776654321 222233333221112122111 112 2222 234567 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) +.|.+...... +..++.....||..|.|||..|-| +.+ -+....-++..|.|.+..|+++|++.+|.+-=. T Consensus 217 ~~l~~~~~~~~--~~~~~~~~~~Ia~~fgVP~~~l~~--~~s-----e~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er 287 (378) T protein:vir:94 217 VELKKDYSVLN--KDEIDLIKSELLTGYFMNENILLG--TAS-----QEQQIYFYNSTIIPLLIQLEKELTYKLISTNRR 287 (378) T ss_pred EEccCChhhhh--HHHHHHHHHHHHHHhCCCHHHhcC--ChH-----HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHh Confidence 77766554444 466778889999999999965533 222 234445677899999999999999999976433 Q ss_pred hcCCChhHhe-eeecCcccccCC--CchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccc Q lcl|NC_011057. 400 REGIDPSKYV-VWYDASQLTIDP--DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLI 476 (634) Q Consensus 400 ~eG~d~~~yV-~w~DaS~L~~~p--d~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li 476 (634) ..|....-|+ +.||.+.|..-. ++.+....++..|++|.+++|+++|++...+-| +- .+..| T Consensus 288 ~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD----~~-------~~~~n---- 352 (378) T protein:vir:94 288 RVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----VY-------IANLN---- 352 (378) T ss_pred hhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----ee-------eeccc---- Confidence 3444333333 568888886531 223333568899999999999999997544332 00 00000 Q ss_pred hhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCC Q lcl|NC_011057. 477 PMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTE 523 (634) Q Consensus 477 ~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe 523 (634) +.|+-+ +. ..+.. ......+.|++-| T Consensus 353 --~~~~~~--------~~--------~~~~~---~~~~~~~~e~~n~ 378 (378) T protein:vir:94 353 --AVAVKN--------LS--------DLQGS---RKDVTSTDETNNQ 378 (378) T ss_pred --cccccc--------ch--------hhcCC---cCCCCCCCCCCCC Confidence 111111 10 00000 0000111111111 No 83 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.19 E-value=6.3e-11 Score=76.51 Aligned_cols=369 Identities=13% Similarity=0.124 Sum_probs=184.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==+ |+.+|-+.. .-+.|.. .+.. .+... ++ ...-+.-.+.-++++|+++.+ T Consensus 1 Mg~f~--~l~~~~~~~------------~~~~~~~-~~~~---~~~~~--------~l-~~~~v~~~i~~Ia~~ia~~p~ 53 (376) T protein:vir:78 1 MGFFS--ELFKRNKEI------------EWMWDLD-FLED---KTTKV--------YL-KKMALNTCVKHIARTIAKSDF 53 (376) T ss_pred Cchhh--hhhccCCcc------------ccccchh-hccc---cchhh--------hh-hhHHHHHHHHHHHHhhcccce Confidence 43211 222221100 0000000 0000 00000 01 122344456778999999998 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+ . +...+ ..+..+...-+..-+-..++++.++.+|..-|++|+++ .|.++ | ... T Consensus 54 ~~~~--~-------~~~~~----~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~-~r~~~-------~--~~~ 110 (376) T protein:vir:78 54 RLKN--G-------ETSVR----DKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVL-SDTDD-------F--LIA 110 (376) T ss_pred eecc--c-------ccccc----chHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEE-EeCCC-------e--eec Confidence 7642 1 11111 22444455557778899999999999999999999875 44332 2 122 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) +.|..-. ..+... ....+.+.++......+..|++.--+ -.+|....+.++.+... ..+.++.++ T Consensus 111 ~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~evih~~~---------~~~~~~~~~~~~~~~~~--~~~~~~~~~ 175 (376) T protein:vir:78 111 DSYVRKE-FAFFPD---VFEGVTVKDYRYNRNFSMDDVIFLEY---------GNERLSAFTDGMFEDYG--ELFGKMIRA 175 (376) T ss_pred cceeecc-cceeee---eeeeeeeecceeeeeeccccEEEecc---------CCCCchhhhhHHHHHHH--HHHHHHHHH Confidence 3343322 111100 01123333332222223344432212 23456666666655443 335555555 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ik 320 (634) ...+||+-. .+-+...+ ....-+.+.+++.+-+.-... .++.-+++| ++ ...+++ T Consensus 176 ~~~~~~~~~---~~~~~~~~-------------~~~~e~~~~~~~~~~~~~~g~---~~~~~~v~~----l~--~g~~~~ 230 (376) T protein:vir:78 176 QMRNFQIRG---AVNFKMAG-------------VADKDKQTKLQEYIDKVYASF---NNNEIAIVP----QL--EGFNYE 230 (376) T ss_pred HHhcCCCce---eEEEccCC-------------CCCHHHHHHHHHHHHHHhccc---cccCcceEE----cC--CCceEE Confidence 555665411 01110000 011225566666655322211 112222222 22 445566 Q ss_pred eeecCC---chhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHH Q lcl|NC_011057. 321 HIRFDN---EITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRV 396 (634) Q Consensus 321 Hl~f~~---d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~ 396 (634) -+.+.. +.++. -+++|+..+..||..|.|||. ||| |+ ..+.-+....-++..|.|.+..|+++|++.+|.+ T Consensus 231 ~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~-~l~-~~---~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~ 305 (376) T protein:vir:78 231 EFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSS-LLH-GD---MADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTF 305 (376) T ss_pred eeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHH-HhC-CC---CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCc Confidence 665544 22332 478999999999999999995 556 33 3445666777788899999999999999998754 Q ss_pred HHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 397 TLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 397 ~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) .+|.+=||...|. +.|..+.+ ..+++.|++|.+++|+++|++--.+... T Consensus 306 ---------~~~~~~~~~~~ll-~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~------------------ 357 (376) T protein:vir:78 306 ---------SEFLAGEHIKIIH-KKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPEL------------------ 357 (376) T ss_pred ---------ccceecccchhhc-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC------------------ Confidence 4455556655553 33444333 4488999999999999999964221110 Q ss_pred ccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCC Q lcl|NC_011057. 474 TLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDT 507 (634) Q Consensus 474 ~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~ 507 (634) +..=.|.--..+.. ++++. T Consensus 358 --------------d~~~~~~n~~~~~~-~~e~g 376 (376) T protein:vir:78 358 --------------DKYLITKNYQSADE-GGEDG 376 (376) T ss_pred --------------ceeeeccCceehhc-cccCC Confidence 00000100001111 11111 No 84 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.18 E-value=1.8e-10 Score=74.02 Aligned_cols=374 Identities=13% Similarity=0.107 Sum_probs=180.5 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |. +.++-++ -+ ++...+....-..||.++=. |.+ .=+.-.|.-++++||.+.| T Consensus 1 Mg------~f~~~~~-----------~~--------~~~~~~~~~~~~~~~~~~~~-~~~-~~v~~~i~~Ia~~iA~l~~ 53 (378) T protein:vir:16 1 MN------LFGKVVS-----------FS--------RGKLNNDTQRVTAWQNEAVE-YTS-AFVTNIHNKIANEITKVEF 53 (378) T ss_pred Cc------cchhhhh-----------hh--------cccccCCcceeeecccchhh-HHH-HHHHHHHHHHHhhhhhCce Confidence 11 1111110 00 00000111122356655422 211 1133346779999999998 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) -.-+.....+. ..... .....-+..+.+-=..-.+...++.+.++.+|..-|+.||.+.- .+. .| T Consensus 54 ~~~~~~~~~~~-~~~~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~-d~~------~g----- 118 (378) T protein:vir:16 54 NHVKYKKSDVG-SDTLI--SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVF-DDN------TG----- 118 (378) T ss_pred eEEEEcccccc-ccccc--ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEe-ecC------Cc----- Confidence 55444433111 11110 11123344444444667888999999999999999999998642 111 01 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) +-| ...|.+...+|.. .| +|++=+| -.-..--||...++..+ .++.++ T Consensus 119 ~~~------------------~l~~~~~~~~~~~-~d-iih~r~~--~~~~~~~s~l~~~~~~i----------~~~~~~ 166 (378) T protein:vir:16 119 ELL------------------DLLFADDKKEYKP-EE-LVRLTSP--FYINEDTSILDNALASI----------QTKLEQ 166 (378) T ss_pred eEE------------------EEEecCCeeEecc-cc-eEEecCc--cCccchhHHHHHHHHHH----------HHHHhc Confidence 112 1233333334432 22 3343122 22233445555554433 233222 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVK 320 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ik 320 (634) -. -+|+|-.|..++- ...+++.+.+-+--+....-.+ +.. ++++ +..-+++ T Consensus 167 ~~-~~g~l~~~~~l~~---------------------~~~~~~~~~~~~~~~~~~~~~~-~g~--~~vl----~~g~~~~ 217 (378) T protein:vir:16 167 GK-LRGLLKINAFLDI---------------------DNTQEYREKALTTIKNMQEGSS-YNG--LTPV----DNKTEIV 217 (378) T ss_pred Cc-cceeeEeCCcCCH---------------------HHHHHHHHHHHHHHHHhhcccc-ccc--ceEc----CCCceEE Confidence 21 2366655544331 1223333333322222222111 112 2333 2334566 Q ss_pred eeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011057. 321 HIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAR 400 (634) Q Consensus 321 Hl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~ 400 (634) .|........ +..++.....||..|.|||..|-| +.. -+....-++..|.|.+..|+++|++.+|.+.-.. T Consensus 218 ~l~~~~~~~~--~~~~~~~~~~Ia~~fgVPp~~l~g--~~~-----e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~ 288 (378) T protein:vir:16 218 ELKKDYSVLN--KDEIDLIKSELLTGYFMNENILLG--TAS-----QEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRR 288 (378) T ss_pred EccCChhhhh--HHHHHHHHHHHHHHhCCCHHHhcC--Cch-----HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhh Confidence 6655544333 455667789999999999966533 222 2445555678899999999999999999764434 Q ss_pred cCCChhHhe-eeecCcccccCC--CchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccch Q lcl|NC_011057. 401 EGIDPSKYV-VWYDASQLTIDP--DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIP 477 (634) Q Consensus 401 eG~d~~~yV-~w~DaS~L~~~p--d~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~ 477 (634) .|.....|. +.||.+.|..-. ++.+....++..|++|.+++|+++|+.--.+-| + -++| T Consensus 289 ~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD----~--------------~~~~ 350 (378) T protein:vir:16 289 VVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----V--------------YIAN 350 (378) T ss_pred hhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----e--------------Eeec Confidence 454444344 568888876542 222333568999999999999999997533222 0 0011 Q ss_pred -hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCC Q lcl|NC_011057. 478 -MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHE 519 (634) Q Consensus 478 -~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~e 519 (634) .+.|+-+ ..... + ...+..+++++..| T Consensus 351 ~n~~~~~~--------~~~~~---~----~~~~~~~~~e~~ne 378 (378) T protein:vir:16 351 LNAVAVKN--------LSDLQ---G----SRKDVTSTDETNNQ 378 (378) T ss_pred cccccccc--------hhhhc---C----ccCCCCCCCCCCCC Confidence 0111110 00000 0 00000011111111 No 85 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.16 E-value=1.4e-10 Score=74.54 Aligned_cols=372 Identities=13% Similarity=0.125 Sum_probs=183.5 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhh---hcccCccccccHHHHHHHhhhhhHHHHHhhhhhceee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSK---STGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSR 77 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~---~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr 77 (634) |. +.+.-+ .|+. .+....-..||.+.=. |.+ .=+.-.+.-+++.+|. T Consensus 1 Mg------~f~~~~----------------------~f~~~~~~~~~~~~~~~~~~~~~-~~~-~~v~~~i~~Ia~~iA~ 50 (378) T protein:vir:93 1 MN------LFGKVV----------------------SFSRGKLNNDTQRVTAWQNEAVE-YTS-AFVTNIHNKIANEITK 50 (378) T ss_pred Cc------cchhhh----------------------hhhccccCCCcceeeecccchhH-HHH-HHHHHHHHHHHhhhhh Confidence 11 111111 1110 0111122346654321 111 1122335779999999 Q ss_pred eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCccccc Q lcl|NC_011057. 78 CRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSV 157 (634) Q Consensus 78 ~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~ 157 (634) +.+-.=.-+.+ |....... .....-+..+++.-..--+...++++.++.+|..-|++||.+... .+ .| T Consensus 51 lp~~~~~~~~~-~~~~~~~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~-~~------~g-- 118 (378) T protein:vir:93 51 VEFNHVKYKKS-DVGSDTLI--SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-DN------TG-- 118 (378) T ss_pred CceeeEEEccc-cccccccc--ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-cC------Cc-- Confidence 99854333322 11111111 112233445555556778889999999999999999999876431 11 11 Q ss_pred ccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_011057. 158 RTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANA 237 (634) Q Consensus 158 ~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 237 (634) +-|+. .|.+...+|. .+=+|++-+| -.-...-||...++..+ .++ T Consensus 119 ---~~~~l------------------~~~~~~~~~~--~~diih~r~~--~~~~~~~s~l~~~~~~i----------~~~ 163 (378) T protein:vir:93 119 ---ELLDL------------------LFADDKKEYK--TEELVRLTSP--FYINEDTSILDNALASI----------QTK 163 (378) T ss_pred ---eEEEE------------------EecCCeeEec--cceeEEecCc--cccchhhHHHHHHHHHH----------HHH Confidence 12222 2222222332 2335566554 23333455555444333 222 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) .++-- =+|+|=+|..++- .+.+.+.+-+.+--+......+ ... ++++ +... T Consensus 164 ~~~~~-~~g~l~~~~~l~~---------------------~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~l----~~g~ 214 (378) T protein:vir:93 164 LEQGK-LRGLLKINAFLDI---------------------DNTQEYREKALTTIKNMQEGSS-YNG--LTPV----DNKT 214 (378) T ss_pred HhcCc-ccceeeeCCcCCH---------------------HHHHHHHHHHHHHHHHhhcccc-ccc--ceEc----CCCc Confidence 22111 1366655544321 1223333333322222222211 112 2222 2445 Q ss_pred ccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVT 397 (634) Q Consensus 318 ~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~ 397 (634) +++.|.+...... +..++...+.||..|.|||..|-| +.+ -+....-++..|.|.+..|+++|++.+|.+. T Consensus 215 ~~~~l~~~~~~~~--~~~~~~~~~~Ia~~fgVPp~~l~g--~~~-----e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~ 285 (378) T protein:vir:93 215 EIVELKKDYSVLN--KDEIDLIKSELLTGYFMNENILLG--TAT-----QEQQIYFYNSTIIPLLIQLEKELTYKLISTN 285 (378) T ss_pred eEEEccCChhhhh--HHHHHHHHHHHHHHhCCCHHHhcC--CcH-----HHHHHHHHHHHHHHHHHHHHHHHHhhcCChh Confidence 6676666554443 466678889999999999966543 222 2444555678899999999999999999775 Q ss_pred HHhcCCChhHhe-eeecCcccccCC--CchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 398 LAREGIDPSKYV-VWYDASQLTIDP--DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 398 L~~eG~d~~~yV-~w~DaS~L~~~p--d~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) =...|.....|+ +.||.+.|..-. ++.+....++..|++|.++.|+++|+..-.|-| +- .+..| T Consensus 286 er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD----~~-------~~~~n-- 352 (378) T protein:vir:93 286 RRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----VY-------IANLN-- 352 (378) T ss_pred HhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----ee-------eeccc-- Confidence 444455433333 578888886542 223333569999999999999999997544322 00 00000 Q ss_pred cchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQD 527 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~ 527 (634) +.|+-+ +. ..+. .+.+..|+-|++.. T Consensus 353 ----~~~~~~--------~~--------~~~~-------~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 353 ----AVAVKN--------LS--------DLQG-------SRKDVTSTDETNNQ 378 (378) T ss_pred ----cccccc--------hh--------hhcC-------ccCCCCCCCCCCCC Confidence 111111 00 0000 01111111111111 No 86 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.06 E-value=3e-10 Score=72.77 Aligned_cols=374 Identities=12% Similarity=0.090 Sum_probs=194.8 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |. +.++=++..+..+.. ...+.++...... .+...-.-..++ ..+-+.-.+.-++++||.+.+ T Consensus 1 Mg------~f~~~~~~~~~~~~~----~~~~~~~~~~~~~---~~~~~v~~~~~l----~~~~v~~~i~~ia~~ia~~~~ 63 (382) T protein:vir:48 1 MP------IFNLATESPPDNQGG----FFDVVDSDFLASL---KGNEWVSAETAL----RNSDLFSIINQLSNDLATVKL 63 (382) T ss_pred Cc------cccccccCCcccccc----cccchhhhccccc---cCCcccchHhhh----ccHHHHHHHHHHHHhhccCce Confidence 33 222222222111111 1111111111000 000000001111 234455567778999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) -..+-+.+ . ...-+.--+...++++.++..|-+-|+.|+.+ .|.. +|. . T Consensus 64 ~~~~~~~~----------------~----L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i-~rd~-------~G~---~ 112 (382) T protein:vir:48 64 ITSRKKLQ----------------G----IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYR-WRNE-------NGR---D 112 (382) T ss_pred eeecchhh----------------h----hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEE-EECC-------CCc---E Confidence 76543322 0 12224455678899999999999999999986 3422 232 2 Q ss_pred hhceeccHHHHhcc--CCCcceeeEeC--C---CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKS--NKGSGTNIVLP--T---GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKT 233 (634) Q Consensus 161 ~~W~~vt~~Ei~~~--~~~~~~~i~lP--~---g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~ 233 (634) ..++.|..+.+... ..++..-+... + |....| +..|+ |++=.+++.-...-.||+.++...+.-.....+. T Consensus 113 ~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~ev-ih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~ 190 (382) T protein:vir:48 113 MKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHV-PQNDV-LHFRLLSVDGGMTSVSPLMALSRELDIQKASGNL 190 (382) T ss_pred EEEEEEcCceeEEEEcCCCCeEEEEEEecCccccceeEE-cCccE-EEecCCCCCCccccccHHHHHHHHHHHHHHHHHH Confidence 45777766665432 22233334433 2 222233 33454 4443556655567789999999988777777777 Q ss_pred HHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_011057. 234 IANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPG 313 (634) Q Consensus 234 I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~ 313 (634) ..+..+.-..-.|||-+|+.++-+ ...++.+.+.+ ..+..+ =++|+. T Consensus 191 ~~~~~~ng~~p~~il~~~~~~~~e---------------------~~~~~~~~~~~----~~~n~g-----~~~vl~--- 237 (382) T protein:vir:48 191 TINSLKNALNANGILKIKGGGLLD---------------------FKTKLSRSRQA----MKQMQG-----GPLVLD--- 237 (382) T ss_pred HHHHHhccCCCceEEEeCCCCChH---------------------HHHHHHHHHHh----hccCCC-----CeeEcC--- Confidence 777777777778899887654321 22233333321 112222 233332 Q ss_pred HHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_011057. 314 EQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQI 393 (634) Q Consensus 314 Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~ 393 (634) ..-+++.|.+... +.--+++|+..+..||..|.||| .+||. ++.|..+ -+-...-++..|.|.+..|++.|++.+ T Consensus 238 -~g~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~afgVp~-~~lg~-~~~~~~~-~~~~~~~~~~~l~p~~~~i~~~l~~~l 312 (382) T protein:vir:48 238 -DLEDFTPLEIKSN-VSQLLKQADWTTGQFAKVYGIPD-NVVGG-QGDQQSS-LEMSSDLYSKAVSRYLRPFLSELSQKL 312 (382) T ss_pred -CCceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCH-HHhCC-CCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2235555554432 22237899999999999999999 77785 4555433 344566788899999999999999987 Q ss_pred HHHHHHhcCCChhHheeeecCcccccCCCch-HHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcC Q lcl|NC_011057. 394 LRVTLAREGIDPSKYVVWYDASQLTIDPDKS-DEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKD 472 (634) Q Consensus 394 lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t-~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~d 472 (634) +... +++- ...+| .+.... .....+++.|++|..+.|..++- .|+- +.|..+. .+ T Consensus 313 ~~~~----~~~~---~~~~~-----~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~---~g~~--~~~~~~~-------~~ 368 (382) T protein:vir:48 313 SCDV----DADI---FPAVD-----PTGSNYISRINSLVKTGTLAQNQGLYILQQ---AEIL--PKELPNG-------EN 368 (382) T ss_pred cChh----hhhh---hhhhc-----cchhHHHHHHHHHhhcCccCHHHHHHHHhh---CCCC--Ccchhhh-------hc Confidence 6542 2110 00111 111111 11234778899999999888742 1222 1111110 00 Q ss_pred cccchhhhhhhhhhhhcccCCCCCC Q lcl|NC_011057. 473 PTLIPMLAPLIAGVLQQIEFPQQQQ 497 (634) Q Consensus 473 p~Li~~laPll~p~~q~~~~P~p~~ 497 (634) + . |.+++=+- ..+- T Consensus 369 --~----~----~~~~GGd~-~~~~ 382 (382) T protein:vir:48 369 --P----N----STLKGGEE-DGQD 382 (382) T ss_pred --C----C----CCCCCCCC-CCCC Confidence 0 0 11111110 0011 No 87 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.05 E-value=2.2e-09 Score=68.04 Aligned_cols=379 Identities=11% Similarity=0.072 Sum_probs=175.6 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHh--------hhhhHHHHHhhhh Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVD--------LVGELRYYVGWRA 72 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd--------~VgELryyvgWr~ 72 (634) |.=-+-+ . ..|.+........+|. .|.... ...-+.-.|.-++ T Consensus 1 Mg~~~~~--~-------------------------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~v~~~v~~Ia 51 (395) T protein:vir:40 1 MGFKSWV--S-------------------------GFFNEEQRTLNLTDTV--WCSIPSEKLKELSIKKWAIDSCANKIA 51 (395) T ss_pred CchHHHH--H-------------------------hhhcccccccccccch--hhccccccchhhhhhhHHHHHHHHHHH Confidence 1100000 0 0000000000111121 222111 1233455678889 Q ss_pred hceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCC Q lcl|NC_011057. 73 SSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQ 152 (634) Q Consensus 73 ~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~ 152 (634) +.+|++.+..-+ + |. .+. ..+..+.+.=...-+...++.+.++.+|.+-|++|+++. |.+. T Consensus 52 ~~ia~~p~~~~~---~-~~---~~~------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~-~~~~----- 112 (395) T protein:vir:40 52 NTLSCAEVLTYE---K-GE---EVR------KKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQ-DEYI----- 112 (395) T ss_pred HHHhhCceeecc---C-Cc---ccc------chHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEe-cCce----- Confidence 999998887643 2 11 121 123344455577889999999999999999999998762 2110 Q ss_pred cccccccchhceeccHHHHhccCCCcceeeEeCCCCc--ccccCCCCeEEEe-eCCCcccccCCccchhhhhHHHHHHHh Q lcl|NC_011057. 153 PDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEE--HEFVKGTDIIFRV-WIPKPRKASEPDSPVRAVLDSIREIVR 229 (634) Q Consensus 153 ~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~--h~~~~~~D~~~Rv-W~P~prra~eaDSPvra~l~~LrEI~r 229 (634) .....|.+......... ......+|.. .+|. .+-+|++ .++.. -.++....+..+.++.- T Consensus 113 -----~~~~~~~~~~~~~~~~~-----~~~v~~~~~~~~~~~~--~~evih~r~~~~~-----~~~~~~~l~~~~~~~~~ 175 (395) T protein:vir:40 113 -----YVADSFTKNDKSLYENT-----YTEVTLKDLTLKKEFK--ESEVLHLTLNNES-----IKSIIDGFYLLYGDLLT 175 (395) T ss_pred -----eecCCccccccccccce-----eeeeeecCceeeeeec--cccEEEeecCCCC-----ccccchhHHHHHHHHHH Confidence 11223433221111100 1111112221 1222 2333333 12211 12233333344444331 Q ss_pred hhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeE Q lcl|NC_011057. 230 TTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIA 309 (634) Q Consensus 230 ttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva 309 (634) . .+ ++...--.-+|++.+.. .+. ....+.+++++.+-+.-+.... .+.-++++ T Consensus 176 ~--~~-~~~~~~~~~~~~l~~~~------~~~-------------~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~vl 228 (395) T protein:vir:40 176 A--AV-NKYKKLNSRKIIVKLKA------MFG-------------QTPEAEEKLRLMLSERMKKFLA-----EGDSALPV 228 (395) T ss_pred H--HH-HHHHhcCCCCceEEEec------ccC-------------CCHHHHHHHHHHHHHHHHHhhc-----cCCceeec Confidence 1 11 11111111223333311 100 0112445555555433222211 12222332 Q ss_pred eechHHhcccceeecCCch-hHHH-HHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHH Q lcl|NC_011057. 310 GVPGEQIKDVKHIRFDNEI-TEVA-IKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQ 387 (634) Q Consensus 310 ~vP~Ehi~~ikHl~f~~d~-te~a-iktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ 387 (634) + ..-+++-|...... .-+. -|+.+++++.||..|.||| .||| |+.+| .-+....-++..|.|.++.|++ T Consensus 229 --~--~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp-~~l~-~~~sn---~e~~~~~f~~~~L~P~~~~ie~ 299 (395) T protein:vir:40 229 --E--DGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPL-GLAK-GDTVG---LSEQVNSFLMFSINPIAEMFTD 299 (395) T ss_pred --C--CCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCH-HHhc-CCCcC---HHHHHHHHHHHHHHHHHHHHHH Confidence 2 33345555543322 2222 2456778899999999999 5666 44444 4566677788899999999999 Q ss_pred HHHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHH Q lcl|NC_011057. 388 ALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMW 464 (634) Q Consensus 388 ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~w 464 (634) +|++.+|-..-.. ..|-|-||.+.|. ++|..+.+ ..++..|++|.++.|+++|+.--.+.+-+.. T Consensus 300 ~l~~kLl~~~~~~-----~g~~i~fd~~~ll-~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~------ 367 (395) T protein:vir:40 300 EGNRKFYGRDSVL-----ERTYMKLDTTRIK-VQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQER------ 367 (395) T ss_pred HHHHhcCChhhhc-----CCceEEEechhhh-ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCcee------ Confidence 9999998763211 3456668888884 34554433 3488999999999999999964222111000 Q ss_pred HHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCC Q lcl|NC_011057. 465 AQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQ 526 (634) Q Consensus 465 A~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~ 526 (634) + .|+ -++.++ ...+ ...+|++.|.+.|+ T Consensus 368 ----------~----~~~---n~~~~~---------~~~~--------~~kgge~~~~~~~~ 395 (395) T protein:vir:40 368 ----------F----VTK---NYAPLG---------ENEE--------DLKGGDINENKGDS 395 (395) T ss_pred ----------e----ecc---cccccc---------cccc--------ccCCCCCCCCcCCC Confidence 0 000 000000 0000 01222222222222 No 88 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.02 E-value=3.8e-10 Score=72.24 Aligned_cols=386 Identities=10% Similarity=0.056 Sum_probs=198.0 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.-. =+...|+.+.... .+...+.....+++...... .+ .+++-- -....-..+-+.=.|.-+++.+|++.+ T Consensus 1 m~m~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~---~~-~~g~~v-~~~~al~~~~v~~~v~~ia~~ia~lp~ 72 (392) T protein:vir:74 1 MILP-ILNFINQTNDPPE--AGSVQSYFPDGNDAQIMESL---LG-DNNEWV-SARAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred Ccch-hhhhhhcccCccc--ccccccccccCchhhhhhhc---cC-CCCccc-chhhhhcchHHHHHHHHHHHhhccCce Confidence 2111 1233333322111 11111222222222211110 11 011100 001111234455567778888888776 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+=+. +.+.+ =..--+...++++.++.+|-+-|+.|+.+. |.. +|. . T Consensus 73 ~~~~~~~----------------~~l~~----~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~-------~G~---~ 121 (392) T protein:vir:74 73 NAEKKKN----------------QGIID----NPSTNANKHGFWQSMFAQLLLGGEAFAYRW-RNA-------NGA---D 121 (392) T ss_pred eeccchh----------------hhhhh----hcCCCCCHHHHHHHHHHHhhhcCCEEEEEE-ECC-------CCc---E Confidence 5432111 11111 133456778999999999999999998863 432 232 2 Q ss_pred hhceeccHHHHhccC--CCcceeeEeC--CCC--cccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSN--KGSGTNIVLP--TGE--EHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~--~~~~~~i~lP--~g~--~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 234 (634) ...+.|..+.++... .++...+... ++. +....+..|+ |++=.+.+.-...--||+.++.+.+.-...+.+.. T Consensus 122 ~~L~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ev-ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 200 (392) T protein:vir:74 122 MKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDL-IHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLT 200 (392) T ss_pred EEEEEEcCceeEEEEcCCCceEEEEEEecCCccceeEEEcCccE-EEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 346666666554332 2222333333 221 2222233443 33333444334456799999988887777777777 Q ss_pred HHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_011057. 235 ANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (634) Q Consensus 235 ~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~E 314 (634) .+.-+.-..-.|||-+|+....+. ....++.+-+. ++.-+-=|+|+. T Consensus 201 ~~~f~ng~~p~~il~~~~~~~~~~-------------------~~~~~~~~~~~----------~~~n~g~~~vl~---- 247 (392) T protein:vir:74 201 ISSLNSSLNVPGVLTVKGGGLLSD-------------------KDKASRSRSFM----------KRSRSGGPVVLD---- 247 (392) T ss_pred HHHHhccCCCceEEEeCCCCCchH-------------------HHHHHHHHHHh----------ccccCCCeeecC---- Confidence 777777777788998887654321 11222222221 111222344442 Q ss_pred HhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 315 QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 315 hi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) ..-+++.|.+...... -+++|+..+..||..|.||| .+||.. ..|..+. +-...-++..|.|.++.|+++|++.++ T Consensus 248 ~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp-~~lg~~-~~~~~~~-e~~~~~~~~~l~p~~~~ie~~l~~~l~ 323 (392) T protein:vir:74 248 DLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPD-SYIGGQ-GDQQSSI-QQISGMYASALNRYLRPAISELEYKLS 323 (392) T ss_pred CCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCH-HHhCCC-CCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 3456677766543333 38999999999999999999 677764 3443333 334456788899999999999999876 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) ... ++|... +++.+.+ .+.+....++..|++|.++.|++++ +.|+.. ..+|. ..| T Consensus 324 ~~~----~~~~~~---~~~~d~~----~~~~~~~~l~~~g~~t~near~~~~---~~g~~p---ne~r~------~en-- 378 (392) T protein:vir:74 324 DHI----SVNMRP---AIDPLGD----NYLSTISTATRWGALAENQATFVLQ---EAGYIP---KDLPA------PEN-- 378 (392) T ss_pred chh----cccchh---hhcCCHH----HHHHHHHHHHhCCCcCHHHHHHHHH---hCCCCc---cccch------hcC-- Confidence 541 222211 1222111 1133445688999999999999873 233331 12222 001 Q ss_pred cchhhhhhhhhhhhcccCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQ 497 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~ 497 (634) | .| +-.+ +--+|.| T Consensus 379 l----~~----~~~G-d~~~p~p 392 (392) T protein:vir:74 379 T----NK----KTTG-QSNEPVP 392 (392) T ss_pred C----CC----CCCC-CCCCCCC Confidence 1 11 1112 2222333 No 89 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.00 E-value=3.5e-09 Score=66.96 Aligned_cols=372 Identities=13% Similarity=0.111 Sum_probs=175.2 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.==..++ ..-+. ...+++...+ .|+.+.=.+- ..-+.=.|.-+++.++.+.| T Consensus 1 M~~f~k~~--~~~~~-------------~~~~~~~~~~----------~~~~~~~~~~--~~~v~~~v~~ia~~iA~lp~ 53 (378) T protein:vir:85 1 MNLFGKVV--SFSRG-------------KLNNDTQRVT----------AWQNEAVEYT--SAFVTNIHNKIANEITKVEF 53 (378) T ss_pred Cchhhhhh--hhhhc-------------ccccCCccee----------eeeccchhhh--hHHHHHHHHHHHHhHhhCce Confidence 32211111 00000 0011111111 1222211111 01123347789999999998 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..=+.+.+.+.+-...+ ..+ .-+..+.+.=..--+...++.+-++.+|-.-|++|+.++.+.. +|. .. T Consensus 54 ~~~~~~~~~~~~~~~~~--~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~-------~g~--~~ 121 (378) T protein:vir:85 54 NHVKYKKSDVGSDTLIS--MAG-SDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSE-------TGE--LL 121 (378) T ss_pred eEEEEeccccccccccc--ccc-chHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCC-------Cce--EE Confidence 77666655333222111 111 2233444444667789999999999999999999998765422 221 11 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) ..|+. .....|... |++ ++=+| -.-....++...++ +.|.++.++ T Consensus 122 ~~~~~---------------------~~~~~~~~~-dvi-h~~~~--~~~~~~~~~~~~a~----------~~~~~~~~~ 166 (378) T protein:vir:85 122 DLLFA---------------------NDKKEYKPE-ELV-RLVSP--FYINEDTSILDNAL----------ASIQTKLEQ 166 (378) T ss_pred EEEec---------------------CCCEEEccc-ceE-EEecC--cCccchhhHHHHHH----------HHHHHHHhc Confidence 11211 111123222 322 22122 11122223322222 223333332 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccc-cccccceeEeechHHhccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDS-QAAFIPVIAGVPGEQIKDV 319 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S-~AA~vPiva~vP~Ehi~~i 319 (634) --+ +|+|-+|..++-. +.+++.+.+- ..+++..+ .-+-=++++. -..++ T Consensus 167 ~~~-~g~l~~~~~l~~~---------------------~~~~~~~~~~----~~~~~~~~~~~~g~~~vl~----~g~~~ 216 (378) T protein:vir:85 167 GKL-RGLLKINAFLDID---------------------NTQEYREKAL----ATIKNMQEGSSYNGLTPVD----NKTEI 216 (378) T ss_pred CCc-ceEEEeCCcCCHH---------------------HHHHHHHHHH----HHHHHhhcccccccceecC----CCceE Confidence 211 4777766544321 2222333322 22222111 1111233332 23455 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) +-|.+.... ..+++++.....||..|.|||..|-| +.. -+....-++..|.|.+..|.++|++.+|.+.=. T Consensus 217 ~~l~~~~~~--~~~~~~~~~~~~Ia~~fgVPp~~l~~--s~~-----e~~~~~f~~~tL~P~~~~ie~~l~~kLl~~~er 287 (378) T protein:vir:85 217 VELKKDYSV--LNKDEIELIKSELLTGYFMNENILLG--TAT-----QEQQIYFYNSTIIPLLIQLEKELTYKLISTNRR 287 (378) T ss_pred EeccCChhh--hhHHHHHHHHHHHHHHhCCCHHHhcC--Cch-----HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhh Confidence 555554433 34567777778899999999966633 332 233344677889999999999999999876433 Q ss_pred hcCCChhHhe-eeecCcccccCCCchH---HHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCccc Q lcl|NC_011057. 400 REGIDPSKYV-VWYDASQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTL 475 (634) Q Consensus 400 ~eG~d~~~yV-~w~DaS~L~~~pd~t~---eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~L 475 (634) ..|.....|+ +.||.+.|..- |..+ ....++..|++|.+++|+++|+.--.|-| .+ + T Consensus 288 ~~~~~~~~~~~~~f~~~~l~~~-d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD-------------~~-----~ 348 (378) T protein:vir:85 288 RVVKGNLYYERIIVDNQLFKFA-TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-------------IY-----I 348 (378) T ss_pred hhhhhccccceeeecchhhhhc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-------------eE-----e Confidence 3455444343 57888887543 2222 23458899999999999999996433222 00 0 Q ss_pred ch-hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCC Q lcl|NC_011057. 476 IP-MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHE 519 (634) Q Consensus 476 i~-~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~e 519 (634) +| .+.|+ +-+........ ...+++++..| T Consensus 349 ~~~N~~~~--------~~~~~~~~~~~-------~~~~~~e~~n~ 378 (378) T protein:vir:85 349 ANLNAVAV--------KNLSDLQGSRK-------DVASTDETNNQ 378 (378) T ss_pred eccccccc--------ccchhhcCccC-------CCCCCCCCCCC Confidence 00 01111 11100000000 00001111111 No 90 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=98.98 E-value=4.4e-10 Score=71.86 Aligned_cols=352 Identities=14% Similarity=0.119 Sum_probs=183.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.-=. ||.. |+ .+..+..-..-.. +++...+.---+...+ ..+-+.-.|.=+++++|.+++ T Consensus 1 M~~~~-------~f~~-----r~-----~~~~~~~~~~~~~-~~~~~~~~~v~~~~al-~~~av~~cv~~ia~~ia~~p~ 61 (359) T protein:vir:10 1 MSILN-------PFER-----RS-----SITPNNYYPFMVQ-NGSIVPNSLVDATEAL-KNSDLYAVTSLISSDIAGTRF 61 (359) T ss_pred Ccccc-------hhhc-----cc-----cCCCCcchhhhhc-cccccCCcccCHHHhh-cchHHHHHHHHHHHhhhcCcc Confidence 33222 2211 10 0000000000000 0000000000000111 122333345567778888776 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) . + ......+ ..=..--+-..++++.++.+|-.-|+.|+.+ .|..+ |. . T Consensus 62 ~-----------------~---~~~~~~L-~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~r~~~-------g~---~ 109 (359) T protein:vir:10 62 I-----------------G---NQVFTSV-LNNPSHLTNAFSFWQTAILNLLLNGNVFLAI-LKGDN-------SL---M 109 (359) T ss_pred c-----------------c---chHHHHH-hhcccccCCHHHHHHHHHHhccccCceEEEE-EECCC-------Ce---E Confidence 3 1 1222222 2335556788899999999999999998875 35332 21 2 Q ss_pred hhceeccHHHHhccCCCcceeeEeC---CCCcccccCCCCe-EEEe--eCCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLP---TGEEHEFVKGTDI-IFRV--WIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP---~g~~h~~~~~~D~-~~Rv--W~P~prra~eaDSPvra~l~~LrEI~rttk~I 234 (634) ..++.|..+.+.....+++.-+.+. +|...+|.. .|+ -||. +++++-.-..--||+.++...+.-..-..+.. T Consensus 110 ~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~~-~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~ 188 (359) T protein:vir:10 110 KELRLIPSNAITIDLTDDTLTYEVNQFDDYPSAKYNA-SEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLS 188 (359) T ss_pred EEEEEeCCceEEEEEcCCeEEEEEEecCCceEEEEcc-cceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHH Confidence 3455555554443333334433332 344444433 343 2333 33444455567788887777666666556655 Q ss_pred HHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_011057. 235 ANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (634) Q Consensus 235 ~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~E 314 (634) .+..+.=..-.|||-+|+.. + ..-+.+++++.+- -...+ .+ +--|+|+. T Consensus 189 ~~~f~ng~~~~gil~~~~~~-l-------------------~~e~~~~~~~~~~-~~~~~---~n---~g~~~vl~---- 237 (359) T protein:vir:10 189 LSTLKGALNPTSVVKVPQGT-L-------------------SSEAKDSIRKEFE-KANGG---NN---SGRVMVLD---- 237 (359) T ss_pred HHHHhccCCcceEEEeCCCC-C-------------------CHHHHHHHHHHHH-HHhCc---cc---cCCceecC---- Confidence 55555544556788777542 1 1124455666552 22111 11 11233332 Q ss_pred HhcccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_011057. 315 QIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQI 393 (634) Q Consensus 315 hi~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~ 393 (634) ..-+++-|.+. ..+. -+++|+..+..||..|-||| .+||. .+.++.+.-++. +-...++.|.+.-+++.|...+ T Consensus 238 ~g~~~~~l~~~--~~d~q~le~~~~~~~~Ia~~fgVPp-~~lg~-~~~~~~~~~~~e-~~~~~~l~~~l~p~~~~l~~~l 312 (359) T protein:vir:10 238 QSADFSTVSIN--ADVANYLNSMNWGRTQIAKAFGVSD-SYLNG-TGDQQSSLDQIK-DLYVNALNRFIEPLISELRIKC 312 (359) T ss_pred CCcceeeecCC--HHHHHHHHHHHHHHHHHHHHhCCCH-HHhCC-CCcccccHHHHH-HHHHHHHHHHHHHHHHHHHHHh Confidence 23345555543 3343 47899999999999999999 55663 233333444443 3334457888888888888887 Q ss_pred HHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCc Q lcl|NC_011057. 394 LRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (634) Q Consensus 394 lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp 473 (634) .++. ++|... .+.||.+.+. .....+++.|++|.+|.|+.+|+.- T Consensus 313 ~~~~----~~~~~~-~~~~d~~~~~------~~~~~~~~~G~~t~NE~R~~l~~~p------------------------ 357 (359) T protein:vir:10 313 DSSI----GVDMSP-ITDYSNSVFK------ADILNWVKEGIIEPTEAKTLLESKG------------------------ 357 (359) T ss_pred hhhh----cccchh-hhhcCHHHHH------HHHHHHHhCCCcCHHHHHHHhCCCC------------------------ Confidence 7663 455444 4556644332 2245689999999999999999952 Q ss_pred ccchhhhhhh Q lcl|NC_011057. 474 TLIPMLAPLI 483 (634) Q Consensus 474 ~Li~~laPll 483 (634) .+ T Consensus 358 --------v~ 359 (359) T protein:vir:10 358 --------II 359 (359) T ss_pred --------CC Confidence 11 No 91 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=98.94 E-value=8.7e-09 Score=64.78 Aligned_cols=374 Identities=14% Similarity=0.106 Sum_probs=177.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |- |..+-+. ..+. .-+.+ .+....|+.+.=. |.+ .-+.-.|.-+++.+|.+.+ T Consensus 1 M~------if~~~~~----~~~~-----~~~~~----------~~~~~~~~~~~~~-~~~-~~v~~~v~~Ia~~iA~lp~ 53 (378) T protein:vir:94 1 MN------LFGKVVS----FSRG-----KLNND----------TQRVTAWQNEAVE-YTS-AFVTNIHNKIANEITKVEF 53 (378) T ss_pred Cc------hhHHhHh----hhhc-----ccccC----------cceeeeeecchhh-hhh-HHHHHHHHHHHHhHhhCce Confidence 22 1111111 0000 00000 0111112222100 111 2244456678999999987 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) -.=+.+...|.+-.... ..++. +..+...=..--+...++.+.++.+|-.-|++||..+.+.. +|. T Consensus 54 ~~~~~~~~~~~~~~~~~--~~~~~-l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~-------~g~---- 119 (378) T protein:vir:94 54 NHVKYKKSDVGSDTLIS--MAGSD-LDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSE-------TGE---- 119 (378) T ss_pred eeeeecccccccccccc--cccch-HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCC-------CCc---- Confidence 54455544333222111 11222 33344445677888999999999999999999998665422 221 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKS 240 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 240 (634) -|+.+. .....+|. .+-+|++= +|-......|+...++..+ .++.++ T Consensus 120 -~~~~~~------------------~~~~~~~~--~~dvih~~--~~~~~~~~~~~~~~~~~~~----------~~~~~~ 166 (378) T protein:vir:94 120 -LLDLLF------------------ANDKKEYK--PEELVRLT--SPFYINEDTSILDNALASI----------QTKLEQ 166 (378) T ss_pred -EEEEEE------------------ecCcEEec--hhceeeec--CcCCcccchhHHHHHHHHH----------HHHHhh Confidence 122111 00111221 12234442 2222333445544443332 222221 Q ss_pred HhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccc-cccceeEeechHHhccc Q lcl|NC_011057. 241 RLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQA-AFIPVIAGVPGEQIKDV 319 (634) Q Consensus 241 RL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~A-A~vPiva~vP~Ehi~~i 319 (634) = .-+|+|-.|..++-. +.+++.+ ..+..+++..+.+ +-=++++. -..++ T Consensus 167 ~-~~~g~l~~~~~l~~~---------------------~~~~~~e----~~~~~~~~~~~~~n~~~~~vl~----~g~~~ 216 (378) T protein:vir:94 167 G-KLRGLLKINAFLDID---------------------NTQEYRE----KALATIKNMQEGSSYNGLTPVD----NKTEI 216 (378) T ss_pred C-CcccceeeCCcCCHH---------------------HHHHHHH----HHHHHHHHhhcccccccceecc----CCceE Confidence 1 235777776654321 1122222 2223333321111 11123332 22234 Q ss_pred ceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 320 KHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLA 399 (634) Q Consensus 320 kHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~ 399 (634) + .+.....+..+..++....+||..|.|||..|-|- ..+ +....-++..|.|.+..|+++|++.+|-+.=. T Consensus 217 ~--~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g~--~~e-----~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~ 287 (378) T protein:vir:94 217 V--ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT--ATQ-----EQQIYFYNSTIIPLLIQLEKELTYKLISTNRR 287 (378) T ss_pred E--EccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcCC--chH-----HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHh Confidence 4 34444555556777888899999999999776543 222 33334556789999999999999999866433 Q ss_pred hcCCChhHh-eeeecCcccccCC--CchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccc Q lcl|NC_011057. 400 REGIDPSKY-VVWYDASQLTIDP--DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLI 476 (634) Q Consensus 400 ~eG~d~~~y-V~w~DaS~L~~~p--d~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li 476 (634) ..|.....| -+.||.+.|..-. ++.+....+++.|++|.+++|+++|+..-.|-| + -++ T Consensus 288 ~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd----~--------------~~~ 349 (378) T protein:vir:94 288 RVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----V--------------YIA 349 (378) T ss_pred hhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----e--------------eee Confidence 345533333 3578888875432 223333569999999999999999996533322 0 011 Q ss_pred hhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCC Q lcl|NC_011057. 477 PMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTE 523 (634) Q Consensus 477 ~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe 523 (634) | + -++.++.+........ +..+++ |++-| T Consensus 350 ~----~---n~~~~~~~~~~~~~~~-------~~~~~~----e~~n~ 378 (378) T protein:vir:94 350 N----L---NAVAVKNLSDLQGNRK-------DVTSTD----ETNNQ 378 (378) T ss_pred c----c---cccchhcchhcccccC-------CCCCCC----CCCCC Confidence 1 0 0111111111100000 000111 11111 No 92 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=98.93 E-value=8.5e-09 Score=64.82 Aligned_cols=514 Identities=13% Similarity=0.053 Sum_probs=196.5 Q ss_pred CC---------CCCcceeEeccCCCCccchhhhhh------------------------------hhccCCchhhhh--- Q lcl|NC_011057. 1 MA---------ATQSLRLVRRPKGGRPAPSRALTA------------------------------ASQPLPDPSQVF--- 38 (634) Q Consensus 1 ~~---------a~~~lr~vrrp~g~~~a~~ral~a------------------------------As~~itdp~~~~--- 38 (634) .. -+..||-..+- .|..+++.-+ |-..+......+ T Consensus 39 ~~~~~~~~p~~~~~~L~~~~e~---~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 115 (651) T protein:vir:99 39 QSHNVGVNPPYNPDRLAAFLEL---NETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTG 115 (651) T ss_pred cccCCCCCCCCCHHHHHHHHhc---ChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhccc Confidence 00 01111111110 0011111110 000000000000 Q ss_pred -hhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceee-eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCC Q lcl|NC_011057. 39 -SKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSR-CRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADG 116 (634) Q Consensus 39 -~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr-~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG 116 (634) ......-.-..|-...+.-++++|=. |+-=+.+..++ ++|| .+++.+.+=... +.........+...-..+ T Consensus 116 ~~~~n~~~t~~~i~~~~~~Dle~tGna--~ieiIrn~~g~pv~L~--~lp~~~~Rv~~~---~~~~~~~~~~ll~~~pn~ 188 (651) T protein:vir:99 116 PNQAKTPATPERVKELARQDYHGVGWL--ALEMLTDIEGRPVGLA--YVPARTVRVRRP---QNRFDQPRHPEEGRYVDG 188 (651) T ss_pred ccccCCCCCHHHHHHHHHHHHHHHhhH--hhhhhhcCccchhhhh--hcChhheeeecc---cccccchhhhhhhccccc Confidence 00000001111222223222222100 01001121111 1221 233332211000 000000011111111111 Q ss_pred cchHHHHHHHHHHhhccccceEEEEEEecCCCC---CCCccccccc----chhceeccHHHHhccCCCcceeeEeCCCCc Q lcl|NC_011057. 117 TLGQAALTKRVVECLTVPGELWIVILTRPVKGA---PAQPDGSVRT----RQEWYAVSKEEIKKSNKGSGTNIVLPTGEE 189 (634) Q Consensus 117 ~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~---~~~~dg~~~~----~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~ 189 (634) ...-... .+++. +..-|..|+.+. ....+. .....+.... .+.|...... ........+...++.. T Consensus 189 ~~~~~~~-~~~~q-~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~----~~~~~~g~~~~~~~~~ 261 (651) T protein:vir:99 189 DVADIAS-RGYVQ-IRNGNRRYFGEA-GDRYRGQEVVIDESGDEPTIRYREDEESEREPI----FVDRETGDVTTGDANG 261 (651) T ss_pred ccchhHH-HHHHH-HHhcCcceEEEe-eccccceeeeeccCCcceeEEeccCcceeeeee----cccceeeeEEEcCCCc Confidence 1111111 11221 222344454321 111000 0000000000 0001110000 0000011123333333 Q ss_pred ccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCC Q lcl|NC_011057. 190 HEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGE 269 (634) Q Consensus 190 h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~ 269 (634) .+..+..| ||++=.+.+..-..--||+..++..+.=-.-..+...+..+.-....|||.+|... T Consensus 262 ~~~~~~~e-ViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~--------------- 325 (651) T protein:vir:99 262 LENRPANE-LIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGE--------------- 325 (651) T ss_pred eeEecccc-eEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCC--------------- Confidence 33333344 44554455555556678888888877655555555555555556667778776421 Q ss_pred CCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHH-------hcccceeecCCchhHHHHHHHHHHHHH Q lcl|NC_011057. 270 EIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ-------IKDVKHIRFDNEITEVAIKTRNDAIAR 342 (634) Q Consensus 270 ~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Eh-------i~~ikHl~f~~d~te~aiktR~daI~r 342 (634) ...-..+.|++.+-+. +. .+-=++|+..++.. .-+++.|.+..--+.--+++|+..+.. T Consensus 326 -----ls~e~~~~lr~~~~~~----~~-----nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~e 391 (651) T protein:vir:99 326 -----LSEESKRDLRQMLNGL----RE-----ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHE 391 (651) T ss_pred -----CCHHHHHHHHHHHHHH----hc-----cCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHH Confidence 1122456666666432 22 23345566554322 224444444332233458999999999 Q ss_pred HhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCccccc-CC Q lcl|NC_011057. 343 LAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTI-DP 421 (634) Q Consensus 343 lA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~-~p 421 (634) ||..|-||| .+||+..++|+.++-+....-++..|.|.+..|+++|++.+|.+.. ++...+|-|=||.+.|.. |. T Consensus 392 Ia~afgVPp-~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e---~~~~~~i~~ef~~~~llr~D~ 467 (651) T protein:vir:99 392 IAKVLEVPP-VKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQAL---GVTDWTIEYELRGADQPKQEA 467 (651) T ss_pred HHHHhCCCH-HHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc---cccCceEEEEeccchhhhccH Confidence 999999999 8889988999999999999999999999999999999999998844 443345566677777654 43 Q ss_pred CchHHHHH-HHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCC Q lcl|NC_011057. 422 DKSDEAKF-AYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAID 500 (634) Q Consensus 422 d~t~eA~~-~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~ 500 (634) ....++++ ++..|++|.+++|+++||.--. .+ |+ +..|. |+ |...-..+ . T Consensus 468 ~~~~e~~~~~i~~G~~T~NE~R~~lglppi~-----~~-----~g------d~~l~----~~-----~~~~~g~~----~ 518 (651) T protein:vir:99 468 QLAEQRVRAMRLAGVGLVDEAREELGLDPLG-----EP-----YG------EMTLS----EF-----EAEVAGDV----A 518 (651) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-----Cc-----cc------ccccc----cc-----cccccccc----c Confidence 33333333 7888999999999999995311 10 00 00111 10 00000000 0 Q ss_pred CCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCc-------cHHHH------------------------------HHH Q lcl|NC_011057. 501 SGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAG-------LESGI------------------------------VDL 543 (634) Q Consensus 501 ~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~-------~~~a~------------------------------vdl 543 (634) .+++.+..++. +++....+.|.++-++.-.... ....| |-. T Consensus 519 ~gge~~~~~~~--~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~~gyd~~~~~l~~~f~~~~~~~~~y~y~~v~~ 596 (651) T protein:vir:99 519 GGGETEAVHEP--PEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDEGLYDFGENELYLSFLRDEGQSSLYAYVDVPA 596 (651) T ss_pred cCCCCcccccC--ccccccccchhhhhhhhhcccchhhhhhHHHHHHHhhcCCCccceEEEEEeecCCCCceeeeeCCCH Confidence 11111111111 1111111111111111100000 00000 111 Q ss_pred HHHHHHHHhhHHhhcCChhH--HHHhhCCChHHhhhhcCCCChhHHHHHHhcccccccHHH Q lcl|NC_011057. 544 MVDRALELVGKRRRGRDRET--LARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKI 602 (634) Q Consensus 544 lv~rALelAGkR~Rt~~R~~--~arlr~ip~h~~h~~~~Pv~~~~v~rLi~GWd~~ld~~~ 602 (634) -|.++|--|.-.=+=--+.= +.+.+.| .+.|-.+.--+..++..+- |.+. ++| T Consensus 597 ~~~~~~~~a~s~g~~~~~~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~~---~~~~-~~~ 651 (651) T protein:vir:99 597 SEWSALANAGSHGGYHYDNIRLEYPYLEI--TNFHDRLPEGPAPDAGDVP---DGVP-DEI 651 (651) T ss_pred HHHHHHhcCcccceeehhccccccchhhh--hhhhhhCCCCCCCCcCCCC---CCCc-ccC Confidence 24444444332110000000 0000000 1122222111111111111 1111 233 No 93 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.91 E-value=1.2e-09 Score=69.42 Aligned_cols=386 Identities=11% Similarity=0.078 Sum_probs=194.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.- .=+...++.+-++..+.. ..-.....++.....- .+.....=+. +-.-..+-+.-.|.-+++.+|.+.+ T Consensus 1 m~m-~~f~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~~~~v~~--~~al~~~~v~~~i~~ia~~ia~lp~ 72 (392) T protein:vir:10 1 MIL-PILNFINQTNDPPEVGSV--QSYFPDGNDAQIMESL---LGDNNEWVSA--RAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred Ccc-hhhhhhhccccccccccc--ccccccCchhhhhhhh---cCCCCceech--HHhhccHHHHHHHHHHHHhhccCce Confidence 211 122233333222211110 0000011111111100 0000000011 1111345666677888999998887 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+=+. +.+ ..=..--+...++++.++.+|-+-|+.|+.+. |.. +|. . T Consensus 73 ~~~~~~~----------------~~l----~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~-------~g~---~ 121 (392) T protein:vir:10 73 NAEKKKN----------------QGI----IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRW-RNA-------NGA---D 121 (392) T ss_pred eeccchh----------------hhH----hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEE-ECC-------CCc---E Confidence 5532221 111 11244567888999999999999999998863 422 231 2 Q ss_pred hhceeccHHHHhcc--CCCcceeeEeC--C--CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKS--NKGSGTNIVLP--T--GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (634) Q Consensus 161 ~~W~~vt~~Ei~~~--~~~~~~~i~lP--~--g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 234 (634) ..++.|....+... ..++...+... + +.+....+..|+ |++=.+.+.-...--||+.++...+.=...+.+.. T Consensus 122 ~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ei-ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 200 (392) T protein:vir:10 122 MKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDL-IHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLT 200 (392) T ss_pred EEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccE-EEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 34556655544322 22223333332 2 223333444554 34434444444556789988888876666666666 Q ss_pred HHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_011057. 235 ANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (634) Q Consensus 235 ~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~E 314 (634) .+..+.-..-.|||-+|+..... +....++.+-+. ++..+-=++|+ | T Consensus 201 ~~~f~ng~~p~gil~~~~~~~~~-------------------~~~~~~~~~~~~----------~~~~~g~~~vl--~-- 247 (392) T protein:vir:10 201 ISSLNSSLNVPGVLTVKGGGLLS-------------------DKDKASRSRSFM----------KRSRSGGPVVL--D-- 247 (392) T ss_pred HHHHhccCCCceEEEeCCCCCch-------------------HHHHHHHHHHHh----------ccccCCCeeec--C-- Confidence 66666666677888877653321 011222222222 11112223343 2 Q ss_pred HhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 315 QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 315 hi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) ..-+++.|........ -+++|+..+..||..|.||| .+||.. ..+..+ .+-...-++..|.|.++.|+++|++.++ T Consensus 248 ~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp-~~lg~~-~~~~~~-~~~~~~f~~~~l~P~~~~ie~~l~~~L~ 323 (392) T protein:vir:10 248 DLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPD-SYIGGQ-GDQQSS-IQQISGMYASALNRYLRPAISELEYKLS 323 (392) T ss_pred CCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCH-HHhCCC-CCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 3345666654433222 37999999999999999999 677863 333322 3334456788999999999999999876 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) .. .++|... ++ +.+.+ .+.+....+++.|++|.++.|++++ +.|+.. .| +|.+ .+ T Consensus 324 ~~----~~~d~~~-~~--~~d~~----~~~~~~~~l~~~g~~t~nE~r~~l~---~~g~~p--~e-~r~~------e~-- 378 (392) T protein:vir:10 324 DH----ISVNMRP-AI--DPLGD----NYLSTISTATRWGALAENQATFVLQ---EAGYIP--KD-LPAP------EN-- 378 (392) T ss_pred cc----ccccchh-hh--ccCHH----HHHHHHHHHHhCCCcCHHHHHHHHH---hcCCCc--cc-cchh------cC-- Confidence 54 1333221 11 21111 1123345688999999999999872 223321 11 2220 01 Q ss_pred cchhhhhhhhhhhhcccCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQ 497 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~ 497 (634) | .| .-.+ +-.+|.| T Consensus 379 l----~~----~~~G-d~~~p~p 392 (392) T protein:vir:10 379 T----NK----KTTG-QSNEPVP 392 (392) T ss_pred C----CC----CCCC-CCCCCCC Confidence 1 11 1111 2233333 No 94 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.91 E-value=1.2e-09 Score=69.42 Aligned_cols=386 Identities=11% Similarity=0.078 Sum_probs=194.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.- .=+...++.+-++..+.. ..-.....++.....- .+.....=+. +-.-..+-+.-.|.-+++.+|.+.+ T Consensus 1 m~m-~~f~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~~~~v~~--~~al~~~~v~~~i~~ia~~ia~lp~ 72 (392) T protein:vir:39 1 MIL-PILNFINQTNDPPEVGSV--QSYFPDGNDAQIMESL---LGDNNEWVSA--RAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred Ccc-hhhhhhhccccccccccc--ccccccCchhhhhhhh---cCCCCceech--HHhhccHHHHHHHHHHHHhhccCce Confidence 211 122233333222211110 0000011111111100 0000000011 1111345666677888999998887 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+=+. +.+ ..=..--+...++++.++.+|-+-|+.|+.+. |.. +|. . T Consensus 73 ~~~~~~~----------------~~l----~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~-------~g~---~ 121 (392) T protein:vir:39 73 NAEKKKN----------------QGI----IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRW-RNA-------NGA---D 121 (392) T ss_pred eeccchh----------------hhH----hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEE-ECC-------CCc---E Confidence 5532221 111 11244567888999999999999999998863 422 231 2 Q ss_pred hhceeccHHHHhcc--CCCcceeeEeC--C--CCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKS--NKGSGTNIVLP--T--GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (634) Q Consensus 161 ~~W~~vt~~Ei~~~--~~~~~~~i~lP--~--g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 234 (634) ..++.|....+... ..++...+... + +.+....+..|+ |++=.+.+.-...--||+.++...+.=...+.+.. T Consensus 122 ~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ei-ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 200 (392) T protein:vir:39 122 MKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDL-IHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLT 200 (392) T ss_pred EEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccE-EEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 34556655544322 22223333332 2 223333444554 34434444444556789988888876666666666 Q ss_pred HHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_011057. 235 ANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (634) Q Consensus 235 ~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~E 314 (634) .+..+.-..-.|||-+|+..... +....++.+-+. ++..+-=++|+ | T Consensus 201 ~~~f~ng~~p~gil~~~~~~~~~-------------------~~~~~~~~~~~~----------~~~~~g~~~vl--~-- 247 (392) T protein:vir:39 201 ISSLNSSLNVPGVLTVKGGGLLS-------------------DKDKASRSRSFM----------KRSRSGGPVVL--D-- 247 (392) T ss_pred HHHHhccCCCceEEEeCCCCCch-------------------HHHHHHHHHHHh----------ccccCCCeeec--C-- Confidence 66666666677888877653321 011222222222 11112223343 2 Q ss_pred HhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 315 QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 315 hi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) ..-+++.|........ -+++|+..+..||..|.||| .+||.. ..+..+ .+-...-++..|.|.++.|+++|++.++ T Consensus 248 ~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp-~~lg~~-~~~~~~-~~~~~~f~~~~l~P~~~~ie~~l~~~L~ 323 (392) T protein:vir:39 248 DLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPD-SYIGGQ-GDQQSS-IQQISGMYASALNRYLRPAISELEYKLS 323 (392) T ss_pred CCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCH-HHhCCC-CCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 3345666654433222 37999999999999999999 677863 333322 3334456788999999999999999876 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcc Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~ 474 (634) .. .++|... ++ +.+.+ .+.+....+++.|++|.++.|++++ +.|+.. .| +|.+ .+ T Consensus 324 ~~----~~~d~~~-~~--~~d~~----~~~~~~~~l~~~g~~t~nE~r~~l~---~~g~~p--~e-~r~~------e~-- 378 (392) T protein:vir:39 324 DH----ISVNMRP-AI--DPLGD----NYLSTISTATRWGALAENQATFVLQ---EAGYIP--KD-LPAP------EN-- 378 (392) T ss_pred cc----ccccchh-hh--ccCHH----HHHHHHHHHHhCCCcCHHHHHHHHH---hcCCCc--cc-cchh------cC-- Confidence 54 1333221 11 21111 1123345688999999999999872 223321 11 2220 01 Q ss_pred cchhhhhhhhhhhhcccCCCCCC Q lcl|NC_011057. 475 LIPMLAPLIAGVLQQIEFPQQQQ 497 (634) Q Consensus 475 Li~~laPll~p~~q~~~~P~p~~ 497 (634) | .| .-.+ +-.+|.| T Consensus 379 l----~~----~~~G-d~~~p~p 392 (392) T protein:vir:39 379 T----NK----KTTG-QSNEPVP 392 (392) T ss_pred C----CC----CCCC-CCCCCCC Confidence 1 11 1111 2233333 No 95 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=98.88 E-value=8.4e-09 Score=64.87 Aligned_cols=384 Identities=13% Similarity=0.055 Sum_probs=164.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.=-.-++--++..... ..+. ..++. .. ....+ ...-+.-.+.-+++.+|.+.+ T Consensus 1 Mgl~d~~~~~~~~~~~~-------------~~~~-~~~~~---~~--------~~~~l-~~~~v~~~i~~Ia~~ia~lp~ 54 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSD-------------DDSG-STTSE---KL--------TNVVL-KEDALYKCVNYLARIISKSTF 54 (395) T ss_pred CcchhhhcCCCCccccc-------------cccc-cchhh---hc--------chhhh-hhHHHHHHHHHHHHhhcccee Confidence 32222221111000000 0000 00000 00 00112 223445567889999999988 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+-+ . .+..+ ..+..+.+.-+.--+...++++.++.+|-.-|++|+.+. |..+ +... T Consensus 55 ~v~~~~----~---~~~~~----~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~-~~~~---------~~~~ 113 (395) T protein:vir:96 55 RIKAPE----K---LTENQ----KDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVI-PGKG---------IYVA 113 (395) T ss_pred EEEeCC----c---ccccc----chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEE-cCCc---------eecC Confidence 765421 1 11111 223444555566778999999999999999999998863 3211 1111 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCC-cccccCCCCe-EEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH-H Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGE-EHEFVKGTDI-IFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN-A 237 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~-~h~~~~~~D~-~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n-a 237 (634) ..| .... .+. ...- ..+...++. .++| +..|+ -||.=++.-+.. -++++. ..+++..+.-.+.. + T Consensus 114 ~~~-~~~~-~~~-~~~~--~~v~~~~~~~~~~~-~~~dvih~k~~~~~~~~~--~~~~~~----~~~~~~~~~i~~~~~~ 181 (395) T protein:vir:96 114 DAF-TQDK-KLS-GNKF--KVSRVQGQTYEKIF-TFDQVIYLKNDNSDLMLK--VESLWE----EYGELLGHVINNQKIA 181 (395) T ss_pred Ccc-cccc-ccc-ccee--eeeeeccceeeeEe-ccCceEEecccCCccccc--cccccc----hHHHHHHHHHHHHHHH Confidence 122 1111 110 0000 111122222 1222 22332 234322222222 233333 33333332222211 1 Q ss_pred HHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 238 SKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 238 ~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) ..+|...++. -+. ..+ .. +....+.-..+..++++ +-...+.. . ...-++++ + ..- T Consensus 182 ~~~~~~~~~~--~~~-~~~-~~-----------~~~~~~~~~~~~~~~~~-~~~~~~~~--~--~~~~v~~l--~--~g~ 237 (395) T protein:vir:96 182 NQIRFTMTPP--KDK-VRE-RA-----------QENSDGGRQPKSDKDFF-KRTIEKIR--T--ESVVGIPV--T--ANT 237 (395) T ss_pred HHHHHHhhhc--ccc-ccc-ce-----------eeccCchhhHHHHHHHH-HHHHHHhh--c--CCcceEEc--c--CCc Confidence 1234433321 110 000 00 00011111112222221 11112221 1 12222221 2 222 Q ss_pred ccceeecCCch---hH--HHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEI---TE--VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQ 392 (634) Q Consensus 318 ~ikHl~f~~d~---te--~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~ 392 (634) +.+-|+....- -+ --.+++++.+..||..|.||| .||| +|..+..+....=++..|.|.+..|+++|++. T Consensus 238 ~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp-~~l~----~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~ 312 (395) T protein:vir:96 238 NYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPI-SLLH----GDIADNQKNYELLLEGPIESLITNIVDGLEYA 312 (395) T ss_pred eeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCH-HHhc----CCCccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 33333332211 01 123456788999999999999 5555 34556777777788889999999999999999 Q ss_pred HHHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHh Q lcl|NC_011057. 393 ILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAV 469 (634) Q Consensus 393 ~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v 469 (634) +|.+.-...|+ -|.+ +.| .+.|..+.+ ..+++.|++|.+++|+++|++.-.+ +.|=.- .+ T Consensus 313 Ll~~~e~~~~~-----~f~~--~~l-~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~-----~~gD~~----~~ 375 (395) T protein:vir:96 313 IFDKSETLEGS-----FIKV--TGL-KNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPD-----GLGKVL----YM 375 (395) T ss_pred cCChhhhcCce-----eEee--cch-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC-----CCCcee----ee Confidence 98653221221 2344 343 233444333 5588999999999999999964221 111000 00 Q ss_pred hcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccC Q lcl|NC_011057. 470 SKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDN 512 (634) Q Consensus 470 ~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~ 512 (634) ..| +.|+ ...|++.+++.+. T Consensus 376 ~~N------~~~~-----------------~~~gge~~~~~~~ 395 (395) T protein:vir:96 376 TKN------YESV-----------------LERGGEVDEEVET 395 (395) T ss_pred ccc------ceec-----------------hhccCCCCCCCCC Confidence 000 0111 0011111111111 No 96 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=98.85 E-value=1.6e-08 Score=63.35 Aligned_cols=384 Identities=14% Similarity=0.080 Sum_probs=172.6 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.=-..|. +++-. ....+.++...+ .+..+ ..+ ...-+.-.+.-++++||.+.+ T Consensus 1 MGlf~~~~---~~~~~----------~~~~~~~~~~~~----------~~~~~--~~~-~~~~v~~~I~~ia~~iA~lp~ 54 (395) T protein:vir:98 1 MGILDFFS---FKKSG----------TLSDDDSGSTTS----------EKLTN--VVL-KEDALYKCVNYLARIISKSTF 54 (395) T ss_pred Ccchhhhc---CCCcc----------cccccccchhhh----------hhcch--hhh-hhHHHHHHHHHHHHHHhhCce Confidence 44333331 11100 000111111111 01111 112 223445567788999999888 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+-+ + +.+. +. .+..++..-+-..+...++++.++.+|.+-|++||++. |.++ +... T Consensus 55 ~~~~~~-~-----~~~~-~~----~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~-~~~~---------~~~~ 113 (395) T protein:vir:98 55 RLKTPE-K-----LTEN-QK----DWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVI-PGKG---------IYVA 113 (395) T ss_pred eEEecC-C-----cccc-cc----hHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEE-eCCc---------eecC Confidence 775533 1 1221 11 23344455566778999999999999999999999864 3211 1111 Q ss_pred hhceeccHHHHhccCCCcceeeEeCCCC-cccccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTNIVLPTGE-EHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~i~lP~g~-~h~~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 238 (634) ..|. +... +. ...- ..+..-+.. ..+|. ..| +-||..++..+.. -++++...-..+.......+ .+. T Consensus 114 ~~~~-~~~~-~~-~~~~--~~~~~~~~~~~~~~~-~~evih~k~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~---~~~ 182 (395) T protein:vir:98 114 DSFT-QDKK-IS-GSQF--KVSRVQGQTYEKTFT-FDQVIYLKNDNSDLMSK--VESLWEEYGELLGHVINNQK---IAN 182 (395) T ss_pred Cccc-cccc-cc-Cccc--ceeeecCceeeeEec-CccEEEecCCCCCcccc--ccchhhhHHHHHHHHHHHHH---HHH Confidence 2222 2110 00 0000 011111111 12222 223 3344344333332 23333333333333222211 122 Q ss_pred HhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_011057. 239 KSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKD 318 (634) Q Consensus 239 ~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ 318 (634) ..|..+|+-- ....+... ....+.-..+.+++. ++-...+.. ...-+++| . |..-+ T Consensus 183 ~~~~~~~~~~---~~~~~~~~------------~~~~~~~~~~~~~~~-~~~~~~~~~--~~~~~v~~----l--~~g~~ 238 (395) T protein:vir:98 183 QIRFTMIPPK---DKVRERAQ------------ENSDGGRQSKSDKDF-FKRTVEKIR--TESVVGIP----V--TANTN 238 (395) T ss_pred HHHHhhcccc---cccccccc------------ccCCcHHHHHHHHHH-HHHHHhhhh--cCCcceee----c--CCCce Confidence 3344444211 11111100 000111111222222 221122211 11111221 1 23334 Q ss_pred cceeecCC-----chhHHHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHH Q lcl|NC_011057. 319 VKHIRFDN-----EITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQI 393 (634) Q Consensus 319 ikHl~f~~-----d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~ 393 (634) .+-|++.+ .-++=-+++|+..+..||..|.|||. +|| +|..+..+....-++..|.|.+..|.++|++.+ T Consensus 239 ~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~-~l~----~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kl 313 (395) T protein:vir:98 239 YEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPIS-LLH----GDIADNQKNYELLLEGPIESLITNIVDGLEYAI 313 (395) T ss_pred eEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHH-Hhc----CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44454321 22234578999999999999999995 555 234456666667778889999999999999999 Q ss_pred HHHHHHhcCCChhHheeeecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhh Q lcl|NC_011057. 394 LRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVS 470 (634) Q Consensus 394 lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~ 470 (634) |.+.....|+ .||.+.| .+.|..+.+ ..+++.|++|.+++|+++|++--.+ +.| T Consensus 314 l~~~~~~~g~-------~f~~~~l-~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~-----~~g---------- 370 (395) T protein:vir:98 314 FDKSETLQGS-------FIKVTGL-KNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPD-----GLG---------- 370 (395) T ss_pred CChhhhcCcc-------eeeehhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC-----CCC---------- Confidence 8764433332 1344454 333444333 3478999999999999999963211 110 Q ss_pred cCcccch-hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccC Q lcl|NC_011057. 471 KDPTLIP-MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDN 512 (634) Q Consensus 471 ~dp~Li~-~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~ 512 (634) |--+++ .++|+ +..|++.+++.+. T Consensus 371 -D~~~~~~n~~~~-----------------~~~gge~~~~~~~ 395 (395) T protein:vir:98 371 -KVLYMTKNYESV-----------------LERGGEVDEEVET 395 (395) T ss_pred -ceeeecccceec-----------------ccccCCCCCCCCC Confidence 000111 00111 1111111111111 No 97 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.79 E-value=2.9e-08 Score=61.94 Aligned_cols=441 Identities=14% Similarity=0.101 Sum_probs=207.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |-....-.+..++.......+++.-||+.-- ..+.-...+.+.++....+.+....=.|-.--+|-++++....= T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~-----~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 75 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGH-----RWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVA 75 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCc-----ccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 4433332222222211112223344443211 11100112344455544444433333333334455555443321 Q ss_pred EE--eeecccCCCCCCCCCCCCcccHHHHHHHHhh-----cCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCc Q lcl|NC_011057. 81 VA--SELDENTGLPTGGISEDNTEGERVREIVSKI-----ADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQP 153 (634) Q Consensus 81 ~a--seiD~Dtg~ptG~i~ed~~~g~r~~~iv~~i-----agG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~ 153 (634) ++ +-|-|. -.+ ++..-++++.+.-+.- +.|.+.=.+|.+-+...+-+-||+.+++..++.. T Consensus 76 ~vVG~Gi~p~-----~~~-~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~------ 143 (495) T protein:vir:10 76 AAVGNGLTPR-----WRM-KEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLS------ 143 (495) T ss_pred hhcCCCcccc-----cCC-chHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccC------ Confidence 11 223222 111 1111223333333333 3677777778888888889999999988665432 Q ss_pred ccccccchhceeccHHHHhccC-------------------CCcceeeEeC---CCC------cccccC-CCCeEEEeeC Q lcl|NC_011057. 154 DGSVRTRQEWYAVSKEEIKKSN-------------------KGSGTNIVLP---TGE------EHEFVK-GTDIIFRVWI 204 (634) Q Consensus 154 dg~~~~~~~W~~vt~~Ei~~~~-------------------~~~~~~i~lP---~g~------~h~~~~-~~D~~~RvW~ 204 (634) +|.- ..=.-..|..+.|.... .+.-+.|-+- +|+ ..++.. ...-++|+++ T Consensus 144 ~g~~-~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~ 222 (495) T protein:vir:10 144 EGLS-VPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV 222 (495) T ss_pred CCCc-cceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc Confidence 1210 00122333333332110 0111111111 111 111111 1234667775 Q ss_pred CCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCcee-eecccccCCCCcCCCCcCCCCCCCccccchHHHHH Q lcl|NC_011057. 205 PKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVL-FVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQL 283 (634) Q Consensus 205 P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l 283 (634) ++|.-..-- |--++| ++ |..+.++..+....-.+.+=+. ||=++. |+.........++... .| T Consensus 223 ~r~gQ~RGi--s~la~i--~~-l~~l~~y~dael~~a~i~A~~~~fi~~~~--~~~~~~~~~~~~~~~~--~~------- 286 (495) T protein:vir:10 223 LTVRSDAGA--PWFQLL--LR-LNELDQYEDAELVRKKTAALFAAFIQEAT--ADSTGGPTIGQPKRSK--GG------- 286 (495) T ss_pred cCCCcccCc--chhHHH--HH-HHHhhHHHHHHHHHHHHhhhheeeeecCC--CccccccccCcccccc--Cc------- Confidence 544333322 223333 33 6677777777777666666544 543322 2111110000000000 00 Q ss_pred HHHHHHHHhhcccCccccccccc-eeE-eechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCc Q lcl|NC_011057. 284 TDMLFQVAETAVEDEDSQAAFIP-VIA-GVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQT 361 (634) Q Consensus 284 ~~ml~qva~tai~De~S~AA~vP-iva-~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~ 361 (634) .+.-.+=| .|. --|||.|+-++.= .-+.-...+..-.+|.||.|+.||-|.|.|=-+++ T Consensus 287 ---------------~~~~~l~pG~i~~L~pGe~i~~~~p~----~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~ 347 (495) T protein:vir:10 287 ---------------KRITGLNPGTLQYLQPGQEVKFSNPA----DVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGV 347 (495) T ss_pred ---------------ccceecCCceeeecCCCCeeeeeCCC----CCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccc Confidence 00001112 111 1244444333321 12334456778899999999999999999966799 Q ss_pred chhhHhhhhhhhhhHHHH--h--HHHHHHHHHHHHHHHHHHHhcCCC-------hhHh--eeeecCcccccCCCchHHH- Q lcl|NC_011057. 362 NHWSAWQISDEDVQLHIA--P--VMEIFCQALTDQILRVTLAREGID-------PSKY--VVWYDASQLTIDPDKSDEA- 427 (634) Q Consensus 362 NhwtAw~i~de~v~~hI~--P--~~~~i~~ait~~~lr~~L~~eG~d-------~~~y--V~w~DaS~L~~~pd~t~eA- 427 (634) |++|+-+-.-|..+..-. = ++..+|+-|++-||.-++..--|+ ...| +-|.=++-..+||-|.-+| T Consensus 348 nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~ 427 (495) T protein:vir:10 348 NYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLAD 427 (495) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHH Confidence 999999988887665322 1 345689999999999887553343 2344 3487888889999987777 Q ss_pred HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhh-hhhhhhhhcccCCCCCCCCCCCCCCC Q lcl|NC_011057. 428 KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLA-PLIAGVLQQIEFPQQQQAIDSGGNED 506 (634) Q Consensus 428 ~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~la-Pll~p~~q~~~~P~p~~a~~~~~~~~ 506 (634) +.+++.|..|-+++-+.+|.+- .|..++.|.++-. ...+. || + .-|.+.+++.. T Consensus 428 ~~~i~~G~~s~~~~~a~~G~D~--------~~v~~q~a~e~~~-----~~~~Gl~~-~------~~p~~~~~~~~----- 482 (495) T protein:vir:10 428 LGDVRAGFAPISDKQAERGYDM--------EELFDMISDANQL-----IDEYDLRL-D------SDPRYVNGSGA----- 482 (495) T ss_pred HHHHHcCCCCHHHHHHHcCCCH--------HHHHHHHHHHHHH-----HHHcCCCC-C------CCCCcCCCccC----- Confidence 7799999999999999888753 2455555554422 11110 11 0 01111111100 Q ss_pred CccccCCCCCCCCCCCCCCC Q lcl|NC_011057. 507 TSDDDNLDDGEHEPDTEDDQ 526 (634) Q Consensus 507 ~~~d~~~~~~~~ePDTe~d~ 526 (634) .....+++.+++. T Consensus 483 -------~~~~~~~~~~~~e 495 (495) T protein:vir:10 483 -------EQKSVMEAALNNE 495 (495) T ss_pred -------CCCCCCCCCCCCC Confidence 0111111111111 No 98 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=98.76 E-value=1.1e-09 Score=69.69 Aligned_cols=332 Identities=11% Similarity=0.095 Sum_probs=170.6 Q ss_pred cchhhhhhhhccCCchhhhhhhh---cccCccccccHHHHHHHh-hhhhHHHHHhhhhhceee---eeEEEeeecccCCC Q lcl|NC_011057. 19 APSRALTAASQPLPDPSQVFSKS---TGISRNSDWQTDAWEAVD-LVGELRYYVGWRASSCSR---CRLVASELDENTGL 91 (634) Q Consensus 19 a~~ral~aAs~~itdp~~~~~~~---~~~~~~~~WQ~eAW~~yd-~VgELryyvgWr~~s~Sr---~rL~aseiD~Dtg~ 91 (634) +......++++.-+.+..+|+-. ... ....|=.++|+++- -.|+ |-..-+|+ ++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~-~~~~~~~~~~~~~~~~~~~------~~epp~~~~~La~l~---------- 63 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPV-DTNSWMTRYCELFYNDFDD------YWEPPISLKGLAEIA---------- 63 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeee-cCcchHHHHHHHHhcCCCc------cccCCCCHHHHHHHH---------- Confidence 11111111111111111222210 000 12234455555542 1121 21111111 1110 Q ss_pred CCCCCCCCCcccHHHHHHHHhh------cCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhcee Q lcl|NC_011057. 92 PTGGISEDNTEGERVREIVSKI------ADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYA 165 (634) Q Consensus 92 ptG~i~ed~~~g~r~~~iv~~i------agG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~ 165 (634) +.++-+-++...-+.| ..+.+.+.++ ++++.++-+=|.+|+.+. |.+. |. .-..+. T Consensus 64 ------~~n~~h~~~i~~k~N~l~~~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~~~-rn~~-------G~---~~~L~~ 125 (348) T protein:vir:26 64 ------NANGYHGSLLKARANYVAGRFMNGGGLPMYKM-NSACWDYFGLGMSAFVKI-RSYL-------KN---VIALEP 125 (348) T ss_pred ------hhhhhhhhhHhhhhhHHhhcccCCCCCCHHHH-HHHHHHHHhcCCeEEEEE-EcCC-------Cc---EEEEEE Confidence 0112221121111112 4455666666 677888888899998864 6443 21 223555 Q ss_pred ccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhC Q lcl|NC_011057. 166 VSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGN 245 (634) Q Consensus 166 vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gn 245 (634) +....+.+...+. .-....+|+.++|.. +-||++=.|+|.....--||..+++.++.--.-.++.-++--+.=.... T Consensus 126 l~~~~v~~~~d~~-~~~~~~~g~~~~f~~--~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg 202 (348) T protein:vir:26 126 LPMVHMRKRKNGD-FVQLLRNNEQKVFKA--KDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMG 202 (348) T ss_pred ecCceeEeeecCc-EEEEEecCeEEEEcC--ccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 5556666554332 334556888888755 3345666778877777788988888765432222222112222222344 Q ss_pred ceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecC Q lcl|NC_011057. 246 GVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFD 325 (634) Q Consensus 246 GvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~ 325 (634) |||.++.. . ...-..+.|.+.+-+ ++ ..-..=.+++..|+-.-+.+|-..++ T Consensus 203 ~Il~~~~~------~--------------ls~e~~~~lk~~~~~-~~-------G~~n~~~~~vl~~~g~~~Gi~~~pis 254 (348) T protein:vir:26 203 FIFYATDP------N--------------LSEADEKALKEKIAS-SK-------GIGNFRSMFVNIPNGKEKGIQLIPVG 254 (348) T ss_pred eEEEecCC------C--------------CCHHHHHHHHHHHHH-hc-------CcccccceeEEcCCCCccceeEEEcc Confidence 55655421 0 112245555555532 11 11122345566665333344444443 Q ss_pred C-chhHHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 326 N-EITEVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREG 402 (634) Q Consensus 326 ~-d~te~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG 402 (634) . ..+.--+++|+.....||..+-||| .|+|+. +..|..++-+....=++..|.|.++.|+++|++.++ T Consensus 255 ~~~~d~qf~e~k~~t~~dIa~af~VPp-~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~-------- 325 (348) T protein:vir:26 255 DIATKDEFERIKNITAQDIFVGHRFPA-GMGGMLPQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPE-------- 325 (348) T ss_pred CChhHHHHHHHHHhhHHHHHHHhCCCH-HHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhC-------- Confidence 2 2233468899999999999999999 789973 336778888888888999999999999999997542 Q ss_pred CChhHheeeecCcccccCCCchHHHHHHHHccCC Q lcl|NC_011057. 403 IDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAI 436 (634) Q Consensus 403 ~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~I 436 (634) +. ..+-|+||.+.. .|+++. .+| T Consensus 326 ~~-~~~~~~fdl~~~---~e~~~~-------~a~ 348 (348) T protein:vir:26 326 IP-DNLKLKFNLNPG---VESANG-------SAV 348 (348) T ss_pred CC-CccEEEEecCcc---cccchh-------hcC Confidence 22 445677876532 222222 222 No 99 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=98.70 E-value=2.5e-09 Score=67.79 Aligned_cols=330 Identities=15% Similarity=0.118 Sum_probs=176.6 Q ss_pred eEec--cCCCCccch-hhhhhhhccCCchh-hhhhhhcccCc----cccccHHHHHHHhhhhhHHHHHhhhhhceee--- Q lcl|NC_011057. 9 LVRR--PKGGRPAPS-RALTAASQPLPDPS-QVFSKSTGISR----NSDWQTDAWEAVDLVGELRYYVGWRASSCSR--- 77 (634) Q Consensus 9 ~vrr--p~g~~~a~~-ral~aAs~~itdp~-~~~~~~~~~~~----~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr--- 77 (634) ..|+ +....+++. .+.+...++-+.+. ..|+- +.. ...| ++|-+ |+-|.-+|..--+|+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~p~~v~~~~~------~~~y~-~~~~~~~~~~pp~~~~~l 70 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTF---GDPMPVLDGRG------ILDYL-ECWPNGRWYEPPLSMEGL 70 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEe---CCceeecCcch------hhHHH-HHhhcCccccCCCCHHHH Confidence 2222 222222222 33333333322111 12221 111 1111 11111 222222343333333 Q ss_pred eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHH------hhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCC Q lcl|NC_011057. 78 CRLVASELDENTGLPTGGISEDNTEGERVREIVS------KIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPA 151 (634) Q Consensus 78 ~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~------~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~ 151 (634) +||+ +.++.+.++....+ -+..+.+...++ ++++..+-+=|.+|+.+. |.+ T Consensus 71 a~~~----------------~~~~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f-~~~v~d~ll~Gnay~~~~-rn~----- 127 (350) T protein:vir:11 71 AKSV----------------GSSVYLQSGLKFKRNMLAKTFIPHRLLSRATF-EQFSLDWLTFGSAYLEQP-RSR----- 127 (350) T ss_pred HHHH----------------hhhhhhccchhhhhhhhhhcccCCCCCCHHHH-HHHHHHHHhcCCeEEEEE-EcC----- Confidence 1221 11222211111111 145666777776 467777778899999874 533 Q ss_pred CcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhh Q lcl|NC_011057. 152 QPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTT 231 (634) Q Consensus 152 ~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rtt 231 (634) .|. . -..+.+....+++...++..-...++|..++|..+ | ||++=+|+|.....--||..+++.++.--.-.+ T Consensus 128 --~G~--~-~~L~~l~~~~vr~~~~~~~~~~~~~~~~~~~~~~~-e-Vihir~~~~~~~~yGls~~~~a~~si~l~~~a~ 200 (350) T protein:vir:11 128 --LGT--R-MPLQAPLAKYMRRGTDLETFYQVRSWKDEHEFEKG-S-VIQLREADINQEIYGVPEWFCALQSALLNESAT 200 (350) T ss_pred --CCC--E-EEEEEeCCceeEeeecCCeEEEEeeCCeEEEECcc-c-EEEeCCCCCCCCcccccHHHHHHHHHHHHHHHH Confidence 232 2 23555666777766656555566778888888653 3 456656777777777889999888876544444 Q ss_pred HHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_011057. 232 KTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGV 311 (634) Q Consensus 232 k~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~v 311 (634) +.-++.-+.-..-.|||.+|... ...-..+.|.+.+-+ -.+....=.+++.. T Consensus 201 ~~~~~~f~NGa~~~gil~~~~~~--------------------ls~e~~~~l~~~~~~--------~~G~~N~~~~~v~~ 252 (350) T protein:vir:11 201 LFRRKYYNNGSHAGFILYMTDAA--------------------QNEEDIDALRTALKT--------AKGPGNFRNLFVYA 252 (350) T ss_pred HHHHHHHhccCCCceEEEecCCC--------------------CCHHHHHHHHHHHHH--------hcCccccCceeeec Confidence 44443333333444667665310 112245556555532 11222223445566 Q ss_pred chHHhcccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhcccc--CcchhhHhhhhhhhhhHHHHhHHHHHHHH Q lcl|NC_011057. 312 PGEQIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGS--QTNHWSAWQISDEDVQLHIAPVMEIFCQA 388 (634) Q Consensus 312 P~Ehi~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlgs--~~NhwtAw~i~de~v~~hI~P~~~~i~~a 388 (634) |+-.=+.+|-..++..-.+. -+++|+.....||..+-||| .|+|+-. .+|..++.+....=++..|.|.+..|++ T Consensus 253 ~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~- 330 (350) T protein:vir:11 253 PNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYP-QLMGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ- 330 (350) T ss_pred CCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH- Confidence 65333445555555444333 58899999999999999999 7999732 3567888898888999999999999986 Q ss_pred HHHHHHHHHHHhcCCChhHheeeecCccc Q lcl|NC_011057. 389 LTDQILRVTLAREGIDPSKYVVWYDASQL 417 (634) Q Consensus 389 it~~~lr~~L~~eG~d~~~yV~w~DaS~L 417 (634) |.+ +|- .+-+... -|+.++| T Consensus 331 ln~-~l~----~~~~~F~----~~~~~~l 350 (350) T protein:vir:11 331 VNE-MIG----EEVVRFA----QFDAPGL 350 (350) T ss_pred HHh-hcC----ccccccC----cccccCC Confidence 444 331 1112223 4555888 No 100 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=98.69 E-value=2.2e-09 Score=68.08 Aligned_cols=356 Identities=12% Similarity=0.050 Sum_probs=183.9 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhh-----cccCccccccHHHHHHHhhhhhHHHHHhhhhhce Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS-----TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSC 75 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~-----~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~ 75 (634) |. =|=-|+.+-..++..++.++++.+- ++..+.+ +++ ....|.+..| ++|-+ |+-|.-+|-..-| T Consensus 1 m~----~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~fg-~p~~~~~~~~-~~~~~-~~~~~~~~~~~pi 70 (368) T protein:vir:79 1 MS----RNKTRRAARAASAHVRTANTDAPTE---HHTDRAAQAEVFSFG-DPVEVLDRRE-LLDYV-ECMRMGQWYEPPM 70 (368) T ss_pred CC----ccccccchhccCcccccccccCcch---hhccccCceEEEEcC-Cceeecchhh-HHHHH-HHHhccchhccCc Confidence 32 1111211122222222222222211 1111111 112 2223666554 44443 3323333666666 Q ss_pred ee---eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCC Q lcl|NC_011057. 76 SR---CRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQ 152 (634) Q Consensus 76 Sr---~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~ 152 (634) +. ++|+-+- +.. |++..... . +.. ........+...++ ++++.++-+=|..|+.+. |.+ T Consensus 71 ~~~~la~~~~~~--~~h----~~~~~~~~--n-~l~-l~~~Pn~~~t~~~f-~~l~~d~ll~Gnay~~~~-r~~------ 132 (368) T protein:vir:79 71 PWDGLARSFRAA--AHH----SSAVYVKR--N-ILV-STFIPHPLLSRATF-ERLVLDWQVFGNAYLERR-ENV------ 132 (368) T ss_pred CHHHHHHHHhhc--ccc----chhhhhhc--c-hhh-hhcCCCcCCCHHHH-HHHHHHHhhcCCeEEEEE-EcC------ Confidence 64 2332111 110 11111000 0 111 12346777888887 678889889999998874 432 Q ss_pred cccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_011057. 153 PDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTK 232 (634) Q Consensus 153 ~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 232 (634) +|. .-.++.+....+++...++..-+...+|..++|.. +-||++=+|+|..-..--||..+++.++.--...++ T Consensus 133 -~G~---~~~L~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~--~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~ 206 (368) T protein:vir:79 133 -LGG---TIRLDTPLAKYVRRGLDLNTYFFVQNWQQPYTFAA--GSVFHLQEPDINQEVYGLPEYLSALNATWLNESATL 206 (368) T ss_pred -CCC---EEEEEEeCcccceeeccCCEEEEEecCCeEEEEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHH Confidence 232 23466777777776666666667777888877765 335677788888888888999998877764333333 Q ss_pred HHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_011057. 233 TIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (634) Q Consensus 233 ~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP 312 (634) .-++.-+.=..-.|||.+|... ...-..+.|.+.|-+ .+ +...+--++|+ .| T Consensus 207 ~~~~~~~NGa~~~gil~~~~~~--------------------l~~e~~~~lk~~~~~-~~------G~~N~g~~~vl-~~ 258 (368) T protein:vir:79 207 FRRRYYKNGSHAGFILYMTDAA--------------------QKQEDVDTLREAMKS-AK------GPGNFRNLFMY-AP 258 (368) T ss_pred HHHHHHhccCCCceEEEeCCCC--------------------CCHHHHHHHHHHHHH-hc------CCcccCceeEe-cC Confidence 3222222223334566665310 112245556655532 11 11122223333 34 Q ss_pred h--HHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhcccc--CcchhhHhhhhhhhhhHHHHhHHHHHHHH Q lcl|NC_011057. 313 G--EQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGS--QTNHWSAWQISDEDVQLHIAPVMEIFCQA 388 (634) Q Consensus 313 ~--Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs--~~NhwtAw~i~de~v~~hI~P~~~~i~~a 388 (634) + |..-+++.+.+... +.--+++|+.....||..+-||| .|||+.. ..|..++-+....=++.-|.|.++.|++ T Consensus 259 ~g~~~g~~~~pls~~~~-d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie~- 335 (368) T protein:vir:79 259 NGKKDGIQLLPVSEVAA-KDEFWNIKNVTRDDQLAAHRVPP-QLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLLA- 335 (368) T ss_pred CCCccceeEEEcCCCHH-HHHHHHHHHHhHHHHHHHhCCCH-HHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH- Confidence 3 33334555554332 33357899999999999999999 8889832 2457788888888899999999999974 Q ss_pred HHHHHHHHHHHhcCCChhHheeeecCcccccCCCchHHHHHHHHccCCCH Q lcl|NC_011057. 389 LTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKSDEAKFAYENGAING 438 (634) Q Consensus 389 it~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t~eA~~~~~~G~It~ 438 (634) |++ + |. .+++-||...|... |....|..+.+ ++ T Consensus 336 ln~-~----l~-------~e~~rF~~~~l~~~-D~~a~a~~~~r----sa 368 (368) T protein:vir:79 336 IND-W----IG-------DEVVRFAPYALGGH-DQPAAAPGGQR----SA 368 (368) T ss_pred HHh-c----cC-------cceeeechhHhhcc-cccccCCcccc----cC Confidence 443 2 21 13455565555443 22222211111 11 No 101 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.66 E-value=1.1e-09 Score=69.77 Aligned_cols=330 Identities=14% Similarity=0.168 Sum_probs=175.2 Q ss_pred eEeccCCCCccchhhhhhhhccCCchhhhhhhhcccC-ccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE---E-Ee Q lcl|NC_011057. 9 LVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGIS-RNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL---V-AS 83 (634) Q Consensus 9 ~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~-~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL---~-as 83 (634) ..||-+.....+...-++.++.+ ..|+-..-.. ....|..+..+++.- | +|| .--+++.-| + ++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~----~~~~f~~p~~v~~~~~~~~~~~~~~~-~--~~~----~pp~~~~~la~~~~a~ 69 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKM----EAFTFGEPVPVLDRRDILDYVECISN-G--RWY----EPPISFTGLAKSLRAA 69 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcE----EEEEcCCceeecCCcchhHHHHhhhc-C--ccc----cCCCCHHHHHHHHHhh Confidence 33443321111111111111111 1221110000 112234443333321 3 222 111111100 0 00 Q ss_pred eecccCCCCCCCCCCCCcccHHHHHHHHh-hcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchh Q lcl|NC_011057. 84 ELDENTGLPTGGISEDNTEGERVREIVSK-IADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQE 162 (634) Q Consensus 84 eiD~Dtg~ptG~i~ed~~~g~r~~~iv~~-iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~ 162 (634) .+- +++-. -+..-|... ...+.+...++ ++++.++.+=|..|+.+ .|.. +|. .-+ T Consensus 70 ~~h-------~~~i~-----~k~n~l~~~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i-~rn~-------~G~---~~~ 125 (344) T protein:vir:60 70 VHH-------SSPIY-----VKRNILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEK-RYST-------TGK---VIR 125 (344) T ss_pred hhh-------ccchh-----hhhhHHHhhccCCCCCCHHHH-HHHHHHHHhcCCeEEEE-EECC-------CCc---EEE Confidence 000 00000 111111111 24556777777 78999988999999875 3533 232 234 Q ss_pred ceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_011057. 163 WYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRL 242 (634) Q Consensus 163 W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 242 (634) .+.|....+++...++..-++..+|+.++|.. +-||++=+|+|..-..-=||..+++.++. +... ...-+.|. T Consensus 126 L~~l~~~~vr~~~~~~~~~~v~~~~~~~~~~~--~eIiHir~~~~~~~~yGlsp~~~a~~si~----l~~~-a~~~~~~~ 198 (344) T protein:vir:60 126 LETSPAKYTRRGVEEDVYWWVPSFNEPTAFAP--GSVFHLLEPDINQELYGLPEYLSALNSAW----LNES-ATLFRRKY 198 (344) T ss_pred EEEcCcceEEEeecCCeEEEEccCCeEEEEcC--ccEEEEcCCCCCCCcccccHHHHHHHHHH----HHHH-HHHHHHHH Confidence 66667777776655554445555788888765 34577778888877778889888877654 2222 12234455 Q ss_pred hhCc-----eeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_011057. 243 IGNG-----VLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIK 317 (634) Q Consensus 243 ~gnG-----vlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~ 317 (634) ..|| ||.+|.. . ...-..+.|.+.+- +-...-+.=++|+.+|+..-+ T Consensus 199 f~NG~~pg~il~~~~~------~--------------ls~e~~~~ik~~~~--------~~~g~~~~r~~~l~~p~g~~~ 250 (344) T protein:vir:60 199 YENGAHAGYIMYVTDA------V--------------QDRNDIEMLRENMV--------KSKGRNNFKNLFLYAPQGKAD 250 (344) T ss_pred HhccCCCceEEEecCc------C--------------CCHHHHHHHHHHHH--------HhcCCCCCcceEEecCCCCcc Confidence 5554 5555421 0 11123444544442 212223556889999986666 Q ss_pred ccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHH Q lcl|NC_011057. 318 DVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (634) Q Consensus 318 ~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~l 394 (634) .+|-..++..-.+. -+++|+.....||..+-||| .|||+- +..|+.++-+....=++..|.|.++.|++ |.+ + T Consensus 251 g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e~-ln~-~- 326 (344) T protein:vir:60 251 GIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPF-QLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-ING-W- 326 (344) T ss_pred ceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCH-HHhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHH-h- Confidence 67766665444433 58999999999999999999 799973 23457778888888899999999999975 443 2 Q ss_pred HHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 395 RVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 395 r~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) | |... +-|+...|..|.- T Consensus 327 ---l---g~~~----i~F~~~~l~~~d~ 344 (344) T protein:vir:60 327 ---L---GQEV----IRFKNYSLDTDNG 344 (344) T ss_pred ---c---CCcc----cccCccccCCCCC Confidence 3 4322 2344455555532 No 102 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.63 E-value=3.1e-09 Score=67.20 Aligned_cols=328 Identities=13% Similarity=0.108 Sum_probs=170.1 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhh--cccCccccccHHHHHHHh-hhhhHHHHHhhhhhceee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS--TGISRNSDWQTDAWEAVD-LVGELRYYVGWRASSCSR 77 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~--~~~~~~~~WQ~eAW~~yd-~VgELryyvgWr~~s~Sr 77 (634) |. .+|.. ++-+.+.+++ ..|+-+ .-... ..|=.+..+++. ..|+ |-..-||+ T Consensus 1 m~---------~~~~~-----~~~~~~~~~~----~~~~~~~p~~~~~-~~~~~~~~~~~~~~~~~------~~~pP~~~ 55 (337) T protein:vir:78 1 MT---------KRQQQ-----PAQAAASSPR----PSVVFSMPEAIDP-TAWMTDYTGVFYNPYGE------YYQPPIDR 55 (337) T ss_pred CC---------CcccC-----cccccccCce----eEEEecCcccccC-cchhHhhhhhhhccCcc------eecCCCCH Confidence 11 11111 0111111111 112111 00111 112222222211 1111 11111211 Q ss_pred eeEEEeeecccCCCCCCCCCCCCcccHHHHHH-HHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccc Q lcl|NC_011057. 78 CRLVASELDENTGLPTGGISEDNTEGERVREI-VSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGS 156 (634) Q Consensus 78 ~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~i-v~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~ 156 (634) .-| ...-+.+|-+.++... .+-+.....+..+++++++.++-+=|..|+.+ .|.+. |. T Consensus 56 ~~L-------------a~l~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~-~rn~~-------G~ 114 (337) T protein:vir:78 56 KGL-------------AKVARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLK-LRNSF-------GQ 114 (337) T ss_pred HHH-------------HHHhhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEE-EECCC-------Cc Confidence 100 0011233434333332 23334444555678999999998999999875 55432 22 Q ss_pred cccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_011057. 157 VRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIAN 236 (634) Q Consensus 157 ~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 236 (634) .-..+.+....+++...+ ..-+...+|+.++|.. .| +|++=+|+|..-..--||+.+++.++--=...++.-++ T Consensus 115 ---~~~L~pl~~~~v~~~~d~-~~~~~~~~~~~~~~~~-~e-IiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~ 188 (337) T protein:vir:78 115 ---VVGLHPLSSVYLRRREDG-CFVYLQQGKPNLIYRP-DD-VIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRR 188 (337) T ss_pred ---EEEEEEeCCceeEeeeCC-eEEEEEcCCceEEECC-cc-EEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 234566666777766433 3345556778877765 34 46776777766666678888888776533333333222 Q ss_pred HHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_011057. 237 ASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQI 316 (634) Q Consensus 237 a~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi 316 (634) --+.=..-.|||.+|... ...-..+.|.+.+ ++-.+....=.+++..|+-.= T Consensus 189 ~f~NGa~p~~il~~~~~~--------------------l~~e~~~~lk~~~--------~~~~G~~n~~~~~v~~~~g~~ 240 (337) T protein:vir:78 189 YFLNGAHMGFIFYATDPN--------------------MDDDTEEEMKEMI--------ANSKGVGNFRSMFVNIPDGKP 240 (337) T ss_pred HHhccCCCceeEEcCCCC--------------------CCHHHHHHHHHHH--------HHhcCcccccceEEEcCCCCc Confidence 222223334566555310 0112444555444 222222334456677776444 Q ss_pred cccceeecCCchh-HHHHHHHHHHHHHHhhhccCChHHhhccccCcchhh---HhhhhhhhhhHHHHhHHHHHHHHHHHH Q lcl|NC_011057. 317 KDVKHIRFDNEIT-EVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWS---AWQISDEDVQLHIAPVMEIFCQALTDQ 392 (634) Q Consensus 317 ~~ikHl~f~~d~t-e~aiktR~daI~rlA~~~D~~pE~LLGlgs~~Nhwt---Aw~i~de~v~~hI~P~~~~i~~ait~~ 392 (634) +.||-..++..-. .--+++|+.....||..+-||| .|+|+=.+.+.|+ +-+....=++..|.|.++.|++++++. T Consensus 241 ~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp-~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ 319 (337) T protein:vir:78 241 DGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPP-ALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAINSA 319 (337) T ss_pred cceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 5556666554333 3347899999999999999999 8999833334565 777777888899999999999999753 Q ss_pred HHHHHHHhcCCChhHhe-eeecCcccc Q lcl|NC_011057. 393 ILRVTLAREGIDPSKYV-VWYDASQLT 418 (634) Q Consensus 393 ~lr~~L~~eG~d~~~yV-~w~DaS~L~ 418 (634) ++++.+|+ |-+..+.|. T Consensus 320 ---------ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 320 ---------GLPRALWVTFRETIGAAV 337 (337) T ss_pred ---------cCChhhceeccccccccC Confidence 23334443 333445555 No 103 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.61 E-value=5.5e-09 Score=65.85 Aligned_cols=322 Identities=14% Similarity=0.175 Sum_probs=176.4 Q ss_pred eEeccCCCCccch-hhhhhhhccCCchhhhhhhhc--ccCccccccHHHHHHHhhhhhHHHH----------Hhhhhhce Q lcl|NC_011057. 9 LVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKST--GISRNSDWQTDAWEAVDLVGELRYY----------VGWRASSC 75 (634) Q Consensus 9 ~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~~--~~~~~~~WQ~eAW~~yd~VgELryy----------vgWr~~s~ 75 (634) ..||-+ ..+++. .+.++.++.+ ..|+-.. -. ....|-.+..+++. .|+ || =-.++|.. T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~----~~~~f~~p~~v-~~~~~~~~~~~~~~-~~~--~~~pp~~~~~la~~~~a~~~ 71 (344) T protein:vir:20 1 MSKKKG-KTPQPAAKTMTASGPKM----EAFTFGEPVPV-LDRRDILDYVECIS-NGR--WYEPPVSFTGLAKSLRAAVH 71 (344) T ss_pred CCcccC-CCCcchhhhhhccCCce----EEEEcCCceEe-cCcchhhhhhhhhh-cCc--eecCCCCHHHHHHHHhhhhh Confidence 233322 222222 2333333322 1222110 00 11122232222222 132 22 00111111 Q ss_pred eeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHH-HhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcc Q lcl|NC_011057. 76 SRCRLVASELDENTGLPTGGISEDNTEGERVREIV-SKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPD 154 (634) Q Consensus 76 Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv-~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~d 154 (634) ..-=||+ +..-+. .-+....+...++ ++++..+.+=|..|+.+ .|.+. T Consensus 72 h~~~i~~----------------------k~n~l~~~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i-~rn~~------- 120 (344) T protein:vir:20 72 HSSPIYV----------------------KRNILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEK-RYSTT------- 120 (344) T ss_pred hCcccee----------------------hhhhHHHhccCCCCCCHHHH-HHHHHHHHhcCCeEEEE-EECCC------- Confidence 1000110 001111 1134556777777 78888888899999976 45332 Q ss_pred cccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_011057. 155 GSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (634) Q Consensus 155 g~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 234 (634) |. .-.-+.+....+.+...++..-++..+|..++|.. +-||++=+|+|..-..--||..+++.++---.-.++ T Consensus 121 G~---~~~L~pl~~~~vr~~~~~~~~~~~~~~~~~~~~~~--~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~-- 193 (344) T protein:vir:20 121 GK---VIRLETSPAKYTRRGVEEDVYWWVPSFNEPTAFAP--GSVFHLLEPDINQELYGLPEYLSALNSAWLNESATL-- 193 (344) T ss_pred Cc---EEEEEEcCCceeEeeecCCEEEEEccCCeEEEEcC--ccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHH-- Confidence 32 22355556666666655555455566788888765 335667688887777778898888877653322232 Q ss_pred HHHHHhHhhhC-----ceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeE Q lcl|NC_011057. 235 ANASKSRLIGN-----GVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIA 309 (634) Q Consensus 235 ~na~~SRL~gn-----GvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva 309 (634) -+.|...| |||.+|.. . ...-..+.|.+.|- +-...-+.=++|+ T Consensus 194 ---~~~~~f~NGa~p~~Il~~~d~------~--------------l~~e~~~~ik~~~~--------~~~g~~n~r~l~l 242 (344) T protein:vir:20 194 ---FRRKYYENGAHAGYIMYVTDA------V--------------QDRNDIEMLRENMV--------KSKGRNNFKNLFL 242 (344) T ss_pred ---HHHHHHhccCCCceEEEecCc------C--------------CCHHHHHHHHHHHH--------HhcCCCCccceEE Confidence 23344444 45655421 0 01113344444442 2222345678899 Q ss_pred eechHHhcccceeecCCchhHH-HHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHHH Q lcl|NC_011057. 310 GVPGEQIKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIFC 386 (634) Q Consensus 310 ~vP~Ehi~~ikHl~f~~d~te~-aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i~ 386 (634) ..|+..-+.+|-..++..-.+- -+++|+.....||..+-||| .|+|+- +.+|..++.+....=++..|.|.++.|. T Consensus 243 ~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp-~llGi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~~e 321 (344) T protein:vir:20 243 YAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPF-QLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIR 321 (344) T ss_pred ecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCH-HHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHH Confidence 9998666677777776544433 48999999999999999999 788973 3356777888888888999999999997 Q ss_pred HHHHHHHHHHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 387 QALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 387 ~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) + |++ + | |... +-|+-..|..+.+ T Consensus 322 ~-in~-~----l---g~~~----i~F~~~~l~~~d~ 344 (344) T protein:vir:20 322 E-ING-W----L---GQEV----IRFKNYSLDTDND 344 (344) T ss_pred H-HHH-h----c---CCcc----cccCccccccCCC Confidence 5 443 2 3 4432 3355567777755 No 104 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.61 E-value=2.7e-09 Score=67.54 Aligned_cols=326 Identities=10% Similarity=0.087 Sum_probs=167.7 Q ss_pred EeccCCCCccchhh-hhhhhccCCchhhhhhhh-----cccCccccccHHHHHHHhhhhhHHHHHhhhhhceee---eeE Q lcl|NC_011057. 10 VRRPKGGRPAPSRA-LTAASQPLPDPSQVFSKS-----TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSR---CRL 80 (634) Q Consensus 10 vrrp~g~~~a~~ra-l~aAs~~itdp~~~~~~~-----~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr---~rL 80 (634) +++-+..+ ..+ .+++.+.. ..|+.. .+..-.+-|+...+++|+ --+++ ++| T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~----~~f~~~~~~~~~~~~y~~~~~~~~~~~~e-------------pp~~~~~la~l 60 (345) T protein:vir:37 1 MKTNVKTD---NKKGIVIAPIND----RTFSLNEISASPALDYVGIGFDENYNCYL-------------PPVNRHALAKL 60 (345) T ss_pred CCCCcccc---chhhcccCccee----EEeecCCcccccchhhhhhhhcCCccccC-------------CCCCHHHHHHH Confidence 22211111 111 11111111 122221 011112224444444443 11111 111 Q ss_pred E-EeeecccCCCCCCCCCCCCcccHHHHHHHH-hhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccc Q lcl|NC_011057. 81 V-ASELDENTGLPTGGISEDNTEGERVREIVS-KIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (634) Q Consensus 81 ~-aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~-~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~ 158 (634) + ++.+- +++-. -+..-+.. -....-+...++ ++++..+-+=|.+|+.+ .|.+. |. T Consensus 61 ~~~~~~h-------~~~i~-----~k~n~l~~~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~~-~rn~~-------G~-- 117 (345) T protein:vir:37 61 PHQNAQH-------GGILH-----SRANMVSSLYEGGKALSRMDM-RALCLNLIQFGDVGLLK-VRNGF-------GQ-- 117 (345) T ss_pred hhccccc-------cccee-----eechHHHhhccCCCCCCHHHH-HHHHHHHHhcCCeEEEE-EEcCC-------Cc-- Confidence 1 00000 00000 00000001 123445666666 57778877889999875 45433 32 Q ss_pred cchhceeccHHHHhccCCCcceeeE-----eCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHH Q lcl|NC_011057. 159 TRQEWYAVSKEEIKKSNKGSGTNIV-----LPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKT 233 (634) Q Consensus 159 ~~~~W~~vt~~Ei~~~~~~~~~~i~-----lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~ 233 (634) . -..+.+....+.+...++..-.. ..+|+.++|..+ | ||++=+|+|..-..--||..+++.++.-=...++. T Consensus 118 ~-~~L~pl~~~~vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~-d-Vihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~ 194 (345) T protein:vir:37 118 V-VRLVPLSSLYLRVRKDGGYSYLMKKSLYDTAQEIYRYDAK-D-IIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVF 194 (345) T ss_pred E-EEEEEEcCceeEEEEeCCeeEEEEEeEecCCceEEEEccc-c-EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 2 23445555555554443332211 235777777653 3 56666677777667788988888765432222222 Q ss_pred HHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_011057. 234 IANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPG 313 (634) Q Consensus 234 I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~ 313 (634) -++.-+.=..-.|||.+|... ...-..+.|.+.+ ++-......=.+++..|+ T Consensus 195 ~~~~f~NG~~p~~Il~~~d~~--------------------l~~e~~~~lk~~~--------~~~~g~~n~~~~~i~~p~ 246 (345) T protein:vir:37 195 RRRYFSNGAHMGFILYSTDPD--------------------LTEEMEEEIARKI--------SESKGVGNFRSMFVNIAN 246 (345) T ss_pred HHHHHhccCCcceEEEecCCC--------------------CCHHHHHHHHHHH--------HHhcCcccccceEEEcCC Confidence 222222222233566665210 0111334444433 332223344557777776 Q ss_pred HHhcccceeecCCch-hHHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHHHHHHH Q lcl|NC_011057. 314 EQIKDVKHIRFDNEI-TEVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIFCQALT 390 (634) Q Consensus 314 Ehi~~ikHl~f~~d~-te~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i~~ait 390 (634) -.-+.+|-..++..- +.--+++|+-....||..+-||| .|+|+. +.++..++.+....-++..|.|.++.|+++|+ T Consensus 247 g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp-~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln 325 (345) T protein:vir:37 247 GHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPA-GLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETIN 325 (345) T ss_pred CcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 444555555554432 33358889999999999999999 789983 23455678888888999999999999999998 Q ss_pred HHHHHHHHHhcCCChhHheeeecCccccc Q lcl|NC_011057. 391 DQILRVTLAREGIDPSKYVVWYDASQLTI 419 (634) Q Consensus 391 ~~~lr~~L~~eG~d~~~yV~w~DaS~L~~ 419 (634) +. ++ -+..+++.||.-+|.. T Consensus 326 ~~-----~~----~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 326 QD-----PE----IKNLLKIKFREQNFAK 345 (345) T ss_pred hh-----cc----CCCcceEEecchhhcC Confidence 63 21 1356789999999988 No 105 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=98.52 E-value=4.7e-09 Score=66.24 Aligned_cols=345 Identities=18% Similarity=0.180 Sum_probs=175.8 Q ss_pred CCCCCcc-----------------eeEeccCCCCccchhhhhhhhccCCchhh--hhhhhc--ccCccccccHHHHHHHh Q lcl|NC_011057. 1 MAATQSL-----------------RLVRRPKGGRPAPSRALTAASQPLPDPSQ--VFSKST--GISRNSDWQTDAWEAVD 59 (634) Q Consensus 1 ~~a~~~l-----------------r~vrrp~g~~~a~~ral~aAs~~itdp~~--~~~~~~--~~~~~~~WQ~eAW~~yd 59 (634) |-|-..- +..|| |....+..-+..+.++...-|++ .|+-.. .. ....|..+..+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v-~~~~~~~~~~~~~~ 78 (376) T protein:vir:10 1 MPARDRPRAARRRRHSFIFIHGVLRMSKR-RSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPV-MNRAEILDYVECWS 78 (376) T ss_pred CCCCccchhhhhhcccchhhcccccchhc-cCCCcccchhhhhHhhhccCcceeEEEEcCCceec-cCcchhhhhhhhhh Confidence 3332222 22222 11111111111111111111111 121100 00 11112222222221 Q ss_pred hhhhHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHh------hcCCcchHHHHHHHHHHhhcc Q lcl|NC_011057. 60 LVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSK------IADGTLGQAALTKRVVECLTV 133 (634) Q Consensus 60 ~VgELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~------iagG~lGQaqL~kR~~~~LtV 133 (634) . | +| |--=|+.+ |+ ...-+.++-+.++....+. +....+.+.++ ++++..+-+ T Consensus 79 ~-~------~~----------~~pp~~~~-~L--a~~~~~~~~h~s~l~~k~n~l~~~~~Pnp~lT~~~f-~~~v~d~ll 137 (376) T protein:vir:10 79 N-G------EW----------FEPPVSFA-GL--AKSFRASTHHSSALFFKANVLASTFRPHRWLSRHAF-ERWALDFLT 137 (376) T ss_pred c-C------ce----------ecCCCCHH-HH--HHHHhhhHHhhhhHHHHhHHHHhccCCCCCCCHHHH-HHHHHHHHh Confidence 1 1 12 11112211 00 0111233333333332222 24666777776 577888778 Q ss_pred ccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCC Q lcl|NC_011057. 134 PGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEP 213 (634) Q Consensus 134 pGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~ea 213 (634) =|.+|+.+. |.+ +|. .-..+.|....+++...+++.-+...+|+.++|.. +-+|++=+|+|..--.- T Consensus 138 ~Gnay~~~~-rn~-------~G~---~~~L~pl~~~~vr~~~d~~~~~~~~~~~~~~~~~~--~eViHir~~~~~~~~yG 204 (376) T protein:vir:10 138 FGNGYLERR-RNM-------VGG---TLRLEPALAKYVRRKADFNGFVYVNGWQERHEFEP--DSVFQLVRPDINQEVYG 204 (376) T ss_pred cCCeEEEEE-ECC-------CCC---EEEEEEeCCcceEEEeeCCeEEEEEcCCeEEEEcc--ccEEEecCCCCCCCccc Confidence 899998763 433 332 23466677777776666666667777888888865 33456656777766666 Q ss_pred ccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecc-cccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHh Q lcl|NC_011057. 214 DSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPH-EMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAE 292 (634) Q Consensus 214 DSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~-e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~ 292 (634) =||..+++.++---...++.-++--+.=..-.|||.+|. .++ .-..+.|.+.+- T Consensus 205 ls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~---------------------~e~~~~lr~~~~---- 259 (376) T protein:vir:10 205 LPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQK---------------------QDDVDNMRDALK---- 259 (376) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCC---------------------HHHHHHHHHHHH---- Confidence 789998888766444444333333333333446676653 111 124455555552 Q ss_pred hcccCccccccccceeEeechHHhcccceeecCCch-hHHHHHHHHHHHHHHhhhccCChHHhhcccc--CcchhhHhhh Q lcl|NC_011057. 293 TAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEI-TEVAIKTRNDAIARLAMGLDVSPERLLGLGS--QTNHWSAWQI 369 (634) Q Consensus 293 tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~-te~aiktR~daI~rlA~~~D~~pE~LLGlgs--~~NhwtAw~i 369 (634) +-......=.+++..|+-.-+.+|-..++..- +.--+++|+.....||..+-||| .|+|+-. .+|..++.+. T Consensus 260 ----~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~eq~ 334 (376) T protein:vir:10 260 ----NAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSNSGGFGTPDTA 334 (376) T ss_pred ----HhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcccHHHH Confidence 21122223345556665333344555454332 23358899999999999999999 7999832 3467888999 Q ss_pred hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCch Q lcl|NC_011057. 370 SDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKS 424 (634) Q Consensus 370 ~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t 424 (634) ...=++.-|.|.++.|.+ |++ +| | .+||-||..+|.--.-++ T Consensus 335 ~~~f~~~~L~Pl~~~iee-ln~-~L-------~----~~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 335 ARVFGRNEIRPLQARFAE-LND-WL-------G----EEVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHHHHHHHHHH-HHh-hc-------c----ccccccChhHhhcccccC Confidence 998999999999999975 554 22 1 234556666665442333 No 106 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.49 E-value=1.7e-08 Score=63.19 Aligned_cols=321 Identities=14% Similarity=0.193 Sum_probs=175.8 Q ss_pred eEeccCCCCccch-hhhhhhhccCCchhhhhhhh--cccCccccccHHHHHHHhhhhhHHHH---Hh-------hhhhce Q lcl|NC_011057. 9 LVRRPKGGRPAPS-RALTAASQPLPDPSQVFSKS--TGISRNSDWQTDAWEAVDLVGELRYY---VG-------WRASSC 75 (634) Q Consensus 9 ~vrrp~g~~~a~~-ral~aAs~~itdp~~~~~~~--~~~~~~~~WQ~eAW~~yd~VgELryy---vg-------Wr~~s~ 75 (634) ..||-+- .+++. ...+++++.+ ..|+-. .-. ....|-.+..+++. .|+ || ++ .++|.. T Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~----~~~~~~~p~~v-~~~~~~~~~~~~~~-~~~--~~~pp~~~~~la~~~~a~~~ 71 (344) T protein:vir:56 1 MSKKKGK-TPQPAAKTMTASAPKM----EAFTFGEPVPV-LDRRDILDYVECIS-NGR--WYEPPVSFTGLAKSLRAAVH 71 (344) T ss_pred CCCCCCC-CCchhhHHhhcCCCce----EEEEcCCceee-cCcchhhhHHHhhh-cCc--cccCCCCHHHHHHHHhhhhh Confidence 3333332 22222 2333333333 233221 001 11223333333321 132 22 00 111111 Q ss_pred eeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHH-HhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcc Q lcl|NC_011057. 76 SRCRLVASELDENTGLPTGGISEDNTEGERVREIV-SKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPD 154 (634) Q Consensus 76 Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv-~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~d 154 (634) ..-=||+ +..-+. .-+...-+.+.++ ++++.++.+=|.+|+.+ .|.+. T Consensus 72 h~s~i~~----------------------k~n~l~~~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~-~rn~~------- 120 (344) T protein:vir:56 72 HSSPIYV----------------------KRNILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEK-RYSTT------- 120 (344) T ss_pred hCcccee----------------------hhhhHHhhcCCCCCCCHHHH-HHHHHHHHhcCCeEEEE-EECCC------- Confidence 0000000 011111 1124556777777 78999988999999886 35332 Q ss_pred cccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_011057. 155 GSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (634) Q Consensus 155 g~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 234 (634) |. .-.-+.+....+.+...++..-++..+|+.++|.. +-||++=+|+|..--.--||..+++.++. +.... T Consensus 121 G~---~~~L~pl~~~~v~~~~~~~~~~~~~~~g~~~~~~~--~dIiHir~~~~~~~~~Gls~~~~a~~si~----l~~~a 191 (344) T protein:vir:56 121 GK---VIRLETSPAKYTRRGVEEDVYWWVPSFNEPTAFAP--GSVFHLLEPDINQELYGLPEYLSALNSAW----LNESA 191 (344) T ss_pred Cc---EEEEEEeCCceeEEeecCCEEEEEecCCeEEEEcC--ccEEEECCCCCCCCcccccHHHHHHHHHH----HHHHH Confidence 32 23355566677777666655556667888888865 44577778888776677788888776654 22222 Q ss_pred HHHHHhHhhhCc-----eeeeccc-ccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_011057. 235 ANASKSRLIGNG-----VLFVPHE-MSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVI 308 (634) Q Consensus 235 ~na~~SRL~gnG-----vlfvP~e-~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiv 308 (634) . ....|...|| ||.+|.. ++ .-..+.|.+.+- +-...-+.=++| T Consensus 192 ~-~~~~~~f~NGa~pg~Il~~~d~~ls---------------------~e~~~~lk~~~~--------~~~g~~~~r~l~ 241 (344) T protein:vir:56 192 T-LFRRKYYENGAHAGYIMYVTDAVQD---------------------RNDIEMLRENMV--------KSKGRNNFKNLF 241 (344) T ss_pred H-HHHHHHHhccCCCceEEEecCCCCC---------------------HHHHHHHHHHHH--------HhcCCCCccceE Confidence 2 2344555554 5655531 11 113444444443 212223577889 Q ss_pred EeechHHhcccceeecCCchhH-HHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHH Q lcl|NC_011057. 309 AGVPGEQIKDVKHIRFDNEITE-VAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIF 385 (634) Q Consensus 309 a~vP~Ehi~~ikHl~f~~d~te-~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i 385 (634) +..|+..-+.+|-..++..-.+ --+++|+.....||..+-||| .|+|+- +..|..++.++...=++..|.|.++.| T Consensus 242 l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp-~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~i 320 (344) T protein:vir:56 242 LYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPF-QLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRI 320 (344) T ss_pred EecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCH-HHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHH Confidence 9999866666777777654443 358999999999999999999 799973 224566788888888999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 386 CQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 386 ~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) ++ +.+. | ..+=+...+|.+ ..+.- T Consensus 321 e~-~n~~-l----~~~~~~F~~y~l-------~~~~~ 344 (344) T protein:vir:56 321 RE-INGW-I----GQEVIRFKNYSL-------DTDNG 344 (344) T ss_pred HH-HHhh-h----ccccccCCCccc-------cccCC Confidence 76 4432 2 122224444444 33311 No 107 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.38 E-value=4.4e-08 Score=60.94 Aligned_cols=323 Identities=18% Similarity=0.199 Sum_probs=169.6 Q ss_pred CCCCCcceeEeccCCCCccchhhhhh-hhccCCchhhhhhhh--cccCccccccHHHHHHHhhhhhHHHHHhhhhhceee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTA-ASQPLPDPSQVFSKS--TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSR 77 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~a-As~~itdp~~~~~~~--~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr 77 (634) |. .|-+. +++ +.++ +++++ ..|+-. .-+. ...|..+.++++.. |+ |...-+|+ T Consensus 1 m~--------~~~~~--~~~--~~~~~~~~~~----~~~~~~~p~~~~-~~~~~~~~~~~~~~-~~------~~~pp~~~ 56 (340) T protein:vir:98 1 MS--------KRKPR--KAV--AMTASAPQKM----EAFTFGEPVPVL-DKRDILDYVECISN-GK------WYEPPVSF 56 (340) T ss_pred CC--------CCCCC--ccc--cccccCccce----eEEEcCCceeec-Ccchhhhhhhhhhc-Cc------eecCCCCH Confidence 32 11111 111 1111 11111 233221 1111 22244444444422 31 22222221 Q ss_pred eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHh------hcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCC Q lcl|NC_011057. 78 CRLVASELDENTGLPTGGISEDNTEGERVREIVSK------IADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPA 151 (634) Q Consensus 78 ~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~------iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~ 151 (634) .=| | .+-+.++-+..+....+. +..+.+.+.++ ++++.++-+=|..|+.++ |.+. T Consensus 57 ~~l-a------------~l~~a~~~h~s~i~~k~n~l~~~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~~~-rn~~---- 117 (340) T protein:vir:98 57 SGL-A------------KSLRSAVHHSSPIYVKRNVLASTYIPHPLLSRQDF-SRFALDYLVFGNAFLEQR-HSVT---- 117 (340) T ss_pred HHH-H------------HHHHhccccchhhhhhhhHHhhccCCCCCCCHHHH-HHHHHHHHhcCCeEEEEE-ECCC---- Confidence 100 0 000111111111111111 24455666665 678888888899999864 5332 Q ss_pred CcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhh Q lcl|NC_011057. 152 QPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTT 231 (634) Q Consensus 152 ~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rtt 231 (634) |. .-..+.+....+.+...++..-....+|.+++|.. +-||++=+|+|..-..--||..+++.++.-=.- T Consensus 118 ---G~---~~~L~pl~~~~vr~~~~~~~~~~~~~~~~~~~~~~--~eViHir~~~~~~~~~Gls~~~~a~~si~l~~a-- 187 (340) T protein:vir:98 118 ---GQ---LIKLLTSPAKYTRRGVDDSVFWFVENFTQPHEFAP--DTVFHLLEPDINQEIYGLPEYLSALNSAWLNES-- 187 (340) T ss_pred ---Cc---EEEEEEeCCceEEEcccCcEEEEEecCCeEEEEcc--ccEEEEcCCCCCCCcccccHHHHHHHHHHHHHH-- Confidence 32 23355566667766655544445566788888855 335677678888777778898888776532221 Q ss_pred HHHHHHHHhHhhhCc-----eeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccc Q lcl|NC_011057. 232 KTIANASKSRLIGNG-----VLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIP 306 (634) Q Consensus 232 k~I~na~~SRL~gnG-----vlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vP 306 (634) ...-+.|...|| ||.+|.. . ...-..+.|.+.+ ++-......=. T Consensus 188 ---a~~~~~~~f~NGa~pg~il~~~~~------~--------------ls~e~~~~lk~~~--------~~~~G~~n~~~ 236 (340) T protein:vir:98 188 ---ATLFRRKYYQNGAHAGYIMYVTDP------A--------------QSATDVESLRDAM--------RNSKGLGNFKN 236 (340) T ss_pred ---HHHHHHHHHhccCCCceEEEecCC------C--------------CCHHHHHHHHHHH--------HHhcCccccCc Confidence 222334555555 6666521 0 1112344444433 32222333446 Q ss_pred eeEeechHHhcccceeecCCchh-HHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHH Q lcl|NC_011057. 307 VIAGVPGEQIKDVKHIRFDNEIT-EVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVME 383 (634) Q Consensus 307 iva~vP~Ehi~~ikHl~f~~d~t-e~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~ 383 (634) +++..|+-.-+.+|-..++..-. .--+++|+..+..||..+-||| .|+|+- +.+|..+.-+....=++..|.|.++ T Consensus 237 ~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~ 315 (340) T protein:vir:98 237 LFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPF-QLMGGKPENIGSLGDVEKVAKVFVRNELSPLQD 315 (340) T ss_pred eeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCH-HHhcccCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 77777764445566666654433 3458899999999999999999 799983 2345678889999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 384 IFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 384 ~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) .|.+ |++ +| ..+ |+-||...|.-. | T Consensus 316 ~iee-~n~-~L----~~e-------~~rF~~~~l~~~-d 340 (340) T protein:vir:98 316 RFRE-VND-WL----GME-------VIRFKEYTLDNP-E 340 (340) T ss_pred HHHH-HHh-cc----ccc-------ccccCccccccC-C Confidence 9986 543 32 112 234444444332 2 No 108 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.32 E-value=5.7e-08 Score=60.30 Aligned_cols=315 Identities=11% Similarity=0.125 Sum_probs=166.4 Q ss_pred EeccCCCCccchhhhhhhhccCCchhhhhhhhcc-----cCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeEEEee Q lcl|NC_011057. 10 VRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTG-----ISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASE 84 (634) Q Consensus 10 vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~-----~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~ase 84 (634) .+.-|.+. .++.+.+ +++ -..+|+...- ..-..-|+.+.+++| T Consensus 1 ~~~~~~~~---~~~~~~~-~~~--~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~-------------------------- 48 (345) T protein:vir:37 1 MKTNVKTD---NKKGIVI-API--NDRTFSLSEITASPALDYVGIGFDENYNCY-------------------------- 48 (345) T ss_pred CCcccccc---chhhhcC-CCc--eEEEeecCCcccchhhcccceeeecCCccc-------------------------- Confidence 22222211 1111111 111 0112222100 001111111222222 Q ss_pred ecccCCCCCCCC------CCCCcccHHHHHHHHh------hcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCC Q lcl|NC_011057. 85 LDENTGLPTGGI------SEDNTEGERVREIVSK------IADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQ 152 (634) Q Consensus 85 iD~Dtg~ptG~i------~ed~~~g~r~~~iv~~------iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~ 152 (634) +|+-.. -+.++-+.++...-+. +..+-+...++ ++++.++-+=|.+|+.++ |.+ T Consensus 49 ------epp~~~~~la~~~~~~~~h~~~i~~k~n~l~~~~~Pn~~~t~~~f-~~~v~d~ll~Gnay~~i~-rn~------ 114 (345) T protein:vir:37 49 ------LPPVNRHALAKLPHQNAQHGGILHSRANMVSATYEGGKALSKMEM-RALCLNLIQFGDVGLLKV-RNG------ 114 (345) T ss_pred ------cCCCCHHHHHHHhhcchhhcchhhhhhhHHhhccCCCCCCCHHHH-HHHHHHHHhcCCeEEEEE-ECC------ Confidence 222110 1122222222221112 24566777777 567777777899998874 433 Q ss_pred cccccccchhceeccHHHHhccCCCcceeeEeC-----CCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHH Q lcl|NC_011057. 153 PDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLP-----TGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREI 227 (634) Q Consensus 153 ~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP-----~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI 227 (634) .|. . -+.+.+....+.+...++..-.... .|+.++|..+ -+|++=+|+|..-..--||..+++.++- T Consensus 115 -~G~--~-~~L~pl~~~~vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~--eViHir~~~~~~~~~Gl~~~~~a~~si~-- 186 (345) T protein:vir:37 115 -FGQ--V-VRLVPLSSLYLRVHKDGGYSYLMKKSLYDTAQEIYRYDAK--DIIFIKLYDPMQQVYGSPDYVGGIQSAL-- 186 (345) T ss_pred -CCC--E-EEEEEecCceeEEeecCCeeEEEeeeeeccCceEEEEccc--cEEEEcCCCCCCCcccchHHHHHHHHHH-- Confidence 232 1 2344455566655544432222211 3667777543 3456656777666666678888776553 Q ss_pred HhhhHHHHHHHHhHhhhCc-----eeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccc Q lcl|NC_011057. 228 VRTTKTIANASKSRLIGNG-----VLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQA 302 (634) Q Consensus 228 ~rttk~I~na~~SRL~gnG-----vlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~A 302 (634) +... ...-+.+...|| ||.++.. . ...-..+.|.+.|- +...-. T Consensus 187 --l~~~-a~~~~~~~f~NGa~~~~Il~~t~~------~--------------l~~e~~~~lk~~~~--------~~~g~~ 235 (345) T protein:vir:37 187 --LNSD-ATVFRRRYFSNGAHMGFILYSTDP------D--------------LTEEMEEEIARKIS--------ESKGVG 235 (345) T ss_pred --HHHH-HHHHHHHHHhccCCcceEEEeCCC------C--------------CCHHHHHHHHHHHH--------HhcCcc Confidence 2221 122234555555 5544421 0 11224455554443 222234 Q ss_pred cccceeEeechHHhcccceeecCCchh-HHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHH Q lcl|NC_011057. 303 AFIPVIAGVPGEQIKDVKHIRFDNEIT-EVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIA 379 (634) Q Consensus 303 A~vPiva~vP~Ehi~~ikHl~f~~d~t-e~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~ 379 (634) +.-++++..|+..-+.+|-..++..-. .--+++|+..+..||..+-||| .|+|+- +.++..++-+....=++..|. T Consensus 236 n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp-~liGi~~~~t~~~s~~e~~~~~f~~~~l~ 314 (345) T protein:vir:37 236 NFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPA-GLSGIIPTNTGGLGDPLKYREVYHYDEVM 314 (345) T ss_pred ccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhccccCCCCCcccHHHHHHHHHHHHHH Confidence 556888888875444566666655433 3367889999999999999999 888973 234566777777777888899 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCccccc Q lcl|NC_011057. 380 PVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTI 419 (634) Q Consensus 380 P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~ 419 (634) |.++.|.++|++. ++ -+..|++.||..+|.. T Consensus 315 P~~~~ie~~ln~~-----~e----~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 315 PLQEIIAETINQD-----PE----IKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHhhhh-----hc----cCCcceEEECchhhcC Confidence 9999999999862 21 1257999999999988 No 109 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=98.32 E-value=1.9e-07 Score=57.46 Aligned_cols=331 Identities=18% Similarity=0.165 Sum_probs=172.3 Q ss_pred CCCCCcceeEeccCCCCccch-hh-hhhhhccCCchhhhhhhhc--ccCccccccH---HHHHHHh---------hhhhH Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPS-RA-LTAASQPLPDPSQVFSKST--GISRNSDWQT---DAWEAVD---------LVGEL 64 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~-ra-l~aAs~~itdp~~~~~~~~--~~~~~~~WQ~---eAW~~yd---------~VgEL 64 (634) |. =|=-| +....+++. .+ -+++++.. ..|+-.. -. ....|.. |||..-+ ..-+| T Consensus 1 ~~----~~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~~~~~p~~v-~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~ 70 (351) T protein:vir:78 1 MS----KRRSR-APRTFAAAPNPSAGSAAPARA----EVFTFDDPTPV-MNRAEILDYVECWSNGEWFEPPVSFAGLAKS 70 (351) T ss_pred CC----CCCCC-CCCCCCCCCchhhhhccccee----EEEEcCCceee-cCcchhhhhhhhhccCceecCCCCHHHHHHH Confidence 21 11111 111111111 11 11111111 1221100 00 0111222 3331100 00011 Q ss_pred HHHHhhhhhcee-eeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEE Q lcl|NC_011057. 65 RYYVGWRASSCS-RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILT 143 (634) Q Consensus 65 ryyvgWr~~s~S-r~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~ 143 (634) -....+.++.+. +..++++.+ +...-+.+.++ ++++..+-+=|.+|+.+ . T Consensus 71 ~~~~~~h~~~l~~k~n~l~~~~---------------------------~Pn~~~t~~~f-~~~~~d~ll~Gnay~~~-~ 121 (351) T protein:vir:78 71 FRASTHHSSALFFKANVLASTF---------------------------RPHRWLSRHAF-ERWALDFLTFGNGYLER-R 121 (351) T ss_pred HhhhHhhhhhhhhhhhHHhhcc---------------------------cCCCCCCHHHH-HHHHHHHHhcCCeEEEE-E Confidence 111111111111 111111111 24555667776 56777766779998875 3 Q ss_pred ecCCCCCCCcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHH Q lcl|NC_011057. 144 RPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDS 223 (634) Q Consensus 144 rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~ 223 (634) |.+ +|. .-..+.+....+.+....++.-+...+|.+++|..+ | +|++=+|+|.....-=||..+++.+ T Consensus 122 rn~-------~G~---~~~L~pl~~~~v~~~~~~~~~~~~~~~~~~~~~~~~-e-Vihir~~~~~~~~yGl~~~~~a~~s 189 (351) T protein:vir:78 122 RNM-------VGG---TLRLEPALAKYVRRKADFSGFVYVNGWQERHEFAPD-S-VFQLVRPDINQEVYGLPEYLSSLHS 189 (351) T ss_pred ECC-------CCC---EEEEEEecCcceEEeeeCCeEEEEecCCeEEEEccc-c-EEEEcCCCCCCCcccccHHHHHHHH Confidence 433 332 234666777788777666667777778888888763 4 4566678887776667899999888 Q ss_pred HHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccc Q lcl|NC_011057. 224 IREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAA 303 (634) Q Consensus 224 LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA 303 (634) +..-.-.++.-++.-+.-..-.|||.++.. . ...-..+.|.+.+ ++-..... T Consensus 190 i~l~~~a~~~~~~~f~NGa~pggIl~~~~~------~--------------ls~e~~~~lr~~~--------~~~~G~~N 241 (351) T protein:vir:78 190 AWLNESSTLFRRKYYENGSHAGFILYMTDA------A--------------QKQDDVDNMRDAL--------KNAKGPGN 241 (351) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEecCC------C--------------CCHHHHHHHHHHH--------HHhcCccc Confidence 765444444433333333333455655421 0 1112444444444 22122333 Q ss_pred ccceeEeechHHhcccceeecCCch-hHHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHh Q lcl|NC_011057. 304 FIPVIAGVPGEQIKDVKHIRFDNEI-TEVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAP 380 (634) Q Consensus 304 ~vPiva~vP~Ehi~~ikHl~f~~d~-te~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P 380 (634) .=.+++..|+-.-+.+|-..++..- +.--+++|+-....||..+-||| .|+|+- +.+|+.++.+....=++..|.| T Consensus 242 ~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~~l~P 320 (351) T protein:vir:78 242 FRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSNSGGFGTPDTAARVFGRNEIRP 320 (351) T ss_pred ccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcccHHHHHHHHHHHHHHH Confidence 4455666665444555655555443 33457899999999999999999 888983 2355678888888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCch Q lcl|NC_011057. 381 VMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKS 424 (634) Q Consensus 381 ~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t 424 (634) .++.|++ |++ + | | . +||-||..+|.--.-++ T Consensus 321 ~~~~iee-~n~-~----l---~---~-~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 321 LQARFAE-LND-W----L---G---D-EVVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHH-HHh-h----c---C---c-cceecChhhhccccccC Confidence 9999986 443 2 2 2 2 24667777776553443 No 110 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=98.28 E-value=3.4e-08 Score=61.54 Aligned_cols=215 Identities=13% Similarity=0.126 Sum_probs=119.0 Q ss_pred EecCCCCCCCcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhH Q lcl|NC_011057. 143 TRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLD 222 (634) Q Consensus 143 ~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~ 222 (634) +|-.+ || +-||..... .....|+.++|.. |=||++=.|+|..-..-=||+.+|+. T Consensus 1 ~r~~~------dg-----~~~y~~~~~------------~~~~~g~~~~~~~--~eilH~r~~~~~~~~~Glspi~~a~~ 55 (219) T protein:vir:98 1 MRVCK------DG-----NYKYLMKKS------------LYDTKSEIYEYNK--NDVIFIKLYDPMQQVYGSPDYVGGIT 55 (219) T ss_pred Cceee------cC-----eEEEEEecc------------eecCCceeEEecc--ccEEEecCCCCCCCcceecHHHHHHH Confidence 23221 22 234443211 1112345555544 44677777888777777889888876 Q ss_pred HHHHHHhhhHHHHHHHHhHhhhCceeeeccc-ccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccc Q lcl|NC_011057. 223 SIREIVRTTKTIANASKSRLIGNGVLFVPHE-MSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQ 301 (634) Q Consensus 223 ~LrEI~rttk~I~na~~SRL~gnGvlfvP~e-~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~ 301 (634) .+.. ..... .-+.++..||- .|.. +..|... ...-+.+.|.+-+-+ + .++. T Consensus 56 ~i~~----~~aa~-~~~~~~f~Ng~--~p~gil~~~~~~--------------l~~e~~~~~~~~~~~----~---~g~~ 107 (219) T protein:vir:98 56 SALL----NSDAT-IFRRRYYSNGA--HMGFILYSTDPD--------------MTEEMEDEIAERIRD----S---KGVG 107 (219) T ss_pred HHHH----HHHHH-HHHHHHHhcCC--CCceEEEeCCCC--------------CCHHHHHHHHHHHHH----h---cCcc Confidence 5542 22221 12334565652 3333 2222110 011245555555432 1 1222 Q ss_pred ccccceeEeechHHhcccceeecCC-chhHHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHH Q lcl|NC_011057. 302 AAFIPVIAGVPGEQIKDVKHIRFDN-EITEVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHI 378 (634) Q Consensus 302 AA~vPiva~vP~Ehi~~ikHl~f~~-d~te~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI 378 (634) .+ =++++..|+..-+.++...+.- ..+.--+++|+..+..||..|.||| .+||+- +..+..++-++.-.=++..+ T Consensus 108 n~-~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp-~~lG~~~~~~~~~sn~eq~~~~f~~~tL 185 (219) T protein:vir:98 108 NF-RSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPP-GLSGIIPVNTAGLGDPLKIREAYQADEV 185 (219) T ss_pred cc-cceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH-HHcccccCCCCCccCHHHHHHHHHHHHH Confidence 22 4667777774433344444432 2245578999999999999999999 788862 24556778888888899999 Q ss_pred HhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 379 APVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 379 ~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) .|.++.|.++|+++++-+ +..+ +-||.-.++-. | T Consensus 186 ~P~~~~ie~~ln~~~~~~--------~~~~-~~F~~~~~~d~-~ 219 (219) T protein:vir:98 186 LPLQEIIAESINSDYEIK--------SALK-VNFKQPEKRDK-N 219 (219) T ss_pred HHHHHHHHHHhhhhhcCC--------CccE-EeecCcccccC-C Confidence 999999999999875433 2222 23442222222 2 No 111 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.26 E-value=1.9e-06 Score=51.90 Aligned_cols=454 Identities=15% Similarity=0.126 Sum_probs=193.8 Q ss_pred CC--CCCcceeEeccCCCCccchhhhhhh---hccCCchhhhhhhh-cccCccccccHHHHHHHhhhhhHHHHHhhhhhc Q lcl|NC_011057. 1 MA--ATQSLRLVRRPKGGRPAPSRALTAA---SQPLPDPSQVFSKS-TGISRNSDWQTDAWEAVDLVGELRYYVGWRASS 74 (634) Q Consensus 1 ~~--a~~~lr~vrrp~g~~~a~~ral~aA---s~~itdp~~~~~~~-~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s 74 (634) |. ++..++- |-+.. +++.+.+. ++.- +. ....+. ...+.+..+......+....=.|----+|-++. T Consensus 1 ~~~p~~~~~~~---~~~~~--~~~~~~~y~~~a~~~-~~-~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a 73 (533) T protein:vir:34 1 MKTPTIPTLLG---PDGMT--SLREYAGYHGGGSGF-GG-QLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANA 73 (533) T ss_pred CCCchhhhhhc---ccccc--hHHHHHhhhhccCCC-CC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 22 2222221 11111 12222111 1111 10 111111 112233333332222221111111112233333 Q ss_pred eeeee-EEE-eeecccCCCCC---CCCCCCCcccHHHHHHHHhh-------------cCCcchHHHHHHHHHHhhccccc Q lcl|NC_011057. 75 CSRCR-LVA-SELDENTGLPT---GGISEDNTEGERVREIVSKI-------------ADGTLGQAALTKRVVECLTVPGE 136 (634) Q Consensus 75 ~Sr~r-L~a-seiD~Dtg~pt---G~i~ed~~~g~r~~~iv~~i-------------agG~lGQaqL~kR~~~~LtVpGE 136 (634) +.+.. -++ +-|-+. -.|. -++.+ ...+.+.+.++.. +.|.+.=.+|.+-+...+-+-|| T Consensus 74 v~~~~~nvVG~Gi~~~-~~p~~~~lg~~~--~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE 150 (533) T protein:vir:34 74 IQLHQDHIVGSFFRLS-HRPSWRYLGIGE--EEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGE 150 (533) T ss_pred HHHHHHHhhCCCceee-eccchhhcCCCh--hHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCc Confidence 32211 111 112111 0000 01111 1123333444333 67888888888888888999999 Q ss_pred eEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCC----------------CcceeeEe----CCCC---ccccc Q lcl|NC_011057. 137 LWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNK----------------GSGTNIVL----PTGE---EHEFV 193 (634) Q Consensus 137 ~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~----------------~~~~~i~l----P~g~---~h~~~ 193 (634) +.+.+..++.++. +-+ =+-..|..+.|..... |.-+.|-+ |+|. ..++. T Consensus 151 ~f~~~~~~~~~g~---~~~-----~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~ 222 (533) T protein:vir:34 151 LFVQATWDTSSSR---LFR-----TQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWI 222 (533) T ss_pred eEEEeeeccCCCC---ccc-----eEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCcccccccee Confidence 9999877654321 011 1233444444432111 11111111 1111 00111 Q ss_pred ----C-CCCeEEEeeCCCcccccC--CccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCcee-eecccccCCCCcCCC-C Q lcl|NC_011057. 194 ----K-GTDIIFRVWIPKPRKASE--PDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVL-FVPHEMSLPAAQGPV-S 264 (634) Q Consensus 194 ----~-~~D~~~RvW~P~prra~e--aDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvP~e~slP~~~~p~-a 264 (634) . +..-++|+++|. |.-| --|..-++|..|+.| .++..+......+.+-+. ||=+...-....... . T Consensus 223 ~~~~~v~a~~VlH~f~~~--r~gQ~RGis~lapvl~~l~~l---~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~ 297 (533) T protein:vir:34 223 PRELPGGRASFIHVFEPV--EDGQTRGANVFYSVMEQMKML---DTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILG 297 (533) T ss_pred eeeeccChhHeeeecccc--CCCcccCCchHHHHHHHHHHH---HHHHHHHHHHHHHhhhheeeeecCCCcccccccccC Confidence 0 223466666554 2222 234444455555444 444555555555554443 332221110000000 0 Q ss_pred cCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccc--eeEeechHHhcccceeecCCchhHHHHHHHHHHHHH Q lcl|NC_011057. 265 EVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIP--VIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIAR 342 (634) Q Consensus 265 ~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vP--iva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~r 342 (634) .........+.+.. .. ... .+.+-.-.+-| |+---|||.|+-++-=+ -+.--..+.+..++. T Consensus 298 ~~~~~~~~~~~~~~------~~-----~~~-~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~----p~~~~~~f~~~~lr~ 361 (533) T protein:vir:34 298 ANSQEQRERLTGWI------GE-----IAA-YYAAAPVRLGGAKVPHLMPGDSLNLQTAQD----TDNGYSVFEQSLLRY 361 (533) T ss_pred CCcccccccccccc------hh-----hhh-ccCcceeeccCceeeecCCCCeeeecCCCC----CCCCHHHHHHHHHHH Confidence 00000111111110 00 000 00100001111 12223444333322211 122334577888999 Q ss_pred HhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHh----HHHHHHHHHHHHHHHHHHHhcCCC------------hh Q lcl|NC_011057. 343 LAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAP----VMEIFCQALTDQILRVTLAREGID------------PS 406 (634) Q Consensus 343 lA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P----~~~~i~~ait~~~lr~~L~~eG~d------------~~ 406 (634) ||.|+.||-|.|+|==|++|++|+-+-.-|..+. +.- ++.-+|+-|++.||.-++..-.|+ +. T Consensus 362 iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~-~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~ 440 (533) T protein:vir:34 362 IAAGLGVSYEQLSRNYAQMSYSTARASANESWAY-FMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARS 440 (533) T ss_pred HHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHH Confidence 9999999999999943699999998776665543 111 344578889999998877553333 13 Q ss_pred Hh--eeeecCcccccCCCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhh Q lcl|NC_011057. 407 KY--VVWYDASQLTIDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLI 483 (634) Q Consensus 407 ~y--V~w~DaS~L~~~pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll 483 (634) .| +-|+=..-..+||-|.-+| +..++.|..|-+++.+.+|.+-+ |..+|.|.++-. T Consensus 441 ~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~--------ev~~q~a~e~~~------------- 499 (533) T protein:vir:34 441 AWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQ--------EIFAQQVRETME------------- 499 (533) T ss_pred hhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH--------HHHHHHHHHHHH------------- Confidence 34 5788899999999997777 77999999999999999887532 444454444322 Q ss_pred hhhhhcccCCCC--CCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCccc Q lcl|NC_011057. 484 AGVLQQIEFPQQ--QQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGT 531 (634) Q Consensus 484 ~p~~q~~~~P~p--~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~ 531 (634) ++..++|.| +......+...+++ ++.+...++ T Consensus 500 ---~~~~gl~~~~~~~~~~~s~~~~~~~-------------~~~~~~~~~ 533 (533) T protein:vir:34 500 ---RRAAGLKPPAWAAAAFESGLRQSTE-------------EEKSDSRAA 533 (533) T ss_pred ---HHhcCCCCCCCCCcCccCCCCCCCC-------------CCcccCCCC Confidence 122222211 11111101111111 111111111 No 112 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.20 E-value=2.7e-06 Score=51.15 Aligned_cols=462 Identities=16% Similarity=0.097 Sum_probs=197.4 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCc---hhhhh-hhh-cccCccccccH-------HHHHHHhhhhhHHHHH Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPD---PSQVF-SKS-TGISRNSDWQT-------DAWEAVDLVGELRYYV 68 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itd---p~~~~-~~~-~~~~~~~~WQ~-------eAW~~yd~VgELryyv 68 (634) |.=+. .|.++...... +++++..... .+.- -+..+ .+. ...+.+.++.. +|-+++.--|=.+=++ T Consensus 1 m~~~~-~r~~~~~a~~~-~~~~~~~~~~-~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av 77 (553) T protein:vir:63 1 MTKVT-VRKLSEVTSGR-PEQSASLGGG-GLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAV 77 (553) T ss_pred Ccchh-hhhhccccccc-chhhhhhhcc-cccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 32211 12221111111 1223332211 1110 00000 011 11222222221 1222222111111111 Q ss_pred h-hhhhceee-eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhh-------------cCCcchHHHHHHHHHHhhcc Q lcl|NC_011057. 69 G-WRASSCSR-CRLVASELDENTGLPTGGISEDNTEGERVREIVSKI-------------ADGTLGQAALTKRVVECLTV 133 (634) Q Consensus 69 g-Wr~~s~Sr-~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~i-------------agG~lGQaqL~kR~~~~LtV 133 (634) . +..|-|.- .++- +.+|.. .. .++.+ ...+.+++.++.. +.|.+.=.+|.+-+...+-+ T Consensus 78 ~~~~~nvVG~Gi~~~-~~~~~~-~l--~g~~~--~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~ 151 (553) T protein:vir:63 78 GYQRDSIVGAQYRLN-SMPDIN-VI--PGATE--EWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVK 151 (553) T ss_pred HHHHHhhccCCceee-eccchh-hh--cCCCH--HHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHh Confidence 1 22222221 1111 111100 00 01221 1113333333322 45887777777777888888 Q ss_pred ccceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCC-cceeeEeCCCCcccccC-CCCeEEEeeCCCccc-- Q lcl|NC_011057. 134 PGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKG-SGTNIVLPTGEEHEFVK-GTDIIFRVWIPKPRK-- 209 (634) Q Consensus 134 pGE~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~-~~~~i~lP~g~~h~~~~-~~D~~~RvW~P~prr-- 209 (634) -||+.+.+..++..+. +-+ -.-..|..+.|...... ++- .+-.| -||++ +.-+..+|++-||.. T Consensus 152 dGE~~~~~~~~~~~~~---~~~-----~~lq~ie~drl~~~~~~~~~~--~i~~G--VE~d~~Gr~vaY~i~~~hPgd~~ 219 (553) T protein:vir:63 152 TGEVLATAEWDRAANR---PYA-----TCFQMVSTDRLSNPYQQLDTP--TLRRG--VQYDKRGRPQGYWIQVAHPGDLY 219 (553) T ss_pred CCceEEEeeeccCCCC---ccc-----ceEEEechhhcCCCCCCCCCC--eeEee--eEECCCCceEEEEeeccCCCccc Confidence 9999998876644321 111 12345555555432211 010 00011 12222 222333444444432 Q ss_pred -------------------------------ccCC--ccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCcee-eeccccc Q lcl|NC_011057. 210 -------------------------------ASEP--DSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVL-FVPHEMS 255 (634) Q Consensus 210 -------------------------------a~ea--DSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvP~e~s 255 (634) .-|. -|.--++|..| ..+.++..+......+++-+. ||=+++ T Consensus 220 ~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l---~~l~~y~daeL~~a~i~A~~a~fi~~~~- 295 (553) T protein:vir:63 220 QMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDM---RMAKRFKEMSLQNAVINASYAAAIESEL- 295 (553) T ss_pred cccccccceeeeccccccChhHheecccccCCCcccCCchHHHHHHHH---HHHhHHHHHHHHHHHHhhhheeeeecCC- Confidence 2111 23333444444 445555566666555655554 554432 Q ss_pred CCCCcCCCCc---CCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccc--eeEeechHHhcccceeecCCchhH Q lcl|NC_011057. 256 LPAAQGPVSE---VEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIP--VIAGVPGEQIKDVKHIRFDNEITE 330 (634) Q Consensus 256 lP~~~~p~a~---~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vP--iva~vP~Ehi~~ikHl~f~~d~te 330 (634) |+.+..... .+.+......+.. ... ..........-.+-| |+---|||.++-++.=+= +. T Consensus 296 -~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---------~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p----~~ 360 (553) T protein:vir:63 296 -PPEFIHSQMSGGSPNADMVGIFGKY-MDA---------LKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTP----GG 360 (553) T ss_pred -Chhhhhhhccccccccccccccccc-ccc---------cccccccccceeecCceeeecCCCCeeeecCCCCC----CC Confidence 222211110 0011111111000 000 000000000001111 122234443333222211 22 Q ss_pred HHHHHHHHHHHHHhhhccCChHHhhccccCcchhhHhhhhhhhhhHHH---HhHHHHHHHHHHHHHHHHHHHhcCCCh-- Q lcl|NC_011057. 331 VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHI---APVMEIFCQALTDQILRVTLAREGIDP-- 405 (634) Q Consensus 331 ~aiktR~daI~rlA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI---~P~~~~i~~ait~~~lr~~L~~eG~d~-- 405 (634) --..+.+..++.+|.|+.||-|.|+|==|++|++|+-+-.-|..+..- .=++..+|+-|++.||.-++..--|+- T Consensus 361 ~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~ 440 (553) T protein:vir:63 361 VGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPP 440 (553) T ss_pred CHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCC Confidence 334567889999999999999999996579999999776666544311 113456899999999988775444431 Q ss_pred -------------hHh--eeeecCcccccCCCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHh Q lcl|NC_011057. 406 -------------SKY--VVWYDASQLTIDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAV 469 (634) Q Consensus 406 -------------~~y--V~w~DaS~L~~~pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v 469 (634) ..| +-|+=+.-..+||-|.-+| +.+++.|..|-+..-+++|.+-+ |..+|.|.+.- T Consensus 441 ~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~--------~v~~q~a~e~~ 512 (553) T protein:vir:63 441 GQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFR--------KSFAQRAREDA 512 (553) T ss_pred cccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHH--------HHHHHHHHHHH Confidence 123 4588888889999998777 77999999999999999997542 55666666542 Q ss_pred hcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCC Q lcl|NC_011057. 470 SKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQD 527 (634) Q Consensus 470 ~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~ 527 (634) .- +..++|.+.....+.+...+. ..+.++++..++++.++. T Consensus 513 ~~----------------~~~Gl~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 513 LL----------------KKYGLTFNLSAKRSLGDGRDA-ATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HH----------------HHcCCCCCCCCccccCCCccc-CCCCCCCCCCCCcccccC Confidence 21 122222111111111111111 111112223333333333 No 113 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=98.17 E-value=1.2e-07 Score=58.52 Aligned_cols=331 Identities=20% Similarity=0.199 Sum_probs=168.7 Q ss_pred CCCCCcceeEeccCCCCccch-hh-hhhhhccCCchhhhhhhhcccC-ccccccH---HHHHHHhhhhhHHHHHhhhhhc Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPS-RA-LTAASQPLPDPSQVFSKSTGIS-RNSDWQT---DAWEAVDLVGELRYYVGWRASS 74 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~-ra-l~aAs~~itdp~~~~~~~~~~~-~~~~WQ~---eAW~~yd~VgELryyvgWr~~s 74 (634) |. =|=-| +....+++. .+ -+++++.. ..|+-..-.. ....|.. |||.. |+ || .-- T Consensus 1 ~~----~~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~~~~~p~~v~~~~~~~~~~~~~~~----~~--~~----~pp 61 (351) T protein:vir:79 1 MS----KRRSR-APRTFAAAPNPSAGSAAPARA----EVFTFDDPTPVMNRAEILDYVECWSN----GE--WF----EPP 61 (351) T ss_pred CC----CCCCC-CCCCCCCCCchhhhhccccee----EEEEcCCceeecCcchhhhhhhhhhc----Cc--ee----cCC Confidence 21 11111 111111111 11 11111111 1221100000 0111222 33311 21 11 111 Q ss_pred eeeeeEEEeeecccCCCCCCCCCCCCcccHHH-----HHHHH-hhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCC Q lcl|NC_011057. 75 CSRCRLVASELDENTGLPTGGISEDNTEGERV-----REIVS-KIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKG 148 (634) Q Consensus 75 ~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~-----~~iv~-~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~ 148 (634) +++.-|- ..-+.++-+.++ ..++. -+....+.+.++ ++++..+-+-|.+|+.+. |.+ T Consensus 62 ~~~~~la-------------~~~~~~~~h~~~l~~k~n~l~~~~~Pnp~~t~~~f-~~~v~d~ll~Gnay~~~~-r~~-- 124 (351) T protein:vir:79 62 VSFAGLA-------------KSFRASTHHSSALFFKANVLASTFRPHRWLSRHAF-ERWALDFLTFGNGYLERR-RNM-- 124 (351) T ss_pred CCHHHHH-------------HHHhhhHhhhhhhhhhhhHHhhcccCCCCCCHHHH-HHHHHHHHhcCCeEEEEE-ECC-- Confidence 1110000 000011111111 11111 134566777777 678888888899998763 433 Q ss_pred CCCCcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHH Q lcl|NC_011057. 149 APAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIV 228 (634) Q Consensus 149 ~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~ 228 (634) +|. .-..+.+....+.+...+++.-+...+|..++|.. +-+|++=+|+|.....--||..+++.++--=. T Consensus 125 -----~G~---~~~L~~l~~~~v~~~~~~~~~~~~~~~g~~~~~~~--~eIihir~~~~~~~~yGl~~~~~a~~si~l~~ 194 (351) T protein:vir:79 125 -----VGG---TLRLEPALAKYVRRKADFSGFVYVNGWQERHEFEP--DSVFQLVRPDINQEVYGLPEYLSSLHSAWLNE 194 (351) T ss_pred -----CCC---EEEEEEeCCcceeeeecCCeEEEEecCceEEEEcC--ccEEEeCCCCCCCCcccccHHHHHHHHHHHHH Confidence 232 24577777888887766666677777888888865 33456667888777777789888887765433 Q ss_pred hhhHHHHHHHHhHhhhCc-----eeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCcccccc Q lcl|NC_011057. 229 RTTKTIANASKSRLIGNG-----VLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAA 303 (634) Q Consensus 229 rttk~I~na~~SRL~gnG-----vlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA 303 (634) -.++. ..|+..|| ||.+|.. . ...-..+.|.+.+-+ ++ .... T Consensus 195 ~a~~~-----~~~~f~NGa~pg~il~~~~~------~--------------ls~e~~~~lk~~~~~-~~-------G~~N 241 (351) T protein:vir:79 195 SSTLF-----RRKYYENGSHAGFILYMTDA------A--------------QKQDDVDNMRDALKN-AK-------GPGN 241 (351) T ss_pred HHHHH-----HHHHHhccCCCceEEEecCC------C--------------CCHHHHHHHHHHHHH-hc-------Cccc Confidence 33322 34455554 4554421 0 011244455544421 11 1112 Q ss_pred ccceeEeechHHhcccceeecCCchh-HHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHh Q lcl|NC_011057. 304 FIPVIAGVPGEQIKDVKHIRFDNEIT-EVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAP 380 (634) Q Consensus 304 ~vPiva~vP~Ehi~~ikHl~f~~d~t-e~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P 380 (634) .=.+++..|+-.-+.+|-..++..-. .--+++|+.....||..+-||| .|+|+. +.+++.++.+....=++.-|.| T Consensus 242 ~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp-~llGi~~~~t~~~~n~e~~~~~f~~~~l~P 320 (351) T protein:vir:79 242 FRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSNSGGFGTPDTAARVFGRNEIRP 320 (351) T ss_pred cCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcccHHHHHHHHHHHHHHH Confidence 23455566654444555555554333 3457899999999999999999 888983 2345667888888888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCch Q lcl|NC_011057. 381 VMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKS 424 (634) Q Consensus 381 ~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t 424 (634) .++.|.+ |.+ + | |. +|+-||..+|.--.-++ T Consensus 321 l~~~ie~-ln~-~----l---g~----~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 321 LQARFAE-LND-W----L---GD----EVVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHH-HHh-h----c---Cc----ceeeeChhhhccccccC Confidence 9999965 443 2 3 32 24566766655432222 No 114 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.14 E-value=1.2e-06 Score=53.07 Aligned_cols=336 Identities=14% Similarity=0.140 Sum_probs=168.5 Q ss_pred eEeccCCCCccchhhhhhhhccCCchhhhhhhh--cccCccccccHHHHHHHhhhhhHHHHHhhhhhceeee---eEEEe Q lcl|NC_011057. 9 LVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS--TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRC---RLVAS 83 (634) Q Consensus 9 ~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~--~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~---rL~as 83 (634) ..|+-+-..+. ...+.+ .+.+ .+|+-. ...- +..|-.+..+++ ..+-+|..--+++. +|+ T Consensus 1 m~~~~~~~~~~-~~~~~~-~~~~----~~~~~~~p~~~~-~~~~~~~~~~~~------~~~~~~~~pp~~~~~la~l~-- 65 (346) T protein:vir:10 1 MKKQLRKNLTQ-NDRLQP-QAQT----EIFSFGDPIPVL-DRADILNYLECS------AMYEKWYNPPMSFDGLAKSL-- 65 (346) T ss_pred CCcccCCCCCc-cccccc-ccCe----EEEecCCcceec-CchhHHHHHHHh------hcCCceEecCCCHHHHHHHH-- Confidence 22221111111 111111 1111 122110 0011 111222222221 11112322111111 000 Q ss_pred eecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccchhc Q lcl|NC_011057. 84 ELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEW 163 (634) Q Consensus 84 eiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~~~W 163 (634) +..+.- +++-.. .. .-+..+ -.+..+.+...++. +++.++-+=|..|+.+ .|... |. .-.. T Consensus 66 ~~~~~h----~~~i~~-k~-n~l~~l-~~~Pn~~~t~~~f~-~~~~d~ll~Gnay~~i-~r~~~-------G~---~~~L 126 (346) T protein:vir:10 66 RSSTHH----ESAIIT-KA-NILLST-CEVDSRYLSRRDLS-SFVKDYLVFGNAYFEV-VRNRL-------GQ---VQRI 126 (346) T ss_pred Hhhhhc----chhhhh-hh-hhHHHH-HhCCCCCCCHHHHH-HHHHHHHhcCCeEEEE-EEcCC-------Cc---EEEE Confidence 000000 111100 00 011122 12466778888775 5778888889999876 35332 32 2346 Q ss_pred eeccHHHHhccCCCcceeeE--eCCCCcccccCCCCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhH Q lcl|NC_011057. 164 YAVSKEEIKKSNKGSGTNIV--LPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSR 241 (634) Q Consensus 164 ~~vt~~Ei~~~~~~~~~~i~--lP~g~~h~~~~~~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SR 241 (634) +.+....+......++..+. ..+|+.++|..+ -+|++=+|+|..-..--||..+++.++.--...++.-++--+.= T Consensus 127 ~pl~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~--dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG 204 (346) T protein:vir:10 127 ESPLAKYVRKGLEAGQFYYVPQRFDHQEHEFAKG--SIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNG 204 (346) T ss_pred EEecCCceEEEEcCCeEEEEEEccCCeEEEEecc--cEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 66777777665545544333 357888888553 35666677777666777888888877655554444433333333 Q ss_pred hhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccce Q lcl|NC_011057. 242 LIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKH 321 (634) Q Consensus 242 L~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikH 321 (634) ..-.|||.+|... ...-..+.|++.+-+ +.. . ...=-+++..|+-.-+.+|- T Consensus 205 ~~~~~il~~~d~~--------------------l~~e~~~~i~~~~~~----~~g---~-~n~~~~~vl~~~~~~~gi~~ 256 (346) T protein:vir:10 205 AHAGFVFYMSDAS--------------------QKQEDVENIRQQLKQ----SKG---V-GNFKNLFVHAPNGKKDGIQI 256 (346) T ss_pred CCCceEEEeCCCC--------------------CCHHHHHHHHHHHHH----hcC---c-cccCceeEecCCCCccceeE Confidence 3344566665311 011244555554432 211 1 11112444555433344444 Q ss_pred eecCCc-hhHHHHHHHHHHHHHHhhhccCChHHhhccc--cCcchhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHH Q lcl|NC_011057. 322 IRFDNE-ITEVAIKTRNDAIARLAMGLDVSPERLLGLG--SQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTL 398 (634) Q Consensus 322 l~f~~d-~te~aiktR~daI~rlA~~~D~~pE~LLGlg--s~~NhwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L 398 (634) ..++.. .+.--++.|+.....||..+-||| .|+|+- +.++..++.+....-++.-|.|.++.|++ +.+ .| T Consensus 257 ~pis~~~~d~qf~e~k~~~~~~I~~af~VPp-~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~-----~L 329 (346) T protein:vir:10 257 IPIADVSAKDEFFNIKNVSRDDVLAAHRVPP-QLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQ-----WL 329 (346) T ss_pred EecCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHh-----hc Confidence 444322 233457789999999999999999 688973 23457788888888899999999999986 332 22 Q ss_pred HhcCCChhHheeeecCcccccCCC Q lcl|NC_011057. 399 AREGIDPSKYVVWYDASQLTIDPD 422 (634) Q Consensus 399 ~~eG~d~~~yV~w~DaS~L~~~pd 422 (634) | ..||-|+..+|..-.+ T Consensus 330 ---~----~e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 330 ---G----QEVIKFKPSKLLQRTQ 346 (346) T ss_pred ---c----cceeeechhhhcccCC Confidence 2 1367788888887744 No 115 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.10 E-value=4.6e-06 Score=49.86 Aligned_cols=461 Identities=14% Similarity=0.102 Sum_probs=193.6 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhh-cccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS-TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~-~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (634) |--+..+-.-.||.- .+.++..-+|+. - +. ....+. ...+.++.+......+....=.|---.+|-.+++.... T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~a~~-~-~~-~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 75 (530) T protein:vir:38 1 MKIPSLVGPDGKTSL--REYAGYHGGGGG-F-GG-QLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQ 75 (530) T ss_pred CccceeecCccccch--HHHhhhhcccCC-C-CC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 333222221111110 001111111111 1 00 111111 11222222222222222211111111222222222111 Q ss_pred -EEEe-eecccCCCCC---CCCCC-CCc-ccHHHHHHHHhh---------cCCcchHHHHHHHHHHhhccccceEEEEEE Q lcl|NC_011057. 80 -LVAS-ELDENTGLPT---GGISE-DNT-EGERVREIVSKI---------ADGTLGQAALTKRVVECLTVPGELWIVILT 143 (634) Q Consensus 80 -L~as-eiD~Dtg~pt---G~i~e-d~~-~g~r~~~iv~~i---------agG~lGQaqL~kR~~~~LtVpGE~wi~il~ 143 (634) -+++ -|-+. -.|. -++.+ .+. -++++...-+.= +.|.+-=.+|.+-+...+-+-||+.+.+.. T Consensus 76 ~nvVG~Gi~~~-~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~ 154 (530) T protein:vir:38 76 DHIVGSFFRLS-YRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATW 154 (530) T ss_pred HHhhCCCceee-eccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeee Confidence 1111 11110 0000 01221 111 123444433321 457777778888888888999999998876 Q ss_pred ecCCCCCCCcccccccchhceeccHHHHhc----------------cCCCcceeeEe----CCCCc-cc------ccC-C Q lcl|NC_011057. 144 RPVKGAPAQPDGSVRTRQEWYAVSKEEIKK----------------SNKGSGTNIVL----PTGEE-HE------FVK-G 195 (634) Q Consensus 144 rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~----------------~~~~~~~~i~l----P~g~~-h~------~~~-~ 195 (634) ++.++.. =+ =+-..|..+.|.. ...|.-+.|-+ |+|.. .. +.. + T Consensus 155 ~~~~g~~---~~-----~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~ 226 (530) T protein:vir:38 155 DSDSTRL---FR-----TQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGG 226 (530) T ss_pred ccCCCCc---cc-----eEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccC Confidence 6543210 00 1122333333321 11111122222 22211 00 111 2 Q ss_pred CCeEEEeeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCcee-eecccccCCCCcC-CCCcCCCCCCCc Q lcl|NC_011057. 196 TDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVL-FVPHEMSLPAAQG-PVSEVEGEEIAP 273 (634) Q Consensus 196 ~D~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvP~e~slP~~~~-p~a~~~~~~~~p 273 (634) .+-++||++|.----.---|..-++|..|+.| .++..+....-.+.+-+. ||=+.+.-..... +........+.. T Consensus 227 a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l---~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 303 (530) T protein:vir:38 227 RPSFIHVFEPMEDGQTRGANAFYSVMEQMKML---DTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSK 303 (530) T ss_pred hhHeEeeccccCCCcccCCchHHHHHHHHHHH---hHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCccccccc Confidence 33577777664211111224444455554444 444555555555554443 5544332211110 000000011111 Q ss_pred cccchHHHHHHHHHHHHHhhcccCccccccccc--eeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCCh Q lcl|NC_011057. 274 LVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIP--VIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSP 351 (634) Q Consensus 274 ~~g~~a~~~l~~ml~qva~tai~De~S~AA~vP--iva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~p 351 (634) +.+... .+... .+...-.+-| |+.--|||.|+-++.=+= +.--..+.+..++.||.|+.||- T Consensus 304 ~~~~~~-----------~~~~~-~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p----~~~~~~f~~~~lr~iaaglGi~y 367 (530) T protein:vir:38 304 LTGWLG-----------EMAAY-YSAAPVRLGGARVPHLLPGDSLNLQSAQDT----DNGYSTFEQSLLRYIAAGLGVSY 367 (530) T ss_pred ccccch-----------hhhhc-ccccceeccCceeeecCCCCeeeeeCCCCC----CCCHHHHHHHHHHHHHhhcCCCH Confidence 111100 00000 0111111112 112234444333332221 22334667889999999999999 Q ss_pred HHhhc-cccCcchhhHhhhhhhhhhHHHHh----HHHHHHHHHHHHHHHHHHHhcCCC-h-----------hHh--eeee Q lcl|NC_011057. 352 ERLLG-LGSQTNHWSAWQISDEDVQLHIAP----VMEIFCQALTDQILRVTLAREGID-P-----------SKY--VVWY 412 (634) Q Consensus 352 E~LLG-lgs~~NhwtAw~i~de~v~~hI~P----~~~~i~~ait~~~lr~~L~~eG~d-~-----------~~y--V~w~ 412 (634) |.|+| + |++|++|+-+-.-|..+. +.- ++.-+|.-|++.||.-++..--|+ | ..| +-|+ T Consensus 368 e~lt~D~-s~~nYSS~R~~~~e~~r~-~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~ 445 (530) T protein:vir:38 368 EQLSRNY-SQMSYSTARASANESWAY-FMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWI 445 (530) T ss_pred HHHhccc-ccccHHHHHHHHHHHHHH-HHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeee Confidence 99999 5 699999998887776654 222 234466778888888777553333 1 234 5799 Q ss_pred cCcccccCCCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhccc Q lcl|NC_011057. 413 DASQLTIDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIE 491 (634) Q Consensus 413 DaS~L~~~pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~ 491 (634) =..-..+||-|.-+| +..++.|..|-+.+.+++|.+-+ |..+|.|.+.-..+ .+ ++. T Consensus 446 ~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~--------~v~~q~a~e~~~~~-----~~---------Gl~ 503 (530) T protein:vir:38 446 GSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQ--------EIFAQQVRESMERR-----AA---------GLN 503 (530) T ss_pred cCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH--------HHHHHHHHHHHHHH-----Hc---------CCC Confidence 999999999987776 77999999999999999987532 44555554442211 11 111 Q ss_pred CCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcc Q lcl|NC_011057. 492 FPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDG 530 (634) Q Consensus 492 ~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~ 530 (634) +|...++....+. ..+++.++|+...+ T Consensus 504 ~~~~~~~~~~~~~------------~~~~~~~~d~~~~a 530 (530) T protein:vir:38 504 PPAWAAAAFEAGV------------KKSNEEEQDGARAA 530 (530) T ss_pred CCCCcccccCCCC------------CCCCCCCCCCCCCC Confidence 2211111111010 01111111111111 No 116 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.10 E-value=4.6e-06 Score=49.86 Aligned_cols=470 Identities=13% Similarity=0.091 Sum_probs=207.2 Q ss_pred CCCCCc-ceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeee- Q lcl|NC_011057. 1 MAATQS-LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRC- 78 (634) Q Consensus 1 ~~a~~~-lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~- 78 (634) +.++-+ -+-.||.++- +..++.-||+..= ....+....+.+..++.....+....=.|-.-.+|-++.+.+. T Consensus 7 ~i~~~sP~~a~~R~~ar--~~~~~y~aa~~~r----~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 7 LLEPLAPELVARRLAAR--EAIQAYEAARPGR----THKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred HhhhcchHHHHHHHHhH--HHhccccccCccc----cccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 111110 0001111110 1112333443211 1111112234455555443333333233333334545554432 Q ss_pred eEEEe----eecccCCCCCCCCCCCCcccHHHHHHHHhh---------cCCcchHHHHHHHHHHhhccccceEEEEEEec Q lcl|NC_011057. 79 RLVAS----ELDENTGLPTGGISEDNTEGERVREIVSKI---------ADGTLGQAALTKRVVECLTVPGELWIVILTRP 145 (634) Q Consensus 79 rL~as----eiD~Dtg~ptG~i~ed~~~g~r~~~iv~~i---------agG~lGQaqL~kR~~~~LtVpGE~wi~il~rp 145 (634) .-+++ -|.|. |.|. |.....++.+.+++. +.|.+.=.+|.+-+...+-+-||+.+.+..++ T Consensus 81 ~nvVG~~G~~i~p~---~l~~---d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~ 154 (548) T protein:vir:95 81 ERVVGGSGIGVEPL---PLRL---DGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGR 154 (548) T ss_pred HhccCccccceeee---ecCC---CHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecc Confidence 33333 22322 1111 112223444444444 67888888888888889999999999887765 Q ss_pred CCCCCCCcccccccchhceeccHHHHhccCCCcceeeEeCCCCcccccC-CCCeEEEeeCCCcccccC------------ Q lcl|NC_011057. 146 VKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVK-GTDIIFRVWIPKPRKASE------------ 212 (634) Q Consensus 146 ~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~~-~~D~~~RvW~P~prra~e------------ 212 (634) .++. .-|+..+ -.-..|..+.|.......+.. +-+| -||+. +.-+..+|++.||..... T Consensus 155 ~~~~---~~g~~~~-~~lqliepd~l~~~~~~~~~~--i~~G--IE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA 226 (548) T protein:vir:95 155 VPNY---TFATSVP-FALELLEPDYLPFSYNNLSKG--IVQG--IERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEA 226 (548) T ss_pred cccc---cCCcccc-eEEEEechhhcCCCCCCCCCc--eeee--eEECCCCceEEEEEeecCCCcccccccccceeeech Confidence 4321 0111001 123445555554222211111 1122 13444 344556666666653211 Q ss_pred ----------------CccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCce-eeecccccCCCCcCCCCcCCCCCCCccc Q lcl|NC_011057. 213 ----------------PDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGV-LFVPHEMSLPAAQGPVSEVEGEEIAPLV 275 (634) Q Consensus 213 ----------------aDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGv-lfvP~e~slP~~~~p~a~~~~~~~~p~~ 275 (634) --|..-++|..|+. +.++..+....-.+.+-+ +||=+ ..|....... ++ T Consensus 227 ~~VlHif~~~r~gQ~RGvs~lapvl~~l~~---l~~y~dael~~aki~A~~a~fi~~--~~~~~~~~~~----~~----- 292 (548) T protein:vir:95 227 ERIIHIAYRKRIGQNRGVPMLHAVLIRLAD---LKDYEESERVAARISAALAMYIKK--GNPDSYTVEP----GK----- 292 (548) T ss_pred hHheecccccCCccccCcchHHHHHHHHHH---HhHHHHHHHHHHHHhhhheeeeec--CCCccccCCC----Cc----- Confidence 11222234444444 344444444444444433 23322 1222211100 00 Q ss_pred cchHHHHHHHHHHHHHhhcccCccccccccce--eEe-echHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChH Q lcl|NC_011057. 276 GEPAVQQLTDMLFQVAETAVEDEDSQAAFIPV--IAG-VPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPE 352 (634) Q Consensus 276 g~~a~~~l~~ml~qva~tai~De~S~AA~vPi--va~-vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE 352 (634) .+...--.+-|= |-. .|||.|+-++- +.-+.---.+.+..+|.||.|+.||-| T Consensus 293 --------------------~~~~~~~~~~pG~iv~~L~pGe~i~~~~p----~~p~~~~~~f~~~~lr~IAaglGipYe 348 (548) T protein:vir:95 293 --------------------DRKNRTIPIAPGMVFDDLEPGEDVGMIES----NRPNPFLEGFRNGQLRMIGAGTRSTYS 348 (548) T ss_pred --------------------ccccccccccCCccccccCCCceeeecCC----CCCCCCHHHHHHHHHHHHHhhcCCCHH Confidence 000000011111 111 24443322221 112334456788899999999999999 Q ss_pred HhhccccCcchhhHhhhhhhhhhHHHHh----HHHHHHHHHHHHHHHHHHHhcCC------ChhHh--eeeecCcccccC Q lcl|NC_011057. 353 RLLGLGSQTNHWSAWQISDEDVQLHIAP----VMEIFCQALTDQILRVTLAREGI------DPSKY--VVWYDASQLTID 420 (634) Q Consensus 353 ~LLGlgs~~NhwtAw~i~de~v~~hI~P----~~~~i~~ait~~~lr~~L~~eG~------d~~~y--V~w~DaS~L~~~ 420 (634) .|.|= .++|++|+-+-.-|..+. +.- ++..+|+-|++.||.-++..=-| ++..| +-|.=+.-..+| T Consensus 349 ~ltgD-~s~nYSS~R~~l~e~~r~-~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iD 426 (548) T protein:vir:95 349 SVSRA-YDGTYSAQRQELVEGWLG-YDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWIN 426 (548) T ss_pred HHhcc-cchhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccC Confidence 99996 467999998776665443 221 35678999999999888754333 33444 458878888899 Q ss_pred CCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcC-cccchhhhhhhhhhhhcccCCCCCCC Q lcl|NC_011057. 421 PDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKD-PTLIPMLAPLIAGVLQQIEFPQQQQA 498 (634) Q Consensus 421 pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~d-p~Li~~laPll~p~~q~~~~P~p~~a 498 (634) |-|.-+| +.+++.|..|-++.-+++|.+- .|..+|+|.+.-..+ --|+..--|-.++.-+..+-+++++. T Consensus 427 P~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~--------~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~ 498 (548) T protein:vir:95 427 PMHEANAWELLVKAGFADEAEVARARGRDP--------RELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQK 498 (548) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHhCCCH--------HHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhh Confidence 9887776 7799999999999999998753 234455555442211 11111111111111111111111111 Q ss_pred CCCCCCCCCccccCCC---C--CC---CCCCC--CCCCCCcccCCCccHH Q lcl|NC_011057. 499 IDSGGNEDTSDDDNLD---D--GE---HEPDT--EDDQDDDGTQKAGLES 538 (634) Q Consensus 499 ~~~~~~~~~~~d~~~~---~--~~---~ePDT--e~d~~~~~~~~a~~~~ 538 (634) -..++..+-..|+... . .| .-||- |.+...+.-.+.+++- T Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 499 VYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQPSNPDP 548 (548) T ss_pred hccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCCCCCCC Confidence 1011100000000000 0 00 01221 1111112222222211 No 117 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.01 E-value=7.1e-06 Score=48.82 Aligned_cols=435 Identities=12% Similarity=0.064 Sum_probs=196.4 Q ss_pred CCCCCcceeEeccCCCCccchh-h-hhhhhccCCc--hhhhhh-h---hcccCccccccHH-------HHHHHhhhhhHH Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSR-A-LTAASQPLPD--PSQVFS-K---STGISRNSDWQTD-------AWEAVDLVGELR 65 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~r-a-l~aAs~~itd--p~~~~~-~---~~~~~~~~~WQ~e-------AW~~yd~VgELr 65 (634) |+.+.+-+=+.-..-++.++++ + .-.+...|.- ++...+ + ....+.+.+.... |-++|.--|=.+ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 5544433221111111111111 0 0001111110 000100 0 0112222322222 222232222222 Q ss_pred HHHh-hhhhcee-e-eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhh-------cCCcchHHHHHHHHHHhhcccc Q lcl|NC_011057. 66 YYVG-WRASSCS-R-CRLVASELDENTGLPTGGISEDNTEGERVREIVSKI-------ADGTLGQAALTKRVVECLTVPG 135 (634) Q Consensus 66 yyvg-Wr~~s~S-r-~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~i-------agG~lGQaqL~kR~~~~LtVpG 135 (634) =++. +..|-|. . .++. +.++.. -+++.++ -++++...-+.- +.|.+.=.+|.+-+...+-+-| T Consensus 81 ~av~~~~~nvVG~~Gi~~~-~~~~~~----~~~~~~~--~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dG 153 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQ-SRVKRR----NGKPDDR--ANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDG 153 (505) T ss_pred HHHHHHHHHhcCCCcceee-ecCCcc----cccccHH--HHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCC Confidence 2222 3334332 1 1111 112211 1233321 223333333332 4577777778888888889999 Q ss_pred ceEEEEEEecCCCCCCCcccccccchhceeccHHHHhccCCCc-ceeeEeCCCCcccccC-CCCeEEEeeCCCcccccCC Q lcl|NC_011057. 136 ELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGS-GTNIVLPTGEEHEFVK-GTDIIFRVWIPKPRKASEP 213 (634) Q Consensus 136 E~wi~il~rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~-~~~i~lP~g~~h~~~~-~~D~~~RvW~P~prra~ea 213 (634) |+.+.+..|++. +-+. .-..|..+-|....... ...-.+-+|= ||+. +.-+..+|++-||...... T Consensus 154 E~f~~~~~~~~~-----~~~~-----~lqliepd~l~~~~n~~~~~~~~i~~GI--e~d~~Gr~~aY~i~~~hPgd~~~~ 221 (505) T protein:vir:96 154 EVLVREHRGYPN-----KWGY-----ALQILECDRLDLNYNADLQNGNRIRMSI--ELDAWERPVAYHLLVNHPGDNSYC 221 (505) T ss_pred ceEEEEeecCCC-----Ccce-----EEEEechhhcCCCCCcccCCcCeEEece--EECCCCceEEEEEeecCCCccccc Confidence 998887654321 2221 23445555554221100 0000011111 2333 3445556666665432111 Q ss_pred ------------------------------ccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCcee-eecccccCCCCcCC Q lcl|NC_011057. 214 ------------------------------DSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVL-FVPHEMSLPAAQGP 262 (634) Q Consensus 214 ------------------------------DSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvP~e~slP~~~~p 262 (634) -|.- ...|.-|..+.++..+....-.+.+=+. ||=+ ..|+...+ T Consensus 222 ~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~l---apvl~~l~~l~~y~dael~~a~i~A~~a~fi~~--~~~~~~~~ 296 (505) T protein:vir:96 222 YHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWT---HASMVELHHIGEYRKSEMIAAELGAKKVGFYEQ--DPEAYDQP 296 (505) T ss_pred cccccccccccCHhHhhhhhcccCCccccCcchH---HHHHHHHHHHhHHHHHHHHHHHHhhhheeeeec--CCccCCCc Confidence 1222 2334444455555666665555555443 5522 12211111 Q ss_pred CCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHH Q lcl|NC_011057. 263 VSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIAR 342 (634) Q Consensus 263 ~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~r 342 (634) ..+. .|.. .+.-..-+ |.---|||.|+-++.=+= +.--..+.+..+|. T Consensus 297 ~~~~--------~~~~------------------~~~l~pG~--i~~L~pGe~i~~~~~~~p----~~~~~~f~~~~lr~ 344 (505) T protein:vir:96 297 PEDD--------QGEI------------------VEEVEAGT--YQLLPYGIRFKEHKIDHP----HTNFGAFVKSSLRG 344 (505) T ss_pred cccc--------cCcc------------------ccccCCce--eeecCCCCeeeeeCCCCC----CCCHHHHHHHHHHH Confidence 0000 0000 00000111 111234444433322111 12334677889999 Q ss_pred HhhhccCChHHhhccccCcchhhHhhhhhhhhhHHHHh----HHHHHHHHHHHHHHHHHHHhcCC-----ChhHh--eee Q lcl|NC_011057. 343 LAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAP----VMEIFCQALTDQILRVTLAREGI-----DPSKY--VVW 411 (634) Q Consensus 343 lA~~~D~~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P----~~~~i~~ait~~~lr~~L~~eG~-----d~~~y--V~w 411 (634) ||.|+.||-|.|.|==+++|++|+-+-.-|..+. +.- ++..+|+-|++.||.-++..--| +++.| +-| T Consensus 345 iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~-~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w 423 (505) T protein:vir:96 345 VAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDL-YKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAF 423 (505) T ss_pred HHhhcCCCHHHHhcccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeee Confidence 9999999999999844689999998877765543 221 34578999999999887754333 35666 668 Q ss_pred ecCcccccCCCchHHH-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcc Q lcl|NC_011057. 412 YDASQLTIDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQI 490 (634) Q Consensus 412 ~DaS~L~~~pd~t~eA-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~ 490 (634) .=+.-..+||-|.-+| +.+++.|..|-+++-+++|.+- .|.++|.|.+.-..+ .+ ++ T Consensus 424 ~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~--------~~v~~q~a~e~~~~~-----~~---------Gl 481 (505) T protein:vir:96 424 QPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDP--------EDVFDEIAWEEQLMR-----DK---------GV 481 (505) T ss_pred ccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCH--------HHHHHHHHHHHHHHH-----Hc---------CC Confidence 8888899999998877 7799999999999999988753 334445444432211 11 01 Q ss_pred cCCCCCCCCCCCCCCCCccccCCCC Q lcl|NC_011057. 491 EFPQQQQAIDSGGNEDTSDDDNLDD 515 (634) Q Consensus 491 ~~P~p~~a~~~~~~~~~~~d~~~~~ 515 (634) .++.+...... ...+++++.+.++ T Consensus 482 ~~~~~~~~~~~-~~~~~~~~~~~d~ 505 (505) T protein:vir:96 482 NPTPPEQESKD-ATTDEEDDSASDD 505 (505) T ss_pred CCCCCCCCCCC-CCCCCCCCCCCCC Confidence 11111111111 1111111111111 No 118 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=98.00 E-value=7.4e-06 Score=48.70 Aligned_cols=441 Identities=14% Similarity=0.101 Sum_probs=194.1 Q ss_pred CCCCCc-ceeEeccCCCCccchhhhhhhhccCCchhhhhhhh-cccCcccccc-------HHHHHHHhhhhhHHHHHh-h Q lcl|NC_011057. 1 MAATQS-LRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS-TGISRNSDWQ-------TDAWEAVDLVGELRYYVG-W 70 (634) Q Consensus 1 ~~a~~~-lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~-~~~~~~~~WQ-------~eAW~~yd~VgELryyvg-W 70 (634) +.+.-+ -+-.||.+.- +..++.-||+.. ...++. ...+.++... .+|-++|.--|=.+=++. | T Consensus 7 ~i~~~sP~~~~~R~~ar--~~~~~y~aa~~~-----r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~ 79 (502) T protein:vir:79 7 VIGVFSPGWKAARLRSR--AVIQAYEAVKTT-----RTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKL 79 (502) T ss_pred HHhhcChHHHHHHHhhH--HHHhhccccCcc-----cccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 111100 1111222210 111233333221 111111 1122222222 233334433332222222 4 Q ss_pred hhhceee--eeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhh-----cCCcchHHHHHHHHHHhhccccceEEEEEE Q lcl|NC_011057. 71 RASSCSR--CRLVASELDENTGLPTGGISEDNTEGERVREIVSKI-----ADGTLGQAALTKRVVECLTVPGELWIVILT 143 (634) Q Consensus 71 r~~s~Sr--~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~i-----agG~lGQaqL~kR~~~~LtVpGE~wi~il~ 143 (634) ..|.|.- .++-+ .++-+ .+.+.+ .-+.++.+.-+.- +.|.+.=.+|.+-+...+-+-||+.+.++. T Consensus 80 ~~nvVG~ggi~~~~-~~~~~----~~~~~~--~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 152 (502) T protein:vir:79 80 EERVVGKNGIIVEP-HPVLR----NGAIAR--DLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVS 152 (502) T ss_pred HHhhccCCceeeee-ccCCC----ChhHHH--HHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEee Confidence 4454431 22221 11111 112211 1123333332222 567888888888888888999999999877 Q ss_pred ecCCCCCCCcccccccchhceeccHHHHhccCCCccee----eEeC-CCCc---------------c--cccCCCCeEEE Q lcl|NC_011057. 144 RPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTN----IVLP-TGEE---------------H--EFVKGTDIIFR 201 (634) Q Consensus 144 rp~~~~~~~~dg~~~~~~~W~~vt~~Ei~~~~~~~~~~----i~lP-~g~~---------------h--~~~~~~D~~~R 201 (634) .+.+. ..+|...+ =.-..|..+.|.... .++-. |++- .|.+ . +++ ...-++| T Consensus 153 ~~~~~---~~~g~~~~-l~lq~iepd~l~~~~-~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rv-pA~~vlH 226 (502) T protein:vir:79 153 GRINS---LTPSAGVH-FWLEALEPDFIPMTS-DESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEV-DAERMLH 226 (502) T ss_pred cccCc---cCCCcccc-eEEEEecchhcCCCC-CCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEe-chhheEE Confidence 43321 11222111 122333444442111 11110 2210 0111 1 111 1223666 Q ss_pred eeCCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCce-eeecccccCCCCcCCCCcCCCCCCCccccchHH Q lcl|NC_011057. 202 VWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGV-LFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAV 280 (634) Q Consensus 202 vW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGv-lfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~ 280 (634) +++|.----.---|..-++|..|+.|- ++..+....-.+.+-+ +||=++. |....+.....+ .+. T Consensus 227 ~f~~~r~gQ~RGis~lapvl~~l~~l~---~~~dael~~a~i~A~~~~fi~~~~--~~~~~~~~~~~~------~~~--- 292 (502) T protein:vir:79 227 LKFVRRLHQMRGTSLLSGVLIRLSALK---EYEDSELTAARIAAALGMYIRKGD--GQSYEPDGNGSK------ENE--- 292 (502) T ss_pred eecccCCccccCCchHHHHHHHHHHHh---HHHHHHHHHHHHhhhheeeeecCC--CcccccccCCCC------Ccc--- Confidence 666532212222344555666555544 3444444444444433 2333222 212111110000 000 Q ss_pred HHHHHHHHHHHhhcccCccccccccc--eeEe-echHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhcc Q lcl|NC_011057. 281 QQLTDMLFQVAETAVEDEDSQAAFIP--VIAG-VPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGL 357 (634) Q Consensus 281 ~~l~~ml~qva~tai~De~S~AA~vP--iva~-vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGl 357 (634) +. -.+-| ||-. -|||.++-++. . .-+.-...+-+-.++.||.|++||-|.|+|= T Consensus 293 ----------------~~---~~l~pG~i~~~L~pGe~i~~~~p---~-~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D 349 (502) T protein:vir:79 293 ----------------RE---LTIQPGIIYDDLKPGEEIGMVKS---D-RPNPNLETFRNGQLRAVAAGSRLSFSSTARN 349 (502) T ss_pred ----------------cc---ccccCCccccccCCCceeeeeCC---C-CCCCCHHHHHHHHHHHHHhhcCCCHHHHhcc Confidence 00 00112 1111 23333332221 1 1223335667788899999999999999996 Q ss_pred ccCcchhhHhhhhhhhhhHHHH---hHHHHHHHHHHHHHHHHHHHhcCC------ChhHh--eeeecCcccccCCCchHH Q lcl|NC_011057. 358 GSQTNHWSAWQISDEDVQLHIA---PVMEIFCQALTDQILRVTLAREGI------DPSKY--VVWYDASQLTIDPDKSDE 426 (634) Q Consensus 358 gs~~NhwtAw~i~de~v~~hI~---P~~~~i~~ait~~~lr~~L~~eG~------d~~~y--V~w~DaS~L~~~pd~t~e 426 (634) .+.|++|+-+-.-|..+.--. =++..+|+-|++.||.-++..-.| ++..| +-|.=+.-..+||-|.-+ T Consensus 350 -~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~ 428 (502) T protein:vir:79 350 -YNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAE 428 (502) T ss_pred -ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHH Confidence 456999998777666443111 134578999999999888755333 33555 458888888999998877 Q ss_pred H-HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCC Q lcl|NC_011057. 427 A-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNE 505 (634) Q Consensus 427 A-~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~ 505 (634) | +.+++.|..|-+++-+..|.+-+ |..+|.|.++-.- ..+ ++.||.-+.....+... T Consensus 429 a~~~~i~~Gl~t~~~~~a~~G~D~~--------~v~~q~a~e~~~~-----~~~---------Gl~~~~~~~~~~~~~~~ 486 (502) T protein:vir:79 429 AWKIQIRGGAATESDWVRAGGRNPD--------DVKRRRKAEIDEN-----RKL---------DLVFDTDPASDKGGSSA 486 (502) T ss_pred HHHHHHHcCCCCHHHHHHHcCCCHH--------HHHHHHHHHHHHH-----HHc---------CCCCCCCCCCCCCCCCC Confidence 7 77999999999999999887532 4455555443221 111 11222111111112222 Q ss_pred CCccccCCCCCCCCCC Q lcl|NC_011057. 506 DTSDDDNLDDGEHEPD 521 (634) Q Consensus 506 ~~~~d~~~~~~~~ePD 521 (634) +.+++++.+.+++.-| T Consensus 487 ~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 487 ATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCC Confidence 2222222111111101 No 119 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=92.12 E-value=0.013 Score=30.97 Aligned_cols=442 Identities=11% Similarity=0.081 Sum_probs=172.1 Q ss_pred CCC----------CCccee------EeccCCCCccchhhhhh--hhccCCchhhhhhhhcccCccccccH---------- Q lcl|NC_011057. 1 MAA----------TQSLRL------VRRPKGGRPAPSRALTA--ASQPLPDPSQVFSKSTGISRNSDWQT---------- 52 (634) Q Consensus 1 ~~a----------~~~lr~------vrrp~g~~~a~~ral~a--As~~itdp~~~~~~~~~~~~~~~WQ~---------- 52 (634) |++ .-.++. .|.|....|...-++.. |-.-..+....+....+..+.+.+.. T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGH 104 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccH Confidence 111 111221 12333333322211111 11112233344443333333333311 Q ss_pred HHHHHHhhhhhHHHHHhhhhhceeee-eEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhh Q lcl|NC_011057. 53 DAWEAVDLVGELRYYVGWRASSCSRC-RLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECL 131 (634) Q Consensus 53 eAW~~yd~VgELryyvgWr~~s~Sr~-rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~L 131 (634) +.=.+|..-++.|=+|.=.+.-|.|. +-+-++ |.+ .++ -+.++.|-+.+- .|.=.+.++.+...- T Consensus 105 ~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~-~~~------~~~-----~~~~~~l~~~~~--~l~~~~~l~~a~~~~ 170 (537) T protein:vir:10 105 QMCALIATHWLVNKACSQMPRDAMRKGYKIISD-DGN------ELD-----PKDAKFIDRYDR--AFNIKKHAIQFVRKG 170 (537) T ss_pred HHHHHHHhCchhhhhhhhhhHHhhcCCceeecC-Ccc------ccc-----HHHHHHHHHHHH--HhhHHHHHHHHHHhc Confidence 11134444455555555555544332 222221 111 111 133344333222 233344455555554 Q ss_pred ccccceEEEEEEecCCCC-CC---Cccccccc--chhceeccH--------HHHhccCCC----cceeeEeCCCCccccc Q lcl|NC_011057. 132 TVPGELWIVILTRPVKGA-PA---QPDGSVRT--RQEWYAVSK--------EEIKKSNKG----SGTNIVLPTGEEHEFV 193 (634) Q Consensus 132 tVpGE~wi~il~rp~~~~-~~---~~dg~~~~--~~~W~~vt~--------~Ei~~~~~~----~~~~i~lP~g~~h~~~ 193 (634) -+=|-.+|+|+..-.++. -+ ++++ +.. ...-.++++ .++.....+ .-..+.+.+-.-| T Consensus 171 rlyG~~~i~i~v~~~D~~~~~~Pl~~~~-i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~g~~iH--- 246 (537) T protein:vir:10 171 RIFGIRIALFKVDSPDPYYYEKPFNIDG-VMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLINGKKYH--- 246 (537) T ss_pred ccccceEEEEeecCcCCccccccccccc-ccccceeEEEEechhhcccccchhhhccCCccccCCceeeeecCeEec--- Confidence 567999999887422111 01 1121 100 001122222 333221111 1112222221111 Q ss_pred CCCCeEEEe-eCCCcc-----cccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCC Q lcl|NC_011057. 194 KGTDIIFRV-WIPKPR-----KASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVE 267 (634) Q Consensus 194 ~~~D~~~Rv-W~P~pr-----ra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~ 267 (634) -+-+|++ =+|-|. ...--.|-.+.+++.|.=..+++..+..... ..+-.+| .+.+-. T Consensus 247 --~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~---~~~~~v~---k~~~~~--------- 309 (537) T protein:vir:10 247 --RSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAM---TKRQTVL---KVDAAQ--------- 309 (537) T ss_pred --ceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHH---hcCCcee---eechHH--------- Confidence 2334443 122222 2223577788888887766665554432221 1111111 011100 Q ss_pred CCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhc Q lcl|NC_011057. 268 GEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGL 347 (634) Q Consensus 268 ~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~ 347 (634) -+.+. ++|.+-+-.+. .++|- .=.+++...+|.++.++ +.|+ ++++ +.+.+...||.++ T Consensus 310 -----~l~~~---~~~~~r~~~~~--~~r~n-----~g~~~id~e~e~~e~~~-~~ls-gl~~----~l~~~~~~iAa~~ 368 (537) T protein:vir:10 310 -----VLANK---QQFDETMSWWT--ATRDN-----YQVRVVDKDNEDVVQID-TTLN-DLDK----VIMNQYQLVCAIA 368 (537) T ss_pred -----hhcCH---HHHHHHHHHHH--hhcCC-----cceeEecCCCceeEEEe-ccCC-CHHH----HHHHHHHHHHhhh Confidence 01111 12332222111 22221 11245555557666665 4555 4554 4566667799999 Q ss_pred cCChHHhhcccc-Ccc-----hhhHhhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCC Q lcl|NC_011057. 348 DVSPERLLGLGS-QTN-----HWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDP 421 (634) Q Consensus 348 D~~pE~LLGlgs-~~N-----hwtAw~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~p 421 (634) +||--+|+|... ..| --..|--.=+.+|-++.|.++.|.+-|.+..+ | .+..|-|=| ..|..-. T Consensus 369 ~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~-------~-~~~~~~i~f--~pL~~~s 438 (537) T protein:vir:10 369 RTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHL-------R-KRIRVKVEF--PPMDAPK 438 (537) T ss_pred CCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------C-CCcceEEEe--CCCCCCC Confidence 999988988521 222 22233333355566788888877655544322 2 223332212 3443332 Q ss_pred Cc--h-------HHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccC Q lcl|NC_011057. 422 DK--S-------DEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEF 492 (634) Q Consensus 422 d~--t-------~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~ 492 (634) ++ . +....+++.|+|+.+++|.+++-..+.+|+-=. |-+.+ ....+. T Consensus 439 ~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~-----------------------~~~~~-ed~e~~ 494 (537) T protein:vir:10 439 ESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSIT-----------------------PAMRP-TDAEDI 494 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCcccccccc-----------------------CCCCh-hhhhcc Confidence 22 2 234569999999999999999986655555100 11110 001111 Q ss_pred CCCCCCCCC--CCCCCCc-cccCCCCCCCCCCCCCCCCCcccCCCccH Q lcl|NC_011057. 493 PQQQQAIDS--GGNEDTS-DDDNLDDGEHEPDTEDDQDDDGTQKAGLE 537 (634) Q Consensus 493 P~p~~a~~~--~~~~~~~-~d~~~~~~~~ePDTe~d~~~~~~~~a~~~ 537 (634) +.+....+. .++.+.. +..+..+.+.+++ + ++...++.-+ T Consensus 495 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~---~~~~~a~~~~ 537 (537) T protein:vir:10 495 DVDDEGKPVRIIEDQPAPSEMFGATSSGESAN--D---PRDSGAAFED 537 (537) T ss_pred cCCccCCcCCCCCCCCCccccCCCCccccccC--C---CccCccccCC Confidence 111111100 0000000 1111111111111 1 1111111111 No 120 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=91.23 E-value=0.017 Score=30.30 Aligned_cols=449 Identities=18% Similarity=0.201 Sum_probs=163.9 Q ss_pred CCCCC----------cceeEeccCCCCccchhhhhhhhccCCchhhhhhh-------------hcccCccc----cccH- Q lcl|NC_011057. 1 MAATQ----------SLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSK-------------STGISRNS----DWQT- 52 (634) Q Consensus 1 ~~a~~----------~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~-------------~~~~~~~~----~WQ~- 52 (634) ||-.. +|-..-|-+... +.+.++.+|-+-.-||...... ..+..+.+ .|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~ 79 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKR-ATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEA 79 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhh-hhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccc Confidence 22110 111000000000 0111222222211121110000 00000111 1211 Q ss_pred ------HHHHHHhhhhhHHHHHhhhhhceeeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHH Q lcl|NC_011057. 53 ------DAWEAVDLVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKR 126 (634) Q Consensus 53 ------eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR 126 (634) +.=.+|..-++.|=+|.=.+.-|-|-=.-...-+.+ .+++ +.+..|-..+- +| .+-.. T Consensus 80 ~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~------~~~~-----~~~~~i~~~~~--~l---~v~~~ 143 (532) T protein:vir:94 80 TSWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKD------ELAA-----DKATRITQKLE--QY---NVRTL 143 (532) T ss_pred cccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCcc------ccch-----HHHHHHHHHHH--hh---hHHHH Confidence 111244444454555555555555443333222211 1221 33333222111 12 23445 Q ss_pred HHHhh---ccccceEEEEEEecCCCCCCCc---------cc----c---cccchhceeccHHHHhccCCCcceeeEeCC- Q lcl|NC_011057. 127 VVECL---TVPGELWIVILTRPVKGAPAQP---------DG----S---VRTRQEWYAVSKEEIKKSNKGSGTNIVLPT- 186 (634) Q Consensus 127 ~~~~L---tVpGE~wi~il~rp~~~~~~~~---------dg----~---~~~~~~W~~vt~~Ei~~~~~~~~~~i~lP~- 186 (634) +.+|+ -+=|-.+|+|+.+..+. +.+ ++ . ++..+.|. |+..+..... ..-|+ T Consensus 144 l~~a~~~~rlyG~a~i~i~v~~~~~--~~~~~~p~~l~~~~I~~g~~~~l~vld~~~-v~p~~~~~~d------p~sp~f 214 (532) T protein:vir:94 144 VRTVVIHDQAYGGAHVFPHLKMDGD--SVPADAPLLLSPSFVQRGCLIGFATIEPMW-LSPNAYNATD------PTLPSF 214 (532) T ss_pred HHHHHHhhhcccceEEEEEeccCCc--cccccccccccccccccceeeEEEeechhe-eccccccccc------cccccc Confidence 55555 48899999998863321 111 00 0 00111111 1111111000 00000 Q ss_pred CCcccccC------CCCeEEEe-eCCCccccc-----CCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccc Q lcl|NC_011057. 187 GEEHEFVK------GTDIIFRV-WIPKPRKAS-----EPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEM 254 (634) Q Consensus 187 g~~h~~~~------~~D~~~Rv-W~P~prra~-----eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~ 254 (634) |++..|.. .-+-+|++ =+|-|...+ --.|-.+.+++.|+=..++...+..-..+ ..+.++ .+ T Consensus 215 g~P~~y~v~~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~----~~~~v~--k~ 288 (532) T protein:vir:94 215 YKPDSWIATSGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQ----FSMTNL--AT 288 (532) T ss_pred CCceeEEEccCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHh----cCCcee--ee Confidence 22222211 12334443 344444332 34677788888887776666555432221 111111 01 Q ss_pred cCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHH Q lcl|NC_011057. 255 SLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIK 334 (634) Q Consensus 255 slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aik 334 (634) .+ ++.- + . .+ ...+.+-+-.+. ..++ ..=.+++....|.++.++ +.|. +++++--. T Consensus 289 ~~--a~~l-s--~-------~~---~~~~~~r~~~~~--~~~~-----n~g~~~id~~~e~~e~~~-~~ls-gl~~~l~~ 344 (532) T protein:vir:94 289 DM--AQLL-A--P-------GG---AQSLDARLQLFN--LYRD-----NRNIGALDKGTEEIQQTN-TPLS-GLDSLQAQ 344 (532) T ss_pred ch--HHhh-c--c-------hh---HHHHHHHHHHHH--hhcC-----CccceEEcCCCceeEEEe-cccC-CHHHHHHH Confidence 11 0000 0 0 01 122222221111 1111 112244444557777776 6676 57776544 Q ss_pred HHHHHHHHHhhhccCChHHhhccccCc-c-----hhhHhhhhhhhhh-HHHHhHHHHHHHHHHHHHHHHHHHhcCC-Chh Q lcl|NC_011057. 335 TRNDAIARLAMGLDVSPERLLGLGSQT-N-----HWSAWQISDEDVQ-LHIAPVMEIFCQALTDQILRVTLAREGI-DPS 406 (634) Q Consensus 335 tR~daI~rlA~~~D~~pE~LLGlgs~~-N-----hwtAw~i~de~v~-~hI~P~~~~i~~ait~~~lr~~L~~eG~-d~~ 406 (634) +...||.+++||--+|+|..... | --..|--.=+.+| ..+.|+++.|.+-|.+. . .|. +++ T Consensus 345 ----~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s----~---~g~~~~d 413 (532) T protein:vir:94 345 ----SQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLS----E---YGQIDPG 413 (532) T ss_pred ----HHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----h---cCCCCCC Confidence 44469999999998898852111 1 1122322223344 23567766666555432 2 244 444 Q ss_pred HheeeecCcccccCCCc--h-------HHHHHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccch Q lcl|NC_011057. 407 KYVVWYDASQLTIDPDK--S-------DEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIP 477 (634) Q Consensus 407 ~yV~w~DaS~L~~~pd~--t-------~eA~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~ 477 (634) =++.|-+ |-.-.++ . +.+..+++.|+|+.+++|.++|...+.+|+...++. +-+.. T Consensus 414 ~~~~f~p---L~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~------~~~~~------ 478 (532) T protein:vir:94 414 LAWEWSP---LMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGER------DELDD------ 478 (532) T ss_pred ceEEeCC---CCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccc------ccccc------ Confidence 3334543 4322222 2 222458999999999999999998777776433220 00000 Q ss_pred hhhhhhhhhhhcccCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCCCCcccCCCccHHHHHHHHHHHHHHHhhHH Q lcl|NC_011057. 478 MLAPLIAGVLQQIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPDTEDDQDDDGTQKAGLESGIVDLMVDRALELVGKR 555 (634) Q Consensus 478 ~laPll~p~~q~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePDTe~d~~~~~~~~a~~~~a~vdllv~rALelAGkR 555 (634) .+....-.+.-...+|.++ + ..+. +..+.++|+.++++.+.+.++...+-+ |.| T Consensus 479 --~~~~~~~~~~~~~~~~~~~--~-----~~~~---~~~~~~~d~~~~~~~~~~~~~~~~~~~------------~~~ 532 (532) T protein:vir:94 479 --VEEIAKQLMAAALNPPATA--P-----QTPN---PQPDSEDDQTDNQPDAQADPAQNDQPV------------GNR 532 (532) T ss_pred --ccchhhhhcccccCCCCCC--C-----CCCC---CCCCCCCCCCCCccCCCccccccCCCc------------CCC Confidence 0111000010111111100 0 0000 111112222222111111111110000 000 No 121 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=36.65 E-value=1.2 Score=20.23 Aligned_cols=412 Identities=16% Similarity=0.170 Sum_probs=181.8 Q ss_pred CCCCCcceeEeccCCCCccchhh------hhhhhccCCchhhh--hhhh--cccCccccc-cHHHHHHHh--------hh Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRA------LTAASQPLPDPSQV--FSKS--TGISRNSDW-QTDAWEAVD--------LV 61 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ra------l~aAs~~itdp~~~--~~~~--~~~~~~~~W-Q~eAW~~yd--------~V 61 (634) .--+.+|-+++|+.-..-++... +..+-+.++=|+.- +.+. .....-+.| +.++|.+.+ .. T Consensus 44 ~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vl 123 (527) T protein:vir:10 44 LTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIR 123 (527) T ss_pred cCchhheeeecCCccccccceeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhh Confidence 44456888887665532222223 44444444333322 0000 001111223 457787776 45 Q ss_pred hhHHHHHhhhhhce--eeeeEEEeeecc----------cCCCCCCC--C---C--CCCccc---HHHHHHHHhhcCCcch Q lcl|NC_011057. 62 GELRYYVGWRASSC--SRCRLVASELDE----------NTGLPTGG--I---S--EDNTEG---ERVREIVSKIADGTLG 119 (634) Q Consensus 62 gELryyvgWr~~s~--Sr~rL~aseiD~----------Dtg~ptG~--i---~--ed~~~g---~r~~~iv~~iagG~lG 119 (634) |-=.|++.|..+.- +|+|+- ++|| +.+.|.|- + + +|...| -|++.|.+.+ |.-| T Consensus 124 GDg~f~l~wD~~k~~~~R~~v~--~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l--~~~g 199 (527) T protein:vir:10 124 GDYVLLLIGDDEKDEGSRLSLH--EVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTL--DDDG 199 (527) T ss_pred cceeEEEeeccCCCcCCCceEe--ecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhc--Cccc Confidence 77789999997653 677753 3442 22444443 1 1 122222 3344443333 1111 Q ss_pred HHHHHHHHHHhhcc-ccceEEEEEEecCCCCCCCcccccccchhce-----eccHHHHhccCCCcceeeEeCCCCccccc Q lcl|NC_011057. 120 QAALTKRVVECLTV-PGELWIVILTRPVKGAPAQPDGSVRTRQEWY-----AVSKEEIKKSNKGSGTNIVLPTGEEHEFV 193 (634) Q Consensus 120 QaqL~kR~~~~LtV-pGE~wi~il~rp~~~~~~~~dg~~~~~~~W~-----~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~ 193 (634) ..| +|. |+.... .+. .++|- ++.++.+++.. +++.+. +-..+..|. T Consensus 200 -----------~~~~~G~--~~yt~~------~w~------lg~w~d~~e~p~~~~~~~~~~--~~~~l~-~lp~pi~fi 251 (527) T protein:vir:10 200 -----------KPVPGGA--IKYTEE------LYE------PGKWDDRPESPLEPDDIKKLS--TLTEEE-PLPEQITTL 251 (527) T ss_pred -----------ccccCcc--eeeeec------eee------ccccccccccccchhhhhhhc--Cceeee-cccCCCCcc Confidence 011 121 111100 000 12342 23344444221 111111 111122222 Q ss_pred CCCCeEEEeeC--CCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCC Q lcl|NC_011057. 194 KGTDIIFRVWI--PKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEI 271 (634) Q Consensus 194 ~~~D~~~RvW~--P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~ 271 (634) + |-.|+ |.|+.+|- -|----.|..+.||-++.--.. .-|+..|+++.-+ ..+. |. +.+|+. T Consensus 252 P-----vV~~~t~p~~~~~WG-~S~La~ll~l~deLn~~~Td~s--~is~~sG~Pi~~~-tg~~-~v------d~~G~~- 314 (527) T protein:vir:10 252 P-----VFHFRGHPIMNAMFG-RSGLAGLESLIASVNQTMTDED--LIMVFGGLGFYAT-DSAP-PR------DSRGNM- 314 (527) T ss_pred c-----eEeecCCCccccccC-hhhHhHHHHHHHHHhhhhhHHH--HHHHHhCCceeee-cccc-cc------cccCCc- Confidence 2 22234 33334333 2333355666666644433222 2367789998876 3333 11 122221 Q ss_pred CccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHH--HHHHHHHHHHHHhhhccC Q lcl|NC_011057. 272 APLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEV--AIKTRNDAIARLAMGLDV 349 (634) Q Consensus 272 ~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~--aiktR~daI~rlA~~~D~ 349 (634) .|+. |+--+ |...|.+ .+ +.+-+-+..+ -....+.+.++|+....+ T Consensus 315 ~~~~--------------VgPG~-------------iweL~e~--ak---~~~v~~~~~la~~~~h~~~L~~~l~~vA~~ 362 (527) T protein:vir:10 315 VPWT--------------ISPLG-------------MVEHGQN--NK---IYRVNGVASLEPSQTHMTKAEEAMQQTKGI 362 (527) T ss_pred Cccc--------------cCCce-------------eEecCCC--cc---eeeccchhhhHHHHHHHHHHHHHHHHhhcC Confidence 2211 00000 1111100 00 1111111111 122355666788888888 Q ss_pred ChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHH------HHHHHHHHHHH-------HHHHhcCC-ChhHh----eee Q lcl|NC_011057. 350 SPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEI------FCQALTDQILR-------VTLAREGI-DPSKY----VVW 411 (634) Q Consensus 350 ~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~------i~~ait~~~lr-------~~L~~eG~-d~~~y----V~w 411 (634) |- .=+| .+++.+ -+|.-++++-+.|.+.- +-..+.++++. ++.+..|+ +...+ ++| T Consensus 363 Pa-vA~G---~vD~s~--~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf 436 (527) T protein:vir:10 363 PD-IAVG---VVDAAV--AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITF 436 (527) T ss_pred Ce-eeec---cccCCc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEe Confidence 77 6555 222222 46677888889998763 22333444332 12334444 33332 556 Q ss_pred ecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhh Q lcl|NC_011057. 412 YDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQ 488 (634) Q Consensus 412 ~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q 488 (634) -|. |-+ |+++.. +.++..|+|+-+...++++ +.||=.+...-.++|..++..+-.+++...+|+..-... T Consensus 437 ~p~--lP~--D~~avie~v~tL~~aGi~S~~tAv~~L~---~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~ 509 (527) T protein:vir:10 437 RDP--KPV--NSEKRFNQLLQLWEAGLIPAKKLTEELS---KIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAA 509 (527) T ss_pred ccc--CCC--CHHHHHHHHHHHHHcCchhHHHHHHHHH---hccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhcc Confidence 555 222 233222 5699999999999888873 222322344557888899999999999988888753222 Q ss_pred cccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 489 QIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 489 ~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) .-+|| +++.|+.+-. -|= T Consensus 510 ~~g~~------------~~~~d~~~~~---~~~ 527 (527) T protein:vir:10 510 EQGIP------------DEEDDQALNG---QPL 527 (527) T ss_pred ccCCC------------CCCcccccCC---CCC Confidence 22222 1111111000 000 No 122 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=34.01 E-value=1.3 Score=19.93 Aligned_cols=243 Identities=12% Similarity=0.060 Sum_probs=121.0 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhhhHHHHHhhhhhceeeeeE Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (634) |.== . +...|.....+-..-..... .+..+ +...... +....+ ..+-+.=.+.-++++++++-| T Consensus 1 MglF-~-~~~~r~~~~~~~~~~~~~~~---------~~~~~--~~~~~~v-~~~~al--~~~~v~~~i~~ia~~iA~lp~ 64 (251) T protein:vir:46 1 MGIF-Y-KNEKRDLQYNEDDLQMMVQT---------LPSFQ--GTKLRQY-KDIEAI--RHSDIFTAVMMIASDLARMPI 64 (251) T ss_pred CCcc-c-cccccccCCCccchhhhhhh---------hcccc--CcCccee-chhhhh--ccHHHHHHHHHHHHhHhhCce Confidence 4321 1 11112111111000111110 00000 0001111 011111 122344445667888888877 Q ss_pred EEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEecCCCCCCCcccccccc Q lcl|NC_011057. 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (634) Q Consensus 81 ~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp~~~~~~~~dg~~~~~ 160 (634) ..-+ + +... .+ ..+..++..-..-.+...++++.++.+|-+-|+.|+.+ .|.. +|. . T Consensus 65 ~~~~---~-----~~~~---~~-~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i-~r~~-------~G~---~ 121 (251) T protein:vir:46 65 RVTV---N-----GQIN---YS-DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEI-TRDK-------TGE---P 121 (251) T ss_pred EEee---C-----cccc---cc-chHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EECC-------CCc---E Confidence 6643 1 1111 11 34455566667778889999999999999999998886 4532 232 3 Q ss_pred hhceeccHHHHhccCCCccee---eEeC----CCCcccccCCCC-eEEEeeCCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_011057. 161 QEWYAVSKEEIKKSNKGSGTN---IVLP----TGEEHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTK 232 (634) Q Consensus 161 ~~W~~vt~~Ei~~~~~~~~~~---i~lP----~g~~h~~~~~~D-~~~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 232 (634) .+++.|..+.+......+|.. +..- .|....| +..| +-||..+. ....--||+.++.+.|.-..-+.+ T Consensus 122 ~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~g~~~~~-~~~diiH~r~~~~---dg~~G~spi~~~~~~i~~~~~~~~ 197 (251) T protein:vir:46 122 MNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV-KFEDMLDIKFYSL---DGINGLSLLDTLSRTIESDNNGKD 197 (251) T ss_pred EEEEEECCceEEEEECCCCcEEEEEEEeccCCcceeEEE-CCccEEEecCcCC---CCeeecCHHHHHHHHHHHHHHHHH Confidence 457777777765443322221 1111 2333334 3334 44554332 235678999999999988888888 Q ss_pred HHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_011057. 233 TIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVP 312 (634) Q Consensus 233 ~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP 312 (634) ...+..+.-..-.|||-+|+.++-+ -+.++|++.+. ..+.-.+ -+..|++ +. T Consensus 198 ~~~~~f~ng~~p~gil~~~~~l~~~--------------------e~~~~~~~~~~----~~~~g~~-n~g~~~~---gm 249 (251) T protein:vir:46 198 FLNNFLRNGTHAGGILKMKGVLDNK--------------------KARDRAREEFP----KVLVELN-KLGKLSY---SM 249 (251) T ss_pred HHHHHHHccCCCcEEEEeCCCCCCH--------------------HHHHHHHHHHH----HHhcCcc-ccccccc---cc Confidence 8888888777778899888765421 12333444333 2222111 1112222 22 Q ss_pred hH Q lcl|NC_011057. 313 GE 314 (634) Q Consensus 313 ~E 314 (634) .| T Consensus 250 ~~ 251 (251) T protein:vir:46 250 NQ 251 (251) T ss_pred CC Confidence 23 No 123 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=33.92 E-value=1.3 Score=19.91 Aligned_cols=525 Identities=13% Similarity=0.111 Sum_probs=185.7 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhcc-CCchhhhhhhhcccCccc----cc---------cHHHHHHHhhhhhHHH Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQP-LPDPSQVFSKSTGISRNS----DW---------QTDAWEAVDLVGELRY 66 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~-itdp~~~~~~~~~~~~~~----~W---------Q~eAW~~yd~VgELry 66 (634) +.=+.-.|-++.++ .+ ...-++-++... ++|--.++..+.+.++.. +| |-. .+|..-+..|= T Consensus 50 ~~~~~~~~~~~~~~-~~-~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~--alY~~~~l~rk 125 (765) T protein:vir:96 50 PEKAPVIRSVKDFL-EP-GLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQAC--AIISQHWLVDK 125 (765) T ss_pred cccCCCCCCCCccc-Cc-ccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHH--HHHHhCchhhh Confidence 11122223333222 22 112233333211 111112222211111111 11 111 12333233333 Q ss_pred HHhhhhhce-eeeeEEEeeecccCCCCCCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhccccceEEEEEEec Q lcl|NC_011057. 67 YVGWRASSC-SRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRP 145 (634) Q Consensus 67 yvgWr~~s~-Sr~rL~aseiD~Dtg~ptG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~LtVpGE~wi~il~rp 145 (634) +|.=.+.-| .+-+-+-+ +.+ .+.+ ..-.++....+.+ .=.+.++.+...--+=|-.+|+++..- T Consensus 126 iVd~pAeDa~R~g~~I~~--~~~------e~~~--~~~~~l~~~~~rl-----~v~~~l~ea~~~~RlyGga~i~i~i~~ 190 (765) T protein:vir:96 126 ACSMSGEDAARNGWELKS--DGR------KLSD--EQSALIARRDMEF-----RVKDNLVELNRFKNVFGVRIALFVVES 190 (765) T ss_pred hhhcchHHhhcCCceeec--Ccc------ccCH--HHHHHHHHHHHHh-----hHHHHHHHHHHHhhhceeeEEEEEecc Confidence 333332222 22222222 111 1111 1112333333333 334555555555567788888888741 Q ss_pred CCC----CCCCccc----ccc---c-chhcee-ccHHHHhccCC----CcceeeEeCCCCcccccCCCCeEEEeeCCCc- Q lcl|NC_011057. 146 VKG----APAQPDG----SVR---T-RQEWYA-VSKEEIKKSNK----GSGTNIVLPTGEEHEFVKGTDIIFRVWIPKP- 207 (634) Q Consensus 146 ~~~----~~~~~dg----~~~---~-~~~W~~-vt~~Ei~~~~~----~~~~~i~lP~g~~h~~~~~~D~~~RvW~P~p- 207 (634) .++ .+=++++ .+. . ...|.. ++..|+-.... +.-..+.+.+.+-| -.-||++ +..+ T Consensus 191 ~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g~~IH-----~SRli~~-~g~~l 264 (765) T protein:vir:96 191 DDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISGKKYH-----RSHLVVV-RGPQP 264 (765) T ss_pred cCcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecCceec-----cceEEEe-cCCCc Confidence 110 0001111 000 0 111211 11122211111 11122233222211 1234443 2221 Q ss_pred -cc-----ccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCCCccccchHHH Q lcl|NC_011057. 208 -RK-----ASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQ 281 (634) Q Consensus 208 -rr-----a~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~~p~~g~~a~~ 281 (634) .. ..--.|=.+.|++.|.=..++...+.....+--+ -++.+ .+ .. -+.+ .+ T Consensus 265 pd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~--~v~k~----~~--~~------------~l~~---~~ 321 (765) T protein:vir:96 265 PDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRT--STIHV----DV--EK------------AIAN---ED 321 (765) T ss_pred hhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhcc--ceeee----ch--Hh------------hhcc---HH Confidence 11 1124555677777776666655444433222111 01111 11 00 0111 12 Q ss_pred HHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccc-cC Q lcl|NC_011057. 282 QLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLG-SQ 360 (634) Q Consensus 282 ~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlg-s~ 360 (634) +|.+-+-.+. .+.|=.+ +++..-.|.++.++ +.|+ +++++- +.+...||.+.+||--+|+|.. ++ T Consensus 322 ~l~~r~~~~~--~~r~n~g------~~~id~ee~~e~~s-~~ls-gl~d~l----~~~~~~iAaas~IP~t~LfGqsp~G 387 (765) T protein:vir:96 322 AFNARLAFWI--ANRDNHG------VKVIGIDETMEQFD-TNLS-DFDSVI----MNQYQLVAAIAKTPATKLLGTSPKG 387 (765) T ss_pred HHHHHHHHHH--HhcCCce------eEEecCCcceeEEe-cccC-CHHHHH----HHHHHHHHhhhCCCeeeeccCCccc Confidence 2332222221 1222222 33433446555555 4565 566644 5556789999999999999974 45 Q ss_pred cchhhHhhh-----hhhhhh-HHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHheeeecCcccccCCCch-------HHH Q lcl|NC_011057. 361 TNHWSAWQI-----SDEDVQ-LHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDASQLTIDPDKS-------DEA 427 (634) Q Consensus 361 ~NhwtAw~i-----~de~v~-~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~w~DaS~L~~~pd~t-------~eA 427 (634) .|-.+=-.+ .=+.+| ..+.|+++.+.+-|- ..++++++=.+.|-+...++ +-++. +.+ T Consensus 388 lnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~--------~s~~i~~d~~i~FnpL~~~s-ekEkAei~~k~Aea~ 458 (765) T protein:vir:96 388 FNATGEHETISYHEELESIQEHIFDPLLERHYLLLA--------KSESIDVQLEIVWNPVDSTT-SQQQAELNNKKAATD 458 (765) T ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HhcCCCCcceEEeCCCCCCC-HHHHHHHHHHHHHHH Confidence 553322221 112223 234555544443332 24678765444454333222 11222 233 Q ss_pred HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhh-hhhcccCCCCCCCC------- Q lcl|NC_011057. 428 KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAG-VLQQIEFPQQQQAI------- 499 (634) Q Consensus 428 ~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p-~~q~~~~P~p~~a~------- 499 (634) ..+++.|+|+.+++|.+++...+.+|+--+.+.- -.+ |+.+| ..+..+-+....+. T Consensus 459 ~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~--------e~~--------~~~~pe~~~~~~~~~~~~~~~~~e~~~ 522 (765) T protein:vir:96 459 EIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQA--------ETE--------PGMSPENLAELEKAGAQSAKAKGEAER 522 (765) T ss_pred HHHHhcCCCCHHHHHHHHhccccCCCCCCCcccc--------ccc--------cCCCccccccccCCCcccccccCcccc Confidence 5589999999999999998866666653332210 011 11111 11111111100000 Q ss_pred ------CCCCCCCCccccCCCCCCCCCCCCCCCCC---cccCCCccH-HHHHHHHHHHHHHHhhHHhhcCChhHHHHhhC Q lcl|NC_011057. 500 ------DSGGNEDTSDDDNLDDGEHEPDTEDDQDD---DGTQKAGLE-SGIVDLMVDRALELVGKRRRGRDRETLARLSG 569 (634) Q Consensus 500 ------~~~~~~~~~~d~~~~~~~~ePDTe~d~~~---~~~~~a~~~-~a~vdllv~rALelAGkR~Rt~~R~~~arlr~ 569 (634) ...+..+..+.. ..+.+|..++-... +.+.++.+. .+-......++.+.++|=. ..-..+..-.-. T Consensus 523 ~~a~p~~~eg~~~~~~~~---p~~~~p~~~~~~~~~g~~~~~p~~~~p~~~~~~~~~~~~~~~~~~~-~~a~~~g~~v~~ 598 (765) T protein:vir:96 523 AEAQAGAVEGAGDPVPAA---PRGTKPLAKAAEEGAGEAATPPSRPNPRAELRNLLSDLLSKLEALD-DAQAPDGVDIEQ 598 (765) T ss_pred ccCCCCccCCCCcccccC---CcccCCccccccccCccccCccccccccccchhcccchhhhhhccc-cccccCCCCCCC Confidence 000000100000 01111211111000 111111111 1122333444444444422 111222222234 Q ss_pred CChHHhhhhcCCC-ChhHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011057. 570 VRERDYHRYMDPV-PESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVIDVVA 634 (634) Q Consensus 570 ip~h~~h~~~~Pv-~~~~v~rLi~GWd~~ld~~~~a~~g~Dp~~lr~~v~~~v~~~lt~~vvd~~~ 634 (634) .+.+..|+-..|. +.-+..-...+|.....+ .|-..-+.-+-+++.. T Consensus 599 ~~~~a~~~a~~ps~a~~~~~~~~~~~~~~P~~------------------~~~~~~~~~~~~~~~i 646 (765) T protein:vir:96 599 DDAPGLKRTSKPSVSGMEPSVFSSNRIVGPRD------------------HSELQRIKVNGITTLI 646 (765) T ss_pred CccchhhhhhccccCCCCCcccCCCCCCCCcc------------------cccccceeecceEeee Confidence 4566666655551 111122234455443321 0101111111122222 No 124 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=31.10 E-value=1.5 Score=19.58 Aligned_cols=412 Identities=17% Similarity=0.175 Sum_probs=181.7 Q ss_pred CCCCCcceeEeccCCCCccchhh------hhhhhccCCchhhh--hhhh--cccC-ccccccHHHHHHHh--------hh Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRA------LTAASQPLPDPSQV--FSKS--TGIS-RNSDWQTDAWEAVD--------LV 61 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ra------l~aAs~~itdp~~~--~~~~--~~~~-~~~~WQ~eAW~~yd--------~V 61 (634) .--+.+|-+++|+.-..-++... +..+-+.++=|+.- +.+. .... .+.=-+.++|.+.+ .. T Consensus 44 ~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vl 123 (527) T protein:vir:10 44 LTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIR 123 (527) T ss_pred cCchhheeeecCCccccccceeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhh Confidence 44456888887665532222223 44444444333322 0000 0011 12223557787776 45 Q ss_pred hhHHHHHhhhhhce--eeeeEEEeeecc----------cCCCCCCC--C---C--CCCccc---HHHHHHHHhhcCCcch Q lcl|NC_011057. 62 GELRYYVGWRASSC--SRCRLVASELDE----------NTGLPTGG--I---S--EDNTEG---ERVREIVSKIADGTLG 119 (634) Q Consensus 62 gELryyvgWr~~s~--Sr~rL~aseiD~----------Dtg~ptG~--i---~--ed~~~g---~r~~~iv~~iagG~lG 119 (634) |-=.|++.|..+.- +|+|+- ++|| +.+.|.|- + + +|...| -|++.|.+.+ |.-| T Consensus 124 GDg~f~l~wD~~k~~~~R~~v~--~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l--~~~g 199 (527) T protein:vir:10 124 GDYVLLLIGDDEKDEGSRLSLH--EVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTL--DDDG 199 (527) T ss_pred cceeEEEeeccCCCcCCCceEe--ecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhc--Cccc Confidence 77789999997653 677753 3442 22444443 1 1 122222 3344443333 1111 Q ss_pred HHHHHHHHHHhhcc-ccceEEEEEEecCCCCCCCcccccccchhce-----eccHHHHhccCCCcceeeEeCCCCccccc Q lcl|NC_011057. 120 QAALTKRVVECLTV-PGELWIVILTRPVKGAPAQPDGSVRTRQEWY-----AVSKEEIKKSNKGSGTNIVLPTGEEHEFV 193 (634) Q Consensus 120 QaqL~kR~~~~LtV-pGE~wi~il~rp~~~~~~~~dg~~~~~~~W~-----~vt~~Ei~~~~~~~~~~i~lP~g~~h~~~ 193 (634) ..| +|. |+.... .+. .++|- ++.++.+++.. +++.+. +-..+..|. T Consensus 200 -----------~~~~~G~--~~yt~~------~w~------lg~w~d~~e~p~~~~~~~~~~--~~~~l~-~lp~pi~fi 251 (527) T protein:vir:10 200 -----------KPVPGGA--IKYTEE------LYE------PGKWDDRPESPLEPDDIKKLS--TLTEEE-PLPEQITTL 251 (527) T ss_pred -----------ccccCcc--eeeeec------eee------ccccccccccccchhhhhhhc--Cceeee-cccCCCCcc Confidence 011 121 111100 000 12342 23344444221 111111 111122222 Q ss_pred CCCCeEEEeeC--CCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhCceeeecccccCCCCcCCCCcCCCCCC Q lcl|NC_011057. 194 KGTDIIFRVWI--PKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEI 271 (634) Q Consensus 194 ~~~D~~~RvW~--P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvP~e~slP~~~~p~a~~~~~~~ 271 (634) + |-.|+ |.|+.+|- -|----.|..+.||-++.--.. .-|+..|+++.-+ ..+. |. +.+|+. T Consensus 252 P-----vV~~~t~p~~~~~WG-~S~La~ll~l~deLn~~~Td~s--~is~~sG~Pi~~~-tg~~-~v------d~~G~~- 314 (527) T protein:vir:10 252 P-----VFHFRGHPIMNAMFG-RSGLAGLESLIASVNQTMTDED--LIMVFGGLGFYAT-DSAP-PR------DSRGNM- 314 (527) T ss_pred c-----eEeecCCCccccccC-hhhHhHHHHHHHHHhhhhhHHH--HHHHHhCCceeee-cccc-cc------cccCCc- Confidence 2 22234 33334333 2333355666666644433222 2367789998876 3333 11 222221 Q ss_pred CccccchHHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHH--HHHHHHHHHHHHhhhccC Q lcl|NC_011057. 272 APLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEV--AIKTRNDAIARLAMGLDV 349 (634) Q Consensus 272 ~p~~g~~a~~~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~--aiktR~daI~rlA~~~D~ 349 (634) .|+. |+--+ |...|.+ .+ +.+-+-+..+ -....+.+.++|+....+ T Consensus 315 ~~~~--------------VgPG~-------------iweL~e~--ak---~~~v~~~~~la~~~~h~~~L~~~l~~vA~~ 362 (527) T protein:vir:10 315 VPWT--------------ISPLG-------------MVEHGQN--NK---IYRVNGVASLEPSQTHMNKAEEAMQQTKGI 362 (527) T ss_pred Cccc--------------cCCce-------------eEecCCC--cc---eeeccchhhhHHHHHHHHHHHHHHHHhhcC Confidence 2211 00000 1111100 01 1111111111 122356667788888888 Q ss_pred ChHHhhccccCcchhhHhhhhhhhhhHHHHhHHHH------HHHHHHHHHHH-------HHHHhcCC-ChhHh----eee Q lcl|NC_011057. 350 SPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEI------FCQALTDQILR-------VTLAREGI-DPSKY----VVW 411 (634) Q Consensus 350 ~pE~LLGlgs~~NhwtAw~i~de~v~~hI~P~~~~------i~~ait~~~lr-------~~L~~eG~-d~~~y----V~w 411 (634) |- .=+| .+++.+ -+|.-++++-+.|.+.- +-..+.++++. ++.+..|+ +...+ ++| T Consensus 363 Pa-vA~G---~vD~s~--~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf 436 (527) T protein:vir:10 363 PD-IAVG---VVDAAV--AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITF 436 (527) T ss_pred Ce-eeec---cccCCc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEe Confidence 77 6555 222222 46677888889998763 22333444332 12334444 33332 556 Q ss_pred ecCcccccCCCchHHH---HHHHHccCCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhh Q lcl|NC_011057. 412 YDASQLTIDPDKSDEA---KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQ 488 (634) Q Consensus 412 ~DaS~L~~~pd~t~eA---~~~~~~G~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q 488 (634) -|. |-+| +++.. +.++..|+|+-+...++++ +.||=.+...-.++|-.++..+-.+++...+|+..-... T Consensus 437 ~p~--lP~D--~~avie~v~tL~~aGiiS~etAv~~L~---~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~ 509 (527) T protein:vir:10 437 RDP--KPVN--NEKRFAQLLELWEAGLIPAKKLTEELS---KIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAA 509 (527) T ss_pred ccc--CCCC--HHHHHHHHHHHHHcCchhHHHHHHHHH---hccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhcc Confidence 555 3332 33222 5699999999999888873 222312344446778889999999999988888753222 Q ss_pred cccCCCCCCCCCCCCCCCCccccCCCCCCCCCC Q lcl|NC_011057. 489 QIEFPQQQQAIDSGGNEDTSDDDNLDDGEHEPD 521 (634) Q Consensus 489 ~~~~P~p~~a~~~~~~~~~~~d~~~~~~~~ePD 521 (634) .-+|| +++.|+.+-. -|= T Consensus 510 ~~g~~------------~~~~d~~~~~---~~~ 527 (527) T protein:vir:10 510 EQGIP------------DEEDDQALNG---QPL 527 (527) T ss_pred ccCCC------------CCCcccccCC---CCC Confidence 22222 1111111000 000 No 125 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=23.68 E-value=2.3 Score=18.63 Aligned_cols=386 Identities=18% Similarity=0.197 Sum_probs=139.0 Q ss_pred CCCCCcceeEeccCCCCccchhhhhhhhccCCchhhhhhhhcccCccccccHHHHHHHhhhh----hHHHHHhhhhhcee Q lcl|NC_011057. 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVG----ELRYYVGWRASSCS 76 (634) Q Consensus 1 ~~a~~~lr~vrrp~g~~~a~~ral~aAs~~itdp~~~~~~~~~~~~~~~WQ~eAW~~yd~Vg----ELryyvgWr~~s~S 76 (634) -.|..+.||.|. |-+|+...+-+ | ..+.+.|..-.+ -..+. +-.|-.+|++..+- T Consensus 10 ~~~~~~~~~~~~--------rd~l~~~~~gl---g--------~~r~~~~~~~g~--~~~~~~~~l~~~Yr~~~ia~~iV 68 (449) T protein:vir:10 10 NHALNDARMARA--------RMGLMVPTMGL---D--------NKRHSAWCEYGF--PELVTYENLYSLYRRGGIAHGAV 68 (449) T ss_pred hhhcchhHHHHH--------HHHHHHHHhcC---C--------cccchhhhhcCC--cccCCHHHHHHHHhcCchhHHHH Confidence 122233333221 12343332222 1 112222211100 00111 12455566655443 Q ss_pred eeeEEEeeecccC--CCC--CCCCCCCCcccHHHHHHHHhhcCCcchHHHHHHHHHHhhc---cccceEEEEEEecCCCC Q lcl|NC_011057. 77 RCRLVASELDENT--GLP--TGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLT---VPGELWIVILTRPVKGA 149 (634) Q Consensus 77 r~rL~aseiD~Dt--g~p--tG~i~ed~~~g~r~~~iv~~iagG~lGQaqL~kR~~~~Lt---VpGE~wi~il~rp~~~~ 149 (634) .. ++ | ++ .-| +-+.+.+..+.. .++-..+- .+-=.++...+.+++. +-|...|+++.+-++.- T Consensus 69 d~--~~---d-~~~~~~~~i~~g~~~~~~~~~--~~~e~~~~--~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l 138 (449) T protein:vir:10 69 EK--LV---G-KCWQTNPEIIEGDDADDSEDE--TSWEKKSK--QVFTNRLWRSFAEADRRRLVGRYAGILLHIRDEKDW 138 (449) T ss_pred Hh--hh---h-hhhhcCcccccCccccchhhh--HHHHHHHH--HHHHHHHHHHHHHHHHhhhccCcEEEEEEecCCCCC Confidence 31 21 1 10 000 001111111111 11111110 1111244445666654 67878888887633322 Q ss_pred CCC--cccccccchhceec-----cHHHHhccCCCcceeeEeCC-CCccccc--------------CCCCeEEEeeCCCc Q lcl|NC_011057. 150 PAQ--PDGSVRTRQEWYAV-----SKEEIKKSNKGSGTNIVLPT-GEEHEFV--------------KGTDIIFRVWIPKP 207 (634) Q Consensus 150 ~~~--~dg~~~~~~~W~~v-----t~~Ei~~~~~~~~~~i~lP~-g~~h~~~--------------~~~D~~~RvW~P~p 207 (634) ... +++.+ ..-.++ |..++.....+ |+ |++..|. -...-+|++ + T Consensus 139 ~~Pl~~~~~i---~~i~v~~~~~i~~~~~~~dp~s-------p~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~-~--- 204 (449) T protein:vir:10 139 NLPATKGRGL---QKVSVSWAGSLKVAEWDTGINS-------KTYGQPKLWKYTERLPNGSSRRVDIHPDRVFIL-G--- 204 (449) T ss_pred CcccccCcce---eeEEeeccccCChhhhhcCCCC-------CCCCCceEEEEeeeccCCCccceeeccceeEee-c--- Confidence 111 22221 122222 33333221111 11 3333331 111223333 1 Q ss_pred ccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhhC----ceeeecc-c-ccCCCCcCCCCcCCCCCCCccccchHHH Q lcl|NC_011057. 208 RKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGN----GVLFVPH-E-MSLPAAQGPVSEVEGEEIAPLVGEPAVQ 281 (634) Q Consensus 208 rra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gn----GvlfvP~-e-~slP~~~~p~a~~~~~~~~p~~g~~a~~ 281 (634) .+.....|-.+++.+.| +.+.+.-...+.|.|.+. ++.|.=. . .+|... .+.+.. ...+ T Consensus 205 ~~~~~g~~~L~~~yn~l---~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~-------~~~~~e-----~~~~ 269 (449) T protein:vir:10 205 DYSEDAIGFLEPAYNAF---VSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASL-------YGVSID-----ELQD 269 (449) T ss_pred CCCCCChhHHHHHHHHh---hhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHH-------hhCCch-----HHHH Confidence 11122345566666644 444444444444444321 1221100 0 111110 011111 1112 Q ss_pred HHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCCchhHHHHHHHHHHHHHHhhhccCChHHhhccccCc Q lcl|NC_011057. 282 QLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQT 361 (634) Q Consensus 282 ~l~~ml~qva~tai~De~S~AA~vPiva~vP~Ehi~~ikHl~f~~d~te~aiktR~daI~rlA~~~D~~pE~LLGlgs~~ 361 (634) ++.+++.... .-.++ ++...++..+ ++-..|. +++++--. ....||-+.+||-=+|+|-.... T Consensus 270 ~~~~~~~~~~----~~~~~-------~~i~~~~d~~-~~~~~~s-gl~d~l~~----~~q~iaaa~~IP~t~L~Gqsp~g 332 (449) T protein:vir:10 270 KFNEVAGEIN----RGNDV-------LMTTQGATVT-PLVTSVA-DPTATYNV----NLQTAAAGVDIPTRILIGNQQAE 332 (449) T ss_pred HHHHHHHHHh----ccchh-------eeecCCcceE-EEecccC-ChhHHHHH----HHHHHHHHhCCCeeeeeccCccc Confidence 2222211111 11111 1112222221 1123343 66665332 34458999999999999986666 Q ss_pred chhhH----hhhhhhhhhHHHHhHHHHHHHHHHHHHHHHHHHhcCCChhHhee-eecCcccccCCCch-------HHHHH Q lcl|NC_011057. 362 NHWSA----WQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVV-WYDASQLTIDPDKS-------DEAKF 429 (634) Q Consensus 362 NhwtA----w~i~de~v~~hI~P~~~~i~~ait~~~lr~~L~~eG~d~~~yV~-w~DaS~L~~~pd~t-------~eA~~ 429 (634) +-.|. |.-.-...|-++.|+++.|++-|.+. .+ |=.|.+|-| |-+...++- -++. +.+.. T Consensus 333 lnst~D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s----~~---g~~~~d~~i~f~pL~~~t~-kEkAei~k~~A~a~~~ 404 (449) T protein:vir:10 333 RSSTEDQKYFNARCQSRRVDLSFEIEDFCDKLIEL----KI---IDAVAKKAVIWDDLNEQTG-TEKLTNAKTMGEINQT 404 (449) T ss_pred cccchhHHHHHHHHHHHHHhhhHHHHHHHHHHHHh----hc---CCCCCceeEEeCCCCCCCH-HHHHHHHHHHHHHHHH Confidence 55441 11111234555677777776654332 11 212335544 443333322 1222 22234 Q ss_pred HHHcc---CCCHHHHHHHhCCCccccCCCCCHHHHHHHHHHHhhcCcccchhhhhhhhhhhhcccCCCCCCCCCCCCCCC Q lcl|NC_011057. 430 AYENG---AINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLQQIEFPQQQQAIDSGGNED 506 (634) Q Consensus 430 ~~~~G---~It~ealr~~~Gl~ed~~yd~~t~Eg~r~wA~d~v~~dp~Li~~laPll~p~~q~~~~P~p~~a~~~~~~~~ 506 (634) +++.| +|+.+|+|.++|+.-.+++ | ..++++ T Consensus 405 ~~~ag~~~~~~~~EiR~~~~~~~~~~~---------------------------------------~-------~~~e~~ 438 (449) T protein:vir:10 405 MLGSGDNPAFSREEIRTAAGYDNDDEE---------------------------------------P-------LGEEDG 438 (449) T ss_pred HHHccccCCcCHHHHHHHhcccCCCCC---------------------------------------C-------CCCCCC Confidence 66666 9999999998877421110 0 000111 Q ss_pred CccccCCCCCCCCCCCCCCCCCcc Q lcl|NC_011057. 507 TSDDDNLDDGEHEPDTEDDQDDDG 530 (634) Q Consensus 507 ~~~d~~~~~~~~ePDTe~d~~~~~ 530 (634) ++.++..+ +++ T Consensus 439 de~~~~~d-------------~~a 449 (449) T protein:vir:10 439 DEEDKATD-------------SAA 449 (449) T ss_pred ccccccCC-------------cCC Confidence 11110000 000 Done!