Query lcl|NC_021303.1_cdsid_YP_008051310.1 [gene=8] [protein=portal protein] [protein_id=YP_008051310.1] [location=4440..6353] Match_columns 637 No_of_seqs 18 out of 21 Neff 3.3 Searched_HMMs 1612 Date Thu Nov 7 17:37:36 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107517 Length: 639 100.0 0E+00 0E+00 1944.0 57.2 637 1-637 1-639 (639) 2 protein:vir:97900 Length: 639 100.0 0E+00 0E+00 1944.0 57.2 637 1-637 1-639 (639) 3 protein:vir:106027 Length: 629 100.0 0E+00 0E+00 1834.1 55.7 624 1-637 1-629 (629) 4 protein:vir:99088 Length: 629 100.0 0E+00 0E+00 1817.8 55.3 626 1-637 1-629 (629) 5 protein:vir:8654 Length: 629 # 100.0 0E+00 0E+00 1817.2 55.3 626 1-637 1-629 (629) 6 protein:vir:102426 Length: 631 100.0 6E-321 5E-324 1776.3 54.7 624 1-636 1-631 (631) 7 protein:vir:106491 Length: 646 100.0 3E-286 2E-289 1586.0 49.1 592 1-637 1-627 (646) 8 protein:vir:483 Length: 413 # 99.5 2.8E-14 1.7E-17 94.9 22.9 404 1-531 1-413 (413) 9 protein:vir:1326 Length: 457 # 99.4 7.1E-14 4.4E-17 92.7 21.6 430 1-524 1-457 (457) 10 protein:vir:7853 Length: 518 # 99.4 1.6E-13 9.7E-17 90.8 22.5 474 23-598 1-518 (518) 11 protein:vir:6240 Length: 457 # 99.4 1E-13 6.3E-17 91.8 21.0 437 1-529 1-457 (457) 12 protein:vir:101648 Length: 518 99.4 3.3E-13 2E-16 89.0 23.0 474 23-598 1-518 (518) 13 protein:vir:101647 Length: 460 99.4 2.9E-14 1.8E-17 94.8 16.7 417 1-517 1-460 (460) 14 protein:vir:102080 Length: 429 99.4 3.9E-13 2.4E-16 88.6 22.2 421 1-516 1-429 (429) 15 protein:vir:4454 Length: 414 # 99.4 4.3E-13 2.7E-16 88.4 22.2 405 1-526 1-414 (414) 16 protein:vir:960 Length: 413 # 99.4 3.6E-13 2.2E-16 88.8 20.6 399 1-518 4-413 (413) 17 protein:vir:8418 Length: 409 # 99.4 5.3E-13 3.3E-16 87.9 21.3 402 1-517 1-409 (409) 18 protein:vir:81218 Length: 423 99.3 1.4E-12 8.8E-16 85.5 22.2 413 1-520 1-423 (423) 19 protein:vir:4337 Length: 434 # 99.3 1.4E-12 8.9E-16 85.5 22.2 419 1-522 1-434 (434) 20 protein:vir:1266 Length: 416 # 99.3 1.2E-12 7.5E-16 85.9 19.9 408 1-522 1-416 (416) 21 protein:vir:107605 Length: 432 99.3 3.6E-12 2.2E-15 83.3 22.4 422 1-516 1-432 (432) 22 protein:vir:105002 Length: 432 99.3 3.6E-12 2.2E-15 83.3 22.4 422 1-516 1-432 (432) 23 protein:vir:102855 Length: 432 99.3 3.6E-12 2.2E-15 83.3 22.4 422 1-516 1-432 (432) 24 protein:vir:1380 Length: 422 # 99.3 1.9E-12 1.2E-15 84.8 20.7 405 6-517 1-422 (422) 25 protein:vir:100150 Length: 437 99.3 1.4E-12 8.8E-16 85.5 18.0 421 1-529 1-437 (437) 26 protein:vir:4509 Length: 424 # 99.2 7.4E-12 4.6E-15 81.6 20.4 403 1-522 17-424 (424) 27 protein:vir:102727 Length: 945 99.2 8E-11 5E-14 75.9 25.5 525 1-637 27-623 (945) 28 protein:vir:3868 Length: 417 # 99.2 1E-11 6.2E-15 80.9 20.3 416 6-525 1-417 (417) 29 protein:vir:93610 Length: 454 99.2 3.7E-11 2.3E-14 77.8 22.8 437 5-555 1-454 (454) 30 protein:vir:95378 Length: 406 99.2 1.2E-11 7.3E-15 80.5 19.7 400 1-521 1-406 (406) 31 protein:vir:100249 Length: 431 99.2 2.6E-11 1.6E-14 78.6 21.4 403 1-527 1-431 (431) 32 protein:vir:94666 Length: 723 99.2 3.9E-11 2.4E-14 77.7 22.1 511 31-637 1-617 (723) 33 protein:vir:81152 Length: 411 99.1 2.3E-11 1.4E-14 78.9 19.5 396 18-518 1-411 (411) 34 protein:vir:80333 Length: 419 99.1 1.8E-11 1.1E-14 79.5 18.8 409 1-551 1-419 (419) 35 protein:vir:189 Length: 424 # 99.1 5.3E-11 3.3E-14 76.9 21.1 409 1-512 1-424 (424) 36 protein:vir:10362 Length: 432 99.1 7.4E-11 4.6E-14 76.1 21.9 413 1-524 1-432 (432) 37 protein:vir:5737 Length: 419 # 99.1 2.8E-11 1.7E-14 78.4 19.0 414 6-533 1-419 (419) 38 protein:vir:9702 Length: 406 # 99.1 1.1E-10 6.8E-14 75.2 21.4 398 1-525 1-406 (406) 39 protein:vir:81072 Length: 432 99.1 4.2E-11 2.6E-14 77.5 18.9 413 3-524 1-432 (432) 40 protein:vir:1884 Length: 424 # 99.1 1.2E-10 7.5E-14 75.0 21.1 410 1-512 1-424 (424) 41 protein:vir:1431 Length: 419 # 99.1 2.2E-10 1.4E-13 73.6 22.2 408 1-533 1-419 (419) 42 protein:vir:102118 Length: 409 99.1 6.2E-11 3.8E-14 76.6 18.7 400 1-517 1-409 (409) 43 protein:vir:3153 Length: 467 # 99.0 1.4E-10 8.4E-14 74.7 19.9 413 54-534 1-467 (467) 44 protein:vir:8317 Length: 409 # 99.0 2.5E-11 1.5E-14 78.7 14.9 362 1-461 11-409 (409) 45 protein:vir:100882 Length: 383 99.0 2.7E-10 1.7E-13 73.0 20.0 374 1-508 1-383 (383) 46 protein:vir:4194 Length: 540 # 99.0 1.2E-09 7.5E-13 69.5 22.7 477 1-599 6-540 (540) 47 protein:vir:105064 Length: 421 99.0 7.9E-10 4.9E-13 70.5 21.7 408 1-528 1-421 (421) 48 protein:vir:9359 Length: 348 # 99.0 1.3E-10 8E-14 74.8 17.2 338 75-523 1-348 (348) 49 protein:vir:97060 Length: 432 99.0 9.2E-10 5.7E-13 70.1 21.9 413 1-524 1-432 (432) 50 protein:vir:2683 Length: 412 # 99.0 1.4E-10 8.7E-14 74.6 17.3 403 1-509 1-412 (412) 51 protein:vir:80134 Length: 403 99.0 8.9E-10 5.5E-13 70.2 21.0 393 1-521 1-403 (403) 52 protein:vir:104259 Length: 403 98.9 2.7E-09 1.7E-12 67.6 22.9 391 1-516 1-403 (403) 53 protein:vir:93943 Length: 409 98.9 2E-10 1.2E-13 73.8 16.4 402 1-523 1-409 (409) 54 protein:vir:78641 Length: 278 98.8 4.9E-10 3E-13 71.6 15.8 272 75-417 1-278 (278) 55 protein:vir:3843 Length: 397 # 98.8 4.3E-09 2.7E-12 66.5 20.3 384 1-519 1-397 (397) 56 protein:vir:94426 Length: 409 98.8 1.5E-09 9.1E-13 69.0 17.7 402 1-522 1-409 (409) 57 protein:vir:6210 Length: 394 # 98.8 1.8E-09 1.1E-12 68.5 17.9 383 1-520 1-394 (394) 58 protein:vir:100187 Length: 385 98.8 1.1E-08 6.7E-12 64.3 21.3 374 1-507 1-385 (385) 59 protein:vir:80796 Length: 574 98.8 1.2E-08 7.2E-12 64.1 21.5 490 1-571 27-574 (574) 60 protein:vir:80644 Length: 551 98.8 2.1E-09 1.3E-12 68.1 17.3 479 1-567 5-551 (551) 61 protein:vir:81095 Length: 416 98.8 1.2E-09 7.2E-13 69.6 15.3 402 6-523 1-416 (416) 62 protein:vir:4598 Length: 416 # 98.8 1.2E-09 7.2E-13 69.6 15.3 402 6-523 1-416 (416) 63 protein:vir:79772 Length: 648 98.7 7E-08 4.4E-11 59.8 26.2 524 1-637 1-609 (648) 64 protein:vir:4854 Length: 386 # 98.7 2.1E-09 1.3E-12 68.2 14.7 374 6-491 1-386 (386) 65 protein:vir:4156 Length: 542 # 98.7 3.4E-08 2.1E-11 61.5 21.1 487 1-599 1-542 (542) 66 protein:vir:10321 Length: 495 98.6 3E-08 1.9E-11 61.8 19.4 443 1-517 1-495 (495) 67 protein:vir:96980 Length: 409 98.6 2E-08 1.2E-11 62.9 17.5 402 1-522 1-409 (409) 68 protein:vir:63755 Length: 547 98.6 1.9E-08 1.2E-11 62.9 17.2 477 1-567 1-547 (547) 69 protein:vir:96579 Length: 576 98.6 1.4E-08 8.6E-12 63.7 16.2 484 1-572 27-576 (576) 70 protein:vir:98396 Length: 441 98.5 1.2E-07 7.2E-11 58.6 20.5 412 1-519 1-441 (441) 71 protein:vir:3420 Length: 533 # 98.5 4.2E-08 2.6E-11 61.0 18.1 446 1-531 1-533 (533) 72 protein:vir:6382 Length: 553 # 98.5 2.9E-07 1.8E-10 56.4 22.7 463 1-525 1-553 (553) 73 protein:vir:99312 Length: 563 98.5 1.4E-07 8.7E-11 58.2 20.8 446 1-567 55-563 (563) 74 protein:vir:95599 Length: 563 98.5 1.4E-07 8.7E-11 58.2 20.8 446 1-567 55-563 (563) 75 protein:vir:94002 Length: 378 98.5 1E-07 6.5E-11 58.9 19.3 375 6-521 1-378 (378) 76 protein:vir:4952 Length: 386 # 98.5 6.2E-08 3.8E-11 60.1 16.9 373 6-522 1-386 (386) 77 protein:vir:95965 Length: 385 98.4 1.1E-07 6.7E-11 58.8 18.1 371 1-507 1-385 (385) 78 protein:vir:4995 Length: 384 # 98.4 2.8E-08 1.7E-11 62.0 14.7 369 6-461 1-384 (384) 79 protein:vir:79984 Length: 441 98.4 3.5E-07 2.2E-10 56.0 20.4 406 1-519 23-441 (441) 80 protein:vir:9408 Length: 441 # 98.4 3.5E-07 2.2E-10 56.0 20.4 406 1-519 23-441 (441) 81 protein:vir:8100 Length: 466 # 98.4 7.1E-08 4.4E-11 59.8 16.2 421 6-516 1-466 (466) 82 protein:vir:1661 Length: 378 # 98.4 3E-07 1.9E-10 56.3 19.6 375 6-521 1-378 (378) 83 protein:vir:100691 Length: 535 98.4 8.9E-07 5.5E-10 53.8 23.7 463 1-538 1-535 (535) 84 protein:vir:101289 Length: 395 98.3 5.6E-07 3.5E-10 54.9 19.6 386 1-523 1-395 (395) 85 protein:vir:9507 Length: 395 # 98.3 5.6E-07 3.5E-10 54.9 19.6 386 1-523 1-395 (395) 86 protein:vir:100650 Length: 395 98.3 5.6E-07 3.5E-10 54.9 19.6 386 1-523 1-395 (395) 87 protein:vir:95542 Length: 548 98.3 7.2E-07 4.5E-10 54.3 19.1 456 1-537 1-548 (548) 88 protein:vir:96738 Length: 505 98.2 9.2E-07 5.7E-10 53.7 18.9 425 1-517 1-505 (505) 89 protein:vir:93867 Length: 378 98.2 1.1E-06 6.6E-10 53.3 19.0 375 6-525 1-378 (378) 90 protein:vir:858 Length: 378 # 98.2 5.6E-07 3.5E-10 54.8 17.3 375 1-521 1-378 (378) 91 protein:vir:78310 Length: 376 98.2 1.4E-06 8.9E-10 52.6 19.5 369 1-515 1-376 (376) 92 protein:vir:4828 Length: 382 # 98.2 2E-07 1.2E-10 57.3 14.2 371 1-495 1-382 (382) 93 protein:vir:389 Length: 530 # 98.2 1.2E-06 7.2E-10 53.1 18.3 461 1-525 1-530 (530) 94 protein:vir:94869 Length: 378 98.1 4.2E-06 2.6E-09 50.1 19.6 375 1-521 1-378 (378) 95 protein:vir:7407 Length: 392 # 98.0 2E-06 1.2E-09 51.8 16.9 384 1-497 1-392 (392) 96 protein:vir:3989 Length: 392 # 98.0 3.1E-06 1.9E-09 50.8 17.9 385 1-520 1-392 (392) 97 protein:vir:1023 Length: 392 # 98.0 3.1E-06 1.9E-09 50.8 17.9 385 1-520 1-392 (392) 98 protein:vir:79538 Length: 502 98.0 6.5E-06 4E-09 49.0 20.4 439 1-522 11-502 (502) 99 protein:vir:1150 Length: 350 # 98.0 2.6E-07 1.6E-10 56.7 10.6 337 1-417 1-350 (350) 100 protein:vir:1082 Length: 359 # 97.9 1.1E-05 7E-09 47.7 18.9 351 1-451 1-359 (359) 101 protein:vir:3780 Length: 345 # 97.9 4.2E-07 2.6E-10 55.5 10.5 327 9-419 1-345 (345) 102 protein:vir:6058 Length: 344 # 97.8 1.5E-06 9.3E-10 52.5 12.1 327 8-422 1-344 (344) 103 protein:vir:4089 Length: 395 # 97.7 2.4E-05 1.5E-08 45.9 19.1 370 36-526 1-395 (395) 104 protein:vir:79150 Length: 368 97.6 2.5E-06 1.6E-09 51.3 11.1 355 1-438 1-368 (368) 105 protein:vir:267 Length: 348 # 97.6 3.8E-06 2.4E-09 50.3 11.8 333 18-436 1-348 (348) 106 protein:vir:2013 Length: 344 # 97.6 7E-06 4.3E-09 48.9 12.9 319 8-422 1-344 (344) 107 protein:vir:3743 Length: 345 # 97.5 2E-06 1.2E-09 51.9 9.4 315 9-419 1-345 (345) 108 protein:vir:98567 Length: 340 97.5 7.4E-06 4.6E-09 48.7 12.1 322 1-422 1-340 (340) 109 protein:vir:78191 Length: 351 97.4 1.2E-05 7.2E-09 47.7 12.4 329 8-424 1-351 (351) 110 protein:vir:78749 Length: 337 97.4 2.7E-06 1.7E-09 51.1 8.6 324 9-418 1-337 (337) 111 protein:vir:5691 Length: 344 # 97.4 2E-05 1.2E-08 46.4 13.2 320 8-415 1-344 (344) 112 protein:vir:103971 Length: 376 96.7 7.7E-05 4.8E-08 43.1 11.2 345 1-424 1-376 (376) 113 protein:vir:98643 Length: 395 96.6 0.00046 2.8E-07 38.9 19.2 386 1-526 1-395 (395) 114 protein:vir:98853 Length: 219 96.4 6.7E-05 4.2E-08 43.5 8.9 204 169-427 1-219 (219) 115 protein:vir:100328 Length: 346 96.4 0.00057 3.5E-07 38.4 13.9 334 1-422 1-346 (346) 116 protein:vir:9641 Length: 395 # 96.3 0.00072 4.5E-07 37.8 18.8 383 1-526 1-395 (395) 117 protein:vir:79207 Length: 351 96.2 0.00053 3.3E-07 38.5 12.8 324 8-438 1-351 (351) 118 protein:vir:99452 Length: 651 94.0 0.0056 3.5E-06 32.9 18.3 512 1-604 75-651 (651) 119 protein:vir:94049 Length: 532 93.4 0.0076 4.7E-06 32.2 20.1 444 1-556 1-532 (532) 120 protein:vir:107742 Length: 537 47.3 0.71 0.00044 21.4 21.5 434 1-535 25-537 (537) 121 protein:vir:97265 Length: 513 37.2 1.1 0.0007 20.3 17.7 451 1-522 1-513 (513) 122 protein:vir:4698 Length: 251 # 26.3 2 0.0012 19.0 12.2 243 1-315 1-251 (251) No 1 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=100.00 E-value=0 Score=1944.01 Aligned_cols=637 Identities=99% Similarity=1.405 Sum_probs=630.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) ||||||||||||||+||++||++||||||+++||+++|||+++|++|++||+|||++||+||||||||||++|||||||| T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ||||||||||+|||+|++|++|++++++++|++||||+|||+|||||+++|||||||+|||+|+|++++++||+.+++++ T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:10 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) ||+||++||++|+|++++|+||||++|||++++|+|||||||||||++|||||||+||++||||+||||+|+|++||||| T Consensus 161 W~vvs~~Ei~~~~~~~~~i~lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~ 240 (639) T protein:vir:10 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (639) T ss_pred eeeeeHHHhcccCCCeeEeecCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) ||||||||||||||++++|+|+|++++||++||++.|+|++++||+||||||+|||+||+|+||+||||+++|+|||+|| T Consensus 241 gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E~l~~i 320 (639) T protein:vir:10 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (639) T ss_pred hCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechHHhcCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR 400 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~ 400 (637) |||||+|||||++||||||||+||||||||||||||||||+||||||||+||||||||+|+|++||+|||+|||||+|++ T Consensus 321 khl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~Lrp~Le~ 400 (639) T protein:vir:10 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR 400 (639) T ss_pred eeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHH Q lcl|NC_021303. 401 EGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYA 480 (637) Q Consensus 401 eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~a 480 (637) |||||+|||||||+|+||+||||||||++|||||+||+|||||||||++++||||+|+|+|++||+++|+++|+||++++ T Consensus 401 eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~~a 480 (639) T protein:vir:10 401 EGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYA 480 (639) T ss_pred hCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccCCCCcCCCCCCCCCCC--CCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhccccc Q lcl|NC_021303. 481 PLLSSQLAGIEFPQPANAIESTREEDD--EDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRF 558 (637) Q Consensus 481 pLl~~~~~~ie~P~p~~a~~~~~~~~d--~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~ 558 (637) ||++|.+|.++||+|+++.++++++++ ++++++++.|||||+++.+..++++++++.+++++++||+||||||||||. T Consensus 481 pl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~ 560 (639) T protein:vir:10 481 PLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRF 560 (639) T ss_pred hccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHhhccccc Confidence 999999999999999999999998764 445677778999999999999999999999999999999999999999999 Q ss_pred CCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHhhhhccccC Q lcl|NC_021303. 559 KVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIEGEVV 637 (637) Q Consensus 559 ~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~lr~~v~~~v~~~lt~~vvd~~v~ 637 (637) +++++++++|||+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||++||||||| T Consensus 561 ~~~~r~~~a~~r~vp~he~H~~l~Pv~~~~~~rli~gwd~~ld~~~~a~lg~D~~~lr~~v~~~v~~~lt~~~i~~ev~ 639 (639) T protein:vir:10 561 KVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIEGEVV 639 (639) T ss_pred CCCChhhHHHhhcCChhHceeecCCCChHHHHHHHHHHHhHHHHHHHHHhCCCHHHHHHHHHHHHHHHHhhhhhccccC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=100.00 E-value=0 Score=1944.01 Aligned_cols=637 Identities=99% Similarity=1.405 Sum_probs=630.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) ||||||||||||||+||++||++||||||+++||+++|||+++|++|++||+|||++||+||||||||||++|||||||| T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ||||||||||+|||+|++|++|++++++++|++||||+|||+|||||+++|||||||+|||+|+|++++++||+.+++++ T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:97 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) ||+||++||++|+|++++|+||||++|||++++|+|||||||||||++|||||||+||++||||+||||+|+|++||||| T Consensus 161 W~vvs~~Ei~~~~~~~~~i~lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~ 240 (639) T protein:vir:97 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (639) T ss_pred eeeeeHHHhcccCCCeeEeecCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) ||||||||||||||++++|+|+|++++||++||++.|+|++++||+||||||+|||+||+|+||+||||+++|+|||+|| T Consensus 241 gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E~l~~i 320 (639) T protein:vir:97 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (639) T ss_pred hCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechHHhcCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR 400 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~ 400 (637) |||||+|||||++||||||||+||||||||||||||||||+||||||||+||||||||+|+|++||+|||+|||||+|++ T Consensus 321 khl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~Lrp~Le~ 400 (639) T protein:vir:97 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR 400 (639) T ss_pred eeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHH Q lcl|NC_021303. 401 EGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYA 480 (637) Q Consensus 401 eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~a 480 (637) |||||+|||||||+|+||+||||||||++|||||+||+|||||||||++++||||+|+|+|++||+++|+++|+||++++ T Consensus 401 eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~~a 480 (639) T protein:vir:97 401 EGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYA 480 (639) T ss_pred hCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccCCCCcCCCCCCCCCCC--CCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhccccc Q lcl|NC_021303. 481 PLLSSQLAGIEFPQPANAIESTREEDD--EDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRF 558 (637) Q Consensus 481 pLl~~~~~~ie~P~p~~a~~~~~~~~d--~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~ 558 (637) ||++|.+|.++||+|+++.++++++++ ++++++++.|||||+++.+..++++++++.+++++++||+||||||||||. T Consensus 481 pl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~ 560 (639) T protein:vir:97 481 PLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRF 560 (639) T ss_pred hccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHhhccccc Confidence 999999999999999999999998764 445677778999999999999999999999999999999999999999999 Q ss_pred CCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHhhhhccccC Q lcl|NC_021303. 559 KVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIEGEVV 637 (637) Q Consensus 559 ~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~lr~~v~~~v~~~lt~~vvd~~v~ 637 (637) +++++++++|||+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||++||||||| T Consensus 561 ~~~~r~~~a~~r~vp~he~H~~l~Pv~~~~~~rli~gwd~~ld~~~~a~lg~D~~~lr~~v~~~v~~~lt~~~i~~ev~ 639 (639) T protein:vir:97 561 KVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIEGEVV 639 (639) T ss_pred CCCChhhHHHhhcCChhHceeecCCCChHHHHHHHHHHHhHHHHHHHHHhCCCHHHHHHHHHHHHHHHHhhhhhccccC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=100.00 E-value=0 Score=1834.09 Aligned_cols=624 Identities=54% Similarity=0.951 Sum_probs=608.1 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccc-cccchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT-ARNEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~-~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) ||+|||||||||||+| + ||+|+|||||. +|+++||++.+|. +|++||+|||+|||+|||||||||||+||||||| T Consensus 1 ma~~~lrv~rrpk~~p-~--~r~l~aasqp~-~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss~Sr~r 76 (629) T protein:vir:10 1 MAASTLRVSRRPKGSP-A--RRSLTAASQPM-EPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASSCSRVE 76 (629) T ss_pred CCccceeEEecCCCcc-c--eeeeccccCCC-CcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhhheeee Confidence 9999999999999995 3 77999999997 7999999998885 6999999999999999999999999999999999 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~ 159 (637) |||||||||||+|||+|+ ||+|.|++|++||+.||||+|||+|||||+++|||||||+|||||.|+++++.+ ..++ T Consensus 77 L~as~idpDtg~ptg~i~-ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~---~~r~ 152 (629) T protein:vir:10 77 LIASELDPDTGKPTGGIR-DDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDG---SVRH 152 (629) T ss_pred EEEeeecCCCCCCccccc-cCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCc---cccc Confidence 999999999999999998 899999999999999999999999999999999999999999999999997666 4478 Q ss_pred cceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 160 RWYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 160 ~W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) +||+||++||++|+++++.|+||||++|+|++++|+|||||||||||++|||||||+||++||||+||||+|+|++|||| T Consensus 153 ~W~vVt~~Ei~~kg~g~~~i~lpdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL 232 (629) T protein:vir:10 153 NWYVVTNDEVKNKGAGKTDIELPDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRL 232 (629) T ss_pred ceeeecHHHhccccCceeEEEcCCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_021303. 240 MNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEK 319 (637) Q Consensus 240 ~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ 319 (637) |||||||||||||||++++|++.++ ||++||.+.|++++++||+||||||+|||+||+|+||+||||+++|+|||+| T Consensus 233 ~gnGvlflP~e~slp~~~ap~~~~~---Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ 309 (629) T protein:vir:10 233 IGNGVVFLPQELSLPRATAPVADNQ---PGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQK 309 (629) T ss_pred hhCceeEeccCcccccccCCCCCCC---CcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcC Confidence 9999999999999999999999888 8999999999999999999999999999999999999999999999999999 Q ss_pred cceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHH Q lcl|NC_021303. 320 VQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLL 398 (637) Q Consensus 320 ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L 398 (637) ||||||+|||||++||||||||+|||||||||||||||| ||+||||||||+||||||||+|+|++||+|||++|||++| T Consensus 310 ikhLkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L 389 (629) T protein:vir:10 310 IFHLKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATL 389 (629) T ss_pred eeeeeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHH Confidence 999999999999999999999999999999999999999 5999999999999999999999999999999999999999 Q ss_pred HHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHH Q lcl|NC_021303. 399 AREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAM 478 (637) Q Consensus 399 ~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~ 478 (637) ++|||||+|||||||+|+||+||||||||+++||||+||+|||||||||++++||||||+|+||+||+++|.++|+||++ T Consensus 390 ~~eGiDp~~Yvvw~DaS~Lt~dPd~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~ 469 (629) T protein:vir:10 390 RAEGIDPDRYVLWYDASGLTVDPDKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKV 469 (629) T ss_pred HHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhccccc Q lcl|NC_021303. 479 YAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRF 558 (637) Q Consensus 479 ~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~ 558 (637) ++||+.+.+++|+||+|++++++++++++++|+++.++||+|||+ +.++++.++.+.+++|++|||+|||||||||++ T Consensus 470 ~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~~~e~~~e~d--A~~a~~~~~~aa~~~A~rllv~RALelAGkRl~ 547 (629) T protein:vir:10 470 LAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDTTGSEPSTEDD--AEAAARISSVADMVLAERLLTVRALGLAGKRRV 547 (629) T ss_pred hhhhcCCccccccccCCCCcCCCCCcccCccccCCCCCCcCCCcc--hhhcccCCchhhHHHHHHHHHHHHHHHcccccc Confidence 999999999999999999999999999999999999999999988 455667777789999999999999999999999 Q ss_pred CCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHH---HHHHHHHHHHHHHHhhhhccc Q lcl|NC_021303. 559 KVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEK---LRNAVLATVRRQLTQPLIEGE 635 (637) Q Consensus 559 ~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~---lr~~v~~~v~~~lt~~vvd~~ 635 (637) +.++|.+++||+++|+|+||++|+||++++|+|||+|||++||++++++||+|++| |+++|+++|+++||++||||| T Consensus 548 ~~rdR~~~ar~~~vp~he~h~~l~Pv~~~~v~rli~gwd~~l~~~~~a~lg~D~~~~~~~~sav~~~v~~~lt~~~~~~e 627 (629) T protein:vir:10 548 NTNDRAQKARLAGIAPHDYHRVMGPVADADIPRLIAGWDEGLEEEALALLGVDSRRTEALRSAVRAQIRRELTMPVVDAE 627 (629) T ss_pred CCCchhhHHHhhcCChhhceeecCCCChhHHHHHHHhhhhHHHHHHHHHhCCChhhhHHHHHHHHHHHHHHhhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999974 788899999999999999999 Q ss_pred cC Q lcl|NC_021303. 636 VV 637 (637) Q Consensus 636 v~ 637 (637) || T Consensus 628 v~ 629 (629) T protein:vir:10 628 VC 629 (629) T ss_pred cC Confidence 99 No 4 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=100.00 E-value=0 Score=1817.82 Aligned_cols=626 Identities=48% Similarity=0.818 Sum_probs=606.4 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) ||+|||||||||||+||.+||++||||||++++|++.|||+++++.+++||+|||+|||+||||||||||++|||||||| T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ||||||||||+|||+|+ |+++.+++|++||+.|+||+|||+|||||+++|||||||+|||+++|+++ +.|+...++.+ T Consensus 81 ~as~idpDtg~ptg~i~-e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~-~~d~~~~~~~e 158 (629) T protein:vir:99 81 IASAIDPDTGLPTGSID-EDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKS-RLDSNGNPVPE 158 (629) T ss_pred EeeeecCCCCCCccccC-CCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCC-ccCCCCcchhh Confidence 99999999999999997 89999999999999999999999999999999999999999999999655 67788888999 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) ||+||++|||+|+++ +.|.||+|++|||++++|+|||||||||||++|||||||+||++||||+||||+|+|++||||| T Consensus 159 W~~vt~~ei~~~~~~-~~i~lP~g~~~e~~~~~d~l~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~ 237 (629) T protein:vir:99 159 WLALTPEEVRASEKK-TIIELPTGDKHEFRDGLDGMFRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRLI 237 (629) T ss_pred heeechHHhhhccCc-eeEEcCCCCccceeCCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHh Confidence 999999999988766 5599999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) ||||||||+|||||+.++|++.++ ||+++|++.+.|++++||+||||||+|||+||+|+||+||||+++|+|||++| T Consensus 238 gnGvlflP~e~slP~~~~p~~~n~---pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:99 238 GNGVVFVPHEMSLPSMNAPVASNK---PGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred hCceeEeccCcccCccCCCCCCCC---CCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 999999999999999999999888 89999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLA 399 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~ 399 (637) |||||+|||||++||||||||+|||||||||||||||| +|+||||||||+||||||||+|+|++||+|||+|||||+|+ T Consensus 315 ~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le 394 (629) T protein:vir:99 315 THLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLM 394 (629) T ss_pred eEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999 59999999999999999999999999999999999999999 Q ss_pred HhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHH Q lcl|NC_021303. 400 REGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMY 479 (637) Q Consensus 400 ~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~ 479 (637) +|||||+|||||||+|+||+||||||||++|||||+||+||||||+||++|+||||||+|+|+|||+|+|+++|+||++| T Consensus 395 ~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~ 474 (629) T protein:vir:99 395 REGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTL 474 (629) T ss_pred HhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhccccccccCCCCcCCCCCCCCCC--CCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccc Q lcl|NC_021303. 480 APLLSSQLAGIEFPQPANAIESTREED--DEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRR 557 (637) Q Consensus 480 apLl~~~~~~ie~P~p~~a~~~~~~~~--d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr 557 (637) +||+ +.+.+++||++.++++|+++++ |+++++++|+||+||+++++..+.+++..+...++|++||+|||||||||| T Consensus 475 a~l~-~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~ 553 (629) T protein:vir:99 475 AVLI-PELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAGTDDSDQASLDSRETAMVEALVFRALELAGKRS 553 (629) T ss_pred hhhh-hhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcC Confidence 9999 8889999999999999998864 666788999999999999888775555555667899999999999999997 Q ss_pred cCCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHhhhhccccC Q lcl|NC_021303. 558 FKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIEGEVV 637 (637) Q Consensus 558 ~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~lr~~v~~~v~~~lt~~vvd~~v~ 637 (637) | ++++++|||+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||+ .|||||| T Consensus 554 r---~r~~~ar~r~v~~he~h~~l~Pv~~~~i~rli~gwd~~ld~~~~~~Lg~d~~~lr~~v~a~v~~~lt~-~~~~ev~ 629 (629) T protein:vir:99 554 R---TRSLPYELRQLSDRELVRRLEPVRREHVADLIRGWDSMLEERAVQALNMNIPGIRAAVKRAVYGELTK-TIDGEVS 629 (629) T ss_pred C---ChhhHHHHhcCchhhceeecCCCCHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhh-hhccccC Confidence 4 68899999999999999999999999999999999999999999999999999999999999999997 5999999 No 5 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=100.00 E-value=0 Score=1817.24 Aligned_cols=626 Identities=48% Similarity=0.819 Sum_probs=606.5 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) ||+|||||||||||+||.+||++||||||++++|++.|||+++++.+++||+|||+|||+||||||||||++|||||||| T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL 80 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRVRL 80 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ||||||||||+|||+|+ |+++.+++|++||+.|+||+|||+|||||+++|||||||+|||+++|+++ +.|+...++.+ T Consensus 81 ~as~idpDtg~ptg~i~-e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~-~~d~~~~~~~e 158 (629) T protein:vir:86 81 IASAIDPDTGLPTGSID-EDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKS-RLDSNGNPVPE 158 (629) T ss_pred EeeeecCCCCCCccccC-CCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCC-ccCCCCcchhh Confidence 99999999999999997 89999999999999999999999999999999999999999999999655 67788888999 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) ||+||++|||+|+++ +.|.||+|++|||++++|+|||||||||||++|||||||+||++||||+||||+|+|++||||| T Consensus 159 W~~vt~~ei~~~~~~-~~i~lP~g~~~e~~~~~d~l~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~ 237 (629) T protein:vir:86 159 WLALTPEEVRASEKK-TIIELPTGDKHEFRDGLDGMFRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRLI 237 (629) T ss_pred heeechHHhhhccCc-eeeEcCCCCcceeeCCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHh Confidence 999999999988766 5599999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) ||||||||+|||||+.++|++.++ ||+++|++.+.|++++||+||||||+|||+||+|+||+||||+++|+|||++| T Consensus 238 gnGvlflP~e~slP~~~~p~~~n~---pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:86 238 GNGVVFVPHEMSLPSMNAPVASNK---PGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred hCceeeeccCcccCccCCCCCCCC---CCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 999999999999999999999888 89999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLA 399 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~ 399 (637) |||||+|||||++||||||||+|||||||||||||||| +|+||||||||+||||||||+|+|++||+|||+|||||+|+ T Consensus 315 ~hlkf~~ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le 394 (629) T protein:vir:86 315 THLKFDNQVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLM 394 (629) T ss_pred eEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999 59999999999999999999999999999999999999999 Q ss_pred HhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHH Q lcl|NC_021303. 400 REGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMY 479 (637) Q Consensus 400 ~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~ 479 (637) +|||||+|||||||+|+||+||||||||++|||||+||+||||||+||++|+||||||+|+|+|||+|+|+++|+||++| T Consensus 395 ~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~ 474 (629) T protein:vir:86 395 REGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTL 474 (629) T ss_pred HhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhccccccccCCCCcCCCCCCCCCC--CCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccc Q lcl|NC_021303. 480 APLLSSQLAGIEFPQPANAIESTREED--DEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRR 557 (637) Q Consensus 480 apLl~~~~~~ie~P~p~~a~~~~~~~~--d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr 557 (637) +||+ +.+.+++||++.+++||+++++ |+++++++|+||+||+++++..+.+++..+...++|++||+|||||||||| T Consensus 475 a~l~-~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~ 553 (629) T protein:vir:86 475 AVLI-PELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAGTDDSDQASLDSRETAMVEALVFRALELAGKRS 553 (629) T ss_pred hhhh-hhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcC Confidence 9999 8889999999999999998864 666788999999999999888775555555667899999999999999997 Q ss_pred cCCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHhhhhccccC Q lcl|NC_021303. 558 FKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIEGEVV 637 (637) Q Consensus 558 ~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~lr~~v~~~v~~~lt~~vvd~~v~ 637 (637) | ++++++|||+||+|+||+||+||++++|+|||+|||++||++++++||+|++|||++|++||+++||++ |||||| T Consensus 554 r---~r~~~a~~r~v~~he~h~~l~Pv~~~~v~rli~gwd~~ld~~~~~~Lg~d~~~lr~~v~a~v~~~lt~~-v~~ev~ 629 (629) T protein:vir:86 554 R---TRSLPYELRQLSDRELVRRLEPVRREHVADLIRGWDSMLEERAVQALNMNIPGIRAAVKRAVYGELTKT-IDGEVS 629 (629) T ss_pred C---ChhhHHHHhccChhhcceecCCCChHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhcc-cccccC Confidence 4 688999999999999999999999999999999999999999999999999999999999999999985 999999 No 6 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=100.00 E-value=5.7e-321 Score=1776.26 Aligned_cols=624 Identities=51% Similarity=0.857 Sum_probs=597.1 Q ss_pred CCCC-cceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAAT-SLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~-~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |||+ ||||||||||++|+++| +||||||+++||+++|||+|+.++|++||+|||++||+||||||||||++||||||| T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r-~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~r 79 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSR-ALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (631) T ss_pred CCcccceeeeecCCCCCccchh-hhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhceee Confidence 7655 99999999999998888 999999999999999999977789999999999999999999999999999999999 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcc---cccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDP---VTGLAA 156 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~---~~~~~~ 156 (637) ||+||||||||+|||+|+ |++|+|+++++||+.|+||+|||+|||||+++|||||||+|||+|+||++++ +|+... T Consensus 80 L~as~idpDtg~ptg~ie-e~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r 158 (631) T protein:vir:10 80 LVASELDENTGLPTGGIS-EDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (631) T ss_pred eEeeeeccCCCCCccccc-cCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccc Confidence 999999999999999998 8999999999999999999999999999999999999999999999999865 467778 Q ss_pred ccccceeeeHHHhccCC-CceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 157 PRARWYAVTREEIKSKA-GETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 157 ~~~~W~~vt~~Ei~~k~-g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) ++++||+||++||++++ |.++.|++|+|++|+|++++|+|||||||||||++|||||||+||++||||+||||+|+|++ T Consensus 159 ~~~~W~~vt~~ei~~~~~g~g~~v~lp~g~~h~~~~~~D~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaa 238 (631) T protein:vir:10 159 TRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIANAS 238 (631) T ss_pred cccceeeccHHHHhcccCcccceeecCCCCccceecCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 89999999999998544 55688999999999999999999999999999999999999999999999999999999999 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) |||||||||||||+|||||++++|. ...||+++|++.|.|++++|++||||||+|||+||+|+||+||||+++|+| T Consensus 239 kSRl~gnGvlflP~els~P~~~~~~----~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E 314 (631) T protein:vir:10 239 KSRLIGNGVLFVPHEMSLPAAQGPV----SEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (631) T ss_pred HHHHhhCceeEeccccccCCCCCCC----CCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 9999999999999999999999885 345899999999999999999999999999999999999999999999999 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) ||++||||||+|||||++||||||||+|||||||||||||||| +|+||||||||+||||||||+|+|++||+|||+||| T Consensus 315 ~i~~i~hlkf~~ei~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~L 394 (631) T protein:vir:10 315 QIKDVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (631) T ss_pred HhcCeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHH Confidence 9999999999999999999999999999999999999999999 599999999999999999999999999999999999 Q ss_pred HHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCch Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPE 474 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~ 474 (637) ||+|++|||||+|||||||+|+||+||||||||++|||||+||+||||||+||++|+||||+|.|+|++||+++|.++|. T Consensus 395 rp~Le~eGvDp~kYvvW~DaS~Lt~dPdr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpa 474 (631) T protein:vir:10 395 RVTLAREGIDPSKYVVWYDPSQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPT 474 (631) T ss_pred HHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhcccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCC-CCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHh Q lcl|NC_021303. 475 LIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDS-GARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLA 553 (637) Q Consensus 475 Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~-~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelA 553 (637) ||++++||+.+.+++++||+|.++.++|+++.++++ +++.++||||+|+++...++++. ..+|++||+|||||| T Consensus 475 Lip~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~~epdt~d~~p~~~~a~~~-----~~iv~llv~RALelA 549 (631) T protein:vir:10 475 LIPMLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDADDLDDGEQEPDTEDDDDGTQKAGLE-----TGIVDLMVDRALELV 549 (631) T ss_pred cchhhHHHHHHHhhhccCCCCCCCCCCCCCccccccccccCCCCCCCCCCCCccccccch-----HHHHHHHHHHHHHhh Confidence 999999999999999999999999999999987776 56667999999987776655444 257899999999999 Q ss_pred cccccCCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021303. 554 GKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVRRQLTQPLIE 633 (637) Q Consensus 554 GkRr~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~lr~~v~~~v~~~lt~~vvd 633 (637) |||++++ ++++++||++++.|+||++|+||++++|+|||+|||+++|++++++||+|++|||++|++||+++||++||| T Consensus 550 GkRl~~r-~r~~~ar~~~v~~he~H~~~~Pv~~~ev~rli~gwd~~ld~~~~~~Lg~d~~~lr~~v~a~v~~~lt~~~~~ 628 (631) T protein:vir:10 550 GKRRRGR-DRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDPGTIRSAVRRKVMAELTRPVID 628 (631) T ss_pred cchhcCC-cccchhHHhcccccccccccCCCCHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHH Confidence 9998765 699999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred ccc Q lcl|NC_021303. 634 GEV 636 (637) Q Consensus 634 ~~v 636 (637) --- T Consensus 629 ~~~ 631 (631) T protein:vir:10 629 VVA 631 (631) T ss_pred hcC Confidence 333 No 7 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=100.00 E-value=2.8e-286 Score=1586.05 Aligned_cols=592 Identities=27% Similarity=0.457 Sum_probs=551.8 Q ss_pred CCCCcceEEecCCCCCc------ccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcc Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAP------AARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANS 74 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p------~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s 74 (637) || .-||||+|| .+|||+||||||++..+.+.+||. +++.+++||+|||+|||+||||||||||++|| T Consensus 1 ~~------~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt-~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a 73 (646) T protein:vir:10 1 MA------LLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKT-WKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDS 73 (646) T ss_pred Cc------ccCCCCCCCCcccccccchhhhhhccccccCCCcceeec-CCCcchhhhHHHHHHHhhhhhHhhHhhhhhhh Confidence 55 358999999 478999999999999999999995 55668899999999999999999999999999 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) |||||||||||| |||.|||++.. +++++||+.|+||++||+|||||+++|||||||+||| +.++.++. T Consensus 74 ~SR~rL~aseid-dtG~~tg~v~~------~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv-----~~~~~~~~ 141 (646) T protein:vir:10 74 VAQARLYVTEVD-DTGEETGEVQD------ERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIV-----GEGAATSP 141 (646) T ss_pred hceeeeeeeeec-CCCCCcCccch------HHHHHHhhhhccchhhHHHHHHHHHhheecccceEEe-----eccccCCC Confidence 999999999999 99999999974 3899999999999999999999999999999999999 34556666 Q ss_pred ccccccceeeeHHHhccCCCceeEEecCC---CCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 155 AAPRARWYAVTREEIKSKAGETAEISLPD---GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~~k~g~~~~i~lPd---G~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) ..++++||+||++|| +++|+++.|++|+ |++|+|++++|+|||||||||||++|||||||+||++||||+||||+| T Consensus 142 ~~~~~~W~vvt~~Ev-~~tg~~~~i~~p~~~~g~~~v~~~~~d~lvRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I 220 (646) T protein:vir:10 142 EAAEGSWFVVTGSAI-SRTGDEIAVRRPQQRGGSKLVLVDGQDILIRCWRPHPNDTDQADSFTRSAIVPLREIELLTKRE 220 (646) T ss_pred CCCccceeeecHHHh-ccCCCeeeeecCccCCCCCcceecCCceEEEEecCCcccccCCcchhHHHHHHHHHHHHhhhHh Confidence 777999999999999 5678999999999 999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) +|++|||||||||||||+|||||+++.+. +.+++||+||||||+|||+||+|+||+||||++ T Consensus 221 ~aaakSRL~GnGvLfvP~e~s~p~~~~~~------------------a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~ 282 (646) T protein:vir:10 221 FAELDSRLTGAGIMFLPEGVDFPRGEEDP------------------AGLAGFMAYLQRAAAASMADQSRASAMVPIMAT 282 (646) T ss_pred HHHHHHHHhcCceeeeccccccCCCCCCC------------------cchhHHHHHHHHHHHhhhcCCCCccceeeeEEe Confidence 99999999999999999999999988542 247799999999999999999999999999999 Q ss_pred echHH---hcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHH Q lcl|NC_021303. 312 VAAEH---LEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQA 388 (637) Q Consensus 312 vP~Eh---i~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~A 388 (637) +|+|. +++||||||+||||+++||||||||+||||||||||||||||+|+||||||||+||||| ||+|+|++||+| T Consensus 283 ~P~E~i~~~~~ik~l~f~~eite~aiktR~daI~RlA~glDIppE~LLGlgd~NHWtAWqI~de~vr-HI~P~l~~ic~A 361 (646) T protein:vir:10 283 IPNEMMEHLDKIKPLTFWSELSAEITPMKDKAIARLASSAEIPGEVLTGIGDANHWTAWLISDEGIR-WIRGYLGLIADA 361 (646) T ss_pred eChHHHhhhhcceeeccCchhhHHHhhhHHHHHHHHHhccCCchhheeeccccceeeeeeeccccch-hhhhHHHHHHHH Confidence 99995 46888888999999999999999999999999999999999999999999999999999 999999999999 Q ss_pred HHhHHHHHHHHHhCC-ChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHH Q lcl|NC_021303. 389 IYNDILTPLLAREGI-DPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAAD 467 (637) Q Consensus 389 it~~~Lr~~L~~eGi-Dp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d 467 (637) ||+|||||+|++||| ||+|||||||+|+||+||||||||+++||||+||+||||||+||+++++| |++|+|+||+++ T Consensus 362 lT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~~pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~p--t~~E~~~~~~~~ 439 (646) T protein:vir:10 362 LTRGFLRRALESMGVTNPERYAFAFDTSTLASKPNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMP--TVQERAVQILLG 439 (646) T ss_pred HHhhHHHHHHHHcCCCChhHeEEeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhcccccccC--ChHHHHHHHHHH Confidence 999999999999999 99999999999999999999999999999999999999999999999998 899999999999 Q ss_pred HhcCCchhH---HHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCccc--CCCC--------- Q lcl|NC_021303. 468 VVTKNPELI---AMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEA--ASLN--------- 533 (637) Q Consensus 468 ~v~~~P~Li---~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~--a~~~--------- 533 (637) +|++||+|| ..++++..|.++.++||+++.+.+.|+.++|+++++++|+||+|++++....+ +++. T Consensus 440 ~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~~~~~~dg~~~~~e~~g~~~~~E~~~~pda~~~~a~~~~~~~r~~~~~~~ 519 (646) T protein:vir:10 440 LVKTQPDLILDPAIQAALGLPAVQSVGLPPTAAQRTDGDLDDDESEGAPNGGEAPDQPDADEARAITAALDRRIALAARP 519 (646) T ss_pred HhcCCccccccchhhccccCCCcCccccCCcccccccCCCCChhhcCCCCCCccCCCCCCCccccccccccccchhhhhh Confidence 999999999 66788888899999999999999999999999999999999999986544433 3333 Q ss_pred ------cchHHHHHHHHHHHHHHHHhcccccCCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhc-ccccccHHHHH Q lcl|NC_021303. 534 ------DRAAYLVAERLLVNRALDLAGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAG-WDTALEDEVVA 606 (637) Q Consensus 534 ------~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~G-Wd~~ld~~~~a 606 (637) +.++++++||+||+|||||||||+| +++++++|||+||+|+||++|+||++++++||+.| ||+++ ++++ T Consensus 520 ~~~~~~p~a~~~aav~l~v~RAL~lAG~Rlr--t~~~~~a~~r~vp~he~h~~l~Pv~~~~~~rl~~G~wd~~~--~v~~ 595 (646) T protein:vir:10 520 VLALPSPEAVFNASAKLMILRALELAGGRLT--TPAERRGRWSDVPRHELHHHVGPITPDKARRVTEGAWNHVA--VAAA 595 (646) T ss_pred hhccccchhHHHHHHHHHHHHHHHhcccccc--CchhhhHHhhcCChhhceeecCCCChhhHHHHHhcccccHH--HHHH Confidence 2377899999999999999999975 58899999999999999999999999999999999 99997 5999 Q ss_pred HhCCCHHHHHHHHHHHHHHHHHhhhhccc-cC Q lcl|NC_021303. 607 SLGLDNEKLRNAVLATVRRQLTQPLIEGE-VV 637 (637) Q Consensus 607 ~lG~Dp~~lr~~v~~~v~~~lt~~vvd~~-v~ 637 (637) +||+|++|||++|++||+++||+||---. |+ T Consensus 596 ~lg~D~~~lr~~v~~~Vr~~lt~g~~~~~~~~ 627 (646) T protein:vir:10 596 DLGVDAGELERVLSTYVLELLTRGLRHHDDML 627 (646) T ss_pred hcCCChHHHHHHHHHHHHHHHhcCCCccccce Confidence 99999999999999999999999963222 22 No 8 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.49 E-value=2.8e-14 Score=94.89 Aligned_cols=404 Identities=12% Similarity=0.160 Sum_probs=224.5 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhH--HHHHhhhhhhHhhHhhhhhcceeee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSE--AWDFSESIGELSYYISWRANSCSRT 78 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~e--AW~~yd~VgELryyvgWr~~s~Sr~ 78 (637) |==. .+.+|.+..+. +.|... ....+++ ...|... .++.+-..+-+.-.+.-+++.+|.+ T Consensus 1 ~~f~--~~f~r~~~~~~--------------~~~~~~-~~~~~~~-~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~ 62 (413) T protein:vir:48 1 MFFS--GLFQRKSDAPV--------------TTPAEL-AEAIGLS-YDTYTGKRISSQRAMRLTAVYSCVRVLAESVGML 62 (413) T ss_pred Cccc--hhhccCccCCc--------------cchHHH-HHhhhcC-cccccCceechhhhhccHHHHHHHHHHHHhhhhC Confidence 3222 33444332221 111111 1111111 1222221 1344445666777888999999999 Q ss_pred EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccc Q lcl|NC_021303. 79 TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (637) Q Consensus 79 rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~ 158 (637) .+..-+.+.+ |. .++. .+.+..+.+.=-..-+...++++.++.+|-+-|++|+.+. |..|.+ . T Consensus 63 p~~~~~~~~~-~~----~~~~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~-~~~g~~-------~ 125 (413) T protein:vir:48 63 PCSLYKISGT-LK----TRVV----DERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKV-KALGEV-------V 125 (413) T ss_pred ceEEEEecCC-cc----eeec----ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEE-eCCCcE-------E Confidence 9988777744 21 2222 2456666655555668888999999999999999998865 434431 2 Q ss_pred ccceeeeHHHhc---cCCCcee-EEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_021303. 159 ARWYAVTREEIK---SKAGETA-EISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNA 234 (637) Q Consensus 159 ~~W~~vt~~Ei~---~k~g~~~-~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 234 (637) +-| .+..+.+. ...+..+ .+..++|..++|... -||++=.+.+ .-..--||+..+...+.-..-..+...+. T Consensus 126 ~L~-~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~--evih~~~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 201 (413) T protein:vir:48 126 ELL-PIDPGCVEPKLNSQWQPVYQVTFPDGSVDVLTQD--EIWHVRTLTL-DGLVGLNPIAYAREAISLAAATEEHGARL 201 (413) T ss_pred EEE-EEcCceEEEEEcCCceEEEEEEecCceEEEEccc--cEEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 222 23223222 1222222 357788888877653 3333322222 22445688777777665555455555555 Q ss_pred HHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_021303. 235 AKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAA 314 (637) Q Consensus 235 ~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~ 314 (637) .+.-..-.|||-+|+.++ .-..+.+.+.|.+.-. ...+.+. |+|+ ++ T Consensus 202 ~~ng~~p~gil~~~~~~~-------------------------~e~~~~~~~~~~~~~~-g~~n~g~-----~~vl--~~ 248 (413) T protein:vir:48 202 FGNGAVTSGVLRTEQKLT-------------------------PDAYERLKKDFEERHT-GLGNAHR-----PMIL--EM 248 (413) T ss_pred HhccCCcceEEEeCCCCC-------------------------HHHHHHHHHHHHHHhc-CccccCc-----ceec--CC Confidence 554455567777765322 0134455555543322 1122222 3343 33 Q ss_pred HHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 315 EHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 315 Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) . -+++-|.+...-. --+++|+..+..+|..+-|||..|=+.+++|+-+..+....-++.-|.|.++.||++|++.+| T Consensus 249 g--~~~~~l~~~~~d~-q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~ 325 (413) T protein:vir:48 249 G--LDWKSMALNAEDS-QFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLV 325 (413) T ss_pred C--ceEEeccCChhHH-HHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 3 3455555432222 237899999999999999988876444567877777777777888899999999999999888 Q ss_pred HHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcC Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTK 471 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~ 471 (637) .+--. ..|.++||.+.|. .+|..+.|. .++..|.+|-.-.|+++|++.-.|=| . T Consensus 326 ~~~~~------~~~~~~fd~~~l~-~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD-------------~--- 382 (413) T protein:vir:48 326 RESKQ------GKFYAKFNAGALL-RGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD-------------V--- 382 (413) T ss_pred Ccccc------CCeEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-------------e--- Confidence 65432 3688999999984 345444433 37788999999999999997433211 0 Q ss_pred CchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCC Q lcl|NC_021303. 472 NPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAAS 531 (637) Q Consensus 472 ~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~ 531 (637) -+.| ... +++...++....+ +++.+.+++ ++ T Consensus 383 ------~~~~--------~n~---~~~~~~~~~~~~~------~~~~~~~~~------~~ 413 (413) T protein:vir:48 383 ------YLTP--------MNM---TTSPSAGDDNGKK------KESGDADKT------AS 413 (413) T ss_pred ------eecc--------ccc---cccccccccCCCC------CCCCCcccc------CC Confidence 0000 000 0000111111100 011111111 11 No 9 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.44 E-value=7.1e-14 Score=92.68 Aligned_cols=430 Identities=13% Similarity=0.119 Sum_probs=215.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehh----ccccchhhhhhhhcccccccchhhHH--HHHhhhhhhHhhHhhhhhcc Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAAS----QLITDPQKQMKTSLMGTARNEWQSEA--WDFSESIGELSYYISWRANS 74 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs----~~~~~p~~~~k~~~~g~~r~~WQ~eA--W~~yd~VgELryyvgWr~~s 74 (637) |.==+ |+.+|.+... +.... .++ ||.-.+ .+|. .|..+- .+-.=.++-+.-.|.-++++ T Consensus 1 Mg~~~-~l~~r~~~~~-------~~~~~~~~~~~~-~~~~~~---~~~~---~~~g~~V~~~~al~~~~V~~~v~~Ia~~ 65 (457) T protein:vir:13 1 MGFWS-ALFGRGHSPA-------LDGIEARAWEPY-DPSIYN---LGAV---AASGETVTPHDALQVSAVFASVRLLSET 65 (457) T ss_pred Cchhh-hhhccccccc-------cccccccccccc-chHHHh---hccc---ccCCceechHHhhccHHHHHHHHHHHHh Confidence 65432 3444433221 01110 111 222111 0111 110000 01111234455567788999 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) +|.+-|..-+-+.+..++ ++ .....++.+.=..+ +...++++.++.+|-+-|+.|+.| .+.+|++ T Consensus 66 iA~lp~~~~~~~~~~~~~-----~~----~~~l~~~ln~~~n~-~t~~~f~~~~~~~lll~Gna~~~i-~~~~g~~---- 130 (457) T protein:vir:13 66 IATLPLSTYSKRGGSRKE-----IV----TPEWLDYPNAEPGG-MGRIDILSQTVLSLLLQGNAFLAV-RWQGPNI---- 130 (457) T ss_pred hccCceEEEEecCCcccc-----cc----cchHHHhccccCCC-CCHHHHHHHHHHHHhhcCCeEEEE-EecCCcE---- Confidence 999988776654332222 22 23455554444443 677899999999999999999887 4544432 Q ss_pred ccccccceeeeHHHhcc----CCC--ce--eEEecC-CCCcccccC-CCceEEEEecCCcccccCCccchhhhhHHHHHH Q lcl|NC_021303. 155 AAPRARWYAVTREEIKS----KAG--ET--AEISLP-DGKTHEFNR-DLDSLVRIWNPRPRKASQATSPVRACLETLREI 224 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~~----k~g--~~--~~i~lP-dG~~he~~~-~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI 224 (637) .+-| .|..+.+.. .++ .. ...... +|..+.... ..+-||++=.+++.-...--||+..+...+.=. T Consensus 131 ---~~l~-~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~ 206 (457) T protein:vir:13 131 ---VGLD-VLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLA 206 (457) T ss_pred ---EEEE-EEccCceEEEEecCCCccceeEEEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHH Confidence 1122 222222210 011 01 111221 233322111 123456665555554456667777666665555 Q ss_pred HhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccc Q lcl|NC_021303. 225 ERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAA 304 (637) Q Consensus 225 ~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA 304 (637) .-..+...+..+.-.+-.|||.+|+.++ ....+.+.+.|.+. +...+.+-. T Consensus 207 ~~~~~~~~~~f~ng~~p~gil~~~~~ls-------------------------~e~~~~~~~~~~~~----~~g~~nag~ 257 (457) T protein:vir:13 207 LAAQKYGSKFFANGAMPGAVVEVPGTMS-------------------------EEGLARAREAWRAA----NSGVDNAHR 257 (457) T ss_pred HHHHHHHHHHHhcCCCcceEEEcCCCCC-------------------------HHHHHHHHHHHHHH----hcCccccCc Confidence 5555555555555555567887776332 11345565555432 222222211 Q ss_pred ccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeee--EEeccCceeEeechh Q lcl|NC_021303. 305 YIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSA--WAIGDEDVQLHIKPV 381 (637) Q Consensus 305 ~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsA--W~I~dedVrlHI~P~ 381 (637) ++|+ ++. -+++-|.+... +.--+++|+-.+..+|..+-|||. |||. .+++.|+. .|....=++..|.|. T Consensus 258 --~~vl--~~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~ 329 (457) T protein:vir:13 258 --VALL--TEG--AKFSKVAMSPD-EAQFLQTRQFQVPEIARIFGVPPH-LISDATNSTSWGSGLAEQNIAFTMFSLRPW 329 (457) T ss_pred --ceec--CCC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHH-HcCCCCCcccccchHHHHHHHHHHHHHHHH Confidence 2232 322 35555554432 222478999999999999999998 5587 46777754 555555667789999 Q ss_pred HHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCch Q lcl|NC_021303. 382 MDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTL 458 (637) Q Consensus 382 me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~ 458 (637) +..|+++|+..+|... + ...|.++||.+.| ...|..+.+ ..++..|.+|-.-.|..+|+..-.+- ... T Consensus 330 ~~~ie~~ln~~L~~~~----~--~~~~~i~fd~~~l-~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g--~~d 400 (457) T protein:vir:13 330 LERIEAGFNRLLFAET----A--DRFRFVKFNLDEI-KRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDG--LGE 400 (457) T ss_pred HHHHHHHHHHhhcCcc----c--cCceeEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--ccc Confidence 9999999998887542 1 1457889999998 445544433 34778899999999999999643221 001 Q ss_pred HHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCC-----CCCCCCCCccCCCCC Q lcl|NC_021303. 459 DGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDED-----SGARQQREPQTEDER 524 (637) Q Consensus 459 eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~-----~~a~~g~EPdted~~ 524 (637) +.+...-...+.+.|+ .+. -+.|.+..++.+++..+. +...++.|.|.||+. T Consensus 401 ~~~~~~n~~~~~~~~~------------~~~--~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 401 KYRVPLNLGEVGEEPE------------PEP--APAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred ceeecccccccccccc------------ccc--cCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 1111111111111111 011 111222222222211111 111122334444432 No 10 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.43 E-value=1.6e-13 Score=90.78 Aligned_cols=474 Identities=15% Similarity=0.115 Sum_probs=227.6 Q ss_pred hheehhccccchhhhhhhhccc-ccccchhhH-------HH--HHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCC Q lcl|NC_021303. 23 SLTAASQLITDPQKQMKTSLMG-TARNEWQSE-------AW--DFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLP 92 (637) Q Consensus 23 ~ltAAs~~~~~p~~~~k~~~~g-~~r~~WQ~e-------AW--~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~P 92 (637) -|-|--|-++.|...-+..... +-...|+.. +| ..|-..+-+.-.|.-+++++|.+.|..=+-+.+. T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~--- 77 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT--- 77 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCc--- Confidence 4445556666665321111100 111112211 22 2344556777778889999999988877777552 Q ss_pred CCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhcc- Q lcl|NC_021303. 93 TGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKS- 171 (637) Q Consensus 93 tG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~- 171 (637) ..+ ++ ......+... -.--+-..++++.++.+|.+-|+.|+.+.-...|. +. .++.|..+.+.. T Consensus 78 --~~~-~~---~~~~~~Ll~~-PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~-------~~-~L~~l~p~~Vtv~ 142 (518) T protein:vir:78 78 --ETE-EH---DTGYAKLLAD-PCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-------PE-KLMPMHPSRVAIK 142 (518) T ss_pred --ccc-cc---chHHHHHHhC-CCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc-------EE-EEEEECCCceEEE Confidence 122 12 1334444443 33446777899999999999999999976544443 11 344444444431 Q ss_pred --CCCcee--EEecCCCCcc---cccCCCceE-EEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCc Q lcl|NC_021303. 172 --KAGETA--EISLPDGKTH---EFNRDLDSL-VRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNG 243 (637) Q Consensus 172 --k~g~~~--~i~lPdG~~h---e~~~~~d~l-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnG 243 (637) ..++.. .+...+|... +|.. .+++ ||..+|+. -..--||+.++...+.-..-+.+...+..+.-..-.| T Consensus 143 ~~~~~~~~~y~~~~~~~~~~~~~~~~~-~eIiHir~~~~dg--~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~g 219 (518) T protein:vir:78 143 RNSRTGRYEYYFQAGAGVGTQLVSFAD-DEVVPIRFFNPDG--LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNL 219 (518) T ss_pred EcCCCCEEEEEEEecCCccceeEEecC-CcEEEecCCCCCc--ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccE Confidence 112222 2333333322 2322 2332 44444432 2233566665555444433333333333332233335 Q ss_pred eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccccee Q lcl|NC_021303. 244 VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHI 323 (637) Q Consensus 244 vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHl 323 (637) ||-+|+.++ ....+.+.+.|. ..+...+. +-=++|+ ++. -+++-| T Consensus 220 vl~~~~~ls-------------------------~e~~~~~k~~~~----~~~~G~~n--ag~~~vL--~~G--~~~~~l 264 (518) T protein:vir:78 220 VLRHEKRLS-------------------------PEAQQRLREQFD----RAHAGSSN--TGKTMVV--EEG--MEPIPL 264 (518) T ss_pred EEecCCCCC-------------------------HHHHHHHHHHHH----HHhcCccc--CCceeEc--CCC--ceEEec Confidence 665554221 113344544443 23322211 1123333 222 345555 Q ss_pred ecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhC Q lcl|NC_021303. 324 KFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREG 402 (637) Q Consensus 324 kf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eG 402 (637) .+... +.--+++|+-.+..+|..+-|||..| |+ +++|+-++.+....-++..|.|.+..|+++|++.++.. ++ T Consensus 265 ~~~~~-d~q~le~r~~~~~eIa~afgVPp~~l-g~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~-~~--- 338 (518) T protein:vir:78 265 QLTAV-EMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY-WV--- 338 (518) T ss_pred cCChh-HHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--- Confidence 55332 23347899999999999999999865 87 46888777777777778889999999999999876532 22 Q ss_pred CChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHH Q lcl|NC_021303. 403 IDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMY 479 (637) Q Consensus 403 iDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~ 479 (637) ..|-+.||.+.| ..+|..+.+ ..++..|.+|-.-.|+.+|+.--.+ +...+.+.+ + .+ T Consensus 339 ---~~~~~~fd~~~L-lr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~--~~gD~~~v~-------~------n~ 399 (518) T protein:vir:78 339 ---RKNRMKFDIDDV-IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDD--PKADELYAN-------S------AL 399 (518) T ss_pred ---CcceEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCCceeeec-------c------cc Confidence 357789999998 445554443 3567789999999999999874321 011111111 1 11 Q ss_pred Hhhhc---cccccccCCCCc-CCCCCCCCCCCCCCCCCCCCCccCCCCCCC---------cccCCCCcchH-HHHHH--- Q lcl|NC_021303. 480 APLLS---SQLAGIEFPQPA-NAIESTREEDDEDSGARQQREPQTEDERST---------EEAASLNDRAA-YLVAE--- 542 (637) Q Consensus 480 apLl~---~~~~~ie~P~p~-~a~~~~~~~~d~~~~a~~g~EPdted~~~~---------~~~a~~~~~a~-~~aa~--- 542 (637) .||-. ...++-+.|.++ ++..+..+.++...++..+-++++.+.... ...++....++ -+-+| T Consensus 400 ~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (518) T protein:vir:78 400 QPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGA 479 (518) T ss_pred eecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccchhcccCCCCcccccchHHHHHHHH Confidence 22211 122233333332 111111111111122222333443311000 11111111000 01122 Q ss_pred ----HHHHHHHHHHhcccccCCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccc Q lcl|NC_021303. 543 ----RLLVNRALDLAGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDT 598 (637) Q Consensus 543 ----~llV~rALelAGkRr~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~ 598 (637) +-+-.-||.||-|- +.++.+-|--+--.. ..|- |. T Consensus 480 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~------------~~~~----~~ 518 (518) T protein:vir:78 480 MGRGKDIKGFALQLAEKY-----PDDLEDILLAVQLAL------------AERK----DN 518 (518) T ss_pred hhcCCcchhhhhhhhhhc-----chhHHHHHHHHHHhh------------hhcc----CC Confidence 22223345555442 221111110000000 0000 00 No 11 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.42 E-value=1e-13 Score=91.81 Aligned_cols=437 Identities=12% Similarity=0.118 Sum_probs=216.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHH--HHHhhhhhhHhhHhhhhhcceeee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEA--WDFSESIGELSYYISWRANSCSRT 78 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eA--W~~yd~VgELryyvgWr~~s~Sr~ 78 (637) |.==+ ++..|++... +..+.....+|... .-..+|+ ..|..+. .+.+=.++-+.-.|.-++++||.+ T Consensus 1 Mg~~~-~l~~~~~~~~-------~~~~~~~~~~~~~~-~~~~~~~--~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~l 69 (457) T protein:vir:62 1 MGFWS-ALFGRGHSPA-------LDAAEGRAWEPYDP-SIYNLGA--TASSGERVTPHDALQVSAVFASVRLLSETIATL 69 (457) T ss_pred Cchhh-hhhccccccc-------cccccccccccchh-hhhhccc--cccCCceechHHhhccHHHHHHHHHHHHhHhhC Confidence 65332 1222222211 11111111111110 0111121 1111100 111112345556677789999999 Q ss_pred EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccc Q lcl|NC_021303. 79 TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (637) Q Consensus 79 rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~ 158 (637) .|..=+-+.++.+. . + ......+.+.--. .+-..++++.++.+|-+-|++|+.| .+.+|+ + . T Consensus 70 p~~~~~~~~~~~~~-----~-~---~~~~~~ll~~pn~-~~t~~~f~~~~~~~l~l~Gna~~~i-~~~~g~-~------~ 131 (457) T protein:vir:62 70 PLSTYSKRGGTRKE-----I-D---TPEWLDFPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAV-RWAGPN-I------A 131 (457) T ss_pred ceEEEEecCCcccc-----c-c---chHHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEE-EeCCCc-E------E Confidence 88776555332121 1 1 1233444333322 3678899999999999999999887 444443 1 1 Q ss_pred ccceeeeHHHhcc----CCC--cee--EEec-CCCCccc---ccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHh Q lcl|NC_021303. 159 ARWYAVTREEIKS----KAG--ETA--EISL-PDGKTHE---FNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIER 226 (637) Q Consensus 159 ~~W~~vt~~Ei~~----k~g--~~~--~i~l-PdG~~he---~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~r 226 (637) +-| .|....+.. ..+ ... .... -+|..+. |.. +-||++=.+++.....--||+.++...+.-..- T Consensus 132 ~l~-~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~g~~~~~~~~~~--~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~ 208 (457) T protein:vir:62 132 GLD-VLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTP--RDVLHIPGMMLPGDFVGCSPISYARESIGLALA 208 (457) T ss_pred EEE-EEcCcceEEEEeccCCccceeEEEEEEccCCceeEEEeeCc--cceEEecCCCCCCceecccHHHHHHHHHHHHHH Confidence 122 222222210 010 111 1111 2333322 222 234555445554445667787777766655555 Q ss_pred hhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccccc Q lcl|NC_021303. 227 TTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYI 306 (637) Q Consensus 227 ttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~v 306 (637) ..+...+..+.-.+-.|||-+|+.++ ..+.+.+.+.|.+. +...+.+-. T Consensus 209 ~~~~~~~~f~ng~~p~gil~~~~~ls-------------------------~e~~~~~~~~~~~~----~~G~~nag~-- 257 (457) T protein:vir:62 209 AQKYGAHFFRNGAMPGAVVEVPGTMS-------------------------EEGLARAREAWRAA----NSGVDNAHR-- 257 (457) T ss_pred HHHHHHHHHhccCCcceEEEcCCCCC-------------------------HHHHHHHHHHHHHH----hcCccccCc-- Confidence 55555555555555567887776332 01344455544332 222222111 Q ss_pred ceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeee--EEeccCceeEeechhHH Q lcl|NC_021303. 307 PLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSA--WAIGDEDVQLHIKPVMD 383 (637) Q Consensus 307 Piva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsA--W~I~dedVrlHI~P~me 383 (637) ++|+ ++. -+++-|.+... +.--+++|+-.+..+|...-|||. |||. ++++.|++ -|..-.=++-.|.|.+. T Consensus 258 ~~vl--~~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~ 331 (457) T protein:vir:62 258 VALL--TEG--AKFSKVAMSPD-EAQFLQTRQFQVPEIARIFGVPPH-LISDATNSTSWGSGLAEQNIAFTMFSLRPWLE 331 (457) T ss_pred ceec--CCC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHH-HcCCCCCcccccchHHHHHHHHHHHHHHHHHH Confidence 2222 332 35555554332 222388999999999999999997 5687 57787764 33333445667999999 Q ss_pred HHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHH Q lcl|NC_021303. 384 LICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDG 460 (637) Q Consensus 384 ~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg 460 (637) .|+++|+..+|... + ...|.++||.+.| ...|..+.+ ..++..|.+|-.-.|+.+|++.-.+-. ..+. T Consensus 332 ~ie~~ln~~L~~~~----~--~~~~~i~fd~~~l-~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~--~D~~ 402 (457) T protein:vir:62 332 RIEAGFNRLLFAET----A--DRFRFVKFNLDEI-KRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGL--GEKY 402 (457) T ss_pred HHHHHHHhhhcCcc----c--cCceEEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--ccee Confidence 99999998887542 2 2568899999998 344443333 347788999999999999997543310 0011 Q ss_pred HHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021303. 461 CREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEA 529 (637) Q Consensus 461 ~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~ 529 (637) +... .+.++-+ .-+.-..|.+.+..++.+++.++.+....+.+||.++.....++ T Consensus 403 ~~~~-------------n~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 403 RVPL-------------NLGEIGE-EPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred eecc-------------ccccccc-cccccccCCCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 1100 0111110 00111112222222333333222233334455555543333222 No 12 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.41 E-value=3.3e-13 Score=89.04 Aligned_cols=474 Identities=15% Similarity=0.105 Sum_probs=226.6 Q ss_pred hheehhccccchhhhhhhhcccc-cccchhhH-------HH--HHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCC Q lcl|NC_021303. 23 SLTAASQLITDPQKQMKTSLMGT-ARNEWQSE-------AW--DFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLP 92 (637) Q Consensus 23 ~ltAAs~~~~~p~~~~k~~~~g~-~r~~WQ~e-------AW--~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~P 92 (637) -|-|--|.++-|...=|...+.. -.-.|+.. +| ..|-..+-+.--|.-+++++|.+.|..-+.+.|.+ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~-- 78 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE-- 78 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCc-- Confidence 55555666666653222211111 01112110 11 12334455666788899999999888878776632 Q ss_pred CCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhcc- Q lcl|NC_021303. 93 TGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKS- 171 (637) Q Consensus 93 tG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~- 171 (637) .+.. .+.+..+.+. -.--+-..++++.++.+|.+-|++|+.+.-...|. + ..++.|..+.+.. T Consensus 79 ---~~~~----~~~~~~Ll~~-PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~-~-------~~L~~l~p~~v~v~ 142 (518) T protein:vir:10 79 ---TEES----DTGYAKLLAD-PCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-P-------EKLMPMHPSRVAIK 142 (518) T ss_pred ---eecc----chHHHHHHcC-CCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-E-------EEEEEECCCceEEE Confidence 2211 2344455443 34456677899999999999999998876544443 1 1344554444431 Q ss_pred ---CCCceeE-EecCCCCc---ccccCCCceE-EEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCc Q lcl|NC_021303. 172 ---KAGETAE-ISLPDGKT---HEFNRDLDSL-VRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNG 243 (637) Q Consensus 172 ---k~g~~~~-i~lPdG~~---he~~~~~d~l-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnG 243 (637) .++...+ +...+|.. .+|.. .+++ ||..+|+ .-..--||+.++...+.-..-+.+...+..+.=..-.| T Consensus 143 ~~~~~~~~~y~~~~~~~~~~~~~~~~~-~eViHir~~s~d--g~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~g 219 (518) T protein:vir:10 143 RNSRTGRYEYYFQAGAGVGTQLVSFAD-DEVVPIRFFNPD--GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNL 219 (518) T ss_pred EcCCCCEEEEEEEecCCccceEEEecC-CcEEEecCCCCC--cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccE Confidence 2222221 33333322 23332 2332 4444433 22234467666555444443333333333222233335 Q ss_pred eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccccee Q lcl|NC_021303. 244 VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHI 323 (637) Q Consensus 244 vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHl 323 (637) ||-+|+.++ ....+.|.+.|. ..+...+.+ --++|+ ++. -+++-| T Consensus 220 il~~~~~ls-------------------------~e~~~~~k~~~~----~~~~G~~na--g~v~vL--~~G--~~~~~l 264 (518) T protein:vir:10 220 VLRHEKRLS-------------------------EAAQQRLREQFD----RAHSGSSNT--GKTMVV--EEG--MEPIPL 264 (518) T ss_pred EEecCCCCC-------------------------HHHHHHHHHHHH----HHhcCcccc--CcceEc--CCC--ceEEEc Confidence 665554321 113344444432 223222221 122333 222 344555 Q ss_pred ecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhC Q lcl|NC_021303. 324 KFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREG 402 (637) Q Consensus 324 kf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eG 402 (637) .+... +.--+++|+-.+..+|...-|||..| |+ .++|+-++.+....-++.-|.|.+..|+++|++.++.. ++ T Consensus 265 ~~s~~-D~q~le~r~~~~~eIa~afgVPp~~l-g~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~-~~--- 338 (518) T protein:vir:10 265 QLTAV-EMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY-WV--- 338 (518) T ss_pred cCChh-HHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--- Confidence 54332 22248999999999999999998655 87 46777777777777778889999999999999876643 22 Q ss_pred CChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHH Q lcl|NC_021303. 403 IDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMY 479 (637) Q Consensus 403 iDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~ 479 (637) ..|-+.||.+.| ..+|..+.+ ..++..|.+|-.-.|+.+|++.-++ +...+.+.+ . .+ T Consensus 339 ---~~~~~~fd~~~l-lr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~--~~gD~~~~~---------~----n~ 399 (518) T protein:vir:10 339 ---RKNRMKFDIDDV-IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDD--PKADELYAN---------S----AL 399 (518) T ss_pred ---CCceEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCCCeeeec---------c----cc Confidence 367799999998 345554443 3467779999999999999874321 001111100 0 11 Q ss_pred Hhhh---ccccccccCCCCc-CCCCCCCCCCCCCCCCCCCCCccCCCCC---------CCcccCCCCcchH-HHHHH--- Q lcl|NC_021303. 480 APLL---SSQLAGIEFPQPA-NAIESTREEDDEDSGARQQREPQTEDER---------STEEAASLNDRAA-YLVAE--- 542 (637) Q Consensus 480 apLl---~~~~~~ie~P~p~-~a~~~~~~~~d~~~~a~~g~EPdted~~---------~~~~~a~~~~~a~-~~aa~--- 542 (637) .||- .....+-+.|.++ ++..+..+.++...++..+-++++.+.. -.+..++....++ -+-+| T Consensus 400 ~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (518) T protein:vir:10 400 QPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGA 479 (518) T ss_pred eecccccccccCCCCCCCCCCCCccccccccccccccCCCCCcccccccccccccchhccccCCCcccccchHHHHHHHH Confidence 2221 1122233333332 1111111111111222223333332110 0111111111111 01111 Q ss_pred ----HHHHHHHHHHhcccccCCCchhhhhHhhcCchhhhhhhcCCCCHHHHHHHHhcccc Q lcl|NC_021303. 543 ----RLLVNRALDLAGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAGWDT 598 (637) Q Consensus 543 ----~llV~rALelAGkRr~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~~v~rLi~GWd~ 598 (637) |-+-.-||.||-|- +.++.+-|--+--.. ..|- |. T Consensus 480 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~------------~~~~----~~ 518 (518) T protein:vir:10 480 MGRGKDIKGFALQLAEKY-----PDDLEDILLAVQLAL------------AERK----DN 518 (518) T ss_pred hhcCccchhHhhhhhhhc-----chhHHHHHHHHHHhh------------hhcc----CC Confidence 22223345555442 221111110000000 0000 00 No 13 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.40 E-value=2.9e-14 Score=94.80 Aligned_cols=417 Identities=14% Similarity=0.092 Sum_probs=214.4 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhc---ccccccchhhHHHHHhhhhhhHhhHhhhhhcceee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSL---MGTARNEWQSEAWDFSESIGELSYYISWRANSCSR 77 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~---~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr 77 (637) ||.---|+.|+-++.... .. ..|-.+. +.....++..-..+.+-..+-+.-.|.-+++.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~------------~~---~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~ 65 (460) T protein:vir:10 1 MANRIIRALRELTGLDNK------------FN---DAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVA 65 (460) T ss_pred CchhHHHHHhhhhccCCC------------ch---HHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhh Confidence 776655555432221110 00 1111111 11111233333444444556666667888999999 Q ss_pred eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHh---------------------ccCcccHHHHHHHHHhhhcccc Q lcl|NC_021303. 78 TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGI---------------------ADGPLGQAALIKRAVECMTVVG 136 (637) Q Consensus 78 ~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~i---------------------AgG~lGqaqLlkr~~~~LtVpG 136 (637) +-+..-+.+.|.+.--.. ...-.........+.+ -..-+-..++++.++.+|-+-| T Consensus 66 lp~~v~~~~~~g~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~G 142 (460) T protein:vir:10 66 VPYTIKVVKDTKAYQQLN---NLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNG 142 (460) T ss_pred CceEEEeccCCccchhhh---hhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcC Confidence 999999988773211000 0000000000111111 1233467789999999999999 Q ss_pred cEEEEEEeecCCccccccccccccceeeeHHHhcc---CCC-------ceeEEecC-CCCcccccCCCceEEEEecCCcc Q lcl|NC_021303. 137 EVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKS---KAG-------ETAEISLP-DGKTHEFNRDLDSLVRIWNPRPR 205 (637) Q Consensus 137 E~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~---k~g-------~~~~i~lP-dG~~he~~~~~d~l~RvW~P~pr 205 (637) ++|+.+.-...|. ....+.+-| .|..+.+.. .++ .......+ +|..++|....=+-||.++|.-. T Consensus 143 nay~~i~r~~~~~---~~G~~~~L~-~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~ 218 (460) T protein:vir:10 143 NCYFYLMSPDDGI---NAGVPSQMY-VLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFD 218 (460) T ss_pred CeEEEEEecCCCc---cCceeEEEE-EEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcc Confidence 9998766543331 111122222 222222221 111 11112333 44555665544444777776532 Q ss_pred ---cccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHH Q lcl|NC_021303. 206 ---KASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASE 282 (637) Q Consensus 206 ---ra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~ 282 (637) ....--||+.++...+. +........ .++..||. .|..+-.+.+. ......+ T Consensus 219 ~~~~~~~G~sp~~~~~~~i~----~~~~~~~~~-~~~f~ng~--~~~~i~~~~~~------------------l~~e~~~ 273 (460) T protein:vir:10 219 LQGSHLYGMSPIRAILRNIN----SQNSTIDNN-VKTMQNGG--VFGFIHGGSTG------------------LTQPQAD 273 (460) T ss_pred cccCccccccHHHHHHHHHH----HHHHHHHHH-HHHHhcCC--CcceeeecCCC------------------CCHHHHH Confidence 22344577766655443 333333332 23344442 22221111111 1122445 Q ss_pred HHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccC--- Q lcl|NC_021303. 283 QLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS--- 359 (637) Q Consensus 283 ~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls--- 359 (637) .+.+.|.+. +...+. +--|+++ +++ -+++.|.....-.. -+++|+-.+..+|+.+-|||. |||.. T Consensus 274 ~~~~~~~~~----~~g~~n--~g~~~vl--~~g--~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~ 341 (460) T protein:vir:10 274 SLKQRLTEM----DKSPDR--LSQIAGA--SGE--IAFTKISLNTDELK-PFDYLKYDQKAICNALGWSDK-LLNNNEGG 341 (460) T ss_pred HHHHHHHHH----hcCccc--cCCceec--CCC--ceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHH-HhCCCCCC Confidence 565555433 222111 2233443 333 35556655433222 378999999999999999998 78873 Q ss_pred CcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHH Q lcl|NC_021303. 360 KGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSA 439 (637) Q Consensus 360 ~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~e 439 (637) ++|+-++.+....-++..|.|.+..|+++|++.+|.. +-....|.|+||.+.|..--.....-.++|+.|++|-. T Consensus 342 t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~-----~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~N 416 (460) T protein:vir:10 342 GLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKR-----FKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPN 416 (460) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc-----ccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHH Confidence 3567788888888899999999999999999988753 12356788999999985322222333468999999999 Q ss_pred HHHHHhcCccc--cCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 440 ALRRLLNVGED--SGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQRE 517 (637) Q Consensus 440 Alrr~lgl~~d--~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~E 517 (637) -.|+++|++.- .+-| .-+.| ..++.++ ..+ ++..++..+++. T Consensus 417 E~R~~~g~~pi~~~~gD----------------------~~~~~---~n~~~~~---------~~~--~~~~~~~~nq~~ 460 (460) T protein:vir:10 417 EIRIAMKYETLNQDGMD----------------------IVFMP---SNKVRID---------DVS--NNLIDSAFNQNQ 460 (460) T ss_pred HHHHHhCCCCCCCCCCC----------------------eeeec---ccccchh---------hcc--cccCCCcccCCC Confidence 99999999842 2323 00111 0111111 011 111111111111 No 14 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.39 E-value=3.9e-13 Score=88.62 Aligned_cols=421 Identities=9% Similarity=0.084 Sum_probs=229.1 Q ss_pred CCCCcceEE--ecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHH-HHhhhhhhHhhHhhhhhcceee Q lcl|NC_021303. 1 MAATSLRVV--RRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYISWRANSCSR 77 (637) Q Consensus 1 ma~~~lr~v--rrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW-~~yd~VgELryyvgWr~~s~Sr 77 (637) |.==. |+. ++.+.++ +..+..+...|.... |...+.+.- -. ..| ..+-++-.+.-++++||+ T Consensus 1 M~~~~-~~f~~~~r~~~~-----------~~~~~~~~~~~~~~~-g~~~~~~~v-~~~~al-~~~~v~~~i~~ia~~ia~ 65 (429) T protein:vir:10 1 MDSVK-KFFNFEKRQTSQ-----------VIELNKDDEKLLEWL-GISPSTISV-KGKNAL-KVATVFACIKILSESVSK 65 (429) T ss_pred Cchhh-hhhcccccCccc-----------ccccCCChHHHHHHh-cCCCCccee-chhhhh-ccHHHHHHHHHHHHhhcc Confidence 33211 111 1111111 112233444554443 322222210 01 122 345667778889999999 Q ss_pred eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccc Q lcl|NC_021303. 78 TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAP 157 (637) Q Consensus 78 ~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~ 157 (637) +.+..=+-+++ |.. .+.+ +.+..+.+.=...-+-..++++.++.+|-+-|+.|+.+.-...|. +.+.-.- T Consensus 66 l~~~~~~~~~~-~~~----~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~i 135 (429) T protein:vir:10 66 LPLKIYQEDEY-GIQ----RGTK----HYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALWPI 135 (429) T ss_pred CceEEEEecCC-cee----eccc----cHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEE Confidence 99887665533 321 1222 456666554445556777899999999999999999977544443 2222221 Q ss_pred cccceeeeHHHhcc-CCCce-eEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 158 RARWYAVTREEIKS-KAGET-AEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 158 ~~~W~~vt~~Ei~~-k~g~~-~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) ..++..+..++... +.+.. .+....+|..++|.. +-||++=.+.+..-..-.||+..+...+.-.....+...+.. T Consensus 136 ~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~--~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 213 (429) T protein:vir:10 136 DASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP--EEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFY 213 (429) T ss_pred cCceeEEEEcCcccccccceEEEEEccCCeEEEEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 22223333332211 11122 223445666666654 335665344455555677888888877776666666655555 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||-+|+.++ ....+.+++.|.+. ....++.+. ++|+ ++. T Consensus 214 ~ng~~~~~il~~~~~l~-------------------------~e~~~~~~~~~~~~-~~g~~n~~~-----~~vl--~~g 260 (429) T protein:vir:10 214 KQGLQVKGLVQYVGDLN-------------------------EDAKKVFRENFESM-SSGLQNSHR-----IALM--PVG 260 (429) T ss_pred hccCCccEEEEcCCCCC-------------------------HHHHHHHHHHHHHH-hccccccCc-----eeec--CCC Confidence 55444557777665322 11334555555332 122233222 2222 333 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++-|.+.. .+.--+++|+-.+..+|..+-|||.-|=++.++|+-++.|....=++..|.|.+..|+++|++.+|- T Consensus 261 --~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~ 337 (429) T protein:vir:10 261 --YQFQPISLNM-SDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFL 337 (429) T ss_pred --ceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 2455555432 2222378999999999999999997664456788877777777778889999999999999988774 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCCCHH---HHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDLSDE---AVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tde---A~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) ..--. ..|.+.||.+.| ..+|..+. ...++..|++|-.-.|+.+|++.-.+=| +.+ T Consensus 338 ~~~~~-----~g~~~~fd~~~l-l~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD----~~~----------- 396 (429) T protein:vir:10 338 DSELD-----KGFYSKFNVDAI-LRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----RLL----------- 396 (429) T ss_pred hhhcC-----CCcEEEeechhh-hcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eee----------- Confidence 33221 446789999988 33443333 3448888999999999999987543312 000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQR 516 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~ 516 (637) .| ..++.++... .....-|+++.+...++.+|+ T Consensus 397 -------~~---~n~~~~d~~~-~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 397 -------VN---GNMLPIDMAG-QAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred -------ec---ccccchhhcc-ccccCCCCCCCCCCCCCCCCC Confidence 01 0111111100 001111222222222222222 No 15 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.38 E-value=4.3e-13 Score=88.36 Aligned_cols=405 Identities=14% Similarity=0.140 Sum_probs=222.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchh--hHHHHHhhhhhhHhhHhhhhhcceeee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQ--SEAWDFSESIGELSYYISWRANSCSRT 78 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ--~eAW~~yd~VgELryyvgWr~~s~Sr~ 78 (637) |-==+ |+.+|...++. +. |...... .+++ ...|. .=.++.+-.++-+.-.|.-+++++|.+ T Consensus 1 Mg~f~-~lf~r~~~~~~-~~-------------~~~~~~~-~~~~-~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~ 63 (414) T protein:vir:44 1 MVFFS-GLFQRKSDAPV-TT-------------PAELADA-IGLS-YDTYTGKQISSQRAMRLTAVFSCVRVLAESVGML 63 (414) T ss_pred Cchhh-hhhccCccCcc-cc-------------hhhHhHh-hccC-ccccCCceechhhhhccHHHHHHHHHHHHHhccC Confidence 43221 55565433322 11 1111111 0110 01111 011223334555666677789999999 Q ss_pred EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccc Q lcl|NC_021303. 79 TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (637) Q Consensus 79 rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~ 158 (637) .+..-+.+.+ |.. .. ..+.+..+.+.=...-+-..++++.++.+|-+-|++|+.+. |.+|.+ . T Consensus 64 p~~~~~~~~~-~~~----~~----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~-~~~g~~-------~ 126 (414) T protein:vir:44 64 PCNLYHLNGS-LKQ----RA----TGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKV-KAFGEV-------A 126 (414) T ss_pred ceEEEEecCC-cee----ec----ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-eCCCcE-------E Confidence 9988777754 222 11 23556666665556667888899999999999999998764 555542 1 Q ss_pred ccceeeeHHHhc---cCCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_021303. 159 ARWYAVTREEIK---SKAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNA 234 (637) Q Consensus 159 ~~W~~vt~~Ei~---~k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 234 (637) + .+.|...-+. ..++..++ +..++|...+|....=+-||..+.+ -..--||+..+...+.-..-..+...+. T Consensus 127 ~-L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~d---~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 202 (414) T protein:vir:44 127 E-LLPVDPGCVVPKLNSSWEPVYQVTFPDGSTDVLSQEDIWHVRTLTLD---GLVGLNPIAYAREAISLAAATEEHGARL 202 (414) T ss_pred E-EEEEcCceEEEEECCCCcEEEEEEecCceEEEEccccEEEecCCCCC---CcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 1 2222222221 23333333 5667887777765332224433322 2456677777666655555555555555 Q ss_pred HHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_021303. 235 AKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAA 314 (637) Q Consensus 235 ~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~ 314 (637) .+.-....|||-+|+.++ ....+.+++.|.+ .+...+. +--|+|+ ++ T Consensus 203 f~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~----~~~g~~n--~~~~~vl--~~ 249 (414) T protein:vir:44 203 FSNGAVTSGVLRTEQTLS-------------------------DQAYERLKKDFEE----RHTGLGN--AHRPMIL--EM 249 (414) T ss_pred HhccCCCceEEEeCCCCC-------------------------HHHHHHHHHHHHH----HhcCccc--cCcceec--CC Confidence 555555677877665322 1134445444433 2322221 2224443 33 Q ss_pred HHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 315 EHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 315 Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) . -+++.|.+... +.--+++|+-.+..+|..+-|||..|=+.+++|+-++.+....-++--|.|.++.|+++|+..+| T Consensus 250 g--~~~~~l~~~~~-d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~ 326 (414) T protein:vir:44 250 G--LDWKSMALNAE-DSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLV 326 (414) T ss_pred C--ceEEEccCChH-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 2 34555554322 22347889999999999999999765444578888887777777888899999999999999887 Q ss_pred HHHHHHhCCChHHeEEeecCcccccCCCCCHH--H-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcC Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDE--A-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTK 471 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tde--A-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~ 471 (637) ..--. ..|-+.||.+.|. .+|..+. + ..++..|.+|-.-.|+.+|++.-.+=| . T Consensus 327 ~~~~~------~~~~i~fd~~~ll-~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD-------------~--- 383 (414) T protein:vir:44 327 RKSKQ------GVFYAKFNAGALL-RGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD-------------V--- 383 (414) T ss_pred Ccccc------CceEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-------------e--- Confidence 65221 3678899998883 2333222 2 348888999999999999997433211 0 Q ss_pred CchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021303. 472 NPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERST 526 (637) Q Consensus 472 ~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~ 526 (637) -+. |.-.+..+ .+. .+.+.+.++..+|++++ T Consensus 384 ------~~~------------~~n~~~~~-~~~-----~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 384 ------YLT------------PMNMTTKP-SDG-----SKAGKQKDNANADETTS 414 (414) T ss_pred ------ecc------------cccccccC-Ccc-----ccCCCCCCCCCCCCCCC Confidence 000 00000000 000 01111122222333222 No 16 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.36 E-value=3.6e-13 Score=88.79 Aligned_cols=399 Identities=14% Similarity=0.133 Sum_probs=219.0 Q ss_pred CCCC----cceEEecCCCCCcccc-cchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAAT----SLRVVRRPKGSAPAAR-RRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~----~lr~vrrpk~~~p~~~-r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |+.- .+.+.|+.|..+...+ ......+.++..++...+ .+..++... .++ ..+-+.-.+.-+++++ T Consensus 4 ~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-------~~~-~~~~v~~cI~~ia~~i 74 (413) T protein:vir:96 4 VSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFF-KELISDGYT-------KLS-DSPEVRMAVDCIADLV 74 (413) T ss_pred cchhhhhhcCCccccCCCcchhhhhhccccccccccccchhhH-hhhccchhH-------HHh-hchHHHHHHHHHHHhh Confidence 3221 2344555443221111 111111112221221111 111111000 112 2355666678899999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |++.+..-+-+.| |... .+ +....+++.=...-+-..++++.++.+|-+-|++|+.+.-...|..+.. T Consensus 75 a~~~~~~~~~~~~-~~~~-----~~----~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~-- 142 (413) T protein:vir:96 75 SNMTIQLMQNGET-GDKR-----IK----NDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIG-- 142 (413) T ss_pred ccCceEEEEecCC-Cccc-----cc----cHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEE-- Confidence 9998888777655 4332 11 2355555444455677889999999999999999999776555542222 Q ss_pred cccccceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEE-ecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRI-WNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~Rv-W~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) -| .+..+.|. ...++..+....+| .+|.. +=||++ ++|++..-..--||+.++.+.+.-..-..+... T Consensus 143 ----L~-~l~~~~v~~~~~~~~~~y~~~~~~--~~~~~--~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 213 (413) T protein:vir:96 143 ----LT-PISPYKVTFNVSDDDLDYSITFDN--KEYDP--STLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKK 213 (413) T ss_pred ----EE-EecCceeEEEEcCCeEEEEEeecC--cEEch--hhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 12 22222222 12222222222233 23432 234554 578887777788999998888888877888888 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-+|+.++ + -..+.+++.+. ..+...+. +-=++|+.. T Consensus 214 ~~~~ng~~p~gil~~~~~l~------~-------------------e~~~~~~~~~~----~~~~g~~n--~g~~~vl~~ 262 (413) T protein:vir:96 214 GFMASEYMPNLIVSVDSDSD------E-------------------LSDEEGRENFE----EMYLKRKE--AGKPWIIPE 262 (413) T ss_pred HHHhccCCccEEEEeCCCCC------H-------------------HHHHHHHHHHH----HHhcCccc--cCceeeecC Confidence 88888788889988887543 0 02344544433 22222111 112233333 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) .+.-...++-+.. .+.--+++|+..+..+|..+-|||. |||.++.|. +....-++..|.|.+..||++|++. T Consensus 263 ~~~~~~~~~~~~~---~d~q~~e~~~~~~~~Ia~~fgVP~~-~lg~~~~~~----~~~~~~~~~~l~P~~~~ie~~ln~~ 334 (413) T protein:vir:96 263 GMVNVQQIKPLTL---NDLAINDAVTLDKKTVAGIFGVPAF-LLGVGTYNK----DEFNNFINTKIMSIAQVIQQTYNKL 334 (413) T ss_pred CcccccccccCCh---hHHHHHHHHHHHHHHHHHHhCCCHH-HcCCCcchH----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3332222222221 2334578999999999999999986 668754432 2233457778999999999999988 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHh Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVV 469 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v 469 (637) +|- ..|-+.||.+.| .++|..+.|. .++..|.+|-.-.|+.+|+.-..+=| +.+ + T Consensus 335 ll~----------~~~~~~fd~~~l-l~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~gd----~~~-------~ 392 (413) T protein:vir:96 335 IVE----------EDMYFSLNPRSL-YNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEMD----DLL-------V 392 (413) T ss_pred hCC----------CCcEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----eee-------e Confidence 752 356789999998 4455444433 67788999999999999987544311 000 0 Q ss_pred cCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_021303. 470 TKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREP 518 (637) Q Consensus 470 ~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EP 518 (637) .+| +.|+ +...+.+...++|. T Consensus 393 ~~n------~~~~----------------------~~~~~~~~~~~~dt 413 (413) T protein:vir:96 393 LEN------YLQQ----------------------KDLVNQKKLIQDET 413 (413) T ss_pred ccc------ccch----------------------hhcccccCCCCCCC Confidence 000 1110 00111111111111 No 17 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.36 E-value=5.3e-13 Score=87.88 Aligned_cols=402 Identities=16% Similarity=0.172 Sum_probs=215.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+ |+.++++... ..+..+. ..+++..+-. .|. .-. ...| + ..+-+.-.|.-++++||.+.+ T Consensus 1 Mgl~~-~~f~~~~~~~------~~~~~~~-~~~~~~~~~~--~g~-~v~-~~~a---l-~~~~v~~~v~~ia~~iA~lp~ 64 (409) T protein:vir:84 1 MSLFT-RIFSGPSEER------TLTKISG-IPSPAEDWAM--HGD-RPG-ANSA---M-TLGAFYACVTLLADTVASLSI 64 (409) T ss_pred Cchhh-hhhcCCCccc------ccccccc-cccccchhhc--cCc-ccc-hhhh---h-ccHHHHHHHHHHHHhhhhCce Confidence 66544 5666654321 2221111 1222222211 110 000 0000 1 123345556678999999999 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+-+ |.|+ + +. +.+.++.+.=-.--+...++++.++.+|-+-|+.|+.|..|..++.+.+...-..+ T Consensus 65 ~~~~~~-~~~~----~--~~----~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~ 133 (409) T protein:vir:84 65 DAYRKK-DNVR----I--PV----SPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPD 133 (409) T ss_pred EEEEec-CCcc----c--cc----chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCc Confidence 877655 3222 2 22 34566655445566788899999999999999999988777665433333221112 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) +.-+.. .....+......- .+...+|.. +=||++=+..+..-..--||+..+...+.=..-..+...+..+.-.. T Consensus 134 ~v~v~~--~~~~~~~~~~~~~-~~~g~~~~~--~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~ 208 (409) T protein:vir:84 134 CIHVTD--AKDEDGDWIEPVY-RIDGKVVPN--HRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSAN 208 (409) T ss_pred eeEEEE--cCCCcceEEEEEe-cCCceEEch--hhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 221211 1111222222221 222234432 33555534444444567788887776666555555555555555555 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) -.|||-+|+.++ ....+.+.+.+++.. .. +--++|+ +++ .++ T Consensus 209 p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~----~n-----~g~~~vl--~~g--~~~ 250 (409) T protein:vir:84 209 PSGILSSDADLT-------------------------PDQVKQTQKQWIQSH----HN-----RRLPAVM--SAG--IKW 250 (409) T ss_pred ccEEEecCCCCC-------------------------HHHHHHHHHHHHHHh----cc-----CCCeeec--CCC--ceE Confidence 567776665322 113455666555432 22 2224443 333 345 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeee--EEeccCceeEeechhHHHHHHHHHhHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSA--WAIGDEDVQLHIKPVMDLICQAIYNDILTPL 397 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsA--W~I~dedVrlHI~P~me~ic~Ait~~~Lr~~ 397 (637) +-+.+... +.--+++|+..+..+|..+-|||. +||. .++|-|++ .|....=++-.|.|.+..||++|++.+ T Consensus 251 ~~~~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L---- 324 (409) T protein:vir:84 251 QSVSITPN-ESQFLETRSFQRSEIAMWFRIPPH-MIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFL---- 324 (409) T ss_pred EEccCChh-HHHHHHHHHHHHHHHHHHhCCCHH-HhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Confidence 55554322 222378999999999999999997 5676 45666653 444444466779999999999998653 Q ss_pred HHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCch Q lcl|NC_021303. 398 LAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPE 474 (637) Q Consensus 398 L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~ 474 (637) +..|.|.||.+.| ..+|..+.+ ..+++.|++|-.-.|+.+|++.-.+-| +. T Consensus 325 -------~~g~~i~fd~~~l-~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD----~~-------------- 378 (409) T protein:vir:84 325 -------PRGQFVKFNVDGL-MRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGD----IH-------------- 378 (409) T ss_pred -------cCCCeEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----ee-------------- Confidence 2357899999998 456655554 447778999999999999998654422 00 Q ss_pred hHHHHHhhhccccccccCCCCcCCCCCCCCCC-CCCCCCCCCCC Q lcl|NC_021303. 475 LIAMYAPLLSSQLAGIEFPQPANAIESTREED-DEDSGARQQRE 517 (637) Q Consensus 475 Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~-d~~~~a~~g~E 517 (637) +.|+ .+..++ ..++.+... .+.+++.+|++ T Consensus 379 ----~~~~---n~~~~~------~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 379 ----LQPM---NFVPLG------YVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred ----eecc---cccccc------cCCccccCcCCCCCCccCCCC Confidence 0010 000011 001111110 11112223333 No 18 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.33 E-value=1.4e-12 Score=85.53 Aligned_cols=413 Identities=15% Similarity=0.127 Sum_probs=207.4 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHH-HHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW-~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |.==+ |+-.+|.... ++-+..+.++. ++......+.-.. +.+..++-++-.+.=+++++|++- T Consensus 1 Mg~~~-~~~~~~~~~~--------~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp 64 (423) T protein:vir:81 1 MGFLQ-KLGLAPSVVA--------TPEPIELVGPI-------FESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQ 64 (423) T ss_pred CchhH-hhcccccccc--------Ccccccccccc-------ccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCc Confidence 32211 1111221100 00011111110 0000000111111 112244556777888999999999 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~ 159 (637) |..=+.+.|.+.. .+.+ +.+..+.+. ..--+-..++++.++.+|-+-|+.|+.+ .|..|........--- T Consensus 65 ~~~~~~~~dg~~~----~~~~----~~~~~ll~~-PN~~~t~~~f~~~~~~~l~l~Gna~~~i-~rd~~~~~~~~~l~p~ 134 (423) T protein:vir:81 65 LQAFERVEDGGRE----RVRE----GHLARVCKL-ANSDMTMYDLLERTMFDLCLYDEFFWLL-PGDLGVDTPTLDIRPI 134 (423) T ss_pred eEEEEEecCCcee----eecc----chHHHHhhc-CCCCCCHHHHHHHHHHHHhhcCCeEEEE-EecCCcCcceEEEeec Confidence 9887777673221 1222 334555543 4455788899999999999999999765 4543321100000000 Q ss_pred cceeeeHHHhccCCCceeEE----ecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 160 RWYAVTREEIKSKAGETAEI----SLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 160 ~W~~vt~~Ei~~k~g~~~~i----~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) ..-.+++...+.-.+.-.+. ...+|...+|.. +-+|++=++++..-..--||+..+.+.+.=..-..+...+.. T Consensus 135 ~~~~v~~~~~~~~~~~~~Y~~~~~~~~~g~~~~~~~--~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f 212 (423) T protein:vir:81 135 PVSWVQRRAYKDGWGSLDYIIIESGDNDGRSVKVPG--ERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMW 212 (423) T ss_pred ccceeeeeeccCCCcceEEEEEEecCCCceEEEEcc--cceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11112222222111222221 112454444443 234555466666555567776665544433333333332222 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||.+++.+.-... ...+.+.+++-| ++.+..-.+-+. -|+|+ +++ T Consensus 213 ~ng~~p~gvi~~~~~~~~~~l--------------------~~e~~~~~~~~~----~~~~~~~~~n~g-~~~vl--~~g 265 (423) T protein:vir:81 213 RNGPRPGMVIMRDPESKAGKW--------------------DAESRTRFMANL----RASFSPKSSDVG-GTLLL--EDG 265 (423) T ss_pred hccCCCceEEEecCcccCccC--------------------CHHHHHHHHHHH----HHHhccccccCC-cceec--CCC Confidence 222333467777764321111 112333333322 222221111122 22332 332 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) -+++-|.+.. .+.--+++|+-.+..+|...-|||. |||. +++++-+..|....=++.-|.|.+..|+++|+..+| T Consensus 266 --~~~~~l~~s~-~d~q~~e~~~~~~~eIa~~fgVPp~-~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L~ 341 (423) T protein:vir:81 266 --MKAENFHTTS-KDEQTVETTKLSLQTVAQVYGINPT-MVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNLFLL 341 (423) T ss_pred --ceEEeccCCh-hhHHHHHHHHhhHHHHHHHhCCCHH-HhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 3566666543 3334568888999999999999877 5687 467776777777777777899999999999998876 Q ss_pred HHHHHHhCCChHHeEEeecCcccccCCCCCHH--HHH--HHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDE--AVE--AHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tde--A~~--a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) .. .++|-..|.+.||.+.| ..+|..+. +.+ +...|.+|-.-.|+.+|++...|=| T Consensus 342 ~~----~~~~~~~~~~~fd~~~l-lr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD---------------- 400 (423) T protein:vir:81 342 PR----VGIDNEKFYFEFNLEEK-LRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGD---------------- 400 (423) T ss_pred Cc----cccccCccEEEecchhh-hccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc---------------- Confidence 54 44566789999999998 44554433 332 2245999999999999998654322 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQT 520 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdt 520 (637) ++-.|.... +++.++. .+++.+| T Consensus 401 --------------------~~~~p~n~~-~~~~~~~------~~~~~~t 423 (423) T protein:vir:81 401 --------------------DLARPLNTE-FGDSEDA------PGEEVET 423 (423) T ss_pred --------------------eeecccccc-cCccCCC------CCCCCCC Confidence 111111110 1111111 1111222 No 19 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.33 E-value=1.4e-12 Score=85.51 Aligned_cols=419 Identities=14% Similarity=0.131 Sum_probs=219.3 Q ss_pred CCCCcceEEecCCCCCcccccchheeh---hccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAA---SQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSR 77 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAA---s~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr 77 (637) |+-+--+|+-+=++.++.+ +.+- +-..+|+. .|....++...++ ..--.+-.=.++-+.-.+.-+++++|. T Consensus 1 ~~~~l~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~-~~~~~~g~~~~~g-~~v~~~~al~~~~V~~~i~~ia~~ia~ 74 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSS----LFGWGGKTIRLTDGA-FWSQFLGRESSSG-KKVTVDKAMKLSAVWACVRLISTSVAG 74 (434) T ss_pred Cccchhhhhhhcccccchh----hhcccccccccCchH-HHHHHhcCCccCC-ceechhhhhccHHHHHHHHHHHHhhhh Confidence 6655555544433333211 1111 11112222 1222211110000 000011111234456678889999999 Q ss_pred eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccc Q lcl|NC_021303. 78 TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAP 157 (637) Q Consensus 78 ~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~ 157 (637) +.+..=+-+.|.+. +.++ .+.+..+.+.=-..-+-..++++.++.+|-+-|+.|+.| .|.+|. + T Consensus 75 lp~~~~~~~~~g~~----~~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i-~~~~G~-~------ 138 (434) T protein:vir:43 75 LPLGVYERKADGSR----VDAR----SFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEI-RRAAGR-P------ 138 (434) T ss_pred CceEEEEEcCCCcc----cccc----ccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEE-EeCCCc-E------ Confidence 98887777766221 1222 245666655545666788899999999999999999775 465543 1 Q ss_pred cccceeeeHHHhcc---CCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 158 RARWYAVTREEIKS---KAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 158 ~~~W~~vt~~Ei~~---k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) . .++.|..+.+.. .+|...+ ....+|..++|.. +-||++=.+ +-+-..--||+..+.+.+.-.....+...+ T Consensus 139 ~-~L~~l~p~~v~~~~~~~g~~~y~~~~~~g~~~~~~~--~eVih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~ 214 (434) T protein:vir:43 139 A-ALDFLLPSRVDLECDENGRLKYFYTTKKGARREIER--TNMLHIPAF-TLDGRIGLSAIRYGVDVFGSVMSAEDAANG 214 (434) T ss_pred E-EEEEEcCcceEEEEcCCCeEEEEEEecCceEEEEcc--ccEEEecCc-CCCCccccCHHHHHHHHHHHHHHHHHHHHH Confidence 1 334444444431 2333332 4556777777764 334443122 222344568877776666555555554444 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-+|+.++ ....+.|.+.+-+. ..-.+ +--++|+ + T Consensus 215 ~f~ng~~~~gil~~~~~l~-------------------------~e~~~~~r~~~~~~----~g~~n---ag~~~vl--~ 260 (434) T protein:vir:43 215 TFKNGLLPTVAFKVDRILQ-------------------------PAQREEFREYVKSV----SGAMN---SGRSPVL--E 260 (434) T ss_pred HHhccCCcceEEecCCCCC-------------------------HHHHHHHHHHHHHh----cCccc---cCCcccc--C Confidence 4444344456666666433 01334455544221 11111 2223333 3 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccC-Ccceeee--EEeccCceeEeechhHHHHHHHHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS-KGNHWSA--WAIGDEDVQLHIKPVMDLICQAIY 390 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls-~~NHWsA--W~I~dedVrlHI~P~me~ic~Ait 390 (637) +. -+++.|.+... +.--+++|+-.+..+|..+=|||. |||.. ++++|.+ -|....-++.-|.|.+..|+++|+ T Consensus 261 ~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln 336 (434) T protein:vir:43 261 QG--ITPETIGINPV-DAQLLETREHGVIEICRWFGVPPW-MIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQQCVN 336 (434) T ss_pred CC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHH-HhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 34555554332 223488999999999999999987 55874 4666644 455555567779999999999999 Q ss_pred hHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHH Q lcl|NC_021303. 391 NDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAAD 467 (637) Q Consensus 391 ~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d 467 (637) +.+|.+--. .+|.|-||.+.| .+.|..+.|. .++..|.+|-.-.|+.+|++.-.+=| +.+. T Consensus 337 ~kL~~~~~~------~~~~~~fd~~~l-lr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD----~~~~----- 400 (434) T protein:vir:43 337 KRLLTAPER------IRYYAEFSLEGF-LKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGD----ILTV----- 400 (434) T ss_pred hhcCChhhh------cCceEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEee----- Confidence 987654222 358899999998 3455554443 37788999999999999998543312 0000 Q ss_pred HhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCC--CCCCCCCCCCCCccCCC Q lcl|NC_021303. 468 VVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREE--DDEDSGARQQREPQTED 522 (637) Q Consensus 468 ~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~--~d~~~~a~~g~EPdted 522 (637) + -.++|+= .++ ..+.. ....+++..++ |.+++ T Consensus 401 ----~----~n~~~~~-----~~~---------~~~~~~~~~~~~~~~~~~-~~~~~ 434 (434) T protein:vir:43 401 ----Q----SNLVPID-----QLG---------QSNKSQAVRAALMNWFSQ-PEPQE 434 (434) T ss_pred ----c----cCccchh-----hhh---------ccCCCcchhhhhhccCCC-CCCCC Confidence 0 0011110 000 00000 01111111111 11111 No 20 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.30 E-value=1.2e-12 Score=85.91 Aligned_cols=408 Identities=12% Similarity=0.077 Sum_probs=224.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccc-cchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTAR-NEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r-~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |=-. |+.+| |+.. +..+...++ .+... +|... ..+..=.++-+=..+.+.-.|.-++++||++. T Consensus 1 m~~~--~~f~~-~~~~---------~~~~~~~~~--~~~~~-~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~ 65 (416) T protein:vir:12 1 MLLE--RMFEK-RSGS---------SDHEDGFNN--ILLNM-FGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLP 65 (416) T ss_pred Cccc--hhccc-ccCc---------cccCccchh--HHHHh-hcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCc Confidence 3222 11111 1111 111111111 11111 22111 11111111122234566667888999999999 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~ 159 (637) +..=+-+++ |.. .+.+ +....++..=..--+-..++++.++.+|-+-|+.|+.+.-...|. -. T Consensus 66 ~~~~~~~~~-~~~----~~~~----~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~--------~~ 128 (416) T protein:vir:12 66 IHTYKRTDG-GIE----RKPE----HKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGY--------PE 128 (416) T ss_pred eEEEEecCC-ccc----cccc----cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc--------EE Confidence 866554432 322 2222 234444444444567788999999999999999999866433332 12 Q ss_pred cceeeeHHHhc----cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 160 RWYAVTREEIK----SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 160 ~W~~vt~~Ei~----~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) .++.|..+-++ ..++...+....+|...+|....=+-||-.+++. ..--||+.++...+.-.....+...+.. T Consensus 129 ~L~~l~~~~v~v~~~~~~~~~~~~~~~~g~~~~~~~~eiih~~~~~~~~---~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 205 (416) T protein:vir:12 129 ALFPLRPDYTNAYVHPTTGMLWYQTVLNGKAIELYDYEVLHFKGLSTDG---IHGKSPIGVVREHIGAQAAATKYNAKLY 205 (416) T ss_pred EEEEECCcceEEEEeCCCcEEEEEEecCCeEEEecCccEEEecCcCCCC---cccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 44445444443 1223333334457777676654333344343332 3455888777777766666666666666 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||-+|+.++ ....+.+.+.+-.+. ..++ ++|+ ++. T Consensus 206 ~ng~~p~~il~~~~~~~-------------------------~e~~~~~~~~~~~~~-----~~~~-----~~vl--~~g 248 (416) T protein:vir:12 206 KNEATPRGILKVPAFLD-------------------------EKPKENVRKEWKRVN-----KVEN-----IAII--DYG 248 (416) T ss_pred hcCCCCceEEecCCCCC-------------------------HHHHHHHHHHHHHHh-----cCCC-----eeec--CCC Confidence 66666678887765221 124556666664321 1122 2222 443 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++-|.+... +.--+++|+-....+|..+-|||.-|=+..++|+-++-+....-++.-|.|.+..|+++|++.+|- T Consensus 249 --~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~ 325 (416) T protein:vir:12 249 --LEYQSISMPLQ-EAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFL 325 (416) T ss_pred --ceEEEccCChh-hHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 35666665432 223478999999999999999997665557788888888887888889999999999999998874 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) +.-.. ..|-+-||.+.| ...|..+.|. .++++|++|-.-.|+.+|++.-.+=| +.++. T Consensus 326 ~~~~~-----~g~~i~fd~~~l-~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd----~~~~~--------- 386 (416) T protein:vir:12 326 DHDQK-----SGHYVKFNIDSE-LRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGD----KYISS--------- 386 (416) T ss_pred chhhc-----CCceEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----eeeec--------- Confidence 33221 347788999997 4455555443 58888999999999999998654312 00000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTED 522 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted 522 (637) -.++| ++.. .+...-.+.++..|.|+..|. T Consensus 387 ----~n~~~--------~~~~--------~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 387 ----LNYVF--------LDFL--------EEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred ----ccccc--------cccc--------chhhccccccccCCCCCcCCC Confidence 00111 0100 000000111122233333332 No 21 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.30 E-value=3.6e-12 Score=83.33 Aligned_cols=422 Identities=9% Similarity=0.076 Sum_probs=225.4 Q ss_pred CCCCc-c----eEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATS-L----RVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~-l----r~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |.==+ | +. + +|.+++ ... ++.+...+-.+.+++ ..... =-.+..=..+.+.-.+.-+++.+ T Consensus 1 M~~~~r~~~~~~~-~-~r~~~~---~~~-------~~~~~~~~~~~~g~~-~~~~~-v~~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 1 MKIVDSVKKFFNF-E-KRQTSQ---VIE-------LNKDDEKLLEWLGIS-PSTIS-VKGKNALKVATVFACIKILSESV 66 (432) T ss_pred CChHHHHHHhcCc-c-ccCccc---ccc-------cCCchHHHHHHhCCC-cCccc-cchhhhhccHHHHHHHHHHHHhh Confidence 43211 1 11 1 122221 011 122233343343222 11110 00111113456666777889999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |++.+..-+-+++ |.. .++ .+.+..+.+.=...-+-..++++.++.+|.+-|+.|+.+.-...|. +.+.- T Consensus 67 a~lp~~~~~~~~~-~~~----~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~ 136 (432) T protein:vir:10 67 SKLPLKIYQEDEY-GIQ----RGT----KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALW 136 (432) T ss_pred ccCceEEEEecCC-cee----ecc----ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEE Confidence 9999887776655 321 122 2456666655455667888999999999999999999987544443 22221 Q ss_pred cccccceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) .-...+.-+.+++.. .......+....+|...+|.. +=||++=++.+.....--||+.++...+.-.....+...+ T Consensus 137 ~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~--~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 214 (432) T protein:vir:10 137 PIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP--EEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINN 214 (432) T ss_pred EEcCceeEEEEcCcccccccceEEEEEecCCeEEEEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 111122222222211 111122223344555555544 2355553445556666788988887777666666665555 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-+|+.++ ....+.+.+.|.+.- ...++.+. ++| ++ T Consensus 215 ~~~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~-~g~~n~~~-----~~v--l~ 261 (432) T protein:vir:10 215 FYKQGLQVKGLVQYVGDLN-------------------------EDAKKVFRENFESMS-SGLQNSHR-----IAL--MP 261 (432) T ss_pred HHhccCCccEEEEcCCCCC-------------------------HHHHHHHHHHHHHHh-cccccCCc-----cee--cC Confidence 5555455557777765322 012344544443221 12222221 122 23 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) ++ -+++.|.+..+ +.--+++|+-.+..+|..+-|||..|=++.++|.-+..+....-++..|.|.+..|+++|++.+ T Consensus 262 ~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 262 VG--YQFQPISLNMS-DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred CC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 33 35666665432 3334889999999999999999987744456777777777777788899999999999999988 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) |-..-.. ..|-+.||.+.| ..+|..+.+ ..+...|.+|-.-.|+.+|+..-.+=| +.+. . T Consensus 339 l~~~~~~-----~g~~~~fd~~~l-~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD----~~~~-------~ 401 (432) T protein:vir:10 339 FLDSELD-----KGFYSKFNVDAI-LRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----RLLV-------N 401 (432) T ss_pred cChhhcC-----CCcEEEeechhh-hcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEee-------c Confidence 7543332 335688999988 334444433 337788999999999999987543212 0000 0 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQR 516 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~ 516 (637) -.+.|+ +.-.. ....-++++.+...+..+|+ T Consensus 402 ------~n~~~~--------~~~~~-~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 402 ------GNMLPI--------DMAGQ-AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ------ccccch--------hhccc-cccCCCCCCCCCCCCCCCCC Confidence 001111 10000 00111222222212122222 No 22 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.30 E-value=3.6e-12 Score=83.33 Aligned_cols=422 Identities=9% Similarity=0.076 Sum_probs=225.4 Q ss_pred CCCCc-c----eEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATS-L----RVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~-l----r~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |.==+ | +. + +|.+++ ... ++.+...+-.+.+++ ..... =-.+..=..+.+.-.+.-+++.+ T Consensus 1 M~~~~r~~~~~~~-~-~r~~~~---~~~-------~~~~~~~~~~~~g~~-~~~~~-v~~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 1 MKIVDSVKKFFNF-E-KRQTSQ---VIE-------LNKDDEKLLEWLGIS-PSTIS-VKGKNALKVATVFACIKILSESV 66 (432) T ss_pred CChHHHHHHhcCc-c-ccCccc---ccc-------cCCchHHHHHHhCCC-cCccc-cchhhhhccHHHHHHHHHHHHhh Confidence 43211 1 11 1 122221 011 122233343343222 11110 00111113456666777889999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |++.+..-+-+++ |.. .++ .+.+..+.+.=...-+-..++++.++.+|.+-|+.|+.+.-...|. +.+.- T Consensus 67 a~lp~~~~~~~~~-~~~----~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~ 136 (432) T protein:vir:10 67 SKLPLKIYQEDEY-GIQ----RGT----KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALW 136 (432) T ss_pred ccCceEEEEecCC-cee----ecc----ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEE Confidence 9999887776655 321 122 2456666655455667888999999999999999999987544443 22221 Q ss_pred cccccceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) .-...+.-+.+++.. .......+....+|...+|.. +=||++=++.+.....--||+.++...+.-.....+...+ T Consensus 137 ~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~--~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 214 (432) T protein:vir:10 137 PIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP--EEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINN 214 (432) T ss_pred EEcCceeEEEEcCcccccccceEEEEEecCCeEEEEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 111122222222211 111122223344555555544 2355553445556666788988887777666666665555 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-+|+.++ ....+.+.+.|.+.- ...++.+. ++| ++ T Consensus 215 ~~~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~-~g~~n~~~-----~~v--l~ 261 (432) T protein:vir:10 215 FYKQGLQVKGLVQYVGDLN-------------------------EDAKKVFRENFESMS-SGLQNSHR-----IAL--MP 261 (432) T ss_pred HHhccCCccEEEEcCCCCC-------------------------HHHHHHHHHHHHHHh-cccccCCc-----cee--cC Confidence 5555455557777765322 012344544443221 12222221 122 23 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) ++ -+++.|.+..+ +.--+++|+-.+..+|..+-|||..|=++.++|.-+..+....-++..|.|.+..|+++|++.+ T Consensus 262 ~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 262 VG--YQFQPISLNMS-DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred CC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 33 35666665432 3334889999999999999999987744456777777777777788899999999999999988 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) |-..-.. ..|-+.||.+.| ..+|..+.+ ..+...|.+|-.-.|+.+|+..-.+=| +.+. . T Consensus 339 l~~~~~~-----~g~~~~fd~~~l-~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD----~~~~-------~ 401 (432) T protein:vir:10 339 FLDSELD-----KGFYSKFNVDAI-LRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----RLLV-------N 401 (432) T ss_pred cChhhcC-----CCcEEEeechhh-hcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEee-------c Confidence 7543332 335688999988 334444433 337788999999999999987543212 0000 0 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQR 516 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~ 516 (637) -.+.|+ +.-.. ....-++++.+...+..+|+ T Consensus 402 ------~n~~~~--------~~~~~-~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 402 ------GNMLPI--------DMAGQ-AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ------ccccch--------hhccc-cccCCCCCCCCCCCCCCCCC Confidence 001111 10000 00111222222212122222 No 23 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.30 E-value=3.6e-12 Score=83.33 Aligned_cols=422 Identities=9% Similarity=0.076 Sum_probs=225.4 Q ss_pred CCCCc-c----eEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATS-L----RVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~-l----r~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |.==+ | +. + +|.+++ ... ++.+...+-.+.+++ ..... =-.+..=..+.+.-.+.-+++.+ T Consensus 1 M~~~~r~~~~~~~-~-~r~~~~---~~~-------~~~~~~~~~~~~g~~-~~~~~-v~~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 1 MKIVDSVKKFFNF-E-KRQTSQ---VIE-------LNKDDEKLLEWLGIS-PSTIS-VKGKNALKVATVFACIKILSESV 66 (432) T ss_pred CChHHHHHHhcCc-c-ccCccc---ccc-------cCCchHHHHHHhCCC-cCccc-cchhhhhccHHHHHHHHHHHHhh Confidence 43211 1 11 1 122221 011 122233343343222 11110 00111113456666777889999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |++.+..-+-+++ |.. .++ .+.+..+.+.=...-+-..++++.++.+|.+-|+.|+.+.-...|. +.+.- T Consensus 67 a~lp~~~~~~~~~-~~~----~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~ 136 (432) T protein:vir:10 67 SKLPLKIYQEDEY-GIQ----RGT----KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALW 136 (432) T ss_pred ccCceEEEEecCC-cee----ecc----ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEE Confidence 9999887776655 321 122 2456666655455667888999999999999999999987544443 22221 Q ss_pred cccccceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) .-...+.-+.+++.. .......+....+|...+|.. +=||++=++.+.....--||+.++...+.-.....+...+ T Consensus 137 ~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~--~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 214 (432) T protein:vir:10 137 PIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKP--EEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINN 214 (432) T ss_pred EEcCceeEEEEcCcccccccceEEEEEecCCeEEEEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 111122222222211 111122223344555555544 2355553445556666788988887777666666665555 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-+|+.++ ....+.+.+.|.+.- ...++.+. ++| ++ T Consensus 215 ~~~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~-~g~~n~~~-----~~v--l~ 261 (432) T protein:vir:10 215 FYKQGLQVKGLVQYVGDLN-------------------------EDAKKVFRENFESMS-SGLQNSHR-----IAL--MP 261 (432) T ss_pred HHhccCCccEEEEcCCCCC-------------------------HHHHHHHHHHHHHHh-cccccCCc-----cee--cC Confidence 5555455557777765322 012344544443221 12222221 122 23 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) ++ -+++.|.+..+ +.--+++|+-.+..+|..+-|||..|=++.++|.-+..+....-++..|.|.+..|+++|++.+ T Consensus 262 ~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 262 VG--YQFQPISLNMS-DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred CC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 33 35666665432 3334889999999999999999987744456777777777777788899999999999999988 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) |-..-.. ..|-+.||.+.| ..+|..+.+ ..+...|.+|-.-.|+.+|+..-.+=| +.+. . T Consensus 339 l~~~~~~-----~g~~~~fd~~~l-~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD----~~~~-------~ 401 (432) T protein:vir:10 339 FLDSELD-----KGFYSKFNVDAI-LRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD----RLLV-------N 401 (432) T ss_pred cChhhcC-----CCcEEEeechhh-hcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEee-------c Confidence 7543332 335688999988 334444433 337788999999999999987543212 0000 0 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQR 516 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~ 516 (637) -.+.|+ +.-.. ....-++++.+...+..+|+ T Consensus 402 ------~n~~~~--------~~~~~-~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 402 ------GNMLPI--------DMAGQ-AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ------ccccch--------hhccc-cccCCCCCCCCCCCCCCCCC Confidence 001111 10000 00111222222212122222 No 24 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.29 E-value=1.9e-12 Score=84.84 Aligned_cols=405 Identities=12% Similarity=0.099 Sum_probs=222.1 Q ss_pred ceEEec---CCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHH-HHhhhhhhHhhHhhhhhcceeeeEEE Q lcl|NC_021303. 6 LRVVRR---PKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYISWRANSCSRTTLI 81 (637) Q Consensus 6 lr~vrr---pk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW-~~yd~VgELryyvgWr~~s~Sr~rL~ 81 (637) .=+..+ .|...+..+-......+-+.+++ ..+ .. +|. +.++ .--. ..+ ..+-+.-.+.=+++++|++.+. T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~-~g~-~~~~-~v~~~~al-~~~~v~~ci~~ia~~iA~lp~~ 74 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDS-NFW-EK-FGI-KLNF-SVRGKRAL-KENTVYVCTKIRAESIGKLSLK 74 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcc-hhh-hh-ccc-cCCc-ccchhhhh-ccHHHHHHHHHHHHhhhhCceE Confidence 222222 22222211110000001111110 000 00 110 0000 0000 001 2244555577789999998887 Q ss_pred EeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccc Q lcl|NC_021303. 82 PSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARW 161 (637) Q Consensus 82 aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W 161 (637) .-+ ++ . ++.+ +.+..+.+.=..--+-..++++.++.+|-+-|+.|+.+.-...|. +. .+ T Consensus 75 ~~~---~~-~-----~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~-~~-------~L 133 (422) T protein:vir:13 75 IYK---DK-E-----EYKE----HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGK-II-------GL 133 (422) T ss_pred EEe---cC-c-----cccc----chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EE-------EE Confidence 644 11 1 1122 345555554444556778999999999999999999876544443 22 23 Q ss_pred eeeeHHHhcc---------CCCc-eeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 162 YAVTREEIKS---------KAGE-TAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 162 ~~vt~~Ei~~---------k~g~-~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) +.|..+.+.. +.+. ...+..++|...+|.. +-+|++-.+.+..-..-.||+..+.+.+.-.....+.. T Consensus 134 ~~i~~~~v~~~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~--~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 211 (422) T protein:vir:13 134 YPINSDNVTKIIDDDNFLSSLSKVWYVVTDKNGKEHKLLP--DEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFI 211 (422) T ss_pred EEECCcceEEEEcCCcceeccceEEEEEEeCCCeEEEEcc--cceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 3333333321 1111 1224566777766665 44566656666666677899999998887777777777 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) .+..+.-..-.|||-+|+.++ ....+.+.+.|.+.- ...++.+. ++|+ T Consensus 212 ~~~f~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~-~g~~n~~~-----~~vl- 259 (422) T protein:vir:13 212 NKFFKNGLSIKGIVQYVGDLD-------------------------EKAKKIFKKEFESMS-NGLENAHS-----ISLL- 259 (422) T ss_pred HHHHhccCCccEEEEeCCCCC-------------------------HHHHHHHHHHHHHHh-cCccccCC-----ceec- Confidence 777777667788888876432 013344554444321 11222221 1222 Q ss_pred echHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHh Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~ 391 (637) ++. -+++-+.+... +.=-+++|+-.+..+|.-.-|||.-|-+..++|+.+..+....-++..|.|.+..|+++|+. T Consensus 260 -~~g--~~~~~l~~~~~-d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~ 335 (422) T protein:vir:13 260 -PFG--YQFQPISLSMA-DAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQD 335 (422) T ss_pred -CCC--ceeeeccCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 332 34454544332 22237899999999999999999655555678888888877777888999999999999999 Q ss_pred HHHHHHHHHhCCChHHeEEeecCcccccCCCCCHH---HHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 392 DILTPLLAREGIDPTKYILWYDASGLTSDPDLSDE---AVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 392 ~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tde---A~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) .+|.+.-. . ..|-+.||.+.|. .+|..+. ...+++.|.+|-.-.|+.+|++.-.+-| +.+ T Consensus 336 ~Ll~~~~~----~-~g~~i~fd~~~l~-r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD----~~~------- 398 (422) T protein:vir:13 336 KLFSQYET----L-QDVKAEFNVDTIL-RSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGD----RLL------- 398 (422) T ss_pred hhCChhhh----c-CCceEEeechhhh-cCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eee------- Confidence 98765422 1 3477899999883 3343333 3448899999999999999998654422 000 Q ss_pred hcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 469 VTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQRE 517 (637) Q Consensus 469 v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~E 517 (637) .+ -.++|| + .. ++....+.+.|.+ T Consensus 399 --~~----~n~~~l--------~---------~~--~~~~~~~g~~~g~ 422 (422) T protein:vir:13 399 --VN----GNMIPI--------E---------MA--GEQYKKGGEKGGK 422 (422) T ss_pred --ec----cCccch--------h---------hc--ccccccCCCcCCC Confidence 00 011111 1 00 0011111122222 No 25 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.25 E-value=1.4e-12 Score=85.53 Aligned_cols=421 Identities=17% Similarity=0.131 Sum_probs=219.4 Q ss_pred CCCCcceEEecCCCCCcccccchheeh-hccc--cchhhhhhhhcccccccchhhHH--HHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAA-SQLI--TDPQKQMKTSLMGTARNEWQSEA--WDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAA-s~~~--~~p~~~~k~~~~g~~r~~WQ~eA--W~~yd~VgELryyvgWr~~s~ 75 (637) |=-+-=|++-|.|.+- ..+ ..|+ +|+.- +. ..+|. ..+..+. .+-+-..+-+.-.+.-+++++ T Consensus 1 ~~~~~~~~~~~~~~~~--------~~~~g~~~s~~~~~~-~~-~~~~~--~~~~g~~v~~~~al~~~~v~~ci~~Ia~~i 68 (437) T protein:vir:10 1 MKQGKQRALGRIKSSF--------LKWLGVPISLTDGSF-WS-AWGGM--GSSSGETVTADSALQLSAVWSCVRLIAETI 68 (437) T ss_pred CCcchhhhhhhhHHhh--------hhhcCCcccCCchhH-HH-hhccc--ccCCCceechHhhhccHHHHHHHHHHHHHH Confidence 3322223333332220 000 0111 11111 11 10111 1111111 111223345666788899999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |++.|..-+.+.|.++. ..+ .+.+..+.+.=...-+...++++.++.+|-+-|+.|+.+. |.+|.+ T Consensus 69 a~lp~~~~~~~~~g~~~----~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~g~~----- 134 (437) T protein:vir:10 69 ATLPLNLYQTKPDGTRV----LAK----QHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKL-RSAGVL----- 134 (437) T ss_pred hhCceeEEEEcCCCcee----ecc----ccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEE-ecCCcE----- Confidence 99999888888663221 122 3556666665566778889999999999999999998864 445531 Q ss_pred cccccceeeeHHHhc--c-CCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 156 APRARWYAVTREEIK--S-KAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~--~-k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) . .++.+..+.+. . ..|...+ +..++|...+|... | ||++=+++. .-..--||+.++...+.-..-..+.. T Consensus 135 --~-~L~~l~p~~v~i~~~~~g~~~y~~~~~~g~~~~~~~~-d-Iih~r~~~~-d~~~G~spi~~~~~~i~~~~~~~~~~ 208 (437) T protein:vir:10 135 --I-GLELMLPQRTTVKRLTSGALQYTYRNVDGTVSTLAED-D-VFHVRGFSL-DGLMGLTPIQYAREVLGNSTAANKTS 208 (437) T ss_pred --E-EEEEEcCcceEEEECCCCeEEEEEEecCceEEEEccc-c-EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHH Confidence 1 23444333332 1 2222222 45567776666543 2 444422221 22445678777766665555555555 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) .+..+.-..-.|||-+|+.++ ....+.+.+.|.+ .+...+ -+--|+|+. T Consensus 209 ~~~f~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~----~~~g~~--nag~~~vl~ 257 (437) T protein:vir:10 209 ASVFRNGLRPSGVLSTDQILQ-------------------------KEKRAEIRTDLAE----QFGGAM--QAGKTMVLE 257 (437) T ss_pred HHHHhccCCccEEEEcCCCCC-------------------------HHHHHHHHHHHHH----HhcCcc--ccCcceecc Confidence 555555555667776665322 1133445444433 222111 122244442 Q ss_pred echHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhcc-CCcceee--eEEeccCceeEeechhHHHHHH Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWS--AWAIGDEDVQLHIKPVMDLICQ 387 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGl-s~~NHWs--AW~I~dedVrlHI~P~me~ic~ 387 (637) +. -+++-|.+. ..+. -+++|+-.+..+|.-.-|||..| |. .++|-|. ..+....=++.-|.|.+..|++ T Consensus 258 --~g--~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~~~tl~P~~~~ie~ 330 (437) T protein:vir:10 258 --AG--MKYQAITMN--PGDVQLLETRAFNIEEICRWYRVPPFMV-GHSEKSTSWGTGIEQQTLGFLTFTLRPWLTRIEQ 330 (437) T ss_pred --CC--ceEEeccCC--hhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 22 244444443 3333 38999999999999999998766 77 4555554 3566666678889999999999 Q ss_pred HHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHH Q lcl|NC_021303. 388 AIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREF 464 (637) Q Consensus 388 Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~ 464 (637) +|++.+|.+--. .+|.+-||.+.| .++|..+.+ ..+++.|.+|-.-.|+.+|++.-.|=| +-.. T Consensus 331 ~l~~kll~~~e~------~~~~~~fd~~~l-l~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~----~~~~-- 397 (437) T protein:vir:10 331 AARRSLLRPGER------DQFYAEFSVEGL-LRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNA----AVLT-- 397 (437) T ss_pred HHHhhccCcccc------CceEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc----ceEe-- Confidence 999998865322 357788999987 445544443 347888999999999999997533211 0000 Q ss_pred HHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021303. 465 AADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEA 529 (637) Q Consensus 465 A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~ 529 (637) +..+ +.|+= ...-..++. .+++.-.+.+.+.|+++-+ .+. T Consensus 398 ----~~~~------~~~~~-----~~~~~~~~~------~~~~~~~~~~~~~~~~~~~----~e~ 437 (437) T protein:vir:10 398 ----VQSA------LLPID-----KLGEHTTAT------AAQDALKAWLYQEEKTRAT----QER 437 (437) T ss_pred ----ecCc------ccchh-----hccCcCCCc------chhccccccCCCCCCCCcc----ccC Confidence 0000 11110 000000000 0000111111122222222 122 No 26 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.22 E-value=7.4e-12 Score=81.60 Aligned_cols=403 Identities=13% Similarity=0.047 Sum_probs=214.1 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |-=..| .++..-..| +.+++.-.-.+-..+.+. ..+ -.+.+-.++-+.-.|.-+++++|.+-+ T Consensus 17 ~~~~~l--f~~~~~~~~----------~~~~~~~~~~~~~~~~~~--~~v---s~~~al~~~~v~~cv~~Ia~~iA~lp~ 79 (424) T protein:vir:45 17 VLLDAL--FRSKSLENP----------STPITGDAVDTDGLFRAD--VYV---SPETAMKLAAVYSCIYVLSSSLAQMPL 79 (424) T ss_pred HHHHhh--ccccCCCCC----------ccccchhhhhhhccccCC--cee---chHHhhccHHHHHHHHHHHHHHhhCce Confidence 211111 122111111 111111000000000000 000 001112234455567778899998877 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..=+-+.+ |. .++. .+.+.++++.=...-+-..++++.++.+|.+-|+.|+.+.-...|.+ . . T Consensus 80 ~v~~~~~~-~~----~~~~----~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~-------~-~ 142 (424) T protein:vir:45 80 HVMRRHKG-KV----EPAR----DHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEV-------I-S 142 (424) T ss_pred EEEEecCC-ce----eecc----cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcE-------E-E Confidence 65554422 11 1122 24566666555556678889999999999999999997654344431 2 2 Q ss_pred ceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhH Q lcl|NC_021303. 161 WYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSR 238 (637) Q Consensus 161 W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SR 238 (637) ++.+....+. ..+++..+-.-.++..++|... -||++=.+.+ .-..--||+..+.+.+.--.-..+...+..+.- T Consensus 143 L~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~--eVih~r~~~~-d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng 219 (424) T protein:vir:45 143 LDCCMPWETTLMNTGGRYTYGLYNEYGAFAISPD--DMIHIRALGN-NQKMGLSPIMQHAETIGMGMSGQKYTESFFSGN 219 (424) T ss_pred EEEecCceEEEEEcCCeEEEEEEecCceEEECcc--cEEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 2223222222 2334433332223334455543 3455534443 234567888877776665555555555555555 Q ss_pred hhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhc Q lcl|NC_021303. 239 VMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLE 318 (637) Q Consensus 239 L~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~ 318 (637) ..-.|||-+|+.++ ....+.+.+.+.+.-+...++.+ -++|+ +++ - T Consensus 220 ~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~~g~~~n~g-----~~~vl--~~g--~ 265 (424) T protein:vir:45 220 ARPAGIVSVKSGLN-------------------------KESWGWLKDQWQKASQALRRQEN-----KTMLL--PAD--L 265 (424) T ss_pred CCccEEEEeCCCCC-------------------------HHHHHHHHHHHHHHhccccccCC-----ceeEc--CCC--c Confidence 55567787776432 01344555555443332222222 22333 333 4 Q ss_pred ccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHH Q lcl|NC_021303. 319 KVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLL 398 (637) Q Consensus 319 ~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L 398 (637) +++.|.+... +.--+++|+-.+..+|...-|||..|=+..++++-++.|....=++.-|.|.+..|+++|+..+|.+.= T Consensus 266 ~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e 344 (424) T protein:vir:45 266 DYKALTVSPV-DAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFTRAE 344 (424) T ss_pred eEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhh Confidence 5666665432 222488999999999999999988765456678877788777778888999999999999998886532 Q ss_pred HHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchh Q lcl|NC_021303. 399 AREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPEL 475 (637) Q Consensus 399 ~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~L 475 (637) .. ..|-+.||.+.| ..+|..+.| ..+...|.+|-.-.|+.+|++.-.+=| . T Consensus 345 ~~-----~g~~i~fd~~~l-lr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD-------------~------- 398 (424) T protein:vir:45 345 LA-----AGYYVRFNLTGL-LRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLD-------------E------- 398 (424) T ss_pred hc-----CCcEEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-------------e------- Confidence 21 347789999998 334444333 337788999999999999998543312 0 Q ss_pred HHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_021303. 476 IAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTED 522 (637) Q Consensus 476 i~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted 522 (637) -+.| ....++.++. +.+..+++.++| T Consensus 399 --~~~~--------------~n~~~~~~~~-----~~~~~~~~~~~~ 424 (424) T protein:vir:45 399 --MLVS--------------VNAANPAGDF-----KPPKNDEGKTNE 424 (424) T ss_pred --eeec--------------cccccccccc-----CCCCCCCCCCCC Confidence 0011 0000010000 000111111111 No 27 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.21 E-value=8e-11 Score=75.94 Aligned_cols=525 Identities=14% Similarity=0.092 Sum_probs=244.8 Q ss_pred CCCC--cceEEecCCCC---Ccccccchheehh-----ccccchhhhhhhhcccc----cccchhhHHHHHhh------- Q lcl|NC_021303. 1 MAAT--SLRVVRRPKGS---APAARRRSLTAAS-----QLITDPQKQMKTSLMGT----ARNEWQSEAWDFSE------- 59 (637) Q Consensus 1 ma~~--~lr~vrrpk~~---~p~~~r~~ltAAs-----~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd------- 59 (637) +-|. +| -|-|.- -|----|+|.=-| --+-+-.+++||.+.-. ....|-.+-|++-. T Consensus 27 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vt 103 (945) T protein:vir:10 27 IKANVDSL---SRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLP 103 (945) T ss_pred chhchhhh---hcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCccceecc Confidence 0000 11 011111 1100111221111 11222333444433221 11223332222110 Q ss_pred -------------------hhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHH---hccC Q lcl|NC_021303. 60 -------------------SIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKG---IADG 117 (637) Q Consensus 60 -------------------~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~---iAgG 117 (637) ...-+.-.+.-+++++|.+.+-.-+-+.|+..+- .. ......+.+..+.+. .-.+ T Consensus 104 sls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~-~~--kk~~~~hpL~~LL~rPNp~mT~ 180 (945) T protein:vir:10 104 SISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNY-YL--KRIRDARNILEFLERPDPYFSE 180 (945) T ss_pred cccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccc-cc--cccccchHHHHHHhCCCcccCh Confidence 0122344566688999999887776665633332 21 122334555566542 1111 Q ss_pred cccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhcc---CCCceeE--EecCCCCcccccCC Q lcl|NC_021303. 118 PLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKS---KAGETAE--ISLPDGKTHEFNRD 192 (637) Q Consensus 118 ~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~---k~g~~~~--i~lPdG~~he~~~~ 192 (637) .-.-..+++.++.++-+-|..|+.+.-..+|. + -.++.+...-+.. ..|+... +..-+|........ T Consensus 181 ~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~-------i-i~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a 252 (945) T protein:vir:10 181 VNSWEYLLGMVLDDILTIDRGAIVKIRDEQGN-------L-VAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDK 252 (945) T ss_pred hHHHHHHHHHHHHHHhhcCCeEEEEEECCCCc-------E-EEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecC Confidence 22223489999999999999999876444443 1 2455555554432 2232222 22224544433345 Q ss_pred CceEEEEecCCccccc--CCccchhhhhHHHHHHHhhhHHHHHHH-HhHhhcCceeeecccCCCCCcccccccccccCCC Q lcl|NC_021303. 193 LDSLVRIWNPRPRKAS--QATSPVRACLETLREIERTTRKIKNAA-KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPG 269 (637) Q Consensus 193 ~d~l~RvW~P~prra~--eaDSPvra~l~~LrEI~rttk~I~na~-~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg 269 (637) .|.++++.+|++..-. .--||+.++...+.--....+.-.+.. ++..+-.|||-++.+....... . T Consensus 253 ~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~----~------- 321 (945) T protein:vir:10 253 RDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDI----Y------- 321 (945) T ss_pred CceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCcccccccc----c------- Confidence 5777778888775433 345888888776655555544443332 4555667888888754332211 0 Q ss_pred cccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcC Q lcl|NC_021303. 270 APVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLD 349 (637) Q Consensus 270 ~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglD 349 (637) ...+....+.|.+.+ +.++.-.+ +-.|+|+ ++. -+++.|.+... +.--+++|+..+.++|...- T Consensus 322 ----~~LseEq~erlKe~w----ee~~sG~N---nG~piVL--deG--mef~pLs~s~~-DaQfLEsrkfs~eeIArAFG 385 (945) T protein:vir:10 322 ----PQLSREQLESIQRQL----QAIMMGDY---TQVPILS--GGK--FTWIDFKGKRR-DMQFKELAEFVARKICAVYQ 385 (945) T ss_pred ----cccCHHHHHHHHHHH----HHHhCCcc---cccceec--CCC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhC Confidence 001222344444443 33332222 2246654 332 35666665443 22347999999999999999 Q ss_pred CchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCC-CCCHHH Q lcl|NC_021303. 350 VSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDP-DLSDEA 427 (637) Q Consensus 350 v~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dP-D~tdeA 427 (637) |||..| |. +++|.-++.|....-++-.|.|.+..|+++|++.+++.. +...|-+.||...+. |+ .+.+-. T Consensus 386 VPP~lL-G~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~------eg~~i~fdFd~ldl~-D~ksraEal 457 (945) T protein:vir:10 386 VSPQDV-GILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFR------NEKDIKLWFKEDDLE-KERDWWNII 457 (945) T ss_pred CCHHHc-ccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cCceeEEEecchhcc-CHHHHHHHH Confidence 998876 76 567888888888888888899999999999997654322 125678899988765 33 333333 Q ss_pred HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCC Q lcl|NC_021303. 428 VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDD 507 (637) Q Consensus 428 ~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d 507 (637) ..++..|.+|-.-.|+.+|++.-.|=| +-+ +.. ..+.|.-....+.-.-.+++.+.+.++ T Consensus 458 ~kli~sGiLTiNEvRe~lGLpPIeGGD----~ll-------i~~-----nn~~P~d~~~ka~~ga~p~q~aq~~~d---- 517 (945) T protein:vir:10 458 QGQLNTGFRSINEARMEKGLEPVPWGD----VPF-------SGL-----RNWKPEDEQAKAQQGAMPPQLAQAMAD---- 517 (945) T ss_pred HHHHhCCCcCHHHHHHHhCCCCCCCcc----eee-------ecc-----ccccccccccccccCCCCcccccCCCC---- Confidence 456788999999999999998654322 000 000 011121110000001011111111111 Q ss_pred CCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhhHhhcCchhhhhhhcCCCCHH Q lcl|NC_021303. 508 EDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSS 587 (637) Q Consensus 508 ~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~rlr~ip~h~~h~~~~PV~~~ 587 (637) +....|.+.+++++.+.... ...+...+-+..+|=+.|- | T Consensus 518 --qp~~kGGe~dEns~~psE~k------da~~e~~~~l~~~~~~~a~--------------------------------e 557 (945) T protein:vir:10 518 --QPSQQGGGVDENSSVPSEQK------NAGLEVLRNLFKSLDANAS--------------------------------E 557 (945) T ss_pred --CCCCCCCCCCCCCCCCCccc------chHHHHHHHHHHHHHHHHH--------------------------------H Confidence 11222222222222211111 1111222233333322221 1 Q ss_pred HHHHHHh--ccccccc-----HHHHHHhCCCHHHHHHHHHH------------HHHHHHHhhhhccccC Q lcl|NC_021303. 588 EIPRLIA--GWDTALE-----DEVVASLGLDNEKLRNAVLA------------TVRRQLTQPLIEGEVV 637 (637) Q Consensus 588 ~v~rLi~--GWd~~ld-----~~~~a~lG~Dp~~lr~~v~~------------~v~~~lt~~vvd~~v~ 637 (637) .+..||. |-|-.+. -+++..-|+| .+-..|+. .+.-. -..+|..|.+ T Consensus 558 ~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 623 (945) T protein:vir:10 558 NLKQVIELTNDDNYLKEKELLTRVLKSVGLD--SVSEFIENNSQTDVEVSAKDILSFK-YNSLVEDETI 623 (945) T ss_pred HHHHHHhhcCCCchhHHHHHHHHHHHHhhhH--HHHHHHhcCCccceeechhhhhhhh-hhhhccccce Confidence 2222221 1111111 0344444554 11111100 00000 0112222221 No 28 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.20 E-value=1e-11 Score=80.89 Aligned_cols=416 Identities=13% Similarity=0.067 Sum_probs=206.7 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeee Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAI 85 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (637) .++.|+.-..-. +.-.....+++-. .+..+| |-. +..+ .++-+.-.|.-+++.++++.|..-+- T Consensus 1 m~~~~~~~~~~~-------~~~~~~~~~~~~~-~~~~g~-----~~~--~~Al-~~~~V~~cv~~ia~~iA~lp~~~~~~ 64 (417) T protein:vir:38 1 MKLFRGLATEVD-------PHWADHLLDSGVI-PSFRGG-----YLG--ISAL-RNSDVLTAVSIVSGDVSRFPLVITDS 64 (417) T ss_pred CccccccccCCC-------ccchhhhcccccc-cccCCc-----eec--hhhc-ccHHHHHHHHHHHHhhccCeeEEEEc Confidence 333332111100 0011111122111 111111 111 1122 23455556788899999999987654 Q ss_pred ccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeee Q lcl|NC_021303. 86 DPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVT 165 (637) Q Consensus 86 D~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt 165 (637) +.| ++. + .+.+..+++.--.--+...++++.++.+|-+-|++|+.|.-...|+.+.....=..++..+. T Consensus 65 ~~~------~~~-~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~ 133 (417) T protein:vir:38 65 STD------EVI-D----LANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVD 133 (417) T ss_pred CCc------cee-c----cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEE Confidence 433 111 1 13355555544455677888999999999999999998764433432222211111222221 Q ss_pred HHHhccCCCcee-EEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCce Q lcl|NC_021303. 166 REEIKSKAGETA-EISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGV 244 (637) Q Consensus 166 ~~Ei~~k~g~~~-~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGv 244 (637) . ...++-. .+..++|......+..|++ |+=.+ +..-..--||+.++...+.--.-..+...+..+.=....|| T Consensus 134 ~----~~~~~~~y~~~~~~~~~~~~~~~~dvi-H~r~~-~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~i 207 (417) T protein:vir:38 134 T----SDPDNIIYRFTPYNSSMQKVCGFEDVI-HWKFF-SYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSII 207 (417) T ss_pred E----cCCCeEEEEEEEcCCcEEEEecCcceE-EecCC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 1 1223333 2455666554333334443 33222 11223355776665554443333333333333333334445 Q ss_pred eeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceee Q lcl|NC_021303. 245 LFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIK 324 (637) Q Consensus 245 lfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlk 324 (637) |-.|+.++ ....+.+++.|.+. +...++ --|+|+ ++. .+++.|. T Consensus 208 l~~~~~l~-------------------------~e~~~~~~~~~~~~----~~g~n~---g~~~vl--~~g--~~~~~l~ 251 (417) T protein:vir:38 208 KAKESRLS-------------------------AEARQKIREDFERA----QAGADA---GSPIIV--DAT--MDYQPLE 251 (417) T ss_pred EEeCCCCC-------------------------HHHHHHHHHHHHHH----hccccc---CCceec--cCC--ceEEEcc Confidence 54443221 12445555555332 222222 244554 332 4677776 Q ss_pred cCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCC Q lcl|NC_021303. 325 FGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGID 404 (637) Q Consensus 325 f~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiD 404 (637) +..+-.. -+++|+-.+..+|.-.-|||..| |- +++.-++-+....-++.-|.|.+..|+++|+..+|.+... T Consensus 252 ~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~l-g~-~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~----- 323 (417) T protein:vir:38 252 VDTNVLN-LINSNNYSTAQIAKALRVPAYRL-AQ-NSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQR----- 323 (417) T ss_pred CCHHHHH-HHHHHHhhHHHHHHHhCCCHHHh-CC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhc----- Confidence 6554333 37899999999999999999887 53 3334455566666677779999999999999999865432 Q ss_pred hHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhc Q lcl|NC_021303. 405 PTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLS 484 (637) Q Consensus 405 p~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~ 484 (637) .+|.+-||.+.|.. -...+-..+++.|.+|-.-.|+++|++.-.+-+.+. - .+..| +.|+- T Consensus 324 -~~~~~~fd~~~l~~--~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~--~-------~~~~n------~~~~d- 384 (417) T protein:vir:38 324 -HQYCIGFDTKSVNG--LPIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDR--I-------QSTLN------TVFLD- 384 (417) T ss_pred -ccceEEechhhhhH--HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCe--e-------eeccc------ccccc- Confidence 45788899887631 112334567899999999999999997533211000 0 00000 11111 Q ss_pred cccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021303. 485 SQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERS 525 (637) Q Consensus 485 ~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~ 525 (637) ..+-.+.+ ..++. ...++.++++++.+.+|+-+ T Consensus 385 ----~~~~~~~~---~~~~~-kgg~~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 385 ----QKEAYQAE---HAAEL-KGGDTNAKGNQNGSGTNANS 417 (417) T ss_pred ----cccccccc---ccccc-CCCCCCCCCCCcCCCCcCCC Confidence 00000000 00000 00111111223333333322 No 29 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.19 E-value=3.7e-11 Score=77.78 Aligned_cols=437 Identities=12% Similarity=0.093 Sum_probs=214.3 Q ss_pred cceEEecCCCCCcccccchheeh-hccccchhhhhhhhcccccccch---hhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 5 SLRVVRRPKGSAPAARRRSLTAA-SQLITDPQKQMKTSLMGTARNEW---QSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 5 ~lr~vrrpk~~~p~~~r~~ltAA-s~~~~~p~~~~k~~~~g~~r~~W---Q~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) -+=..||-|......+ .+..+ ...+- +.... ..+|...++= ...|. .++-+.-.|.=++++||.+-+ T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~--~~~~~-~~~g~~~~g~~v~~~~al----~~~~V~~~v~~Ia~~iA~lp~ 71 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGR--DVREAGWTSLF--QAVAE-PFAGAWQQGVKADPEAVL----SFHAVFACISLISQDIAKMRL 71 (454) T ss_pred CCCccccCcccccccc--cccchhhhhhh--hhhhh-hhcchhhcCcccChHHhh----ccHHHHHHHHHHHHhhccCce Confidence 3334455444433221 22211 11110 00000 0011111100 01122 234456677889999999988 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..=+-+.| |... . .++ +.+..+- .=-.--+-..++++.++.+|-+-|+.|+.+.-...|. +. . T Consensus 72 ~~~~~~~~-g~~~---~-~~~---~~~~~L~-~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~-------~~-~ 134 (454) T protein:vir:93 72 RLMQTDAQ-GIRR---E-TRR---GDIARLC-RRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQ-------IK-E 134 (454) T ss_pred EEEEeccC-Cccc---h-hhh---HHHHHHH-hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-------EE-E Confidence 88666655 2221 1 111 1222221 2223456778899999999999999999877554443 11 3 Q ss_pred ceeeeHHHhcc---CCCceeEEecCC-----CCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 161 WYAVTREEIKS---KAGETAEISLPD-----GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 161 W~~vt~~Ei~~---k~g~~~~i~lPd-----G~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) ++.|....+.. .+|.-.+....+ |...+|. .+=||++=...+..-..--||+..+...+.-..-..+... T Consensus 135 L~~i~~~~v~v~~~~~g~~~y~~~~~~~~~~~~~~~~~--~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~ 212 (454) T protein:vir:93 135 LRILDWNRVEPLVADDGEVFYRITPDRNCGITEAVTVP--AREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENST 212 (454) T ss_pred EEEEcCcceEEEEcCCCcEEEEEEeccccccceeEEec--CcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHH Confidence 44444444431 222222211111 1122222 2335555223333444566777766666554444444443 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-+|+.++ ....+.|.+.|.+.- .+ .+.++ ++|+ T Consensus 213 ~~f~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~-~g-~n~g~-----~~vl-- 258 (454) T protein:vir:93 213 SFFRNGGRPSGVIEIPGSIT-------------------------EENAKKLKSNWDSGY-TG-ENAGK-----TAIL-- 258 (454) T ss_pred HHHhccCCccEEEecCCCCC-------------------------HHHHHHHHHHHHHHh-cc-cccCC-----ceec-- Confidence 33333334446777766332 113445555543322 11 22222 2233 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHh Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~ 391 (637) ++. -+++.|.+... +.--+++|+-.+..+|...-|||.. ||. .++|.+++.|....-++..|.|.+..|+++|+. T Consensus 259 ~~g--~~~~~l~~~~~-d~q~le~~~~~~~~Ia~~fgVPp~~-lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 334 (454) T protein:vir:93 259 SNG--AKYNPTTFSPV-DSQTVEQLKMTAEIVCSVFRVPAYK-IGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDE 334 (454) T ss_pred cCC--ceEEEcccChh-HHHHHHHHHHHHHHHHHHhCCCHHH-cCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 35666665443 2334789999999999999999975 566 468888888888888889999999999999998 Q ss_pred HHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 392 DILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 392 ~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) .++.+ ..|.+.||.+.| ...|..+.+. .+++.|.+|-.-.|+.+|++.-.|=| +.+ T Consensus 335 ~L~~~---------~~~~~~f~~~~l-l~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD----~~~------- 393 (454) T protein:vir:93 335 ALETG---------ENESTEFDVTTL-LRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGD----ALY------- 393 (454) T ss_pred hhcCC---------CCcEEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eee------- Confidence 77642 236689999988 3345544443 47888999999999999997543311 000 Q ss_pred hcCCchhHHHHHhhhc-cccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHH Q lcl|NC_021303. 469 VTKNPELIAMYAPLLS-SQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVN 547 (637) Q Consensus 469 v~~~P~Li~~~apLl~-~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~ 547 (637) +..+. .|+-. +..+.-+-|..... .++..++. ....++++..+|++ .+ +.+-+.. T Consensus 394 ~~~~~------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~d~~~~~~e~~-----~d----------~~~~~~~ 449 (454) T protein:vir:93 394 LQQQN------YSLEALSRRDAREDPFASSG-KTASVPQA--VAASDGNKAITETE-----HD----------AVKAMFR 449 (454) T ss_pred eccCc------cchHhhhccCcccCCCCCCc-cCCCCCCC--CCCCCCCCCccCCc-----cc----------hhhhhhh Confidence 01110 01100 00000110100000 00000000 00111122222211 11 1122222 Q ss_pred HHHHHhcc Q lcl|NC_021303. 548 RALDLAGK 555 (637) Q Consensus 548 rALelAGk 555 (637) +=+ +| T Consensus 450 ~~~---~~ 454 (454) T protein:vir:93 450 GIL---KK 454 (454) T ss_pred hhh---cC Confidence 222 22 No 30 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.18 E-value=1.2e-11 Score=80.49 Aligned_cols=400 Identities=15% Similarity=0.100 Sum_probs=210.7 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+. .|+.+..+.. +... .....|-+. .+......-.+.+ + ..+-++-.+.-+++.||++.+ T Consensus 1 Mg~f~~--~~~~~~~~~~---~~~~-------~~~~~~~~~-~~~~~~~~~~~~~--~-~~~~v~~~i~~ia~~ia~~~~ 64 (406) T protein:vir:95 1 MGLFDR--WRRTKRKSKI---RADT-------GYVGLFMSG-EDVSFLVPGYVRL--S-DNPEVRMAVHKIADLISSMTI 64 (406) T ss_pred Ccchhh--hccccccccc---cccc-------hhhhhhccC-cccCccccCHHHH--h-hcHHHHHHHHHHHHhhccCce Confidence 543332 2333322211 1111 111111110 0000011111111 2 347778888889999999999 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEE-eecCCccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVL-IRQEKDPVTGLAAPRA 159 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il-~r~~~~~~~~~~~~~~ 159 (637) .+-+.+++ |.- . .+ +....+...=..--+...++++.++.+|-+-|+++..++ .|..++. +.+ T Consensus 65 ~~~~~~~~-~~~----~-~~----~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~------~~~ 128 (406) T protein:vir:95 65 YLMQNTED-GDI----R-IR----NELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGL------IDE 128 (406) T ss_pred EEEEecCC-cce----e-ec----chHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCc------EEE Confidence 99888865 221 1 11 233444444455557888999999999888887755443 3332221 122 Q ss_pred cceeeeHHHhcc-CCCceeEEecCCCCcccccCCCceEEEE-ecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_021303. 160 RWYAVTREEIKS-KAGETAEISLPDGKTHEFNRDLDSLVRI-WNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKS 237 (637) Q Consensus 160 ~W~~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~Rv-W~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 237 (637) -|+ |...-+.. ..+++..+.. +| .+|.. +-||++ .++++.+...--||+..+.+.+.-.....+...+..+. T Consensus 129 l~~-i~~~~v~~~~~~~~~~~~~-~~--~~~~~--~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~n 202 (406) T protein:vir:95 129 LVP-LTPSKVNFLDTPDGYQVLY-GG--QTFNY--DEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSG 202 (406) T ss_pred EEE-EcCceeEEEEcCCeEEEEe-cc--EEEch--hHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 222 22222221 1111111111 22 12322 234443 57788777778899999999888888888888888888 Q ss_pred HhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_021303. 238 RVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHL 317 (637) Q Consensus 238 RL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi 317 (637) -....|||-+|+.++-. ..+.+.+-+ ...+.. +--+--++|+...++.. T Consensus 203 g~~~~~il~~~~~l~~e-------------------------~~~~~~~~~----~~~~~g--~~n~~~~~v~~~~~~~~ 251 (406) T protein:vir:95 203 KYMPSLIVKVDAATAEL-------------------------SSEEGRNAV----FKKYLQ--ATEAGQPWIIPAELLEV 251 (406) T ss_pred cCCcceEEEeCCCCCHH-------------------------HHHHHHHHH----HHHhcc--ccccCCceeecCCCccc Confidence 88888898887743211 122232222 222221 11223344444455433 Q ss_pred cccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHH Q lcl|NC_021303. 318 EKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPL 397 (637) Q Consensus 318 ~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~ 397 (637) ..++.+.. .+.--+++|+..+.++|..+-|||. |||.++.+- +....-++-.|.|.+..|+++|+..+|. T Consensus 252 ~~~~~~~~---~d~q~~e~~~~~~~~Ia~~fgVp~~-~lg~~~~~~----~~~~~~~~~~l~P~~~~ie~~l~~~l~~-- 321 (406) T protein:vir:95 252 EQVKPLSL---KDIAINEAVELDKRTVAGMFGVPAF-LLGIGEFNR----DEYNNFINSTILPIAKGIEQELTRKLLI-- 321 (406) T ss_pred cccccCCh---hHHHHHHHHHHHHHHHHHHhCCCHH-HcCCCCchH----HHHHHHHHHHHHHHHHHHHHHHHHhcCC-- Confidence 33333322 2333568999999999999999875 558643221 1112245677999999999999988764 Q ss_pred HHHhCCChHHeEEeecCcccccCCCC---CHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCch Q lcl|NC_021303. 398 LAREGIDPTKYILWYDASGLTSDPDL---SDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPE 474 (637) Q Consensus 398 L~~eGiDp~kYvvw~DaS~Lt~dPD~---tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~ 474 (637) +..|-++||.+.| ...|. .+.+..++..|.+|-.-.|+.+|+..-.+=| +.+ T Consensus 322 -------~~~~~~~fd~~~l-~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd----~~~------------- 376 (406) T protein:vir:95 322 -------SPDLYFKFNPRSL-YAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLS----ELV------------- 376 (406) T ss_pred -------CCCcEEEeechhh-hcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----eee------------- Confidence 2447899999998 44554 4445568889999999999999998544312 000 Q ss_pred hHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_021303. 475 LIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTE 521 (637) Q Consensus 475 Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdte 521 (637) .| ...+.++-.........|+.+. ++-.+| T Consensus 377 -----~~---~n~~~~~~~~~~~~~k~g~~~~---------~~~~~~ 406 (406) T protein:vir:95 377 -----IL---ENYIPLDKIGDQSKLKGGDNSG---------ADGQTD 406 (406) T ss_pred -----ec---cCccchhhcccccccCCCCCCC---------CCCCCC Confidence 01 0111111000000011111110 000111 No 31 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.18 E-value=2.6e-11 Score=78.57 Aligned_cols=403 Identities=15% Similarity=0.131 Sum_probs=214.1 Q ss_pred CCCCcceEEecCCCCCccccc--c-hheehhccccchhhhhhhhcccccccchhhHHHHH-------------hhhhhhH Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARR--R-SLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDF-------------SESIGEL 64 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r--~-~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~-------------yd~VgEL 64 (637) |.= +...||.|....+.+. . +..|++.....+++.|.+. .+.....|-. .=.++-+ T Consensus 1 Mgl--~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~------~~~~~~~~~~~~~~~g~~v~~~~al~~~~V 72 (431) T protein:vir:10 1 MGL--FDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGL------DDPRLKEYIRRGELNGGTGRETRALRNMAV 72 (431) T ss_pred Ccc--hhhhhcCcccccccccccccccccccccccccccccccc------cchHHHHhhccCccCcceechhhhhccHHH Confidence 542 2223332221111111 1 1222222222233333221 1111111210 0012345 Q ss_pred hhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEe Q lcl|NC_021303. 65 SYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLI 144 (637) Q Consensus 65 ryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~ 144 (637) .-.+.-+++++|.+.+..-+-| ++++ + .. .+.+..+++.=-.--+...++++.++.+|.+-|++|+.|. T Consensus 73 ~~ci~~Ia~~iA~lp~~v~~~~-~~~~----~-~~----~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~- 141 (431) T protein:vir:10 73 LRCVTLISGTIGMLPMNLISSD-DSKQ----V-LT----DDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIV- 141 (431) T ss_pred HHHHHHHHHhhccCceEEEEec-Ccee----e-ec----cchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 5556778999999988775544 3221 1 12 2456666665555568888999999999999999999875 Q ss_pred ecCCccccccccccccceeeeHHHhcc--C-CCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHH Q lcl|NC_021303. 145 RQEKDPVTGLAAPRARWYAVTREEIKS--K-AGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLET 220 (637) Q Consensus 145 r~~~~~~~~~~~~~~~W~~vt~~Ei~~--k-~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~ 220 (637) |.+|.+ . ..+.+....+.. . .+.-.+ +..++|...+|... | |||+=++++ .-..--||+..+... T Consensus 142 r~~g~~-------~-~L~pl~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~-d-ViHir~~~~-dg~~G~spi~~~~~~ 210 (431) T protein:vir:10 142 WSGNRP-------I-RLIPMDRGSAKGRLTSTWQIVYDYTTPTGDKIELPAR-E-VFHLRDLSI-DGVSGVSRVKLSGNA 210 (431) T ss_pred EcCCce-------E-EEEEEcCceeEEEEcCCCeEEEEEEeCCceEEEEchh-h-EEEecCcCC-CCcccccHHHHHHHH Confidence 434431 1 233344443331 2 222222 56677877666542 3 344322322 123456777777666 Q ss_pred HHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcc Q lcl|NC_021303. 221 LREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDEN 300 (637) Q Consensus 221 LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~ 300 (637) +.=..-..+...+..+.-..-.|||-+|+.++ ....+.+.+.+.+.-. ..++.+ T Consensus 211 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls-------------------------~e~~~~~~~~~~~~~~-g~~n~g 264 (431) T protein:vir:10 211 LELAEQAERAASRTFRTGVMAGGAIEVPKELS-------------------------DNAYGRMKASVQENHT-GSENAG 264 (431) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEecCCCCC-------------------------HHHHHHHHHHHHHHhc-CccccC Confidence 65555555555555555555567776665322 1134455555433221 111212 Q ss_pred ccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeec Q lcl|NC_021303. 301 SQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIK 379 (637) Q Consensus 301 S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~ 379 (637) . |+|+ ++. -+++-|.+..+ +.--+++|+-.+..+|.-.-|||.-| |. .++++-+..|....=++.-|. T Consensus 265 ~-----~~vl--~~g--~~~~~l~~~~~-d~q~le~r~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~eq~~~~f~~~tL~ 333 (431) T protein:vir:10 265 S-----WMLL--EEG--ATAKQFSNTAA-SAQQIENRNHQIEEVARMYGVPRPLL-MMDDTSWGSGIEQLAIFFIQYGLS 333 (431) T ss_pred C-----ceec--CCC--ceEEEccCChh-HHHHHHHHHHhHHHHHHHhCCCHHHh-CCCCCCccccHHHHHHHHHHHHHH Confidence 1 2222 333 46666666432 33346899999999999999988755 55 567777777777788888899 Q ss_pred hhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcC----CcCHHHHHHHhcCccccC Q lcl|NC_021303. 380 PVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRG----AITSAALRRLLNVGEDSG 452 (637) Q Consensus 380 P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drG----aIt~eAlrr~lgl~~d~~ 452 (637) |.+..|+++|++.+|-+- ++ ..|-+.||.+.| ...|..+.+. .++..| .+|-.-.|+.+|++.-.+ T Consensus 334 P~~~~ie~~ln~~Ll~~~---~~---~~~~~~fd~~~l-lr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~ 406 (431) T protein:vir:10 334 HWFVSWEQAAARAFLPEK---ML---GQRQFKFNEGAL-LRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADD 406 (431) T ss_pred HHHHHHHHHHHhhccChh---hc---CCceEEEechhh-hccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCC Confidence 999999999998887432 22 357789999998 4456554443 345444 599999999999975433 Q ss_pred CCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021303. 453 YDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTE 527 (637) Q Consensus 453 yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~ 527 (637) -+.+ ++-.|....+.++ +++|+ ++. T Consensus 407 ~~gD----------------------------------~~~~p~n~~~~~~-----------~~~~p-----~~~ 431 (431) T protein:vir:10 407 PVAD----------------------------------QLRNPMTQKQKGS-----------GDEPP-----ATT 431 (431) T ss_pred cccc----------------------------------ceecccccccCCC-----------CCCCC-----CCC Confidence 1111 0011111111111 01110 000 No 32 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.17 E-value=3.9e-11 Score=77.66 Aligned_cols=511 Identities=17% Similarity=0.195 Sum_probs=228.5 Q ss_pred ccchhhhhhhhcccccccchhhHHH-----HHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccc Q lcl|NC_021303. 31 ITDPQKQMKTSLMGTARNEWQSEAW-----DFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQ 105 (637) Q Consensus 31 ~~~p~~~~k~~~~g~~r~~WQ~eAW-----~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~ 105 (637) ++ .+-.+.+| .+.|-..-+ +.|-.++-+.--|.-+++++|.+.|..- +.| | .+. ++ + T Consensus 1 ~~----~~~~~~g~--~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~--~~~-~----~~~-~~----~ 62 (723) T protein:vir:94 1 MT----TFPSGAGG--WNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVR--GPD-G----ELD-EL----H 62 (723) T ss_pred Cc----ccccCCCc--cccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEE--cCC-C----ccc-hh----h Confidence 11 11111111 122321111 1122345566677889999999888663 323 2 222 22 3 Q ss_pred hHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeH-HHhc-cCCCc-------e Q lcl|NC_021303. 106 IVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTR-EEIK-SKAGE-------T 176 (637) Q Consensus 106 rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~-~Ei~-~k~g~-------~ 176 (637) .+.++++.=-.--+...++.+.++.+|..-|++|+.+.-...+ ....|.+-|+.-.+ -.+. .+.+. . T Consensus 63 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~----~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~ 138 (723) T protein:vir:94 63 PLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRT----PAGVPDEIWYVYDRVTTIVATRAADAVPQAQII 138 (723) T ss_pred HHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc----cccceeEEEEecCcceEEeecCCCccceeeeee Confidence 4555555434455677889999999999999999986543111 11223333432110 0111 11111 1 Q ss_pred -eEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCc-----eeeeccc Q lcl|NC_021303. 177 -AEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNG-----VLFVPAE 250 (637) Q Consensus 177 -~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnG-----vlfvPqe 250 (637) -.+...+|...+|... | ||++=.++|..-..--||+..+... +.+...+.+.. .++..|| ||-.|+ T Consensus 139 ~y~~~~~~G~~~~~~~~-d-IiHir~~~~~dg~~G~Spi~~a~~~----i~~~~aa~~~~-~~~f~NG~~p~giL~~~~- 210 (723) T protein:vir:94 139 GYVIERTDGVRVPVLAD-E-MLWLRFSDPYDPLAVMAPWKAARAA----VDADFYAATWQ-RQSFKNGARPGGVVNLGD- 210 (723) T ss_pred EEEEEecCceeEEeccc-c-eEEecCCCCCCCcccccHHHHHHHH----HHHHHHHHHHH-HHHHhcCCCcceEEEcCC- Confidence 1134567766665443 2 4444333444444566777665544 44444444433 3444554 332221 Q ss_pred CCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHh----cccceeecC Q lcl|NC_021303. 251 MSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHL----EKVQHIKFG 326 (637) Q Consensus 251 ~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi----~~ikHlkf~ 326 (637) . +....+.+.+-|. ..+. ++.-+--|+|+...+... +.++...++ T Consensus 211 -----l--------------------~~e~~~~~~~~~~----~~~~--G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~ 259 (723) T protein:vir:94 211 -----M--------------------DEQTFTKTVAAFR----SQVE--GVQNAGRHLLIAGQGSDGGAAGKGATFTSLS 259 (723) T ss_pred -----C--------------------CHHHHHHHHHHHH----HHhh--chhhcCcceeecccccccccccCCceEEEcc Confidence 0 1112333333322 2221 223345566665432211 233443343 Q ss_pred cchhH-HHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCCh Q lcl|NC_021303. 327 NEVTE-VEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDP 405 (637) Q Consensus 327 ~dvte-vaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp 405 (637) ....+ --+++|+-.+..+|+-.=|||.-|+|-+ ++-+..+....-++..|.|.+..|+++|+..+|- .+|. T Consensus 260 ~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~s--t~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~----~~g~-- 331 (723) T protein:vir:94 260 MSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGS--TYENQAEAKAAVWTETLIPQMEVMASITDLQLLP----DIGW-- 331 (723) T ss_pred CCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCC--CcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcc----cccC-- Confidence 33222 3578999999999999999998776543 2223344444456788999999999999998763 3454 Q ss_pred HHeEEeecCccc-ccCCC-CCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhh Q lcl|NC_021303. 406 TKYILWYDASGL-TSDPD-LSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLL 483 (637) Q Consensus 406 ~kYvvw~DaS~L-t~dPD-~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl 483 (637) +|-+-||...| +-|.. +.+-...++..|.+|-.-.|+.+|++.-.|=|- + -++.|+. T Consensus 332 -~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~----------------~----~~~~p~~ 390 (723) T protein:vir:94 332 -TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIG----------------Q----MTLTPYR 390 (723) T ss_pred -ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc----------------c----ceecccc Confidence 35577887764 44443 233334567889999999999999964433120 0 0111211 Q ss_pred ccccccccCCCCcCCCCCCCCC-CCCCCCCCCCCCccCCC-CCCCc---ccCCCCcc-----hHHHHHHHHHHHHHHHHh Q lcl|NC_021303. 484 SSQLAGIEFPQPANAIESTREE-DDEDSGARQQREPQTED-ERSTE---EAASLNDR-----AAYLVAERLLVNRALDLA 553 (637) Q Consensus 484 ~~~~~~ie~P~p~~a~~~~~~~-~d~~~~a~~g~EPdted-~~~~~---~~a~~~~~-----a~~~aa~~llV~rALelA 553 (637) ..+.+..-|.| +...+.-- ....+. ..+++|..+- .+++. ......+. ...-....+++.=|+.+| T Consensus 391 -~~~a~~~~~~p--~~~e~~~~~~~~~~~-~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (723) T protein:vir:94 391 -AQFAPAPAPAP--AVEEGAARMLALLER-VAADRPLPELPVRATTVLHHDPGPDPQQTLYERLEALLQPLLVELGRRQA 466 (723) T ss_pred -ccccCCCCCCc--cchhhhHhhhhhccc-cccccCcCCCCCCCCCCCCCCcccCCchhHHHHHHHHHhhhHHHHHHHHH Confidence 11111111111 11000000 000000 0011111110 00000 00000000 111112234455556666 Q ss_pred c---------ccccCCCchhhhhHhhcCchhhhhhh--cCC-----CCHH-----------------------------H Q lcl|NC_021303. 554 G---------KRRFKVNDAALKTKLRDVPAHEYHRV--LPP-----VRSS-----------------------------E 588 (637) Q Consensus 554 G---------kRr~~~~~~~~~~rlr~ip~h~~h~~--~~P-----V~~~-----------------------------~ 588 (637) + +|...+... ..+++|.+..+.+-+. +++ |+.. + T Consensus 467 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (723) T protein:vir:94 467 AVTLREFDLLMRGERAAAL-WLADVRAVASEAYERGALLAPPDAEEVPPARLTRLDLAPEELAVRINVKRIFNARKWVAR 545 (723) T ss_pred HHHHHhhchhhcchHHHHH-HHHHHHHHHHhccccceeccccccchhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 5 221111111 1235566666544442 111 1111 0 Q ss_pred HHHHHhcccc-----cccHHHHHHhC----CCHHHHHHHH----------HHHHH-----HHHHhhhhccccC Q lcl|NC_021303. 589 IPRLIAGWDT-----ALEDEVVASLG----LDNEKLRNAV----------LATVR-----RQLTQPLIEGEVV 637 (637) Q Consensus 589 v~rLi~GWd~-----~ld~~~~a~lG----~Dp~~lr~~v----------~~~v~-----~~lt~~vvd~~v~ 637 (637) ..+.+.+|.. +. +.+...++ +|+..++..+ ...+. ..++.++..|+=. T Consensus 546 ~~~~l~~~~~~~~~~~~-~~v~~~l~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~l~~~~i~~g~~~G~~~ 617 (723) T protein:vir:94 546 TKDTLRGWYETAWRTGG-DHVAAQLGDGFDMDEQVLDELDKRLDVLAGQINATTEAALRAQLLHHGVQQGESV 617 (723) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHhccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCH Confidence 1111122221 11 13334444 4554433221 11111 2234455555555 No 33 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.14 E-value=2.3e-11 Score=78.92 Aligned_cols=396 Identities=13% Similarity=0.074 Sum_probs=217.9 Q ss_pred ccccchheeh------hccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCC Q lcl|NC_021303. 18 AARRRSLTAA------SQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGL 91 (637) Q Consensus 18 ~~~r~~ltAA------s~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~ 91 (637) +--..-+..- +-..++|. +-.+.+|.. .. . +..=..+-+.-.+.-+++++|.+.+..=+-++|. T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~~-~~--~---~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~-- 70 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPL--LLQWLGVDP-DT--P---RNQLSEATYFACLKILSESLGKLPLKMYQKTERG-- 70 (411) T ss_pred CchHHHHHhhccCcccccccchHH--HHHHhcCcc-cC--h---hhhhccHHHHHHHHHHHHhHhhCceeEEEecCCc-- Confidence 2211111110 00111111 111111110 00 0 0001223466678889999999998888877663 Q ss_pred CCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhc- Q lcl|NC_021303. 92 PTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIK- 170 (637) Q Consensus 92 PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~- 170 (637) .+.+++ +.+..+.+.=..--+...++++.++.+|.+-|++|+.+. |.+|. +.+.-.-..++..+..++-. T Consensus 71 ---~~~~~~----~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~-r~~g~-~~~l~~l~~~~v~~~~~~~~~ 141 (411) T protein:vir:81 71 ---IVKSDR----EELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQ-YSGPQ-LQALWILPSQYVTIVVDDRGL 141 (411) T ss_pred ---eeeecc----cHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ecCCc-eEEEEEECCceEEEEEcCccc Confidence 222232 345555544344557889999999999999999998765 44443 22221111223333332211 Q ss_pred -c-CCCceeEEecC-CCCcccccCCCceEEEE-ecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceee Q lcl|NC_021303. 171 -S-KAGETAEISLP-DGKTHEFNRDLDSLVRI-WNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLF 246 (637) Q Consensus 171 -~-k~g~~~~i~lP-dG~~he~~~~~d~l~Rv-W~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlf 246 (637) . +..-...+..+ +|..++|..+. ||++ +++ +..-..-.||+.++...+.-..-..+...+..+.-.+-.|||- T Consensus 142 ~~~~~~~~~~~~~~~~g~~~~~~~~e--iih~k~~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 218 (411) T protein:vir:81 142 LGEKNAIWYRYNDPYDGKMYVFRNDE--ILHFKTSV-TFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLE 218 (411) T ss_pred ccccceEEEEEEecCCceEEEEcccc--EEEEcCCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 1 11222234444 56666655433 4444 332 2233457788888877776666666666666666566677887 Q ss_pred ecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecC Q lcl|NC_021303. 247 VPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFG 326 (637) Q Consensus 247 vPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~ 326 (637) +|+.++ ....+.+.+.+.+. +...+.+-. ++++ ++. -+++-|.+. T Consensus 219 ~~~~l~-------------------------~e~~~~~~~~~~~~----~~g~~n~g~--~~vl--~~g--~~~~~l~~~ 263 (411) T protein:vir:81 219 YTGDLN-------------------------QEARDRLVKGFEQF----ANGSKNAGK--IIPV--PLG--MKLVPLDIK 263 (411) T ss_pred eCCCCC-------------------------HHHHHHHHHHHHHH----hcCccccCC--ceec--CCC--ceEEEccCC Confidence 776432 11334455444332 222222222 2232 332 345555443 Q ss_pred cchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCCh Q lcl|NC_021303. 327 NEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDP 405 (637) Q Consensus 327 ~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp 405 (637) . .+.--+++|+.....+|.-.-|||..| |. .++|+-++.|....-++--|.|.+..|+++|++.+|.+.... T Consensus 264 ~-~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~----- 336 (411) T protein:vir:81 264 L-TDSQFFELKKYTALQIAAAFGIKPNQI-NDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDLIS----- 336 (411) T ss_pred H-HHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcC----- Confidence 2 233346889999999999999999866 66 578888888877777888899999999999999988765432 Q ss_pred HHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhh Q lcl|NC_021303. 406 TKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPL 482 (637) Q Consensus 406 ~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apL 482 (637) ..|-+.||.+.| .++|..+.+ ..++..|++|-.-.|.++|++-..+=| +.+ +. -.++|| T Consensus 337 ~~~~~~fd~~~l-l~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD----~~~-------~~------~n~~pl 398 (411) T protein:vir:81 337 QGHYFKFNVNVI-LRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGN----NLM-------AN------GNYIPL 398 (411) T ss_pred CCcEEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eee-------ec------cCccch Confidence 456789999998 445554444 457888999999999999987543312 000 00 011221 Q ss_pred hccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCc Q lcl|NC_021303. 483 LSSQLAGIEFPQPANAIESTREEDDEDSGARQQREP 518 (637) Q Consensus 483 l~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EP 518 (637) +.. + ++...|.|- T Consensus 399 --------~~~--------~-------~~~~kgGd~ 411 (411) T protein:vir:81 399 --------SML--------G-------ANYGKGGDS 411 (411) T ss_pred --------hhh--------h-------hhhccCCCC Confidence 100 0 001111111 No 34 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.14 E-value=1.8e-11 Score=79.52 Aligned_cols=409 Identities=14% Similarity=0.115 Sum_probs=213.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchh-----hHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQ-----SEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ-----~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |==+ |+.++.- .. +.+.. ..+-...+|...+.+. ..|. .++-+.-.|.=+++++ T Consensus 1 m~~~-----~~~~~~~---~~------~~~~~---~~~~~~~~g~~~s~~~~~v~~~~al----~~~~v~~cv~~ia~~i 59 (419) T protein:vir:80 1 MFFS-----RQLLSNL---GQ------TQPGS---GGWVSALLGSARSEAGQVVTPASAL----SLTVLQNCVTLLAESI 59 (419) T ss_pred CCcc-----ccccccc---Cc------CCCCc---chhhHHhhcccccccCcccChHHhh----ccHHHHHHHHHHHHhh Confidence 3211 1111110 00 00000 0111122222111111 1122 2345666777889999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |.+.|..=+.+.|..+ .+.+ +.+..+.+.--.--+-..++++.++.+|-+-|+.|+.+.-...|. T Consensus 60 a~lp~~~~~~~~~~~~-----~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~------ 124 (419) T protein:vir:80 60 AQLPVELYERSGDDRK-----PATD----HPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGV------ 124 (419) T ss_pred ccCceEEEEecCCCcc-----cccc----cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc------ Confidence 9999988888866322 2222 456666665556667888999999999999999999875443342 Q ss_pred cccccceeeeHHHhcc--CCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIKS--KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~~--k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) +. ..+.|..+.+.. ...+...... .|... | ...++++.=+++ .+-..--||+..+...+.-.....+...+ T Consensus 125 -~~-~L~~i~~~~v~i~~~~~~~~~y~~-~~~~~-~-~~~~i~h~~~~~--~d~~~G~s~i~~~~~~i~~~~~~~~~~~~ 197 (419) T protein:vir:80 125 -IQ-GLYPLDNEAVTVMKGPDLKPMYRV-AGADP-L-PQRLVHHVRWMS--INGYTGLSPVLLHANAIGHAQAIQQYAGK 197 (419) T ss_pred -EE-EEEEecCceEEEEECCCceEEEEE-cCccc-c-chhheEEecCCC--CCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 12 334443333331 1111222222 22211 1 122333322443 33345678877776666555555555555 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||.+|+.+. . . ......+.|.+.+.+ .+...+.+-. ++++ + T Consensus 198 ~f~ng~~~~gil~~~~~~~--~-~------------------~~~~~~~~~~~~~~~----~~~g~~n~g~--~~vl--~ 248 (419) T protein:vir:80 198 SFMNGTALSGVIERPTDAP--A-L------------------KDQASVDRITDGWNA----KFGGSGNAKK--VALL--Q 248 (419) T ss_pred HHhcCCCccEEEEecCCCC--c-c------------------cCHHHHHHHHHHHHH----HhcCccccCC--ceec--C Confidence 5555556667777766321 0 0 112244556555543 2222222211 2222 3 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) +. .+++.|.+.. .+.--+++|+-.+..+|..+=|||.-|-...++|.-++-+....-++..|.|.+..|+++|+..+ T Consensus 249 ~g--~~~~~l~~s~-~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kl 325 (419) T protein:vir:80 249 EG--MKFKPLSMTN-VDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDL 325 (419) T ss_pred CC--ceEEeccCCh-hhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 33 4566666542 34445788999999999999998876533355666666666666678889999999999999988 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) |-+... ..|.+.||.+.| ..+|..+.+ ..+++.|.+|-.-.|+.+|++.-.+=| +. T Consensus 326 l~~~~~------~~~~i~fd~~~l-~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD----~~---------- 384 (419) T protein:vir:80 326 LLPSER------KQYFIEYNLAGL-LRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD----IY---------- 384 (419) T ss_pred cCcccc------CCeEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----ee---------- Confidence 755332 368999999997 334443332 337889999999999999997543312 00 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRAL 550 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rAL 550 (637) +.|+ .++..+.|++ . .++++.+.++ .. .. + .|.| T Consensus 385 --------~~~~---n~~~~~~~~~------------~-----~~~~~~~~~~-~~----------~~--~-----~~~l 418 (419) T protein:vir:80 385 --------LSPM---NMVDASKPQP------------I-----PMGKTEPTKA-AL----------DE--I-----GRIL 418 (419) T ss_pred --------eecc---cccccccccc------------c-----cCCCCCchhh-hH----------HH--H-----Hhhc Confidence 1110 1111111110 0 0011111000 00 00 0 1111 Q ss_pred H Q lcl|NC_021303. 551 D 551 (637) Q Consensus 551 e 551 (637) . T Consensus 419 ~ 419 (419) T protein:vir:80 419 S 419 (419) T ss_pred C Confidence 1 No 35 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.13 E-value=5.3e-11 Score=76.91 Aligned_cols=409 Identities=13% Similarity=0.094 Sum_probs=212.3 Q ss_pred CCCCcceEEecCCCC-------CcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhc Q lcl|NC_021303. 1 MAATSLRVVRRPKGS-------APAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRAN 73 (637) Q Consensus 1 ma~~~lr~vrrpk~~-------~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~ 73 (637) |--+..-+-=+++.. -...++ ..+......+.|- .+.++..| ..+-.+ -+-.++-+.-.|.-+++ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~-~~~~~~~~~~~~~-~~~~~~~~---~~v~~~---~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGR-LVTPNQGSQTGPV-SAHGYLGD---SSINDE---RILQISTVWRCVSLIST 72 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccc-cccccchhhcccc-cccccccc---ccccHH---HhhccHHHHHHHHHHHH Confidence 322211111111110 000000 0000000001110 01111111 111111 11123345566788999 Q ss_pred ceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccc Q lcl|NC_021303. 74 SCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTG 153 (637) Q Consensus 74 s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~ 153 (637) ++|.+.|..=+.+.|+|... +. ..+.+..+.+.=..--+-..++++.++.+|-+-|++|+.+.-...|. +. T Consensus 73 ~iA~lp~~vy~~~~~~~~~~--~~-----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~- 143 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKK--VD-----LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD-VI- 143 (424) T ss_pred hhccCceEEEEeccCCceee--ec-----cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EE- Confidence 99999998878887754321 11 12446666555555567788899999999999999999875333332 21 Q ss_pred cccccccceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 154 LAAPRARWYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 154 ~~~~~~~W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) .++.+....+. ..++...+-.-.+|...+|... | ||++=.+.+ ....--||+..+...+.--.-..+.. T Consensus 144 ------~L~~l~~~~v~v~~~~~~~~y~~~~~g~~~~~~~~-e-Vihir~~~~-dg~~G~spi~~~~~~i~~~~~~~~~~ 214 (424) T protein:vir:18 144 ------SLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQK-E-IFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQ 214 (424) T ss_pred ------EEEEecCcceEEEEcCCeEEEEEEeCCeEEEeccc-c-EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHH Confidence 34444434333 2223322222346766666553 2 344422222 22345578776655544333333333 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) .+..+.-..-.|||-+|+.+. .....+.+.+++.+.. .-+++- =++|+ T Consensus 215 ~~~f~ng~~~~gil~~~~~~l------------------------~~e~~~~~~~~~~~~~----~~~nag---~~~vl- 262 (424) T protein:vir:18 215 RDFFANGAKSPQILSTGEKVL------------------------TEQQRSQVEENFKEIA----GGPVKK---RLWIL- 262 (424) T ss_pred HHHHhccCCcceEEEeCCcCC------------------------CHHHHHHHHHHHHHHh----CCcccC---Cceec- Confidence 333333333446776666321 1124455666664432 112221 13333 Q ss_pred echHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeee--EEeccCceeEeechhHHHHHHH Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSA--WAIGDEDVQLHIKPVMDLICQA 388 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsA--W~I~dedVrlHI~P~me~ic~A 388 (637) ++. -+++-|.+...- .--+++|+-.+..+|...-|||.-| |. .+++.|.+ -|....-++.-|.|.+..|+++ T Consensus 263 -~~g--~~~~~l~~~~~d-~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~ 337 (424) T protein:vir:18 263 -EAG--FSTSAIGVTPQD-AEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENS 337 (424) T ss_pred -cCC--ceEEecCCChhH-HHHHHHHHHhHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 355666554322 2237899999999999999987665 76 56777755 4555556778899999999999 Q ss_pred HHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHH Q lcl|NC_021303. 389 IYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFA 465 (637) Q Consensus 389 it~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A 465 (637) |++.+|-+ .+. ..|.+.||.+.| ..+|..+.+ ..++..|.+|-.-.|+.+|++.-.+=| +.+ T Consensus 338 ln~~L~~~----~~~--~~~~~~fd~~~l-lr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD----~~~---- 402 (424) T protein:vir:18 338 IQRWLIPS----KDV--GRLHAEHNLDGL-LRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGD----VAM---- 402 (424) T ss_pred HHhhcCCc----ccc--CCeEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eee---- Confidence 99888654 233 368899999998 455654443 357888999999999999988543311 000 Q ss_pred HHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCC Q lcl|NC_021303. 466 ADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGA 512 (637) Q Consensus 466 ~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a 512 (637) +..| +.||= ..++..+...++| T Consensus 403 ---~~~n------~~~l~----------------~~~~~~~~~~n~a 424 (424) T protein:vir:18 403 ---RQAQ------YVPIT----------------DLGTNKEPRNNGA 424 (424) T ss_pred ---eccC------ccchh----------------hhhccCCccccCC Confidence 0011 01110 0011111222222 No 36 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.13 E-value=7.4e-11 Score=76.11 Aligned_cols=413 Identities=12% Similarity=0.159 Sum_probs=208.3 Q ss_pred CCCC-------cceEEecCCCCCcccccchheehhccccchhhhhhhhcccc-cccchhhHHHHHhhhhhhHhhHhhhhh Q lcl|NC_021303. 1 MAAT-------SLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT-ARNEWQSEAWDFSESIGELSYYISWRA 72 (637) Q Consensus 1 ma~~-------~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~-~r~~WQ~eAW~~yd~VgELryyvgWr~ 72 (637) |-+- .++-.-.|........+.+ ..|+....+.+ ++. ..++ ..=-++-+=..+-+.-.+.-++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~----~~~~s~~g-~~v~~~~al~~~~V~~~i~~Ia 71 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQT----FTPVNATARDL----GIIISDTG-AAVNADAIMRLDAVAACVKLVS 71 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccc----cccCcchhhhh----cccccccC-cccchhhhhcchHHHHHHHHHH Confidence 2211 1111111111000000000 01111111111 000 0000 0001122223455666778899 Q ss_pred cceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 73 NSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 73 ~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) +++|++.+..=+-+.| |. ..+.+ +.+..+.+.=-..-+...++++.++.+|-+-|++|+.+.- .+|.+ T Consensus 72 ~~ia~lp~~~y~~~~~-g~----~~~~~----~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~-~~g~~-- 139 (432) T protein:vir:10 72 QAIAAMPLTMYMRTPD-GR----KEAVN----HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVV-TDGRI-- 139 (432) T ss_pred HhhhhCceeEEEecCC-Cc----ccccc----cHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEe-cCCcE-- Confidence 9999999988776766 22 22222 3455555444455688889999999999999999987654 35531 Q ss_pred ccccccccceeeeHHHhc---cCCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhh Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIK---SKAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTT 228 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~---~k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rtt 228 (637) . ..+.|..+.+. ...|...+ +...+|...+|.. +-|+++=++ +.....--||+..+.+.+.--.... T Consensus 140 -----~-~L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~--~~iih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~ 210 (432) T protein:vir:10 140 -----E-SLQYLANDRLTITTDTKGNTAYRYRRTDGQMIDIPK--QQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAE 210 (432) T ss_pred -----E-EEEEEcCCceEEEEcCCCcEEEEEEecCceEEEEcC--ccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHH Confidence 1 23333333332 12333333 3455776666654 334554222 2233445578777766655544444 Q ss_pred HHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccce Q lcl|NC_021303. 229 RKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPL 308 (637) Q Consensus 229 k~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPi 308 (637) +...+.-+.-....|||-+|+.++ ....+.|.+-+... .+.+. ++ T Consensus 211 ~~~~~~f~ng~~~~gil~~~~~l~-------------------------~e~~~~~~~~~~~~-----~nag~-----~~ 255 (432) T protein:vir:10 211 AQAARAFRNGQLQSVYYQIDRFLT-------------------------DDQYDSFAKKVSGS-----VEAGR-----AP 255 (432) T ss_pred HHHHHHHhcCCCcceEEecCCCCC-------------------------HHHHHHHHHHHhhh-----hhCCC-----ce Confidence 444444444445567777776432 11344555544321 11111 23 Q ss_pred eEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCC--cceee--eEEeccCceeEeechhHHH Q lcl|NC_021303. 309 VASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK--GNHWS--AWAIGDEDVQLHIKPVMDL 384 (637) Q Consensus 309 va~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~--~NHWs--AW~I~dedVrlHI~P~me~ 384 (637) |+ ++. -+++-|.+..+-.. -+++|+..+..+|.-.-|||. |||..+ ++=|. .-|..-.=++.-|.|.+.. T Consensus 256 vl--~~g--~~~~~l~~~~~d~q-~le~~~~~~~~Ia~afgVPp~-~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ 329 (432) T protein:vir:10 256 LL--EGG--MDVKSLGLNPVDAQ-LLQSRQYSVESICRFFGVPPS-MIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRR 329 (432) T ss_pred ec--CCC--ceEEEccCChHHHH-HHHHHHHHHHHHHHHhCCCHH-HcCCccCCcccccchHHHHHHHHHHHHHHHHHHH Confidence 33 333 35666665443222 478999999999999999885 567742 22232 2333333455679999999 Q ss_pred HHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHH Q lcl|NC_021303. 385 ICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (637) Q Consensus 385 ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~ 461 (637) |+++|++.+|.+.-. .+|.+-||.+.| ...|..+.| ..++..|.+|-.-.|+.+|++.-.|=+ T Consensus 330 ie~~ln~kL~~~~~~------~~~~~~fd~~~l-l~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~------- 395 (432) T protein:vir:10 330 IEQSIALNLLSPAER------RRYFADFDTSAL-LRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNA------- 395 (432) T ss_pred HHHHHHhhhcCcccc------CceEEEeechhh-hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc------- Confidence 999999988876422 468899999998 344444333 347788999999999999997544311 Q ss_pred HHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCC Q lcl|NC_021303. 462 REFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDER 524 (637) Q Consensus 462 r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~ 524 (637) +.+..+-. +.||= .+. +.++|.++...++++.++ ++. T Consensus 396 -----~~~~~~~~----~~pl~--~~~--~~~~~~~~~~~~~~~~~~-------------~~~ 432 (432) T protein:vir:10 396 -----AVLTVQSA----MVPLD--SIG--LQASPEPASGLGNQQQDK-------------VSK 432 (432) T ss_pred -----ceEeecCc----ccchh--hhc--ccCCCCCCCCCCCccccc-------------ccC Confidence 00000100 01110 000 001111111111111111 111 No 37 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.12 E-value=2.8e-11 Score=78.44 Aligned_cols=414 Identities=12% Similarity=0.093 Sum_probs=213.3 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeee Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAI 85 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (637) .-+-+.=|+.+..++- --..+.+-... ..+..|- .-+ .+..+ .++.++-.+.-+++++|.+.+..=+- T Consensus 1 m~~~~~~~~~~~~~~~-----~~~~~~~~~~~-~~~~~g~-~v~----~~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~ 68 (419) T protein:vir:57 1 MFIPQFWKGRPSENRV-----NWQVVPGGMRS-SSSQAGV-IIT----PETAL-ALSAVRACVTLLAESVAQLPCVLYRR 68 (419) T ss_pred CcchhhhccCCccccc-----ccccccccccc-ccccCCc-eec----hHHhh-ccHHHHHHHHHHHHhhccCceEEEEE Confidence 3333332332211110 00001000000 0000010 001 11122 23556777888999999999988777 Q ss_pred ccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeee Q lcl|NC_021303. 86 DPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVT 165 (637) Q Consensus 86 D~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt 165 (637) +.+.+... +. .+.+.++.+.=..--+...++++.++.+|-+-|+.|+.|.-...|. +. .++.|. T Consensus 69 ~~~g~~~~----~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~-~~-------~L~pl~ 132 (419) T protein:vir:57 69 TENGGREI----AF----DHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGD-IT-------ELIPIN 132 (419) T ss_pred cCCCceec----cc----cchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EE-------EEEEEc Confidence 76733221 12 3456666554455567788899999999999999998876444443 11 223333 Q ss_pred HHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCc Q lcl|NC_021303. 166 REEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNG 243 (637) Q Consensus 166 ~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnG 243 (637) ..-+. ....+.+.-..+ +..+.|.. +-||++=+++ .+...--||+.++...+.-.....+...+..+.-..-.| T Consensus 133 ~~~v~v~~~~~g~~~y~~~-~~~~~~~~--~~vih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 208 (419) T protein:vir:57 133 PHKVIVLKGPDGMPYYDIP-SIGEILPM--RMVHHIKSFS-LDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSG 208 (419) T ss_pred CcceEEEECCCceEEEEEc-CCceEEch--hhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccE Confidence 22222 122233333333 22233332 3445543332 223445688877777666555555554444444444457 Q ss_pred eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccccee Q lcl|NC_021303. 244 VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHI 323 (637) Q Consensus 244 vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHl 323 (637) ||.+|..+.- . .+....+.|.+.+.+. +...+. +--++|+ ++. -+++.| T Consensus 209 il~~~~~~~~---~------------------~~~e~~~~~~~~~~~~----~~g~~n--ag~~~vl--~~g--~~~~~l 257 (419) T protein:vir:57 209 VIERPFEAKA---I------------------ASQAAVDAILAKWTER----YGGVRN--AFSVGML--QEG--MTYKQL 257 (419) T ss_pred EEEecCcCCc---c------------------cCHHHHHHHHHHHHHH----hccccc--cccceec--CCC--ceEEEc Confidence 7777652210 0 1223455666555443 222111 2223333 333 355555 Q ss_pred ecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCC Q lcl|NC_021303. 324 KFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGI 403 (637) Q Consensus 324 kf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGi 403 (637) .+. ..+.--+++|+..+..+|...-|||..|=+..++|.-++-+....-++--|.|.+..|+++|++.+|.+... T Consensus 258 ~~~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~---- 332 (419) T protein:vir:57 258 SQD-NEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLLLPSER---- 332 (419) T ss_pred CCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc---- Confidence 542 223334789999999999999999887644456666555555555566679999999999999998876432 Q ss_pred ChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHH Q lcl|NC_021303. 404 DPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYA 480 (637) Q Consensus 404 Dp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~a 480 (637) ..|.|.||.+.| ..+|..+.+. .+++.|.+|-.-.|..+|++.-.+=| + -+. T Consensus 333 --~~~~i~fd~~~l-l~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD----~------------------~~~ 387 (419) T protein:vir:57 333 --RDFYIEFNVSSL-LRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGD----K------------------YLT 387 (419) T ss_pred --CCeEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----e------------------eee Confidence 369999999998 4455555443 46788999999999999997433212 0 001 Q ss_pred hhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCC Q lcl|NC_021303. 481 PLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLN 533 (637) Q Consensus 481 pLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~ 533 (637) |+- +...+-+ ..++ ++...+.||.+-. ..+-. T Consensus 388 ~~n---~~~~~~~------~~~~-------~~~~~~~~~~~~~-----~~~~~ 419 (419) T protein:vir:57 388 PLN---MVDSKAL------TGIG-------KATPQQLKDIEAI-----LCTRN 419 (419) T ss_pred ccc---ccccccc------cccc-------CCCcccCcchhhh-----hhccC Confidence 110 0000000 0000 0011111111110 00001 No 38 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.10 E-value=1.1e-10 Score=75.17 Aligned_cols=398 Identities=14% Similarity=0.118 Sum_probs=200.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccc--cccchhhHHHHHhhhhhhHhhHhhhhhcceeee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT--ARNEWQSEAWDFSESIGELSYYISWRANSCSRT 78 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~--~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~ 78 (637) |. ..++.+... -...+ .+....+|. ..-... .|++ ++-+.-.+.-+++++|++ T Consensus 1 m~-----~f~~~~~~~------------~~~~~---~~~~~~~~~~~~~~~~~-~Al~----~~~V~~~i~~Ia~~iA~l 55 (406) T protein:vir:97 1 MS-----FFQPLGTSK------------VSYDD---YISSVLAGDVSQKYLGV-SALK----NSDILTATSIIAGDIARF 55 (406) T ss_pred Cc-----cccccCCCC------------CCcch---HHHHHhcCCCCcccccc-hhhc----cHHHHHHHHHHHHhhhhC Confidence 32 122111111 00111 111111111 000111 1322 233444577899999999 Q ss_pred EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccc Q lcl|NC_021303. 79 TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (637) Q Consensus 79 rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~ 158 (637) .|+.-.- | |. + +.+ +.+..+.+.-..--+-..++++.++.+|.+-|++|+.+.-...++-+ T Consensus 56 p~~~~~~--~-g~----~-~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~------- 116 (406) T protein:vir:97 56 PLVKKDV--N-GD----I-IHD----EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQA------- 116 (406) T ss_pred eeEEEec--C-cc----c-ccc----chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeE------- Confidence 9976433 3 22 2 122 34555554444566788899999999999999999986543222111 Q ss_pred ccceeeeHHHhcc---CCCceeE-EecC-CCCcccccCCCceE-EEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 159 ARWYAVTREEIKS---KAGETAE-ISLP-DGKTHEFNRDLDSL-VRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 159 ~~W~~vt~~Ei~~---k~g~~~~-i~lP-dG~~he~~~~~d~l-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) ..++.+....+.. .++.-.+ +..+ +|...+|.. .|++ ||.. +.+-..--||+.++.+.+.-..-..+... T Consensus 117 ~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~~-~evih~r~~---~~dg~~G~spi~~~~~~i~~~~a~~~~~~ 192 (406) T protein:vir:97 117 LQFQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKCFA-HDVIHWKFF---SHDTILGRSPLLSLGDEIDLQTGGINTLI 192 (406) T ss_pred EEEEEECCCeeEEEEcCCceEEEEEEecCCceEEEEcc-ccEEEecCC---CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 1344444443331 2222222 3333 344444433 3332 4322 22223356776655554443333333332 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +++.||. .|..+..+... .+....+.+++.|.+ .+...++ - -|+|+ T Consensus 193 -----~~f~ng~--~~~~i~~~~~~------------------l~~e~~~~~~~~~~~----~~~g~n~--g-~~~vl-- 238 (406) T protein:vir:97 193 -----KFFKDGF--SSGILTMKGAQ------------------LSGDARQRARQEFEK----MREGSVG--G-SPLVF-- 238 (406) T ss_pred -----HHHhccC--CCceEEecCCC------------------CCHHHHHHHHHHHHH----Hhccccc--C-ceeec-- Confidence 3445553 23322222111 122345556665533 2332221 1 12333 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) ++. .+++.|.+..+... -+++|+-.+..+|...-|||.-| |.. +..-+..+...+=++.-|.|++..|+++|+.. T Consensus 239 ~~g--~~~~~l~~~~~d~q-~le~~~~~~~~Ia~afgVPp~~l-g~~-~~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~k 313 (406) T protein:vir:97 239 DST--MEYTPLEIDTNVLQ-LITSNNFSTAQIAKALRVPSYKL-GVN-SPNQSVAQLMEDYVTNDLPFYFDAITSELGLK 313 (406) T ss_pred CCC--ceEEEccCCHHHHH-HHHHHHhhHHHHHHHhCCCHHHc-CCC-CCcchHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 333 56777776654433 36899999999999999999877 542 22224445555555677999999999999998 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) +|.+.-. ..|.+-||.+.+.. .+.++...++..|.+|..-.|..+|+..-.+-.-+ +.+ +.. T Consensus 314 ll~~~~~------~~~~i~fd~~~~~~--~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD--~~~-------~~~- 375 (406) T protein:vir:97 314 TLNDKDR------RLYHIEFDTRSVTG--RNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMD--RYQ-------SSL- 375 (406) T ss_pred hcChhhc------cceeEEEecCccch--hhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCC--eEe-------ecc- Confidence 7754322 46888999876532 23455567888999999999999999854431100 000 000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERS 525 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~ 525 (637) .++| +...++..+....+..|.|-+.+++.+ T Consensus 376 -----n~~~-----------------~~~~~~~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 376 -----NYVF-----------------LDKKEEYQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred -----Cccc-----------------hhcccccccccccccCCCCCCCCCCCC Confidence 0111 111122233333344444444444333 No 39 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.09 E-value=4.2e-11 Score=77.46 Aligned_cols=413 Identities=12% Similarity=0.148 Sum_probs=204.7 Q ss_pred CCcceEE---ecCCC-CCcccccchheeh----hccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcc Q lcl|NC_021303. 3 ATSLRVV---RRPKG-SAPAARRRSLTAA----SQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANS 74 (637) Q Consensus 3 ~~~lr~v---rrpk~-~~p~~~r~~ltAA----s~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s 74 (637) -+..|+. -|-|. -.+ +..+.+. .+++....+.+.... ..++ ..=-++-+-..+-+.-.+.-++++ T Consensus 1 ~~~~~~mg~f~r~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~g-~~v~~~~al~~~~V~~~i~~Ia~~ 73 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVP---PDPVDIGGGQTFTPVNATARDLGIII---SDTG-AAVNADAIMRLDAVAACVKLVSQA 73 (432) T ss_pred CCchhhcchhhhhhhhccc---ccccccccccccccCccchhhhcccc---cccC-cccchHhhhccHHHHHHHHHHHHh Confidence 1122211 01010 000 0000000 111111111110000 0000 000011222334556677889999 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) +|++.+..-+-+.|. ..++.+ +.+..+.+.=...-+-..++++.++.+|.+-|++|+.+. |.+|. +. T Consensus 74 ia~lp~~~y~~~~~g-----~~~~~~----~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~-~~~g~-~~-- 140 (432) T protein:vir:81 74 IAAMPLTMYMRTPDG-----RKEAVN----HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKV-VTDGR-IE-- 140 (432) T ss_pred hhhCceeeEEecCCc-----ceeccc----chHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEE-ecCCc-EE-- Confidence 999999887777662 222222 445555554445567888999999999999999998765 44553 11 Q ss_pred ccccccceeeeHHHhcc---CCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHH Q lcl|NC_021303. 155 AAPRARWYAVTREEIKS---KAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRK 230 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~~---k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~ 230 (637) ..+.|..+.+.. ..|...+ +...+|...+|.. +-|+++ ...+-....--||+.++.+.+.--.-..+. T Consensus 141 -----~L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~--~~iih~-r~~~~dg~~G~spi~~~~~~i~~~~~~~~~ 212 (432) T protein:vir:81 141 -----SLQYLANDRLTITTDPKGNTAYRYRRTDGQMIDIPK--QQIWKI-MGYSLDGENGLSAIRYGAQIFGTAIAAEAQ 212 (432) T ss_pred -----EEEEEcCCceEEEECCCCcEEEEEEecCceEEEEcc--ccEEEe-cCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 222333333321 2333333 3455776666644 234444 222323345567877766555544444444 Q ss_pred HHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeE Q lcl|NC_021303. 231 IKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVA 310 (637) Q Consensus 231 I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva 310 (637) ..+..+.-....|||-+|+.++ ....+.+.+-+... .+ +-=++|+ T Consensus 213 ~~~~f~ng~~~~gil~~~~~l~-------------------------~e~~~~~~~~~~~~-----~n-----ag~~~vl 257 (432) T protein:vir:81 213 AARAFRNGQLQSVYYQIDRFLT-------------------------DDQYDSFAKKVSGS-----VE-----AGRAPLL 257 (432) T ss_pred HHHHHhcCCCcceEEecCCCCC-------------------------HHHHHHHHHHHhhh-----hc-----CCCceec Confidence 4433333334447777765332 01233444433211 11 1113333 Q ss_pred eechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCC--cceee--eEEeccCceeEeechhHHHHH Q lcl|NC_021303. 311 SVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK--GNHWS--AWAIGDEDVQLHIKPVMDLIC 386 (637) Q Consensus 311 ~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~--~NHWs--AW~I~dedVrlHI~P~me~ic 386 (637) ++. .+++-|.+..+-. --+++|+-.+..+|...-|||. |||..+ ++=|. ..|....=++.-|.|.+..|+ T Consensus 258 --~~g--~~~~~l~~~~~d~-q~le~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie 331 (432) T protein:vir:81 258 --EGG--MDVKSLGLNPVDA-QLLQSRQYSVESICRFFGVPPS-MIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIE 331 (432) T ss_pred --CCC--ceEEEccCCHHHH-HHHHHHHHHHHHHHHHhCCCHH-HcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHH Confidence 333 3566666654332 3378999999999999999885 568743 22232 344444556678999999999 Q ss_pred HHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHH Q lcl|NC_021303. 387 QAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCRE 463 (637) Q Consensus 387 ~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~ 463 (637) ++|+..+|.+.-. .+|.+-||.+.|. .+|..+.|. .++..|.+|-.-.|+.+|++.-.|=+ T Consensus 332 ~~l~~kLl~~~~~------~~~~~~fd~~~ll-r~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~--------- 395 (432) T protein:vir:81 332 QSIALNLLSPAER------RRYFADFDTSALL-RADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNA--------- 395 (432) T ss_pred HHHHhhccCcccc------CceEEEeechhhh-ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc--------- Confidence 9999988875332 4689999999983 445444432 36678999999999999997533210 Q ss_pred HHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCC Q lcl|NC_021303. 464 FAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDER 524 (637) Q Consensus 464 ~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~ 524 (637) +.+..+-. +.||= .+.. -++|.++ .+..+++.. ++.. T Consensus 396 ---~~~~~~~~----~~pl~--~~~~--~~~~~~~--~~~~n~~~~-----------~~~~ 432 (432) T protein:vir:81 396 ---AVLTVQSA----MVPLD--SIGL--QASPEPA--SGLGNQQQD-----------KVSK 432 (432) T ss_pred ---ceEeecCc----ccchh--hhcc--CCCCCCC--CCCCCcccc-----------cccC Confidence 00100100 01110 0000 0111111 111111111 1110 No 40 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.08 E-value=1.2e-10 Score=74.96 Aligned_cols=410 Identities=13% Similarity=0.089 Sum_probs=213.1 Q ss_pred CCCCcceEEecCCCCC--c-c---cccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcc Q lcl|NC_021303. 1 MAATSLRVVRRPKGSA--P-A---ARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANS 74 (637) Q Consensus 1 ma~~~lr~vrrpk~~~--p-~---~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s 74 (637) |--+..-+-=++++.= + . ..++-.+...+-.+.|- .+..+..|. .=+ ...|++ ++-+.-.|.-++++ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~v~-~~~al~----~~~v~~cv~~Ia~~ 73 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPV-SAHGHLGDS-SIN-DERILQ----ISTVWRCVSLISTL 73 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhccccccccccccccccc-ccccccccc-ccc-HHHhhc----cHHHHHHHHHHHHh Confidence 3332222222222220 0 0 00000000000001110 000111110 000 111222 22344567889999 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) +|.+.+..=+.+.|+|... +. . .+.+..+++.=-.--+-..++++.++.+|-+-|+.|+.+.-...|. +. T Consensus 74 iA~lp~~~~~~~~~~~~~~--~~-~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~-- 143 (424) T protein:vir:18 74 TACLPLDVFETDQNDNRKK--VD-L----SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD-VI-- 143 (424) T ss_pred hccCceEEEEeecCCceee--ec-c----ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EE-- Confidence 9999888878777754331 11 1 2345555554444457778899999999999999999875333332 21 Q ss_pred ccccccceeeeHHHhc--cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 155 AAPRARWYAVTREEIK--SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~--~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) .++.+....+. ..++...+-.-.+|...+|....=+-||-.+++. ..--||+.++.+.+.--....+... T Consensus 144 -----~L~pl~~~~V~v~~~~~~~~y~~~~~g~~~~~~~~eIih~r~~~~dg---~~G~spi~~~~~~i~~~~a~~~~~~ 215 (424) T protein:vir:18 144 -----SLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGFTG---LVGLSPIAFACKSAGVAVAMEDQQR 215 (424) T ss_pred -----EEEEecCcceEEEEcCCeEEEEEEeCCeEEEeccccEEEecCcCCCC---cccccHHHHHHHHHHHHHHHHHHHH Confidence 33344333333 2222222222336666666553222244333332 3456888877666655555555555 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-+|+.+. .....+.+.+.+.+.- .-+++- =++|+ T Consensus 216 ~~f~ng~~p~gil~~~~~~l------------------------~~e~~~~~~~~~~~~~----~g~nag---~~~vl-- 262 (424) T protein:vir:18 216 DFFANGAKSPQILSTGEKVL------------------------TEQQRSQVEENFKEIA----GGPVKK---RLWIL-- 262 (424) T ss_pred HHHHccCCcceEEEeCCcCC------------------------CHHHHHHHHHHHHHHh----CCcccC---Cceec-- Confidence 54454455556877776321 1124455666554332 222221 12232 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeee--EEeccCceeEeechhHHHHHHHH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSA--WAIGDEDVQLHIKPVMDLICQAI 389 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsA--W~I~dedVrlHI~P~me~ic~Ai 389 (637) ++. -+++-|.+..+ +.--+++|+-.+..+|.-.-|||..| |. .+++.|++ .|....-++-.|.|.+..|+++| T Consensus 263 ~~g--~~~~~l~~~~~-d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l 338 (424) T protein:vir:18 263 EAG--FSTSAIGVTPQ-DAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) T ss_pred cCC--ceEEecCCChh-HHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 35555554432 22337899999999999999987765 66 56666644 55556667778999999999999 Q ss_pred HhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHH Q lcl|NC_021303. 390 YNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAA 466 (637) Q Consensus 390 t~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~ 466 (637) ++.+|.+. +. .+|.+.||.+.| ..+|..+.+ ..++..|.+|-.-.|+.+|++--.|=| +.+ T Consensus 339 ~~~L~~~~----~~--~~~~~~fd~~~l-lr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD----~~~----- 402 (424) T protein:vir:18 339 QRWLIPAK----DV--GRIHAEHNLDGL-LRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD----VAM----- 402 (424) T ss_pred HhhcCCcc----cc--CCeEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eee----- Confidence 99887652 22 368999999998 455554443 347788999999999999988533211 000 Q ss_pred HHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCC Q lcl|NC_021303. 467 DVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGA 512 (637) Q Consensus 467 d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a 512 (637) +..+ +.|+ + . .++..+...++| T Consensus 403 --~~~n------~~~l--------~------~--~~~~~~p~~~ga 424 (424) T protein:vir:18 403 --RQSQ------YVPI--------T------D--LGTNKEPRNNGA 424 (424) T ss_pred --eccC------ccch--------H------h--hhccCCCccCCC Confidence 0000 0010 0 0 011111111222 No 41 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.08 E-value=2.2e-10 Score=73.55 Aligned_cols=408 Identities=13% Similarity=0.102 Sum_probs=211.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccch-h----hHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEW-Q----SEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~W-Q----~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |-= .|...+...... ++ +.- |-...+|...+.+ . ..|. .++-+.-.|.-++++| T Consensus 1 ~~~------~r~~~~~~~~~~--~~--------~~~-~~~~~~g~~~s~~~~~vt~~~al----~~~~v~~~v~~ia~~i 59 (419) T protein:vir:14 1 MFF------SRQLLSNLGQTQ--MS--------AGG-WVSALLGSSRSDSGQVVTPASAL----ALTVLQNCVTLLAESI 59 (419) T ss_pred Ccc------cccccccccccc--cC--------cch-hhHHhhcCCCccCCcccchHHhh----ccHHHHHHHHHHHHhh Confidence 321 111111100000 00 000 1111222111111 1 1122 2334566677899999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |.+.|..-+-+.+... ++.+ +.+..+.+.=..--+-..++++.++.+|-+-|+.|+.+.-...|. T Consensus 60 A~lp~~~~~~~~~~~~-----~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~------ 124 (419) T protein:vir:14 60 AQLPIELYERSGEDRK-----PATD----HPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGV------ 124 (419) T ss_pred ccCceEEEEecCCccc-----cccc----cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc------ Confidence 9999988777755322 2222 456666665556668888999999999999999988865333332 Q ss_pred cccccceeeeHHHhcc---CCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIKS---KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~~---k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) +. ..+.+..+-+.. ..+...+ .. .|... | ..+.|+++=.+ +..-..-.||+..+...+.-.....+... T Consensus 125 -~~-~l~pl~~~~v~v~~~~~~~~~y-~~-~~~~~-~--~~~~i~h~~~~-~~dg~~G~s~i~~~~~~i~~~~~~~~~~~ 196 (419) T protein:vir:14 125 -IQ-GLYPLDNEAVTVMRGSDLKPVY-RV-RGSDP-M--PQRLVHHVRWM-SINGYTGLSPVLLHANAIGHAQAIQQYAG 196 (419) T ss_pred -EE-EEEEecCceEEEEECCCceEEE-EE-ccCcc-c--chhheeEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 12 233333333321 1222222 11 11111 1 11334443222 22334566888777776665555556656 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-+|+.+.-- ......+.|.+.+.+ .+..-+. +--++++ T Consensus 197 ~~f~ng~~p~gil~~~~~~~~~---------------------~~~~~~~~~~~~~~~----~~~g~~n--ag~~~vl-- 247 (419) T protein:vir:14 197 KSFMNGTALSGVIERPKDAPAL---------------------KDQASVDRITDGWNA----KFGGSGN--AKKVALL-- 247 (419) T ss_pred HHHhccCCccEEEEecCCCCcc---------------------cCHHHHHHHHHHHHH----HhcCccc--cCCceec-- Confidence 6555556667788776632110 112244555555443 2222222 1222222 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) ++. -+++-|.+. ..+.--+++|+-.+..+|..+=|||..|-+..+++.-+.-+....-++--|.|.+..|+++|+.. T Consensus 248 ~~g--~~~~~l~~~-~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~k 324 (419) T protein:vir:14 248 QEG--MTFRPLSMT-NVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRD 324 (419) T ss_pred CCC--ceEEEccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 222 355555553 23444578999999999999999887664445566666667776777888999999999999998 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHh Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVV 469 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v 469 (637) +|.+... ..|.+.||.+.|. ++|..+.+ ..+++.|.+|-.-.|+.+|++.-.|=| T Consensus 325 ll~~~~~------~~~~i~fd~~~l~-r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD--------------- 382 (419) T protein:vir:14 325 LLLPSER------KQYFIEYNLAGLL-RGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD--------------- 382 (419) T ss_pred ccCcccc------CCeEEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC--------------- Confidence 8755322 3689999999983 45554433 236789999999999999998544312 Q ss_pred cCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCC Q lcl|NC_021303. 470 TKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLN 533 (637) Q Consensus 470 ~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~ 533 (637) .-+.|+ .++..+-|.+ .+... .+..+..+++ . .-.++ T Consensus 383 -------~~~~~~---n~~~~~~~~~--------~~~~~-----~~~~~~~~~e--~--~~~l~ 419 (419) T protein:vir:14 383 -------IYLSPM---NMVDASKPQQ--------LPVGK-----SEPTKAAIDE--I--GRILS 419 (419) T ss_pred -------eeeecc---cccccccccc--------ccCCC-----CCCccccccc--h--hcccC Confidence 001110 0111110000 00000 0000000100 0 00111 No 42 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.06 E-value=6.2e-11 Score=76.55 Aligned_cols=400 Identities=12% Similarity=0.074 Sum_probs=211.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhccccc--ccchhhHHHHHhhhhhhHhhHhhhhhcceeee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTA--RNEWQSEAWDFSESIGELSYYISWRANSCSRT 78 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~--r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~ 78 (637) |- .|++.+...... + ..+ ..+-.+.+++. ..-+...|. ...-++-.+.-+++.+|++ T Consensus 1 m~------f~~~~~~~~~~~--~-------~~~--~~~~~~~g~~~~~~~v~~~~al----~~~~v~~~i~~ia~~ia~l 59 (409) T protein:vir:10 1 ML------FRKGFKNQSQEI--S-------IDD--KKILEWLGINPSETYVNGKSCL----KQATVFGCIRILSDNISKL 59 (409) T ss_pred Cc------ccccccCcCCCC--C-------CCh--HHHHHHhcCCcCcceechhhhh----ccHHHHHHHHHHHHhhhhC Confidence 43 333333221110 1 111 12222211110 000111222 2234566677789999998 Q ss_pred EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccc Q lcl|NC_021303. 79 TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (637) Q Consensus 79 rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~ 158 (637) .+..=+-+ |.++ .++ .+.+..+++.--.--+-..++++.++.+|-+-|+.|+.+.-...|.+. +..+-. T Consensus 60 p~~~~~~~-~~~~-----~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~-~L~~i~ 128 (409) T protein:vir:10 60 PIKIYQKK-DGIK-----RVP----DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIK-GLYPLK 128 (409) T ss_pred ceEEEEec-CCee-----ecc----CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE-EEEEEc Confidence 77553322 3222 122 245556655444555777889999999999999999997655555321 111111 Q ss_pred ccceeeeHHHhc-cCCCce--eEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 159 ARWYAVTREEIK-SKAGET--AEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 159 ~~W~~vt~~Ei~-~k~g~~--~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) ..+..+..++-- .+.... .......|...+|... -||++=++++. ...--||+.++.+.+....-+.+...+.. T Consensus 129 ~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~--evih~r~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 205 (409) T protein:vir:10 129 SDGMKIFVDDTGLLNSENNVWYLYTDDLGQRHKFMSD--EILHFKGLTAD-GLAGLSVIELLNHLIENGKSSETYLNNFF 205 (409) T ss_pred CCceEEEEcCCccccccceEEEEEEeCCceeEEeccc--cEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 122222222100 011112 2234455666666543 34444233332 23456887777766666555555555555 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||-+|+.++ ....+.+.+.+.+.-. ..++ +--++|+ ++. T Consensus 206 ~ng~~~~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~~-g~~n-----~~~~~vl--~~g 252 (409) T protein:vir:10 206 KNGLQVKGLVQYAGDLN-------------------------PEAEEVFKENFERMSS-GLKN-----AHRIAML--PIG 252 (409) T ss_pred hccCCCcEEEEcCCCCC-------------------------HHHHHHHHHHHHHHhc-cccc-----cCCceec--CCC Confidence 55455567776665332 1134455555543221 1122 1223332 333 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) -+++-|.+...-.. -+++|+-.+..+|..+-|||..| |. +++|.-++.+....-++-.|.|.++.|+++|++..| T Consensus 253 --~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~ 328 (409) T protein:vir:10 253 --YKFEPISQKLVDAQ-FLENSQLTIRQIASVFGVKMHQL-NDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLF 328 (409) T ss_pred --ceEEEccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHc-CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 35666666533222 37899999999999999999855 65 567777777777777888899999999999987765 Q ss_pred HHHHHHhCCChHHeEEeecCcccccCCCC---CHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcC Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLTSDPDL---SDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTK 471 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt~dPD~---tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~ 471 (637) -..-. +..|-+-||.+.| ..+|. .+....++..|++|-.-.|+.+|++.-.+-|- . T Consensus 329 ~~~~~-----~~~~~~~fd~~~l-l~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~----~----------- 387 (409) T protein:vir:10 329 LISEI-----KNGFYSKFNVDTI-LRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDV----L----------- 387 (409) T ss_pred Cchhc-----cCCcEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe----e----------- Confidence 32211 2557788999998 33443 33345688889999999999999975443230 0 Q ss_pred CchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 472 NPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQRE 517 (637) Q Consensus 472 ~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~E 517 (637) +.| .... |-+...+ +....|++ T Consensus 388 -------~~~--------------~n~~-~~~~~~~--~~~kgGe~ 409 (409) T protein:vir:10 388 -------LIN--------------GNMI-PVKMAGE--QYSKGGEK 409 (409) T ss_pred -------eec--------------cCcc-chhhccc--cccccCCC Confidence 001 0000 1111111 11112222 No 43 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.05 E-value=1.4e-10 Score=74.69 Aligned_cols=413 Identities=13% Similarity=0.113 Sum_probs=188.7 Q ss_pred HHHHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccC---------cccHHHH Q lcl|NC_021303. 54 AWDFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADG---------PLGQAAL 124 (637) Q Consensus 54 AW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG---------~lGqaqL 124 (637) -=++.+.-+-++=.|.=+++.++.+-|.+-.-+...+ .. ..++...+..+.....--. ..-+.++ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~-~~-----~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~ 74 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAED-PD-----RDGEQYERVWDFWFGDDSNWQVGPMESERATATNV 74 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCccc-cc-----chhhhhhhHHHHhhccCCCccccchhhHhhHHHHH Confidence 1122222344666677788888887765422221111 10 0111111122211111111 1235688 Q ss_pred HHHHHhhhcccccEEEEEEeecCCccccccccc--------------------cccceeeeHHHhccCCCceeEE----- Q lcl|NC_021303. 125 IKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAP--------------------RARWYAVTREEIKSKAGETAEI----- 179 (637) Q Consensus 125 lkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~--------------------~~~W~~vt~~Ei~~k~g~~~~i----- 179 (637) ++.++.++.+-|.+||.+.-...|.+. +...= ...||....+.......+.... T Consensus 75 ~~~~~~~l~l~Gn~~i~~~r~~~G~~~-~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (467) T protein:vir:31 75 LQTAWTDYEAIGWLTIEILTQTDGTPT-GLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDA 153 (467) T ss_pred HHHHHHHHHhcCCeEEEEEECCCCcEE-EEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeee Confidence 999999999999999987755455321 11100 0111111111111100010000 Q ss_pred -ecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhc----CceeeecccCCCC Q lcl|NC_021303. 180 -SLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMN----NGVLFVPAEMSLP 254 (637) Q Consensus 180 -~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~g----nGvlfvPqe~slP 254 (637) ..+.|..+.| + .+=|+++=.+++..-..--||+.+++..+. +........++.+.+ .|||.++..+. T Consensus 154 ~~~~~~~~~~~-~-~~diih~r~~~~~~~~~G~s~~~~~~~~i~----~~~~~~~~~~~~f~ng~~p~gil~~~~~~l-- 225 (467) T protein:vir:31 154 DDGSTGTSVSN-P-ANELIFKRNHSPLYPHYGAPDIIPAVKTIR----GDSAAQDYNIDFFENDGVPRIAIIVKGAEL-- 225 (467) T ss_pred ccccccceeEe-c-cccEEEecCCCCCCCcccccHHHHHHHHHH----HHHHHHHHHHHHHhccCCCceEEEecCcCC-- Confidence 0011222222 1 233456656777777778899998877653 333333333333332 35777764221 Q ss_pred CcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhccc-----CccccccccceeEeechHHhcccceeecC--- Q lcl|NC_021303. 255 AAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAME-----DENSQAAYIPLVASVAAEHLEKVQHIKFG--- 326 (637) Q Consensus 255 ~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~-----De~S~AA~vPiva~vP~Ehi~~ikHlkf~--- 326 (637) .....+.+.+.+..--..+.+ -++...+--++++....+ .+-.-++|. T Consensus 226 ----------------------~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~--~~~~~~~~~~ls 281 (467) T protein:vir:31 226 ----------------------TEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGAD--RSDVEIRLEPLT 281 (467) T ss_pred ----------------------CHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCc--ccccceeEEecc Confidence 011333444444322111111 011112222344444333 111122332 Q ss_pred --cchhHHHHhhHHHHHHHHHhhcCCchhHhhccCC-ccee-eeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhC Q lcl|NC_021303. 327 --NEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK-GNHW-SAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREG 402 (637) Q Consensus 327 --~dvtevaiktR~daI~RlAmglDv~pErLLGls~-~NHW-sAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eG 402 (637) +..+.--+++|+..+..+|...-|||. |||+++ +|-. ++-+....-++-.|.|.+..|+++|+..++...+ T Consensus 282 ~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~-~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~---- 356 (467) T protein:vir:31 282 VGIDEEASFLEFRGRNEHDILKVHDVPPV-IAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGL---- 356 (467) T ss_pred ccChhhHHHHHHHHHHHHHHHHHhCCCHH-HcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh---- Confidence 333445589999999999999999985 679854 3422 3455566666777999999999999998764432 Q ss_pred CChHHeEEeecCcccccCCCCCHH---HHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHH Q lcl|NC_021303. 403 IDPTKYILWYDASGLTSDPDLSDE---AVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMY 479 (637) Q Consensus 403 iDp~kYvvw~DaS~Lt~dPD~tde---A~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~ 479 (637) +-..|-+-||.+.|. ..|..+. ...+++.|.+|-.-+|+.+|++.-. | + .+.+ . T Consensus 357 -~~~~~~i~f~~~~l~-~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~--d----~--------------~~~~-~ 413 (467) T protein:vir:31 357 -DAPDWTIEFELAKPD-TKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFP--E----E--------------HVYG-G 413 (467) T ss_pred -ccCCceEEEecchhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--c----c--------------cccC-C Confidence 335677889998884 3333222 3356889999999999999986321 0 0 0000 0 Q ss_pred HhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCc Q lcl|NC_021303. 480 APLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLND 534 (637) Q Consensus 480 apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~ 534 (637) .++. +.+.+-..|.......+.+..+++.+.-....+.+.+.+......+-+++ T Consensus 414 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 414 ETLV-AEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred cccc-cccccccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 0000 11111111111100000000011000000000011111111222222222 No 44 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.02 E-value=2.5e-11 Score=78.74 Aligned_cols=362 Identities=14% Similarity=0.126 Sum_probs=193.8 Q ss_pred CCCC----------------cceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHH-----HHhh Q lcl|NC_021303. 1 MAAT----------------SLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-----DFSE 59 (637) Q Consensus 1 ma~~----------------~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW-----~~yd 59 (637) -+++ ++=-.|+|..++ .++.-.+.+.. . .|.....|...++ +.+- T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-----------~--~~g~~~~~~~~~~~~~t~~~~~ 76 (409) T protein:vir:83 11 PSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEP-EARALPWIRPT-----------A--WSGYPESWATPSWGSAQDKLRT 76 (409) T ss_pred ccCCCcccccccccccCCCCceeeccCCCcch-hhhhccccccc-----------c--cccccccccccCccccchhhHh Confidence 1111 223355555432 22221111110 1 1111123333332 1122 Q ss_pred hhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEE Q lcl|NC_021303. 60 SIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVW 139 (637) Q Consensus 60 ~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~w 139 (637) .++-+.-.|.=+++++|.+.|+.-+ + |... + ....+.+.=-.--+-..++++.++.+|.+ |..| T Consensus 77 ~~~~v~acV~~Ia~~iA~lpl~~~~---~-~~~~------~-----~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay 140 (409) T protein:vir:83 77 LIDVAWACIDLNASVLSSMPIYRMR---N-GRII------D-----SVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAF 140 (409) T ss_pred hhHHHHHHHHHHHHhhccCceEEee---C-Cccc------c-----chhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcE Confidence 3444555677789999998776543 1 2211 1 01112221112236778899999999998 9999 Q ss_pred EEEEeecCCccccccccccccceeeeHHHhccCCCceeEEecC-CCCc-ccccC--CCceEEEEecCCcccccCCccchh Q lcl|NC_021303. 140 IAVLIRQEKDPVTGLAAPRARWYAVTREEIKSKAGETAEISLP-DGKT-HEFNR--DLDSLVRIWNPRPRKASQATSPVR 215 (637) Q Consensus 140 i~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~k~g~~~~i~lP-dG~~-he~~~--~~d~l~RvW~P~prra~eaDSPvr 215 (637) +.++.|...+- +. ..+.|..+. +.+++- ||.. |.+.. ..+-||++=..++..-..--||+. T Consensus 141 ~~~i~r~~~G~------~~-~L~pl~p~~--------v~v~~~~~g~~~y~~~~~~~~~eiiHir~~~~~~~~~G~spi~ 205 (409) T protein:vir:83 141 VLPMAHGSDGY------PI-RFRVVPPWL--------VNVELKKGARREYRIGGLNVTDEILHIRYQGNTADAHGHGPLE 205 (409) T ss_pred EEEEEECCCCc------EE-EEEEECCcc--------eEEEEcCCceEEEEEccccCccceEEeCCCCCCCCcccccHHH Confidence 88777654321 11 333343332 222222 2211 12222 235567663333433334557765 Q ss_pred hhhHHHHHHHhhhHHHHHHHHhHhhcCce-----eeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHH Q lcl|NC_021303. 216 ACLETLREIERTTRKIKNAAKSRVMNNGV-----LFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQ 290 (637) Q Consensus 216 a~l~~LrEI~rttk~I~na~~SRL~gnGv-----lfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~ 290 (637) ++. +.+.+....... ..++..||. |-+|+.+ .....+.|.+-+.+ T Consensus 206 ~~~----~~i~~~~a~~~~-~~~~f~nga~p~gil~~~~~l-------------------------s~e~~~~~~~~~~~ 255 (409) T protein:vir:83 206 SAA----PRQVVIGLLQKY-VQNLAETGGVPLYWLGVERRL-------------------------SETEAVDLMDRWIE 255 (409) T ss_pred HHH----HHHHHHHHHHHH-HHHHHhcCCCcceEeecCCCC-------------------------CHHHHHHHHHHHHH Confidence 554 444444443333 345555543 3333311 11234445444432 Q ss_pred HHhhcccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccC-Ccceee---e Q lcl|NC_021303. 291 ASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS-KGNHWS---A 366 (637) Q Consensus 291 va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls-~~NHWs---A 366 (637) +..+ .+-=|+|+..-++ .-|-+.+..+ +.--+++|+-.+..+|.-.-||| .|||+. +++.|+ . T Consensus 256 ----~~~~----nag~~~il~~g~~---~~~~~~~s~~-d~q~le~r~~~~~eIa~~fgVPp-~llg~~~~~~~~tysn~ 322 (409) T protein:vir:83 256 ----SRSK----YAGHPALVTGGAT---LNQAKSMSAQ-DLSLMELTQFNEARIAILLGVPP-FLVGLPGATGSLTYSNI 322 (409) T ss_pred ----hhCC----ccCccceecCCcc---cccccCCCHH-HHHHHHHHHhhHHHHHHHhCCCH-HHccCCCCccccccccH Confidence 2222 2223445443222 2233444432 22247889999999999999987 788984 566665 5 Q ss_pred EEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHH Q lcl|NC_021303. 367 WAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRR 443 (637) Q Consensus 367 W~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr 443 (637) -|....-++..|.|.+..|+++|++.+|.. . |-+-||.+.|. .+|..+.+. .+.+.|.+|-.-.|+ T Consensus 323 eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~---------~-~~~~f~~~~ll-r~d~~~r~~~~~~~~~~G~lT~NE~R~ 391 (409) T protein:vir:83 323 EQLFSFHDRSSLRPKATAVMAALDRWALPS---------P-QHLELNRDDYT-RPSLVERATAYKIMIEAGVMEPNEARA 391 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhCCC---------C-cEEEeehhhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 777777788899999999999999977642 2 34689988873 444443332 356679999999999 Q ss_pred HhcCccccCCCCCchHHH Q lcl|NC_021303. 444 LLNVGEDSGYDLTTLDGC 461 (637) Q Consensus 444 ~lgl~~d~~yd~~t~eg~ 461 (637) .+|++-.+|-|=-|.-|+ T Consensus 392 ~~glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 392 MERLHSEAAAVRLSGGGV 409 (409) T ss_pred HhCCCCCCCCcccCCCCC Confidence 999998888665555566 No 45 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.00 E-value=2.7e-10 Score=73.02 Aligned_cols=374 Identities=12% Similarity=0.100 Sum_probs=198.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+.+.. .|.++..... ..++ .+-...+|.....|=+ ++-+-..+-+.-.+.-+++.||.+.+ T Consensus 1 Mg~~~~~~~--~k~~~~~~~~---------~~~~--~~~~~~~~~~~~~~v~--~~~~l~~~~v~~~i~~ia~~ia~~~~ 65 (383) T protein:vir:10 1 MGLLTPKNF--SKRNAKNMVY---------PSNP--AFFTTTVGGMQLSYVS--ALSALQNTNVYSVINRIASDVSSAHF 65 (383) T ss_pred CCccccccc--cccccccccc---------ccch--hhhhhhccCccccccc--hhHhhcchHHHHHHHHHHHhhccCce Confidence 443222111 1222110000 0010 1111111111111111 11112234456667778999999887 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+-..+ .+.+. .---+-..++++.++.+|-+-|++|+.+.-.+-+ T Consensus 66 ~~~~~~~~--------------------~ll~~-PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~~~------------ 112 (383) T protein:vir:10 66 KTENTATL--------------------NRLES-PSSLIGRFSFWQGALMQLCLSGNDYIPLVGQNLE------------ 112 (383) T ss_pred eecccchh--------------------hhhhC-CCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCcee------------ Confidence 54321111 11111 1112566788999999999999999987532111 Q ss_pred ceeeeHHHhc--cCCCce-eEE-ecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_021303. 161 WYAVTREEIK--SKAGET-AEI-SLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAK 236 (637) Q Consensus 161 W~~vt~~Ei~--~k~g~~-~~i-~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 236 (637) .+.++..-|+ ....+. ..+ ...+|...+|....=+-||.++|.......--||+.+|...+.=...+.+...+..+ T Consensus 113 ~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ 192 (383) T protein:vir:10 113 HIPNSDVQINYLPGNMGIVYTVLESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAME 192 (383) T ss_pred EeecCcceEEEEEcCCceEEEEEEcCCceEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1122221122 112222 222 333555566655333336767776665556679999998888777777777666666 Q ss_pred hHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_021303. 237 SRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEH 316 (637) Q Consensus 237 SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Eh 316 (637) .-.+-.|||.+|+.++ .....+.+.+.+-+. +.-.++- -|+++ ++. T Consensus 193 ng~~~~~il~~~~~~~------------------------~~e~~~~~~~~~~~~----~~~~n~~---~~~vl--~~g- 238 (383) T protein:vir:10 193 NQINPAGKLTISNYLS------------------------DGKDLESAREEFEKA----NTGDNSG---RLMVL--PDG- 238 (383) T ss_pred ccCCcceEEEeCCCCC------------------------CHHHHHHHHHHHHHH----hCccccC---Ccccc--CCC- Confidence 6667778888876432 111444565555433 1111111 22333 222 Q ss_pred hcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccC---CcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 317 LEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS---KGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 317 i~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls---~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) .+++.|.+.......-.++|+..+..+|..+-|||..| |.+ +.++-++-|+... +.--|.|.+..|+++|++.+ T Consensus 239 -~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~l-g~~~~~~~~~sn~eq~~~~-~~~~l~P~~~~ie~~l~~~l 315 (383) T protein:vir:10 239 -FDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDIL-GGGTSTESQHSNIDQIKAT-YLANLNSYVNPIVDELRLKM 315 (383) T ss_pred -ceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCccCCCCccccHHHHHHH-HHHHHHHHHHHHHHHHHHhh Confidence 46777766554444445899999999999999988765 553 3445555555433 33359999999999999877 Q ss_pred HHHHHHHhCCChHHeEEeecCcccc-cCCCCCHH-HHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcC Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLT-SDPDLSDE-AVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTK 471 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt-~dPD~tde-A~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~ 471 (637) |. |-+.||...|. .|+..--+ ...+++.|.+|-.-.|+.+|+.--.+-| T Consensus 316 ~~------------~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d----------------- 366 (383) T protein:vir:10 316 NA------------PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDN----------------- 366 (383) T ss_pred CC------------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCc----------------- Confidence 53 23788888763 33332222 4467788999999999999875322111 Q ss_pred CchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCC Q lcl|NC_021303. 472 NPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDE 508 (637) Q Consensus 472 ~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~ 508 (637) .|......++....++| T Consensus 367 --------------------~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 367 --------------------LPEFKPLTNETKGGDDK 383 (383) T ss_pred --------------------ccccCCCcccCCCCCCC Confidence 11111111122222222 No 46 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.98 E-value=1.2e-09 Score=69.48 Aligned_cols=477 Identities=14% Similarity=0.130 Sum_probs=226.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchh-----hHHH-HHhhhhhhHhhHhhhhhcc Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQ-----SEAW-DFSESIGELSYYISWRANS 74 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ-----~eAW-~~yd~VgELryyvgWr~~s 74 (637) |.-.+|-=.++-|++.. + +.++...+ .+|= -++. ++|...+-++--|.=+++. T Consensus 6 ~~~~~~~~~~~~~~~~~----------~-------~~~~~~~~----~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ 64 (540) T protein:vir:41 6 LSIKSLEKYRAIKGDTD----------S-------QALKEDRF----EEYVEPKVHPLVLLSLLQVNPYHASACSIKAND 64 (540) T ss_pred cChhhccchhhhhcccc----------c-------cccccCCC----CccccCCCCHHHHHHHHHhcHHHHHHHHHHHHH Confidence 43333221111111100 1 11111000 1110 0000 3333334445555666777 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) ++.+-+..-. +.+ .+. +. +-.-..-..++++.++.++.+-|.+|+.+.-...|.+ T Consensus 65 ia~~~~~i~~---~~~----~~~-----------~~---lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~---- 119 (540) T protein:vir:41 65 ILRTGYLIDG---DDG----GVE-----------EL---LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEP---- 119 (540) T ss_pred HhcCCceEec---Ccc----chh-----------hh---ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcE---- Confidence 7777665411 111 111 10 1122244568899999999999999998765544431 Q ss_pred ccccccceeeeHHHhccCCCceeEEecCCCCccccc-------------------CCCceEEEEecCCcccccCCccchh Q lcl|NC_021303. 155 AAPRARWYAVTREEIKSKAGETAEISLPDGKTHEFN-------------------RDLDSLVRIWNPRPRKASQATSPVR 215 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~-------------------~~~d~l~RvW~P~prra~eaDSPvr 215 (637) . ..+.|...-++....+...+.+.||....+. -..+-||++=+++|..-..--||+. T Consensus 120 ---~-~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~ 195 (540) T protein:vir:41 120 ---V-RLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYL 195 (540) T ss_pred ---E-EEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHH Confidence 2 2222333333321111112222233211110 0123466665666766667889999 Q ss_pred hhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhc Q lcl|NC_021303. 216 ACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAA 295 (637) Q Consensus 216 a~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aa 295 (637) +++..+.-..-..+...+.-+.-..-.|||.+|..++-+..-.. .....+.++|-+.-+.+ T Consensus 196 ~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~-------------------~~~~~~~~~~~~~~~~~ 256 (540) T protein:vir:41 196 SAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGS-------------------DGEPTGRTVLQGLIEDN 256 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccch-------------------HHHHHHHHHHHHHHHHH Confidence 88887766655555554444444455578888876554443311 12233444444444444 Q ss_pred ccCccccccccceeEeechHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccC---CcceeeeEEecc Q lcl|NC_021303. 296 MEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMS---KGNHWSAWAIGD 371 (637) Q Consensus 296 i~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls---~~NHWsAW~I~d 371 (637) +.. ....+-.|+|+..|+.--+.++...++....+. -+++|+..+..+|...-|||. +||+. +.|.-++-+... T Consensus 257 ~~g-~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~-~lG~~~~~~~n~sn~eq~~~ 334 (540) T protein:vir:41 257 FKY-LKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPY-RLGITDVGPLGGNFAEVARR 334 (540) T ss_pred hcc-ccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHH-HcCcccCCCCCcccHHHHHH Confidence 332 123567889998886444556666665443333 478999999999999999986 56974 355566677766 Q ss_pred CceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHh-cCccc Q lcl|NC_021303. 372 EDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLL-NVGED 450 (637) Q Consensus 372 edVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~l-gl~~d 450 (637) .-++.-|.|.+..|+++|++.++. ..+ ..|-+.||.+.| .++|+......++..|++|-.-.|..| |+.. T Consensus 335 ~f~~~tL~P~~~~ie~~ln~~L~~----~~~---~~~~i~f~~~~l-l~~D~~~~~~~lv~~G~lT~NE~Re~L~g~e~- 405 (540) T protein:vir:41 335 TYYESVVRPQQEIVSSVLTDFIQL----KLD---PGARFVFNEEIL-MESEFVHNYALLVQCGVLTPSEVREKLFGLDG- 405 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhh----ccC---CceEEEecchhh-cchHHHHHHHHHHhCCCCCHHHHHHHhCcCcC- Confidence 667777999999999999985543 222 247789999997 456655444457788999998888654 5542 Q ss_pred cCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCC----------CCC-CCCCCCC-CCCc Q lcl|NC_021303. 451 SGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTRE----------EDD-EDSGARQ-QREP 518 (637) Q Consensus 451 ~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~----------~~d-~~~~a~~-g~EP 518 (637) +-|. -+.+.-.++. .+.+-+...+..+..+++. ++. ++..+++ +.++ T Consensus 406 -gdd~------------------~l~p~n~~~~--~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (540) T protein:vir:41 406 -GPDM------------------FMVPSSIGKS--AMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKI 464 (540) T ss_pred -CCcc------------------cccccccccc--cccccccccCCCCccccccccchhcccccCccccccccccccccc Confidence 2110 0101000110 1111111111111111111 000 0111111 2223 Q ss_pred cCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcc--cccC---CCchhhhhHhhcCc-----------hhhhhhhcC Q lcl|NC_021303. 519 QTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGK--RRFK---VNDAALKTKLRDVP-----------AHEYHRVLP 582 (637) Q Consensus 519 dted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGk--Rr~~---~~~~~~~~rlr~ip-----------~h~~h~~~~ 582 (637) |.....-..++. . .+-- .+-|.|+++.--- |-.. ..+..+ .|..++. .|-+++++ T Consensus 465 ~~~~~~~~~~~~--~--~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~- 535 (540) T protein:vir:41 465 DEVLSDFRAEAY--E--NGKK---MLSIAGDMGTMSAINRGVSMIPPKPSNL-EAYEDLLAASVDDIVERIRHYLYKVI- 535 (540) T ss_pred cccccccCCccc--c--chhH---HHHHhhhhhhhhhhhcCceecCCCCcch-HHHHHHHHhhHHHHHHHHHHHHHHHh- Confidence 221110011111 1 1100 1445555554321 1110 001111 1111111 23334433 Q ss_pred CCCHHHHHHHHhccccc Q lcl|NC_021303. 583 PVRSSEIPRLIAGWDTA 599 (637) Q Consensus 583 PV~~~~v~rLi~GWd~~ 599 (637) ||-.. T Consensus 536 ------------~~~~~ 540 (540) T protein:vir:41 536 ------------GWREL 540 (540) T ss_pred ------------hhccC Confidence 45555 No 47 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=98.98 E-value=7.9e-10 Score=70.49 Aligned_cols=408 Identities=12% Similarity=0.109 Sum_probs=211.7 Q ss_pred CCCCc-ceEEecCCCCCcccccchheehhccccchhhhhhhhccc---ccccchhh-HHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATS-LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMG---TARNEWQS-EAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~-lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g---~~r~~WQ~-eAW~~yd~VgELryyvgWr~~s~ 75 (637) |--.. ++-.+ +++++.+ .|.....| +....... -....+ .++-+.-.+.-+++++ T Consensus 1 m~~~~~~~~~~-----------~~~s~~~--------~w~~~~~~~~~~~~~~g~~vt~~~al-~~~~v~~~i~~Ia~~i 60 (421) T protein:vir:10 1 MFIPQMFEGKK-----------RSVSGGG--------FWEAMLGGVRSSHSKAGVMITPETAL-ALSAVRACVTLLAESV 60 (421) T ss_pred CCCcchhcccc-----------cccCcch--------hhHHHhhhhccCcccCCceechHHhh-ccHHHHHHHHHHHHhh Confidence 33221 11111 1121110 01111000 00000000 000111 3455666788899999 Q ss_pred eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 76 SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 76 Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) |.+.+..=+-+.|.+.- .+.+ +.+..+.+.=...-+-..++++.++.+|-+-|+.|+.+.-...|. T Consensus 61 A~lp~~~~~~~~~g~~~----~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~------ 126 (421) T protein:vir:10 61 AQLPVELYRRDKNGGRQ----RATD----HPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGY------ 126 (421) T ss_pred ccCceEEEEEcCCCcee----eccc----chHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc------ Confidence 99999887777663221 1222 345555554455667788899999999999999998876444443 Q ss_pred cccccceeeeHHHhcc--CCCceeEEecC-CCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIKS--KAGETAEISLP-DGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~~--k~g~~~~i~lP-dG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) +. ..+.|..+.+.. ...+.+..... .|. +|.. +-|+++=.++ .....--||+..+.+.+.-..-..+... T Consensus 127 -~~-~L~~l~~~~v~v~~~~~g~~~y~~~~~g~--~~~~--~eiih~~~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~ 199 (421) T protein:vir:10 127 -PK-ELIPINPKKVIVLKGPDGMPYYEIPEIGE--TLPM--RMMHHVKVFS-LDGYIGSSPIQTNADVLGLNLAVEEHAS 199 (421) T ss_pred -EE-EEEEecCceEEEEECCCceEEEEEcCCCc--EEch--hhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 12 233333333321 11222333332 232 3322 2344442222 2334567887777766655555555555 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-.|+.+.=- .+....+.|.+.+.+- +..-+.+-. ++|+ T Consensus 200 ~~f~ng~~~~gil~~~~~~~~~---------------------~~~e~~~~~~~~~~~~----~~g~~n~~~--~~vl-- 250 (421) T protein:vir:10 200 AVFRRGATMSGVIERPKEAPAI---------------------KSQEKIDQLLAKWTDR----YSGINNMFS--VALL-- 250 (421) T ss_pred HHHhcCCCccEEEEecCccCcc---------------------CCHHHHHHHHHHHHHH----hcCccccCc--ceec-- Confidence 5555555556777776532110 1122444455444433 222122222 2222 Q ss_pred chHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHHHH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIY 390 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait 390 (637) ++. -+++.|.+.. .+. -+++|+-.+..+|...=|||.- ||. +++|.-+.-+....=++--|.|.+..|+++|+ T Consensus 251 ~~g--~~~~~l~~~~--~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln 325 (421) T protein:vir:10 251 QEG--MSYKQMSQDN--EKAQLLQSRQWGVEEVCRLYKIPPHM-VQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQ 325 (421) T ss_pred CCC--ceEEecCCCh--hHHHHHHHHHHhHHHHHHHhCCCHHH-cCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 333 3555555443 333 4789999999999999999865 576 45776666666666778889999999999999 Q ss_pred hHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHH Q lcl|NC_021303. 391 NDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAAD 467 (637) Q Consensus 391 ~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d 467 (637) ..+|.+- ++ ..|.|-||.+.| ..+|..+.|. .+++.|.+|-.-.|+.+|++.-.+=| +. T Consensus 326 ~kL~~~~---~~---~~~~v~fd~~~l-~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD----~~------- 387 (421) T protein:vir:10 326 RDLLLPS---ER---RDLYIEFNVSGL-LRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGD----KY------- 387 (421) T ss_pred hhccCcc---cc---CCeEEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----ee------- Confidence 9877542 21 257789999998 3455555443 35678999999999999998544322 00 Q ss_pred HhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021303. 468 VVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEE 528 (637) Q Consensus 468 ~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~ 528 (637) +.|+- +...+ ...+++.. +..+++..+|+-.+.. T Consensus 388 -----------~~~~n---~~~~~------~~~~~~~~-------~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 388 -----------LTPLN---MVDSA------QIIPGDKK-------PTAQQMAEIDTILSRT 421 (421) T ss_pred -----------eeccc---ccccc------ccccCCCC-------cccccCcccccccccC Confidence 11110 11111 11111111 0111122222211111 No 48 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.98 E-value=1.3e-10 Score=74.81 Aligned_cols=338 Identities=12% Similarity=0.118 Sum_probs=186.3 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) +|.+-+..-+=+. ...+.+.++.+.=...-+...++++.++.+|-+-|++|+.+.-...|. T Consensus 1 ia~lp~~~~~~~~--------------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~----- 61 (348) T protein:vir:93 1 MASLPLKMYEDYK--------------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ----- 61 (348) T ss_pred CcccceEeEecCc--------------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc----- Confidence 5555555433111 112446666665456678899999999999999999999875444443 Q ss_pred ccccccceeeeHHHhc---cCCCceeE--EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_021303. 155 AAPRARWYAVTREEIK---SKAGETAE--ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTR 229 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~---~k~g~~~~--i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 229 (637) +. ..+.|..+.+. ...++.+. +..++|...+|... -||++=++++..-..--||+..+...+. +.. T Consensus 62 --~~-~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~~--eiih~r~~~~~~~~~G~s~~~~~~~~i~----~~~ 132 (348) T protein:vir:93 62 --PS-KLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNM--DMLHFKHIVASNMVQGISPIDVLKNTTD----FDN 132 (348) T ss_pred --EE-EEEEEcCCceEEEEeCCCcEEEEEEEcCCCeEEEEccc--cEEEecCCCCCCceeeccHHHHHHHHHH----HHH Confidence 22 33444444332 13333332 56777877766542 3555556666555555677766654433 222 Q ss_pred HHHHHHHhHh--hcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccc Q lcl|NC_021303. 230 KIKNAAKSRV--MNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIP 307 (637) Q Consensus 230 ~I~na~~SRL--~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vP 307 (637) ...+...+.. .+.|++..++.+ .....+.+.+.|.+. +. ++- . + T Consensus 133 ~~~~~~~~~~~~~~~~i~~~~~~l-------------------------~~e~~~~~~~~~~~~----~~--n~~-~--~ 178 (348) T protein:vir:93 133 AVRTFNLTEMQKPDSFMLKYGSNV-------------------------STEKRQQVLEDFKQY----YE--ENG-G--I 178 (348) T ss_pred HHHHHHHHhcCCCceeEEecCCCC-------------------------CHHHHHHHHHHHHHH----hh--cCC-C--e Confidence 2222211111 112222222211 112345555555443 22 111 1 2 Q ss_pred eeEeechHHhcccceeecCcchhHHH-HhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHH Q lcl|NC_021303. 308 LVASVAAEHLEKVQHIKFGNEVTEVE-IKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLIC 386 (637) Q Consensus 308 iva~vP~Ehi~~ikHlkf~~dvteva-iktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic 386 (637) +|+ ++. -+++.|. ....+.. +++|+-.+..+|..+=|||..|-+.+++|..+..+....-++..|.|.+..|. T Consensus 179 ~vl--~~g--~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie 252 (348) T protein:vir:93 179 LFQ--EPG--VEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 252 (348) T ss_pred eec--CCC--ceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 222 322 3444444 4344444 78999999999999999888665556889999888888888889999999999 Q ss_pred HHHHhHHHHHHHHHhCCChHHeEEeecCcccc-cCCCC-CHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHH Q lcl|NC_021303. 387 QAIYNDILTPLLAREGIDPTKYILWYDASGLT-SDPDL-SDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREF 464 (637) Q Consensus 387 ~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt-~dPD~-tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~ 464 (637) ++|++.+|-. .+.+ ..|-+-||.+.|. .|+-. .+-+..+++.|.+|-.-.|..+|+..-.+=| T Consensus 253 ~~l~~~l~~~----~~~~-~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD---------- 317 (348) T protein:vir:93 253 EEFNRKLLTK----TDRE-KNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD---------- 317 (348) T ss_pred HHHHHhhCCc----cccc-CcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcC---------- Confidence 9999887753 1122 2455778999883 33322 2333558888999999999999997433211 Q ss_pred HHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCC Q lcl|NC_021303. 465 AADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDE 523 (637) Q Consensus 465 A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~ 523 (637) . +++...+..++.+. +. +++..|.|..+++. T Consensus 318 ---~------------~~~~~n~~~~~~~~------------~~-~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 318 ---K------------PLISGDLYPIDTPL------------EL-RKSLKGGDKNVNES 348 (348) T ss_pred ---e------------Eeecccccccccch------------hh-cccccCCCCCcCCC Confidence 0 00101111111110 00 01111111111111 No 49 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=98.98 E-value=9.2e-10 Score=70.13 Aligned_cols=413 Identities=13% Similarity=0.168 Sum_probs=196.3 Q ss_pred CCCC-cceEEecCCCC--Cc----ccccchheehhccccchhhhhhhhccccc-ccchhhHHHHHhhhhhhHhhHhhhhh Q lcl|NC_021303. 1 MAAT-SLRVVRRPKGS--AP----AARRRSLTAASQLITDPQKQMKTSLMGTA-RNEWQSEAWDFSESIGELSYYISWRA 72 (637) Q Consensus 1 ma~~-~lr~vrrpk~~--~p----~~~r~~ltAAs~~~~~p~~~~k~~~~g~~-r~~WQ~eAW~~yd~VgELryyvgWr~ 72 (637) |-+- -+=+.=|.|+. +| ...+.+. .++....+.+ ++.. .++ ..=-++.+-..+-+.-.|.-++ T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~----~~~~~~~g-~~v~~~~a~~~~aV~~~v~~Ia 71 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTF----TPVNATARDL----GIIISDTG-AAVNADAIMRLDAVAACVKLVS 71 (432) T ss_pred CCCcccCchhhhhHhhcCCcccccccccccc----ccCchhhhhh----cccccccC-cccchHhhhcchHHHHHHHHHH Confidence 2211 00011111111 00 0001010 1111111111 1100 000 0001122223455666788899 Q ss_pred cceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 73 NSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 73 ~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) +++|++.+..=+-+.| |. ..+.+ +.+..+.+.=-..-+-..++++.++.+|-+-|++|+.+.- .+|. +. T Consensus 72 ~~ia~lp~~~y~~~~~-g~----~~~~~----~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~-~~g~-~~ 140 (432) T protein:vir:97 72 QAVAAMPLMMYMRTPD-GR----KEAVN----HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVV-TDGR-IE 140 (432) T ss_pred HhhccCceEEEEecCC-Cc----ccccc----cHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEe-cCCc-EE Confidence 9999999887666666 32 22222 3455554443445588889999999999999999987654 3443 11 Q ss_pred ccccccccceeeeHHHhc---cCCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhh Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIK---SKAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTT 228 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~---~k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rtt 228 (637) + -| .|..+.+. ...|...+ +...+|...+|.. +-|+++=++ +-.-..--||+..+...+.--.... T Consensus 141 ~------L~-~l~p~~v~v~~~~~g~~~y~~~~~~g~~~~~~~--~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~a~~ 210 (432) T protein:vir:97 141 S------LQ-YLANDRLTITTDTKGNTAYRYRRTDGQMIDIPR--QQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAE 210 (432) T ss_pred E------EE-EEcCcceEEEEcCCCcEEEEEEecCceEEEEcc--ccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHH Confidence 1 12 23333322 12333333 3444666555543 234555222 2222344566655544433222222 Q ss_pred HHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccce Q lcl|NC_021303. 229 RKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPL 308 (637) Q Consensus 229 k~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPi 308 (637) +...+..+.-....|||-+|+.++ ....+.|.+.+.. ..+.+ =++ T Consensus 211 ~~~~~~f~ng~~~~gil~~~~~l~-------------------------~e~~~~~~~~~~~-----~~nag-----~~~ 255 (432) T protein:vir:97 211 AQAARAFRNGQLQSVYYQIDRFLT-------------------------DDQYDSFSKKVSG-----SVEAG-----RAP 255 (432) T ss_pred HHHHHHHhccCCcceeEecCCCCC-------------------------HHHHHHHHHHHhh-----hhcCC-----Cce Confidence 222222222223346666665332 1133445444321 11111 123 Q ss_pred eEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCC-cce-e--eeEEeccCceeEeechhHHH Q lcl|NC_021303. 309 VASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK-GNH-W--SAWAIGDEDVQLHIKPVMDL 384 (637) Q Consensus 309 va~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~-~NH-W--sAW~I~dedVrlHI~P~me~ 384 (637) |+ ++. -+++-|.+... +.=-+++|+-.+..+|..+-|||. |||..+ ++. | +.-+..-.=++.-|.|.+.. T Consensus 256 vl--~~g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ 329 (432) T protein:vir:97 256 LL--EGG--MDVKSLGLNPV-DAQLLQSRQYSVESICRFFGVPPS-MIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRR 329 (432) T ss_pred ec--CCC--ceEEEccCChh-HHHHHHHHHHHHHHHHHHhCCCHH-HcCCcCCcccccchhHHHHHHHHHHHHHHHHHHH Confidence 33 322 34555554332 222378899999999999999885 557642 221 2 12233333455679999999 Q ss_pred HHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHH Q lcl|NC_021303. 385 ICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (637) Q Consensus 385 ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~ 461 (637) |+++|++.+|.+.- + .+|.+-||.+.|. .+|..+.| ..++..|.+|-.-.|+.+|++--.|=+ T Consensus 330 ie~~ln~kLl~~~e---~---~~~~~~fd~~~ll-r~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~------- 395 (432) T protein:vir:97 330 IEQSIALNLLTPAE---R---RRYFADFDTSALL-RADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNA------- 395 (432) T ss_pred HHHHHhhhccCccc---c---CceEEEeechhhh-ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc------- Confidence 99999999887632 2 3688999999983 44444333 347788999999999999997544311 Q ss_pred HHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCC Q lcl|NC_021303. 462 REFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDER 524 (637) Q Consensus 462 r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~ 524 (637) +.+..+-. +.||-... +.++|.++...++ +..-+++. T Consensus 396 -----~~~~~~~~----~~pl~~~~----~~~~~~~~~~~~~-------------~~~~~~~~ 432 (432) T protein:vir:97 396 -----AVLTVQSA----MVPLDSIG----LQASPEPASGLGN-------------QQQDKVSK 432 (432) T ss_pred -----ceEeeccc----ccchhhhc----ccCCCCCCCCCCC-------------cccccccC Confidence 00101111 11211000 0011111110111 11111111 No 50 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.97 E-value=1.4e-10 Score=74.61 Aligned_cols=403 Identities=13% Similarity=0.089 Sum_probs=210.8 Q ss_pred CCCCc-ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATS-LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~-lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |.--. =.+++|-|..-. .+ .+..-.....++......+..|. ..+.+-.++-+.-.+.-++++||.+. T Consensus 1 m~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~v--------~~~~a~~~~~v~~~i~~ia~~iA~lp 69 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLI--DN-WIDQSTSKLYDFSPWKNRSFWGV--------INNTLETNETIFSAITKLSNSMASLP 69 (412) T ss_pred CccchhhhhhhhhhhhHh--hh-hhcccccccccccccCCcccccc--------chhhhhccHHHHHHHHHHHHhHhhCc Confidence 43321 144444442210 00 00000111111111101111110 12233455666667888999999998 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~ 159 (637) +..-+-+.. + .+.+.++.+.=..--+-..++++.++.+|-+-|+.|+.+.-...|. -. T Consensus 70 ~~~~~~~~~-------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~--------~~ 127 (412) T protein:vir:26 70 LKMYEDYKV-------V-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ--------PS 127 (412) T ss_pred eeEeecccc-------c-------cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc--------EE Confidence 876542211 1 2345666655455667888999999999999999999876544443 12 Q ss_pred cceeeeHHHhcc---CCCcee--EEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_021303. 160 RWYAVTREEIKS---KAGETA--EISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNA 234 (637) Q Consensus 160 ~W~~vt~~Ei~~---k~g~~~--~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 234 (637) .++.|..+.+.. +.++.+ .+...+|...+|.. +=||++=++++..-..--||+.++...+.=...+.+. +. T Consensus 128 ~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~--~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~--~~ 203 (412) T protein:vir:26 128 KLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--NL 203 (412) T ss_pred EEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc--ccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHH--HH Confidence 455555554432 223333 24566676666654 3456664555555555668876665444322212111 22 Q ss_pred HHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_021303. 235 AKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAA 314 (637) Q Consensus 235 ~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~ 314 (637) .+....+.||+..++.+ .....+.+.+.+.+. ++..+ - ++|+ ++ T Consensus 204 ~~~~~~~~~i~~~~~~l-------------------------~~e~~~~~~~~~~~~----~~~~g----~-~~vl--~~ 247 (412) T protein:vir:26 204 TEMQKPDSFMLKYGSNV-------------------------GKEKRQQVLEDFKQY----YEENG----G-ILFQ--EP 247 (412) T ss_pred HhcCCCCceEEecCCCC-------------------------CHHHHHHHHHHHHHH----hhcCC----C-eeec--CC Confidence 22222222333222211 112334444444332 22211 1 2222 33 Q ss_pred HHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 315 EHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 315 Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) . -+++.|.+... +.--+++|+-.+..+|-..-|||..|-+.+++|.-++.|....-++..|.|.+..|+++|++.+| T Consensus 248 g--~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kLl 324 (412) T protein:vir:26 248 G--VEIEPLPKKYV-SEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLL 324 (412) T ss_pred C--ceEEEcCCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 2 35555554432 22348889999999999999988766444678888888888888888999999999999998876 Q ss_pred HHHHHHhCCChHHeEEeecCcccc-cCCC-CCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLT-SDPD-LSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt-~dPD-~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) -.. +.. ..|.+-||.+.|. .|+- +.+....++..|.+|-.-.|+.+|++.-.+=| +.+ T Consensus 325 ~~~----~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD----~~~----------- 384 (412) T protein:vir:26 325 TKT----DRE-KNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD----KPL----------- 384 (412) T ss_pred Ccc----ccc-CcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eee----------- Confidence 532 221 3467889999873 3332 23334457788999999999999987544322 000 Q ss_pred chhHHHHHhhhccccccccCCCCcC-CCCCCCCCCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPAN-AIESTREEDDED 509 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~-a~~~~~~~~d~~ 509 (637) +...++.++.|.-.. ...-|+++.++. T Consensus 385 ----------~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 385 ----------ISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred ----------ecccccccccchhhcccccCCCCCcCCC Confidence 001111111111000 001111111111 No 51 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=98.95 E-value=8.9e-10 Score=70.21 Aligned_cols=393 Identities=17% Similarity=0.160 Sum_probs=191.1 Q ss_pred CCCCcceEEecC-CCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSLRVVRRP-KGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~lr~vrrp-k~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |.== ...||. +..++. +.+.... ..+....+.+... .+.+ .+.+.-.+.-+++++|.+. T Consensus 1 Mg~~--~~f~~k~~~~~~~-------~~~~~~~---~~~~~~~~~~~~~-------~~~~-~~~V~~~I~~ia~~iA~~p 60 (403) T protein:vir:80 1 MGLF--NFFRRKTRSEPTN-------AISWFLT---QEAYDTLAIPGYT-------RLSD-NPEVRMAVHKIAELISSMT 60 (403) T ss_pred Cccc--ccccccccccccc-------hhhhhcc---cccccccccchhh-------hhhh-hHHHHHHHHHHHHhhhhCc Confidence 6543 233332 221211 0011100 0000010000000 1111 3556677888999999977 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhccc--ccEEEEEEeecCCccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVV--GEVWIAVLIRQEKDPVTGLAAP 157 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVp--GE~wi~il~r~~~~~~~~~~~~ 157 (637) +..-.-.++ |.. .+ .+....+.+.=-..-+-..++++.++.++-.- |.+||.+.-...|. + T Consensus 61 ~~~~~~~~~-g~~----~~-----~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~-------~ 123 (403) T protein:vir:80 61 IHLMQNTDN-GDI----RI-----KNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGL-------I 123 (403) T ss_pred eEEEEecCC-cee----ec-----CChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCc-------E Confidence 765443433 432 21 13345544433445667888999998876554 55666544322232 1 Q ss_pred cccceeeeHHHhcc-CCCceeEEecCCCCcccccCCCceEEEEe-cCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 158 RARWYAVTREEIKS-KAGETAEISLPDGKTHEFNRDLDSLVRIW-NPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 158 ~~~W~~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~RvW-~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) . .++.|...-+.. ...++..+.. +| .+|.. +-||++- ++.|.....--||+.++.+.+.-.....+...+.. T Consensus 124 ~-~L~~l~p~~v~~~~~~~g~~~~y-~~--~~~~~--~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 197 (403) T protein:vir:80 124 D-ELIPLAPSKVSFVDTDTGYQIWY-QG--KAYNY--DEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFM 197 (403) T ss_pred E-EEEEEcCCeeEEEEcCCceEEEE-ee--cccch--hhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 233333332221 1111111111 12 12222 3455544 56777777777887766665555554444444444 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-.+..|||-+|+.++= ...+.+++-+++--..+. -+--++++..++. T Consensus 198 ~ng~~p~~il~~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~------~~g~~~~~~~~~~ 246 (403) T protein:vir:80 198 SGKYMPSLIVKVDAATAE-------------------------LSSEEGRNAVFKKYLEAS------EAGQPWIIPAELL 246 (403) T ss_pred hccCCcceEEEeCCCCCh-------------------------HHHHHHHHHHHHHHhhhh------hcCCeeeeccccc Confidence 444555567766664321 012233333332211111 1223344444444 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCC--cceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK--GNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~--~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) ...+++-+.+ .+.--+++|+-.+..+|.-.-||| .+||+++ ...++. =++-.|.|.+..|+++|++.+ T Consensus 247 ~~~~~~~l~~---~d~q~~e~~~~~~~~Ia~~fgVPp-~~lg~~~~~~~~~~~------f~~~~l~P~~~~ie~~l~~kl 316 (403) T protein:vir:80 247 DVEQVKPLSL---KDLAIHETVELDKRTVAGIFGVPA-FLLGVGKYDKDEYNN------FINSTILPIAKGIEQELTRKL 316 (403) T ss_pred ccceeccCCH---HHHHHHHHHHHhHHHHHHHhCCCH-HHcCCCCccHHHHHH------HHHHHHHHHHHHHHHHHHHhc Confidence 3444444433 233447899999999999999987 5557643 222222 344568999999999999877 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) |.+ ..|-+.||.+.|. ..|..+.+ ..++..|++|-.-.|+.+|+....+=| +.+..+ T Consensus 317 l~~---------~~~~~~f~~~~ll-~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd----~~~~~~------ 376 (403) T protein:vir:80 317 LIS---------PDLYFKFNPRSLY-AYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS----ELVILE------ 376 (403) T ss_pred cCC---------CCcEEEeechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEeecc------ Confidence 632 3477899999983 55644443 347888999999999999998654312 111110 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTE 521 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdte 521 (637) .+.||- ..++. .....+..++++-.+| T Consensus 377 -------n~~pl~----------------~~~~~-~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 377 -------NYIPLD----------------KIGDQ-NKLKGGEKGGADGQTD 403 (403) T ss_pred -------cccchh----------------hccch-hhccCCCCCCCCCCCC Confidence 111210 00000 0000000011111111 No 52 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=98.93 E-value=2.7e-09 Score=67.55 Aligned_cols=391 Identities=12% Similarity=0.050 Sum_probs=188.0 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhH-HHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSE-AWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~e-AW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |+=.+.= ++|-.|-....+.++.-+... +|... -.+.|-.++.+.--|..+++++|.+- T Consensus 1 mg~~~~~---~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p 60 (403) T protein:vir:10 1 MGFKSWI---TEKLNPGQRIIRDMEPVSHRT-----------------NRKPFTTGQAYSKIEILNRTANMVIDSAAECS 60 (403) T ss_pred Ccchhhh---hhccchhhhhhhccccccccc-----------------CCcccccHHHHHHHHHHHHHHHHHHHHHhhCc Confidence 6544311 111111100111121111110 01100 01222345667777888999999987 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~ 159 (637) +...+-. +..+++-..+. +....+.+.=-.--+-..++.+.++.++-+-|++||.+. +. . T Consensus 61 ~~v~~~~---~~~~~~~~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~~---------~ 120 (403) T protein:vir:10 61 YTVGDKY---NIVTYANGVKT----KTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----GT---------S 120 (403) T ss_pred eeEeecc---ccccccccccc----chHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----Cc---------e Confidence 6553321 11111111121 223334333234456778899999999999999998752 21 1 Q ss_pred cceeeeHHHh--ccCCCceeEEecCCCCcccccCCCceEEEE--e--cCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 160 RWYAVTREEI--KSKAGETAEISLPDGKTHEFNRDLDSLVRI--W--NPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 160 ~W~~vt~~Ei--~~k~g~~~~i~lPdG~~he~~~~~d~l~Rv--W--~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) -|+ +-.+.+ ....++.+ ...-.+....|..+ -+|++ . .+++.....--||+.++...+.-.....+...+ T Consensus 121 l~~-l~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~--eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 196 (403) T protein:vir:10 121 LYH-VPAALMQVEADANKFI-KKFIFNNQINYRVD--EIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEK 196 (403) T ss_pred eEe-ecCcceEEEEcCCceE-EEEEecCceeeccc--ceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 121 111111 11111111 11111222223222 23333 2 234444444557777777666655555544444 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-.|+.++ ....+.|++- -+..+...+.+ -=|+|+ + T Consensus 197 ~f~ng~~~~gil~~~~~l~-------------------------~e~~~~~~~~----~~~~~~g~~n~--g~~~vl--~ 243 (403) T protein:vir:10 197 FLDNGTVIGLILETDEILN-------------------------KKLRERKQEE----LQLDYNPSTGQ--SSVLIL--D 243 (403) T ss_pred HHhccCCcceEEEeCCCCC-------------------------HHHHHHHHHH----HHHHhCCcccC--cceeec--C Confidence 4444344446665555332 1123334332 23333221111 123333 2 Q ss_pred hHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) +. -+++.+.+.....+. -+++|+-.+..+|.-+-|||. |||.+ ++-+.-+....-++.-|.|.+..|+++|+.. T Consensus 244 ~g--~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~--~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~ 318 (403) T protein:vir:10 244 GG--MKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQV-LLDGG--NNANIRPNIELFYYMTIIPMLNKLTSSLTFF 318 (403) T ss_pred CC--ceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHH-HcCCC--CCcCHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 22 345555554333333 388999999999999999985 55643 2333445555667888999999999999975 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccc-cCCCCCHHH--H-HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLT-SDPDLSDEA--V-EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt-~dPD~tdeA--~-~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) + .|-++||.+.+. ..+|....+ . .+++.|++|-...|..+|++.-+. +.+- T Consensus 319 L-------------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~-----~~~d------- 373 (403) T protein:vir:10 319 F-------------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDD-----EQMN------- 373 (403) T ss_pred c-------------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCc-----cccc------- Confidence 4 256889998774 455554433 2 367889999999999999885321 0000 Q ss_pred hcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 469 VTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQR 516 (637) Q Consensus 469 v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~ 516 (637) .-+.|+ ..++. .....+.+..++ ..+.+|+ T Consensus 374 --------~~~~p~---n~~~~------~~~~~~~e~~~~-~~~~~g~ 403 (403) T protein:vir:10 374 --------KIRIPA---NVAGS------ATGVSGQEGGRP-KGSTEGD 403 (403) T ss_pred --------cccccc---ccccc------cccCCCCcCCCC-CCCcCCC Confidence 111111 01100 001111111111 1112222 No 53 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.92 E-value=2e-10 Score=73.77 Aligned_cols=402 Identities=14% Similarity=0.098 Sum_probs=207.7 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |..+.+ +-|.|.+-- -..+..-....++|.--...+..|- .++-+-..+-+.-.+.-+++++|.+.| T Consensus 1 ~~~~~~--~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v--------~~~~~~~~~~V~~ci~~Ia~~ia~lp~ 67 (409) T protein:vir:93 1 MAKENI--VTRIKKKLI---DNWIDQSTSKLYDFSPWKNRSFWGV--------INNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred CCccch--hhhhhhhhh---hhhhccccccccccccccCcccccc--------chhhhhccHHHHHHHHHHHHhhhhCce Confidence 665543 333333210 0011111112222221111111110 112233445566677889999999988 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+=+.. + .+.+..+.+.=..--+-..++++.++.+|-+-|+.|+.+.-...|. + .. T Consensus 68 ~~~~~~~~-------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-------~-~~ 125 (409) T protein:vir:93 68 KMYEDYKV-------V-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-------P-SK 125 (409) T ss_pred eEeecccc-------c-------cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-------E-EE Confidence 77542211 1 1345555544445567788999999999999999998875443442 1 24 Q ss_pred ceeeeHHHhcc---CCCceeE--EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 161 WYAVTREEIKS---KAGETAE--ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 161 W~~vt~~Ei~~---k~g~~~~--i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) ++.|..+.+.. +.++.+. +...+|...+|.. +=||++=++.+..-..--||+.++...+.=...+.+. +.. T Consensus 126 L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~--~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~--~~~ 201 (409) T protein:vir:93 126 LFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--NLT 201 (409) T ss_pred EEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc--ccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH--HHH Confidence 45554444331 2333332 4555666555543 3456664555555555668876655544322222111 222 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +....+.||+..++.+ .....+.+.+.|.+. +.+.+. + +|+ ++. T Consensus 202 ~~~~~~~~i~~~~~~l-------------------------~~e~~~~~~~~~~~~----~~~~g~----~-~vl--~~g 245 (409) T protein:vir:93 202 EMQKPDSFMLKYGSNV-------------------------GKEKRQQVLEDFKQY----YEENGG----I-LFQ--EPG 245 (409) T ss_pred hcCCCCceEEecCCCC-------------------------CHHHHHHHHHHHHHH----hhcCCC----e-eec--CCC Confidence 2222222333333221 112344555555432 222221 2 222 322 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++.|.+.. .+.--+++|+-.+..+|...-|||..|-+.+++|.-+..|....-++.-|.|.+..|+++|++.+|- T Consensus 246 --~~~~~l~~~~-~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~ 322 (409) T protein:vir:93 246 --VEIEPLPKKY-VSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLT 322 (409) T ss_pred --ceEEEcCCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 3444444332 2333488999999999999999888664446788888888777778888999999999999988774 Q ss_pred HHHHHhCCChHHeEEeecCcccc-cCCCCC-HHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLT-SDPDLS-DEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt-~dPD~t-deA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) + .+.+ ..|-+-||.+.|. .|+-.. +....++..|++|-.-.|+.+|++.-.+=| +.+. T Consensus 323 ~----~~~~-~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD----~~~~----------- 382 (409) T protein:vir:93 323 K----TDRE-KNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD----KPLI----------- 382 (409) T ss_pred c----cccc-CcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eeee----------- Confidence 3 2332 2466889998873 343222 223447788999999999999997554322 0000 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDE 523 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~ 523 (637) + ..++.++.+.- .......|++...| . T Consensus 383 -------~---~n~~~~~~~~~------------~~~~~~gG~~n~~e-~ 409 (409) T protein:vir:93 383 -------S---GDLYPIDTPLE------------LRKSLKGGDKNVNE-S 409 (409) T ss_pred -------c---ccccccccchh------------hcccccCCCCCcCC-C Confidence 0 01111111100 00001111111111 0 No 54 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=98.84 E-value=4.9e-10 Score=71.62 Aligned_cols=272 Identities=12% Similarity=0.066 Sum_probs=168.9 Q ss_pred eeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccc Q lcl|NC_021303. 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (637) Q Consensus 75 ~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~ 154 (637) +|++-|.+-+=+.+ .+ +.+..+.+.--.--+...++++.+..+|-+-|++|+.+.-...|. T Consensus 1 ia~l~~~~~~~~~~----------~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~----- 61 (278) T protein:vir:78 1 MASLPLKMYEDYKV----------VN----TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ----- 61 (278) T ss_pred CccceeEEEecCcc----------cc----cHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCc----- Confidence 77777766542211 11 224444333333457788899999999999999998876544442 Q ss_pred ccccccceeeeHHHhcc---CCCceeE--EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_021303. 155 AAPRARWYAVTREEIKS---KAGETAE--ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTR 229 (637) Q Consensus 155 ~~~~~~W~~vt~~Ei~~---k~g~~~~--i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 229 (637) ...++.+..+-+.. ..++.+. +...+|...+|.. +-||++=++++.....-.||+.++...+.......+ T Consensus 62 ---~~~l~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~--~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~ 136 (278) T protein:vir:78 62 ---PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRT 136 (278) T ss_pred ---EEEEEEECCceeEEEEcCCCceEEEEEEcCCceEEEEcc--ccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHH Confidence 12455554444432 2333333 4555666555543 346666577777777788999999888876554443 Q ss_pred HHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_021303. 230 KIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLV 309 (637) Q Consensus 230 ~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiv 309 (637) . |..+....+.|++..|+.++ ....+.+++.|.+ .+.+.+ . + ++ T Consensus 137 ~--~~~~~~~~~~~i~~~~~~l~-------------------------~e~~~~~~~~~~~----~~~~~g---~-~-~v 180 (278) T protein:vir:78 137 F--NLTEMQKPDSFMLKYGSNVG-------------------------KEKRQQVLEDFKQ----YYEENG---G-I-LF 180 (278) T ss_pred H--HHHHhcCCCcEEEEeCCCCC-------------------------HHHHHHHHHHHHH----HhccCC---C-c-ee Confidence 2 44444445566666665432 1134456665543 222211 1 1 22 Q ss_pred EeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeechhHHHHHHH Q lcl|NC_021303. 310 ASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQA 388 (637) Q Consensus 310 a~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P~me~ic~A 388 (637) ++++ .+++.+.+.. .+.--+++|+..+..+|..+-|||+ |+|. .++|+-++-+....=++.-|.|.++.|+++ T Consensus 181 --l~~g--~~~~~l~~~~-~d~~~~e~~~~~~~~Ia~~fgVpp~-~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~ 254 (278) T protein:vir:78 181 --QEPG--VEIEPLPKKY-VSEDIVASENLTRERVANVFQLPSV-FLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEE 254 (278) T ss_pred --cCCC--ceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCHH-HhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 2333 3566666542 2334488899999999999999865 5566 568888877766666777799999999999 Q ss_pred HHhHHHHHHHHHhCCChHHeEEeecCccc Q lcl|NC_021303. 389 IYNDILTPLLAREGIDPTKYILWYDASGL 417 (637) Q Consensus 389 it~~~Lr~~L~~eGiDp~kYvvw~DaS~L 417 (637) |+..+|.+- .++ ..|-+=||.+.| T Consensus 255 ln~~L~~~~----e~~-~g~~~~f~~~~l 278 (278) T protein:vir:78 255 FNRKLLTKT----DRE-KIGILNLTLNLI 278 (278) T ss_pred HHhhcCChh----Hhc-CCceEEEecccC Confidence 999876532 111 347899999999 No 55 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.82 E-value=4.3e-09 Score=66.45 Aligned_cols=384 Identities=13% Similarity=0.103 Sum_probs=196.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |. +.++.|.+. + +....||. |....+++. ....--....+ .++-+.-.|.-+++++|.+.+ T Consensus 1 M~-----~f~~~~~~~---~-------~~~~~~~~--~~~~~~~~~-~~~~v~~~~al-~~~~V~~~v~~ia~~ia~~p~ 61 (397) T protein:vir:38 1 MP-----LLKLNKSHS---Q-------GFSLNDPD--WVNFLTGGE-AQKYVSADTAL-KNSDIFSLIMQLSGDLAMVRY 61 (397) T ss_pred Cc-----chhhhhccc---C-------cccCCchh--hhhhhcCCc-CCceechHHhh-ccHHHHHHHHHHHHHHhhCcc Confidence 43 344444332 1 11112222 222222211 11111111223 356677778888999998887 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) -. ++ .....+.. =..--+...++++.++.+|-+-|++|+.+.-...|. +. . T Consensus 62 ~~----------------~~----~~~~~l~~-~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~-------~~-~ 112 (397) T protein:vir:38 62 TS----------------ES----DRSQSIIS-NPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGV-------DL-S 112 (397) T ss_pred cc----------------cc----cHHHHHHh-cCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-------EE-E Confidence 31 11 11222222 122346788999999999999999999876544443 11 3 Q ss_pred ceeeeHHHhc---cCCCceeE--EecC---CCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 161 WYAVTREEIK---SKAGETAE--ISLP---DGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 161 W~~vt~~Ei~---~k~g~~~~--i~lP---dG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) ++.|....+. ...++.+. +... .|...+|.. .+ ||++=.+.+..-..--||+.+++..+.-..-..+... T Consensus 113 l~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~~~-~e-iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 190 (397) T protein:vir:38 113 WEYLRPSQVQPMLLQDGSGLIYNINFDEPAIGYMENVPA-AD-VIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTL 190 (397) T ss_pred EEEEcCceeEEEEcCCCceEEEEEEeccccccceeEecC-cc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 3344333332 12222222 2222 233334433 23 5555445544444567888888777766666666666 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-.+-.|||-+|+.++- ...+.+.+.+.. ....++.+ -|+|+ T Consensus 191 ~~f~ng~~~~~il~~~~~~~~-------------------------e~~~~~~~~~~~--~~~~~n~~-----~~~vl-- 236 (397) T protein:vir:38 191 KALKQSVTASAVLTIQKGGLL-------------------------DAETRIARSKEI--SKQIHNSD-----GPVVI-- 236 (397) T ss_pred HHHhccCCccEEEEeCCCCCH-------------------------HHHHHHHHHHHH--HhcccccC-----Cceec-- Confidence 666666666778877763210 123334444322 11222222 23443 Q ss_pred chHHhcccceeecCcchhHHH-HhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHh Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVE-IKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvteva-iktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~ 391 (637) + .+++...++....+.. +++|+..+..+|..+-|||..|-|.+++|.+.. |. ...++-.|.|.+..|+++|++ T Consensus 237 ~----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e-~~-~~~~~~~l~P~~~~ie~~ln~ 310 (397) T protein:vir:38 237 D----ALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT-QI-SGQYAKSLNRYVQAIVGELND 310 (397) T ss_pred C----CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH-HH-HHHHHHHHHHHHHHHHHHHHH Confidence 2 2344444454444444 889999999999999999888766655554333 32 234556799999999999998 Q ss_pred HHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 392 DILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 392 ~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) .++.. .+ |-+.+. .++|..+.+ ..+++.|.||-.-.|+.+|+..-.+=| T Consensus 311 ~l~~~----~~-----~~~~~~-----~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d-------------- 362 (397) T protein:vir:38 311 KLHAN----IS-----ANIRFA-----IDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKD-------------- 362 (397) T ss_pred hccCh----hc-----cccccc-----ccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc-------------- Confidence 87753 22 222222 233333333 457888999999999999986422211 Q ss_pred hcCCchhHHHHHhhhccccccccCCCCcCCC-CCCCCCCCCCCCCCCCCCcc Q lcl|NC_021303. 469 VTKNPELIAMYAPLLSSQLAGIEFPQPANAI-ESTREEDDEDSGARQQREPQ 519 (637) Q Consensus 469 v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~-~~~~~~~d~~~~a~~g~EPd 519 (637) + ..+... +.+.... ...+.+.+..+..+.+++|+ T Consensus 363 ------~-------~~~~~~----~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 363 ------L-------PDPEKE----PQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred ------c-------cccccc----ccccccccccccCCCCCCCCCCCCCCCC Confidence 0 000000 0011111 11111111212222333333 No 56 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.82 E-value=1.5e-09 Score=69.02 Aligned_cols=402 Identities=13% Similarity=0.094 Sum_probs=208.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |+...+ .-|=|++=. -..+.-......++..-...+..|- .-+.|-..+.+.-.+.-+++++|++.+ T Consensus 1 ~~~~~~--~~~~k~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v--------~~~~a~~~~~v~~~i~~Ia~~ia~lp~ 67 (409) T protein:vir:94 1 MAKENI--VTRIKKKLI---DNWIDQSASKLYDFSPWKNKSFWGV--------INNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred Cccccc--chhhhhHHh---hhhhcCCcccccccccccCcccccc--------chhhhhccHHHHHHHHHHHHhhhhCce Confidence 665532 333333210 0000000111111111000110110 011233456667777889999999988 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+=... . .+.+.++.+.=..--+-..++++.++.+|-+-|++|+.+.-...|. +. . T Consensus 68 ~~~~~~~~----------~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-------~~-~ 125 (409) T protein:vir:94 68 KMYEDYKV----------V----NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-------PS-K 125 (409) T ss_pred eEeecccc----------c----chhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-------EE-E Confidence 66442111 1 1335555444345567888999999999999999998876444443 22 4 Q ss_pred ceeeeHHHhc---cCCCceeE--EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 161 WYAVTREEIK---SKAGETAE--ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 161 W~~vt~~Ei~---~k~g~~~~--i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) ++.|..+.+. .+.++.+. +...+|...+|.. +=+|++=++++..-..--||+.++.+.+.-...+.+. +.. T Consensus 126 L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~--~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~--~~~ 201 (409) T protein:vir:94 126 LFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--NLT 201 (409) T ss_pred EEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEcc--ccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHH--HHH Confidence 5555555443 13334332 5566777666654 3355554555655556678877665554432222211 111 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) ..-..+.||+..|+.+ .....+.+.+.|.+ .++..+. ++|+ ++. T Consensus 202 ~~~~~~~~i~~~~~~l-------------------------~~e~~~~~~~~~~~----~~~~~g~-----~~vl--~~g 245 (409) T protein:vir:94 202 EMQKPDSFMLKYGSNV-------------------------GKEKRQQVLEDFKQ----YYEENGG-----ILFQ--EPG 245 (409) T ss_pred hcCCCCeeEEecCCCC-------------------------CHHHHHHHHHHHHH----HhhcCCC-----eeec--CCC Confidence 1111222333333211 11233445444443 2222221 2222 222 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++.|.+... +.--+++|+-.+..+|...-|||..|=|.+++|.-+..|....=++.-|.|.+..|+++|++.+|- T Consensus 246 --~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 322 (409) T protein:vir:94 246 --VEIEPLPKKYV-SEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLT 322 (409) T ss_pred --ceEEEcCCChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC Confidence 34555544332 223378999999999999999988664446788888888777778888999999999999987664 Q ss_pred HHHHHhCCChHHeEEeecCccc-ccCCCCCHHH-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGL-TSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~L-t~dPD~tdeA-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) ..+.+. .|.+-||.+.| ..|+-..-++ ..++..|.+|-.-.|..+|++.-.+=| +.+ T Consensus 323 ----~~~~~~-~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD----~~~------------ 381 (409) T protein:vir:94 323 ----KTDREK-NRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD----KPL------------ 381 (409) T ss_pred ----cccccC-cceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----eEe------------ Confidence 333333 46677899887 3443333333 347788999999999999997554322 000 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTED 522 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted 522 (637) .+ ..+..++.+... ......|++...|- T Consensus 382 ------~~---~n~~~~~~~~~~------------~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 382 ------IS---GDLYPIDTPLEL------------RKSLKGGDKNVNES 409 (409) T ss_pred ------ec---ccccccccchhh------------cccccCCCCCcCCC Confidence 00 011111111000 00011111111110 No 57 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=98.81 E-value=1.8e-09 Score=68.52 Aligned_cols=383 Identities=11% Similarity=0.066 Sum_probs=182.7 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHH---H-HHhhhhhhHhhHhhhhhccee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEA---W-DFSESIGELSYYISWRANSCS 76 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eA---W-~~yd~VgELryyvgWr~~s~S 76 (637) |.==+ |+ +++...-+..+..+ ...+|. ..|.... . ..| ..+-++-.+.-+++.|| T Consensus 1 MGl~~-~~--~~~~~~~~~~~~~~---------------~~~~~~--~~~~~~~~vt~~~al-~~~~v~~~i~~Ia~~iA 59 (394) T protein:vir:62 1 MGLRD-RF--SNYLFKKAEKRGYL---------------DNVLGK--SIRYSGVYVTDSNIL-QSSDVYELLQDISNQMV 59 (394) T ss_pred Cchhh-hh--hhhccCCCCchhhh---------------hhhhhc--ccccCccccChhhhh-ccHHHHHHHHHHHHhhc Confidence 33211 11 11110000000000 001111 0000000 0 123 33567778888999999 Q ss_pred eeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccc Q lcl|NC_021303. 77 RTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAA 156 (637) Q Consensus 77 r~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~ 156 (637) .+.+..- +.| |.. +.+ +....+... ..--+-..++++.++.+|.+-|++|+.+--++-+ T Consensus 60 ~lp~~v~--~~~-g~~-----~~~----~~~~~Ll~~-PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-------- 118 (394) T protein:vir:62 60 LADIVVE--DEF-GNE-----IKD----DIALQILRN-PNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-------- 118 (394) T ss_pred ccceEEE--cCC-Ccc-----cch----hhHHHHhcc-CCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-------- Confidence 9988773 333 432 222 334445443 3344667788999999999999999986322111 Q ss_pred ccccceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_021303. 157 PRARWYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAK 236 (637) Q Consensus 157 ~~~~W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 236 (637) .|--++. + ..+.+.. .. ....++|.. +-+|++=.+.+ ....--||+..+. +.+.+........+ T Consensus 119 ---~~~~~~~---~-~~~~~~~-~~-~~~~~~~~~--~eiih~r~~~~-d~~~G~s~~~~~~----~~i~~~~~~~~~~~ 182 (394) T protein:vir:62 119 ---LASNVFT---E-LDDNLVE-HF-NIGGHEIPP--CMIRHVKNIGA-DHLRGKGILDLGR----DTLEGVMSAEKTLT 182 (394) T ss_pred ---ccccceE---E-ECCceEE-EE-eeCCEEech--hheEEecCcCC-CCccccChHHHHH----HHHHHHHHHHHHHH Confidence 1211111 1 1111111 11 112244433 33455423322 2233446655444 44444444433333 Q ss_pred hHhhcC-----ceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 237 SRVMNN-----GVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 237 SRL~gn-----GvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) +++.| |||-+|..++. ...+.+.+.+.+. ..+...+. +-=++|+. T Consensus 183 -~~~~ng~~~~~il~~~~~~~~-----------------------~~~~~~~~~~~~~----~~~~g~~n--~g~~~vl~ 232 (394) T protein:vir:62 183 -DKYKKGGLLTFLLNLDAHINP-----------------------QNGAQSKLINAIL----DQLESIDE--ARSVKMIP 232 (394) T ss_pred -HHHHccCCcceEEEeCCCCCc-----------------------CHHHHHHHHHHHH----HHhccccc--cCceeEee Confidence 33333 36555552211 0112233443332 22222221 12223333 Q ss_pred echHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHH Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIY 390 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait 390 (637) .. .+++-..+.....+. -+++|+-....+|..+-|||..|=|+.++| .-+....-++.-|.|.+..||++|+ T Consensus 233 ~g----~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn---~e~~~~~~~~~~l~P~~~~ie~~l~ 305 (394) T protein:vir:62 233 LG----KGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIKED---IEKAMMYIHNKAVRPIMKNFEDHLS 305 (394) T ss_pred CC----CceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcC---HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33 344544555443333 578999999999999999888664444444 3344445567779999999999999 Q ss_pred hHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHh Q lcl|NC_021303. 391 NDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVV 469 (637) Q Consensus 391 ~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v 469 (637) ..+|.+ .+| .+|.+.||.+.|. ++|...++ -.+...|.+|-.-.|+.+|++.-.+-+ + T Consensus 306 ~kll~~---~~~---~~~~~~fd~~~~~-~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~-----g--------- 364 (394) T protein:vir:62 306 LLFYAQ---NSG---KRIKFKINILDFV-TYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKE-----S--------- 364 (394) T ss_pred hhhcCc---ccc---CceEEEechhhhc-CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-----C--------- Confidence 888755 333 4799999999883 44433333 346667999999999999998542101 0 Q ss_pred cCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_021303. 470 TKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQT 520 (637) Q Consensus 470 ~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdt 520 (637) + ..+.+ ..++.+. ......++...|++.+- T Consensus 365 ----d--~~~~~---~n~~~~~------------~~~~~~~~~kgge~~en 394 (394) T protein:vir:62 365 ----Q--AIYIS---NDVTEIG------------KKEATDGSLGGGEENEN 394 (394) T ss_pred ----C--eeecc---ccccccc------------ccccccccCCCCCCCCC Confidence 0 00000 0011110 00000111111111100 No 58 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=98.78 E-value=1.1e-08 Score=64.28 Aligned_cols=374 Identities=12% Similarity=0.109 Sum_probs=194.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+-|. +.++..... . ... ++ ..+... .|.....+ .-.-.++ ..+-++-.+.-++++||++.+ T Consensus 1 Mg~~~~~~-~~~~~~~~~----~-----~~~-~~-~~~~~~-~~~~~~~~-v~~~~al-~~~~v~~~i~~ia~~ia~~p~ 65 (385) T protein:vir:10 1 MGLLTPRN-FNKRKAKNM----V-----YPS-NP-AFFTTT-VGGMQLSY-VSALSAL-QNTNVYSVINRIASDVASAHF 65 (385) T ss_pred Cccccchh-ccccccccc----c-----ccc-ch-hhhhhh-ccccCccc-cCHHHhh-ccHHHHHHHHHHHHHHhhCce Confidence 65332221 111111110 0 000 11 111111 11100100 1111222 345567778889999999877 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) -.-. .+ ...+.+. ..--+-..++++.++.+|.+-|++|+.+.-. ..+ T Consensus 66 ~v~~---~~-----------------~~~ll~~-PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~-~~~----------- 112 (385) T protein:vir:10 66 KTEN---TA-----------------TLNRLES-PSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-NLE----------- 112 (385) T ss_pred eeec---cc-----------------hhhhhhc-CCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-cee----------- Confidence 5421 11 1111111 1112456788999999999999999987522 111 Q ss_pred ceeeeHHHhc--cCCCcee-EEecC-CCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHH Q lcl|NC_021303. 161 WYAVTREEIK--SKAGETA-EISLP-DGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAK 236 (637) Q Consensus 161 W~~vt~~Ei~--~k~g~~~-~i~lP-dG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~ 236 (637) -+.+...-|+ ..+++.. .+... +|...+|....=+-||..+|.......--||+..+...+.-.....+...+..+ T Consensus 113 ~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ 192 (385) T protein:vir:10 113 HIPNSDVQINYLPGNMGIVYTVLESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAME 192 (385) T ss_pred EeecCCceEEEEEcCCceEEEEEEcCCceEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1111111111 1112222 23233 334445544332336666666555556678999988888777777777777777 Q ss_pred hHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_021303. 237 SRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEH 316 (637) Q Consensus 237 SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Eh 316 (637) .-..-.|||-+|..++= ....+.+.+.+-+. +...++ --|+++ ++. T Consensus 193 ng~~~~gil~~~~~~~~------------------------~e~~~~~~~~~~~~----~~~~n~---~~~~vl--~~g- 238 (385) T protein:vir:10 193 NQINPAGKLTISNYLSD------------------------GKDLESAREEFEKA----NTGDNS---GRLMVL--PDG- 238 (385) T ss_pred ccCCcceEEEeCCCCCC------------------------HHHHHHHHHHHHHH----hCcccc---CCcccc--CCC- Confidence 77777788888763321 11334454444332 221111 122232 222 Q ss_pred hcccceeecCcchhHHH--HhhHHHHHHHHHhhcCCchhHhhcc--CCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 317 LEKVQHIKFGNEVTEVE--IKTRIDAITRLAMGLDVSPERLLGM--SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 317 i~~ikHlkf~~dvteva--iktR~daI~RlAmglDv~pErLLGl--s~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) .+++.|... ..+.. .++|+-.+..+|...-|||.-|=|. +++++-+.-|....=++ -|.|.+..|+++|+.. T Consensus 239 -~~~~~l~~~--~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~-~l~P~~~~ie~~l~~~ 314 (385) T protein:vir:10 239 -FDYTQLEMK--TDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLA-NLNSYVNPIVDELRLK 314 (385) T ss_pred -ceEEecCCC--hhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHH-HHHHHHHHHHHHHHHh Confidence 345555543 33333 4899999999999999988765443 34555555554333333 6899999999999988 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHH---HHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHh Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVE---AHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVV 469 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~---a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v 469 (637) +|.+ -+.||.+.| ..+|..+.+.. +++.|.+|-.-.|..+|+.- ++.. T Consensus 315 l~~~------------~~~f~~~~l-l~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p---~p~~------------- 365 (385) T protein:vir:10 315 MNAP------------DLELDIKDM-LDVDDSALINQVSNLAKSGVLGAEQAQFILTRSG---FLPD------------- 365 (385) T ss_pred hCCc------------eEEeechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCc---cCCC------------- Confidence 7521 277888887 34455444433 77889999999998887531 1100 Q ss_pred cCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCC Q lcl|NC_021303. 470 TKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDD 507 (637) Q Consensus 470 ~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d 507 (637) .+.++..+.....-|+++++ T Consensus 366 ------------------~~~~~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 366 ------------------NLPEFKPLTTQVKGGDEGDN 385 (385) T ss_pred ------------------CCccccCcccccCCCCCCCC Confidence 01122222333333333333 No 59 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.78 E-value=1.2e-08 Score=64.07 Aligned_cols=490 Identities=13% Similarity=0.116 Sum_probs=204.3 Q ss_pred CCCC--cceEEecCCCCCcccccchheeh----hccc----------------cchhhhhhhhcccccccchhhHHHHHh Q lcl|NC_021303. 1 MAAT--SLRVVRRPKGSAPAARRRSLTAA----SQLI----------------TDPQKQMKTSLMGTARNEWQSEAWDFS 58 (637) Q Consensus 1 ma~~--~lr~vrrpk~~~p~~~r~~ltAA----s~~~----------------~~p~~~~k~~~~g~~r~~WQ~eAW~~y 58 (637) |--. .=||+---..+ ......++-+. +++. .+|.. ++.......++..-..|.+++ T Consensus 27 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~iv~~~i~~~ 104 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYS-MESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQD-LHKTLKKFGNNIILNAIINTR 104 (574) T ss_pred cccchhhhhhhhccCCC-HHHHHHhHhhhcccccchhhhhccccccccCcCccCCccc-HHHHHHhhccChhHHHHHHHH Confidence 1111 11121100000 00000111110 0000 01111 111111111233333444433 Q ss_pred hhhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccC--c--ccHHHHHHHHHhhhcc Q lcl|NC_021303. 59 ESIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADG--P--LGQAALIKRAVECMTV 134 (637) Q Consensus 59 d~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG--~--lGqaqLlkr~~~~LtV 134 (637) -. -+-=++..++.+.+.+-+.+-.-|.| +.+++.... ..+++..++...--. | .-..++++.++.++-+ T Consensus 105 ~~--~V~~~~~~i~~~ia~lp~~i~~kd~~-~~~~~~~~~----~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll 177 (574) T protein:vir:80 105 SN--QVSMYCKPARNSETGVGYEIRLKDIE-AEPTSHDIA----NIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYM 177 (574) T ss_pred HH--HHHHHHHHHHhhhccCceEEEEeccC-CCccchhhh----hhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHh Confidence 11 11122334444444444444333433 444432222 234555554332111 2 2345799999999999 Q ss_pred cccEEEEEEeecCCccccccccccccceeeeHHHh-ccCCCceeEEecCCCCcccccCCCceEEEEecCCcc--cccCCc Q lcl|NC_021303. 135 VGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEI-KSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPR--KASQAT 211 (637) Q Consensus 135 pGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei-~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~pr--ra~eaD 211 (637) -|..|+.+.-..+|. +.+...-...+..+..+.- ....++.++....+|.........++++..-+|.+. .-..-- T Consensus 178 ~Gnayi~i~r~~~G~-~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~ 256 (574) T protein:vir:80 178 YDQVNFEKVFDKDGN-FIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGY 256 (574) T ss_pred cCCeEEEEEECCCCc-EEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccc Confidence 999999877655553 2221111112222221111 112233444555555544333445666444454432 222344 Q ss_pred cchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHH Q lcl|NC_021303. 212 SPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQA 291 (637) Q Consensus 212 SPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~v 291 (637) ||+.++...+.=..-+.+...+..+.-..-.|||-++.. +. ......+.|.+.+... T Consensus 257 spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~------~~-----------------ls~e~~~~lk~~~~~~ 313 (574) T protein:vir:80 257 PELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTG------QQ-----------------QSQQALDIFRREWRSS 313 (574) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC------CC-----------------CCHHHHHHHHHHHHHH Confidence 777666666555544455555544444555677776531 10 1112344555554322 Q ss_pred HhhcccCccccccccceeEeechHHhcccceeecCcc-hhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcc------- Q lcl|NC_021303. 292 SVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNE-VTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGN------- 362 (637) Q Consensus 292 a~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~d-vtevaiktR~daI~RlAmglDv~pErLLGl-s~~N------- 362 (637) -....+ +--+||+.. +.++...|... .+.--+++|+..+..+|.-.-|||..| |+ +++. T Consensus 314 -~~G~~n----~g~~~vl~~------~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~t~~gs~~~ 381 (574) T protein:vir:80 314 -LAGING----SWQIPVVSA------EDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEI-NFPNNGGATGSKGG 381 (574) T ss_pred -hccccc----cccceeecC------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-cccccccccccccc Confidence 111122 123455532 23444444432 334458999999999999999999754 54 3343 Q ss_pred ---eeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHH Q lcl|NC_021303. 363 ---HWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSA 439 (637) Q Consensus 363 ---HWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~e 439 (637) .-++.+....-++.-|.|.+..|+++|++.+|.. + + .+|.+.||...+.. .+...++.....+|.+|-. T Consensus 382 ~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~-~---~---~~~~~~f~~~d~~~-~~~~~~~~~~~~~G~lT~N 453 (574) T protein:vir:80 382 SLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAE-F---G---EKYQFQFRGGDLSA-QLDKLKIIEQEGKVFRTVN 453 (574) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-c---C---CceEEEecccchhh-HHHHHHHHHHHhCCccCHH Confidence 3445555566677779999999999999998853 1 2 46899999877642 2222223345668999999 Q ss_pred HHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCC--CCCC-CCCCCCCC Q lcl|NC_021303. 440 ALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTRE--EDDE-DSGARQQR 516 (637) Q Consensus 440 Alrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~--~~d~-~~~a~~g~ 516 (637) -.|+.+|++.-.|-|.- +.|+ .++.++-+.+......... ..++ .++..... T Consensus 454 E~R~~lgl~Pi~gGD~~----------------------~~~~---n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (574) T protein:vir:80 454 EIRHDKGLEPIKGGDVI----------------------LNGV---HIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDV 508 (574) T ss_pred HHHHHhCCCCCCCCCEe----------------------eecc---ceeecccccccccCCccchhccccccccccCCCC Confidence 99999999865543310 1111 1111111111110000000 0000 00000000 Q ss_pred CccCCCCCCCcccCCCCcchHHHHHH-------HHHHHHHHHHhcc---cccCCCc----hhhhhHhhc Q lcl|NC_021303. 517 EPQTEDERSTEEAASLNDRAAYLVAE-------RLLVNRALDLAGK---RRFKVND----AALKTKLRD 571 (637) Q Consensus 517 EPdted~~~~~~~a~~~~~a~~~aa~-------~llV~rALelAGk---Rr~~~~~----~~~~~rlr~ 571 (637) +.+..++|+..+.+... ...-+. ..++.---+.-|| .+-.-+. .-.+.+=.+ T Consensus 509 ~~~~~~~p~~~~~d~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 574 (574) T protein:vir:80 509 EQPEPEEPKDSQNDTDV---SFQDEQQGLNGKSKKVNGKVDDNVGKDGQLKSEENTNSTKHGTDGIKKE 574 (574) T ss_pred CCCCCCCCCCccccccc---hhhhhhhhhccchhhhcCCcccccccccccccccccccccccCccccCC Confidence 11111111111000000 000000 0000000000011 0000000 000001011 No 60 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.77 E-value=2.1e-09 Score=68.13 Aligned_cols=479 Identities=14% Similarity=0.085 Sum_probs=207.2 Q ss_pred CCCC-cceEEecCCCCCcccccc----------------------hheehhccccchhhhhhhhcccccccchhh----- Q lcl|NC_021303. 1 MAAT-SLRVVRRPKGSAPAARRR----------------------SLTAASQLITDPQKQMKTSLMGTARNEWQS----- 52 (637) Q Consensus 1 ma~~-~lr~vrrpk~~~p~~~r~----------------------~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~----- 52 (637) |+-- ++|.+-+++..+-.-.+. --.|=+++.. ++..|... ++. |..+.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~-~~~~~~~~-~~~-r~~~~~~~~l~ 81 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVI-GSMSANPG-FKT-KPSIRNNQDLH 81 (551) T ss_pred hhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccc-cceecCcc-ccc-CccccChhHHH Confidence 5543 677666666654322221 1112123331 12222111 111 111110 Q ss_pred HHHHHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCC--------cccCCCCcccchHHHHHHHhcc----Cccc Q lcl|NC_021303. 53 EAWDFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTG--------EVDIEEDPDAQIVADYVKGIAD----GPLG 120 (637) Q Consensus 53 eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG--------~v~~e~~~~~~rv~~iv~~iAg----G~lG 120 (637) .-=+.|-.-+-|+=.|.=+++.++.+=..+ .++.+ |.+.- ....++...-+.+.++.+.-.- .+.- T Consensus 82 ~~~~~~~~npiv~~~I~~ia~~IA~~~~~~-~~~~~-g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s 159 (551) T protein:vir:80 82 GVLKKFGGNIILNAIINTRSNQVSMYCKPA-RHSEK-GVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDS 159 (551) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhh-hhhcC-CCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccch Confidence 001122223555555666666665432221 22211 22210 0000111111123333332221 1124 Q ss_pred HHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhcc---CCC-----ceeEEecCCCC-cccccC Q lcl|NC_021303. 121 QAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKS---KAG-----ETAEISLPDGK-THEFNR 191 (637) Q Consensus 121 qaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~---k~g-----~~~~i~lPdG~-~he~~~ 191 (637) ..++++.++.+|-+-|.+|+.+.-...|. + . .++.|...-|+. ..| ...+...-+|. ..+|.. T Consensus 160 ~~~f~~~lv~dlll~Gnay~~i~rd~~G~-~------~-~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~ 231 (551) T protein:vir:80 160 FSSFVKKIVRDTYMYDQVNFEKVFNRNQS-M------V-RFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNA 231 (551) T ss_pred HHHHHHHHHHHHHhcCCEEEEEEECCCCc-E------E-EEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcc Confidence 56799999999999999998766544543 1 1 233333333321 111 11122222333 333433 Q ss_pred CCceE-EEEec-CCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCC Q lcl|NC_021303. 192 DLDSL-VRIWN-PRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPG 269 (637) Q Consensus 192 ~~d~l-~RvW~-P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg 269 (637) .+++ ||.|. +++.....--||+.++++.+.-..-..+...+..+.-..-.|||.+|....+ T Consensus 232 -~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~l---------------- 294 (551) T protein:vir:80 232 -REMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQ---------------- 294 (551) T ss_pred -cceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCC---------------- Confidence 3444 44442 2344444466888877777665555555544444444445566666542111 Q ss_pred cccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcc-hhHHHHhhHHHHHHHHHhhc Q lcl|NC_021303. 270 APVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNE-VTEVEIKTRIDAITRLAMGL 348 (637) Q Consensus 270 ~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~d-vtevaiktR~daI~RlAmgl 348 (637) .....+.|.+.+.. .+.-.+ -+--+||+.. +.++...+.-. .+.--+++|+..+..+|.-. T Consensus 295 -------t~e~~~~lk~~~~~----~~~G~~-nag~~~vl~~------~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aF 356 (551) T protein:vir:80 295 -------SQHALEIFKREWKN----SLSGIN-GSWQIPVVSA------EDVKFVNMTPSARDMEFEKWLNYLINVISALY 356 (551) T ss_pred -------CHHHHHHHHHHHHH----HhcCcc-ccCccccccC------CCceEEEccCChhHHHHHHHHHHHHHHHHHHh Confidence 11133444444332 222111 2334566532 12333334322 23335788999999999999 Q ss_pred CCchhHhhcc-CC----------cceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCccc Q lcl|NC_021303. 349 DVSPERLLGM-SK----------GNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGL 417 (637) Q Consensus 349 Dv~pErLLGl-s~----------~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~L 417 (637) -|||. +||+ ++ .|+.++.+....-++-.|.|.+..|+++|++.++.. .| .+|.+.||...+ T Consensus 357 gVPp~-~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~----~~---~~~~f~f~~~~~ 428 (551) T protein:vir:80 357 GIDPA-EINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE----FG---DKYTFQFVGGDI 428 (551) T ss_pred cCCHH-HcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc----cC---CceEEEeeccCh Confidence 99986 5565 23 255556666666677789999999999999988753 22 468899985543 Q ss_pred ccCCCCCHHHHHHHhcCCcCHHHHHHHhcCcc-ccCCCCC-chHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCC Q lcl|NC_021303. 418 TSDPDLSDEAVEAHDRGAITSAALRRLLNVGE-DSGYDLT-TLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQP 495 (637) Q Consensus 418 t~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~-d~~yd~~-t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p 495 (637) .+.....+...++.+|.+|-.-.|+.+|+.- ..|=|.- ..-....+-+..-.++++. ..+.-.+.++ T Consensus 429 -~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~ 497 (551) T protein:vir:80 429 -KSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEH----------EKQQSNLQML 497 (551) T ss_pred -hhHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcch----------hhhhhccccc Confidence 2222223444677789999999999999953 2221200 0000000000000011110 0001111110 Q ss_pred c--CCCCCCCCCCCCCCCCCCCCCccCCCCCCCc-ccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhh Q lcl|NC_021303. 496 A--NAIESTREEDDEDSGARQQREPQTEDERSTE-EAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKT 567 (637) Q Consensus 496 ~--~a~~~~~~~~d~~~~a~~g~EPdted~~~~~-~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~ 567 (637) . .+.+.+.++. ++|+..++-+.. ......+..... |.+.. +| +.-+.+.++ T Consensus 498 ~~~~~~~~~~~~~---------~~p~~~~~~~~~~~~~~~~~~~~~~---------~~~~~-~~--~~~~~~~~~ 551 (551) T protein:vir:80 498 QEQTGNRVSTDVE---------DIPDGKDTTGDIGKDGQRKDKDNAN---------AGKQG-MK--GDKPNDWQT 551 (551) T ss_pred cCcCCCCCCCCCC---------CCCCccccCCCccccccccCccccc---------hhhhh-cC--CCCccccCC Confidence 0 0001111111 111111110000 001111111111 11111 12 122222222 No 61 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.75 E-value=1.2e-09 Score=69.56 Aligned_cols=402 Identities=12% Similarity=0.102 Sum_probs=188.9 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcc--cccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEe Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLM--GTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPS 83 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~--g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~as 83 (637) .-|.+|.+... . ..+..+....++.-.+ |......-.+ ..+ ...-+.-.|.-+++++|++.|..- T Consensus 1 Mg~f~~~~~r~-------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~al-~~~~v~~cv~~Ia~~iA~~p~~~~ 67 (416) T protein:vir:81 1 MGIFYKNEKRD-------L---QYNEDDLQMMVQTLPGFQGTKLRQYKDI--EAI-RHSDIFTAVMMIASDLARMPIRVT 67 (416) T ss_pred CCccccccccc-------c---cCCCcchhHHHHHhccccccCccccchh--hhh-cchHHHHHHHHHHHhhccCceEEe Confidence 55555443211 0 0111111111111100 0000110000 000 011222346778999999887542 Q ss_pred eeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccccee Q lcl|NC_021303. 84 AIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYA 163 (637) Q Consensus 84 eiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~ 163 (637) .+ |. +. . .+.+..+.+.=..--+-..++++.++.+|-+-|++|+.+.-...|. +. .++. T Consensus 68 ---~~-~~----~~-~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~-------~~-~L~~ 126 (416) T protein:vir:81 68 ---VN-GQ----IN-Y----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE-------PM-NLTF 126 (416) T ss_pred ---cC-cc----cc-c----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-------EE-EEEE Confidence 23 32 22 1 2345555544444557788999999999999999998865433332 12 2334 Q ss_pred eeHHHhc--cC-CCceeEE-ecCCCCcc----cccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 164 VTREEIK--SK-AGETAEI-SLPDGKTH----EFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 164 vt~~Ei~--~k-~g~~~~i-~lPdG~~h----e~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) |..+.+. .. .|...+. ..-+|..+ +|.. .| ||++=. ++-....--||+..+.+.+.=..-..+...+.. T Consensus 127 i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~-~e-vihir~-~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 203 (416) T protein:vir:81 127 RKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKF-ED-MLDIKF-YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFL 203 (416) T ss_pred EcCceeEEEECCCccEEEEEEEecCCCceeEEEEcc-cc-EEEecc-CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHH Confidence 4333332 11 2222211 11122222 2222 23 333311 122334556776666665544444444555545 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||-+|+.++= ..+.+.+.+.|.+.-. ..+..+ -|+|+ ++. T Consensus 204 ~ng~~~~gil~~~~~~~~------------------------~~~~~~~~~~~~~~~~-g~~nag-----~~~vl--~~g 251 (416) T protein:vir:81 204 RNGTHAGGILKMKGVLDN------------------------KKARDRAREEFHKSFS-GTKQAG-----KVVVL--DES 251 (416) T ss_pred hccCCCcEEEEeCCCCCC------------------------HHHHHHHHHHHHHHhc-CccccC-----ceeec--CCC Confidence 555555677777663221 0123344444333221 111111 23333 332 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++.|.+..+.-+ -+++|+.++..+|.-+-|||. |||+.+.+. +..|.+-.=++ -|.|.+..|+++|+..++. T Consensus 252 --~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~-~~~~~~~~~~~-~l~P~~~~ie~~ln~~l~~ 325 (416) T protein:vir:81 252 --MTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLH-KFGIETANM-SITDANLDYLS-TLKPYITCVCAELNFKFND 325 (416) T ss_pred --ceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHH-HcCCCCCCc-cHHHHHHHHHH-HHHHHHHHHHHHHhhhccc Confidence 34555554433222 378899999999999999987 578865432 32333222222 5999999999999987643 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) .. ..|-+.||.+.|.. +|..+.+ ..++..|.+|-.-.|+.+|++--.+-|-. .-+ +. T Consensus 326 ~~--------~~~~~~f~~~~l~~-~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~--~~~-------~~-- 385 (416) T protein:vir:81 326 EY--------VNREFKFDTTEIRV-VDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS--IHR-------VD-- 385 (416) T ss_pred cc--------cCceEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc--eEe-------ec-- Confidence 21 24667999999743 3433333 23678899999999999999754432200 000 00 Q ss_pred chhHHHHHhhhccccccccCC-CCcCCCCCCCCCCCCCCCCCCCCCccCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFP-QPANAIESTREEDDEDSGARQQREPQTEDE 523 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P-~p~~a~~~~~~~~d~~~~a~~g~EPdted~ 523 (637) -.++| ++.. +.+.. ..+.. +....|.|-. + T Consensus 386 ----~n~~~--------~~~~~~~~~~--~~~~~----~~~~kgGe~n---~ 416 (416) T protein:vir:81 386 ----LNHVN--------IELVDEYQMN--KSRAT----DKKLKGGEEN---E 416 (416) T ss_pred ----ccccc--------cccccccCcc--ccccc----ccccCCCCCC---C Confidence 01111 1111 00000 00000 0001111100 0 No 62 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.75 E-value=1.2e-09 Score=69.56 Aligned_cols=402 Identities=12% Similarity=0.102 Sum_probs=188.9 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcc--cccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEe Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLM--GTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPS 83 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~--g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~as 83 (637) .-|.+|.+... . ..+..+....++.-.+ |......-.+ ..+ ...-+.-.|.-+++++|++.|..- T Consensus 1 Mg~f~~~~~r~-------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~al-~~~~v~~cv~~Ia~~iA~~p~~~~ 67 (416) T protein:vir:45 1 MGIFYKNEKRD-------L---QYNEDDLQMMVQTLPGFQGTKLRQYKDI--EAI-RHSDIFTAVMMIASDLARMPIRVT 67 (416) T ss_pred CCccccccccc-------c---cCCCcchhHHHHHhccccccCccccchh--hhh-cchHHHHHHHHHHHhhccCceEEe Confidence 55555443211 0 0111111111111100 0000110000 000 011222346778999999887542 Q ss_pred eeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccccee Q lcl|NC_021303. 84 AIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYA 163 (637) Q Consensus 84 eiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~ 163 (637) .+ |. +. . .+.+..+.+.=..--+-..++++.++.+|-+-|++|+.+.-...|. +. .++. T Consensus 68 ---~~-~~----~~-~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~-------~~-~L~~ 126 (416) T protein:vir:45 68 ---VN-GQ----IN-Y----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE-------PM-NLTF 126 (416) T ss_pred ---cC-cc----cc-c----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-------EE-EEEE Confidence 23 32 22 1 2345555544444557788999999999999999998865433332 12 2334 Q ss_pred eeHHHhc--cC-CCceeEE-ecCCCCcc----cccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 164 VTREEIK--SK-AGETAEI-SLPDGKTH----EFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 164 vt~~Ei~--~k-~g~~~~i-~lPdG~~h----e~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) |..+.+. .. .|...+. ..-+|..+ +|.. .| ||++=. ++-....--||+..+.+.+.=..-..+...+.. T Consensus 127 i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~-~e-vihir~-~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 203 (416) T protein:vir:45 127 RKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKF-ED-MLDIKF-YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFL 203 (416) T ss_pred EcCceeEEEECCCccEEEEEEEecCCCceeEEEEcc-cc-EEEecc-CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHH Confidence 4333332 11 2222211 11122222 2222 23 333311 122334556776666665544444444555545 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||-+|+.++= ..+.+.+.+.|.+.-. ..+..+ -|+|+ ++. T Consensus 204 ~ng~~~~gil~~~~~~~~------------------------~~~~~~~~~~~~~~~~-g~~nag-----~~~vl--~~g 251 (416) T protein:vir:45 204 RNGTHAGGILKMKGVLDN------------------------KKARDRAREEFHKSFS-GTKQAG-----KVVVL--DES 251 (416) T ss_pred hccCCCcEEEEeCCCCCC------------------------HHHHHHHHHHHHHHhc-CccccC-----ceeec--CCC Confidence 555555677777663221 0123344444333221 111111 23333 332 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++.|.+..+.-+ -+++|+.++..+|.-+-|||. |||+.+.+. +..|.+-.=++ -|.|.+..|+++|+..++. T Consensus 252 --~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~-~~~~~~~~~~~-~l~P~~~~ie~~ln~~l~~ 325 (416) T protein:vir:45 252 --MTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLH-KFGIETANM-SITDANLDYLS-TLKPYITCVCAELNFKFND 325 (416) T ss_pred --ceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHH-HcCCCCCCc-cHHHHHHHHHH-HHHHHHHHHHHHHhhhccc Confidence 34555554433222 378899999999999999987 578865432 32333222222 5999999999999987643 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) .. ..|-+.||.+.|.. +|..+.+ ..++..|.+|-.-.|+.+|++--.+-|-. .-+ +. T Consensus 326 ~~--------~~~~~~f~~~~l~~-~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~--~~~-------~~-- 385 (416) T protein:vir:45 326 EY--------VNREFKFDTTEIRV-VDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS--IHR-------VD-- 385 (416) T ss_pred cc--------cCceEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc--eEe-------ec-- Confidence 21 24667999999743 3433333 23678899999999999999754432200 000 00 Q ss_pred chhHHHHHhhhccccccccCC-CCcCCCCCCCCCCCCCCCCCCCCCccCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFP-QPANAIESTREEDDEDSGARQQREPQTEDE 523 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P-~p~~a~~~~~~~~d~~~~a~~g~EPdted~ 523 (637) -.++| ++.. +.+.. ..+.. +....|.|-. + T Consensus 386 ----~n~~~--------~~~~~~~~~~--~~~~~----~~~~kgGe~n---~ 416 (416) T protein:vir:45 386 ----LNHVN--------IELVDEYQMN--KSRAT----DKKLKGGEEN---E 416 (416) T ss_pred ----ccccc--------cccccccCcc--ccccc----ccccCCCCCC---C Confidence 01111 1111 00000 00000 0001111100 0 No 63 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.74 E-value=7e-08 Score=59.80 Aligned_cols=524 Identities=14% Similarity=0.135 Sum_probs=224.0 Q ss_pred CCC----------------------------CcceEEecCCCCCcccccchheehhccccch--------hhhhhhhccc Q lcl|NC_021303. 1 MAA----------------------------TSLRVVRRPKGSAPAARRRSLTAASQLITDP--------QKQMKTSLMG 44 (637) Q Consensus 1 ma~----------------------------~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p--------~~~~k~~~~g 44 (637) ||- -+.|+--.|.+.|+.. .....-..|| ++..++..+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-----~~~~~~~~d~~~~~~~r~g~~~~~~~~g 75 (648) T protein:vir:79 1 MARKVWGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGG-----GGGGSAKRDPKMSLVKRIGLAIMDGGGG 75 (648) T ss_pred CccchhcchhhhhhhhhccCccccccccccccccccCCCccccCCCC-----cccccccccchhHHHHHhHHHHHhhcCC Confidence 111 1233333333332210 0001111122 2333444333 Q ss_pred ccccchh------hHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCc Q lcl|NC_021303. 45 TARNEWQ------SEAWDFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGP 118 (637) Q Consensus 45 ~~r~~WQ------~eAW~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~ 118 (637) . .+|. ....+.|..-+-++-.|.-++++++++-++..- +.+.++ +. .....+.. -.--. T Consensus 76 ~--~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~---~~~~~~-----~~----~~~~~ll~-rPn~~ 140 (648) T protein:vir:79 76 G--RDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVS---KNPNAV-----EY----IRMRFTLM-AEATQ 140 (648) T ss_pred c--cccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEe---cCCccc-----hh----hHHHHHhh-ccCCC Confidence 2 2232 123467777788888899999999998776422 222222 11 11112211 12223 Q ss_pred ccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccc-------ccccceeeeHHHhcc---CCCcee-EE-ecCCCCc Q lcl|NC_021303. 119 LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAA-------PRARWYAVTREEIKS---KAGETA-EI-SLPDGKT 186 (637) Q Consensus 119 lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~-------~~~~W~~vt~~Ei~~---k~g~~~-~i-~lPdG~~ 186 (637) +...++++.+..+|-+=|..|+.++-..+|.++-.... +....+-|..+-++. ..|... +. .-.+|.+ T Consensus 141 ~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~ 220 (648) T protein:vir:79 141 IPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDK 220 (648) T ss_pred CCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCce Confidence 56678999999999999999998776666542211000 001111222222221 111110 11 1112222 Q ss_pred -ccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccc Q lcl|NC_021303. 187 -HEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQA 265 (637) Q Consensus 187 -he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~ 265 (637) ..|. .+-||++...++..-..--||+.+|.+.|.-.....+.-.+--+.=....|||.+|.+ + T Consensus 221 ~~~~~--~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~------~-------- 284 (648) T protein:vir:79 221 PQKFK--PEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLE------Q-------- 284 (648) T ss_pred eEEec--CccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC------c-------- Confidence 1222 2347788666665555677888888777755444444333322222233455554310 0 Q ss_pred cCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchh--H-HHHhhHHHHHH Q lcl|NC_021303. 266 QIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVT--E-VEIKTRIDAIT 342 (637) Q Consensus 266 ~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvt--e-vaiktR~daI~ 342 (637) ......+.+++-+.+ ++.. .+ +.. ...+.+-+.|....+ + --+++|+-.+. T Consensus 285 ----------~~~e~~k~~~e~~~~----~~~~-------~~-i~g----g~v~~~~~~i~~~~s~~dlqfle~rk~~~~ 338 (648) T protein:vir:79 285 ----------EGFGAEEGEVDLVRG----EVEN-------MD-VEG----GMVTTERVNISSIASNQIIDAKEYLKHFEQ 338 (648) T ss_pred ----------cchHHHHHHHHHHHH----hccc-------cc-ccc----cccccceeeccccCCHHHHHHHHHHHHHHH Confidence 001122333333322 1111 00 111 111223333332212 1 24678999999 Q ss_pred HHHhhcCCchhHhhccC-CcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCCh---HHeEEeecCcccc Q lcl|NC_021303. 343 RLAMGLDVSPERLLGMS-KGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDP---TKYILWYDASGLT 418 (637) Q Consensus 343 RlAmglDv~pErLLGls-~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp---~kYvvw~DaS~Lt 418 (637) .+|...-|||. |||+. ++|.-++.+. ....+--|.|....++..+...+++..|...++|+ ..|-+.||.+.|- T Consensus 339 eIa~aFgVPP~-lLG~~~~ss~stae~~-~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Ll 416 (648) T protein:vir:79 339 RAFTVLGVSEL-MMGRGGTASRSTGDNL-SSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEID 416 (648) T ss_pred HHHHHhCCCHh-HcccCCCccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccc Confidence 99999999996 66985 4554444332 33345568899999999999999988888777764 2455688877664 Q ss_pred cC-C-CCCHHHHHHHhcCCcCHHHHHHHhcCcc-ccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCC Q lcl|NC_021303. 419 SD-P-DLSDEAVEAHDRGAITSAALRRLLNVGE-DSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQP 495 (637) Q Consensus 419 ~d-P-D~tdeA~~a~drGaIt~eAlrr~lgl~~-d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p 495 (637) .. + .+.+.+..+++.|.+|-...|..+|++- .+|.+.. .+..++-.... ...+.. T Consensus 417 r~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~-----------~l~~~~~~~~~-----------~~~~~~ 474 (648) T protein:vir:79 417 MDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRA-----------KMHLQMVTIAQ-----------ATALAA 474 (648) T ss_pred hhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcc-----------ccccccccchh-----------cccccc Confidence 32 1 1223345678899999999999999963 2332211 11112111000 010111 Q ss_pred cCCCCCCCCC-CCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhhHhhcCch Q lcl|NC_021303. 496 ANAIESTREE-DDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKTKLRDVPA 574 (637) Q Consensus 496 ~~a~~~~~~~-~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~rlr~ip~ 574 (637) ++..+++... ....++...+.+.++.++-..+...+.... .-.--++.|-.-.|+-- | -.++++. .. T Consensus 475 ~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~-~-----~~~~~~~-----~~ 542 (648) T protein:vir:79 475 LAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQ-TNGRHVRYMQEMLLEYT-T-----LNEAIKA-----LI 542 (648) T ss_pred CCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCccc-cchhhhhhhhhhhhcch-h-----hhHHHhh-----HH Confidence 1111111111 111122222222222221111111111110 00111222222233321 0 0011111 11 Q ss_pred hhhhhhcCCCCHHHHHHHHhcccccccHHHHHHhCCCHHHHHHHHHHHHH-------------------HHHHhhhhccc Q lcl|NC_021303. 575 HEYHRVLPPVRSSEIPRLIAGWDTALEDEVVASLGLDNEKLRNAVLATVR-------------------RQLTQPLIEGE 635 (637) Q Consensus 575 h~~h~~~~PV~~~~v~rLi~GWd~~ld~~~~a~lG~Dp~~lr~~v~~~v~-------------------~~lt~~vvd~~ 635 (637) .+||.+. |...|. .+...+-+-+..++.....|-- +..-.++=-+| T Consensus 543 ~~~~~~~--------------~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 607 (648) T protein:vir:79 543 ERYYQYG--------------SKEHLK-SINGSLMYTEGRLLELTTQYWGEEVTEKVRIPFHRMTENLREEVMSTIDKVE 607 (648) T ss_pred HHHHHHh--------------HHHHHH-hhhhhheeccchhHHHHHHHhhhhhhceeeeeHHHHHHHHHHHHHhhhhhhh Confidence 2333332 222221 2222222222222222221111 11100000011 Q ss_pred cC Q lcl|NC_021303. 636 VV 637 (637) Q Consensus 636 v~ 637 (637) -+ T Consensus 608 ~~ 609 (648) T protein:vir:79 608 GV 609 (648) T ss_pred hh Confidence 11 No 64 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=98.69 E-value=2.1e-09 Score=68.19 Aligned_cols=374 Identities=13% Similarity=0.078 Sum_probs=192.1 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeee Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAI 85 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (637) .++.++.|.++....-.. .......++. .+..+..|. .-....++ .++-+.-.+.-++++||.+.+..-+- T Consensus 1 M~~f~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~--~v~~~~~~----~~~~v~~~i~~ia~~ia~~p~~~~~~ 71 (386) T protein:vir:48 1 MPIFNITNLATESPPISQ--GGFFDITDPD-FLSTLNGSE--WVSAESAL----RNSDLFSIINQLSNDLATVKLTASRK 71 (386) T ss_pred Cccccccccccccccccc--ccccccccch-hcccccCCc--eechhhhh----cchHHHHHHHHHHHhhccCceeeccc Confidence 566666655432111000 0001111111 011111111 11122222 35667778888999999998865432 Q ss_pred ccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeee Q lcl|NC_021303. 86 DPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVT 165 (637) Q Consensus 86 D~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt 165 (637) ..+ .+ ..-..--+-..++++.++.+|-+-|+.|+.+.-...|. + ..++.|. T Consensus 72 ~~~------~l---------------~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~-------~~L~~l~ 122 (386) T protein:vir:48 72 QLQ------GI---------------IDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGR-D-------MKWEYLR 122 (386) T ss_pred hhH------HH---------------hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCc-E-------EEEEEec Confidence 211 01 11112225677899999999999999998765433332 1 2444444 Q ss_pred HHHhcc---CCCcee-E-EecCC---CCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHh Q lcl|NC_021303. 166 REEIKS---KAGETA-E-ISLPD---GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKS 237 (637) Q Consensus 166 ~~Ei~~---k~g~~~-~-i~lPd---G~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~S 237 (637) .+.+.. ..++.+ + +...+ |...+|.. +=||++=.+++.....--||+..+...+.-.....+...+..+. T Consensus 123 ~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~--~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~n 200 (386) T protein:vir:48 123 PSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQ--GDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKN 200 (386) T ss_pred CceeEEEEcCCCceEEEEEEecCccccceeEecC--ccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 444431 112222 1 22222 22233433 33555556666655556788888888777666666666555555 Q ss_pred HhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_021303. 238 RVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHL 317 (637) Q Consensus 238 RL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi 317 (637) -....|||-+|+.++ ....+.+++.+.+ ...+ +--|+|+. +. T Consensus 201 g~~~~~ii~~~~~~~-------------------------~e~~~~~~~~~~~----~~~n-----~g~~~vl~--~g-- 242 (386) T protein:vir:48 201 ALNANGILKIKGGGL-------------------------LDFKTKLSRSRQA----MKQM-----QGGPLVLD--DL-- 242 (386) T ss_pred cCCcceEEEeCCCCC-------------------------HHHHHHHHHHHHH----hhcC-----CCCceecC--CC-- Confidence 555666776555221 1122334433322 1122 22234433 22 Q ss_pred cccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHH Q lcl|NC_021303. 318 EKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTP 396 (637) Q Consensus 318 ~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~ 396 (637) -+++.|.. ...+. -+++|+-.+..+|...-|||. |||.+.++ -+.++-..+-++.-|.|.+..||++|++.++.. T Consensus 243 ~~~~~l~~--~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~-~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~ 318 (386) T protein:vir:48 243 EEFTPLEI--KSNVSQLLKQADWTTGQFAKVYGIPEN-VVGGQGDQ-QSSLEMSLDLYNKAVSRYLRPFLSELSQKLSCD 318 (386) T ss_pred ceEEEcCC--ChhHHHHHHHHHHHHHHHHHHhCCCHH-HhCCCCCc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch Confidence 23444443 33333 389999999999999999987 55763322 266777777888899999999999999888753 Q ss_pred HHHHhCCChHHeEEeecCcccccCCCCC---HHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 397 LLAREGIDPTKYILWYDASGLTSDPDLS---DEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 397 ~L~~eGiDp~kYvvw~DaS~Lt~dPD~t---deA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) . ++| ... ..++|.. +....++..|.+|-...|+.+|..- +.. .|. +.. .++ T Consensus 319 ~----~~~---------~~~-~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~---~~~--~~~-~~~------~~~ 372 (386) T protein:vir:48 319 V----DAD---------ILP-AVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAE---ILP--KEL-PEG------ENP 372 (386) T ss_pred h----hcc---------hhh-hhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCC---CCC--ccc-hhh------cCC Confidence 2 221 111 1123322 2344667789999999998887532 211 110 000 000 Q ss_pred hhHHHHHhhhcccccccc Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIE 491 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie 491 (637) . ..|+--+.--+-| T Consensus 373 ~----~~~~~gGd~~~~~ 386 (386) T protein:vir:48 373 N----KTTLKGGEINGED 386 (386) T ss_pred C----CCccCCCCCCCCC Confidence 0 0011000111111 No 65 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.68 E-value=3.4e-08 Score=61.54 Aligned_cols=487 Identities=14% Similarity=0.109 Sum_probs=219.1 Q ss_pred CCCC--cceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeee Q lcl|NC_021303. 1 MAAT--SLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRT 78 (637) Q Consensus 1 ma~~--~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~ 78 (637) |=-. |.|-+=+||.--- -...++.+.. ..+-.. .+. --+|..=+ ++|.+-+-++--|.=+++.++++ T Consensus 1 ~~~~~~~i~s~~~~~~i~~------~~~~s~~~~~--~~~~~~-~~p-p~~~~~la-~l~~~n~~v~scI~~ia~~IA~l 69 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKR------EEVESQALGE--TRFEEY-VEP-KVNPLVLL-SLLQVNPYHASACSIKANDIIRT 69 (542) T ss_pred Cccccccccccccchhhhh------cccccccccc--ccCCcc-ccC-CCCHHHHH-HHHhhcHHHHHHHHHHHHHHhhC Confidence 4333 3333333332210 0111111100 000000 000 01222111 45556666677778888888888 Q ss_pred EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccc Q lcl|NC_021303. 79 TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (637) Q Consensus 79 rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~ 158 (637) .+-. +.++.. .+..-+-.--+-..++++.++.+|-+-|.+|+.+.-...|. +.+...-. T Consensus 70 ~~~~---~~~~~~-----------------~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~-~~~L~~l~ 128 (542) T protein:vir:41 70 GYIL---EGDDEG-----------------VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGD-PIRFEYIP 128 (542) T ss_pred ceee---ecccch-----------------hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc-EEEEEEEc Confidence 7754 222100 01111223335567789999999999999999776544443 22221111 Q ss_pred ccceeeeHHHhc----cCCCceeEEecC---------CCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHH Q lcl|NC_021303. 159 ARWYAVTREEIK----SKAGETAEISLP---------DGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIE 225 (637) Q Consensus 159 ~~W~~vt~~Ei~----~k~g~~~~i~lP---------dG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~ 225 (637) ..+..++++.-. ..+.+.+...-- +|......+..| ||++=++++..-..--||+.+++..+.=-. T Consensus 129 ~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~e-IiHir~~~~~~~~~Glspi~~~~~~i~~~~ 207 (542) T protein:vir:41 129 SHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANE-LVFIHIPSPVCSYYGVPRYVSAAPAILAMQ 207 (542) T ss_pred CcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCccc-EEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 122222221100 001111111000 011111111123 455545555555566789988887664444 Q ss_pred hhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccc Q lcl|NC_021303. 226 RTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAY 305 (637) Q Consensus 226 rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~ 305 (637) -..+...+..+.-.+..|||.+|..+.=-.... .-....+.+.|.+.+- ..+.. ..-.+- T Consensus 208 ~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~---------------~~~~~e~~~~lk~~~~----~~~~g-~~~n~g 267 (542) T protein:vir:41 208 KIDEYNYAFFDNYTIPSYVITVTGEFEDELEED---------------PDGNPTGRTVIQALIE----DNFKH-LKEAPH 267 (542) T ss_pred HHHHHHHHHHhccCCccEEEEeCCccccccccc---------------cccCHHHHHHHHHHHH----HHHhh-hhcccC Confidence 444444444444455567898887432111000 0011223344433332 22221 112345 Q ss_pred cceeEeechHHhcccceeecCcch-hHHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechh Q lcl|NC_021303. 306 IPLVASVAAEHLEKVQHIKFGNEV-TEVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPV 381 (637) Q Consensus 306 vPiva~vP~Ehi~~ikHlkf~~dv-tevaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~ 381 (637) .|+|+..|++--+.++...+.... +.--++.|+..+..+|...-|||. +||+.+ .|.-++.+....-++.-|.|. T Consensus 268 k~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~-~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~ 346 (542) T protein:vir:41 268 TPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPY-RLGIADTGPLGGNFAEVTRRTYYESVVRPQ 346 (542) T ss_pred ceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHH-HhCcCCCcccccccHHHHHHHHHHHHHHHH Confidence 788888887655556655554333 333478999999999999999998 568842 344456777777778889999 Q ss_pred HHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHh-cCccccCCCCCchHH Q lcl|NC_021303. 382 MDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLL-NVGEDSGYDLTTLDG 460 (637) Q Consensus 382 me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~l-gl~~d~~yd~~t~eg 460 (637) +..|+++|++.++.. .+ ..|.+.||...|. +.|+...+..++..|.+|-.-.|..| |++- + | T Consensus 347 ~~~ie~~ln~~L~~~----~~---~~~~~~f~~~~ll-~~d~~~~~~~~v~~GilT~NE~Re~L~g~~p--g-d------ 409 (542) T protein:vir:41 347 QNIISSILTDFFQVK----FN---PKTRFKFNDETLL-ESDSVRNCALLVQSGVLTPAEARERLFGLDG--G-P------ 409 (542) T ss_pred HHHHHHHHHhhcccc----cC---CceEEEecchhhc-chHHHHHHHHHHhCCCCCHHHHHHhhCCCCC--C-C------ Confidence 999999999765532 22 2477899998885 56777777788889999998888644 5542 2 1 Q ss_pred HHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCC-CCCCCCCCccCC----CCCCCcccCCCCcc Q lcl|NC_021303. 461 CREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDED-SGARQQREPQTE----DERSTEEAASLNDR 535 (637) Q Consensus 461 ~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~-~~a~~g~EPdte----d~~~~~~~a~~~~~ 535 (637) +|-++ |+- ...+.+. . +....+.+.+.+ +.......|+.. +.-+..+... ... T Consensus 410 -----------d~~l~----p~~-~~~~~~~--~---~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~-~~~ 467 (542) T protein:vir:41 410 -----------DIFMV----PSK-GAAKSVK--R---QERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKK-KID 467 (542) T ss_pred -----------ccccc----ccc-ccccccc--c---CCcCCCCCchhhhhhcccccCccccccccccccchhhcc-ccc Confidence 11111 110 0000000 0 000000000000 000000001000 0000000000 000 Q ss_pred hHHHHHHHHHHHHHHHHhccccc-------------------CCCchhhhhHhhcCc-----------hhhhhhhcCCCC Q lcl|NC_021303. 536 AAYLVAERLLVNRALDLAGKRRF-------------------KVNDAALKTKLRDVP-----------AHEYHRVLPPVR 585 (637) Q Consensus 536 a~~~aa~~llV~rALelAGkRr~-------------------~~~~~~~~~rlr~ip-----------~h~~h~~~~PV~ 585 (637) .+. .+.+ .-|++ +||.-. ...+-.+ .|...+. .|-+++++ T Consensus 468 ~~~---~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---- 537 (542) T protein:vir:41 468 ESL---AEFR-AEAYE-AGKKMLIIGGDMGSMSALNQGVSVIPSKPLNL-ERYEELLEASVEDMIGRIRHYLYKVI---- 537 (542) T ss_pred chh---hhhH-HhHHh-cCceEEEeecCchhhhhhhccceeccCCCcCh-HHHHHHHHhhHHHHHHHHHHHHHHHh---- Confidence 000 0000 11111 222100 0000000 1111111 13333333 Q ss_pred HHHHHHHHhccccc Q lcl|NC_021303. 586 SSEIPRLIAGWDTA 599 (637) Q Consensus 586 ~~~v~rLi~GWd~~ 599 (637) ||-.. T Consensus 538 ---------~~~~~ 542 (542) T protein:vir:41 538 ---------GWREL 542 (542) T ss_pred ---------hhccC Confidence 45555 No 66 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.62 E-value=3e-08 Score=61.80 Aligned_cols=443 Identities=12% Similarity=0.045 Sum_probs=209.4 Q ss_pred CCCCcc-eEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSL-RVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~l-r~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |--+.- .+..++....+ +..++..||+. ++.++.+..-+.+..+....+.+....=.|-.--+|-++++.... T Consensus 1 m~~~~~~~~a~~~~~~~~-~~~~~y~aa~~-----~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 74 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVP-VGASAYEGASG-----GHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWV 74 (495) T ss_pred CCcccccccccchhhhhH-HHhhhhhcccc-----CcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 333311 11111110011 01112333321 122333222234455554444433333333333445444444322 Q ss_pred EE--EeeeccccCCCCCcccCCCCcccchHHHHHHHhc-----cCcccHHHHHHHHHhhhcccccEEEEEEeecCCcc-- Q lcl|NC_021303. 80 LI--PSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIA-----DGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDP-- 150 (637) Q Consensus 80 L~--aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iA-----gG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~-- 150 (637) =+ -+-|-|.- ..++..-+.++.+.-+.-+ .|.+--.+|.+.+...+-+-||+.+++..++.... T Consensus 75 ~~vVG~Gi~p~~-------~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~ 147 (495) T protein:vir:10 75 AAAVGNGLTPRW-------RMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLS 147 (495) T ss_pred HhhcCCCccccc-------CCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCc Confidence 11 12233220 1111112233333333333 58888888999999999999999999887754321 Q ss_pred -ccccccccccceeeeHHHhccCC-------------CceeEEe----cC-CC----CcccccC-CCceEEEEecCCccc Q lcl|NC_021303. 151 -VTGLAAPRARWYAVTREEIKSKA-------------GETAEIS----LP-DG----KTHEFNR-DLDSLVRIWNPRPRK 206 (637) Q Consensus 151 -~~~~~~~~~~W~~vt~~Ei~~k~-------------g~~~~i~----lP-dG----~~he~~~-~~d~l~RvW~P~prr 206 (637) +...+.-+.+++.....+....+ |..+..- -| |+ ...++.. .-.-|+|+++++|.- T Consensus 148 ~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ 227 (495) T protein:vir:10 148 VPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRS 227 (495) T ss_pred cceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccccCCCc Confidence 22333333333332222211111 1111110 01 10 0111111 113456777544432 Q ss_pred ccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee-eecccCCCCCcccccccccccCCCcccccCCCchhHHHHH Q lcl|NC_021303. 207 ASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL-FVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLA 285 (637) Q Consensus 207 a~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~ 285 (637) ..-- |--++| ++ |..+.++..+....-.+.+=+. ||=++ .|+.......+.+...+ . T Consensus 228 ~RGi--s~la~i--~~-l~~l~~y~dael~~a~i~A~~~~fi~~~--~~~~~~~~~~~~~~~~~------~--------- 285 (495) T protein:vir:10 228 DAGA--PWFQLL--LR-LNELDQYEDAELVRKKTAALFAAFIQEA--TADSTGGPTIGQPKRSK------G--------- 285 (495) T ss_pred ccCc--chhHHH--HH-HHHhhHHHHHHHHHHHHhhhheeeeecC--CCccccccccCcccccc------C--------- Confidence 2221 222333 33 6677777777776666666544 44332 12222110000000000 0 Q ss_pred HHHHHHHhhcccCccccccccc-eeEe-echHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhcc-CCcc Q lcl|NC_021303. 286 TMIYQASVAAMEDENSQAAYIP-LVAS-VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGN 362 (637) Q Consensus 286 ~ml~~va~aai~De~S~AA~vP-iva~-vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGl-s~~N 362 (637) ..+.-.+=| .|.. -|||-|+-++.= .-+.-...+..-.++.+|+|+.||-|.|+|= |++| T Consensus 286 -------------~~~~~~l~pG~i~~L~pGe~i~~~~p~----~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~n 348 (495) T protein:vir:10 286 -------------GKRITGLNPGTLQYLQPGQEVKFSNPA----DVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVN 348 (495) T ss_pred -------------cccceecCCceeeecCCCCeeeeeCCC----CCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc Confidence 000011112 1222 244433222211 1233445667788999999999999999987 7999 Q ss_pred eeeeEEeccCceeEee--ch--hHHHHHHHHHhHHHHHHHHHhCCC-------hHHe--EEeecCcccccCCCCCHHH-H Q lcl|NC_021303. 363 HWSAWAIGDEDVQLHI--KP--VMDLICQAIYNDILTPLLAREGID-------PTKY--ILWYDASGLTSDPDLSDEA-V 428 (637) Q Consensus 363 HWsAW~I~dedVrlHI--~P--~me~ic~Ait~~~Lr~~L~~eGiD-------p~kY--vvw~DaS~Lt~dPD~tdeA-~ 428 (637) +.|+-+---|..+..- += .+..+|+-|++.||.-++..-.|+ ++.| +-|.=++-..+||-|--+| + T Consensus 349 YSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~ 428 (495) T protein:vir:10 349 YSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADL 428 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHH Confidence 9999988877766532 21 235689999999999888875553 3455 5688899999999986554 5 Q ss_pred HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCC Q lcl|NC_021303. 429 EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDE 508 (637) Q Consensus 429 ~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~ 508 (637) .+.+.|..|-+..-+..|. |+ +|-.+|.|.+.-.-+ . -++.|+..+.+.+......++ T Consensus 429 ~~i~~G~~s~~~~~a~~G~------D~--~~v~~q~a~e~~~~~-----~---------~Gl~~~~~p~~~~~~~~~~~~ 486 (495) T protein:vir:10 429 GDVRAGFAPISDKQAERGY------DM--EELFDMISDANQLID-----E---------YDLRLDSDPRYVNGSGAEQKS 486 (495) T ss_pred HHHHcCCCCHHHHHHHcCC------CH--HHHHHHHHHHHHHHH-----H---------cCCCCCCCCCcCCCccCCCCC Confidence 6788899998876655554 33 356666666652111 0 034444333222211111122 Q ss_pred CCCCCCCCC Q lcl|NC_021303. 509 DSGARQQRE 517 (637) Q Consensus 509 ~~~a~~g~E 517 (637) .+.+..++| T Consensus 487 ~~~~~~~~e 495 (495) T protein:vir:10 487 VMEAALNNE 495 (495) T ss_pred CCCCCCCCC Confidence 221222222 No 67 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=98.59 E-value=2e-08 Score=62.85 Aligned_cols=402 Identities=13% Similarity=0.088 Sum_probs=200.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |+-. .|+-|-|++=- ...+.--+....++..-..++..|- +.. .+-.++-+.-.+.-+++++|.+.+ T Consensus 1 ~~~~--~~~~~~k~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v----~~~----~a~~~~~V~~ci~~ia~~ia~lp~ 67 (409) T protein:vir:96 1 MAKE--NIVTRIKKKLI---DNWIDQSASKLYDFSPWKNKSFWGV----INN----TLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred Cccc--cchhhhhhHHh---hhhhccccccccccccccCcccccc----chh----hHhhhHHHHHHHHHHHHhhhhCce Confidence 5433 33333333210 0001000111112211100111110 111 122445566667889999999887 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+=.. . . .+.+.++.+.=..--+-..++++.++.+|-+-|+.|+.+.-...|. + . . T Consensus 68 ~~~~~~~--------~--~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~------~-~ 125 (409) T protein:vir:96 68 KMYEDYK--------V--V----NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-P------S-K 125 (409) T ss_pred EEeeccc--------c--c----chhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-E------E-E Confidence 6644211 1 1 1234555444344557788999999999999999999876544443 1 2 2 Q ss_pred ceeeeHHHhcc---CCCcee--EEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 161 WYAVTREEIKS---KAGETA--EISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 161 W~~vt~~Ei~~---k~g~~~--~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) .+.+..+.+.. ..++.+ .+...+|..++|... =+|++=.+++.....--||+..+...+.-...+.+. +.. T Consensus 126 L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~--evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~--~~~ 201 (409) T protein:vir:96 126 LFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNM--DMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF--NLT 201 (409) T ss_pred EEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccc--cEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH--HHH Confidence 33343333331 222332 245556776666543 355554445555555668876655444322222211 111 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +-...+.+|+-.|+.+ .....+.+.+-|.+ .+++.+. ++++ ++. T Consensus 202 ~~~~~~~~i~~~~~~l-------------------------~~e~~~~~~~~~~~----~~~n~g~-----~~vl--~~g 245 (409) T protein:vir:96 202 EMQKPDSFMLKYGSNV-------------------------STEKRQQVLEDFKQ----YYEENGG-----ILFQ--EPG 245 (409) T ss_pred hcCCCceeEEecCCCC-------------------------CHHHHHHHHHHHHH----HhhcCCC-----eeec--CCC Confidence 1111111222222211 11233444444433 2332221 2222 322 Q ss_pred HhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 316 hi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) -+++.|.+.. .+.--+++|+..+..+|..+-|||..|=+..++|.-+.-|....=++-.|.|.+..|.++|++.+|- T Consensus 246 --~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~ 322 (409) T protein:vir:96 246 --VEIEPLPKKY-VSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLT 322 (409) T ss_pred --ceEEEcCCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 4455555432 2233478899999999999999888664346778777777777778888999999999999987764 Q ss_pred HHHHHhCCChHHeEEeecCcccc-cCCC-CCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLT-SDPD-LSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt-~dPD-~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) ..+.+. .|-+-||.+.|. .|.- +.+-...++..|++|-.-.|+.+|++.-.+=| +.+. T Consensus 323 ----~~~~~~-g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD----~~~~----------- 382 (409) T protein:vir:96 323 ----KTDREK-NRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD----KPLI----------- 382 (409) T ss_pred ----cccccC-cceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcc----eeee----------- Confidence 223332 366788988873 3432 23333458889999999999999997544312 0000 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTED 522 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted 522 (637) + ..+..++-+ .+.......|++...|- T Consensus 383 -------~---~n~~~~~~~------------~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 383 -------S---GDLYPIDTP------------LELRKSLKGGDKNVNES 409 (409) T ss_pred -------c---ccccccccc------------hhhcccccCCCCCcCCC Confidence 0 011111111 00000011111111110 No 68 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.58 E-value=1.9e-08 Score=62.93 Aligned_cols=477 Identities=15% Similarity=0.125 Sum_probs=198.5 Q ss_pred CCCC-cceEEecCCCCCcccccc---------------hheeh-hccccchhhhh-hhhcccccccchhhHHHHHhhhh- Q lcl|NC_021303. 1 MAAT-SLRVVRRPKGSAPAARRR---------------SLTAA-SQLITDPQKQM-KTSLMGTARNEWQSEAWDFSESI- 61 (637) Q Consensus 1 ma~~-~lr~vrrpk~~~p~~~r~---------------~ltAA-s~~~~~p~~~~-k~~~~g~~r~~WQ~eAW~~yd~V- 61 (637) |--- +||-+-++|...-..-+. .=.|+ ..+.+.-+... .-...+..+..|..+ .+|+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~--~~~~l~~ 78 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIR--NNQDLHG 78 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccC--ChhHHHH Confidence 5443 666666665553321111 11111 12221111111 111111111222222 133221 Q ss_pred --------hhHhhHhhhhhcceeeeEEEEeeeccc-c----------CCCCCcccCCCCcccchHHHHHHHhcc----Cc Q lcl|NC_021303. 62 --------GELSYYISWRANSCSRTTLIPSAIDPD-T----------GLPTGEVDIEEDPDAQIVADYVKGIAD----GP 118 (637) Q Consensus 62 --------gELryyvgWr~~s~Sr~rL~aseiD~D-t----------G~PtG~v~~e~~~~~~rv~~iv~~iAg----G~ 118 (637) +-|+=-+.=+++.++..= ...+++.+ - .+.+ . ++......+.++.+...- .+ T Consensus 79 l~~~~~~npiv~~~I~~~a~~ia~~~-~~~~~~~~~~~~~ir~k~~~~~~~---~-~~~~~~~~l~~~l~~pn~~~~p~~ 153 (547) T protein:vir:63 79 VLKKFGGNIILNAIINTRSNQVSMYC-KPARHSEKGVGFEVRLKDLDKKPT---S-HDEATIKRIESFIEKTGVDNDINR 153 (547) T ss_pred HHHHhhcCHHHHHHHHHHHHHHhhhh-hhhhhhccCCCceeEecccccccC---h-hhHHHHHHHHHHHHhhCCCCCCcc Confidence 222222222333333210 11111111 0 1111 0 111111223333332211 11 Q ss_pred ccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhcc---CCC-----ceeEEecCCCCccccc Q lcl|NC_021303. 119 LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKS---KAG-----ETAEISLPDGKTHEFN 190 (637) Q Consensus 119 lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~---k~g-----~~~~i~lPdG~~he~~ 190 (637) .-..++++.++.++-+-|.+|+.+.-...|. + . .++.|...-|+. ..+ +..+...-+|...... T Consensus 154 ~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~-~------~-~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~ 225 (547) T protein:vir:63 154 DSFSSFVKKIVRDTYMYDQVNFEKVFNRNQS-M------V-RFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATF 225 (547) T ss_pred chHHHHHHHHHHHHHhhCCEEEEEEECCCCc-E------E-EEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEe Confidence 2346799999999999999998766544443 1 1 233333333321 111 1112222233222222 Q ss_pred CCCceEEEEecCC--cccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCC Q lcl|NC_021303. 191 RDLDSLVRIWNPR--PRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIP 268 (637) Q Consensus 191 ~~~d~l~RvW~P~--prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~p 268 (637) ...|++.-..||. +.....--||+.++...+.-..-..+...+..+.-..-.|||.+|....+ T Consensus 226 ~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~l--------------- 290 (547) T protein:vir:63 226 NAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQ--------------- 290 (547) T ss_pred ccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCC--------------- Confidence 2334442224443 33333456887777766665555555444444444444567777653211 Q ss_pred CcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchh-HHHHhhHHHHHHHHHhh Q lcl|NC_021303. 269 GAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVT-EVEIKTRIDAITRLAMG 347 (637) Q Consensus 269 g~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvt-evaiktR~daI~RlAmg 347 (637) ...+.+.|.+.+.. .+.-.+ -|--+||+.. +.++...+.-... .--+++|+..+..+|.- T Consensus 291 --------s~e~~~~lk~~~~~----~~~G~~-nagk~~vl~~------~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~a 351 (547) T protein:vir:63 291 --------SQHALEIFKREWKN----SLSGIN-GSWQIPVVSA------EDVKFVNMTPSARDMEFEKWLNYLINVISAL 351 (547) T ss_pred --------CHHHHHHHHHHHHH----HhcCcc-cccccccccC------CCceEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 11133444444322 222111 2333566532 2344444443333 33578899999999999 Q ss_pred cCCchhHhhcc-CCc----------ceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcc Q lcl|NC_021303. 348 LDVSPERLLGM-SKG----------NHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASG 416 (637) Q Consensus 348 lDv~pErLLGl-s~~----------NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~ 416 (637) .-|||.. ||+ +++ |+.++.+....-++.-|.|.+..|+++|+..+|.. .| ..|.+.||... T Consensus 352 fgVPP~~-lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~~ 423 (547) T protein:vir:63 352 YGIDPAE-INIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE----FG---DKYTFQFVGGD 423 (547) T ss_pred hCCCHHH-cCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----cC---CceEEEeeccc Confidence 9999975 555 332 34444555555677789999999999999988753 22 35888888654 Q ss_pred cccCCCCCHHHHHHHhcCCcCHHHHHHHhcCcc-ccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhc-----cccccc Q lcl|NC_021303. 417 LTSDPDLSDEAVEAHDRGAITSAALRRLLNVGE-DSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLS-----SQLAGI 490 (637) Q Consensus 417 Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~-d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~-----~~~~~i 490 (637) + .+.....++.....+|.+|-.-.|+.+|+.- ..|=|. .+ +|.-|..+..... ...+.- T Consensus 424 ~-~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~----~~----------~~~~~~~~~~~~~~~~~~~~~~~~ 488 (547) T protein:vir:63 424 I-KSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDI----PL----------NGVIVQRIGQLMQQEQFEHEKQQS 488 (547) T ss_pred c-ccHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCce----ee----------cccccccccccccccCCccccchh Confidence 3 2333333445667889999999999999953 122120 00 1110000000000 000000 Q ss_pred cCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhh Q lcl|NC_021303. 491 EFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKT 567 (637) Q Consensus 491 e~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~ 567 (637) .++++..+....+..+++.++++..+..+.+++... .+..... |.+. |++ +.-+.+.++ T Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~------~~~~~~~---------~~~~-~~~--~~~~~~~~~ 547 (547) T protein:vir:63 489 NLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQR------KDKDNAN---------AGKQ-GMK--GDKPNDWQT 547 (547) T ss_pred hccccccccCCCCCCCCCCCCCCcccCCCcCccccc------cCccccc---------hhhh-hcC--CCCccccCC Confidence 011111111000111111111111111111111111 1111101 1111 122 111222222 No 69 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.57 E-value=1.4e-08 Score=63.65 Aligned_cols=484 Identities=12% Similarity=0.103 Sum_probs=201.8 Q ss_pred CCCC-cceEEec-------CCCCCc--ccccc----hheehhccccchhh-----hhhhhcccccccchhhHHHHHhhhh Q lcl|NC_021303. 1 MAAT-SLRVVRR-------PKGSAP--AARRR----SLTAASQLITDPQK-----QMKTSLMGTARNEWQSEAWDFSESI 61 (637) Q Consensus 1 ma~~-~lr~vrr-------pk~~~p--~~~r~----~ltAAs~~~~~p~~-----~~k~~~~g~~r~~WQ~eAW~~yd~V 61 (637) |+-- .-|+.+. .|+..- ++.-. .+.+--...+.|.- ..+.......++. =.+.+.++ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~np---iv~~~I~~- 102 (576) T protein:vir:96 27 IDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNP---ILNAIILT- 102 (576) T ss_pred cccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCH---HHHHHHHH- Confidence 1000 0000000 011000 00000 00000001111110 0000000000111 13344444 Q ss_pred hhHhhHhhhhhcceee-----------eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhcc--C--cccHHHHHH Q lcl|NC_021303. 62 GELSYYISWRANSCSR-----------TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIAD--G--PLGQAALIK 126 (637) Q Consensus 62 gELryyvgWr~~s~Sr-----------~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAg--G--~lGqaqLlk 126 (637) +++++|. +-|.+.-.+.+ +.++ .++-.....+...+....- . +.--.++++ T Consensus 103 ---------ia~~vA~~~~~~~~~~~~~~~~i~lk~~~-~~~~----~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~ 168 (576) T protein:vir:96 103 ---------RSNQVAMYCQPSRYNERGLGFEVRMRDLD-AEPG----KKEKEEIKRIENFILNTGRDKDIDRDSFQSFCR 168 (576) T ss_pred ---------HHHHHHhhhhhhhhccccccceeEEecCc-Cccc----hhhhHhhhhHHhhHhhccCCCCCccccHHHHHH Confidence 4444442 11222223333 3333 1221112233333332222 1 123457999 Q ss_pred HHHhhhcccccEEEEEEeecCCc-cccccccccccceeeeHHHhcc---CCC-----ceeEEecCCCCcccccCCCceEE Q lcl|NC_021303. 127 RAVECMTVVGEVWIAVLIRQEKD-PVTGLAAPRARWYAVTREEIKS---KAG-----ETAEISLPDGKTHEFNRDLDSLV 197 (637) Q Consensus 127 r~~~~LtVpGE~wi~il~r~~~~-~~~~~~~~~~~W~~vt~~Ei~~---k~g-----~~~~i~lPdG~~he~~~~~d~l~ 197 (637) .++.+|-+-|.+|+.++...++. .+. ..+.|...-|.. ..| ........+|.........|+++ T Consensus 169 ~lv~dlll~Gna~~~i~~~rd~~g~~~-------~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~ 241 (576) T protein:vir:96 169 KIVRDTYTYDQVNFEKVFNKKNATTMD-------KFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAM 241 (576) T ss_pred HHHHHHHhcCCeEEEEEEecCCCCceE-------EEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEE Confidence 99999999999999887644431 111 233333333321 111 11222333444433333457777 Q ss_pred EEecCCcccc--cCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccC Q lcl|NC_021303. 198 RIWNPRPRKA--SQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEV 275 (637) Q Consensus 198 RvW~P~prra--~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~ 275 (637) ++.+|.+... ..--||+.++...+.-..-+.+...+..+.-..-.|||.+|....+- T Consensus 242 ~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls--------------------- 300 (576) T protein:vir:96 242 GIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQS--------------------- 300 (576) T ss_pred EeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC--------------------- Confidence 7778776432 34579999888888777777777777666666777888877532211 Q ss_pred CCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcch-hHHHHhhHHHHHHHHHhhcCCchhH Q lcl|NC_021303. 276 SGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEV-TEVEIKTRIDAITRLAMGLDVSPER 354 (637) Q Consensus 276 ~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dv-tevaiktR~daI~RlAmglDv~pEr 354 (637) ....+.|.+.+. .++...+ -+--+|+|+. +.++...+.... +.--+++|+-.+..+|...=|||.. T Consensus 301 --~e~~~~lr~~~~----~~~~G~~-nag~~p~vl~------~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~ 367 (576) T protein:vir:96 301 --QRALENFKREWK----SSFSGIN-GSWQVPVVMA------DDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAE 367 (576) T ss_pred --HHHHHHHHHHHH----HHhcccc-ccccceeecC------CCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHH Confidence 113344544443 2232211 1223566643 234444444333 3344899999999999999999986 Q ss_pred hhccC-Cc-----------ceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCC Q lcl|NC_021303. 355 LLGMS-KG-----------NHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPD 422 (637) Q Consensus 355 LLGls-~~-----------NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD 422 (637) | |+. ++ |+-++-+....-++--|.|.+..|+++|+..+|.. .| .+|.++||-..+. T Consensus 368 l-G~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~----~~---~~~~~~f~r~d~~---- 435 (576) T protein:vir:96 368 I-GFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISE----YS---DKYVFQFVGGDTK---- 435 (576) T ss_pred c-cccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh----cc---CceEEEeccCCHH---- Confidence 6 773 33 55556666666777779999999999999988754 22 4688888754332 Q ss_pred CCHHHHH---HHhcCCcCHHHHHHHhcCccccCCCC--CchHHHHHHHHHHhcCCchhH--HHHHhhhccccccccCCCC Q lcl|NC_021303. 423 LSDEAVE---AHDRGAITSAALRRLLNVGEDSGYDL--TTLDGCREFAADVVTKNPELI--AMYAPLLSSQLAGIEFPQP 495 (637) Q Consensus 423 ~tdeA~~---a~drGaIt~eAlrr~lgl~~d~~yd~--~t~eg~r~~A~d~v~~~P~Li--~~~apLl~~~~~~ie~P~p 495 (637) ...++.+ ...+|.+|-.-.|+.+|++.-.|=|. ...-.....+.....++..-- ...-++.. .+++- -|.+ T Consensus 436 ~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~-~~~~ 513 (576) T protein:vir:96 436 SELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQ-FLNSP-DDEE 513 (576) T ss_pred HHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCcccccccccccc-ccCCC-CCCC Confidence 2223322 34569999999999999975433120 000000000000000000000 00000000 00000 0000 Q ss_pred cCCCCCCCCCCCCCCCCCCC-CCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhhHhhcC Q lcl|NC_021303. 496 ANAIESTREEDDEDSGARQQ-REPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKTKLRDV 572 (637) Q Consensus 496 ~~a~~~~~~~~d~~~~a~~g-~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~rlr~i 572 (637) + ..++.++..+..+..+.+ .+.+.--+..--......+.-. .-.+..+- ||- .-.+.+-+ .- T Consensus 514 ~-~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-------~~~--~~~~~~~~----~~ 576 (576) T protein:vir:96 514 P-QQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDNVKSQEG-SNKGQGTK-------GKG--NEKPSDFK----NN 576 (576) T ss_pred C-CCCCCCCcccccccccCCCCCCccccccccCCCCccccccc-cccccccc-------ccC--CCCccccc----CC Confidence 0 000000000110111111 1110000000000111110000 00111110 110 01111111 00 No 70 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=98.54 E-value=1.2e-07 Score=58.61 Aligned_cols=412 Identities=13% Similarity=0.113 Sum_probs=186.6 Q ss_pred CCC--CcceEEecCCCCCc------------ccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhh-----h Q lcl|NC_021303. 1 MAA--TSLRVVRRPKGSAP------------AARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSES-----I 61 (637) Q Consensus 1 ma~--~~lr~vrrpk~~~p------------~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~-----V 61 (637) |-- |+.-.| .+|+.-- ..-.|++. -+..+. +.|-+...| |+...-..|.. + T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~f~~~e~r~~~---~~~~~~-~~~~~~~~~-----~~~~~~~~~~~~~al~~ 70 (441) T protein:vir:98 1 MHWYNTDCYFV-DFKSRKQSRKELVVVGIFYKNEKRDLQ---YNEDDL-QMMVQTLPG-----FQGTKLRQYKDIEAIRH 70 (441) T ss_pred CceecCcccee-ccccccchhhhhhcccccccccccccc---CCCcch-HHHHHHhhc-----ccccCccccchhhhhcc Confidence 100 000000 0111100 00011111 111111 222111111 11111111111 1 Q ss_pred hhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEE Q lcl|NC_021303. 62 GELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIA 141 (637) Q Consensus 62 gELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~ 141 (637) +-+.-.|.-++++++.+.|..-+ + |. +. . .+.+..+++.=...-+-..++++.++.+|.+-|++|+. T Consensus 71 ~~V~acv~~Ia~~iA~lpl~~~~---~-~~----~~-~----~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~ 137 (441) T protein:vir:98 71 SDIFTAVMMIASDLARMPIRVTV---N-GQ----IN-Y----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIE 137 (441) T ss_pred HHHHHHHHHHHHhhccCceEEec---C-Cc----cc-c----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEE Confidence 22333477788999998876632 3 22 22 1 23455666555666788889999999999999999988 Q ss_pred EEeecCCccccccccccccceeeeHHHhc---cCCCceeEE-ecCCC--CcccccCCCceEEEEecCCcccccCCccchh Q lcl|NC_021303. 142 VLIRQEKDPVTGLAAPRARWYAVTREEIK---SKAGETAEI-SLPDG--KTHEFNRDLDSLVRIWNPRPRKASQATSPVR 215 (637) Q Consensus 142 il~r~~~~~~~~~~~~~~~W~~vt~~Ei~---~k~g~~~~i-~lPdG--~~he~~~~~d~l~RvW~P~prra~eaDSPvr 215 (637) |.-...|.+ .+=| .+..+.+. ...|...+. ...+| ...+..-..+-||++=. ++-.-..--||+. T Consensus 138 i~r~~~G~~-------~~L~-~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~-~~~dg~~G~spi~ 208 (441) T protein:vir:98 138 ITRDKTGEP-------MNLT-FRKTSEIELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF-YSLDGINGLSLLD 208 (441) T ss_pred EEEcCCCcE-------EEEE-EEcCceeEEEECCCCcEEEEEEEeccCcceeeEEEccccEEEecc-CCCCCccccCHHH Confidence 654434431 1112 22222221 112222221 11122 21111111223444411 1222233456666 Q ss_pred hhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhc Q lcl|NC_021303. 216 ACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAA 295 (637) Q Consensus 216 a~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aa 295 (637) .+.+.+.--.-..+...+..+.-..-.|||-+|+.++=+ -+.+.+.+-+ +.+ T Consensus 209 ~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~------------------------e~~~~~~~~~----~~~ 260 (441) T protein:vir:98 209 TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNK------------------------KARDRAREEF----HKS 260 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCH------------------------HHHHHHHHHH----HHH Confidence 555555433333344333333334445677776633210 1222333222 233 Q ss_pred ccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCcee Q lcl|NC_021303. 296 MEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQ 375 (637) Q Consensus 296 i~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVr 375 (637) +.-.+.+ -=|+|+ ++. -+++.|.+..+..+ -+++|+-.+..+|...-|||.. ||+++.| .+.-|.+-.-++ T Consensus 261 ~~G~~na--g~~~vl--~~g--~~~~~l~~~~~d~q-~~e~r~~~~~~Ia~~fgVPp~~-lg~~~~~-~s~~q~~~~y~~ 331 (441) T protein:vir:98 261 FSGTKQA--GKVVVL--DES--MTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHK-FGIETAN-MSITDANLDYLS 331 (441) T ss_pred hcCcccc--Ccceec--CCC--ceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHH-cCCCCCC-ccHHHHHHHHHH Confidence 3321111 122333 332 35666666544333 3789999999999999999886 4875443 232333222233 Q ss_pred EeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccC Q lcl|NC_021303. 376 LHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSG 452 (637) Q Consensus 376 lHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~ 452 (637) .|.|.+..|+++|+..++.. . ..|-+.||.+.|. ..|..+.+ ..++..|.+|-.-.|+.+|++.-.| T Consensus 332 -tl~P~~~~ie~~ln~~L~~~----~----~~~~~~fd~~~ll-r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~g 401 (441) T protein:vir:98 332 -TLKPYITCVCAELNFKFNDE----Y----VNREFKFDTTEIR-VVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPG 401 (441) T ss_pred -HHHHHHHHHHHHHHhhcccc----c----cCceEEEechhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 69999999999999876521 1 2455799999984 35554443 3477889999999999999975433 Q ss_pred CCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCC-CCcCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_021303. 453 YDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFP-QPANAIESTREEDDEDSGARQQREPQ 519 (637) Q Consensus 453 yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P-~p~~a~~~~~~~~d~~~~a~~g~EPd 519 (637) =| ... .+.| .....++.. +.+.. ..+..+ ..-..|++-+ T Consensus 402 Gd--~~~------------------~~~~---~n~~~~~~~~~~q~~--~~~~~~---~~~kgGe~ne 441 (441) T protein:vir:98 402 GN--GSI------------------HRVD---LNHVNIELVDEYQMN--KSRATD---KKLKGGEENE 441 (441) T ss_pred CC--cce------------------Eeec---ccccccccccccccc--cccccc---cccCCCCCCC Confidence 22 000 0000 011111111 00000 000000 0001111111 No 71 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.54 E-value=4.2e-08 Score=61.02 Aligned_cols=446 Identities=13% Similarity=0.105 Sum_probs=199.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehh--ccccchhhhhhhhcc--cccccchhhH-------HHHHhhhhhhHhhHhh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAAS--QLITDPQKQMKTSLM--GTARNEWQSE-------AWDFSESIGELSYYIS 69 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs--~~~~~p~~~~k~~~~--g~~r~~WQ~e-------AW~~yd~VgELryyvg 69 (637) |.-+....++.|.+..+ ++.+.+.- +... ++.++++.. ++.++.+... |.+++.--|=.+=+++ T Consensus 1 ~~~p~~~~~~~~~~~~~---~~~~~~y~~~a~~~--~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~ 75 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTS---LREYAGYHGGGSGF--GGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQ 75 (533) T ss_pred CCCchhhhhhcccccch---HHHHHhhhhccCCC--CCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 44444333333333222 22222210 1111 123333322 2233333332 3333333332222222 Q ss_pred -hhhcceeeeEEEEeeeccccCCCC---CcccCCCCcccchHHHHHHHh-------------ccCcccHHHHHHHHHhhh Q lcl|NC_021303. 70 -WRANSCSRTTLIPSAIDPDTGLPT---GEVDIEEDPDAQIVADYVKGI-------------ADGPLGQAALIKRAVECM 132 (637) Q Consensus 70 -Wr~~s~Sr~rL~aseiD~DtG~Pt---G~v~~e~~~~~~rv~~iv~~i-------------AgG~lGqaqLlkr~~~~L 132 (637) |..|-|.- -|-+. -.|. -+++. .....+.+.++.. +.|.+--.+|.+.+...+ T Consensus 76 ~~~~nvVG~------Gi~~~-~~p~~~~lg~~~---~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~ 145 (533) T protein:vir:34 76 LHQDHIVGS------FFRLS-HRPSWRYLGIGE---EEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMH 145 (533) T ss_pred HHHHHhhCC------Cceee-eccchhhcCCCh---hHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHH Confidence 22222221 11111 0110 01111 1123445555444 569999999999999999 Q ss_pred cccccEEEEEEeecCCccccccccccccceeeeHHHhccC----CCceeEEecCCCCcccccC----------------- Q lcl|NC_021303. 133 TVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKSK----AGETAEISLPDGKTHEFNR----------------- 191 (637) Q Consensus 133 tVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~k----~g~~~~i~lPdG~~he~~~----------------- 191 (637) -+-||+.+.+..++.++.+.+. +=..|..+-|.+. ++..+- + +-|||. T Consensus 146 ~~dGE~f~~~~~~~~~g~~~~~-----~lq~ie~d~l~~~~~~~~~~~i~----~--GIe~d~~Gr~~aY~i~~~~~~~~ 214 (533) T protein:vir:34 146 AFNGELFVQATWDTSSSRLFRT-----QFRMVSPKRISNPNNTGDSRNCR----A--GVQINDSGAALGYYVSEDGYPGW 214 (533) T ss_pred HhCCceEEEeeeccCCCCccce-----EEEEechhhcCCCCCCCCCCceE----e--eeEECCCCCeEEEEEeecCCCCc Confidence 9999999998887765321111 1123334444321 111110 0 112222 Q ss_pred --------------CCceEEEEecCCcccccC--CccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee-eecccCCCC Q lcl|NC_021303. 192 --------------DLDSLVRIWNPRPRKASQ--ATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL-FVPAEMSLP 254 (637) Q Consensus 192 --------------~~d~l~RvW~P~prra~e--aDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvPqe~slP 254 (637) ...-|+|+++|. |.-| --|..-++|..|+. +.++..+......+.+-+. ||=+. .| T Consensus 215 ~~~~~~~~~~~~~v~a~~VlH~f~~~--r~gQ~RGis~lapvl~~l~~---l~~y~dael~~a~i~A~~a~fi~~~--~~ 287 (533) T protein:vir:34 215 MPQKWTWIPRELPGGRASFIHVFEPV--EDGQTRGANVFYSVMEQMKM---LDTLQNTQLQSAIVKAMYAATIESE--LD 287 (533) T ss_pred cccccceeeeeeccChhHeeeecccc--CCCcccCCchHHHHHHHHHH---HHHHHHHHHHHHHHhhhheeeeecC--CC Confidence 123456666554 2222 12444444554444 4445555555555555443 33221 11 Q ss_pred CcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccc--eeEeechHHhcccceeecCcchhHH Q lcl|NC_021303. 255 AAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIP--LVASVAAEHLEKVQHIKFGNEVTEV 332 (637) Q Consensus 255 ~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vP--iva~vP~Ehi~~ikHlkf~~dvtev 332 (637) ......+ .....+....+.+....- .... .+.+-.-.+-| |+---|||-|+-++-= .-+.- T Consensus 288 ~~~~~~~----------~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~l~pG~i~~L~pGe~i~~~~~~----~p~~~ 350 (533) T protein:vir:34 288 TQSAMDF----------ILGANSQEQRERLTGWIG--EIAA-YYAAAPVRLGGAKVPHLMPGDSLNLQTAQ----DTDNG 350 (533) T ss_pred ccccccc----------ccCCCcccccccccccch--hhhh-ccCcceeeccCceeeecCCCCeeeecCCC----CCCCC Confidence 1110000 000001111111111000 0000 00100001111 1222344422222111 12223 Q ss_pred HHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeec---hhHHHHHHHHHhHHHHHHHHHhCCC---- Q lcl|NC_021303. 333 EIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIK---PVMDLICQAIYNDILTPLLAREGID---- 404 (637) Q Consensus 333 aiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~---P~me~ic~Ait~~~Lr~~L~~eGiD---- 404 (637) ...+.+..++.+|+||.||-|.|+|= |++|+.|+-+-.-|..+..-. =.+..+|+-|++.||.-++..-.|+ T Consensus 351 ~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~ 430 (533) T protein:vir:34 351 YSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSK 430 (533) T ss_pred HHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCc Confidence 34577788999999999999999997 899999987665554332100 1345679999999999888764443 Q ss_pred --------hHHe--EEeecCcccccCCCCCHHH-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 405 --------PTKY--ILWYDASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 405 --------p~kY--vvw~DaS~Lt~dPD~tdeA-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) ++.| +.|+=.+-..+||-|--+| +...+.|..|-+...+..|.+ + +|-.+|.|.+.-..+ T Consensus 431 ~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D------~--~ev~~q~a~e~~~~~- 501 (533) T protein:vir:34 431 ARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDD------Y--QEIFAQQVRETMERR- 501 (533) T ss_pred cCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCC------H--HHHHHHHHHHHHHHH- Confidence 2455 7999999999999986554 667888999998777666544 3 345555555542210 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAAS 531 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~ 531 (637) . .++.+|..+...+......++ ++|.++ . .++ T Consensus 502 ----~---------~gl~~~~~~~~~~~s~~~~~~-------~~~~~~-~-----~~~ 533 (533) T protein:vir:34 502 ----A---------AGLKPPAWAAAAFESGLRQST-------EEEKSD-S-----RAA 533 (533) T ss_pred ----h---------cCCCCCCCCCcCccCCCCCCC-------CCCccc-C-----CCC Confidence 0 033444332211111111111 111111 1 111 No 72 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.54 E-value=2.9e-07 Score=56.39 Aligned_cols=463 Identities=14% Similarity=0.063 Sum_probs=211.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhcccc--chhhhhhhhccc--ccccchhh-------HHHHHhhhhhhHhhHhh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLIT--DPQKQMKTSLMG--TARNEWQS-------EAWDFSESIGELSYYIS 69 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~--~p~~~~k~~~~g--~~r~~WQ~-------eAW~~yd~VgELryyvg 69 (637) |--+..|.+.......| ++++........- .-++.++++... +.++++.. +|-+++.--|=.+=++. T Consensus 1 m~~~~~r~~~~~a~~~~--~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRP--EQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred Ccchhhhhhcccccccc--hhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 76666666654444433 2223322211110 012233333222 22223221 22233322222222222 Q ss_pred -hhhcceee-eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHh-------------ccCcccHHHHHHHHHhhhcc Q lcl|NC_021303. 70 -WRANSCSR-TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGI-------------ADGPLGQAALIKRAVECMTV 134 (637) Q Consensus 70 -Wr~~s~Sr-~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~i-------------AgG~lGqaqLlkr~~~~LtV 134 (637) +..|-|.- .++- +.+|.. ..+ +++ ......+++.|+.. +.|.+--.+|.+.+...+-+ T Consensus 79 ~~~~nvVG~Gi~~~-~~~~~~-~l~--g~~---~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~ 151 (553) T protein:vir:63 79 YQRDSIVGAQYRLN-SMPDIN-VIP--GAT---EEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVK 151 (553) T ss_pred HHHHhhccCCceee-eccchh-hhc--CCC---HHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHh Confidence 22232221 1111 111110 000 111 11223344444332 45888888999999999999 Q ss_pred cccEEEEEEeecCCccccccccccccceeeeHHHhccCCCceeEEecCCC----CcccccC-CCceEEEEecCCccccc- Q lcl|NC_021303. 135 VGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKSKAGETAEISLPDG----KTHEFNR-DLDSLVRIWNPRPRKAS- 208 (637) Q Consensus 135 pGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~k~g~~~~i~lPdG----~~he~~~-~~d~l~RvW~P~prra~- 208 (637) -||+.+.+..++.++.+.+. .=..|..+-|.+.... |+| .+-|||. +.-+-.+|++-||.... T Consensus 152 dGE~~~~~~~~~~~~~~~~~-----~lq~ie~drl~~~~~~------~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~ 220 (553) T protein:vir:63 152 TGEVLATAEWDRAANRPYAT-----CFQMVSTDRLSNPYQQ------LDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQ 220 (553) T ss_pred CCceEEEeeeccCCCCcccc-----eEEEechhhcCCCCCC------CCCCeeEeeeEECCCCceEEEEeeccCCCcccc Confidence 99999998877654322111 1134555555421111 122 2224444 23344555555554321 Q ss_pred --------------------------------CC--ccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee-eecccCCC Q lcl|NC_021303. 209 --------------------------------QA--TSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL-FVPAEMSL 253 (637) Q Consensus 209 --------------------------------ea--DSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvPqe~sl 253 (637) |. -|.--++ |.-|..+.++..+......+++-+. ||=+.. T Consensus 221 ~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapv---l~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~-- 295 (553) T protein:vir:63 221 MAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSG---LKDMRMAKRFKEMSLQNAVINASYAAAIESEL-- 295 (553) T ss_pred ccccccceeeeccccccChhHheecccccCCCcccCCchHHHH---HHHHHHHhHHHHHHHHHHHHhhhheeeeecCC-- Confidence 11 1222333 4444455556666666666665554 554322 Q ss_pred CCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccc-ccccceeEeechHHhcccceeecCcchhHH Q lcl|NC_021303. 254 PAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQ-AAYIPLVASVAAEHLEKVQHIKFGNEVTEV 332 (637) Q Consensus 254 P~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~-AA~vPiva~vP~Ehi~~ikHlkf~~dvtev 332 (637) |..+.....+.....|.+..-. ....+.+.. ...-....+. ..+ |+---|||-|+-++.=+ -+.- T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~------~~~~~~~~~l~pG~--i~~L~pGe~i~~~~p~~----p~~~ 361 (553) T protein:vir:63 296 PPEFIHSQMSGGSPNADMVGIF--GKYMDALKA------YVGGANNIQIDGAK--IPHLFPGTKLNLKPMGT----PGGV 361 (553) T ss_pred Chhhhhhhcccccccccccccc--ccccccccc------ccccccceeecCce--eeecCCCCeeeecCCCC----CCCC Confidence 2222111111000001111000 000000000 0000000010 011 12223444332222211 1223 Q ss_pred HHhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEee---chhHHHHHHHHHhHHHHHHHHHhCCC---- Q lcl|NC_021303. 333 EIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHI---KPVMDLICQAIYNDILTPLLAREGID---- 404 (637) Q Consensus 333 aiktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI---~P~me~ic~Ait~~~Lr~~L~~eGiD---- 404 (637) -..+-+-.++.+|+||.||-|.|+|= |++|+.|+-+---|.-+..- .=+...+|+-|++.||.-++..--|+ T Consensus 362 ~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~ 441 (553) T protein:vir:63 362 GSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPG 441 (553) T ss_pred HHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCc Confidence 34566888999999999999999997 89999998665554433211 01346789999999999888766554 Q ss_pred -----------hHHe--EEeecCcccccCCCCCHHH-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 405 -----------PTKY--ILWYDASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 405 -----------p~kY--vvw~DaS~Lt~dPD~tdeA-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) +..| +-|+=+.---+||-|--+| +.+.+.|.-|-+...+..|.+- ++-.+|+|.+... T Consensus 442 ~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~--------~~v~~q~a~e~~~ 513 (553) T protein:vir:63 442 QTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDF--------RKSFAQRAREDAL 513 (553) T ss_pred ccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCH--------HHHHHHHHHHHHH Confidence 2234 6799999999999996655 5677889999987776666543 3556677766521 Q ss_pred CCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERS 525 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~ 525 (637) .+ . .++.|+..+.. +.+...+.+.+.++++..++++++.- T Consensus 514 ~~-----~---------~Gl~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 514 LK-----K---------YGLTFNLSAKR-SLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HH-----H---------cCCCCCCCCcc-ccCCCcccCCCCCCCCCCCCcccccC Confidence 11 1 13333332221 11111111111222223333333211 No 73 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.54 E-value=1.4e-07 Score=58.16 Aligned_cols=446 Identities=13% Similarity=0.143 Sum_probs=191.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchh---------hhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQ---------KQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWR 71 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~---------~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr 71 (637) +|.++= ...++-+.....++|+ ..++.-. ++ .-|+--+.=+ T Consensus 55 ~a~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~----~n-------------~i~~~~I~t~ 104 (563) T protein:vir:99 55 QAYAEP-------------FIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFG----NN-------------PILNAIILTR 104 (563) T ss_pred Ccchhh-------------hHhhhcccccccccccCCCCcccHHHHHHHhh----cc-------------hHHHHHHHHH Confidence 222210 1111222222111111 1111100 01 1111111111 Q ss_pred hcce---------------eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccC----cccHHHHHHHHHhhh Q lcl|NC_021303. 72 ANSC---------------SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADG----PLGQAALIKRAVECM 132 (637) Q Consensus 72 ~~s~---------------Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG----~lGqaqLlkr~~~~L 132 (637) ++.+ -.++|+-...+ ++ .++-...+++..++..+.-- ..--.++++.++.++ T Consensus 105 ~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~-----~~----~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~l 175 (563) T protein:vir:99 105 SNQVAMYCQPARYSEKGLGFEVRLRDLDAE-----PG----RKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDT 175 (563) T ss_pred HHHHHHHhhhhhhhcccccceeEEeecCCC-----cc----hhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHH Confidence 1111 13444432222 21 12211223333333333221 123458999999999 Q ss_pred cccccEEEEEEe-ecCCccccccccccccceeeeHHHhcc-CCCceeEEecCCCCcccccCCCceEEEEecCCcccc--c Q lcl|NC_021303. 133 TVVGEVWIAVLI-RQEKDPVTGLAAPRARWYAVTREEIKS-KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKA--S 208 (637) Q Consensus 133 tVpGE~wi~il~-r~~~~~~~~~~~~~~~W~~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra--~ 208 (637) -+-|.+++.++. |.+.+.+.+...-...+..+..+.-.. ......+....+|.........++|+++.+|.+... . T Consensus 176 ll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~ 255 (563) T protein:vir:99 176 YIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSG 255 (563) T ss_pred HhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCc Confidence 999999886654 543322222211111222222111100 111222334445655443445677778888776432 2 Q ss_pred CCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHH Q lcl|NC_021303. 209 QATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMI 288 (637) Q Consensus 209 eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml 288 (637) .--||+.++...+.=..-+.+...+..+.-..-.|||-+|....+ +....+.|.+.+ T Consensus 256 ~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~l-----------------------s~e~~~~~~~~~ 312 (563) T protein:vir:99 256 YGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQ-----------------------SQHALENFKREW 312 (563) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCC-----------------------CHHHHHHHHHHH Confidence 356888877777765555566665555655666777777653211 111344454444 Q ss_pred HHHHhhcccCccccccccceeEeechHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccC-Ccceeee Q lcl|NC_021303. 289 YQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMS-KGNHWSA 366 (637) Q Consensus 289 ~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls-~~NHWsA 366 (637) .+ ++... .-+--+|+|+ + +.++...+.....+. -+++|+..+..+|...-|||..| |+. ++++++. T Consensus 313 ~~----~~~G~-~nagk~~~vl--~----~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~~~ 380 (563) T protein:vir:99 313 KS----SLSGI-NGSWQIPVVM--A----DDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEI-GFPNRGGATGS 380 (563) T ss_pred HH----Hhccc-cccccceEEc--C----CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-ccccccccccc Confidence 32 22211 1122355554 2 234444444443343 48999999999999999999765 874 4555432 Q ss_pred -E----------EeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHh Q lcl|NC_021303. 367 -W----------AIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHD 432 (637) Q Consensus 367 -W----------~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~d 432 (637) | +....-++.-|.|.+..|+++|+..+|.. .| .+|.+.|+-. |.....++. .+.. T Consensus 381 ~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~----~~---~~~~~~f~r~----D~~~~~e~~~~~~~~~ 449 (563) T protein:vir:99 381 KGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE----YG---DKYTFQFVGG----DTKSATDKLNILKLET 449 (563) T ss_pred ccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh----cc---cccEEEeccC----CHHHHHHHHHHHHHhc Confidence 2 22233566679999999999999988864 22 4688888544 332233332 2467 Q ss_pred cCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhcccc--------------ccccCCCCc-C Q lcl|NC_021303. 433 RGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQL--------------AGIEFPQPA-N 497 (637) Q Consensus 433 rGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~--------------~~ie~P~p~-~ 497 (637) .|.+|-.-.|+.+|+.--.|=|. . -.|--+..+..+..... +...-|... . T Consensus 450 ~G~lT~NE~R~~~gl~Pi~gGD~-------------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (563) T protein:vir:99 450 QIFKTVNEAREEQGKKPIEGGDI-------------I-LDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSE 515 (563) T ss_pred CCccCHHHHHHHhCCCCCCCcce-------------e-ecccccccccccccccCCCccccchhhhhcccccCCCCCCCC Confidence 79999999999999975433120 0 00000000000000000 000000000 0 Q ss_pred CCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhh Q lcl|NC_021303. 498 AIESTREEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKT 567 (637) Q Consensus 498 a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~ 567 (637) ..++.+..+++ +++|..-..++++..... ..+... =|.+-.+.++ -+. T Consensus 516 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~--~~~~~~---------------~~~~~~~~~~--~~~ 563 (563) T protein:vir:99 516 EGQSTDSSNDD---KEIGTDAQIKGDDNVYRT--QTSNKG---------------QGRKGEKSSD--FKH 563 (563) T ss_pred CCCCCCCCCCc---cccccccccccccccccc--cCcccc---------------ccccCcCccc--ccC Confidence 00000000000 001100000000000000 000000 0111111111 111 No 74 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.54 E-value=1.4e-07 Score=58.16 Aligned_cols=446 Identities=13% Similarity=0.143 Sum_probs=191.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchh---------hhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQ---------KQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWR 71 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~---------~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr 71 (637) +|.++= ...++-+.....++|+ ..++.-. ++ .-|+--+.=+ T Consensus 55 ~a~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~----~n-------------~i~~~~I~t~ 104 (563) T protein:vir:95 55 QAYAEP-------------FIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFG----NN-------------PILNAIILTR 104 (563) T ss_pred Ccchhh-------------hHhhhcccccccccccCCCCcccHHHHHHHhh----cc-------------hHHHHHHHHH Confidence 222210 1111222222111111 1111100 01 1111111111 Q ss_pred hcce---------------eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccC----cccHHHHHHHHHhhh Q lcl|NC_021303. 72 ANSC---------------SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADG----PLGQAALIKRAVECM 132 (637) Q Consensus 72 ~~s~---------------Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG----~lGqaqLlkr~~~~L 132 (637) ++.+ -.++|+-...+ ++ .++-...+++..++..+.-- ..--.++++.++.++ T Consensus 105 ~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~-----~~----~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~l 175 (563) T protein:vir:95 105 SNQVAMYCQPARYSEKGLGFEVRLRDLDAE-----PG----RKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDT 175 (563) T ss_pred HHHHHHHhhhhhhhcccccceeEEeecCCC-----cc----hhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHH Confidence 1111 13444432222 21 12211223333333333221 123458999999999 Q ss_pred cccccEEEEEEe-ecCCccccccccccccceeeeHHHhcc-CCCceeEEecCCCCcccccCCCceEEEEecCCcccc--c Q lcl|NC_021303. 133 TVVGEVWIAVLI-RQEKDPVTGLAAPRARWYAVTREEIKS-KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKA--S 208 (637) Q Consensus 133 tVpGE~wi~il~-r~~~~~~~~~~~~~~~W~~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra--~ 208 (637) -+-|.+++.++. |.+.+.+.+...-...+..+..+.-.. ......+....+|.........++|+++.+|.+... . T Consensus 176 ll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~ 255 (563) T protein:vir:95 176 YIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSG 255 (563) T ss_pred HhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCc Confidence 999999886654 543322222211111222222111100 111222334445655443445677778888776432 2 Q ss_pred CCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHH Q lcl|NC_021303. 209 QATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMI 288 (637) Q Consensus 209 eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml 288 (637) .--||+.++...+.=..-+.+...+..+.-..-.|||-+|....+ +....+.|.+.+ T Consensus 256 ~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~l-----------------------s~e~~~~~~~~~ 312 (563) T protein:vir:95 256 YGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQ-----------------------SQHALENFKREW 312 (563) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCC-----------------------CHHHHHHHHHHH Confidence 356888877777765555566665555655666777777653211 111344454444 Q ss_pred HHHHhhcccCccccccccceeEeechHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccC-Ccceeee Q lcl|NC_021303. 289 YQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMS-KGNHWSA 366 (637) Q Consensus 289 ~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls-~~NHWsA 366 (637) .+ ++... .-+--+|+|+ + +.++...+.....+. -+++|+..+..+|...-|||..| |+. ++++++. T Consensus 313 ~~----~~~G~-~nagk~~~vl--~----~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~~~ 380 (563) T protein:vir:95 313 KS----SLSGI-NGSWQIPVVM--A----DDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEI-GFPNRGGATGS 380 (563) T ss_pred HH----Hhccc-cccccceEEc--C----CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-ccccccccccc Confidence 32 22211 1122355554 2 234444444443343 48999999999999999999765 874 4555432 Q ss_pred -E----------EeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHH---HHHh Q lcl|NC_021303. 367 -W----------AIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAV---EAHD 432 (637) Q Consensus 367 -W----------~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~---~a~d 432 (637) | +....-++.-|.|.+..|+++|+..+|.. .| .+|.+.|+-. |.....++. .+.. T Consensus 381 ~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~----~~---~~~~~~f~r~----D~~~~~e~~~~~~~~~ 449 (563) T protein:vir:95 381 KGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE----YG---DKYTFQFVGG----DTKSATDKLNILKLET 449 (563) T ss_pred ccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh----cc---cccEEEeccC----CHHHHHHHHHHHHHhc Confidence 2 22233566679999999999999988864 22 4688888544 332233332 2467 Q ss_pred cCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhcccc--------------ccccCCCCc-C Q lcl|NC_021303. 433 RGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQL--------------AGIEFPQPA-N 497 (637) Q Consensus 433 rGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~--------------~~ie~P~p~-~ 497 (637) .|.+|-.-.|+.+|+.--.|=|. . -.|--+..+..+..... +...-|... . T Consensus 450 ~G~lT~NE~R~~~gl~Pi~gGD~-------------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (563) T protein:vir:95 450 QIFKTVNEAREEQGKKPIEGGDI-------------I-LDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSE 515 (563) T ss_pred CCccCHHHHHHHhCCCCCCCcce-------------e-ecccccccccccccccCCCccccchhhhhcccccCCCCCCCC Confidence 79999999999999975433120 0 00000000000000000 000000000 0 Q ss_pred CCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCCCcchHHHHHHHHHHHHHHHHhcccccCCCchhhhh Q lcl|NC_021303. 498 AIESTREEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKT 567 (637) Q Consensus 498 a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~~~~a~~~aa~~llV~rALelAGkRr~~~~~~~~~~ 567 (637) ..++.+..+++ +++|..-..++++..... ..+... =|.+-.+.++ -+. T Consensus 516 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~--~~~~~~---------------~~~~~~~~~~--~~~ 563 (563) T protein:vir:95 516 EGQSTDSSNDD---KEIGTDAQIKGDDNVYRT--QTSNKG---------------QGRKGEKSSD--FKH 563 (563) T ss_pred CCCCCCCCCCc---cccccccccccccccccc--cCcccc---------------ccccCcCccc--ccC Confidence 00000000000 001100000000000000 000000 0111111111 111 No 75 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=98.50 E-value=1e-07 Score=58.85 Aligned_cols=375 Identities=13% Similarity=0.068 Sum_probs=187.5 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeee Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAI 85 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (637) .=+.++-++-. + .-..+++. +-..|+.+.= .|.+ .=++-.|.-+++++|.+.+..=.. T Consensus 1 Mg~f~~~~~~~----~------~~~~~~~~----------~~~~~~~~~~-~~~~-~~v~~~v~~IA~~iA~lp~~~~~~ 58 (378) T protein:vir:94 1 MNLFGKVVSFS----R------GKLNNDTQ----------RVTAWQNEAV-EYTS-AFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred CCccccchhcc----c------ccccCCcc----------eeeeeccchh-HHHH-HHHHHHHHHHHhhhhhCceeeEEE Confidence 11222221100 0 00001111 0112333221 1111 113335788999999988754444 Q ss_pred ccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeee Q lcl|NC_021303. 86 DPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVT 165 (637) Q Consensus 86 D~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt 165 (637) +...|.. +.......+.+..+.+.-.---+-..++++.++.+|-.-|+.||.+..+... T Consensus 59 ~~~~~~~----~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~----------------- 117 (378) T protein:vir:94 59 KKSDVGS----DTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNT----------------- 117 (378) T ss_pred cccCccc----ccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCC----------------- Confidence 4332221 1111223355666665555566888899999999999999999976544221 Q ss_pred HHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee Q lcl|NC_021303. 166 REEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL 245 (637) Q Consensus 166 ~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl 245 (637) +.+..-.|++.+.+|.. +=|||+=+| -.-..--||...+...+ .++.++-- -+|+| T Consensus 118 ---------g~~~~l~p~~~~~~~~~--~diiH~~~~--~~~~~g~s~l~~~~~~i----------~~~~~~~~-~~gil 173 (378) T protein:vir:94 118 ---------GELLDLLFADDKKEYKP--EELVRLTSP--FYINEDTSILDNALASI----------QTKLEQGK-LRGLL 173 (378) T ss_pred ---------ceEEEEEecCCeeEeee--eeeEEecCc--CCccchhHHHHHHHHHH----------HHHHhccc-cccee Confidence 11112234444444433 235555333 23334456655554433 22222211 24777 Q ss_pred eecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeec Q lcl|NC_021303. 246 FVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKF 325 (637) Q Consensus 246 fvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf 325 (637) -+|..++- .+.+.+.+-|.+--+......+ +.. ++++ ++. .+++.|.+ T Consensus 174 ~~~~~l~~-------------------------~~~~~~~~~~~~~~~~~~~~~~-~g~--~~vl--~~g--~~~~~l~~ 221 (378) T protein:vir:94 174 KINAFLDI-------------------------DNTQEYREKALTTIKNMQEGSS-YNG--LTPV--DNK--TEIVELKK 221 (378) T ss_pred eeCCcCCH-------------------------HHHHHHHHHHHHHHHHhhcccc-ccc--ceec--CCC--ceEEEccC Confidence 66653321 1223333333332222222222 112 2222 332 46676666 Q ss_pred CcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCCh Q lcl|NC_021303. 326 GNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDP 405 (637) Q Consensus 326 ~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp 405 (637) ...... +.+++.....+|.-.-|||..|-|..+...| ..-++-.|.|.+..|+++|++.+|.+-=...|... T Consensus 222 ~~~~~~--~~~~~~~~~~Ia~~fgVP~~~l~~~~se~~~------~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~ 293 (378) T protein:vir:94 222 DYSVLN--KDEIDLIKSELLTGYFMNENILLGTASQEQQ------IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGN 293 (378) T ss_pred Chhhhh--HHHHHHHHHHHHHHhCCCHHHhcCChHHHHH------HHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhc Confidence 655444 4567778889999999999877554332222 33566789999999999999999876555455433 Q ss_pred HHeE-EeecCccccc-CC-CCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhh Q lcl|NC_021303. 406 TKYI-LWYDASGLTS-DP-DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPL 482 (637) Q Consensus 406 ~kYv-vw~DaS~Lt~-dP-D~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apL 482 (637) .-|+ +.||.+.|.. |+ ++.+-...++..|.+|..-.|+.+|++.-.+=| +.+ +..| ++|+ T Consensus 294 ~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD----~~~-------~~~n------~~~~ 356 (378) T protein:vir:94 294 LYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----VYI-------ANLN------AVAV 356 (378) T ss_pred ccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eee-------eccc------cccc Confidence 3333 6789888753 32 344445668889999999999999998665422 000 0000 1111 Q ss_pred hccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_021303. 483 LSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTE 521 (637) Q Consensus 483 l~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdte 521 (637) +-+ ++......+...++|++-| T Consensus 357 --------~~~---------~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 357 --------KNL---------SDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred --------ccc---------hhhcCCcCCCCCCCCCCCC Confidence 100 0000000011111222222 No 76 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.45 E-value=6.2e-08 Score=60.10 Aligned_cols=373 Identities=12% Similarity=0.052 Sum_probs=188.9 Q ss_pred ceEEecCCCCCc---ccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEE Q lcl|NC_021303. 6 LRVVRRPKGSAP---AARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIP 82 (637) Q Consensus 6 lr~vrrpk~~~p---~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~a 82 (637) .++.++-+.... ..+. ...- ..++... ..+..|. .-....|+. .+-+.-.+.=+++.+|.+.+.. T Consensus 1 M~~f~~~~~~~~~~~~~~~-~~~~----~~~~~~~-~~~~~~~--~v~~~~al~----~~~v~~~i~~ia~~ia~~p~~~ 68 (386) T protein:vir:49 1 MPIFNITNLATESPPINQE-SFFD----IADSDFL-ASLNSSE--WVSAENALK----NSDLFSIISQLSNDLATAKITT 68 (386) T ss_pred CchhhhhccCCCCcccchh-hhhh----hhhcccc-ccccCCc--eechhhhhc----cHHHHHHHHHHHHHhhhCceee Confidence 334443332211 1111 0000 0000000 0000000 001112221 2344555677889999988866 Q ss_pred eeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccce Q lcl|NC_021303. 83 SAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWY 162 (637) Q Consensus 83 seiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~ 162 (637) -+-+.+ .+-.. ..--+-..++++.+..+|-+-|+.|+.+.-...|.+ . .++ T Consensus 69 ~~~~~~------~l~~~---------------PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~-------~-~l~ 119 (386) T protein:vir:49 69 SRKQLQ------GIVDN---------------PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD-------M-KWE 119 (386) T ss_pred ccchhh------hhhhc---------------cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcE-------E-EEE Confidence 543322 01101 122256678999999999999999998765444431 1 333 Q ss_pred eeeHHHhc---cCCCceeE--Eec--CC-CCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_021303. 163 AVTREEIK---SKAGETAE--ISL--PD-GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNA 234 (637) Q Consensus 163 ~vt~~Ei~---~k~g~~~~--i~l--Pd-G~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 234 (637) .|..+.+. ...++... +.. +. |...+|.. +=||++=.+.+.....-.||+.++.+.+.=.....+...+. T Consensus 120 ~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~--~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 197 (386) T protein:vir:49 120 YLRPSQVSFNRLDNQNGLYYNITFDDPHIAPKQHVPQ--NDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISA 197 (386) T ss_pred EecCceeEEEEcCCCceEEEEEEEcCccccceeEEcc--ccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 33333332 12222221 222 22 23334433 23556545666555567799999988888777788888888 Q ss_pred HHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_021303. 235 AKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAA 314 (637) Q Consensus 235 ~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~ 314 (637) .+.-.+..|||.+|+.++-. ....+...+.+ .. ..+--++++ ++ T Consensus 198 ~~ng~~~~~il~~~~~~~~~-------------------------~~~~~~~~~~~----~~-----~n~g~~~vl--~~ 241 (386) T protein:vir:49 198 LKNALNANGILKIKGGGLLD-------------------------FKTKVSRSRQA----MK-----QMQGGPLVL--DD 241 (386) T ss_pred HHccCCccEEEEeCCCCChH-------------------------HHHHHHHHHHH----hc-----cCCCCceec--CC Confidence 88888899999998743211 11122222211 11 122233443 33 Q ss_pred HHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 315 EHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 315 Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) . .+++.|.+.... .--+++|+..+..+|...-|||..| |++..|+-++-++ ++-.+.-|.|.+..|++.|.+.++ T Consensus 242 g--~~~~~l~~~~~d-~~~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~~~-~~~~~~~i~~~l~~i~~~~~~~l~ 316 (386) T protein:vir:49 242 L--EDFTPLEIKSNV-AQLLSQADWTTGQFAKVYGIPESIV-GGDGDQQSSLEMI-YNIYFKSVSRYLRPFVSEMSKKLS 316 (386) T ss_pred C--ceEEEccCChhH-HHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCccchHHHH-HHHHHHHHHHHHHHHHHHHHHHhc Confidence 3 355655544322 2347889999999999999998765 7654444333344 334456689999999999988775 Q ss_pred HHHHHHhCCChHHeEEeecCcccc-cCC-CCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLT-SDP-DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt-~dP-D~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) .. +-||...+. .|+ .+......++..|.+|-.-.|+.++- .+|.+... T Consensus 317 ~~-------------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~---~~~~~~~~-------------- 366 (386) T protein:vir:49 317 CE-------------VDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQ---AEILPKEL-------------- 366 (386) T ss_pred ch-------------hcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhh---CCCCCCcC-------------- Confidence 43 234544432 222 23344456778899999888877652 12221000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTED 522 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted 522 (637) |. .. ....| +. .|.|.+.+| T Consensus 367 ~~---~~---------~~~~~-------~~-----------~gGd~~~~~ 386 (386) T protein:vir:49 367 PD---GK---------NPNRT-------SL-----------KGGEINEQD 386 (386) T ss_pred cc---hh---------ccCCC-------CC-----------CCCCCCCCC Confidence 00 00 00000 00 011111111 No 77 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=98.45 E-value=1.1e-07 Score=58.77 Aligned_cols=371 Identities=12% Similarity=0.074 Sum_probs=183.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |+==+ ++..| +.++ +...+.. .++.-+. +.|-...-+.-.+.-+++.||++.+ T Consensus 1 Mg~f~-~~f~~-~~~~------------~~~~~~~-~~~~~~~------------~~a~~~~~v~~~i~~ia~~ia~~p~ 53 (385) T protein:vir:95 1 MGLFD-SVFKR-HSEL------------SWMYDLE-FLQDKSK------------KAYLKQIALNTVVEMVARTISQSEF 53 (385) T ss_pred Cchhh-hhhcc-Cccc------------ccccchh-hhhccch------------hhhhhhHHHHHHHHHHHHHHcccce Confidence 55310 11111 1111 0001111 1111100 1111223345567889999999988 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+= |.+ ++ +.+..+.+.=-..-+-..++++.++.+|-+-|++||++ .+.++. .+... T Consensus 54 ~~~~~----~~~------~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~-~~~~~~------~~~~~ 112 (385) T protein:vir:95 54 RVMKN----NTK------EK----GTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVK-NDEGHF------FVADD 112 (385) T ss_pred eeeec----Ccc------cc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEE-ecCCCe------eeccc Confidence 76541 221 11 23444443323355677899999999999999999754 343321 22223 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCc-ccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKT-HEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~-he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) |..-+...+.... ...+...++.. .+|.. +-||++=.+.+.-...-.||+..+...+.-. ....- T Consensus 113 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~--~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~----------~~~~~ 178 (385) T protein:vir:95 113 FEKEDELGLYSHR--FTNVLVNDFEFKRVFTM--DDVIYLKYNNQKLDAFSLGLFEDYGEIFGRM----------IDLQM 178 (385) T ss_pred ccccccccccccc--ceeeeecccceeeeecc--ccEEEecCCCCCcccccchHHHHHHHHHHHH----------HHHHH Confidence 3333332332211 11222222222 23322 3355554455544455667776655443221 11111 Q ss_pred hcC---ceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_021303. 240 MNN---GVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEH 316 (637) Q Consensus 240 ~gn---GvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Eh 316 (637) -+| |+|-+|. .. .......+.+++.+-+.-.....+.+ ++++ +++. T Consensus 179 ~~~~~~g~l~~~~------~~-----------------~~~~e~~~~~~~~~~~~~~g~~~~~~------~i~~-l~~g- 227 (385) T protein:vir:95 179 LNNQIRGILKVDA------TK-----------------FYNKEKQKELQAYIDTLFDAFQNNTI------AVVP-LTEG- 227 (385) T ss_pred hcCCCceEEEeCC------cc-----------------CCCHHHHHHHHHHHHHHhhhhhhcCC------ceEE-cCCC- Confidence 233 3333332 11 01122344455554433222222111 2221 2322 Q ss_pred hcccceeecCc-----chhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHh Q lcl|NC_021303. 317 LEKVQHIKFGN-----EVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (637) Q Consensus 317 i~~ikHlkf~~-----dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~ 391 (637) .+++-|.+.. .-+.--+++|+.....+|.-.-|||..|-| +.-++.+....-++-.|.|.+..|+++|+. T Consensus 228 -~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~----~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~ 302 (385) T protein:vir:95 228 -LAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG----EMADLEKTIESYLQFCINPLLRKIEAELNS 302 (385) T ss_pred -ceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC----CCcCHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3455554332 223446889999999999999999987732 333455556666777899999999999999 Q ss_pred HHHHHHHHHhCCChHHeEEeecCcccccCCCCCHH---HHHHHhcCCcCHHHHHHHhcCcccc--CCCCCchHHHHHHHH Q lcl|NC_021303. 392 DILTPLLAREGIDPTKYILWYDASGLTSDPDLSDE---AVEAHDRGAITSAALRRLLNVGEDS--GYDLTTLDGCREFAA 466 (637) Q Consensus 392 ~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tde---A~~a~drGaIt~eAlrr~lgl~~d~--~yd~~t~eg~r~~A~ 466 (637) .+|-+--. ..|-+.||.+.| .++|..+. ...+++.|++|-.-.|..+|++.-+ +=| + T Consensus 303 ~L~~~~~~------~~~~~~fd~~~l-~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd----~------- 364 (385) T protein:vir:95 303 KFFYQDEY------LNDDMHIKVVGI-DKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELD----K------- 364 (385) T ss_pred hcCChhhc------ccceEEEechhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc----e------- Confidence 88765221 244578999887 34443333 3347889999999999999997422 111 0 Q ss_pred HHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCC Q lcl|NC_021303. 467 DVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDD 507 (637) Q Consensus 467 d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d 507 (637) -+.| ..++.++ ....|+.++| T Consensus 365 -----------~~~~---~n~~~~~------~~kgge~~~e 385 (385) T protein:vir:95 365 -----------FIIT---KNLQSAD------AFKGGESNEE 385 (385) T ss_pred -----------eeec---ccceecc------cccCCCCCCC Confidence 0001 0111111 1111221111 No 78 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=98.44 E-value=2.8e-08 Score=62.00 Aligned_cols=369 Identities=13% Similarity=0.066 Sum_probs=193.4 Q ss_pred ceEEe-cCCCC-CcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEe Q lcl|NC_021303. 6 LRVVR-RPKGS-APAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPS 83 (637) Q Consensus 6 lr~vr-rpk~~-~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~as 83 (637) ..+.. ++++. .|...+.++. ...+|.- +..+..|. .- ....|++ .+=+.=.+.-+++.+|.+.+... T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~----~~~~~~~-~~~~~~~~-~v-~~~~al~----~~~V~~~i~~Ia~~ia~l~~~~~ 69 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFF----DITDPEF-LDALNGSE-WV-SAETALK----NSDLFSIISQLSNDLATAKITTS 69 (384) T ss_pred CccccccccCcccccccchhhc----cccchhh-cccccCCc-ee-chhhhhc----cHHHHHHHHHHHHHHhhCceeee Confidence 33432 22322 1111111111 1122221 12121111 11 1222332 23345566778999999988765 Q ss_pred eeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccccee Q lcl|NC_021303. 84 AIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYA 163 (637) Q Consensus 84 eiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~ 163 (637) +-+.+ .+-.. ...-+-..++++.++.+|-+-|+.|+.+.-...|. + . .++. T Consensus 70 ~~~~~------~l~~~---------------PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~-~------~-~L~~ 120 (384) T protein:vir:49 70 RKQLQ------GIVDN---------------PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGR-D------M-KWEY 120 (384) T ss_pred cchhh------hhhhc---------------cCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCc-E------E-EEEE Confidence 43221 01111 11224556889999999999999999866443342 1 1 3333 Q ss_pred eeHHHhcc---CCCceeE--EecCC---CCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHH Q lcl|NC_021303. 164 VTREEIKS---KAGETAE--ISLPD---GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (637) Q Consensus 164 vt~~Ei~~---k~g~~~~--i~lPd---G~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~ 235 (637) +..+.+.. ..++.+. +...+ |...+|.. .|+ |++=.+.+.....--||+.++.+.+.-.....+...+.. T Consensus 121 l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~-~eV-ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 198 (384) T protein:vir:49 121 LRPSQVSFNRLDNQNGLYYNITFDDPRIPPKQHVPQ-GDI-LHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNAL 198 (384) T ss_pred EcCceeEEEEcCCCceEEEEEEecCccccceeEecC-ccE-EEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333321 2222221 23222 22334443 333 444344444444567999999888887777777777777 Q ss_pred HhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechH Q lcl|NC_021303. 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (637) Q Consensus 236 ~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~E 315 (637) +.-..-.|||-+|+.++-.+ ..+...+. ....+..+ . |+++ ++. T Consensus 199 ~ng~~~~~il~~~~~~~~~~------------------------~~~~~~~~-----~~~~~n~~---~--~~vl--~~g 242 (384) T protein:vir:49 199 KNALNANGILKIKGGGLLDF------------------------KTKQSRSR-----QAMKQMQG---G--PLVL--DDL 242 (384) T ss_pred hccCCCceEEEeCCCCChHH------------------------HHHHHHHH-----HhcccCCc---c--ceec--CCC Confidence 77777778888776432211 11112111 11112222 2 3333 222 Q ss_pred HhcccceeecCcchhHHH-HhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 316 HLEKVQHIKFGNEVTEVE-IKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 316 hi~~ikHlkf~~dvteva-iktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) -+++.+... ..+.. +++|+..+..+|..+=|||..| |..+.|+-+.=++ .+-.+-+|.|.+.-|+..|...+. T Consensus 243 --~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVp~~~l-g~~~~~~~~~~~~-~~~~~~~i~~~l~pi~~~i~~~l~ 316 (384) T protein:vir:49 243 --EDFTPLEIK--SNVAQLLSQADWTTGQFAKVYGIPESVV-GGEGDKQSSLEMI-YNIYFKAVSRFLRPFVSELSKKLS 316 (384) T ss_pred --ceEEEccCC--hhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCccccHHHH-HHHHHHHHHHHHHHHHHHHHHHhc Confidence 244444443 33333 7899999999999999999855 5543333332223 233445688888888888888876 Q ss_pred HHHHHHh--CCChHHeEEeecCccccc-CCCCCHHHHHHH-hcCCcCHHHHHHHhcCccccCCCCCchHHH Q lcl|NC_021303. 395 TPLLARE--GIDPTKYILWYDASGLTS-DPDLSDEAVEAH-DRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (637) Q Consensus 395 r~~L~~e--GiDp~kYvvw~DaS~Lt~-dPD~tdeA~~a~-drGaIt~eAlrr~lgl~~d~~yd~~t~eg~ 461 (637) +.+.-.. -++++.|-++||.+.|.. +.---.||.+.. +.|..+.| +|+..|+.--.| =++.|.| T Consensus 317 ~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne-~r~~~~~~p~~g--Gd~~~~~ 384 (384) T protein:vir:49 317 CEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKD-LPEGETDSTLKG--GETNEQY 384 (384) T ss_pred hhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChh-HHHHcCCCCCCC--CCCCCCC Confidence 6553222 235677888998887743 233345555543 44877755 788888854332 2445555 No 79 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=98.42 E-value=3.5e-07 Score=55.98 Aligned_cols=406 Identities=12% Similarity=0.117 Sum_probs=185.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcc--cccccchhh-HHHHHhhhhhhHhhHhhhhhcceee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLM--GTARNEWQS-EAWDFSESIGELSYYISWRANSCSR 77 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~--g~~r~~WQ~-eAW~~yd~VgELryyvgWr~~s~Sr 77 (637) |+--.| .++++. |++. -+..+.+.-++.... |.....+-. .|. ..+-+.-.|.-+++++++ T Consensus 23 ~~~~~l--f~~~e~-------R~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~al----~~~~V~~cv~~Ia~~iA~ 86 (441) T protein:vir:79 23 LVVVGI--FYKNEK-------RDLQ---YNEDDLQMMVQTLPGFQGTKLRQYKDIEAI----RHSDIFTAVMMIASDLAR 86 (441) T ss_pred hhcccc--cccccc-------cccc---CCCcchHHHHHHhcccCcccccccchhhhh----ccHHHHHHHHHHHHhhcc Confidence 221111 011111 1111 111111111111111 110011110 111 122333446678888888 Q ss_pred eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccc Q lcl|NC_021303. 78 TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAP 157 (637) Q Consensus 78 ~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~ 157 (637) +-|..-+ + |+ +. . .+.+..+++.=...-+...++++.++.+|-+-|+.|+.+.-...|. + T Consensus 87 lp~~~~~---~-~~----~~-~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-------~ 146 (441) T protein:vir:79 87 MPIRVTV---N-GQ----IN-Y----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE-------P 146 (441) T ss_pred Cceeeec---C-cc----cc-c----cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-------E Confidence 8775422 3 22 22 1 2345556555555668888999999999999999999876443442 1 Q ss_pred cccceeeeHHHhc--c-CCCceeE-EecCCCCcccc--cCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 158 RARWYAVTREEIK--S-KAGETAE-ISLPDGKTHEF--NRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 158 ~~~W~~vt~~Ei~--~-k~g~~~~-i~lPdG~~he~--~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) .+=| .|....+. . ..|...+ ....+|..+.+ .-..+-||++=. ++-.-..--||+..+...+.-..-..+.. T Consensus 147 ~~L~-~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~-~~~dg~~G~spl~~~~~~i~~~~~~~~~~ 224 (441) T protein:vir:79 147 MNLT-FRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF-YSLDGINGLSLLDTLSRTIESDNNGKDFL 224 (441) T ss_pred EEEE-EEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEecc-CCCCCccccCHHHHHHHHHHHHHHHHHHH Confidence 2122 22222222 1 1222221 12222222211 111223444411 12222334566555555444333333333 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) .+..+.-..-.|||-+|+.++= .-+.+.+.+.+ +.++.-.+.+ --|+|+ T Consensus 225 ~~~f~ng~~p~gil~~~~~~~~------------------------~e~~e~~r~~~----~~~~~G~~na--g~~~vl- 273 (441) T protein:vir:79 225 NNFLRNGTHAGGILKMKGVLDN------------------------KKARDRAREEF----HKSFSGTKQA--GKVVVL- 273 (441) T ss_pred HHHHhccCCCcEEEEcCCCCCC------------------------HHHHHHHHHHH----HHHhcCcccc--Ccceec- Confidence 3333333444577777663321 01223343333 2333321111 122332 Q ss_pred echHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHh Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~ 391 (637) ++. -+++.|.+..+..+ -+++|+..+..+|.-+-|||. |||+.+.| .+..|.+-.=++ -|.|.+..|+++|+. T Consensus 274 -~~G--~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~-~s~~q~~~~~~~-tl~P~~~~ie~eln~ 346 (441) T protein:vir:79 274 -DES--MTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLH-KFGIETAN-MSITDANLDYLS-TLKPYITCVCAELNF 346 (441) T ss_pred -CCC--ceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHH-HcCCCCCC-ccHHHHHHHHHH-HHHHHHHHHHHHHhh Confidence 333 35666665543333 478999999999999999987 56885543 244443333222 599999999999998 Q ss_pred HHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 392 DILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 392 ~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) .++.. . ..|-+.||.+.|.. .|..+.+ ..++..|.+|-.-.|+.+|++--.+-|- T Consensus 347 kl~~~----~----~~~~~~fd~~~llr-~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~------------- 404 (441) T protein:vir:79 347 KFNDE----Y----VNREFKFDTTEIRV-VDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNG------------- 404 (441) T ss_pred hcccc----c----cCceEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCc------------- Confidence 76532 1 24557999999743 4444333 3466789999999999999975443220 Q ss_pred hcCCchhHHHHHhhhccccccccCC-CCcCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_021303. 469 VTKNPELIAMYAPLLSSQLAGIEFP-QPANAIESTREEDDEDSGARQQREPQ 519 (637) Q Consensus 469 v~~~P~Li~~~apLl~~~~~~ie~P-~p~~a~~~~~~~~d~~~~a~~g~EPd 519 (637) .+ .+.+ .....++.. +.+ .+.....++.-..|++-+ T Consensus 405 -----~~--~~~~---~n~~~~~~~~~~~-----~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 405 -----SI--HRVD---LNHVNIELVDEYQ-----MNKSRATDKKLKGGEENE 441 (441) T ss_pred -----ce--Eeec---ccccccccccccc-----cccccccccccCCCCCCC Confidence 00 0000 001111110 000 000000000001111111 No 80 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=98.42 E-value=3.5e-07 Score=55.98 Aligned_cols=406 Identities=12% Similarity=0.117 Sum_probs=185.9 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcc--cccccchhh-HHHHHhhhhhhHhhHhhhhhcceee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLM--GTARNEWQS-EAWDFSESIGELSYYISWRANSCSR 77 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~--g~~r~~WQ~-eAW~~yd~VgELryyvgWr~~s~Sr 77 (637) |+--.| .++++. |++. -+..+.+.-++.... |.....+-. .|. ..+-+.-.|.-+++++++ T Consensus 23 ~~~~~l--f~~~e~-------R~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~al----~~~~V~~cv~~Ia~~iA~ 86 (441) T protein:vir:94 23 LVVVGI--FYKNEK-------RDLQ---YNEDDLQMMVQTLPGFQGTKLRQYKDIEAI----RHSDIFTAVMMIASDLAR 86 (441) T ss_pred hhcccc--cccccc-------cccc---CCCcchHHHHHHhcccCcccccccchhhhh----ccHHHHHHHHHHHHhhcc Confidence 221111 011111 1111 111111111111111 110011110 111 122333446678888888 Q ss_pred eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccc Q lcl|NC_021303. 78 TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAP 157 (637) Q Consensus 78 ~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~ 157 (637) +-|..-+ + |+ +. . .+.+..+++.=...-+...++++.++.+|-+-|+.|+.+.-...|. + T Consensus 87 lp~~~~~---~-~~----~~-~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-------~ 146 (441) T protein:vir:94 87 MPIRVTV---N-GQ----IN-Y----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE-------P 146 (441) T ss_pred Cceeeec---C-cc----cc-c----cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-------E Confidence 8775422 3 22 22 1 2345556555555668888999999999999999999876443442 1 Q ss_pred cccceeeeHHHhc--c-CCCceeE-EecCCCCcccc--cCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 158 RARWYAVTREEIK--S-KAGETAE-ISLPDGKTHEF--NRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 158 ~~~W~~vt~~Ei~--~-k~g~~~~-i~lPdG~~he~--~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) .+=| .|....+. . ..|...+ ....+|..+.+ .-..+-||++=. ++-.-..--||+..+...+.-..-..+.. T Consensus 147 ~~L~-~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~-~~~dg~~G~spl~~~~~~i~~~~~~~~~~ 224 (441) T protein:vir:94 147 MNLT-FRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF-YSLDGINGLSLLDTLSRTIESDNNGKDFL 224 (441) T ss_pred EEEE-EEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEecc-CCCCCccccCHHHHHHHHHHHHHHHHHHH Confidence 2122 22222222 1 1222221 12222222211 111223444411 12222334566555555444333333333 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) .+..+.-..-.|||-+|+.++= .-+.+.+.+.+ +.++.-.+.+ --|+|+ T Consensus 225 ~~~f~ng~~p~gil~~~~~~~~------------------------~e~~e~~r~~~----~~~~~G~~na--g~~~vl- 273 (441) T protein:vir:94 225 NNFLRNGTHAGGILKMKGVLDN------------------------KKARDRAREEF----HKSFSGTKQA--GKVVVL- 273 (441) T ss_pred HHHHhccCCCcEEEEcCCCCCC------------------------HHHHHHHHHHH----HHHhcCcccc--Ccceec- Confidence 3333333444577777663321 01223343333 2333321111 122332 Q ss_pred echHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHh Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYN 391 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~ 391 (637) ++. -+++.|.+..+..+ -+++|+..+..+|.-+-|||. |||+.+.| .+..|.+-.=++ -|.|.+..|+++|+. T Consensus 274 -~~G--~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~-~s~~q~~~~~~~-tl~P~~~~ie~eln~ 346 (441) T protein:vir:94 274 -DES--MTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLH-KFGIETAN-MSITDANLDYLS-TLKPYITCVCAELNF 346 (441) T ss_pred -CCC--ceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHH-HcCCCCCC-ccHHHHHHHHHH-HHHHHHHHHHHHHhh Confidence 333 35666665543333 478999999999999999987 56885543 244443333222 599999999999998 Q ss_pred HHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 392 DILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 392 ~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) .++.. . ..|-+.||.+.|.. .|..+.+ ..++..|.+|-.-.|+.+|++--.+-|- T Consensus 347 kl~~~----~----~~~~~~fd~~~llr-~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~------------- 404 (441) T protein:vir:94 347 KFNDE----Y----VNREFKFDTTEIRV-VDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNG------------- 404 (441) T ss_pred hcccc----c----cCceEEeechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCc------------- Confidence 76532 1 24557999999743 4444333 3466789999999999999975443220 Q ss_pred hcCCchhHHHHHhhhccccccccCC-CCcCCCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_021303. 469 VTKNPELIAMYAPLLSSQLAGIEFP-QPANAIESTREEDDEDSGARQQREPQ 519 (637) Q Consensus 469 v~~~P~Li~~~apLl~~~~~~ie~P-~p~~a~~~~~~~~d~~~~a~~g~EPd 519 (637) .+ .+.+ .....++.. +.+ .+.....++.-..|++-+ T Consensus 405 -----~~--~~~~---~n~~~~~~~~~~~-----~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 405 -----SI--HRVD---LNHVNIELVDEYQ-----MNKSRATDKKLKGGEENE 441 (441) T ss_pred -----ce--Eeec---ccccccccccccc-----cccccccccccCCCCCCC Confidence 00 0000 001111110 000 000000000001111111 No 81 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=98.41 E-value=7.1e-08 Score=59.79 Aligned_cols=421 Identities=13% Similarity=0.065 Sum_probs=198.4 Q ss_pred ceEEecCCCCCcccccchheehhcccc--------------chhhhhhhhcccccccchhhHH-----HHHhhhhhhHhh Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLIT--------------DPQKQMKTSLMGTARNEWQSEA-----WDFSESIGELSY 66 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~--------------~p~~~~k~~~~g~~r~~WQ~eA-----W~~yd~VgELry 66 (637) ..+..|=++...++.+.++.-.++... +|.. +....| .+..|-+.+ ++.|-.++-+.- T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g-~~~~~~~~~g~~v~~~~a~~~~~v~~ 77 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRI--QQTLAG-PSTELAPDTFVGLATQAYQANGPVFA 77 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHH--HHhhcc-ccccccCccccccchhhhhccHHHHH Confidence 333333333322221212211111111 1111 111111 112233222 333445677888 Q ss_pred HhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeec Q lcl|NC_021303. 67 YISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQ 146 (637) Q Consensus 67 yvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~ 146 (637) .|.-++++++++-|..-+-+ |.|. +++.+ +....|.+. ----+-..++++.++.+|-+-|+.|+.|.-.. T Consensus 78 ~i~~Ia~~ia~lp~~~~~~~-~~~~----~~~~~----~~~~~L~~~-PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~ 147 (466) T protein:vir:81 78 CMLVRQLVFSSVRFRWQRLR-DGKP----SDTFG----SRDLQILET-PWKGGTTQDMLSRMIQDADLAGNSYWTIVDGE 147 (466) T ss_pred HHHHHHHhhccCceEEEEec-CCce----eeccc----cHHHHHhhC-CCCCCCHHHHHHHHHHHHHhcCCeEEEEEecC Confidence 88999999999988887765 3221 22222 234444332 22335567899999999999999999975432 Q ss_pred CCcc-ccccccccccceeeeHHHhcc---CCCce--eEEecCCC-----CcccccCCCceEEEE-ecCCcccccCCccch Q lcl|NC_021303. 147 EKDP-VTGLAAPRARWYAVTREEIKS---KAGET--AEISLPDG-----KTHEFNRDLDSLVRI-WNPRPRKASQATSPV 214 (637) Q Consensus 147 ~~~~-~~~~~~~~~~W~~vt~~Ei~~---k~g~~--~~i~lPdG-----~~he~~~~~d~l~Rv-W~P~prra~eaDSPv 214 (637) .|.- ++....+. .++.|..+-+.. ..+.. .+..--+| ...+|.. +=|||+ ..++|.....--||+ T Consensus 148 ~g~l~~~~~g~~~-~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~--~dviHir~~~~~~d~~~G~s~i 224 (466) T protein:vir:81 148 FVRMRPDWVDVVV-EERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLA--EDVVHFAPIPDPLASYRGMSWL 224 (466) T ss_pred ccccccccCccee-EEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeecc--ccEEEEcCCCCcccccccccHH Confidence 2210 11112222 455555544432 11111 11110111 2222322 234454 345666666778888 Q ss_pred hhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhh Q lcl|NC_021303. 215 RACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVA 294 (637) Q Consensus 215 ra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~a 294 (637) ..+.+.+.-.....+...+..+.-..-.|||-.|+.++ ....+.|.+.|.+.-. T Consensus 225 ~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~-------------------------~e~~~~~~~~~~~~~~- 278 (466) T protein:vir:81 225 TPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMAD-------------------------PAAVKKWADEVNSKHA- 278 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC-------------------------HHHHHHHHHHHHHHhc- Confidence 88777665555555555554554455566776665332 1134445554443321 Q ss_pred cccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccC----CcceeeeEEec Q lcl|NC_021303. 295 AMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS----KGNHWSAWAIG 370 (637) Q Consensus 295 ai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls----~~NHWsAW~I~ 370 (637) ..+. +--|+|+ ++. -+++.|.+.. .+.--+++|+-.+..+|.-.-|||. |||+. .+++-+..|.. T Consensus 279 g~~n-----~g~~~vl--~~g--~~~~~l~~~~-~d~q~le~~~~~~~~Ia~~fgVPp~-~lG~~~~~~~st~sn~eq~~ 347 (466) T protein:vir:81 279 GVDN-----AWKNLNL--YPG--ADADVVGSNL-QEIDFKNVRGGGETRIAAAAGVPPV-IVGLSEGLAAATYSNYGQAR 347 (466) T ss_pred Cccc-----cccceEc--CCC--ceEEEccCCh-hHHHHHHHHHHHHHHHHHHhCCCHH-HcccccCCCccccccHHHHH Confidence 1111 1223333 222 4566676643 2333478999999999999999976 55653 34444566666 Q ss_pred cCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHH----------HHhcCCcCHHH Q lcl|NC_021303. 371 DEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVE----------AHDRGAITSAA 440 (637) Q Consensus 371 dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~----------a~drGaIt~eA 440 (637) ..=++.-|.|.+..|+++|++.++++ .++ .+|-+=||...|... |..+.+.. +...| ||-+- T Consensus 348 ~~f~~~tl~P~~~~ie~~l~~~L~~~---~~~---~~~~~~f~~~~llr~-d~~~r~~~~~~~~~~~~~~~~~g-~t~nE 419 (466) T protein:vir:81 348 RRLADGTAHPLWQNLSGCIGHVMPDM---GPD---VRLWYDADDVPFLRE-DEKDAADIQKVRAETINTLITAG-YEPES 419 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCc---ccC---cceEEEecchhhhcc-CHHHHHHHHHHHHHHHHHHHHcC-CChhh Confidence 66778889999999999999887763 222 235566777766433 32221110 11223 33333 Q ss_pred HHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 441 LRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQR 516 (637) Q Consensus 441 lrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~ 516 (637) .|...+.++.--+. |. .+.|+ +. .|+..++....+.+ ....+.++|+ T Consensus 420 ~r~~~~~gd~~~~~------------------~~---~~~~~-----~~--~~~~~~~~~~~~~~-~~~Gg~~ngn 466 (466) T protein:vir:81 420 VVAAVNSGDLRLLK------------------HT---GLTSV-----QL--LPPGVSASASSDTP-TSGGADDNGN 466 (466) T ss_pred ccccccCCcccccc------------------CC---Ccchh-----hh--cccccccccCCCCc-ccCCCCcCCC Confidence 33222211110000 00 00000 00 11111111111110 0001111222 No 82 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=98.41 E-value=3e-07 Score=56.32 Aligned_cols=375 Identities=12% Similarity=0.059 Sum_probs=183.4 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeee Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAI 85 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (637) .=+.++-++-. + ++..+....-..||.++=. |.+ .=+.-.|.-++++||.+.|-.-+. T Consensus 1 Mg~f~~~~~~~-------------------~-~~~~~~~~~~~~~~~~~~~-~~~-~~v~~~i~~Ia~~iA~l~~~~~~~ 58 (378) T protein:vir:16 1 MNLFGKVVSFS-------------------R-GKLNNDTQRVTAWQNEAVE-YTS-AFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred Cccchhhhhhh-------------------c-ccccCCcceeeecccchhh-HHH-HHHHHHHHHHHhhhhhCceeEEEE Confidence 11222221100 0 0011111122356655322 111 113334677999999999855444 Q ss_pred ccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeee Q lcl|NC_021303. 86 DPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVT 165 (637) Q Consensus 86 D~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt 165 (637) ..+.+..-...+ ...+.+..+.+-=-.--+...++++.++.+|..-|+.||.+.-. +. T Consensus 59 ~~~~~~~~~~~~----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d-~~----------------- 116 (378) T protein:vir:16 59 KKSDVGSDTLIS----MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-DN----------------- 116 (378) T ss_pred cccccccccccc----cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-cC----------------- Confidence 433121110111 12344555543333345778899999999999999999976432 21 Q ss_pred HHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee Q lcl|NC_021303. 166 REEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL 245 (637) Q Consensus 166 ~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl 245 (637) . +.+.--.|++.+.+|..+ =+|++=+| -.-..--||...++..+ .++.++-. -+|+| T Consensus 117 -------~-g~~~~l~~~~~~~~~~~~--diih~r~~--~~~~~~~s~l~~~~~~i----------~~~~~~~~-~~g~l 173 (378) T protein:vir:16 117 -------T-GELLDLLFADDKKEYKPE--ELVRLTSP--FYINEDTSILDNALASI----------QTKLEQGK-LRGLL 173 (378) T ss_pred -------C-ceEEEEEecCCeeEeccc--ceEEecCc--cCccchhHHHHHHHHHH----------HHHHhcCc-cceee Confidence 1 111112344444455332 24444222 22233445555544333 23222211 13565 Q ss_pred eecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeec Q lcl|NC_021303. 246 FVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKF 325 (637) Q Consensus 246 fvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf 325 (637) -.|..++ ....+.+.+-|.+.-+......+ +.. ++++ ++. -+++.|.. T Consensus 174 ~~~~~l~-------------------------~~~~~~~~~~~~~~~~~~~~~~~-~g~--~~vl--~~g--~~~~~l~~ 221 (378) T protein:vir:16 174 KINAFLD-------------------------IDNTQEYREKALTTIKNMQEGSS-YNG--LTPV--DNK--TEIVELKK 221 (378) T ss_pred EeCCcCC-------------------------HHHHHHHHHHHHHHHHHhhcccc-ccc--ceEc--CCC--ceEEEccC Confidence 5554322 11233344444443333333222 112 2333 222 34566655 Q ss_pred CcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCCh Q lcl|NC_021303. 326 GNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDP 405 (637) Q Consensus 326 ~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp 405 (637) ...... +..+......+|.-.-|||..|-|..+. +....-++-.|.|.+..|+++|+..+|.+--...|... T Consensus 222 ~~~~~~--~~~~~~~~~~Ia~~fgVPp~~l~g~~~e------~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~ 293 (378) T protein:vir:16 222 DYSVLN--KDEIDLIKSELLTGYFMNENILLGTASQ------EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGN 293 (378) T ss_pred Chhhhh--HHHHHHHHHHHHHHhCCCHHHhcCCchH------HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhc Confidence 554433 3455567789999999999877554321 33334456779999999999999999877655555544 Q ss_pred HHeE-EeecCcccc-cCC-CCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhh Q lcl|NC_021303. 406 TKYI-LWYDASGLT-SDP-DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPL 482 (637) Q Consensus 406 ~kYv-vw~DaS~Lt-~dP-D~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apL 482 (637) ..|. +.||.+.|. .|+ ++.+-...++..|.+|-.-.|+.+|++.-.+=| +.+. ..| +.|+ T Consensus 294 ~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD----~~~~-------~~n------~~~~ 356 (378) T protein:vir:16 294 LYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----VYIA-------NLN------AVAV 356 (378) T ss_pred ccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEee-------ccc------cccc Confidence 4444 778888774 343 333444668889999999999999998654312 0000 000 1111 Q ss_pred hccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_021303. 483 LSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTE 521 (637) Q Consensus 483 l~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdte 521 (637) -+ + ........+...++|.+-| T Consensus 357 ~~--------~---------~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 357 KN--------L---------SDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred cc--------h---------hhhcCccCCCCCCCCCCCC Confidence 00 0 0000000000111111111 No 83 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.39 E-value=8.9e-07 Score=53.76 Aligned_cols=463 Identities=14% Similarity=0.118 Sum_probs=193.8 Q ss_pred CCCC-cceEE---ecCCCCCc------------c-------cccchheehhccccchh-hhhhhhcccccccchhhHHHH Q lcl|NC_021303. 1 MAAT-SLRVV---RRPKGSAP------------A-------ARRRSLTAASQLITDPQ-KQMKTSLMGTARNEWQSEAWD 56 (637) Q Consensus 1 ma~~-~lr~v---rrpk~~~p------------~-------~~r~~ltAAs~~~~~p~-~~~k~~~~g~~r~~WQ~eAW~ 56 (637) ||-- +||-. ---|.+.- + +.|....+.=--.++|. ...|.+...+ .+.+ +-=. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~--~~~~-~l~~ 77 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDV--LSTK-KLLK 77 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccc--cCHH-HHHH Confidence 5432 23210 00111110 0 00000101000011121 1111110000 0110 0001 Q ss_pred HhhhhhhHhhHhhhhhcce---------------eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcc-- Q lcl|NC_021303. 57 FSESIGELSYYISWRANSC---------------SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPL-- 119 (637) Q Consensus 57 ~yd~VgELryyvgWr~~s~---------------Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~l-- 119 (637) +|.-.+-++-.+.=+++.+ ..++||-... .+++ ++-...+.+..++..--.-.+ T Consensus 78 ~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~-----~~~~----~~~~~~~~l~~lL~~~PN~~~~~ 148 (535) T protein:vir:10 78 AYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATK-----VMSK----AQIKRAHEIEDFIYNTGSEYYEW 148 (535) T ss_pred HhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccC-----CCcc----hhhhhhhHHHHHHHhCCCCCCCh Confidence 1111122222221112111 2455552221 2221 111223444444432222222 Q ss_pred --cHHHHHHHHHhh-hcccccEEEEEEeecCCccccccccccccceeeeHHHhccCCCceeEEecCCCCcccccCCCceE Q lcl|NC_021303. 120 --GQAALIKRAVEC-MTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSL 196 (637) Q Consensus 120 --GqaqLlkr~~~~-LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l 196 (637) --.++++.++.+ |..-|..|+.|. |.+++-+.+.-+=...+..+..+.-..+.+........+|...+|.. .+++ T Consensus 149 ~~~~~~~~~~lv~d~l~~~g~ay~~i~-r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~-~eii 226 (535) T protein:vir:10 149 RDTFPRLLTKIINDMYVQDQINIERIF-KNDSNELDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVKFSE-RNLT 226 (535) T ss_pred hHHHHHHHHHHHHHHHhhCCceEEEEE-ECCCCcEEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEEECc-ccEE Confidence 224688888887 455556565554 43332122111111112222221111111122222333344444443 3444 Q ss_pred -EEEecC-CcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCccccc Q lcl|NC_021303. 197 -VRIWNP-RPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPE 274 (637) Q Consensus 197 -~RvW~P-~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~ 274 (637) ||-|++ +...-..--||+.++...+.-..-..+...+..+.=..-.|||-+|..+.--. T Consensus 227 h~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~l------------------- 287 (535) T protein:vir:10 227 FINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQA------------------- 287 (535) T ss_pred EEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCccc------------------- Confidence 554432 22333345688888887777777666666665555555668888876322100 Q ss_pred CCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhH Q lcl|NC_021303. 275 VSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPER 354 (637) Q Consensus 275 ~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pEr 354 (637) .....+.|++.+ +..+...+ -+--+||+... + -+.+.+.+... +.--+++|+..+..+|.-.-|||.. T Consensus 288 --s~e~~e~lk~~~----~~~~~G~~-nag~~~vl~~~-g---~~~~~l~~~~~-D~qfle~~~~~~~eIa~afgVPp~~ 355 (535) T protein:vir:10 288 --NQMMLAGIRRQW----TSQGSGLG-GAWKIPILAAK-D---AKFVNMTQNSR-DMEFDKFLNFMIYDTAAIFQMQPEE 355 (535) T ss_pred --CHHHHHHHHHHH----HHHhcCcc-cccccccccCC-C---ceEEecCCChh-HHHHHHHHHHHHHHHHHHhCCCHHH Confidence 111333343332 33333322 23455666431 1 24445555432 2335899999999999999999976 Q ss_pred hhccC-CcceeeeE------------EeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCC Q lcl|NC_021303. 355 LLGMS-KGNHWSAW------------AIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDP 421 (637) Q Consensus 355 LLGls-~~NHWsAW------------~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dP 421 (637) | |+. ++|+-+.. +....-++..|.|.+..|+++|+..+|.. .|. +|.+.||. -++.|. T Consensus 356 l-G~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~----~~~---~~~f~f~~-l~~~d~ 426 (535) T protein:vir:10 356 I-NFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRY----VDT---DYRFSFTL-GDAQDK 426 (535) T ss_pred h-ccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc----cCC---eEEEEecc-ccccCH Confidence 5 884 45543322 22222345569999999999999988742 232 57888874 566666 Q ss_pred CCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhcc-ccccccCCCCcCCCC Q lcl|NC_021303. 422 DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSS-QLAGIEFPQPANAIE 500 (637) Q Consensus 422 D~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~-~~~~ie~P~p~~a~~ 500 (637) ....++.+..-+|.+|-.-.|+.+|++--.|=|. -...+..+ -++.+ .......|.+ ..+ T Consensus 427 ~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~----~~~~~~~~-------------~~~~~~~~~~~~~p~~--~~~ 487 (535) T protein:vir:10 427 LQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDV----PGFIGSAE-------------NFINATGFGQPNVPDS--SDD 487 (535) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccc----cccccchh-------------hcccccccccccCCCC--CCC Confidence 6556677777789999999999999986543230 00001000 00000 0000111111 111 Q ss_pred CCCCC-----CCC-------CCCCCCCCCccCCCCCCCcccCCCCcchHH Q lcl|NC_021303. 501 STREE-----DDE-------DSGARQQREPQTEDERSTEEAASLNDRAAY 538 (637) Q Consensus 501 ~~~~~-----~d~-------~~~a~~g~EPdted~~~~~~~a~~~~~a~~ 538 (637) .+..- .+. +.+.++++.|.+.+ +.....+....+.. T Consensus 488 ~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~--~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 488 SGSTLGERERQERIQHSKDYEKGKDDPKSPLPKP--SESDDVSNNEDADT 535 (535) T ss_pred ccccCCccccCcccccccccccCCCCCCCCCCcC--CCCCccccccccCC Confidence 11110 000 01111122221111 11111111111111 No 84 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=98.34 E-value=5.6e-07 Score=54.87 Aligned_cols=386 Identities=11% Similarity=0.104 Sum_probs=177.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+ |+-+|.+... ..++ + ..+..-..+.|=...-++-.+.-+++.+|++.+ T Consensus 1 Mg~f~-~lf~~~~~~~-------------~~~~----~---------~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~ 53 (395) T protein:vir:10 1 MSILE-KIFKTRKDIT-------------YMLD----L---------DMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHF 53 (395) T ss_pred Cchhh-hhhccCcccc-------------cccc----c---------hhccccchhhhhhhHHHHHHHHHHHHhhcccee Confidence 32210 2222211100 0000 0 000000112222345566678889999999987 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+ .+. + ++ +.+..+.+.=-..-+-..++++.++.+|-.-|+.++++. + +++. .+. . T Consensus 54 ~~~~----~~~----~--~~----~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~-~-~~~~-----~~~-~ 111 (395) T protein:vir:10 54 KVLE----GNR----I--QK----NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVS-D-SKEL-----LIA-D 111 (395) T ss_pred Eecc----CCc----c--cc----chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEe-c-CCCe-----Eec-C Confidence 6532 111 1 22 223444333334557778899999999999998876542 2 2210 111 1 Q ss_pred ceeeeHHHhccCCCceeEEecCCCC-cccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGK-THEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~-~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) ++.++...+ .......+...++. .++|. ..+ ||++=..++.-...-.||+.++...+. +.. ++-+.=. T Consensus 112 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~e-vih~~~~~~~~~~~G~spi~~~~~~~~----~~~---~~~~~~~ 180 (395) T protein:vir:10 112 SFYREEYAL--YDDIFKDVTVKDYTYQRTFT-MQE-VIYLKYNNNKVTHFVESLFEDYGKIFG----RMI---GAQLKNY 180 (395) T ss_pred CccceeEee--cCcceeEEEEcCceeeeeec-ccc-EEEEccCCCCcccccchHHHHHHHHHH----HHH---HHHHhcC Confidence 222221111 11111122222222 22332 233 444422333334455677666544432 211 1111111 Q ss_pred hcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_021303. 240 MNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEK 319 (637) Q Consensus 240 ~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ 319 (637) .-.|+|.+|+.. ....+.+.+...+-+-- .+.. .++ +.++..++. -+ T Consensus 181 ~~~gii~~~~~~------------------------~~~e~~~~~~~~~~~~~-~~~~-~~~-----~~v~~l~~g--~~ 227 (395) T protein:vir:10 181 QIRGILKSASSA------------------------YDEKNIEKLQAFTNKLF-NTFN-KNQ-----LAIAPLIEG--FD 227 (395) T ss_pred CCceEEEeCCCC------------------------CCHHHHHHHHHHHHHHh-cccc-ccC-----cceEEcCCC--ce Confidence 122455554421 11224444555443321 1111 111 122223322 34 Q ss_pred cceeecCc---chhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 320 VQHIKFGN---EVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 320 ikHlkf~~---dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) ++-|.+.. +.+.. -+++|+..+..+|.-.-|||..| |-..+| +-+....=++-.|.|.+..|+++|++.+|. T Consensus 228 ~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l-~~~~sn---~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~ 303 (395) T protein:vir:10 228 YEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI-YGETAD---LEKNTLVFEKFCLTPLLKKIQNELNAKLIT 303 (395) T ss_pred eeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh-cCcccC---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 55555433 33333 38899999999999999998865 422222 334444455667999999999999998876 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCC---CHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDL---SDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~---tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) +... .+| +.||.+.|. .+|. .+....++..|.+|-.-.|..+|++.-.+-. ..+-+ +.++ T Consensus 304 ~~~~------~~~-~~f~~~~l~-~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~~~-------~~~n 366 (395) T protein:vir:10 304 QSMY------LKD-TRIEIVGVN-KKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDEYL-------ITKN 366 (395) T ss_pred hhhh------ccc-ceecchhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceee-------eccc Confidence 5332 223 368887773 3443 3333447889999999999999998654310 00000 0000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCC-CCCCCCCCCCCCccCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREE-DDEDSGARQQREPQTEDE 523 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~-~d~~~~a~~g~EPdted~ 523 (637) +.| ++ .++.. ....+.+..|+|.+...+ T Consensus 367 ------~~~--------~~---------~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 367 ------YEK--------AN---------SGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred ------ccc--------cc---------ccccccCcccccccCCCCCCCCCC Confidence 011 11 11111 111112222222222212 No 85 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=98.34 E-value=5.6e-07 Score=54.87 Aligned_cols=386 Identities=11% Similarity=0.104 Sum_probs=177.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+ |+-+|.+... ..++ + ..+..-..+.|=...-++-.+.-+++.+|++.+ T Consensus 1 Mg~f~-~lf~~~~~~~-------------~~~~----~---------~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~ 53 (395) T protein:vir:95 1 MSILE-KIFKTRKDIT-------------YMLD----L---------DMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHF 53 (395) T ss_pred Cchhh-hhhccCcccc-------------cccc----c---------hhccccchhhhhhhHHHHHHHHHHHHhhcccee Confidence 32210 2222211100 0000 0 000000112222345566678889999999987 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+ .+. + ++ +.+..+.+.=-..-+-..++++.++.+|-.-|+.++++. + +++. .+. . T Consensus 54 ~~~~----~~~----~--~~----~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~-~-~~~~-----~~~-~ 111 (395) T protein:vir:95 54 KVLE----GNR----I--QK----NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVS-D-SKEL-----LIA-D 111 (395) T ss_pred Eecc----CCc----c--cc----chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEe-c-CCCe-----Eec-C Confidence 6532 111 1 22 223444333334557778899999999999998876542 2 2210 111 1 Q ss_pred ceeeeHHHhccCCCceeEEecCCCC-cccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGK-THEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~-~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) ++.++...+ .......+...++. .++|. ..+ ||++=..++.-...-.||+.++...+. +.. ++-+.=. T Consensus 112 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~e-vih~~~~~~~~~~~G~spi~~~~~~~~----~~~---~~~~~~~ 180 (395) T protein:vir:95 112 SFYREEYAL--YDDIFKDVTVKDYTYQRTFT-MQE-VIYLKYNNNKVTHFVESLFEDYGKIFG----RMI---GAQLKNY 180 (395) T ss_pred CccceeEee--cCcceeEEEEcCceeeeeec-ccc-EEEEccCCCCcccccchHHHHHHHHHH----HHH---HHHHhcC Confidence 222221111 11111122222222 22332 233 444422333334455677666544432 211 1111111 Q ss_pred hcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_021303. 240 MNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEK 319 (637) Q Consensus 240 ~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ 319 (637) .-.|+|.+|+.. ....+.+.+...+-+-- .+.. .++ +.++..++. -+ T Consensus 181 ~~~gii~~~~~~------------------------~~~e~~~~~~~~~~~~~-~~~~-~~~-----~~v~~l~~g--~~ 227 (395) T protein:vir:95 181 QIRGILKSASSA------------------------YDEKNIEKLQAFTNKLF-NTFN-KNQ-----LAIAPLIEG--FD 227 (395) T ss_pred CCceEEEeCCCC------------------------CCHHHHHHHHHHHHHHh-cccc-ccC-----cceEEcCCC--ce Confidence 122455554421 11224444555443321 1111 111 122223322 34 Q ss_pred cceeecCc---chhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 320 VQHIKFGN---EVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 320 ikHlkf~~---dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) ++-|.+.. +.+.. -+++|+..+..+|.-.-|||..| |-..+| +-+....=++-.|.|.+..|+++|++.+|. T Consensus 228 ~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l-~~~~sn---~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~ 303 (395) T protein:vir:95 228 YEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI-YGETAD---LEKNTLVFEKFCLTPLLKKIQNELNAKLIT 303 (395) T ss_pred eeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh-cCcccC---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 55555433 33333 38899999999999999998865 422222 334444455667999999999999998876 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCC---CHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDL---SDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~---tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) +... .+| +.||.+.|. .+|. .+....++..|.+|-.-.|..+|++.-.+-. ..+-+ +.++ T Consensus 304 ~~~~------~~~-~~f~~~~l~-~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~~~-------~~~n 366 (395) T protein:vir:95 304 QSMY------LKD-TRIEIVGVN-KKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDEYL-------ITKN 366 (395) T ss_pred hhhh------ccc-ceecchhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceee-------eccc Confidence 5332 223 368887773 3443 3333447889999999999999998654310 00000 0000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCC-CCCCCCCCCCCCccCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREE-DDEDSGARQQREPQTEDE 523 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~-~d~~~~a~~g~EPdted~ 523 (637) +.| ++ .++.. ....+.+..|+|.+...+ T Consensus 367 ------~~~--------~~---------~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 367 ------YEK--------AN---------SGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred ------ccc--------cc---------ccccccCcccccccCCCCCCCCCC Confidence 011 11 11111 111112222222222212 No 86 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=98.34 E-value=5.6e-07 Score=54.87 Aligned_cols=386 Identities=11% Similarity=0.104 Sum_probs=177.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+ |+-+|.+... ..++ + ..+..-..+.|=...-++-.+.-+++.+|++.+ T Consensus 1 Mg~f~-~lf~~~~~~~-------------~~~~----~---------~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~ 53 (395) T protein:vir:10 1 MSILE-KIFKTRKDIT-------------YMLD----L---------DMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHF 53 (395) T ss_pred Cchhh-hhhccCcccc-------------cccc----c---------hhccccchhhhhhhHHHHHHHHHHHHhhcccee Confidence 32210 2222211100 0000 0 000000112222345566678889999999987 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+ .+. + ++ +.+..+.+.=-..-+-..++++.++.+|-.-|+.++++. + +++. .+. . T Consensus 54 ~~~~----~~~----~--~~----~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~-~-~~~~-----~~~-~ 111 (395) T protein:vir:10 54 KVLE----GNR----I--QK----NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVS-D-SKEL-----LIA-D 111 (395) T ss_pred Eecc----CCc----c--cc----chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEe-c-CCCe-----Eec-C Confidence 6532 111 1 22 223444333334557778899999999999998876542 2 2210 111 1 Q ss_pred ceeeeHHHhccCCCceeEEecCCCC-cccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGK-THEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~-~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) ++.++...+ .......+...++. .++|. ..+ ||++=..++.-...-.||+.++...+. +.. ++-+.=. T Consensus 112 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~e-vih~~~~~~~~~~~G~spi~~~~~~~~----~~~---~~~~~~~ 180 (395) T protein:vir:10 112 SFYREEYAL--YDDIFKDVTVKDYTYQRTFT-MQE-VIYLKYNNNKVTHFVESLFEDYGKIFG----RMI---GAQLKNY 180 (395) T ss_pred CccceeEee--cCcceeEEEEcCceeeeeec-ccc-EEEEccCCCCcccccchHHHHHHHHHH----HHH---HHHHhcC Confidence 222221111 11111122222222 22332 233 444422333334455677666544432 211 1111111 Q ss_pred hcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_021303. 240 MNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEK 319 (637) Q Consensus 240 ~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ 319 (637) .-.|+|.+|+.. ....+.+.+...+-+-- .+.. .++ +.++..++. -+ T Consensus 181 ~~~gii~~~~~~------------------------~~~e~~~~~~~~~~~~~-~~~~-~~~-----~~v~~l~~g--~~ 227 (395) T protein:vir:10 181 QIRGILKSASSA------------------------YDEKNIEKLQAFTNKLF-NTFN-KNQ-----LAIAPLIEG--FD 227 (395) T ss_pred CCceEEEeCCCC------------------------CCHHHHHHHHHHHHHHh-cccc-ccC-----cceEEcCCC--ce Confidence 122455554421 11224444555443321 1111 111 122223322 34 Q ss_pred cceeecCc---chhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHH Q lcl|NC_021303. 320 VQHIKFGN---EVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILT 395 (637) Q Consensus 320 ikHlkf~~---dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr 395 (637) ++-|.+.. +.+.. -+++|+..+..+|.-.-|||..| |-..+| +-+....=++-.|.|.+..|+++|++.+|. T Consensus 228 ~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l-~~~~sn---~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~ 303 (395) T protein:vir:10 228 YEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI-YGETAD---LEKNTLVFEKFCLTPLLKKIQNELNAKLIT 303 (395) T ss_pred eeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh-cCcccC---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 55555433 33333 38899999999999999998865 422222 334444455667999999999999998876 Q ss_pred HHHHHhCCChHHeEEeecCcccccCCCC---CHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 396 PLLAREGIDPTKYILWYDASGLTSDPDL---SDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 396 ~~L~~eGiDp~kYvvw~DaS~Lt~dPD~---tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) +... .+| +.||.+.|. .+|. .+....++..|.+|-.-.|..+|++.-.+-. ..+-+ +.++ T Consensus 304 ~~~~------~~~-~~f~~~~l~-~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~~~-------~~~n 366 (395) T protein:vir:10 304 QSMY------LKD-TRIEIVGVN-KKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDEYL-------ITKN 366 (395) T ss_pred hhhh------ccc-ceecchhhh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceee-------eccc Confidence 5332 223 368887773 3443 3333447889999999999999998654310 00000 0000 Q ss_pred chhHHHHHhhhccccccccCCCCcCCCCCCCCC-CCCCCCCCCCCCccCCCC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREE-DDEDSGARQQREPQTEDE 523 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~-~d~~~~a~~g~EPdted~ 523 (637) +.| ++ .++.. ....+.+..|+|.+...+ T Consensus 367 ------~~~--------~~---------~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 367 ------YEK--------AN---------SGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred ------ccc--------cc---------ccccccCcccccccCCCCCCCCCC Confidence 011 11 11111 111112222222222212 No 87 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.29 E-value=7.2e-07 Score=54.25 Aligned_cols=456 Identities=12% Similarity=0.068 Sum_probs=210.2 Q ss_pred CCCC--cceEEecCCCCCcc----cccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcc Q lcl|NC_021303. 1 MAAT--SLRVVRRPKGSAPA----ARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANS 74 (637) Q Consensus 1 ma~~--~lr~vrrpk~~~p~----~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s 74 (637) |--= .++-+ -|+.+..+ ...++..||+.- .-...|. . ..+.++.++.....+....=.|---.+|-++. T Consensus 1 Mn~iDr~i~~~-sP~~a~~R~~ar~~~~~y~aa~~~--r~~~~~~-~-~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~a 75 (548) T protein:vir:95 1 MNLIDRLLEPL-APELVARRLAAREAIQAYEAARPG--RTHKAKR-Q-PLGADTSLQKSAVSMREQCRKLDEDHDLVTGL 75 (548) T ss_pred CchHHhHhhhc-chHHHHHHHHhHHHhccccccCcc--ccccccC-C-CCChHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 1100 01111 12221111 111234444321 1112221 1 22344555543333332222232333444444 Q ss_pred eeee-EEEEe----eeccccCCCCCcccCCCCcccchHHHHHHHh---------ccCcccHHHHHHHHHhhhcccccEEE Q lcl|NC_021303. 75 CSRT-TLIPS----AIDPDTGLPTGEVDIEEDPDAQIVADYVKGI---------ADGPLGQAALIKRAVECMTVVGEVWI 140 (637) Q Consensus 75 ~Sr~-rL~as----eiD~DtG~PtG~v~~e~~~~~~rv~~iv~~i---------AgG~lGqaqLlkr~~~~LtVpGE~wi 140 (637) +.+. .-+++ -|.|. |.| .+........+.++.+ +.|.+--.+|.+.+...+-+-||+.+ T Consensus 76 v~~~~~nvVG~~G~~i~p~---~l~----~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~ 148 (548) T protein:vir:95 76 LDRLEERVVGGSGIGVEPL---PLR----LDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLA 148 (548) T ss_pred HHHHHHhccCccccceeee---ecC----CCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEE Confidence 4432 22332 22222 221 1111223444555554 57999999999999999999999999 Q ss_pred EEEeecCCccccccccccccceeeeHHHhcc-CCCceeEEecCCCCcccccC-CCceEEEEecCCcccccCC-------- Q lcl|NC_021303. 141 AVLIRQEKDPVTGLAAPRARWYAVTREEIKS-KAGETAEISLPDGKTHEFNR-DLDSLVRIWNPRPRKASQA-------- 210 (637) Q Consensus 141 ~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~-~~d~l~RvW~P~prra~ea-------- 210 (637) .+..++.++...+...|. .=..|..+-|.+ +++..-.| -+| -|||. +.-+-.+|++.||...... T Consensus 149 ~~~~~~~~~~~~g~~~~~-~lqliepd~l~~~~~~~~~~i--~~G--IE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~r 223 (548) T protein:vir:95 149 QKLMGRVPNYTFATSVPF-ALELLEPDYLPFSYNNLSKGI--VQG--IERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKR 223 (548) T ss_pred EeeecccccccCCcccce-EEEEechhhcCCCCCCCCCce--eee--eEECCCCceEEEEEeecCCCcccccccccceee Confidence 988876653211221211 123445555542 11111111 122 35555 3455566666666542211 Q ss_pred --------------------ccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCce-eeecccCCCCCcccccccccccCCC Q lcl|NC_021303. 211 --------------------TSPVRACLETLREIERTTRKIKNAAKSRVMNNGV-LFVPAEMSLPAAQAPIPAGQAQIPG 269 (637) Q Consensus 211 --------------------DSPvra~l~~LrEI~rttk~I~na~~SRL~gnGv-lfvPqe~slP~~~ap~~a~~~~~pg 269 (637) -|..-++|..|+. +.++..+....-.+.+-+ +||=. ..|......+ + T Consensus 224 vpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~---l~~y~dael~~aki~A~~a~fi~~--~~~~~~~~~~-------~ 291 (548) T protein:vir:95 224 VEAERIIHIAYRKRIGQNRGVPMLHAVLIRLAD---LKDYEESERVAARISAALAMYIKK--GNPDSYTVEP-------G 291 (548) T ss_pred echhHheecccccCCccccCcchHHHHHHHHHH---HhHHHHHHHHHHHHhhhheeeeec--CCCccccCCC-------C Confidence 1222234444443 444444444444444433 23322 2222221100 0 Q ss_pred cccccCCCchhHHHHHHHHHHHHhhcccCccccccccce--eEe-echHHhcccceeecCc--chhHHHHhhHHHHHHHH Q lcl|NC_021303. 270 APVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPL--VAS-VAAEHLEKVQHIKFGN--EVTEVEIKTRIDAITRL 344 (637) Q Consensus 270 ~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPi--va~-vP~Ehi~~ikHlkf~~--dvtevaiktR~daI~Rl 344 (637) . .+...--.+-|= |-. .|||- |+|-+ .-+.---.+.+-.++-+ T Consensus 292 ~--------------------------~~~~~~~~~~pG~iv~~L~pGe~------i~~~~p~~p~~~~~~f~~~~lr~I 339 (548) T protein:vir:95 292 K--------------------------DRKNRTIPIAPGMVFDDLEPGED------VGMIESNRPNPFLEGFRNGQLRMI 339 (548) T ss_pred c--------------------------ccccccccccCCccccccCCCce------eeecCCCCCCCCHHHHHHHHHHHH Confidence 0 000000011111 111 24442 33322 22334456778889999 Q ss_pred HhhcCCchhHhhccCCcceeeeEEeccCceeEeech----hHHHHHHHHHhHHHHHHHHHhCC------ChHHe--EEee Q lcl|NC_021303. 345 AMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKP----VMDLICQAIYNDILTPLLAREGI------DPTKY--ILWY 412 (637) Q Consensus 345 AmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P----~me~ic~Ait~~~Lr~~L~~eGi------Dp~kY--vvw~ 412 (637) |.||.||-|.|+|=.++|+.|+-+---|..+. +.- +...+|+-|++.||.-++..--| ++..| +-|. T Consensus 340 AaglGipYe~ltgD~s~nYSS~R~~l~e~~r~-~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~ 418 (548) T protein:vir:95 340 GAGTRSTYSSVSRAYDGTYSAQRQELVEGWLG-YDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQ 418 (548) T ss_pred HhhcCCCHHHHhcccchhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeee Confidence 99999999999998667999887765554433 211 35678999999999988876444 34445 7788 Q ss_pred cCcccccCCCCCHHH-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHh---------cCCchhHHHHHhh Q lcl|NC_021303. 413 DASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVV---------TKNPELIAMYAPL 482 (637) Q Consensus 413 DaS~Lt~dPD~tdeA-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v---------~~~P~Li~~~apL 482 (637) =+.-..+||-|--+| +.+.+.|..|-+..-+..|.+- +|-.+|+|.+.. ..+|..- T Consensus 419 ~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~--------~ev~~q~a~E~~~~~~~GL~~~~~~~~~------ 484 (548) T protein:vir:95 419 GPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDP--------RELKKSRETEIKANRAAGLVFSSDAYHQ------ 484 (548) T ss_pred cCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCH--------HHHHHHHHHHHHHHHHcCCCCCCccccc------ Confidence 888889999886554 6678889999987776666543 344555555542 1222210 Q ss_pred hccccccccCCCCcCCC--------CCCCCCCCCCCCCCCCCC---cc---CCCCCCCcccCCCCcchH Q lcl|NC_021303. 483 LSSQLAGIEFPQPANAI--------ESTREEDDEDSGARQQRE---PQ---TEDERSTEEAASLNDRAA 537 (637) Q Consensus 483 l~~~~~~ie~P~p~~a~--------~~~~~~~d~~~~a~~g~E---Pd---ted~~~~~~~a~~~~~a~ 537 (637) ...+..-|.++++. .+|++.+++=+.-+.|=+ || .+........++-++ + T Consensus 485 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 548 (548) T protein:vir:95 485 ---LVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQPSNPD--P 548 (548) T ss_pred ---ccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCCCCCC--C Confidence 00111222222211 122332222222222211 22 222222222222222 1 No 88 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.25 E-value=9.2e-07 Score=53.68 Aligned_cols=425 Identities=14% Similarity=0.092 Sum_probs=197.0 Q ss_pred CCCCcceEE--ecCCCC-Cc------ccccchheehhccccchhhhhhhh----cccccccchhhH-------HHHHhhh Q lcl|NC_021303. 1 MAATSLRVV--RRPKGS-AP------AARRRSLTAASQLITDPQKQMKTS----LMGTARNEWQSE-------AWDFSES 60 (637) Q Consensus 1 ma~~~lr~v--rrpk~~-~p------~~~r~~ltAAs~~~~~p~~~~k~~----~~g~~r~~WQ~e-------AW~~yd~ 60 (637) |+..-.+-. =|.-+. .+ ....+...||+. ++..+.+ ..++.++.+... |-++|.- T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~-----~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rN 75 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARR-----DRLGKAWLRRASRLSADEEIYADLASLVQRAREQSIN 75 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccC-----CCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhc Confidence 543322211 111110 00 000111222211 1111211 112222332222 2233333 Q ss_pred hhhHhhHhh-hhhccee-eeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHh-----------ccCcccHHHHHHH Q lcl|NC_021303. 61 IGELSYYIS-WRANSCS-RTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGI-----------ADGPLGQAALIKR 127 (637) Q Consensus 61 VgELryyvg-Wr~~s~S-r~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~i-----------AgG~lGqaqLlkr 127 (637) -|=.+=++. +..|-|. .=-..-+.++... |+++ + .+++.+... +.|.+--.+|.+. T Consensus 76 n~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~----~~~~-~------~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l 144 (505) T protein:vir:96 76 NPYAKRFYQLLKNNVIGPKGMTFQSRVKRRN----GKPD-D------RANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHL 144 (505) T ss_pred ChHHHHHHHHHHHHhcCCCcceeeecCCccc----cccc-H------HHHHHHHHHHHHhcCCcCcceeccCCHHHHHHH Confidence 332222222 3344442 1111111122111 1121 1 244444333 4588888999999 Q ss_pred HHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhccCCCceeEEecCCC----CcccccC-CCceEEEEecC Q lcl|NC_021303. 128 AVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKSKAGETAEISLPDG----KTHEFNR-DLDSLVRIWNP 202 (637) Q Consensus 128 ~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~k~g~~~~i~lPdG----~~he~~~-~~d~l~RvW~P 202 (637) +...+-+-||+.+.+..++++.-+...+ .|..+-|.+...+. +++| .+-|||. +.-+-.+|++- T Consensus 145 ~~r~~~~dGE~f~~~~~~~~~~~~~~lq-------liepd~l~~~~n~~----~~~~~~i~~GIe~d~~Gr~~aY~i~~~ 213 (505) T protein:vir:96 145 WMETLARDGEVLVREHRGYPNKWGYALQ-------ILECDRLDLNYNAD----LQNGNRIRMSIELDAWERPVAYHLLVN 213 (505) T ss_pred HHHHHhhCCceEEEEeecCCCCcceEEE-------EechhhcCCCCCcc----cCCcCeEEeceEECCCCceEEEEEeec Confidence 9999999999998887765442122222 34444443211000 0111 1224555 34444666666 Q ss_pred CcccccCCc-----cch-------------------h---hhhHHHHHHHhhhHHHHHHHHhHhhcCcee-eecccCCCC Q lcl|NC_021303. 203 RPRKASQAT-----SPV-------------------R---ACLETLREIERTTRKIKNAAKSRVMNNGVL-FVPAEMSLP 254 (637) Q Consensus 203 ~prra~eaD-----SPv-------------------r---a~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvPqe~slP 254 (637) ||....... .++ | -....|.-|..+.++..+....-.+.+=+. ||=+ ..+ T Consensus 214 hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~--~~~ 291 (505) T protein:vir:96 214 HPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQ--DPE 291 (505) T ss_pred CCCccccccccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeec--CCc Confidence 665322110 011 1 012334444555556666665555555443 4422 111 Q ss_pred CcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccc-ccccceeEeechHHhcccceeecCcchhHHH Q lcl|NC_021303. 255 AAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQ-AAYIPLVASVAAEHLEKVQHIKFGNEVTEVE 333 (637) Q Consensus 255 ~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~-AA~vPiva~vP~Ehi~~ikHlkf~~dvteva 333 (637) ....+ +++. .+. . . .+. .-+ |.---|||-|+-++-=+= +.-- T Consensus 292 ~~~~~--------~~~~----~~~-~-----------------~-~~l~pG~--i~~L~pGe~i~~~~~~~p----~~~~ 334 (505) T protein:vir:96 292 AYDQP--------PEDD----QGE-I-----------------V-EEVEAGT--YQLLPYGIRFKEHKIDHP----HTNF 334 (505) T ss_pred cCCCc--------cccc----cCc-c-----------------c-cccCCce--eeecCCCCeeeeeCCCCC----CCCH Confidence 11100 0000 000 0 0 011 111 111224443322221111 1223 Q ss_pred HhhHHHHHHHHHhhcCCchhHhhcc-CCcceeeeEEeccCceeEeech----hHHHHHHHHHhHHHHHHHHHhCC----- Q lcl|NC_021303. 334 IKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKP----VMDLICQAIYNDILTPLLAREGI----- 403 (637) Q Consensus 334 iktR~daI~RlAmglDv~pErLLGl-s~~NHWsAW~I~dedVrlHI~P----~me~ic~Ait~~~Lr~~L~~eGi----- 403 (637) ..+.+-.++-+|+||.||-|.|+|= |++|+.|+-+-.-|..+. ++- +...+|+-|++.||.-++..--| T Consensus 335 ~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~-~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~ 413 (505) T protein:vir:96 335 GAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDL-YKLLQFFVVTELLERVAGNLISMSLLTQALPLNMV 413 (505) T ss_pred HHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCc Confidence 4567788888999999999999987 889999988776655443 111 34578999999999988776544 Q ss_pred ChHHe--EEeecCcccccCCCCCHHH-HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHH Q lcl|NC_021303. 404 DPTKY--ILWYDASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYA 480 (637) Q Consensus 404 Dp~kY--vvw~DaS~Lt~dPD~tdeA-~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~a 480 (637) +++.| +-|.=+.-..+||-|--+| +.+.+.|..|-+..-+..|.+ + +|-.+|.|.+.-..+ . T Consensus 414 ~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D------~--~~v~~q~a~e~~~~~-----~-- 478 (505) T protein:vir:96 414 DIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDD------P--EDVFDEIAWEEQLMR-----D-- 478 (505) T ss_pred cchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCC------H--HHHHHHHHHHHHHHH-----H-- Confidence 46777 8899999999999987665 567788999998776665544 2 445556665552211 0 Q ss_pred hhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 481 PLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQRE 517 (637) Q Consensus 481 pLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~E 517 (637) .++.++.++.....+..++++++ .++| T Consensus 479 -------~Gl~~~~~~~~~~~~~~~~~~~~---~~d~ 505 (505) T protein:vir:96 479 -------KGVNPTPPEQESKDATTDEEDDS---ASDD 505 (505) T ss_pred -------cCCCCCCCCCCCCCCCCCCCCCC---CCCC Confidence 12333322222222211111111 1111 No 89 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=98.24 E-value=1.1e-06 Score=53.35 Aligned_cols=375 Identities=12% Similarity=0.077 Sum_probs=185.0 Q ss_pred ceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEeee Q lcl|NC_021303. 6 LRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPSAI 85 (637) Q Consensus 6 lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~asei 85 (637) .=+.++-++-. .++..+....-..||.+.=. |.+ .=+.-.+.-+++.+|.+.+-.=.- T Consensus 1 Mg~f~~~~~f~--------------------~~~~~~~~~~~~~~~~~~~~-~~~-~~v~~~i~~Ia~~iA~lp~~~~~~ 58 (378) T protein:vir:93 1 MNLFGKVVSFS--------------------RGKLNNDTQRVTAWQNEAVE-YTS-AFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred Cccchhhhhhh--------------------ccccCCCcceeeecccchhH-HHH-HHHHHHHHHHHhhhhhCceeeEEE Confidence 11222221100 00111111112345544311 111 112223577999999998854333 Q ss_pred ccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeee Q lcl|NC_021303. 86 DPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVT 165 (637) Q Consensus 86 D~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt 165 (637) +.+ |.-.... .....+.+..+.+.=-.--+-..++++.++.+|..-|++||.+... ++. ..-|+ T Consensus 59 ~~~-~~~~~~~---~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~-~~~--------g~~~~--- 122 (378) T protein:vir:93 59 KKS-DVGSDTL---ISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-DNT--------GELLD--- 122 (378) T ss_pred ccc-ccccccc---cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-cCC--------ceEEE--- Confidence 322 2111111 1122344555544334455777899999999999999999875432 210 11222 Q ss_pred HHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee Q lcl|NC_021303. 166 REEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL 245 (637) Q Consensus 166 ~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl 245 (637) -.|++...+|.. +=||++-+| -.-...-||...++..+ .++.++-- =+|+| T Consensus 123 --------------l~~~~~~~~~~~--~diih~r~~--~~~~~~~s~l~~~~~~i----------~~~~~~~~-~~g~l 173 (378) T protein:vir:93 123 --------------LLFADDKKEYKT--EELVRLTSP--FYINEDTSILDNALASI----------QTKLEQGK-LRGLL 173 (378) T ss_pred --------------EEecCCeeEecc--ceeEEecCc--cccchhhHHHHHHHHHH----------HHHHhcCc-cccee Confidence 234443444432 336666554 23333455555444332 22222111 13555 Q ss_pred eecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeec Q lcl|NC_021303. 246 FVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKF 325 (637) Q Consensus 246 fvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf 325 (637) =+|..++ ..+.+.+.+-|.+--+......+ ... ++++ ++. .+++.|.+ T Consensus 174 ~~~~~l~-------------------------~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~l--~~g--~~~~~l~~ 221 (378) T protein:vir:93 174 KINAFLD-------------------------IDNTQEYREKALTTIKNMQEGSS-YNG--LTPV--DNK--TEIVELKK 221 (378) T ss_pred eeCCcCC-------------------------HHHHHHHHHHHHHHHHHhhcccc-ccc--ceEc--CCC--ceEEEccC Confidence 4444221 11233344444443333333222 112 2333 222 45666666 Q ss_pred CcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCCh Q lcl|NC_021303. 326 GNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDP 405 (637) Q Consensus 326 ~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp 405 (637) ...... +..++.....+|.-.-|||..|-|..+. +....-++-.|.|.+..||++|++.+|.+.=...|... T Consensus 222 ~~~~~~--~~~~~~~~~~Ia~~fgVPp~~l~g~~~e------~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~ 293 (378) T protein:vir:93 222 DYSVLN--KDEIDLIKSELLTGYFMNENILLGTATQ------EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGN 293 (378) T ss_pred Chhhhh--HHHHHHHHHHHHHHhCCCHHHhcCCcHH------HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhc Confidence 554444 4566677889999999999877654321 22334466779999999999999999977655556544 Q ss_pred HHeE-EeecCcccc-cCC-CCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhh Q lcl|NC_021303. 406 TKYI-LWYDASGLT-SDP-DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPL 482 (637) Q Consensus 406 ~kYv-vw~DaS~Lt-~dP-D~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apL 482 (637) ..++ +.||.+.|. .|+ ++.+-...+++.|++|..-.|+.+|++.-.|=| +.+ +..| ++|+ T Consensus 294 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD----~~~-------~~~n------~~~~ 356 (378) T protein:vir:93 294 LYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----VYI-------ANLN------AVAV 356 (378) T ss_pred ccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eee-------eccc------cccc Confidence 4433 778988774 343 334445668899999999999999998665422 000 0111 1111 Q ss_pred hccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021303. 483 LSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERS 525 (637) Q Consensus 483 l~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~ 525 (637) +-+ ++.++...++.|+.|++-. T Consensus 357 --------~~~-------------~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 357 --------KNL-------------SDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred --------cch-------------hhhcCccCCCCCCCCCCCC Confidence 100 0000111111111111111 No 90 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=98.23 E-value=5.6e-07 Score=54.84 Aligned_cols=375 Identities=13% Similarity=0.112 Sum_probs=175.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==. |++..-+.. ..+++...| .|+.+.=.+ ...-+.=.|.-+++.++.+.| T Consensus 1 M~~f~-k~~~~~~~~--------------~~~~~~~~~----------~~~~~~~~~--~~~~v~~~v~~ia~~iA~lp~ 53 (378) T protein:vir:85 1 MNLFG-KVVSFSRGK--------------LNNDTQRVT----------AWQNEAVEY--TSAFVTNIHNKIANEITKVEF 53 (378) T ss_pred Cchhh-hhhhhhhcc--------------cccCCccee----------eeeccchhh--hhHHHHHHHHHHHHhHhhCce Confidence 32211 111000000 001111111 111111000 001123347788999999998 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..=+.+.+.+.+-...++ ..+.+..+.+.=-.--+...++.+.++.+|-.-|++|+.++.+...+ + T Consensus 54 ~~~~~~~~~~~~~~~~~~----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g----------~ 119 (378) T protein:vir:85 54 NHVKYKKSDVGSDTLISM----AGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETG----------E 119 (378) T ss_pred eEEEEecccccccccccc----ccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCc----------e Confidence 776666554443222222 22444555443344557888899999999999999998866543221 1 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) ++. +..-+| ..+|... |++ ++=+|- .-....+....++ +.|.++.++--+ T Consensus 120 ~~~---------------~~~~~~-~~~~~~~-dvi-h~~~~~--~~~~~~~~~~~a~----------~~~~~~~~~~~~ 169 (378) T protein:vir:85 120 LLD---------------LLFAND-KKEYKPE-ELV-RLVSPF--YINEDTSILDNAL----------ASIQTKLEQGKL 169 (378) T ss_pred EEE---------------EEecCC-CEEEccc-ceE-EEecCc--CccchhhHHHHHH----------HHHHHHHhcCCc Confidence 111 111111 1122222 222 222221 1122223222222 223333332211 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) +|+|-+|..++- ...+.+.+.+.+.-.....-.++ -=++++. +. .++ T Consensus 170 -~g~l~~~~~l~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~---g~~~vl~--~g--~~~ 216 (378) T protein:vir:85 170 -RGLLKINAFLDI-------------------------DNTQEYREKALATIKNMQEGSSY---NGLTPVD--NK--TEI 216 (378) T ss_pred -ceEEEeCCcCCH-------------------------HHHHHHHHHHHHHHHHhhccccc---ccceecC--CC--ceE Confidence 477666653321 12233444433332222221111 1233332 22 345 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR 400 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~ 400 (637) +-|.+..... .+++++.....+|.-+-|||.-|-|-.+...|. .-++..|.|.+..|+++|+..+|.+-=.. T Consensus 217 ~~l~~~~~~~--~~~~~~~~~~~Ia~~fgVPp~~l~~s~~e~~~~------~f~~~tL~P~~~~ie~~l~~kLl~~~er~ 288 (378) T protein:vir:85 217 VELKKDYSVL--NKDEIELIKSELLTGYFMNENILLGTATQEQQI------YFYNSTIIPLLIQLEKELTYKLISTNRRR 288 (378) T ss_pred EeccCChhhh--hHHHHHHHHHHHHHHhCCCHHHhcCCchHHHHH------HHHHHHHHHHHHHHHHHHHhhcCChhhhh Confidence 5555544333 346666666788999999998775543332322 34667899999999999999998776555 Q ss_pred hCCChHHeE-EeecCcccc-cCCC-CCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHH Q lcl|NC_021303. 401 EGIDPTKYI-LWYDASGLT-SDPD-LSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIA 477 (637) Q Consensus 401 eGiDp~kYv-vw~DaS~Lt-~dPD-~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~ 477 (637) .|.....|+ +.||.+.|. .|+- +.+-...++..|.+|-.-.|+.+|++.-.|=| +.+ T Consensus 289 ~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD----~~~---------------- 348 (378) T protein:vir:85 289 VVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----IYI---------------- 348 (378) T ss_pred hhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----eEe---------------- Confidence 555555444 678887763 4442 33334568889999999999999997544322 000 Q ss_pred HHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_021303. 478 MYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTE 521 (637) Q Consensus 478 ~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdte 521 (637) .|+ .++.++-+... . .++ .+...++|.+-+ T Consensus 349 --~~~---N~~~~~~~~~~--~-~~~------~~~~~~~e~~n~ 378 (378) T protein:vir:85 349 --ANL---NAVAVKNLSDL--Q-GSR------KDVASTDETNNQ 378 (378) T ss_pred --ecc---cccccccchhh--c-Ccc------CCCCCCCCCCCC Confidence 010 01111100000 0 000 000001111111 No 91 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=98.22 E-value=1.4e-06 Score=52.63 Aligned_cols=369 Identities=11% Similarity=0.045 Sum_probs=180.4 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.==+ |+.+|-+.. +...+. ..+..-+ .+-|=...-+.-.+.-++++|+++.+ T Consensus 1 Mg~f~-~l~~~~~~~-------------~~~~~~-~~~~~~~------------~~~~l~~~~v~~~i~~Ia~~ia~~p~ 53 (376) T protein:vir:78 1 MGFFS-ELFKRNKEI-------------EWMWDL-DFLEDKT------------TKVYLKKMALNTCVKHIARTIAKSDF 53 (376) T ss_pred Cchhh-hhhccCCcc-------------ccccch-hhccccc------------hhhhhhhHHHHHHHHHHHHhhcccce Confidence 44221 333321110 000000 0110000 00011122344556778999999998 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+ .+ .. .+ +.+..+.+.=...-+-..++++.++.+|.+-|++|+++.-..+|.. ... T Consensus 54 ~~~~----~~-----~~-~~----~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~-------~~~ 112 (376) T protein:vir:78 54 RLKN----GE-----TS-VR----DKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLI-------ADS 112 (376) T ss_pred eecc----cc-----cc-cc----chHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeee-------ccc Confidence 7642 11 11 11 3345554444555678889999999999999999987654444322 223 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) |..-. ..+... ....+...++......+..|+ |++ + ...+|....+.++.+... ..+.++.++... T Consensus 113 ~~~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~ev-ih~-----~---~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 178 (376) T protein:vir:78 113 YVRKE-FAFFPD--VFEGVTVKDYRYNRNFSMDDV-IFL-----E---YGNERLSAFTDGMFEDYG--ELFGKMIRAQMR 178 (376) T ss_pred eeecc-cceeee--eeeeeeeecceeeeeeccccE-EEe-----c---cCCCCchhhhhHHHHHHH--HHHHHHHHHHHh Confidence 33221 111111 111223323221111122333 332 1 223566666666655443 335555555566 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) +||+-. .+-+... ........+.+.+++.+.-.... ++.-+++| +++. .++ T Consensus 179 ~~~~~~---~~~~~~~-----------------~~~~~e~~~~~~~~~~~~~~g~~---~~~~~v~~----l~~g--~~~ 229 (376) T protein:vir:78 179 NFQIRG---AVNFKMA-----------------GVADKDKQTKLQEYIDKVYASFN---NNEIAIVP----QLEG--FNY 229 (376) T ss_pred cCCCce---eEEEccC-----------------CCCCHHHHHHHHHHHHHHhcccc---ccCcceEE----cCCC--ceE Confidence 665411 0001000 01122344556655544332211 11112222 2332 455 Q ss_pred ceeecCc---chhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHH Q lcl|NC_021303. 321 QHIKFGN---EVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTP 396 (637) Q Consensus 321 kHlkf~~---dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~ 396 (637) +-+.+.. +.++. -+++|+..+..+|.-.-|||..| |...+| .-+....-++-.|.|.+..|+++|++.+|-+ T Consensus 230 ~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l-~~~~s~---~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~ 305 (376) T protein:vir:78 230 EEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLL-HGDMAD---LSNNMKAYMEYCIDPLTKKLEDELNAKLFTF 305 (376) T ss_pred EeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHhhhCCc Confidence 5555443 22332 47899999999999999999865 433333 2344445567789999999999999988643 Q ss_pred HHHHhCCChHHeEEeecCcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 397 LLAREGIDPTKYILWYDASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 397 ~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) .+|.+=||.+.| .+.|..+.+ ..++..|.+|-.-.|+++|++.-.+-..+ T Consensus 306 ---------~~~~~~~~~~~l-l~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d----------------- 358 (376) T protein:vir:78 306 ---------SEFLAGEHIKII-HKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELD----------------- 358 (376) T ss_pred ---------ccceecccchhh-cccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc----------------- Confidence 445555665554 233433333 34778899999999999999864321000 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQ 515 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g 515 (637) ..-.|.. .. +.++. +++ | T Consensus 359 ---------------~~~~~~n--~~-~~~~~--~e~----g 376 (376) T protein:vir:78 359 ---------------KYLITKN--YQ-SADEG--GED----G 376 (376) T ss_pred ---------------eeeeccC--ce-ehhcc--ccC----C Confidence 0000100 00 11111 111 1 No 92 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.19 E-value=2e-07 Score=57.29 Aligned_cols=371 Identities=14% Similarity=0.070 Sum_probs=183.4 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.== +|=...++.... ....+.++.- +..+..|. .-. ...+ -..+-+.-.+.-++++||.+.+ T Consensus 1 Mg~f-----~~~~~~~~~~~~-----~~~~~~~~~~-~~~~~~~~-~v~-~~~~----l~~~~v~~~i~~ia~~ia~~~~ 63 (382) T protein:vir:48 1 MPIF-----NLATESPPDNQG-----GFFDVVDSDF-LASLKGNE-WVS-AETA----LRNSDLFSIINQLSNDLATVKL 63 (382) T ss_pred Cccc-----cccccCCccccc-----ccccchhhhc-cccccCCc-ccc-hHhh----hccHHHHHHHHHHHHhhccCce Confidence 5432 221111111111 0011111110 01111110 000 1112 1234556667789999999988 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) -..+-+.+. +-.. ..--+-..++++.++.+|-+-|+.|+.+.-...|. + .. T Consensus 64 ~~~~~~~~~------L~~~---------------PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~-~-------~~ 114 (382) T protein:vir:48 64 ITSRKKLQG------IVDN---------------PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGR-D-------MK 114 (382) T ss_pred eeecchhhh------hhhh---------------cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-E-------EE Confidence 766443221 1111 11225667899999999999999999876443342 1 14 Q ss_pred ceeeeHHHhcc---CCCceeE--EecCC---CCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 161 WYAVTREEIKS---KAGETAE--ISLPD---GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 161 W~~vt~~Ei~~---k~g~~~~--i~lPd---G~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) ++.|..+.+.. ..++... +...+ |....|.. .| ||++=.+++.-...-.||+.++...+.-.....+... T Consensus 115 l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~-~e-vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 192 (382) T protein:vir:48 115 WEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQ-ND-VLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTI 192 (382) T ss_pred EEEEcCceeEEEEcCCCCeEEEEEEecCccccceeEEcC-cc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 44444444431 2223222 22222 22333433 23 4555456665556778999999988877777777777 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-+|+.++-. ....+.+.+.+ ..+..+ =++|+ T Consensus 193 ~~~~ng~~p~~il~~~~~~~~e-------------------------~~~~~~~~~~~----~~~n~g-----~~~vl-- 236 (382) T protein:vir:48 193 NSLKNALNANGILKIKGGGLLD-------------------------FKTKLSRSRQA----MKQMQG-----GPLVL-- 236 (382) T ss_pred HHHhccCCCceEEEeCCCCChH-------------------------HHHHHHHHHHh----hccCCC-----CeeEc-- Confidence 7777777778898887643210 12223333322 122222 23333 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) ++. -+++.|.+.... .--+++|+..+..+|..+-|||..| |.+..|.-+. +-...-++.-|.|.+..|+++|+.. T Consensus 237 ~~g--~~~~~l~~~~~d-~q~~e~~~~~~~~Ia~afgVp~~~l-g~~~~~~~~~-~~~~~~~~~~l~p~~~~i~~~l~~~ 311 (382) T protein:vir:48 237 DDL--EDFTPLEIKSNV-SQLLKQADWTTGQFAKVYGIPDNVV-GGQGDQQSSL-EMSSDLYSKAVSRYLRPFLSELSQK 311 (382) T ss_pred CCC--ceEEEccCChhH-HHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 245555544332 2237899999999999999998655 6644333121 2233456677899999999999998 Q ss_pred HHHHHHHHhCC--ChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhc Q lcl|NC_021303. 393 ILTPLLAREGI--DPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVT 470 (637) Q Consensus 393 ~Lr~~L~~eGi--Dp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~ 470 (637) ++...-..... |++.|-++++.. .++..|.+|-...|..++ ..||-+. |. +.. T Consensus 312 l~~~~~~~~~~~~~~~~~~~~~~~~-------------~l~~~g~~t~~e~r~~l~---~~g~~~~--~~-~~~------ 366 (382) T protein:vir:48 312 LSCDVDADIFPAVDPTGSNYISRIN-------------SLVKTGTLAQNQGLYILQ---QAEILPK--EL-PNG------ 366 (382) T ss_pred hcChhhhhhhhhhccchhHHHHHHH-------------HHhhcCccCHHHHHHHHh---hCCCCCc--ch-hhh------ Confidence 76543211111 334444444332 345556666666665543 1122111 10 000 Q ss_pred CCchhHHHHHhhhcccccc-ccCCCC Q lcl|NC_021303. 471 KNPELIAMYAPLLSSQLAG-IEFPQP 495 (637) Q Consensus 471 ~~P~Li~~~apLl~~~~~~-ie~P~p 495 (637) .++ . +.+++ -+=.+- T Consensus 367 ~~~---------~-~~~~GGd~~~~~ 382 (382) T protein:vir:48 367 ENP---------N-STLKGGEEDGQD 382 (382) T ss_pred hcC---------C-CCCCCCCCCCCC Confidence 000 0 01111 110000 No 93 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.19 E-value=1.2e-06 Score=53.14 Aligned_cols=461 Identities=13% Similarity=0.078 Sum_probs=193.9 Q ss_pred CCCCcceEEecCCCCCcccccch-heehhccccchhhhhhhhccc--ccccchhh-------HHHHHhhhhhhHhhHhh- Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRS-LTAASQLITDPQKQMKTSLMG--TARNEWQS-------EAWDFSESIGELSYYIS- 69 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~-ltAAs~~~~~p~~~~k~~~~g--~~r~~WQ~-------eAW~~yd~VgELryyvg- 69 (637) |=-. +++ .|.+.++..+.+. .-+|+. . ++.++++... +.++.+.. +|.+++.--|=.+=+++ T Consensus 1 ~~~~--~~~-~~~~~~~~~~~~~~~~~a~~-~---~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~ 73 (530) T protein:vir:38 1 MKIP--SLV-GPDGKTSLREYAGYHGGGGG-F---GGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQL 73 (530) T ss_pred Cccc--eee-cCccccchHHHhhhhcccCC-C---CCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 2111 000 0111111111111 111111 1 1223332221 22222222 23333333222111111 Q ss_pred hhhcceee-eEEEEeeeccccCCCCCcccCCCCc-ccchHHHHHHHh---------ccCcccHHHHHHHHHhhhcccccE Q lcl|NC_021303. 70 WRANSCSR-TTLIPSAIDPDTGLPTGEVDIEEDP-DAQIVADYVKGI---------ADGPLGQAALIKRAVECMTVVGEV 138 (637) Q Consensus 70 Wr~~s~Sr-~rL~aseiD~DtG~PtG~v~~e~~~-~~~rv~~iv~~i---------AgG~lGqaqLlkr~~~~LtVpGE~ 138 (637) |..|-|.- .++.+ ..|-. .-+++.|.+. -..++.+.-+.= +.|.+-=.+|.+.+...+-+-||+ T Consensus 74 ~~~nvVG~Gi~~~~-~p~~~----~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~ 148 (530) T protein:vir:38 74 HQDHIVGSFFRLSY-RPSWR----YLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGEL 148 (530) T ss_pred HHHHhhCCCceeee-ccchh----hcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCce Confidence 22222221 11111 11100 0011111111 122333333221 348888889999999999999999 Q ss_pred EEEEEeecCCccc--cccccccccceeeeHHH---------hc-cCCCceeE--Eec--CCCCc-ccc------c-CCCc Q lcl|NC_021303. 139 WIAVLIRQEKDPV--TGLAAPRARWYAVTREE---------IK-SKAGETAE--ISL--PDGKT-HEF------N-RDLD 194 (637) Q Consensus 139 wi~il~r~~~~~~--~~~~~~~~~W~~vt~~E---------i~-~k~g~~~~--i~l--PdG~~-he~------~-~~~d 194 (637) .+.+..+++++.+ ...+.=+.+++.-.... |+ .+.|..+. |.. |.|.. ..+ . -..+ T Consensus 149 ~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~ 228 (530) T protein:vir:38 149 CVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRP 228 (530) T ss_pred EEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChh Confidence 9998877665321 12222222222211110 11 11122221 110 11111 010 0 1123 Q ss_pred eEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCcee-eecccCCCCCcccccccccccCCCcccc Q lcl|NC_021303. 195 SLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVL-FVPAEMSLPAAQAPIPAGQAQIPGAPVP 273 (637) Q Consensus 195 ~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvl-fvPqe~slP~~~ap~~a~~~~~pg~~~~ 273 (637) -|+|+++|.----.---|..-++|..|+. +.++..+....-.+.+-+. ||=+...-.... ...+. T Consensus 229 ~vlH~f~~~r~gQ~RGis~lapvl~~l~~---l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~--~~~~~--------- 294 (530) T protein:vir:38 229 SFIHVFEPMEDGQTRGANAFYSVMEQMKM---LDTLQNTQLQSAIVKAMYAATIESELDTQSAM--DFILG--------- 294 (530) T ss_pred HeEeeccccCCCcccCCchHHHHHHHHHH---HhHHHHHHHHHHHHhhhheeeeeccCCccccc--ccccc--------- Confidence 56777766421111122344444444444 4444555555555554443 443322111110 00000 Q ss_pred cCCCchhHHHHHHHHHHHHhhcccCccccccccc--eeEeechHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCc Q lcl|NC_021303. 274 EVSGVPASEQLATMIYQASVAAMEDENSQAAYIP--LVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVS 351 (637) Q Consensus 274 ~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vP--iva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~ 351 (637) ..+......+...... +... .+...-.+-| |+.--|||-|+-++-=+ -+.-...+.+..++.+|+||.|| T Consensus 295 -~~~~~~~~~~~~~~~~--~~~~-~~~~~~~l~pG~i~~L~pGe~i~~~~p~~----p~~~~~~f~~~~lr~iaaglGi~ 366 (530) T protein:vir:38 295 -ADNKEQQSKLTGWLGE--MAAY-YSAAPVRLGGARVPHLLPGDSLNLQSAQD----TDNGYSTFEQSLLRYIAAGLGVS 366 (530) T ss_pred -CCcccccccccccchh--hhhc-ccccceeccCceeeecCCCCeeeeeCCCC----CCCCHHHHHHHHHHHHHhhcCCC Confidence 0111111111111100 0000 0111011112 11223344222222111 12233456788999999999999 Q ss_pred hhHhhc-cCCcceeeeEEeccCceeEeech----hHHHHHHHHHhHHHHHHHHHhCCC------------hHHe--EEee Q lcl|NC_021303. 352 PERLLG-MSKGNHWSAWAIGDEDVQLHIKP----VMDLICQAIYNDILTPLLAREGID------------PTKY--ILWY 412 (637) Q Consensus 352 pErLLG-ls~~NHWsAW~I~dedVrlHI~P----~me~ic~Ait~~~Lr~~L~~eGiD------------p~kY--vvw~ 412 (637) -|.|+| +|++|+.|+-+---|..+. ++- ++..+|+-|++.||.-++.+--|+ .+.| +.|+ T Consensus 367 ye~lt~D~s~~nYSS~R~~~~e~~r~-~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~ 445 (530) T protein:vir:38 367 YEQLSRNYSQMSYSTARASANESWAY-FMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWI 445 (530) T ss_pred HHHHhcccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeee Confidence 999999 5899999987766655443 111 344568889999999887765453 2345 7899 Q ss_pred cCcccccCCCCCHH-HHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhcccccccc Q lcl|NC_021303. 413 DASGLTSDPDLSDE-AVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIE 491 (637) Q Consensus 413 DaS~Lt~dPD~tde-A~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie 491 (637) =.+-..+||-|--+ ++...+.|..|-+......|.+- +|-.+|.|.+.-.. .. .++. T Consensus 446 ~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~--------~~v~~q~a~e~~~~-----~~---------~Gl~ 503 (530) T protein:vir:38 446 GSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDY--------QEIFAQQVRESMER-----RA---------AGLN 503 (530) T ss_pred cCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCH--------HHHHHHHHHHHHHH-----HH---------cCCC Confidence 99999999998554 46778899999987776665443 35556666554211 11 0333 Q ss_pred CCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021303. 492 FPQPANAIESTREEDDEDSGARQQREPQTEDERS 525 (637) Q Consensus 492 ~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~ 525 (637) +|..+.+.+......++ ++|...+..+ T Consensus 504 ~~~~~~~~~~~~~~~~~-------~~~~d~~~~a 530 (530) T protein:vir:38 504 PPAWAAAAFEAGVKKSN-------EEEQDGARAA 530 (530) T ss_pred CCCCcccccCCCCCCCC-------CCCCCCCCCC Confidence 33322222111111110 1111111100 No 94 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=98.09 E-value=4.2e-06 Score=50.07 Aligned_cols=375 Identities=12% Similarity=0.088 Sum_probs=176.3 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |- |..|-++.. +.... .+... -..|+.+.= .|.+ .-+.=.|.-+++.+|.+.+ T Consensus 1 M~-----if~~~~~~~----~~~~~------~~~~~----------~~~~~~~~~-~~~~-~~v~~~v~~Ia~~iA~lp~ 53 (378) T protein:vir:94 1 MN-----LFGKVVSFS----RGKLN------NDTQR----------VTAWQNEAV-EYTS-AFVTNIHNKIANEITKVEF 53 (378) T ss_pred Cc-----hhHHhHhhh----hcccc------cCcce----------eeeeecchh-hhhh-HHHHHHHHHHHHhHhhCce Confidence 21 222222110 00000 00000 011111110 0111 2244456678999999987 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) -.=+.+...|.+-...+. ..+.+..+.+.=-.--+-..++.+.++.+|-.-|+.||..+.+...+ .- T Consensus 54 ~~~~~~~~~~~~~~~~~~----~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g---------~~ 120 (378) T protein:vir:94 54 NHVKYKKSDVGSDTLISM----AGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETG---------EL 120 (378) T ss_pred eeeeeccccccccccccc----ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCC---------cE Confidence 544555443433211111 22334444443344557788999999999999999999866543321 01 Q ss_pred ceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVM 240 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~ 240 (637) |+.+. .. +| .+|. .+-++++=+| -......|+...++..+ .++.++= . T Consensus 121 ~~~~~------~~---------~~--~~~~--~~dvih~~~~--~~~~~~~~~~~~~~~~~----------~~~~~~~-~ 168 (378) T protein:vir:94 121 LDLLF------AN---------DK--KEYK--PEELVRLTSP--FYINEDTSILDNALASI----------QTKLEQG-K 168 (378) T ss_pred EEEEE------ec---------Cc--EEec--hhceeeecCc--CCcccchhHHHHHHHHH----------HHHHhhC-C Confidence 22111 01 11 1221 1234444222 22333444444443322 2222211 2 Q ss_pred cCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhccc Q lcl|NC_021303. 241 NNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKV 320 (637) Q Consensus 241 gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~i 320 (637) -+|+|-.|..++-. +.+.+.+-+.+.-+......++ .. ++++ ++. .++ T Consensus 169 ~~g~l~~~~~l~~~-------------------------~~~~~~e~~~~~~~~~~~~~n~-~~--~~vl--~~g--~~~ 216 (378) T protein:vir:94 169 LRGLLKINAFLDID-------------------------NTQEYREKALATIKNMQEGSSY-NG--LTPV--DNK--TEI 216 (378) T ss_pred cccceeeCCcCCHH-------------------------HHHHHHHHHHHHHHHhhccccc-cc--ceec--cCC--ceE Confidence 35777666533211 1122322222222222222221 11 3333 222 234 Q ss_pred ceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHH Q lcl|NC_021303. 321 QHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR 400 (637) Q Consensus 321 kHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~ 400 (637) + .+.....+..+..++....++|.-+-|||..|-|-.+.+. ...-++-.|.|.+..|+++|+..+|.+-=.. T Consensus 217 ~--~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g~~~e~~------~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~ 288 (378) T protein:vir:94 217 V--ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTATQEQ------QIYFYNSTIIPLLIQLEKELTYKLISTNRRR 288 (378) T ss_pred E--EccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcCCchHHH------HHHHHHHHHHHHHHHHHHHHHhhcCChhHhh Confidence 4 3444455555677777888999999999988866543222 2234456799999999999999988765444 Q ss_pred hCCChHHe-EEeecCcccc-cCCC-CCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHH Q lcl|NC_021303. 401 EGIDPTKY-ILWYDASGLT-SDPD-LSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIA 477 (637) Q Consensus 401 eGiDp~kY-vvw~DaS~Lt-~dPD-~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~ 477 (637) .|.....| -+.||.+.|. .|+. +.+-...++++|.+|-.-.|+.+|++.-.|-| +. T Consensus 289 ~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd----~~----------------- 347 (378) T protein:vir:94 289 VVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD----VY----------------- 347 (378) T ss_pred hhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----ee----------------- Confidence 45433333 3778877773 3433 33444558999999999999999997655433 00 Q ss_pred HHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCC Q lcl|NC_021303. 478 MYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTE 521 (637) Q Consensus 478 ~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdte 521 (637) +.| ...+.++-+... +...++...++|.+-+ T Consensus 348 -~~~---~n~~~~~~~~~~---------~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 348 -IAN---LNAVAVKNLSDL---------QGNRKDVTSTDETNNQ 378 (378) T ss_pred -eec---ccccchhcchhc---------ccccCCCCCCCCCCCC Confidence 111 011111111000 0000000111111111 No 95 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.04 E-value=2e-06 Score=51.83 Aligned_cols=384 Identities=12% Similarity=0.068 Sum_probs=192.0 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.-.-+...|+.+.++ .++ ..........++. +.....|. +++- =-.+..-..+-+.=.|.-+++.+|++.+ T Consensus 1 m~m~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~--~~~~~~~~--~g~~-v~~~~al~~~~v~~~v~~ia~~ia~lp~ 72 (392) T protein:vir:74 1 MILPILNFINQTNDPP-EAG--SVQSYFPDGNDAQ--IMESLLGD--NNEW-VSARAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred CcchhhhhhhcccCcc-ccc--ccccccccCchhh--hhhhccCC--CCcc-cchhhhhcchHHHHHHHHHHHhhccCce Confidence 5555555555544332 111 1112111122211 12221111 1110 0011111234455567778888888776 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+=+.+ + + .++ ..--+...++++.++.+|-+-|+.|+.+.-...|. + . . T Consensus 73 ~~~~~~~~-~-----l-~~~--------------PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~-~------~-~ 123 (392) T protein:vir:74 73 NAEKKKNQ-G-----I-IDN--------------PSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGA-D------M-K 123 (392) T ss_pred eeccchhh-h-----h-hhh--------------cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-E------E-E Confidence 54322211 0 1 000 11125677899999999999999998876443332 1 1 3 Q ss_pred ceeeeHHHhc---cCCCceeE--EecCCCCc---ccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 161 WYAVTREEIK---SKAGETAE--ISLPDGKT---HEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 161 W~~vt~~Ei~---~k~g~~~~--i~lPdG~~---he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) .+.|..+.+. .+.++... +...+|.. .+|. ..+ ||++=.+.+.-...--||+.++.+.+.-...+.+... T Consensus 124 L~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~-~~e-vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 201 (392) T protein:vir:74 124 WEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAP-QSD-LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTI 201 (392) T ss_pred EEEEcCceeEEEEcCCCceEEEEEEecCCccceeEEEc-Ccc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 3333333332 12223222 22222221 1222 223 4444344444444567999998888877777777777 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +.-+.-..-.|||-+|+....+. ..++.+.+.-+. +.-+-=|+|+ T Consensus 202 ~~f~ng~~p~~il~~~~~~~~~~---------------------------~~~~~~~~~~~~------~~n~g~~~vl-- 246 (392) T protein:vir:74 202 SSLNSSLNVPGVLTVKGGGLLSD---------------------------KDKASRSRSFMK------RSRSGGPVVL-- 246 (392) T ss_pred HHHhccCCCceEEEeCCCCCchH---------------------------HHHHHHHHHHhc------cccCCCeeec-- Confidence 77777777788888887432221 112222221111 1112234444 Q ss_pred chHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 313 P~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) ++. -+++.|.+..+... -+++|+-.+..+|..+-|||..| |..+.+. +..+-...-++-.|.|.+..|+++|+.. T Consensus 247 ~~g--~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~-~~~e~~~~~~~~~l~p~~~~ie~~l~~~ 321 (392) T protein:vir:74 247 DDL--EEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYI-GGQGDQQ-SSIQQISGMYASALNRYLRPAISELEYK 321 (392) T ss_pred CCC--ceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHh-CCCCCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 332 46666666544333 38999999999999999998655 6643333 2112123346677999999999999998 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCC Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~ 472 (637) ++.. .++|... +++.+..+ +.+....++..|.+|-.-.|+.++ ..||.+ ..+|.+ T Consensus 322 l~~~----~~~~~~~---~~~~d~~~----~~~~~~~l~~~g~~t~near~~~~---~~g~~p---ne~r~~-------- 376 (392) T protein:vir:74 322 LSDH----ISVNMRP---AIDPLGDN----YLSTISTATRWGALAENQATFVLQ---EAGYIP---KDLPAP-------- 376 (392) T ss_pred ccch----hcccchh---hhcCCHHH----HHHHHHHHHhCCCcCHHHHHHHHH---hCCCCc---cccchh-------- Confidence 7653 2233221 11211111 234556677889999887777652 234432 112211 Q ss_pred chhHHHHHhhhccccccccCCCCcC Q lcl|NC_021303. 473 PELIAMYAPLLSSQLAGIEFPQPAN 497 (637) Q Consensus 473 P~Li~~~apLl~~~~~~ie~P~p~~ 497 (637) -.+.|+-.+ +--+|+| T Consensus 377 ----enl~~~~~G-----d~~~p~p 392 (392) T protein:vir:74 377 ----ENTNKKTTG-----QSNEPVP 392 (392) T ss_pred ----cCCCCCCCC-----CCCCCCC Confidence 011222211 1133333 No 96 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.04 E-value=3.1e-06 Score=50.79 Aligned_cols=385 Identities=12% Similarity=0.057 Sum_probs=189.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.-.=+...+|.+.++..+...... ....+ ..+.....|..... =+ .+-.-..+-+.-.|.-+++.+|.+.+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~~-v~--~~~al~~~~v~~~i~~ia~~ia~lp~ 72 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYF---PDGND--AQIMESLLGDNNEW-VS--ARAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred Ccchhhhhhhccccccccccccccc---ccCch--hhhhhhhcCCCCce-ec--hHHhhccHHHHHHHHHHHHhhccCce Confidence 7777666666655544322111110 01111 11111111110000 01 11112345666778888999998887 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+=+.+ .+- + =..--+...++++.++.+|-+-|+.|+.+.-...|. + . . T Consensus 73 ~~~~~~~~------~l~-~--------------~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~------~-~ 123 (392) T protein:vir:39 73 NAEKKKNQ------GII-D--------------NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGA-D------M-K 123 (392) T ss_pred eeccchhh------hHh-h--------------cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCc-E------E-E Confidence 55432211 010 1 012236778899999999999999998876443332 1 2 2 Q ss_pred ceeeeHHHhc---cCCCceeEE--ecCCCC--cccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 161 WYAVTREEIK---SKAGETAEI--SLPDGK--THEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 161 W~~vt~~Ei~---~k~g~~~~i--~lPdG~--~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) .+.|..+.+. .+.++.... ...++. +....+..| ||++=.+.+.-...--||+.++...+.=...+.+...+ T Consensus 124 L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~e-iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 202 (392) T protein:vir:39 124 WEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSD-LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTIS 202 (392) T ss_pred EEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEcccc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 2333333322 122232222 222221 121122334 44443444544456678988888877666666666666 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-+|+..... +..++.+.+.-..+ ..+-=++|+ | T Consensus 203 ~f~ng~~p~gil~~~~~~~~~---------------------------~~~~~~~~~~~~~~------~~~g~~~vl--~ 247 (392) T protein:vir:39 203 SLNSSLNVPGVLTVKGGGLLS---------------------------DKDKASRSRSFMKR------SRSGGPVVL--D 247 (392) T ss_pred HHhccCCCceEEEeCCCCCch---------------------------HHHHHHHHHHHhcc------ccCCCeeec--C Confidence 666666667888777632111 11122222211111 112223333 3 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) +. -+++.|........ -+++|+..+..+|..+-|||..| |.++.+. +..+-...-++-.|.|.+..|+++|+..+ T Consensus 248 ~g--~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~l-g~~~~~~-~~~~~~~~f~~~~l~P~~~~ie~~l~~~L 322 (392) T protein:vir:39 248 DL--EEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYI-GGQGDQQ-SSIQQISGMYASALNRYLRPAISELEYKL 322 (392) T ss_pred CC--ceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHh-CCCCCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 32 35555555433222 37999999999999999998776 5533333 22222233567789999999999999987 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) +.. -++|-.. + ++.+..+ +.+....++..|.+|-...|+.+. ..||.+. | +|. T Consensus 323 ~~~----~~~d~~~-~--~~~d~~~----~~~~~~~l~~~g~~t~nE~r~~l~---~~g~~p~--e-~r~---------- 375 (392) T protein:vir:39 323 SDH----ISVNMRP-A--IDPLGDN----YLSTISTATRWGALAENQATFVLQ---EAGYIPK--D-LPA---------- 375 (392) T ss_pred ccc----ccccchh-h--hccCHHH----HHHHHHHHHhCCCcCHHHHHHHHH---hcCCCcc--c-cch---------- Confidence 643 2333221 1 1111111 123455677889999877776652 2333211 0 110 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQT 520 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdt 520 (637) .+ .+| ..+.|+ ++||-+ T Consensus 376 -------------~e--~l~----~~~~Gd-----------~~~p~p 392 (392) T protein:vir:39 376 -------------PE--NTN----KKTTGQ-----------SNEPVP 392 (392) T ss_pred -------------hc--CCC----CCCCCC-----------CCCCCC Confidence 00 011 111221 122222 No 97 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.04 E-value=3.1e-06 Score=50.79 Aligned_cols=385 Identities=12% Similarity=0.057 Sum_probs=189.8 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.-.=+...+|.+.++..+...... ....+ ..+.....|..... =+ .+-.-..+-+.-.|.-+++.+|.+.+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~~-v~--~~~al~~~~v~~~i~~ia~~ia~lp~ 72 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYF---PDGND--AQIMESLLGDNNEW-VS--ARAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred Ccchhhhhhhccccccccccccccc---ccCch--hhhhhhhcCCCCce-ec--hHHhhccHHHHHHHHHHHHhhccCce Confidence 7777666666655544322111110 01111 11111111110000 01 11112345666778888999998887 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+=+.+ .+- + =..--+...++++.++.+|-+-|+.|+.+.-...|. + . . T Consensus 73 ~~~~~~~~------~l~-~--------------~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~------~-~ 123 (392) T protein:vir:10 73 NAEKKKNQ------GII-D--------------NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGA-D------M-K 123 (392) T ss_pred eeccchhh------hHh-h--------------cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCc-E------E-E Confidence 55432211 010 1 012236778899999999999999998876443332 1 2 2 Q ss_pred ceeeeHHHhc---cCCCceeEE--ecCCCC--cccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 161 WYAVTREEIK---SKAGETAEI--SLPDGK--THEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 161 W~~vt~~Ei~---~k~g~~~~i--~lPdG~--~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) .+.|..+.+. .+.++.... ...++. +....+..| ||++=.+.+.-...--||+.++...+.=...+.+...+ T Consensus 124 L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~e-iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 202 (392) T protein:vir:10 124 WEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSD-LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTIS 202 (392) T ss_pred EEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEcccc-EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 2333333322 122232222 222221 121122334 44443444544456678988888877666666666666 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.-..-.|||-+|+..... +..++.+.+.-..+ ..+-=++|+ | T Consensus 203 ~f~ng~~p~gil~~~~~~~~~---------------------------~~~~~~~~~~~~~~------~~~g~~~vl--~ 247 (392) T protein:vir:10 203 SLNSSLNVPGVLTVKGGGLLS---------------------------DKDKASRSRSFMKR------SRSGGPVVL--D 247 (392) T ss_pred HHhccCCCceEEEeCCCCCch---------------------------HHHHHHHHHHHhcc------ccCCCeeec--C Confidence 666666667888777632111 11122222211111 112223333 3 Q ss_pred hHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) +. -+++.|........ -+++|+..+..+|..+-|||..| |.++.+. +..+-...-++-.|.|.+..|+++|+..+ T Consensus 248 ~g--~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~l-g~~~~~~-~~~~~~~~f~~~~l~P~~~~ie~~l~~~L 322 (392) T protein:vir:10 248 DL--EEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYI-GGQGDQQ-SSIQQISGMYASALNRYLRPAISELEYKL 322 (392) T ss_pred CC--ceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHh-CCCCCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 32 35555555433222 37999999999999999998776 5533333 22222233567789999999999999987 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) +.. -++|-.. + ++.+..+ +.+....++..|.+|-...|+.+. ..||.+. | +|. T Consensus 323 ~~~----~~~d~~~-~--~~~d~~~----~~~~~~~l~~~g~~t~nE~r~~l~---~~g~~p~--e-~r~---------- 375 (392) T protein:vir:10 323 SDH----ISVNMRP-A--IDPLGDN----YLSTISTATRWGALAENQATFVLQ---EAGYIPK--D-LPA---------- 375 (392) T ss_pred ccc----ccccchh-h--hccCHHH----HHHHHHHHHhCCCcCHHHHHHHHH---hcCCCcc--c-cch---------- Confidence 643 2333221 1 1111111 123455677889999877776652 2333211 0 110 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccC Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQT 520 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdt 520 (637) .+ .+| ..+.|+ ++||-+ T Consensus 376 -------------~e--~l~----~~~~Gd-----------~~~p~p 392 (392) T protein:vir:10 376 -------------PE--NTN----KKTTGQ-----------SNEPVP 392 (392) T ss_pred -------------hc--CCC----CCCCCC-----------CCCCCC Confidence 00 011 111221 122222 No 98 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=98.03 E-value=6.5e-06 Score=49.02 Aligned_cols=439 Identities=13% Similarity=0.071 Sum_probs=196.7 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchh-------hHHHHHhhhhhhHhhHhh-hhh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQ-------SEAWDFSESIGELSYYIS-WRA 72 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ-------~eAW~~yd~VgELryyvg-Wr~ 72 (637) ++| -+-.||.+.- ...++.-||+..-. ..+... .++.++... .+|-++|.--|=.+=++. |.. T Consensus 11 ~sP--~~~~~R~~ar---~~~~~y~aa~~~r~---~~~~~~-~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 81 (502) T protein:vir:79 11 FSP--GWKAARLRSR---AVIQAYEAVKTTRT---HKARRE-NRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEE 81 (502) T ss_pred cCh--HHHHHHHhhH---HHHhhccccCcccc---cCCCCC-CCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 221 0111221111 11123334322111 111111 112222222 233344444443332332 455 Q ss_pred cceee--eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHh-----ccCcccHHHHHHHHHhhhcccccEEEEEEee Q lcl|NC_021303. 73 NSCSR--TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGI-----ADGPLGQAALIKRAVECMTVVGEVWIAVLIR 145 (637) Q Consensus 73 ~s~Sr--~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~i-----AgG~lGqaqLlkr~~~~LtVpGE~wi~il~r 145 (637) |.|.- .+|-+ ..+-+ .++.. + .-+.++.+.-+.- +.|.+--.+|.+.+...+-+-||+.+.+... T Consensus 82 nvVG~ggi~~~~-~~~~~----~~~~~-~--~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~ 153 (502) T protein:vir:79 82 RVVGKNGIIVEP-HPVLR----NGAIA-R--DLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSG 153 (502) T ss_pred hhccCCceeeee-ccCCC----ChhHH-H--HHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeec Confidence 55531 22221 11111 11111 1 1112222222222 4589999999999999999999999999876 Q ss_pred cCCccccccccccccceeeeHHHhcc--CCCceeE--EecC-CC---------------CcccccC-CCceEEEEecCCc Q lcl|NC_021303. 146 QEKDPVTGLAAPRARWYAVTREEIKS--KAGETAE--ISLP-DG---------------KTHEFNR-DLDSLVRIWNPRP 204 (637) Q Consensus 146 ~~~~~~~~~~~~~~~W~~vt~~Ei~~--k~g~~~~--i~lP-dG---------------~~he~~~-~~d~l~RvW~P~p 204 (637) +.+..-++...|. .=..|..+-|.+ ++++.+. |+.- .| ...++.. .-.-|+|+++|.- T Consensus 154 ~~~~~~~g~~~~l-~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r 232 (502) T protein:vir:79 154 RINSLTPSAGVHF-WLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRR 232 (502) T ss_pred ccCccCCCcccce-EEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccC Confidence 6543212222221 112333333321 1111110 1110 11 1111000 1123566665532 Q ss_pred ccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCce-eeecccCCCCCcccccccccccCCCcccccCCCchhHHH Q lcl|NC_021303. 205 RKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGV-LFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQ 283 (637) Q Consensus 205 rra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGv-lfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~ 283 (637) ---.---|..-++|..|+.|- ++..+....-.+.+-+ +||=.+ .|....+...+. + .+. T Consensus 233 ~gQ~RGis~lapvl~~l~~l~---~~~dael~~a~i~A~~~~fi~~~--~~~~~~~~~~~~------~-----~~~---- 292 (502) T protein:vir:79 233 LHQMRGTSLLSGVLIRLSALK---EYEDSELTAARIAAALGMYIRKG--DGQSYEPDGNGS------K-----ENE---- 292 (502) T ss_pred CccccCCchHHHHHHHHHHHh---HHHHHHHHHHHHhhhheeeeecC--CCcccccccCCC------C-----Ccc---- Confidence 211222344455555555544 4444444444444433 233221 122221111000 0 000 Q ss_pred HHHHHHHHHhhcccCccccccccc--eeEe-echHHhcccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccCC Q lcl|NC_021303. 284 LATMIYQASVAAMEDENSQAAYIP--LVAS-VAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK 360 (637) Q Consensus 284 L~~ml~~va~aai~De~S~AA~vP--iva~-vP~Ehi~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls~ 360 (637) +. -.+-| ||-. -|||-|+- +.-. .-+.-...+-+-.++-+|+|+.||-|.|+|=.+ T Consensus 293 --------------~~---~~l~pG~i~~~L~pGe~i~~---~~p~-~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s 351 (502) T protein:vir:79 293 --------------RE---LTIQPGIIYDDLKPGEEIGM---VKSD-RPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN 351 (502) T ss_pred --------------cc---ccccCCccccccCCCceeee---eCCC-CCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc Confidence 00 01112 1111 23442222 1111 122233456667788899999999999999866 Q ss_pred cceeeeEEeccCceeEeec---hhHHHHHHHHHhHHHHHHHHHhCC------ChHHe--EEeecCcccccCCCCCHHH-H Q lcl|NC_021303. 361 GNHWSAWAIGDEDVQLHIK---PVMDLICQAIYNDILTPLLAREGI------DPTKY--ILWYDASGLTSDPDLSDEA-V 428 (637) Q Consensus 361 ~NHWsAW~I~dedVrlHI~---P~me~ic~Ait~~~Lr~~L~~eGi------Dp~kY--vvw~DaS~Lt~dPD~tdeA-~ 428 (637) .|+.|+-+---|..+..-. =+...+|+-|++.||.-++..-.| ++..| +-|.=++-.-+||-|--+| + T Consensus 352 ~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~ 431 (502) T protein:vir:79 352 GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWK 431 (502) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHH Confidence 7999987766555433100 134578999999999988876544 34566 6788899999999986665 6 Q ss_pred HHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCC-CCCCCCC Q lcl|NC_021303. 429 EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIE-STREEDD 507 (637) Q Consensus 429 ~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~-~~~~~~d 507 (637) .+.+.|..|-+...+..|. |+ ++-.+|.|.+.-..+ . .++.|+.-+...+ .++.+.+ T Consensus 432 ~~i~~Gl~t~~~~~a~~G~------D~--~~v~~q~a~e~~~~~-----~---------~Gl~~~~~~~~~~~~~~~~~~ 489 (502) T protein:vir:79 432 IQIRGGAATESDWVRAGGR------NP--DDVKRRRKAEIDENR-----K---------LDLVFDTDPASDKGGSSAATK 489 (502) T ss_pred HHHHcCCCCHHHHHHHcCC------CH--HHHHHHHHHHHHHHH-----H---------cCCCCCCCCCCCCCCCCCCCC Confidence 7788899999877655554 43 355566666553211 0 1455554332222 2222222 Q ss_pred CCCCCCCCCCccCCC Q lcl|NC_021303. 508 EDSGARQQREPQTED 522 (637) Q Consensus 508 ~~~~a~~g~EPdted 522 (637) +++.++++.+ .|+ T Consensus 490 ~~e~~~~~~~--~e~ 502 (502) T protein:vir:79 490 RQEPQHTDDQ--SEE 502 (502) T ss_pred CCCCCCCCCC--CCC Confidence 2222221111 111 No 99 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=97.95 E-value=2.6e-07 Score=56.70 Aligned_cols=337 Identities=13% Similarity=0.091 Sum_probs=170.2 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhccccc----ccchhhHHHHHhhhhhhHhhHhhhhhccee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTA----RNEWQSEAWDFSESIGELSYYISWRANSCS 76 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~----r~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (637) |.- -|+++...+++.+.+.++.+++.+ ...++.-++|.- ...| ++|-+ |+-|.-+|..--+| T Consensus 1 m~~-----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~p~~v~~~~~------~~~y~-~~~~~~~~~~pp~~ 66 (350) T protein:vir:11 1 MSK-----RRSHRRQQPVTVQSAQEGEFIPRQ--GGRAEAFTFGDPMPVLDGRG------ILDYL-ECWPNGRWYEPPLS 66 (350) T ss_pred CCc-----cccCCCcCccccCCcchhhhcccc--ccceEEEEeCCceeecCcch------hhHHH-HHhhcCccccCCCC Confidence 321 022222333333333333322222 222223334421 1112 11111 22222234444343 Q ss_pred e---eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCc-ccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 77 R---TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGP-LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 77 r---~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~-lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) + +||+ +..+.-+.+ +.. +...+....--++ +-..++ ++++.++-+=|.+|+.+.-...|++ T Consensus 67 ~~~la~~~--~~~~~h~~~---l~~-------k~n~l~~~~~Pn~~~t~~~f-~~~v~d~ll~Gnay~~~~rn~~G~~-- 131 (350) T protein:vir:11 67 MEGLAKSV--GSSVYLQSG---LKF-------KRNMLAKTFIPHRLLSRATF-EQFSLDWLTFGSAYLEQPRSRLGTR-- 131 (350) T ss_pred HHHHHHHH--hhhhhhccc---hhh-------hhhhhhhcccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEEcCCCCE-- Confidence 3 1221 111221111 111 1111222222233 444444 6678888888999998875444432 Q ss_pred ccccccccceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) .+-|+. ...-++ .+.++......++|..++|..+ -||++=+|+|.....--||..+++.++.--.-.++.- T Consensus 132 -----~~L~~l-~~~~vr~~~~~~~~~~~~~~~~~~~~~~~--eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~ 203 (350) T protein:vir:11 132 -----MPLQAP-LAKYMRRGTDLETFYQVRSWKDEHEFEKG--SVIQLREADINQEIYGVPEWFCALQSALLNESATLFR 203 (350) T ss_pred -----EEEEEe-CCceeEeeecCCeEEEEeeCCeEEEECcc--cEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 222222 222333 3444544555668888888653 3566656777776777889988888776544444443 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) ++.-+.-..-.|||.+|.. . .+....+.|.+.|-+ . .+....=.+++. T Consensus 204 ~~~f~NGa~~~gil~~~~~------~------------------ls~e~~~~l~~~~~~-~-------~G~~N~~~~~v~ 251 (350) T protein:vir:11 204 RKYYNNGSHAGFILYMTDA------A------------------QNEEDIDALRTALKT-A-------KGPGNFRNLFVY 251 (350) T ss_pred HHHHhccCCCceEEEecCC------C------------------CCHHHHHHHHHHHHH-h-------cCccccCceeee Confidence 3333333334466666531 0 112245556665533 1 112222344556 Q ss_pred echHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHHHHHH Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMDLICQ 387 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me~ic~ 387 (637) .|+..=+.+|-..++..-.+. -+++|+-....+|...-|||. |+|+.+ +++-++.+....=++--|.|.+..|++ T Consensus 252 ~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~ 330 (350) T protein:vir:11 252 APNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQ-LMGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ 330 (350) T ss_pred cCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 665333556666665444443 488999999999999999987 888843 455666676666777889999999986 Q ss_pred HHHhHHHHHHHHHhCCChHHeEEeecCccc Q lcl|NC_021303. 388 AIYNDILTPLLAREGIDPTKYILWYDASGL 417 (637) Q Consensus 388 Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~L 417 (637) +++ +| ..+-+ +| .-|+.+.| T Consensus 331 -ln~-~l----~~~~~---~F-~~~~~~~l 350 (350) T protein:vir:11 331 -VNE-MI----GEEVV---RF-AQFDAPGL 350 (350) T ss_pred -HHh-hc----Ccccc---cc-CcccccCC Confidence 443 33 11212 22 24677888 No 100 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=97.91 E-value=1.1e-05 Score=47.71 Aligned_cols=351 Identities=13% Similarity=0.139 Sum_probs=174.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.-=. + .+| |+. +..+....+-.+ +|+...+.---+-..+ ..+-+.-.|.=+++++|.+++ T Consensus 1 M~~~~-~-f~~-r~~--------------~~~~~~~~~~~~-~~~~~~~~~v~~~~al-~~~av~~cv~~ia~~ia~~p~ 61 (359) T protein:vir:10 1 MSILN-P-FER-RSS--------------ITPNNYYPFMVQ-NGSIVPNSLVDATEAL-KNSDLYAVTSLISSDIAGTRF 61 (359) T ss_pred Ccccc-h-hhc-ccc--------------CCCCcchhhhhc-cccccCCcccCHHHhh-cchHHHHHHHHHHHhhhcCcc Confidence 43221 0 111 110 001000000000 0100000000000111 122233345556777777766 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) . +++ ....+... -.--+-..++++.++.+|-.-|+.|+.++ |.+++. +. . T Consensus 62 ~------------------~~~---~~~~L~~~-PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~g~------~~-~ 111 (359) T protein:vir:10 62 I------------------GNQ---VFTSVLNN-PSHLTNAFSFWQTAILNLLLNGNVFLAIL-KGDNSL------MK-E 111 (359) T ss_pred c------------------cch---HHHHHhhc-ccccCCHHHHHHHHHHhccccCceEEEEE-ECCCCe------EE-E Confidence 3 111 11111111 11235667889999999999999998764 434321 12 2 Q ss_pred ceeeeHHHhc--cCCCceeE-Eec-CCCCcccccCCCceE-EEE--ecCCcccccCCccchhhhhHHHHHHHhhhHHHHH Q lcl|NC_021303. 161 WYAVTREEIK--SKAGETAE-ISL-PDGKTHEFNRDLDSL-VRI--WNPRPRKASQATSPVRACLETLREIERTTRKIKN 233 (637) Q Consensus 161 W~~vt~~Ei~--~k~g~~~~-i~l-PdG~~he~~~~~d~l-~Rv--W~P~prra~eaDSPvra~l~~LrEI~rttk~I~n 233 (637) ++.|..+.+. ...++-.+ +.. .+|..++|... |++ ||. +++++-.-..--||+.++...+.-..-..+...+ T Consensus 112 l~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~~~-evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~ 190 (359) T protein:vir:10 112 LRLIPSNAITIDLTDDTLTYEVNQFDDYPSAKYNAS-EMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLS 190 (359) T ss_pred EEEeCCceEEEEEcCCeEEEEEEecCCceEEEEccc-ceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 3333333222 12222221 222 23455555442 332 333 3344445556778888777766665555555555 Q ss_pred HHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeec Q lcl|NC_021303. 234 AAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVA 313 (637) Q Consensus 234 a~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP 313 (637) ..+.=..-.|||-+|+.. ....+.+.|.+.+-. ... -.+ +--|+|+ + T Consensus 191 ~f~ng~~~~gil~~~~~~------------------------l~~e~~~~~~~~~~~-~~~---~~n---~g~~~vl--~ 237 (359) T protein:vir:10 191 TLKGALNPTSVVKVPQGT------------------------LSSEAKDSIRKEFEK-ANG---GNN---SGRVMVL--D 237 (359) T ss_pred HHhccCCcceEEEeCCCC------------------------CCHHHHHHHHHHHHH-HhC---ccc---cCCceec--C Confidence 555544456777776521 011234556665522 221 111 1123333 2 Q ss_pred hHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhH Q lcl|NC_021303. 314 AEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (637) Q Consensus 314 ~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~ 392 (637) +. -+++-|.+. ..+. -+++|+..+..+|.-.-|||..|-|.++.|.|. =++. +-..-++.|.+.-|++.|+.. T Consensus 238 ~g--~~~~~l~~~--~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-~~~e-~~~~~~l~~~l~p~~~~l~~~ 311 (359) T protein:vir:10 238 QS--ADFSTVSIN--ADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSL-DQIK-DLYVNALNRFIEPLISELRIK 311 (359) T ss_pred CC--cceeeecCC--HHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccH-HHHH-HHHHHHHHHHHHHHHHHHHHH Confidence 22 344555443 3343 478999999999999999999764445544332 1222 222334777778888888877 Q ss_pred HHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCHHHHHHHhcCcccc Q lcl|NC_021303. 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDS 451 (637) Q Consensus 393 ~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~ 451 (637) +.+.. ++|.. |.+.||.+.+. .....+++.|.+|-.-.|+.+|+.-== T Consensus 312 l~~~~----~~~~~-~~~~~d~~~~~------~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 312 CDSSI----GVDMS-PITDYSNSVFK------ADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred hhhhh----cccch-hhhhcCHHHHH------HHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 76653 35544 44667766554 235668899999999999999886211 No 101 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=97.87 E-value=4.2e-07 Score=55.53 Aligned_cols=327 Identities=9% Similarity=0.026 Sum_probs=159.5 Q ss_pred EecCCCCCccc----ccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceee---eEEE Q lcl|NC_021303. 9 VRRPKGSAPAA----RRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSR---TTLI 81 (637) Q Consensus 9 vrrpk~~~p~~----~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr---~rL~ 81 (637) .++-+..+-.. ......+=+-+.+.|. . .-.-.+-|+...+++|+- -+++ ++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~----~--~~~y~~~~~~~~~~~~ep-------------p~~~~~la~l~ 61 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISAS----P--ALDYVGIGFDENYNCYLP-------------PVNRHALAKLP 61 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccc----c--chhhhhhhhcCCccccCC-------------CCCHHHHHHHh Confidence 23222222100 0000011111111111 0 001112344444444431 1111 1111 Q ss_pred EeeeccccCCCCCcccCCCCcccchHHHHHHHhccCc-ccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 82 PSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGP-LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 82 aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~-lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) +..+.-+.+ +..+ ..-+...+-.++ |...++ ++++.++-+=|.+|+.+.-...|++ .+- T Consensus 62 --~~~~~h~~~---i~~k-------~n~l~~~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~-------~~L 121 (345) T protein:vir:37 62 --HQNAQHGGI---LHSR-------ANMVSSLYEGGKALSRMDM-RALCLNLIQFGDVGLLKVRNGFGQV-------VRL 121 (345) T ss_pred --hcccccccc---eeee-------chHHHhhccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEEcCCCcE-------EEE Confidence 000000000 1111 111111111222 444444 6778888899999988664444431 112 Q ss_pred ceeeeHHHhc-cCCCcee-EE----ecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHH Q lcl|NC_021303. 161 WYAVTREEIK-SKAGETA-EI----SLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNA 234 (637) Q Consensus 161 W~~vt~~Ei~-~k~g~~~-~i----~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na 234 (637) |+.. ...+. .+.++.. .+ ...+|..++|..+ -||++=+|+|..-..--||..+++.++.-=...++.-++. T Consensus 122 ~pl~-~~~vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~--dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~ 198 (345) T protein:vir:37 122 VPLS-SLYLRVRKDGGYSYLMKKSLYDTAQEIYRYDAK--DIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRY 198 (345) T ss_pred EEEc-CceeEEEEeCCeeEEEEEeEecCCceEEEEccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 2221 12221 1222222 22 1235666777653 3566667777777777889888887654322222222222 Q ss_pred HHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_021303. 235 AKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAA 314 (637) Q Consensus 235 ~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~ 314 (637) -+.=..-.|||.+|. .. ......+.|.+-| ++-......=.+++..|+ T Consensus 199 f~NG~~p~~Il~~~d------~~------------------l~~e~~~~lk~~~--------~~~~g~~n~~~~~i~~p~ 246 (345) T protein:vir:37 199 FSNGAHMGFILYSTD------PD------------------LTEEMEEEIARKI--------SESKGVGNFRSMFVNIAN 246 (345) T ss_pred HhccCCcceEEEecC------CC------------------CCHHHHHHHHHHH--------HHhcCcccccceEEEcCC Confidence 222222335666653 00 0112344444433 222223344457777776 Q ss_pred HHhcccceeecCcch-hHHHHhhHHHHHHHHHhhcCCchhHhhccCCcce---eeeEEeccCceeEeechhHHHHHHHHH Q lcl|NC_021303. 315 EHLEKVQHIKFGNEV-TEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNH---WSAWAIGDEDVQLHIKPVMDLICQAIY 390 (637) Q Consensus 315 Ehi~~ikHlkf~~dv-tevaiktR~daI~RlAmglDv~pErLLGls~~NH---WsAW~I~dedVrlHI~P~me~ic~Ait 390 (637) ..=+.+|-..++..- +.--+++|+-.+.++|...-|||. |+|+...|. -++.+....=++--|.|.+..|+++|+ T Consensus 247 g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~-llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln 325 (345) T protein:vir:37 247 GHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETIN 325 (345) T ss_pred CcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 544666666665432 333478888999999999999987 779854443 445666666778889999999999998 Q ss_pred hHHHHHHHHHhCCChHHeEEeecCccccc Q lcl|NC_021303. 391 NDILTPLLAREGIDPTKYILWYDASGLTS 419 (637) Q Consensus 391 ~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~ 419 (637) .- ++ -+..|++.||...|.. T Consensus 326 ~~-----~~----~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 326 QD-----PE----IKNLLKIKFREQNFAK 345 (345) T ss_pred hh-----cc----CCCcceEEecchhhcC Confidence 52 22 2467899999999988 No 102 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=97.78 E-value=1.5e-06 Score=52.52 Aligned_cols=327 Identities=14% Similarity=0.149 Sum_probs=161.9 Q ss_pred EEecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEEEEe Q lcl|NC_021303. 8 VVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSESIGELSYYISWRANSCSRTTLIPS 83 (637) Q Consensus 8 ~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL~as 83 (637) ..||-|.. . +.+++.-+........-++|. ....|..+..+++.- | +|| .--+++.-| |- T Consensus 1 m~~~~~~~-~-------~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~-~--~~~----~pp~~~~~l-a~ 64 (344) T protein:vir:60 1 MSKKKGKT-L-------QPAAKKMTASAPKMEAFTFGEPVPVLDRRDILDYVECISN-G--RWY----EPPISFTGL-AK 64 (344) T ss_pred CCcccCCC-C-------CchHHhhcCCcCcEEEEEcCCceeecCCcchhHHHHhhhc-C--ccc----cCCCCHHHH-HH Confidence 22332211 0 011111111111222233442 122233333333311 3 222 111111100 00 Q ss_pred --eeccccCCCCCcccCCCCcccchHHHHHHHhccCc-ccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 84 --AIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGP-LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 84 --eiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~-lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) +..+.-+-+ |.. +...|...+--++ |-..++ ++++.++.+=|..|+.+.-...|++ .+. T Consensus 65 ~~~a~~~h~~~---i~~-------k~n~l~~~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~-~~L------ 126 (344) T protein:vir:60 65 SLRAAVHHSSP---IYV-------KRNILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKV-IRL------ 126 (344) T ss_pred HHHhhhhhccc---hhh-------hhhHHHhhccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcE-EEE------ Confidence 000000000 110 1111222212222 444555 7899999999999987654444432 111 Q ss_pred ceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 161 WYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 161 W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) |+ |...-++ .+.++.......+|..++|.. +-||++=+|+|..-..-=||..+++.++. +... ...-+.|. T Consensus 127 ~~-l~~~~vr~~~~~~~~~~v~~~~~~~~~~~--~eIiHir~~~~~~~~yGlsp~~~a~~si~----l~~~-a~~~~~~~ 198 (344) T protein:vir:60 127 ET-SPAKYTRRGVEEDVYWWVPSFNEPTAFAP--GSVFHLLEPDINQELYGLPEYLSALNSAW----LNES-ATLFRRKY 198 (344) T ss_pred EE-cCcceEEEeecCCeEEEEccCCeEEEEcC--ccEEEEcCCCCCCCcccccHHHHHHHHHH----HHHH-HHHHHHHH Confidence 21 2222332 234444444555788888876 34677778888877778888888877654 2222 12234455 Q ss_pred hcCc-----eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeech Q lcl|NC_021303. 240 MNNG-----VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAA 314 (637) Q Consensus 240 ~gnG-----vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~ 314 (637) ..|| ||.+|. +. .+....+.|.+.|.+ + ...-+.=++|+..|+ T Consensus 199 f~NG~~pg~il~~~~------~~------------------ls~e~~~~ik~~~~~-~-------~g~~~~r~~~l~~p~ 246 (344) T protein:vir:60 199 YENGAHAGYIMYVTD------AV------------------QDRNDIEMLRENMVK-S-------KGRNNFKNLFLYAPQ 246 (344) T ss_pred HhccCCCceEEEecC------cC------------------CCHHHHHHHHHHHHH-h-------cCCCCCcceEEecCC Confidence 5554 554443 11 111244455544422 1 122345688999998 Q ss_pred HHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHHHHHHHHH Q lcl|NC_021303. 315 EHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMDLICQAIY 390 (637) Q Consensus 315 Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me~ic~Ait 390 (637) .--+.+|-..+...-.+- -+++|+-....+|...=|||. |+|+.+ +|+-++-+....=++..|.|.+..|++ |+ T Consensus 247 g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e~-ln 324 (344) T protein:vir:60 247 GKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-IN 324 (344) T ss_pred CCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 655677777765444333 489999999999999999996 889843 356666666666677889999888875 44 Q ss_pred hHHHHHHHHHhCCChHHeEEeecCcccccCCC Q lcl|NC_021303. 391 NDILTPLLAREGIDPTKYILWYDASGLTSDPD 422 (637) Q Consensus 391 ~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD 422 (637) + + |..+ |+-|+.-.|..|=- T Consensus 325 ~-~----lg~~-------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 325 G-W----LGQE-------VIRFKNYSLDTDNG 344 (344) T ss_pred H-h----cCCc-------ccccCccccCCCCC Confidence 3 2 2222 23344433333211 No 103 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=97.73 E-value=2.4e-05 Score=45.93 Aligned_cols=370 Identities=11% Similarity=0.052 Sum_probs=172.3 Q ss_pred hhhhhhc---ccc---cccchhhHHHHHhh-h-------hhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCC Q lcl|NC_021303. 36 KQMKTSL---MGT---ARNEWQSEAWDFSE-S-------IGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEED 101 (637) Q Consensus 36 ~~~k~~~---~g~---~r~~WQ~eAW~~yd-~-------VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~ 101 (637) =.|+++. +|. +......-.|.... . ..-+.-.|.-+++.+|++.+..-+ + |.. +. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~---~-~~~---~~---- 69 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDTVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYE---K-GEE---VR---- 69 (395) T ss_pred CchHHHHHhhhcccccccccccchhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeecc---C-Ccc---cc---- Confidence 1122222 222 11111222343222 1 223444578889999999887643 2 221 11 Q ss_pred cccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHHHhccCCCceeEEec Q lcl|NC_021303. 102 PDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIKSKAGETAEISL 181 (637) Q Consensus 102 ~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~~k~g~~~~i~l 181 (637) +....+.+.=-..-+-..++++.++.+|.+-|++|+++. +.+...++ .|........ .. ....+.. T Consensus 70 ---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~-~~~~~~~~-------~~~~~~~~~~-~~--~~~~v~~ 135 (395) T protein:vir:40 70 ---KKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQ-DEYIYVAD-------SFTKNDKSLY-EN--TYTEVTL 135 (395) T ss_pred ---chHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEe-cCceeecC-------Cccccccccc-cc--eeeeeee Confidence 123334333344557778999999999999999998753 22221112 2322111111 01 1111221 Q ss_pred CCCCc--ccccCCCceEEEE-ecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcC---ceeeecccCCCCC Q lcl|NC_021303. 182 PDGKT--HEFNRDLDSLVRI-WNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNN---GVLFVPAEMSLPA 255 (637) Q Consensus 182 PdG~~--he~~~~~d~l~Rv-W~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gn---GvlfvPqe~slP~ 255 (637) +|-. .+|.. +-||++ .++.. -.++....+..+.++.- .+.++-.-+| |++.+.. T Consensus 136 -~~~~~~~~~~~--~evih~r~~~~~-----~~~~~~~l~~~~~~~~~------~~~~~~~~~~~~~~~l~~~~------ 195 (395) T protein:vir:40 136 -KDLTLKKEFKE--SEVLHLTLNNES-----IKSIIDGFYLLYGDLLT------AAVNKYKKLNSRKIIVKLKA------ 195 (395) T ss_pred -cCceeeeeecc--ccEEEeecCCCC-----ccccchhHHHHHHHHHH------HHHHHHHhcCCCCceEEEec------ Confidence 3322 22322 234443 12211 12233333344444321 1111222233 3333211 Q ss_pred cccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcch-hHHH- Q lcl|NC_021303. 256 AQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEV-TEVE- 333 (637) Q Consensus 256 ~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dv-teva- 333 (637) . ......+.+.+++.+-+.-+....+ +.-++++ ++. -+++-|...... .-.. T Consensus 196 ~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~vl--~~g--~~~~~l~~~~~d~q~~e~ 249 (395) T protein:vir:40 196 M-----------------FGQTPEAEEKLRLMLSERMKKFLAE-----GDSALPV--EDG--MEIDELAGDSKIAESRDI 249 (395) T ss_pred c-----------------cCCCHHHHHHHHHHHHHHHHHhhcc-----CCceeec--CCC--ceEEeccCChhhhhHHHH Confidence 0 0112234556666665443332222 2222332 222 245555443322 2222 Q ss_pred HhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeec Q lcl|NC_021303. 334 IKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYD 413 (637) Q Consensus 334 iktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~D 413 (637) -|+.+++++.+|.-+-|||..| |-+.+| .-|....-++-.|.|.+..||++|+..+|-..-.. ..|-+-|| T Consensus 250 ~~~~~~~~~~Ia~~fgVPp~~l-~~~~sn---~e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~-----~g~~i~fd 320 (395) T protein:vir:40 250 KKMIDDVFEMVANSFNIPLGLA-KGDTVG---LSEQVNSFLMFSINPIAEMFTDEGNRKFYGRDSVL-----ERTYMKLD 320 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHh-cCCCcC---HHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhc-----CCceEEEe Confidence 2455778899999999998765 543344 23444556778899999999999999987643321 34666799 Q ss_pred CcccccCCCCCHHH---HHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccc Q lcl|NC_021303. 414 ASGLTSDPDLSDEA---VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGI 490 (637) Q Consensus 414 aS~Lt~dPD~tdeA---~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~i 490 (637) .+.| .++|..+.| ..++..|++|-.-.|+++|++--.+-+-+ + -+.| ..++.+ T Consensus 321 ~~~l-l~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD--~------------------~~~~---~n~~~~ 376 (395) T protein:vir:40 321 TTRI-KVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQ--E------------------RFVT---KNYAPL 376 (395) T ss_pred chhh-hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCc--e------------------eeec---cccccc Confidence 8887 455554433 34788999999999999999754321100 0 0000 001111 Q ss_pred cCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021303. 491 EFPQPANAIESTREEDDEDSGARQQREPQTEDERST 526 (637) Q Consensus 491 e~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~ 526 (637) + .. .+ ...+|++.+.+.+ . T Consensus 377 ~-----------~~-~~---~~kgge~~~~~~~--~ 395 (395) T protein:vir:40 377 G-----------EN-EE---DLKGGDINENKGD--S 395 (395) T ss_pred c-----------cc-cc---ccCCCCCCCCcCC--C Confidence 1 00 00 0011111111100 0 No 104 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=97.61 E-value=2.5e-06 Score=51.30 Aligned_cols=355 Identities=13% Similarity=0.071 Sum_probs=172.4 Q ss_pred CCCCcceEEecCCCCCccccc---chheehhccc--cchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcce Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARR---RSLTAASQLI--TDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSC 75 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r---~~ltAAs~~~--~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~ 75 (637) |. ||.|......+. ++.++++.+. +.+......-++|. ...|.+..| ++|-+ |+-|.-+|-..-| T Consensus 1 m~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~-p~~~~~~~~-~~~~~-~~~~~~~~~~~pi 70 (368) T protein:vir:79 1 MS-------RNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGD-PVEVLDRRE-LLDYV-ECMRMGQWYEPPM 70 (368) T ss_pred CC-------ccccccchhccCcccccccccCcchhhccccCceEEEEcCC-ceeecchhh-HHHHH-HHHhccchhccCc Confidence 43 333333211111 1111111111 11111122234453 234666554 34433 3323333666666 Q ss_pred ee---eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 76 SR---TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 76 Sr---~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) +. ++|+-+ .+..|.+ +. .+ ..+..+ ...-..-+...++ ++++.++-+-|..|+.+.-...|.+ T Consensus 71 ~~~~la~~~~~--~~~h~~~---~~-~~----~n~l~l-~~~Pn~~~t~~~f-~~l~~d~ll~Gnay~~~~r~~~G~~-- 136 (368) T protein:vir:79 71 PWDGLARSFRA--AAHHSSA---VY-VK----RNILVS-TFIPHPLLSRATF-ERLVLDWQVFGNAYLERRENVLGGT-- 136 (368) T ss_pred CHHHHHHHHhh--ccccchh---hh-hh----cchhhh-hcCCCcCCCHHHH-HHHHHHHhhcCCeEEEEEEcCCCCE-- Confidence 64 233211 1222211 11 11 011111 1123344666665 6788999999999998876555541 Q ss_pred ccccccccceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) . ..+.+....+. .+.++.......+|..++|..+ -||++=+|+|..-..--||..+++.++.--...++.- T Consensus 137 -----~-~L~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~--dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~ 208 (368) T protein:vir:79 137 -----I-RLDTPLAKYVRRGLDLNTYFFVQNWQQPYTFAAG--SVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFR 208 (368) T ss_pred -----E-EEEEeCcccceeeccCCEEEEEecCCeEEEEccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 1 22233333333 2444555556668888888763 3677778888887788899999887765433333332 Q ss_pred HHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEe Q lcl|NC_021303. 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (637) Q Consensus 232 ~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~ 311 (637) ++.-+.=..-.|||.+|.. . .+....+.|.+.|-+ .+ ...+ +--++|+ T Consensus 209 ~~~~~NGa~~~gil~~~~~------~------------------l~~e~~~~lk~~~~~-~~-G~~N-----~g~~~vl- 256 (368) T protein:vir:79 209 RRYYKNGSHAGFILYMTDA------A------------------QKQEDVDTLREAMKS-AK-GPGN-----FRNLFMY- 256 (368) T ss_pred HHHHhccCCCceEEEeCCC------C------------------CCHHHHHHHHHHHHH-hc-CCcc-----cCceeEe- Confidence 2222222233445655531 1 112245556665533 11 1111 2223333 Q ss_pred echHHhcccceeecCcc-hhHHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHHHHHH Q lcl|NC_021303. 312 VAAEHLEKVQHIKFGNE-VTEVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMDLICQ 387 (637) Q Consensus 312 vP~Ehi~~ikHlkf~~d-vtevaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me~ic~ 387 (637) .|+..=+.+|-..++.. .+.--+++|+-.+..+|...-||| .|+|+.+ +|+-+.-+....=++.-|.|.+..|++ T Consensus 257 ~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie~ 335 (368) T protein:vir:79 257 APNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPP-QLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLLA 335 (368) T ss_pred cCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH Confidence 34322244444444332 223347899999999999999999 6779843 345556666666666778999998874 Q ss_pred HHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCH Q lcl|NC_021303. 388 AIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITS 438 (637) Q Consensus 388 Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~ 438 (637) |++ +| . .+++-||...|... |....|. +|.=|+ T Consensus 336 -ln~-~l----~-------~e~~rF~~~~l~~~-D~~a~a~----~~~rsa 368 (368) T protein:vir:79 336 -IND-WI----G-------DEVVRFAPYALGGH-DQPAAAP----GGQRSA 368 (368) T ss_pred -HHh-cc----C-------cceeeechhHhhcc-cccccCC----cccccC Confidence 443 22 2 13556666555321 1111111 111111 No 105 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=97.58 E-value=3.8e-06 Score=50.29 Aligned_cols=333 Identities=11% Similarity=0.097 Sum_probs=159.4 Q ss_pred ccccchheehhccccchhhhhhhhcccc-----cccchhhHHHHHhh-hhhhHhhHhhhhhcceee---eEEEEeeeccc Q lcl|NC_021303. 18 AARRRSLTAASQLITDPQKQMKTSLMGT-----ARNEWQSEAWDFSE-SIGELSYYISWRANSCSR---TTLIPSAIDPD 88 (637) Q Consensus 18 ~~~r~~ltAAs~~~~~p~~~~k~~~~g~-----~r~~WQ~eAW~~yd-~VgELryyvgWr~~s~Sr---~rL~aseiD~D 88 (637) ++-. ...++++.-+.+... .+||+ ....|=.++|+++- -.|+ |-..-+|+ ++|+-+ .+- T Consensus 1 ~~~~-~~~~~~~~~~~~~~~---~~~~~~p~~~~~~~~~~~~~~~~~~~~~~------~~epp~~~~~La~l~~~--n~~ 68 (348) T protein:vir:26 1 MTEQ-LIHSHTTDGTESKSV---YSFDPNPEPVDTNSWMTRYCELFYNDFDD------YWEPPISLKGLAEIANA--NGY 68 (348) T ss_pred CCcc-ccchhhccccCCceE---EEecCCCeeecCcchHHHHHHHHhcCCCc------cccCCCCHHHHHHHHhh--hhh Confidence 1111 111111111111111 23331 12334555555542 1221 11111111 011000 000 Q ss_pred cCCCCCcccCCCCcccchHHHHHHHhccCc-ccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccceeeeHH Q lcl|NC_021303. 89 TGLPTGEVDIEEDPDAQIVADYVKGIADGP-LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWYAVTRE 167 (637) Q Consensus 89 tG~PtG~v~~e~~~~~~rv~~iv~~iAgG~-lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~ 167 (637) -+.+ |. .+...|...+--++ +-..+ +++++.++-+=|.+|+.+.-...|+ +.+. +.+... T Consensus 69 h~~~---i~-------~k~N~l~~~~~Pn~~~t~~~-f~~~~~d~ll~Gnay~~~~rn~~G~-~~~L-------~~l~~~ 129 (348) T protein:vir:26 69 HGSL---LK-------ARANYVAGRFMNGGGLPMYK-MNSACWDYFGLGMSAFVKIRSYLKN-VIAL-------EPLPMV 129 (348) T ss_pred hhhh---Hh-------hhhhHHhhcccCCCCCCHHH-HHHHHHHHHhcCCeEEEEEEcCCCc-EEEE-------EEecCc Confidence 0000 00 01111111111122 33444 4777888889999999876433443 2121 122222 Q ss_pred Hhcc-CCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceee Q lcl|NC_021303. 168 EIKS-KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLF 246 (637) Q Consensus 168 Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlf 246 (637) .+.. +.+.. .....+|..++|..+ -||++=.|+|.....--||..+++.++.--.-.++.-++--+.=....|||. T Consensus 130 ~v~~~~d~~~-~~~~~~g~~~~f~~~--dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~ 206 (348) T protein:vir:26 130 HMRKRKNGDF-VQLLRNNEQKVFKAK--DVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFY 206 (348) T ss_pred eeEeeecCcE-EEEEecCeEEEEcCc--cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 2332 33333 334457888888653 4566667788777777889888887654322222221121222223445565 Q ss_pred ecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecC Q lcl|NC_021303. 247 VPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFG 326 (637) Q Consensus 247 vPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~ 326 (637) ++. +. ......+.|.+.|.+ +. -.+ ..=.+++..|+..=+.+|-..++ T Consensus 207 ~~~------~~------------------ls~e~~~~lk~~~~~-~~----G~~---n~~~~~vl~~~g~~~Gi~~~pis 254 (348) T protein:vir:26 207 ATD------PN------------------LSEADEKALKEKIAS-SK----GIG---NFRSMFVNIPNGKEKGIQLIPVG 254 (348) T ss_pred ecC------CC------------------CCHHHHHHHHHHHHH-hc----Ccc---cccceeEEcCCCCccceeEEEcc Confidence 543 11 112245556555533 11 112 22335566665433445555543 Q ss_pred c-chhHHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHHhC Q lcl|NC_021303. 327 N-EVTEVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREG 402 (637) Q Consensus 327 ~-dvtevaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eG 402 (637) . ..+.--+++|+-...++|...-|||. |+|+.. +++-++-+....=++--|.|.+..|+++|++..+ T Consensus 255 ~~~~d~qf~e~k~~t~~dIa~af~VPp~-llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~-------- 325 (348) T protein:vir:26 255 DIATKDEFERIKNITAQDIFVGHRFPAG-MGGMLPQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPE-------- 325 (348) T ss_pred CChhHHHHHHHHHhhHHHHHHHhCCCHH-HccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhC-------- Confidence 2 22334678899999999999999985 889842 4555666666666777799999999999997532 Q ss_pred CChHHeEEeecCcccccCCCCCHHHHHHHhcCCc Q lcl|NC_021303. 403 IDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAI 436 (637) Q Consensus 403 iDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaI 436 (637) + +..+-++||.+. ..++++ +.|| T Consensus 326 ~-~~~~~~~fdl~~---~~e~~~-------~~a~ 348 (348) T protein:vir:26 326 I-PDNLKLKFNLNP---GVESAN-------GSAV 348 (348) T ss_pred C-CCccEEEEecCc---ccccch-------hhcC Confidence 2 245667888532 223333 2333 No 106 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=97.56 E-value=7e-06 Score=48.85 Aligned_cols=319 Identities=14% Similarity=0.134 Sum_probs=162.2 Q ss_pred EEecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhhHHHHHhhhhhhHhhH----------hhhhhc Q lcl|NC_021303. 8 VVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSESIGELSYY----------ISWRAN 73 (637) Q Consensus 8 ~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd~VgELryy----------vgWr~~ 73 (637) .. |.|..+|+..-...++.++. ...-+||. ....|-.+..+++. .|+ || =-.++| T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~-------~~~~~f~~p~~v~~~~~~~~~~~~~~-~~~--~~~pp~~~~~la~~~~a~ 69 (344) T protein:vir:20 1 MS-KKKGKTPQPAAKTMTASGPK-------MEAFTFGEPVPVLDRRDILDYVECIS-NGR--WYEPPVSFTGLAKSLRAA 69 (344) T ss_pred CC-cccCCCCcchhhhhhccCCc-------eEEEEcCCceEecCcchhhhhhhhhh-cCc--eecCCCCHHHHHHHHhhh Confidence 22 22333322221111111111 12223342 22223222222221 132 22 001111 Q ss_pred ceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhc-cCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 74 SCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIA-DGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 74 s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iA-gG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) ....-=||+ +...+...+- ..-|...++ ++++.++.+=|..|+.++-...|++ T Consensus 70 ~~h~~~i~~-----------------------k~n~l~~~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~-- 123 (344) T protein:vir:20 70 VHHSSPIYV-----------------------KRNILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKV-- 123 (344) T ss_pred hhhCcccee-----------------------hhhhHHHhccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcE-- Confidence 111000010 0111111111 122444555 7889999999999998765444432 Q ss_pred ccccccccceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) ..-|+.. ..-+. .+.++.......+|..++|..+ -||++=+|+|..-..--||..+++.++---.-.++ T Consensus 124 -----~~L~pl~-~~~vr~~~~~~~~~~~~~~~~~~~~~~~--eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~-- 193 (344) T protein:vir:20 124 -----IRLETSP-AKYTRRGVEEDVYWWVPSFNEPTAFAPG--SVFHLLEPDINQELYGLPEYLSALNSAWLNESATL-- 193 (344) T ss_pred -----EEEEEcC-CceeEeeecCCEEEEEccCCeEEEEcCc--cEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHH-- Confidence 1122211 11222 2344444445567888888763 45677688887777778898888877653222222 Q ss_pred HHHHHhHhhcC-----ceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccccc Q lcl|NC_021303. 232 KNAAKSRVMNN-----GVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYI 306 (637) Q Consensus 232 ~na~~SRL~gn-----GvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~v 306 (637) -+.|...| |||.+|. +. ......+.|.+.|.+ -...-+.= T Consensus 194 ---~~~~~f~NGa~p~~Il~~~d------~~------------------l~~e~~~~ik~~~~~--------~~g~~n~r 238 (344) T protein:vir:20 194 ---FRRKYYENGAHAGYIMYVTD------AV------------------QDRNDIEMLRENMVK--------SKGRNNFK 238 (344) T ss_pred ---HHHHHHhccCCCceEEEecC------cC------------------CCHHHHHHHHHHHHH--------hcCCCCcc Confidence 23344444 4555553 11 111234445444422 22234567 Q ss_pred ceeEeechHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhH Q lcl|NC_021303. 307 PLVASVAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVM 382 (637) Q Consensus 307 Piva~vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~m 382 (637) ++|+..|+..-+.+|-..++..-.+- -+++|+-....+|...-|||. |+|+.. +++-++.+....=++.-|.|.+ T Consensus 239 ~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~-llGi~~~~t~~~~n~e~~~~~f~~~~l~P~~ 317 (344) T protein:vir:20 239 NLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVRNELIPLQ 317 (344) T ss_pred ceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccccHHHHHHHHHHHHHHHHH Confidence 88999998655677777776544333 489999999999999999996 779843 4455566666666777789998 Q ss_pred HHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCC Q lcl|NC_021303. 383 DLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPD 422 (637) Q Consensus 383 e~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD 422 (637) ..|.+ |++ + | |+ -++-|+-..|..|=+ T Consensus 318 ~~~e~-in~-~----l---g~----~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 318 DRIRE-ING-W----L---GQ----EVIRFKNYSLDTDND 344 (344) T ss_pred HHHHH-HHH-h----c---CC----cccccCccccccCCC Confidence 88875 433 2 2 22 134465555544433 No 107 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=97.52 E-value=2e-06 Score=51.86 Aligned_cols=315 Identities=9% Similarity=0.082 Sum_probs=154.6 Q ss_pred EecCCCCCcccccchheeh--------hccccchhhhhhhhcccccccchhhHHHHHhhh------hhhHhhHhhhhhcc Q lcl|NC_021303. 9 VRRPKGSAPAARRRSLTAA--------SQLITDPQKQMKTSLMGTARNEWQSEAWDFSES------IGELSYYISWRANS 74 (637) Q Consensus 9 vrrpk~~~p~~~r~~ltAA--------s~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~------VgELryyvgWr~~s 74 (637) .++-|.++- + +.+.+ +-+.+-|...++- ..-|+.+.+++|+- .-+|-..-...++. T Consensus 1 ~~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~~y------~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~ 70 (345) T protein:vir:37 1 MKTNVKTDN---K-KGIVIAPINDRTFSLSEITASPALDY------VGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGI 70 (345) T ss_pred CCccccccc---h-hhhcCCCceEEEeecCCcccchhhcc------cceeeecCCccccCCCCHHHHHHHhhcchhhcch Confidence 333333321 0 11111 1111111111111 12233333333320 00010000111110 Q ss_pred e-eeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccc Q lcl|NC_021303. 75 C-SRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTG 153 (637) Q Consensus 75 ~-Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~ 153 (637) + .+...+++.+.|. +-+-..++ ++++.++-+=|.+|+.++-...|++ .+ T Consensus 71 i~~k~n~l~~~~~Pn----------------------------~~~t~~~f-~~~v~d~ll~Gnay~~i~rn~~G~~-~~ 120 (345) T protein:vir:37 71 LHSRANMVSATYEGG----------------------------KALSKMEM-RALCLNLIQFGDVGLLKVRNGFGQV-VR 120 (345) T ss_pred hhhhhhHHhhccCCC----------------------------CCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCCE-EE Confidence 0 1111222222211 22334444 6677788888999999875545532 11 Q ss_pred cccccccceeeeHHHhc-cCCCcee-EEe----cCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhh Q lcl|NC_021303. 154 LAAPRARWYAVTREEIK-SKAGETA-EIS----LPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERT 227 (637) Q Consensus 154 ~~~~~~~W~~vt~~Ei~-~k~g~~~-~i~----lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rt 227 (637) -|+. ....+. .+.++.. .+. .-.|..++|..+ -||++=+|+|..-..--||..+++.++- + T Consensus 121 ------L~pl-~~~~vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~--eViHir~~~~~~~~~Gl~~~~~a~~si~----l 187 (345) T protein:vir:37 121 ------LVPL-SSLYLRVHKDGGYSYLMKKSLYDTAQEIYRYDAK--DIIFIKLYDPMQQVYGSPDYVGGIQSAL----L 187 (345) T ss_pred ------EEEe-cCceeEEeecCCeeEEEeeeeeccCceEEEEccc--cEEEEcCCCCCCCcccchHHHHHHHHHH----H Confidence 1221 112222 1222222 221 113566666553 3566656777666666678887776553 2 Q ss_pred hHHHHHHHHhHhhcCc-----eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccc Q lcl|NC_021303. 228 TRKIKNAAKSRVMNNG-----VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQ 302 (637) Q Consensus 228 tk~I~na~~SRL~gnG-----vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~ 302 (637) ... ...-+.+...|| ||.++. +. ......+.|.+.|.+ ...- T Consensus 188 ~~~-a~~~~~~~f~NGa~~~~Il~~t~------~~------------------l~~e~~~~lk~~~~~--------~~g~ 234 (345) T protein:vir:37 188 NSD-ATVFRRRYFSNGAHMGFILYSTD------PD------------------LTEEMEEEIARKISE--------SKGV 234 (345) T ss_pred HHH-HHHHHHHHHhccCCcceEEEeCC------CC------------------CCHHHHHHHHHHHHH--------hcCc Confidence 221 122234555554 554443 11 111234445444422 2222 Q ss_pred ccccceeEeechHHhcccceeecCcchhH-HHHhhHHHHHHHHHhhcCCchhHhhccCCc---ceeeeEEeccCceeEee Q lcl|NC_021303. 303 AAYIPLVASVAAEHLEKVQHIKFGNEVTE-VEIKTRIDAITRLAMGLDVSPERLLGMSKG---NHWSAWAIGDEDVQLHI 378 (637) Q Consensus 303 AA~vPiva~vP~Ehi~~ikHlkf~~dvte-vaiktR~daI~RlAmglDv~pErLLGls~~---NHWsAW~I~dedVrlHI 378 (637) .+.-++++..|+..-+.+|-..++..-.+ --+++|+..+..+|...-||| .|+|+... +.-+.-+....=++.-| T Consensus 235 ~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp-~liGi~~~~t~~~s~~e~~~~~f~~~~l 313 (345) T protein:vir:37 235 GNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPA-GLSGIIPTNTGGLGDPLKYREVYHYDEV 313 (345) T ss_pred cccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhccccCCCCCcccHHHHHHHHHHHHH Confidence 35568888888754355666666554332 357889999999999999999 56698543 33344444444556679 Q ss_pred chhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCccccc Q lcl|NC_021303. 379 KPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTS 419 (637) Q Consensus 379 ~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~ 419 (637) .|.+..|.++|++ .++ -+..|++.||...|.. T Consensus 314 ~P~~~~ie~~ln~-----~~e----~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 314 MPLQEIIAETINQ-----DPE----IKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHhhh-----hhc----cCCcceEEECchhhcC Confidence 9999999999985 222 2357999999999988 No 108 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=97.48 E-value=7.4e-06 Score=48.73 Aligned_cols=322 Identities=13% Similarity=0.159 Sum_probs=159.5 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhhHHHHHhhhhhhHhhHhhhhhccee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSESIGELSYYISWRANSCS 76 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (637) |- +.|..+ ++ +.++.++ .+...| ++|. ....|..+.++++.. |+ |-..-+| T Consensus 1 m~--------~~~~~~-~~---~~~~~~~---~~~~~~---~~~~p~~~~~~~~~~~~~~~~~~-~~------~~~pp~~ 55 (340) T protein:vir:98 1 MS--------KRKPRK-AV---AMTASAP---QKMEAF---TFGEPVPVLDKRDILDYVECISN-GK------WYEPPVS 55 (340) T ss_pred CC--------CCCCCc-cc---cccccCc---cceeEE---EcCCceeecCcchhhhhhhhhhc-Cc------eecCCCC Confidence 32 111111 11 1111111 011222 2331 223344444444422 31 2222222 Q ss_pred eeE---EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCc-ccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 77 RTT---LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGP-LGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 77 r~r---L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~-lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) +.= |+ +..+--+-+ |.. +...|...+-.++ |.+. -+++++.++-+-|..|+.++-...|++ . T Consensus 56 ~~~la~l~--~a~~~h~s~---i~~-------k~n~l~~~~~Pn~~lt~~-~f~~~~~d~ll~Gnay~~~~rn~~G~~-~ 121 (340) T protein:vir:98 56 FSGLAKSL--RSAVHHSSP---IYV-------KRNVLASTYIPHPLLSRQ-DFSRFALDYLVFGNAFLEQRHSVTGQL-I 121 (340) T ss_pred HHHHHHHH--Hhccccchh---hhh-------hhhHHhhccCCCCCCCHH-HHHHHHHHHHhcCCeEEEEEECCCCcE-E Confidence 110 00 000000000 000 1111111112222 3333 357788898899999998774444431 1 Q ss_pred ccccccccceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHH Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I 231 (637) + =| .+....+. .++++.......+|..++|..+ -||++=+|+|..-..--||..+++.++.-=.- T Consensus 122 ~------L~-pl~~~~vr~~~~~~~~~~~~~~~~~~~~~~~--eViHir~~~~~~~~~Gls~~~~a~~si~l~~a----- 187 (340) T protein:vir:98 122 K------LL-TSPAKYTRRGVDDSVFWFVENFTQPHEFAPD--TVFHLLEPDINQEIYGLPEYLSALNSAWLNES----- 187 (340) T ss_pred E------EE-EeCCceEEEcccCcEEEEEecCCeEEEEccc--cEEEEcCCCCCCCcccccHHHHHHHHHHHHHH----- Confidence 1 12 22223333 2444555555668888888653 36777678887777778898888776532211 Q ss_pred HHHHHhHhhcCc-----eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccccc Q lcl|NC_021303. 232 KNAAKSRVMNNG-----VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYI 306 (637) Q Consensus 232 ~na~~SRL~gnG-----vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~v 306 (637) ...-+.|...|| ||.+|. +. .+....+.|.+.+ ++-......= T Consensus 188 a~~~~~~~f~NGa~pg~il~~~~------~~------------------ls~e~~~~lk~~~--------~~~~G~~n~~ 235 (340) T protein:vir:98 188 ATLFRRKYYQNGAHAGYIMYVTD------PA------------------QSATDVESLRDAM--------RNSKGLGNFK 235 (340) T ss_pred HHHHHHHHHhccCCCceEEEecC------CC------------------CCHHHHHHHHHHH--------HHhcCccccC Confidence 222334555555 666653 11 1112344444433 2212223334 Q ss_pred ceeEeechHHhcccceeecCcchh-HHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhH Q lcl|NC_021303. 307 PLVASVAAEHLEKVQHIKFGNEVT-EVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVM 382 (637) Q Consensus 307 Piva~vP~Ehi~~ikHlkf~~dvt-evaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~m 382 (637) .+++..|+..-+.+|-..++..-. .--+++|+-.+..+|...-|||. |+|+.+ +++-+.-+....=++.-|.|.+ T Consensus 236 ~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~ 314 (340) T protein:vir:98 236 NLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQ-LMGGKPENIGSLGDVEKVAKVFVRNELSPLQ 314 (340) T ss_pred ceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccccHHHHHHHHHHHHHHHHH Confidence 677777765445677776664433 33578999999999999999995 889853 3455667777777778899999 Q ss_pred HHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCC Q lcl|NC_021303. 383 DLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPD 422 (637) Q Consensus 383 e~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD 422 (637) +.|++ |++ +| ..+ |+-||...|. +.| T Consensus 315 ~~iee-~n~-~L----~~e-------~~rF~~~~l~-~~d 340 (340) T protein:vir:98 315 DRFRE-VND-WL----GME-------VIRFKEYTLD-NPE 340 (340) T ss_pred HHHHH-HHh-cc----ccc-------ccccCccccc-cCC Confidence 99986 543 32 222 3455554442 222 No 109 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=97.41 E-value=1.2e-05 Score=47.66 Aligned_cols=329 Identities=12% Similarity=0.111 Sum_probs=160.2 Q ss_pred EEecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhh---HHHHHhh-------h--hhhHhhHhhhh Q lcl|NC_021303. 8 VVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQS---EAWDFSE-------S--IGELSYYISWR 71 (637) Q Consensus 8 ~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~---eAW~~yd-------~--VgELryyvgWr 71 (637) .. |.|..+++..- +....++....|++. ..-++|. ....|.. |||..-+ . .-+|-....+. T Consensus 1 ~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h 77 (351) T protein:vir:78 1 MS-KRRSRAPRTFA-AAPNPSAGSAAPARA-EVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHH 77 (351) T ss_pred CC-CCCCCCCCCCC-CCCchhhhhccccee-EEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhh Confidence 22 12221110000 111111111111111 1112221 1112222 3441100 0 01221122222 Q ss_pred hccee-eeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcc Q lcl|NC_021303. 72 ANSCS-RTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDP 150 (637) Q Consensus 72 ~~s~S-r~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~ 150 (637) ++.+. +..++++.+.|. .-+-+.++ ++++..+-+=|.+|+.+.-...|. T Consensus 78 ~~~l~~k~n~l~~~~~Pn----------------------------~~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~- 127 (351) T protein:vir:78 78 SSALFFKANVLASTFRPH----------------------------RWLSRHAF-ERWALDFLTFGNGYLERRRNMVGG- 127 (351) T ss_pred hhhhhhhhhHHhhcccCC----------------------------CCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCC- Confidence 22221 112222222222 12344445 667777778899998776554453 Q ss_pred ccccccccccceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_021303. 151 VTGLAAPRARWYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTR 229 (637) Q Consensus 151 ~~~~~~~~~~W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 229 (637) +.+-|+ +...-+. .+.++.......+|..++|..+ -||++=+|+|.....-=||..+++.++..-.-.++ T Consensus 128 ------~~~L~p-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~--eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~ 198 (351) T protein:vir:78 128 ------TLRLEP-ALAKYVRRKADFSGFVYVNGWQERHEFAPD--SVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTL 198 (351) T ss_pred ------EEEEEE-ecCcceEEeeeCCeEEEEecCCeEEEEccc--cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHH Confidence 122222 2233333 2333444445557888888764 35666678887776667899999888765444444 Q ss_pred HHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccccccee Q lcl|NC_021303. 230 KIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLV 309 (637) Q Consensus 230 ~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiv 309 (637) .-++.-+.-..-.|||.++. +. ......+.|.+.|- +-......=.++ T Consensus 199 ~~~~~f~NGa~pggIl~~~~------~~------------------ls~e~~~~lr~~~~--------~~~G~~N~~~~~ 246 (351) T protein:vir:78 199 FRRKYYENGSHAGFILYMTD------AA------------------QKQDDVDNMRDALK--------NAKGPGNFRNVF 246 (351) T ss_pred HHHHHHhccCCCceEEEecC------CC------------------CCHHHHHHHHHHHH--------HhcCccccccee Confidence 33333333233344555543 00 11124445544442 212233334555 Q ss_pred EeechHHhcccceeecCcchh-HHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHHHH Q lcl|NC_021303. 310 ASVAAEHLEKVQHIKFGNEVT-EVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMDLI 385 (637) Q Consensus 310 a~vP~Ehi~~ikHlkf~~dvt-evaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me~i 385 (637) +..|+..-+.+|-..++..-. .--+++|+-....+|...-||| .|+|+.+ +++-++.+....=++..|.|.+..| T Consensus 247 v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~i 325 (351) T protein:vir:78 247 MYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARF 325 (351) T ss_pred eecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 666654445666666654433 3356899999999999999998 5679853 4455667777777788899999999 Q ss_pred HHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCC Q lcl|NC_021303. 386 CQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLS 424 (637) Q Consensus 386 c~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~t 424 (637) ++ |++ +| | . +++-||..+|-----++ T Consensus 326 ee-~n~-~l-------~---~-~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 326 AE-LND-WL-------G---D-EVVRFDDYEIPPAPVAA 351 (351) T ss_pred HH-HHh-hc-------C---c-cceecChhhhccccccC Confidence 86 443 22 1 2 24667777664322222 No 110 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=97.38 E-value=2.7e-06 Score=51.14 Aligned_cols=324 Identities=11% Similarity=0.117 Sum_probs=159.0 Q ss_pred EecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhhHHHHHhh-hhhhHhhHhhhhhcceeeeEEEEe Q lcl|NC_021303. 9 VRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSE-SIGELSYYISWRANSCSRTTLIPS 83 (637) Q Consensus 9 vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd-~VgELryyvgWr~~s~Sr~rL~as 83 (637) .-++|..+. -+++.+ |...| ++|. ....|-.+..+++. ..|+ |-..-|++.-| + T Consensus 1 m~~~~~~~~------~~~~~~----~~~~~---~~~~p~~~~~~~~~~~~~~~~~~~~~~------~~~pP~~~~~L--a 59 (337) T protein:vir:78 1 MTKRQQQPA------QAAASS----PRPSV---VFSMPEAIDPTAWMTDYTGVFYNPYGE------YYQPPIDRKGL--A 59 (337) T ss_pred CCCcccCcc------cccccC----ceeEE---EecCcccccCcchhHhhhhhhhccCcc------eecCCCCHHHH--H Confidence 223333321 111111 11222 2331 11223333333321 1121 21222222111 0 Q ss_pred eeccccCCCCCcccCCCCcccchH-HHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccccce Q lcl|NC_021303. 84 AIDPDTGLPTGEVDIEEDPDAQIV-ADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRARWY 162 (637) Q Consensus 84 eiD~DtG~PtG~v~~e~~~~~~rv-~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~W~ 162 (637) ++ -..++--.++ .--.+-++....+..+++++++.++-+=|..|+.+.-...|++ .+ -+ T Consensus 60 ~l------------~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~-~~-------L~ 119 (337) T protein:vir:78 60 KV------------ARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQV-VG-------LH 119 (337) T ss_pred HH------------hhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcE-EE-------EE Confidence 00 0000100000 0112223333444457889999999999999988654444432 11 12 Q ss_pred eeeHHHhcc-CCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhc Q lcl|NC_021303. 163 AVTREEIKS-KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMN 241 (637) Q Consensus 163 ~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~g 241 (637) .+....++. +++.... ...+|..++|..+ -||++=+|+|..-..--||+.+++.++--=...++.-++.-+.=..- T Consensus 120 pl~~~~v~~~~d~~~~~-~~~~~~~~~~~~~--eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p 196 (337) T protein:vir:78 120 PLSSVYLRRREDGCFVY-LQQGKPNLIYRPD--DVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHM 196 (337) T ss_pred EeCCceeEeeeCCeEEE-EEcCCceEEECCc--cEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 222233332 3444443 3447777777653 35777677776666667888888876653333333322222222333 Q ss_pred CceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccc Q lcl|NC_021303. 242 NGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQ 321 (637) Q Consensus 242 nGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ik 321 (637) .|||.+|. +. ......+.|.+.+- +-.+....=.+++..|+..=+.+| T Consensus 197 ~~il~~~~------~~------------------l~~e~~~~lk~~~~--------~~~G~~n~~~~~v~~~~g~~~Gi~ 244 (337) T protein:vir:78 197 GFIFYATD------PN------------------MDDDTEEEMKEMIA--------NSKGVGNFRSMFVNIPDGKPDGIK 244 (337) T ss_pred ceeEEcCC------CC------------------CCHHHHHHHHHHHH--------HhcCcccccceEEEcCCCCcccee Confidence 44555553 11 11123445544432 212223344566777765435566 Q ss_pred eeecCcchh-HHHHhhHHHHHHHHHhhcCCchhHhhccCC-cceee---eEEeccCceeEeechhHHHHHHHHHhHHHHH Q lcl|NC_021303. 322 HIKFGNEVT-EVEIKTRIDAITRLAMGLDVSPERLLGMSK-GNHWS---AWAIGDEDVQLHIKPVMDLICQAIYNDILTP 396 (637) Q Consensus 322 Hlkf~~dvt-evaiktR~daI~RlAmglDv~pErLLGls~-~NHWs---AW~I~dedVrlHI~P~me~ic~Ait~~~Lr~ 396 (637) -..++..-. .--+++|+-....+|...=||| .|+|+.+ .+.|+ +-+....=++--|.|.++.|+++++.. T Consensus 245 ~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp-~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~---- 319 (337) T protein:vir:78 245 LIPVGDIATKDEFAAIKGITAQDVLTAHRYPP-ALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAINSA---- 319 (337) T ss_pred EEEcCCChhHHHHHHHHHHhHHHHHHHhCCCH-HHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhh---- Confidence 666654332 2347899999999999999999 6789854 34565 566666667788999999999999753 Q ss_pred HHHHhCCChHHeE-EeecCcccc Q lcl|NC_021303. 397 LLAREGIDPTKYI-LWYDASGLT 418 (637) Q Consensus 397 ~L~~eGiDp~kYv-vw~DaS~Lt 418 (637) ++.+..|+ +=+..+.|- T Consensus 320 -----ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 320 -----GLPRALWVTFRETIGAAV 337 (337) T ss_pred -----cCChhhceeccccccccC Confidence 22333332 223333333 No 111 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=97.36 E-value=2e-05 Score=46.37 Aligned_cols=320 Identities=14% Similarity=0.157 Sum_probs=162.4 Q ss_pred EEecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhhHHHHHhhhhhhHhhH---hh-------hhhc Q lcl|NC_021303. 8 VVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSESIGELSYY---IS-------WRAN 73 (637) Q Consensus 8 ~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd~VgELryy---vg-------Wr~~ 73 (637) ..||-| .+|+......+++++.. ..-++|. ....|-.+..+++. .|+ || +. .++| T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~-------~~~~~~~p~~v~~~~~~~~~~~~~~-~~~--~~~pp~~~~~la~~~~a~ 69 (344) T protein:vir:56 1 MSKKKG-KTPQPAAKTMTASAPKM-------EAFTFGEPVPVLDRRDILDYVECIS-NGR--WYEPPVSFTGLAKSLRAA 69 (344) T ss_pred CCCCCC-CCCchhhHHhhcCCCce-------EEEEcCCceeecCcchhhhHHHhhh-cCc--cccCCCCHHHHHHHHhhh Confidence 223332 22222222222222111 1222332 12223222222221 132 22 00 1111 Q ss_pred ceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccC-cccHHHHHHHHHhhhcccccEEEEEEeecCCcccc Q lcl|NC_021303. 74 SCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADG-PLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVT 152 (637) Q Consensus 74 s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG-~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~ 152 (637) ... +-| +.. +...|...+--+ -+-+.++ ++++.++.+-|.+|+.+.-...|+ +. T Consensus 70 ~~h-------------~s~---i~~-------k~n~l~~~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~-~~ 124 (344) T protein:vir:56 70 VHH-------------SSP---IYV-------KRNILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGK-VI 124 (344) T ss_pred hhh-------------Ccc---cee-------hhhhHHhhcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCc-EE Confidence 100 000 000 011111111112 2445555 888999999999999876443443 22 Q ss_pred ccccccccceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 153 GLAAPRARWYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 153 ~~~~~~~~W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) +...-...+..+ .+.++.......+|..++|.. +-||++=+|+|..--.--||..+++.++. +..... T Consensus 125 ~L~pl~~~~v~~------~~~~~~~~~~~~~g~~~~~~~--~dIiHir~~~~~~~~~Gls~~~~a~~si~----l~~~a~ 192 (344) T protein:vir:56 125 RLETSPAKYTRR------GVEEDVYWWVPSFNEPTAFAP--GSVFHLLEPDINQELYGLPEYLSALNSAW----LNESAT 192 (344) T ss_pred EEEEeCCceeEE------eecCCEEEEEecCCeEEEEcC--ccEEEECCCCCCCCcccccHHHHHHHHHH----HHHHHH Confidence 221111122221 234455555666888888865 34677778888776677788887776654 222222 Q ss_pred HHHHhHhhcCc-----eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccc Q lcl|NC_021303. 233 NAAKSRVMNNG-----VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIP 307 (637) Q Consensus 233 na~~SRL~gnG-----vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vP 307 (637) ....|...|| ||.+|. +. ......+.|.+.+.+ + ...-+.=+ T Consensus 193 -~~~~~~f~NGa~pg~Il~~~d------~~------------------ls~e~~~~lk~~~~~----~----~g~~~~r~ 239 (344) T protein:vir:56 193 -LFRRKYYENGAHAGYIMYVTD------AV------------------QDRNDIEMLRENMVK----S----KGRNNFKN 239 (344) T ss_pred -HHHHHHHhccCCCceEEEecC------CC------------------CCHHHHHHHHHHHHH----h----cCCCCccc Confidence 2344555554 555553 10 111234455544432 1 12235778 Q ss_pred eeEeechHHhcccceeecCcchhHH-HHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHH Q lcl|NC_021303. 308 LVASVAAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMD 383 (637) Q Consensus 308 iva~vP~Ehi~~ikHlkf~~dvtev-aiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me 383 (637) +|+..|+..-+.+|-..+...-.+- -+++|+-.+..+|...=|||. |+|+.. ++.-+..+....=++--|.|.++ T Consensus 240 l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~-llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~ 318 (344) T protein:vir:56 240 LFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVRNELIPLQD 318 (344) T ss_pred eEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 8999998655777777776544433 489999999999999999996 889843 34555677777677788899998 Q ss_pred HHHHHHHhHHHHHHHHHhCCChHHeEEeecCc Q lcl|NC_021303. 384 LICQAIYNDILTPLLAREGIDPTKYILWYDAS 415 (637) Q Consensus 384 ~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS 415 (637) .|++ +.+ +| ..+=|....|.+=-|-- T Consensus 319 ~ie~-~n~-~l----~~~~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 319 RIRE-ING-WI----GQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHH-HHh-hh----ccccccCCCccccccCC Confidence 8876 333 22 22223444444332222 No 112 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=96.73 E-value=7.7e-05 Score=43.15 Aligned_cols=345 Identities=12% Similarity=0.115 Sum_probs=157.1 Q ss_pred CCCC-cceEEecCC-------CCCcccccchheeh--------hccccchhhhhhhhcccc----cccchhhHHHHHhhh Q lcl|NC_021303. 1 MAAT-SLRVVRRPK-------GSAPAARRRSLTAA--------SQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSES 60 (637) Q Consensus 1 ma~~-~lr~vrrpk-------~~~p~~~r~~ltAA--------s~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd~ 60 (637) |-+. .-|--|+-+ |-.|+.+|+--..+ ++....|++. -.-++|. ....|..+..+++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~f~fg~p~~v~~~~~~~~~~~~~~~ 79 (376) T protein:vir:10 1 MPARDRPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARA-EVFTFDDPTPVMNRAEILDYVECWSN 79 (376) T ss_pred CCCCccchhhhhhcccchhhcccccchhccCCCcccchhhhhHhhhccCccee-EEEEcCCceeccCcchhhhhhhhhhc Confidence 3322 111111110 01222222111111 1111111111 0111221 112222222222211 Q ss_pred hhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccc-----hHHHHHHHhccCc-ccHHHHHHHHHhhhcc Q lcl|NC_021303. 61 IGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQ-----IVADYVKGIADGP-LGQAALIKRAVECMTV 134 (637) Q Consensus 61 VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~-----rv~~iv~~iAgG~-lGqaqLlkr~~~~LtV 134 (637) | +|-..-+++.-| +--+ .-|+--. +...+...+--++ |-+.+ +++++.++-+ T Consensus 80 -~------~~~~pp~~~~~L-a~~~-------------~~~~~h~s~l~~k~n~l~~~~~Pnp~lT~~~-f~~~v~d~ll 137 (376) T protein:vir:10 80 -G------EWFEPPVSFAGL-AKSF-------------RASTHHSSALFFKANVLASTFRPHRWLSRHA-FERWALDFLT 137 (376) T ss_pred -C------ceecCCCCHHHH-HHHH-------------hhhHHhhhhHHHHhHHHHhccCCCCCCCHHH-HHHHHHHHHh Confidence 1 121111111100 0000 0000000 1111111111122 33333 5678888889 Q ss_pred cccEEEEEEeecCCccccccccccccceeeeHHHhc-cCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccc Q lcl|NC_021303. 135 VGEVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIK-SKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSP 213 (637) Q Consensus 135 pGE~wi~il~r~~~~~~~~~~~~~~~W~~vt~~Ei~-~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSP 213 (637) =|.+|+.+.-...|.+ .+ .+.|...-+. .++++.......+|..++|..+ -||++=+|+|..--.-=|| T Consensus 138 ~Gnay~~~~rn~~G~~-------~~-L~pl~~~~vr~~~d~~~~~~~~~~~~~~~~~~~--eViHir~~~~~~~~yGls~ 207 (376) T protein:vir:10 138 FGNGYLERRRNMVGGT-------LR-LEPALAKYVRRKADFNGFVYVNGWQERHEFEPD--SVFQLVRPDINQEVYGLPE 207 (376) T ss_pred cCCeEEEEEECCCCCE-------EE-EEEeCCcceEEEeeCCeEEEEEcCCeEEEEccc--cEEEecCCCCCCCcccccH Confidence 9999987765444531 22 2222223333 2444444445557777777653 3566656777766666789 Q ss_pred hhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHh Q lcl|NC_021303. 214 VRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASV 293 (637) Q Consensus 214 vra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~ 293 (637) ..+++.++---...++.-++--+.=..-.|||.+|.. . ......+.|.+.|-+ T Consensus 208 ~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~------~------------------l~~e~~~~lr~~~~~--- 260 (376) T protein:vir:10 208 YLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDA------A------------------QKQDDVDNMRDALKN--- 260 (376) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCC------C------------------CCHHHHHHHHHHHHH--- Confidence 9988887654333333333333333334456666540 0 111244556555522 Q ss_pred hcccCccccccccceeEeechHHhcccceeecCcchh-HHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEe Q lcl|NC_021303. 294 AAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVT-EVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAI 369 (637) Q Consensus 294 aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvt-evaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I 369 (637) .. .....=.+++..|+..=+.+|-..++..-. .--+++|+-.+..+|...-|||. |+|+.+ ++.-++.+. T Consensus 261 --~~---G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~~~t~~~sn~eq~ 334 (376) T protein:vir:10 261 --AK---GPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ-LLGIVPSNSGGFGTPDTA 334 (376) T ss_pred --hc---CccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHHHH Confidence 11 122233355566653334555555554322 33588999999999999999995 889853 445666666 Q ss_pred ccCceeEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCC Q lcl|NC_021303. 370 GDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLS 424 (637) Q Consensus 370 ~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~t 424 (637) ...=++.-|.|.+..|.+ |++ +| . + +|+-||..+|-----++ T Consensus 335 ~~~f~~~~L~Pl~~~iee-ln~-~L----~------~-~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 335 ARVFGRNEIRPLQARFAE-LND-WL----G------E-EVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHHHHHHHHHH-HHh-hc----c------c-cccccChhHhhcccccC Confidence 666667778999888875 444 33 1 1 24556665553211111 No 113 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=96.63 E-value=0.00046 Score=38.90 Aligned_cols=386 Identities=13% Similarity=0.042 Sum_probs=165.7 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.=-+-- .++|.. .+ ..+.++. .+.. +..+ ..+ ...-+.-.+.-++++||.+.+ T Consensus 1 MGlf~~~--~~~~~~-------~~----~~~~~~~-~~~~---------~~~~--~~~-~~~~v~~~I~~ia~~iA~lp~ 54 (395) T protein:vir:98 1 MGILDFF--SFKKSG-------TL----SDDDSGS-TTSE---------KLTN--VVL-KEDALYKCVNYLARIISKSTF 54 (395) T ss_pred Ccchhhh--cCCCcc-------cc----cccccch-hhhh---------hcch--hhh-hhHHHHHHHHHHHHHHhhCce Confidence 5443211 111110 00 0011111 0010 1111 111 223445557788999999888 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+-+ .+. + .+ +.+..+++.=.---+-..++++.++.+|.+-|++||++.-....-.++ . T Consensus 55 ~~~~~~--~~~----~--~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~~~~-------~ 115 (395) T protein:vir:98 55 RLKTPE--KLT----E--NQ----KDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIYVAD-------S 115 (395) T ss_pred eEEecC--Ccc----c--cc----chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCceecCC-------c Confidence 765432 111 1 11 234445444344557889999999999999999998865432221111 2 Q ss_pred ceeeeHHHhccCCCceeEEecCCCC-cccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHh Q lcl|NC_021303. 161 WYAVTREEIKSKAGETAEISLPDGK-THEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRV 239 (637) Q Consensus 161 W~~vt~~Ei~~k~g~~~~i~lPdG~-~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL 239 (637) |.. .. .+... ....+..-.+. ..+|....=+-||..++..+.. -++++...-..+.......+ .+...|. T Consensus 116 ~~~-~~-~~~~~--~~~~~~~~~~~~~~~~~~~evih~k~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 186 (395) T protein:vir:98 116 FTQ-DK-KISGS--QFKVSRVQGQTYEKTFTFDQVIYLKNDNSDLMSK--VESLWEEYGELLGHVINNQK---IANQIRF 186 (395) T ss_pred ccc-cc-cccCc--ccceeeecCceeeeEecCccEEEecCCCCCcccc--ccchhhhHHHHHHHHHHHHH---HHHHHHH Confidence 221 11 11100 01111111111 1233222112244343333322 23333333333333222211 1223444 Q ss_pred hcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcc Q lcl|NC_021303. 240 MNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEK 319 (637) Q Consensus 240 ~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ 319 (637) .+|+- -+. ..+.... ...+....+.+++++-+. ..+.. ...-+++ + .++. -+ T Consensus 187 ~~~~~--~~~-~~~~~~~----------------~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~v~--~--l~~g--~~ 238 (395) T protein:vir:98 187 TMIPP--KDK-VRERAQE----------------NSDGGRQSKSDKDFFKRT-VEKIR--TESVVGI--P--VTAN--TN 238 (395) T ss_pred hhccc--ccc-ccccccc----------------cCCcHHHHHHHHHHHHHH-Hhhhh--cCCccee--e--cCCC--ce Confidence 44432 000 0000000 001111222233322222 22211 1111122 1 2222 23 Q ss_pred cceeecC-----cchhHHHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHHHhHHH Q lcl|NC_021303. 320 VQHIKFG-----NEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (637) Q Consensus 320 ikHlkf~-----~dvtevaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~L 394 (637) .+-|++. +.-++=-+++|+..+..+|.-.-|||.-| |-+.+ +.-+....=++..|.|.+..|.++|+..+| T Consensus 239 ~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l-~~~~s---n~e~~~~~f~~~tl~P~~~~ie~~l~~kll 314 (395) T protein:vir:98 239 YEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLL-HGDIA---DNQKNYELLLEGPIESLITNIVDGLEYAIF 314 (395) T ss_pred eEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHh-cCCcc---cHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4444332 12223457899999999999999999866 42222 222333334455699999999999999988 Q ss_pred HHHHHHhCCChHHeEEeecCcccccCCCCCHH---HHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcC Q lcl|NC_021303. 395 TPLLAREGIDPTKYILWYDASGLTSDPDLSDE---AVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTK 471 (637) Q Consensus 395 r~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tde---A~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~ 471 (637) .+-....|+ | ||...| ..+|..+. ...+++.|.+|..-.|+.+|++--.+-. ..+. T Consensus 315 ~~~~~~~g~----~---f~~~~l-~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~--gD~~----------- 373 (395) T protein:vir:98 315 DKSETLQGS----F---IKVTGL-KNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGL--GKVL----------- 373 (395) T ss_pred ChhhhcCcc----e---eeehhh-hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Ccee----------- Confidence 654433332 2 444443 33443333 3347789999999999999997533200 0000 Q ss_pred CchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021303. 472 NPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERST 526 (637) Q Consensus 472 ~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~ 526 (637) +.+ ..++.++ .. |.|++.+.+ + T Consensus 374 -------~~~---~n~~~~~--------~~-------------gge~~~~~~--~ 395 (395) T protein:vir:98 374 -------YMT---KNYESVL--------ER-------------GGEVDEEVE--T 395 (395) T ss_pred -------eec---ccceecc--------cc-------------cCCCCCCCC--C Confidence 000 0111110 01 111111111 0 No 114 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=96.41 E-value=6.7e-05 Score=43.47 Aligned_cols=204 Identities=13% Similarity=0.188 Sum_probs=113.2 Q ss_pred hc-cCCCceeEEe-----cCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcC Q lcl|NC_021303. 169 IK-SKAGETAEIS-----LPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNN 242 (637) Q Consensus 169 i~-~k~g~~~~i~-----lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gn 242 (637) |+ .++|...++. ...|+.++|.. +=||++=.|+|..-..-=||+.+|+..+.. ...... -+.++..| T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g~~~~~~~--~eilH~r~~~~~~~~~Glspi~~a~~~i~~----~~aa~~-~~~~~f~N 73 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKSEIYEYNK--NDVIFIKLYDPMQQVYGSPDYVGGITSALL----NSDATI-FRRRYYSN 73 (219) T ss_pred CceeecCeEEEEEecceecCCceeEEecc--ccEEEecCCCCCCCcceecHHHHHHHHHHH----HHHHHH-HHHHHHhc Confidence 33 2333333221 12255555554 346777778887777777898888765542 222211 22345556 Q ss_pred c-----eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHh Q lcl|NC_021303. 243 G-----VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHL 317 (637) Q Consensus 243 G-----vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi 317 (637) | ||.+|. .. ....+.+.|.+-+.+ + . ++..+ =++++..|+..= T Consensus 74 g~~p~gil~~~~------~~------------------l~~e~~~~~~~~~~~----~-~--g~~n~-~~~~l~~~gg~~ 121 (219) T protein:vir:98 74 GAHMGFILYSTD------PD------------------MTEEMEDEIAERIRD----S-K--GVGNF-RSMFVNIAGGHP 121 (219) T ss_pred CCCCceEEEeCC------CC------------------CCHHHHHHHHHHHHH----h-c--Ccccc-cceeEecCCCCc Confidence 5 444442 11 111244555555432 1 1 22222 466777776433 Q ss_pred cccceeecCc-chhHHHHhhHHHHHHHHHhhcCCchhHhhccC---CcceeeeEEeccCceeEeechhHHHHHHHHHhHH Q lcl|NC_021303. 318 EKVQHIKFGN-EVTEVEIKTRIDAITRLAMGLDVSPERLLGMS---KGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDI 393 (637) Q Consensus 318 ~~ikHlkf~~-dvtevaiktR~daI~RlAmglDv~pErLLGls---~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~ 393 (637) +.++...+.- -.+.--+++|+-.+..+|.-.-|||. |+|+. .++.-++-|..-.=++..+.|.+..|.++|++.+ T Consensus 122 ~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~-~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~ 200 (219) T protein:vir:98 122 DGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPG-LSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY 200 (219) T ss_pred cceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHH-HcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 4444444432 22445789999999999999999998 66874 2334455666666688889999999999999765 Q ss_pred HHHHHHHhCCChHHeEEeecCcccccCCCCCHHH Q lcl|NC_021303. 394 LTPLLAREGIDPTKYILWYDASGLTSDPDLSDEA 427 (637) Q Consensus 394 Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA 427 (637) |- ++..+ +-||. ++.+|-= T Consensus 201 ~~--------~~~~~-~~F~~------~~~~d~~ 219 (219) T protein:vir:98 201 EI--------KSALK-VNFKQ------PEKRDKN 219 (219) T ss_pred cC--------CCccE-EeecC------cccccCC Confidence 32 22222 34442 2222211 No 115 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=96.39 E-value=0.00057 Score=38.36 Aligned_cols=334 Identities=14% Similarity=0.150 Sum_probs=154.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccc----cccchhhHHHHHhhhhhhHhhHhhhhhccee Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT----ARNEWQSEAWDFSESIGELSYYISWRANSCS 76 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~----~r~~WQ~eAW~~yd~VgELryyvgWr~~s~S 76 (637) |. ||-+...+ +.....++| ....-++|. .+..|-.+..+++ ..+-+|..--++ T Consensus 1 m~-------~~~~~~~~--~~~~~~~~~--------~~~~~~~~~p~~~~~~~~~~~~~~~~------~~~~~~~~pp~~ 57 (346) T protein:vir:10 1 MK-------KQLRKNLT--QNDRLQPQA--------QTEIFSFGDPIPVLDRADILNYLECS------AMYEKWYNPPMS 57 (346) T ss_pred CC-------cccCCCCC--ccccccccc--------CeEEEecCCcceecCchhHHHHHHHh------hcCCceEecCCC Confidence 32 22221111 111111111 111112221 1111222222211 111123222222 Q ss_pred eeEEE-EeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccc Q lcl|NC_021303. 77 RTTLI-PSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLA 155 (637) Q Consensus 77 r~rL~-aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~ 155 (637) +.-|- .-+..+.-+-+ +.+.++ .+..+.+ +=.+-+-..++ ++++.++-+-|..|+.+.-...|++ T Consensus 58 ~~~la~l~~~~~~h~~~---i~~k~n----~l~~l~~-~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~r~~~G~~----- 123 (346) T protein:vir:10 58 FDGLAKSLRSSTHHESA---IITKAN----ILLSTCE-VDSRYLSRRDL-SSFVKDYLVFGNAYFEVVRNRLGQV----- 123 (346) T ss_pred HHHHHHHHHhhhhcchh---hhhhhh----hHHHHHh-CCCCCCCHHHH-HHHHHHHHhcCCeEEEEEEcCCCcE----- Confidence 11100 00111110111 111111 1111111 11234445555 5678888899999988764444431 Q ss_pred cccccceeeeHHHhcc--CCCceeE-EecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 156 APRARWYAVTREEIKS--KAGETAE-ISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 156 ~~~~~W~~vt~~Ei~~--k~g~~~~-i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) . ..+.+....+.. ..++... +...+|..++|..+ -||++=+|+|..-..--||..+++.++.--...++.-+ T Consensus 124 --~-~L~pl~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~--dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~ 198 (346) T protein:vir:10 124 --Q-RIESPLAKYVRKGLEAGQFYYVPQRFDHQEHEFAKG--SIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRR 198 (346) T ss_pred --E-EEEEecCCceEEEEcCCeEEEEEEccCCeEEEEecc--cEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHH Confidence 1 222233333331 2233333 34457888888653 35666677777666777888888877655444444433 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +--+.=..-.|||.+|. +. ......+.|.+.|-+ +. ..+ ..=-+++.. T Consensus 199 ~~~~NG~~~~~il~~~d------~~------------------l~~e~~~~i~~~~~~----~~-g~~---n~~~~~vl~ 246 (346) T protein:vir:10 199 KYFLNGAHAGFVFYMSD------AS------------------QKQEDVENIRQQLKQ----SK-GVG---NFKNLFVHA 246 (346) T ss_pred HHHhccCCCceEEEeCC------CC------------------CCHHHHHHHHHHHHH----hc-Ccc---ccCceeEec Confidence 33333333445566653 11 011234445444432 22 111 111244455 Q ss_pred chHHhcccceeecCcc-hhHHHHhhHHHHHHHHHhhcCCchhHhhccCC---cceeeeEEeccCceeEeechhHHHHHHH Q lcl|NC_021303. 313 AAEHLEKVQHIKFGNE-VTEVEIKTRIDAITRLAMGLDVSPERLLGMSK---GNHWSAWAIGDEDVQLHIKPVMDLICQA 388 (637) Q Consensus 313 P~Ehi~~ikHlkf~~d-vtevaiktR~daI~RlAmglDv~pErLLGls~---~NHWsAW~I~dedVrlHI~P~me~ic~A 388 (637) |+..-+.+|-..+... .+.--+++|+-....+|...-|||. |+|+.+ ++.-++.+....-++.-|.|.++.|++ T Consensus 247 ~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~-llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee- 324 (346) T protein:vir:10 247 PNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQ-LMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLKE- 324 (346) T ss_pred CCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH- Confidence 5443344554444322 2233467889999999999999997 779843 344455555555666678999999986 Q ss_pred HHhHHHHHHHHHhCCChHHeEEeecCcccccCCC Q lcl|NC_021303. 389 IYNDILTPLLAREGIDPTKYILWYDASGLTSDPD 422 (637) Q Consensus 389 it~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD 422 (637) +++ +| |. +|+-|+...|-.--+ T Consensus 325 ~n~-~L-------~~----e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 325 FNQ-WL-------GQ----EVIKFKPSKLLQRTQ 346 (346) T ss_pred HHh-hc-------cc----ceeeechhhhcccCC Confidence 332 22 11 367788887764433 No 116 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=96.34 E-value=0.00072 Score=37.80 Aligned_cols=383 Identities=13% Similarity=0.034 Sum_probs=160.0 Q ss_pred CCCCcceEEecC-CCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeE Q lcl|NC_021303. 1 MAATSLRVVRRP-KGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTT 79 (637) Q Consensus 1 ma~~~lr~vrrp-k~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~r 79 (637) |.=-. ++-++. +..+ ...-...++..+. ...+ ...-+.-.+.-+++.+|.+. T Consensus 1 Mgl~d-~~~~~~~~~~~--------------~~~~~~~~~~~~~-----------~~~l-~~~~v~~~i~~Ia~~ia~lp 53 (395) T protein:vir:96 1 MGILD-FFSFKKSGTLS--------------DDDSGSTTSEKLT-----------NVVL-KEDALYKCVNYLARIISKST 53 (395) T ss_pred Ccchh-hhcCCCCcccc--------------ccccccchhhhcc-----------hhhh-hhHHHHHHHHHHHHhhccce Confidence 33221 111110 0000 0000000110000 0011 22344456788999999988 Q ss_pred EEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCccccccccccc Q lcl|NC_021303. 80 LIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (637) Q Consensus 80 L~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~ 159 (637) +..-+- |..+ . . .+.+..+.+.=..--+-..++++.++.+|-.-|++|+.+.-..... +.. T Consensus 54 ~~v~~~----~~~~---~-~----~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~-------~~~ 114 (395) T protein:vir:96 54 FRIKAP----EKLT---E-N----QKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIY-------VAD 114 (395) T ss_pred eEEEeC----Cccc---c-c----cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCcee-------cCC Confidence 876432 2211 1 1 1345555544344456888999999999999999998865322211 111 Q ss_pred cceeeeHHHhccCCCceeEEecCCCC-cccccCCCceE-EEEecCCcccccCCccchhhhhHHHHHHHhhhHHHHH-HHH Q lcl|NC_021303. 160 RWYAVTREEIKSKAGETAEISLPDGK-THEFNRDLDSL-VRIWNPRPRKASQATSPVRACLETLREIERTTRKIKN-AAK 236 (637) Q Consensus 160 ~W~~vt~~Ei~~k~g~~~~i~lPdG~-~he~~~~~d~l-~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~n-a~~ 236 (637) .|.. . ..+. ......+...++. .++|.. .|++ ||.=++.-+.. -++++. ..+++..+.-.+.. +.. T Consensus 115 ~~~~-~-~~~~--~~~~~~v~~~~~~~~~~~~~-~dvih~k~~~~~~~~~--~~~~~~----~~~~~~~~~i~~~~~~~~ 183 (395) T protein:vir:96 115 AFTQ-D-KKLS--GNKFKVSRVQGQTYEKIFTF-DQVIYLKNDNSDLMLK--VESLWE----EYGELLGHVINNQKIANQ 183 (395) T ss_pred cccc-c-cccc--cceeeeeeeccceeeeEecc-CceEEecccCCccccc--cccccc----hHHHHHHHHHHHHHHHHH Confidence 2211 1 1111 1111112221211 122222 2222 33222222221 233333 33333333222211 122 Q ss_pred hHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHH Q lcl|NC_021303. 237 SRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEH 316 (637) Q Consensus 237 SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Eh 316 (637) +|...++. -+. ..+.+. ....+.-..+.+.+++ +....+... ...-+++ .+ T Consensus 184 ~~~~~~~~--~~~---------~~~~~~--------~~~~~~~~~~~~~~~~-~~~~~~~~~----~~~~v~~--l~--- 234 (395) T protein:vir:96 184 IRFTMTPP--KDK---------VRERAQ--------ENSDGGRQPKSDKDFF-KRTIEKIRT----ESVVGIP--VT--- 234 (395) T ss_pred HHHHhhhc--ccc---------ccccee--------eccCchhhHHHHHHHH-HHHHHHhhc----CCcceEE--cc--- Confidence 34444432 000 000000 0001111112222222 222222221 1222222 22 Q ss_pred hcccceeecCcchhH-------HHHhhHHHHHHHHHhhcCCchhHhhccCCcceeeeEEeccCceeEeechhHHHHHHHH Q lcl|NC_021303. 317 LEKVQHIKFGNEVTE-------VEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAI 389 (637) Q Consensus 317 i~~ikHlkf~~dvte-------vaiktR~daI~RlAmglDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~ic~Ai 389 (637) +..+...+....++ --.+++++.+.-+|.-.-|||.-| |- +.-+..+....=++-.|.|.+..|+++| T Consensus 235 -~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l-~~---~~sn~e~~~~~f~~~~L~P~~~~ie~~l 309 (395) T protein:vir:96 235 -ANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLL-HG---DIADNQKNYELLLEGPIESLITNIVDGL 309 (395) T ss_pred -CCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHh-cC---CCccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333322221 123456788889999999988865 42 2223444444556778999999999999 Q ss_pred HhHHHHHHHHHhCCChHHeEEeecCcccccCCC-CCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHH Q lcl|NC_021303. 390 YNDILTPLLAREGIDPTKYILWYDASGLTSDPD-LSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADV 468 (637) Q Consensus 390 t~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD-~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~ 468 (637) ++.+|.+--...| |-+.+|. -|+.|.. +.+-+..+++.|.+|-.-.|+.+|++.-.+-. ..+.+ T Consensus 310 ~~~Ll~~~e~~~~-----~~f~~~~-l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~--gD~~~------- 374 (395) T protein:vir:96 310 EYAIFDKSETLEG-----SFIKVTG-LKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGL--GKVLY------- 374 (395) T ss_pred HhhcCChhhhcCc-----eeEeecc-hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceee------- Confidence 9998764322222 2244442 2344443 33334457899999999999999998643200 00000 Q ss_pred hcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021303. 469 VTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERST 526 (637) Q Consensus 469 v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~ 526 (637) +..| ++| ++ .. |.|.+.+.+ + T Consensus 375 ~~~N------~~~--------~~--------~~-------------gge~~~~~~--~ 395 (395) T protein:vir:96 375 MTKN------YES--------VL--------ER-------------GGEVDEEVE--T 395 (395) T ss_pred eccc------cee--------ch--------hc-------------cCCCCCCCC--C Confidence 0000 001 00 00 111111111 0 No 117 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=96.21 E-value=0.00053 Score=38.52 Aligned_cols=324 Identities=12% Similarity=0.138 Sum_probs=153.2 Q ss_pred EEecCCCCCcccccchheehhccccchhhhhhhhccccc----ccchhh---HHHHHhh-------h--hhhHhhHhhhh Q lcl|NC_021303. 8 VVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTA----RNEWQS---EAWDFSE-------S--IGELSYYISWR 71 (637) Q Consensus 8 ~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~----r~~WQ~---eAW~~yd-------~--VgELryyvgWr 71 (637) .. |.|..+++..- +....++....|++. ..-++|.. ...|.. |||..-+ . .-+|-....+. T Consensus 1 ~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h 77 (351) T protein:vir:79 1 MS-KRRSRAPRTFA-AAPNPSAGSAAPARA-EVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHH 77 (351) T ss_pred CC-CCCCCCCCCCC-CCCchhhhhccccee-EEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhh Confidence 22 22221110000 111111111111111 11122221 112222 3441100 0 00111111111 Q ss_pred hccee-eeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcc Q lcl|NC_021303. 72 ANSCS-RTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDP 150 (637) Q Consensus 72 ~~s~S-r~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~ 150 (637) ++.+. +..+.++.+.| ..-+-+.++ ++++.++-+-|.+|+.+.-...|++ T Consensus 78 ~~~l~~k~n~l~~~~~P----------------------------np~~t~~~f-~~~v~d~ll~Gnay~~~~r~~~G~~ 128 (351) T protein:vir:79 78 SSALFFKANVLASTFRP----------------------------HRWLSRHAF-ERWALDFLTFGNGYLERRRNMVGGT 128 (351) T ss_pred hhhhhhhhhHHhhcccC----------------------------CCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCCE Confidence 11110 11111222222 222445555 6788899999999988765544531 Q ss_pred ccccccccccceeeeHHHhcc-CCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhH Q lcl|NC_021303. 151 VTGLAAPRARWYAVTREEIKS-KAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTR 229 (637) Q Consensus 151 ~~~~~~~~~~W~~vt~~Ei~~-k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk 229 (637) . ..+.+...-+.. ++++.......+|..++|..+ -||++=+|+|.....--||..+++.++--=.-.++ T Consensus 129 -------~-~L~~l~~~~v~~~~~~~~~~~~~~~g~~~~~~~~--eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~ 198 (351) T protein:vir:79 129 -------L-RLEPALAKYVRRKADFSGFVYVNGWQERHEFEPD--SVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTL 198 (351) T ss_pred -------E-EEEEeCCcceeeeecCCeEEEEecCceEEEEcCc--cEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHH Confidence 1 233333344442 344444455667888888763 35666678887777777898888877653332222 Q ss_pred HHHHHHHhHhhcCc-----eeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccccc Q lcl|NC_021303. 230 KIKNAAKSRVMNNG-----VLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAA 304 (637) Q Consensus 230 ~I~na~~SRL~gnG-----vlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA 304 (637) -..|+..|| ||.+|. +. ......+.|.+.|-+ . ... .. T Consensus 199 -----~~~~~f~NGa~pg~il~~~~------~~------------------ls~e~~~~lk~~~~~-~----~G~---~N 241 (351) T protein:vir:79 199 -----FRRKYYENGSHAGFILYMTD------AA------------------QKQDDVDNMRDALKN-A----KGP---GN 241 (351) T ss_pred -----HHHHHHhccCCCceEEEecC------CC------------------CCHHHHHHHHHHHHH-h----cCc---cc Confidence 234455554 454443 11 111244555554432 1 111 12 Q ss_pred ccceeEeechHHhcccceeecCcchh-HHHHhhHHHHHHHHHhhcCCchhHhhccCCcce---eeeEEeccCceeEeech Q lcl|NC_021303. 305 YIPLVASVAAEHLEKVQHIKFGNEVT-EVEIKTRIDAITRLAMGLDVSPERLLGMSKGNH---WSAWAIGDEDVQLHIKP 380 (637) Q Consensus 305 ~vPiva~vP~Ehi~~ikHlkf~~dvt-evaiktR~daI~RlAmglDv~pErLLGls~~NH---WsAW~I~dedVrlHI~P 380 (637) .=.+++..|+..-+.+|-..++..-. .--+++|+-....+|...-|||.- +|+.+.|. -++.+....=++.-|.| T Consensus 242 ~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~l-lGi~~~~t~~~~n~e~~~~~f~~~~l~P 320 (351) T protein:vir:79 242 FRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQL-LGIVPSNSGGFGTPDTAARVFGRNEIRP 320 (351) T ss_pred cCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH-hcccCCCCCCcccHHHHHHHHHHHHHHH Confidence 23455566654345566666554433 335689999999999999999864 59854443 44555555556667888 Q ss_pred hHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCcccccCCCCCHHHHHHHhcCCcCH Q lcl|NC_021303. 381 VMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITS 438 (637) Q Consensus 381 ~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~dPD~tdeA~~a~drGaIt~ 438 (637) .+..|.+ |++ +| |. +++-||..+|- ||.+++ T Consensus 321 l~~~ie~-ln~-~l-------g~----~~~~F~~~~ll--------------r~d~~a 351 (351) T protein:vir:79 321 LQARFAE-LND-WL-------GD----EVVTFDDYEIP--------------PAPVAA 351 (351) T ss_pred HHHHHHH-HHh-hc-------Cc----ceeeeChhhhc--------------cccccC Confidence 8888865 443 22 22 24566665542 222222 No 118 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=94.04 E-value=0.0056 Score=32.94 Aligned_cols=512 Identities=13% Similarity=0.072 Sum_probs=194.3 Q ss_pred CCCCcceEEecCCCCCc--ccccchheehhccccchhhhhh--hh--cccccccchhhHHHHHhhhhhhHhhHhhh--hh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAP--AARRRSLTAASQLITDPQKQMK--TS--LMGTARNEWQSEAWDFSESIGELSYYISW--RA 72 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p--~~~r~~ltAAs~~~~~p~~~~k--~~--~~g~~r~~WQ~eAW~~yd~VgELryyvgW--r~ 72 (637) +|-.-++|+-+..-... ...+ ..+|-.....+...+. .+ .++-.-..|-...+.-++++| |.-+ +. T Consensus 75 iag~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tG----na~ieiIr 148 (651) T protein:vir:99 75 EVGFGFDLVPAQGVDGDDASDAQ--REVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVG----WLALEMLT 148 (651) T ss_pred hhccCceeeecccCCCCccchHH--HHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHh----hHhhhhhh Confidence 44444555433321111 0000 0011111100000000 00 000001122223332222222 1111 11 Q ss_pred cceee-eEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCc-- Q lcl|NC_021303. 73 NSCSR-TTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKD-- 149 (637) Q Consensus 73 ~s~Sr-~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~-- 149 (637) +..++ ++|| .+++.+.+ +...+.+..+....+...--.+.+.-.. .++++. +..-|..|+.+.-.-.+. T Consensus 149 n~~g~pv~L~--~lp~~~~R----v~~~~~~~~~~~~~ll~~~pn~~~~~~~-~~~~~q-~~~~~~~~~~~~g~~~~~~~ 220 (651) T protein:vir:99 149 DIEGRPVGLA--YVPARTVR----VRRPQNRFDQPRHPEEGRYVDGDVADIA-SRGYVQ-IRNGNRRYFGEAGDRYRGQE 220 (651) T ss_pred cCccchhhhh--hcChhhee----eecccccccchhhhhhhcccccccchhH-HHHHHH-HHhcCcceEEEeecccccee Confidence 22221 1222 23333221 1000111111111111110011111111 111221 222344554332111100 Q ss_pred ---cccccccc----cccceeeeHHHhccCCCceeEEecCCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHH Q lcl|NC_021303. 150 ---PVTGLAAP----RARWYAVTREEIKSKAGETAEISLPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLR 222 (637) Q Consensus 150 ---~~~~~~~~----~~~W~~vt~~Ei~~k~g~~~~i~lPdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~Lr 222 (637) ...+..+. ...|......-+ ...+-.+...++...+..+.. =||++=.+.+..-..--||+..++..+. T Consensus 221 ~~~~~~~~~v~~~~~~d~~~~~~~~~~---~~~~g~~~~~~~~~~~~~~~~-eViHir~~~~~~g~~G~spl~~a~~~i~ 296 (651) T protein:vir:99 221 VVIDESGDEPTIRYREDEESEREPIFV---DRETGDVTTGDANGLENRPAN-ELIFIPNPSILEDDYGVPDWVSAIRTIS 296 (651) T ss_pred eeeccCCcceeEEeccCcceeeeeecc---cceeeeEEEcCCCceeEeccc-ceEEecCCCCCCCcccccHHHHHHHHHH Confidence 00000000 001111110000 011111222333333332323 3455544555555566788888887776 Q ss_pred HHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCcccc Q lcl|NC_021303. 223 EIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQ 302 (637) Q Consensus 223 EI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~ 302 (637) =-.-..+...+..+.-....|||.+|... ......+.|.+.|-+.. .. T Consensus 297 ~a~~a~~~~~~~f~NG~~p~gil~~~~~~------------------------ls~e~~~~lr~~~~~~~----~n---- 344 (651) T protein:vir:99 297 ADEAAKDYNRDFFDNDTIPRMVIKVTGGE------------------------LSEESKRDLRQMLNGLR----EE---- 344 (651) T ss_pred HHHHHHHHHHHHHhccCCCceEEEecCCC------------------------CCHHHHHHHHHHHHHHh----cc---- Confidence 55555555555555555666777776411 12224566777665432 22 Q ss_pred ccccceeEeechHHh-------cccceeecCcchhHHHHhhHHHHHHHHHhhcCCchhHhhccC-CcceeeeEEeccCce Q lcl|NC_021303. 303 AAYIPLVASVAAEHL-------EKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS-KGNHWSAWAIGDEDV 374 (637) Q Consensus 303 AA~vPiva~vP~Ehi-------~~ikHlkf~~dvtevaiktR~daI~RlAmglDv~pErLLGls-~~NHWsAW~I~dedV 374 (637) +-=++|+..++..- -+++.|.+..--+.--+++|+..+..+|...-||| .+||+. ++|+-++-+....-+ T Consensus 345 -agk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp-~~lG~~~~~~~sn~E~~~~~f~ 422 (651) T protein:vir:99 345 -SHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPP-VKIGVTDSANRSNSDQQDKDFA 422 (651) T ss_pred -CCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCH-HHhccCCCCCcccHHHHHHHHH Confidence 23455555543211 23444444322233348899999999999999987 566884 688888888888888 Q ss_pred eEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEeecCccc-ccCCCCCHHHHH-HHhcCCcCHHHHHHHhcCccccC Q lcl|NC_021303. 375 QLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGL-TSDPDLSDEAVE-AHDRGAITSAALRRLLNVGEDSG 452 (637) Q Consensus 375 rlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~L-t~dPD~tdeA~~-a~drGaIt~eAlrr~lgl~~d~~ 452 (637) +.-|.|.+..|+++|+..+|.......| .+|-+=||...| ..|+-...+++. ++..|++|-.-.|+++|++.-++ T Consensus 423 ~~tL~P~~~~ie~eln~kLl~~~e~~~~---~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~ 499 (651) T protein:vir:99 423 LEVIQPEQHTFAEWLYQIIHQQALGVTD---WTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGE 499 (651) T ss_pred HHHHHHHHHHHHHHHHHhhcCccccccC---ceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 8889999999999999998877544322 244555666554 455544445444 77789999999999999864321 Q ss_pred CCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCCC Q lcl|NC_021303. 453 YDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAASL 532 (637) Q Consensus 453 yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~~ 532 (637) + |... .+.|+..... .. +...++. .... +|+.+......+..+. T Consensus 500 -----~-----~gd~----------~l~~~~~~~~-----g~---~~~gge~-~~~~-------~~~~~~~~~~~e~~~~ 543 (651) T protein:vir:99 500 -----P-----YGEM----------TLSEFEAEVA-----GD---VAGGGET-EAVH-------EPPEENKIGEREWDTV 543 (651) T ss_pred -----c-----cccc----------cccccccccc-----cc---cccCCCC-cccc-------cCccccccccchhhhh Confidence 0 0000 0111100000 00 0001110 0000 0111100000000000 Q ss_pred Ccc----hHH-HHHHHHHHHHHHHH-hcc----cccC--CCchhhhhHhhcCchhhhhhhcCC--------------CCH Q lcl|NC_021303. 533 NDR----AAY-LVAERLLVNRALDL-AGK----RRFK--VNDAALKTKLRDVPAHEYHRVLPP--------------VRS 586 (637) Q Consensus 533 ~~~----a~~-~aa~~llV~rALel-AGk----Rr~~--~~~~~~~~rlr~ip~h~~h~~~~P--------------V~~ 586 (637) .+. ..+ .--|.--.-++..- ++. .+.. ++.-. -.+..+||+..|--.|.- -+- T Consensus 544 ~~~~~~~e~~~~~~v~ss~~~~~gyd~~~~~l~~~f~~~~~~~~-~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~ 622 (651) T protein:vir:99 544 KSELTTKDPIEQMQFSSSNLDEGLYDFGENELYLSFLRDEGQSS-LYAYVDVPASEWSALANAGSHGGYHYDNIRLEYPY 622 (651) T ss_pred hhhhcccchhhhhhHHHHHHHhhcCCCccceEEEEEeecCCCCc-eeeeeCCCHHHHHHHhcCcccceeehhccccccch Confidence 000 000 00000000000000 000 0000 00000 133344444333211110 011 Q ss_pred HHH----HHHHhc-------ccccccHHH Q lcl|NC_021303. 587 SEI----PRLIAG-------WDTALEDEV 604 (637) Q Consensus 587 ~~v----~rLi~G-------Wd~~ld~~~ 604 (637) +.| +||-.| -.+.+.|+| T Consensus 623 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 651 (651) T protein:vir:99 623 LEITNFHDRLPEGPAPDAGDVPDGVPDEI 651 (651) T ss_pred hhhhhhhhhCCCCCCCCcCCCCCCCcccC Confidence 111 222222 122233344 No 119 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=93.41 E-value=0.0076 Score=32.19 Aligned_cols=444 Identities=16% Similarity=0.185 Sum_probs=167.4 Q ss_pred CCCCcceEEecCCCC-----------Ccccccchheehhccccchhh-h------------hhhhcccccccchhhHHH- Q lcl|NC_021303. 1 MAATSLRVVRRPKGS-----------APAARRRSLTAASQLITDPQK-Q------------MKTSLMGTARNEWQSEAW- 55 (637) Q Consensus 1 ma~~~lr~vrrpk~~-----------~p~~~r~~ltAAs~~~~~p~~-~------------~k~~~~g~~r~~WQ~eAW- 55 (637) ||-|.-- -||.-+ .-++.|..+.+|=+-..||.. . +....++.+.+.+- =.| T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~-~~~~ 77 (532) T protein:vir:94 1 MADTDPT--PRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNA-LSFV 77 (532) T ss_pred CCCCCCC--CCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccc-cccc Confidence 6554210 001000 012223344444221112211 0 00011111111110 012 Q ss_pred -----------HHhhhhhhHhhHhhhhhcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHH Q lcl|NC_021303. 56 -----------DFSESIGELSYYISWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAAL 124 (637) Q Consensus 56 -----------~~yd~VgELryyvgWr~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqL 124 (637) .+|..-++.|=+|.=.+.-|-|-=.-...-+.+ +++ ...+..|-..+- +|.--+. T Consensus 78 ~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~--------~~~----~~~~~~i~~~~~--~l~v~~~ 143 (532) T protein:vir:94 78 EATSWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKD--------ELA----ADKATRITQKLE--QYNVRTL 143 (532) T ss_pred cccccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCcc--------ccc----hHHHHHHHHHHH--hhhHHHH Confidence 233333333333333333343322222211100 011 112222211111 1222233 Q ss_pred HHHHHhhhcccccEEEEEEeecCCcc--ccccccccccceeeeHHHhccCCCceeEE---------------ecCC---- Q lcl|NC_021303. 125 IKRAVECMTVVGEVWIAVLIRQEKDP--VTGLAAPRARWYAVTREEIKSKAGETAEI---------------SLPD---- 183 (637) Q Consensus 125 lkr~~~~LtVpGE~wi~il~r~~~~~--~~~~~~~~~~W~~vt~~Ei~~k~g~~~~i---------------~lPd---- 183 (637) |+.+...=-+=|-.+|+|+.+.+|.. .+.+ +.+..+-|+ .|+-..| ..|. T Consensus 144 l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p-------~~l~~~~I~--~g~~~~l~vld~~~v~p~~~~~~dp~sp~f 214 (532) T protein:vir:94 144 VRTVVIHDQAYGGAHVFPHLKMDGDSVPADAP-------LLLSPSFVQ--RGCLIGFATIEPMWLSPNAYNATDPTLPSF 214 (532) T ss_pred HHHHHHhhhcccceEEEEEeccCCcccccccc-------ccccccccc--cceeeEEEeechheeccccccccccccccc Confidence 33333333588999999998765521 1100 111111121 1111000 0110 Q ss_pred CCccc--ccCC----CceEEEE-ecCCccccc-----CCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccC Q lcl|NC_021303. 184 GKTHE--FNRD----LDSLVRI-WNPRPRKAS-----QATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEM 251 (637) Q Consensus 184 G~~he--~~~~----~d~l~Rv-W~P~prra~-----eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~ 251 (637) |.-.. ...+ -+-||++ =+|-|...+ --.|-.+.+++.|+=..++...+..-..+ ..+.+. .+ T Consensus 215 g~P~~y~v~~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~----~~~~v~--k~ 288 (532) T protein:vir:94 215 YKPDSWIATSGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQ----FSMTNL--AT 288 (532) T ss_pred CCceeEEEccCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHh----cCCcee--ee Confidence 10000 0001 1334543 234443322 34677788888887776666555432221 111111 01 Q ss_pred CCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchhH Q lcl|NC_021303. 252 SLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTE 331 (637) Q Consensus 252 slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvte 331 (637) .+ ++.. +....+.|++-+-.+.+ .++ ..=.+++....|.++.++ +.|. ++++ T Consensus 289 ~~--a~~l-----------------s~~~~~~~~~r~~~~~~--~~~-----n~g~~~id~~~e~~e~~~-~~ls-gl~~ 340 (532) T protein:vir:94 289 DM--AQLL-----------------APGGAQSLDARLQLFNL--YRD-----NRNIGALDKGTEEIQQTN-TPLS-GLDS 340 (532) T ss_pred ch--HHhh-----------------cchhHHHHHHHHHHHHh--hcC-----CccceEEcCCCceeEEEe-cccC-CHHH Confidence 11 1100 00112333332221111 111 111244444556666666 6676 5777 Q ss_pred HHHhhHHHHHHHHHhhcCCchhHhhccC--CcceeeeEEeccCceeEeechhHHHHHHHHHhHHHHHHHHH--------- Q lcl|NC_021303. 332 VEIKTRIDAITRLAMGLDVSPERLLGMS--KGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAR--------- 400 (637) Q Consensus 332 vaiktR~daI~RlAmglDv~pErLLGls--~~NHWsAW~I~dedVrlHI~P~me~ic~Ait~~~Lr~~L~~--------- 400 (637) +--... ..+|...+||--+|+|.+ +-| .-+++|++- .--.|+++-+++|+|+|+. T Consensus 341 ~l~~~~----~~iAaa~~IP~t~LfG~sp~Gln-----stGe~D~~~-----yyd~I~s~Qe~~l~p~le~l~~~l~~s~ 406 (532) T protein:vir:94 341 LQAQSQ----EQMAAVSHIPLVKLLGITPNGLN-----ASSDGEIRV-----WYDFIAGYQATNLTPLMEWIIDLIQLSE 406 (532) T ss_pred HHHHHH----HHHHhHhCCCeeeeecCCccccc-----ccchHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 654444 469999999999999984 233 124555433 2223344444444444332 Q ss_pred hCC-ChHHeEEeecCccccc------CCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCc Q lcl|NC_021303. 401 EGI-DPTKYILWYDASGLTS------DPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNP 473 (637) Q Consensus 401 eGi-Dp~kYvvw~DaS~Lt~------dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P 473 (637) -|. ||+-++.|-+.-+++- .=.+.+.+..+++.|+|+.+..|+.|+.....+|+....+. +-....+ T Consensus 407 ~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~------~~~~~~~ 480 (532) T protein:vir:94 407 YGQIDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGER------DELDDVE 480 (532) T ss_pred cCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccc------ccccccc Confidence 233 7787777765433321 11123344678999999999999999998888776432221 0011112 Q ss_pred hhHHHHHhhhccccccccCCCCcCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccCC-CCcchHHHHHHHHHHHHHHHH Q lcl|NC_021303. 474 ELIAMYAPLLSSQLAGIEFPQPANAIESTREEDDEDSGARQQREPQTEDERSTEEAAS-LNDRAAYLVAERLLVNRALDL 552 (637) Q Consensus 474 ~Li~~~apLl~~~~~~ie~P~p~~a~~~~~~~~d~~~~a~~g~EPdted~~~~~~~a~-~~~~a~~~aa~~llV~rALel 552 (637) +......+ ...+.|++. +.++.+..+. ++|+-+..+.+.++. .+.+.+ T Consensus 481 ~~~~~~~~------~~~~~~~~~---~~~~~~~~~~-------~~d~~~~~~~~~~~~~~~~~~~--------------- 529 (532) T protein:vir:94 481 EIAKQLMA------AALNPPATA---PQTPNPQPDS-------EDDQTDNQPDAQADPAQNDQPV--------------- 529 (532) T ss_pred chhhhhcc------cccCCCCCC---CCCCCCCCCC-------CCCCCCCccCCCccccccCCCc--------------- Confidence 21111111 111111111 1111111111 111111111111110 011111 Q ss_pred hccc Q lcl|NC_021303. 553 AGKR 556 (637) Q Consensus 553 AGkR 556 (637) |.| T Consensus 530 -~~~ 532 (532) T protein:vir:94 530 -GNR 532 (532) T ss_pred -CCC Confidence 111 No 120 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=47.32 E-value=0.71 Score=21.41 Aligned_cols=434 Identities=11% Similarity=0.047 Sum_probs=167.3 Q ss_pred CCC----------C-cceEEe------cCCCCCcccccchheeh--hccccchhhhhhhhcccccccchhh--------- Q lcl|NC_021303. 1 MAA----------T-SLRVVR------RPKGSAPAARRRSLTAA--SQLITDPQKQMKTSLMGTARNEWQS--------- 52 (637) Q Consensus 1 ma~----------~-~lr~vr------rpk~~~p~~~r~~ltAA--s~~~~~p~~~~k~~~~g~~r~~WQ~--------- 52 (637) |++ + .++..+ .|...-|. +..++..+ -+-..+...+|....+..+++.+.. T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMP-KVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIG 103 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccc-cccccccchhccccccchhhhhhhccccccchhhhhccccCCcc Confidence 221 1 122111 22222111 11122111 2223344555554433333444321 Q ss_pred -HHHHHhhhhhhHhhHhhhhhcceeee-EEEEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHh Q lcl|NC_021303. 53 -EAWDFSESIGELSYYISWRANSCSRT-TLIPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVE 130 (637) Q Consensus 53 -eAW~~yd~VgELryyvgWr~~s~Sr~-rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~ 130 (637) +.=.+|..-++.|=+|.=.+.-|.|. +-+-++ |.+ +.....+..|-+.+- .|.--+.++.+.. T Consensus 104 ~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~-~~~------------~~~~~~~~~l~~~~~--~l~~~~~l~~a~~ 168 (537) T protein:vir:10 104 HQMCALIATHWLVNKACSQMPRDAMRKGYKIISD-DGN------------ELDPKDAKFIDRYDR--AFNIKKHAIQFVR 168 (537) T ss_pred HHHHHHHHhCchhhhhhhhhhHHhhcCCceeecC-Ccc------------cccHHHHHHHHHHHH--HhhHHHHHHHHHH Confidence 01123333344444444444433222 222111 100 011122333322222 2444455666666 Q ss_pred hhcccccEEEEEEeecCC-ccccccc--------------cccccceeee-HHHhcc-----CCCceeEEecCCCCcccc Q lcl|NC_021303. 131 CMTVVGEVWIAVLIRQEK-DPVTGLA--------------APRARWYAVT-REEIKS-----KAGETAEISLPDGKTHEF 189 (637) Q Consensus 131 ~LtVpGE~wi~il~r~~~-~~~~~~~--------------~~~~~W~~vt-~~Ei~~-----k~g~~~~i~lPdG~~he~ 189 (637) .--+=|-.+|+|+....+ .....+- +....|.... ..++.. .-|......+ .|.+ + T Consensus 169 ~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v-~g~~--i 245 (537) T protein:vir:10 169 KGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLI-NGKK--Y 245 (537) T ss_pred hcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeee-cCeE--e Confidence 656779999998875333 2111111 1111122211 122211 0011111111 1111 1 Q ss_pred cCCCceEEEEe-cC-----CcccccCCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccc Q lcl|NC_021303. 190 NRDLDSLVRIW-NP-----RPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAG 263 (637) Q Consensus 190 ~~~~d~l~RvW-~P-----~prra~eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~ 263 (637) . -+-||++= +| .|....--.|-.+.+++.|.=..+++..+..... ..+-.+| .+.+.. T Consensus 246 H--~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~---~~~~~v~---k~~~~~-------- 309 (537) T protein:vir:10 246 H--RSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAM---TKRQTVL---KVDAAQ-------- 309 (537) T ss_pred c--ceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHH---hcCCcee---eechHH-------- Confidence 1 13445431 11 2222233577788888887766665554432221 1111111 011100 Q ss_pred cccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEeechHHhcccceeecCcchhHHHHhhHHHHHHH Q lcl|NC_021303. 264 QAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITR 343 (637) Q Consensus 264 ~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~vP~Ehi~~ikHlkf~~dvtevaiktR~daI~R 343 (637) .+.+ .+.|.+.+.... .++|-. =.+++....|.++.++ +.|+ ++++ +.+.+... T Consensus 310 ----------~l~~---~~~~~~r~~~~~--~~r~n~-----g~~~id~e~e~~e~~~-~~ls-gl~~----~l~~~~~~ 363 (537) T protein:vir:10 310 ----------VLAN---KQQFDETMSWWT--ATRDNY-----QVRVVDKDNEDVVQID-TTLN-DLDK----VIMNQYQL 363 (537) T ss_pred ----------hhcC---HHHHHHHHHHHH--hhcCCc-----ceeEecCCCceeEEEe-ccCC-CHHH----HHHHHHHH Confidence 0011 123433322222 122211 1144544556555555 4454 3554 45666677 Q ss_pred HHhhcCCchhHhhccC--CcceeeeEEeccCce----------eEeechhHHHHHHHHHhHHHHHHHHHhCCChHHeEEe Q lcl|NC_021303. 344 LAMGLDVSPERLLGMS--KGNHWSAWAIGDEDV----------QLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILW 411 (637) Q Consensus 344 lAmglDv~pErLLGls--~~NHWsAW~I~dedV----------rlHI~P~me~ic~Ait~~~Lr~~L~~eGiDp~kYvvw 411 (637) +|...+||--+|+|.+ +.| .-+++|+ |-++.|.|+.|.+-|.+.. -|-+++=.+.| T Consensus 364 iAa~~~IP~t~L~G~sp~Gln-----atGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~-------~~~~~~~~i~f 431 (537) T protein:vir:10 364 VCAIARTPAPKMLGTVPTGFN-----STGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSH-------LRKRIRVKVEF 431 (537) T ss_pred HHhhhCCCceeeccCCccccc-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CCCCcceEEEe Confidence 9999999999999985 333 2245454 3345666665544433221 12244433555 Q ss_pred ecCcccc------cCCCCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhcc Q lcl|NC_021303. 412 YDASGLT------SDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSS 485 (637) Q Consensus 412 ~DaS~Lt------~dPD~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~ 485 (637) -+.-+++ +.=.+.+.+..+++.|+|+.+..|..|+-..+.+|+-- .|-++. T Consensus 432 ~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l-----------------------~~~~~~ 488 (537) T protein:vir:10 432 PPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSI-----------------------TPAMRP 488 (537) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccc-----------------------cCCCCh Confidence 4433221 11122345678899999999999999998766665510 000000 Q ss_pred ccccccCCCCcCCCCCCCC---CCCCCCCCCCCCC-ccCCCCCCCcccCCCCcc Q lcl|NC_021303. 486 QLAGIEFPQPANAIESTRE---EDDEDSGARQQRE-PQTEDERSTEEAASLNDR 535 (637) Q Consensus 486 ~~~~ie~P~p~~a~~~~~~---~~d~~~~a~~g~E-Pdted~~~~~~~a~~~~~ 535 (637) ..++...-....++.+. +.++.+.++..++ .++.+. ....++ ... T Consensus 489 --ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~--~~~ 537 (537) T protein:vir:10 489 --TDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDP-RDSGAA--FED 537 (537) T ss_pred --hhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCC-ccCccc--cCC Confidence 01110000000000000 0011111111111 111111 111111 111 No 121 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=37.22 E-value=1.1 Score=20.29 Aligned_cols=451 Identities=12% Similarity=0.089 Sum_probs=178.4 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccc---------cccchhhHHHHHhhhhhhHhhHhhhh Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGT---------ARNEWQSEAWDFSESIGELSYYISWR 71 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~---------~r~~WQ~eAW~~yd~VgELryyvgWr 71 (637) |+=- .||. +.+++....|.+..- +..+-...|. -=-.|..|-++.|..==...+|-++- T Consensus 1 m~~~------~~~~--v~~~h~~y~a~~~~W----~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~ 68 (513) T protein:vir:97 1 MADK------DPKS--PATTSGAYDQMLPRW----HVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMV 68 (513) T ss_pred CCCC------CCCC--CCcCCHHHHHHHHHH----HHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChH Confidence 4432 2232 222232222221111 1111111121 01123334444443322334455555 Q ss_pred hcceeeeEEEEeeeccccCCCCCcccCCCCcccchHHH-HHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcc Q lcl|NC_021303. 72 ANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQIVAD-YVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDP 150 (637) Q Consensus 72 ~~s~Sr~rL~aseiD~DtG~PtG~v~~e~~~~~~rv~~-iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~ 150 (637) +..|...-=.+..-+|. .+.+-+ ....+ +.+.+=+-=..-.++++++....-+-|-+||.+-.-..+.+ T Consensus 69 ~~tl~~l~G~vf~k~p~---------~~~~~p-~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~ 138 (513) T protein:vir:97 69 EQTLDTLSGKPFSEPIK---------LNEDVP-KAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPR 138 (513) T ss_pred HHHHHHHhhhhhhcCcc---------cCcCch-HHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCc Confidence 44443322112222221 111111 12222 33444333345678899998888888888877643222111 Q ss_pred cccc---------ccccccceeeeHHHhcc-----CCCceeE--E------ecCCCCcccccCCCceEEEEecCCccccc Q lcl|NC_021303. 151 VTGL---------AAPRARWYAVTREEIKS-----KAGETAE--I------SLPDGKTHEFNRDLDSLVRIWNPRPRKAS 208 (637) Q Consensus 151 ~~~~---------~~~~~~W~~vt~~Ei~~-----k~g~~~~--i------~lPdG~~he~~~~~d~l~RvW~P~prra~ 208 (637) .++. ...+.-++.++.++|-. -+|.... + +.+|| |..+.---+|||+|.-=+-+ T Consensus 139 ~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dg----f~~~~~~q~rvL~~g~~~v~ 214 (513) T protein:vir:97 139 EDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDG----FAEVCKRRIRVLEPGLVQLW 214 (513) T ss_pred cchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCC----CcceEEEEEEEEeCceEEEE Confidence 0110 11134588889999864 1232221 2 22454 22222223555554321111 Q ss_pred CCccchhhhhHHHHHHHhhhHHHHHHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCch--------h Q lcl|NC_021303. 209 QATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVP--------A 280 (637) Q Consensus 209 eaDSPvra~l~~LrEI~rttk~I~na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~--------~ 280 (637) .........- .|.+... --++++=+||=-.-...... +...-||+-.-+ . T Consensus 215 r~~~~~~~~~---~e~~~~~----------~g~~~l~~IP~v~~~~~~~~---------~~~~~pPLl~LA~ln~~hy~~ 272 (513) T protein:vir:97 215 EPVKKSNAQK---EEWALAD----------EWATGLNYVPLVTFYADRQG---------FMMGKPPLLDLAHLNVAHWQS 272 (513) T ss_pred EeecCCCccc---cceEEec----------CCCCcCCceeEEEEecCCCC---------CCCCccchHHHHHHHHHHHhh Confidence 1111000000 0110000 00122223332110111111 122223322111 1 Q ss_pred HHHHHHHHHHHH--h---hcccCccccccccceeEe------echHHhcccceeecC-cchh--HHHHhhHHHHHHHHHh Q lcl|NC_021303. 281 SEQLATMIYQAS--V---AAMEDENSQAAYIPLVAS------VAAEHLEKVQHIKFG-NEVT--EVEIKTRIDAITRLAM 346 (637) Q Consensus 281 ~~~L~~ml~~va--~---aai~De~S~AA~vPiva~------vP~Ehi~~ikHlkf~-~dvt--evaiktR~daI~RlAm 346 (637) ..++.++|+.++ + ..+.+++. -||.+. .|+ --.+.+.+.|. +-+. ...+|..++-|+++.. T Consensus 273 ~Sd~~~il~~~~~P~l~~~G~~~~~~----~~i~iG~~~~~~lpe-~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga 347 (513) T protein:vir:97 273 ASDQRHILTVSRFPILACSGASGEDS----DPVVVGPNKVLYNPD-PAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGA 347 (513) T ss_pred hhhHHHHHHhcccceeeeecCCcCCC----CceEeeccccccCCC-CCCcceeeccCchhHHHHHHHHHHHHHHHHHHHH Confidence 123444444443 2 22333321 123221 231 11345666665 4333 4567777777766643 Q ss_pred hcCCchhHhhccCCcceeeeEEeccCceeEeechhHHH---HHHHHHhHHHHHHHHHhCCChHHeEEeecCccccc--CC Q lcl|NC_021303. 347 GLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDL---ICQAIYNDILTPLLAREGIDPTKYILWYDASGLTS--DP 421 (637) Q Consensus 347 glDv~pErLLGls~~NHWsAW~I~dedVrlHI~P~me~---ic~Ait~~~Lr~~L~~eGiDp~kYvvw~DaS~Lt~--dP 421 (637) +||.-+..| =||=+..-+.=+.| =+|.. -|+...+++|+-+-+=+|++++.+.|.+...=... ++ T Consensus 348 -------~ll~~~~~~-~Ta~a~~~~~~~~~--S~L~~~a~~le~al~~~l~~~a~wlg~~~~~~~v~in~dF~~~~~~~ 417 (513) T protein:vir:97 348 -------EFLKRKTGG-QTATARALDSAEAT--SDLSAMTGLFEDALAQALDITADWLRLGPNGGTVELVKDYDLEEMDA 417 (513) T ss_pred -------HhhccCCcc-ccHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCCCCccEEEeccccCcccCCH Confidence 444433333 33333332222211 12333 34556678899998889999988888875533222 33 Q ss_pred CCCHHHHHHHhcCCcCHHHHHHHhcCccccCCCCCchHHHHHHHHHHhcCCchhHHHHHhhhccccccccCCCCcCCCCC Q lcl|NC_021303. 422 DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIES 501 (637) Q Consensus 422 D~tdeA~~a~drGaIt~eAlrr~lgl~~d~~yd~~t~eg~r~~A~d~v~~~P~Li~~~apLl~~~~~~ie~P~p~~a~~~ 501 (637) .-.+.-.+++..|.||.++|+++|--..==..|++..+-|.+.+-+.-.. +.+ ....+....-+++....-. T Consensus 418 ~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~-~~~-------~~~d~~~~~~~~~~~~~~~ 489 (513) T protein:vir:97 418 PGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEA-MGR-------AGLDLDPAQKNPPEGGEGE 489 (513) T ss_pred HHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhc-cCC-------CCccccccCCCCCCCCCCC Confidence 34455578999999999999998854333334566665554444332111 000 0000000010111111111 Q ss_pred CCCCCCCCCCCCCCCC---ccCCC Q lcl|NC_021303. 502 TREEDDEDSGARQQRE---PQTED 522 (637) Q Consensus 502 ~~~~~d~~~~a~~g~E---Pdted 522 (637) ++.+.+..++++.|+. |.-|. T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 490 GEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred CCCCCCCCCCCCccccCCCCCCCC Confidence 1112222223332221 22221 No 122 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=26.30 E-value=2 Score=18.98 Aligned_cols=243 Identities=7% Similarity=0.041 Sum_probs=117.6 Q ss_pred CCCCcceEEecCCCCCcccccchheehhccccchhhhhhhhcccccccchhhHHHHHhhhhhhHhhHhhhhhcceeeeEE Q lcl|NC_021303. 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYISWRANSCSRTTL 80 (637) Q Consensus 1 ma~~~lr~vrrpk~~~p~~~r~~ltAAs~~~~~p~~~~k~~~~g~~r~~WQ~eAW~~yd~VgELryyvgWr~~s~Sr~rL 80 (637) |.=-..+- .|....++...- .. ...+-+ ..|.....+ +... +-..+-+.=.+.-++++++++-| T Consensus 1 MglF~~~~-~r~~~~~~~~~~-~~----------~~~~~~-~~~~~~~~v-~~~~--al~~~~v~~~i~~ia~~iA~lp~ 64 (251) T protein:vir:46 1 MGIFYKNE-KRDLQYNEDDLQ-MM----------VQTLPS-FQGTKLRQY-KDIE--AIRHSDIFTAVMMIASDLARMPI 64 (251) T ss_pred CCcccccc-ccccCCCccchh-hh----------hhhhcc-ccCcCccee-chhh--hhccHHHHHHHHHHHHhHhhCce Confidence 65433221 221111111000 00 000000 011111111 0111 01223344456678888888877 Q ss_pred EEeeeccccCCCCCcccCCCCcccchHHHHHHHhccCcccHHHHHHHHHhhhcccccEEEEEEeecCCcccccccccccc Q lcl|NC_021303. 81 IPSAIDPDTGLPTGEVDIEEDPDAQIVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (637) Q Consensus 81 ~aseiD~DtG~PtG~v~~e~~~~~~rv~~iv~~iAgG~lGqaqLlkr~~~~LtVpGE~wi~il~r~~~~~~~~~~~~~~~ 160 (637) ..-+ + ++ . +. .+.+..++..=-.--+...++++.++.+|-+-|+.|+.+.-...|. --. T Consensus 65 ~~~~---~-~~----~-~~----~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~--------~~~ 123 (251) T protein:vir:46 65 RVTV---N-GQ----I-NY----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE--------PMN 123 (251) T ss_pred EEee---C-cc----c-cc----cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc--------EEE Confidence 6643 1 11 1 11 2455566554445567788999999999999999998876544443 124 Q ss_pred ceeeeHHHhcc---CCCceeE-Eec----CCCCcccccCCCceEEEEecCCcccccCCccchhhhhHHHHHHHhhhHHHH Q lcl|NC_021303. 161 WYAVTREEIKS---KAGETAE-ISL----PDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (637) Q Consensus 161 W~~vt~~Ei~~---k~g~~~~-i~l----PdG~~he~~~~~d~l~RvW~P~prra~eaDSPvra~l~~LrEI~rttk~I~ 232 (637) ++.|..+.+.. ..|...+ ... ..|....|....=+-||..+. ....--||+.++.+.|.-..-+.+... T Consensus 124 L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~g~~~~~~~~diiH~r~~~~---dg~~G~spi~~~~~~i~~~~~~~~~~~ 200 (251) T protein:vir:46 124 LTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSL---DGINGLSLLDTLSRTIESDNNGKDFLN 200 (251) T ss_pred EEEECCceEEEEECCCCcEEEEEEEeccCCcceeEEECCccEEEecCcCC---CCeeecCHHHHHHHHHHHHHHHHHHHH Confidence 45554444432 1222222 111 224444554432222554332 235678999999998888888888888 Q ss_pred HHHHhHhhcCceeeecccCCCCCcccccccccccCCCcccccCCCchhHHHHHHHHHHHHhhcccCccccccccceeEee Q lcl|NC_021303. 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (637) Q Consensus 233 na~~SRL~gnGvlfvPqe~slP~~~ap~~a~~~~~pg~~~~~~~~~~~~~~L~~ml~~va~aai~De~S~AA~vPiva~v 312 (637) +..+.-..-.|||-+|+.++=+ .+.+.|++.+.+ .+...+. +..|++ + T Consensus 201 ~~f~ng~~p~gil~~~~~l~~~------------------------e~~~~~~~~~~~----~~~g~~n-~g~~~~---g 248 (251) T protein:vir:46 201 NFLRNGTHAGGILKMKGVLDNK------------------------KARDRAREEFPK----VLVELNK-LGKLSY---S 248 (251) T ss_pred HHHHccCCCcEEEEeCCCCCCH------------------------HHHHHHHHHHHH----HhcCccc-cccccc---c Confidence 8877777778898888743211 122334333332 2222111 112222 2 Q ss_pred chH Q lcl|NC_021303. 313 AAE 315 (637) Q Consensus 313 P~E 315 (637) ..| T Consensus 249 m~~ 251 (251) T protein:vir:46 249 MNQ 251 (251) T ss_pred cCC Confidence 222 Done!