Query lcl|NC_011269.1_cdsid_YP_002224115.1 [gene=87] [protein=gp87] [protein_id=YP_002224115.1] [location=35236..37839] Match_columns 867 No_of_seqs 318 out of 15941 Neff 5.1 Searched_HMMs 1612 Date Thu Nov 7 14:46:54 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_82 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_82_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105154 Length: 525 100.0 1.3E-37 8E-41 222.8 18.4 462 32-597 1-525 (525) 2 protein:vir:95542 Length: 548 99.4 6.5E-14 4.1E-17 92.9 19.6 502 16-618 1-548 (548) 3 protein:vir:389 Length: 530 # 99.3 1.6E-13 9.8E-17 90.8 13.7 465 24-590 1-530 (530) 4 protein:vir:3420 Length: 533 # 99.2 6.1E-13 3.8E-16 87.6 15.5 465 24-593 1-533 (533) 5 protein:vir:79538 Length: 502 99.2 1.7E-12 1E-15 85.2 17.0 458 16-598 1-502 (502) 6 protein:vir:107880 Length: 491 99.1 6.7E-11 4.2E-14 76.4 19.7 464 32-600 1-491 (491) 7 protein:vir:79063 Length: 491 99.0 3.9E-11 2.4E-14 77.6 17.1 452 56-600 1-491 (491) 8 protein:vir:96738 Length: 505 99.0 7.6E-11 4.7E-14 76.1 18.4 464 32-588 1-505 (505) 9 protein:vir:3153 Length: 467 # 99.0 4.6E-11 2.9E-14 77.3 16.5 415 116-572 1-467 (467) 10 protein:vir:10321 Length: 495 99.0 2.2E-11 1.3E-14 79.1 14.0 451 24-580 1-495 (495) 11 protein:vir:1380 Length: 422 # 99.0 1.1E-10 6.5E-14 75.3 15.9 413 44-565 1-422 (422) 12 protein:vir:93610 Length: 454 98.9 1.3E-10 8.2E-14 74.7 16.1 423 49-565 1-454 (454) 13 protein:vir:8418 Length: 409 # 98.9 1.3E-10 8.3E-14 74.7 14.3 388 58-583 1-409 (409) 14 protein:vir:3989 Length: 392 # 98.9 1.5E-10 9.5E-14 74.4 14.4 382 56-542 1-392 (392) 15 protein:vir:1023 Length: 392 # 98.9 1.5E-10 9.5E-14 74.4 14.4 382 56-542 1-392 (392) 16 protein:vir:6382 Length: 553 # 98.9 4.7E-10 2.9E-13 71.7 16.8 468 44-595 1-553 (553) 17 protein:vir:78641 Length: 278 98.9 1E-10 6.2E-14 75.4 12.7 271 138-473 1-278 (278) 18 protein:vir:4952 Length: 386 # 98.8 1.7E-10 1.1E-13 74.1 12.8 375 58-542 1-386 (386) 19 protein:vir:81152 Length: 411 98.8 5.3E-10 3.3E-13 71.4 14.3 397 58-569 1-411 (411) 20 protein:vir:7407 Length: 392 # 98.8 8.5E-10 5.3E-13 70.3 14.5 378 56-542 1-392 (392) 21 protein:vir:3843 Length: 397 # 98.7 2.9E-10 1.8E-13 72.8 11.4 389 58-571 1-397 (397) 22 protein:vir:102080 Length: 429 98.7 3.5E-10 2.1E-13 72.5 11.5 413 58-574 1-429 (429) 23 protein:vir:4828 Length: 382 # 98.7 9.2E-10 5.7E-13 70.1 13.8 377 58-544 1-382 (382) 24 protein:vir:105002 Length: 432 98.7 4.1E-10 2.5E-13 72.1 11.0 413 58-574 1-432 (432) 25 protein:vir:102855 Length: 432 98.7 4.1E-10 2.5E-13 72.1 11.0 413 58-574 1-432 (432) 26 protein:vir:107605 Length: 432 98.7 4.1E-10 2.5E-13 72.1 11.0 413 58-574 1-432 (432) 27 protein:vir:108215 Length: 469 98.7 3E-09 1.8E-12 67.3 15.7 428 70-594 1-469 (469) 28 protein:vir:80333 Length: 419 98.7 5.1E-09 3.2E-12 66.1 16.9 400 60-558 1-419 (419) 29 protein:vir:101647 Length: 460 98.7 3.7E-09 2.3E-12 66.8 15.9 442 26-575 1-460 (460) 30 protein:vir:77981 Length: 448 98.7 5.2E-09 3.2E-12 66.0 16.2 411 70-566 1-448 (448) 31 protein:vir:100691 Length: 535 98.7 2.4E-08 1.5E-11 62.3 19.6 471 13-607 1-535 (535) 32 protein:vir:102118 Length: 409 98.6 4.5E-09 2.8E-12 66.3 14.2 387 84-565 1-409 (409) 33 protein:vir:4156 Length: 542 # 98.6 1.3E-08 8E-12 63.8 16.4 483 75-636 1-542 (542) 34 protein:vir:79772 Length: 648 98.6 3.9E-09 2.4E-12 66.7 13.5 479 32-596 1-648 (648) 35 protein:vir:1266 Length: 416 # 98.6 5.8E-09 3.6E-12 65.8 14.3 403 65-574 1-416 (416) 36 protein:vir:98816 Length: 446 98.5 8.9E-08 5.5E-11 59.2 19.1 400 78-533 1-446 (446) 37 protein:vir:7853 Length: 518 # 98.5 9.5E-09 5.9E-12 64.6 13.4 486 59-631 1-518 (518) 38 protein:vir:4995 Length: 384 # 98.5 3E-09 1.8E-12 67.3 9.7 372 80-541 1-384 (384) 39 protein:vir:101648 Length: 518 98.4 1.4E-08 8.4E-12 63.7 12.1 479 59-640 1-518 (518) 40 protein:vir:63755 Length: 547 98.4 3.8E-07 2.3E-10 55.8 20.0 498 13-649 1-547 (547) 41 protein:vir:2683 Length: 412 # 98.4 1E-08 6.5E-12 64.3 11.1 397 44-552 1-412 (412) 42 protein:vir:93943 Length: 409 98.4 6.9E-08 4.3E-11 59.9 15.6 401 70-574 1-409 (409) 43 protein:vir:960 Length: 413 # 98.4 1.1E-07 7.1E-11 58.6 16.6 390 41-552 1-413 (413) 44 protein:vir:4854 Length: 386 # 98.4 1.6E-08 1E-11 63.3 11.7 382 50-544 1-386 (386) 45 protein:vir:80644 Length: 551 98.3 6.3E-07 3.9E-10 54.6 19.7 502 9-640 1-551 (551) 46 protein:vir:4194 Length: 540 # 98.3 1E-07 6.3E-11 58.9 15.3 476 77-636 1-540 (540) 47 protein:vir:96579 Length: 576 98.3 1.4E-06 8.7E-10 52.7 21.7 520 8-642 1-576 (576) 48 protein:vir:99853 Length: 488 98.3 4.3E-07 2.7E-10 55.5 17.9 457 44-606 1-488 (488) 49 protein:vir:1986 Length: 512 # 98.3 2.1E-07 1.3E-10 57.2 15.9 469 49-604 1-512 (512) 50 protein:vir:8100 Length: 466 # 98.3 2.6E-07 1.6E-10 56.7 15.9 436 47-580 1-466 (466) 51 protein:vir:94426 Length: 409 98.2 4.7E-07 2.9E-10 55.3 16.3 402 44-574 1-409 (409) 52 protein:vir:1082 Length: 359 # 98.2 3E-08 1.8E-11 61.8 9.5 347 80-525 1-359 (359) 53 protein:vir:9359 Length: 348 # 98.2 2.1E-07 1.3E-10 57.2 13.6 340 138-574 1-348 (348) 54 protein:vir:483 Length: 413 # 98.1 3.5E-07 2.2E-10 56.0 14.6 399 48-582 1-413 (413) 55 protein:vir:99232 Length: 526 98.1 6.8E-07 4.2E-10 54.4 15.7 444 84-602 1-526 (526) 56 protein:vir:95254 Length: 488 98.1 2.5E-06 1.6E-09 51.3 18.8 412 70-587 1-488 (488) 57 protein:vir:4454 Length: 414 # 98.1 7.8E-07 4.9E-10 54.1 15.2 391 82-582 1-414 (414) 58 protein:vir:79984 Length: 441 98.1 7.4E-07 4.6E-10 54.2 14.7 426 44-571 1-441 (441) 59 protein:vir:9408 Length: 441 # 98.1 7.4E-07 4.6E-10 54.2 14.7 426 44-571 1-441 (441) 60 protein:vir:79511 Length: 448 98.0 2.6E-06 1.6E-09 51.2 17.7 413 70-568 1-448 (448) 61 protein:vir:103860 Length: 528 98.0 2.4E-06 1.5E-09 51.4 17.0 449 84-606 1-528 (528) 62 protein:vir:4337 Length: 434 # 98.0 7.2E-07 4.5E-10 54.3 14.2 418 32-580 1-434 (434) 63 protein:vir:81095 Length: 416 98.0 5.2E-07 3.2E-10 55.0 13.1 405 58-571 1-416 (416) 64 protein:vir:4598 Length: 416 # 98.0 5.2E-07 3.2E-10 55.0 13.1 405 58-571 1-416 (416) 65 protein:vir:78161 Length: 355 98.0 5.8E-07 3.6E-10 54.8 13.2 336 219-604 1-355 (355) 66 protein:vir:96980 Length: 409 98.0 1.6E-06 9.9E-10 52.4 15.5 401 70-574 1-409 (409) 67 protein:vir:100249 Length: 431 98.0 6.2E-07 3.8E-10 54.6 12.8 408 44-575 1-431 (431) 68 protein:vir:5737 Length: 419 # 98.0 9.6E-07 5.9E-10 53.6 13.8 399 82-574 1-419 (419) 69 protein:vir:1431 Length: 419 # 97.9 1.9E-06 1.2E-09 51.9 14.9 401 70-586 1-419 (419) 70 protein:vir:1884 Length: 424 # 97.9 1E-06 6.3E-10 53.4 13.3 398 26-563 1-424 (424) 71 protein:vir:100150 Length: 437 97.9 1.1E-06 6.6E-10 53.3 13.1 417 56-581 1-437 (437) 72 protein:vir:100882 Length: 383 97.9 1.9E-06 1.2E-09 52.0 13.7 361 80-538 1-383 (383) 73 protein:vir:3868 Length: 417 # 97.8 5.4E-07 3.4E-10 54.9 10.4 397 80-582 1-417 (417) 74 protein:vir:100328 Length: 346 97.8 8.7E-07 5.4E-10 53.8 11.3 327 56-483 1-346 (346) 75 protein:vir:1326 Length: 457 # 97.8 8.2E-07 5.1E-10 54.0 10.8 439 58-616 1-457 (457) 76 protein:vir:98396 Length: 441 97.8 3.8E-06 2.3E-09 50.3 14.2 426 44-571 1-441 (441) 77 protein:vir:99312 Length: 563 97.7 1.5E-05 9.3E-09 47.0 16.9 501 12-623 1-563 (563) 78 protein:vir:95599 Length: 563 97.7 1.5E-05 9.3E-09 47.0 16.9 501 12-623 1-563 (563) 79 protein:vir:189 Length: 424 # 97.7 3.3E-06 2E-09 50.6 13.3 407 44-563 1-424 (424) 80 protein:vir:6240 Length: 457 # 97.7 5E-06 3.1E-09 49.7 13.3 420 82-587 1-457 (457) 81 protein:vir:81072 Length: 432 97.6 1.1E-05 7E-09 47.7 15.1 405 67-583 1-432 (432) 82 protein:vir:105064 Length: 421 97.5 1.4E-05 8.5E-09 47.2 14.4 402 32-588 1-421 (421) 83 protein:vir:9702 Length: 406 # 97.5 3.5E-05 2.2E-08 45.0 15.9 385 80-580 1-406 (406) 84 protein:vir:3743 Length: 345 # 97.5 2.1E-06 1.3E-09 51.7 8.8 321 60-487 1-345 (345) 85 protein:vir:3780 Length: 345 # 97.4 2E-06 1.2E-09 51.9 8.4 324 60-487 1-345 (345) 86 protein:vir:4509 Length: 424 # 97.4 1.3E-05 8.2E-09 47.3 12.7 408 32-551 1-424 (424) 87 protein:vir:100187 Length: 385 97.4 1.6E-05 9.9E-09 46.9 13.1 375 44-547 1-385 (385) 88 protein:vir:80040 Length: 461 97.3 9.3E-05 5.8E-08 42.7 16.6 425 44-562 1-461 (461) 89 protein:vir:97060 Length: 432 97.3 6.1E-05 3.8E-08 43.7 14.9 409 44-583 1-432 (432) 90 protein:vir:79233 Length: 526 97.2 5.5E-05 3.4E-08 43.9 13.9 472 84-602 1-526 (526) 91 protein:vir:10362 Length: 432 97.0 5.9E-05 3.7E-08 43.8 12.8 405 26-583 1-432 (432) 92 protein:vir:2013 Length: 344 # 96.8 5.9E-05 3.7E-08 43.8 11.1 325 44-478 1-344 (344) 93 protein:vir:99452 Length: 651 96.6 0.00048 3E-07 38.8 17.4 541 1-654 1-651 (651) 94 protein:vir:6058 Length: 344 # 96.5 0.00011 6.7E-08 42.3 10.8 325 49-478 1-344 (344) 95 protein:vir:79150 Length: 368 96.5 0.00024 1.5E-07 40.4 12.3 347 44-514 1-368 (368) 96 protein:vir:81218 Length: 423 96.5 0.00044 2.8E-07 39.0 13.7 397 58-547 1-423 (423) 97 protein:vir:6210 Length: 394 # 96.5 0.00032 2E-07 39.8 12.8 373 58-547 1-394 (394) 98 protein:vir:79207 Length: 351 96.4 0.00024 1.5E-07 40.4 12.1 323 1-493 1-351 (351) 99 protein:vir:78749 Length: 337 96.4 0.00021 1.3E-07 40.7 11.7 310 56-489 1-337 (337) 100 protein:vir:103971 Length: 376 96.3 0.00029 1.8E-07 39.9 11.9 337 1-493 2-376 (376) 101 protein:vir:1150 Length: 350 # 96.3 0.00027 1.6E-07 40.2 11.5 331 44-469 1-350 (350) 102 protein:vir:78191 Length: 351 96.3 0.00026 1.6E-07 40.2 11.4 323 1-493 1-351 (351) 103 protein:vir:80796 Length: 574 95.9 0.0012 7.5E-07 36.6 19.6 518 1-636 1-574 (574) 104 protein:vir:8317 Length: 409 # 95.8 0.001 6.4E-07 37.0 12.5 372 58-543 1-409 (409) 105 protein:vir:98853 Length: 219 95.7 0.00014 8.9E-08 41.6 7.7 204 231-479 1-219 (219) 106 protein:vir:5249 Length: 437 # 95.5 0.002 1.2E-06 35.4 17.9 413 64-580 1-437 (437) 107 protein:vir:267 Length: 348 # 95.4 0.00054 3.4E-07 38.5 9.6 327 49-495 1-348 (348) 108 protein:vir:98567 Length: 340 95.2 0.00083 5.1E-07 37.5 10.1 321 44-482 1-340 (340) 109 protein:vir:104259 Length: 403 95.0 0.003 1.9E-06 34.4 15.6 376 82-580 1-403 (403) 110 protein:vir:5691 Length: 344 # 95.0 0.0014 8.5E-07 36.3 10.6 325 44-478 1-344 (344) 111 protein:vir:572 Length: 506 # 94.8 0.0035 2.2E-06 34.1 16.6 421 98-571 1-506 (506) 112 protein:vir:98643 Length: 395 94.5 0.0041 2.6E-06 33.7 15.5 374 82-547 1-395 (395) 113 protein:vir:95378 Length: 406 94.2 0.0051 3.1E-06 33.2 16.9 385 58-580 1-406 (406) 114 protein:vir:1236 Length: 483 # 92.2 0.006 3.7E-06 32.8 9.2 426 70-590 1-483 (483) 115 protein:vir:733 Length: 453 # 91.0 0.018 1.1E-05 30.2 11.2 414 44-575 1-453 (453) 116 protein:vir:94805 Length: 492 90.4 0.021 1.3E-05 29.7 11.3 426 61-591 1-492 (492) 117 protein:vir:9641 Length: 395 # 89.0 0.029 1.8E-05 29.0 15.3 375 80-543 1-395 (395) 118 protein:vir:102950 Length: 471 88.8 0.03 1.9E-05 28.9 17.1 401 82-576 1-471 (471) 119 protein:vir:95965 Length: 385 88.7 0.031 1.9E-05 28.9 16.3 367 58-546 1-385 (385) 120 protein:vir:78310 Length: 376 88.1 0.035 2.1E-05 28.6 11.5 354 58-535 1-376 (376) 121 protein:vir:79043 Length: 479 88.0 0.035 2.2E-05 28.6 16.9 418 77-573 1-479 (479) 122 protein:vir:9871 Length: 429 # 86.9 0.042 2.6E-05 28.1 11.0 398 44-580 1-429 (429) 123 protein:vir:38 Length: 496 # N 86.9 0.043 2.6E-05 28.1 16.3 421 49-580 1-496 (496) 124 protein:vir:99522 Length: 470 85.5 0.052 3.2E-05 27.6 13.8 429 56-581 1-470 (470) 125 protein:vir:94869 Length: 378 85.3 0.054 3.3E-05 27.5 14.0 354 80-575 1-378 (378) 126 protein:vir:4698 Length: 251 # 84.0 0.03 1.9E-05 28.9 7.3 241 58-388 1-251 (251) 127 protein:vir:100650 Length: 395 83.3 0.07 4.3E-05 26.9 13.8 380 58-575 1-395 (395) 128 protein:vir:9507 Length: 395 # 83.3 0.07 4.3E-05 26.9 13.8 380 58-575 1-395 (395) 129 protein:vir:101289 Length: 395 83.3 0.07 4.3E-05 26.9 13.8 380 58-575 1-395 (395) 130 protein:vir:94546 Length: 506 82.5 0.077 4.8E-05 26.7 9.3 437 44-590 1-506 (506) 131 protein:vir:97171 Length: 512 82.2 0.079 4.9E-05 26.6 10.7 448 49-585 1-512 (512) 132 protein:vir:80134 Length: 403 82.1 0.08 5E-05 26.6 12.7 369 80-580 1-403 (403) 133 protein:vir:5961 Length: 503 # 82.0 0.081 5E-05 26.6 16.1 441 32-607 1-503 (503) 134 protein:vir:3609 Length: 452 # 81.8 0.082 5.1E-05 26.5 11.8 401 68-580 1-452 (452) 135 protein:vir:99781 Length: 511 81.6 0.065 4.1E-05 27.1 8.2 446 38-580 1-511 (511) 136 protein:vir:1661 Length: 378 # 80.7 0.093 5.8E-05 26.2 12.8 351 73-583 1-378 (378) 137 protein:vir:105619 Length: 772 80.5 0.014 8.7E-06 30.7 4.1 593 1-733 45-772 (772) 138 protein:vir:93747 Length: 472 80.2 0.097 6E-05 26.1 10.1 451 5-590 1-472 (472) 139 protein:vir:107742 Length: 537 80.0 0.099 6.2E-05 26.1 14.7 495 15-599 1-537 (537) 140 protein:vir:100312 Length: 152 79.8 0.026 1.6E-05 29.3 5.4 117 275-413 1-152 (152) 141 protein:vir:103951 Length: 511 78.9 0.11 6.8E-05 25.9 10.2 444 38-585 1-511 (511) 142 protein:vir:96240 Length: 511 78.3 0.12 7.2E-05 25.7 10.0 441 38-586 1-511 (511) 143 protein:vir:96179 Length: 468 78.1 0.12 7.3E-05 25.7 13.8 412 77-572 1-468 (468) 144 protein:vir:4089 Length: 395 # 76.8 0.13 8.1E-05 25.4 15.4 376 82-575 1-395 (395) 145 protein:vir:95899 Length: 474 73.7 0.17 0.0001 24.8 14.1 418 77-583 1-474 (474) 146 protein:vir:96266 Length: 474 73.7 0.17 0.0001 24.8 14.1 418 77-583 1-474 (474) 147 protein:vir:94498 Length: 474 73.4 0.17 0.00011 24.8 15.8 434 49-583 1-474 (474) 148 protein:vir:97447 Length: 474 73.4 0.17 0.00011 24.8 15.8 434 49-583 1-474 (474) 149 protein:vir:9922 Length: 489 # 72.9 0.18 0.00011 24.7 13.7 417 70-579 1-489 (489) 150 protein:vir:80959 Length: 499 71.1 0.2 0.00012 24.4 16.8 419 49-580 1-499 (499) 151 protein:vir:9306 Length: 511 # 70.9 0.2 0.00013 24.4 10.7 442 38-585 1-511 (511) 152 protein:vir:106571 Length: 499 70.5 0.21 0.00013 24.3 11.5 431 80-601 1-499 (499) 153 protein:vir:9751 Length: 422 # 68.4 0.24 0.00015 24.0 13.2 407 44-525 1-422 (422) 154 protein:vir:104338 Length: 422 68.1 0.24 0.00015 24.0 16.2 406 64-568 1-422 (422) 155 protein:vir:96366 Length: 511 65.4 0.28 0.00018 23.6 10.5 439 43-585 1-511 (511) 156 protein:vir:78805 Length: 511 65.4 0.28 0.00018 23.6 10.5 439 43-585 1-511 (511) 157 protein:vir:858 Length: 378 # 65.4 0.28 0.00018 23.6 12.2 352 58-575 1-378 (378) 158 protein:vir:3028 Length: 500 # 57.4 0.26 0.00016 23.8 5.7 432 61-546 1-500 (500) 159 protein:vir:9815 Length: 500 # 57.4 0.26 0.00016 23.8 5.7 432 61-546 1-500 (500) 160 protein:vir:95113 Length: 474 55.2 0.49 0.0003 22.3 14.7 431 49-583 1-474 (474) 161 protein:vir:4898 Length: 502 # 53.3 0.53 0.00033 22.1 10.2 471 1-586 2-502 (502) 162 protein:vir:3163 Length: 145 # 51.6 0.34 0.00021 23.2 5.3 115 274-416 1-145 (145) 163 protein:vir:1164 Length: 156 # 51.2 0.48 0.0003 22.3 6.1 119 275-422 1-156 (156) 164 protein:vir:79115 Length: 148 50.5 0.57 0.00036 21.9 6.4 115 256-418 1-148 (148) 165 protein:vir:78907 Length: 518 45.5 0.77 0.00048 21.2 10.6 438 58-580 1-518 (518) 166 protein:vir:3964 Length: 453 # 43.3 0.85 0.00053 21.0 13.1 411 68-580 1-453 (453) 167 protein:vir:1587 Length: 508 # 42.9 0.87 0.00054 20.9 11.4 437 58-583 1-508 (508) 168 protein:vir:93630 Length: 776 39.7 1 0.00062 20.6 18.7 638 1-726 1-776 (776) 169 protein:vir:9706 Length: 100 # 37.6 0.44 0.00027 22.6 3.6 96 99-278 1-100 (100) 170 protein:vir:94002 Length: 378 37.5 1.1 0.00069 20.3 13.7 351 73-575 1-378 (378) 171 protein:vir:93867 Length: 378 36.7 1.2 0.00072 20.2 14.3 354 73-583 1-378 (378) 172 protein:vir:105292 Length: 478 36.0 1.2 0.00074 20.2 16.6 456 21-580 1-478 (478) 173 protein:vir:5703 Length: 150 # 34.0 0.65 0.00041 21.6 4.0 114 275-412 1-150 (150) 174 protein:vir:80680 Length: 441 32.1 1.5 0.0009 19.7 12.9 414 44-566 1-441 (441) 175 protein:vir:106639 Length: 481 31.6 1.5 0.00092 19.6 14.7 432 49-574 1-481 (481) 176 protein:vir:79179 Length: 155 29.8 1.3 0.00084 19.9 5.0 115 274-412 1-155 (155) 177 protein:vir:1838 Length: 149 # 27.8 1.8 0.0011 19.2 6.9 114 256-410 1-149 (149) 178 protein:vir:97336 Length: 492 26.3 2 0.0012 19.0 17.1 430 61-591 1-492 (492) 179 protein:vir:95806 Length: 440 25.2 2.1 0.0013 18.8 13.9 391 98-580 1-440 (440) 180 protein:vir:5839 Length: 533 # 24.0 2.2 0.0014 18.7 21.3 482 26-642 1-533 (533) 181 protein:vir:105889 Length: 474 23.1 2.4 0.0015 18.5 14.8 414 44-580 1-474 (474) 182 protein:vir:94101 Length: 474 23.1 2.4 0.0015 18.5 14.8 414 44-580 1-474 (474) 183 protein:vir:7768 Length: 484 # 22.0 2.5 0.0016 18.4 18.6 456 1-608 1-484 (484) 184 protein:vir:103841 Length: 155 21.4 0.65 0.0004 21.6 1.6 116 275-412 1-155 (155) No 1 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=100.00 E-value=1.3e-37 Score=222.81 Aligned_cols=462 Identities=15% Similarity=0.148 Sum_probs=246.9 Q ss_pred hhHHhhh-hhcccCCchH-HHHHHHHhhhcchhHHHH---HHHHhccccccccee----e--ccchhhhhhhhhHHhhCC Q lcl|NC_011269. 32 MARAQAA-ALQNTVDNKP-LIDYFQGRRRAAEANRQR---LASYRKQGNFGSNMQ----I--AMPKIRQPLGTLADKGIP 100 (867) Q Consensus 32 ~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~~~~~~~ 100 (867) |.|..-. .|+.|+..-. -++-++++-|-|+..-.- |-+---+ .|-+||- | -+-.--+=.|.|-|+-| T Consensus 1 ~~~~~~~~~~~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~-gfv~~~~~ng~i~~v~~~~l~~~f~npd~~~- 78 (525) T protein:vir:10 1 MTRTKGSKNKSTTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFID-GFVMDLCNNGKIKTVNLDTLQLWFNNPDKYI- 78 (525) T ss_pred CCCCcCCcccccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHH-HHHHHhhcCCceeeeeHHHHHhhhcChHHHH- Confidence 2221111 1111111111 244555555554422111 0111111 1222221 0 00111122233333322 Q ss_pred CchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHH------HHH Q lcl|NC_011269. 101 FNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPD------QFA 174 (867) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~ 174 (867) || |..-...||-..--|-.|.|+.-..|-.+.+|+| |-.+--.-++|.+++.-+. +.- T Consensus 79 ------~~---i~~l~~y~yi~~~~v~ql~~li~~lp~l~y~i~~-------~~~~k~~~~~~s~~n~~l~k~i~hk~lt 142 (525) T protein:vir:10 79 ------NN---IVNLLTYYYIIDGNVFQLYDLIFSLPPLDYQIKV-------LKRDKDYKEDLSTINLYLEKKIQHKQLT 142 (525) T ss_pred ------HH---HHHHHHHhhhhcchHHHHHHHHHhcCCcceeehh-------hhhccchhhHHHHHHHHHHHhHHHHHHH Confidence 12 2223344666655566666666666666655543 1111112223333332221 122 Q ss_pred HHHhh----hhhhcc----------hhhhhhhccceehheecCcceeehh-hhhhhcchHHHHHHHHHHhhccccccccc Q lcl|NC_011269. 175 REYFT----VGEVTS----------LAHFNESLGVWSSEEILNPDMLRVS-RSMFVQRERVQLMVKDLVDHLRQGPTTAG 239 (867) Q Consensus 175 ~~~~~----~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (867) |.+++ .|-.++ |--||+---|+-++. -|=+||-|- -..| ++.-++--|.+.++|.-= T Consensus 143 rdll~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~r-~~g~~v~vid~~~f--~~~~~~~r~~~~~~lsp~----- 214 (525) T protein:vir:10 143 RDLLVQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYGR-AKGKMVAVIDLQWF--DEMSELERKLTFENLSPL----- 214 (525) T ss_pred HHHHHHhhccCceeEeeecCCCCcchhhhhhhhhhccccc-cCCceEEEEehHHh--hhhhHHHHHHHHHhhchh----- Confidence 33332 222111 223343332222221 122333221 1111 111111122222222100 Q ss_pred cccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccccc-CcchhhHHHHHHHHHHHHH Q lcl|NC_011269. 240 GNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATR-GAPHLLRSFRTLMAEESLN 318 (867) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 318 (867) -...+|....+-.-|--++ -+=|.|+.+.+.|+.|.+..+..| |+|.++++|..+..+++|| T Consensus 215 --------------i~~~~y~~~~~~~~~~~~~---~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klr 277 (525) T protein:vir:10 215 --------------ITENKYKKWKEYNGENEDA---LRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLR 277 (525) T ss_pred --------------hhhhhhhHHhhcccccchh---heeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHH Confidence 0000222221111111111 245889999999999999999999 9999999999999999999 Q ss_pred HHHHHHHhhhhchhhhhhhcccccCCCCcC-CCC---HHHHHHHHH------HHHHhhhc---chhhhhhhhheeee--- Q lcl|NC_011269. 319 AAQDAVADRLYSPLVLATLGIEDMGDGEPW-IPD---QGELDEVRD------DMQSLLAA---DFRLMVHNFGLKVE--- 382 (867) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~~~------~~~~~~~~---~~~~~~~~~~~~~~--- 382 (867) .++++||+||.+|++++|+|+ ++|+.| ||+ |.+|..||| ||+++|++ +|| .++| T Consensus 278 d~EqsIA~kii~a~avLk~gg---~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdf-------a~~efp~ 347 (525) T protein:vir:10 278 DLEQSIADKIIKAMAVLKFRG---KDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDF-------ATFEFPE 347 (525) T ss_pred HHHHHHHHHhhhhheeeeecc---ccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEeccce-------eeccccc Confidence 999999999999999999999 999999 999 999999999 89999999 888 6666 Q ss_pred ----eccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHh Q lcl|NC_011269. 383 ----NVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEA 458 (867) Q Consensus 383 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~ 458 (867) ++|++++ +||.|+.+|++|+||+++|++ |+|++||+|+||||..-|+||+|...||...|++++||-.- T Consensus 348 ik~~~~glDg~------K~d~I~~DI~~A~GlS~sL~n-GdggNyAtaslnld~fykkigVm~e~Iee~y~kL~d~Vl~~ 420 (525) T protein:vir:10 348 IKNGDKTLDPK------KYDSIDNDITNATGISQVLTN-GTKGNYASAKLNLDVFYKKIGVMLEIIEEIYNQLIDIILGE 420 (525) T ss_pred ccCcccCCCch------hhhhhhhhhhhhhccceeeec-CCCCceeeeeeeHHHHHHHHHHHHHHHHHHHHHHHhhhcCc Confidence 7888887 899999999999999999999 99999999999999999999999999999999999999543 Q ss_pred hcccchheehhhccccchhhhhhhhhhh---hhHhhhhhhhhhhhhccc--cccccchhhhhhhhhhhhhce-eeeeccc Q lcl|NC_011269. 459 QGHYDYDLKGGVRVPIYREIVEYDEETG---QEYIRKVPKLLIPEIKFS--TLNLRDEAQERAFIAQLKGMG-VPVSDKT 532 (867) Q Consensus 459 q~~~d~~~~~~~~~~~~rd~~~~k~e~~---k~~~r~~~k~i~~~i~~~--~~~Lr~e~~~~~~v~qL~~~~-~pitd~t 532 (867) ++...+.|| ++.-+.+++||+|. |+++++-.-.-...+.+- +..+-..-.+.+. .+|+.+- -++.+-. T Consensus 421 ----~k~~nyifn-ydkd~pi~~kkk~d~LIkL~d~g~s~k~vldl~gis~e~y~E~s~yEtE~-lkl~EKi~pp~~~~v 494 (525) T protein:vir:10 421 ----EKGCNYIFQ-YNKDTPIEREKKLDTLIKLEAQGYSAKYVLDILGISSEEYFEESIYEIEK-LKLREKIMPPLNTNV 494 (525) T ss_pred ----ccCcceEEe-cCCCchhhhhhhhhhhhhhhccchhhhhhhhhhccCcchHHHHHHHHHHH-HHHhhhcccccccee Confidence 677788899 99999999999764 455555332222222211 0111110111110 1111111 1111111 Q ss_pred cCCCc----ccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCC Q lcl|NC_011269. 533 LAVNI----DMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQ 597 (867) Q Consensus 533 ~p~ti----qme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq 597 (867) ..+.. -.+.+. ... ...+++...... . T Consensus 495 ~SGk~~n~iG~P~~d--d~~--~~dati~s~~~~----------------------------------~ 525 (525) T protein:vir:10 495 LSGKDGNDIGSPKLD--DSD--SSDATIESKERG----------------------------------V 525 (525) T ss_pred eeccccccccCCccC--CCc--chhhhhhhhhcC----------------------------------C Confidence 11100 000000 000 000011000000 0 No 2 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.41 E-value=6.5e-14 Score=92.86 Aligned_cols=502 Identities=17% Similarity=0.140 Sum_probs=241.6 Q ss_pred HHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhH Q lcl|NC_011269. 16 VNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLA 95 (867) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (867) .|-|-|+.--+ +|.. .+.+. ..|..+..|.--+. +-+..+.+ .+. .+ T Consensus 1 Mn~iDr~i~~~--sP~~------------------a~~R~------~ar~~~~~y~aa~~--~r~~~~~~---~~~--s~ 47 (548) T protein:vir:95 1 MNLIDRLLEPL--APEL------------------VARRL------AAREAIQAYEAARP--GRTHKAKR---QPL--GA 47 (548) T ss_pred CchHHhHhhhc--chHH------------------HHHHH------HhHHHhccccccCc--cccccccC---CCC--Ch Confidence 23232222111 1111 11111 11121122222221 11111222 122 22 Q ss_pred HhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccc--cceeccc----ch--------hHHHHHHHHh--- Q lcl|NC_011269. 96 DKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVV--GMEFDSK----DP--------LIKTFYEDLF--- 158 (867) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~----~~--------~~~~~~~~~~--- 158 (867) |-+.+..++.+++=||..|+.++++..+||.+...-|+ ++.+..+ |+ .|+..|.+-+ T Consensus 48 ------~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~ 121 (548) T protein:vir:95 48 ------DTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSP 121 (548) T ss_pred ------HHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCc Confidence 22333456667778899999999999999999999997 3555543 22 2233333322 Q ss_pred -hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhc-----ccee-hheecCcceeehhhhhhhcchHHHHHHHHHHhhc Q lcl|NC_011269. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESL-----GVWS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHL 231 (867) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (867) +..+||+..+..- +-|.++.-||+|-..++-+.. .-|- .+.+|+||+|......- + ..+ T Consensus 122 D~~g~~~f~~lq~l-~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~--~-----------~~i 187 (548) T protein:vir:95 122 ETSGELTRPQVERL-MCRTWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNL--S-----------KGI 187 (548) T ss_pred cccccCCHHHHHHH-HHHHHHhCCceEEEeeecccccccCCcccceEEEEechhhcCCCCCCC--C-----------Cce Confidence 2234566665554 559999999998776664321 1122 56789999987653221 1 134 Q ss_pred cccccccccccccccccchhhhhhhhhHHHHHHhchH-HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHH Q lcl|NC_011269. 232 RQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRT 310 (867) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (867) ++| +|-... -|-.-|- |++.+|- .........-.+|+-+.|.||...--+-..||.|.|..++.. T Consensus 188 ~~G----------IE~D~~---Grp~aY~-i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~ 253 (548) T protein:vir:95 188 VQG----------IERDTW---RRKRAYH-LLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIR 253 (548) T ss_pred eee----------eEECCC---CceEEEE-EeecCCCcccccccccceeeechhHheecccccCCccccCcchHHHHHHH Confidence 445 222111 1222333 5666776 222334456678999999999999888899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhh-h-hhheeeeeccccC Q lcl|NC_011269. 311 LMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMV-H-NFGLKVENVFGRE 388 (867) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~ 388 (867) |-..++|..|+ .++.++.+-|-.+--- . +|+....+...++-...+.-.-=++| | +-|-+|+.+-..- T Consensus 254 l~~l~~y~dae-l~~aki~A~~a~fi~~----~-----~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~ 323 (548) T protein:vir:95 254 LADLKDYEESE-RVAARISAALAMYIKK----G-----NPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNR 323 (548) T ss_pred HHHHhHHHHHH-HHHHHHhhhheeeeec----C-----CCccccCCCCcccccccccccCCccccccCCCceeeecCCCC Confidence 99999999998 6777777776655222 1 11111111112222222222223333 3 4467777766555 Q ss_pred ccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHH-------hhcc Q lcl|NC_011269. 389 SVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAE-------AQGH 461 (867) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e-------~q~~ 461 (867) ---+.++=++.+.+.|..+|||.-.++||-.+.+|+++-.++--.-+.+...|.+| |.+.|++|-+ +.|. T Consensus 324 p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~---i~~~~~Pi~~~wle~a~l~G~ 400 (548) T protein:vir:95 324 PNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEF---IDYWCRPVYRSWLQMYLLARK 400 (548) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHcCC Confidence 56677888888999999999999999996556699999888854444455555554 4455555422 4443 Q ss_pred cch----heehhhcc---ccchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccch-hhhhhhhhhhhhceeeee-c Q lcl|NC_011269. 462 YDY----DLKGGVRV---PIYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDE-AQERAFIAQLKGMGVPVS-D 530 (867) Q Consensus 462 ~d~----~~~~~~~~---~~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e-~~~~~~v~qL~~~~~pit-d 530 (867) .+- +....+++ .-.|..++..||+.-.. ++..++-.-..|.....|.... .+-.......+..+.++. + T Consensus 401 i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~ 480 (548) T protein:vir:95 401 ERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSD 480 (548) T ss_pred cCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCc Confidence 320 11112222 12256688888774432 3322222222233222322221 111111222233333332 1 Q ss_pred cccCCCcccccchhhhhhH-HHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCCCCCCCCCccC Q lcl|NC_011269. 531 KTLAVNIDMKFDQELERQA-DETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTELGEAQAVAGE 609 (867) Q Consensus 531 ~t~p~tiqme~E~e~e~k~-~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~~~qa~~~agq 609 (867) +..... ....++.++.++ .-...+.....+....+. ....++|.+.+ .-+.+. ...+. T Consensus 481 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-----------------~~~~~~-~~~~~ 539 (548) T protein:vir:95 481 AYHQLV-KSGMDPVEAVQKVYLGVGKMLTADEARELVN--RYGAGLPVPGP-----------------DFPNES-NNGGA 539 (548) T ss_pred cccccc-ccccCCCCchhhhccccccccccchhHHhhc--cCCCCCcCCCC-----------------CCCccc-ccCCC Confidence 111111 111111111110 000001111111111111 11112221110 000000 00111 Q ss_pred CCCccCCCC Q lcl|NC_011269. 610 AQAELQTKQ 618 (867) Q Consensus 610 ~~~p~~~~~ 618 (867) .+.+..+-+ T Consensus 540 ~~~~~~~~~ 548 (548) T protein:vir:95 540 DGQPSNPDP 548 (548) T ss_pred CCCCCCCCC Confidence 111000000 No 3 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.27 E-value=1.6e-13 Score=90.76 Aligned_cols=465 Identities=10% Similarity=0.041 Sum_probs=223.3 Q ss_pred CCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCch Q lcl|NC_011269. 24 VNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNV 103 (867) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (867) .++|- --+++..+....+++...|...+..+++. +.|-...+|..|-. T Consensus 1 ~~~~~------------~~~~~~~~~~~~~~~~~~~a~~~~~~~~~------------------w~~~~~s~~~~i~~-- 48 (530) T protein:vir:38 1 MKIPS------------LVGPDGKTSLREYAGYHGGGGGFGGQLRG------------------WNPPSESADAALLP-- 48 (530) T ss_pred Cccce------------eecCccccchHHHhhhhcccCCCCCcccc------------------cccCCCCHHHHHHH-- Confidence 11111 11223333333333322222111111111 12222334444422 Q ss_pred hhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceeccc------------ch----hHHHHHHHHh-------- Q lcl|NC_011269. 104 EDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSK------------DP----LIKTFYEDLF-------- 158 (867) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~------------~~----~~~~~~~~~~-------- 158 (867) .++.+++=||..|+.+|++..+|+.+...-|+. |...++ |. .|+..|.+-+ T Consensus 49 ----~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D 124 (530) T protein:vir:38 49 ----NYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGID 124 (530) T ss_pred ----HHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEe Confidence 233455558899999999999999999999987 776664 22 2333343211 Q ss_pred hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccc-e-ehheecCcceeehhhhhhhcchHHHHHHHHHHhhcccccc Q lcl|NC_011269. 159 FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGV-W-SSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPT 236 (867) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (867) +--++|+..+..- +-|.++.-||+|-..++.++.|. | -.+.+|+||+|.-...+-... .+++| T Consensus 125 ~~g~~~f~~~q~l-~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~------------~i~~G-- 189 (530) T protein:vir:38 125 AERKRTFTMMIRE-GVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTR------------NCRAG-- 189 (530) T ss_pred eeccCCHHHHHHH-HHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCC------------eeEee-- Confidence 1124566666655 55999999999999988876542 3 257889999987653221111 23334 Q ss_pred ccccccccccccchhhhhhhhhHHHHHHhc-hH--HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHH Q lcl|NC_011269. 237 TAGGNMSTVEETPSEREQRMREFQDLQRRY-PE--IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMA 313 (867) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (867) +|-.-.-| -.-|- |++.+ |. .-........++++-+.|.||...--+=.+||.|.|..++..|-. T Consensus 190 --------Ie~d~~Gr---~~aY~-i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~ 257 (530) T protein:vir:38 190 --------VKINDSGA---ALGYY-VSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKM 257 (530) T ss_pred --------eEECCCCc---eEEEE-EeeccCCCccccccceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHH Confidence 22222111 11222 23332 21 001111123467788899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhchhhhh---hhcccccCCCCcCC-----CCHHHHHHHHHHHHH-----hhh-cchhhhhhhhhe Q lcl|NC_011269. 314 EESLNAAQDAVADRLYSPLVLA---TLGIEDMGDGEPWI-----PDQGELDEVRDDMQS-----LLA-ADFRLMVHNFGL 379 (867) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~-----~~~-~~~~~~~~~~~~ 379 (867) .++|..|+- ++.++.+=|-.| ..+.+..+ +... ++...+...-.+... .+- -.-.+...+-|- T Consensus 258 l~~y~dael-~~a~i~A~~a~fi~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe 334 (530) T protein:vir:38 258 LDTLQNTQL-QSAIVKAMYAATIESELDTQSAM--DFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGD 334 (530) T ss_pred HhHHHHHHH-HHHHHhhhheeeeeccCCccccc--cccccCCcccccccccccchhhhhcccccceeccCceeeecCCCC Confidence 999999873 223333222221 22211111 1000 011111111110000 000 012223345577 Q ss_pred eeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCC-ccceehhhhhHHHHHHHHHHHHHHHHHH-Hhhhh-HHHH Q lcl|NC_011269. 380 KVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRH-IRRRC-EVVA 456 (867) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~r~~~-~~i~ 456 (867) +|+.+-..-.--+.++=++.+.+.|-.+|||+-.++||-- +.+|+++-.++--.-+.+...|.+|..+ ++.+. +|+. T Consensus 335 ~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~ 414 (530) T protein:vir:38 335 SLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLE 414 (530) T ss_pred eeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH Confidence 7777777766778888889999999999999999999543 6789999999866666677777777664 33333 4454 Q ss_pred H--hhcccch----hee------hhhcc---ccchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccchhhhhhh-h Q lcl|NC_011269. 457 E--AQGHYDY----DLK------GGVRV---PIYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDEAQERAF-I 518 (867) Q Consensus 457 e--~q~~~d~----~~~------~~~~~---~~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e~~~~~~-v 518 (867) | +.|..+- .+. -.+++ .-.+..++..||+.-.. ++...+-.-..|.....|......+... . T Consensus 415 ~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~ 494 (530) T protein:vir:38 415 EAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRES 494 (530) T ss_pred HHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHH Confidence 4 3332220 000 01111 12244566666653221 2221111111111111111110000000 0 Q ss_pred hhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccccccccccc Q lcl|NC_011269. 519 AQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLA 590 (867) Q Consensus 519 ~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~ 590 (867) ...+..+.+ ..+... . ....+....++..+.-. +.+ T Consensus 495 ~~~~~~Gl~--------------------------------~~~~~~-~--~~~~~~~~~~~~~~d~~-~~a 530 (530) T protein:vir:38 495 MERRAAGLN--------------------------------PPAWAA-A--AFEAGVKKSNEEEQDGA-RAA 530 (530) T ss_pred HHHHHcCCC--------------------------------CCCCcc-c--ccCCCCCCCCCCCCCCC-CCC Confidence 000000000 000000 0 00000000000000000 000 No 4 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.24 E-value=6.1e-13 Score=87.56 Aligned_cols=465 Identities=11% Similarity=0.064 Sum_probs=218.7 Q ss_pred CCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhccccccc-ceeeccchhhhhhhhhHHhhCCCc Q lcl|NC_011269. 24 VNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGS-NMQIAMPKIRQPLGTLADKGIPFN 102 (867) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 102 (867) +.||.-+. +++-. ++.+ ++....|..-+...+ -|.- .+|....+|..| T Consensus 1 ~~~p~~~~---------------------~~~~~-~~~~-~~~~~~y~~~a~~~~~~~~~-----w~p~~~s~~~~~--- 49 (533) T protein:vir:34 1 MKTPTIPT---------------------LLGPD-GMTS-LREYAGYHGGGSGFGGQLRS-----WNPPSESVDAAL--- 49 (533) T ss_pred CCCchhhh---------------------hhccc-ccch-HHHHHhhhhccCCCCCcccc-----cccCCCCHHHHH--- Confidence 33332111 11100 1111 122222321111111 1111 123333344333 Q ss_pred hhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceeccc------------c----hhHHHHHHHHh------- Q lcl|NC_011269. 103 VEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSK------------D----PLIKTFYEDLF------- 158 (867) Q Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~------------~----~~~~~~~~~~~------- 158 (867) +..++.+++=||..|+.+|++..+||.+..+-|+. |.+.++ | ..|+..|.+-+ T Consensus 50 ---~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~ 126 (533) T protein:vir:34 50 ---LPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCI 126 (533) T ss_pred ---HHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCcccee Confidence 22344556668899999999999999999999987 877764 2 23333343321 Q ss_pred -hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcc-cee-hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc Q lcl|NC_011269. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLG-VWS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP 235 (867) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (867) +--+||+..+..- +-|.+++-||+|-..++.+..| .|. .+.+|+||+|.-....-.. . .+++| T Consensus 127 D~~g~~~f~~~q~l-~~r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~-~-----------~i~~G- 192 (533) T protein:vir:34 127 DVERKRTFTMMIRE-GVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDS-R-----------NCRAG- 192 (533) T ss_pred ccccccCHHHHHHH-HHHHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCC-C-----------ceEee- Confidence 2335667776665 4599999999999998887654 232 5688999998765322111 1 12333 Q ss_pred cccccccccccccchhhhhhhhhHHHHHHhchHH--HhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHH Q lcl|NC_011269. 236 TTAGGNMSTVEETPSEREQRMREFQDLQRRYPEI--IQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMA 313 (867) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (867) +|-.-.-|- .-|-=..+.+|.- ........-++++-+.|.||...--+=..||.|.|..++..|-. T Consensus 193 ---------Ie~d~~Gr~---~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~ 260 (533) T protein:vir:34 193 ---------VQINDSGAA---LGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKM 260 (533) T ss_pred ---------eEECCCCCe---EEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCCCcccCCchHHHHHHHHHH Confidence 222111111 1121111222320 01111123456788899999999989999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhchhhhh---hhcccccCCCCcCCCCHHHHHHHHHHHHHh----------hhcchhhhhhhhhee Q lcl|NC_011269. 314 EESLNAAQDAVADRLYSPLVLA---TLGIEDMGDGEPWIPDQGELDEVRDDMQSL----------LAADFRLMVHNFGLK 380 (867) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~ 380 (867) .++|..|+- ++.+..+=|-.+ ..+.++.....--....+.-+.+-+..... |. .-.+....-|-+ T Consensus 261 l~~y~dael-~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-pG~i~~L~pGe~ 338 (533) T protein:vir:34 261 LDTLQNTQL-QSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLG-GAKVPHLMPGDS 338 (533) T ss_pred HHHHHHHHH-HHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeecc-CceeeecCCCCe Confidence 999999883 222232222222 222211111000001111111111000000 11 122233455677 Q ss_pred eeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCC-ccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHH---- Q lcl|NC_011269. 381 VENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVV---- 455 (867) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i---- 455 (867) |+.+-..-.--+.++=++.+.+.|-.+|||+-.++||-- +.||+++-.++--.-+.+-..|.+|.. +.|++| T Consensus 339 i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~---~~~~pi~~~w 415 (533) T protein:vir:34 339 LNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVAS---RQASQMFLCW 415 (533) T ss_pred eeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH Confidence 777666666678888899999999999999999999552 679999988884444445555655544 334433 Q ss_pred -HH--hhcccc--------h--heehhhcc---ccchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccchhhhhhh Q lcl|NC_011269. 456 -AE--AQGHYD--------Y--DLKGGVRV---PIYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDEAQERAF 517 (867) Q Consensus 456 -~e--~q~~~d--------~--~~~~~~~~---~~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e~~~~~~ 517 (867) .| +.|..+ + ...-.+++ .-.+..++..||+.-.. ++...+-.-..|.....|......+... T Consensus 416 l~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~ 495 (533) T protein:vir:34 416 LEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVR 495 (533) T ss_pred HHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHH Confidence 22 444332 0 00111121 12255567777663322 2211111111111111111110000000 Q ss_pred -hhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCC-CccccccccccccCCC Q lcl|NC_011269. 518 -IAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPY-PPELAQHLQSTLALRQ 593 (867) Q Consensus 518 -v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~-pp~~aQ~p~~t~~~a~ 593 (867) ....+..+.+. .+.... ....+.+. ..+.... .. .+ T Consensus 496 e~~~~~~~gl~~--------------------------------~~~~~~---~~~s~~~~~~~~~~~~--~~---~~ 533 (533) T protein:vir:34 496 ETMERRAAGLKP--------------------------------PAWAAA---AFESGLRQSTEEEKSD--SR---AA 533 (533) T ss_pred HHHHHHhcCCCC--------------------------------CCCCCc---CccCCCCCCCCCCccc--CC---CC Confidence 00000011000 000000 00000000 0000000 00 00 No 5 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.22 E-value=1.7e-12 Score=85.16 Aligned_cols=458 Identities=15% Similarity=0.088 Sum_probs=225.7 Q ss_pred HHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhH Q lcl|NC_011269. 16 VNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLA 95 (867) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (867) +|-|-|+-- -++ |-..+.+. ..|+.+..|.-.+ .+-..-..| ....+ T Consensus 1 mn~~dr~i~------------------~~s--P~~~~~R~------~ar~~~~~y~aa~--~~r~~~~~~-----~~~s~ 47 (502) T protein:vir:79 1 MAILDDVIG------------------VFS--PGWKAARL------RSRAVIQAYEAVK--TTRTHKARR-----ENRTA 47 (502) T ss_pred CchHhhHHh------------------hcC--hHHHHHHH------hhHHHHhhccccC--cccccCCCC-----CCCCh Confidence 111111100 011 12222221 2333333343322 122222222 23334 Q ss_pred HhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccc--ccee----cccchhH--------HHHHHHHh--- Q lcl|NC_011269. 96 DKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVV--GMEF----DSKDPLI--------KTFYEDLF--- 158 (867) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~----~~~~~~~--------~~~~~~~~--- 158 (867) |..| +..++.+++=||..|+..|++..+|+.+...-|+ ++.+ ...|..+ ++.|++-+ T Consensus 48 ~~~~------~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~ 121 (502) T protein:vir:79 48 DQLS------QYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSP 121 (502) T ss_pred HHHH------HHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCc Confidence 4433 3345567778899999999999999999999997 3543 3334322 23332211 Q ss_pred -hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhh-----cccee-hheecCcceeehhhhhhhcchHHHHHHHHHHhhc Q lcl|NC_011269. 159 -FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNES-----LGVWS-SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHL 231 (867) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (867) +..+||+..+..- +-|.++.-||+|-..++.+. +.-|. .+.+|+||+|..-. .-+ ..+ T Consensus 122 D~~g~~~f~~~q~l-~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~---~~~-----------~~i 186 (502) T protein:vir:79 122 EVTGQFTRPMLERL-MLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTS---DES-----------NRL 186 (502) T ss_pred CccccCCHHHHHHH-HHHHHHhCCceEEEEeecccCccCCCcccceEEEEecchhcCCCC---CCC-----------Cee Confidence 2235666666555 55999999999988777542 22232 56788888886542 111 123 Q ss_pred cccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHH Q lcl|NC_011269. 232 RQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTL 311 (867) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (867) ++| +|-..-.| -.-| -|++.+|- + ....+-.+|+-+.|.||...--+=..||.|.|..+...| T Consensus 187 ~~G----------Ve~d~~Gr---~~aY-~i~~~hPg--d-~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l 249 (502) T protein:vir:79 187 NQG----------VFVDDWGR---PEKY-LVYKSRPV--S-GRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRL 249 (502) T ss_pred Eee----------eEECCCCc---eEEE-EEeecCCC--C-CcccceeEechhheEEeecccCCccccCCchHHHHHHHH Confidence 344 22211111 1111 13456665 3 234456789999999999988899999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHH-HHHHHHHHHhhhcchhh-hhh-hhheeeeeccccC Q lcl|NC_011269. 312 MAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGEL-DEVRDDMQSLLAADFRL-MVH-NFGLKVENVFGRE 388 (867) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~ 388 (867) -..++|..|+ -++.++.+-|-.|--- .+|+...+....- +..+. .-|. -=+ +.| +-|-+|+.+-..- T Consensus 250 ~~l~~~~dae-l~~a~i~A~~~~fi~~----~~~~~~~~~~~~~~~~~~~---~~l~--pG~i~~~L~pGe~i~~~~p~~ 319 (502) T protein:vir:79 250 SALKEYEDSE-LTAARIAAALGMYIRK----GDGQSYEPDGNGSKENERE---LTIQ--PGIIYDDLKPGEEIGMVKSDR 319 (502) T ss_pred HHHhHHHHHH-HHHHHHhhhheeeeec----CCCcccccccCCCCCcccc---cccc--CCccccccCCCceeeeeCCCC Confidence 9999999998 5666666665554211 1122111111100 00000 0111 112 233 5588888877776 Q ss_pred ccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHH-----HH--hhcc Q lcl|NC_011269. 389 SVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVV-----AE--AQGH 461 (867) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i-----~e--~q~~ 461 (867) ..-+.++=++.+.+.|..+|||...++||-.+.+|+++-.++--.-+.+-..|.+|. ++.|++| .+ +.|. T Consensus 320 p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~---~~~~~pi~~~~l~~a~l~G~ 396 (502) T protein:vir:79 320 PNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFI---GAVTRPMYRAWLKQAVASGV 396 (502) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHcCC Confidence 777888889999999999999999999966567999998888555555666666554 3445444 22 4443 Q ss_pred cch----heehhhcc---ccchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccchhhhhh-hhhhhhhceeeeecc Q lcl|NC_011269. 462 YDY----DLKGGVRV---PIYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDEAQERA-FIAQLKGMGVPVSDK 531 (867) Q Consensus 462 ~d~----~~~~~~~~---~~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e~~~~~-~v~qL~~~~~pitd~ 531 (867) ++- +....+++ .-.|..++..||+.-.. ++..++-....+.....|......+.. .....+..+.+.. T Consensus 397 i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~-- 474 (502) T protein:vir:79 397 IRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFD-- 474 (502) T ss_pred CCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCC-- Confidence 220 11111121 12245566666553221 221111111111111111111000000 0000011111000 Q ss_pred ccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCC Q lcl|NC_011269. 532 TLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQT 598 (867) Q Consensus 532 t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~ 598 (867) +.+... ++....+...+.+..+.. ..+. T Consensus 475 ------------------------------~~~~~~-----~~~~~~~~~~~e~~~~~~----~~e~ 502 (502) T protein:vir:79 475 ------------------------------TDPASD-----KGGSSAATKRQEPQHTDD----QSEE 502 (502) T ss_pred ------------------------------CCCCCC-----CCCCCCCCCCCCCCCCCC----CCCC Confidence 000000 000000000000000000 0000 No 6 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.08 E-value=6.7e-11 Score=76.36 Aligned_cols=464 Identities=11% Similarity=0.047 Sum_probs=212.8 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeec-cchhhhhhhhhHHhhCCCchhhhHHHH Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIA-MPKIRQPLGTLADKGIPFNVEDEEELR 110 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (867) |+.-.. +.+-+|+.- -+-++-+. +.++. +. ++....+.+ .|.-..++...+.. T Consensus 1 m~~~i~-----~~~g~p~~~--~~~~~~~~---~~ia~-~~--~~~~~~~~~~~~~~~~~iLr~~~~------------- 54 (491) T protein:vir:10 1 MSKGLW-----VSPTEFVTF--GEPDKSLS---SQIAT-RA--RSIDFFALGMYLPNPDPVLKALGK------------- 54 (491) T ss_pred CCCcee-----CCCCCccCc--ccCChHHH---HHHHh-hh--cccccccccCCccchHHHHHhcCC------------- Confidence 222111 112222210 00011111 11221 11 111111111 22223333332221 Q ss_pred HHHHHHHHHhhccchHHHHHHhhhhccccc--ceec--ccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcch Q lcl|NC_011269. 111 VIRHWCRLFYATHDLVPLLIDIYSKFPVVG--MEFD--SKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSL 186 (867) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (867) .++.+-.. ...+-|..||+- ++.+|.+ ++|+ +.|+.+.+|..+.+- ++++.++|.|+. .-.-.|=...= T Consensus 55 ~~~~y~~m--~~D~~i~s~l~~-Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~--~~~~~~~l~~~l--da~~~G~s~~E 127 (491) T protein:vir:10 55 DIRVYREL--RADAHVGGCVRR-RKAAVKALEWGLDRGKAKSRVAKSIADVFA--DLDLSRIVTEML--DAVLYGYQPME 127 (491) T ss_pred CHHHHHHH--hhChHHHHHHHH-HHHHHhCCCcEEecCCCCHHHHHHHHHHHh--cCCHHHHHHHHH--HhhhhcceeEE Confidence 12223222 246677888876 4778876 4443 346668888888776 788889998865 22222322221 Q ss_pred hhhhhhcccee--hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHH Q lcl|NC_011269. 187 AHFNESLGVWS--SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQR 264 (867) Q Consensus 187 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 264 (867) ..++..+|.|. .++.++|..++.. ...++.+ T Consensus 128 i~w~~~~g~~~~~~l~~r~~~~f~~d-----~~~~l~~------------------------------------------ 160 (491) T protein:vir:10 128 ITWGKVGNYIVPIDVVGKPADWFVYD-----PENQLRF------------------------------------------ 160 (491) T ss_pred EEEeecCCeeEEEEeeeecccceeec-----cCCceEE------------------------------------------ Confidence 22455666665 6667777665543 1111110 Q ss_pred hchHHHhhhccCCCCcccHH-HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccC Q lcl|NC_011269. 265 RYPEIIQAAMQNDGLDISEA-LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMG 343 (867) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 343 (867) +..-.+.+|++|+.. .|.|. |+++.=.++|.+++..||...+.|...-+--...+.|+-.|+|+.|.+. + T Consensus 161 -----~~~~~~~~g~~l~~~k~i~~~-~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~---~ 231 (491) T protein:vir:10 161 -----RSKDHWMQGEELPARKFLVPR-QEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR---S 231 (491) T ss_pred -----ecCCCCCCcceecCCCEEEEE-ecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCC---C Confidence 000012356777665 44444 6666667899999999999999999999999999999999999999863 1 Q ss_pred CCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCc---hhHHHHHHHHHHHHhhccchhhhcCCCc Q lcl|NC_011269. 344 DGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPN---LDADYDRIERKLLQAWGIGEALISGGTG 420 (867) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 420 (867) . +++|.+.+-+-+.+ +..|..++ .--|-+||.+...++--+ -+.=++...++|..++ +|+.|+|+ .| T Consensus 232 a------~~~ek~~l~~al~~-~~~~a~~v-iP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LGqtlTt~-~~ 301 (491) T protein:vir:10 232 A------SDGEKNLLLDCLED-MVQDAVAV-VPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL-LGQNQTTE-AT 301 (491) T ss_pred C------CHHHHHHHHHHHHH-HhcCcEEE-ecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH-hhhhcccC-cc Confidence 1 34555555553333 23343322 233566777655443322 2333556678888777 89999995 58 Q ss_pred cceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchh--hhhhhhhhhhhHhhhhhhhhh Q lcl|NC_011269. 421 GAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYRE--IVEYDEETGQEYIRKVPKLLI 498 (867) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd--~~~~k~e~~k~~~r~~~k~i~ 498 (867) ++|+.+.|-.+....+.-.....|..+++++++++.++|.+...-.++.+....-.+ ..+..+++.....+....-+. T Consensus 302 gs~a~~~vh~~v~~di~~~D~~~i~~tln~li~~l~~~N~~~~~~p~f~~~~~~e~~~~~a~~~~~L~~~G~~i~~~~i~ 381 (491) T protein:vir:10 302 STRASAQAGLEVTDDIRDGDKAVVSEAMNMLIRWICDLNFDGADRPVFDMWEQEQVDEIQAGRDQKLTQAGARFTPAYFK 381 (491) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEecCcCchhHHHHHHHHHHHhCCCcCCHHHHH Confidence 899999999988777788888899999999889999999765543444333322111 011111111111111111111 Q ss_pred hhhccccccccchhhhhhhhhhhhhceee-eeccccC-CCcccccchhhhhhHH-----------HHHHHHhhccccccc Q lcl|NC_011269. 499 PEIKFSTLNLRDEAQERAFIAQLKGMGVP-VSDKTLA-VNIDMKFDQELERQAD-----------ETVQKLMATAQAMKK 565 (867) Q Consensus 499 ~~i~~~~~~Lr~e~~~~~~v~qL~~~~~p-itd~t~p-~tiqme~E~e~e~k~~-----------E~l~tL~~taet~kk 565 (867) .+........ .+. +......... ....... ...+...+........ +.+..+...+.+... T Consensus 382 e~~Gip~~~~-----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e 455 (491) T protein:vir:10 382 RAYNLQDGDL-----DER-PLPVSAVDTVGAASFAEFEAPDQDALDAALNTLSARDLNADAQALVAPLLKRIANGASADE 455 (491) T ss_pred HHhCCCCCCc-----Ccc-ccccCCCCCcccccccccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 1111111100 000 0000000000 0000000 0000111111111100 111111222222221 Q ss_pred ccccccccCCCC-CccccccccccccCCCCCCCCCC Q lcl|NC_011269. 566 VQDLCDAQNLPY-PPELAQHLQSTLALRQGKTQTEL 600 (867) Q Consensus 566 vq~~~p~~g~P~-pp~~aQ~p~~t~~~a~gpgq~~~ 600 (867) +...-...-... ..++.+........+...+.... T Consensus 456 ~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~G~~~a 491 (491) T protein:vir:10 456 LLGMLAELYPSLDADALQERLARAIFVANLWGRLHA 491 (491) T ss_pred HHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhccC Confidence 111100000000 01111111110000000111111 No 7 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.05 E-value=3.9e-11 Score=77.62 Aligned_cols=452 Identities=11% Similarity=0.065 Sum_probs=205.5 Q ss_pred hhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHH-----hhC-CCch--hhhHH-HHHHHHHHHHH--hhccc Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLAD-----KGI-PFNV--EDEEE-LRVIRHWCRLF--YATHD 124 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~-~~~~--~~~~~-~~~~~~~~~~~--~~~~~ 124 (867) |..++- -..|++ +..+ ..++++.+... ..+ |+.- ++++. |+..-.=|+.| ....+ T Consensus 1 ~~~~i~---------~~~g~~---~~~~--~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~ 66 (491) T protein:vir:79 1 MSKGLW---------VSPTEF---VKFG--EPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADA 66 (491) T ss_pred CCCeee---------CCCCCc---cccc--ccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHhhCh Confidence 211110 001111 1111 11223222111 001 1110 11111 11000001111 01245 Q ss_pred hHHHHHHhhhhcccccceec----ccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccce--eh Q lcl|NC_011269. 125 LVPLLIDIYSKFPVVGMEFD----SKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVW--SS 198 (867) Q Consensus 125 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 198 (867) -|..||+- ++.+|.+.++. +.|+-+.+|..+.+- ++++.++|.|+.- -.-.|=...=..+...+|.| .+ T Consensus 67 ~i~s~l~~-Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~--~~~~~~~i~~~ld--a~~~G~s~~Ei~w~~~~g~~~~~~ 141 (491) T protein:vir:79 67 HVGGCVRR-RKAAVKALEWGLDRGKAKSRVAKSIADVFA--DLDLSRIATEMLD--AVLYGYQPMEITWGKVGNYIVPID 141 (491) T ss_pred HHHHHHHH-HHHHHhCCCcEEecCCCCHHHHHHHHHHHh--cCCHHHHHHHHHH--hhhhcceeEEEEEeecCCeeeEEe Confidence 56666664 46777764433 455667888888876 7888899988652 11122222112235566766 36 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhh-ccCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAA-MQND 277 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 277 (867) ++..+|..+.+.. ..++. +.+. ...+ T Consensus 142 l~~r~~~~f~~d~-----~~~l~------------------------------------------------l~~~~~~~~ 168 (491) T protein:vir:79 142 VVGKPADWFVYDP-----ENQLR------------------------------------------------FRSKEHWVQ 168 (491) T ss_pred eeeecccceeecc-----CCceE------------------------------------------------EeecCCCCC Confidence 7777776655431 11111 0000 2345 Q ss_pred CCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHH Q lcl|NC_011269. 278 GLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDE 357 (867) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (867) |++|+..-.....|+++.=.++|..++..||...+.++..-+.-...+.|+-.|+|+.|.+. + =+++|.+. T Consensus 169 g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~---~------a~~~ek~~ 239 (491) T protein:vir:79 169 GEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR---S------ASDAETNL 239 (491) T ss_pred ceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC---C------CCHHHHHH Confidence 67776554434447777777999999999999999999999999999999999999999863 1 13445555 Q ss_pred HHHHHHHhhhcchhhhhhhhheeeeecccc---CccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHH Q lcl|NC_011269. 358 VRDDMQSLLAADFRLMVHNFGLKVENVFGR---ESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVT 434 (867) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 434 (867) +-+.+.+ +..|.. +|.--|-+||.+-+. ++.-.-+.=++...++|..++ +|+.|+|+ .|++|+.+.|-.+... T Consensus 240 l~~al~~-~~~~a~-~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LGqtlTt~-~~gs~a~~~vh~~v~~ 315 (491) T protein:vir:79 240 LLDRLED-MVQDAV-AVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-LGQNQTTE-ATSTRASAQAGLEVTD 315 (491) T ss_pred HHHHHHH-HhcCeE-EEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-hhhhhccC-cccchhhHHHHHHHHH Confidence 4443333 233433 223334566665433 222222334555678888888 89999994 6899999999998877 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchh-h-hhhhhhhhhhHhhhhhhhhhhhhccccccccchh Q lcl|NC_011269. 435 QIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYRE-I-VEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEA 512 (867) Q Consensus 435 ~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd-~-~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~ 512 (867) .+.-.-...|..+++++++++.++|.+.....++.+-...-.+ + .+..+++.....+....-+..+...-.... +. T Consensus 316 ~i~~~D~~~i~~tln~li~~l~~~N~~~~~~p~f~~~e~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gip~~~~--~e 393 (491) T protein:vir:79 316 DIRDGDKAIVVEAMNMLIRWICDLNFDGAARPVFDMWEQEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNLQDGDL--DE 393 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEeecCcCchhHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCC--Cc Confidence 7788888999999999889999999876555444333222111 0 111111111111111111111111111100 00 Q ss_pred hhhhhhhhhhhcee----eeeccccCCCcccccchhh--------hhhHH---HHHHHHhhcccccccccccccccCCCC Q lcl|NC_011269. 513 QERAFIAQLKGMGV----PVSDKTLAVNIDMKFDQEL--------ERQAD---ETVQKLMATAQAMKKVQDLCDAQNLPY 577 (867) Q Consensus 513 ~~~~~v~qL~~~~~----pitd~t~p~tiqme~E~e~--------e~k~~---E~l~tL~~taet~kkvq~~~p~~g~P~ 577 (867) .+........ .......+. +...+... +.... +.+..+...+.+...+...-...-... T Consensus 394 ----~~~~~~~~~~~~~~~~~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~ 467 (491) T protein:vir:79 394 ----RPLPVSAVDAVGAASFAEFEAPD--QDALDAALNALSARDLNADAQALVAPLLKRIANGASADELLGMLAELYPSL 467 (491) T ss_pred ----cccCcCcccccccccccccCCCC--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcC Confidence 0000000000 000000000 00011000 00000 111112222222221111100000000 Q ss_pred C-ccccccccccccCCCCCCCCCC Q lcl|NC_011269. 578 P-PELAQHLQSTLALRQGKTQTEL 600 (867) Q Consensus 578 p-p~~aQ~p~~t~~~a~gpgq~~~ 600 (867) + ..+..........+...+.... T Consensus 468 d~~~l~~~l~~a~~~A~l~Gr~~a 491 (491) T protein:vir:79 468 DTDALQERLARAIFVANLWGRLHA 491 (491) T ss_pred CHHHHHHHHHHHHHHHHHhhhccC Confidence 0 1111111100000110111111 No 8 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.04 E-value=7.6e-11 Score=76.06 Aligned_cols=464 Identities=14% Similarity=0.109 Sum_probs=221.1 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHH Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRV 111 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (867) |.+| -.+.|.++-- +.+....++ +. -+.....|.--+....|... .++|-...+|. +-+..++. T Consensus 1 ~~r~--~~~~~~~dr~--i~~~~~~~~-~~-~~~~~~~y~aa~~~r~~~~w----~~~~~~~s~~~------~i~~~~~~ 64 (505) T protein:vir:96 1 MKRA--EKKPSLAQRM--VNWAWYRYV-EP-QKNAARAFEAARRDRLGKAW----LRRASRLSADE------EIYADLAS 64 (505) T ss_pred CCCC--ccccchhhcc--cchhhhhhH-HH-HHHhhhhcccccCCCccccc----cCCCCCCChHH------HHHHHHHH Confidence 3332 2233322211 111111111 00 01111123222211111110 11232233332 33445666 Q ss_pred HHHHHHHHhhccchHHHHHHhhhhcccc--cceeccc--------chhHHHHHHHHh--hcc--------cccHHHHhHH Q lcl|NC_011269. 112 IRHWCRLFYATHDLVPLLIDIYSKFPVV--GMEFDSK--------DPLIKTFYEDLF--FGE--------DLNYLEFLPD 171 (867) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~--------~~~~~~~~~~~~--~~~--------~~~~~~~~~~ 171 (867) +++=||..|+.++++.-.|+.+...-|+ ||.+.+. |+.+.+-.+.+| |.+ ++|+..+. . T Consensus 65 lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq-~ 143 (505) T protein:vir:96 65 LVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLL-H 143 (505) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHH-H Confidence 7777999999999999999999999996 6887764 444444333332 333 34444444 3 Q ss_pred HHHHHHhhhhhhcchhhhhhhcccee-hheecCcceeehhhhhhhc-chHHHHHHHHHHhhccccccccccccccccccc Q lcl|NC_011269. 172 QFAREYFTVGEVTSLAHFNESLGVWS-SEEILNPDMLRVSRSMFVQ-RERVQLMVKDLVDHLRQGPTTAGGNMSTVEETP 249 (867) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (867) ++-|.++.-||+|-.-+..+ ++.|. .+.+|+||+|......-.+ +.+ +++| +|-.. T Consensus 144 l~~r~~~~dGE~f~~~~~~~-~~~~~~~lqliepd~l~~~~n~~~~~~~~-----------i~~G----------Ie~d~ 201 (505) T protein:vir:96 144 LWMETLARDGEVLVREHRGY-PNKWGYALQILECDRLDLNYNADLQNGNR-----------IRMS----------IELDA 201 (505) T ss_pred HHHHHHhhCCceEEEEeecC-CCCcceEEEEechhhcCCCCCcccCCcCe-----------EEec----------eEECC Confidence 45699999999975443332 23333 5788888888765321111 111 2333 33222 Q ss_pred hhhhhhhhhHHHHHHhchH-H--HhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 250 SEREQRMREFQDLQRRYPE-I--IQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVAD 326 (867) Q Consensus 250 ~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (867) ..| -.-|- |++.+|- . ..........+|+-+.|.||...--+=..||.|.|..++..|-..++|..|+-- +. T Consensus 202 ~Gr---~~aY~-i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~-~a 276 (505) T protein:vir:96 202 WER---PVAYH-LLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMI-AA 276 (505) T ss_pred CCc---eEEEE-EeecCCCccccccccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHH-HH Confidence 211 11221 4556664 1 111233456678889999999988888999999999999999999999998743 33 Q ss_pred hhhchhhhh-hhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhh-hhheeeeeccccCccCchhHHHHHHHHHH Q lcl|NC_011269. 327 RLYSPLVLA-TLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVH-NFGLKVENVFGRESVPNLDADYDRIERKL 404 (867) Q Consensus 327 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 404 (867) ++.+=|-.+ |-.. =+.+.+ ..+.-.+....|+- - +++| +-|-+|+.+-..-.--+.++=++.+-+.| T Consensus 277 ~i~A~~a~fi~~~~--~~~~~~---~~~~~~~~~~~l~p-----G-~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~i 345 (505) T protein:vir:96 277 ELGAKKVGFYEQDP--EAYDQP---PEDDQGEIVEEVEA-----G-TYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGV 345 (505) T ss_pred HHhhhheeeeecCC--ccCCCc---cccccCccccccCC-----c-eeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHH Confidence 333333222 1111 011211 11110011111211 1 2333 55667776655555667778888889999 Q ss_pred HHhhccchhhhcCCC-ccceehhhhhHHHHHHHHHHHHHHHHH-HHhhhhH-HHHH--hhcccc---hheehhhcc---c Q lcl|NC_011269. 405 LQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKR-HIRRRCE-VVAE--AQGHYD---YDLKGGVRV---P 473 (867) Q Consensus 405 ~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~r~~~~-~i~e--~q~~~d---~~~~~~~~~---~ 473 (867) -.+|||.-..+||-- +.+|+++-.++--.-+.+...|.+|.. +++.+-+ |+.| +.|... ......+++ . T Consensus 346 aaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~ 425 (505) T protein:vir:96 346 AAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQP 425 (505) T ss_pred HhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeecc Confidence 999999999999553 568999999886655667777777754 3443443 3333 343221 011111122 1 Q ss_pred cchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccchhhhhhh-hhhhhhceeeeeccccCCCcccccchhhhhhHH Q lcl|NC_011269. 474 IYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDEAQERAF-IAQLKGMGVPVSDKTLAVNIDMKFDQELERQAD 550 (867) Q Consensus 474 ~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e~~~~~~-v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~ 550 (867) -.|..++..||+.-.. ++...+-....|.....|......+... ....+..+.....+..... T Consensus 426 p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~-------------- 491 (505) T protein:vir:96 426 RGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESK-------------- 491 (505) T ss_pred CCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCC-------------- Confidence 2255566666653322 2221111111222222222111000000 0111111110000000000 Q ss_pred HHHHHHhhcccccccccccccccCCCCCcccccccccc Q lcl|NC_011269. 551 ETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQST 588 (867) Q Consensus 551 E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t 588 (867) .. + ..+..+....- T Consensus 492 ----------------~~-------~-~~~~~~~~~d~ 505 (505) T protein:vir:96 492 ----------------DA-------T-TDEEDDSASDD 505 (505) T ss_pred ----------------CC-------C-CCCCCCCCCCC Confidence 00 0 00000000000 No 9 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.02 E-value=4.6e-11 Score=77.26 Aligned_cols=415 Identities=13% Similarity=0.129 Sum_probs=185.7 Q ss_pred HHHHhhccchHHHHHHhhhhcccc-cceec--------ccc-hhHHHHHHHHhhcccc------------cHHHHhHHHH Q lcl|NC_011269. 116 CRLFYATHDLVPLLIDIYSKFPVV-GMEFD--------SKD-PLIKTFYEDLFFGEDL------------NYLEFLPDQF 173 (867) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--------~~~-~~~~~~~~~~~~~~~~------------~~~~~~~~~~ 173 (867) .|.+=...+.|-.||++.++-=.+ ++++. .++ ..++.-+ +.++..+. ....||.-++ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~-~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~ 79 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVW-DFWFGDDSNWQVGPMESERATATNVLQTAW 79 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHH-HHhhccCCCccccchhhHhhHHHHHHHHHH Confidence 667777899999999998864211 12221 111 2222222 22332332 2345666655 Q ss_pred HHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhh Q lcl|NC_011269. 174 AREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSERE 253 (867) Q Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (867) ..|+.-|.++-+...|..+ ..-++..|+|++|++...- +-.+.+. .+ .......-... T Consensus 80 -~~l~l~Gn~~i~~~r~~~G-~~~~l~~l~~~~v~~~~d~---~~~~~~~---------~~-----~~~~~~~~~~~--- 137 (467) T protein:vir:31 80 -TDYEAIGWLTIEILTQTDG-TPTGLAYVPGHTIRKRMDE---RGFVQLL---------EE-----KEKYFGVAGDR--- 137 (467) T ss_pred -HHHHhcCCeEEEEEECCCC-cEEEEEEeCCceeEeeeec---ceeEeec---------CC-----ceeeEEecccc--- Confidence 6778889999888888775 4567899999999886211 1001000 00 00000000000 Q ss_pred hhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhh Q lcl|NC_011269. 254 QRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLV 333 (867) Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (867) ........+...|.+ +.....+....++..-|-|+++-.+....+|.|.+.-+.++|.............-+.-..|-- T Consensus 138 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 216 (467) T protein:vir:31 138 YQTNGNGDLDPVFVD-ADDGSTGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRI 216 (467) T ss_pred ceeecccceeeeeee-eccccccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Confidence 000000111222222 1223455667788888999987766777799999998888876554444333333333333433 Q ss_pred hhhh-cccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh--------------hhhhhhheeeeeccccCccC----chh Q lcl|NC_011269. 334 LATL-GIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR--------------LMVHNFGLKVENVFGRESVP----NLD 394 (867) Q Consensus 334 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~----~~~ 394 (867) ++++ |. +-+++..+.+|+-|++.....+. .+|-.-|+....++-.-..| ..| T Consensus 217 il~~~~~---------~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d 287 (467) T protein:vir:31 217 AIIVKGA---------ELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEE 287 (467) T ss_pred EEEecCc---------CCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhh Confidence 3333 21 45889999999988776543322 12222333333333322222 224 Q ss_pred HHH----HHHHHHHHHhhccchhhhcCCCccceeh-h-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheeh Q lcl|NC_011269. 395 ADY----DRIERKLLQAWGIGEALISGGTGGAYAS-S-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKG 468 (867) Q Consensus 395 ~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~ 468 (867) .+| ++..++|.+++||.-.+|.-.++++|++ + +....|+.+-+.-+...|++++.+.+-. +.....++.|++ T Consensus 288 ~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~--~~~~~~~~~i~f 365 (467) T protein:vir:31 288 ASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHK--QGLDAPDWTIEF 365 (467) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--hhhccCCceEEE Confidence 333 3455679999999999985456666643 2 3444555555666666666666655421 222334555666 Q ss_pred hhccccchhhhhhhhhhhhhHhhh---hhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhh Q lcl|NC_011269. 469 GVRVPIYREIVEYDEETGQEYIRK---VPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQEL 545 (867) Q Consensus 469 ~~~~~~~rd~~~~k~e~~k~~~r~---~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~ 545 (867) -+..+...|..+.-+-..+ ..+. ....+.....+ .+ +.++.............+........ . -..++.. T Consensus 366 ~~~~l~~~d~~~~~~~~~~-~~~~G~~T~NE~R~~~Gl-~p-i~d~~~~~~~~~~~~~~~~~~~~~~~--~--~~~~~~~ 438 (467) T protein:vir:31 366 ELAKPDTKLQDVEIASQRV-QAMQGLLTVNELRDEFGF-EP-FPEEHVYGGETLVAEVTGGSGPGGGI--G--DQIEQLV 438 (467) T ss_pred ecchhhccCHHHHHHHHHH-HHhCCCcCHHHHHHHhCC-CC-CCcccccCCcccccccccccCCCCcc--c--CcCCCCC Confidence 6666654444332221211 1111 01111111111 01 11100000000000000000000000 0 0000101 Q ss_pred hhhHHHHHHHHhhcccccc--cccccccc Q lcl|NC_011269. 546 ERQADETVQKLMATAQAMK--KVQDLCDA 572 (867) Q Consensus 546 e~k~~E~l~tL~~taet~k--kvq~~~p~ 572 (867) +....+.+.-+...-++.+ +..+..+. T Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 439 EDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred CCcccchHhhhhhccccchhhhhccccCC Confidence 1111111111100000000 00000000 No 10 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.01 E-value=2.2e-11 Score=79.06 Aligned_cols=451 Identities=12% Similarity=0.042 Sum_probs=216.0 Q ss_pred CCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCch Q lcl|NC_011269. 24 VNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNV 103 (867) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (867) +|+=.. .+...++++..-.. ...|.--+.+-.+ + .. + .+.+| - T Consensus 1 m~~~~~----------------------~~~a~~~~~~~~~~-~~~y~aa~~~~~~-~--~~----~-~~s~d------~ 43 (495) T protein:vir:10 1 MNMTPS----------------------GYQSLASGLLVPVG-ASAYEGASGGHRW-Q--DI----G-DYGPD------T 43 (495) T ss_pred CCcccc----------------------cccccchhhhhHHH-hhhhhccccCccc-C--CC----C-CCChh------H Confidence 332211 11111111111000 0123222211111 1 00 0 12222 2 Q ss_pred hhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceeccc--chhHHHHHHHHh--hcc------cccHHHHhHHH Q lcl|NC_011269. 104 EDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSK--DPLIKTFYEDLF--FGE------DLNYLEFLPDQ 172 (867) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~--~~~------~~~~~~~~~~~ 172 (867) +-+-.++.+++=||..|+.++++..+|+.+..+-|+. |...++ |+.+.+-.+.+| |.+ ++|+..+..- T Consensus 44 ~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l- 122 (495) T protein:vir:10 44 AVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQAL- 122 (495) T ss_pred HHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHH- Confidence 2233455666779999999999999999999999887 777664 544544444443 333 4566665554 Q ss_pred HHHHHhhhhhhcchhhhhhh--cccee-hheecCcceeehhhhhhh--cchHHHHHHHHHHhhccccccccccccccccc Q lcl|NC_011269. 173 FAREYFTVGEVTSLAHFNES--LGVWS-SEEILNPDMLRVSRSMFV--QRERVQLMVKDLVDHLRQGPTTAGGNMSTVEE 247 (867) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (867) +-|.++.-||+|-..++.+. ++-|. .+.+|+||+|.....+-. .+.+ +++| +|- T Consensus 123 ~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~-----------i~~G----------Ie~ 181 (495) T protein:vir:10 123 VVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGY-----------VKGG----------IRF 181 (495) T ss_pred HHHHHHhCCceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCE-----------EEec----------eEE Confidence 55999999999987776542 23343 678889999876532211 1111 2333 222 Q ss_pred cchhhhhhhhhHHHHHHhchH-HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 248 TPSEREQRMREFQDLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVAD 326 (867) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (867) ..- -|-.-|- |++.+|- -.......+-.+|+-+.|.||..+ .+=..||.|.| -+...|-..++|..|+- ++. T Consensus 182 d~~---Gr~vaY~-i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~-r~gQ~RGis~l-a~i~~l~~l~~y~dael-~~a 254 (495) T protein:vir:10 182 SNG---GKRKAYC-FYRNHPAESSLIGDPVDTVWIKAEHVLHVTVL-TVRSDAGAPWF-QLLLRLNELDQYEDAEL-VRK 254 (495) T ss_pred CCC---CceEEEE-EeecCCCcccccccccceeeechhheEecccc-CCCcccCcchh-HHHHHHHHhhHHHHHHH-HHH Confidence 111 1222232 4566775 111223335578998999999754 67789999965 34444545555554432 566 Q ss_pred hhhchhhhhh---hcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhh-hhheeeeeccccCccCchhHHHHHHHH Q lcl|NC_011269. 327 RLYSPLVLAT---LGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVH-NFGLKVENVFGRESVPNLDADYDRIER 402 (867) Q Consensus 327 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 402 (867) ++.+-|-.+- .+.+.. ++. +.+ .+.+ .+....+...=-+++| +-|-+++.+-....--+.++=++.+.+ T Consensus 255 ~i~A~~~~fi~~~~~~~~~--~~~-~~~-~~~~---~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr 327 (495) T protein:vir:10 255 KTAALFAAFIQEATADSTG--GPT-IGQ-PKRS---KGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLL 327 (495) T ss_pred HHhhhheeeeecCCCcccc--ccc-cCc-cccc---cCcccceecCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHH Confidence 6666555442 222111 111 111 1111 1111112122223444 567888887777677788888999999 Q ss_pred HHHHhhccchhhhcCCC--ccceehhhhhHHHHHHHHH-HHHH-HHH-HHHhhhhH-HHHH--hhcccch----h-eehh Q lcl|NC_011269. 403 KLLQAWGIGEALISGGT--GGAYASSALNREFVTQIMT-GFQN-ALK-RHIRRRCE-VVAE--AQGHYDY----D-LKGG 469 (867) Q Consensus 403 ~~~~~~~~~~~~~~~g~--~~~~~~~~~~~~~~~~~~~-~~~~-~l~-~~~r~~~~-~i~e--~q~~~d~----~-~~~~ 469 (867) .|..+|||+-..+| |+ +.||+++-.++--. ++++ ..|. +|. +.+|.+-+ |+.+ +.|...- + ..-. T Consensus 328 ~iaaglGi~Ye~lt-gD~s~~nYSS~R~~~~e~-~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~ 405 (495) T protein:vir:10 328 SIAKGYGITYEMLT-GDLRGVNYSSIRAGLLEF-RRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYY 405 (495) T ss_pred HHHhhcCCCHHHHh-cccccccHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhh Confidence 99999999999999 55 66899998888222 3333 3443 333 23343332 3322 4443220 0 0111 Q ss_pred hcc---ccchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccchhhhh-hhhhhhhhceeeeec-ccc---CCCccc Q lcl|NC_011269. 470 VRV---PIYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDEAQER-AFIAQLKGMGVPVSD-KTL---AVNIDM 539 (867) Q Consensus 470 ~~~---~~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e~~~~-~~v~qL~~~~~pitd-~t~---p~tiqm 539 (867) +++ .-.+..++..||+.-.. ++...+-.-..|.....|......+. ......+..+.++.. +.. ....+. T Consensus 406 ~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~ 485 (495) T protein:vir:10 406 NRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQK 485 (495) T ss_pred hccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCC Confidence 222 12356677777764332 22222222222222222222210000 011122222222111 000 000000 Q ss_pred ccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 540 KFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 540 e~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ..+. ... ..+ T Consensus 486 ~~~~-----------------------------~~~--~~e 495 (495) T protein:vir:10 486 SVME-----------------------------AAL--NNE 495 (495) T ss_pred CCCC-----------------------------CCC--CCC Confidence 0000 000 000 No 11 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.95 E-value=1.1e-10 Score=75.28 Aligned_cols=413 Identities=9% Similarity=0.060 Sum_probs=209.4 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) |. +.+.|..+++.-++++.. ......+++....+..-+++.... .+-..+.| .+ T Consensus 1 MG---~f~~lf~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~g~~~~~----~v~~~~al------------~~ 54 (422) T protein:vir:13 1 MG---FLRGLFNKKNNNDEKRSN-------YDEDIGIDISDSNFWEKFGIKLNF----SVRGKRAL------------KE 54 (422) T ss_pred Cc---hhhhhhhccCCccchhhh-------hhhccccccCcchhhhhccccCCc----ccchhhhh------------cc Confidence 21 222222333322222211 001122333333343333322111 11122111 24 Q ss_pred chHHHHHHhhhh-cccccceecccchhHHHHHHHHhhc----ccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh Q lcl|NC_011269. 124 DLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYEDLFFG----EDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS 198 (867) Q Consensus 124 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (867) +.|-.||++.++ ..-..+++..+...+++-..+-.|- +..+-.+|+..++ ..++.-|.++-+...|.. |.+.+ T Consensus 55 ~~v~~ci~~ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-G~~~~ 132 (422) T protein:vir:13 55 NTVYVCTKIRAESIGKLSLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLE-TQRTLKGNAYAYIERDRK-GKIIG 132 (422) T ss_pred HHHHHHHHHHHHhhhhCceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEE Confidence 667778877653 1112223222223333322222221 3444558999877 889999999999988875 67999 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG 278 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (867) +..|+|+.|.|. .-.+..+. .+..+.-+ -....|.. T Consensus 133 L~~i~~~~v~~~---~~~~~~~~------------------------------------~~~~~~y~-----~~~~~g~~ 168 (422) T protein:vir:13 133 LYPINSDNVTKI---IDDDNFLS------------------------------------SLSKVWYV-----VTDKNGKE 168 (422) T ss_pred EEEECCcceEEE---EcCCccee------------------------------------ccceEEEE-----EEeCCCeE Confidence 999999999875 11111111 00000000 11123444 Q ss_pred CcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHH Q lcl|NC_011269. 279 LDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEV 358 (867) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (867) ..++..-|-|++...+.=.-.|.+.+..+..+|-...+.......+.+.-.+|--++++.. --+.+..+++ T Consensus 169 ~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~l~~e~~~~~ 239 (422) T protein:vir:13 169 HKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG---------DLDEKAKKIF 239 (422) T ss_pred EEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC---------CCCHHHHHHH Confidence 5677778889987655445579999999999998877777777777777777888887753 2367888999 Q ss_pred HHHHHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHH Q lcl|NC_011269. 359 RDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVT 434 (867) Q Consensus 359 ~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~ 434 (867) |+.|+..... + ...+|-..|++++.....-....+-.-.++..++|.+++||.-.++.+.++++|+++. ....|++ T Consensus 240 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~ 319 (422) T protein:vir:13 240 KKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYV 319 (422) T ss_pred HHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHH Confidence 9988776643 2 3456667788888777654444444555677889999999999999988889999854 3444555 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhh Q lcl|NC_011269. 435 QIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQE 514 (867) Q Consensus 435 ~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~ 514 (867) .-+.-+...|++.+.+.+=...|+ ..++.|+|-+..+...|..+.-+-+ +..+.. .-+..+.-+.. T Consensus 320 ~~l~P~~~~ie~~l~~~Ll~~~~~--~~g~~i~fd~~~l~r~d~~~~~~~~-~~~~~~-----------G~~T~NE~R~~ 385 (422) T protein:vir:13 320 TTLQSSLTVYEQEIQDKLFSQYET--LQDVKAEFNVDTILRSDIKTRYEAY-RIGIQG-----------GFIEANEARRR 385 (422) T ss_pred HHHHHHHHHHHHHHHHhhCChhhh--cCCceEEeechhhhcCCHHHHHHHH-HHHHhC-----------CCcCHHHHHHH Confidence 556666777777776655333332 2234455444444333333222211 111111 00111110100 Q ss_pred hhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccc Q lcl|NC_011269. 515 RAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKK 565 (867) Q Consensus 515 ~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kk 565 (867) .+ +..++. +..+.-+..-..+++ +... .....+...+ T Consensus 386 ~g-l~p~~g-gD~~~~~~n~~~l~~-~~~~-----------~~~~g~~~g~ 422 (422) T protein:vir:13 386 EN-LPPVEG-GDRLLVNGNMIPIEM-AGEQ-----------YKKGGEKGGK 422 (422) T ss_pred hC-CCCCCC-cCeeeeccCccchhh-cccc-----------cccCCCcCCC Confidence 00 000000 000000000000000 0000 0000011111 No 12 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.94 E-value=1.3e-10 Score=74.74 Aligned_cols=423 Identities=10% Similarity=0.064 Sum_probs=205.7 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHH Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPL 128 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (867) |-+.|+.-+. |..+....+..+ +.-....+.-+++.-...++.-|.+. .-.++-|-. T Consensus 1 ~~~~~~~~~~----~~~~~~~~~~~~-----~~~~~~~~~~~~~g~~~~g~~v~~~~--------------al~~~~V~~ 57 (454) T protein:vir:93 1 MWNLLRRTRK----NQKSGRDVREAG-----WTSLFQAVAEPFAGAWQQGVKADPEA--------------VLSFHAVFA 57 (454) T ss_pred CCCccccCcc----cccccccccchh-----hhhhhhhhhhhhcchhhcCcccChHH--------------hhccHHHHH Confidence 3333222111 111100111111 00001111112211112222222111 112355667 Q ss_pred HHHhhhhcccccc--eec-ccchhH----HHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh Q lcl|NC_011269. 129 LIDIYSKFPVVGM--EFD-SKDPLI----KTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS 198 (867) Q Consensus 129 ~~~~~~~~~~~~~--~~~-~~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (867) ||++-++ -|..+ ++- .+.+.. +.-..+.++ .+..+-.+|+..++ ..++.-|..+-+...|.. |.+.+ T Consensus 58 ~v~~Ia~-~iA~lp~~~~~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~-~~lll~Gna~~~i~r~~~-G~~~~ 134 (454) T protein:vir:93 58 CISLISQ-DIAKMRLRLMQTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWL-NAKLRHGNTVVLKIRNAR-GQIKE 134 (454) T ss_pred HHHHHHH-hhccCceEEEEeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEEECCC-CcEEE Confidence 8887643 22222 221 111122 222222333 23345568999866 888999999999888876 66889 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG 278 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (867) +.+|+|+.|+|.. ..+-.+- |+-..+ .....+.. T Consensus 135 L~~i~~~~v~v~~---~~~g~~~-----------------------------------------y~~~~~--~~~~~~~~ 168 (454) T protein:vir:93 135 LRILDWNRVEPLV---ADDGEVF-----------------------------------------YRITPD--RNCGITEA 168 (454) T ss_pred EEEEcCcceEEEE---cCCCcEE-----------------------------------------EEEEec--ccccccee Confidence 9999999998851 1110000 111111 12233344 Q ss_pred CcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHH Q lcl|NC_011269. 279 LDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEV 358 (867) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (867) +.++..-|-|+....+.-...|.+.+.-+.++|.......+....+.+.-..|--++++.+ ..++++.+.+ T Consensus 169 ~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~l~~e~~~~~ 239 (454) T protein:vir:93 169 VTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG---------SITEENAKKL 239 (454) T ss_pred EEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC---------CCCHHHHHHH Confidence 5667777889987665555689999999999998888777777777777777776777652 2377889999 Q ss_pred HHHHHHhhhcch--hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHH Q lcl|NC_011269. 359 RDDMQSLLAADF--RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQ 435 (867) Q Consensus 359 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~ 435 (867) |+.++..+..++ ..+|---|++++.+...-+...+-.-.++..++|.+++||.-.+|..+++.+|+++. ....|+.. T Consensus 240 ~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 319 (454) T protein:vir:93 240 KSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQ 319 (454) T ss_pred HHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHH Confidence 999998887764 356667788888877654433333444567789999999999999888889998854 44456777 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhh-----hhhhh--hhhhccccccc Q lcl|NC_011269. 436 IMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRK-----VPKLL--IPEIKFSTLNL 508 (867) Q Consensus 436 ~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~-----~~k~i--~~~i~~~~~~L 508 (867) -+.=+..+|++++.+.+-.- .+..++|-+..+...|..+.-+...+.-... +...+ .+.+.+..-.+ T Consensus 320 ~l~P~~~~ie~~ln~~L~~~------~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~ 393 (454) T protein:vir:93 320 CLQTLIESIELLLDEALETG------ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALY 393 (454) T ss_pred HHHHHHHHHHHHHHHhhcCC------CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeee Confidence 78888888888887766211 1223454444443333322222111111111 10000 11111111000 Q ss_pred cc-hhhhhhhhhhhhhceeeeecccc-CCCc---------ccccchhhhhhHHHHHHHHhhccccccc Q lcl|NC_011269. 509 RD-EAQERAFIAQLKGMGVPVSDKTL-AVNI---------DMKFDQELERQADETVQKLMATAQAMKK 565 (867) Q Consensus 509 r~-e~~~~~~v~qL~~~~~pitd~t~-p~ti---------qme~E~e~e~k~~E~l~tL~~taet~kk 565 (867) .. .-...+.+.+-.....+...... ..++ ....+.+.. ........+ ++| T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d-~~~~~~~~~------~~~ 454 (454) T protein:vir:93 394 LQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHD-AVKAMFRGI------LKK 454 (454) T ss_pred eccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccc-hhhhhhhhh------hcC Confidence 00 00000000000000000000000 0000 000000000 000000000 000 No 13 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.89 E-value=1.3e-10 Score=74.70 Aligned_cols=388 Identities=10% Similarity=0.025 Sum_probs=188.4 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHH---hhCCCchhhhHHHHHHHHHH-------HHHhhccchHH Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLAD---KGIPFNVEDEEELRVIRHWC-------RLFYATHDLVP 127 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~ 127 (867) -||-+++ |+.+.-..+. .+++...+ .|- ..-+-.++.|- T Consensus 1 Mgl~~~~----------------------f~~~~~~~~~~~~~~~~~~~~---------~~~~~g~~v~~~~al~~~~v~ 49 (409) T protein:vir:84 1 MSLFTRI----------------------FSGPSEERTLTKISGIPSPAE---------DWAMHGDRPGANSAMTLGAFY 49 (409) T ss_pred Cchhhhh----------------------hcCCCcccccccccccccccc---------hhhccCcccchhhhhccHHHH Confidence 1111110 1111000000 00110000 010 01123356677 Q ss_pred HHHHhhhh-cccccceecccchhHHHHHH--HHhh----cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehhe Q lcl|NC_011269. 128 LLIDIYSK-FPVVGMEFDSKDPLIKTFYE--DLFF----GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEE 200 (867) Q Consensus 128 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (867) .|||+.++ ..-..|+..-+++..+..-. +-.| .+..+-.+|+..++ ..++.-|+.+-+-......|...++. T Consensus 50 ~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~-~~l~l~Gn~~~~i~~~~~~g~~~~L~ 128 (409) T protein:vir:84 50 ACVTLLADTVASLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLM-ESLAVTGNAFGYISARDEANRPTAIM 128 (409) T ss_pred HHHHHHHHhhhhCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEEECCCCceEEEE Confidence 88887753 22222333223333222111 1111 23455668999877 88899999998877777889999999 Q ss_pred ecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCc Q lcl|NC_011269. 201 ILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLD 280 (867) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (867) .|+|+.|.|. ...+.... .+......+|-. T Consensus 129 ~l~p~~v~v~---~~~~~~~~-----------------------------------------------~~~~~~~~~g~~ 158 (409) T protein:vir:84 129 PIHPDCIHVT---DAKDEDGD-----------------------------------------------WIEPVYRIDGKV 158 (409) T ss_pred EEcCceeEEE---EcCCCcce-----------------------------------------------EEEEEecCCceE Confidence 9999999886 22211110 001111233445 Q ss_pred ccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHH Q lcl|NC_011269. 281 ISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRD 360 (867) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (867) ++...|-|+++....--..|.+.+..+-++|-......+....+.+....|--++++.+ .-+.+.++++|+ T Consensus 159 ~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~l~~e~~~~~~~ 229 (409) T protein:vir:84 159 VPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDA---------DLTPDQVKQTQK 229 (409) T ss_pred EchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC---------CCCHHHHHHHHH Confidence 66677889887655433579998888888887777777777777777778877777652 136678899999 Q ss_pred HHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh---hhhHHHHHHHH Q lcl|NC_011269. 361 DMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS---ALNREFVTQIM 437 (867) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~ 437 (867) .+.+...-....+|-.-|++++.+...-+...+-+-.++..++|.+++||...++...++++|..+ +....|+..-+ T Consensus 230 ~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l 309 (409) T protein:vir:84 230 QWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTL 309 (409) T ss_pred HHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHH Confidence 888877555666666778888887654333333344457778999999999999976666666333 33344455555 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhh Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAF 517 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~ 517 (867) +-+...|++++.+++. .++.|+|-+..+.--|..+.-+ ..+..+... -+..+.-+...+ T Consensus 310 ~P~~~~ie~~l~~~L~--------~g~~i~fd~~~l~~~d~~~~~~-~~~~~~~~G-----------~~t~NE~R~~~g- 368 (409) T protein:vir:84 310 LPWLRCIEQALDTFLP--------RGQFVKFNVDGLMRGDVTARFT-AYQMGLQNG-----------IWSVNEVRAWED- 368 (409) T ss_pred HHHHHHHHHHHHHhcc--------CCCeEEEechhhhccCHHHHHH-HHHHHHhCC-----------CcCHHHHHHHhC- Confidence 5566666666665541 1234444333332222211111 111111110 010000000000 Q ss_pred hhhhhhceeeeeccccCCCc-ccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccc Q lcl|NC_011269. 518 IAQLKGMGVPVSDKTLAVNI-DMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 518 v~qL~~~~~pitd~t~p~ti-qme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ 583 (867) +..+.. +.....+..-..+ +.+..+. .++. .++... +.-+ T Consensus 369 ~~p~~g-gD~~~~~~n~~~~~~~~~~~~------------------~~~~-----~~~~~~--~gn~ 409 (409) T protein:vir:84 369 APPIPE-GDIHLQPMNFVPLGYVPPEEP------------------AQEP-----QPNSAT--EGNK 409 (409) T ss_pred CCCCCC-cceeeecccccccccCCcccc------------------CcCC-----CCCCcc--CCCC Confidence 000000 0000000000000 0000000 0000 000000 0000 No 14 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.88 E-value=1.5e-10 Score=74.40 Aligned_cols=382 Identities=9% Similarity=0.009 Sum_probs=191.1 Q ss_pred hhhcchhHHHHHHHHhcccccccceee-----ccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHH Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQI-----AMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLI 130 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (867) |-.|+-+.+.+.. .+... ++.+. ..+.+...+.... +. .+.++. +-.++.|-.|| T Consensus 1 m~m~~f~~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--~~--~v~~~~------------al~~~~v~~~i 60 (392) T protein:vir:39 1 MILPILNFINQTN--DPPEV--GSVQSYFPDGNDAQIMESLLGDN--NE--WVSARA------------ALRNSDLFSII 60 (392) T ss_pred Ccchhhhhhhccc--ccccc--cccccccccCchhhhhhhhcCCC--Cc--eechHH------------hhccHHHHHHH Confidence 2222211111100 00000 00000 0000000000000 00 011110 12368889999 Q ss_pred HhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehh Q lcl|NC_011269. 131 DIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVS 210 (867) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (867) ++.++ .|..+.|...+...+...+. -.+.++-.+|+..++ ..++.-|+++-+.+-++. |...++..|+|+.|.|. T Consensus 61 ~~ia~-~ia~lp~~~~~~~~~~l~~~--PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-g~~~~L~~l~~~~v~~~ 135 (392) T protein:vir:39 61 LQLSS-DLAIVKINAEKKKNQGIIDN--PSTNANKHGFWQSMF-AQLLLGGEAFAYRWRNAN-GADMKWEYLRPSQVNTY 135 (392) T ss_pred HHHHH-hhccCceeeccchhhhHhhc--CCCCCCHHHHHHHHH-HHhhhcCcEEEEEEECCC-CcEEEEEEEcCceeEEE Confidence 98876 33344555444443322221 234566688999877 888899999999888765 56889999999999877 Q ss_pred hhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhh Q lcl|NC_011269. 211 RSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVV 290 (867) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (867) +.-. |+.+ . |+ |- +.....+.-..++..-|-|++ T Consensus 136 ~~~~------------------------~~~~---------------~----y~-~~--~~~~~~~~~~~~~~~eiih~~ 169 (392) T protein:vir:39 136 YFEY------------------------ENGM---------------Y----YN-IT--FDDPKIEPILQAPQSDLIHMK 169 (392) T ss_pred EcCC------------------------CceE---------------E----EE-EE--ecCcccceeEEEccccEEEec Confidence 3210 0000 0 00 00 001111222345566688988 Q ss_pred hcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch Q lcl|NC_011269. 291 NRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF 370 (867) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (867) +....-...|.+.+.-+.++|-......+......++-..|--++++.+ + +..++++.+..++.++..-.+ . T Consensus 170 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~----~---~~~~~~~~~~~~~~~~~~~~~-g 241 (392) T protein:vir:39 170 LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKG----G---GLLSDKDKASRSRSFMKRSRS-G 241 (392) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC----C---CCchHHHHHHHHHHHhccccC-C Confidence 7655444579999999999988877777777777788888877777752 2 234566677777766544333 3 Q ss_pred hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 371 RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRR 450 (867) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~ 450 (867) ..+|---|++++.++..-+...+-+-.+...++|.+++||...++. +.+.++++..-...|+..-+.-+..+|++.+.+ T Consensus 242 ~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg-~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:39 242 GPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIG-GQGDQQSSIQQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred CeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4455567889998887666666556677788999999999999995 666666655444444444444444444444444 Q ss_pred hhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhH-----hhhhhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 451 RCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEY-----IRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 451 ~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~-----~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) .+ ..++.+-+..+.-.|...+++-+.++- .+.+...+..+.... ..|..+.+.+. T Consensus 321 ~L----------~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~----p~e~r~~e~l~------ 380 (392) T protein:vir:39 321 KL----------SDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI----PKDLPAPENTN------ 380 (392) T ss_pred hc----------cccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCC----ccccchhcCCC------ Confidence 33 333332222222223333333222221 111111121111111 01111111010 Q ss_pred eeeeccccCCCcccccc Q lcl|NC_011269. 526 VPVSDKTLAVNIDMKFD 542 (867) Q Consensus 526 ~pitd~t~p~tiqme~E 542 (867) +..++....+.- T Consensus 381 -----~~~~Gd~~~p~p 392 (392) T protein:vir:39 381 -----KKTTGQSNEPVP 392 (392) T ss_pred -----CCCCCCCCCCCC Confidence 000111111110 No 15 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.88 E-value=1.5e-10 Score=74.40 Aligned_cols=382 Identities=9% Similarity=0.009 Sum_probs=191.1 Q ss_pred hhhcchhHHHHHHHHhcccccccceee-----ccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHH Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQI-----AMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLI 130 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (867) |-.|+-+.+.+.. .+... ++.+. ..+.+...+.... +. .+.++. +-.++.|-.|| T Consensus 1 m~m~~f~~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--~~--~v~~~~------------al~~~~v~~~i 60 (392) T protein:vir:10 1 MILPILNFINQTN--DPPEV--GSVQSYFPDGNDAQIMESLLGDN--NE--WVSARA------------ALRNSDLFSII 60 (392) T ss_pred Ccchhhhhhhccc--ccccc--cccccccccCchhhhhhhhcCCC--Cc--eechHH------------hhccHHHHHHH Confidence 2222211111100 00000 00000 0000000000000 00 011110 12368889999 Q ss_pred HhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehh Q lcl|NC_011269. 131 DIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVS 210 (867) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (867) ++.++ .|..+.|...+...+...+. -.+.++-.+|+..++ ..++.-|+++-+.+-++. |...++..|+|+.|.|. T Consensus 61 ~~ia~-~ia~lp~~~~~~~~~~l~~~--PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-g~~~~L~~l~~~~v~~~ 135 (392) T protein:vir:10 61 LQLSS-DLAIVKINAEKKKNQGIIDN--PSTNANKHGFWQSMF-AQLLLGGEAFAYRWRNAN-GADMKWEYLRPSQVNTY 135 (392) T ss_pred HHHHH-hhccCceeeccchhhhHhhc--CCCCCCHHHHHHHHH-HHhhhcCcEEEEEEECCC-CcEEEEEEEcCceeEEE Confidence 98876 33344555444443322221 234566688999877 888899999999888765 56889999999999877 Q ss_pred hhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhh Q lcl|NC_011269. 211 RSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVV 290 (867) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (867) +.-. |+.+ . |+ |- +.....+.-..++..-|-|++ T Consensus 136 ~~~~------------------------~~~~---------------~----y~-~~--~~~~~~~~~~~~~~~eiih~~ 169 (392) T protein:vir:10 136 YFEY------------------------ENGM---------------Y----YN-IT--FDDPKIEPILQAPQSDLIHMK 169 (392) T ss_pred EcCC------------------------CceE---------------E----EE-EE--ecCcccceeEEEccccEEEec Confidence 3210 0000 0 00 00 001111222345566688988 Q ss_pred hcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch Q lcl|NC_011269. 291 NRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF 370 (867) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (867) +....-...|.+.+.-+.++|-......+......++-..|--++++.+ + +..++++.+..++.++..-.+ . T Consensus 170 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~----~---~~~~~~~~~~~~~~~~~~~~~-g 241 (392) T protein:vir:10 170 LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKG----G---GLLSDKDKASRSRSFMKRSRS-G 241 (392) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC----C---CCchHHHHHHHHHHHhccccC-C Confidence 7655444579999999999988877777777777788888877777752 2 234566677777766544333 3 Q ss_pred hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 371 RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRR 450 (867) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~ 450 (867) ..+|---|++++.++..-+...+-+-.+...++|.+++||...++. +.+.++++..-...|+..-+.-+..+|++.+.+ T Consensus 242 ~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg-~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:10 242 GPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIG-GQGDQQSSIQQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred CeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4455567889998887666666556677788999999999999995 666666655444444444444444444444444 Q ss_pred hhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhH-----hhhhhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 451 RCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEY-----IRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 451 ~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~-----~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) .+ ..++.+-+..+.-.|...+++-+.++- .+.+...+..+.... ..|..+.+.+. T Consensus 321 ~L----------~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~----p~e~r~~e~l~------ 380 (392) T protein:vir:10 321 KL----------SDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI----PKDLPAPENTN------ 380 (392) T ss_pred hc----------cccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCC----ccccchhcCCC------ Confidence 33 333332222222223333333222221 111111121111111 01111111010 Q ss_pred eeeeccccCCCcccccc Q lcl|NC_011269. 526 VPVSDKTLAVNIDMKFD 542 (867) Q Consensus 526 ~pitd~t~p~tiqme~E 542 (867) +..++....+.- T Consensus 381 -----~~~~Gd~~~p~p 392 (392) T protein:vir:10 381 -----KKTTGQSNEPVP 392 (392) T ss_pred -----CCCCCCCCCCCC Confidence 000111111110 No 16 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.87 E-value=4.7e-10 Score=71.74 Aligned_cols=468 Identities=13% Similarity=0.082 Sum_probs=212.7 Q ss_pred CCchHHHHHHHHhhhcchhHHHHH--HHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhh Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRL--ASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYA 121 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 (867) |.++.. .++.....+...-+-+. ..|..-+..+.-++--+| -...+|. +-+..++.+++=||..|+ T Consensus 1 m~~~~~-r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~-----~~~s~~~------~~~~~~~~lr~RaRdL~r 68 (553) T protein:vir:63 1 MTKVTV-RKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNP-----SLRSPDA------LINPLKRIADARGRDMAD 68 (553) T ss_pred Ccchhh-hhhcccccccchhhhhhhcccccccccCCCccccccc-----CCCChHH------HHHHHHHHHHHHHHHHHh Confidence 222221 11122222222111111 012111111111222222 2233333 333456677888999999 Q ss_pred ccchHHHHHHhhhhccccc-ceeccc-------------ch----hHHHHHHHHh--------hcccccHHHHhHHHHHH Q lcl|NC_011269. 122 THDLVPLLIDIYSKFPVVG-MEFDSK-------------DP----LIKTFYEDLF--------FGEDLNYLEFLPDQFAR 175 (867) Q Consensus 122 ~~~~~~~~~~~~~~~~~~~-~~~~~~-------------~~----~~~~~~~~~~--------~~~~~~~~~~~~~~~~~ 175 (867) .++++...|+.+...-|+. |...++ |+ .|+..|.+-+ +--+||+..+..= .-| T Consensus 69 Nn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l-~~r 147 (553) T protein:vir:63 69 NDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRL-GVV 147 (553) T ss_pred cChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHH-HHH Confidence 9999999999999999988 777654 22 2333333221 2234566665544 559 Q ss_pred HHhhhhhhcchhhhhhhc-cce-ehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhh Q lcl|NC_011269. 176 EYFTVGEVTSLAHFNESL-GVW-SSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSERE 253 (867) Q Consensus 176 ~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (867) .++.-||+|-..++.+.. +.| -.+.+|+||+|......-... .+++| +|-...- T Consensus 148 ~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~------------~i~~G----------VE~d~~G-- 203 (553) T protein:vir:63 148 GYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTP------------TLRRG----------VQYDKRG-- 203 (553) T ss_pred HHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCC------------eeEee----------eEECCCC-- Confidence 999999998877765543 334 367889999998763221111 13344 2221111 Q ss_pred hhhhhHHHHHHhchH-HHhhh-------ccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 254 QRMREFQDLQRRYPE-IIQAA-------MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA 325 (867) Q Consensus 254 ~~~~~~~~~~~~~~~-~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (867) |---|- |++.+|- ..... +.....+++-+.|.||...--+=.+||.|.|..+...|-.+++|..|+- ++ T Consensus 204 -r~vaY~-i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL-~~ 280 (553) T protein:vir:63 204 -RPQGYW-IQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSL-QN 280 (553) T ss_pred -ceEEEE-eeccCCCccccccccccceeeeccccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHH-HH Confidence 122232 5566665 11111 1123457889999999988888889999999999999999999998874 33 Q ss_pred hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHH------------HHHhhhcc----------hhhhhh-hhheeee Q lcl|NC_011269. 326 DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDD------------MQSLLAAD----------FRLMVH-NFGLKVE 382 (867) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~----------~~~~~~-~~~~~~~ 382 (867) .++.+=|-+|---. +|++...+.+.++ ....+..+ -=+++| .-|-+++ T Consensus 281 a~i~A~~a~fi~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~ 351 (553) T protein:vir:63 281 AVINASYAAAIESE---------LPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLN 351 (553) T ss_pred HHHhhhheeeeecC---------CChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeee Confidence 33333333221100 1111111111110 00001111 011222 1233444 Q ss_pred eccccCccCchhHHHHHHHHHHHHhhccchhhhcCCC-ccceehhhhhHHHHHHHHHHHHHHHHHHHhh-hh-HHHHH-- Q lcl|NC_011269. 383 NVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGT-GGAYASSALNREFVTQIMTGFQNALKRHIRR-RC-EVVAE-- 457 (867) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~-~~i~e-- 457 (867) .+-..-.--+.++=++.+-+.|-.+|||.-.++||-- +.+|+++-.++--.-+.+...|.+|..+.-+ +- +|+.| T Consensus 352 ~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~ 431 (553) T protein:vir:63 352 LKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAI 431 (553) T ss_pred ecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4333333345556677788888899999999999442 5689999988855555566666666443222 22 23332 Q ss_pred hhcccc------h------hee-hhhcc---ccchhhhhhhhhhhhhH--hhhhhhhhhhhhccccccccchhhhhh-hh Q lcl|NC_011269. 458 AQGHYD------Y------DLK-GGVRV---PIYREIVEYDEETGQEY--IRKVPKLLIPEIKFSTLNLRDEAQERA-FI 518 (867) Q Consensus 458 ~q~~~d------~------~~~-~~~~~---~~~rd~~~~k~e~~k~~--~r~~~k~i~~~i~~~~~~Lr~e~~~~~-~v 518 (867) +.|..+ . ... -.+++ .-.|..++..||+.-.. ++...+-.-..|.....|......+.. .. T Consensus 432 l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~ 511 (553) T protein:vir:63 432 AAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRARED 511 (553) T ss_pred HcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHH Confidence 333221 0 000 00111 12255567776663322 222211111111111122111100000 01 Q ss_pred hhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCC Q lcl|NC_011269. 519 AQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGK 595 (867) Q Consensus 519 ~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gp 595 (867) ..++..+.+.... . ...... +.....+.+..+........+. T Consensus 512 ~~~~~~Gl~~~~~-------~-----------------------~~~~~~-----~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 512 ALLKKYGLTFNLS-------A-----------------------KRSLGD-----GRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHHcCCCCCCC-------C-----------------------ccccCC-----CcccCCCCCCCCCCCCcccccC Confidence 1111111111000 0 000000 0000000000000000000000 No 17 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=98.86 E-value=1e-10 Score=75.41 Aligned_cols=271 Identities=10% Similarity=0.091 Sum_probs=169.4 Q ss_pred cccceecc--cchhHHHHHHHHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhh Q lcl|NC_011269. 138 VVGMEFDS--KDPLIKTFYEDLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 138 ~~~~~~~~--~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (867) |..+.|.. +++..+.=..+++ -.+.++-.+|+...+ ..++.-|+.+-+..-+.. |...++..|+|+.|+|... T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~ll~~Gna~~~i~r~~~-G~~~~l~~l~~~~v~v~~~ 78 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIE-TIRNEKGNAYVLIERDIY-HQPSKLFLLNPDVVEMLIE 78 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHH-HHHhhcCCEEEEEEECCC-CcEEEEEEECCceeEEEEc Confidence 44443322 1111111111111 123455668998866 899999999888777755 4578999999999987621 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR 292 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (867) .+ |.. ++.+| ....|..+.++..-|-|+++. T Consensus 79 ---~~---------------------~~~--------------------~~y~~-----~~~~g~~~~~~~~evih~~~~ 109 (278) T protein:vir:78 79 ---NQ---------------------SRE--------------------LYYSI-----HAATGNKLIVHNMDMLHFKHI 109 (278) T ss_pred ---CC---------------------Cce--------------------EEEEE-----EcCCceEEEEccccEEEECCC Confidence 10 000 01111 122344456777778999876 Q ss_pred CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhh-chhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh Q lcl|NC_011269. 293 PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLY-SPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR 371 (867) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 371 (867) .+.-...|.+.+..+.+++....+ |++.-+.+.. .|--+.+.++ .=+.++.+.+|+.|+..+...-. T Consensus 110 ~~~~~~~G~s~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~i~~~~~---------~l~~e~~~~~~~~~~~~~~~~g~ 177 (278) T protein:vir:78 110 VASNMVQGISPIDVLKNTTDFDNA---VRTFNLTEMQKPDSFMLKYGS---------NVGKEKRQQVLEDFKQYYEENGG 177 (278) T ss_pred CCCCCeeeccHHHHHHHHHHHHHH---HHHHHHHHhcCCCcEEEEeCC---------CCCHHHHHHHHHHHHHHhccCCC Confidence 665556799998888887765333 3332222222 3333333331 12568899999999988876666 Q ss_pred hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHH-HHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 372 LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNRE-FVTQIMTGFQNALKRHIRR 450 (867) Q Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~r~ 450 (867) .+|---|++++.++..-+...+..-.+...++|..++||...++.+.++++|+++.--+. |...-+--+...|++.+.+ T Consensus 178 ~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~ 257 (278) T protein:vir:78 178 ILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNR 257 (278) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 777788899998877665555555567788999999999999998888899999765443 4444466677777777776 Q ss_pred hhHHHHHhhcccchheehhhccc Q lcl|NC_011269. 451 RCEVVAEAQGHYDYDLKGGVRVP 473 (867) Q Consensus 451 ~~~~i~e~q~~~d~~~~~~~~~~ 473 (867) .+-.-.|+. ..+.|+|-+..| T Consensus 258 ~L~~~~e~~--~g~~~~f~~~~l 278 (278) T protein:vir:78 258 KLLTKTDRE--KIGILNLTLNLI 278 (278) T ss_pred hcCChhHhc--CCceEEEecccC Confidence 663334432 122355555555 No 18 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.82 E-value=1.7e-10 Score=74.13 Aligned_cols=375 Identities=12% Similarity=0.071 Sum_probs=192.5 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhh-CC-----CchhhhHHHHHHHHHHHHHhhccchHHHHHH Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKG-IP-----FNVEDEEELRVIRHWCRLFYATHDLVPLLID 131 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (867) -|+-+.+.+ ++.+ -++.- .......+-+ +. ..+.++. + -+++-|-.||+ T Consensus 1 M~~f~~~~~----~~~~-----~~~~~----~~~~~~~~~~~~~~~~~~~~v~~~~------a------l~~~~v~~~i~ 55 (386) T protein:vir:49 1 MPIFNITNL----ATES-----PPINQ----ESFFDIADSDFLASLNSSEWVSAEN------A------LKNSDLFSIIS 55 (386) T ss_pred Cchhhhhcc----CCCC-----cccch----hhhhhhhhccccccccCCceechhh------h------hccHHHHHHHH Confidence 111111000 0000 00000 0000000000 00 0111111 1 24677888999 Q ss_pred hhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhh Q lcl|NC_011269. 132 IYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSR 211 (867) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (867) +.++ -|..+.|...+...+...+. =.+.++-.+|+..++ ..++.-|+++-+...++. |...++..|+|+.|+|.+ T Consensus 56 ~ia~-~ia~~p~~~~~~~~~~l~~~--PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-g~~~~l~~i~~~~v~v~~ 130 (386) T protein:vir:49 56 QLSN-DLATAKITTSRKQLQGIVDN--PSNNANRFNFYQSIF-AQMLLGGEAFAYRWRNDN-GRDMKWEYLRPSQVSFNR 130 (386) T ss_pred HHHH-HhhhCceeeccchhhhhhhc--cCCCCCHHHHHHHHH-HHhhhcCCEEEEEEECCC-CcEEEEEEecCceeEEEE Confidence 8887 55555555555443322111 124566788999977 889999999999888765 566789999999998873 Q ss_pred hhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhh Q lcl|NC_011269. 212 SMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVN 291 (867) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (867) .- + ++. +..+|- +.-...+....++..-|-|+.+ T Consensus 131 ~~---~---------------------~~~--------------------~~y~~~--~~~~~~~~~~~~~~~evih~~~ 164 (386) T protein:vir:49 131 LD---N---------------------QNG--------------------LYYNIT--FDDPHIAPKQHVPQNDILHFRL 164 (386) T ss_pred cC---C---------------------Cce--------------------EEEEEE--EcCccccceeEEccccEEEecC Confidence 11 0 000 000010 0111223334566667889987 Q ss_pred cCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh Q lcl|NC_011269. 292 RPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR 371 (867) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 371 (867) ..+.=.-.|.|.+..+.++|.......+......+.-..|--++++.+ .. . .++...+++.++..-.-... T Consensus 165 ~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~----~~---~--~~~~~~~~~~~~~~~~n~g~ 235 (386) T protein:vir:49 165 LSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKG----GG---L--LDFKTKVSRSRQAMKQMQGG 235 (386) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCC----CC---C--hHHHHHHHHHHHHhccCCCC Confidence 655444589999999999998888777777788888888888888863 11 1 23345556656555444456 Q ss_pred hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 372 LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRR 451 (867) Q Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 451 (867) .+|-.-|++++..+..-+...+-+-.++..++|.+++||...++. +++.+|+++.... +.| +..|+.+|+.. T Consensus 236 ~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~~~~~----~~~---~~~i~~~l~~i 307 (386) T protein:vir:49 236 PLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVG-GDGDQQSSLEMIY----NIY---FKSVSRYLRPF 307 (386) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCccchHHHHH----HHH---HHHHHHHHHHH Confidence 677778899988876655554445567788899999999999996 7888888765333 222 22233333322 Q ss_pred hHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccc-----cccchhhhhhhhhhhhhcee Q lcl|NC_011269. 452 CEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTL-----NLRDEAQERAFIAQLKGMGV 526 (867) Q Consensus 452 ~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~-----~Lr~e~~~~~~v~qL~~~~~ 526 (867) . .+++-.+-+++.+-+..+.-.|..++++.+.++-.. . +...-+..++ .+-.+.+... . T Consensus 308 ~---~~~~~~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~-g---~~t~nE~r~~l~~~~~~~~~~~~~~---------~ 371 (386) T protein:vir:49 308 V---SEMSKKLSCEVDVDISPAVDPTGSNYISLINSMVKS-G---TLAQNQGLYILQQAEILPKELPDGK---------N 371 (386) T ss_pred H---HHHHHHhcchhcccchhhhccCHHHHHHHHHHHHhC-C---CcCHHHHHHHHhhCCCCCCcCcchh---------c Confidence 2 334443334444433333333444444433322111 0 1111111100 0000100000 0 Q ss_pred eeeccccCCCcccccc Q lcl|NC_011269. 527 PVSDKTLAVNIDMKFD 542 (867) Q Consensus 527 pitd~t~p~tiqme~E 542 (867) ....+..+++... .+ T Consensus 372 ~~~~~~~gGd~~~-~~ 386 (386) T protein:vir:49 372 PNRTSLKGGEINE-QD 386 (386) T ss_pred cCCCCCCCCCCCC-CC Confidence 0011112222211 11 No 19 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.78 E-value=5.3e-10 Score=71.45 Aligned_cols=397 Identities=11% Similarity=0.023 Sum_probs=199.5 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFP 137 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (867) -|+-+++.++ .++.. .-..+..|.+.+-+ +++. +.. .-.-.++-|-.|||+-++ . T Consensus 1 MG~~~~~~~~--~~~~~---~~~~~~~~~~~~~~------g~~~-~~~------------~~al~~~~V~~~v~~Ia~-~ 55 (411) T protein:vir:81 1 MGWWSRLTRF--FRPRN---ETVDMTNPLLLQWL------GVDP-DTP------------RNQLSEATYFACLKILSE-S 55 (411) T ss_pred CchHHHHHhh--ccCcc---cccccchHHHHHHh------cCcc-cCh------------hhhhccHHHHHHHHHHHH-h Confidence 3333333221 11111 11111122221111 1111 111 112235567778877654 2 Q ss_pred cccceecc----cchhHHH---HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCccee Q lcl|NC_011269. 138 VVGMEFDS----KDPLIKT---FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDML 207 (867) Q Consensus 138 ~~~~~~~~----~~~~~~~---~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (867) |..+.|.. +|-.++. -..+++- .+..+-.+|+..++ ..++.-|+.+-+.+-| +|...++..|+|+.| T Consensus 56 iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~-~~lll~Gna~~~i~r~--~g~~~~l~~l~~~~v 132 (411) T protein:vir:81 56 LGKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVE-MNRNHYGNAYVWCQYS--GPQLQALWILPSQYV 132 (411) T ss_pred HhhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHH-HHHhhcCCeEEEEEec--CCceEEEEEECCceE Confidence 22222211 1111111 1111111 13455678999977 7888899988888766 578889999999999 Q ss_pred ehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHH Q lcl|NC_011269. 208 RVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALIS 287 (867) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (867) .|. .-.+.++. +...++.+| .....|....++..-|- T Consensus 133 ~~~---~~~~~~~~------------------------------------~~~~~~~~~----~~~~~g~~~~~~~~eii 169 (411) T protein:vir:81 133 TIV---VDDRGLLG------------------------------------EKNAIWYRY----NDPYDGKMYVFRNDEIL 169 (411) T ss_pred EEE---EcCccccc------------------------------------ccceEEEEE----EecCCceEEEEccccEE Confidence 886 11111111 000001111 12224566678888899 Q ss_pred HhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh Q lcl|NC_011269. 288 RVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA 367 (867) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (867) |+++..+.=--.|.+.+..+-.+|-......++...+.++-..|--++++.. .-+.+..+++|+.|+..+. T Consensus 170 h~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~l~~e~~~~~~~~~~~~~~ 240 (411) T protein:vir:81 170 HFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTG---------DLNQEARDRLVKGFEQFAN 240 (411) T ss_pred EEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC---------CCCHHHHHHHHHHHHHHhc Confidence 9986554434579999999999999888888888888888888877777652 2367888999998877664 Q ss_pred c-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHHHHH Q lcl|NC_011269. 368 A-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGFQNA 443 (867) Q Consensus 368 ~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~ 443 (867) . + -..+|-.-|++++.++..-....+-.-.++.+++|.+++||...++...++++|+++ +....|+..-++-+... T Consensus 241 g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ 320 (411) T protein:vir:81 241 GSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQ 320 (411) T ss_pred CccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHH Confidence 3 3 245666778888777643332222234467789999999999999987888999987 34445666667777777 Q ss_pred HHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG 523 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~ 523 (867) |++.+.+.+-.-.|+. .++.|+|-+..+.-. +.+.+. + ...+.. .-.-+..+.-+...+ +. T Consensus 321 ie~~l~~~ll~~~~~~--~~~~~~fd~~~ll~~---d~~~~~-~-~~~~~~-------~~g~~t~NE~R~~~g-l~---- 381 (411) T protein:vir:81 321 YEEEITYKILSNDLIS--QGHYFKFNVNVILRA---DIKTQM-D-SLSTAV-------QNGIMTPNEARDYLD-MP---- 381 (411) T ss_pred HHHHHHhhcCChhhcC--CCcEEEeechhhhcc---CHHHHH-H-HHHHHH-------hCCCcCHHHHHHHhC-CC---- Confidence 7777776553223322 122233322222111 222211 1 111110 001011111111000 00 Q ss_pred ceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccc Q lcl|NC_011269. 524 MGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDL 569 (867) Q Consensus 524 ~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~ 569 (867) |+- .+....+... ...+.. +... ..+-.+. T Consensus 382 ---p~~---ggD~~~~~~n---~~pl~~----~~~~---~~kgGd~ 411 (411) T protein:vir:81 382 ---ADD---YGNNLMANGN---YIPLSM----LGAN---YGKGGDS 411 (411) T ss_pred ---CCC---CCCeeeeccC---ccchhh----hhhh---hccCCCC Confidence 000 0111111100 000000 0000 0011111 No 20 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.75 E-value=8.5e-10 Score=70.30 Aligned_cols=378 Identities=9% Similarity=-0.005 Sum_probs=188.4 Q ss_pred hhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHH---------HHhhccchH Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCR---------LFYATHDLV 126 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~ 126 (867) |..|+-+.+.+ .++.- -.++.+...... .+. .+..|.. .=+-+++-| T Consensus 1 m~m~~~~~~~~---~~~~~-~~~~~~~~~~~~----------------~~~----~~~~~~~~~~g~~v~~~~al~~~~v 56 (392) T protein:vir:74 1 MILPILNFINQ---TNDPP-EAGSVQSYFPDG----------------NDA----QIMESLLGDNNEWVSARAALRNSDL 56 (392) T ss_pred Ccchhhhhhhc---ccCcc-cccccccccccC----------------chh----hhhhhccCCCCcccchhhhhcchHH Confidence 44444322111 11100 001111100000 001 1111100 001245778 Q ss_pred HHHHHhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcce Q lcl|NC_011269. 127 PLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDM 206 (867) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (867) -.|||+.++- |..+.|...+...+.+.+. -.+.++-.+|+..++ ..++.-|+++-+.+.+.. |...++..|+|+. T Consensus 57 ~~~v~~ia~~-ia~lp~~~~~~~~~~l~~~--PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-G~~~~L~~i~~~~ 131 (392) T protein:vir:74 57 FSIILQLSSD-LAIVKINAEKKKNQGIIDN--PSTNANKHGFWQSMF-AQLLLGGEAFAYRWRNAN-GADMKWEYLRPSQ 131 (392) T ss_pred HHHHHHHHHh-hccCceeeccchhhhhhhh--cCCCCCHHHHHHHHH-HHhhhcCCEEEEEEECCC-CcEEEEEEEcCce Confidence 8899987651 2233333333222222221 123456688999877 888999999999888864 6688999999999 Q ss_pred eehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHH Q lcl|NC_011269. 207 LRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALI 286 (867) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (867) |.|...-.- +.+ . |+ |-. .....+.-..++.+-| T Consensus 132 v~v~~~~~~------------------------~~~---------------~----y~-~~~--~~~~~~~~~~~~~~ev 165 (392) T protein:vir:74 132 VNTYYFEYE------------------------NGM---------------Y----YN-ITF--DDPKIEPILQAPQSDL 165 (392) T ss_pred eEEEEcCCC------------------------ceE---------------E----EE-EEe--cCCccceeEEEcCccE Confidence 987731110 000 0 00 000 0001111234555668 Q ss_pred HHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhh Q lcl|NC_011269. 287 SRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLL 366 (867) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (867) -|+.+-...-.-.|.+.+.-+.++|-......+......+.-..|--++++.. + ...++++.++.++.++..- T Consensus 166 ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~----~---~~~~~~~~~~~~~~~~~~~ 238 (392) T protein:vir:74 166 IHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKG----G---GLLSDKDKASRSRSFMKRS 238 (392) T ss_pred EEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC----C---CCchHHHHHHHHHHHhccc Confidence 88876544433569999998888887777777767777777777887888752 1 2345667777777666443 Q ss_pred hcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 367 AADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKR 446 (867) Q Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 446 (867) -+ ...+|-.-|++++.++..-+...+-+-.+...++|.+++||...+|. +.+.++++.+-...|+..-+.-+...|++ T Consensus 239 n~-g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~e~~~~~~~~~l~p~~~~ie~ 316 (392) T protein:vir:74 239 RS-GGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIG-GQGDQQSSIQQISGMYASALNRYLRPAIS 316 (392) T ss_pred cC-CCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 33 34456567899998886655555555567788899999999999995 67777766555554544444444444444 Q ss_pred HHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhH-----hhhhhhhhhhhhccccccccchhhhhhhhhhh Q lcl|NC_011269. 447 HIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEY-----IRKVPKLLIPEIKFSTLNLRDEAQERAFIAQL 521 (867) Q Consensus 447 ~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~-----~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL 521 (867) .+.+.+ ..++++-...+.-.|...+++.+.++- .+.+...+..+.. +.. .|..+.+.+.. T Consensus 317 ~l~~~l----------~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g---~~p-ne~r~~enl~~- 381 (392) T protein:vir:74 317 ELEYKL----------SDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAG---YIP-KDLPAPENTNK- 381 (392) T ss_pred HHHHhc----------cchhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCC---CCc-cccchhcCCCC- Confidence 444433 333332222222222222222222211 0111111111111 000 11111111110 Q ss_pred hhceeeeeccccCCCcccccc Q lcl|NC_011269. 522 KGMGVPVSDKTLAVNIDMKFD 542 (867) Q Consensus 522 ~~~~~pitd~t~p~tiqme~E 542 (867) ..+++.+.+.- T Consensus 382 ----------~~~Gd~~~p~p 392 (392) T protein:vir:74 382 ----------KTTGQSNEPVP 392 (392) T ss_pred ----------CCCCCCCCCCC Confidence 00111111110 No 21 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.73 E-value=2.9e-10 Score=72.84 Aligned_cols=389 Identities=10% Similarity=0.049 Sum_probs=188.9 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFP 137 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (867) -+|-+. -++.. +-.++.-+-+-.-+++. ..+.. +-.+ -+-+++-|-.||++.++ - T Consensus 1 M~~f~~------~~~~~---~~~~~~~~~~~~~~~~~-~~~~~--v~~~------------~al~~~~V~~~v~~ia~-~ 55 (397) T protein:vir:38 1 MPLLKL------NKSHS---QGFSLNDPDWVNFLTGG-EAQKY--VSAD------------TALKNSDIFSLIMQLSG-D 55 (397) T ss_pred Ccchhh------hhccc---CcccCCchhhhhhhcCC-cCCce--echH------------HhhccHHHHHHHHHHHH-H Confidence 111100 00000 00011111111111110 00111 1111 11236778889988763 4 Q ss_pred cccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 138 VVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) |..+.|..+|+.+..+... -.+.++-.+|+..++ ..++.-|+++-+.+.++. |.+.++..|+|+.|+|.. ..+ T Consensus 56 ia~~p~~~~~~~~~~l~~~--PN~~~s~~~f~~~~~-~~lll~Gna~~~i~r~~~-g~~~~l~~l~~~~v~i~~---~~~ 128 (397) T protein:vir:38 56 LAMVRYTSESDRSQSIISN--PSVTANGYSFWQGMF-AQLLLDGNCYAYRHKNTN-GVDLSWEYLRPSQVQPML---LQD 128 (397) T ss_pred HhhCcccccccHHHHHHhc--CCCCCCHHHHHHHHH-HHhhhcCCEEEEEEECCC-CcEEEEEEEcCceeEEEE---cCC Confidence 5567777888887655443 235667789999977 899999999988888865 467899999999998761 111 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA 297 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (867) |+.+ .| +|- +.....+....++..-|.|+......-- T Consensus 129 ---------------------~~~~---------------~y-----~~~--~~~~~~~~~~~~~~~eiih~~~~~~~~~ 165 (397) T protein:vir:38 129 ---------------------GSGL---------------IY-----NIN--FDEPAIGYMENVPAADVIHIRLLSKNGG 165 (397) T ss_pred ---------------------CceE---------------EE-----EEE--eccccccceeEecCccEEEecCCCCCCc Confidence 1000 00 000 0111222334566667889987665544 Q ss_pred ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcc--hhhhhh Q lcl|NC_011269. 298 TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAAD--FRLMVH 375 (867) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 375 (867) .+|.+.+..+.++|.......++.....++-..|--++++-. . + +.++.+.+++.++.....+ ...+|- T Consensus 166 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~----~---~--~~e~~~~~~~~~~~~~~~~n~~~~~vl 236 (397) T protein:vir:38 166 KTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQK----G---G--LLDAETRIARSKEISKQIHNSDGPVVI 236 (397) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC----C---C--CHHHHHHHHHHHHHHhcccccCCceec Confidence 589999999999998877777777777888888887887752 1 1 2345566777665555544 345666 Q ss_pred hhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_011269. 376 NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVV 455 (867) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i 455 (867) .-|++++.++..-....+.+-.++.+++|.+++||..+++. |...+|++..=..-+..+-+.-+...|++.+.+++ T Consensus 237 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg-~~~~~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~l--- 312 (397) T protein:vir:38 237 DALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLN-GQGDQQSSITQISGQYAKSLNRYVQAIVGELNDKL--- 312 (397) T ss_pred CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhc--- Confidence 78999999987777776667788999999999999999998 44455543321111122222223333333332222 Q ss_pred HHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhh-h--hhhhcc--c-cccccchhhhhhhhhhhhhceeeee Q lcl|NC_011269. 456 AEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKL-L--IPEIKF--S-TLNLRDEAQERAFIAQLKGMGVPVS 529 (867) Q Consensus 456 ~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~-i--~~~i~~--~-~~~Lr~e~~~~~~v~qL~~~~~pit 529 (867) -...+.++.+.++. | .+.+ ++ ...+..+. | ..++.. . .....++........ ....... T Consensus 313 ---~~~~~~~~~~~~~~----d---~~~~-~~-~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~---~~~~~~~ 377 (397) T protein:vir:38 313 ---HANISANIRFAIDA----M---GDQY-AS-TISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEP---QQAIQLI 377 (397) T ss_pred ---cChhcccccccccC----C---HHHH-HH-HHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccc---ccccccc Confidence 11112222222221 1 1111 11 11111000 0 111110 0 000111100000000 0000000 Q ss_pred ccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 530 DKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 530 d~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) ....+.. +-..+ .+... .++ T Consensus 378 ~~~~g~~---~~~~~---------------~e~~~----~~~ 397 (397) T protein:vir:38 378 QQEGGEN---DGNNS---------------DERGS----DPE 397 (397) T ss_pred ccccCCC---CCCCC---------------CCCCC----CCC Confidence 0000000 00000 00000 000 No 22 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.72 E-value=3.5e-10 Score=72.46 Aligned_cols=413 Identities=12% Similarity=0.048 Sum_probs=194.8 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHH---hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLAD---KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS 134 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (867) -|+ +.+|-+-..-...-.......-.-+..... .++. +-.+ .. -.++.|-.|||+.+ T Consensus 1 M~~------~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~--v~~~-----------~a-l~~~~v~~~i~~ia 60 (429) T protein:vir:10 1 MDS------VKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTIS--VKGK-----------NA-LKVATVFACIKILS 60 (429) T ss_pred Cch------hhhhhcccccCcccccccCCChHHHHHHhcCCCCcce--echh-----------hh-hccHHHHHHHHHHH Confidence 111 111100000000000000000000000000 0010 0011 11 24677888888876 Q ss_pred h-cccccceecc-cchhHHHH----HHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcc Q lcl|NC_011269. 135 K-FPVVGMEFDS-KDPLIKTF----YEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPD 205 (867) Q Consensus 135 ~-~~~~~~~~~~-~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (867) + ..-..|+... +|.+.++- ..+++- -+..+-.+|+..++ ..++.-|+++-+...|+.+. ..++.+|+|+ T Consensus 61 ~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~~G~-~~~L~~i~~~ 138 (429) T protein:vir:10 61 ESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLE-AQKNLYGNSYANIEFDRKGK-VQALWPIDAS 138 (429) T ss_pred HhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCCCc-EEEEEEEcCc Confidence 5 2222233221 22222221 112211 12344568888866 88888999999999887654 7789999999 Q ss_pred eeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHH Q lcl|NC_011269. 206 MLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEAL 285 (867) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (867) .|.|... ....+. ....++. ..-.+|....++..- T Consensus 139 ~v~v~~~---~~~~~~------------------------------------~~~~~~~------~~~~~g~~~~~~~~e 173 (429) T protein:vir:10 139 KVTVYID---DVGLLN------------------------------------SKTKMWY------VVNTGGQQRVLKPEE 173 (429) T ss_pred eeEEEEc---Cccccc------------------------------------ccceEEE------EEccCCeEEEEcccc Confidence 9987521 111111 0000000 111234455677778 Q ss_pred HHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHh Q lcl|NC_011269. 286 ISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSL 365 (867) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (867) |-|+++..+.=-..|.+.+..+..+|-......++...+.+.-..|--++++.. .-+.+..+++|+.|+.. T Consensus 174 vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~---------~l~~e~~~~~~~~~~~~ 244 (429) T protein:vir:10 174 ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG---------DLNEDAKKVFRENFESM 244 (429) T ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---------CCCHHHHHHHHHHHHHH Confidence 899987655444569999999999988887777777777777777777777642 12566788999988876 Q ss_pred hhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHH Q lcl|NC_011269. 366 LAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQ 441 (867) Q Consensus 366 ~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~ 441 (867) ... + -..+|-.-|++++.++..-....+-+-.++..++|.+++||...++.+.++++|+++. ....|+..-++-+. T Consensus 245 ~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~ 324 (429) T protein:vir:10 245 SSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATL 324 (429) T ss_pred hccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 542 3 2556667899999886544333333445677889999999999999988999999844 33344555566666 Q ss_pred HHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhh Q lcl|NC_011269. 442 NALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQL 521 (867) Q Consensus 442 ~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL 521 (867) ..|++.+.+.+=...+.. ..+.|+|-+..+...|..+.-+...++ ++.. -+..+.-+...+ +..+ T Consensus 325 ~~ie~~ln~kl~~~~~~~--~g~~~~fd~~~ll~~d~~~~~~~~~~~-~~~G-----------~~T~NE~R~~~g-l~p~ 389 (429) T protein:vir:10 325 TMYEQEMTYKLFLDSELD--KGFYSKFNVDAILRADIKTRYEAYRTG-IQGG-----------FLKPNEARSKED-LPPE 389 (429) T ss_pred HHHHHHHHHhhcChhhcC--CCcEEEeechhhhcCCHHHHHHHHHHH-HhCC-----------CcCHHHHHHHhC-CCCC Confidence 666666665543222321 112233333333222322221111111 1110 010000000000 0000 Q ss_pred hhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 522 KGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 522 ~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) .. +..+..+..-..+++ ++. ...+ ..... ....+....-+ T Consensus 390 ~g-gD~~~~~~n~~~~d~-~~~-~~~k---------~g~~~-~~~~~~~~e~~ 429 (429) T protein:vir:10 390 AG-GDRLLVNGNMLPIDM-AGQ-AYLK---------GGDTN-GEVSKEGNEGN 429 (429) T ss_pred CC-cCeeeecccccchhh-ccc-cccC---------CCCCC-CCCCCCCCCCC Confidence 00 000000000000000 000 0000 00000 00000000000 No 23 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.72 E-value=9.2e-10 Score=70.13 Aligned_cols=377 Identities=10% Similarity=0.035 Sum_probs=193.1 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFP 137 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (867) -|+-+++.. + ++. |. ......+....+.++.-.. . .-.+=|-.++-|-.|||+.++ . T Consensus 1 Mg~f~~~~~--~--~~~----~~-------~~~~~~~~~~~~~~~~~~~-~------v~~~~~l~~~~v~~~i~~ia~-~ 57 (382) T protein:vir:48 1 MPIFNLATE--S--PPD----NQ-------GGFFDVVDSDFLASLKGNE-W------VSAETALRNSDLFSIINQLSN-D 57 (382) T ss_pred Ccccccccc--C--Ccc----cc-------cccccchhhhccccccCCc-c------cchHhhhccHHHHHHHHHHHH-h Confidence 111110000 0 000 00 0000000000011110000 0 000113356778889998876 2 Q ss_pred cccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 138 VVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) |..+.|...+..-+...+ . -.+.++-.+|+..++ ..++.-|.++-+...++. |.+.++.+|+|+.|.|.+.-.- T Consensus 58 ia~~~~~~~~~~~~~L~~-~-PN~~~t~~~f~~~l~-~~l~l~Gna~~~i~rd~~-G~~~~l~~i~~~~v~v~~~~~~-- 131 (382) T protein:vir:48 58 LATVKLITSRKKLQGIVD-N-PSNNANRFNFYQSIF-AQMLLGGEAFAYRWRNEN-GRDMKWEYLRPSQVSFNRLDNK-- 131 (382) T ss_pred hccCceeeecchhhhhhh-h-cCCCCCHHHHHHHHH-HHhhhcCCEEEEEEECCC-CcEEEEEEEcCceeEEEEcCCC-- Confidence 333344444333222111 1 124456689999977 888999999998888755 5578999999999998732110 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA 297 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (867) ..+ .| +|-. .-...+....++..-|-|+.+....-. T Consensus 132 ~~~-------------------------------------~y-----~~~~--~~~~~~~~~~~~~~evih~~~~~~~~~ 167 (382) T protein:vir:48 132 DGI-------------------------------------YY-----NITF--DDPRIPPKQHVPQNDVLHFRLLSVDGG 167 (382) T ss_pred CeE-------------------------------------EE-----EEEe--cCccccceeEEcCccEEEecCCCCCCc Confidence 000 00 1100 111222334556666888887666555 Q ss_pred ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhh Q lcl|NC_011269. 298 TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNF 377 (867) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (867) ..|.+.+..+.++|.......+......++-..|--++++.+ .. +.++...+++.++....-....+|-.- T Consensus 168 ~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~----~~-----~~e~~~~~~~~~~~~~~n~g~~~vl~~ 238 (382) T protein:vir:48 168 MTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKG----GG-----LLDFKTKLSRSRQAMKQMQGGPLVLDD 238 (382) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC----CC-----ChHHHHHHHHHHHhhccCCCCeeEcCC Confidence 789999999999998888888888888888888888888852 21 345666777777777666677788888 Q ss_pred heeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_011269. 378 GLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAE 457 (867) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e 457 (867) |++++.+...-..+.+-+-.+..+++|.+++||...++. +.+.++.+.+-..+|++.-+.-+...|++.+.+.+-.-.+ T Consensus 239 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg-~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~~ 317 (382) T protein:vir:48 239 LEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVG-GQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKLSCDVD 317 (382) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhh Confidence 999988876555444445557778999999999999995 6666555555556666655666666666666655522111 Q ss_pred hhcccchheehhhccccchhhhhhhhhhhhhH-----hhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc Q lcl|NC_011269. 458 AQGHYDYDLKGGVRVPIYREIVEYDEETGQEY-----IRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT 532 (867) Q Consensus 458 ~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~-----~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t 532 (867) ..- +..++ .|-..+++.+.++. .+.+...+..+.. + +..+..... . ...+. T Consensus 318 ~~~---------~~~~~-~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g---~-~~~~~~~~~---~-------~~~~~ 373 (382) T protein:vir:48 318 ADI---------FPAVD-PTGSNYISRINSLVKTGTLAQNQGLYILQQAE---I-LPKELPNGE---N-------PNSTL 373 (382) T ss_pred hhh---------hhhhc-cchhHHHHHHHHHhhcCccCHHHHHHHHhhCC---C-CCcchhhhh---c-------CCCCC Confidence 111 00000 11112222222211 0001000000000 0 001100000 0 00011 Q ss_pred cCCCcccccchh Q lcl|NC_011269. 533 LAVNIDMKFDQE 544 (867) Q Consensus 533 ~p~tiqme~E~e 544 (867) .+++ +-+++ T Consensus 374 ~GGd---~~~~~ 382 (382) T protein:vir:48 374 KGGE---EDGQD 382 (382) T ss_pred CCCC---CCCCC Confidence 1111 11111 No 24 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.69 E-value=4.1e-10 Score=72.06 Aligned_cols=413 Identities=12% Similarity=0.062 Sum_probs=193.9 Q ss_pred hcchhHHHHHHHH---hcccccccceeeccchhhhhhhhhHH---hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHH Q lcl|NC_011269. 58 RAAEANRQRLASY---RKQGNFGSNMQIAMPKIRQPLGTLAD---KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLID 131 (867) Q Consensus 58 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (867) -|+- .||.++ .|.. +.-.+.....-..+..... .++.-+ + .=+-.++.|-.||+ T Consensus 1 M~~~---~r~~~~~~~~~r~---~~~~~~~~~~~~~~~~~~g~~~~~~~v~--~------------~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIV---DSVKKFFNFEKRQ---TSQVIELNKDDEKLLEWLGISPSTISVK--G------------KNALKVATVFACIK 60 (432) T ss_pred CChH---HHHHHhcCccccC---cccccccCCchHHHHHHhCCCcCccccc--h------------hhhhccHHHHHHHH Confidence 1121 111111 0000 0000000000000000000 000000 0 01123567778888 Q ss_pred hhhh-cccccceecccc-hhHHH----HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheec Q lcl|NC_011269. 132 IYSK-FPVVGMEFDSKD-PLIKT----FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEIL 202 (867) Q Consensus 132 ~~~~-~~~~~~~~~~~~-~~~~~----~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (867) +.++ ..-..+++..++ .+.++ -..+++- .+..+-.+|+..++ ..++.-|+++-+.+.|..+ ...++.+| T Consensus 61 ~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~~G-~~~~L~~i 138 (432) T protein:vir:10 61 ILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLE-AQKNLYGNSYANIEFDRKG-KVQALWPI 138 (432) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCCC-cEEEEEEE Confidence 8765 222223332222 22111 1122211 13345578999877 8889999999999888764 47889999 Q ss_pred CcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCccc Q lcl|NC_011269. 203 NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDIS 282 (867) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (867) +|+.|.|... ....+. .. -.++.++ ..+|....++ T Consensus 139 ~~~~v~v~~d---~~~~~~----------------~~--------------------~~~~y~~------~~~g~~~~~~ 173 (432) T protein:vir:10 139 DASKVTVYID---DVGLLN----------------SK--------------------TKMWYVV------NTGGQQRVLK 173 (432) T ss_pred cCceeEEEEc---Cccccc----------------cc--------------------ceEEEEE------ecCCeEEEEc Confidence 9999987521 111111 00 0000011 1234445677 Q ss_pred HHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHH Q lcl|NC_011269. 283 EALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDM 362 (867) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (867) ..-|-|+++..+.=-.+|.+.+..+..+|.......+......+.-..|--++++.+ . .+.+..+.+|+.+ T Consensus 174 ~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~------~---l~~e~~~~~~~~~ 244 (432) T protein:vir:10 174 PEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG------D---LNEDAKKVFRENF 244 (432) T ss_pred cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC------C---CCHHHHHHHHHHH Confidence 788999987655445569999999999888877777777777777667776666642 1 3556778999988 Q ss_pred HHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHH Q lcl|NC_011269. 363 QSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMT 438 (867) Q Consensus 363 ~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~ 438 (867) +..+.. + ...+|-.-|++++.+...-.-.-+-+-.++..++|.+++||...++...+.++|+++. -...|+..-+. T Consensus 245 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~ 324 (432) T protein:vir:10 245 ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQ 324 (432) T ss_pred HHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 776542 3 3556667799998887554433333445677889999999999999877888998843 33334444455 Q ss_pred HHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhh Q lcl|NC_011269. 439 GFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFI 518 (867) Q Consensus 439 ~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v 518 (867) -+...|++.+.+.+=.-.|... ...|+|-+..+...|..+.-+-. +..++.. -+..+.-+...+ + T Consensus 325 P~~~~ie~~ln~kLl~~~~~~~--g~~~~fd~~~l~~~d~~~~~~~~-~~~~~~G-----------~~t~NE~R~~~g-~ 389 (432) T protein:vir:10 325 ATLTMYEQEMTYKLFLDSELDK--GFYSKFNVDAILRADIKTRYEAY-RTGIQGG-----------FLKPNEARSKED-L 389 (432) T ss_pred HHHHHHHHHHHHhhcChhhcCC--CcEEEeechhhhcCCHHHHHHHH-HHHHhCC-----------CcCHHHHHHHhC-C Confidence 5566666666554422233211 12244433333333332222211 1111110 010100000000 0 Q ss_pred hhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 519 AQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 519 ~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) ..++. +..+.-+..-..+++-.+ ...+ ... +-.+.......-+ T Consensus 390 ~pi~g-gD~~~~~~n~~~~~~~~~--~~~k---------~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:10 390 PPEAG-GDRLLVNGNMLPIDMAGQ--AYLK---------GGD-TNGEVSKEGNEGN 432 (432) T ss_pred CCCCC-CCeEeecccccchhhccc--cccC---------CCC-CCCCCCCCCCCCC Confidence 00000 000000000000000000 0000 000 0000000000000 No 25 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.69 E-value=4.1e-10 Score=72.06 Aligned_cols=413 Identities=12% Similarity=0.062 Sum_probs=193.9 Q ss_pred hcchhHHHHHHHH---hcccccccceeeccchhhhhhhhhHH---hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHH Q lcl|NC_011269. 58 RAAEANRQRLASY---RKQGNFGSNMQIAMPKIRQPLGTLAD---KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLID 131 (867) Q Consensus 58 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (867) -|+- .||.++ .|.. +.-.+.....-..+..... .++.-+ + .=+-.++.|-.||+ T Consensus 1 M~~~---~r~~~~~~~~~r~---~~~~~~~~~~~~~~~~~~g~~~~~~~v~--~------------~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIV---DSVKKFFNFEKRQ---TSQVIELNKDDEKLLEWLGISPSTISVK--G------------KNALKVATVFACIK 60 (432) T ss_pred CChH---HHHHHhcCccccC---cccccccCCchHHHHHHhCCCcCccccc--h------------hhhhccHHHHHHHH Confidence 1121 111111 0000 0000000000000000000 000000 0 01123567778888 Q ss_pred hhhh-cccccceecccc-hhHHH----HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheec Q lcl|NC_011269. 132 IYSK-FPVVGMEFDSKD-PLIKT----FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEIL 202 (867) Q Consensus 132 ~~~~-~~~~~~~~~~~~-~~~~~----~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (867) +.++ ..-..+++..++ .+.++ -..+++- .+..+-.+|+..++ ..++.-|+++-+.+.|..+ ...++.+| T Consensus 61 ~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~~G-~~~~L~~i 138 (432) T protein:vir:10 61 ILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLE-AQKNLYGNSYANIEFDRKG-KVQALWPI 138 (432) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCCC-cEEEEEEE Confidence 8765 222223332222 22111 1122211 13345578999877 8889999999999888764 47889999 Q ss_pred CcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCccc Q lcl|NC_011269. 203 NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDIS 282 (867) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (867) +|+.|.|... ....+. .. -.++.++ ..+|....++ T Consensus 139 ~~~~v~v~~d---~~~~~~----------------~~--------------------~~~~y~~------~~~g~~~~~~ 173 (432) T protein:vir:10 139 DASKVTVYID---DVGLLN----------------SK--------------------TKMWYVV------NTGGQQRVLK 173 (432) T ss_pred cCceeEEEEc---Cccccc----------------cc--------------------ceEEEEE------ecCCeEEEEc Confidence 9999987521 111111 00 0000011 1234445677 Q ss_pred HHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHH Q lcl|NC_011269. 283 EALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDM 362 (867) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (867) ..-|-|+++..+.=-.+|.+.+..+..+|.......+......+.-..|--++++.+ . .+.+..+.+|+.+ T Consensus 174 ~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~------~---l~~e~~~~~~~~~ 244 (432) T protein:vir:10 174 PEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG------D---LNEDAKKVFRENF 244 (432) T ss_pred cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC------C---CCHHHHHHHHHHH Confidence 788999987655445569999999999888877777777777777667776666642 1 3556778999988 Q ss_pred HHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHH Q lcl|NC_011269. 363 QSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMT 438 (867) Q Consensus 363 ~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~ 438 (867) +..+.. + ...+|-.-|++++.+...-.-.-+-+-.++..++|.+++||...++...+.++|+++. -...|+..-+. T Consensus 245 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~ 324 (432) T protein:vir:10 245 ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQ 324 (432) T ss_pred HHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 776542 3 3556667799998887554433333445677889999999999999877888998843 33334444455 Q ss_pred HHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhh Q lcl|NC_011269. 439 GFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFI 518 (867) Q Consensus 439 ~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v 518 (867) -+...|++.+.+.+=.-.|... ...|+|-+..+...|..+.-+-. +..++.. -+..+.-+...+ + T Consensus 325 P~~~~ie~~ln~kLl~~~~~~~--g~~~~fd~~~l~~~d~~~~~~~~-~~~~~~G-----------~~t~NE~R~~~g-~ 389 (432) T protein:vir:10 325 ATLTMYEQEMTYKLFLDSELDK--GFYSKFNVDAILRADIKTRYEAY-RTGIQGG-----------FLKPNEARSKED-L 389 (432) T ss_pred HHHHHHHHHHHHhhcChhhcCC--CcEEEeechhhhcCCHHHHHHHH-HHHHhCC-----------CcCHHHHHHHhC-C Confidence 5566666666554422233211 12244433333333332222211 1111110 010100000000 0 Q ss_pred hhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 519 AQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 519 ~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) ..++. +..+.-+..-..+++-.+ ...+ ... +-.+.......-+ T Consensus 390 ~pi~g-gD~~~~~~n~~~~~~~~~--~~~k---------~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:10 390 PPEAG-GDRLLVNGNMLPIDMAGQ--AYLK---------GGD-TNGEVSKEGNEGN 432 (432) T ss_pred CCCCC-CCeEeecccccchhhccc--cccC---------CCC-CCCCCCCCCCCCC Confidence 00000 000000000000000000 0000 000 0000000000000 No 26 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.69 E-value=4.1e-10 Score=72.06 Aligned_cols=413 Identities=12% Similarity=0.062 Sum_probs=193.9 Q ss_pred hcchhHHHHHHHH---hcccccccceeeccchhhhhhhhhHH---hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHH Q lcl|NC_011269. 58 RAAEANRQRLASY---RKQGNFGSNMQIAMPKIRQPLGTLAD---KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLID 131 (867) Q Consensus 58 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (867) -|+- .||.++ .|.. +.-.+.....-..+..... .++.-+ + .=+-.++.|-.||+ T Consensus 1 M~~~---~r~~~~~~~~~r~---~~~~~~~~~~~~~~~~~~g~~~~~~~v~--~------------~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIV---DSVKKFFNFEKRQ---TSQVIELNKDDEKLLEWLGISPSTISVK--G------------KNALKVATVFACIK 60 (432) T ss_pred CChH---HHHHHhcCccccC---cccccccCCchHHHHHHhCCCcCccccc--h------------hhhhccHHHHHHHH Confidence 1121 111111 0000 0000000000000000000 000000 0 01123567778888 Q ss_pred hhhh-cccccceecccc-hhHHH----HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheec Q lcl|NC_011269. 132 IYSK-FPVVGMEFDSKD-PLIKT----FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEIL 202 (867) Q Consensus 132 ~~~~-~~~~~~~~~~~~-~~~~~----~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (867) +.++ ..-..+++..++ .+.++ -..+++- .+..+-.+|+..++ ..++.-|+++-+.+.|..+ ...++.+| T Consensus 61 ~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~~G-~~~~L~~i 138 (432) T protein:vir:10 61 ILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLE-AQKNLYGNSYANIEFDRKG-KVQALWPI 138 (432) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCCC-cEEEEEEE Confidence 8765 222223332222 22111 1122211 13345578999877 8889999999999888764 47889999 Q ss_pred CcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCccc Q lcl|NC_011269. 203 NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDIS 282 (867) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (867) +|+.|.|... ....+. .. -.++.++ ..+|....++ T Consensus 139 ~~~~v~v~~d---~~~~~~----------------~~--------------------~~~~y~~------~~~g~~~~~~ 173 (432) T protein:vir:10 139 DASKVTVYID---DVGLLN----------------SK--------------------TKMWYVV------NTGGQQRVLK 173 (432) T ss_pred cCceeEEEEc---Cccccc----------------cc--------------------ceEEEEE------ecCCeEEEEc Confidence 9999987521 111111 00 0000011 1234445677 Q ss_pred HHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHH Q lcl|NC_011269. 283 EALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDM 362 (867) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (867) ..-|-|+++..+.=-.+|.+.+..+..+|.......+......+.-..|--++++.+ . .+.+..+.+|+.+ T Consensus 174 ~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~------~---l~~e~~~~~~~~~ 244 (432) T protein:vir:10 174 PEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG------D---LNEDAKKVFRENF 244 (432) T ss_pred cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC------C---CCHHHHHHHHHHH Confidence 788999987655445569999999999888877777777777777667776666642 1 3556778999988 Q ss_pred HHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHH Q lcl|NC_011269. 363 QSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMT 438 (867) Q Consensus 363 ~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~ 438 (867) +..+.. + ...+|-.-|++++.+...-.-.-+-+-.++..++|.+++||...++...+.++|+++. -...|+..-+. T Consensus 245 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~ 324 (432) T protein:vir:10 245 ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQ 324 (432) T ss_pred HHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 776542 3 3556667799998887554433333445677889999999999999877888998843 33334444455 Q ss_pred HHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhh Q lcl|NC_011269. 439 GFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFI 518 (867) Q Consensus 439 ~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v 518 (867) -+...|++.+.+.+=.-.|... ...|+|-+..+...|..+.-+-. +..++.. -+..+.-+...+ + T Consensus 325 P~~~~ie~~ln~kLl~~~~~~~--g~~~~fd~~~l~~~d~~~~~~~~-~~~~~~G-----------~~t~NE~R~~~g-~ 389 (432) T protein:vir:10 325 ATLTMYEQEMTYKLFLDSELDK--GFYSKFNVDAILRADIKTRYEAY-RTGIQGG-----------FLKPNEARSKED-L 389 (432) T ss_pred HHHHHHHHHHHHhhcChhhcCC--CcEEEeechhhhcCCHHHHHHHH-HHHHhCC-----------CcCHHHHHHHhC-C Confidence 5566666666554422233211 12244433333333332222211 1111110 010100000000 0 Q ss_pred hhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 519 AQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 519 ~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) ..++. +..+.-+..-..+++-.+ ...+ ... +-.+.......-+ T Consensus 390 ~pi~g-gD~~~~~~n~~~~~~~~~--~~~k---------~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:10 390 PPEAG-GDRLLVNGNMLPIDMAGQ--AYLK---------GGD-TNGEVSKEGNEGN 432 (432) T ss_pred CCCCC-CCeEeecccccchhhccc--cccC---------CCC-CCCCCCCCCCCCC Confidence 00000 000000000000000000 0000 000 0000000000000 No 27 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.69 E-value=3e-09 Score=67.33 Aligned_cols=428 Identities=13% Similarity=0.087 Sum_probs=193.5 Q ss_pred Hhcccccccceeeccchhhhh-hhhh--HHhhCCCch---hhhHHHHHHHHHHHHH--h-hccchHHHHHHhhhhccccc Q lcl|NC_011269. 70 YRKQGNFGSNMQIAMPKIRQP-LGTL--ADKGIPFNV---EDEEELRVIRHWCRLF--Y-ATHDLVPLLIDIYSKFPVVG 140 (867) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~---~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~ 140 (867) |- +-.+...|.++-- ++++ ++.+ +-+ |..+||+- .+=|++| . ...+-|..||+--. -+|.+ T Consensus 1 ~~------~~~~~~~p~~~~g~~~~~~~~~~~--~~~~~~e~~~~lr~-~~~~~ly~~m~e~D~~i~s~l~~rk-~av~~ 70 (469) T protein:vir:10 1 MT------ERVKTAAPVSEAGYVFGSGVVDGW--TVWDPFEQTPELQW-PQSVAVYSRMDNEDSRVTSLLEAIS-LPIRS 70 (469) T ss_pred CC------CcccCCCCccchhhhhhcccccch--hhcccccccccccc-ccchHHHHHHHhhChHHHHHHHHHH-HHHhc Confidence 11 1112223322110 1111 1100 112 22234431 1114444 2 24677888888654 55776 Q ss_pred ceec----ccchhHHHHHHHHhh----cc-----------cccHHHHhHHHHHHHH----hhhhhhcchhhhh-hhccce Q lcl|NC_011269. 141 MEFD----SKDPLIKTFYEDLFF----GE-----------DLNYLEFLPDQFAREY----FTVGEVTSLAHFN-ESLGVW 196 (867) Q Consensus 141 ~~~~----~~~~~~~~~~~~~~~----~~-----------~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~ 196 (867) .++. .+|+.+.+|..+... +. +.+..++|.++. ... |.+.|.. +.+-. ..+|.| T Consensus 71 ~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l-~~a~~~G~s~~Eiv-w~~~~~~~dG~~ 148 (469) T protein:vir:10 71 TPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVT-SPTLQFGHAVFEQV-YRPRNQSPDGRF 148 (469) T ss_pred CCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHH-HHhhhhCceeeeee-eecccccCCCce Confidence 4433 346666666655432 11 122334444432 111 1122211 00000 112332 Q ss_pred --ehheecCccee-ehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhh Q lcl|NC_011269. 197 --SSEEILNPDML-RVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAA 273 (867) Q Consensus 197 --~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (867) ..++..++..+ ++. |.-+.... .+++.+... .-+.. ... T Consensus 149 ~~~~l~~rp~~~i~~~~---~~~~~~l~--------~~~~~~~~~------------------------~~~~~---~~~ 190 (469) T protein:vir:10 149 WLRKLAPRPQWTISKFN---VAPDGGLE--------SIEQIAPPA------------------------RTRGS---LYV 190 (469) T ss_pred eeeeeeecCcccceeee---eccCCcee--------eeeecCccc------------------------ccccc---ccc Confidence 23333344333 111 11111100 001100000 00000 011 Q ss_pred ccCCCCcccHH-HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCH Q lcl|NC_011269. 274 MQNDGLDISEA-LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQ 352 (867) Q Consensus 274 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 352 (867) ....++.|+.. .|.|+ |+++.=.+.|.++|..|+...+.|...-+-....+.|+-.|+++.|... | -+. T Consensus 191 ~~~~~~~lp~~k~i~~~-~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~-----~----a~~ 260 (469) T protein:vir:10 191 ANIAPPEIPVNRLVVYT-RNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASS-----A----TDE 260 (469) T ss_pred CCCCccccccCcEEEEE-ecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCC-----C----CCH Confidence 23345556444 45555 6666666789999999999999999999999999999999999998763 1 346 Q ss_pred HHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH Q lcl|NC_011269. 353 GELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF 432 (867) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 432 (867) +|.+.+.+.+++...-..--+|.--|.++|.+.+.++.-.-+.=++...++|..++ +|+-|+|++.|+.|+.+.|-.|. T Consensus 261 ~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~i-LG~tlTs~~~gGS~a~~~vh~ev 339 (469) T protein:vir:10 261 DEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSG-LAHFLNLDGKGGSYALASVLEDP 339 (469) T ss_pred HHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHH-hcccccccCccchhhHHHHHHHH Confidence 66666666444443212223445567888888888887666666777888998888 89999998889999999999999 Q ss_pred HHHHHHHHHHHHHHHHhh-hhHHHHHhhcccchh-eehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccc Q lcl|NC_011269. 433 VTQIMTGFQNALKRHIRR-RCEVVAEAQGHYDYD-LKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRD 510 (867) Q Consensus 433 ~~~~~~~~~~~l~~~~r~-~~~~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~ 510 (867) .....-...+.|..++.+ +++++..+|-..+.- .++.+......+ ++ ..+.+..-+..+.+ ..+ T Consensus 340 ~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e~~~---------~~----~a~~i~~l~~~G~~-~~~ 405 (469) T protein:vir:10 340 FTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIGSRQ---------DL----TAAAVKLLYDAGVF-DDD 405 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCCCcH---------HH----HHHHHHHHHhcCCc-cCc Confidence 999899999999999964 779999999443311 122221111100 00 01111000011100 000 Q ss_pred hhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhc--ccccccccccccccCCCCCcccccccccc Q lcl|NC_011269. 511 EAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMAT--AQAMKKVQDLCDAQNLPYPPELAQHLQST 588 (867) Q Consensus 511 e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~t--aet~kkvq~~~p~~g~P~pp~~aQ~p~~t 588 (867) +.. +..+.+ ..+.+......+.. ....+. ...... .+....... .....+.+......+ T Consensus 406 ~~~-~~~~~e--~~gip~~~~~~~~~--~~~~~~---------~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l--- 466 (469) T protein:vir:10 406 PAV-KRAIRQ--RFNLPSELNDTPSA--EPEEPA---------AVPNQSAAPARTRSSGN--ADARARAPKADQGVL--- 466 (469) T ss_pred ccc-HHHHHH--HhCCCCCCCCcccc--cchhcc---------cCCCCCccccccCCCCC--cccccccCCChHHhh--- Confidence 000 000110 01111111000000 000000 000000 000000000 000000000000000 Q ss_pred ccCCCC Q lcl|NC_011269. 589 LALRQG 594 (867) Q Consensus 589 ~~~a~g 594 (867) ... T Consensus 467 ---~da 469 (469) T protein:vir:10 467 ---FDA 469 (469) T ss_pred ---ccC Confidence 000 No 28 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=98.69 E-value=5.1e-09 Score=66.05 Aligned_cols=400 Identities=11% Similarity=0.076 Sum_probs=194.7 Q ss_pred chhHHHHHHHHhcccccccceeeccchhhhh----hhhh-HHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh Q lcl|NC_011269. 60 AEANRQRLASYRKQGNFGSNMQIAMPKIRQP----LGTL-ADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS 134 (867) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (867) +--|+. +-++..-..|.-..- +|.. -..+. +|-..+- -.++-|-.||++.+ T Consensus 1 m~~~~~----------~~~~~~~~~~~~~~~~~~~~g~~~s~~~~--~v~~~~a------------l~~~~v~~cv~~ia 56 (419) T protein:vir:80 1 MFFSRQ----------LLSNLGQTQPGSGGWVSALLGSARSEAGQ--VVTPASA------------LSLTVLQNCVTLLA 56 (419) T ss_pred CCcccc----------cccccCcCCCCcchhhHHhhcccccccCc--ccChHHh------------hccHHHHHHHHHHH Confidence 111110 001111001100000 0000 00111 1111111 14567778888876 Q ss_pred hcccccceec--ccc-hhHHH---HHHHHhhc----ccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCc Q lcl|NC_011269. 135 KFPVVGMEFD--SKD-PLIKT---FYEDLFFG----EDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNP 204 (867) Q Consensus 135 ~~~~~~~~~~--~~~-~~~~~---~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (867) + .|..+.|. -++ +..++ -..+-.|- +..+-.+|+..++ ..++.-|+.+-+...+.. |...++..|+| T Consensus 57 ~-~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~l~l~Gna~~~i~r~~~-G~~~~L~~i~~ 133 (419) T protein:vir:80 57 E-SIAQLPVELYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQ-VAVGLRGNSYSFIDRDQD-GVIQGLYPLDN 133 (419) T ss_pred H-hhccCceEEEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEecC Confidence 5 22232221 111 11111 11111121 2345668998877 788999999999888874 66889999999 Q ss_pred ceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHH Q lcl|NC_011269. 205 DMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEA 284 (867) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (867) +.|+|... .+-++ .| + ..+. ..++.+ T Consensus 134 ~~v~i~~~---~~~~~-------------------------------------~y----~--------~~~~--~~~~~~ 159 (419) T protein:vir:80 134 EAVTVMKG---PDLKP-------------------------------------MY----R--------VAGA--DPLPQR 159 (419) T ss_pred ceEEEEEC---CCceE-------------------------------------EE----E--------EcCc--cccchh Confidence 99988521 10000 01 0 0111 135667 Q ss_pred HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHH Q lcl|NC_011269. 285 LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQS 364 (867) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (867) .|-|+.+.... .-.|.+.+.-+-.+|-...........+.+.-..|--++++. ++... .-+++.++.+|+.++. T Consensus 160 ~i~h~~~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~----~~~~~-~~~~~~~~~~~~~~~~ 233 (419) T protein:vir:80 160 LVHHVRWMSIN-GYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERP----TDAPA-LKDQASVDRITDGWNA 233 (419) T ss_pred heEEecCCCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEec----CCCCc-ccCHHHHHHHHHHHHH Confidence 78888864332 246999988888888777777777777777777776677765 23322 3478999999998887 Q ss_pred hhhcc---hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHH Q lcl|NC_011269. 365 LLAAD---FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGF 440 (867) Q Consensus 365 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~ 440 (867) .+... ...+|-.-|++++.++..-.-..+-+-.++..++|.+++||.-.++..+++++|+++. ....|+..-+.-+ T Consensus 234 ~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~ 313 (419) T protein:vir:80 234 KFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPW 313 (419) T ss_pred HhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHH Confidence 66542 3456667788888877554444444555777899999999999999888889998864 3444555556666 Q ss_pred HHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhh Q lcl|NC_011269. 441 QNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQ 520 (867) Q Consensus 441 ~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~q 520 (867) ...|++.+.+.+=.-.+.. .+.|+|-+..+.-.|..+.-+-. +..+..... -..++.. .+.+-........... T Consensus 314 ~~~ie~~l~~kll~~~~~~---~~~i~fd~~~l~~~d~~~~~~~~-~~~~~~G~~-T~NE~R~-~~g~~p~~gGD~~~~~ 387 (419) T protein:vir:80 314 VKRHEQAKTRDLLLPSERK---QYFIEYNLAGLLRGDQSSRYAAY-AVGRQWGWL-SINDIRR-LENMPPVKGGDIYLSP 387 (419) T ss_pred HHHHHHHHhhhccCccccC---CeEEEEechhhhccCHHHHHHHH-HHHHhCCCc-CHHHHHH-HhCCCCCCCcceeeec Confidence 6677777666541112211 23344433333222222211111 111111000 0001100 0000000000000111 Q ss_pred hhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhh Q lcl|NC_011269. 521 LKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMA 558 (867) Q Consensus 521 L~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~ 558 (867) +. ++....+.....-.....+...+|. .++.+ T Consensus 388 ~n-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~ 419 (419) T protein:vir:80 388 MN-----MVDASKPQPIPMGKTEPTKAALDEI-GRILS 419 (419) T ss_pred cc-----cccccccccccCCCCCchhhhHHHH-HhhcC Confidence 10 0111111111111111122222222 23322 No 29 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=98.68 E-value=3.7e-09 Score=66.82 Aligned_cols=442 Identities=10% Similarity=0.028 Sum_probs=200.4 Q ss_pred CCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHH--hhCCCch Q lcl|NC_011269. 26 MPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLAD--KGIPFNV 103 (867) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 103 (867) |.|--..+--..++..++.. .....++-..--+.+.|...+ ..+.-+.+|-|+.-|-.+++ -.+|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~---------~~~~a~~~~~v~~~v~~ia~~iA~lp~~v 70 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFN-DAFIKYIGQTFTKYDNNGKTY---------LEQGYNINPDVYSCISQMAAKTVAVPYTI 70 (460) T ss_pred CchhHHHHHhhhhccCCCch-HHHHHhhccccCCCccchhhh---------hHHHHhcchHHHHHHHHHHHhhhhCceEE Confidence 22211111101111111110 011111111111122222211 01223344555555444442 3455555 Q ss_pred hhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhh Q lcl|NC_011269. 104 EDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGE 182 (867) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (867) -++++-...+.+-+. ..+..++-.+.+.-... .....++..+.......- -+..+-.+|+..++ ..++.-|+ T Consensus 71 ~~~~~~g~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~P-N~~~t~~~f~~~~~-~~lll~Gn 143 (460) T protein:vir:10 71 KVVKDTKAYQQLNNL-----NISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESP-NPTQTWADIYSLYK-TYMRLNGN 143 (460) T ss_pred EeccCCccchhhhhh-----hhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCC-CCCCCHHHHHHHHH-HHHhhcCC Confidence 443322111111000 00000000000000000 001111111111111111 13445668888866 88899999 Q ss_pred hcchhhhhhh---ccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhH Q lcl|NC_011269. 183 VTSLAHFNES---LGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREF 259 (867) Q Consensus 183 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 259 (867) .+-+.+.|.. .|...++..|+|+.|.|.. ..+..+. .++| T Consensus 144 ay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~---~~~~~~~----------------------------------~~~~ 186 (460) T protein:vir:10 144 CYFYLMSPDDGINAGVPSQMYVLPAHLIKIVL---KDDINLL----------------------------------STDS 186 (460) T ss_pred eEEEEEecCCCccCceeEEEEEEcCceEEEEE---cCCCcee----------------------------------eeee Confidence 9998888764 4788899999999999872 1111111 0011 Q ss_pred HHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc-----ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhh Q lcl|NC_011269. 260 QDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA-----TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVL 334 (867) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (867) ....- ....++....++..-|-|+++..+... -+|.+.+..+-++|.......+......+.-..|-.| T Consensus 187 ~~~~~------~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i 260 (460) T protein:vir:10 187 PIKSY------MLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFI 260 (460) T ss_pred eeeEE------EEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccee Confidence 10011 112346667888888999988776654 3688888777777776665555555555555566666 Q ss_pred hhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccc Q lcl|NC_011269. 335 ATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIG 411 (867) Q Consensus 335 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 411 (867) .+.+. -.+.++.+++|+.|+..+.. + ...+|-.-|++++..+..-....+-+-.++..++|.+++||. T Consensus 261 ~~~~~---------~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 331 (460) T protein:vir:10 261 HGGST---------GLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWS 331 (460) T ss_pred eecCC---------CCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 55552 24778899999988877642 3 245666778888888766555544455677889999999999 Q ss_pred hhhhcCCCc--cceehhh-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhh-hhh Q lcl|NC_011269. 412 EALISGGTG--GAYASSA-LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEE-TGQ 487 (867) Q Consensus 412 ~~~~~~g~~--~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e-~~k 487 (867) ..++..-++ .+|+++. ....|+..-+.-+..+|++.+.+.+-.-.+ .+..+.|++-+..+.. +..+.+ .++ T Consensus 332 p~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~--~~~~~~i~~d~~~l~~---l~~d~~~~~~ 406 (460) T protein:vir:10 332 DKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFK--GYENAVIEWDISELPE---MQTDMVAMAS 406 (460) T ss_pred HHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc--ccCCceEEeecchhhh---HHHHHHHHHH Confidence 999975554 3688764 444566666777888888887776632222 2333334443222211 111111 111 Q ss_pred hHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccc Q lcl|NC_011269. 488 EYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQ 567 (867) Q Consensus 488 ~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq 567 (867) .. + -.-+..+.-+...+ +. ++.++ .+....+..-. ..+....+......+ T Consensus 407 ~~-~-----------~g~~T~NE~R~~~g-~~-------pi~~~-~gD~~~~~~n~---~~~~~~~~~~~~~~~------ 456 (460) T protein:vir:10 407 WL-N-----------TIPVTPNEIRIAMK-YE-------TLNQD-GMDIVFMPSNK---VRIDDVSNNLIDSAF------ 456 (460) T ss_pred HH-h-----------CCCCCHHHHHHHhC-CC-------CCCCC-CCCeeeecccc---cchhhcccccCCCcc------ Confidence 11 1 01111111111111 01 11000 00010110000 000000000000000 Q ss_pred ccccccCC Q lcl|NC_011269. 568 DLCDAQNL 575 (867) Q Consensus 568 ~~~p~~g~ 575 (867) -++. T Consensus 457 ----nq~~ 460 (460) T protein:vir:10 457 ----NQNQ 460 (460) T ss_pred ----cCCC Confidence 0000 No 30 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=98.66 E-value=5.2e-09 Score=66.01 Aligned_cols=411 Identities=10% Similarity=0.065 Sum_probs=186.3 Q ss_pred Hhcccccccce--------eeccch---hhhhhhhhH-HhhCCCchhhhHHHHHHHHHHHHH--hhccchHHHHHHhhhh Q lcl|NC_011269. 70 YRKQGNFGSNM--------QIAMPK---IRQPLGTLA-DKGIPFNVEDEEELRVIRHWCRLF--YATHDLVPLLIDIYSK 135 (867) Q Consensus 70 ~~~~~~~~~~~--------~~~~~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 135 (867) |+|.+|--.++ ..+.++ ++.++.+.. +-.+|.+. ...||..+ =|++| -...+-|..||+- ++ T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~iLr~~~-~~~ly~~m~~D~hi~s~l~~-Rk 76 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREF--DELLQGKD-GLLVYHKMLSDGTVKNALNY-IF 76 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccch--hHhhcccc-chHHHHHHhhChHHHHHHHH-HH Confidence 44444332211 111111 111111110 01112111 11222211 12222 1124556666665 44 Q ss_pred cccccceecc----c---chhHHHHHHHHhhcc-----cccHHHHhHHHHHHHHhhhhhhcchhh--hh-hhcccee--h Q lcl|NC_011269. 136 FPVVGMEFDS----K---DPLIKTFYEDLFFGE-----DLNYLEFLPDQFAREYFTVGEVTSLAH--FN-ESLGVWS--S 198 (867) Q Consensus 136 ~~~~~~~~~~----~---~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~--~ 198 (867) -+|.+.++.+ + |.-+.+|..+..-+. +++...+|.|+. .-.-.| +|+.. ++ ..+|.|. . T Consensus 77 ~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~l--da~~~G--~s~~Eivw~~~~dg~~~~~~ 152 (448) T protein:vir:77 77 GRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYE--NAYIYG--MAAGEIVLTLGADGKLILDK 152 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHHH--Hhhhhc--ceeEEEEEeecCCCceeecc Confidence 5777744443 2 333556666654432 345666777754 222222 22221 11 2345443 4 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchH-HHhhhccCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPE-IIQAAMQND 277 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 277 (867) +++..+..++ +-.|..+....+ +.... +.......+ T Consensus 153 l~~r~~~~~~--~f~~~~~~~l~~-----------------------------------------~~~~~~~~~~~~~~~ 189 (448) T protein:vir:77 153 IVPIHPFNID--EVLYDEEGGPKA-----------------------------------------LKLSGEVKGGSQFVN 189 (448) T ss_pred ccccCCCccc--eeeeecCCceEE-----------------------------------------EecCCcccccccCCC Confidence 4444443221 212333332220 00000 111122345 Q ss_pred CCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHH Q lcl|NC_011269. 278 GLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDE 357 (867) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (867) ++.|+..-+.|..+ +..=.+.|..++..||...+.+.-.-+-....+.|+-.|+|+.|... |.. -+.++.+. T Consensus 190 ~~~lP~~~~i~~~~-~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~-----ga~--~~~~~~~~ 261 (448) T protein:vir:77 190 GLEIPIWKTVVFLH-NDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPK-----SVR--QGTKQWEA 261 (448) T ss_pred ccccccceEEEEec-CCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCC-----CCC--CCHHHHHH Confidence 67777665555544 33346789999999999999999999999999999999999998752 221 24556665 Q ss_pred HHHHHHHhhhcc-hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhH--HHHH Q lcl|NC_011269. 358 VRDDMQSLLAAD-FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNR--EFVT 434 (867) Q Consensus 358 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~--~~~~ 434 (867) +.+-+.+ |.++ .--+|.-.|..+|.+.+.++...-+.-++...++|..++ +|+-|+|+..|+.|+. +++- +... T Consensus 262 l~~av~~-i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i-LGqtlTs~~~~g~~~~-~~~~~~~v~~ 338 (448) T protein:vir:77 262 AKEIVKN-FVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL-GIDFNTVQLNMGVQAV-NIGEFVSLTQ 338 (448) T ss_pred HHHHHHH-HhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHH-hccccccccccchhhh-hhhhHHHHHH Confidence 5542222 2222 113556677888888888877777777778899999999 9999999665555554 4432 3333 Q ss_pred HHHHHHHHHHHHHHhh-hhHHHHHhhcccchhe-ehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchh Q lcl|NC_011269. 435 QIMTGFQNALKRHIRR-RCEVVAEAQGHYDYDL-KGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEA 512 (867) Q Consensus 435 ~~~~~~~~~l~~~~r~-~~~~i~e~q~~~d~~~-~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~ 512 (867) +..-.-...|..++.+ +++++.++|-.++.-. ++.|.....-|+ ++ .++ .+. .+..++.+.. .|.... T Consensus 339 ~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl---~~-~a~-~~~-~l~~~~~~~~----~ip~~~ 408 (448) T protein:vir:77 339 QTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDF---SA-AAN-LMG-MLINAVKDSE----DIPTEL 408 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhH---HH-HHH-HhH-HHHHHHHHHh----cCCccC Confidence 3345566778888875 6699999995554321 233332222222 11 100 000 0111100000 000000 Q ss_pred hhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccc Q lcl|NC_011269. 513 QERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKV 566 (867) Q Consensus 513 ~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkv 566 (867) + ..+. ..... ..+..+. .+-+.+.... ....+.. .+.++- T Consensus 409 ~--~~~~---~~~~~-~~~~~~~---~~~~~~~~~~--~~~~~~~---~~r~~~ 448 (448) T protein:vir:77 409 K--ALID---ALPSK-MRRALGV---VDEVREAVRQ--PADSRYL---YTRRRR 448 (448) T ss_pred C--cCCC---CCchh-cccccCC---CCCCCchhhc--chhhHHH---HhhhcC Confidence 0 0000 00000 0000000 0011111000 0000000 000110 No 31 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.65 E-value=2.4e-08 Score=62.35 Aligned_cols=471 Identities=13% Similarity=0.069 Sum_probs=203.7 Q ss_pred hHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHH-----HHhh-hcchhHHHHHHHHhcccccccceeeccch Q lcl|NC_011269. 13 SAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYF-----QGRR-RAAEANRQRLASYRKQGNFGSNMQIAMPK 86 (867) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (867) -|-+..||.| . .|+|+....+. -+++ +-+.-+... +.+..++|...|. ++|- T Consensus 1 ~~~~~~~~~~---------------~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~ 58 (535) T protein:vir:10 1 MAILKDLRNA---------------F----SLSNKKSTSYIELGDYDKDIVNKAIRPGRA--SARDTVDGIDIAD-GNVA 58 (535) T ss_pred ChhhHHHHHH---------------H----HhhhhhhhhhHHHhhhhHHHHHhhhhhhhh--hhhcccccccccc-CCcc Confidence 2222333321 1 22333322221 1121 111112222 4566666655555 7776 Q ss_pred hhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhc----c--------cccceeccc-------c Q lcl|NC_011269. 87 IRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKF----P--------VVGMEFDSK-------D 147 (867) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~--------~~~~~~~~~-------~ 147 (867) +|+ .+--+.|....++ |.+.||. .|+|..||++..+- . +.++.+..+ + T Consensus 59 g~~------~~~~~~~~~~~~~------l~~~~~~-~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~ 125 (535) T protein:vir:10 59 GQY------SVASISDVLSTKK------LLKAYAD-NDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSK 125 (535) T ss_pred ccc------ccCccccccCHHH------HHHHhcc-ChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcc Confidence 553 2333444444444 5555664 57777777665532 1 122232211 1 Q ss_pred hhHHH---HHHHHhhc--ccccH----HHHhHHHHHHHHhhhhh-hcchhhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 148 PLIKT---FYEDLFFG--EDLNY----LEFLPDQFAREYFTVGE-VTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 148 ~~~~~---~~~~~~~~--~~~~~----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) ..+++ .|.-+... +...- .+|+...+ ..|+..|- .+-.-.-+ ..|...++..|+|+.|+|....+... T Consensus 126 ~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv-~d~l~~~g~ay~~i~r~-~~G~~~~L~~l~p~~V~v~~d~~~~~ 203 (535) T protein:vir:10 126 AQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKII-NDMYVQDQINIERIFKN-DSNELDHFNAVDASKVVISYSPRSKD 203 (535) T ss_pred hhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHH-HHHHhhCCceEEEEEEC-CCCcEEEEEEeCCceeEEEEcCcccc Confidence 22222 12111110 11111 23444433 45555553 33333333 34567789999999999863221110 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcc-- Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTA-- 295 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 295 (867) .- +.| . .....+....++..-|-|++..+.. T Consensus 204 ~~-------------------------------------~~~----~------~~~~~~~~~~~~~~eiih~~~~~~~~~ 236 (535) T protein:vir:10 204 QP-------------------------------------RKF----E------QFVSETKSVKFSERNLTFINYWNLSDT 236 (535) T ss_pred Cc-------------------------------------eEE----E------EEecCceeEEECcccEEEEeccCCCCc Confidence 00 000 0 1112233345555567787653322 Q ss_pred -ccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh-cc--h- Q lcl|NC_011269. 296 -WATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA-AD--F- 370 (867) Q Consensus 296 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~- 370 (867) ....|.+.+.-+.++|............+.+.-..|--++++-+ .+.+ --+++.++.+|+.++.... ++ . T Consensus 237 ~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~----~~~~-~ls~e~~e~lk~~~~~~~~G~~nag~ 311 (535) T protein:vir:10 237 DRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQ----DGDA-QANQMMLAGIRRQWTSQGSGLGGAWK 311 (535) T ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecC----CCCc-ccCHHHHHHHHHHHHHHhcCcccccc Confidence 23358888888888888777666666666777677877777652 2222 1267889999998877654 22 2 Q ss_pred hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhh-----HH--------HHHHHH Q lcl|NC_011269. 371 RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALN-----RE--------FVTQIM 437 (867) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-----~~--------~~~~~~ 437 (867) .+||...|++++.+...-.-..+-.-.++..++|.+++||.-.||.=-+.++|+++.-+ .+ |+.+-+ T Consensus 312 ~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L 391 (535) T protein:vir:10 312 IPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGL 391 (535) T ss_pred cccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHH Confidence 35666789999988765443333344567789999999999999965567778765321 11 333334 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhh--hhh-----hhhhccccccccc Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVP--KLL-----IPEIKFSTLNLRD 510 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~--k~i-----~~~i~~~~~~Lr~ 510 (867) +-+..+|++.|-+.+= . ..+.++.+.|..+...|..+ +.+.++......+ ..+ ++.+.++...+ T Consensus 392 ~P~l~~ie~~ln~~Ll-----~-~~~~~~~f~f~~l~~~d~~~-r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~-- 462 (535) T protein:vir:10 392 TPLLSFIEQVINDKIM-----R-YVDTDYRFSFTLGDAQDKLQ-EEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPG-- 462 (535) T ss_pred HHHHHHHHHHHhhhcc-----c-ccCCeEEEEeccccccCHHH-HHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccc-- Confidence 4455555555544431 1 12335666666666555433 3344444432211 000 11111111000 Q ss_pred hhhhhhhhhhhhh--ceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccccccccc Q lcl|NC_011269. 511 EAQERAFIAQLKG--MGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQST 588 (867) Q Consensus 511 e~~~~~~v~qL~~--~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t 588 (867) ...++.. ..+....+..+....-......+....+..... .......+.+..+.+++.... .+ T Consensus 463 ------~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~-------~~~~~g~~~~~~~~~~~~~~~--~~ 527 (535) T protein:vir:10 463 ------FIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHS-------KDYEKGKDDPKSPLPKPSESD--DV 527 (535) T ss_pred ------cccchhhcccccccccccCCCCCCCccccCCccccCcccccc-------cccccCCCCCCCCCCcCCCCC--cc Confidence 0000000 000000111111100000000000000000000 000000000001100000000 00 Q ss_pred ccCCCCCCCCCCCCCCCCc Q lcl|NC_011269. 589 LALRQGKTQTELGEAQAVA 607 (867) Q Consensus 589 ~~~a~gpgq~~~~qa~~~a 607 (867) . .... +.+ T Consensus 528 ~------~~~~-----~~~ 535 (535) T protein:vir:10 528 S------NNED-----ADT 535 (535) T ss_pred c------cccc-----cCC Confidence 0 0000 000 No 32 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=98.60 E-value=4.5e-09 Score=66.34 Aligned_cols=387 Identities=13% Similarity=0.089 Sum_probs=191.7 Q ss_pred cchhhhhhhhhHHhhCCCchhhhHHHHHHHHHH----HHHh------hccchHHHHHHhhhh-cccccceec-ccchhHH Q lcl|NC_011269. 84 MPKIRQPLGTLADKGIPFNVEDEEELRVIRHWC----RLFY------ATHDLVPLLIDIYSK-FPVVGMEFD-SKDPLIK 151 (867) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~------~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~ 151 (867) |- |+.....- .+.+..+.. .+-.|. ...| -.++.|-.|||+-+. ..-..|+.- .+|...+ T Consensus 1 m~-f~~~~~~~-~~~~~~~~~------~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~ 72 (409) T protein:vir:10 1 ML-FRKGFKNQ-SQEISIDDK------KILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKR 72 (409) T ss_pred Cc-ccccccCc-CCCCCCChH------HHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeee Confidence 00 11111000 001111110 011121 1112 256778888887643 111122221 1222211 Q ss_pred HH--HHH-Hh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHH Q lcl|NC_011269. 152 TF--YED-LF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVK 225 (867) Q Consensus 152 ~~--~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (867) .. ..+ ++ -.+..+-.+|+..++ ..++.-|+++-+.+.|.. |.+.++..|+|+.|.|. ...+.++. T Consensus 73 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-G~~~~L~~i~~~~V~v~---~~~~~~~~---- 143 (409) T protein:vir:10 73 VPDHYLEYLLKLRPNPYMSSSDFWKCIE-VQRNIYGNAYVALDFKKN-GEIKGLYPLKSDGMKIF---VDDTGLLN---- 143 (409) T ss_pred ccCchHHHHHhhccCCCCCHHHHHHHHH-HHHhhcCCeEEEEEEcCC-CcEEEEEEEcCCceEEE---EcCCcccc---- Confidence 11 111 11 123455678988877 888999999999888865 66889999999999875 21111111 Q ss_pred HHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhh Q lcl|NC_011269. 226 DLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLL 305 (867) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (867) + .+.-.|+ -....|....++..-|-|+++.... .-.|.+.+. T Consensus 144 --------~-----------------------~~~~~y~------~~~~~g~~~~~~~~evih~r~~~~d-~~~G~s~i~ 185 (409) T protein:vir:10 144 --------S-----------------------ENNVWYL------YTDDLGQRHKFMSDEILHFKGLTAD-GLAGLSVIE 185 (409) T ss_pred --------c-----------------------cceEEEE------EEeCCceeEEeccccEEEecCcCCC-CcccccHHH Confidence 0 0000000 1112334455666778888754322 346999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c--hhhhhhhhheeee Q lcl|NC_011269. 306 RSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVE 382 (867) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~ 382 (867) .+..+|....+..+....+.+.-.+|--++++.+ --+++..+.+|+-|+..... + ...+|-.-|++++ T Consensus 186 ~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---------~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~ 256 (409) T protein:vir:10 186 LLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAG---------DLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFE 256 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC---------CCCHHHHHHHHHHHHHHhccccccCCceecCCCceEE Confidence 9999998888888888888888888877777653 13567788888877765543 2 4567777888999 Q ss_pred eccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcc Q lcl|NC_011269. 383 NVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGH 461 (867) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~ 461 (867) .+...-+...+-+-.+...++|.+++||...++.+.++++|+++. ....|++.-+.-+...|++.+.+.+=...|+. T Consensus 257 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~-- 334 (409) T protein:vir:10 257 PISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLFLISEIK-- 334 (409) T ss_pred EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhcc-- Confidence 887766555555556788899999999999999866778898864 33344444455555555555554442222221 Q ss_pred cchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCccccc Q lcl|NC_011269. 462 YDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKF 541 (867) Q Consensus 462 ~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~ 541 (867) ..+.++|-+..+...|..+.-+...+ .+... -+.. .|.=+...+..++. +..+.-+..-..+++ . T Consensus 335 ~~~~~~fd~~~ll~~d~~~~~~~~~~-~~~~G-----------~~T~-NE~R~~lgl~p~~g-gD~~~~~~n~~~~~~-~ 399 (409) T protein:vir:10 335 NGFYSKFNVDTILRADIKTRYESYKE-AIQNG-----------FKTP-NEIRELEEDEPLEG-GDVLLINGNMIPVKM-A 399 (409) T ss_pred CCcEEEEechhhhccCHHHHHHHHHH-HHhCC-----------CcCH-HHHHHHhCCCCCCC-cCeeeeccCccchhh-c Confidence 12234443333322222221111111 11110 0000 00000000000000 000000000000000 0 Q ss_pred chhhhhhHHHHHHHHhhccccccc Q lcl|NC_011269. 542 DQELERQADETVQKLMATAQAMKK 565 (867) Q Consensus 542 E~e~e~k~~E~l~tL~~taet~kk 565 (867) .... .....| T Consensus 400 ~~~~--------------~kgGe~ 409 (409) T protein:vir:10 400 GEQY--------------SKGGEK 409 (409) T ss_pred cccc--------------cccCCC Confidence 0000 000011 No 33 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.59 E-value=1.3e-08 Score=63.83 Aligned_cols=483 Identities=11% Similarity=0.072 Sum_probs=207.5 Q ss_pred ccccceeeccchhhhhhhhhHHh-------------hCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhc-cccc Q lcl|NC_011269. 75 NFGSNMQIAMPKIRQPLGTLADK-------------GIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKF-PVVG 140 (867) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 140 (867) -|-.||+|--=.=+.++.+..-. .=|+|++ .+++ .|.+.++|-.|||+.++- .-.+ T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~---------~la~-l~~~n~~v~scI~~ia~~IA~l~ 70 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPKVNPL---------VLLS-LLQVNPYHASACSIKANDIIRTG 70 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCCCCHH---------HHHH-HHhhcHHHHHHHHHHHHHHhhCc Confidence 44445555433333333322110 1134433 2444 556678999999999863 2223 Q ss_pred ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHH Q lcl|NC_011269. 141 MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERV 220 (867) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (867) +++..++..+.. ...-.+..+-.+|+..++ ..|+.-|..+-+...|.. |...++.+|+|..|+|.+. .+..+ T Consensus 71 ~~~~~~~~~~l~---~~lpN~~~s~~~f~~~~v-~~lll~Gnayi~i~rd~~-G~~~~L~~l~~~~v~v~~d---~~~~~ 142 (542) T protein:vir:41 71 YILEGDDEGVVD---EFIRACKPSFEYVLLRAL-EDLQVFNYCTLEVVRDDR-GDPIRFEYIPSHTIRVHKD---GSRYR 142 (542) T ss_pred eeeecccchhhh---hhcCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEEcCC-CcEEEEEEEcCcceEEEEc---CCeeE Confidence 555544443322 222456677788888866 889999998887777775 5678899999999998631 11111 Q ss_pred HHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccC Q lcl|NC_011269. 221 QLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRG 300 (867) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (867) + ...+ .... +...|. |...+....+..+.+++..-|-|+.+..+--.-+| T Consensus 143 ~---------~~~~-----~~~~-----------~~~~y~-----~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~G 192 (542) T protein:vir:41 143 Q---------TWDG-----VNIT-----------HFKDYR-----YEGEINPETGEDQDSVGANELVFIHIPSPVCSYYG 192 (542) T ss_pred e---------eecC-----Ccce-----------eEEeec-----ccccccccccccccccCcccEEEecCCCCCCCccc Confidence 1 0000 0000 000111 11122223333445667777889987765555689 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCC-CCcCCCCHHHHHHHHHHHHHhhhc----chhhhhh Q lcl|NC_011269. 301 APHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGD-GEPWIPDQGELDEVRDDMQSLLAA----DFRLMVH 375 (867) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 375 (867) .|.+..+..+|.......+....+.+.-..|--++++-+....+ .+.---+++.++.+|+.|+..+.. ....+|- T Consensus 193 lspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL 272 (542) T protein:vir:41 193 VPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVF 272 (542) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEe Confidence 99999999999887766666666666667787777775421111 111134677888888877665432 1222221 Q ss_pred h------hheeeeeccccCccCchhHHH----HHHHHHHHHhhccchhhhcCCCccce--ehh-hhhHHHHHHHHHHHHH Q lcl|NC_011269. 376 N------FGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTGGAY--ASS-ALNREFVTQIMTGFQN 442 (867) Q Consensus 376 ~------~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~--~~~-~~~~~~~~~~~~~~~~ 442 (867) . =|+++..++.. +.|.+| +...++|..++||.-.++...++++| +++ +....|+.+-+.-++. T Consensus 273 ~~~~~~~~g~~~~pl~~~----~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~ 348 (542) T protein:vir:41 273 SIPGGDTVKVTFTPLNTS----QKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQN 348 (542) T ss_pred eccCCcccceeEEEcCCC----hhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHH Confidence 1 13444444322 234444 55578899999999999965455544 443 3555666677777778 Q ss_pred HHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhh------hhhhhhhhhcccc-ccc------- Q lcl|NC_011269. 443 ALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRK------VPKLLIPEIKFST-LNL------- 508 (867) Q Consensus 443 ~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~------~~k~i~~~i~~~~-~~L------- 508 (867) +|++.+.+.+-...+ .+..++|-...+.-+|. +..+ +..++. +....++.+.... +.+ T Consensus 349 ~ie~~ln~~L~~~~~----~~~~~~f~~~~ll~~d~---~~~~-~~~v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~ 420 (542) T protein:vir:41 349 IISSILTDFFQVKFN----PKTRFKFNDETLLESDS---VRNC-ALLVQSGVLTPAEARERLFGLDGGPDIFMVPSKGAA 420 (542) T ss_pred HHHHHHHhhcccccC----CceEEEecchhhcchHH---HHHH-HHHHhCCCCCHHHHHHhhCCCCCCCccccccccccc Confidence 888877776633221 22334443333333331 1111 111111 0000000000000 000 Q ss_pred -----cc---hhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHH-HHHHhhcccccccccccccccCCCCCc Q lcl|NC_011269. 509 -----RD---EAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADET-VQKLMATAQAMKKVQDLCDAQNLPYPP 579 (867) Q Consensus 509 -----r~---e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~-l~tL~~taet~kkvq~~~p~~g~P~pp 579 (867) .+ +.+....+.+...+..+-.++....+ .+.+. ...++.+. ...+....++.++.--..-. T Consensus 421 ~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 490 (542) T protein:vir:41 421 KSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSK--LSAEE-KKKKIDESLAEFRAEAYEAGKKMLIIGGD------- 490 (542) T ss_pred cccccCCcCCCCCchhhhhhcccccCcccccccccc--ccchh-hcccccchhhhhHHhHHhcCceEEEeecC------- Confidence 00 00000000110000000000000000 00000 00011011 11122222222221100000 Q ss_pred cccccccccccCCCCCCCCCCCCCCCCccCCCCccCCCCCC-Ccc---CccCcCCCCCCCC Q lcl|NC_011269. 580 ELAQHLQSTLALRQGKTQTELGEAQAVAGEAQAELQTKQIE-MQE---MMMDQQMAGGVMP 636 (867) Q Consensus 580 ~~aQ~p~~t~~~a~gpgq~~~~qa~~~agq~~~p~~~~~~~-~qp---~~~~qg~pG~~gP 636 (867) . .+..+..++-...+....- -..-...-..... .-+ +.. .-.-|-.-- T Consensus 491 -~----~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 542 (542) T protein:vir:41 491 -M----GSMSALNQGVSVIPSKPLN---LERYEELLEASVEDMIGRIRHYL-YKVIGWREL 542 (542) T ss_pred -c----hhhhhhhccceeccCCCcC---hHHHHHHHHhhHHHHHHHHHHHH-HHHhhhccC Confidence 0 0000000000000000000 0000000000000 000 000 000000000 No 34 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.58 E-value=3.9e-09 Score=66.67 Aligned_cols=479 Identities=14% Similarity=0.125 Sum_probs=213.9 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhccc------ccccceeeccch-------------hhhhh- Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQG------NFGSNMQIAMPK-------------IRQPL- 91 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~-------------~~~~~- 91 (867) |++ .---||.-+|+.. -.|+.+ +.--.|+++-.- ++.|- T Consensus 1 ~~~-------------------~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~ 59 (648) T protein:vir:79 1 MAR-------------------KVWGRGFWSRISL--MWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKM 59 (648) T ss_pred Ccc-------------------chhcchhhhhhhh--hccCccccccccccccccccCCCccccCCCCcccccccccchh Confidence 222 2223466666655 333222 112233332211 12221 Q ss_pred ------hhhHHhh-C-CCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc--ceecccchhHH-H-HHHHH-- Q lcl|NC_011269. 92 ------GTLADKG-I-PFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG--MEFDSKDPLIK-T-FYEDL-- 157 (867) Q Consensus 92 ------~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~-~~~~~-- 157 (867) |+..-.+ . ..++.+ =..++..++|. |.+.++|-.|||++++= |.. +.+..++..-. + ...+. T Consensus 60 ~~~~r~g~~~~~~~~g~~~~~e--pp~d~~~l~~l-~~~np~V~~aI~iia~~-ia~l~~~i~~~~~~~~~~~~~~~ll~ 135 (648) T protein:vir:79 60 SLVKRIGLAIMDGGGGGRDFEE--PEFDFNEITSA-YNTEGYVRQAVDKYIEM-MFKADWDFVSKNPNAVEYIRMRFTLM 135 (648) T ss_pred HHHHHhHHHHHhhcCCcccccc--CCcCHHHHHHH-HhcChHHHHHHHHHHHH-HhhCcceEEecCCccchhhHHHHHhh Confidence 1111100 0 001111 01123335666 66899999999998652 222 44444433211 1 11122 Q ss_pred hhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcc--------------ceehheecCcceeehhhhhhhcchHHHHH Q lcl|NC_011269. 158 FFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLG--------------VWSSEEILNPDMLRVSRSMFVQRERVQLM 223 (867) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (867) -.-+..+-.+|+..++ ..++.-|.++-+...++.++ .+..+..|||+.|+|.+.-+ ..+. T Consensus 136 rPn~~~t~~~f~~~l~-~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~---g~~~-- 209 (648) T protein:vir:79 136 AEATQIPTNQLFIEIA-EDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKF---GMIK-- 209 (648) T ss_pred ccCCCCCHHHHHHHHH-HHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCC---Ccee-- Confidence 2223567778888866 88888999998877777653 34566778999888873211 1111 Q ss_pred HHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcch Q lcl|NC_011269. 224 VKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPH 303 (867) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (867) .|. .+...++..+.++..-|-|++...+.....|.|. T Consensus 210 ----------------------------------~Y~---------y~~~g~~~~~~~~~~dIIHik~~~~~d~~~GlSp 246 (648) T protein:vir:79 210 ----------------------------------GWQ---------QEQEGQDKPQKFKPEDIVHIYYKREKGRAFGTPW 246 (648) T ss_pred ----------------------------------eeE---------EEecCCceeEEecCccEEEEccCCCCCCceeccH Confidence 111 1234556667777788999998878888899999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeee Q lcl|NC_011269. 304 LLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVEN 383 (867) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 383 (867) +.-|..+|-...........+-++-..|..+++++. +.+-+-..+++++++++.++.. +++.=++..+. T Consensus 247 i~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~----~~~~~e~~k~~~e~~~~~~~~~-------~i~gg~v~~~~ 315 (648) T protein:vir:79 247 LLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGL----EQEGFGAEEGEVDLVRGEVENM-------DVEGGMVTTER 315 (648) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC----CccchHHHHHHHHHHHHhcccc-------cccccccccce Confidence 999999998777777777778888889999988873 2222245677778888766543 34443344444 Q ss_pred ccccCc----cCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHH-- Q lcl|NC_011269. 384 VFGRES----VPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAE-- 457 (867) Q Consensus 384 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e-- 457 (867) ++-.-. .+.+..-.++..++|.+++||.-.||.-.++.+|+++..-..+..+..--++..|++.+..++..... T Consensus 316 ~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e 395 (648) T protein:vir:79 316 VNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILME 395 (648) T ss_pred eeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 432211 11112223556688999999999998645567777765544333333344555555555443321111 Q ss_pred --hhc--ccchheehhhccccchhhhhhhhhhhhh---------Hhhhhhhh-hhh------hhcccccc---------- Q lcl|NC_011269. 458 --AQG--HYDYDLKGGVRVPIYREIVEYDEETGQE---------YIRKVPKL-LIP------EIKFSTLN---------- 507 (867) Q Consensus 458 --~q~--~~d~~~~~~~~~~~~rd~~~~k~e~~k~---------~~r~~~k~-i~~------~i~~~~~~---------- 507 (867) +.. ..|+.++|.++.+..+|.....+...++ |.|..... .++ .+...... T Consensus 396 ~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~ 475 (648) T protein:vir:79 396 GGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAAL 475 (648) T ss_pred hhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccC Confidence 111 2345577777777666554433322111 11111100 000 00000000 Q ss_pred ---------------------ccchhhhhhhhhhhhhce-----------------eeeecccc---------------- Q lcl|NC_011269. 508 ---------------------LRDEAQERAFIAQLKGMG-----------------VPVSDKTL---------------- 533 (867) Q Consensus 508 ---------------------Lr~e~~~~~~v~qL~~~~-----------------~pitd~t~---------------- 533 (867) =++++++.+.-.+.+.+. +.+++... T Consensus 476 ~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (648) T protein:vir:79 476 APTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQTNGRHVRYMQEMLLEYTTLNEAIKALIERYYQYGSKEHLK 555 (648) T ss_pred CCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCccccchhhhhhhhhhhhcchhhhHHHhhHHHHHHHHhHHHHHH Confidence 000001000000000000 00000000 Q ss_pred --C-----------------------CCcccccchhhhhhHH---HHHHHHhhcccccccccccccccCCCCCccccccc Q lcl|NC_011269. 534 --A-----------------------VNIDMKFDQELERQAD---ETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHL 585 (867) Q Consensus 534 --p-----------------------~tiqme~E~e~e~k~~---E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p 585 (867) . .++.++++.-.|.... .++.+.++.++....++.+-+. -+..-....+.. T Consensus 556 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 634 (648) T protein:vir:79 556 SINGSLMYTEGRLLELTTQYWGEEVTEKVRIPFHRMTENLREEVMSTIDKVEGVAEASDIAQAVFDV-FTDRLGHISNEA 634 (648) T ss_pred hhhhhheeccchhHHHHHHHhhhhhhceeeeeHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHH-HHHhhhhhhhhh Confidence 0 0000000000000000 0000111111111111100000 000000000000 Q ss_pred ---cccccCCCCCC Q lcl|NC_011269. 586 ---QSTLALRQGKT 596 (867) Q Consensus 586 ---~~t~~~a~gpg 596 (867) ....+...+.+ T Consensus 635 ~~~~~~~~~~~~~~ 648 (648) T protein:vir:79 635 FAISESLAEVNGDG 648 (648) T ss_pred HHHhhhHhhhcCCC Confidence 00000000000 No 35 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.58 E-value=5.8e-09 Score=65.75 Aligned_cols=403 Identities=10% Similarity=0.074 Sum_probs=197.8 Q ss_pred HHHHHHh-cccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----cccc Q lcl|NC_011269. 65 QRLASYR-KQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FPVV 139 (867) Q Consensus 65 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 139 (867) =.+.++- |-+.....-....+.++..++-.. ..-...+-..+ +..++-|-.|||+-++ .|+. T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~~~~------------al~~~~v~~~i~~Ia~~ia~l~~~ 67 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRK-TASGERVSESN------------SLVQPDIFACVNVLSDDIAKLPIH 67 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcc-cccCceechhh------------hhccHHHHHHHHHHHHhhhhCceE Confidence 0111110 000000111111111222111100 00000111111 1124445566666543 3321 Q ss_pred cceecccchhHHHH----HHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhh Q lcl|NC_011269. 140 GMEFDSKDPLIKTF----YEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 140 ~~~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (867) =++ .+|...++- ..+++. .+..+-.+|+..++ ..++.-|+++-+.+.+.. |...++..|+|+.|.|... T Consensus 68 ~~~--~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v-~~lll~Gna~~~i~r~~~-G~~~~L~~l~~~~v~v~~~ 143 (416) T protein:vir:12 68 TYK--RTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMM-THVLTWGNAYSYIQFGSH-GYPEALFPLRPDYTNAYVH 143 (416) T ss_pred EEE--ecCCccccccccHHHHHHHhhcccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEECCcceEEEEe Confidence 111 111221111 111111 13355568888866 889999999998887764 6689999999999987511 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR 292 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (867) - + ++.+ +.+ ....|..+.++...|.|+++. T Consensus 144 ~---~---------------------~~~~--------------------~~~------~~~~g~~~~~~~~eiih~~~~ 173 (416) T protein:vir:12 144 P---T---------------------TGML--------------------WYQ------TVLNGKAIELYDYEVLHFKGL 173 (416) T ss_pred C---C---------------------CcEE--------------------EEE------EecCCeEEEecCccEEEecCc Confidence 0 0 0000 111 112355567888888999865 Q ss_pred CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhh Q lcl|NC_011269. 293 PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRL 372 (867) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 372 (867) ... ...|.+.+..+..+|-.....+.......+.-..|--++++.. .-++++.+.+|++++..-.+.- . T Consensus 174 ~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---------~~~~e~~~~~~~~~~~~~~~~~-~ 242 (416) T protein:vir:12 174 STD-GIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA---------FLDEKPKENVRKEWKRVNKVEN-I 242 (416) T ss_pred CCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC---------CCCHHHHHHHHHHHHHHhcCCC-e Confidence 433 3479999999999998877777777777788888877777752 3488999999999987666654 3 Q ss_pred hhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 373 MVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRR 451 (867) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~ 451 (867) +|-.-|++++.+...-+..-+-.-.++..++|.+++||.-.++.+.++++|+++. ....|+..-+.-+...|++++.+. T Consensus 243 ~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ 322 (416) T protein:vir:12 243 AIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVK 322 (416) T ss_pred eecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4446788888887554444344556777899999999999999888889999874 444566666777777777777765 Q ss_pred hHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeecc Q lcl|NC_011269. 452 CEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDK 531 (867) Q Consensus 452 ~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~ 531 (867) +-.-.+. ...+.|+|-+..+...|..+..+ .++..++.. -+..+.-+...+ +..++. +..+..+ T Consensus 323 l~~~~~~--~~g~~i~fd~~~l~~~d~~~~~~-~~~~~~~~G-----------~~T~NE~R~~~g-l~Pi~g-gd~~~~~ 386 (416) T protein:vir:12 323 LFLDHDQ--KSGHYVKFNIDSELRGDSKTQAE-YLKTLHETG-----------VLNKDEIRELLE-RNPIEN-GDKYISS 386 (416) T ss_pred hcCchhh--cCCceEEeechhhhccCHHHHHH-HHHHHHhCC-----------CcCHHHHHHHhC-CCCCCC-cceeeec Confidence 5322222 12233444333332222222111 111111110 011111011000 111100 0001001 Q ss_pred ccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 532 TLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 532 t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) ..-..++.. .+.+. .. ...+.+. .+ ....+ T Consensus 387 ~n~~~~~~~----~~~~~------~~-~~~~~~g-ge-~~~~g 416 (416) T protein:vir:12 387 LNYVFLDFL----EEYQR------LK-AGGAMKG-GD-NKNEG 416 (416) T ss_pred ccccccccc----chhhc------cc-cccccCC-CC-CcCCC Confidence 100111110 00000 00 0000000 00 00001 No 36 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=98.51 E-value=8.9e-08 Score=59.25 Aligned_cols=400 Identities=12% Similarity=0.085 Sum_probs=200.0 Q ss_pred cceeecc---chhhhhhhhhH---HhhCCCchhhhHHHH----HHHHHHHHH---hhccchHHHHHHhhhhccccc--ce Q lcl|NC_011269. 78 SNMQIAM---PKIRQPLGTLA---DKGIPFNVEDEEELR----VIRHWCRLF---YATHDLVPLLIDIYSKFPVVG--ME 142 (867) Q Consensus 78 ~~~~~~~---~~~~~~~~~~~---~~~~~~~~~~~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~--~~ 142 (867) -||...| |.+++.+.+-+ +..-.|--+|. -|| .|..=|++| ..+.+-|..+++. ++-+|.+ .+ T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~-~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~-Rk~av~~~~w~ 78 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDG-GYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDS-IALSVLNKVGP 78 (446) T ss_pred CcccccCCCchhhhhhhhhccccchhhcccCCcch-HhhhcCCChHHHHHHHHHHHhcchHHHHHHHH-HHHHhhcCCce Confidence 4666665 55555554433 11111111111 111 122224444 4556788888886 4456666 55 Q ss_pred ecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC-ccee-------ehhhhhh Q lcl|NC_011269. 143 FDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN-PDML-------RVSRSMF 214 (867) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-------~~~~~~~ 214 (867) ++..|..+.+|.++.+- +++...++.|+. .-+. ++|.. .-.+|+.....+ |-.+ +-.+.+| T Consensus 79 V~p~~~~~a~~v~~~l~--~~~~~~~~~~~l--dai~----~G~s~---~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~ 147 (446) T protein:vir:98 79 YQHGDKRIKKFIDDQLR--NRAKTWISHCVK--SIMT----YGFSL---SEQIYAHGARDNMPATVLDDIVNYHPLQVML 147 (446) T ss_pred ecCccHHHHHHHHHHHh--hcCchhHHHHHH--HHHh----hCcee---eeEEEeecccccccchhhcccccccccccee Confidence 66778888888888776 555444444421 1111 22222 334665432211 1111 0011112 Q ss_pred h--cchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHH--------hchHHHhhhccCCCCcccHH Q lcl|NC_011269. 215 V--QRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQR--------RYPEIIQAAMQNDGLDISEA 284 (867) Q Consensus 215 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~ 284 (867) . ...++ ..| +++....+..... ..+. ......+.++.|+.. T Consensus 148 ~~~~~~~~-----------~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~~~iP~~ 198 (446) T protein:vir:98 148 IANDNGRI-----------VDG-----------------DTVTASQYKSGYWVPLPPYRIGDPP-KKVDVVGSHVRLPSH 198 (446) T ss_pred eeccCCcc-----------ccc-----------------cccchhhcccccccCcccchhhhhh-hhcccCccccccccc Confidence 1 11111 111 0111111111000 0111 122334556666665 Q ss_pred HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHH-HHHH Q lcl|NC_011269. 285 LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVR-DDMQ 363 (867) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 363 (867) =..+..|++..=.+.|.+++-.||..-+.|...-+-....+.|+-.|+|+.|.+.. .-+.+..-|+..+.+... +.+- T Consensus 199 kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~g-a~~~~~~~~~~~~~~~~~~~~L~ 277 (446) T protein:vir:98 199 KRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPG-NTGVVEEAPDGTEITTTIAEQAE 277 (446) T ss_pred ceEEEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCC-CCcccccchhHHHHHHHHHHHHH Confidence 55555666666668899999999999999999999999999999999999998631 122334345554444332 1122 Q ss_pred Hhhh---cchhhh----hhhhheeeeeccccCccC-chhHHHHHHHHHHHHhhccchhhhcC---CCccceehhhhhHHH Q lcl|NC_011269. 364 SLLA---ADFRLM----VHNFGLKVENVFGRESVP-NLDADYDRIERKLLQAWGIGEALISG---GTGGAYASSALNREF 432 (867) Q Consensus 364 ~~~~---~~~~~~----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---g~~~~~~~~~~~~~~ 432 (867) .++. .|=-.+ +.--|..+|.+.+.++.- .-+.=++...+.|..++ +++-|+.| |.|++|+-+.|--|. T Consensus 278 ~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~Iskai-Lg~~Ltl~~~~~~~GS~ala~vh~~V 356 (446) T protein:vir:98 278 DALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGM-GIPNLLVQNRETTFGTGRASEIQLEL 356 (446) T ss_pred HHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHH-hcccccccccccccchhhhHHHHHHH Confidence 2222 221111 123466788777765432 23344456678888887 56656643 456778888888877 Q ss_pred HHHHHHHHHHHHHHHHh-hhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccch Q lcl|NC_011269. 433 VTQIMTGFQNALKRHIR-RRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDE 511 (867) Q Consensus 433 ~~~~~~~~~~~l~~~~r-~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e 511 (867) ....+-.-...|..++. ++++++.++|.....- ..++... -+++++.+.+|-+..-..+.++....+.+.+. T Consensus 357 ~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~---~~~~~~~----~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~ 429 (446) T protein:vir:98 357 FDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALY---PLASNTG----YITRLPGRATDLAALVEAIKQMHDMGFLVDGD 429 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccc---ccccccc----cceeccCChhhHHHHHHHHHHHHhCCcccccc Confidence 77778888899999996 4679999999754310 1111111 12333334444333222222222211111111 Q ss_pred hhhhhhhhhhhhceeeeecccc Q lcl|NC_011269. 512 AQERAFIAQLKGMGVPVSDKTL 533 (867) Q Consensus 512 ~~~~~~v~qL~~~~~pitd~t~ 533 (867) +..|.+. .+.|-.++.- T Consensus 430 ---~~~ire~--~giP~~~~~~ 446 (446) T protein:vir:98 430 ---KDHIRSI--TGLPDAISST 446 (446) T ss_pred ---HHHHHHH--hCcCCCCCCC Confidence 1111111 1112211111 No 37 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.50 E-value=9.5e-09 Score=64.57 Aligned_cols=486 Identities=13% Similarity=0.074 Sum_probs=202.5 Q ss_pred cchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccc Q lcl|NC_011269. 59 AAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPV 138 (867) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (867) -|-+|=|-+. .+..+- -.|.++-+++.....+++.+..- .|-...|.+++.|-.|||+.++ -| T Consensus 1 ~~~~~~~~~~---~p~~~~-----~~~~~~~~~~~~~~~g~~~~~~~--------~~~~~~~~~~~~V~acV~~IA~-~i 63 (518) T protein:vir:78 1 MLLANGQTLS---APAMAE-----LSPQMQDSYYYAPAVGMQLERQF--------SLYGGIYKNQPWVRTVIAKRAQ-AL 63 (518) T ss_pred CcccCceeec---cchhhh-----hhhhhhhcccccceeceeccccc--------chhhHHhhhhHHHHHHHHHHHH-hh Confidence 2222222211 111000 01233444444444455444321 1333457789999999998765 22 Q ss_pred cc--ceec---------ccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCccee Q lcl|NC_011269. 139 VG--MEFD---------SKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDML 207 (867) Q Consensus 139 ~~--~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (867) .. |++- .+|+.+..+...- ....+-.+|+..++ ..++.-|..+-+...|.. |.+.++..|+|+.| T Consensus 64 A~lp~~l~~~~~~~~~~~~~~~~~~Ll~~P--N~~~t~~~F~~~lv-~~lll~Gnay~~i~r~~~-G~~~~L~~l~p~~V 139 (518) T protein:vir:78 64 ARLPVKCMFTSGDTETEEHDTGYAKLLADP--CEYLDPFAFWEWVA-STLDIYGETYLAIQKNKS-GTPEKLMPMHPSRV 139 (518) T ss_pred ccCceEEEEEcCCccccccchHHHHHHhCC--CCCCCHHHHHHHHH-HHHhhcCCeEEEEEEcCC-CcEEEEEEECCCce Confidence 22 2221 1222222222210 12345668998877 788889999988888776 56788999999999 Q ss_pred ehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHH Q lcl|NC_011269. 208 RVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALIS 287 (867) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (867) .|...-......+. |+ + .....+.-+.++..-|- T Consensus 140 tv~~~~~~~~~~y~-------------------------------------~~-----~----~~~~~~~~~~~~~~eIi 173 (518) T protein:vir:78 140 AIKRNSRTGRYEYY-------------------------------------FQ-----A----GAGVGTQLVSFADDEVV 173 (518) T ss_pred EEEEcCCCCEEEEE-------------------------------------EE-----e----cCCccceeEEecCCcEE Confidence 98732110000000 00 0 00011222344555677 Q ss_pred HhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh Q lcl|NC_011269. 288 RVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA 367 (867) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (867) |+.+-...--..|.+.+--+.++|.......+....+.+.-..|--++++.+ .-+.+..+.+|+.|+..+. T Consensus 174 Hir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~---------~ls~e~~~~~k~~~~~~~~ 244 (518) T protein:vir:78 174 PIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK---------RLSPEAQQRLREQFDRAHA 244 (518) T ss_pred EecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC---------CCCHHHHHHHHHHHHHHhc Confidence 8876544333479999988888888888888788888888777766666542 1266788899998887765 Q ss_pred c-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh-hHHHHHHHHHHHHHH Q lcl|NC_011269. 368 A-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL-NREFVTQIMTGFQNA 443 (867) Q Consensus 368 ~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~ 443 (867) - + ...+|-.-|++++..+..-+...+-.-.++..++|.+++||.-.+|...++++|+++.- ...|+..-++-+..+ T Consensus 245 G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ 324 (518) T protein:vir:78 245 GSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR 324 (518) T ss_pred CcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHH Confidence 3 3 35667777888888765433222223334566899999999999997677889988543 334555556667777 Q ss_pred HHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhh---hhhhhhhhccccccccchhhhhhhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV---PKLLIPEIKFSTLNLRDEAQERAFIAQ 520 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~---~k~i~~~i~~~~~~Lr~e~~~~~~v~q 520 (867) |++.+.+.+-...+- .+.++|-+..+.-.|..+.-+-+ +..+... ...+-.. +...-+.+.. .....++ T Consensus 325 ie~eln~~L~~~~~~----~~~~~fd~~~Llr~D~~~r~~~~-~~~~~~G~lT~NE~R~~--~gl~pie~~~-gD~~~v~ 396 (518) T protein:vir:78 325 IQSAMDKYVGQYWVR----KNRMKFDIDDVIQPDWEAKSEST-QKMVNSGVATPNEGREI--MGLPRSDDPK-ADELYAN 396 (518) T ss_pred HHHHHHHhhcccccC----cceEEeechhhhccCHHHHHHHH-HHHHhCCCcCHHHHHHH--hCCCCCCCCC-Cceeeec Confidence 777776665322121 12333333333222322221111 1111110 0000000 0000000000 0000000 Q ss_pred hhhceeeeeccccCCCc--ccccchhhhhh-HHHHHHHHhhcccccccccccccc-------cC-C--CCCcccc-cccc Q lcl|NC_011269. 521 LKGMGVPVSDKTLAVNI--DMKFDQELERQ-ADETVQKLMATAQAMKKVQDLCDA-------QN-L--PYPPELA-QHLQ 586 (867) Q Consensus 521 L~~~~~pitd~t~p~ti--qme~E~e~e~k-~~E~l~tL~~taet~kkvq~~~p~-------~g-~--P~pp~~a-Q~p~ 586 (867) .-..++......... +.+..++...+ ..+......+.....++....... .. . |...+.+ .+.. T Consensus 397 --~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (518) T protein:vir:78 397 --SALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLR 474 (518) T ss_pred --ccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccchhcccCCCCcccccchHHH Confidence 000011100000000 00000000000 000000000000000000000000 00 0 0000000 0000 Q ss_pred ccccCCCCCCCCCCCCCCCCccCCCCccCCCCCCCccCccCcCCC Q lcl|NC_011269. 587 STLALRQGKTQTELGEAQAVAGEAQAELQTKQIEMQEMMMDQQMA 631 (867) Q Consensus 587 ~t~~~a~gpgq~~~~qa~~~agq~~~p~~~~~~~~qp~~~~qg~p 631 (867) .. ...-+.+..-.+-+-..+..-+-....--...+-.....-.. T Consensus 475 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) T protein:vir:78 475 AV-KGAMGRGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) T ss_pred HH-HHHhhcCCcchhhhhhhhhhcchhHHHHHHHHHHhhhhccCC Confidence 00 000000000000000000000000000000000000000000 No 38 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=98.46 E-value=3e-09 Score=67.35 Aligned_cols=372 Identities=11% Similarity=0.040 Sum_probs=174.6 Q ss_pred eeeccchhhh---hhhhhHHhhCCCchhhhHHHHHHHHHHHHH------hhccchHHHHHHhhhhcccccceecccchhH Q lcl|NC_011269. 80 MQIAMPKIRQ---PLGTLADKGIPFNVEDEEELRVIRHWCRLF------YATHDLVPLLIDIYSKFPVVGMEFDSKDPLI 150 (867) Q Consensus 80 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (867) |.+-.-+... |.... +-.+... +. ..+..|-+-- |-.++.|-.|||+-++ .|..+.|...+... T Consensus 1 Mglf~~~~~~~~~~~~~~-~~~~~~~--~~---~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~-~ia~l~~~~~~~~~ 73 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQ-DSFFDIT--DP---EFLDALNGSEWVSAETALKNSDLFSIISQLSN-DLATAKITTSRKQL 73 (384) T ss_pred CccccccccCcccccccc-hhhcccc--ch---hhcccccCCceechhhhhccHHHHHHHHHHHH-HHhhCceeeecchh Confidence 1111000000 00000 0000000 00 0000111111 2357889999998876 23344444444332 Q ss_pred HHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhh Q lcl|NC_011269. 151 KTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDH 230 (867) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (867) +...+. =-+..+-.+|+..++ ..++.-|+.+-+.+.++.+ ...++..|+|+.|.|... .+ T Consensus 74 ~~l~~~--PN~~~t~~~f~~~l~-~~lll~Gna~~~i~r~~~g-~~~~L~~l~~~~v~v~~~---~~------------- 133 (384) T protein:vir:49 74 QGIVDN--PSNNANRFNFYQSIF-AQMLLGGEAFAYRWRNENG-RDMKWEYLRPSQVSFNRL---DN------------- 133 (384) T ss_pred hhhhhc--cCCCCCHHHHHHHHH-HHhhhcCCeEEEEEECCCC-cEEEEEEEcCceeEEEEc---CC------------- Confidence 221111 123455678999866 8899999999999888764 578899999999988621 10 Q ss_pred ccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHH Q lcl|NC_011269. 231 LRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRT 310 (867) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (867) ++. +.-+|-. .-...+....++..-|-|+.+..+--.-.|.+.+.-+..+ T Consensus 134 --------~~~--------------------~~y~~~~--~~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~ 183 (384) T protein:vir:49 134 --------QNG--------------------LYYNITF--DDPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRE 183 (384) T ss_pred --------Cce--------------------EEEEEEe--cCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHH Confidence 000 0011110 1112233345666778899876554445799998888888 Q ss_pred HHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCcc Q lcl|NC_011269. 311 LMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESV 390 (867) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (867) |-...........+.+.-..|--++++-+ . +-+++..++.++.... ..-....+|-.-|++++.+...-+. T Consensus 184 i~~~~~~~~~~~~~~~ng~~~~~il~~~~-------~-~~~~~~~~~~~~~~~~-~~n~~~~~vl~~g~~~~~l~~~~~d 254 (384) T protein:vir:49 184 LNIQKASDKLTLNALKNALNANGILKIKG-------G-GLLDFKTKQSRSRQAM-KQMQGGPLVLDDLEDFTPLEIKSNV 254 (384) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCC-------C-CChHHHHHHHHHHHhc-ccCCccceecCCCceEEEccCChhh Confidence 87766666666666677667777777642 1 2233444555554333 2333456666788999888776666 Q ss_pred CchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhh Q lcl|NC_011269. 391 PNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGV 470 (867) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~ 470 (867) ..+-+-.++..++|.+++||...++. +++...++..-.. |.+. ..|+.+++-.. .+|+..+..++..-. T Consensus 255 ~q~~e~~~~~~~~Ia~~fgVp~~~lg-~~~~~~~~~~~~~----~~~~---~~i~~~l~pi~---~~i~~~l~~~l~~~~ 323 (384) T protein:vir:49 255 AQLLSQADWTTGQFAKVYGIPESVVG-GEGDKQSSLEMIY----NIYF---KAVSRFLRPFV---SELSKKLSCEVDADI 323 (384) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCccccHHHHH----HHHH---HHHHHHHHHHH---HHHHHHhchhhhhhh Confidence 66556678888999999999999997 5665555433222 2221 11222222211 122222222221100 Q ss_pred ccccchhhhhhhhhhhhhHhhhhhhh---hhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCccccc Q lcl|NC_011269. 471 RVPIYREIVEYDEETGQEYIRKVPKL---LIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKF 541 (867) Q Consensus 471 ~~~~~rd~~~~k~e~~k~~~r~~~k~---i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~ 541 (867) ....-++...+++-+.++ ++...+. +........+ +..|..+.... .+..+++...+. T Consensus 324 ~~~~~~~~~~~~~~~~~l-~~~~~~t~~e~~~~l~~~g~-~~ne~r~~~~~-----------~p~~gGd~~~~~ 384 (384) T protein:vir:49 324 LPAVDPTGSNYIGLINSM-VKTGTLAQNQGLYVLQQAEI-LPKDLPEGETD-----------STLKGGETNEQY 384 (384) T ss_pred hhhhhccchHHHHHHHHH-hhcCcccHHHHHHHHhhCCC-CChhHHHHcCC-----------CCCCCCCCCCCC Confidence 000001112222212111 1110000 0000000000 11111111000 011112222222 No 39 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.40 E-value=1.4e-08 Score=63.72 Aligned_cols=479 Identities=13% Similarity=0.074 Sum_probs=200.7 Q ss_pred cchhHHHHHHHHhcccccccceeeccch-------hhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHH Q lcl|NC_011269. 59 AAEANRQRLASYRKQGNFGSNMQIAMPK-------IRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLID 131 (867) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (867) -|-+|=| ++..|. |+-..+.-...+.+.++. +--|+ .-|..++.|-.||| T Consensus 1 ~~~~~~~---------------~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~-------~~~~~-~~a~~~~~V~acV~ 57 (518) T protein:vir:10 1 MLLANGQ---------------TLSAPAMAELSPQMQDSYYYAPAVGMQLERQ-------FSLYG-GIYKNQPWVRTVIA 57 (518) T ss_pred CcccCce---------------eecCchhhhhhhhhhcccccccccceecccc-------cchhh-HHHhhhHHHHHHHH Confidence 1111111 122232 122222222223333211 11133 34668899999999 Q ss_pred hhhh----cccccceec------ccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehhee Q lcl|NC_011269. 132 IYSK----FPVVGMEFD------SKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEI 201 (867) Q Consensus 132 ~~~~----~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (867) +.++ .|+.=++.+ .+|+.+..+... - .+..+-.+|+..++ ..++.-|..+-+...|.. |.+.++.+ T Consensus 58 ~IA~~iA~lpl~l~~~~~~~~~~~~~~~~~~Ll~~-P-N~~~t~~~F~~~lv-~~lll~Gnay~~i~r~~~-G~~~~L~~ 133 (518) T protein:vir:10 58 KRAQALARLPVKCMFTSGDTETEESDTGYAKLLAD-P-CEYLDPFAFWEWVA-STLDIYGETYLAIQKNKS-GTPEKLMP 133 (518) T ss_pred HHHHhhccCceEEEEEcCCCceeccchHHHHHHcC-C-CCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEE Confidence 8765 222111111 122222222221 1 12344568888866 788889999999888776 55778999 Q ss_pred cCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcc Q lcl|NC_011269. 202 LNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDI 281 (867) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (867) |+|+.|.|... .+ ++.+.. .|+ + ..-..+.-+.+ T Consensus 134 l~p~~v~v~~~---~~---------------------~~~~~y-------------~~~-----~----~~~~~~~~~~~ 167 (518) T protein:vir:10 134 MHPSRVAIKRN---SR---------------------TGRYEY-------------YFQ-----A----GAGVGTQLVSF 167 (518) T ss_pred ECCCceEEEEc---CC---------------------CCEEEE-------------EEE-----e----cCCccceEEEe Confidence 99999988731 10 000000 000 0 00011222345 Q ss_pred cHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHH Q lcl|NC_011269. 282 SEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDD 361 (867) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (867) +..-|-|+.+-...--..|.+.+--+.++|.......+....+.+.-..|=-++++.. .-+++..+.+|+. T Consensus 168 ~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~---------~ls~e~~~~~k~~ 238 (518) T protein:vir:10 168 ADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK---------RLSEAAQQRLREQ 238 (518) T ss_pred cCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC---------CCCHHHHHHHHHH Confidence 5566778876554333479999988899998888888888888888887866666652 2367888999998 Q ss_pred HHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHH Q lcl|NC_011269. 362 MQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIM 437 (867) Q Consensus 362 ~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~ 437 (867) |+..+.- + -..+|-..|++++.++..-+-.-+-.-.++..++|.+++||.-.+|...++++|+++. ....|++.-+ T Consensus 239 ~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL 318 (518) T protein:vir:10 239 FDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM 318 (518) T ss_pred HHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHH Confidence 8877753 4 3567777888888887443222222333566789999999999999767888999854 3444555556 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhh---hhhhhhhhccccccccchhhh Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV---PKLLIPEIKFSTLNLRDEAQE 514 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~---~k~i~~~i~~~~~~Lr~e~~~ 514 (867) .-+..+|++.+.+.+-...+- .+.++|-+..+.-.|..+.-+-..+ .+... ...+ |..+...-+.+.. . T Consensus 319 ~P~l~~ie~~ln~~L~~~~~~----~~~~~fd~~~llr~D~~~r~~~~~~-~~~~G~lT~NE~--R~~~Gl~pie~~~-g 390 (518) T protein:vir:10 319 AIPIARIQSAMDKYVGQYWVR----KNRMKFDIDDVIQPDWEAKSESTQK-MVNSGVATPNEG--REIMGLPRSDDPK-A 390 (518) T ss_pred HHHHHHHHHHHHHhhcccccC----CceEEEechhhhccCHHHHHHHHHH-HHhCCCcCHHHH--HHHhCCCCCCCCC-C Confidence 677777777776665222111 2234443333333333222111111 11110 0000 0000000010000 0 Q ss_pred hhhhhhhhhceeeeeccccCCCc--ccccchhhhhh-HHHHHHHHhhcccccccccccccccCCCCCccccccccccccC Q lcl|NC_011269. 515 RAFIAQLKGMGVPVSDKTLAVNI--DMKFDQELERQ-ADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLAL 591 (867) Q Consensus 515 ~~~v~qL~~~~~pitd~t~p~ti--qme~E~e~e~k-~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~ 591 (867) ....++ .-..++......... +.+..++...+ ..+............++.............++. + T Consensus 391 D~~~~~--~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~------ 459 (518) T protein:vir:10 391 DELYAN--SALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRR---L------ 459 (518) T ss_pred Ceeeec--ccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCcccccccccccccchhc---c------ Confidence 000000 000011100000000 00000000000 000000000000000000000000000000000 0 Q ss_pred CCCCCCCCCCCC--CCCccCCC----CccCCCCC--CCccCc----cCcCCCCCCCCCCCC Q lcl|NC_011269. 592 RQGKTQTELGEA--QAVAGEAQ----AELQTKQI--EMQEMM----MDQQMAGGVMPGQPM 640 (867) Q Consensus 592 a~gpgq~~~~qa--~~~agq~~----~p~~~~~~--~~qp~~----~~qg~pG~~gPpGP~ 640 (867) ..-+........ +...+.-+ -.+-..+. ...... ..-+.+-. -.--. T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 518 (518) T protein:vir:10 460 MQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYPDDLEDILLAVQLALA--ERKDN 518 (518) T ss_pred ccCCCcccccchHHHHHHHHhhcCccchhHhhhhhhhcchhHHHHHHHHHHhhh--hccCC Confidence 000000000000 00000000 00000000 000000 00000000 00000 No 40 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.40 E-value=3.8e-07 Score=55.79 Aligned_cols=498 Identities=14% Similarity=0.101 Sum_probs=206.5 Q ss_pred hHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhh Q lcl|NC_011269. 13 SAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLG 92 (867) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 92 (867) -.-..||||++.+.++--.--+ ...-.-+.++.+....+ -|.+-.++.. |..+..+ -|+. .+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~k~~~~~~~~----~~~~~~~--~~~~-~~g~~---- 65 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIE---VDDNYSIAIQQREQEQI-SKAMNNKEVA----YSQPVIG--SMSA-NPGFK---- 65 (547) T ss_pred CchhhhhhhhcCCccccccccc---cccccchhhhhhhHHHH-HHhhcccchh----hhchhhh--eeec-ccccc---- Confidence 3345789999876553211000 00001122222211111 1111111111 1111111 0000 01111 Q ss_pred hhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----cc--------cccceecccc---------h--- Q lcl|NC_011269. 93 TLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FP--------VVGMEFDSKD---------P--- 148 (867) Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~--------~~~~~~~~~~---------~--- 148 (867) .+.++.+.++. +++.+ -|+..|+|-.||+++.. |. .+++++..+| . T Consensus 66 ---~~~~~~~~~~l---~~l~~----~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~ 135 (547) T protein:vir:63 66 ---TKPSIRNNQDL---HGVLK----KFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATI 135 (547) T ss_pred ---cCCccCChhHH---HHHHH----HhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHH Confidence 23344444443 33332 36678999999988864 32 2233443332 1 Q ss_pred -hHHHHHHHHhhcc---cccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHH Q lcl|NC_011269. 149 -LIKTFYEDLFFGE---DLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMV 224 (867) Q Consensus 149 -~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (867) .|.+|.+...+-- +-.-.+|+.-++ ..|+.-|.++=....|.. |..-++..|+|+.|+|... .+-++- T Consensus 136 ~~l~~~l~~pn~~~~p~~~s~~~f~~~lv-~d~ll~Gn~~~~i~rd~~-G~~~~L~~l~p~~V~~~~~---~~g~~~--- 207 (547) T protein:vir:63 136 KRIESFIEKTGVDNDINRDSFSSFVKKIV-RDTYMYDQVNFEKVFNRN-QSMVRFVAKDPTTIFFATT---ADGKIP--- 207 (547) T ss_pred HHHHHHHHhhCCCCCCccchHHHHHHHHH-HHHHhhCCEEEEEEECCC-CcEEEEEEecCceeEEEEC---Cccccc--- Confidence 2334444333200 012346777766 788888998877777765 5567899999999987611 000000 Q ss_pred HHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCc---cccccCc Q lcl|NC_011269. 225 KDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPT---AWATRGA 301 (867) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 301 (867) + +...| .+ ....+....++..-|-|++..+. .....|. T Consensus 208 --------------------------~---~~~~y----~~------~~~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~ 248 (547) T protein:vir:63 208 --------------------------D---NGNRF----VQ------VIDQKIVATFNAREMAFAVRNPRSDIYATGYGY 248 (547) T ss_pred --------------------------c---CceEE----EE------EcCCcEEEEeccccEEEecccCCCCcccccccc Confidence 0 00000 00 11122223566666778764332 2244699 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh-cch---hhhhhhh Q lcl|NC_011269. 302 PHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA-ADF---RLMVHNF 377 (867) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~ 377 (867) |.|.-+..+|.......+......+.-..|--++++-+ +. ..+++.++.+|+.|+..+. +++ .+||..- T Consensus 249 Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~----~~---~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~~ 321 (547) T protein:vir:63 249 PELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKA----AQ---QQSQHALEIFKREWKNSLSGINGSWQIPVVSAE 321 (547) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecC----CC---CCCHHHHHHHHHHHHHHhcCcccccccccccCC Confidence 98888888887665555555555555555655555531 11 2588899999998887654 232 2566667 Q ss_pred heeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhc----------CCCccceehhhh-hHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 378 GLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALIS----------GGTGGAYASSAL-NREFVTQIMTGFQNALKR 446 (867) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~g~~~~~~~~~~-~~~~~~~~~~~~~~~l~~ 446 (867) |++++.+...-...-+..-.++..++|.+++||.-.+|. ++.+.+|+++.- ...|+.+-+.-+..+|++ T Consensus 322 g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~ 401 (547) T protein:vir:63 322 DVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIED 401 (547) T ss_pred CceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 888888764433333334456677899999999988883 234456666543 334555567777777777 Q ss_pred HHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhcee Q lcl|NC_011269. 447 HIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGV 526 (867) Q Consensus 447 ~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~ 526 (867) .|.+.+-. +. ...+.+-|..++..+..+. .++.+..... -+.++.-+...+.--.++ -+. T Consensus 402 ~ln~~L~~--~~----~~~~~~~f~~~~~~~~~~~-~~~~~~~~~g------------~lT~NE~R~~~gl~P~~e-gGD 461 (547) T protein:vir:63 402 FINKHIVA--EF----GDKYTFQFVGGDIKSELES-VKILAEKAKV------------AMTVNEVRKELNLPGDVI-GGD 461 (547) T ss_pred HHHhhccc--cc----CCceEEEeeccccccHHHH-HHHHHHHhCC------------CcCHHHHHHHhCCCCCCC-CCc Confidence 77766521 11 1234444555544432221 1121211111 111111111111000000 000 Q ss_pred eeeccccCCCcccccchh---hhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCCCCCC Q lcl|NC_011269. 527 PVSDKTLAVNIDMKFDQE---LERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTELGEA 603 (867) Q Consensus 527 pitd~t~p~tiqme~E~e---~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~~~qa 603 (867) .+..+. .........+. ...+..+....+ .+...+.... +.... +..+ .... ..+.......+. T Consensus 462 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-~~~~~---~~~~----~~~~-~~~~d~~~~~~~ 528 (547) T protein:vir:63 462 IPLNGV-IVQRIGQLMQQEQFEHEKQQSNLQML---QEQTGNRVST-DVEDI---PDGK----DTTG-DIGKDGQRKDKD 528 (547) T ss_pred eeeccc-ccccccccccccCCccccchhhcccc---ccccCCCCCC-CCCCC---CCCc----ccCC-CcCccccccCcc Confidence 000000 00000000000 000000000000 0000000000 00000 0000 0000 000000000000 Q ss_pred CCCccCCCCccCCCCCCCccCccCcCCCCCCCCCCCCCcccccccc Q lcl|NC_011269. 604 QAVAGEAQAELQTKQIEMQEMMMDQQMAGGVMPGQPMLPPGAPGDP 649 (867) Q Consensus 604 ~~~agq~~~p~~~~~~~~qp~~~~qg~pG~~gPpGP~gPpG~pG~p 649 (867) ...+ +.++.-|-..--.- . T Consensus 529 ~~~~------------------~~~~~~~~~~~~~~---------~ 547 (547) T protein:vir:63 529 NANA------------------GKQGMKGDKPNDWQ---------T 547 (547) T ss_pred ccch------------------hhhhcCCCCccccC---------C Confidence 0000 00000000000000 0 No 41 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.39 E-value=1e-08 Score=64.33 Aligned_cols=397 Identities=11% Similarity=0.089 Sum_probs=181.8 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhccccccccee-eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQ-IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT 122 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (867) |+ =+.|++.+..+.. +....+.++-..+.+.. +...++-.. + .+ .=|-+ T Consensus 1 m~-----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--v---~~-~~a~~ 50 (412) T protein:vir:26 1 MN-----------------------VIAKENIVTRIKKKLIDNWIDQSTSKLYDFS-PWKNRSFWG--V---IN-NTLET 50 (412) T ss_pred Cc-----------------------cchhhhhhhhhhhhHhhhhhccccccccccc-ccCCccccc--c---ch-hhhhc Confidence 11 0001010000000 00011111111111100 000000000 0 01 11336 Q ss_pred cchHHHHHHhhhh-cccccceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh Q lcl|NC_011269. 123 HDLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS 198 (867) Q Consensus 123 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (867) ++.|-.|||+-++ ..-..|+.--+++..+....+++. .+..+-.+|+..++ ..++.-|+++-+..-|..+ -..+ T Consensus 51 ~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~~G-~~~~ 128 (412) T protein:vir:26 51 NETIFSAITKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIE-TIRNEKGNAYVLIERDIYH-QPSK 128 (412) T ss_pred cHHHHHHHHHHHHhHhhCceeEeeccccccchHHHHHHhhcccCCCHHHHHHHHH-HHHhhcCceEEEEEECCCC-cEEE Confidence 6778888887653 122223332233334443344433 12355678988877 8888899988877766654 4778 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG 278 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (867) +..|+|+.|.|... .+ ++.+. |+.+ ...|.. T Consensus 129 L~~l~~~~v~v~~~---~~---------------------~~~~~-------------------y~~~------~~~g~~ 159 (412) T protein:vir:26 129 LFLLNPDVVEMLIE---NQ---------------------SRELY-------------------YSIH------AATGNK 159 (412) T ss_pred EEEEcCceeEEEEe---CC---------------------CcEEE-------------------EEEE------cCCceE Confidence 99999999987621 10 00000 0100 112334 Q ss_pred CcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhc-hhhhhhhcccccCCCCcCCCCHHHHHH Q lcl|NC_011269. 279 LDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYS-PLVLATLGIEDMGDGEPWIPDQGELDE 357 (867) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (867) +.++..-|-|+.+-.+.=.-.|.+.+.-+-++|-..... ++.-.+.... +-.+.+.+. --+.+..+. T Consensus 160 ~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~---~~~~~~~~~~~~~~i~~~~~---------~l~~e~~~~ 227 (412) T protein:vir:26 160 LIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV---RTFNLTEMQKPDSFMLKYGS---------NVGKEKRQQ 227 (412) T ss_pred EEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHH---HHHHHHhcCCCCceEEecCC---------CCCHHHHHH Confidence 456777888997654433456888776665655543332 2222222222 222233331 247788999 Q ss_pred HHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh-hHHHHHHH Q lcl|NC_011269. 358 VRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL-NREFVTQI 436 (867) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~ 436 (867) +|++|+..+...-..+|-.-|++++.....-+...+-.-.+...++|.+++||...++.++++++|+++.- ...|+..- T Consensus 228 ~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~ 307 (412) T protein:vir:26 228 VLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHT 307 (412) T ss_pred HHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH Confidence 99999988876667788888888887754432222222233456889999999999999888889999754 44566666 Q ss_pred HHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhh------hhhhh--hhhhccccccc Q lcl|NC_011269. 437 MTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRK------VPKLL--IPEIKFSTLNL 508 (867) Q Consensus 437 ~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~------~~k~i--~~~i~~~~~~L 508 (867) ++-+...|++++.+.+=...|... .+.++|-+..+.--|..+.-+ .++..+.. +...+ ++.+.+.... T Consensus 308 l~P~~~~ie~~ln~kLl~~~~~~~--~~~~~fd~~~l~~~d~~~~~~-~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~- 383 (412) T protein:vir:26 308 LLPIVKQYEEEFNRKLLTKTDREK--NRYFKFNVKSYLRADSATQAE-VYFKAVRSGYYTINDIREWEDLPPVEGGDKP- 383 (412) T ss_pred HHHHHHHHHHHHHhhcCCcccccC--cceEEeechhhhccCHHHHHH-HHHHHHhCCCcCHHHHHHHhCCCCCCCcCee- Confidence 777777777777776633333221 112332222221112111111 11111111 00000 1122211110 Q ss_pred cchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHH Q lcl|NC_011269. 509 RDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADET 552 (867) Q Consensus 509 r~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~ 552 (867) +..+. ..++.... .....++-..+... |. T Consensus 384 ---------~~~~n--~~~~~~~~-~~~~~~~gG~~n~~---e~ 412 (412) T protein:vir:26 384 ---------LISGD--LYPIDTPL-ELRKSLKGGDKNVN---ES 412 (412) T ss_pred ---------eeccc--ccccccch-hhcccccCCCCCcC---CC Confidence 11100 00110000 00000000000000 00 No 42 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.38 E-value=6.9e-08 Score=59.86 Aligned_cols=401 Identities=12% Similarity=0.107 Sum_probs=190.1 Q ss_pred Hhccccccccee-eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-cccccceecccc Q lcl|NC_011269. 70 YRKQGNFGSNMQ-IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-FPVVGMEFDSKD 147 (867) Q Consensus 70 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 147 (867) |+|.+-+..=.. +-.--+.++...+.+ +..|..+.-.- .-+.=|-.++.|-.|||+-++ ..-..++.--++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~----v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~ 73 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLYD---FSPWKNRSFWG----VINNTLETNETIFSAITKLSNSMASLPLKMYEDY 73 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccccc---cccccCccccc----cchhhhhccHHHHHHHHHHHHhhhhCceeEeecc Confidence 766654332111 111112223222221 11111111100 011225578889999997653 122223333344 Q ss_pred hhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHH Q lcl|NC_011269. 148 PLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMV 224 (867) Q Consensus 148 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (867) ..++.-..+++. -+..+-.+|+..++ ..++.-|+.+-+.+-|.. |...++..|+|+.|.|... .+ T Consensus 74 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~~-G~~~~L~~l~~~~v~~~~~---~~------- 141 (409) T protein:vir:93 74 KVVNTEVSDLLTVSPNNSLSSFDFINQIE-TIRNEKGNAYVLIERDIY-HQPSKLFLLNPDVVEMLIE---NQ------- 141 (409) T ss_pred ccccchHHHHHhhhcccCCCHHHHHHHHH-HHHhhcCceEEEEEECCC-CcEEEEEEEcCceeEEEEe---CC------- Confidence 444444444433 23455678888866 788888998888777654 5567899999999987521 10 Q ss_pred HHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchh Q lcl|NC_011269. 225 KDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHL 304 (867) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (867) ++.+ | |+.+ ...|..+.++..-|-|+.+-.+--.-.|.+.+ T Consensus 142 --------------~~~~----------------~---y~~~------~~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i 182 (409) T protein:vir:93 142 --------------SREL----------------Y---YSIH------AATGNKLIVHNMDMLHFKHIVASNMVQGISPI 182 (409) T ss_pred --------------CcEE----------------E---EEEE------cCCceEEEEccccEEEeCCCCCCCccccccHH Confidence 0000 0 1110 11244456777888899865443344688876 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhch-hhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeee Q lcl|NC_011269. 305 LRSFRTLMAEESLNAAQDAVADRLYSP-LVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVEN 383 (867) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 383 (867) .-+-++|-.... +++........+ -.|.+.+. .-+.++.+.+|++|+..+......+|-.-|++++. T Consensus 183 ~~~~~~i~~~~~---~~~~~~~~~~~~~~~i~~~~~---------~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~ 250 (409) T protein:vir:93 183 DVLKNTTDFDNA---VRTFNLTEMQKPDSFMLKYGS---------NVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEP 250 (409) T ss_pred HHHHHHHHHHHH---HHHHHHHhcCCCCceEEecCC---------CCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEE Confidence 655555443332 222222222222 22222221 23788999999999988877777888888999888 Q ss_pred ccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhccc Q lcl|NC_011269. 384 VFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHY 462 (867) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~ 462 (867) ....-+...+-+-.+..+++|.+++||...++.++++++|+++. ....|++.-++-+...|++.+.+.+-...+... T Consensus 251 l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~-- 328 (409) T protein:vir:93 251 LPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREK-- 328 (409) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccC-- Confidence 76443333333333456788999999999999988889999974 344566666777777777777765533333211 Q ss_pred chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhc-eeeeeccccCCCccccc Q lcl|NC_011269. 463 DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGM-GVPVSDKTLAVNIDMKF 541 (867) Q Consensus 463 d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~-~~pitd~t~p~tiqme~ 541 (867) .+.++|-+..+.--| .+++. +. ..+... . +-+.++.-+...+ ...++.- ...++.... ++ T Consensus 329 ~~~~~fd~~~ll~~d---~~~~~-~~-~~~~~~-----~--G~~T~NE~R~~~g-~~p~~ggD~~~~~~n~~------~~ 389 (409) T protein:vir:93 329 NRYFKFNVKSYLRAD---SATQA-EV-YFKAVR-----S--GYYTINDIREWED-LPPVEGGDKPLISGDLY------PI 389 (409) T ss_pred cceEEeechhhhccC---HHHHH-HH-HHHHHh-----C--CCcCHHHHHHHhC-CCCCCCcCeeeeccccc------cc Confidence 112232222211111 22211 11 111100 0 0011111111100 0000000 000000000 01 Q ss_pred chhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 542 DQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 542 E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) +...+.+ ..+ +.-+... ..+ T Consensus 390 ~~~~~~~-----~~~-------~gG~~n~-~e~ 409 (409) T protein:vir:93 390 DTPLELR-----KSL-------KGGDKNV-NES 409 (409) T ss_pred ccchhhc-----ccc-------cCCCCCc-CCC Confidence 0000000 000 0000000 000 No 43 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=98.38 E-value=1.1e-07 Score=58.64 Aligned_cols=390 Identities=11% Similarity=0.066 Sum_probs=180.4 Q ss_pred cccCCchHHH---HHHHHhhhcchhHHHHHHHHhcccccccceeec--cchhhhhhhhhHHhhCCCchhhhHHHHHHHHH Q lcl|NC_011269. 41 QNTVDNKPLI---DYFQGRRRAAEANRQRLASYRKQGNFGSNMQIA--MPKIRQPLGTLADKGIPFNVEDEEELRVIRHW 115 (867) Q Consensus 41 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 115 (867) -.+||.+--+ +.|...++-.+.|.... .. .++..+. .+.++ .+... +... T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~---~~----~~~~~~~~~~~~~~------~~~~~--~~~~---------- 55 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKD---EI----PKAPQVVMTLPNFF------KELIS--DGYT---------- 55 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhc---cc----cccccccccchhhH------hhhcc--chhH---------- Confidence 1122222211 22222222222221110 00 0111110 11110 00000 0000 Q ss_pred HHHHhhccchHHHHHHhhhh-cccccceecccc----hhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchh Q lcl|NC_011269. 116 CRLFYATHDLVPLLIDIYSK-FPVVGMEFDSKD----PLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLA 187 (867) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (867) -+.+++.|-.|||+.+. +.-..|++..++ ..++....+++. .+.++-.+|+...+ ..++.-|+.+-+. T Consensus 56 ---~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~-~~lll~Gn~~~~i 131 (413) T protein:vir:96 56 ---KLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLV-RSMLLEGNGNAVV 131 (413) T ss_pred ---HHhhchHHHHHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHH-HHHhhcCCeEEEE Confidence 02457899999998763 333334432222 123333333332 34566789999977 8999999999999 Q ss_pred hhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhch Q lcl|NC_011269. 188 HFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYP 267 (867) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (867) ..+.+++...++.+|+|+.|.|.. ..+. +. | .. T Consensus 132 ~r~~~g~~~~~L~~l~~~~v~~~~---~~~~-~~-------------------------------------y----~~-- 164 (413) T protein:vir:96 132 KPQVSGDKIIGLTPISPYKVTFNV---SDDD-LD-------------------------------------Y----SI-- 164 (413) T ss_pred EEcCCCCceEEEEEecCceeEEEE---cCCe-EE-------------------------------------E----EE-- Confidence 999988877789999999998862 1111 10 0 00 Q ss_pred HHHhhhccCCCCcccHHHHHHhhhcCccccc-cCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCC Q lcl|NC_011269. 268 EIIQAAMQNDGLDISEALISRVVNRPTAWAT-RGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGE 346 (867) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (867) ...+-.++.+-|-|++........ .|.+.+.-+..+|-......+....+.++-..|--++++.+ T Consensus 165 -------~~~~~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~------- 230 (413) T protein:vir:96 165 -------TFDNKEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS------- 230 (413) T ss_pred -------eecCcEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC------- Confidence 001112334557788755443332 59999999999999998899989999999888887888753 Q ss_pred cCCCCHHHHHHHHHHHHHhhhc-ch---hhhhhhhheeeeeccccCccCch-hHHH----HHHHHHHHHhhccchhhhcC Q lcl|NC_011269. 347 PWIPDQGELDEVRDDMQSLLAA-DF---RLMVHNFGLKVENVFGRESVPNL-DADY----DRIERKLLQAWGIGEALISG 417 (867) Q Consensus 347 ~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~~ 417 (867) --+++..+++|+.|+..+.. ++ .++|-.-+.++..+. -++. |.++ +...++|.+++||...++.+ T Consensus 231 --~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~----~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~ 304 (413) T protein:vir:96 231 --DSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIK----PLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGV 304 (413) T ss_pred --CCCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC Confidence 13677889999988877643 32 233333333322211 1233 3333 34457899999999999965 Q ss_pred CCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhh Q lcl|NC_011269. 418 GTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLL 497 (867) Q Consensus 418 g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i 497 (867) |++ +.+-...|+..-++-+...|++.+.+.+- -. +..++|-+..+...|..+.-+-+.+. +.. T Consensus 305 ~~~----~~~~~~~~~~~~l~P~~~~ie~~ln~~ll-----~~--~~~~~fd~~~ll~~d~~~~~~~~~~~-~~~----- 367 (413) T protein:vir:96 305 GTY----NKDEFNNFINTKIMSIAQVIQQTYNKLIV-----EE--DMYFSLNPRSLYNYSLTEMVSAGAQM-TQL----- 367 (413) T ss_pred Ccc----hHHHHHHHHHHHHHHHHHHHHHHHHHhhC-----CC--CcEEEEechhhhccCHHHHHHHHHHH-HhC----- Confidence 543 12222234444444444444444443331 00 22333322222222221111111111 111 Q ss_pred hhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHH Q lcl|NC_011269. 498 IPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADET 552 (867) Q Consensus 498 ~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~ 552 (867) +-+..+.-+...+ ...++. +..+.-+..-..+++-.+ +...+..++ T Consensus 368 ------G~~t~NE~R~~~g-~~p~~~-gd~~~~~~n~~~~~~~~~-~~~~~~~dt 413 (413) T protein:vir:96 368 ------NALRRNEFRNWVG-MPPDAE-MDDLLVLENYLQQKDLVN-QKKLIQDET 413 (413) T ss_pred ------CCcCHHHHHHHhC-CCCCCC-cceeeecccccchhhccc-ccCCCCCCC Confidence 1011111010000 000000 000000000000000000 000000001 No 44 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=98.36 E-value=1.6e-08 Score=63.31 Aligned_cols=382 Identities=11% Similarity=0.072 Sum_probs=185.1 Q ss_pred HHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHH Q lcl|NC_011269. 50 IDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLL 129 (867) Q Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (867) -.-|...++. ++- ...+.+ +.+.+..+-+. . .+.....+ .++. |-+++.|-.| T Consensus 1 M~~f~~~~~~----~~~--~~~~~~---~~~~~~~~~~~-----------~-~~~~~~~v-----~~~~-~~~~~~v~~~ 53 (386) T protein:vir:48 1 MPIFNITNLA----TES--PPISQG---GFFDITDPDFL-----------S-TLNGSEWV-----SAES-ALRNSDLFSI 53 (386) T ss_pred Cccccccccc----ccc--cccccc---cccccccchhc-----------c-cccCCcee-----chhh-hhcchHHHHH Confidence 0000000000 000 000000 00000000000 0 00000000 1111 2367888899 Q ss_pred HHhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeeh Q lcl|NC_011269. 130 IDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRV 209 (867) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (867) |++.+. -|..+.|...|.....+.+. ..+.++-.+|+..++ ..++.-|+++-+..-+. .|.+.++..|+|+.|+| T Consensus 54 i~~ia~-~ia~~p~~~~~~~~~~l~~~--pN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~-~g~~~~L~~l~~~~v~v 128 (386) T protein:vir:48 54 INQLSN-DLATVKLTASRKQLQGIIDN--PSNNANRFNFYQSIF-AQMLLGGEAFAYRWRNE-NGRDMKWEYLRPSQVSF 128 (386) T ss_pred HHHHHH-hhccCceeeccchhHHHhhc--CCCCCCHHHHHHHHH-HHhhhcCcEEEEEEECC-CCcEEEEEEecCceeEE Confidence 998876 45556666666665544332 234677789999877 89999999998887775 45688999999999998 Q ss_pred hhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHh Q lcl|NC_011269. 210 SRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRV 289 (867) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (867) .+-- +|..+ -| +|- +.....+....++..-|-|+ T Consensus 129 ~~~~------------------------~~~~~---------------~y-----~~~--~~~~~~~~~~~~~~~evih~ 162 (386) T protein:vir:48 129 NRLD------------------------NKDGI---------------YY-----NIT--FDDPRIPPKQHVPQGDVLHF 162 (386) T ss_pred EEcC------------------------CCceE---------------EE-----EEE--ecCccccceeEecCccEEEe Confidence 7321 00000 00 000 00001112223455567788 Q ss_pred hhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcc Q lcl|NC_011269. 290 VNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAAD 369 (867) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (867) .+-.+.-.-.|.+.+..+.++|-.......+...+.+.-..|--++++.+ .-+.++.+.+++++....... T Consensus 163 ~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~---------~~~~e~~~~~~~~~~~~~~n~ 233 (386) T protein:vir:48 163 KLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG---------GGLLDFKTKLSRSRQAMKQMQ 233 (386) T ss_pred cCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---------CCCHHHHHHHHHHHHHhhcCC Confidence 76544333469999888888887666666666666666666776776653 124456677888777776666 Q ss_pred hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 370 FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGFQNALKRHI 448 (867) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~ 448 (867) ...+|-.-|++++.++..-+...+-+-.++..++|.+++||...++. +. ++|+++ .-..+|+.--+.-+...|++.+ T Consensus 234 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~-~~~~~~e~~~~~~~~~~l~P~~~~ie~~l 311 (386) T protein:vir:48 234 GGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVG-GQ-GDQQSSLEMSLDLYNKAVSRYLRPFLSEL 311 (386) T ss_pred CCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CC-CCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67788888999988875544333334457778999999999999995 43 345543 2334444444444444455544 Q ss_pred hhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhh---hhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 449 RRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRK---VPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 449 r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~---~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) .+.+ -+++.+-+....-+|-..++..+.++ .+. ....+... ......+..+..... . T Consensus 312 ~~~l----------~~~~~~~~~~~~~~d~~~~~~~~~~l-~~~g~~t~nE~r~~-lg~~~~~~~~~~~~~---~----- 371 (386) T protein:vir:48 312 SQKL----------SCDVDADILPAVDPTGSNSVSRINSM-VKSGTLAQNQGLYI-LQQAEILPKELPEGE---N----- 371 (386) T ss_pred HHhh----------cchhhcchhhhhccChHHHHHHHHHH-HhCCCcCHHHHHHH-hhcCCCCCccchhhc---C----- Confidence 4443 11221111111112222222222111 110 00000000 000010111111110 0 Q ss_pred eeeeccccCCCcccccchh Q lcl|NC_011269. 526 VPVSDKTLAVNIDMKFDQE 544 (867) Q Consensus 526 ~pitd~t~p~tiqme~E~e 544 (867) ....+..+++..- - + T Consensus 372 -~~~~~~~gGd~~~-~--~ 386 (386) T protein:vir:48 372 -PNKTTLKGGEING-E--D 386 (386) T ss_pred -CCCCccCCCCCCC-C--C Confidence 0011111222111 1 1 No 45 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.33 E-value=6.3e-07 Score=54.57 Aligned_cols=502 Identities=13% Similarity=0.071 Sum_probs=207.4 Q ss_pred ccchhHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHH-HHHHHHhhhcchhHHHHHHHHhcccccccceeeccchh Q lcl|NC_011269. 9 GSNWSAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPL-IDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKI 87 (867) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (867) -.|--+-..|||+.++|+-+--...+- . +-.-+.++.+ -+-+ +. .+++...-+ ++..++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~-----------~k--~~~~~~~a~--~~~~~~~~ 62 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEV--D-DNYSIAIQQREQEQI-----------SK--AMNNKEVAY--SQPVIGSM 62 (551) T ss_pred CchhhhhHHHhhhccCChhhccccccc--c-cceeeecccccHHHH-----------HH--hhccCccee--ecccccce Confidence 223345567888877765432111100 0 0000111111 1111 11 122111111 11111111 Q ss_pred hhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----ccc--------ccceecccchh------ Q lcl|NC_011269. 88 RQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FPV--------VGMEFDSKDPL------ 149 (867) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~--------~~~~~~~~~~~------ 149 (867) ...=+. -.+.++.+..+. +++. .-|+..|+|-.||++... |+. +++.+..+|.. T Consensus 63 ~~~~~~-~~r~~~~~~~~l---~~~~----~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~ 134 (551) T protein:vir:80 63 SANPGF-KTKPSIRNNQDL---HGVL----KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSH 134 (551) T ss_pred ecCccc-ccCccccChhHH---HHHH----HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChh Confidence 111110 022333333332 2221 146778999999998864 221 23444444422 Q ss_pred -------HHHHHHHHhhcc---cccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchH Q lcl|NC_011269. 150 -------IKTFYEDLFFGE---DLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRER 219 (867) Q Consensus 150 -------~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (867) |.+|.+...+-- +-...+||.-++ ..++.-|.++-+...|+. |...++..|+|+.|+|...- +-. T Consensus 135 ~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv-~dlll~Gnay~~i~rd~~-G~~~~L~~l~p~~V~v~~~~---~g~ 209 (551) T protein:vir:80 135 DEATIKRIESFIEKTGVDNDINRDSFSSFVKKIV-RDTYMYDQVNFEKVFNRN-QSMVRFVAKDPTTIFFATTA---DGK 209 (551) T ss_pred HHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHH-HHHHhcCCEEEEEEECCC-CcEEEEEEeCCceeEEEECC---ccc Confidence 233333322200 012346777755 788888988877777764 66888999999999875210 000 Q ss_pred HHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCc---cc Q lcl|NC_011269. 220 VQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPT---AW 296 (867) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 296 (867) +. + +..- +. ....++....++..-|-|++..+. .+ T Consensus 210 ~~-----------------------------~---~~~~----y~------~~~~g~~~~~~~~~eiiH~~~n~~~~~~~ 247 (551) T protein:vir:80 210 IP-----------------------------D---NGNR----FV------QVIDQKIVATFNAREMAFAVRNPRSDIYA 247 (551) T ss_pred cc-----------------------------c---CceE----EE------EEeCCcEEEEEcccceEEecccCCCCccc Confidence 00 0 0000 00 111122233556666778764332 22 Q ss_pred cccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh-cch--h-h Q lcl|NC_011269. 297 ATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA-ADF--R-L 372 (867) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~-~ 372 (867) ...|.+.+.-+..+|.......+......+.-..|--|+++-+ +. ..+++.++.+|+.|+..+. +++ + + T Consensus 248 ~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~----~~---~lt~e~~~~lk~~~~~~~~G~~nag~~~ 320 (551) T protein:vir:80 248 TGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKA----AQ---QQSQHALEIFKREWKNSLSGINGSWQIP 320 (551) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcC----CC---CCCHHHHHHHHHHHHHHhcCccccCccc Confidence 4469998888888887665555555555555566766666642 11 2488999999998887653 232 2 4 Q ss_pred hhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhc----------CCCccceehhhhh-HHHHHHHHHHHH Q lcl|NC_011269. 373 MVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALIS----------GGTGGAYASSALN-REFVTQIMTGFQ 441 (867) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~g~~~~~~~~~~~-~~~~~~~~~~~~ 441 (867) ||-.-|++++.....-...-+-.-.++..++|.+++||.-.+|. .+.+.+|+++.-- ..|+.+-++-+. T Consensus 321 vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~ 400 (551) T protein:vir:80 321 VVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLL 400 (551) T ss_pred cccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHH Confidence 55456788877754433333334456677899999999988873 2444567766433 356666688888 Q ss_pred HHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhh Q lcl|NC_011269. 442 NALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQL 521 (867) Q Consensus 442 ~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL 521 (867) .+|++.+.+.+-.. + ...+.+-|..++..+..+.. ++++.... +-+.++.-+...+.--.. T Consensus 401 ~~ie~~ln~~L~~~-----~-~~~~~f~f~~~~~~~~~~~~-~~~~~~~~------------g~lT~NE~R~~~gl~P~~ 461 (551) T protein:vir:80 401 GFIEDFINKHIVAE-----F-GDKYTFQFVGGDIKSELESV-KILAEKAK------------VAMTVNEVRKELNLPGDV 461 (551) T ss_pred HHHHHHHHhhhccc-----c-CCceEEEeeccChhhHHHHH-HHHHHHhc------------CCcCHHHHHHHhCCCCCC Confidence 88888887766321 1 12344445444433321111 11111111 112111111111100000 Q ss_pred hhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccc--cccCCCCCCCCC Q lcl|NC_011269. 522 KGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQS--TLALRQGKTQTE 599 (867) Q Consensus 522 ~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~--t~~~a~gpgq~~ 599 (867) + -+..+..+.. ....+...++...........+. .+ ....+.+..++..+.+.. +.... +..... T Consensus 462 e-gGD~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-------~~---~~~~~~~~~~~~~~~p~~~~~~~~~-~~~~~~ 528 (551) T protein:vir:80 462 I-GGDIPLNGVI-VQRIGQLMQQEQFEHEKQQSNLQ-------ML---QEQTGNRVSTDVEDIPDGKDTTGDI-GKDGQR 528 (551) T ss_pred C-CCceeecccc-cccccccccccCcchhhhhhccc-------cc---cCcCCCCCCCCCCCCCCccccCCCc-cccccc Confidence 0 0000100000 00000000000000000000000 00 000000000000000000 00000 000000 Q ss_pred CCCCCCCccCCCCccCCCCCCCccCccCcCCCCCCCCCCCC Q lcl|NC_011269. 600 LGEAQAVAGEAQAELQTKQIEMQEMMMDQQMAGGVMPGQPM 640 (867) Q Consensus 600 ~~qa~~~agq~~~p~~~~~~~~qp~~~~qg~pG~~gPpGP~ 640 (867) ..+....+ +.++.-|...--.-. T Consensus 529 ~~~~~~~~------------------~~~~~~~~~~~~~~~ 551 (551) T protein:vir:80 529 KDKDNANA------------------GKQGMKGDKPNDWQT 551 (551) T ss_pred cCccccch------------------hhhhcCCCCccccCC Confidence 00000000 000000000000000 No 46 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.33 E-value=1e-07 Score=58.90 Aligned_cols=476 Identities=10% Similarity=0.021 Sum_probs=196.4 Q ss_pred ccceeeccchhhhh--hhhhH-Hhh-----------CCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-c Q lcl|NC_011269. 77 GSNMQIAMPKIRQP--LGTLA-DKG-----------IPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-M 141 (867) Q Consensus 77 ~~~~~~~~~~~~~~--~~~~~-~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 141 (867) |-|--|..+++-.+ ++.-. .+. -|+|.+ ..|+ .|...++|-.||++.++.-... + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~---------~La~-~~~~n~~v~scI~~ia~~ia~~~~ 70 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEYVEPKVHPL---------VLLS-LLQVNPYHASACSIKANDILRTGY 70 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCCccccCCCCHH---------HHHH-HHHhcHHHHHHHHHHHHHHhcCCc Confidence 22222222222221 11110 011 133322 2233 4556688999999999886555 8 Q ss_pred eecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHH Q lcl|NC_011269. 142 EFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQ 221 (867) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (867) ++..+|..+++|- -.+..+-.+|+..++ ..|+.-|.++-+...|.. |...++..|+|++|+|.+. ....++ T Consensus 71 ~i~~~~~~~~~~l----pN~~~t~~~f~~~~v-~dlll~Gnayv~i~r~~~-G~~~~L~~i~~~~V~v~~~---~~~~~~ 141 (540) T protein:vir:41 71 LIDGDDGGVEELL----RACRPSFEFILLQAL-EDLQVFNYCTLEVVRDDQ-GEPVRLDYIPAHTVRVHRD---GSRYMQ 141 (540) T ss_pred eEecCccchhhhc----cCCCCCHHHHHHHHH-HHHHhcCCeEEEEEECCC-CcEEEEEEeCCcceEEeEc---CceeEe Confidence 8889998887752 345667788888866 889999999998888874 6788899999999998631 111111 Q ss_pred HHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCc Q lcl|NC_011269. 222 LMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGA 301 (867) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (867) ..+|.... |-..+ .|...+..........++..-|-|+++..+--.-.|. T Consensus 142 --------------~~d~~~~~---------------~~~~~-~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~ 191 (540) T protein:vir:41 142 --------------TWDGIHVT---------------YFKDY-RYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGV 191 (540) T ss_pred --------------eecCceee---------------eeecc-cccceeeccccccceeecccceEEecCCCCCCCcccc Confidence 00111000 00001 1222223333444455677778899776655566899 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--------h-h Q lcl|NC_011269. 302 PHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--------R-L 372 (867) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~-~ 372 (867) |.+.-+.++|.............-+.-..|=-++++-+. -..+. ..+++.-..+|+.+++.+..-| . | T Consensus 192 Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~--l~~e~-~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~ 268 (540) T protein:vir:41 192 PRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGE--FEDEM-ELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPL 268 (540) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcc--cCchh-ccchHHHHHHHHHHHHHHHHHhccccccccceE Confidence 999999998887765554444444444445444443210 11111 2234444445555554443322 1 2 Q ss_pred hhh-----hhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCc--cceehhh-hhHHHHHHHHHHHHHHH Q lcl|NC_011269. 373 MVH-----NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTG--GAYASSA-LNREFVTQIMTGFQNAL 444 (867) Q Consensus 373 ~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~~~~-~~~~~~~~~~~~~~~~l 444 (867) +++ .-|++++.....-+...+-.-.++..++|.+++||.-.+|.-.++ .+|+++. ....|+.+-+.-++.+| T Consensus 269 vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~i 348 (540) T protein:vir:41 269 VFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIV 348 (540) T ss_pred EEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 222 235555555433222222233455667899999999999842333 4566653 44556666677788888 Q ss_pred HHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhh-----hhhhhhhhhccc-cccccchhhhhhhh Q lcl|NC_011269. 445 KRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRK-----VPKLLIPEIKFS-TLNLRDEAQERAFI 518 (867) Q Consensus 445 ~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~-----~~k~i~~~i~~~-~~~Lr~e~~~~~~v 518 (867) ++.+.+.+.. +. ..++.|+|-...+.-.| .++.+.+.-... +....++.+... +.. + T Consensus 349 e~~ln~~L~~--~~--~~~~~i~f~~~~ll~~D---~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~----------l 411 (540) T protein:vir:41 349 SSVLTDFIQL--KL--DPGARFVFNEEILMESE---FVHNYALLVQCGVLTPSEVREKLFGLDGGPDMF----------M 411 (540) T ss_pred HHHHHHhhhh--cc--CCceEEEecchhhcchH---HHHHHHHHHhCCCCCHHHHHHHhCcCcCCCccc----------c Confidence 8877776532 11 11222333322222222 222222211000 100000000000 000 0 Q ss_pred hhhhhceeeeeccccCCCcccccchhhhhhH----HHH-HHHHhhcccccccccccccccCCCCCccccccccccccCCC Q lcl|NC_011269. 519 AQLKGMGVPVSDKTLAVNIDMKFDQELERQA----DET-VQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQ 593 (867) Q Consensus 519 ~qL~~~~~pitd~t~p~tiqme~E~e~e~k~----~E~-l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~ 593 (867) ..+ ......+....+.+.+. .++ ..+.....+.........+..+.-.. .......+.+. T Consensus 412 ~p~-----------n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 476 (540) T protein:vir:41 412 VPS-----------SIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKID----EVLSDFRAEAY 476 (540) T ss_pred ccc-----------ccccccccccccccCCCCccccccccchhcccccCcccccccccccccccc----ccccccCCccc Confidence 000 00000000000000000 000 00000000000000000000000000 00000000000 Q ss_pred CCCCCCCC---CCCCCccC----CCCccCCCCCCCccCccCcC--------------CCCCCCC Q lcl|NC_011269. 594 GKTQTELG---EAQAVAGE----AQAELQTKQIEMQEMMMDQQ--------------MAGGVMP 636 (867) Q Consensus 594 gpgq~~~~---qa~~~agq----~~~p~~~~~~~~qp~~~~qg--------------~pG~~gP 636 (867) ..+..... ..+....- .-.+..+.....-....... .-|-.-- T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (540) T protein:vir:41 477 ENGKKMLSIAGDMGTMSAINRGVSMIPPKPSNLEAYEDLLAASVDDIVERIRHYLYKVIGWREL 540 (540) T ss_pred cchhHHHHHhhhhhhhhhhhcCceecCCCCcchHHHHHHHHhhHHHHHHHHHHHHHHHhhhccC Confidence 00000000 00000000 00000000000000000000 0000000 No 47 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.32 E-value=1.4e-06 Score=52.68 Aligned_cols=520 Identities=13% Similarity=0.075 Sum_probs=203.9 Q ss_pred cccchhHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHH----Hhcccccccceeec Q lcl|NC_011269. 8 AGSNWSAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLAS----YRKQGNFGSNMQIA 83 (867) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~ 83 (867) .-....+-..|+|+ |++ |+.+.+--++.+ ++-.++.++.+ +.|+-+..- |.-. T Consensus 1 ~~~~~~~~~~~~~~-~~~--------------~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~-~a~~ 57 (576) T protein:vir:96 1 MVTRLADIFKRLRL-GRD--------------YEDIIDTVPIDD-------GLQANIRNIEEKSKELNKSLYGKQ-QAYA 57 (576) T ss_pred ChhhHHHHHHHHhc-cCc--------------cccchhhhhccc-------ChhHHHHHhhhhhhhhccccCCcc-chhh Confidence 11122333445554 332 222222222221 22222222211 111111110 1111 Q ss_pred cchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh----hccc--------ccceecccch--- Q lcl|NC_011269. 84 MPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS----KFPV--------VGMEFDSKDP--- 148 (867) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~--------~~~~~~~~~~--- 148 (867) -|-+...-+...-..-|+..+ -+..++.+.+-|. ..|+|-.||++.+ .|+. +++.+.-+|. T Consensus 58 ~p~~~~~~~~~~~~~~p~~~~---~~~~~~~~l~~~~-~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~ 133 (576) T protein:vir:96 58 EPFLEVMDTNPEFRTKRSYMK---NSDNLHDVLKQFG-NNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAE 133 (576) T ss_pred cceeeeeecCCCccccCcchh---hhhhhHHHHHHhh-cCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCc Confidence 111111001101111122211 1223444544444 4699999999865 3332 2333322222 Q ss_pred ----h------HHHHHHHHhhcccc---cHHHHhHHHHHHHHhhhhhhcchhhhhhh-ccceehheecCcceeehhhhhh Q lcl|NC_011269. 149 ----L------IKTFYEDLFFGEDL---NYLEFLPDQFAREYFTVGEVTSLAHFNES-LGVWSSEEILNPDMLRVSRSMF 214 (867) Q Consensus 149 ----~------~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 214 (867) - +..|.+++.+-... +..+|+.-.+ ..++.-|.++-+..++.. .|.+-++..|+|+.|+|... T Consensus 134 ~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv-~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~-- 210 (576) T protein:vir:96 134 PGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIV-RDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATD-- 210 (576) T ss_pred cchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHH-HHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEEC-- Confidence 2 22344443332222 3467888866 788888999888888765 46788999999999998621 Q ss_pred hcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCccc-HHHHHHhhhcC Q lcl|NC_011269. 215 VQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDIS-EALISRVVNRP 293 (867) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 293 (867) .+..+- .++ .+ +.++ ...+....++ +++|-|+.+-. T Consensus 211 -~dg~~~----------~~~---------------------~~-----~~~~------~~~~~~~~~~~~dii~~~~~~~ 247 (576) T protein:vir:96 211 -KNGKII----------KGG---------------------KR-----FVQV------INKKVVASFTSREMAMGIRNPR 247 (576) T ss_pred -CCCcee----------eee---------------------eE-----EEEe------cCCceEEEecccceEEEeecCC Confidence 110000 000 00 0000 1111112222 23344444432 Q ss_pred c--cccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c- Q lcl|NC_011269. 294 T--AWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D- 369 (867) Q Consensus 294 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~- 369 (867) + -....|.|.|.-+.++|.......+......+.-.+|--|+++- ++. -.+.+..+.+|+.|+..+.- + T Consensus 248 ~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~----~~~---~ls~e~~~~lr~~~~~~~~G~~n 320 (576) T protein:vir:96 248 TELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIK----SEQ---QQSQRALENFKREWKSSFSGING 320 (576) T ss_pred CCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC----CCC---CCCHHHHHHHHHHHHHHhccccc Confidence 2 12456999888888888776666666666666666776666664 222 24788899999999987752 3 Q ss_pred -hh-hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcC-----------CCccceehhhh-hHHHHHH Q lcl|NC_011269. 370 -FR-LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISG-----------GTGGAYASSAL-NREFVTQ 435 (867) Q Consensus 370 -~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------g~~~~~~~~~~-~~~~~~~ 435 (867) .. .+|..-|++++.....-+...+-+-.++..++|.+++||.-.+|.- |.+.+|+++.- ...|+.+ T Consensus 321 ag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~ 400 (576) T protein:vir:96 321 SWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNK 400 (576) T ss_pred cccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHH Confidence 33 5788899999998766555555566678889999999999999931 23346666533 3345555 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhh-hhh--hhhhccccccccchh Q lcl|NC_011269. 436 IMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVP-KLL--IPEIKFSTLNLRDEA 512 (867) Q Consensus 436 ~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~-k~i--~~~i~~~~~~Lr~e~ 512 (867) -+.-+..+|++.|.+.+-.. ++ ..+.+-|.-.+ ++.+.....+-+.. .-| ..++.-. +.| . T Consensus 401 tL~P~~~~ie~~ln~~Ll~~-----~~-~~~~~~f~r~d------~~~~~e~~~~~~~~~~G~lT~NE~R~~-~gl---~ 464 (576) T protein:vir:96 401 GLQPLLRFIEDLINTHIISE-----YS-DKYVFQFVGGD------TKSELDKIKILQEEVKTYKTVNEARKE-KGL---K 464 (576) T ss_pred HHHHHHHHHHHHHHhhhchh-----cc-CceEEEeccCC------HHHHHHHHHHHHHHhcCccCHHHHHHH-hCC---C Confidence 56667777777776544211 11 11222222111 11111110000000 000 0111000 001 0 Q ss_pred hhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC-CCCccccccccccccC Q lcl|NC_011269. 513 QERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL-PYPPELAQHLQSTLAL 591 (867) Q Consensus 513 ~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~-P~pp~~aQ~p~~t~~~ 591 (867) +.++.-+-+ .++.-........ +...+.+ ..++....+.. ..+ ...+..+. ....+...... .. T Consensus 465 piegGD~~~----~~~~~~~~~~~~~-~~~~e~~-~~~~~~~~~~~---~~~---~~~~~~~~~~s~~~~~~g~~---~~ 529 (576) T protein:vir:96 465 PIEGGDVLL----DGSFIQSMSLNTQ-KEQYEDT-KQKERFDMIQQ---FLN---SPDDEEPQQESTEDKVDGRE---SN 529 (576) T ss_pred CCCCcceec----ccccccccccccc-CCCCCCc-ccccccccccc---ccC---CCCCCCCCCCCCCCcccccc---cc Confidence 100000000 0000000000000 0000000 00000111110 000 00000000 00000000000 00 Q ss_pred CCCCCCCCCCCCCCCccCCCCccCCCCCCCccCccCcCCCCCCCCCCCCCc Q lcl|NC_011269. 592 RQGKTQTELGEAQAVAGEAQAELQTKQIEMQEMMMDQQMAGGVMPGQPMLP 642 (867) Q Consensus 592 a~gpgq~~~~qa~~~agq~~~p~~~~~~~~qp~~~~qg~pG~~gPpGP~gP 642 (867) ..+......+.-+......-.-.+.+....++..+ -|---|.--.-- T Consensus 530 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 576 (576) T protein:vir:96 530 DPTKIDSPVGTDGQLKDQDNVKSQEGSNKGQGTKG----KGNEKPSDFKNN 576 (576) T ss_pred cCCCCCCccccccccCCCCcccccccccccccccc----cCCCCcccccCC Confidence 00000000000000000000000000000000000 000000000000 No 48 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.29 E-value=4.3e-07 Score=55.48 Aligned_cols=457 Identities=11% Similarity=0.083 Sum_probs=200.4 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) |++.-+..... .+.++.... +.+.+++.- |. +.+|- .+. -.+++..|.++ .. T Consensus 1 v~~~~l~~e~a--------t~~~~~d~~--~~~~~~l~~--~~-~~il~-~a~------------~g~~~~y~~l~--~D 52 (488) T protein:vir:99 1 MEKPALGREIA--------TSGDGRDIT--RPFISGLQV--PN-DSILQ-RRG------------GNDLRVYEEIL--SD 52 (488) T ss_pred CCccchhHHHH--------HHHhhhhhh--ccccCCCCC--CC-hHHHH-hhc------------cCCHHHHHHHh--hC Confidence 44443333322 222221221 223333322 22 22221 111 11233344443 35 Q ss_pred chHHHHHHhhhhccccc--ceecc-----cchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhh--hhhhcc Q lcl|NC_011269. 124 DLVPLLIDIYSKFPVVG--MEFDS-----KDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAH--FNESLG 194 (867) Q Consensus 124 ~~~~~~~~~~~~~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 194 (867) +-|..||+- ++-+|.+ ++|.. .|..+.+|..+++. +++..++|.++.- ..-.| +|+.. +...+| T Consensus 53 ~~i~s~l~~-rk~av~~~~w~i~p~~~~~~~~~~ae~v~~~l~--~~~~~~~l~~~ld--a~~~G--~s~~Ei~w~~~~g 125 (488) T protein:vir:99 53 AQVKTVWGQ-RQLAVVSREWKVEAGGDRPIDQAAAEHLEQQLQ--RVGWDRVTSKMLF--GVFYG--YAVSELIYGRDDR 125 (488) T ss_pred hHHHHHHHH-HHHHHhcCCceEEcCCCChHHHHHHHHHHHHHh--CCCHHHHHHHHHh--hhhhc--ceeEEEEEeecCC Confidence 668888875 4455655 34432 33456678888776 7888888888652 11122 22211 233344 Q ss_pred cee--hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhh Q lcl|NC_011269. 195 VWS--SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQA 272 (867) Q Consensus 195 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (867) .|. .++..+|..+. |..+.+..+ +.++ T Consensus 126 ~~~~~~l~~r~~~~f~-----~d~~~~l~~------------------------------------------~~~~---- 154 (488) T protein:vir:99 126 YITLEAIKVRNRRRFR-----YDQDGGLRL------------------------------------------LTPN---- 154 (488) T ss_pred eeeEeeeeeeccccee-----ecCCCceEE------------------------------------------eccC---- Confidence 442 34444443322 212211110 0000 Q ss_pred hccCCCCcccH--HHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCC Q lcl|NC_011269. 273 AMQNDGLDISE--ALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIP 350 (867) Q Consensus 273 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (867) ...+|++|+. ..|.| .|+++.=.++|..++..||...+.|+..-+.-...+.|+-.|+|+.|... .-. T Consensus 155 -~~~~g~~lp~~~~~i~~-~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--------~~a 224 (488) T protein:vir:99 155 -NMFEGEPCPAPYFWHFS-TGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDD--------KTA 224 (488) T ss_pred -CCCCccccccCceEEEE-eecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCC--------CCC Confidence 1234666643 45555 34555557889999999999999999999999999999999999999762 113 Q ss_pred CHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCc-hhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhh Q lcl|NC_011269. 351 DQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPN-LDADYDRIERKLLQAWGIGEALISGGTGGAYASSALN 429 (867) Q Consensus 351 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 429 (867) +++|.+.+-+.+..+ ..|-.++ .--+-+||.+...+.--. -+.=++...++|..++ +|+-|+|.+.|+.|+.+.|- T Consensus 225 ~~~ek~~l~~av~~~-~~~~~~v-iP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~i-LGqtlts~~~~Gs~a~~~vh 301 (488) T protein:vir:99 225 TPEDKAKLLAALHAI-QTDSAII-MPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVG-LGQVASTQGTPGRLGNDDLQ 301 (488) T ss_pred CHHHHHHHHHHHHHH-hcCcEEE-ecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHH-hhhhhcccccccchhhHHHH Confidence 556665555543332 3333322 223455666654333222 2344566778887776 88999987778899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhh-hhHHHHHhhcccc--hheehhhcccc-chhhhhhhhhhhhh-Hhhhhhhhhhhhhccc Q lcl|NC_011269. 430 REFVTQIMTGFQNALKRHIRR-RCEVVAEAQGHYD--YDLKGGVRVPI-YREIVEYDEETGQE-YIRKVPKLLIPEIKFS 504 (867) Q Consensus 430 ~~~~~~~~~~~~~~l~~~~r~-~~~~i~e~q~~~d--~~~~~~~~~~~-~rd~~~~k~e~~k~-~~r~~~k~i~~~i~~~ 504 (867) .+....+.-...+.|..++++ .++++..+|.... +.+.+...... -....+..+++.+. ..+....-+..+.... T Consensus 302 ~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip 381 (488) T protein:vir:99 302 ADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVE 381 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCC Confidence 998888888889999999965 7799999985322 12222222111 11112222222221 1111111111111111 Q ss_pred cccccchhhhhhhhhhhhhceeeeeccccCCCc-cc---ccchhhhhhHHHH---HHHHhhcccccccccccccccCCCC Q lcl|NC_011269. 505 TLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNI-DM---KFDQELERQADET---VQKLMATAQAMKKVQDLCDAQNLPY 577 (867) Q Consensus 505 ~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~ti-qm---e~E~e~e~k~~E~---l~tL~~taet~kkvq~~~p~~g~P~ 577 (867) .. ...+. +... .....-......... +. .++.+.+...... +..+...+.+...+...-...-... T Consensus 382 ~~-----~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~ 454 (488) T protein:vir:99 382 VE-----STQAE-ATAP-TPSTEFAEGDQPSDPAAAMAPQLAEAMQPVVGNWTTQLRTLIEQASSLEDLRERLLDLAPQL 454 (488) T ss_pred Cc-----ccccc-cccC-CCcccCCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccC Confidence 11 00000 0000 000000000000000 00 0000111111111 1112222222221111100000000 Q ss_pred Cc-cccccccccc--cCCCCCC--CCCCCCCCCC Q lcl|NC_011269. 578 PP-ELAQHLQSTL--ALRQGKT--QTELGEAQAV 606 (867) Q Consensus 578 pp-~~aQ~p~~t~--~~a~gpg--q~~~~qa~~~ 606 (867) +. .+........ +...|.. .......... T Consensus 455 d~~~l~~~l~~a~~~a~l~G~~~~~~e~~~~~~~ 488 (488) T protein:vir:99 455 SLDQYAQAMAEGLEAAHLAGRNDVQEELDGREQI 488 (488) T ss_pred CHHHHHHHHHHHHHHHHHhhhhhHhhhhcccCCC Confidence 00 1111100000 0000000 0000000000 No 49 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=98.27 E-value=2.1e-07 Score=57.18 Aligned_cols=469 Identities=9% Similarity=0.060 Sum_probs=197.1 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccce-eeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc---cc Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNM-QIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT---HD 124 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 124 (867) |-..+-+ .| .++.+ ..+.++-. ..+ ++-.+--.||...+--.-+-.-+++. +--++++-|.+||.- .+ T Consensus 1 m~~~~d~--~g--~p~~~-~~~~~~~~--~~~~~~~~~~~~~~~~gltp~~l~~iL~~a-~~gd~~~~~~L~~dm~~~D~ 72 (512) T protein:vir:19 1 MGRILDI--SG--QPFDF-DDEMQSRS--DELAMVMKRTQEHPSSGVTPNRAAQMLRDA-ERGDLTAQADLAFDMEEKDT 72 (512) T ss_pred CcceeCC--CC--Ccccc-cccccccc--chhcccchhhccccccCCCHHHHHHHHHHh-hCCCHHHHHHHHHHHHhhCh Confidence 1000000 00 01110 01110000 000 00001111222221100000001111 112467778888853 55 Q ss_pred hHHHHHHhhhhcccccceecc--------cchhHHHHHHHHhhcccc-cHHHHhHHHHHHHHhhhhhhcchhh--hhhhc Q lcl|NC_011269. 125 LVPLLIDIYSKFPVVGMEFDS--------KDPLIKTFYEDLFFGEDL-NYLEFLPDQFAREYFTVGEVTSLAH--FNESL 193 (867) Q Consensus 125 ~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 193 (867) -|..||+- ++-+|.+.++.. +|..+.+|..+++. ++ ++..+|.|+.- - ++=-+++.. +...+ T Consensus 73 hi~s~l~~-Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~--~~~~f~~~~~~lld--A--~~~G~s~~Ei~w~~~~ 145 (512) T protein:vir:19 73 HLFSELSK-RRLAIQALEWRIAPARDASAQEKKDADMLNEYLH--DAAWFEDALFDAGD--A--ILKGYSMQEIEWGWLG 145 (512) T ss_pred HHHHHHHH-HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHh--cCCCHHHHHHHHHh--h--hhhcceeeeeEeeeeC Confidence 67777765 456777744322 12355667777776 33 57777777651 1 111122211 22233 Q ss_pred ccee--hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHh Q lcl|NC_011269. 194 GVWS--SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQ 271 (867) Q Consensus 194 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (867) |.|. .++..+|..+... .+.+..| | .+ T Consensus 146 g~~~~~~~~~r~~~~f~~~-----~~~~~~l----------------------------------r-----~~------- 174 (512) T protein:vir:19 146 KMRVPVALHHRDPALFCAN-----PDNLNEL----------------------------------R-----LR------- 174 (512) T ss_pred Cceeeeeeeeeccccceec-----cCCCcEE----------------------------------E-----ec------- Confidence 3333 3344444332211 1111110 0 00 Q ss_pred hhccCCCCcccHH-HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCC Q lcl|NC_011269. 272 AAMQNDGLDISEA-LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIP 350 (867) Q Consensus 272 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (867) -...+|++|+.. .|.| .|++..=.+.|..++..||..-+.|...-+--...+.|+-.|+|+.|.+. +. T Consensus 175 -~~~~~G~~l~~~k~i~~-~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~---~a------ 243 (512) T protein:vir:19 175 -DASYHGLELQPFGWFMH-RAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPT---GS------ 243 (512) T ss_pred -CCCCCceeecCCceEEE-eccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCC---CC------ Confidence 012356777654 3444 34555556789999999999999999999999999999999999999873 22 Q ss_pred CHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCc-hhHHHHHHHHHHHHhhccchhhhcC-CCccceehhhh Q lcl|NC_011269. 351 DQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPN-LDADYDRIERKLLQAWGIGEALISG-GTGGAYASSAL 428 (867) Q Consensus 351 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~ 428 (867) +++|.+.+.+-+.++ ..|-..++ --|-.+|.+.+...--. -+.=++...++|..++ +|+-|+|+ |.|++|+.+.| T Consensus 244 ~~~ek~~L~~al~~~-~~~a~~ii-P~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i-LGqtlTs~~g~~Gs~a~~~v 320 (512) T protein:vir:19 244 TNREKATLMQAVMDI-GRRAGGII-PMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI-LGGTLTTEAGDKGARSLGEV 320 (512) T ss_pred CHHHHHHHHHHHHHH-hhCcEEEe-cCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhcccccccchhhHHHH Confidence 345666666544443 33433222 23455666554432222 2344566778888887 89999997 67788999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHh-hhhHHHHHhhcccchh----eehhhccccchhh---hhhhhh------hhhhHhhhhh Q lcl|NC_011269. 429 NREFVTQIMTGFQNALKRHIR-RRCEVVAEAQGHYDYD----LKGGVRVPIYREI---VEYDEE------TGQEYIRKVP 494 (867) Q Consensus 429 ~~~~~~~~~~~~~~~l~~~~r-~~~~~i~e~q~~~d~~----~~~~~~~~~~rd~---~~~k~e------~~k~~~r~~~ 494 (867) -.|......-.....|..++. +.++++.++|...... -.+.|....+-|+ .+..++ +-+.++++.. T Consensus 321 h~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i~e~~ 400 (512) T protein:vir:19 321 HDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWIQEKL 400 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHHHHHh Confidence 998888888889999999996 5779999999754421 1222222222221 111111 1112222221 Q ss_pred hhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccch------hhhhhHHHHHHHHhhc--ccccccc Q lcl|NC_011269. 495 KLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQ------ELERQADETVQKLMAT--AQAMKKV 566 (867) Q Consensus 495 k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~------e~e~k~~E~l~tL~~t--aet~kkv 566 (867) ..-.++- .+..+... +....-.. ...........+. +..++. ..+......++.+... +.+...+ T Consensus 401 Gip~~~~--~e~~~~~~-~~~~~~~~--~~~~~~~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~~i~~~~~~~s~ee~ 473 (512) T protein:vir:19 401 HIPQPVG--DEAVFTIQ-PVVPDNGS--QKEAALSAEDIPQ--EDDIDRMGVSPEDWQRSVDPLLKPVIFSVLKDGPEAA 473 (512) T ss_pred CCCCCCC--ccccccCC-Cccccccc--cccccccccCCCc--hhhHhHHhhhHHHHHHHHHHHHHHHHHHHHhCCHHHH Confidence 1100000 00000000 00000000 0000000000000 000110 0011111111111110 1111111 Q ss_pred cccccccCCCCC-ccccccccccccCCCCCCCCCCCCCC Q lcl|NC_011269. 567 QDLCDAQNLPYP-PELAQHLQSTLALRQGKTQTELGEAQ 604 (867) Q Consensus 567 q~~~p~~g~P~p-p~~aQ~p~~t~~~a~gpgq~~~~qa~ 604 (867) ...-...-.... ..+..........+...+......-. T Consensus 474 ~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~G~~~~~~e~ 512 (512) T protein:vir:19 474 MNKAASLYPQMDDAELIDMLTRAIFVADIWGRLDAAADH 512 (512) T ss_pred HHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhhhccC Confidence 110000000000 11111111000000000000000000 No 50 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=98.25 E-value=2.6e-07 Score=56.72 Aligned_cols=436 Identities=13% Similarity=0.061 Sum_probs=206.3 Q ss_pred hHHHHHHHHhhhcchhH-HHHHHHHhccc---ccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc Q lcl|NC_011269. 47 KPLIDYFQGRRRAAEAN-RQRLASYRKQG---NFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT 122 (867) Q Consensus 47 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (867) =-|.+.++....+.+++ .++++++.... ..+.....+.|.+...++.-....-|.+-.-..+ .-|.. T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~---------~~a~~ 71 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLAT---------QAYQA 71 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccch---------hhhhc Confidence 22566676666655544 22333322110 1112233455555555443222211222111110 22445 Q ss_pred cchHHHHHHhhhhc-ccccceeccc-chh---HHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhh-- Q lcl|NC_011269. 123 HDLVPLLIDIYSKF-PVVGMEFDSK-DPL---IKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNES-- 192 (867) Q Consensus 123 ~~~~~~~~~~~~~~-~~~~~~~~~~-~~~---~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 192 (867) .+.|-.||++.++= .-..+++.-+ |.. ++.-..+.++ -+..+-.+|+..++ ..++.-|+.+-+..-|+- T Consensus 72 ~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~-~~lll~Gnay~~i~r~~~g~ 150 (466) T protein:vir:81 72 NGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMI-QDADLAGNSYWTIVDGEFVR 150 (466) T ss_pred cHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHH-HHHHhcCCeEEEEEecCccc Confidence 67788888887631 1111222111 111 1111122233 12345667888866 788888988777665543 Q ss_pred -----ccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhch Q lcl|NC_011269. 193 -----LGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYP 267 (867) Q Consensus 193 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (867) .|.+.++..|+|+.|.|... .+....+ .| .| T Consensus 151 l~~~~~g~~~~l~~l~~~~v~~~~~---~~~~~~~-----------------------------------~y-----~~- 186 (466) T protein:vir:81 151 MRPDWVDVVVEERMVRGGRGELGGG---QLGWRKV-----------------------------------GY-----LY- 186 (466) T ss_pred cccccCcceeEEEEecCcceEEEEc---CCCceEE-----------------------------------EE-----EE- Confidence 36678899999999988632 1111000 00 00 Q ss_pred HHHhhhccCCCCcccHHHHHHhhhcCcccc-ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCC Q lcl|NC_011269. 268 EIIQAAMQNDGLDISEALISRVVNRPTAWA-TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGE 346 (867) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (867) .+......++.+.++.+-|-||++-...+. -.|.+.+..+.++|.......+....+.+.-..|=-++++.. T Consensus 187 ~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~------- 259 (466) T protein:vir:81 187 TEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNP------- 259 (466) T ss_pred EecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC------- Confidence 000011123445567777889876543443 369999999999887766666666666666666655566652 Q ss_pred cCCCCHHHHHHHHHHHHHhhh-cc--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhc---CCCc Q lcl|NC_011269. 347 PWIPDQGELDEVRDDMQSLLA-AD--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALIS---GGTG 420 (867) Q Consensus 347 ~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~ 420 (867) .-+.++.+.+|+.|+..+. ++ ...+|-.-|++++-+...-+..-+-+-.++..++|.+++||...+|. ++.. T Consensus 260 --~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~ 337 (466) T protein:vir:81 260 --MADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAA 337 (466) T ss_pred --CCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCc Confidence 2367788999998877654 34 34677788899988865433333234456788999999999999984 3455 Q ss_pred cceehh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccc---cchhhhhhhhhhhhhHhhhhhhh Q lcl|NC_011269. 421 GAYASS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVP---IYREIVEYDEETGQEYIRKVPKL 496 (867) Q Consensus 421 ~~~~~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~---~~rd~~~~k~e~~k~~~r~~~k~ 496 (867) ++|+++ +....|++.-+.-+..+|++.+.+.+-...|- ...+++|-...+ +..+..+..+..+ ..+...... T Consensus 338 st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~---~~~~~~f~~~~llr~d~~~r~~~~~~~~-~~~~~~~~~ 413 (466) T protein:vir:81 338 ATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPD---VRLWYDADDVPFLREDEKDAADIQKVRA-ETINTLITA 413 (466) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccC---cceEEEecchhhhccCHHHHHHHHHHHH-HHHHHHHHc Confidence 788875 56667777778888888888887776322221 111222211111 1111111100000 001111000 Q ss_pred hhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCC Q lcl|NC_011269. 497 LIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLP 576 (867) Q Consensus 497 i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P 576 (867) . +...+ +|...+ .+.+.-+..+.....+...+... ..........+..+.. + T Consensus 414 g---~t~nE--~r~~~~-~gd~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~Gg~~n----g-- 465 (466) T protein:vir:81 414 G---YEPES--VVAAVN-SGDLRLLKHTGLTSVQLLPPGVS----------------ASASSDTPTSGGADDN----G-- 465 (466) T ss_pred C---CChhh--cccccc-CCccccccCCCcchhhhcccccc----------------cccCCCCcccCCCCcC----C-- Confidence 0 00000 000000 00000000000000000000000 0000000000000000 0 Q ss_pred CCcc Q lcl|NC_011269. 577 YPPE 580 (867) Q Consensus 577 ~pp~ 580 (867) . T Consensus 466 ---n 466 (466) T protein:vir:81 466 ---N 466 (466) T ss_pred ---C Confidence 0 No 51 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.20 E-value=4.7e-07 Score=55.25 Aligned_cols=402 Identities=10% Similarity=0.092 Sum_probs=180.0 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) |+.+-+.+-+. ..+.. ..+.++.....+......+.+-+ .+. .=|-++ T Consensus 1 ~~~~~~~~~~k---~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~----------v~~--------------~~a~~~ 48 (409) T protein:vir:94 1 MAKENIVTRIK---KKLID-----NWIDQSASKLYDFSPWKNKSFWG----------VIN--------------NTLETN 48 (409) T ss_pred Ccccccchhhh---hHHhh-----hhhcCCcccccccccccCccccc----------cch--------------hhhhcc Confidence 33332221111 01111 11112221111111111111111 111 113456 Q ss_pred chHHHHHHhhhh-cccccceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehh Q lcl|NC_011269. 124 DLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSE 199 (867) Q Consensus 124 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (867) +.|-.|||+-++ ..-..|+.--+....+.-..+++. ...++-.+|+..++ ..++.-|+++-+..-|. .|...++ T Consensus 49 ~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~-~G~~~~L 126 (409) T protein:vir:94 49 ETIFSAITKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIE-TIRNEKGNAYVLIERDI-YHQPSKL 126 (409) T ss_pred HHHHHHHHHHHHhhhhCceeEeecccccchhHHHHHhhhcccCCCHHHHHHHHH-HHHhhcCCeEEEEEECC-CCcEEEE Confidence 667777776542 111112332233333333333332 23556678888866 77888899887776654 4557789 Q ss_pred eecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCC Q lcl|NC_011269. 200 EILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGL 279 (867) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (867) ..|+|+.|.|... . +++.+- |+ | ....|..+ T Consensus 127 ~~l~~~~v~v~~~---~---------------------~~~~~~-------------------y~-~-----~~~~g~~~ 157 (409) T protein:vir:94 127 FLLNPDVVEMLIE---N---------------------QSRELY-------------------YS-I-----HAATGNKL 157 (409) T ss_pred EEEcCceeEEEEe---C---------------------CCcEEE-------------------EE-E-----EcCCceEE Confidence 9999999987621 0 000000 00 0 01123345 Q ss_pred cccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhc-hhhhhhhcccccCCCCcCCCCHHHHHHH Q lcl|NC_011269. 280 DISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYS-PLVLATLGIEDMGDGEPWIPDQGELDEV 358 (867) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (867) .++..-|-|+.+..+-=.-.|.+.+..+-++|-..... ++.-...... +-.+.+.+ . --++++.+.+ T Consensus 158 ~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~---~~~~~~~~~~~~~~i~~~~-------~--~l~~e~~~~~ 225 (409) T protein:vir:94 158 IVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV---RTFNLTEMQKPDSFMLKYG-------S--NVGKEKRQQV 225 (409) T ss_pred EEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHH---HHHHHHhcCCCCeeEEecC-------C--CCCHHHHHHH Confidence 56667788887644333345888776555555443322 2222222211 11222222 1 2478999999 Q ss_pred HHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHH Q lcl|NC_011269. 359 RDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIM 437 (867) Q Consensus 359 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~ 437 (867) |++|+......-..+|-.-|++++.....-+...+-.-.+...++|.+++||...++.+++.++|+++. ....|+..-+ T Consensus 226 ~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l 305 (409) T protein:vir:94 226 LEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTL 305 (409) T ss_pred HHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHH Confidence 999998877767788888899998876554433333444556788999999999999888888999863 3445555556 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhh Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAF 517 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~ 517 (867) .=+...|++++.+.+=...|... .+.|+|-+..+.-- +.++++ +. ..+..+ -+-+..+.-+...+ T Consensus 306 ~P~~~~ie~~ln~~Ll~~~~~~~--~~~i~fd~~~ll~~---d~~~~~-~~-~~~~~~-------~G~~T~NE~R~~~g- 370 (409) T protein:vir:94 306 LPIVKQYEEEFNRKLLTKTDREK--NRYFKFNVKSYLRA---DSATQA-EV-YFKAVR-------SGYYTINDIREWED- 370 (409) T ss_pred HHHHHHHHHHHHHhhCCcccccC--cceEEeechhhhcc---CHHHHH-HH-HHHHHh-------CCCcCHHHHHHHhC- Confidence 66666666666665533333211 12233322222111 222211 11 111100 00011111011000 Q ss_pred hhhhhhc-eeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 518 IAQLKGM-GVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 518 v~qL~~~-~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) ...++.- ...++.-..+ +....+.+ + ..+.-+..... + T Consensus 371 ~~p~~ggD~~~~~~n~~~--~~~~~~~~---~-------------~~kGG~~n~~e-~ 409 (409) T protein:vir:94 371 LPPVEGGDKPLISGDLYP--IDTPLELR---K-------------SLKGGDKNVNE-S 409 (409) T ss_pred CCCCCCcCeEeecccccc--cccchhhc---c-------------cccCCCCCcCC-C Confidence 0000000 0001100000 10100000 0 00000000000 0 No 52 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=98.19 E-value=3e-08 Score=61.84 Aligned_cols=347 Identities=12% Similarity=0.114 Sum_probs=169.7 Q ss_pred eeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc------cchHHHHHHhhhhcccccceecccchhHHHH Q lcl|NC_011269. 80 MQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT------HDLVPLLIDIYSKFPVVGMEFDSKDPLIKTF 153 (867) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (867) |+|-+|-.+ - ...-|-+|... +..-..|-.--|++ ++=|=.|||+-+. -|..+.+. +|+.+..+ T Consensus 1 M~~~~~f~~-r-----~~~~~~~~~~~--~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~-~ia~~p~~-~~~~~~~L 70 (359) T protein:vir:10 1 MSILNPFER-R-----SSITPNNYYPF--MVQNGSIVPNSLVDATEALKNSDLYAVTSLISS-DIAGTRFI-GNQVFTSV 70 (359) T ss_pred Ccccchhhc-c-----ccCCCCcchhh--hhccccccCCcccCHHHhhcchHHHHHHHHHHH-hhhcCccc-cchHHHHH Confidence 444443111 0 01112222111 00011121111222 2345568887765 44455554 44544443 Q ss_pred HHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccc Q lcl|NC_011269. 154 YEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQ 233 (867) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (867) ...- .+..+=.+|+..++ ..++.-|+.+-+.+-+. .|...++..|+|+.|.|. ...+..+ T Consensus 71 ~~~P--N~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~-~g~~~~l~~l~~~~v~i~---~~~~~~~------------- 130 (359) T protein:vir:10 71 LNNP--SHLTNAFSFWQTAI-LNLLLNGNVFLAILKGD-NSLMKELRLIPSNAITID---LTDDTLT------------- 130 (359) T ss_pred hhcc--cccCCHHHHHHHHH-HhccccCceEEEEEECC-CCeEEEEEEeCCceEEEE---EcCCeEE------------- Confidence 3221 23456678888876 66677788877766664 467788899999999885 2222110 Q ss_pred cccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcc----ccccCcchhhHHHH Q lcl|NC_011269. 234 GPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTA----WATRGAPHLLRSFR 309 (867) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~ 309 (867) | +| .....+....++..-|-|++..... ..-.|.+-+-.+-+ T Consensus 131 -------------------------y-----~~----~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~ 176 (359) T protein:vir:10 131 -------------------------Y-----EV----NQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTS 176 (359) T ss_pred -------------------------E-----EE----EecCCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHH Confidence 0 00 0011223344555567788754432 12258887777777 Q ss_pred HHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhhhhheeeeecccc Q lcl|NC_011269. 310 TLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKVENVFGR 387 (867) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 387 (867) +|......++......+.-..|--++++.. ...+++..+.+|+.++....+++ ..+|-.-|++.+..+.. T Consensus 177 ~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--------~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~ 248 (359) T protein:vir:10 177 EIGQQKEANRLSLSTLKGALNPTSVVKVPQ--------GTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSIN 248 (359) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEeCC--------CCCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCC Confidence 777766666666666666666666666642 15688999999999988877774 46777788888877654 Q ss_pred CccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchhee Q lcl|NC_011269. 388 ESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLK 467 (867) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~ 467 (867) -....+-+-.+...++|.+++||.-.+|. +.|...++ .+-+-|.+.. .|+.+|+.. +.+|+-.++.++. T Consensus 249 ~~d~q~le~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~----~~~~e~~~~~---~l~~~l~p~---~~~l~~~l~~~~~ 317 (359) T protein:vir:10 249 ADVANYLNSMNWGRTQIAKAFGVSDSYLN-GTGDQQSS----LDQIKDLYVN---ALNRFIEPL---ISELRIKCDSSIG 317 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCccccc----HHHHHHHHHH---HHHHHHHHH---HHHHHHHhhhhhc Confidence 33333333446677899999999999996 43332222 1112233322 233333322 2233332222221 Q ss_pred hhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 468 GGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 468 ~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) ++...+++++.+.- +...+. -+.-+-+..+.-+. +..++.-. T Consensus 318 -----~~~~~~~~~d~~~~----~~~~~~---~~~~G~~t~NE~R~----~l~~~pv~ 359 (359) T protein:vir:10 318 -----VDMSPITDYSNSVF----KADILN---WVKEGIIEPTEAKT----LLESKGII 359 (359) T ss_pred -----ccchhhhhcCHHHH----HHHHHH---HHhCCCcCHHHHHH----HhCCCCCC Confidence 22222333322111 111111 11111111110000 00111111 No 53 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.16 E-value=2.1e-07 Score=57.23 Aligned_cols=340 Identities=11% Similarity=0.125 Sum_probs=161.3 Q ss_pred ccc--ceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhh Q lcl|NC_011269. 138 VVG--MEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 138 ~~~--~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (867) |.. |++.-+++.++.-..+++- .+.++-.+|+..++ ..++.-|+++-+..-|.. |...++..|+|+.|.|... T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~l~l~Gna~~~i~r~~~-G~~~~L~~l~~~~v~~~~~ 78 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIE-TIRNEKGNAYVLIERDIY-HQPSKLFLLNPDVVEMLIE 78 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEEcCCceEEEEe Confidence 333 2222233334433334432 23466778887766 777888998888777765 4567899999999987521 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR 292 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (867) .+ ++.+ . |+ | ....|....++..-|-|+.+- T Consensus 79 ---~~---------------------~~~~---------------~----y~-~-----~~~~g~~~~~~~~eiih~r~~ 109 (348) T protein:vir:93 79 ---NQ---------------------SREL---------------Y----YS-I-----HAATGNKLIVHNMDMLHFKHI 109 (348) T ss_pred ---CC---------------------CcEE---------------E----EE-E-----EcCCCeEEEEccccEEEecCC Confidence 10 0000 0 00 0 111234455666778888764 Q ss_pred CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhch-hhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh Q lcl|NC_011269. 293 PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSP-LVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR 371 (867) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 371 (867) .+.=.-.|.+.+.-+-.++-.. .+|++........+ ..+.+.+. .-+.++.+.+|+.++........ T Consensus 110 ~~~~~~~G~s~~~~~~~~i~~~---~~~~~~~~~~~~~~~~~i~~~~~---------~l~~e~~~~~~~~~~~~~~n~~~ 177 (348) T protein:vir:93 110 VASNMVQGISPIDVLKNTTDFD---NAVRTFNLTEMQKPDSFMLKYGS---------NVSTEKRQQVLEDFKQYYEENGG 177 (348) T ss_pred CCCCceeeccHHHHHHHHHHHH---HHHHHHHHHhcCCCceeEEecCC---------CCCHHHHHHHHHHHHHHhhcCCC Confidence 3322235888765555544332 23333333333332 33334432 24678899999999888776677 Q ss_pred hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh-hHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 372 LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL-NREFVTQIMTGFQNALKRHIRR 450 (867) Q Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~r~ 450 (867) .+|---|++++.....-+-.-+-.-.+...++|.+++||.-.++.++++.+|+++.- ...|+..-+.=+...|++++.+ T Consensus 178 ~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~ 257 (348) T protein:vir:93 178 ILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNR 257 (348) T ss_pred eeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 788888999888875443332223344568889999999999998888899998743 3334444455555556655555 Q ss_pred hhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhc-eeeee Q lcl|NC_011269. 451 RCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGM-GVPVS 529 (867) Q Consensus 451 ~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~-~~pit 529 (867) .+-...|... .+.++|-...+...|..+.-+ ..+-.+.. +-+..+.-+...+ ...++.- ...++ T Consensus 258 ~l~~~~~~~~--g~~i~fd~~~l~~~d~~~~a~-~~~~~~~~-----------G~~T~NE~R~~~g-~~p~~ggD~~~~~ 322 (348) T protein:vir:93 258 KLLTKTDREK--NRYFKFNVKSYLRADSATQAE-VYFKAVRS-----------GYYTINDIREWED-LPPVEGGDKPLIS 322 (348) T ss_pred hhCCcccccC--cceEEeechhhhccCHHHHHH-HHHHHHhC-----------CCCCHHHHHHHhC-CCCCCCcCeEeec Confidence 4422222211 122333222222222111111 11111110 0011110011000 1100000 00011 Q ss_pred ccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 530 DKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 530 d~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) .-..+.+ ...+.+ ...+ -.+.-...+ T Consensus 323 ~n~~~~~--~~~~~~----------------~~~~-gg~~n~~~~ 348 (348) T protein:vir:93 323 GDLYPID--TPLELR----------------KSLK-GGDKNVNES 348 (348) T ss_pred ccccccc--cchhhc----------------cccc-CCCCCcCCC Confidence 1111110 000000 0000 000000000 No 54 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=98.14 E-value=3.5e-07 Score=55.98 Aligned_cols=399 Identities=8% Similarity=0.006 Sum_probs=189.4 Q ss_pred HHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCc-hhhhHHHHHHHHHHHHHhhccchH Q lcl|NC_011269. 48 PLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFN-VEDEEELRVIRHWCRLFYATHDLV 126 (867) Q Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 126 (867) =+-+-+..++. .. ++ .. .-.+.... ++.+. +.-+ ..-..-|..++.| T Consensus 1 ~~f~~~f~r~~--------------~~----~~--~~---~~~~~~~~--~~~~~~~~g~-------~v~~~~~l~~~~v 48 (413) T protein:vir:48 1 MFFSGLFQRKS--------------DA----PV--TT---PAELAEAI--GLSYDTYTGK-------RISSQRAMRLTAV 48 (413) T ss_pred CccchhhccCc--------------cC----Cc--cc---hHHHHHhh--hcCcccccCc-------eechhhhhccHHH Confidence 00000000000 00 00 00 00111111 11110 0000 0011335567788 Q ss_pred HHHHHhhhh-cccccceecccchhHHHHH-----HHHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcccee Q lcl|NC_011269. 127 PLLIDIYSK-FPVVGMEFDSKDPLIKTFY-----EDLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWS 197 (867) Q Consensus 127 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-----~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (867) -.|||+.++ +.-+.++...++...++-. .+++ -.+..+-.+|+...+ ..++.-|+.+-+.+-+ .|... T Consensus 49 ~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gn~~~~i~~~--~g~~~ 125 (413) T protein:vir:48 49 YSCVRVLAESVGMLPCSLYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVI-VCLCLRGNFYAYKVKA--LGEVV 125 (413) T ss_pred HHHHHHHHHhhhhCceEEEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHH-HHHhhcCceEEEEEeC--CCcEE Confidence 889988765 2222233333332222211 1111 123455668888866 8888899988777644 57778 Q ss_pred hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCC Q lcl|NC_011269. 198 SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQND 277 (867) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (867) ++.+|+|+.|.|.. ..+-.+ -| + -....|. T Consensus 126 ~L~~l~~~~v~~~~---~~~~~~-------------------------------------~y----~------~~~~~g~ 155 (413) T protein:vir:48 126 ELLPIDPGCVEPKL---NSQWQP-------------------------------------VY----Q------VTFPDGS 155 (413) T ss_pred EEEEEcCceEEEEE---cCCceE-------------------------------------EE----E------EEecCce Confidence 89999999998751 111000 00 0 0111222 Q ss_pred CCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHH Q lcl|NC_011269. 278 GLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDE 357 (867) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (867) ...++...|-|+.+-.... -.|.+.+..+-++|-...+...+.....++-..|=-++++-+ .-+.++.++ T Consensus 156 ~~~~~~~evih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~---------~~~~e~~~~ 225 (413) T protein:vir:48 156 VDVLTQDEIWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQ---------KLTPDAYER 225 (413) T ss_pred EEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---------CCCHHHHHH Confidence 3345566777887544322 369999999999888777777777777777777766666642 236788999 Q ss_pred HHHHHHHhhh-cc--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHH Q lcl|NC_011269. 358 VRDDMQSLLA-AD--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFV 433 (867) Q Consensus 358 ~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~ 433 (867) +|+.++.... ++ ...+|-.-|++++.++..-+..-+-+-.+...++|.+++||.-.++.+.++++|+++. ....|+ T Consensus 226 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~ 305 (413) T protein:vir:48 226 LKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFI 305 (413) T ss_pred HHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHH Confidence 9999887764 24 3467777889998887654444444556677889999999999999988889999854 344445 Q ss_pred HHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhh Q lcl|NC_011269. 434 TQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQ 513 (867) Q Consensus 434 ~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~ 513 (867) ..-+.=+...|++.+.+.+-.-.+-. ++.|+|-+..+...|..+.-+ ..+..++.. -+..+.-+. T Consensus 306 ~~~i~P~~~~ie~~l~~~L~~~~~~~---~~~~~fd~~~l~~~d~~~~~~-~~~~~~~~g-----------~~T~NE~R~ 370 (413) T protein:vir:48 306 NYSLVPYLTRIEQRINTGLVRESKQG---KFYAKFNAGALLRGDMKSRFE-AYATGINWG-----------IYSPNDCRD 370 (413) T ss_pred HHHHHHHHHHHHHHHHhhccCccccC---CeEEEEechhhhccCHHHHHH-HHHHHHhCC-----------CcCHHHHHH Confidence 54566666666666665542112211 233443333332222211111 111111110 011110010 Q ss_pred hhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccc Q lcl|NC_011269. 514 ERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELA 582 (867) Q Consensus 514 ~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~a 582 (867) ..+ +..++. +....-+..-.....-.+.....+ . ....+. ..+ T Consensus 371 ~~g-~~p~~g-gD~~~~~~n~~~~~~~~~~~~~~~----------------~-~~~~~~-------~~~ 413 (413) T protein:vir:48 371 LED-MNPRPG-GDVYLTPMNMTTSPSAGDDNGKKK----------------E-SGDADK-------TAS 413 (413) T ss_pred HhC-CCCCCC-cceeeccccccccccccccCCCCC----------------C-CCCccc-------cCC Confidence 000 000000 000000000000000000000000 0 000000 000 No 55 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.12 E-value=6.8e-07 Score=54.39 Aligned_cols=444 Identities=13% Similarity=0.111 Sum_probs=200.4 Q ss_pred cchhhhhhhhhH-----------------H-------hhC-CCchhhhHHHH-----HHHHHHHHH---hhccchHHHHH Q lcl|NC_011269. 84 MPKIRQPLGTLA-----------------D-------KGI-PFNVEDEEELR-----VIRHWCRLF---YATHDLVPLLI 130 (867) Q Consensus 84 ~~~~~~~~~~~~-----------------~-------~~~-~~~~~~~~~~~-----~~~~~~~~~---~~~~~~~~~~~ 130 (867) |++|+-+.|... + .+| |..+ ..-|+ ++++-|.+| ....+-|..|| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l--~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l 78 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKL--ARILVEAEQGNLQAQAELFMDMEERDAHLFAEM 78 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHH--HHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHH Confidence 333332222110 0 011 1111 11222 345555555 23466777777 Q ss_pred Hhhhhcccccceecc--------cchhHHHHHHHHhhcccc-cHHHHhHHHHHHHHhhhhhhcchhh--hhhhcccee-- Q lcl|NC_011269. 131 DIYSKFPVVGMEFDS--------KDPLIKTFYEDLFFGEDL-NYLEFLPDQFAREYFTVGEVTSLAH--FNESLGVWS-- 197 (867) Q Consensus 131 ~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-- 197 (867) +. ++-+|.+.++.. .|.-+.+|.++++- ++ |+..+|.|+.- -++--+|+.. +..++|.|. T Consensus 79 ~~-Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~--~~~~~~~~i~~~ld----a~~~G~s~~Eivw~~~~g~~~~~ 151 (526) T protein:vir:99 79 SK-RKRAILGLDWAVEPPRNASAAEKADADYLHELLL--DLEGLEDLLLDALD----GIGHGYSCIELEWALQGREWMPL 151 (526) T ss_pred HH-HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHh--cccCHHHHHHHHHH----hhhhcceeEEEEEeecCCceeEE Confidence 75 455677654332 23356677777775 54 57777777551 1111223222 334455554 Q ss_pred hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCC Q lcl|NC_011269. 198 SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQND 277 (867) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (867) .++..+|..+.+. ......| | +.+ ...+ T Consensus 152 ~l~~r~~~~f~~~-----~~~~~~l--------------------------------~----------~~~-----~~~~ 179 (526) T protein:vir:99 152 AFHHRPQSWFQLN-----PEDQNEL--------------------------------R----------LRD-----NSPA 179 (526) T ss_pred Eeeeecccceeec-----cCCCcEE--------------------------------E----------ecC-----CCCC Confidence 4455555444322 1111110 0 000 1345 Q ss_pred CCcccHH-HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHH Q lcl|NC_011269. 278 GLDISEA-LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELD 356 (867) Q Consensus 278 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 356 (867) |++|+.. .|.|. |+++.=.+.|..++-.|+-..+.|...-+--...+.|+-.|+|+.|.+. |. +++|.+ T Consensus 180 g~~l~~~k~i~~~-~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-----~a----~~~ek~ 249 (526) T protein:vir:99 180 GEALQPFGWIIHR-PRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP-----GT----ADEEKA 249 (526) T ss_pred ceeecCCCeEEEe-ecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC-----CC----CHHHHH Confidence 6666654 56664 6777777899999999999999999888888899999999999999863 21 345666 Q ss_pred HHHHHHHHhhhcchhhhhhhhheeeeeccccCccCc-hhHHHHHHHHHHHHhhccchhhhcC---CCccceehhhhhHHH Q lcl|NC_011269. 357 EVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPN-LDADYDRIERKLLQAWGIGEALISG---GTGGAYASSALNREF 432 (867) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---g~~~~~~~~~~~~~~ 432 (867) .+-+.+.++ ..|-. +|.--+-.+|.+.+...-.. -+.=++...++|..++ +|+-|+|. |-|++|+-+.|--+. T Consensus 250 ~L~~av~~i-~~d~~-~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS~a~g~vh~~v 326 (526) T protein:vir:99 250 TLLRAVTGL-GHAAA-GIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGGAFALGQVHNEV 326 (526) T ss_pred HHHHHHHHH-hhCcE-EEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccCcchhhhHHHHHHHH Confidence 666554443 22332 22223455666654332222 2344577788898888 89999873 456889988888877 Q ss_pred HHHHHHHHHHHHHHHHhh-hhHHHHHhhcccchhe----ehhhccccchh---hhhhhhhhhhhHhhhhhhhhhhhhccc Q lcl|NC_011269. 433 VTQIMTGFQNALKRHIRR-RCEVVAEAQGHYDYDL----KGGVRVPIYRE---IVEYDEETGQEYIRKVPKLLIPEIKFS 504 (867) Q Consensus 433 ~~~~~~~~~~~l~~~~r~-~~~~i~e~q~~~d~~~----~~~~~~~~~rd---~~~~k~e~~k~~~r~~~k~i~~~i~~~ 504 (867) ...+.-.-...|..++.+ .++++.++|......+ ++.+.....=| ..+..+++.....+....-+..+.... T Consensus 327 ~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gip 406 (526) T protein:vir:99 327 RHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIP 406 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCCC Confidence 777777888999999965 7799999998655331 22222222211 233333232222222212222222221 Q ss_pred ccc-----ccchh-hhhhhhhhhhhceeeeeccccCCCcc-cccchhh--------hhhHHHHHH---HHhhcccccccc Q lcl|NC_011269. 505 TLN-----LRDEA-QERAFIAQLKGMGVPVSDKTLAVNID-MKFDQEL--------ERQADETVQ---KLMATAQAMKKV 566 (867) Q Consensus 505 ~~~-----Lr~e~-~~~~~v~qL~~~~~pitd~t~p~tiq-me~E~e~--------e~k~~E~l~---tL~~taet~kkv 566 (867) ... |.... +... -..........+....+.... ...+.-. +......+. .+...+.+...+ T Consensus 407 ~~~~~e~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee~ 485 (526) T protein:vir:99 407 QPAKNEPVLRSAAQPAIL-SRQHGQRVAALATIVGPRYGDQQALDKALADLPAKDMQNQANDLLAPLLEAVNRGDSETEL 485 (526) T ss_pred CCCCcccccCCCCCCccc-ccccccccccccccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHH Confidence 110 10000 0000 000000000000000000000 0000000 111111111 111122222211 Q ss_pred cccccccCCCCC-ccccccccccccCCC--CC--CCCCCCC Q lcl|NC_011269. 567 QDLCDAQNLPYP-PELAQHLQSTLALRQ--GK--TQTELGE 602 (867) Q Consensus 567 q~~~p~~g~P~p-p~~aQ~p~~t~~~a~--gp--gq~~~~q 602 (867) ...-...-...+ ..+.+........+. |. ......+ T Consensus 486 ~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~Gr~~~~~e~~~ 526 (526) T protein:vir:99 486 LGALAEAFPDMDDSALTDALHRLLFAADTWGRLHGNLDRID 526 (526) T ss_pred HHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhhhhcccC Confidence 111100000001 111111110000000 00 0000011 No 56 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=98.11 E-value=2.5e-06 Score=51.27 Aligned_cols=412 Identities=12% Similarity=0.075 Sum_probs=177.2 Q ss_pred Hhcccccccceeeccchh-hhhhhhhHHhhCCCc--hhhhHHHHHHHHH--HHHHhh---ccchHHHHHHhhhhcccccc Q lcl|NC_011269. 70 YRKQGNFGSNMQIAMPKI-RQPLGTLADKGIPFN--VEDEEELRVIRHW--CRLFYA---THDLVPLLIDIYSKFPVVGM 141 (867) Q Consensus 70 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~~--~~~~~~---~~~~~~~~~~~~~~~~~~~~ 141 (867) |+..-- +.=+++-+ -.=++..++++-+-+ .|...+|| | |-..|. ..+-|..||+- ++.+|.+. T Consensus 1 ~~~~~~----~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr----~~~~~~ly~~m~~D~hi~s~l~~-Rk~av~~~ 71 (488) T protein:vir:95 1 MADITE----TQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALR----FPESIKTFQLMMRDPAVAASVNI-IKMFVRKV 71 (488) T ss_pred CCCccc----cCCCCCHHHHHHHHHHhhccccchhhccchhhhc----ccchHHHHHHHhhChHHHHHHHH-HHHHHhcC Confidence 222110 01111111 112344444433322 23333443 3 112232 36668888875 55667664 Q ss_pred --eecc---cchh-----HHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC-------- Q lcl|NC_011269. 142 --EFDS---KDPL-----IKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN-------- 203 (867) Q Consensus 142 --~~~~---~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 203 (867) +|.. +++. +-+|+++.+=+-+.++.++|.|+. |.+.++ |.-.--+|..-...+ T Consensus 72 ~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l--------da~~~G-~s~~Eivw~~~~~~~~~~~~~~~ 142 (488) T protein:vir:95 72 NWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM--------SFCTYG-FCVNEKVYKKRQGKKGKYQSKFD 142 (488) T ss_pred CceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH--------Hhhccc-ceeeeeeeecccccccccccccc Confidence 4431 2222 336676666422344556666644 222222 222334454321111 Q ss_pred -----cceeehh------hhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchH-HHh Q lcl|NC_011269. 204 -----PDMLRVS------RSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPE-IIQ 271 (867) Q Consensus 204 -----~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 271 (867) |+.|.+. +=.|..+....+ +.+++++ +.. T Consensus 143 dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~---------------------------------------~~~~~~~~~~~ 183 (488) T protein:vir:95 143 DGLIGWAKLPIRNQSTLDKWYFDEDFRRVT---------------------------------------GVRQNLRNVSH 183 (488) T ss_pred CCeeeeeeeeecCcccccceeeccCCCcee---------------------------------------ecccccccccc Confidence 1111111 000211111100 0112211 111 Q ss_pred h------hccCCCCcccHH-HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhh--ccccc Q lcl|NC_011269. 272 A------AMQNDGLDISEA-LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATL--GIEDM 342 (867) Q Consensus 272 ~------~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 342 (867) . .....++.|+.. +|.|+ |++..=.+.|..+|..||..-+.|+..-+-....+.|+-.|+.+.+. +. T Consensus 184 ~~~~~~~~~~~~~~~lP~~kfi~~~-~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~--- 259 (488) T protein:vir:95 184 IAGAINLGERPLTRKLPRAKFMLFK-YDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDY--- 259 (488) T ss_pred cccccccccccccccccccceEEEe-ecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCC--- Confidence 1 123455666544 55555 55655567799999999999999999988899999998888776654 42 Q ss_pred CCCCcCCCCHHHHHHHHHHHHHhh---hcchh-hhhhhh---------heeeeeccccCcc-CchhHHHHHHHHHHHHhh Q lcl|NC_011269. 343 GDGEPWIPDQGELDEVRDDMQSLL---AADFR-LMVHNF---------GLKVENVFGRESV-PNLDADYDRIERKLLQAW 408 (867) Q Consensus 343 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~~---------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 408 (867) .+++. .++.+.+.+.+..+. +++.. -+|-.. .++++..++.+.- ...+.=++...++|-.++ T Consensus 260 ~~~~~----~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i 335 (488) T protein:vir:95 260 LDENA----EPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF 335 (488) T ss_pred CCCcc----cHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH Confidence 22222 223333333222211 11110 011111 2355666666533 223333566777888888 Q ss_pred ccchhhhcC-CCccceehhhhhHHHHHHHHHHHHHHHHHHHhh-hhHHHHHhhcccchh-eehhhccccchhhhhhhhhh Q lcl|NC_011269. 409 GIGEALISG-GTGGAYASSALNREFVTQIMTGFQNALKRHIRR-RCEVVAEAQGHYDYD-LKGGVRVPIYREIVEYDEET 485 (867) Q Consensus 409 ~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k~e~ 485 (867) +|+-|+++ |.|+.||.+.|-.|......-.-.+.|..++++ +++++..+|...+.. .++.+....+-|+ T Consensus 336 -LGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl------- 407 (488) T protein:vir:95 336 -MSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITYDDIETPDL------- 407 (488) T ss_pred -hccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCcChhhH------- Confidence 88888873 457899999999988888888888899999965 679999999543321 1222222222221 Q ss_pred hhhHhhhhhhhhhhhhccccccccchhhhhhhhh------------hhhhceeeeeccccCCCcccccchhhhhhHHHHH Q lcl|NC_011269. 486 GQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIA------------QLKGMGVPVSDKTLAVNIDMKFDQELERQADETV 553 (867) Q Consensus 486 ~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~------------qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l 553 (867) +.....+.+.....+.+.+.. .+..+. .+.....+...+..+.....+ T Consensus 408 ------~~~ae~~~~L~~~G~~i~~~~-~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~~~------------- 467 (488) T protein:vir:95 408 ------EAIGSYIQKTVAVGALEVDKE-LSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYKTA------------- 467 (488) T ss_pred ------HHHHHHHHHHHhCCCccccHH-HHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccCCC------------- Confidence 111111111111111111100 000000 000000000010000000000 Q ss_pred HHHhhcccccccccccccccCCCCCccccccccc Q lcl|NC_011269. 554 QKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQS 587 (867) Q Consensus 554 ~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~ 587 (867) .........+..+..+..... T Consensus 468 -------------~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 468 -------------GEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred -------------cccCCcccccccchhhhhccC Confidence 000000000000001000000 No 57 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.07 E-value=7.8e-07 Score=54.06 Aligned_cols=391 Identities=8% Similarity=0.031 Sum_probs=182.9 Q ss_pred eccch--hhh--------hhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhc-ccccceecccchhH Q lcl|NC_011269. 82 IAMPK--IRQ--------PLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKF-PVVGMEFDSKDPLI 150 (867) Q Consensus 82 ~~~~~--~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 150 (867) |++=+ |+. |....-..+++++...-..+ .+ .=|-.++-|-.||++.++= .-..++..-++... T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v----~~--~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~ 74 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQI----SS--QRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSL 74 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCcee----ch--hhhhccHHHHHHHHHHHHHhccCceEEEEecCCc Confidence 22221 111 10000011111111000000 00 1123355677788877541 21222222222222 Q ss_pred HHH-----HHHHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHH Q lcl|NC_011269. 151 KTF-----YEDLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQL 222 (867) Q Consensus 151 ~~~-----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (867) |+- ..+++ -.+..+-.+|+..++ ..++.-|+++-+.+ ...|...++..|+|+.|.|...= .++. T Consensus 75 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~~ll~Gna~~~i~--~~~g~~~~L~~l~~~~v~~~~~~--~~~~--- 146 (414) T protein:vir:44 75 KQRATGERLHKLISTHPNGYMTPQEFWELVV-TCLCLRGNFYAYKV--KAFGEVAELLPVDPGCVVPKLNS--SWEP--- 146 (414) T ss_pred eeecccchHHHHHHhhcccCCCHHHHHHHHH-HHHhhcCCeEEEEE--eCCCcEEEEEEEcCceEEEEECC--CCcE--- Confidence 211 11111 224456678888877 88888999987765 34688889999999999876210 0000 Q ss_pred HHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcc Q lcl|NC_011269. 223 MVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAP 302 (867) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (867) .| + -....|....++...|-|+.+-... .-.|.+ T Consensus 147 -----------------------------------~y----~------~~~~~g~~~~~~~~evih~~~~~~d-~~~G~s 180 (414) T protein:vir:44 147 -----------------------------------VY----Q------VTFPDGSTDVLSQEDIWHVRTLTLD-GLVGLN 180 (414) T ss_pred -----------------------------------EE----E------EEecCceEEEEccccEEEecCCCCC-Cccccc Confidence 00 0 0112344455777788888754322 136999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c--hhhhhhhhhe Q lcl|NC_011269. 303 HLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGL 379 (867) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~ 379 (867) .+..+-.+|-.............++-..|--++++.. .-+.+..+.+|+.++..+.. + ...+|-.-|+ T Consensus 181 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~ 251 (414) T protein:vir:44 181 PIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQ---------TLSDQAYERLKKDFEERHTGLGNAHRPMILEMGL 251 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC---------CCCHHHHHHHHHHHHHHhcCccccCcceecCCCc Confidence 8888888877666666666666666666766666652 13667788888877766542 3 3355666788 Q ss_pred eeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHh Q lcl|NC_011269. 380 KVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRRCEVVAEA 458 (867) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~ 458 (867) +++.+...-+..-+-.-.+...++|.+++||...++.++++++|+++. -...|+..-+.=+...|++.+.+.+-.-.|- T Consensus 252 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~ 331 (414) T protein:vir:44 252 DWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRKSKQ 331 (414) T ss_pred eEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc Confidence 888776443322222334566788999999999999988889999964 3344555556666777777776655222221 Q ss_pred hcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcc Q lcl|NC_011269. 459 QGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNID 538 (867) Q Consensus 459 q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiq 538 (867) + .+.|+|-+..+.-.|..+.-+-+.++ +.....- ..++.. .+.+............+..+..+... . T Consensus 332 -~--~~~i~fd~~~ll~~d~~~~~~~~~~~-~~~G~~t-~NE~R~-~~gl~p~~ggD~~~~~~n~~~~~~~~----~--- 398 (414) T protein:vir:44 332 -G--VFYAKFNAGALLRGDMKSRFEAYATG-INWGIYS-PNDCRD-LEDMNPRPGGDVYLTPMNMTTKPSDG----S--- 398 (414) T ss_pred -C--ceEEEEechhhhccCHHHHHHHHHHH-HhCCCcC-HHHHHH-HhCCCCCCCcceecccccccccCCcc----c--- Confidence 1 22344433333222222211111111 1110000 000000 00000000000001111000000000 0 Q ss_pred cccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccc Q lcl|NC_011269. 539 MKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELA 582 (867) Q Consensus 539 me~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~a 582 (867) +...+. -....+++ .+ T Consensus 399 -~~~~~~--------------------~~~~~d~~-------~~ 414 (414) T protein:vir:44 399 -KAGKQK--------------------DNANADET-------TS 414 (414) T ss_pred -cCCCCC--------------------CCCCCCCC-------CC Confidence 000000 00000000 00 No 58 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=98.05 E-value=7.4e-07 Score=54.20 Aligned_cols=426 Identities=11% Similarity=0.014 Sum_probs=181.9 Q ss_pred CCc-hHHHHHHHHhhhcchhHHHHHHHHhcccccccc--eeeccchhh-hhhhhhHHhhCC---CchhhhHHHHHHHHHH Q lcl|NC_011269. 44 VDN-KPLIDYFQGRRRAAEANRQRLASYRKQGNFGSN--MQIAMPKIR-QPLGTLADKGIP---FNVEDEEELRVIRHWC 116 (867) Q Consensus 44 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 116 (867) |-- |+ --||..-+ .-.+|+.- |.=-|-|-.. =.+..|..- +.+.+...-... .++..+..| T Consensus 1 ~~~~~~-~~~~~~~~-~~~~~~~~---~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al------- 68 (441) T protein:vir:79 1 MHWYNT-DCYFVDFK-SRKQSRKE---LVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAI------- 68 (441) T ss_pred CccccC-cccccccc-ccccchhh---hhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhh------- Confidence 100 00 00111000 11122211 1111111000 000011000 000000000000 011111111 Q ss_pred HHHhhccchHHHHHHhhhh-cccccceecccchhHHHHHHHHhhc----ccccHHHHhHHHHHHHHhhhhhhcchhhhhh Q lcl|NC_011269. 117 RLFYATHDLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYEDLFFG----EDLNYLEFLPDQFAREYFTVGEVTSLAHFNE 191 (867) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (867) .++-|=.||++-++ ..-..+++..+....++-..+..|- +..+=.+|+..++ ..++.-|+.+-+...|. T Consensus 69 -----~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~ 142 (441) T protein:vir:79 69 -----RHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVF-VSALLTSHGYIEITRDK 142 (441) T ss_pred -----ccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHH-HHHhhcCCeEEEEEECC Confidence 12334456655433 1111122222222222222222222 2344568888866 88888999998888876 Q ss_pred hccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHh Q lcl|NC_011269. 192 SLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQ 271 (867) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (867) . |...++..|+|+.|.|.. -.+.++. +..+- ++ T Consensus 143 ~-G~~~~L~~i~~~~v~v~~---d~~g~~~-----------------------------------------~~~~~--~~ 175 (441) T protein:vir:79 143 T-GEPMNLTFRKTSEIELKS---DARGRLY-----------------------------------------YFHQR--ID 175 (441) T ss_pred C-CcEEEEEEEcCceeEEEE---CCCccEE-----------------------------------------EEEEE--ec Confidence 5 568889999999998862 1111111 00000 00 Q ss_pred hhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCC Q lcl|NC_011269. 272 AAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPD 351 (867) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (867) ....+.-..++..-|-|+++-.. =.-.|.+.+..+.++|.......+......++-.+|--++++-+ -+-+ T Consensus 176 ~~~~~~~~~~~~~dvih~k~~~~-dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--------~~~~ 246 (441) T protein:vir:79 176 SNGNNIERNVKFEDMLDIKFYSL-DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--------VLDN 246 (441) T ss_pred cCCceeEEEEccccEEEeccCCC-CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC--------CCCC Confidence 00001112355556778875321 12369999888889988877777777777788888888887753 1456 Q ss_pred HHHHHHHHHHHHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh Q lcl|NC_011269. 352 QGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL 428 (867) Q Consensus 352 ~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 428 (867) .+..+.+|+.|+..+.- + ...+|-.-|++++.....-+...+-+-.+...++|.+++||.-.++. .+..+|+..+. T Consensus 247 ~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~s~~q~ 325 (441) T protein:vir:79 247 KKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFG-IETANMSITDA 325 (441) T ss_pred HHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcC-CCCCCccHHHH Confidence 77788899988777653 3 35677788888887764433222223335567889999999999984 67777776666 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccc Q lcl|NC_011269. 429 NREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNL 508 (867) Q Consensus 429 ~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~L 508 (867) ++.++ +-+.-+..+|++.+.+++-.. ...+.++|-+..+.-.|..+. -+..+..+.....- ..|+.-- +.| T Consensus 326 ~~~~~-~tl~P~~~~ie~eln~kl~~~-----~~~~~~~fd~~~llr~D~~~~-~~~~~~~i~~G~~T-~NE~R~~-~gl 396 (441) T protein:vir:79 326 NLDYL-STLKPYITCVCAELNFKFNDE-----YVNREFKFDTTEIRVVDEKTQ-AEIDKINIDSGKMN-IDEIRQR-DGL 396 (441) T ss_pred HHHHH-HHHHHHHHHHHHHHhhhcccc-----ccCceEEeechhhhccCHHHH-HHHHHHHHhCCCcC-HHHHHHH-hCC Confidence 66544 245555555555555554211 123344443333322222111 11111111111000 1111100 011 Q ss_pred cchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 509 RDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 509 r~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) - +..+.-.+ ...+.... ..+++. +.....+..+ ....++.-+.. + T Consensus 397 ~---Pi~ggd~~----~~~~~~n~--~~~~~~-~~~~~~~~~~-------~~~~~kgGe~~-e 441 (441) T protein:vir:79 397 A---PIPGGNGS----IHRVDLNH--VNIELV-DEYQMNKSRA-------TDKKLKGGEEN-E 441 (441) T ss_pred C---CCCCCCcc----eEeecccc--cccccc-cccccccccc-------cccccCCCCCC-C Confidence 0 00000000 00111111 111110 0000000000 00011100000 0 No 59 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=98.05 E-value=7.4e-07 Score=54.20 Aligned_cols=426 Identities=11% Similarity=0.014 Sum_probs=181.9 Q ss_pred CCc-hHHHHHHHHhhhcchhHHHHHHHHhcccccccc--eeeccchhh-hhhhhhHHhhCC---CchhhhHHHHHHHHHH Q lcl|NC_011269. 44 VDN-KPLIDYFQGRRRAAEANRQRLASYRKQGNFGSN--MQIAMPKIR-QPLGTLADKGIP---FNVEDEEELRVIRHWC 116 (867) Q Consensus 44 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 116 (867) |-- |+ --||..-+ .-.+|+.- |.=-|-|-.. =.+..|..- +.+.+...-... .++..+..| T Consensus 1 ~~~~~~-~~~~~~~~-~~~~~~~~---~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al------- 68 (441) T protein:vir:94 1 MHWYNT-DCYFVDFK-SRKQSRKE---LVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAI------- 68 (441) T ss_pred CccccC-cccccccc-ccccchhh---hhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhh------- Confidence 100 00 00111000 11122211 1111111000 000011000 000000000000 011111111 Q ss_pred HHHhhccchHHHHHHhhhh-cccccceecccchhHHHHHHHHhhc----ccccHHHHhHHHHHHHHhhhhhhcchhhhhh Q lcl|NC_011269. 117 RLFYATHDLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYEDLFFG----EDLNYLEFLPDQFAREYFTVGEVTSLAHFNE 191 (867) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (867) .++-|=.||++-++ ..-..+++..+....++-..+..|- +..+=.+|+..++ ..++.-|+.+-+...|. T Consensus 69 -----~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~ 142 (441) T protein:vir:94 69 -----RHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVF-VSALLTSHGYIEITRDK 142 (441) T ss_pred -----ccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHH-HHHhhcCCeEEEEEECC Confidence 12334456655433 1111122222222222222222222 2344568888866 88888999998888876 Q ss_pred hccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHh Q lcl|NC_011269. 192 SLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQ 271 (867) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (867) . |...++..|+|+.|.|.. -.+.++. +..+- ++ T Consensus 143 ~-G~~~~L~~i~~~~v~v~~---d~~g~~~-----------------------------------------~~~~~--~~ 175 (441) T protein:vir:94 143 T-GEPMNLTFRKTSEIELKS---DARGRLY-----------------------------------------YFHQR--ID 175 (441) T ss_pred C-CcEEEEEEEcCceeEEEE---CCCccEE-----------------------------------------EEEEE--ec Confidence 5 568889999999998862 1111111 00000 00 Q ss_pred hhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCC Q lcl|NC_011269. 272 AAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPD 351 (867) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (867) ....+.-..++..-|-|+++-.. =.-.|.+.+..+.++|.......+......++-.+|--++++-+ -+-+ T Consensus 176 ~~~~~~~~~~~~~dvih~k~~~~-dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--------~~~~ 246 (441) T protein:vir:94 176 SNGNNIERNVKFEDMLDIKFYSL-DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--------VLDN 246 (441) T ss_pred cCCceeEEEEccccEEEeccCCC-CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC--------CCCC Confidence 00001112355556778875321 12369999888889988877777777777788888888887753 1456 Q ss_pred HHHHHHHHHHHHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh Q lcl|NC_011269. 352 QGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL 428 (867) Q Consensus 352 ~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 428 (867) .+..+.+|+.|+..+.- + ...+|-.-|++++.....-+...+-+-.+...++|.+++||.-.++. .+..+|+..+. T Consensus 247 ~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~s~~q~ 325 (441) T protein:vir:94 247 KKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFG-IETANMSITDA 325 (441) T ss_pred HHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcC-CCCCCccHHHH Confidence 77788899988777653 3 35677788888887764433222223335567889999999999984 67777776666 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccc Q lcl|NC_011269. 429 NREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNL 508 (867) Q Consensus 429 ~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~L 508 (867) ++.++ +-+.-+..+|++.+.+++-.. ...+.++|-+..+.-.|..+. -+..+..+.....- ..|+.-- +.| T Consensus 326 ~~~~~-~tl~P~~~~ie~eln~kl~~~-----~~~~~~~fd~~~llr~D~~~~-~~~~~~~i~~G~~T-~NE~R~~-~gl 396 (441) T protein:vir:94 326 NLDYL-STLKPYITCVCAELNFKFNDE-----YVNREFKFDTTEIRVVDEKTQ-AEIDKINIDSGKMN-IDEIRQR-DGL 396 (441) T ss_pred HHHHH-HHHHHHHHHHHHHHhhhcccc-----ccCceEEeechhhhccCHHHH-HHHHHHHHhCCCcC-HHHHHHH-hCC Confidence 66544 245555555555555554211 123344443333322222111 11111111111000 1111100 011 Q ss_pred cchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 509 RDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 509 r~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) - +..+.-.+ ...+.... ..+++. +.....+..+ ....++.-+.. + T Consensus 397 ~---Pi~ggd~~----~~~~~~n~--~~~~~~-~~~~~~~~~~-------~~~~~kgGe~~-e 441 (441) T protein:vir:94 397 A---PIPGGNGS----IHRVDLNH--VNIELV-DEYQMNKSRA-------TDKKLKGGEEN-E 441 (441) T ss_pred C---CCCCCCcc----eEeecccc--cccccc-cccccccccc-------cccccCCCCCC-C Confidence 0 00000000 00111111 111110 0000000000 00011100000 0 No 60 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=98.05 E-value=2.6e-06 Score=51.16 Aligned_cols=413 Identities=9% Similarity=0.010 Sum_probs=174.7 Q ss_pred Hhcccccccce--------eeccchh---hhhhhhh-HHhhCCCchhhhHHHHHHHHHHHHH--hhccchHHHHHHhhhh Q lcl|NC_011269. 70 YRKQGNFGSNM--------QIAMPKI---RQPLGTL-ADKGIPFNVEDEEELRVIRHWCRLF--YATHDLVPLLIDIYSK 135 (867) Q Consensus 70 ~~~~~~~~~~~--------~~~~~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 135 (867) |+|..+-.-.+ .-+-|++ +.+..+. .+-.+|.+. ..+|+.. .=|++| ..+.+-|..||+-- + T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~iLr~~-~~~~ly~~m~~D~hi~s~l~~R-k 76 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREF--DELLQGK-DGLLVYHKMLSDGTVKNALNYI-F 76 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccch--hHhhccc-cchHHHHHHhhChHHHHHHHHH-H Confidence 66655432111 0011111 0110000 011122211 1122211 002222 12356677888654 4 Q ss_pred cccccceecc----cchh---HHHHHHHHhhcc-----cccHHHHhHHHHHHHHhhhhhhcchhhh--h-hhcccee--h Q lcl|NC_011269. 136 FPVVGMEFDS----KDPL---IKTFYEDLFFGE-----DLNYLEFLPDQFAREYFTVGEVTSLAHF--N-ESLGVWS--S 198 (867) Q Consensus 136 ~~~~~~~~~~----~~~~---~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~--~ 198 (867) -+|.+.++.. +++. +-+|..+.+.+. +++..++|.|+.- -++=-+|+... + ..+|.|. . T Consensus 77 ~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~ld----a~~~G~s~~Eivw~~~~~g~~~~~~ 152 (448) T protein:vir:79 77 GRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYEN----AYIYGMAAGEIVLTLGADGKLILDK 152 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHHH----hhhhcceeEEEEeeecCCCceeccc Confidence 5777744433 2232 444544443322 3345566665431 11111222111 1 1233332 2 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG 278 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (867) ++...|..+ .+-.|.-+....+. .+.|++ .......++ T Consensus 153 l~~r~~~~~--~~f~~~~d~~l~~~-------~~~~~~---------------------------------~~~~~~~~~ 190 (448) T protein:vir:79 153 IVPIHPFNI--DEVLYDEEGGPKAL-------KLSGEV---------------------------------KGGSQFVSG 190 (448) T ss_pred ccccCCccc--cceeeecCCceEEe-------ecCCcc---------------------------------cccccCCCc Confidence 222222211 11112222211100 000100 011222345 Q ss_pred CcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHH Q lcl|NC_011269. 279 LDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEV 358 (867) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (867) +.|+..-+.|..+..+- .+.|..++..||..-+.+.-.-+-....+.|+-.|+||.|... |.- -+.++++.+ T Consensus 191 ~~lP~~~~i~~~~~~~g-~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~-----ga~--~~~~~~~~l 262 (448) T protein:vir:79 191 LEIPIWKTVVFLHNDDG-SFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPK-----SVR--QGTKQWEAA 262 (448) T ss_pred cccccceEEEEecCccC-CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCC-----CCC--cCHHHHHHH Confidence 66665555444443333 5779999999999999999999999999999999999998752 211 234455555 Q ss_pred HHHHHHhhhcc-hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhh-HHHHHHH Q lcl|NC_011269. 359 RDDMQSLLAAD-FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALN-REFVTQI 436 (867) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~ 436 (867) .+-++++ .++ .--+|.--|.++|.+.+.++...-+.=++...++|-.++ +|+-|+|++.|+.|+.+.-. -+...+. T Consensus 263 ~~av~~i-~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i-LGqtlTs~~~~g~~~~~~~~~~~v~~~~ 340 (448) T protein:vir:79 263 KEIVKNF-VQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL-GIDFNTVQLNMGVQAINIGEFVSLTQQT 340 (448) T ss_pred HHHHHHH-hcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHH-hhhhhccccccchhhhhhhhHHHHHHHH Confidence 4422222 222 122456677888888887776666666677888998888 89999997777666554311 2333233 Q ss_pred HHHHHHHHHHHHhh-hhHHHHHhhcccchhe-ehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhh Q lcl|NC_011269. 437 MTGFQNALKRHIRR-RCEVVAEAQGHYDYDL-KGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQE 514 (867) Q Consensus 437 ~~~~~~~l~~~~r~-~~~~i~e~q~~~d~~~-~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~ 514 (867) .-.-...|..++.+ +++++.++|-..+.-. ++.|... +++|-+..-..+........++ T Consensus 341 ~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~-------------e~~Dl~~~a~~~~~l~~~~~~~------ 401 (448) T protein:vir:79 341 IISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEME-------------ERNDFSAAANLMGMLINAVKDS------ 401 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC-------------ChHHHHHHHHHhhhhhccchhh------ Confidence 44456778888875 6799999994333211 1111111 1111111111111100000000 Q ss_pred hhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccc Q lcl|NC_011269. 515 RAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQD 568 (867) Q Consensus 515 ~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~ 568 (867) +..+.+ ....+ ++.-+-.......+. ...+.........--..+... T Consensus 402 ~~~~~~--~~~~p--~~~~~~~~~a~~~~~---~~~~~~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 402 EDIPTE--LKALI--DALPSKMRRALGVVD---EVREAVRQPADSRYLYTRRRR 448 (448) T ss_pred HHHHHH--hhcCC--CCCCCccccccCCCC---cccccccCCccccchhhcccC Confidence 000000 01111 111000000000000 000000000000000000000 No 61 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.02 E-value=2.4e-06 Score=51.43 Aligned_cols=449 Identities=13% Similarity=0.103 Sum_probs=197.8 Q ss_pred cchhhhhhhhhHH------------------------hhC-CCchhhhHHHH-----HHHHHHHHH---hhccchHHHHH Q lcl|NC_011269. 84 MPKIRQPLGTLAD------------------------KGI-PFNVEDEEELR-----VIRHWCRLF---YATHDLVPLLI 130 (867) Q Consensus 84 ~~~~~~~~~~~~~------------------------~~~-~~~~~~~~~~~-----~~~~~~~~~---~~~~~~~~~~~ 130 (867) |++|+-+.|.... .+| |..+. .-|+ ++++-|++| ....+-|..|| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~--~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l 78 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLA--HILIEAEQGHLQAQAELFMDMEERDAHLFAEM 78 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHH--HHHHhhhCCCHHHHHHHHHHHHhhChHHHHHH Confidence 3333322221110 011 11111 1122 344445544 12466677777 Q ss_pred Hhhhhcccccce--ec------ccchhHHHHHHHHhhccccc-HHHHhHHHHHHHHhhhhhhcchhh--hhhhcccee-- Q lcl|NC_011269. 131 DIYSKFPVVGME--FD------SKDPLIKTFYEDLFFGEDLN-YLEFLPDQFAREYFTVGEVTSLAH--FNESLGVWS-- 197 (867) Q Consensus 131 ~~~~~~~~~~~~--~~------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-- 197 (867) +. ++-+|.+.+ |. -.|.-+.+|.++++. +++ +..+|.|+.-- ++--+++.. +..++|.|. T Consensus 79 ~~-Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~--~~~~f~~~i~~~lda----~~~G~s~~Ei~w~~~~g~~~~~ 151 (528) T protein:vir:10 79 SK-RKRAVLGLDWTIEPPRNASAAEKADAEYLHELLL--DLEGIEDLMLDCMDG----VGHGYSAIELDWSLQGREWLPQ 151 (528) T ss_pred HH-HHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHh--CCccHHHHHHHHHhh----hhhcceeEEEEEeecCCceeEE Confidence 75 445666643 32 123456778888776 663 66777765411 111122221 344455554 Q ss_pred hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCC Q lcl|NC_011269. 198 SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQND 277 (867) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (867) .+...+|..+.+. ......|. ++. ...+ T Consensus 152 ~~~~r~~~~f~~~-----~~~~~~l~-------~~~----------------------------------------~~~~ 179 (528) T protein:vir:10 152 AFDHRPQSWFQLN-----PDDQDELR-------LRD----------------------------------------NSIA 179 (528) T ss_pred Eeeeecccceeec-----cCCCcEEe-------ccC----------------------------------------CCCC Confidence 4455555544332 11110000 000 1123 Q ss_pred CCcccH-HHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHH Q lcl|NC_011269. 278 GLDISE-ALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELD 356 (867) Q Consensus 278 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 356 (867) |+.|+. ..|.|+ |+.+.=.+.|..++..|+...+.|...-+--...+.|+-.|+||.|.+. | =+++|.+ T Consensus 180 g~~l~~~k~iv~~-~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-----~----a~~~ek~ 249 (528) T protein:vir:10 180 GEVLQPFGWIMHK-PRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPP-----G----TPDEEKV 249 (528) T ss_pred ceeecCCCeEEEe-ecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC-----C----CCHHHHH Confidence 445543 345554 5666566679999999999999999988889999999999999999873 1 1455666 Q ss_pred HHHHHHHHhhhcchhhhhhhhheeeeeccccCccCc-hhHHHHHHHHHHHHhhccchhhhcC-CC--ccceehhhhhHHH Q lcl|NC_011269. 357 EVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPN-LDADYDRIERKLLQAWGIGEALISG-GT--GGAYASSALNREF 432 (867) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-g~--~~~~~~~~~~~~~ 432 (867) .+-+.+.++ ..|-.++ .--|-.||.+...+.-.. -+.=++...++|..++ +|+-|+|- |+ |++|+-+.|-.+. T Consensus 250 ~L~~al~~i-~~~~~~i-iP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS~Alg~vh~~v 326 (528) T protein:vir:10 250 TLLRAVTGL-GHAAAGI-IPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI-LGGTLTSQTSESGGGAYALGQVHNEV 326 (528) T ss_pred HHHHHHHHH-hhCcEEE-ecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH-hhhhhhccccccccchhhhHHHHHHH Confidence 666555443 2233322 223456666654332222 2455777888999998 99999883 33 5889988888887 Q ss_pred HHHHHHHHHHHHHHHHhh-hhHHHHHhhcccchh----eehhhccccchh---hhhhhhhhhhhHhhhhhhhhhhhhccc Q lcl|NC_011269. 433 VTQIMTGFQNALKRHIRR-RCEVVAEAQGHYDYD----LKGGVRVPIYRE---IVEYDEETGQEYIRKVPKLLIPEIKFS 504 (867) Q Consensus 433 ~~~~~~~~~~~l~~~~r~-~~~~i~e~q~~~d~~----~~~~~~~~~~rd---~~~~k~e~~k~~~r~~~k~i~~~i~~~ 504 (867) ...+.-.....|..++++ +++++.++|...... .++.|.....-| ..+..+++.....+....-+..+.... T Consensus 327 ~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip 406 (528) T protein:vir:10 327 RHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGIP 406 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCCC Confidence 777788888999999965 779999999755321 222333322222 223333332222222211121222111 Q ss_pred -----cccccchhhhhhhh-hhhhhcee-eeeccccCC-Ccccccch--------hhhhhHHHH---HHHHhhccccccc Q lcl|NC_011269. 505 -----TLNLRDEAQERAFI-AQLKGMGV-PVSDKTLAV-NIDMKFDQ--------ELERQADET---VQKLMATAQAMKK 565 (867) Q Consensus 505 -----~~~Lr~e~~~~~~v-~qL~~~~~-pitd~t~p~-tiqme~E~--------e~e~k~~E~---l~tL~~taet~kk 565 (867) +..+.+.......- ........ -.+....+. ..+...+. ..+...... +..+...+.+... T Consensus 407 ~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee 486 (528) T protein:vir:10 407 LPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQVLASLPAQDMQNQADSLVAPLLDVISRGGSEAE 486 (528) T ss_pred CCCCCcccccCCCcccccccCcccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 11111111000000 00000000 000000000 00001110 001111111 1111222222222 Q ss_pred ccccccccCCCCC-ccccccccccccCCCCCCCCCCCCCCCC Q lcl|NC_011269. 566 VQDLCDAQNLPYP-PELAQHLQSTLALRQGKTQTELGEAQAV 606 (867) Q Consensus 566 vq~~~p~~g~P~p-p~~aQ~p~~t~~~a~gpgq~~~~qa~~~ 606 (867) +...-...-.... ..+.+........+...+.-....-... T Consensus 487 ~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~G~~~~~~e~~~ 528 (528) T protein:vir:10 487 LLGALAEAFPDMDDSALADALHRLLFVADTWGRLNGTLDRID 528 (528) T ss_pred HHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhhhccccccC Confidence 1111110000000 1111111100000000000000000000 No 62 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=98.02 E-value=7.2e-07 Score=54.26 Aligned_cols=418 Identities=13% Similarity=0.126 Sum_probs=192.8 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhh-hhHHhhCCCchhhhHHHH Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLG-TLADKGIPFNVEDEEELR 110 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 110 (867) |++.+.- .+....+.++|..+ ...+.+ +++.-+.|...++ .....+..-+++ . T Consensus 1 ~~~~l~~--------------~~~~~~~~~~~~~~--~~~~~~-----~~~~~~~~~~~~~g~~~~~g~~v~~~--~--- 54 (434) T protein:vir:43 1 MSKSLGK--------------VLSSATSAPRSSLF--GWGGKT-----IRLTDGAFWSQFLGRESSSGKKVTVD--K--- 54 (434) T ss_pred Cccchhh--------------hhhhcccccchhhh--cccccc-----cccCchHHHHHHhcCCccCCceechh--h--- Confidence 2111100 01111122222111 111111 2222233322221 111112222221 1 Q ss_pred HHHHHHHHHhhccchHHHHHHhhh----hcccccceecccch--hHHHHHH-HHhh---cccccHHHHhHHHHHHHHhhh Q lcl|NC_011269. 111 VIRHWCRLFYATHDLVPLLIDIYS----KFPVVGMEFDSKDP--LIKTFYE-DLFF---GEDLNYLEFLPDQFAREYFTV 180 (867) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~--~~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~ 180 (867) + -+++-|-.||++-+ +.|+.=++.+.+.. .+++-.. +++. .+..+-.+|+..++ ..++.- T Consensus 55 a---------l~~~~V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~ 124 (434) T protein:vir:43 55 A---------MKLSAVWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMV-ASMLLW 124 (434) T ss_pred h---------hccHHHHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHH-HHHhhc Confidence 1 22445666777654 33333233332221 1122112 2221 24456678999877 888999 Q ss_pred hhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHH Q lcl|NC_011269. 181 GEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQ 260 (867) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (867) |+.+-+.+ + ..|...++..|+|+.|.|.+. .+..+. T Consensus 125 Gnay~~i~-~-~~G~~~~L~~l~p~~v~~~~~---~~g~~~--------------------------------------- 160 (434) T protein:vir:43 125 GNAYAEIR-R-AAGRPAALDFLLPSRVDLECD---ENGRLK--------------------------------------- 160 (434) T ss_pred CCeEEEEE-e-CCCcEEEEEEEcCcceEEEEc---CCCeEE--------------------------------------- Confidence 99876655 3 357778899999999988631 111100 Q ss_pred HHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhccc Q lcl|NC_011269. 261 DLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIE 340 (867) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (867) |+. ....|....++..-|-|+.+-... -..|.+.+..+..+|-......+....+.++-..|--++++.. T Consensus 161 --y~~------~~~~g~~~~~~~~eVih~~~~~~d-g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~- 230 (434) T protein:vir:43 161 --YFY------TTKKGARREIERTNMLHIPAFTLD-GRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR- 230 (434) T ss_pred --EEE------EecCceEEEEccccEEEecCcCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC- Confidence 111 112344567788888898764322 2359998888888887777666666667777777776677653 Q ss_pred ccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCC Q lcl|NC_011269. 341 DMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGG 418 (867) Q Consensus 341 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 418 (867) .-+.+..+.+|+.++...-+++ ..+|-.-|++.+.+...-+..-+-+-.+...++|.+++||.-.+|... T Consensus 231 --------~l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~ 302 (434) T protein:vir:43 231 --------ILQPAQREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQT 302 (434) T ss_pred --------CCCHHHHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 1356677888888777655553 455556688888876443333333445677889999999999999655 Q ss_pred Cccc--eeh-hhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhh Q lcl|NC_011269. 419 TGGA--YAS-SALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPK 495 (867) Q Consensus 419 ~~~~--~~~-~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k 495 (867) ++++ |++ .+....|+..-+.-+...|++.+.+++-.-.|.. .+.|+|-+..+.-.|..+.-+...+. +.. T Consensus 303 ~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~---~~~~~fd~~~llr~d~~~r~~~~~~~-~~~--- 375 (434) T protein:vir:43 303 DKGSNWGTGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERI---RYYAEFSLEGFLKADSAGRAAWYSTM-AQN--- 375 (434) T ss_pred cCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhc---CceEEEechhhhccCHHHHHHHHHHH-HhC--- Confidence 5554 554 2445556666666677777777766653323322 23344433333222222211111111 110 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 496 LLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 496 ~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) +-+..+.-+...+ +..++. +..+.-+..-..++. .+...+.+. . + .......+. T Consensus 376 --------G~~T~NE~R~~~g-l~p~~g-gD~~~~~~n~~~~~~-~~~~~~~~~---------~----~--~~~~~~~~~ 429 (434) T protein:vir:43 376 --------GFMTRNEGRRKEN-LPELPG-GDILTVQSNLVPIDQ-LGQSNKSQA---------V----R--AALMNWFSQ 429 (434) T ss_pred --------CCcCHHHHHHHhC-CCCCCC-CCeEeeccCccchhh-hhccCCCcc---------h----h--hhhhccCCC Confidence 1111111111111 111100 000000000001100 000000000 0 0 000001111 Q ss_pred CCCcc Q lcl|NC_011269. 576 PYPPE 580 (867) Q Consensus 576 P~pp~ 580 (867) |.+.+ T Consensus 430 ~~~~~ 434 (434) T protein:vir:43 430 PEPQE 434 (434) T ss_pred CCCCC Confidence 11112 No 63 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.01 E-value=5.2e-07 Score=55.03 Aligned_cols=405 Identities=10% Similarity=0.037 Sum_probs=177.0 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-c Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-F 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 136 (867) -|+-..+. -|+...-.++.+. +..-++...... .-.+.++. +++.. -|=.||++.++ . T Consensus 1 Mg~f~~~~----~r~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~---al~~~---------~v~~cv~~Ia~~i 59 (416) T protein:vir:81 1 MGIFYKNE----KRDLQYNEDDLQM----MVQTLPGFQGTK-LRQYKDIE---AIRHS---------DIFTAVMMIASDL 59 (416) T ss_pred CCcccccc----cccccCCCcchhH----HHHHhccccccC-ccccchhh---hhcch---------HHHHHHHHHHHhh Confidence 11110000 0000000000000 000000000000 00111221 22222 23346655432 1 Q ss_pred ccccceecccchhHHHHHHHHhhc----ccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhh Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFFG----EDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (867) .-..+++..+....++...+-.|. +..+-.+|+..++ ..++.-|+.+-+...|.. |...++.+|+|+.|.|.+. T Consensus 60 A~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-G~~~~L~~i~~~~v~v~~~ 137 (416) T protein:vir:81 60 ARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVF-VSALLTSHGYIEITRDKT-GEPMNLTFRKTSEIELKSD 137 (416) T ss_pred ccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEEcCceeEEEEC Confidence 112233333333333322222232 2344568988866 888889999988888764 6678899999999987521 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR 292 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (867) .+.++. + .| ..++....+--..++..-|-||++- T Consensus 138 ---~~g~~~-----------------------------------------~-~~-~~~~~~~~~~~~~~~~~evihir~~ 171 (416) T protein:vir:81 138 ---ARGRLY-----------------------------------------Y-FH-QRIDSNGNNIERNVKFEDMLDIKFY 171 (416) T ss_pred ---CCccEE-----------------------------------------E-EE-EEecCCCceeEEEEccccEEEeccC Confidence 111111 0 00 0000000000123444556677653 Q ss_pred CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c-- Q lcl|NC_011269. 293 PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D-- 369 (867) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-- 369 (867) . .=.-.|.+.+..+.++|-......+......++-.+|--++++.+ .+-+++..+.+|+.++..+.- + T Consensus 172 ~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--------~~~~~~~~~~~~~~~~~~~~g~~na 242 (416) T protein:vir:81 172 S-LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--------VLDNKKARDRAREEFHKSFSGTKQA 242 (416) T ss_pred C-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC--------CCCCHHHHHHHHHHHHHHhcCcccc Confidence 2 112369999999999998888888888888888888888888863 145677889999988887764 3 Q ss_pred hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 370 FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIR 449 (867) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r 449 (867) -..+|-.-|++++.....-+...+-.-.++..++|.+++||.-.++. ++..+|+..+.++.++ +-+.-+...|++.+- T Consensus 243 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~~~~~~~~-~~l~P~~~~ie~~ln 320 (416) T protein:vir:81 243 GKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFG-IETANMSITDANLDYL-STLKPYITCVCAELN 320 (416) T ss_pred CceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcC-CCCCCccHHHHHHHHH-HHHHHHHHHHHHHHh Confidence 24677778888887764433222223345567899999999999985 6666666555555432 233334444444444 Q ss_pred hhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhc---ee Q lcl|NC_011269. 450 RRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGM---GV 526 (867) Q Consensus 450 ~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~---~~ 526 (867) +++-.. ...+.++|-+..+...|..+.-+-. +..++. +-+..+.-+...+ +..++.- .. T Consensus 321 ~~l~~~-----~~~~~~~f~~~~l~~~D~~~~~~~~-~~~~~~-----------G~~T~NE~R~~~g-l~p~~~gd~~~~ 382 (416) T protein:vir:81 321 FKFNDE-----YVNREFKFDTTEIRVVDEKTQAEID-KINIDS-----------GKMNIDEIRQRDG-LAPIPGGNGSIH 382 (416) T ss_pred hhcccc-----ccCceEEEechhhhccCHHHHHHHH-HHHHhC-----------CCcCHHHHHHHhC-CCCCCCCCcceE Confidence 443111 1134455443333222221111111 111111 0011100000000 0000000 00 Q ss_pred eeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 527 PVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 527 pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) .+.....+.+ + .+.....+..+....+..- +.. + T Consensus 383 ~~~~n~~~~~--~-~~~~~~~~~~~~~~~~kgG-------e~n-~ 416 (416) T protein:vir:81 383 RVDLNHVNIE--L-VDEYQMNKSRATDKKLKGG-------EEN-E 416 (416) T ss_pred eecccccccc--c-ccccCcccccccccccCCC-------CCC-C Confidence 1111111111 1 0000000000000001000 000 0 No 64 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.01 E-value=5.2e-07 Score=55.03 Aligned_cols=405 Identities=10% Similarity=0.037 Sum_probs=177.0 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-c Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-F 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 136 (867) -|+-..+. -|+...-.++.+. +..-++...... .-.+.++. +++.. -|=.||++.++ . T Consensus 1 Mg~f~~~~----~r~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~---al~~~---------~v~~cv~~Ia~~i 59 (416) T protein:vir:45 1 MGIFYKNE----KRDLQYNEDDLQM----MVQTLPGFQGTK-LRQYKDIE---AIRHS---------DIFTAVMMIASDL 59 (416) T ss_pred CCcccccc----cccccCCCcchhH----HHHHhccccccC-ccccchhh---hhcch---------HHHHHHHHHHHhh Confidence 11110000 0000000000000 000000000000 00111221 22222 23346655432 1 Q ss_pred ccccceecccchhHHHHHHHHhhc----ccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhh Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFFG----EDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (867) .-..+++..+....++...+-.|. +..+-.+|+..++ ..++.-|+.+-+...|.. |...++.+|+|+.|.|.+. T Consensus 60 A~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gna~~~i~r~~~-G~~~~L~~i~~~~v~v~~~ 137 (416) T protein:vir:45 60 ARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVF-VSALLTSHGYIEITRDKT-GEPMNLTFRKTSEIELKSD 137 (416) T ss_pred ccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEEcCceeEEEEC Confidence 112233333333333322222232 2344568988866 888889999988888764 6678899999999987521 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR 292 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (867) .+.++. + .| ..++....+--..++..-|-||++- T Consensus 138 ---~~g~~~-----------------------------------------~-~~-~~~~~~~~~~~~~~~~~evihir~~ 171 (416) T protein:vir:45 138 ---ARGRLY-----------------------------------------Y-FH-QRIDSNGNNIERNVKFEDMLDIKFY 171 (416) T ss_pred ---CCccEE-----------------------------------------E-EE-EEecCCCceeEEEEccccEEEeccC Confidence 111111 0 00 0000000000123444556677653 Q ss_pred CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c-- Q lcl|NC_011269. 293 PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D-- 369 (867) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-- 369 (867) . .=.-.|.+.+..+.++|-......+......++-.+|--++++.+ .+-+++..+.+|+.++..+.- + T Consensus 172 ~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--------~~~~~~~~~~~~~~~~~~~~g~~na 242 (416) T protein:vir:45 172 S-LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--------VLDNKKARDRAREEFHKSFSGTKQA 242 (416) T ss_pred C-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC--------CCCCHHHHHHHHHHHHHHhcCcccc Confidence 2 112369999999999998888888888888888888888888863 145677889999988887764 3 Q ss_pred hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 370 FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIR 449 (867) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r 449 (867) -..+|-.-|++++.....-+...+-.-.++..++|.+++||.-.++. ++..+|+..+.++.++ +-+.-+...|++.+- T Consensus 243 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~~~~~~~~-~~l~P~~~~ie~~ln 320 (416) T protein:vir:45 243 GKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFG-IETANMSITDANLDYL-STLKPYITCVCAELN 320 (416) T ss_pred CceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcC-CCCCCccHHHHHHHHH-HHHHHHHHHHHHHHh Confidence 24677778888887764433222223345567899999999999985 6666666555555432 233334444444444 Q ss_pred hhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhc---ee Q lcl|NC_011269. 450 RRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGM---GV 526 (867) Q Consensus 450 ~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~---~~ 526 (867) +++-.. ...+.++|-+..+...|..+.-+-. +..++. +-+..+.-+...+ +..++.- .. T Consensus 321 ~~l~~~-----~~~~~~~f~~~~l~~~D~~~~~~~~-~~~~~~-----------G~~T~NE~R~~~g-l~p~~~gd~~~~ 382 (416) T protein:vir:45 321 FKFNDE-----YVNREFKFDTTEIRVVDEKTQAEID-KINIDS-----------GKMNIDEIRQRDG-LAPIPGGNGSIH 382 (416) T ss_pred hhcccc-----ccCceEEEechhhhccCHHHHHHHH-HHHHhC-----------CCcCHHHHHHHhC-CCCCCCCCcceE Confidence 443111 1134455443333222221111111 111111 0011100000000 0000000 00 Q ss_pred eeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 527 PVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 527 pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) .+.....+.+ + .+.....+..+....+..- +.. + T Consensus 383 ~~~~n~~~~~--~-~~~~~~~~~~~~~~~~kgG-------e~n-~ 416 (416) T protein:vir:45 383 RVDLNHVNIE--L-VDEYQMNKSRATDKKLKGG-------EEN-E 416 (416) T ss_pred eecccccccc--c-ccccCcccccccccccCCC-------CCC-C Confidence 1111111111 1 0000000000000001000 000 0 No 65 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=98.00 E-value=5.8e-07 Score=54.77 Aligned_cols=336 Identities=12% Similarity=0.061 Sum_probs=153.2 Q ss_pred HHHHHHHHHHhhcccccccccccc--ccccccchhhhhhhhhHHH--H--HHhchHHHhhhccCCCCcccHH-HHHHhhh Q lcl|NC_011269. 219 RVQLMVKDLVDHLRQGPTTAGGNM--STVEETPSEREQRMREFQD--L--QRRYPEIIQAAMQNDGLDISEA-LISRVVN 291 (867) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 291 (867) +.|+| =... ++.. -..+--|.++++|..-..+ | .++ ....+++++.|+.. .|.|+ | T Consensus 1 v~Eiv-----w~~~------~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~-----~~~~g~~~~~lp~~kfi~~~-~ 63 (355) T protein:vir:78 1 MFEQV-----YRIE------NGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQ-----WGVFGKATVRIPVDRLVVFV-N 63 (355) T ss_pred CeEEE-----EEee------CCeEEEeeeeecCccceeeeeeccCCceeEEEe-----cCCCCCCcceeccCCEEEEE-e Confidence 22322 2111 1111 1122222222222110000 0 011 12456688888664 66666 5 Q ss_pred cCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHH----HH---HHHH Q lcl|NC_011269. 292 RPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEV----RD---DMQS 364 (867) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~---~~~~ 364 (867) ++..=.+.|..++..||...+.|+..-+-....+.|+-+|+.+.+.-. +.+ .+..+..++++. |+ ++-. T Consensus 64 ~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~---~~~-~~~~d~~~~~~~~~~~~~~l~~~~~ 139 (355) T protein:vir:78 64 EREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAP---LPE-AIARDTARAEQWLNDQKEEGLQLAK 139 (355) T ss_pred CCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecC---CCC-cccchhhhHHHHHHHHHHHHHHHHH Confidence 666556889999999999999999999999999999988777766532 111 113443333222 22 1222 Q ss_pred hhhcc-hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCC--CccceehhhhhHHHHHHHHHHHH Q lcl|NC_011269. 365 LLAAD-FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGG--TGGAYASSALNREFVTQIMTGFQ 441 (867) Q Consensus 365 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~~~~~~~~~~~~~~~~~~ 441 (867) .+..+ .--+|.-.|-++|.+.+.++...-+.=++...+.|..++ +|+-|+|++ .|++|+-+.|-.|...++.-... T Consensus 140 ~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~i-LGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~ 218 (355) T protein:vir:78 140 EFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAV-LAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVM 218 (355) T ss_pred HhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHH-hhhhhccccCCccchhhHHHHHHHHHHHHHHHHH Confidence 22223 123556667899999888887777777888899999999 999999855 36889999999999889899999 Q ss_pred HHHHHHHhh-hhHHHHHhhcccchh-eehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhh Q lcl|NC_011269. 442 NALKRHIRR-RCEVVAEAQGHYDYD-LKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIA 519 (867) Q Consensus 442 ~~l~~~~r~-~~~~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~ 519 (867) ..|..++.+ +++++.++|-..+.- ..+.|.... .|- +..-..+.++....+.+.++. .+..+. T Consensus 219 ~~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~~~-~~~-------------~~~a~~~~~l~~~G~~~~~~~-~~~~~~ 283 (355) T protein:vir:78 219 KHIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQLG-KEQ-------------PVTAEAIRALVECGAFTADPE-LEKDLR 283 (355) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCcC-hhH-------------HHHHHHHHHHHhCCCccccHH-HHHHHH Confidence 999999964 779999999544321 112221111 110 000001111111111111100 000011 Q ss_pred hhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCC Q lcl|NC_011269. 520 QLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTE 599 (867) Q Consensus 520 qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~ 599 (867) +. .+.+ .+........... +........... .................++...+ +. ....-.+.... T Consensus 284 e~--~gip--~p~~~~~~~~~~~---~~~~~~~~~~~~---~~~~~~~~~~a~~~~a~~~~~~~-~~--~~~~~~~~~~~ 350 (355) T protein:vir:78 284 AR--YGLP--APAERDDGADAAA---AKAAGRRRAKRL---PGQRQGAALPSRSPRADPPRRRG-PL--RRRPRHPAHRR 350 (355) T ss_pred HH--hCCC--CCCCCCcccCCcc---cccccccccccc---CCccccccccccCCCCCChhhhH-HH--HHHhhccccCC Confidence 00 0000 1000000000000 000000000000 00000000000000000000000 00 00000011111 Q ss_pred CCCCC Q lcl|NC_011269. 600 LGEAQ 604 (867) Q Consensus 600 ~~qa~ 604 (867) .+..+ T Consensus 351 ~~~~~ 355 (355) T protein:vir:78 351 CAPDG 355 (355) T ss_pred CCCCC Confidence 11111 No 66 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.99 E-value=1.6e-06 Score=52.36 Aligned_cols=401 Identities=11% Similarity=0.099 Sum_probs=178.6 Q ss_pred Hhcccccccceeecc-chhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-cccccceecccc Q lcl|NC_011269. 70 YRKQGNFGSNMQIAM-PKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-FPVVGMEFDSKD 147 (867) Q Consensus 70 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 147 (867) |.|...++-=-+..+ -.+.++.....+ +..|.-+.-. .-++. =|-+++.|-.|||+-++ ..-..|+.--+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~---~~~~~~~~~~---~v~~~-~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~ 73 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSASKLYD---FSPWKNKSFW---GVINN-TLETNETIFSAITKLSNSMASLPLKMYEDY 73 (409) T ss_pred CccccchhhhhhHHhhhhhccccccccc---cccccCcccc---ccchh-hHhhhHHHHHHHHHHHHhhhhCceEEeecc Confidence 444332221000000 001111111111 0001000000 00111 14467778888887643 111123332233 Q ss_pred hhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHH Q lcl|NC_011269. 148 PLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMV 224 (867) Q Consensus 148 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (867) +..+....+++. ....+-.+|+..++ ..++.-|+++-+..-|. .|...++..|+|+.|.|... .+ T Consensus 74 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~-~G~~~~L~~l~~~~v~v~~~---~~------- 141 (409) T protein:vir:96 74 KVVNTEVSDLLTVSPNNSLSSFDFINQIE-TIRNEKGNAYVLIERDI-YHQPSKLFLLNPDVVEMLIE---NQ------- 141 (409) T ss_pred cccchhHHHHHhhhcccCCCHHHHHHHHH-HHHhhcCceEEEEEECC-CCcEEEEEEEcCceeEEEEe---CC------- Confidence 334443334432 12355678888866 78888899888776665 45577899999999987621 00 Q ss_pred HHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchh Q lcl|NC_011269. 225 KDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHL 304 (867) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (867) ++.+ - |+. ....|....++..-|-|+++..+--.-.|.+.+ T Consensus 142 --------------~~~~---------------~----y~~------~~~~g~~~~~~~~evih~r~~~~~~~~~G~s~l 182 (409) T protein:vir:96 142 --------------SREL---------------Y----YSI------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPI 182 (409) T ss_pred --------------CcEE---------------E----EEE------EcCCceEEEEccccEEEeCCCCCCCccccccHH Confidence 0000 0 000 011233345566678888765443344688877 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhchh-hhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeee Q lcl|NC_011269. 305 LRSFRTLMAEESLNAAQDAVADRLYSPL-VLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVEN 383 (867) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 383 (867) ..+-++|... +++++..++....+- .|.+.+ . .-++++.+.+|++|+.........+|-.-|++++. T Consensus 183 ~~~~~~i~~~---~~~~~~~~~~~~~~~~~i~~~~-------~--~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~ 250 (409) T protein:vir:96 183 DVLKNTTDFD---NAVRTFNLTEMQKPDSFMLKYG-------S--NVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEP 250 (409) T ss_pred HHHHHHHHHH---HHHHHHHHHhcCCCceeEEecC-------C--CCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEE Confidence 6555554332 334444444433331 122222 1 24788999999999887776667788888888888 Q ss_pred ccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh-hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhccc Q lcl|NC_011269. 384 VFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL-NREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHY 462 (867) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~ 462 (867) ++..-+...+-.-.+...++|.+++||...++.+++.++|+++.- ...|+..-+.-+...|++.+.+.+=.-.|... T Consensus 251 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~-- 328 (409) T protein:vir:96 251 LPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREK-- 328 (409) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccC-- Confidence 865433322223334456889999999999998888889998643 33455555666666666666665532222211 Q ss_pred chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh-ceeeeeccccCCCccccc Q lcl|NC_011269. 463 DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG-MGVPVSDKTLAVNIDMKF 541 (867) Q Consensus 463 d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~-~~~pitd~t~p~tiqme~ 541 (867) .+.++|-+..+.--| .+.++ +. ..+..+. +-+..+.-+...+ +..++. -...++.-..+.. ... T Consensus 329 g~~i~fd~~~ll~~d---~~~~~-e~-~~~~~~~-------G~~T~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~--~~~ 393 (409) T protein:vir:96 329 NRYFKFNVKSYLRAD---SATQA-EV-YFKAVRS-------GYYTINDIREWED-LPPVEGGDKPLISGDLYPID--TPL 393 (409) T ss_pred cceEEeechhhhccC---HHHHH-HH-HHHHHhC-------CCCCHHHHHHHhC-CCCCCCcceeeecccccccc--cch Confidence 122222221111111 12211 11 1111000 0011110000000 000000 0000110001110 000 Q ss_pred chhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 542 DQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 542 E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) +.+ ..+ + -.+.-...+ T Consensus 394 ~~~---------~~~-------~-gG~~n~~e~ 409 (409) T protein:vir:96 394 ELR---------KSL-------K-GGDKNVNES 409 (409) T ss_pred hhc---------ccc-------c-CCCCCcCCC Confidence 000 000 0 000000000 No 67 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=97.96 E-value=6.2e-07 Score=54.62 Aligned_cols=408 Identities=9% Similarity=0.017 Sum_probs=187.8 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeec--------cchhhhhhhhhHHhhCCCchhhhHHHHHHHHH Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIA--------MPKIRQPLGTLADKGIPFNVEDEEELRVIRHW 115 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 115 (867) |. |.+.|+..++...+-.-|++...+-+ ...+... .|.+..=++.-..-++..+ ++. T Consensus 1 Mg---l~d~~r~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~--~~~-------- 65 (431) T protein:vir:10 1 MG---LFDFIRREKQPEAQARPHVEPSFQAS--TPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGR--ETR-------- 65 (431) T ss_pred Cc---chhhhhcCcccccccccccccccccc--cccccccccccccccchHHHHhhccCccCcceec--hhh-------- Confidence 22 33333332221111111111000000 0000000 0000000000000011111 110 Q ss_pred HHHHhhccchHHHHHHhhhh-cccccceecccchhHH----HHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchh Q lcl|NC_011269. 116 CRLFYATHDLVPLLIDIYSK-FPVVGMEFDSKDPLIK----TFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLA 187 (867) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (867) +-.++.|-.||++-++ +.-..+++.-+|+.-+ .-..+++- -+..+-.+|+..++ ..++.-|+.+-+. T Consensus 66 ----al~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~-~~lll~Gna~~~i 140 (431) T protein:vir:10 66 ----ALRNMAVLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQ-LRALLDGESMARI 140 (431) T ss_pred ----hhccHHHHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHH-HHHhhcCCeEEEE Confidence 1135778888887653 1112232222222111 11111211 12445567888855 7888889998887 Q ss_pred hhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhch Q lcl|NC_011269. 188 HFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYP 267 (867) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (867) +-|. |.-.++..|+|+.|.|... .+..+. |+- T Consensus 141 ~r~~--g~~~~L~pl~~~~v~~~~~---~~~~~~-----------------------------------------y~~-- 172 (431) T protein:vir:10 141 VWSG--NRPIRLIPMDRGSAKGRLT---STWQIV-----------------------------------------YDY-- 172 (431) T ss_pred EEcC--CceEEEEEEcCceeEEEEc---CCCeEE-----------------------------------------EEE-- Confidence 7763 4445788899999987621 111110 110 Q ss_pred HHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCc Q lcl|NC_011269. 268 EIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEP 347 (867) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 347 (867) ....|..+.++...|-|+++-... --.|.+.+.-+-++|-.-....+....+.+.-.+|=-++++-+ T Consensus 173 ----~~~~g~~~~~~~~dViHir~~~~d-g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-------- 239 (431) T protein:vir:10 173 ----TTPTGDKIELPAREVFHLRDLSID-GVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK-------- 239 (431) T ss_pred ----EeCCceEEEEchhhEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-------- Confidence 012345567888888899764321 2358888888878777666666555556565556655555432 Q ss_pred CCCCHHHHHHHHHHHHHhhhc-ch--hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCcccee Q lcl|NC_011269. 348 WIPDQGELDEVRDDMQSLLAA-DF--RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYA 424 (867) Q Consensus 348 ~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 424 (867) .-+.+..+.+|+.|+..+.. ++ ..+|-.-|++++.+.-.-....+-+-.++..++|.+++||...+|.+.++++|+ T Consensus 240 -~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~s 318 (431) T protein:vir:10 240 -ELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGS 318 (431) T ss_pred -CCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccc Confidence 24678899999999887763 43 456666788888775322222221223445688999999999999988889999 Q ss_pred hh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhh-h--hhh Q lcl|NC_011269. 425 SS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKL-L--IPE 500 (867) Q Consensus 425 ~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~-i--~~~ 500 (867) +. +....|+..-+.-+...|++.+.+.+-.-.+..+ +.|+|-+..+.--|..+..+...+ .+....+. | ..| T Consensus 319 n~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~---~~~~fd~~~llr~d~~~r~~~~~~-~~~~G~~~g~lT~NE 394 (431) T protein:vir:10 319 GIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQ---RQFKFNEGALLRGTLNDQAAFFSK-ALGAGGQSPWMKQNE 394 (431) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCC---ceEEEechhhhccCHHHHHHHHHH-HHhcccccCccCHHH Confidence 86 4566777777888888888888776632233322 334443333322222221111111 11100000 0 000 Q ss_pred hccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 501 IKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 501 i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) +.-- +++.......+.. + -.++.....+..-+ +++.. T Consensus 395 ~R~~-~gl~p~~~~~gD~--~---~~p~n~~~~~~~~~--------------------------------~p~~~ 431 (431) T protein:vir:10 395 VREM-LDLPRADDPVADQ--L---RNPMTQKQKGSGDE--------------------------------PPATT 431 (431) T ss_pred HHHH-hCCCCCCCccccc--e---ecccccccCCCCCC--------------------------------CCCCC Confidence 0000 0010000000000 0 00000000000000 00000 No 68 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=97.96 E-value=9.6e-07 Score=53.58 Aligned_cols=399 Identities=11% Similarity=0.042 Sum_probs=179.3 Q ss_pred eccchhh--hhhhhhH-HhhCCCchhhhH--HHHHHHHHHHHHhhccchHHHHHHhhhhcccccceecc----cc---hh Q lcl|NC_011269. 82 IAMPKIR--QPLGTLA-DKGIPFNVEDEE--ELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDS----KD---PL 149 (867) Q Consensus 82 ~~~~~~~--~~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~---~~ 149 (867) |.+-++. .|.-.-. ...++..+.--. .-..+..+ =.-.++-|-.|||+.++ .|..+.|.. +| .. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~~i~~ia~-~ia~lp~~~~~~~~~g~~~~ 76 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPE---TALALSAVRACVTLLAE-SVAQLPCVLYRRTENGGREI 76 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechH---HhhccHHHHHHHHHHHH-hhccCceEEEEEcCCCceec Confidence 3332221 1110000 000000000000 00000000 01124557788887765 333332221 12 22 Q ss_pred HHHHHHHHhh----cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHH Q lcl|NC_011269. 150 IKTFYEDLFF----GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVK 225 (867) Q Consensus 150 ~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (867) +++-...-.| .+..+-.+|+..++ ..++.-|+.+-+.+-|.. |...++..|+|+.|.|... .+.++ T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~l~l~Gna~~~i~r~~~-G~~~~L~pl~~~~v~v~~~---~~g~~----- 146 (419) T protein:vir:57 77 AFDHPLHDLIRYQPNRKDTAFEYHEQTQ-GVLGLEGNSYSLIDRNGR-GDITELIPINPHKVIVLKG---PDGMP----- 146 (419) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEEcCcceEEEEC---CCceE----- Confidence 3343222222 24556678888866 788889998888776654 5678899999999988521 00000 Q ss_pred HHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhh Q lcl|NC_011269. 226 DLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLL 305 (867) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (867) +.+++ +.+-.++...|-|+.+-... .-.|.+.+- T Consensus 147 -------------------------------------~y~~~--------~~~~~~~~~~vih~r~~~~d-~~~G~s~i~ 180 (419) T protein:vir:57 147 -------------------------------------YYDIP--------SIGEILPMRMVHHIKSFSLD-GYIGTSPIQ 180 (419) T ss_pred -------------------------------------EEEEc--------CCceEEchhhEEEecCcCCC-CcccccHHH Confidence 11111 11223556677788754322 235899888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c--hhhhhhhhheeee Q lcl|NC_011269. 306 RSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVE 382 (867) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~ 382 (867) .+-++|-......+....+.+.-..|--++++. ++... .-++++++.+|+.++..+.- + ...+|-.-|++++ T Consensus 181 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~----~~~~~-~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~ 255 (419) T protein:vir:57 181 TNPDVLGLGIAVEQHAAQVFARGTTMSGVIERP----FEAKA-IASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYK 255 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHccCCccEEEEec----CcCCc-ccCHHHHHHHHHHHHHHhccccccccceecCCCceEE Confidence 777777776666666666667767776666665 33333 35788999999877665432 2 4566667788888 Q ss_pred eccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh-hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcc Q lcl|NC_011269. 383 NVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL-NREFVTQIMTGFQNALKRHIRRRCEVVAEAQGH 461 (867) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~ 461 (867) .++..-+...+-+-.++..++|.+++||...+|.+.+.++|+++.- ...|+..-+.=+...|++.+.+.+=.-.+. T Consensus 256 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~--- 332 (419) T protein:vir:57 256 QLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLLLPSER--- 332 (419) T ss_pred EcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc--- Confidence 7765433333333346667899999999999999888889988532 222223334444444444443332111111 Q ss_pred cchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCccccc Q lcl|NC_011269. 462 YDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKF 541 (867) Q Consensus 462 ~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~ 541 (867) .++.|+|-+..+...|..+.-+ ..+..++....- ..++.. .+.+.........++.+.............. T Consensus 333 ~~~~i~fd~~~ll~~d~~~~~~-~~~~~~~~G~~T-~NE~R~-~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~------ 403 (419) T protein:vir:57 333 RDFYIEFNVSSLLRGDQKSRYE-SYALGRQWGWLS-VNDIRR-MENLTPIPGGDKYLTPLNMVDSKALTGIGKA------ 403 (419) T ss_pred CCeEEEEechhhhccCHHHHHH-HHHHHHhCCCcC-HHHHHH-HhCCCCCCCcCeeeeccccccccccccccCC------ Confidence 1334444443332222211111 111111110000 000000 0001000000000010000000000000000 Q ss_pred chhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 542 DQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 542 E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) +++..+..+......| T Consensus 404 -----------------~~~~~~~~~~~~~~~~ 419 (419) T protein:vir:57 404 -----------------TPQQLKDIEAILCTRN 419 (419) T ss_pred -----------------CcccCcchhhhhhccC Confidence 0000000000000001 No 69 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=97.92 E-value=1.9e-06 Score=51.90 Aligned_cols=401 Identities=10% Similarity=0.047 Sum_probs=175.7 Q ss_pred HhcccccccceeeccchhhhhhhhhHH-----hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----ccccc Q lcl|NC_011269. 70 YRKQGNFGSNMQIAMPKIRQPLGTLAD-----KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FPVVG 140 (867) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 140 (867) |--.--+.+|..=.-++....+..+.. -+...|. +. +-.++-|-.||++-++ .|+.= T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~--~~------------al~~~~v~~~v~~ia~~iA~lp~~~ 66 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSDSGQVVTP--AS------------ALALTVLQNCVTLLAESIAQLPIEL 66 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCccCCcccch--HH------------hhccHHHHHHHHHHHHhhccCceEE Confidence 110000001111000000011111110 0111111 11 1134556667776554 23221 Q ss_pred ceecccc-hhHHHHHHHHhh----cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhh Q lcl|NC_011269. 141 MEFDSKD-PLIKTFYEDLFF----GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFV 215 (867) Q Consensus 141 ~~~~~~~-~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (867) ++.+.++ ..+++-..+..| .+..+-.+|+..++ ..++.-|+.+-+.+-|.. |.+..+..|+|+.|.|.+. T Consensus 67 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~l~l~Gna~~~i~r~~~-G~~~~l~pl~~~~v~v~~~--- 141 (419) T protein:vir:14 67 YERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQ-VAVGLRGNSYSFIDRDSD-GVIQGLYPLDNEAVTVMRG--- 141 (419) T ss_pred EEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEecCceEEEEEC--- Confidence 1111111 011111111222 13456778988866 888889999998887765 6788999999999988621 Q ss_pred cchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcc Q lcl|NC_011269. 216 QRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTA 295 (867) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (867) .+ |.- .| + ....++ ++.+.|-|+.+-... T Consensus 142 ~~---------------------~~~----------------~y----~--------~~~~~~--~~~~~i~h~~~~~~d 170 (419) T protein:vir:14 142 SD---------------------LKP----------------VY----R--------VRGSDP--MPQRLVHHVRWMSIN 170 (419) T ss_pred CC---------------------ceE----------------EE----E--------EccCcc--cchhheeEecCcCCC Confidence 11 000 00 0 001111 345667777653321 Q ss_pred ccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c--hhh Q lcl|NC_011269. 296 WATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRL 372 (867) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~ 372 (867) .-.|.+.+.-+-++|-.............+.-..|=-++++- ++... .-+++..+.+|+.++..+.. + ... T Consensus 171 -g~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~----~~~~~-~~~~~~~~~~~~~~~~~~~g~~nag~~ 244 (419) T protein:vir:14 171 -GYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERP----KDAPA-LKDQASVDRITDGWNAKFGGSGNAKKV 244 (419) T ss_pred -CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEec----CCCCc-ccCHHHHHHHHHHHHHHhcCccccCCc Confidence 236888888777777666665555555556555564444543 12212 34688899999988766543 3 346 Q ss_pred hhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 373 MVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRR 451 (867) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~ 451 (867) +|-.-|++++...-.-....+-+-.++..++|.+++||.-.+|..+++++|+++. ....|+..-+.=+..+|++.+.+. T Consensus 245 ~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~k 324 (419) T protein:vir:14 245 ALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRD 324 (419) T ss_pred eecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 6667788888776433222222334567789999999999999988889998854 333444444555556666666654 Q ss_pred hHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeecc Q lcl|NC_011269. 452 CEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDK 531 (867) Q Consensus 452 ~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~ 531 (867) +=.-.+..+ +.|+|-+..+..-|..+.-+-..++ ++...- -..++..- +.+-........+..+.. ..+.+ T Consensus 325 ll~~~~~~~---~~i~fd~~~l~r~d~~~~~~~~~~~-~~~G~~-T~NE~R~~-~gl~p~~gGD~~~~~~n~--~~~~~- 395 (419) T protein:vir:14 325 LLLPSERKQ---YFIEYNLAGLLRGDQSSRYAAYAVG-RQWGWL-SINDIRRL-ENMPPVKGGDIYLSPMNM--VDASK- 395 (419) T ss_pred ccCccccCC---eEEEEechhhhccCHHHHHHHHHHH-HhCCCc-CHHHHHHH-hCCCCCCCcCeeeecccc--ccccc- Confidence 421122222 2233333222221221111111111 100000 00000000 000000000000000000 00000 Q ss_pred ccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccccccc Q lcl|NC_011269. 532 TLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQ 586 (867) Q Consensus 532 t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~ 586 (867) +............... .+..+.+. T Consensus 396 --~~~~~~~~~~~~~~~~-----------------------------~e~~~~l~ 419 (419) T protein:vir:14 396 --PQQLPVGKSEPTKAAI-----------------------------DEIGRILS 419 (419) T ss_pred --cccccCCCCCCccccc-----------------------------cchhcccC Confidence 0000000000000000 01111111 No 70 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.92 E-value=1e-06 Score=53.44 Aligned_cols=398 Identities=14% Similarity=0.121 Sum_probs=187.2 Q ss_pred CCCCchhhHHhhhhhcccCCc-------hHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhH-Hh Q lcl|NC_011269. 26 MPNSPTMARAQAAALQNTVDN-------KPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLA-DK 97 (867) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 97 (867) |- -+ .-||+- +++.+.|...++.-+.. .+...|++... .. T Consensus 1 ~~----------~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~ 48 (424) T protein:vir:18 1 ME----------EP-KYTIDLRTNNGWWARLQSWFVGGRLVTPNQ---------------------GSQTGPVSAHGHLG 48 (424) T ss_pred CC----------CC-cceEeecCCCchHHHHHhhhcccccccccc---------------------cccccccccccccc Confidence 00 00 011110 11222222221111110 01111221110 01 Q ss_pred hCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh----hcccccceeccc---------chhHHHHHHHHhhccccc Q lcl|NC_011269. 98 GIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS----KFPVVGMEFDSK---------DPLIKTFYEDLFFGEDLN 164 (867) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~ 164 (867) +.. +-++.- -.++-|=.||++-+ ..|+.=++.+.+ +++.+-+-+ --.+..+ T Consensus 49 ~~~--v~~~~a------------l~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~--~PN~~~t 112 (424) T protein:vir:18 49 DSS--INDERI------------LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRY--SPNQYMT 112 (424) T ss_pred ccc--ccHHHh------------hccHHHHHHHHHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhh--ccCCCCC Confidence 111 222222 22344555666544 333322222222 222211111 1123455 Q ss_pred HHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhcccccccccccccc Q lcl|NC_011269. 165 YLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMST 244 (867) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (867) -.+|+..++ ..++.-|..+-+.+.+.. |...++..|+|+.|.|.. ..++++- T Consensus 113 ~~~f~~~~~-~~lll~Gnay~~i~r~~~-G~~~~L~pl~~~~V~v~~---~~~~~~y----------------------- 164 (424) T protein:vir:18 113 AQEFREAMT-MQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKL---VGKKVVY----------------------- 164 (424) T ss_pred HHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEecCcceEEEE---cCCeEEE----------------------- Confidence 778888866 888889999988887765 557889999999998862 2222111 Q ss_pred ccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 245 VEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV 324 (867) Q Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (867) +| -..|....++.+-|-|+.+-... .-.|.+.+.-+-.+|-.-....+..... T Consensus 165 -------------~~-------------~~~g~~~~~~~~eIih~r~~~~d-g~~G~spi~~~~~~i~~~~a~~~~~~~~ 217 (424) T protein:vir:18 165 -------------RY-------------QRDSEYADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDF 217 (424) T ss_pred -------------EE-------------EeCCeEEEeccccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 11 01244456667778888754321 2368888877878877766666666667 Q ss_pred HhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhhhhheeeeeccccCccCchhHHHHHHHH Q lcl|NC_011269. 325 ADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKVENVFGRESVPNLDADYDRIER 402 (867) Q Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 402 (867) .+.-..|--++++.. . +=+.++.+.+|+.++....+++ ..+|---|++++.++..-...-+-+-.++..+ T Consensus 218 f~ng~~p~gil~~~~-------~-~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~ 289 (424) T protein:vir:18 218 FANGAKSPQILSTGE-------K-VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVS 289 (424) T ss_pred HHccCCcceEEEeCC-------c-CCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHH Confidence 777777877777652 1 2367888999998888777653 45666778888888654333222234467788 Q ss_pred HHHHhhccchhhhcCCCccceehh---hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhh Q lcl|NC_011269. 403 KLLQAWGIGEALISGGTGGAYASS---ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIV 479 (867) Q Consensus 403 ~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~ 479 (867) +|.+++||.-.+|..-++++|..+ +....|+.+-+.-+..+|++.+.+.+-.-.|..+ ..|+|-+..+.-. T Consensus 290 ~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~---~~~~fd~~~llr~--- 363 (424) T protein:vir:18 290 ELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGR---IHAEHNLDGLLRG--- 363 (424) T ss_pred HHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCC---eEEEEechhhhcc--- Confidence 999999999999976677776332 3455566666666666666666665522233322 2233322222111 Q ss_pred hhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhc Q lcl|NC_011269. 480 EYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMAT 559 (867) Q Consensus 480 ~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~t 559 (867) +.+++. +.. .+..+. +-+.. .|.=+...+..++. +..+.-+..-..+.. +....+.+ T Consensus 364 d~~~r~-~~~-~~~~~~-------G~~T~-NE~R~~~gl~pi~g-GD~~~~~~n~~~l~~-~~~~~~p~----------- 420 (424) T protein:vir:18 364 DSASRA-AFM-KAMGEA-------GLRTI-NEMRRTDNLPPLPG-GDVAMRQSQYVPITD-LGTNKEPR----------- 420 (424) T ss_pred CHHHHH-HHH-HHHHhC-------CCcCH-HHHHHHhCCCCCCC-cCeeeeccCccchHh-hhccCCCc----------- Confidence 222211 111 111000 00100 01000000110000 000000000000000 00000000 Q ss_pred cccc Q lcl|NC_011269. 560 AQAM 563 (867) Q Consensus 560 aet~ 563 (867) .+.. T Consensus 421 ~~ga 424 (424) T protein:vir:18 421 NNGA 424 (424) T ss_pred cCCC Confidence 0000 No 71 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.90 E-value=1.1e-06 Score=53.35 Aligned_cols=417 Identities=12% Similarity=0.074 Sum_probs=193.4 Q ss_pred hhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK 135 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (867) ||.+..+=..|+.+-.. +-++.=+++..+.|..-++......- ..+-. .=+-.++-|-.|||+-++ T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~g~~~s~~~~~~~~~~~~~~~~~g-~~v~~------------~~al~~~~v~~ci~~Ia~ 66 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFL-KWLGVPISLTDGSFWSAWGGMGSSSG-ETVTA------------DSALQLSAVWSCVRLIAE 66 (437) T ss_pred CCcchhhhhhhhHHhhh-hhcCCcccCCchhHHHhhcccccCCC-ceech------------HhhhccHHHHHHHHHHHH Confidence 44433322222211100 11223344445555443332221110 01111 112345666677777654 Q ss_pred ----cccccce--------ecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC Q lcl|NC_011269. 136 ----FPVVGME--------FDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN 203 (867) Q Consensus 136 ----~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (867) .|+.=++ ...++++.+-+... -.+..+-.+|+...+ ..++.-|+.+-+.+-+ .|...++..|+ T Consensus 67 ~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~--PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~--~g~~~~L~~l~ 141 (437) T protein:vir:10 67 TIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQ--PNAENTAAEFWEVIV-ASMLLWGNGYARKLRS--AGVLIGLELML 141 (437) T ss_pred HHhhCceeEEEEcCCCceeeccccHHHHHhhcc--CCcCCCHHHHHHHHH-HHHhhcCCeEEEEEec--CCcEEEEEEEc Confidence 2222111 11222222111111 123456678888866 8888899998887766 37888899999 Q ss_pred cceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccH Q lcl|NC_011269. 204 PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISE 283 (867) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (867) |+.|.|.+- .+..+ .|. | +.-.|.-..++. T Consensus 142 p~~v~i~~~---~~g~~-------------------------------------~y~-----~-----~~~~g~~~~~~~ 171 (437) T protein:vir:10 142 PQRTTVKRL---TSGAL-------------------------------------QYT-----Y-----RNVDGTVSTLAE 171 (437) T ss_pred CcceEEEEC---CCCeE-------------------------------------EEE-----E-----EecCceEEEEcc Confidence 999988731 11000 010 0 011233345667 Q ss_pred HHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHH Q lcl|NC_011269. 284 ALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQ 363 (867) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (867) +-|-|+.+-... .-.|.+.+..+-.+|-...........+.+.-..|--++++-+ .-+.+..+++|++|+ T Consensus 172 ~dIih~r~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~l~~e~~~~~~~~~~ 241 (437) T protein:vir:10 172 DDVFHVRGFSLD-GLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQ---------ILQKEKRAEIRTDLA 241 (437) T ss_pred ccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---------CCCHHHHHHHHHHHH Confidence 778888653211 2368888877777776666666666666666666766666542 235688899999887 Q ss_pred Hhhh-cc--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccce--eh-hhhhHHHHHHHH Q lcl|NC_011269. 364 SLLA-AD--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAY--AS-SALNREFVTQIM 437 (867) Q Consensus 364 ~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~--~~-~~~~~~~~~~~~ 437 (867) .... ++ ...+|-.-|++++.+...-....+-+-.++..++|.+++||.-.+|..+++++| ++ .+....|+..-+ T Consensus 242 ~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl 321 (437) T protein:vir:10 242 EQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTL 321 (437) T ss_pred HHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHH Confidence 6543 33 457777888888888655444444444556678999999999999977777766 44 355666777777 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhh Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAF 517 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~ 517 (867) .-+...|++.+.+.+=.-.|... ..|+|-+..+..-|..+.-+ ..+..+... | +..+.-+. ... T Consensus 322 ~P~~~~ie~~l~~kll~~~e~~~---~~~~fd~~~ll~~d~~~r~~-~~~~~~~~G---~--------~T~NE~R~-~~g 385 (437) T protein:vir:10 322 RPWLTRIEQAARRSLLRPGERDQ---FYAEFSVEGLLRADSAGRAA-FYSTMTQNG---L--------MTRDECRA-KEN 385 (437) T ss_pred HHHHHHHHHHHHhhccCccccCc---eEEEEechhhhccCHHHHHH-HHHHHHhCC---C--------cCHHHHHH-HhC Confidence 77778888888766522122211 12333222221112111111 111111110 0 00000000 000 Q ss_pred hhhhhhceeee--eccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccc Q lcl|NC_011269. 518 IAQLKGMGVPV--SDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPEL 581 (867) Q Consensus 518 v~qL~~~~~pi--td~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~ 581 (867) +..++.-...+ .....+. .. ...+...+ ..+...+-.+....... ...+. T Consensus 386 l~pi~gg~~~~~~~~~~~~~--~~-~~~~~~~~----------~~~~~~~~~~~~~~~~~-~~~e~ 437 (437) T protein:vir:10 386 LPPMGGNAAVLTVQSALLPI--DK-LGEHTTAT----------AAQDALKAWLYQEEKTR-ATQER 437 (437) T ss_pred CCCCCCCcceEeecCcccch--hh-ccCcCCCc----------chhccccccCCCCCCCC-ccccC Confidence 00000000000 0000000 00 00000000 00000000000000000 00000 No 72 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.85 E-value=1.9e-06 Score=52.01 Aligned_cols=361 Identities=12% Similarity=0.089 Sum_probs=168.6 Q ss_pred eeeccchhhhhhhhhHHhhCCCchhhh---HHHHHHHH-----H--HHHHhhccchHHHHHHhhhhcccccceecccchh Q lcl|NC_011269. 80 MQIAMPKIRQPLGTLADKGIPFNVEDE---EELRVIRH-----W--CRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPL 149 (867) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (867) |-|-.++.. .+.=+.++... ..+..... | ++. |-.++.|-.||++.++ -|..+.|.+++.. T Consensus 1 Mg~~~~~~~-------~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-~l~~~~v~~~i~~ia~-~ia~~~~~~~~~~ 71 (383) T protein:vir:10 1 MGLLTPKNF-------SKRNAKNMVYPSNPAFFTTTVGGMQLSYVSALS-ALQNTNVYSVINRIAS-DVSSAHFKTENTA 71 (383) T ss_pred CCccccccc-------ccccccccccccchhhhhhhccCccccccchhH-hhcchHHHHHHHHHHH-hhccCceeecccc Confidence 222211100 00000000000 00000000 0 011 2245778899999887 4444455555544 Q ss_pred HHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHh Q lcl|NC_011269. 150 IKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVD 229 (867) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (867) +..+.+ -. -+..+-.+|+..++ ..++.-|+.+-+..= ...+++.++.++|. +..+. T Consensus 72 ~~~ll~-~P-N~~~t~~~f~~~~~-~~l~l~Gn~~~~i~~-------~~~~~~p~~~~~v~---~~~~~----------- 127 (383) T protein:vir:10 72 TLNRLE-SP-SSLIGRFSFWQGAL-MQLCLSGNDYIPLVG-------QNLEHIPNSDVQIN---YLPGN----------- 127 (383) T ss_pred hhhhhh-CC-CCCCCHHHHHHHHH-HHhhhcCCeEEEEEc-------CceeEeecCcceEE---EEEcC----------- Confidence 433222 11 13456677888765 777778887765421 12234444444433 11000 Q ss_pred hccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCc-ccc-ccCcchhhHH Q lcl|NC_011269. 230 HLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPT-AWA-TRGAPHLLRS 307 (867) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~ 307 (867) .| +...|- ....+....++..-|-|+++-+. .|. ..|.|.+..| T Consensus 128 ---~~---------------------------~~~~~~----~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~ 173 (383) T protein:vir:10 128 ---MG---------------------------IVYTVL----ESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESL 173 (383) T ss_pred ---Cc---------------------------eEEEEE----EcCCceEEEEcccceEEeccCCCCcccccccccHHHHH Confidence 00 000000 00112234455566788875433 233 4699999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhhhhheeeeecc Q lcl|NC_011269. 308 FRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKVENVF 385 (867) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 385 (867) -++|.......+....+.+.-..|--++++.+ . +=+.+..+.+|+.|+.....++ ..+|..-|.+++-++ T Consensus 174 ~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~------~--~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~ 245 (383) T protein:vir:10 174 QNALNLDDKASKSNMSAMENQINPAGKLTISN------Y--LSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLE 245 (383) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCC------C--CCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecC Confidence 99888777766666666666667766666652 1 2356788899999988887764 466777788888876 Q ss_pred ccCccCch-hHHHHHHHHHHHHhhccchhhhcCCCc--cceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhc-c Q lcl|NC_011269. 386 GRESVPNL-DADYDRIERKLLQAWGIGEALISGGTG--GAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQG-H 461 (867) Q Consensus 386 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~-~ 461 (867) ..-..... .+-.+..+++|.+++||...+|.++++ .+|++.. |....+..-|+-+++... .|++- + T Consensus 246 ~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~e-------q~~~~~~~~l~P~~~~ie---~~l~~~l 315 (383) T protein:vir:10 246 MKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNID-------QIKATYLANLNSYVNPIV---DELRLKM 315 (383) T ss_pred CChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHH-------HHHHHHHHHHHHHHHHHH---HHHHHhh Confidence 54443332 223366689999999999999987664 4566533 222223333444443322 22322 2 Q ss_pred cchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeee----eccccCCCc Q lcl|NC_011269. 462 YDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPV----SDKTLAVNI 537 (867) Q Consensus 462 ~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pi----td~t~p~ti 537 (867) +.++++|-+..+...|..+..+.+.+.-... -+..+.-+...+ ...+.....+. ..+..++.- T Consensus 316 ~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G------------~~t~nE~R~~lg-~~p~~~~d~~~~~~~~~~~~gGd~ 382 (383) T protein:vir:10 316 NAPDLELDIKDMLDVDDSILINQVSNLAKSG------------VLGAEQAQFILT-RSGFLPDNLPEFKPLTNETKGGDD 382 (383) T ss_pred CCceEEeechhhhccCHHHHHHHHHHHHhCC------------CcCHHHHHHHhC-CCcccCCcccccCCCcccCCCCCC Confidence 3445555444443333333222221111111 011110000000 00000000000 011111110 Q ss_pred c Q lcl|NC_011269. 538 D 538 (867) Q Consensus 538 q 538 (867) . T Consensus 383 e 383 (383) T protein:vir:10 383 K 383 (383) T ss_pred C Confidence 0 No 73 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.83 E-value=5.4e-07 Score=54.94 Aligned_cols=397 Identities=12% Similarity=0.036 Sum_probs=175.0 Q ss_pred eeeccchhhh--------hhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccccee----cccc Q lcl|NC_011269. 80 MQIAMPKIRQ--------PLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEF----DSKD 147 (867) Q Consensus 80 ~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 147 (867) |.+ ||- .+..+-+.++..++... .-.+.++ .++=|=.||++-+. -|..+.+ ..+| T Consensus 1 m~~----~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~Al------~~~~V~~cv~~ia~-~iA~lp~~~~~~~~~ 67 (417) T protein:vir:38 1 MKL----FRGLATEVDPHWADHLLDSGVIPSFRGG--YLGISAL------RNSDVLTAVSIVSG-DVSRFPLVITDSSTD 67 (417) T ss_pred Ccc----ccccccCCCccchhhhcccccccccCCc--eechhhc------ccHHHHHHHHHHHH-hhccCeeEEEEcCCc Confidence 111 110 01111112222222111 0000000 23444567776543 1222222 1223 Q ss_pred hhHHHHHHHHhh----cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHH Q lcl|NC_011269. 148 PLIKTFYEDLFF----GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLM 223 (867) Q Consensus 148 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (867) ..+++--.+..+ .+..+-.+|+..++ ..++.-|..+-+-+.+..+|.-..++.|.|+.|.|.+. ....+. T Consensus 68 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~---~~~~~~-- 141 (417) T protein:vir:38 68 EVIDLANIEYLMNTKVNKRLSAYQWKFPMM-VNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTS---DPDNII-- 141 (417) T ss_pred ceeccchHHHHHhcccCcCCCHHHHHHHHH-HHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEc---CCCeEE-- Confidence 333321112222 34455667888866 77888899999988888889889999999999998632 111111 Q ss_pred HHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcch Q lcl|NC_011269. 224 VKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPH 303 (867) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (867) | +|-. ...+....++..-|-|+.+-.... -.|.+. T Consensus 142 -----------------------------------y-----~~~~----~~~~~~~~~~~~dviH~r~~~~d~-~~G~s~ 176 (417) T protein:vir:38 142 -----------------------------------Y-----RFTP----YNSSMQKVCGFEDVIHWKFFSYDT-IMGRSP 176 (417) T ss_pred -----------------------------------E-----EEEE----cCCcEEEEecCcceEEecCCCCCC-ccccCH Confidence 0 0100 011111223344567877543211 359999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhhhhheee Q lcl|NC_011269. 304 LLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKV 381 (867) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 381 (867) +..+.++|..............+.-..|--|++..+ .-+.++.+.+|+.|+.....++ ..+|-.-|+++ T Consensus 177 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~---------~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~ 247 (417) T protein:vir:38 177 LLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKES---------RLSAEARQKIREDFERAQAGADAGSPIIVDATMDY 247 (417) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC---------CCCHHHHHHHHHHHHHHhcccccCCceeccCCceE Confidence 888888887766666666666666667777776653 2467889999999988776653 45555667777 Q ss_pred eeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhc Q lcl|NC_011269. 382 ENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQG 460 (867) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~ 460 (867) +.....-....+-.-.++..++|.+++||.-.+|. ...+|+++ +....|+..-+.-+...|++++.+.+=.-.|. T Consensus 248 ~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg--~~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~-- 323 (417) T protein:vir:38 248 QPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLA--QNSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQR-- 323 (417) T ss_pred EEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhC--CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhc-- Confidence 76643322111112233446789999999999994 45577764 33444444445555555555555444111121 Q ss_pred ccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh-ceeeeeccccCCCccc Q lcl|NC_011269. 461 HYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG-MGVPVSDKTLAVNIDM 539 (867) Q Consensus 461 ~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~-~~~pitd~t~p~tiqm 539 (867) .++.|+|-+. ++...+ ....+ .-+..+-+..+.-+...+ +..++. .+..+.-+..-..++. T Consensus 324 -~~~~~~fd~~-----~l~~~~--------~~~~~---~~~~~G~~T~NE~R~~~g-l~pi~~g~~d~~~~~~n~~~~d~ 385 (417) T protein:vir:38 324 -HQYCIGFDTK-----SVNGLP--------IADVN---TAVNGGLWTGNEGRAELG-KKPLKDPNMDRIQSTLNTVFLDQ 385 (417) T ss_pred -ccceEEechh-----hhhHHH--------HHHHH---HHHhCCCcCHHHHHHHhC-CCCCCCCCCCeeeeccccccccc Confidence 1122222111 111000 00001 111112122211111111 111100 0001111111111111 Q ss_pred ccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccc Q lcl|NC_011269. 540 KFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELA 582 (867) Q Consensus 540 e~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~a 582 (867) +.+.+ + .....+.......+ . ..+......-+ T Consensus 386 ~~~~~---~--~~~~~~kgg~~~~~---~---~~~~~~~~~~~ 417 (417) T protein:vir:38 386 KEAYQ---A--EHAAELKGGDTNAK---G---NQNGSGTNANS 417 (417) T ss_pred ccccc---c--ccccccCCCCCCCC---C---CCcCCCCcCCC Confidence 11100 0 00011111100000 0 00000000000 No 74 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=97.81 E-value=8.7e-07 Score=53.82 Aligned_cols=327 Identities=13% Similarity=0.112 Sum_probs=161.6 Q ss_pred hhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhH-HHHHHHHHHHHH---hhccchH-HHHH Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEE-ELRVIRHWCRLF---YATHDLV-PLLI 130 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~-~~~~ 130 (867) |++-...+..+-++..+.+ .++=++++-|. |+- +.. -+.- -||..+ |.+-|+- .-|. T Consensus 1 m~~~~~~~~~~~~~~~~~~-~~~~~~~~~p~---~~~------------~~~~~~~~--~~~~~~~~~~~~pp~~~~~la 62 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQA-QTEIFSFGDPI---PVL------------DRADILNY--LECSAMYEKWYNPPMSFDGLA 62 (346) T ss_pred CCcccCCCCCccccccccc-CeEEEecCCcc---eec------------CchhHHHH--HHHhhcCCceEecCCCHHHHH Confidence 3222111222212222222 12223344332 111 111 1112 255332 2222321 2345 Q ss_pred HhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehh Q lcl|NC_011269. 131 DIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVS 210 (867) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (867) ++++.=+..+.-+..|.--|...++. =.+.++-.+|.. +-..|+.-|..+-....|.. |-+.++..|.|.+|++. T Consensus 63 ~l~~~~~~h~~~i~~k~n~l~~l~~~--Pn~~~t~~~f~~--~~~d~ll~Gnay~~i~r~~~-G~~~~L~pl~~~~v~~~ 137 (346) T protein:vir:10 63 KSLRSSTHHESAIITKANILLSTCEV--DSRYLSRRDLSS--FVKDYLVFGNAYFEVVRNRL-GQVQRIESPLAKYVRKG 137 (346) T ss_pred HHHHhhhhcchhhhhhhhhHHHHHhC--CCCCCCHHHHHH--HHHHHHhcCCeEEEEEEcCC-CcEEEEEEecCCceEEE Confidence 55555555443333333333222211 123344455532 33667788998877777764 45678999999999885 Q ss_pred hhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhh Q lcl|NC_011269. 211 RSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVV 290 (867) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (867) .. .+..+. + ..-..|....++..-|-||+ T Consensus 138 ~~---~~~~~~-----------------------------------------~-------~~~~~g~~~~~~~~dIih~r 166 (346) T protein:vir:10 138 LE---AGQFYY-----------------------------------------V-------PQRFDHQEHEFAKGSIYHLL 166 (346) T ss_pred Ec---CCeEEE-----------------------------------------E-------EEccCCeEEEEecccEEEec Confidence 21 111100 0 00012333455666788988 Q ss_pred hcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch Q lcl|NC_011269. 291 NRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF 370 (867) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (867) +-...=.-.|.|-++-+.+++...++........-+.-..|=-|+++- +. ..++++.+.+|+-|+.....++ T Consensus 167 ~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~-----d~---~l~~e~~~~i~~~~~~~~g~~n 238 (346) T protein:vir:10 167 EPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMS-----DA---SQKQEDVENIRQQLKQSKGVGN 238 (346) T ss_pred CCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-----CC---CCCHHHHHHHHHHHHHhcCccc Confidence 765545568999999988888776665555444444444454444442 11 2478899999998877666653 Q ss_pred h--hhhhh-----hheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccceehhhhhHHHHHHHH Q lcl|NC_011269. 371 R--LMVHN-----FGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAYASSALNREFVTQIM 437 (867) Q Consensus 371 ~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~ 437 (867) + ++|+- =|+++.-++- -..|++|-++| ++|+.++||.-.|+.- +.+++|+++.--+ . T Consensus 239 ~~~~~vl~~~~~~~gi~~~pis~----~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~------~ 308 (346) T protein:vir:10 239 FKNLFVHAPNGKKDGIQIIPIAD----VSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAA------E 308 (346) T ss_pred cCceeEecCCCCccceeEEecCC----ChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH------H Confidence 2 33331 2444444432 22466665554 5699999999998831 3344577654322 1 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchh-eehhhccccchhhhhhhh Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYD-LKGGVRVPIYREIVEYDE 483 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k~ 483 (867) .=. +..|+-.++.|.|++++.-.. |+ .+..+++.+++ T Consensus 309 ~f~----~~~l~P~~~~iee~n~~L~~e~i~-----F~~~~ll~~~~ 346 (346) T protein:vir:10 309 VFF----ITEIEPLQERLKEFNQWLGQEVIK-----FKPSKLLQRTQ 346 (346) T ss_pred HHH----HHHHHHHHHHHHHHHhhcccceee-----echhhhcccCC Confidence 112 223333333344555533221 22 34455566665 No 75 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=97.79 E-value=8.2e-07 Score=53.95 Aligned_cols=439 Identities=11% Similarity=0.026 Sum_probs=179.0 Q ss_pred hcchhHHHH-HHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh- Q lcl|NC_011269. 58 RAAEANRQR-LASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK- 135 (867) Q Consensus 58 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 135 (867) -|+-+|+-. -.+.+......-.+..-.|.+.. ++.....+.+-|.+ + +++ ++-|-.||++-++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~V~~~--~---al~---------~~~V~~~v~~Ia~~ 65 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYN-LGAVAASGETVTPH--D---ALQ---------VSAVFASVRLLSET 65 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHh-hcccccCCceechH--H---hhc---------cHHHHHHHHHHHHh Confidence 233222211 00111011001111111112111 11111122222221 1 222 2334456665543 Q ss_pred ---cccccceeccc-chhHHHHHHHHhhcc---cccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceee Q lcl|NC_011269. 136 ---FPVVGMEFDSK-DPLIKTFYEDLFFGE---DLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLR 208 (867) Q Consensus 136 ---~~~~~~~~~~~-~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (867) .|+.=++.+.+ ...++.-....++-. .++-.+|+..++ ..++.-|..+-+.+-+ .|...++..|+|+.|. T Consensus 66 iA~lp~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~-~~lll~Gna~~~i~~~--~g~~~~l~~l~p~~v~ 142 (457) T protein:vir:13 66 IATLPLSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTV-LSLLLQGNAFLAVRWQ--GPNIVGLDVLDPTKIH 142 (457) T ss_pred hccCceEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHH-HHHhhcCCeEEEEEec--CCcEEEEEEEccCceE Confidence 33322221111 111211111112222 133457888866 7788889988766543 5778889999999998 Q ss_pred hhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC--CcccHHHH Q lcl|NC_011269. 209 VSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG--LDISEALI 286 (867) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 286 (867) |.....-... ...|...--...+++. ..++..-| T Consensus 143 v~~~~~~~~~--------------------------------------------~~~~~~y~~~~~~~~~~~~~~~~~di 178 (457) T protein:vir:13 143 VHMVMVDGLR--------------------------------------------RKVFEAYDIDADGNEVLLGWFTPRDV 178 (457) T ss_pred EEEecCCCcc--------------------------------------------ceeEEEEEEecCCceeeEEeeCccce Confidence 8732111000 0000000000011111 12344567 Q ss_pred HHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhh Q lcl|NC_011269. 287 SRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLL 366 (867) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (867) -|+.+-...-.-.|.+.+.-+-++|-.............+.-..|--|+++.+ .-+++.++.+|+.|+..+ T Consensus 179 ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~ls~e~~~~~~~~~~~~~ 249 (457) T protein:vir:13 179 LHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG---------TMSEEGLARAREAWRAAN 249 (457) T ss_pred EEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC---------CCCHHHHHHHHHHHHHHh Confidence 77776554433579998888888777666666666666666666766666652 237889999999988876 Q ss_pred h-cch--hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh---hhhHHHHHHHHHHH Q lcl|NC_011269. 367 A-ADF--RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS---ALNREFVTQIMTGF 440 (867) Q Consensus 367 ~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~ 440 (867) . +++ ..+|-.-|++++-+...-....+-.-.+...++|.+++||.-.||...++.+|..+ +....|+..-+.=+ T Consensus 250 ~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~ 329 (457) T protein:vir:13 250 SGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPW 329 (457) T ss_pred cCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHH Confidence 5 343 35555667887777544333323233445678899999999999965566666433 34445555556666 Q ss_pred HHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhh Q lcl|NC_011269. 441 QNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQ 520 (867) Q Consensus 441 ~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~q 520 (867) ..+|++.+.+.+-.-.|.++ +.|+|-+..+.-. +.|.++ +.. .+.. .-+-+..+.-+...+ +.. T Consensus 330 ~~~ie~~ln~~L~~~~~~~~---~~i~fd~~~l~~~---D~~~r~-~~~-~~~~-------~~G~~T~NE~R~~~g-l~P 393 (457) T protein:vir:13 330 LERIEAGFNRLLFAETADRF---RFVKFNLDEIKRG---APKERM-ELW-SLGL-------QNGIYSIDEVRAAED-MTP 393 (457) T ss_pred HHHHHHHHHHhhcCccccCc---eeEEeechhhhcc---CHHHHH-HHH-HHHH-------hCCCcCHHHHHHHhC-CCC Confidence 66777766666633333322 2233333222222 222211 111 1100 000011110010000 000 Q ss_pred hhh-ceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCC Q lcl|NC_011269. 521 LKG-MGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTE 599 (867) Q Consensus 521 L~~-~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~ 599 (867) ++. .+....-+..-..+....+.+ ......+..+ ...+..+.. ...+. .. T Consensus 394 i~~g~~d~~~~~~n~~~~~~~~~~~-----------~~~~~~~~~~-----------~~~~~~~~~-----~~~g~--~d 444 (457) T protein:vir:13 394 LPDGLGEKYRVPLNLGEVGEEPEPE-----------PAPAPPAIEP-----------PAEEPDEEP-----EPEGK--PD 444 (457) T ss_pred CCCCcccceeecccccccccccccc-----------ccCCCCCCCC-----------CccccCCCC-----CCCCC--Cc Confidence 000 000000000000000000000 0000000000 000000000 00000 00 Q ss_pred CCCCCCCccCCCCccCC Q lcl|NC_011269. 600 LGEAQAVAGEAQAELQT 616 (867) Q Consensus 600 ~~qa~~~agq~~~p~~~ 616 (867) ...+.. ....... T Consensus 445 --~~~~~~--~~~~~~~ 457 (457) T protein:vir:13 445 --DEGATE--EDDEDDA 457 (457) T ss_pred --cccCCC--CcccccC Confidence 000000 0000000 No 76 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.77 E-value=3.8e-06 Score=50.32 Aligned_cols=426 Identities=11% Similarity=0.026 Sum_probs=174.8 Q ss_pred CC-chHHHHHHHHhhhcchhHHHHHHHHhccccccccee--eccchhh-hhhhhhHHhhC---CCchhhhHHHHHHHHHH Q lcl|NC_011269. 44 VD-NKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQ--IAMPKIR-QPLGTLADKGI---PFNVEDEEELRVIRHWC 116 (867) Q Consensus 44 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 116 (867) |- -|+ --|| -+-.+-.+|+.- |.=-+-|..+-. +..+..- +.+........ -..+.++..| T Consensus 1 ~~~~~~-~~~~-~~~~~~~~~~~~---~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al------- 68 (441) T protein:vir:98 1 MHWYNT-DCYF-VDFKSRKQSRKE---LVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAI------- 68 (441) T ss_pred CceecC-ccce-eccccccchhhh---hhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhh------- Confidence 10 000 0000 000111122111 100000000000 0000000 00000000000 0011111111 Q ss_pred HHHhhccchHHHHHHhhhh-cccccceecccchhHHHHHH-HHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhh Q lcl|NC_011269. 117 RLFYATHDLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYE-DLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNE 191 (867) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (867) .++-|=.||++.++ ..-..+++.-+....++--. +++. .+..+-.+|+..++ ..++.-|+.+-+...|. T Consensus 69 -----~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~-~~lll~Gnay~~i~r~~ 142 (441) T protein:vir:98 69 -----RHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVF-VSALLTSHGYIEITRDK 142 (441) T ss_pred -----ccHHHHHHHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHH-HHHhhcCCeEEEEEEcC Confidence 23445567776543 11111232222222222111 1211 12334458888866 78888899988888876 Q ss_pred hccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHh Q lcl|NC_011269. 192 SLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQ 271 (867) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (867) . |...++..|+|+.|.|... .+.++- +..+ .++ T Consensus 143 ~-G~~~~L~~i~~~~v~v~~~---~~g~~~-----------------------------------------~~~~--~~~ 175 (441) T protein:vir:98 143 T-GEPMNLTFRKTSEIELKLD---ARGRLY-----------------------------------------YFHQ--RID 175 (441) T ss_pred C-CcEEEEEEEcCceeEEEEC---CCCcEE-----------------------------------------EEEE--Eec Confidence 5 5578899999999988621 111110 0000 000 Q ss_pred hhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCC Q lcl|NC_011269. 272 AAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPD 351 (867) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (867) ....+....++..-|-||++-... .-.|.+.+-.+-++|.......+....+.++-..|--++++-+ .+-+ T Consensus 176 ~~~~~~~~~~~~~dviHir~~~~d-g~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~--------~~~~ 246 (441) T protein:vir:98 176 SNGNNIERNVKFEDMLDIKFYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--------VLDN 246 (441) T ss_pred cCcceeeEEEccccEEEeccCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC--------CCCC Confidence 011111123555567788753211 1268888888888887766666666666666666666666642 1445 Q ss_pred HHHHHHHHHHHHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhh Q lcl|NC_011269. 352 QGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSAL 428 (867) Q Consensus 352 ~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 428 (867) ++..+.+|+.++..+.- + ...+|-.-|++++.....-+...+-+-.++..++|.+++||.-.++. ++..+|+..+. T Consensus 247 ~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg-~~~~~~s~~q~ 325 (441) T protein:vir:98 247 KKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFG-IETANMSITDA 325 (441) T ss_pred HHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcC-CCCCCccHHHH Confidence 77788899988877753 3 34667777888877753322222223346666789999999999995 66666666665 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccc Q lcl|NC_011269. 429 NREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNL 508 (867) Q Consensus 429 ~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~L 508 (867) +..++ +-++-+..+|++.+.+.+-.. ..++.|+|-...+.-.|..+.-+ ..+..+..... -..|+.-- +.| T Consensus 326 ~~~y~-~tl~P~~~~ie~~ln~~L~~~-----~~~~~~~fd~~~llr~d~~~~~~-~~~~~~~~G~~-T~NE~R~~-~gl 396 (441) T protein:vir:98 326 NLDYL-STLKPYITCVCAELNFKFNDE-----YVNREFKFDTTEIRVVDEKTQAE-IDKINIDSGKM-NIDEIRQR-DGL 396 (441) T ss_pred HHHHH-HHHHHHHHHHHHHHHhhcccc-----ccCceEEEechhhhccCHHHHHH-HHHHHHhCCCc-CHHHHHHH-hCC Confidence 55554 245555555555555554221 11233444333332222211111 11111111000 00111100 000 Q ss_pred cchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 509 RDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 509 r~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) .+..+.-.+ ...+..... .+++ ++.....+. ......++.-+.. + T Consensus 397 ---~pi~gGd~~----~~~~~~n~~--~~~~-~~~~q~~~~-------~~~~~~~kgGe~n-e 441 (441) T protein:vir:98 397 ---APIPGGNGS----IHRVDLNHV--NIEL-VDEYQMNKS-------RATDKKLKGGEEN-E 441 (441) T ss_pred ---CCCCCCCcc----eEeeccccc--cccc-ccccccccc-------cccccccCCCCCC-C Confidence 000000000 000111111 1111 000000000 0000011100000 0 No 77 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=97.73 E-value=1.5e-05 Score=47.02 Aligned_cols=501 Identities=12% Similarity=0.102 Sum_probs=201.3 Q ss_pred hhHHHHHHHH---hc-------CCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhccccccccee Q lcl|NC_011269. 12 WSAEVNRLRK---AG-------VNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQ 81 (867) Q Consensus 12 ~~~~~~~~~~---~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (867) ...-..++|- .| |.+ ++-.+++-+.|.+.+.+.+ -+..+...-.++..+.+.+ .|-... .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~--~~~~~~----~~~ 72 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPI-DEGLQANIKKIEQDNKEYQ-DLTKSLYGQQQAYAEPFIE--MMDTNP----EFR 72 (563) T ss_pred Chhhhhhhhcccccccccccceeec-cCChhhhHhhhhccchhHH-HHHhhhccCCCcchhhhHh--hhcccc----ccc Confidence 1111223332 33 334 5666677777766555433 1222222222232222222 111111 000 Q ss_pred eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----ccc--------ccceecccc-- Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FPV--------VGMEFDSKD-- 147 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~--------~~~~~~~~~-- 147 (867) .-|...-| |+|+++ =+|. |+..|+|-.|||+.+. |.. .++.+..++ T Consensus 73 -~~~~~~~~---------~~~l~~---------~l~~-~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~ 132 (563) T protein:vir:99 73 -DKRSYMKN---------EHNLHD---------VLKK-FGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLD 132 (563) T ss_pred -ccccCCCC---------cccHHH---------HHHH-hhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecC Confidence 00111111 222222 1333 2345777777776653 221 122222111 Q ss_pred -----------hhHHHHHHHHhhccccc---HHHHhHHHHHHHHhhhhhhcchhhhhhh-ccceehheecCcceeehhhh Q lcl|NC_011269. 148 -----------PLIKTFYEDLFFGEDLN---YLEFLPDQFAREYFTVGEVTSLAHFNES-LGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 148 -----------~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 212 (867) ..|..|..++.+-.+.+ ..+|+..++ ..|+.-|..+-+..++.. .|...++..|+|..|+|... T Consensus 133 ~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv-~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~ 211 (563) T protein:vir:99 133 AEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIV-RDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATD 211 (563) T ss_pred CCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHH-HHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEEC Confidence 22333444433322333 346777755 788888888777665433 47788899999999998621 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcc-cHHHHHHhhh Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDI-SEALISRVVN 291 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 291 (867) .+..+- .+... +.++ ..++....+ ..++|-|+.+ T Consensus 212 ---~~g~~~--------------------------------~~~~~----y~~~------~~g~~~~~~~~~evI~~~~~ 246 (563) T protein:vir:99 212 ---KKGKII--------------------------------KGGKR----FVQV------VDKRVVASFTSRELAMGIRN 246 (563) T ss_pred ---CCCcee--------------------------------cccee----EEEE------eCCceeEEecCcceEEEecc Confidence 110000 00000 0011 111111122 2334445544 Q ss_pred cCc--cccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc- Q lcl|NC_011269. 292 RPT--AWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA- 368 (867) Q Consensus 292 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 368 (867) -.+ -....|.|.+.-+..+|..............++-.+|--|+++-+ + .-.+.++++.+|+.|+..+.. T Consensus 247 ~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~----~---~~ls~e~~~~~~~~~~~~~~G~ 319 (563) T protein:vir:99 247 PRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRS----D---QQQSQHALENFKREWKSSLSGI 319 (563) T ss_pred CCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCC----C---CCCCHHHHHHHHHHHHHHhccc Confidence 322 223569999999999998887777777777777778877777642 1 124788999999999987753 Q ss_pred ch--h-hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCcc-----------ceehhh-hhHHHH Q lcl|NC_011269. 369 DF--R-LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGG-----------AYASSA-LNREFV 433 (867) Q Consensus 369 ~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----------~~~~~~-~~~~~~ 433 (867) ++ . ++|-.-|++++.+...-+..-+-+-.++..++|.+++||.-.+|.--+++ +|+++. ....|+ T Consensus 320 ~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~ 399 (563) T protein:vir:99 320 NGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQ 399 (563) T ss_pred cccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHH Confidence 43 2 47778889988886544444444555667889999999999998422333 333322 223355 Q ss_pred HHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhh Q lcl|NC_011269. 434 TQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQ 513 (867) Q Consensus 434 ~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~ 513 (867) .+-++-+..+|++.+.+.+-. +. ...+.+-|...+. +.+.....+-+. +...-+..+.-+. T Consensus 400 ~~tL~P~l~~ie~~ln~~L~~--~~----~~~~~~~f~r~D~------~~~~e~~~~~~~-------~~~G~lT~NE~R~ 460 (563) T protein:vir:99 400 NKGLQPLLRFIEDLVNRHIIS--EY----GDKYTFQFVGGDT------KSATDKLNILKL-------ETQIFKTVNEARE 460 (563) T ss_pred HHHHHHHHHHHHHHHHhhhch--hc----ccccEEEeccCCH------HHHHHHHHHHHH-------hcCCccCHHHHHH Confidence 555666777777777665421 11 1111222221222 111111111000 0001011111011 Q ss_pred hhhhhhhhhhceeeeeccc--cCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccC Q lcl|NC_011269. 514 ERAFIAQLKGMGVPVSDKT--LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLAL 591 (867) Q Consensus 514 ~~~~v~qL~~~~~pitd~t--~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~ 591 (867) ..+ +..++. +..+-.+. .+.............+...-...+....+... +.++... +... ......... T Consensus 461 ~~g-l~Pi~g-GD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~---~~~~~~~~~ 531 (563) T protein:vir:99 461 EQG-KKPIEG-GDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDN---DDSEEGQ-STDS---SNDDKEIGT 531 (563) T ss_pred HhC-CCCCCC-cceeecccccccccccccccCCCccccchhhhhcccccCCCC---CCCCCCC-CCCC---CCCcccccc Confidence 110 000000 00000000 00000000000000000000111111100000 0000000 0000 000000000 Q ss_pred CCCC-CCCC-CCCCCCCccCCCCccCCCCCCCcc Q lcl|NC_011269. 592 RQGK-TQTE-LGEAQAVAGEAQAELQTKQIEMQE 623 (867) Q Consensus 592 a~gp-gq~~-~~qa~~~agq~~~p~~~~~~~~qp 623 (867) ...+ .... .....+...+.... ..+..-.. T Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 563 (563) T protein:vir:99 532 DAQIKGDDNVYRTQTSNKGQGRKG--EKSSDFKH 563 (563) T ss_pred ccccccccccccccCccccccccC--cCcccccC Confidence 0000 0000 00000000000000 00000000 No 78 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=97.73 E-value=1.5e-05 Score=47.02 Aligned_cols=501 Identities=12% Similarity=0.102 Sum_probs=201.3 Q ss_pred hhHHHHHHHH---hc-------CCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhccccccccee Q lcl|NC_011269. 12 WSAEVNRLRK---AG-------VNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQ 81 (867) Q Consensus 12 ~~~~~~~~~~---~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (867) ...-..++|- .| |.+ ++-.+++-+.|.+.+.+.+ -+..+...-.++..+.+.+ .|-... .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~--~~~~~~----~~~ 72 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPI-DEGLQANIKKIEQDNKEYQ-DLTKSLYGQQQAYAEPFIE--MMDTNP----EFR 72 (563) T ss_pred Chhhhhhhhcccccccccccceeec-cCChhhhHhhhhccchhHH-HHHhhhccCCCcchhhhHh--hhcccc----ccc Confidence 1111223332 33 334 5666677777766555433 1222222222232222222 111111 000 Q ss_pred eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----ccc--------ccceecccc-- Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FPV--------VGMEFDSKD-- 147 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~--------~~~~~~~~~-- 147 (867) .-|...-| |+|+++ =+|. |+..|+|-.|||+.+. |.. .++.+..++ T Consensus 73 -~~~~~~~~---------~~~l~~---------~l~~-~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~ 132 (563) T protein:vir:95 73 -DKRSYMKN---------EHNLHD---------VLKK-FGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLD 132 (563) T ss_pred -ccccCCCC---------cccHHH---------HHHH-hhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecC Confidence 00111111 222222 1333 2345777777776653 221 122222111 Q ss_pred -----------hhHHHHHHHHhhccccc---HHHHhHHHHHHHHhhhhhhcchhhhhhh-ccceehheecCcceeehhhh Q lcl|NC_011269. 148 -----------PLIKTFYEDLFFGEDLN---YLEFLPDQFAREYFTVGEVTSLAHFNES-LGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 148 -----------~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 212 (867) ..|..|..++.+-.+.+ ..+|+..++ ..|+.-|..+-+..++.. .|...++..|+|..|+|... T Consensus 133 ~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv-~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~ 211 (563) T protein:vir:95 133 AEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIV-RDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATD 211 (563) T ss_pred CCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHH-HHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEEC Confidence 22333444433322333 346777755 788888888777665433 47788899999999998621 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcc-cHHHHHHhhh Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDI-SEALISRVVN 291 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 291 (867) .+..+- .+... +.++ ..++....+ ..++|-|+.+ T Consensus 212 ---~~g~~~--------------------------------~~~~~----y~~~------~~g~~~~~~~~~evI~~~~~ 246 (563) T protein:vir:95 212 ---KKGKII--------------------------------KGGKR----FVQV------VDKRVVASFTSRELAMGIRN 246 (563) T ss_pred ---CCCcee--------------------------------cccee----EEEE------eCCceeEEecCcceEEEecc Confidence 110000 00000 0011 111111122 2334445544 Q ss_pred cCc--cccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc- Q lcl|NC_011269. 292 RPT--AWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA- 368 (867) Q Consensus 292 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 368 (867) -.+ -....|.|.+.-+..+|..............++-.+|--|+++-+ + .-.+.++++.+|+.|+..+.. T Consensus 247 ~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~----~---~~ls~e~~~~~~~~~~~~~~G~ 319 (563) T protein:vir:95 247 PRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRS----D---QQQSQHALENFKREWKSSLSGI 319 (563) T ss_pred CCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCC----C---CCCCHHHHHHHHHHHHHHhccc Confidence 322 223569999999999998887777777777777778877777642 1 124788999999999987753 Q ss_pred ch--h-hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCcc-----------ceehhh-hhHHHH Q lcl|NC_011269. 369 DF--R-LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGG-----------AYASSA-LNREFV 433 (867) Q Consensus 369 ~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----------~~~~~~-~~~~~~ 433 (867) ++ . ++|-.-|++++.+...-+..-+-+-.++..++|.+++||.-.+|.--+++ +|+++. ....|+ T Consensus 320 ~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~ 399 (563) T protein:vir:95 320 NGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQ 399 (563) T ss_pred cccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHH Confidence 43 2 47778889988886544444444555667889999999999998422333 333322 223355 Q ss_pred HHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhh Q lcl|NC_011269. 434 TQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQ 513 (867) Q Consensus 434 ~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~ 513 (867) .+-++-+..+|++.+.+.+-. +. ...+.+-|...+. +.+.....+-+. +...-+..+.-+. T Consensus 400 ~~tL~P~l~~ie~~ln~~L~~--~~----~~~~~~~f~r~D~------~~~~e~~~~~~~-------~~~G~lT~NE~R~ 460 (563) T protein:vir:95 400 NKGLQPLLRFIEDLVNRHIIS--EY----GDKYTFQFVGGDT------KSATDKLNILKL-------ETQIFKTVNEARE 460 (563) T ss_pred HHHHHHHHHHHHHHHHhhhch--hc----ccccEEEeccCCH------HHHHHHHHHHHH-------hcCCccCHHHHHH Confidence 555666777777777665421 11 1111222221222 111111111000 0001011111011 Q ss_pred hhhhhhhhhhceeeeeccc--cCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccC Q lcl|NC_011269. 514 ERAFIAQLKGMGVPVSDKT--LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLAL 591 (867) Q Consensus 514 ~~~~v~qL~~~~~pitd~t--~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~ 591 (867) ..+ +..++. +..+-.+. .+.............+...-...+....+... +.++... +... ......... T Consensus 461 ~~g-l~Pi~g-GD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~---~~~~~~~~~ 531 (563) T protein:vir:95 461 EQG-KKPIEG-GDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDN---DDSEEGQ-STDS---SNDDKEIGT 531 (563) T ss_pred HhC-CCCCCC-cceeecccccccccccccccCCCccccchhhhhcccccCCCC---CCCCCCC-CCCC---CCCcccccc Confidence 110 000000 00000000 00000000000000000000111111100000 0000000 0000 000000000 Q ss_pred CCCC-CCCC-CCCCCCCccCCCCccCCCCCCCcc Q lcl|NC_011269. 592 RQGK-TQTE-LGEAQAVAGEAQAELQTKQIEMQE 623 (867) Q Consensus 592 a~gp-gq~~-~~qa~~~agq~~~p~~~~~~~~qp 623 (867) ...+ .... .....+...+.... ..+..-.. T Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 563 (563) T protein:vir:95 532 DAQIKGDDNVYRTQTSNKGQGRKG--EKSSDFKH 563 (563) T ss_pred ccccccccccccccCccccccccC--cCcccccC Confidence 0000 0000 00000000000000 00000000 No 79 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.73 E-value=3.3e-06 Score=50.64 Aligned_cols=407 Identities=13% Similarity=0.088 Sum_probs=185.4 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) |..-.++-- .+-++|.-+++ .++-+.+ .-+...-.....|+.-..-.+ -..|-++. +..+ T Consensus 1 ~~~~~~~~~-~~~~~g~~~~~---~~~f~~~---~~~~~~~~~~~~~~~~~~~~~-~~~v~~~~------------al~~ 60 (424) T protein:vir:18 1 MEEPKYTID-LRTNNGWWARL---KSWFVGG---RLVTPNQGSQTGPVSAHGYLG-DSSINDER------------ILQI 60 (424) T ss_pred CCCCccccc-cCCCCchHHHH---Hhhcccc---ccccccchhhccccccccccc-cccccHHH------------hhcc Confidence 211111111 12234544332 2221111 000000001111221100000 01122221 2334 Q ss_pred chHHHHHHhhhh----cccccceecccchhHHH-H----HHHHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhh Q lcl|NC_011269. 124 DLVPLLIDIYSK----FPVVGMEFDSKDPLIKT-F----YEDLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNE 191 (867) Q Consensus 124 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-~----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (867) +-|-.|||+-++ .|+.=++.+.++ ..++ - ..+++ -.+..+-.+|+..++ ..++.-|+.+-+.+.++ T Consensus 61 ~~v~~cv~~Ia~~iA~lp~~vy~~~~~~-~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~i~r~~ 138 (424) T protein:vir:18 61 STVWRCVSLISTLTACLPLDVFETDQND-NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMT-MQLCFYGNAYALVDRNS 138 (424) T ss_pred HHHHHHHHHHHHhhccCceEEEEeccCC-ceeeeccccHHHHHHhhccCCCCCHHHHHHHHH-HHHhhcCCeEEEEEECC Confidence 555667766543 333222222221 1111 0 11111 123456668888877 88889999988887776 Q ss_pred hccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHh Q lcl|NC_011269. 192 SLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQ 271 (867) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (867) . |...++..|+|+.|.|.. ..+.++- .| T Consensus 139 ~-G~~~~L~~l~~~~v~v~~---~~~~~~y------------------------------------~~------------ 166 (424) T protein:vir:18 139 A-GDVISLLPLQSANMDVKL---VGKKVVY------------------------------------RY------------ 166 (424) T ss_pred C-CcEEEEEEecCcceEEEE---cCCeEEE------------------------------------EE------------ Confidence 5 557889999999998862 2222111 01 Q ss_pred hhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCC Q lcl|NC_011269. 272 AAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPD 351 (867) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (867) -..|....++.+-|-|+.+-... .-.|.+.+.-+-.+|..............+.-..|--++++... +-+ T Consensus 167 -~~~g~~~~~~~~eVihir~~~~d-g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~--------~l~ 236 (424) T protein:vir:18 167 -QRDSEYADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--------VLT 236 (424) T ss_pred -EeCCeEEEeccccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCc--------CCC Confidence 01234456667778888754322 23688888888788877777777777777777788777777531 346 Q ss_pred HHHHHHHHHHHHHhhhcch--hhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh--- Q lcl|NC_011269. 352 QGELDEVRDDMQSLLAADF--RLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS--- 426 (867) Q Consensus 352 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~--- 426 (867) .+..+.+|+.|+....+++ ..+|-.-|++++.++..-....+-+-.++..++|.+++||...+|..-++++|..+ T Consensus 237 ~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e 316 (424) T protein:vir:18 237 EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) T ss_pred HHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHH Confidence 7888999999998887773 45666678888887644333323234467778999999999999965566666332 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccc Q lcl|NC_011269. 427 ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTL 506 (867) Q Consensus 427 ~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~ 506 (867) +....|+..-+.=+..+|++.+.+.+-.-.+.. .+.|+|-+..+...|. |++. +. ..+..+ -+-+ T Consensus 317 q~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~---~~~~~fd~~~llr~d~---~~r~-~~-~~~~~~-------~G~~ 381 (424) T protein:vir:18 317 QQNLGFLQYTLQPYISRWENSIQRWLIPSKDVG---RLHAEHNLDGLLRGDS---ASRA-AF-MKAMGE-------SGLR 381 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCccccC---CeEEEEechhhhccCH---HHHH-HH-HHHHHh-------CCCc Confidence 233334444444444445555544442222222 1223333333322222 2111 11 111100 0001 Q ss_pred cccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccc Q lcl|NC_011269. 507 NLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAM 563 (867) Q Consensus 507 ~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~ 563 (867) ..+.-+...+ +..++. +....-+..-..+.. +....+.+ .+.. T Consensus 382 T~NE~R~~~g-l~pi~g-gD~~~~~~n~~~l~~-~~~~~~~~-----------~n~a 424 (424) T protein:vir:18 382 TINEMRRTDN-MPPLPG-GDVAMRQAQYVPITD-LGTNKEPR-----------NNGA 424 (424) T ss_pred CHHHHHHHhC-CCCCCC-cCeeeeccCccchhh-hhccCCcc-----------ccCC Confidence 1110010000 110000 000000000000000 00000000 0000 No 80 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.66 E-value=5e-06 Score=49.66 Aligned_cols=420 Identities=10% Similarity=-0.006 Sum_probs=184.4 Q ss_pred eccchhhhhhhhhHHhhCC----CchhhhHH-HHHHH--HHHHH-----HhhccchHHHHHHhhhhccccc--ceecccc Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKGIP----FNVEDEEE-LRVIR--HWCRL-----FYATHDLVPLLIDIYSKFPVVG--MEFDSKD 147 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 147 (867) |++=+ .-++......+. ..|...+. +-..- .|-.. =+-.++-|-.||++-++ .|.. |++.-++ T Consensus 1 Mg~~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~-~iA~lp~~~~~~~ 77 (457) T protein:vir:62 1 MGFWS--ALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSE-TIATLPLSTYSKR 77 (457) T ss_pred Cchhh--hhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHH-hHhhCceEEEEec Confidence 33211 100100000000 00110000 00000 00000 01234567788888754 2222 2322222 Q ss_pred hh----HHHHHHHHhhcc---cccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHH Q lcl|NC_011269. 148 PL----IKTFYEDLFFGE---DLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERV 220 (867) Q Consensus 148 ~~----~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (867) .. ++.-....++.. .++-.+|+..++ ..++.-|+.+-+.+-+ .|.+.++..|+|+.|.|.+-. .+.+ T Consensus 78 ~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~-~~l~l~Gna~~~i~~~--~g~~~~l~~l~p~~v~v~~~~---~~~~ 151 (457) T protein:vir:62 78 GGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTV-LSLLLQGNAFLAVRWA--GPNIAGLDVLDPTKIHVHMVM---VDGL 151 (457) T ss_pred CCccccccchHHHHhccccCCCCCHHHHHHHHH-HHHhhcCCeEEEEEeC--CCcEEEEEEEcCcceEEEEec---cCCc Confidence 22 222222333322 245677998876 7888899998777544 578889999999999987321 1100 Q ss_pred HHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCC--cccHHHHHHhhhcCccccc Q lcl|NC_011269. 221 QLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGL--DISEALISRVVNRPTAWAT 298 (867) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ 298 (867) . ...|....-....++.+ .++..-|-|+.+-.+.... T Consensus 152 ~-----------------------------------------~~~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~ 190 (457) T protein:vir:62 152 R-----------------------------------------RKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDF 190 (457) T ss_pred c-----------------------------------------ceeEEEEEEccCCceeEEEeeCccceEEecCCCCCCce Confidence 0 00000000000011111 2334457788776665556 Q ss_pred cCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh-cc--hhhhhh Q lcl|NC_011269. 299 RGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA-AD--FRLMVH 375 (867) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~ 375 (867) .|.+.+--+-++|-...........+.+.-..|--++++.+ .-+.+.++.+|+.|+..+. ++ ...+|- T Consensus 191 ~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------~ls~e~~~~~~~~~~~~~~G~~nag~~~vl 261 (457) T protein:vir:62 191 VGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG---------TMSEEGLARAREAWRAANSGVDNAHRVALL 261 (457) T ss_pred ecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC---------CCCHHHHHHHHHHHHHHhcCccccCcceec Confidence 89998877777776666555555556565556665666652 2477889999998888765 33 335565 Q ss_pred hhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh---hhhHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011269. 376 NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS---ALNREFVTQIMTGFQNALKRHIRRRC 452 (867) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~r~~~ 452 (867) .-|++++-.+..-+...+-+-.++..++|.+++||.-.+|.--++++|..+ +....|+..-+.=+..+|++++.+.+ T Consensus 262 ~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~ln~~L 341 (457) T protein:vir:62 262 TEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLL 341 (457) T ss_pred CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 667887777543332222233345677899999999999965566666433 45566777777777777887777766 Q ss_pred HHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhh---hhhh-----hhhhccccccccchhhhhhhhhhhhhc Q lcl|NC_011269. 453 EVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV---PKLL-----IPEIKFSTLNLRDEAQERAFIAQLKGM 524 (867) Q Consensus 453 ~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~---~k~i-----~~~i~~~~~~Lr~e~~~~~~v~qL~~~ 524 (867) -.-.+.++ +.|+|-+..+.-.|..+.-+ ..+-.+... ...+ ++.+....- ....+.+. T Consensus 342 ~~~~~~~~---~~i~fd~~~l~~~d~~~r~~-~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~--------D~~~~~~n-- 407 (457) T protein:vir:62 342 FAETADRF---RFVKFNLDEIKRGAPKERME-LWSLGLQNGIYSIDEVRAAEDMTPLPDGLG--------EKYRVPLN-- 407 (457) T ss_pred cCccccCc---eEEEeechhhhccCHHHHHH-HHHHHHhCCCcCHHHHHHHhCCCCCCCCCc--------ceeeeccc-- Confidence 33333222 23333333332222211111 111111110 0000 111111100 00000000 Q ss_pred eeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccc Q lcl|NC_011269. 525 GVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQS 587 (867) Q Consensus 525 ~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~ 587 (867) ........+....... ..............+ .....+.|...+.-+.-.+ T Consensus 408 -------~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~---~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 408 -------LGEIGEEPEPEPAPAP---PAIDPPAEEPADDEE---PDNAEGDPDEGETEDDDDA 457 (457) T ss_pred -------cccccccccccccCCC---ccCCCCccCCCCCCC---CCCCCCCCccccccccccC Confidence 0000000000000000 000000000000000 0001111111111111111 No 81 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=97.65 E-value=1.1e-05 Score=47.70 Aligned_cols=405 Identities=13% Similarity=0.104 Sum_probs=175.3 Q ss_pred HHHHhcccccccceeeccchhhhh---hhhhHHh-hCCCchhhhHHHHHHHHHH-----HHHhhccchHHHHHHhhhh-- Q lcl|NC_011269. 67 LASYRKQGNFGSNMQIAMPKIRQP---LGTLADK-GIPFNVEDEEELRVIRHWC-----RLFYATHDLVPLLIDIYSK-- 135 (867) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-- 135 (867) +---|+-|-| +++. .+..+ +...-.. .-|.++.-+ .+-.+..|- -+-+-.++-|-.||++-+. T Consensus 1 ~~~~~~mg~f-~r~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~i 74 (432) T protein:vir:81 1 MPDEKKLGLF-GQLK----AMFVPPDPVDIGGGQTFTPVNATAR-DLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAI 74 (432) T ss_pred CCchhhcchh-hhhh----hhcccccccccccccccccCccchh-hhcccccccCcccchHhhhccHHHHHHHHHHHHhh Confidence 1111111211 1110 01000 0000000 001111100 000000000 0123456777788887543 Q ss_pred --cccccceecccchhHHHH----HHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcce Q lcl|NC_011269. 136 --FPVVGMEFDSKDPLIKTF----YEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDM 206 (867) Q Consensus 136 --~~~~~~~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (867) .|+.=++.+ +| +.++- ..+++- .+..+-.+|+..++ ..++.-|+.+-+..-+ +|...++..|+|+. T Consensus 75 a~lp~~~y~~~-~~-g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~-~~lll~Gnayv~i~~~--~g~~~~L~~l~~~~ 149 (432) T protein:vir:81 75 AAMPLTMYMRT-PD-GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVV-TRLLLDGTAYVRKVVT--DGRIESLQYLANDR 149 (432) T ss_pred hhCceeeEEec-CC-cceecccchHHHHHHhcccccCCHHHHHHHHH-HHHhhcCCeEEEEEec--CCcEEEEEEEcCCc Confidence 333212111 11 22221 111221 12344457888866 7888889987665544 47778889999999 Q ss_pred eehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHH Q lcl|NC_011269. 207 LRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALI 286 (867) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (867) |.|... . .| . + - |+- +...|....++.+.| T Consensus 150 v~v~~~---~----------------~g-----~-~---------------~----y~~------~~~~g~~~~~~~~~i 179 (432) T protein:vir:81 150 LTITTD---P----------------KG-----N-T---------------A----YRY------RRTDGQMIDIPKQQI 179 (432) T ss_pred eEEEEC---C----------------CC-----c-E---------------E----EEE------EecCceEEEEccccE Confidence 988721 0 01 0 0 0 110 011234456777788 Q ss_pred HHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhh Q lcl|NC_011269. 287 SRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLL 366 (867) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (867) -|+.+-... .-.|.+.|.-+-.+|-.-...........+.-..|--+.++-. .-+.+..+.+|+.++... T Consensus 180 ih~r~~~~d-g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---------~l~~e~~~~~~~~~~~~~ 249 (432) T protein:vir:81 180 WKIMGYSLD-GENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR---------FLTDDQYDSFAKKVSGSV 249 (432) T ss_pred EEecCCCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC---------CCCHHHHHHHHHHHhhhh Confidence 888754321 2368888777666665554444444444444445655555532 236788889999888766 Q ss_pred hcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh----hhhHHHHHHHHHHHHH Q lcl|NC_011269. 367 AADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS----ALNREFVTQIMTGFQN 442 (867) Q Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~----~~~~~~~~~~~~~~~~ 442 (867) .+. ..+|-.-|++++.+...-+...+-+-.++..++|.+++||.-.+|...+.++|++. +....|+..-++-+.. T Consensus 250 nag-~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~ 328 (432) T protein:vir:81 250 EAG-RAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLR 328 (432) T ss_pred cCC-CceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHHH Confidence 554 44555678888877654443333344567789999999999999976666666432 3445555555666666 Q ss_pred HHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhh--ccccccccchhhhhhhhhh Q lcl|NC_011269. 443 ALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEI--KFSTLNLRDEAQERAFIAQ 520 (867) Q Consensus 443 ~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i--~~~~~~Lr~e~~~~~~v~q 520 (867) .|++.+.+.+=.-.|.. .+.|+|-+..+..-|..+..+ .++-.+.....- ..++ .+...-+.+. +.+.. T Consensus 329 ~ie~~l~~kLl~~~~~~---~~~~~fd~~~llr~d~~~r~~-~~~~~~~~G~~t-~NE~R~~~glpp~~g~----~~~~~ 399 (432) T protein:vir:81 329 RIEQSIALNLLSPAERR---RYFADFDTSALLRADSAARSS-YYSQLVNNGLMT-RDEAREIEGLPKLGGN----AAVLT 399 (432) T ss_pred HHHHHHHhhccCccccC---ceEEEeechhhhccCHHHHHH-HHHHHHhCCCCC-HHHHHHHhCCCCCCCC----cceEe Confidence 67776766552222322 233444333332222211111 111111111000 0000 0000001000 00000 Q ss_pred hhhceeeeeccccCCCcc-cccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccc Q lcl|NC_011269. 521 LKGMGVPVSDKTLAVNID-MKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 521 L~~~~~pitd~t~p~tiq-me~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ 583 (867) +.....++.+.......+ ...+.. .+. +. ..+ T Consensus 400 ~~~~~~pl~~~~~~~~~~~~~~~~n--~~~----~~-------------------------~~~ 432 (432) T protein:vir:81 400 VQSAMVPLDSIGLQASPEPASGLGN--QQQ----DK-------------------------VSK 432 (432) T ss_pred ecCcccchhhhccCCCCCCCCCCCC--ccc----cc-------------------------ccC Confidence 000000000000000000 000000 000 00 000 No 82 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.55 E-value=1.4e-05 Score=47.23 Aligned_cols=402 Identities=12% Similarity=0.052 Sum_probs=182.1 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCC---chhhhHH Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPF---NVEDEEE 108 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 108 (867) |-...|.. + ..-++..+.+.--++......-.. .|-++. T Consensus 1 m~~~~~~~--------------------------~-----------~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~- 42 (421) T protein:vir:10 1 MFIPQMFE--------------------------G-----------KKRSVSGGGFWEAMLGGVRSSHSKAGVMITPET- 42 (421) T ss_pred CCCcchhc--------------------------c-----------cccccCcchhhHHHhhhhccCcccCCceechHH- Confidence 00000000 0 000011111111111100000000 011111 Q ss_pred HHHHHHHHHHHhhccchHHHHHHhhhhcccccceecc----cch---hHHHHHHHHhh----cccccHHHHhHHHHHHHH Q lcl|NC_011269. 109 LRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDS----KDP---LIKTFYEDLFF----GEDLNYLEFLPDQFAREY 177 (867) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~---~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 177 (867) +-.++-|-.||++-++= |..+.|.. +|. .+++-..+-.| .+..+-.+|+..++ ..+ T Consensus 43 -----------al~~~~v~~~i~~Ia~~-iA~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~l 109 (421) T protein:vir:10 43 -----------ALALSAVRACVTLLAES-VAQLPVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQ-GLL 109 (421) T ss_pred -----------hhccHHHHHHHHHHHHh-hccCceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHH-HHH Confidence 12455566777766541 22222211 121 23332222222 23455778888876 788 Q ss_pred hhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhh Q lcl|NC_011269. 178 FTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMR 257 (867) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 257 (867) +.-|+.+-+.+-+.. |...++..|+|+.|.|.+. .+ | .. T Consensus 110 ll~Gna~~~i~r~~~-G~~~~L~~l~~~~v~v~~~---~~----------------g---------------------~~ 148 (421) T protein:vir:10 110 GLEGNCYSIIDRDGK-GYPKELIPINPKKVIVLKG---PD----------------G---------------------MP 148 (421) T ss_pred hhcCCeEEEEEEcCC-CcEEEEEEecCceEEEEEC---CC----------------c---------------------eE Confidence 889999888887754 5566888999999988521 00 0 00 Q ss_pred hHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhh Q lcl|NC_011269. 258 EFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATL 337 (867) Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (867) .| ++.+ .|-.++...|-|+.+-... .-.|.+.+--+-.+|-......+....+.+.-..|--++++ T Consensus 149 ~y-----~~~~--------~g~~~~~~eiih~~~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~ 214 (421) T protein:vir:10 149 YY-----EIPE--------IGETLPMRMMHHVKVFSLD-GYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIER 214 (421) T ss_pred EE-----EEcC--------CCcEEchhhEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEe Confidence 11 1111 1113445667777654322 23588887777777776666666666666776677666665 Q ss_pred cccccCCCCcCCCCHHHHHHHHHHHHHhhhc-c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhh Q lcl|NC_011269. 338 GIEDMGDGEPWIPDQGELDEVRDDMQSLLAA-D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEAL 414 (867) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (867) -. +-.. .-++++++.+|+-|+..+.. + ...+|-.-|++++.++..-+..-+-+-.++..++|.+++||.-.+ T Consensus 215 ~~----~~~~-~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 289 (421) T protein:vir:10 215 PK----EAPA-IKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHM 289 (421) T ss_pred cC----ccCc-cCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 42 1111 34889999999987776652 3 456677788899888765444443444566888999999999999 Q ss_pred hcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhh Q lcl|NC_011269. 415 ISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV 493 (867) Q Consensus 415 ~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~ 493 (867) +...+.++|+++. ....|+..-+.=+...|++.+.+++=.-.|-. ...|+|-+..+.-.|..+.-+ ..+..++.. T Consensus 290 lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~---~~~v~fd~~~l~~~d~~~~~~-~~~~~~~~G 365 (421) T protein:vir:10 290 VQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPSERR---DLYIEFNVSGLLRGDQKSRYE-SYALGRQWG 365 (421) T ss_pred cCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccC---CeEEEEechhhhccCHHHHHH-HHHHHHhCC Confidence 9888888998853 33344444455555555666655442222211 223444333332222222211 111111110 Q ss_pred hhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchh-hhhhHHHHHHHHhhcccccccccccccc Q lcl|NC_011269. 494 PKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQE-LERQADETVQKLMATAQAMKKVQDLCDA 572 (867) Q Consensus 494 ~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e-~e~k~~E~l~tL~~taet~kkvq~~~p~ 572 (867) .. -..++.. .+.+-........+..+... .+.+. .+.+.+ ...+..|....+.. T Consensus 366 ~~-T~NE~R~-~~gl~p~~ggD~~~~~~n~~--~~~~~-------~~~~~~~~~~~~~e~d~~~~~-------------- 420 (421) T protein:vir:10 366 WL-SVNDIRR-MENLPPIAGGDKYLTPLNMV--DSAQI-------IPGDKKPTAQQMAEIDTILSR-------------- 420 (421) T ss_pred Cc-CHHHHHH-HhCCCCCCCcceeeeccccc--ccccc-------ccCCCCcccccCccccccccc-------------- Confidence 00 0000000 00010000000001110000 00000 000000 00000000000000 Q ss_pred cCCCCCcccccccccc Q lcl|NC_011269. 573 QNLPYPPELAQHLQST 588 (867) Q Consensus 573 ~g~P~pp~~aQ~p~~t 588 (867) | T Consensus 421 ---------------~ 421 (421) T protein:vir:10 421 ---------------T 421 (421) T ss_pred ---------------C Confidence 0 No 83 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=97.49 E-value=3.5e-05 Score=45.00 Aligned_cols=385 Identities=10% Similarity=0.021 Sum_probs=165.3 Q ss_pred eeeccchhh------hhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh----hcccccceecccchh Q lcl|NC_011269. 80 MQIAMPKIR------QPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS----KFPVVGMEFDSKDPL 149 (867) Q Consensus 80 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~ 149 (867) |+.-.++-- -++.....-.....+-... ++ .++-|=.|||+-+ +.|+ ++-.+|.. T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------Al------~~~~V~~~i~~Ia~~iA~lp~---~~~~~~g~ 65 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQKYLGVS------AL------KNSDILTATSIIAGDIARFPL---VKKDVNGD 65 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCcccccch------hh------ccHHHHHHHHHHHHhhhhCee---EEEecCcc Confidence 111111000 0000000000000000000 00 2344445666543 3333 33333322 Q ss_pred H-HH-HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHH Q lcl|NC_011269. 150 I-KT-FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMV 224 (867) Q Consensus 150 ~-~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (867) + ++ -..+++. .+..+-.+|+..++ ..++.-|..+-+-+.+...|...++..|+|+.|.|.+. .+..+ T Consensus 66 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~-~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~---~~~~~---- 137 (406) T protein:vir:97 66 IIHDEDINYLLNVKSTSNASARTWKFAMA-VNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEET---DNHEI---- 137 (406) T ss_pred ccccchHHHHhhccCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEc---CCceE---- Confidence 2 22 1222221 24566678999876 88899999999988887788889999999999988521 11000 Q ss_pred HHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccc-cCcch Q lcl|NC_011269. 225 KDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWAT-RGAPH 303 (867) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 303 (867) .| +|-+ ...+..+.++..-|-|+..-. +.. .|.+. T Consensus 138 ---------------------------------~y-----~~~~----~~~~~~~~~~~~evih~r~~~--~dg~~G~sp 173 (406) T protein:vir:97 138 ---------------------------------VY-----TFTD----MLTAKQVKCFAHDVIHWKFFS--HDTILGRSP 173 (406) T ss_pred ---------------------------------EE-----EEEe----cCCceEEEEccccEEEecCCC--CCCcccccH Confidence 01 1111 123444566677778886432 222 38888 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhhhhheee Q lcl|NC_011269. 304 LLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVHNFGLKV 381 (867) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 381 (867) +.-+..+|-.....+.....+.+.-..|-.|.+.+. .-+++..+.+|+.|+.....++ ..+|-..|.++ T Consensus 174 i~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~---------~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~ 244 (406) T protein:vir:97 174 LLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGA---------QLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEY 244 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCC---------CCCHHHHHHHHHHHHHHhcccccCceeecCCCceE Confidence 777777665544444444444444445544444331 3478889999998888876653 45666677777 Q ss_pred eeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceeh-hhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhc Q lcl|NC_011269. 382 ENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYAS-SALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQG 460 (867) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~ 460 (867) +-..-.-....+-+-.++..++|.+++||.-.+|. +.+ .|++ .+...+|++--+.-+...|++.+.+.+=.-.+..+ T Consensus 245 ~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg-~~~-~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~ 322 (406) T protein:vir:97 245 TPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLG-VNS-PNQSVAQLMEDYVTNDLPFYFDAITSELGLKTLNDKDRRL 322 (406) T ss_pred EEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcC-CCC-CcchHHHHHHHHHHHHHHHHHHHHHHHHhhhhcChhhccc Confidence 66642221111112234457899999999999995 433 4443 23333444444444555555555444311122211 Q ss_pred ccchheehhhccccchhhhhhh-hhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh-ceeeeeccccCCCcc Q lcl|NC_011269. 461 HYDYDLKGGVRVPIYREIVEYD-EETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG-MGVPVSDKTLAVNID 538 (867) Q Consensus 461 ~~d~~~~~~~~~~~~rd~~~~k-~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~-~~~pitd~t~p~tiq 538 (867) +.+++-++ +.+..+ +.+++.. . -+-+..+.-+...+ ...+.. .+....-+..- T Consensus 323 ---~~i~fd~~-----~~~~~~~~~~~~~~-~-----------~g~~T~NE~R~~~g-~~p~~~~~gD~~~~~~n~---- 377 (406) T protein:vir:97 323 ---YHIEFDTR-----SVTGRNVDEIVKLV-N-----------NQILTPNQGLVELG-KQKSTDPNMDRYQSSLNY---- 377 (406) T ss_pred ---eeEEEecC-----ccchhhHHHHHHHH-h-----------CCCcCHHHHHHHhC-CCCCCCCCCCeEeeccCc---- Confidence 22333222 221111 1111110 0 01111111011100 000000 00000000000 Q ss_pred cccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 539 MKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 539 me~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ++++...+.+. ...... ...+....-+. . T Consensus 378 ~~~~~~~~~~~-~~~~~~-----~gg~~~~~~~~-------~ 406 (406) T protein:vir:97 378 VFLDKKEEYQD-KVGIKG-----KGGEVNAEEDK-------S 406 (406) T ss_pred cchhccccccc-cccccc-----CCCCCCCCCCC-------C Confidence 01110000000 000000 00000000000 0 No 84 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=97.45 E-value=2.1e-06 Score=51.66 Aligned_cols=321 Identities=11% Similarity=0.026 Sum_probs=156.7 Q ss_pred chhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHH--hhccch----HHHH--HH Q lcl|NC_011269. 60 AEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLF--YATHDL----VPLL--ID 131 (867) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~----~~~~--~~ 131 (867) ..+|+++.+.........+-..+ ++-.|+ |.+.-+- ..-|+-.+ |-++|+ ++.| .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~--------~~~~~~y-----~~~~~~~~~~~~epp~~~~~la~~~~~~ 64 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTF---SLSEIT--------ASPALDY-----VGIGFDENYNCYLPPVNRHALAKLPHQN 64 (345) T ss_pred CCccccccchhhhcCCCceEEEe---ecCCcc--------cchhhcc-----cceeeecCCccccCCCCHHHHHHHhhcc Confidence 34454444322222211111111 222332 2222222 22232122 456665 2222 22 Q ss_pred hhhhcccccceecccchhHHHHHHHHhh--cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeeh Q lcl|NC_011269. 132 IYSKFPVVGMEFDSKDPLIKTFYEDLFF--GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRV 209 (867) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (867) -|..=+ +.+ |.=+.-+-| .+.|.-.+|. - +...|+.-|+.+=+...|.. |.+-++..|.|++|++ T Consensus 65 ~~h~~~---i~~-------k~n~l~~~~~Pn~~~t~~~f~-~-~v~d~ll~Gnay~~i~rn~~-G~~~~L~pl~~~~vr~ 131 (345) T protein:vir:37 65 AQHGGI---LHS-------RANMVSATYEGGKALSKMEMR-A-LCLNLIQFGDVGLLKVRNGF-GQVVRLVPLSSLYLRV 131 (345) T ss_pred hhhcch---hhh-------hhhHHhhccCCCCCCCHHHHH-H-HHHHHHhcCCeEEEEEECCC-CCEEEEEEecCceeEE Confidence 222111 111 111111111 2334445553 2 33677778998888887774 4566888899999986 Q ss_pred hhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHh Q lcl|NC_011269. 210 SRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRV 289 (867) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (867) .. -++..+. ++.+ .....++...++..-|-|| T Consensus 132 ~~---d~~~~~~-----------------------------------------~~~~----~~~~~g~~~~~~~~eViHi 163 (345) T protein:vir:37 132 HK---DGGYSYL-----------------------------------------MKKS----LYDTAQEIYRYDAKDIIFI 163 (345) T ss_pred ee---cCCeeEE-----------------------------------------Eeee----eeccCceEEEEccccEEEE Confidence 41 1111111 0000 0011233445666778899 Q ss_pred hhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcc Q lcl|NC_011269. 290 VNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAAD 369 (867) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (867) .+-.+.-.-.|.|-++-+.+++..-++-.......-+.-..|=-|+++- + + .-++++.|.+|.-|+...... T Consensus 164 r~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t-----~--~-~l~~e~~~~lk~~~~~~~g~~ 235 (345) T protein:vir:37 164 KLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST-----D--P-DLTEEMEEEIARKISESKGVG 235 (345) T ss_pred cCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-----C--C-CCCHHHHHHHHHHHHHhcCcc Confidence 8755555668999888888887654433322222222222232233322 1 2 237889999999887766555 Q ss_pred hh--hhhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhc--CCCccceehhh-hhHHHHHH Q lcl|NC_011269. 370 FR--LMVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALIS--GGTGGAYASSA-LNREFVTQ 435 (867) Q Consensus 370 ~~--~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~--~g~~~~~~~~~-~~~~~~~~ 435 (867) ++ |+|+ .-|+++.-++.. ..|++|.++| .+|++++||.-.|+. -..+++|+++. ....|+.. T Consensus 236 n~~~~~i~~~~g~~~G~~~~pl~~~----~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~~ 311 (345) T protein:vir:37 236 NFRSMFVNIAGGHPDGLKVIPIGDT----GTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYD 311 (345) T ss_pred ccCceeEecCCCCccceeEEEccCC----hhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHHH Confidence 32 4444 345666666543 3567775554 579999999998872 13345677764 34444555 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhh Q lcl|NC_011269. 436 IMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQ 487 (867) Q Consensus 436 ~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k 487 (867) -++=++.+|++.+-+.++ +.+ +..++|--. +++ | T Consensus 312 ~l~P~~~~ie~~ln~~~e----~~~--~~~i~F~~~-----~l~-------k 345 (345) T protein:vir:37 312 EVMPLQEIIAETINQDPE----IKN--LLKIKFREQ-----NFA-------K 345 (345) T ss_pred HHHHHHHHHHHHhhhhhc----cCC--cceEEECch-----hhc-------C Confidence 566666666666654332 221 111111110 111 1 No 85 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=97.43 E-value=2e-06 Score=51.86 Aligned_cols=324 Identities=11% Similarity=0.061 Sum_probs=158.4 Q ss_pred chhHHHHHHH-HhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHH--hhccchHHH-HHHhhhh Q lcl|NC_011269. 60 AEANRQRLAS-YRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLF--YATHDLVPL-LIDIYSK 135 (867) Q Consensus 60 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~~~~~~ 135 (867) ..+|+++-+. .... +..++-.=+|--| +|-++-| -+--||-.+ |-++|+-.. |..+++. T Consensus 1 ~~~~~~~~~~~~~~~----~~~~~~~f~~~~~--------~~~~~~~-----y~~~~~~~~~~~~epp~~~~~la~l~~~ 63 (345) T protein:vir:37 1 MKTNVKTDNKKGIVI----APINDRTFSLNEI--------SASPALD-----YVGIGFDENYNCYLPPVNRHALAKLPHQ 63 (345) T ss_pred CCCCccccchhhccc----CcceeEEeecCCc--------ccccchh-----hhhhhhcCCccccCCCCCHHHHHHHhhc Confidence 2333333111 1111 1111111111112 2222222 222355443 445554432 2233332 Q ss_pred ccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhh Q lcl|NC_011269. 136 FPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSM 213 (867) Q Consensus 136 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (867) =+--. +.+.. ..|-..|. =.+.|+-.+|. - +-..|+.-|..+=+..-|.. |..-++..|.|.+|+|.. T Consensus 64 ~~~h~~~i~~k~--n~l~~~~~---Pn~~lt~~~f~-~-~~~d~ll~Gnay~~~~rn~~-G~~~~L~pl~~~~vr~~~-- 133 (345) T protein:vir:37 64 NAQHGGILHSRA--NMVSSLYE---GGKALSRMDMR-A-LCLNLIQFGDVGLLKVRNGF-GQVVRLVPLSSLYLRVRK-- 133 (345) T ss_pred ccccccceeeec--hHHHhhcc---CCCCCCHHHHH-H-HHHHHHhcCCeEEEEEEcCC-CcEEEEEEEcCceeEEEE-- Confidence 22222 22221 11111111 02224444443 2 33567778988877776654 566779999999998751 Q ss_pred hhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcC Q lcl|NC_011269. 214 FVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRP 293 (867) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (867) -.+..+. + +.| .....|+-+.++..-|-||.+-. T Consensus 134 -d~~~~~~-----------------------~-----------~~~-----------~~~~~g~~~~~~~~dVihir~~~ 167 (345) T protein:vir:37 134 -DGGYSYL-----------------------M-----------KKS-----------LYDTAQEIYRYDAKDIIFIKLYD 167 (345) T ss_pred -eCCeeEE-----------------------E-----------EEe-----------EecCCceEEEEccccEEEecCCC Confidence 1111100 0 000 00123344456666688988655 Q ss_pred ccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh-- Q lcl|NC_011269. 294 TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR-- 371 (867) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 371 (867) +--.-.|.|-++-+.+++...++...-....-+.-..|=-++++- + + ..++++.+.+|+-++..-..+++ T Consensus 168 ~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~-----d--~-~l~~e~~~~lk~~~~~~~g~~n~~~ 239 (345) T protein:vir:37 168 PMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST-----D--P-DLTEEMEEEIARKISESKGVGNFRS 239 (345) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEec-----C--C-CCCHHHHHHHHHHHHHhcCcccccc Confidence 445668999999999988776554443333333333343333332 1 1 34789999999977765545532 Q ss_pred hhhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhc--CCCccceehhh-hhHHHHHHHHHH Q lcl|NC_011269. 372 LMVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALIS--GGTGGAYASSA-LNREFVTQIMTG 439 (867) Q Consensus 372 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~--~g~~~~~~~~~-~~~~~~~~~~~~ 439 (867) |+|+ .-|+++.-++-. ..|++|.++| ++|+.++||.-.|+. ...++.|+++. ....|+..-++= T Consensus 240 ~~i~~p~g~~~G~~~~pls~~----~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P 315 (345) T protein:vir:37 240 MFVNIANGHPDGLKVIPIGDT----GTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMP 315 (345) T ss_pred eEEEcCCCcccceEEEEccCC----hhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHH Confidence 4444 367777776542 3466666654 569999999999872 12345677754 344445455666 Q ss_pred HHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhh-hhhhh Q lcl|NC_011269. 440 FQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYD-EETGQ 487 (867) Q Consensus 440 ~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k-~e~~k 487 (867) ++.+|++.+-+.++ +.++. .+.|+ .++.| T Consensus 316 ~~~~ie~~ln~~~~----~~~~~---------------~i~F~~~~L~~ 345 (345) T protein:vir:37 316 LQEIIAETINQDPE----IKNLL---------------KIKFREQNFAK 345 (345) T ss_pred HHHHHHHHhhhhcc----CCCcc---------------eEEecchhhcC Confidence 66666666654332 11111 11121 11111 No 86 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=97.41 E-value=1.3e-05 Score=47.34 Aligned_cols=408 Identities=12% Similarity=0.023 Sum_probs=187.2 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHH Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRV 111 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (867) |-|..|+--.-.--..-+.+.|..+ +. ...+.+-.+...+.. +.....+. .|-.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~-~~----------~~~~~~~~~~~~~~~-------~~~~~~~~--~vs~~~---- 56 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRS-KS----------LENPSTPITGDAVDT-------DGLFRADV--YVSPET---- 56 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccc-cC----------CCCCccccchhhhhh-------hccccCCc--eechHH---- Confidence 4454444322211111122222111 11 111111111111000 00000000 011111 Q ss_pred HHHHHHHHhhccchHHHHHHhhh----hccccc-------ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhh Q lcl|NC_011269. 112 IRHWCRLFYATHDLVPLLIDIYS----KFPVVG-------MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTV 180 (867) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (867) +-.++-|-.|||+-+ ..|+.= .+-..++++.+-+-+. =.+..+-.+|+..++ ..++.- T Consensus 57 --------al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~--PN~~~t~~~f~~~~v-~~lll~ 125 (424) T protein:vir:45 57 --------AMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDE--PNTWQTSYKWRELKQ-RHILGW 125 (424) T ss_pred --------hhccHHHHHHHHHHHHHHhhCceEEEEecCCceeecccchHHHHHHhh--cccCCCHHHHHHHHH-HHHhhc Confidence 112344445555443 233211 1111122221111110 113455567888755 788888 Q ss_pred hhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHH Q lcl|NC_011269. 181 GEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQ 260 (867) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (867) |+.+-+.+-|.. |...++..|+|+.|.|... .++.+ T Consensus 126 Gna~~~i~r~~~-G~~~~L~~l~~~~v~i~~~---~~~~~---------------------------------------- 161 (424) T protein:vir:45 126 GNGYTWVKRNRR-GEVISLDCCMPWETTLMNT---GGRYT---------------------------------------- 161 (424) T ss_pred CCeEEEEEEcCC-CcEEEEEEecCceEEEEEc---CCeEE---------------------------------------- Confidence 999888877765 5567889999999987631 11111 Q ss_pred HHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhccc Q lcl|NC_011269. 261 DLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIE 340 (867) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (867) |+-+ ..+....++.+-|-|+.+...- --.|.+.+.-+-.+|-...........+.++-..|--++++-. T Consensus 162 --y~~~-------~~~~~~~~~~~eVih~r~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~- 230 (424) T protein:vir:45 162 --YGLY-------NEYGAFAISPDDMIHIRALGNN-QKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS- 230 (424) T ss_pred --EEEE-------ecCceEEECcccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC- Confidence 0000 0112234566677888754322 2358888887777776666666666666666666665665542 Q ss_pred ccCCCCcCCCCHHHHHHHHHHHHHhhhc--c--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhc Q lcl|NC_011269. 341 DMGDGEPWIPDQGELDEVRDDMQSLLAA--D--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALIS 416 (867) Q Consensus 341 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (867) .-+.+..+.+|+.|+..... + ...+|---|++++.....-...-+-+-.++..++|.+++||.-.++. T Consensus 231 --------~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 302 (424) T protein:vir:45 231 --------GLNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMIN 302 (424) T ss_pred --------CCCHHHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC Confidence 12678899999988877654 2 35666677888887765443333334556777899999999999998 Q ss_pred CCCccceehh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhh Q lcl|NC_011269. 417 GGTGGAYASS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPK 495 (867) Q Consensus 417 ~g~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k 495 (867) ..++++|+++ +....|++.-++-+...|++.+.+.+-.-.|... .+.|+|-...+.-.|..+.-+ ..+-.+..... T Consensus 303 ~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~--g~~i~fd~~~llr~d~~~r~~-~~~~~~~~g~~ 379 (424) T protein:vir:45 303 DLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAA--GYYVRFNLTGLLRGTPQERAQ-FYHFAITDGWM 379 (424) T ss_pred CCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcC--CcEEEeechhhhccCHHHHHH-HHHHHHhCCCc Confidence 8888999885 4556667777777888888887776633333321 123343333332222211111 11111110000 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHH Q lcl|NC_011269. 496 LLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADE 551 (867) Q Consensus 496 ~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E 551 (867) -..|+..- +.+.........+..+... .+ .+.....+.+. .+..+ T Consensus 380 -T~NE~R~~-~gl~pi~ggD~~~~~~n~~-----~~-~~~~~~~~~~~---~~~~~ 424 (424) T protein:vir:45 380 -SRNEARAF-EDMNPVEGLDEMLVSVNAA-----NP-AGDFKPPKNDE---GKTNE 424 (424) T ss_pred -CHHHHHHH-hCCCCCCCcceeeeccccc-----cc-ccccCCCCCCC---CCCCC Confidence 01111100 0010000000000100000 00 00000000000 00000 No 87 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=97.41 E-value=1.6e-05 Score=46.89 Aligned_cols=375 Identities=11% Similarity=0.072 Sum_probs=162.7 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCC-CchhhhHHHHHHHHHHHHHhhc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIP-FNVEDEEELRVIRHWCRLFYAT 122 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 122 (867) |. +. .+++..++-.+ +. ..-..+.+...++- +.. ..+-.+ . |-. T Consensus 1 Mg-------~~-~~~~~~~~~~~-----~~------~~~~~~~~~~~~~~----~~~~~~v~~~-----------~-al~ 45 (385) T protein:vir:10 1 MG-------LL-TPRNFNKRKAK-----NM------VYPSNPAFFTTTVG----GMQLSYVSAL-----------S-ALQ 45 (385) T ss_pred Cc-------cc-cchhccccccc-----cc------ccccchhhhhhhcc----ccCccccCHH-----------H-hhc Confidence 10 00 00011011000 00 00001111111110 000 111111 1 234 Q ss_pred cchHHHHHHhhhh-cccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehhee Q lcl|NC_011269. 123 HDLVPLLIDIYSK-FPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEI 201 (867) Q Consensus 123 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (867) ++.|-.||++.++ +.- +.|..++..++.+... =-+..+-.+|+..++ ..++.-|+.+-+..-+. .| +.. T Consensus 46 ~~~v~~~i~~ia~~ia~--~p~~v~~~~~~~ll~~--PN~~~t~~~f~~~~~-~~l~l~Gn~~~~i~r~~---~~--~~p 115 (385) T protein:vir:10 46 NTNVYSVINRIASDVAS--AHFKTENTATLNRLES--PSSLIGRFSFWQGAL-MQLCLSGNDYIPLVGQN---LE--HIP 115 (385) T ss_pred cHHHHHHHHHHHHHHhh--Cceeeeccchhhhhhc--CCCCCCHHHHHHHHH-HHhhhcCCeEEEEEcCc---ee--Eee Confidence 6778889998876 332 3333444433332221 013345567777766 66777788876654221 11 223 Q ss_pred cCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcc Q lcl|NC_011269. 202 LNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDI 281 (867) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (867) +++..|++.. +. .| .. | +| .....+..+.+ T Consensus 116 ~~~~~v~~~~-----~~--------------~~------~~----------------~-----~~----~~~~~~~~~~~ 145 (385) T protein:vir:10 116 NSDVQINYLP-----GN--------------MG------IV----------------Y-----TV----LESNDRPQMVL 145 (385) T ss_pred cCCceEEEEE-----cC--------------Cc------eE----------------E-----EE----EEcCCceEEEE Confidence 3333333220 00 00 00 0 00 00112233456 Q ss_pred cHHHHHHhhhcCcc-cc-ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHH Q lcl|NC_011269. 282 SEALISRVVNRPTA-WA-TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVR 359 (867) Q Consensus 282 ~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 359 (867) +..-|-|++.-+.. |. ..|.|.+..+-++|-......+......+.-..|--++++-+ . +=+.++.+.+| T Consensus 146 ~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~------~--~~~~e~~~~~~ 217 (385) T protein:vir:10 146 RQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISN------Y--LSDGKDLESAR 217 (385) T ss_pred ccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC------C--CCCHHHHHHHH Confidence 66778898865433 43 469999988888887766666555556565566766666642 1 33567899999 Q ss_pred HHHHHhhhcch--hhhhhhhheeeeeccccCccCch-hHHHHHHHHHHHHhhccchhhhcCCCcc--ceehhhhhHHHHH Q lcl|NC_011269. 360 DDMQSLLAADF--RLMVHNFGLKVENVFGRESVPNL-DADYDRIERKLLQAWGIGEALISGGTGG--AYASSALNREFVT 434 (867) Q Consensus 360 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~--~~~~~~~~~~~~~ 434 (867) +-|+..+..++ ..+|-.-|++++-....-..... .+-.+...++|.+++||.-+++.+++++ +|++..--+ T Consensus 218 ~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~---- 293 (385) T protein:vir:10 218 EEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIK---- 293 (385) T ss_pred HHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHH---- Confidence 99999887764 35566678888887665444432 2233666889999999999999976644 556543212 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHhhc-ccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhh Q lcl|NC_011269. 435 QIMTGFQNALKRHIRRRCEVVAEAQG-HYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQ 513 (867) Q Consensus 435 ~~~~~~~~~l~~~~r~~~~~i~e~q~-~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~ 513 (867) ..++..|.-++++... ||+- ++.++++|-+..+..-|..+.-+-. +-.+..... -..|+..- +.+-..-+ T Consensus 294 ---~~~~~~l~P~~~~ie~---~l~~~l~~~~~~f~~~~ll~~d~~~~~~~~-~~~~~~G~~-T~NE~R~~-~g~~p~p~ 364 (385) T protein:vir:10 294 ---ATYLANLNSYVNPIVD---ELRLKMNAPDLELDIKDMLDVDDSALINQV-SNLAKSGVL-GAEQAQFI-LTRSGFLP 364 (385) T ss_pred ---HHHHHHHHHHHHHHHH---HHHHhhCCceEEeechhhhccCHHHHHHHH-HHHHhCCCc-CHHHHHHH-hCCCccCC Confidence 1122234443333222 2222 2334455544333322221111111 111111000 00111100 00000000 Q ss_pred hhhhhhhhhhceeeeeccccCCCcccccchhhhh Q lcl|NC_011269. 514 ERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELER 547 (867) Q Consensus 514 ~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~ 547 (867) ..... +.-+.. .++...+.+. T Consensus 365 ~~~~~---------~~~~~~----~~~~g~~~dn 385 (385) T protein:vir:10 365 DNLPE---------FKPLTT----QVKGGDEGDN 385 (385) T ss_pred CCCcc---------ccCccc----ccCCCCCCCC Confidence 00000 000000 0000000000 No 88 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=97.33 E-value=9.3e-05 Score=42.68 Aligned_cols=425 Identities=13% Similarity=0.137 Sum_probs=175.1 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcc----cccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHH Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQ----GNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLF 119 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (867) |+-. |=+++.+.+.. -||+.+|-. ..-+.++.-.. -+-|..+ +. ..|+.. T Consensus 1 ~~~~---------------~~a~~~~~~~~a~~~~~~~~~~g~--~~~~d~~~~~~-~~~~~~~-~~-------~~l~~l 54 (461) T protein:vir:80 1 MYSI---------------DKAKQAKIDSKIVNRNDFMVGHGK--ANSRDKLTRQT-PGNGQKL-DL-------KACENL 54 (461) T ss_pred Cccc---------------hhhhhhhhhhhhhhhhHHHhhcCC--cchhhhhhccc-cCccccc-CH-------HHHHHH Confidence 1110 00010011100 011111111 11111111000 0111211 11 268899 Q ss_pred hhccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh Q lcl|NC_011269. 120 YATHDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS 198 (867) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (867) |+.|.|+...||+.-++.+.. ++|.++|+..++.++. +| ++|++-+-|.+.+ +-+..-|...=+-+..++ ..|.. T Consensus 55 Y~~~~l~r~iVd~~a~d~~r~g~~i~~~~~~~~~~~~~-~~-~~l~~~~~l~~~~-~~~rl~G~a~i~i~v~d~-~~~~~ 130 (461) T protein:vir:80 55 YASNSIAMNIVDIISEDMVRAGWSLKTDNKEMKKNIES-KW-RKLKTKDRFQKLY-ADKRLYGDGFLSIGVVSS-NREQA 130 (461) T ss_pred HHhCCccchhhccchHHhhcCCeeeecCCHHHHHHHHH-HH-HHhhHHHHHHHHH-HhhcccccEEEEEEeecC-Ccccc Confidence 999999999999999999876 8999999877776655 45 5899999998876 555555654333322222 12321 Q ss_pred --heecCcceee-hhhhhhhc-chHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHH-----H--Hhch Q lcl|NC_011269. 199 --EEILNPDMLR-VSRSMFVQ-RERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDL-----Q--RRYP 267 (867) Q Consensus 199 --~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--~~~~ 267 (867) ..-|||..++ |+ |.- -.+.++.+..+. +.|.+. +|..- . .... T Consensus 131 ~~~~pl~~~~~~~~~---~l~~~~~~~i~~~~~~----~dp~sp-------------------~fg~P~~y~i~~~~~~~ 184 (461) T protein:vir:80 131 DLSTAIDPKTIKSIP---YINTFNTQKVTQLYLN----QDMFSE-------------------HFGEVEFFEVNRVSQLG 184 (461) T ss_pred CccCCccccccccee---EEEeccccccchhhhc----ccCcCc-------------------ccccceEEEEecccccc Confidence 1223443321 11 100 000011111111 111111 11110 0 0001 Q ss_pred H-HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCC Q lcl|NC_011269. 268 E-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGE 346 (867) Q Consensus 268 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (867) + .+.....+....|...=|.|+.+...++..||.|++-+++-.|..-++-. ..+++-++. ..+..+.... T Consensus 185 ~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~---~~~~~l~~~-~~~~v~k~~~----- 255 (461) T protein:vir:80 185 EEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSL---WSVGQILYD-FAFKVYKTDD----- 255 (461) T ss_pred ccccccccCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHH---HHHHHHHHH-hCCCceecch----- Confidence 1 11222334445566666778888888999999999999988776554433 334432222 1222222111 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCc-------cCchhHHHHHHHHHHHHhhccchhhhcCCC Q lcl|NC_011269. 347 PWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRES-------VPNLDADYDRIERKLLQAWGIGEALISGGT 419 (867) Q Consensus 347 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 419 (867) |+.+=+|....+.+=+-++--|.|+-+ ++..+. +=.|++=++.....|.-+.+|-...+- |. T Consensus 256 --------l~~~~~~~~~~~~~~~~~~~~~~g~~~--~d~~e~~e~~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~-G~ 324 (461) T protein:vir:80 256 --------IDALNKDDKANLTAMLDFMFRTEALAI--IKGDEQLTKESTNVSGMKDLLDYGWDYLAGAVRMPKTVLK-GQ 324 (461) T ss_pred --------HHhhhchHHHHHHHHHHHhcCCceEEE--EcCCcceEEEecCcCCHHHHHHHHHHHHhhhhcCCeeeee-cc Confidence 111111111111111111222334433 222222 224567888888899999999998887 44 Q ss_pred ccce-ehhhhhHHHHHHHHHHHH-HHHHHHHhhhhHHHHH-hhc---ccc---hheehhhccccchhh---hhhhhhhhh Q lcl|NC_011269. 420 GGAY-ASSALNREFVTQIMTGFQ-NALKRHIRRRCEVVAE-AQG---HYD---YDLKGGVRVPIYREI---VEYDEETGQ 487 (867) Q Consensus 420 ~~~~-~~~~~~~~~~~~~~~~~~-~~l~~~~r~~~~~i~e-~q~---~~d---~~~~~~~~~~~~rd~---~~~k~e~~k 487 (867) .+.. |++.=-+......+-.+| ..|+.++.++++.|.- ..+ ..| .++++.|+.|..++- .+..+++++ T Consensus 325 s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~ 404 (461) T protein:vir:80 325 EAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAE 404 (461) T ss_pred cCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHH Confidence 4332 322212222334455555 3466666666665532 111 222 367777777765443 222222221 Q ss_pred hHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccc Q lcl|NC_011269. 488 EYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQA 562 (867) Q Consensus 488 ~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet 562 (867) - +. + -+.-+.+ ....+...+.+- . -+.....-..+..+.+. .... .+... -.+.... T Consensus 405 a-~~-----~--~~~~g~i---s~~e~r~~l~~~---~-~~~~~~~~~~~~~~~~~-~~~~-~~~~~-~~e~~~g 461 (461) T protein:vir:80 405 A-DQ-----I--YIVNGVL---DPDEVKETRFGR---F-GLENSSKFSGDSAEIDK-LAKL-VYDAY-AKKNADG 461 (461) T ss_pred H-HH-----H--HHhcCCC---CHHHHHHHHHHh---c-CCCCCccCCCCCchhhh-hhhh-ccccc-cccCCCC Confidence 1 00 0 1111111 111122222210 0 00000000111111110 0000 00000 0000000 No 89 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.26 E-value=6.1e-05 Score=43.67 Aligned_cols=409 Identities=12% Similarity=0.090 Sum_probs=178.2 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhccc---ccccceeec-cchhhhhhh-hhHHhhCCCchhhhHHHHHHHHHHHH Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQG---NFGSNMQIA-MPKIRQPLG-TLADKGIPFNVEDEEELRVIRHWCRL 118 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 118 (867) |-+... -|+-. |+.++..+. +.+++-++. .....+-++ ...+.+..-|.+ T Consensus 1 ~~~~~~--------~g~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~-------------- 55 (432) T protein:vir:97 1 MPDEKK--------LGLLG---QLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNAD-------------- 55 (432) T ss_pred CCCccc--------Cchhh---hhHhhcCCccccccccccccccCchhhhhhcccccccCcccchH-------------- Confidence 222111 13322 333333221 111221111 111111111 111122222211 Q ss_pred HhhccchHHHHHHhhhhcccccceecc---cchhHHHHH----HHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhh Q lcl|NC_011269. 119 FYATHDLVPLLIDIYSKFPVVGMEFDS---KDPLIKTFY----EDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAH 188 (867) Q Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (867) =+-.++.|-.||++-+. .|..+.|.. ++++.++-. .+++- .+..+-.+|+.-++ ..++.-|..+-+.+ T Consensus 56 ~a~~~~aV~~~v~~Ia~-~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~-~~lll~Gnay~~~~ 133 (432) T protein:vir:97 56 AIMRLDAVAACVKLVSQ-AVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVV-TRLLLDGTAYVRKV 133 (432) T ss_pred hhhcchHHHHHHHHHHH-hhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHH-HHHhhcCCeEEEEE Confidence 12345677778877643 222222211 112222211 11211 12355567888766 77788899887777 Q ss_pred hhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchH Q lcl|NC_011269. 189 FNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPE 268 (867) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (867) -+ +|.-.++..|+|+.|.|... .+.++- |+- T Consensus 134 ~~--~g~~~~L~~l~p~~v~v~~~---~~g~~~-----------------------------------------y~~--- 164 (432) T protein:vir:97 134 VT--DGRIESLQYLANDRLTITTD---TKGNTA-----------------------------------------YRY--- 164 (432) T ss_pred ec--CCcEEEEEEEcCcceEEEEc---CCCcEE-----------------------------------------EEE--- Confidence 65 46677889999999988621 110100 110 Q ss_pred HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcC Q lcl|NC_011269. 269 IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPW 348 (867) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (867) .-..|..+.++...|-|+.+-. -=--.|.+.+.-+-++|-.-....+....+.++-..|--++++-. T Consensus 165 ---~~~~g~~~~~~~~~iih~r~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~--------- 231 (432) T protein:vir:97 165 ---RRTDGQMIDIPRQQIWKIMGYS-LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--------- 231 (432) T ss_pred ---EecCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC--------- Confidence 1113445567777788887532 112368888887777775554444444444555556655665542 Q ss_pred CCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceeh--- Q lcl|NC_011269. 349 IPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYAS--- 425 (867) Q Consensus 349 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~--- 425 (867) .-+.+..+.+|+.++...-+. ..+|---|++++.+...-+...+-.-.++..++|.+++||.-.+|...+.++|++ T Consensus 232 ~l~~e~~~~~~~~~~~~~nag-~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~ 310 (432) T protein:vir:97 232 FLTDDQYDSFSKKVSGSVEAG-RAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSG 310 (432) T ss_pred CCCHHHHHHHHHHHhhhhcCC-CceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchh Confidence 236677788888777655554 3455566788877754332222223356778899999999999997556656642 Q ss_pred h-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhh-hhhhcc Q lcl|NC_011269. 426 S-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLL-IPEIKF 503 (867) Q Consensus 426 ~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i-~~~i~~ 503 (867) . +....|+..-++-+..+|++.|.+.+=.-.|-. .+.++|-+..+...|..+.-+ ...-.+.....-+ .-|..+ T Consensus 311 ~e~~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~---~~~~~fd~~~llr~d~~~r~~-~~~~~~~~G~~T~NE~R~~~ 386 (432) T protein:vir:97 311 IESQQLGFLTMTLSPWLRRIEQSIALNLLTPAERR---RYFADFDTSALLRADSAARSS-YYSQLVNNGLMTRDEAREIE 386 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccC---ceEEEeechhhhccCHHHHHH-HHHHHHhCCCCCHHHHHHHh Confidence 2 344455655566666666666665442112211 223444333332222222111 1111111100000 000001 Q ss_pred ccccccchh---hhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 504 STLNLRDEA---QERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 504 ~~~~Lr~e~---~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ...-+.+.. ......+.++.....-+......+. -+++ +. T Consensus 387 glpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~---~~~~-----------------------------~~----- 429 (432) T protein:vir:97 387 GLPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLG---NQQQ-----------------------------DK----- 429 (432) T ss_pred CCCCCCCCcceEeecccccchhhhcccCCCCCCCCCC---Cccc-----------------------------cc----- Confidence 101010000 0000000000000000000000000 0000 00 Q ss_pred ccc Q lcl|NC_011269. 581 LAQ 583 (867) Q Consensus 581 ~aQ 583 (867) ..+ T Consensus 430 ~~~ 432 (432) T protein:vir:97 430 VSK 432 (432) T ss_pred ccC Confidence 000 No 90 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.19 E-value=5.5e-05 Score=43.95 Aligned_cols=472 Identities=13% Similarity=0.060 Sum_probs=206.6 Q ss_pred cchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHH----HHHHhhhhcccccceecccchhHHHHHHHHhh Q lcl|NC_011269. 84 MPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVP----LLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFF 159 (867) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (867) |++|.-+.|......- ..+.. .....+.|..|..||.-- .+-.|++..--+|+. ..-+-|++|.- T Consensus 1 ~~~~~d~~g~p~~~~~---~~~~~--~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~------~~~~L~edm~e 69 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQ---LREPQ--TSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQ------AQAELFMDMEE 69 (526) T ss_pred CCeeeCCCCCccCccc---cchhh--hhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHH------HHHHHHHHHHh Confidence 7888877766432110 01111 122335555555555432 344566655444321 12355666642 Q ss_pred cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC--cceeehh---hhhhhcchHHHHHHHHHHhhcccc Q lcl|NC_011269. 160 GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN--PDMLRVS---RSMFVQRERVQLMVKDLVDHLRQG 234 (867) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 234 (867) .|..|..-|-. |..-+.|- .|.=+---| ++..++. +.+|-..+-.+-++++|.|++-.| T Consensus 70 -~D~~i~s~l~~---Rk~av~~~------------~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ldA~~~G 133 (526) T protein:vir:79 70 -RDAHLFAEMSK---RKRAILGL------------DWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALDGIGHG 133 (526) T ss_pred -hChHHHHHHHH---HHHHHhCC------------CceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHhhhhhc Confidence 25555555443 22222221 121000001 1111111 122333334777899999988877 Q ss_pred ccccccccccccc---------cchhhhhhhhh-HHHHHHhchHHHhhhccCCCCcccHH-HHHHhhhcCccccccCcch Q lcl|NC_011269. 235 PTTAGGNMSTVEE---------TPSEREQRMRE-FQDLQRRYPEIIQAAMQNDGLDISEA-LISRVVNRPTAWATRGAPH 303 (867) Q Consensus 235 ~~~~~~~~~~~~~---------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 303 (867) .|-.|- .|.+-..|..+ |+-....--++.-.-...+|++|+.. .|.|+ |+++.=.+.|..+ T Consensus 134 -------~s~~Ei~w~~~~g~~~~~~l~~r~~~~F~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~-~~~~~g~p~g~gL 205 (526) T protein:vir:79 134 -------YSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHR-PRARSGYVARSGL 205 (526) T ss_pred -------ceeEEEEEeecCCceeEEEeeeecccceEeccCCCcEEEecCCCCCceeecCCceEEEe-ecCCcCCccccch Confidence 222221 11111111111 00000000011111124567888766 56664 6777777899999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeee Q lcl|NC_011269. 304 LLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVEN 383 (867) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 383 (867) +..|+..-+.|...-+--...+.|+-.|+|+.|.+. |. +++|.+.+-+.+.++ ..|-. +|.--|-.||. T Consensus 206 lr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~-----~a----~~~ek~~L~~av~~i-~~da~-~iiP~~~~ie~ 274 (526) T protein:vir:79 206 FRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP-----GT----ADEEKATLLRAVTGL-GHAAA-GIIPETMAIDF 274 (526) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC-----CC----CHHHHHHHHHHHHHH-hcCcE-EEecCCceeEE Confidence 999999999999888888899999999999999863 11 345555555544433 22322 22223455666 Q ss_pred ccccCccCch-hHHHHHHHHHHHHhhccchhhhcC---CCccceehhhhhHHHHHHHHHHHHHHHHHHHhh-hhHHHHHh Q lcl|NC_011269. 384 VFGRESVPNL-DADYDRIERKLLQAWGIGEALISG---GTGGAYASSALNREFVTQIMTGFQNALKRHIRR-RCEVVAEA 458 (867) Q Consensus 384 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~-~~~~i~e~ 458 (867) +.+...--.+ +.=++...++|..++ +|+-|+|. |.|++|+.+.|--+....+.-.-...|..++++ +++++..+ T Consensus 275 ~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~ 353 (526) T protein:vir:79 275 QQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVL 353 (526) T ss_pred eecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6543322222 344577788898888 99999983 456889999888888777788888999999965 77999999 Q ss_pred hcccchhe----ehhhccccc---hhhhhhhhhhhhhHhhhhhhhhhhhhccccc-----cccchhhhhhhhhhhhhcee Q lcl|NC_011269. 459 QGHYDYDL----KGGVRVPIY---REIVEYDEETGQEYIRKVPKLLIPEIKFSTL-----NLRDEAQERAFIAQLKGMGV 526 (867) Q Consensus 459 q~~~d~~~----~~~~~~~~~---rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~-----~Lr~e~~~~~~v~qL~~~~~ 526 (867) |......+ ++.+..... ....+..+++.....+....-+..+...... .|....+....-........ T Consensus 354 N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~~~~ 433 (526) T protein:vir:79 354 NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQRVA 433 (526) T ss_pred CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCCchhhccccCCccccccccccccc Confidence 98655332 222222222 1122222222222221111112222221111 01000000000000000000 Q ss_pred eeeccccCC-Ccccccchhhhhh--------HH---HHHHHHhhcccccccccccccccCCCCCc-cccccccccccCCC Q lcl|NC_011269. 527 PVSDKTLAV-NIDMKFDQELERQ--------AD---ETVQKLMATAQAMKKVQDLCDAQNLPYPP-ELAQHLQSTLALRQ 593 (867) Q Consensus 527 pitd~t~p~-tiqme~E~e~e~k--------~~---E~l~tL~~taet~kkvq~~~p~~g~P~pp-~~aQ~p~~t~~~a~ 593 (867) ..+...... ..+..++.-+... .. +.+..+...+.+...+...-...-...+. .+..........+. T Consensus 434 ~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~~~~i~~~~~~~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~ 513 (526) T protein:vir:79 434 ALATIVGPRYGDQQALDKALADLPAKDMQNQANDLLAPLLDAVNRGDSETELLGALAEAFPDMDDSALTDALHRLLFAAD 513 (526) T ss_pred cccccccccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHH Confidence 000000000 0000111000000 00 11111111222222111110000000011 11111110000000 Q ss_pred CCC----CCCCCC Q lcl|NC_011269. 594 GKT----QTELGE 602 (867) Q Consensus 594 gpg----q~~~~q 602 (867) ..+ .....+ T Consensus 514 l~Gr~~~~~e~~~ 526 (526) T protein:vir:79 514 TWGRLHGNLDRID 526 (526) T ss_pred HhhhhhhhhcccC Confidence 000 001111 No 91 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=97.04 E-value=5.9e-05 Score=43.77 Aligned_cols=405 Identities=13% Similarity=0.117 Sum_probs=178.4 Q ss_pred CCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhccc---ccccceee--ccchhhhhhhhhHHhhCC Q lcl|NC_011269. 26 MPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQG---NFGSNMQI--AMPKIRQPLGTLADKGIP 100 (867) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~~~~~~ 100 (867) ||.++-|..-.. +.++..+. +.++.-++ .....+-.-+...+.+.. T Consensus 1 ~~~~~~~~~~~~-----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~ 51 (432) T protein:vir:10 1 MPDEKKLGLLGQ-----------------------------LKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAA 51 (432) T ss_pred CCCCcccchhhh-----------------------------hHhhcCCccccccccccccccCcchhhhhcccccccCcc Confidence 554444433211 11111111 00000000 000000000011122222 Q ss_pred CchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceecc---cchhHHHH----HHHHhh---cccccHHHHhH Q lcl|NC_011269. 101 FNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDS---KDPLIKTF----YEDLFF---GEDLNYLEFLP 170 (867) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~----~~~~~~---~~~~~~~~~~~ 170 (867) .|.+. +-.++.|-.||++-++ -|..+.|.. ++++.++- ..+++. .+..+-.+|+. T Consensus 52 v~~~~--------------al~~~~V~~~i~~Ia~-~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~ 116 (432) T protein:vir:10 52 VNADA--------------IMRLDAVAACVKLVSQ-AIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQ 116 (432) T ss_pred cchhh--------------hhcchHHHHHHHHHHH-hhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHH Confidence 22211 2234666777776654 233333221 11122221 111211 12345567887 Q ss_pred HHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccch Q lcl|NC_011269. 171 DQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPS 250 (867) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (867) .++ ..++.-|..+-+.+-| +|...++..|+|+.|.|...- .| .+ T Consensus 117 ~l~-~~lll~Gnay~~~~~~--~g~~~~L~~l~~~~v~v~~~~-------------------~g------~~-------- 160 (432) T protein:vir:10 117 VVV-TRLLLDGTAYVRKVVT--DGRIESLQYLANDRLTITTDT-------------------KG------NT-------- 160 (432) T ss_pred HHH-HHHhhcCCeEEEEEec--CCcEEEEEEEcCCceEEEEcC-------------------CC------cE-------- Confidence 755 7788889988777665 478889999999999886310 01 00 Q ss_pred hhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011269. 251 EREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYS 330 (867) Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (867) . |+- ....|....++.+.|-|+.+-. .-.-.|.+.+.-+-.+|-.-....+....+.++-.. T Consensus 161 -------~----y~~------~~~~g~~~~~~~~~iih~~~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~ 222 (432) T protein:vir:10 161 -------A----YRY------RRTDGQMIDIPKQQIWKIMGYS-LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQL 222 (432) T ss_pred -------E----EEE------EecCceEEEEcCccEEEecCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 0 110 0113445567777888886532 112358888887777766554444444455555556 Q ss_pred hhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhcc Q lcl|NC_011269. 331 PLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGI 410 (867) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 410 (867) |--+.++-. .-+.+..+++++.++...-+. ..+|-.-|++++.+...-+...+-.-.++..++|.+++|| T Consensus 223 ~~gil~~~~---------~l~~e~~~~~~~~~~~~~nag-~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgV 292 (432) T protein:vir:10 223 QSVYYQIDR---------FLTDDQYDSFAKKVSGSVEAG-RAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGV 292 (432) T ss_pred cceEEecCC---------CCCHHHHHHHHHHHhhhhhCC-CceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCC Confidence 766666542 236778889999887765554 4455556778777754333222223346778899999999 Q ss_pred chhhhcCCCcccee---hh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhh Q lcl|NC_011269. 411 GEALISGGTGGAYA---SS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETG 486 (867) Q Consensus 411 ~~~~~~~g~~~~~~---~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~ 486 (867) .-.|+...+.++|+ +. +....|+..-+.-+...|++.|.+.+=.-.|. ..+.|+|-+..+..-|..+.-+-.. T Consensus 293 Pp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~---~~~~~~fd~~~ll~~d~~~r~~~~~ 369 (432) T protein:vir:10 293 PPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIALNLLSPAER---RRYFADFDTSALLRADSAARSSYYS 369 (432) T ss_pred CHHHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcccc---CceEEEeechhhhccCHHHHHHHHH Confidence 99999765555553 32 34445555556666666666665544111221 1233444333332222222111111 Q ss_pred hhHhhhh---hhhh-----hhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhh Q lcl|NC_011269. 487 QEYIRKV---PKLL-----IPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMA 558 (867) Q Consensus 487 k~~~r~~---~k~i-----~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~ 558 (867) + .+... ...+ ++.+.+..-.+ ......+.++.....-.......+ +-+ T Consensus 370 ~-~~~~G~~T~NE~R~~~glppi~g~~~~~----~~~~~~~pl~~~~~~~~~~~~~~~---~~~---------------- 425 (432) T protein:vir:10 370 Q-LVNNGLMTRDEAREIEGLPKLGGNAAVL----TVQSAMVPLDSIGLQASPEPASGL---GNQ---------------- 425 (432) T ss_pred H-HHhCCCCCHHHHHHHhCCCCCCCCcceE----eecCcccchhhhcccCCCCCCCCC---CCc---------------- Confidence 1 11110 0000 01111100000 000000000000000000000000 000 Q ss_pred cccccccccccccccCCCCCccccc Q lcl|NC_011269. 559 TAQAMKKVQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 559 taet~kkvq~~~p~~g~P~pp~~aQ 583 (867) ..+. ..+ T Consensus 426 -------------~~~~-----~~~ 432 (432) T protein:vir:10 426 -------------QQDK-----VSK 432 (432) T ss_pred -------------cccc-----ccC Confidence 0000 000 No 92 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=96.81 E-value=5.9e-05 Score=43.77 Aligned_cols=325 Identities=14% Similarity=0.121 Sum_probs=158.2 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) ||-.- .++....+. ...+...-+.-++.+-| .||.++.+. +..+..|.--.|- | T Consensus 1 ~~~~~-------~~~~~~~~~----~~~~~~~~~~~~~f~~p---~~v~~~~~~-----------~~~~~~~~~~~~~-~ 54 (344) T protein:vir:20 1 MSKKK-------GKTPQPAAK----TMTASGPKMEAFTFGEP---VPVLDRRDI-----------LDYVECISNGRWY-E 54 (344) T ss_pred CCccc-------CCCCcchhh----hhhccCCceEEEEcCCc---eEecCcchh-----------hhhhhhhhcCcee-c Confidence 22110 011222221 33333433556666666 344443321 1222222111121 2 Q ss_pred chHH--HHHHhhhhccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehh Q lcl|NC_011269. 124 DLVP--LLIDIYSKFPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSE 199 (867) Q Consensus 124 ~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (867) |-|. -|.++++.=+... +.+.. +-+...| . =.+.|+-.+| . .+...|+.-|+.+=+..-|.. |.+-++ T Consensus 55 pp~~~~~la~~~~a~~~h~~~i~~k~-n~l~~~~-~---Pn~~lt~~~f-~-~~~~d~ll~Gnay~~i~rn~~-G~~~~L 126 (344) T protein:vir:20 55 PPVSFTGLAKSLRAAVHHSSPIYVKR-NILASTF-I---PHPWLSQQDF-S-RFVLDFLVFGNAFLEKRYSTT-GKVIRL 126 (344) T ss_pred CCCCHHHHHHHHhhhhhhCccceehh-hhHHHhc-c---CCCCCCHHHH-H-HHHHHHHhcCCeEEEEEECCC-CcEEEE Confidence 2111 1123222222111 11110 0000100 0 0122333333 2 233567777887776665644 557789 Q ss_pred eecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCC Q lcl|NC_011269. 200 EILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGL 279 (867) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (867) +.|.+.++++... .+-.++ ....++.+ T Consensus 127 ~pl~~~~vr~~~~---~~~~~~--------------------------------------------------~~~~~~~~ 153 (344) T protein:vir:20 127 ETSPAKYTRRGVE---EDVYWW--------------------------------------------------VPSFNEPT 153 (344) T ss_pred EEcCCceeEeeec---CCEEEE--------------------------------------------------EccCCeEE Confidence 9999999988521 110000 01123334 Q ss_pred cccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHH Q lcl|NC_011269. 280 DISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVR 359 (867) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 359 (867) .++..-|-||.+-.+...-.|.|-++-+.+++...++........-+.-..|=-|+++.+ . -.++++.+.+| T Consensus 154 ~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d-----~---~l~~e~~~~ik 225 (344) T protein:vir:20 154 AFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-----A---VQDRNDIEMLR 225 (344) T ss_pred EEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-----c---CCCHHHHHHHH Confidence 555566789887655566789999998888887655544444433333344555555532 1 24788999999 Q ss_pred HHHHHhhhcc-hhh-hhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccceehh Q lcl|NC_011269. 360 DDMQSLLAAD-FRL-MVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAYASS 426 (867) Q Consensus 360 ~~~~~~~~~~-~~~-~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~~~~ 426 (867) +-|+..-... |+- +|+ .=|+++..++-. ..|++|.++| ++|+.++||.-.|+.- ..++.|+++ T Consensus 226 ~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~----~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~ 301 (344) T protein:vir:20 226 ENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV----ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDI 301 (344) T ss_pred HHHHHhcCCCCccceEEecCCCCccceeEEEcCCC----hhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccH Confidence 9777654444 333 333 246777766543 3456665554 5799999999998831 234557764 Q ss_pred hh-hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhc-ccchheehhhccccchhh Q lcl|NC_011269. 427 AL-NREFVTQIMTGFQNALKRHIRRRCEVVAEAQG-HYDYDLKGGVRVPIYREI 478 (867) Q Consensus 427 ~~-~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~-~~d~~~~~~~~~~~~rd~ 478 (867) .- ...|+..-++=++.+|+ ||++ .++++++|.+.+++..|= T Consensus 302 e~~~~~f~~~~l~P~~~~~e-----------~in~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 302 EKVAKVFVRNELIPLQDRIR-----------EINGWLGQEVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHHH-----------HHHHhcCCcccccCccccccCCC Confidence 42 22222222333333332 3443 455556666666655433 No 93 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=96.59 E-value=0.00048 Score=38.76 Aligned_cols=541 Identities=11% Similarity=0.094 Sum_probs=175.0 Q ss_pred CCCcc--------cccccchhHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhc Q lcl|NC_011269. 1 MSSPI--------YKAGSNWSAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRK 72 (867) Q Consensus 1 ~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (867) |+.-- .-.+++-.|-+ || -+++-.+.-.++-.--..+.+ +-|...|.+|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~---~~----~~~~~~~~~~~~~~~~~~~~p--------------~~~~~~L~~~~e 59 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADL---AK----SPNSTQIPDHRIQSHNVGVNP--------------PYNPDRLAAFLE 59 (651) T ss_pred CCCccceeeeeEEEeecccccccc---cc----cccccccchhhhcccCCCCCC--------------CCCHHHHHHHHh Confidence 33211 11111111111 00 011111111111000011111 113444444444 Q ss_pred ccccccceeeccchhhhhhhhhHH--hhCCC--------chhhhHHHHHHHHHHHHHh----------------hccchH Q lcl|NC_011269. 73 QGNFGSNMQIAMPKIRQPLGTLAD--KGIPF--------NVEDEEELRVIRHWCRLFY----------------ATHDLV 126 (867) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~--------~~~~~~~~~~~~~~~~~~~----------------~~~~~~ 126 (867) .+ |-+++=+...++ .++.+ +-+++.+.+.=++|-+.+. -.+.++ T Consensus 60 ~~----------~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~ 129 (651) T protein:vir:99 60 LN----------ETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVK 129 (651) T ss_pred cC----------hHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHH Confidence 32 223333333222 11111 1111111111112211110 000111 Q ss_pred HH-----------HHHhhhh----------cccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhh--hhh Q lcl|NC_011269. 127 PL-----------LIDIYSK----------FPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTV--GEV 183 (867) Q Consensus 127 ~~-----------~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 183 (867) .+ ||.+.+. -+...+....++..++.....++- .+.|....-.+ | ++||.. +-. T Consensus 130 ~~~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~-~~pn~~~~~~~-~-~~~~q~~~~~~ 206 (651) T protein:vir:99 130 ELARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEG-RYVDGDVADIA-S-RGYVQIRNGNR 206 (651) T ss_pred HHHHHHHHHHhhHhhhhhhcCccchhhhhhcChhheeeecccccccchhhhhhh-cccccccchhH-H-HHHHHHHhcCc Confidence 11 2222221 122223444444444443333322 23332211111 1 222210 000 Q ss_pred cchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHH Q lcl|NC_011269. 184 TSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQ 263 (867) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 263 (867) +-|..|-...+.......++++-+.+. +..+..+.. .+..+. T Consensus 207 ~~~~~~g~~~~~~~~~~~~~~~~v~~~---~~~d~~~~~-----------------------------------~~~~~~ 248 (651) T protein:vir:99 207 RYFGEAGDRYRGQEVVIDESGDEPTIR---YREDEESER-----------------------------------EPIFVD 248 (651) T ss_pred ceEEEeeccccceeeeeccCCcceeEE---eccCcceee-----------------------------------eeeccc Confidence 001111111111111222233222222 111111110 000000 Q ss_pred HhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccC Q lcl|NC_011269. 264 RRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMG 343 (867) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 343 (867) ..+-++ .....+....++..-|-|+.+..+--.-.|.|.+..+..+|....+.........+.-..|--++++-+ T Consensus 249 ~~~g~~-~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~---- 323 (651) T protein:vir:99 249 RETGDV-TTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTG---- 323 (651) T ss_pred ceeeeE-EEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC---- Confidence 011110 111223344566667888876544444579999999999998877777777777777667777777631 Q ss_pred CCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhh-----------hheeeeeccccCccCchhHHH----HHHHHHHHHhh Q lcl|NC_011269. 344 DGEPWIPDQGELDEVRDDMQSLLAADFRLMVHN-----------FGLKVENVFGRESVPNLDADY----DRIERKLLQAW 408 (867) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 408 (867) -+.+++..+.+|+.|+......++.+|-. .|++.+.... ++.-|.+| ++..++|.+++ T Consensus 324 ----~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~---~~~~D~qfle~r~~~~~eIa~af 396 (651) T protein:vir:99 324 ----GELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQ---GISEEMDFRQFREKNEHEIAKVL 396 (651) T ss_pred ----CCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCc---CchhhHHHHHHHHHHHHHHHHHh Confidence 14699999999999998776665554422 2333333321 11224455 34456799999 Q ss_pred ccchhhhcCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhhh-HHHHHhhcccchheehhhcc--ccchhhhhhhhh Q lcl|NC_011269. 409 GIGEALISGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRRC-EVVAEAQGHYDYDLKGGVRV--PIYREIVEYDEE 484 (867) Q Consensus 409 ~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~-~~i~e~q~~~d~~~~~~~~~--~~~rd~~~~k~e 484 (867) ||.-.++.-.+.++|+++. ....|+.+-+.-++.+|+++|.+.+ ..-..+++ ..+.+-|++ +.-.|....-+ T Consensus 397 gVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~---~~i~~ef~~~~llr~D~~~~~e- 472 (651) T protein:vir:99 397 EVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTD---WTIEYELRGADQPKQEAQLAEQ- 472 (651) T ss_pred CCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccC---ceEEEEeccchhhhccHHHHHH- Confidence 9999999766778899865 4445666668888888888887665 22222222 223333433 22223222111 Q ss_pred hhhhHhhh------hhhhh--hhhhcc--ccccccchhhhhhhhhhhhhceeeeeccccCCCcccc---------cchhh Q lcl|NC_011269. 485 TGQEYIRK------VPKLL--IPEIKF--STLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMK---------FDQEL 545 (867) Q Consensus 485 ~~k~~~r~------~~k~i--~~~i~~--~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme---------~E~e~ 545 (867) ..+..++. +...+ ++.+.. ....|.. + ......+...+..-... .+.+. T Consensus 473 ~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~----------~--~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~ 540 (651) T protein:vir:99 473 RVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSE----------F--EAEVAGDVAGGGETEAVHEPPEENKIGEREW 540 (651) T ss_pred HHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccc----------c--ccccccccccCCCCcccccCccccccccchh Confidence 11111111 10000 111110 0011100 0 00001111111100000 00000 Q ss_pred hhhHH-----HHHHHHhhcccccccccccccccCCC--C-----CccccccccccccCCCCCCCCCCCCCCCCccCCC-- Q lcl|NC_011269. 546 ERQAD-----ETVQKLMATAQAMKKVQDLCDAQNLP--Y-----PPELAQHLQSTLALRQGKTQTELGEAQAVAGEAQ-- 611 (867) Q Consensus 546 e~k~~-----E~l~tL~~taet~kkvq~~~p~~g~P--~-----pp~~aQ~p~~t~~~a~gpgq~~~~qa~~~agq~~-- 611 (867) +.... +..+.+...+.....+...+....+. + .....+....+...= ...-.+. ..+.-. T Consensus 541 ~~~~~~~~~~e~~~~~~v~ss~~~~~gyd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~-----~~~~~a~-s~g~~~~~ 614 (651) T protein:vir:99 541 DTVKSELTTKDPIEQMQFSSSNLDEGLYDFGENELYLSFLRDEGQSSLYAYVDVPASEW-----SALANAG-SHGGYHYD 614 (651) T ss_pred hhhhhhhcccchhhhhhHHHHHHHhhcCCCccceEEEEEeecCCCCceeeeeCCCHHHH-----HHHhcCc-ccceeehh Confidence 00000 00000000000001111111110000 0 000001000000000 0000000 000000 Q ss_pred -CccCCCCCCCccCccCcCCCCCCCCCCCCCcccccccccCccC Q lcl|NC_011269. 612 -AELQTKQIEMQEMMMDQQMAGGVMPGQPMLPPGAPGDPAAGGP 654 (867) Q Consensus 612 -~p~~~~~~~~qp~~~~qg~pG~~gPpGP~gPpG~pG~pgP~gP 654 (867) -...=+-..-. ..-+.- |-||.+..|.-+..-|.-- T Consensus 615 ~i~~~~~~~~~~-~~~~~~------~~~~~~~~~~~~~~~~~~~ 651 (651) T protein:vir:99 615 NIRLEYPYLEIT-NFHDRL------PEGPAPDAGDVPDGVPDEI 651 (651) T ss_pred ccccccchhhhh-hhhhhC------CCCCCCCcCCCCCCCcccC Confidence 00000000000 000000 0000000000000000000 No 94 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=96.54 E-value=0.00011 Score=42.32 Aligned_cols=325 Identities=12% Similarity=0.096 Sum_probs=156.7 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccc--hH Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHD--LV 126 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 126 (867) |...+. ++..+.+ ++-.+...-+.-++.+.|. ||.+..+ .-| .+..|.--.|-+-| +. T Consensus 1 m~~~~~--~~~~~~~----~~~~~~~~~~~~~~f~~p~---~v~~~~~------~~~-----~~~~~~~~~~~~pp~~~~ 60 (344) T protein:vir:60 1 MSKKKG--KTLQPAA----KKMTASAPKMEAFTFGEPV---PVLDRRD------ILD-----YVECISNGRWYEPPISFT 60 (344) T ss_pred CCcccC--CCCCchH----HhhcCCcCcEEEEEcCCce---eecCCcc------hhH-----HHHhhhcCccccCCCCHH Confidence 111111 0111111 1222333334455666664 3322221 111 11111111111111 11 Q ss_pred HHHHHhhhhccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCc Q lcl|NC_011269. 127 PLLIDIYSKFPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNP 204 (867) Q Consensus 127 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (867) . |..+++-=+... +.+. .+-+...| . =.+.|.-.+| . .+...|+.-|..+-+..-|..+. .-++..|.| T Consensus 61 ~-la~~~~a~~~h~~~i~~k-~n~l~~~~-~---Pn~~~t~~~f-~-~~~~d~ll~Gnay~~i~rn~~G~-~~~L~~l~~ 131 (344) T protein:vir:60 61 G-LAKSLRAAVHHSSPIYVK-RNILASTF-I---PHPWLSQQDF-S-RFVLDFLVFGNAFLEKRYSTTGK-VIRLETSPA 131 (344) T ss_pred H-HHHHHHhhhhhccchhhh-hhHHHhhc-c---CCCCCCHHHH-H-HHHHHHHhcCCeEEEEEECCCCc-EEEEEEcCc Confidence 1 112211111111 1110 00010100 0 0223444444 3 24467777798888777776655 456888899 Q ss_pred ceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHH Q lcl|NC_011269. 205 DMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEA 284 (867) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (867) ++|++.+. .+ + .| ..-..|+.+.++.. T Consensus 132 ~~vr~~~~---~~-~--------------------------------------~~-----------~v~~~~~~~~~~~~ 158 (344) T protein:vir:60 132 KYTRRGVE---ED-V--------------------------------------YW-----------WVPSFNEPTAFAPG 158 (344) T ss_pred ceEEEeec---CC-e--------------------------------------EE-----------EEccCCeEEEEcCc Confidence 99988521 00 0 00 00112334455566 Q ss_pred HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHH Q lcl|NC_011269. 285 LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQS 364 (867) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (867) -|-||.+-.+...-.|.|-++-+.+++...++........-+.-..|=-++++-+ . --++++.+.+|.-++. T Consensus 159 eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~-----~---~ls~e~~~~ik~~~~~ 230 (344) T protein:vir:60 159 SVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-----A---VQDRNDIEMLRENMVK 230 (344) T ss_pred cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-----c---CCCHHHHHHHHHHHHH Confidence 6789987665566689999988888887655544433333333333555555421 1 2477888899987776 Q ss_pred hhhcc-hhh-hhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhc--CCCccceehhh-hhH Q lcl|NC_011269. 365 LLAAD-FRL-MVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALIS--GGTGGAYASSA-LNR 430 (867) Q Consensus 365 ~~~~~-~~~-~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~--~g~~~~~~~~~-~~~ 430 (867) .-... |+- +|+ .=|+++.-++-. ..|++|.++| ++|+.++||.-.|+. ...++.|+|+. ... T Consensus 231 ~~g~~~~r~~~l~~p~g~~~g~~~~pis~~----~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~ 306 (344) T protein:vir:60 231 SKGRNNFKNLFLYAPQGKADGIKIIPLSEV----ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred hcCCCCCcceEEecCCCCccceeEEEcCCC----hhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHH Confidence 55444 333 333 235666665443 3355655554 579999999998872 23345687753 333 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhHHHHHhhc-ccchheehhhccccchhh Q lcl|NC_011269. 431 EFVTQIMTGFQNALKRHIRRRCEVVAEAQG-HYDYDLKGGVRVPIYREI 478 (867) Q Consensus 431 ~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~-~~d~~~~~~~~~~~~rd~ 478 (867) .|+..-++=++.+|+ ||++ .++..+.|...+++-.|. T Consensus 307 ~f~~~~L~Pl~~~~e-----------~ln~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 307 VFVRNELIPLQDRIR-----------EINGWLGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHH-----------HHHHhcCCcccccCccccCCCCC Confidence 333333333333332 3443 355667777777777777 No 95 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=96.47 E-value=0.00024 Score=40.39 Aligned_cols=347 Identities=11% Similarity=0.030 Sum_probs=157.0 Q ss_pred CCchH---HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHH-HHHHHHHHH Q lcl|NC_011269. 44 VDNKP---LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELR-VIRHWCRLF 119 (867) Q Consensus 44 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 119 (867) ||-+- .+....++.+.......--++- +...-+.-++.+-|- .|-++.+|- -+.-|---= T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~fg~p~---------------~~~~~~~~~~~~~~~~~~~ 64 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHT-DRAAQAEVFSFGDPV---------------EVLDRRELLDYVECMRMGQ 64 (368) T ss_pred CCccccccchhccCcccccccccCcchhhc-cccCceEEEEcCCce---------------eecchhhHHHHHHHHhccc Confidence 22211 0000000001111111000111 111112234444332 222222111 111121111 Q ss_pred hhccchHHH-HHHhhhhcccccceecccchhHHHHHHHHhh--cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccce Q lcl|NC_011269. 120 YATHDLVPL-LIDIYSKFPVVGMEFDSKDPLIKTFYEDLFF--GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVW 196 (867) Q Consensus 120 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (867) |.+-|+-.. |..+++.=+-.+-.+-.+-+.+. ..| .+.+.-.+|. - +-..|+.-|+.+=....|. .|.+ T Consensus 65 ~~~~pi~~~~la~~~~~~~~h~~~~~~~~n~l~-----l~~~Pn~~~t~~~f~-~-l~~d~ll~Gnay~~~~r~~-~G~~ 136 (368) T protein:vir:79 65 WYEPPMPWDGLARSFRAAAHHSSAVYVKRNILV-----STFIPHPLLSRATFE-R-LVLDWQVFGNAYLERRENV-LGGT 136 (368) T ss_pred hhccCcCHHHHHHHHhhccccchhhhhhcchhh-----hhcCCCcCCCHHHHH-H-HHHHHhhcCCeEEEEEEcC-CCCE Confidence 222222222 23444433322211111222221 111 1224444553 2 3367888899988887776 4667 Q ss_pred ehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccC Q lcl|NC_011269. 197 SSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQN 276 (867) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (867) -+++.|+|++|++... +.+ |. .....+ T Consensus 137 ~~L~~l~~~~v~~~~~----~~~----------------------------------------------~~---~~~~~~ 163 (368) T protein:vir:79 137 IRLDTPLAKYVRRGLD----LNT----------------------------------------------YF---FVQNWQ 163 (368) T ss_pred EEEEEeCcccceeecc----CCE----------------------------------------------EE---EEecCC Confidence 7888999999987411 000 00 011123 Q ss_pred CCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHH Q lcl|NC_011269. 277 DGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELD 356 (867) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 356 (867) ....++..-|-||.+-.+--.-.|.|-++-+.+++...++-.......-+.-..|--++++-+ + ..++++.+ T Consensus 164 ~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~-------~-~l~~e~~~ 235 (368) T protein:vir:79 164 QPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTD-------A-AQKQEDVD 235 (368) T ss_pred eEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-------C-CCCHHHHH Confidence 444566667889986654445589999999888888766655554444444444544444431 1 35888999 Q ss_pred HHHHHHHHhhhcch--hhhhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccce Q lcl|NC_011269. 357 EVRDDMQSLLAADF--RLMVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAY 423 (867) Q Consensus 357 ~~~~~~~~~~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~ 423 (867) .+|..|+..--+++ .++|+ .=|++++-++-. ..|++|.++| ++|+.+.||.-.|+.- +..++| T Consensus 236 ~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~----~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~ 311 (368) T protein:vir:79 236 TLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEV----AAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGF 311 (368) T ss_pred HHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCC----HHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCcc Confidence 99998877554553 35555 457777777543 3466665554 5799999999888832 223447 Q ss_pred ehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccc-hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhc Q lcl|NC_011269. 424 ASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYD-YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIK 502 (867) Q Consensus 424 ~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d-~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~ 502 (867) +++.--+ ..=++..| +-.++.+.||+++.. ..++|..-.| +.. |.+. T Consensus 312 sn~e~~~------~~f~~~~l----~Pl~~~ie~ln~~l~~e~~rF~~~~l-----~~~--------D~~a--------- 359 (368) T protein:vir:79 312 GDVEKAA------MVFARNEV----KPLQDRLLAINDWIGDEVVRFAPYAL-----GGH--------DQPA--------- 359 (368) T ss_pred ccHHHHH------HHHHHHHH----HHHHHHHHHHHhccCcceeeechhHh-----hcc--------cccc--------- Confidence 6654322 11122233 333333345555432 1111111111 000 1111 Q ss_pred cccccccchhhh Q lcl|NC_011269. 503 FSTLNLRDEAQE 514 (867) Q Consensus 503 ~~~~~Lr~e~~~ 514 (867) ...++++.+ T Consensus 360 ---~a~~~~rsa 368 (368) T protein:vir:79 360 ---AAPGGQRSA 368 (368) T ss_pred ---cCCcccccC Confidence 011122222 No 96 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=96.46 E-value=0.00044 Score=38.97 Aligned_cols=397 Identities=10% Similarity=0.048 Sum_probs=178.5 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFP 137 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (867) -|+-+.+-. +++. -...+.+ .-.+.......-.+. ++.....|| .++-|-.|||+-++ . T Consensus 1 Mg~~~~~~~----~~~~------~~~~~~~-~~~~~~~~~~~~~~~----~~~~~~~~~-----~~~~v~~~i~~ia~-~ 59 (423) T protein:vir:81 1 MGFLQKLGL----APSV------VATPEPI-ELVGPIFESLKLSTK----NMTVEQIWE-----DQPHLRTVTTFIAR-N 59 (423) T ss_pred CchhHhhcc----cccc------ccCcccc-ccccccccccccccc----hhhHHHHHH-----hhhHHHHHHHHHHH-h Confidence 222222200 1100 0011111 001110011111111 111222254 56778899988766 4 Q ss_pred cccceecc----cc---hhHHHHHHHHhhc---ccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcccee--hheecCcc Q lcl|NC_011269. 138 VVGMEFDS----KD---PLIKTFYEDLFFG---EDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWS--SEEILNPD 205 (867) Q Consensus 138 ~~~~~~~~----~~---~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 205 (867) |..+.|.. +| +.++.-....++. +..+-.+|+..++ ..++.-|+.+-+-.-+. ++... .+.-+++. T Consensus 60 ia~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~-~~l~l~Gna~~~i~rd~-~~~~~~~~l~p~~~~ 137 (423) T protein:vir:81 60 VASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTM-FDLCLYDEFFWLLPGDL-GVDTPTLDIRPIPVS 137 (423) T ss_pred HhhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEecC-CcCcceEEEeecccc Confidence 44443321 12 2344433333331 2244678888866 78888899887765442 22222 23334444 Q ss_pred eeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHH Q lcl|NC_011269. 206 MLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEAL 285 (867) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (867) .+.+.. +. +. .| .+ .| +|-. ..-..|.-..++..- T Consensus 138 ~v~~~~--~~-~~--------------~~------~~---------------~Y-----~~~~--~~~~~g~~~~~~~~e 172 (423) T protein:vir:81 138 WVQRRA--YK-DG--------------WG------SL---------------DY-----IIIE--SGDNDGRSVKVPGER 172 (423) T ss_pred eeeeee--cc-CC--------------Cc------ce---------------EE-----EEEE--ecCCCceEEEEcccc Confidence 444321 00 00 00 00 01 0100 111234446677778 Q ss_pred HHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHh Q lcl|NC_011269. 286 ISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSL 365 (867) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (867) |-|+++-...---.|.+.+..+-.+|..............+.-..|--++++-.. +..++ -+++..+.+++.++.. T Consensus 173 vih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~-~~~~~---l~~e~~~~~~~~~~~~ 248 (423) T protein:vir:81 173 VIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPE-SKAGK---WDAESRTRFMANLRAS 248 (423) T ss_pred eEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCc-ccCcc---CCHHHHHHHHHHHHHH Confidence 8898755443334699988888888877766666666667777777656655421 12222 2678888888887766 Q ss_pred hh--cc--hhhhhhhhheeeeeccccCccCchhHHH----HHHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHH Q lcl|NC_011269. 366 LA--AD--FRLMVHNFGLKVENVFGRESVPNLDADY----DRIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQI 436 (867) Q Consensus 366 ~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~ 436 (867) +. ++ -..+|-.-|++++-... -+.|.++ ++..++|.+++||.-.|+.-.++++|+++ +....|+..- T Consensus 249 ~~~~~~n~g~~~vl~~g~~~~~l~~----s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~ 324 (423) T protein:vir:81 249 FSPKSSDVGGTLLLEDGMKAENFHT----TSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKALYGDN 324 (423) T ss_pred hccccccCCcceecCCCceEEeccC----ChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHH Confidence 52 22 24556566777766642 2344333 66778899999999999965567889884 3455566666 Q ss_pred HHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhh--h--hhhhccccccccchh Q lcl|NC_011269. 437 MTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKL--L--IPEIKFSTLNLRDEA 512 (867) Q Consensus 437 ~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~--i--~~~i~~~~~~Lr~e~ 512 (867) ++-+...|++++.+.+-.-.|+. ...+.++| +..+++..+.|..-+...+.+.. | ..|+.-- +.+.... T Consensus 325 L~P~~~~ie~~l~~~L~~~~~~~-~~~~~~~f-----d~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~-~gl~p~~ 397 (423) T protein:vir:81 325 LGSWIRIIQDVMNLFLLPRVGID-NEKFYFEF-----NLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAM-DNLPSID 397 (423) T ss_pred HHHHHHHHHHHHhhhhcCccccc-cCccEEEe-----cchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHH-hCCCCCC Confidence 66677777777766652222211 11122333 22223221111101111111110 1 0111100 0010000 Q ss_pred hhhhhhhhhhhceeeeeccccCCC-cccccchhhhh Q lcl|NC_011269. 513 QERAFIAQLKGMGVPVSDKTLAVN-IDMKFDQELER 547 (867) Q Consensus 513 ~~~~~v~qL~~~~~pitd~t~p~t-iqme~E~e~e~ 547 (867) .....+.. . .+.+.. ...+.| +.++ T Consensus 398 gGD~~~~p-------~--n~~~~~~~~~~~~-~~~t 423 (423) T protein:vir:81 398 GGDDLARP-------L--NTEFGDSEDAPGE-EVET 423 (423) T ss_pred Ccceeecc-------c--ccccCccCCCCCC-CCCC Confidence 00000000 0 000000 000000 0011 No 97 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=96.45 E-value=0.00032 Score=39.78 Aligned_cols=373 Identities=13% Similarity=0.058 Sum_probs=153.5 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHh------hccchHHHHHH Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFY------ATHDLVPLLID 131 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~ 131 (867) -||-.. |+.-++..+. +..| ..+.-....|+.-.+ -.++.|-.||| T Consensus 1 MGl~~~-----------------------~~~~~~~~~~---~~~~--~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~ 52 (394) T protein:vir:62 1 MGLRDR-----------------------FSNYLFKKAE---KRGY--LDNVLGKSIRYSGVYVTDSNILQSSDVYELLQ 52 (394) T ss_pred Cchhhh-----------------------hhhhccCCCC---chhh--hhhhhhcccccCccccChhhhhccHHHHHHHH Confidence 111100 0000000000 0000 000001111211111 23467788888 Q ss_pred hhhh-cccccceecccc-hhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcce Q lcl|NC_011269. 132 IYSK-FPVVGMEFDSKD-PLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDM 206 (867) Q Consensus 132 ~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (867) +-++ ..-..+++..+| ..++....+.++ .+..+-.+|+..++ ..|+.-|+++=+.. ....+.|....+ T Consensus 53 ~Ia~~iA~lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~-~~lll~Gn~~~~i~-~~~~~~~~~~~~----- 125 (394) T protein:vir:62 53 DISNQMVLADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMT-NTYLLEGETFPILN-GAQIHLASNVFT----- 125 (394) T ss_pred HHHHhhcccceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHH-HHHHhcCCeEEEEe-cceeeccccceE----- Confidence 7653 333334433332 234444444444 12233446777655 66777777655432 112222322111 Q ss_pred eehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHH Q lcl|NC_011269. 207 LRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALI 286 (867) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (867) .++ +. | . ..| ...+..++...| T Consensus 126 -~~~-------~~--------------~-----~----------------~~~---------------~~~~~~~~~~ei 147 (394) T protein:vir:62 126 -ELD-------DN--------------L-----V----------------EHF---------------NIGGHEIPPCMI 147 (394) T ss_pred -EEC-------Cc--------------e-----E----------------EEE---------------eeCCEEechhhe Confidence 110 00 0 0 000 112344566677 Q ss_pred HHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhh Q lcl|NC_011269. 287 SRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLL 366 (867) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (867) .|+.+-. .=--.|.+.+.-+..+|-...........+.+.-..|=-++++.. . .=++++..+.+|+.|+..+ T Consensus 148 ih~r~~~-~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~----~---~~~~~~~~~~~~~~~~~~~ 219 (394) T protein:vir:62 148 RHVKNIG-ADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDA----H---INPQNGAQSKLINAILDQL 219 (394) T ss_pred EEecCcC-CCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCC----C---CCcCHHHHHHHHHHHHHHh Confidence 8876432 222378898888888887776666666667777667766777652 1 1356777888888777765 Q ss_pred hc-ch--hhh--hhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHH Q lcl|NC_011269. 367 AA-DF--RLM--VHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGF 440 (867) Q Consensus 367 ~~-~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~ 440 (867) .. ++ ..+ ...-+++++.+...-...-+-.-.++..++|.+++||.-.++.+++. +++ +....|+..-+.=+ T Consensus 220 ~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~---sn~e~~~~~~~~~~l~P~ 296 (394) T protein:vir:62 220 ESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIK---EDIEKAMMYIHNKAVRPI 296 (394) T ss_pred ccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC---cCHHHHHHHHHHHHHHHH Confidence 43 32 222 23333444433221111111122345678899999999999975444 332 34455555556666 Q ss_pred HHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhh--hhhHhhhhhhhhhhhh--ccccccccchhhhhh Q lcl|NC_011269. 441 QNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEET--GQEYIRKVPKLLIPEI--KFSTLNLRDEAQERA 516 (867) Q Consensus 441 ~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~--~k~~~r~~~k~i~~~i--~~~~~~Lr~e~~~~~ 516 (867) ...|++++.+.+-...+ + ..+.+.|+.. +.++.+.+. ++..+..... -..++ .+...-+.+ ..+ T Consensus 297 ~~~ie~~l~~kll~~~~--~---~~~~~~fd~~---~~~~~~~~~~~~~~~~~~g~~-T~NE~R~~~gl~p~~~---~~g 364 (394) T protein:vir:62 297 MKNFEDHLSLLFYAQNS--G---KRIKFKINIL---DFVTYSNKTNIGYNLVRTAIT-SPDNVADMLGFPKQNT---KES 364 (394) T ss_pred HHHHHHHHhhhhcCccc--c---CceEEEechh---hhcCHHHHHHHHHHHHhCCCc-CHHHHHHHhCCCCCCC---CCC Confidence 66666666654411111 1 1233333333 233333221 1111111000 01111 111111111 111 Q ss_pred hhhhhhhceeeeeccccCCCcccccchhhhh Q lcl|NC_011269. 517 FIAQLKGMGVPVSDKTLAVNIDMKFDQELER 547 (867) Q Consensus 517 ~v~qL~~~~~pitd~t~p~tiqme~E~e~e~ 547 (867) .........+++..... ....+|-..+-+. T Consensus 365 d~~~~~~n~~~~~~~~~-~~~~~kgge~~en 394 (394) T protein:vir:62 365 QAIYISNDVTEIGKKEA-TDGSLGGGEENEN 394 (394) T ss_pred Ceeeccccccccccccc-ccccCCCCCCCCC Confidence 11110011122221111 0111111111111 No 98 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=96.43 E-value=0.00024 Score=40.42 Aligned_cols=323 Identities=14% Similarity=0.118 Sum_probs=146.6 Q ss_pred CCCcccccccchhHHHHHHHHhcCCCCCCchhhHHhhhh---------hccc-CCchHHHHHHHHhhhcc----hhHHHH Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNRLRKAGVNMPNSPTMARAQAAA---------LQNT-VDNKPLIDYFQGRRRAA----EANRQR 66 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~~~~~~~~~~----~~~~~~ 66 (867) ||.--+...+.-.+ .+.++-...++ .+-. ++...+.+|+.-..+|- +-++.- T Consensus 1 ~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~ 66 (351) T protein:vir:79 1 MSKRRSRAPRTFAA--------------APNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAG 66 (351) T ss_pred CCCCCCCCCCCCCC--------------CCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHH Confidence 66544432211111 11111111111 1111 22223455554444432 223333 Q ss_pred HHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceeccc Q lcl|NC_011269. 67 LASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSK 146 (867) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (867) |+++-+.. ..-.--|.+.+.+.-.-|.-+| T Consensus 67 la~~~~~~----------------------------~~h~~~l~~k~n~l~~~~~Pnp---------------------- 96 (351) T protein:vir:79 67 LAKSFRAS----------------------------THHSSALFFKANVLASTFRPHR---------------------- 96 (351) T ss_pred HHHHHhhh----------------------------HhhhhhhhhhhhHHhhcccCCC---------------------- Confidence 33332221 1000001110001000011111 Q ss_pred chhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHH Q lcl|NC_011269. 147 DPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKD 226 (867) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (867) .|.-.+| .- +...|+.-|+.+=+...|.. |.+-++..|.|+++++.... .. T Consensus 97 ---------------~~t~~~f-~~-~v~d~ll~Gnay~~~~r~~~-G~~~~L~~l~~~~v~~~~~~----~~------- 147 (351) T protein:vir:79 97 ---------------WLSRHAF-ER-WALDFLTFGNGYLERRRNMV-GGTLRLEPALAKYVRRKADF----SG------- 147 (351) T ss_pred ---------------CCCHHHH-HH-HHHHHHhcCCeEEEEEECCC-CCEEEEEEeCCcceeeeecC----Ce------- Confidence 1222222 11 22455566887777777754 45668888999998874110 00 Q ss_pred HHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhH Q lcl|NC_011269. 227 LVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLR 306 (867) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (867) | +.....++-+.++..-|-|+.+-.+.-.-.|.|-++- T Consensus 148 --------------------------------~----------~~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~ 185 (351) T protein:vir:79 148 --------------------------------F----------VYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLS 185 (351) T ss_pred --------------------------------E----------EEEecCceEEEEcCccEEEeCCCCCCCCcccccHHHH Confidence 0 0001123334455566778876554445679999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh--hhhh-----hhhe Q lcl|NC_011269. 307 SFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR--LMVH-----NFGL 379 (867) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-----~~~~ 379 (867) +.+++...++........-+.-..|=-++++- + + ..++++.+.+|+-|+..--.+++ ++|+ .=|+ T Consensus 186 a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~-----~--~-~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi 257 (351) T protein:vir:79 186 SLHSAWLNESSTLFRRKYYENGSHAGFILYMT-----D--A-AQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGI 257 (351) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEec-----C--C-CCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccce Confidence 88888765544443333333333443333332 1 1 34888999999987765445533 4444 3456 Q ss_pred eeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhH Q lcl|NC_011269. 380 KVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCE 453 (867) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~ 453 (867) ++.-++-. ..|++|.++| ++|+.++||.-.|+.= ..++.|+++.--+ ..=+++.|.-+++ T Consensus 258 ~~~pl~~~----~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~------~~f~~~~l~Pl~~---- 323 (351) T protein:vir:79 258 QLIPVSEV----AAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAA------RVFGRNEIRPLQA---- 323 (351) T ss_pred EEEEcCCC----hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH------HHHHHHHHHHHHH---- Confidence 66655543 3566666665 4699999999999821 2234576654322 2222333333332 Q ss_pred HHHHhhcccchh-eehhhccccchhhhhhhhhhhhhHhhhh Q lcl|NC_011269. 454 VVAEAQGHYDYD-LKGGVRVPIYREIVEYDEETGQEYIRKV 493 (867) Q Consensus 454 ~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k~e~~k~~~r~~ 493 (867) .|.||+++..-. ++|..-++ +..+. ++ T Consensus 324 ~ie~ln~~lg~~~~~F~~~~l-----lr~d~--------~a 351 (351) T protein:vir:79 324 RFAELNDWLGDEVVTFDDYEI-----PPAPV--------AA 351 (351) T ss_pred HHHHHHhhcCcceeeeChhhh-----ccccc--------cC Confidence 233566544322 22211111 11111 11 No 99 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=96.42 E-value=0.00021 Score=40.74 Aligned_cols=310 Identities=12% Similarity=0.087 Sum_probs=144.1 Q ss_pred hhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHH---HHhhccchH-HHHHH Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCR---LFYATHDLV-PLLID 131 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~ 131 (867) |+. .++++. +.. ...-+.-++++-|. |+.++. +-+..+.-|.- .|| +.|+= .-|.+ T Consensus 1 m~~--~~~~~~--~~~-~~~~~~~~~~~~p~---~~~~~~-----------~~~~~~~~~~~~~~~~~-~pP~~~~~La~ 60 (337) T protein:vir:78 1 MTK--RQQQPA--QAA-ASSPRPSVVFSMPE---AIDPTA-----------WMTDYTGVFYNPYGEYY-QPPIDRKGLAK 60 (337) T ss_pred CCC--cccCcc--ccc-ccCceeEEEecCcc---cccCcc-----------hhHhhhhhhhccCccee-cCCCCHHHHHH Confidence 221 111111 111 11112334444442 222211 11222222321 122 22221 11222 Q ss_pred hhhhcccc----c--cee--cccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC Q lcl|NC_011269. 132 IYSKFPVV----G--MEF--DSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN 203 (867) Q Consensus 132 ~~~~~~~~----~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (867) +++.=|.- . -|. .+-++. .+|++.++ ..|+.-|..+-+...|. .|.+-++..|+ T Consensus 61 l~~~~~~h~~~L~~k~N~~~~~f~~~-~~~~~~~~----------------~d~ll~GNay~~~~rn~-~G~~~~L~pl~ 122 (337) T protein:vir:78 61 VARANAHHGAILMARRNMVAGRFTNQ-RATITAFV----------------HNYLQFGDGGLLKLRNS-FGQVVGLHPLS 122 (337) T ss_pred HhhcchhhhhHHHhhhccccccCcCc-HHHHHHHH----------------HHHHhhCCeEEEEEECC-CCcEEEEEEeC Confidence 22211110 0 000 000010 12222222 34556688877777776 56788899999 Q ss_pred cceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccH Q lcl|NC_011269. 204 PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISE 283 (867) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (867) |.+|++... +..+. ....++...++. T Consensus 123 ~~~v~~~~d----~~~~~--------------------------------------------------~~~~~~~~~~~~ 148 (337) T protein:vir:78 123 SVYLRRRED----GCFVY--------------------------------------------------LQQGKPNLIYRP 148 (337) T ss_pred CceeEeeeC----CeEEE--------------------------------------------------EEcCCceEEECC Confidence 999987521 11000 001223344555 Q ss_pred HHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHH Q lcl|NC_011269. 284 ALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQ 363 (867) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (867) .-|-||++-.+.-.-.|.|-++-+.+++...++........-+.-..|=-|+++- +. .-++++.+.+|+.|+ T Consensus 149 ~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~-----~~---~l~~e~~~~lk~~~~ 220 (337) T protein:vir:78 149 DDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYAT-----DP---NMDDDTEEEMKEMIA 220 (337) T ss_pred ccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC-----CC---CCCHHHHHHHHHHHH Confidence 6678888655444567999988888888765433332222222212333333322 11 136788888888777 Q ss_pred Hhhhcch-h-hhhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhh---cCCCccceehhh-h Q lcl|NC_011269. 364 SLLAADF-R-LMVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALI---SGGTGGAYASSA-L 428 (867) Q Consensus 364 ~~~~~~~-~-~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~---~~g~~~~~~~~~-~ 428 (867) ...-.++ . ++|+ .-|+++.-++.. ..|++|.++| ++|++++||.-.|+ ..+.+++|+|+. . T Consensus 221 ~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~----~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~ 296 (337) T protein:vir:78 221 NSKGVGNFRSMFVNIPDGKPDGIKLIPVGDI----ATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKY 296 (337) T ss_pred HhcCcccccceEEEcCCCCccceeEEEcCCC----hhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHH Confidence 6544443 2 3333 346676666543 4566665554 58999999998887 235677887654 3 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhH Q lcl|NC_011269. 429 NREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEY 489 (867) Q Consensus 429 ~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~ 489 (867) ...|+..-+.=++.+|++.+.+.+-++.+ + ++|++..+++= T Consensus 297 ~~~f~~~~L~P~~~~ie~~~n~~ll~~~~---~-----------------~~f~~~~~~~~ 337 (337) T protein:vir:78 297 DATYARNEVLPLCELVQDAINSAGLPRAL---W-----------------VTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCChhh---c-----------------eeccccccccC Confidence 33444444555666666655544322211 1 12222222111 No 100 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=96.31 E-value=0.00029 Score=39.95 Aligned_cols=337 Identities=15% Similarity=0.124 Sum_probs=142.1 Q ss_pred CCCcccccc------cchhHHHHHHHHhcCCCCCCchhhHH----hhh---------hhcccC-CchHHHHHHHHhhhcc Q lcl|NC_011269. 1 MSSPIYKAG------SNWSAEVNRLRKAGVNMPNSPTMARA----QAA---------ALQNTV-DNKPLIDYFQGRRRAA 60 (867) Q Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~---------~~~~~~-~~~~~~~~~~~~~~~~ 60 (867) -+++.-+|. --+..+|-++.|+--..+-..+..-+ .++ ..+-.| ++..+.+|..-..+|- T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~ 81 (376) T protein:vir:10 2 PARDRPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGE 81 (376) T ss_pred CCCccchhhhhhcccchhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcCc Confidence 111211111 11122233333322222211111110 010 111122 2222444443333321 Q ss_pred ----hhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhc Q lcl|NC_011269. 61 ----EANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKF 136 (867) Q Consensus 61 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (867) +-++.-|+++-+ .|.. ||--+++....|. ++| T Consensus 82 ~~~pp~~~~~La~~~~----------------------------~~~~---------h~s~l~~k~n~l~-------~~~ 117 (376) T protein:vir:10 82 WFEPPVSFAGLAKSFR----------------------------ASTH---------HSSALFFKANVLA-------STF 117 (376) T ss_pred eecCCCCHHHHHHHHh----------------------------hhHH---------hhhhHHHHhHHHH-------hcc Confidence 112222222111 1111 1111111111110 011 Q ss_pred ccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhc Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQ 216 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (867) - -.+. |.-.+|. ++-..|+.-|+.+-+..-|. .|.+-++..|.|.+|++... T Consensus 118 ~--------Pnp~-------------lT~~~f~--~~v~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~~~vr~~~d---- 169 (376) T protein:vir:10 118 R--------PHRW-------------LSRHAFE--RWALDFLTFGNGYLERRRNM-VGGTLRLEPALAKYVRRKAD---- 169 (376) T ss_pred C--------CCCC-------------CCHHHHH--HHHHHHHhcCCeEEEEEECC-CCCEEEEEEeCCcceEEEee---- Confidence 0 0011 1111111 11133445577766666665 45667788899999887511 Q ss_pred chHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccc Q lcl|NC_011269. 217 RERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAW 296 (867) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (867) .+++- .....++-+.++..-|-||.+-.+.= T Consensus 170 ~~~~~-------------------------------------------------~~~~~~~~~~~~~~eViHir~~~~~~ 200 (376) T protein:vir:10 170 FNGFV-------------------------------------------------YVNGWQERHEFEPDSVFQLVRPDINQ 200 (376) T ss_pred CCeEE-------------------------------------------------EEEcCCeEEEEccccEEEecCCCCCC Confidence 11100 01112333455666688887655444 Q ss_pred cccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhh Q lcl|NC_011269. 297 ATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMV 374 (867) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 374 (867) .-.|.|-++-+.+++...++........-+--..|=-|+++-+ + ..++++.|.+|+-|+..--.++ .++| T Consensus 201 ~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d-------~-~l~~e~~~~lr~~~~~~~G~~N~~~~~v 272 (376) T protein:vir:10 201 EVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-------A-AQKQDDVDNMRDALKNAKGPGNFRNVFM 272 (376) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-------C-CCCHHHHHHHHHHHHHhcCccccCceeE Confidence 5679998888888877544433333333332233433333321 1 2488999999997776443443 2444 Q ss_pred h-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccceehhhhhHHHHHHHHHHHHHH Q lcl|NC_011269. 375 H-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAYASSALNREFVTQIMTGFQNA 443 (867) Q Consensus 375 ~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (867) + .-|+++.-++-.. .|++|.++| ++|+.++||.-.|+.- ..+++|+|+.--+ ..=+++. T Consensus 273 l~~~g~~~Gi~~~pls~~~----~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~------~~f~~~~ 342 (376) T protein:vir:10 273 YAPGGKKDGIQLIPVSEVA----AKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAA------RVFGRNE 342 (376) T ss_pred ecCCCCccceEEEEccCCH----HHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH------HHHHHHH Confidence 4 3577777776543 456665554 5799999999888731 2235577754322 1112233 Q ss_pred HHHHHhhhhHHHHHhhcccchh-eehhhccccchhhhhhhhhhhhhHhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYD-LKGGVRVPIYREIVEYDEETGQEYIRKV 493 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k~e~~k~~~r~~ 493 (867) |.-+++ .+.||+.+.... |+| +..+++..+.+ + T Consensus 343 L~Pl~~----~ieeln~~L~~~~~~F-----~~~~Llr~d~k--------a 376 (376) T protein:vir:10 343 IRPLQA----RFAELNDWLGEEVVRF-----DDYEIPPAPVA--------A 376 (376) T ss_pred HHHHHH----HHHHHHhhcccccccc-----ChhHhhccccc--------C Confidence 333322 233455433211 221 11122222221 1 No 101 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=96.28 E-value=0.00027 Score=40.19 Aligned_cols=331 Identities=15% Similarity=0.153 Sum_probs=151.8 Q ss_pred CCchHHHHHHHHhhh--cchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhh Q lcl|NC_011269. 44 VDNKPLIDYFQGRRR--AAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYA 121 (867) Q Consensus 44 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 (867) ||- .....++ +...+..--++..+...-+.-++.+-|. ||-++.+ -+.-+..|----|. T Consensus 1 m~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~---~v~~~~~-----------~~~y~~~~~~~~~~ 61 (350) T protein:vir:11 1 MSK-----RRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPM---PVLDGRG-----------ILDYLECWPNGRWY 61 (350) T ss_pred CCc-----cccCCCcCccccCCcchhhhccccccceEEEEeCCce---eecCcch-----------hhHHHHHhhcCccc Confidence 221 1000000 0000000001111222223344555442 1211111 01111112111111 Q ss_pred ccchHHH-HHHhhhhccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh Q lcl|NC_011269. 122 THDLVPL-LIDIYSKFPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS 198 (867) Q Consensus 122 ~~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (867) +-|+-.. |..+++.=+... +.+.. +-+...| -=.+.|.-.+|. ++-..|+.-|..+-+..-|.. |..-+ T Consensus 62 ~pp~~~~~la~~~~~~~~h~~~l~~k~-n~l~~~~----~Pn~~~t~~~f~--~~v~d~ll~Gnay~~~~rn~~-G~~~~ 133 (350) T protein:vir:11 62 EPPLSMEGLAKSVGSSVYLQSGLKFKR-NMLAKTF----IPHRLLSRATFE--QFSLDWLTFGSAYLEQPRSRL-GTRMP 133 (350) T ss_pred cCCCCHHHHHHHHhhhhhhccchhhhh-hhhhhcc----cCCCCCCHHHHH--HHHHHHHhcCCeEEEEEEcCC-CCEEE Confidence 1111111 112221111111 11110 0011110 012234444443 233566777888777776764 56678 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG 278 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (867) +..|+|++|++.. .++.. | ..-..++. T Consensus 134 L~~l~~~~vr~~~----~~~~~--------------------------------------~-----------~~~~~~~~ 160 (350) T protein:vir:11 134 LQAPLAKYMRRGT----DLETF--------------------------------------Y-----------QVRSWKDE 160 (350) T ss_pred EEEeCCceeEeee----cCCeE--------------------------------------E-----------EEeeCCeE Confidence 9999999998752 11100 0 00112334 Q ss_pred CcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHH Q lcl|NC_011269. 279 LDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEV 358 (867) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (867) ..++..-|-|+.+-.+.-.-.|.|-++-+.+++...++........-+.-..|=-|+++-+ + .-++++.|.+ T Consensus 161 ~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~-------~-~ls~e~~~~l 232 (350) T protein:vir:11 161 HEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTD-------A-AQNEEDIDAL 232 (350) T ss_pred EEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-------C-CCCHHHHHHH Confidence 4566666888886555545679999999988888766544444444444444444444421 1 3588999999 Q ss_pred HHHHHHhhhcch--hhhhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccceeh Q lcl|NC_011269. 359 RDDMQSLLAADF--RLMVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAYAS 425 (867) Q Consensus 359 ~~~~~~~~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~~~ 425 (867) |+.|+..--.++ .++|+ .-|+++.-++-. ..|++|.+++ ++|+.++||.-.|+.- ...++|++ T Consensus 233 ~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~----~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn 308 (350) T protein:vir:11 233 RTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEV----AAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGS 308 (350) T ss_pred HHHHHHhcCccccCceeeecCCCCccceEEEEcCCC----hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCC Confidence 998876544453 23444 346777766543 3456666555 4699999999988842 22355877 Q ss_pred hh-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehh Q lcl|NC_011269. 426 SA-LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGG 469 (867) Q Consensus 426 ~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~ 469 (867) +. ....|+..-++=++.+|++..+.+.+++.. |-++.+..+ T Consensus 309 ~e~~~~~f~~~~L~P~~~~ie~ln~~l~~~~~~---F~~~~~~~l 350 (350) T protein:vir:11 309 ISDAAAVWASLELAPMQTRLQQVNEMIGEEVVR---FAQFDAPGL 350 (350) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCccccc---cCcccccCC Confidence 53 334444444555555555433332222211 222222222 No 102 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=96.27 E-value=0.00026 Score=40.24 Aligned_cols=323 Identities=15% Similarity=0.122 Sum_probs=145.5 Q ss_pred CCCcccccccchhHHHHHHHHhcCCCCCCchhhHHhhhh---------hccc-CCchHHHHHHHHhhhcc----hhHHHH Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNRLRKAGVNMPNSPTMARAQAAA---------LQNT-VDNKPLIDYFQGRRRAA----EANRQR 66 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~~~~~~~~~~----~~~~~~ 66 (867) ||.--+...+.-.+ .+.++-...++ .+-. ++...+.+|+.-..+|- +-++.- T Consensus 1 ~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~ 66 (351) T protein:vir:78 1 MSKRRSRAPRTFAA--------------APNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAG 66 (351) T ss_pred CCCCCCCCCCCCCC--------------CCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHH Confidence 66544432211111 11111111111 1111 12222445554443331 222333 Q ss_pred HHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceeccc Q lcl|NC_011269. 67 LASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSK 146 (867) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (867) |+++-+.. ..-.--|.+.+.+.-.-|.-+| T Consensus 67 la~~~~~~----------------------------~~h~~~l~~k~n~l~~~~~Pn~---------------------- 96 (351) T protein:vir:78 67 LAKSFRAS----------------------------THHSSALFFKANVLASTFRPHR---------------------- 96 (351) T ss_pred HHHHHhhh----------------------------HhhhhhhhhhhhHHhhcccCCC---------------------- Confidence 33322211 1000000000001000011111 Q ss_pred chhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHH Q lcl|NC_011269. 147 DPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKD 226 (867) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (867) .|.-.+|. - +-..|+.-|+.+=+...|. .|..-++..|.|+++++..- .+..+. T Consensus 97 ---------------~~t~~~f~-~-~~~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~~~v~~~~~---~~~~~~----- 150 (351) T protein:vir:78 97 ---------------WLSRHAFE-R-WALDFLTFGNGYLERRRNM-VGGTLRLEPALAKYVRRKAD---FSGFVY----- 150 (351) T ss_pred ---------------CCCHHHHH-H-HHHHHHhcCCeEEEEEECC-CCCEEEEEEecCcceEEeee---CCeEEE----- Confidence 11112221 1 1134555577777777776 45667788999999887510 000000 Q ss_pred HHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhH Q lcl|NC_011269. 227 LVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLR 306 (867) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (867) ....++...++..-|-||.+-.+--.-.|.|-++- T Consensus 151 ---------------------------------------------~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~ 185 (351) T protein:vir:78 151 ---------------------------------------------VNGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLS 185 (351) T ss_pred ---------------------------------------------EecCCeEEEEccccEEEEcCCCCCCCcccccHHHH Confidence 01123334455556778887655556789999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh--hhhh-----hhhe Q lcl|NC_011269. 307 SFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR--LMVH-----NFGL 379 (867) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-----~~~~ 379 (867) +.+++...++-.......-+.-..|=-|+++- + + ..++++.|.+|+-|+..--.+++ ++|+ .=|+ T Consensus 186 a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~-----~--~-~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~ 257 (351) T protein:vir:78 186 SLHSAWLNESSTLFRRKYYENGSHAGFILYMT-----D--A-AQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGI 257 (351) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEec-----C--C-CCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccce Confidence 88888764443332222222222332223321 1 1 35899999999988775545543 4444 2366 Q ss_pred eeeeccccCccCchhHHHHHHHH----HHHHhhccchhhhcC--CCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhH Q lcl|NC_011269. 380 KVENVFGRESVPNLDADYDRIER----KLLQAWGIGEALISG--GTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCE 453 (867) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~ 453 (867) ++.-++-. ..|++|.++|+ +|++++||.-.|+.- +.++.|+|+.--+ ..=++..|.-++++ T Consensus 258 k~~pls~~----~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~------~~f~~~~l~P~~~~--- 324 (351) T protein:vir:78 258 QLIPVSEV----AAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAA------RVFGRNEIRPLQAR--- 324 (351) T ss_pred eEEEcCCC----hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH------HHHHHHHHHHHHHH--- Confidence 77666533 45677877765 699999999999831 2234576654322 22233444444443 Q ss_pred HHHHhhcccc-hheehhhccccchhhhhhhhhhhhhHhhhh Q lcl|NC_011269. 454 VVAEAQGHYD-YDLKGGVRVPIYREIVEYDEETGQEYIRKV 493 (867) Q Consensus 454 ~i~e~q~~~d-~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~ 493 (867) |.||+++.. ..|+|. ..+++..+. ++ T Consensus 325 -iee~n~~l~~~~~~F~-----~~~Llr~d~--------ka 351 (351) T protein:vir:78 325 -FAELNDWLGDEVVRFD-----DYEIPPAPV--------AA 351 (351) T ss_pred -HHHHHhhcCccceecC-----hhhhccccc--------cC Confidence 234554322 112211 111222222 11 No 103 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=518 Identities=15% Similarity=0.091 Sum_probs=191.1 Q ss_pred CCCcccccccchhHHHHHHHHhc-CCCCCCchhhHHhhhhhcccCCchH-HHHHHHHhhhcchhHHHHHHHHhccccccc Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNRLRKAG-VNMPNSPTMARAQAAALQNTVDNKP-LIDYFQGRRRAAEANRQRLASYRKQGNFGS 78 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (867) |-.-+-||-.--...|...||.= ..|--..++.++.. +-+.+.- +..+... +| .++.+ -+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~------~~----~~~~~--~~~~ 64 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVN----NEPYSMESIEKGMNG------KT----TAYMQ--PIIG 64 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhh----ccCCCHHHHHHhHhh------hc----ccccc--hhhh Confidence 11111111100011112222110 00111111111110 0000000 0000000 00 00000 0011 Q ss_pred ceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----------cccccc--eeccc Q lcl|NC_011269. 79 NMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----------FPVVGM--EFDSK 146 (867) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~--~~~~~ 146 (867) +++. -+-.. ...++.|.++. +++ .+.| ...++|..||++... -.|.+| ++--+ T Consensus 65 ~~~~-----~~~~~---~~~~~~~~~~~---~~~---l~~~-~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~k 129 (574) T protein:vir:80 65 EMSV-----NPGYK---TKPSIRNSQDL---HKT---LKKF-GNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLK 129 (574) T ss_pred hccc-----ccccc---CcCccCCcccH---HHH---HHhh-ccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEe Confidence 1110 00000 12333333332 111 2333 456888888876542 122222 22112 Q ss_pred c-------------hhHHHHHHHHhhcc---cccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehh Q lcl|NC_011269. 147 D-------------PLIKTFYEDLFFGE---DLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVS 210 (867) Q Consensus 147 ~-------------~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (867) | .-|..|..+.-... +-...+|+.-++ ..++.-|.++=....|.. |.+.++..|+|+.|+|. T Consensus 130 d~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv-~~lll~Gnayi~i~r~~~-G~~~~L~pl~p~~V~v~ 207 (574) T protein:vir:80 130 DIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLV-RATYMYDQVNFEKVFDKD-GNFIKFDTVDPTTIFLA 207 (574) T ss_pred ccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHH-HHHHhcCCeEEEEEECCC-CcEEEEEEEcCceeEEE Confidence 1 11223333322211 123457888766 788888998877777754 67889999999999986 Q ss_pred hhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhh Q lcl|NC_011269. 211 RSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVV 290 (867) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (867) ..... .+. ..+ .++.++ ...+....++..-|-|++ T Consensus 208 ~d~~~---~~~----------------~~~--------------------~~y~~~------~~g~~~~~~~~~eiih~~ 242 (574) T protein:vir:80 208 TNGEG---KLI----------------KNG--------------------ERFVQV------IDNRIVAKFNERELAFAV 242 (574) T ss_pred EcCcc---ccc----------------cCc--------------------eEEEEE------eCCceEEEEccccEEEEe Confidence 32100 000 000 001111 112222334555566765 Q ss_pred hcCcc--c-cccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh Q lcl|NC_011269. 291 NRPTA--W-ATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA 367 (867) Q Consensus 291 ~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (867) ..... + ...|.+.+.-+.++|.......+......+.-..|=-++++- ++. -.+++.++.+|+-|+.... T Consensus 243 ~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~----~~~---~ls~e~~~~lk~~~~~~~~ 315 (574) T protein:vir:80 243 RNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVK----TGQ---QQSQQALDIFRREWRSSLA 315 (574) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC----CCC---CCCHHHHHHHHHHHHHHhc Confidence 33221 1 235888888888888766655555555555556666666663 111 2578889999998876653 Q ss_pred -cc--hh-hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhh----------cCCCccceehhhhh-HHH Q lcl|NC_011269. 368 -AD--FR-LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALI----------SGGTGGAYASSALN-REF 432 (867) Q Consensus 368 -~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~g~~~~~~~~~~~-~~~ 432 (867) ++ .+ +||-.-|++++.....-.-..+-.-.++..++|.+++||.-.+| +|+++.+|+++.-- ..| T Consensus 316 G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f 395 (574) T protein:vir:80 316 GINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQAS 395 (574) T ss_pred cccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHH Confidence 23 22 46666788888876543333333445667889999999999887 34455678877543 446 Q ss_pred HHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhh-----hhhhh--hhhhcccc Q lcl|NC_011269. 433 VTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRK-----VPKLL--IPEIKFST 505 (867) Q Consensus 433 ~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~-----~~k~i--~~~i~~~~ 505 (867) +.+-+.-+..+|++.|.+.+-. +... .+.+.|...+.-+..+. .++.+..... +...+ ++.+.+.. T Consensus 396 ~~~tL~P~~~~ie~~ln~~Ll~--~~~~----~~~~~f~~~d~~~~~~~-~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD 468 (574) T protein:vir:80 396 QNKGLQPLLRFIEDTVNTYIVA--EFGE----KYQFQFRGGDLSAQLDK-LKIIEQEGKVFRTVNEIRHDKGLEPIKGGD 468 (574) T ss_pred HHHHHHHHHHHHHHHHHhhhhh--hcCC----ceEEEecccchhhHHHH-HHHHHHHhCCccCHHHHHHHhCCCCCCCCC Confidence 6666888888888888877622 2211 12223332222111111 1111111000 00000 12222221 Q ss_pred ccccchhhhhhhhhhhhhceeeeeccccCCCcccccch-hhhhhHHHHHHHHhhcccccccccccccccCCCCCcccccc Q lcl|NC_011269. 506 LNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQ-ELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQH 584 (867) Q Consensus 506 ~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~-e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~ 584 (867) ..+.. .-..+......... +.+.........+......-.+.....+........+..+ T Consensus 469 ~~~~~-------------------~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~- 528 (574) T protein:vir:80 469 VILNG-------------------VHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQ- 528 (574) T ss_pred Eeeec-------------------cceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhh- Confidence 11100 00000000000000 0000000000000000000000000000000000000000 Q ss_pred ccccccCCCCCCCCCCCCCCCCccCCCCccCCCCCCCccCccCcCCCCCCCC Q lcl|NC_011269. 585 LQSTLALRQGKTQTELGEAQAVAGEAQAELQTKQIEMQEMMMDQQMAGGVMP 636 (867) Q Consensus 585 p~~t~~~a~gpgq~~~~qa~~~agq~~~p~~~~~~~~qp~~~~qg~pG~~gP 636 (867) ...+...+.......+-...-+..+.. .... ......++..|---- T Consensus 529 --~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~~~~~~~~~~~ 574 (574) T protein:vir:80 529 --DEQQGLNGKSKKVNGKVDDNVGKDGQL---KSEE-NTNSTKHGTDGIKKE 574 (574) T ss_pred --hhhhhhccchhhhcCCccccccccccc---cccc-ccccccccCccccCC Confidence 000000000000000000000000000 0000 000000000000000 No 104 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=95.78 E-value=0.001 Score=36.96 Aligned_cols=372 Identities=12% Similarity=0.059 Sum_probs=157.1 Q ss_pred hcchhHHHHHHHHhcc-------cccccceeeccchhhhhhhhhHHhhC-----------CCchhhhHHHHHHHHHHHHH Q lcl|NC_011269. 58 RAAEANRQRLASYRKQ-------GNFGSNMQIAMPKIRQPLGTLADKGI-----------PFNVEDEEELRVIRHWCRLF 119 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~ 119 (867) -|+-+|+--+++.-.- .-+.+|=+|. +|+.|-.....+.. |..|.-.. .. .-+ +.= T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~--~~t-~~~ 74 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMV--EFRGPEEEPEARALPWIRPTAWSGYPESWATPS-WG--SAQ-DKL 74 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCcee--eccCCCcchhhhhcccccccccccccccccccC-cc--ccc-hhh Confidence 2222222111000000 0001111111 13333322221111 11110000 00 000 011 Q ss_pred hhccchHHHHHHhhhhcccccceeccc--chhHHHHHHHHhhccc----ccHHHHhHHHHHHHHhhhhhhcchhhhhhhc Q lcl|NC_011269. 120 YATHDLVPLLIDIYSKFPVVGMEFDSK--DPLIKTFYEDLFFGED----LNYLEFLPDQFAREYFTVGEVTSLAHFNESL 193 (867) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (867) |...+.|-.|||+-++ .|..+.|... +..+.+.+. + +-.+ .+-.+|+..++ ..++ +|..+-+.--.-.. T Consensus 75 ~~~~~~v~acV~~Ia~-~iA~lpl~~~~~~~~~~~~~~-l-l~~~PN~~~t~~~f~~~l~-~~ll-lGnay~~~i~r~~~ 149 (409) T protein:vir:83 75 RTLIDVAWACIDLNAS-VLSSMPIYRMRNGRIIDSVAW-M-SNPDPEVYTSWQEFAKQLF-WDFQ-LGEAFVLPMAHGSD 149 (409) T ss_pred HhhhHHHHHHHHHHHH-hhccCceEEeeCCccccchhh-h-cccCCCCCCCHHHHHHHHH-HHHh-hCCcEEEEEEECCC Confidence 2234566677777655 2333333221 111122211 1 2222 44567887755 4454 48876553323345 Q ss_pred cceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhh Q lcl|NC_011269. 194 GVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAA 273 (867) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (867) |..-++..|+|+.|.|. +..+.++. |+ | T Consensus 150 G~~~~L~pl~p~~v~v~---~~~~g~~~-------------------------------------y~-----~------- 177 (409) T protein:vir:83 150 GYPIRFRVVPPWLVNVE---LKKGARRE-------------------------------------YR-----I------- 177 (409) T ss_pred CcEEEEEEECCcceEEE---EcCCceEE-------------------------------------EE-----E------- Confidence 66778999999988776 22221111 10 0 Q ss_pred ccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHH Q lcl|NC_011269. 274 MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQG 353 (867) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (867) . +...+++ |-|+....+.-.-.|.+.+--+-++|-.....++......+.-..|--++++.+ .-+.+ T Consensus 178 -~--~~~~~~e-iiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~---------~ls~e 244 (409) T protein:vir:83 178 -G--GLNVTDE-ILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVER---------RLSET 244 (409) T ss_pred -c--cccCccc-eEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCC---------CCCHH Confidence 0 0111122 557765554444578888877777776666555555555555556666666542 24778 Q ss_pred HHHHHHHHHHHhhhcc--hhhhhhhhheeeeeccccCccCchh---HHH----HHHHHHHHHhhccchhhhcC---CCcc Q lcl|NC_011269. 354 ELDEVRDDMQSLLAAD--FRLMVHNFGLKVENVFGRESVPNLD---ADY----DRIERKLLQAWGIGEALISG---GTGG 421 (867) Q Consensus 354 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~---~~~----~~~~~~~~~~~~~~~~~~~~---g~~~ 421 (867) +.+.+|+.++....-. -.++ ++.-+.-.+.+.++ .+| ++..++|.+++||.-.||.. +++. T Consensus 245 ~~~~~~~~~~~~~~~nag~~~i-------l~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~ 317 (409) T protein:vir:83 245 EAVDLMDRWIESRSKYAGHPAL-------VTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSL 317 (409) T ss_pred HHHHHHHHHHHhhCCccCccce-------ecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCcccc Confidence 8899999887654321 1122 22222223334443 333 44567899999999888841 3445 Q ss_pred ceeh-hhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhh Q lcl|NC_011269. 422 AYAS-SALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPE 500 (867) Q Consensus 422 ~~~~-~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~ 500 (867) +|++ .+.+..|+..-++=+..+|++.+.+.+-.- .+.|+|-+..+.--|..+ +.+.++..+.. T Consensus 318 tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~-------~~~~~f~~~~llr~d~~~-r~~~~~~~~~~-------- 381 (409) T protein:vir:83 318 TYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS-------PQHLELNRDDYTRPSLVE-RATAYKIMIEA-------- 381 (409) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-------CcEEEeehhhhhccCHHH-HHHHHHHHHhC-------- Confidence 7887 455555666666666666666665544110 122333222221111111 11111111111 Q ss_pred hccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccch Q lcl|NC_011269. 501 IKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQ 543 (867) Q Consensus 501 i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~ 543 (867) +-+..+.-+..++ +- +..+.+--...++ T Consensus 382 ---G~lT~NE~R~~~g-lp-----------p~~ggd~l~~~gv 409 (409) T protein:vir:83 382 ---GVMEPNEARAMER-LH-----------SEAAAVRLSGGGV 409 (409) T ss_pred ---CCcCHHHHHHHhC-CC-----------CCCCCcccCCCCC Confidence 0011111010000 00 0000000000111 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=95.74 E-value=0.00014 Score=41.65 Aligned_cols=204 Identities=10% Similarity=0.076 Sum_probs=103.3 Q ss_pred ccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccc-cCcchhhHHHH Q lcl|NC_011269. 231 LRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWAT-RGAPHLLRSFR 309 (867) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 309 (867) +|++ .+.+|.=.+ .+-.....++...++.+-|-|+.+-. ++.. +|.|.+..|.. T Consensus 1 ~r~~--------------------~dg~~~y~~----~~~~~~~~g~~~~~~~~eilH~r~~~-~~~~~~Glspi~~a~~ 55 (219) T protein:vir:98 1 MRVC--------------------KDGNYKYLM----KKSLYDTKSEIYEYNKNDVIFIKLYD-PMQQVYGSPDYVGGIT 55 (219) T ss_pred Ccee--------------------ecCeEEEEE----ecceecCCceeEEeccccEEEecCCC-CCCCcceecHHHHHHH Confidence 2332 111221111 11112233556677778888997643 3333 69999999988 Q ss_pred HHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--hhhhh-----hhheeee Q lcl|NC_011269. 310 TLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--RLMVH-----NFGLKVE 382 (867) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-----~~~~~~~ 382 (867) ++....+..+-+...-+.-.+|=-++++.+ + .-+.+..+.+|+.|+..--.++ .++|. .-|++++ T Consensus 56 ~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~-------~-~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~ 127 (219) T protein:vir:98 56 SALLNSDATIFRRRYYSNGAHMGFILYSTD-------P-DMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVI 127 (219) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEEeCC-------C-CCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEE Confidence 887765555555555555555654444431 1 1266777888886665433332 23333 2355555 Q ss_pred eccccCccCchhHHH----HHHHHHHHHhhccchhhh--cCCCccceehhh-hhHHHHHHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_011269. 383 NVFGRESVPNLDADY----DRIERKLLQAWGIGEALI--SGGTGGAYASSA-LNREFVTQIMTGFQNALKRHIRRRCEVV 455 (867) Q Consensus 383 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~--~~g~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i 455 (867) .+.-. +.|.+| ++-..+|.+++||.-.++ +...+.+|+++. .+..|+..-++=++.+|++.|.+.+ T Consensus 128 ~~~~~----~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~--- 200 (219) T protein:vir:98 128 PIGDT----GQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY--- 200 (219) T ss_pred EccCC----HHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh--- Confidence 44422 234443 334567999999999986 434456787743 3333444444444444444443211 Q ss_pred HHhhcccchheehhhccccchhhh Q lcl|NC_011269. 456 AEAQGHYDYDLKGGVRVPIYREIV 479 (867) Q Consensus 456 ~e~q~~~d~~~~~~~~~~~~rd~~ 479 (867) .+...+..-|...+.+|+= T Consensus 201 -----~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 201 -----EIKSALKVNFKQPEKRDKN 219 (219) T ss_pred -----cCCCccEEeecCcccccCC Confidence 1111222223333333322 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=95.48 E-value=0.002 Score=35.40 Aligned_cols=413 Identities=11% Similarity=0.028 Sum_probs=160.8 Q ss_pred HHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ce Q lcl|NC_011269. 64 RQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-ME 142 (867) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 142 (867) .-++.+|.+...+++ -..++..+= -+.+.+. ...+ -| ..|+.|.|+...||+.-+..+.. ++ T Consensus 1 ~~~~D~~~~~~~~~g---~~~~~~~~~------~~~~~~~-~~~~------l~-a~Y~~~~l~~~~vd~~a~d~~r~~~~ 63 (437) T protein:vir:52 1 MKFFDGIKSLALKLG---SKQEQTYYS------PSLSLTD-DLVQ------LE-ALWRDNWIANKVCIKRPEDMVRNWRE 63 (437) T ss_pred CchhhhhHhHHhcCC---Cccccceee------cCccccc-cHHH------HH-HHHHhCchhhHHhhcchHHhhcCCce Confidence 222223332221111 111111110 1111111 1222 34 45999999999999988877766 88 Q ss_pred ecccc---hhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehh----------eecCcceeeh Q lcl|NC_011269. 143 FDSKD---PLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSE----------EILNPDMLRV 209 (867) Q Consensus 143 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~ 209 (867) |.++| ..+|.|- -.| ++|++-+-|.|.+ +-.=-.|...=+ ...+ +..|+.- .++++..|.. T Consensus 64 i~~~d~~~~~~~~~~--~~~-~~l~~~~~l~~a~-~~~rl~G~a~i~-i~~d-~~~~~~pl~~~~~~~~~~v~~~~~v~~ 137 (437) T protein:vir:52 64 IYSNDLNSKQLDLFT--KFE-RSLKLRETLTKAL-QWSSLYGSVGLL-VVTD-SQNTSAPLKPTERLKRLIILPKWKISP 137 (437) T ss_pred EecCCCCHHHHHHHH--HHH-HhhcHHHHHHHHH-HhcccccceEEE-EEec-CCCcccccccCCceeEEEEechhhccc Confidence 87754 3344433 235 7899999998866 211122332222 1111 2234322 2222222111 Q ss_pred hhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHh Q lcl|NC_011269. 210 SRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRV 289 (867) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (867) . ... -+-|.+. +|.. |+.-+-...++.+.|...=|.|+ T Consensus 138 ~-------------~~~-----~~dp~s~-------------------~fg~-----p~~y~v~~~~~~~~iH~SRii~~ 175 (437) T protein:vir:52 138 T-------------GTK-----DDDVLSP-------------------NFGR-----YSEYSILGGSQSITVHHSRLIIL 175 (437) T ss_pred c-------------ccc-----ccccccc-------------------ccCc-----ceEEEEecCCcceeEccceeEEe Confidence 0 000 0011111 0100 11000011223334444334444 Q ss_pred hhcC---ccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhc--ccccCCCCcCCCCHHHHHHHHHHHHH Q lcl|NC_011269. 290 VNRP---TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLG--IEDMGDGEPWIPDQGELDEVRDDMQS 364 (867) Q Consensus 290 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (867) .+.. +.+.-||.|++-+++-.|..-++-...=..+..+.... .+|+- .+.++.|- .+.+.+...-+.. T Consensus 176 ~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l~~~~-----~~~~~~~~~~~~~ 248 (437) T protein:vir:52 176 NANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKIAAGM-----ENEVASVISAVQE 248 (437) T ss_pred cCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHhcCCc-----HHHHHHHHHHHHH Confidence 3332 56778999999999988876655554444445454333 33332 12333331 2333333332222 Q ss_pred hhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHH-HH Q lcl|NC_011269. 365 LLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQ-NA 443 (867) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~-~~ 443 (867) +--. ..++|-.=+-+++.+-++ +=.|++=++..+..|.-+.||-...+.|=.-+.+++..=-+.-..+.+-.+| .. T Consensus 249 ~~~~-~~~~~~d~~~~~e~~~~~--~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd~i~~~Qe~~ 325 (437) T protein:vir:52 249 IKSA-TNSLLLDAENEYDRKELT--FTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHEAIRRLQETR 325 (437) T ss_pred hcCC-CceEEEcCCcceEEEecC--cCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHHHHHHHHHHH Confidence 1111 222221112222222221 2257788899999999999999999984333344432211111122233344 23 Q ss_pred HHHHHhhhhHHHH-Hhhcccchheehhhccccchh---hhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVA-EAQGHYDYDLKGGVRVPIYRE---IVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIA 519 (867) Q Consensus 444 l~~~~r~~~~~i~-e~q~~~d~~~~~~~~~~~~rd---~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~ 519 (867) |+-.+.+.++-|. +.-+..+.++++.|+-|...+ ..+-.+++++- +.+. +.-+ +.....+.+.+. T Consensus 326 l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a-~~~~-------~~~g---~i~~~e~r~~L~ 394 (437) T protein:vir:52 326 LRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATA-ANTL-------IQNG---VLNEYQIANELR 394 (437) T ss_pred HHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHH-HHHH-------HhcC---CCCHHHHHHHHH Confidence 4555555554442 233345567777777654432 22222222111 1110 1111 111122222222 Q ss_pred hhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 520 QLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 520 qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) . .+. .+ .+..+ +.+..... ..+....+...+.....+ ..+. ++ T Consensus 395 ~---~g~------~~-~i~~~-~~~~~~~~----~~~~~~~~~~~~~~~~~~--~~~~-~~ 437 (437) T protein:vir:52 395 E---SGL------FA-NISAE-HIEELKNA----DEFAGNFEEPEKMEGAQV--QNSE-DQ 437 (437) T ss_pred h---cCC------CC-CCCcc-ccccccCC----CCCCCccCCCCCCCCCCC--CCCC-CC Confidence 1 100 00 00000 00000000 000000000001111111 1111 11 No 107 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=95.37 E-value=0.00054 Score=38.48 Aligned_cols=327 Identities=12% Similarity=0.086 Sum_probs=143.0 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeec-cchhhhhhhhhHHhhCCCchhhhHHHHHHHHHH---HHHhhccc Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIA-MPKIRQPLGTLADKGIPFNVEDEEELRVIRHWC---RLFYATHD 124 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 124 (867) |++-.. .-++..+.. -+.-++++ -|. ||-++. +-+..+--|. ..|| +.| T Consensus 1 ~~~~~~-----------~~~~~~~~~-~~~~~~~~~~p~---~~~~~~-----------~~~~~~~~~~~~~~~~~-epp 53 (348) T protein:vir:26 1 MTEQLI-----------HSHTTDGTE-SKSVYSFDPNPE---PVDTNS-----------WMTRYCELFYNDFDDYW-EPP 53 (348) T ss_pred CCcccc-----------chhhccccC-CceEEEecCCCe---eecCcc-----------hHHHHHHHHhcCCCccc-cCC Confidence 111100 000000110 11223333 121 111111 1111222231 1111 112 Q ss_pred h-HHHHHHhhhhccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehhee Q lcl|NC_011269. 125 L-VPLLIDIYSKFPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEI 201 (867) Q Consensus 125 ~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (867) + ..-|..+++.=+... +.+-. +-+.+.| -=.+.|.-.+|. ++-..|+.-|+.+=+..-|.. |.+-++.. T Consensus 54 ~~~~~La~l~~~n~~h~~~i~~k~-N~l~~~~----~Pn~~~t~~~f~--~~~~d~ll~Gnay~~~~rn~~-G~~~~L~~ 125 (348) T protein:vir:26 54 ISLKGLAEIANANGYHGSLLKARA-NYVAGRF----MNGGGLPMYKMN--SACWDYFGLGMSAFVKIRSYL-KNVIALEP 125 (348) T ss_pred CCHHHHHHHHhhhhhhhhhHhhhh-hHHhhcc----cCCCCCCHHHHH--HHHHHHHhcCCeEEEEEEcCC-CcEEEEEE Confidence 1 111111111111110 00000 0000000 001122223331 122456666888777666654 45668889 Q ss_pred cCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcc Q lcl|NC_011269. 202 LNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDI 281 (867) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (867) |.|.+|++... ++.+. ....|+...+ T Consensus 126 l~~~~v~~~~d----~~~~~--------------------------------------------------~~~~g~~~~f 151 (348) T protein:vir:26 126 LPMVHMRKRKN----GDFVQ--------------------------------------------------LLRNNEQKVF 151 (348) T ss_pred ecCceeEeeec----CcEEE--------------------------------------------------EEecCeEEEE Confidence 99999887521 11110 0112333455 Q ss_pred cHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHH Q lcl|NC_011269. 282 SEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDD 361 (867) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (867) +..-|-|+.+-.+--.-.|.|-++-+.+++...++-.......-+.-..|=-|+++- + + ..++++.+.+|+- T Consensus 152 ~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~-----~--~-~ls~e~~~~lk~~ 223 (348) T protein:vir:26 152 KAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYAT-----D--P-NLSEADEKALKEK 223 (348) T ss_pred cCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-----C--C-CCCHHHHHHHHHH Confidence 566778887644444557999999999888765544333333333333343333322 1 1 3488899999998 Q ss_pred HHHhhhcchh--hhhh-----hhheeeeeccccCccCchhHHHHHHHH----HHHHhhccchhhhc--CCCccceehhh- Q lcl|NC_011269. 362 MQSLLAADFR--LMVH-----NFGLKVENVFGRESVPNLDADYDRIER----KLLQAWGIGEALIS--GGTGGAYASSA- 427 (867) Q Consensus 362 ~~~~~~~~~~--~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~--~g~~~~~~~~~- 427 (867) |+..--.+++ ++|+ .-|+++.-++- -..|++|.++|+ +|++++||.-.|+. -..+++|+++. T Consensus 224 ~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~----~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~ 299 (348) T protein:vir:26 224 IASSKGIGNFRSMFVNIPNGKEKGIQLIPVGD----IATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLK 299 (348) T ss_pred HHHhcCcccccceeEEcCCCCccceeEEEccC----ChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHH Confidence 8776545543 4454 44666666653 235778877766 49999999998863 13446787753 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhh Q lcl|NC_011269. 428 LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPK 495 (867) Q Consensus 428 ~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k 495 (867) ....|+..-++=++.+|++.|-+.+.. -. +..++| +...+.+....+.. T Consensus 300 ~~~~f~~~~l~P~~~~ie~~ln~~l~~----~~--~~~~~f-------------dl~~~~e~~~~~a~ 348 (348) T protein:vir:26 300 VSQVYDFYEVIPVCKRFMDAVNNDPEI----PD--NLKLKF-------------NLNPGVESANGSAV 348 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhCC----CC--ccEEEE-------------ecCcccccchhhcC Confidence 233333333444555555444433210 00 111222 21111111111111 No 108 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=95.20 E-value=0.00083 Score=37.49 Aligned_cols=321 Identities=12% Similarity=0.088 Sum_probs=144.1 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHH--hh Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLF--YA 121 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 121 (867) |+-+ ++++..+.-.+...-+.-++.+-|. ||-+.++ -+..+ ||-.. |- T Consensus 1 m~~~--------------~~~~~~~~~~~~~~~~~~~~~~~p~---~~~~~~~-----------~~~~~--~~~~~~~~~ 50 (340) T protein:vir:98 1 MSKR--------------KPRKAVAMTASAPQKMEAFTFGEPV---PVLDKRD-----------ILDYV--ECISNGKWY 50 (340) T ss_pred CCCC--------------CCCccccccccCccceeEEEcCCce---eecCcch-----------hhhhh--hhhhcCcee Confidence 2211 1111111111111111112222221 1111111 11122 33221 22 Q ss_pred ccchH-HHHHHhhhhccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh Q lcl|NC_011269. 122 THDLV-PLLIDIYSKFPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS 198 (867) Q Consensus 122 ~~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (867) +-|+- .-|..+++.=+... +.+.. +.+.+.| -=.+.|.-.+|- ++...|+.-|+.+-+..-|.. |..-+ T Consensus 51 ~pp~~~~~la~l~~a~~~h~s~i~~k~-n~l~~~~----~Pn~~lt~~~f~--~~~~d~ll~Gnay~~~~rn~~-G~~~~ 122 (340) T protein:vir:98 51 EPPVSFSGLAKSLRSAVHHSSPIYVKR-NVLASTY----IPHPLLSRQDFS--RFALDYLVFGNAFLEQRHSVT-GQLIK 122 (340) T ss_pred cCCCCHHHHHHHHHhccccchhhhhhh-hHHhhcc----CCCCCCCHHHHH--HHHHHHHhcCCeEEEEEECCC-CcEEE Confidence 33331 11223322222111 11100 0000000 001122223331 233556667888777766654 45667 Q ss_pred heecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC Q lcl|NC_011269. 199 EEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG 278 (867) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (867) ++-|.+.+|++... +++. | ++ -..++- T Consensus 123 L~pl~~~~vr~~~~----~~~~--------------------------------------~-----~~------~~~~~~ 149 (340) T protein:vir:98 123 LLTSPAKYTRRGVD----DSVF--------------------------------------W-----FV------ENFTQP 149 (340) T ss_pred EEEeCCceEEEccc----CcEE--------------------------------------E-----EE------ecCCeE Confidence 88888888886410 0000 0 00 012233 Q ss_pred CcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHH Q lcl|NC_011269. 279 LDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEV 358 (867) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (867) +.++..-|-||++-.+...-.|.|-++-+.+++...++-.......-+.-..|=-++++-+ + ..++++.|.+ T Consensus 150 ~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~-------~-~ls~e~~~~l 221 (340) T protein:vir:98 150 HEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTD-------P-AQSATDVESL 221 (340) T ss_pred EEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-------C-CCCHHHHHHH Confidence 4566667889987555566789999999988887655544433333333333433333321 2 3588899999 Q ss_pred HHHHHHhhhcch--hhhhh-----hhheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhc--CCCccceeh Q lcl|NC_011269. 359 RDDMQSLLAADF--RLMVH-----NFGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALIS--GGTGGAYAS 425 (867) Q Consensus 359 ~~~~~~~~~~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~--~g~~~~~~~ 425 (867) |+-|+..--.++ .++|+ .-|+++.-++-.- .|++|.++| .+|+.++||.-.|+. ...+++|++ T Consensus 222 k~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~----~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn 297 (340) T protein:vir:98 222 RDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVA----TKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGD 297 (340) T ss_pred HHHHHHhcCccccCceeEecCCCCccceEEEEcCCCh----hHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCcccc Confidence 997776433443 35555 4577777665432 456666555 479999999999873 133456776 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccchh-eehhhccccchhhhhhh Q lcl|NC_011269. 426 SALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDYD-LKGGVRVPIYREIVEYD 482 (867) Q Consensus 426 ~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~~-~~~~~~~~~~rd~~~~k 482 (867) +.--+ ..=.++.|.-+ ++.|.||+++.... ++|. ..++++.+ T Consensus 298 ~e~~~------~~f~~~~l~Pl----~~~iee~n~~L~~e~~rF~-----~~~l~~~d 340 (340) T protein:vir:98 298 VEKVA------KVFVRNELSPL----QDRFREVNDWLGMEVIRFK-----EYTLDNPE 340 (340) T ss_pred HHHHH------HHHHHHHHHHH----HHHHHHHHhcccccccccC-----ccccccCC Confidence 54322 12222333332 22233566543222 2221 11222222 No 109 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=94.99 E-value=0.003 Score=34.41 Aligned_cols=376 Identities=10% Similarity=0.115 Sum_probs=149.7 Q ss_pred eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHH----HHHhhccchHHHHHHhhhhcccccceecc--------c-ch Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWC----RLFYATHDLVPLLIDIYSKFPVVGMEFDS--------K-DP 148 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~-~~ 148 (867) ||+=+.-......+ +.| .++.+++.....|- ..=|..++.|-.||++-++ .|..+.+.. + +. T Consensus 1 mg~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~-~ia~~p~~v~~~~~~~~~~~~ 75 (403) T protein:vir:10 1 MGFKSWITEKLNPG-QRI---IRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVID-SAAECSYTVGDKYNIVTYANG 75 (403) T ss_pred Ccchhhhhhccchh-hhh---hhcccccccccCCcccccHHHHHHHHHHHHHHHHHHH-HHhhCceeEeecccccccccc Confidence 33322111111101 111 11111111110000 0112346778888877665 333333221 1 11 Q ss_pred hHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHH Q lcl|NC_011269. 149 LIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVK 225 (867) Q Consensus 149 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (867) .++.=..+++- .+..+-.+|+..++ ..++.-|+++-+.. +. ++..|.|+.+.|... .+..+- T Consensus 76 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~~ll~Gnayi~~~-----~~--~l~~l~~~~~~v~~~---~~~~~~---- 140 (403) T protein:vir:10 76 VKTKTLDTLLNVRPNPFMDISTFRRLVV-TDLLFEGCAYIYWD-----GT--SLYHVPAALMQVEAD---ANKFIK---- 140 (403) T ss_pred cccchHHHHHhhCCCCCCCHHHHHHHHH-HHHhhcCCeEEEEe-----Cc--eeEeecCcceEEEEc---CCceEE---- Confidence 22222223322 12344467887766 67777888764431 11 245677777666410 000000 Q ss_pred HHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcC----ccccccCc Q lcl|NC_011269. 226 DLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRP----TAWATRGA 301 (867) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 301 (867) .| -.+.++++...-|-|+.... +----.|. T Consensus 141 ---------------------------------------~~-------~~~~~~~~~~~eiih~~~~~~~~~~~~~~~G~ 174 (403) T protein:vir:10 141 ---------------------------------------KF-------IFNNQINYRVDEIIFIKDNSYVCGTNSQISGQ 174 (403) T ss_pred ---------------------------------------EE-------EecCceeecccceEEecccccccCCCCCcccc Confidence 00 00112222233345554211 11223588 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh-cc--hhhhhhhhh Q lcl|NC_011269. 302 PHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA-AD--FRLMVHNFG 378 (867) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~ 378 (867) +.+.-+.++|-.-....+......+.-..|--++++.. .-+.++.+.+|+.|+.... ++ ...+|-.-| T Consensus 175 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---------~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g 245 (403) T protein:vir:10 175 SRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDE---------ILNKKLRERKQEELQLDYNPSTGQSSVLILDGG 245 (403) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---------CCCHHHHHHHHHHHHHHhCCcccCcceeecCCC Confidence 88777777777666666655666666666765666542 2377888899998877664 33 234444556 Q ss_pred eeeeeccccCc--cCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_011269. 379 LKVENVFGRES--VPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVA 456 (867) Q Consensus 379 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~ 456 (867) ++++.++-.-+ -.-+-+-.++..++|.+++||...++.+|+++++... ...|+..-+.-+..+|++.+.+.+ T Consensus 246 ~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~--~~~f~~~tl~P~~~~ie~~l~~~L---- 319 (403) T protein:vir:10 246 MKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRPN--IELFYYMTIIPMLNKLTSSLTFFF---- 319 (403) T ss_pred ceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHH--HHHHHHHHHHHHHHHHHHHHHHhc---- Confidence 66665542111 1111133355668899999999999975554433322 233444444445555555554443 Q ss_pred Hhhcccchheehhhccccchhhh--hhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccC Q lcl|NC_011269. 457 EAQGHYDYDLKGGVRVPIYREIV--EYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLA 534 (867) Q Consensus 457 e~q~~~d~~~~~~~~~~~~rd~~--~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p 534 (867) .+++.+-++.+ +++ +.|.++ + ...+.. .-.-+..+.-+...+ +. |+-++. . T Consensus 320 ------~~~~~~d~~~~---~~l~~D~~~~~-~-~~~~~~-------~~G~lT~NE~R~~~g-l~-------pi~~~~-~ 372 (403) T protein:vir:10 320 ------GYKITPNTKEV---AALTPDKEAEA-K-HLTSLV-------NNGIITGNEARSELN-LE-------PLDDEQ-M 372 (403) T ss_pred ------Cceeeeccchh---hhcccCHHHHH-H-HHHHHH-------hCCCcCHHHHHHHhC-CC-------CCCccc-c Confidence 23343333222 122 122211 1 111110 001011110000000 00 000000 0 Q ss_pred CCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 535 VNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 535 ~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ....+++.... ........+... +..+.. .+ T Consensus 373 d~~~~p~n~~~-------~~~~~~~~e~~~------~~~~~~--g~ 403 (403) T protein:vir:10 373 NKIRIPANVAG-------SATGVSGQEGGR------PKGSTE--GD 403 (403) T ss_pred ccccccccccc-------ccccCCCCcCCC------CCCCcC--CC Confidence 00001100000 000000000000 000000 00 No 110 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=94.96 E-value=0.0014 Score=36.29 Aligned_cols=325 Identities=13% Similarity=0.110 Sum_probs=152.9 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) ||-+- ..-.++. .+...+...-+.-++.+-|. ||-++.+ -+..+..|.--.|- | T Consensus 1 ~~~~~---------~~~~~~~--~~~~~~~~~~~~~~~~~~p~---~v~~~~~-----------~~~~~~~~~~~~~~-~ 54 (344) T protein:vir:56 1 MSKKK---------GKTPQPA--AKTMTASAPKMEAFTFGEPV---PVLDRRD-----------ILDYVECISNGRWY-E 54 (344) T ss_pred CCCCC---------CCCCchh--hHHhhcCCCceEEEEcCCce---eecCcch-----------hhhHHHhhhcCccc-c Confidence 21110 0001111 22333333334666666663 3322221 11222112111121 2 Q ss_pred chHH--HHHHhhhhccccc--ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehh Q lcl|NC_011269. 124 DLVP--LLIDIYSKFPVVG--MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSE 199 (867) Q Consensus 124 ~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (867) |-|. -|..+++.=+... +.+.. +-+...| . =.+.|.-.+| . ++...|+.-|+.+-+..-|.. |.+-++ T Consensus 55 pp~~~~~la~~~~a~~~h~s~i~~k~-n~l~~~~-~---Pnp~~t~~~f-~-~~~~d~ll~Gnay~~~~rn~~-G~~~~L 126 (344) T protein:vir:56 55 PPVSFTGLAKSLRAAVHHSSPIYVKR-NILASTF-I---PHPWLSQQDF-S-RFVLDFLVFGNAFLEKRYSTT-GKVIRL 126 (344) T ss_pred CCCCHHHHHHHHhhhhhhCccceehh-hhHHhhc-C---CCCCCCHHHH-H-HHHHHHHhcCCeEEEEEECCC-CcEEEE Confidence 2111 1222222222111 11110 0000100 0 0122333344 2 233567777888877776755 456788 Q ss_pred eecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCC Q lcl|NC_011269. 200 EILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGL 279 (867) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (867) ..|.|.+|++... +++ +.+ ....|+.+ T Consensus 127 ~pl~~~~v~~~~~----~~~-------------------------------------------~~~------~~~~g~~~ 153 (344) T protein:vir:56 127 ETSPAKYTRRGVE----EDV-------------------------------------------YWW------VPSFNEPT 153 (344) T ss_pred EEeCCceeEEeec----CCE-------------------------------------------EEE------EecCCeEE Confidence 9999999987621 000 000 01124445 Q ss_pred cccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHH Q lcl|NC_011269. 280 DISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVR 359 (867) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 359 (867) .++..-|-||.+-.+.-.-.|.|-++-+.+++...++........-+.-..|=-|+++-+ . .-++++.|.+| T Consensus 154 ~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d-----~---~ls~e~~~~lk 225 (344) T protein:vir:56 154 AFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-----A---VQDRNDIEMLR 225 (344) T ss_pred EEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-----C---CCCHHHHHHHH Confidence 566667888887554445579999988888887655544444444444344555555431 1 24788899999 Q ss_pred HHHHHhhhcc-hhhhhhh------hheeeeeccccCccCchhHHHHHHH----HHHHHhhccchhhhcC--CCccceehh Q lcl|NC_011269. 360 DDMQSLLAAD-FRLMVHN------FGLKVENVFGRESVPNLDADYDRIE----RKLLQAWGIGEALISG--GTGGAYASS 426 (867) Q Consensus 360 ~~~~~~~~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--g~~~~~~~~ 426 (867) +-|+..-..+ |+-||-+ =|+++.-++-. ..|++|.++| .+|++++||.-.|+.- ..++.|+++ T Consensus 226 ~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~----~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~ 301 (344) T protein:vir:56 226 ENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV----ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDI 301 (344) T ss_pred HHHHHhcCCCCccceEEecCCCCccceeEEEcCCC----hHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccH Confidence 9777654444 4444332 47777766543 3556666655 4799999999998842 234567765 Q ss_pred h-hhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhccc-chheehhhccccchhh Q lcl|NC_011269. 427 A-LNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHY-DYDLKGGVRVPIYREI 478 (867) Q Consensus 427 ~-~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~-d~~~~~~~~~~~~rd~ 478 (867) . ....|+..-++=++.+| .|++.+. +..++|.-=+|+-=|+ T Consensus 302 eq~~~~f~~~tL~Pl~~~i-----------e~~n~~l~~~~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 302 EKVAKVFVRNELIPLQDRI-----------REINGWIGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHH-----------HHHHhhhccccccCCCccccccCC Confidence 3 22222322233333333 3344422 2222222222221122 No 111 >protein:vir:572 Length: 506 # NCBI annotation: unknown # Family: family:all:6660 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046607;genbank:gi:9630180;genbank:GeneID:1261432 Probab=94.79 E-value=0.0035 Score=34.05 Aligned_cols=421 Identities=17% Similarity=0.252 Sum_probs=155.7 Q ss_pred hCCCchh--hhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccce-----ecccchhHHHHHHHHhhcccccHHHHhH Q lcl|NC_011269. 98 GIPFNVE--DEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGME-----FDSKDPLIKTFYEDLFFGEDLNYLEFLP 170 (867) Q Consensus 98 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (867) |+--+-- +-+|++. -..+ |.|- |-+.+|-| |++|- -+..-+-+|-+|. ..=.|.+-|- T Consensus 1 mvTl~K~~i~~E~~~~---~lN~-Y~TY--~~~F~~GF----i~~~~~NG~v~~i~~~~L~~~F~-----NPD~~~~~I~ 65 (506) T protein:vir:57 1 MVTLNKVDIESEEYKQ---MLND-YSTY--TSTFASGF----ISNMFSNGIVTEIEAEQLKNYFS-----NPDEFQEEIE 65 (506) T ss_pred CceeechhccHHHHHH---HHhh-hhHH--HHHHHHHH----HHHhhcCCceeeeeHHHHHhhhc-----ChHHHHHHHH Confidence 2111111 1122211 1111 2222 22222211 11111 1111222332221 1111222222 Q ss_pred HHHHHHHhhhhhhcchhhhhhhccce--------------ehheecCcceeehhhhhhhcchHHHHH------------- Q lcl|NC_011269. 171 DQFAREYFTVGEVTSLAHFNESLGVW--------------SSEEILNPDMLRVSRSMFVQRERVQLM------------- 223 (867) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 223 (867) |..---|..-|+++-|-.+-+++--. +....||=-.-+|+..-+..+=++|+. T Consensus 66 ~L~~Y~YI~~~~i~QL~~LI~aLP~L~Y~I~~~~k~K~~~~~iS~lN~~L~Kv~HK~LTRDLL~Q~A~aGTLvG~WLG~~ 145 (506) T protein:vir:57 66 DLAQYFYISTAEIHQLFELIEALPTLNYKIDSFNKVKSSDKHISLLNKSLHKVKHKRLTRDLLKQVATAGTLVGIWLGDA 145 (506) T ss_pred HHHHHhhhhcchHHHHHHHHHhcCCcceeehhhhhccchhhHHHHHHHHHHHHHHHHHHHHHHHHhhccCceeEeeecCC Confidence 31111233445554443333322110 001111111111111111111111110 Q ss_pred ----------HHHHHhhccccccccccccccccc----cchhhhhhhhhHHHH--------HHhchHHHhhhccCCCCcc Q lcl|NC_011269. 224 ----------VKDLVDHLRQGPTTAGGNMSTVEE----TPSEREQRMREFQDL--------QRRYPEIIQAAMQNDGLDI 281 (867) Q Consensus 224 ----------~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~ 281 (867) +|-..-.+|+ -|.|-.|-. |--.-+||+.-|.+| |++|=|--++.+ ---|++ T Consensus 146 k~PY~~iF~~iKYVFP~~R~-----~G~~V~VvD~~~F~~~~~~~R~~~~~~LSP~I~~~~Y~~~~~~~~~~R-~~~LP~ 219 (506) T protein:vir:57 146 KSPYPFIFDEIKYVFPSFRR-----NGDWVCVVDMELFTKYKDDQRNELLKSLSPYIKQSDYENFMKDREKYR-FKELPQ 219 (506) T ss_pred CCcchhhhhhhhhhcccccc-----CCceEEEEehHHhhhhhHHHHHHHHHhhhhhhhhhhhhhHhhhHHhhh-hhhccc Confidence 0111111111 111111100 111123444433333 222332111111 123566 Q ss_pred cHHHHHHhh--hcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCc--C-CCCHHHHH Q lcl|NC_011269. 282 SEALISRVV--NRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEP--W-IPDQGELD 356 (867) Q Consensus 282 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~ 356 (867) +.-++-|+- .|+. .-|+|.+-...-.+.-+.+|+.-++.||+....-+-++|+|+.+ +.||- | +|.--- . T Consensus 220 ~rT~~~R~~TL~RNQ---~LG~~~~T~~L~Dv~HK~KLkD~E~SIA~KII~A~AVL~~~~~~-~Ngeyt~~K~~~a~K-~ 294 (506) T protein:vir:57 220 ERTFPLRTGTLKRNQ---GLGTSWVTPGLYDVLHKKKLKDVERSIANKIINAVAVLTIGTDK-GNGEYTNMKLPKAVK-Q 294 (506) T ss_pred ccchhheeeeecccc---cccccccchhHHHHHHHHHHHHHHHHHHHHHhhhheeeeeeccc-CCcccccccchHHHH-H Confidence 666665542 2322 22444444444567789999999999999999999999999743 44442 2 332100 0 Q ss_pred HHHHHHHHhhhc---chh--hhhhhhheeeeeccccCccCchh-HHHHHHHHHHHHhhccchhhhcCCCccceehhhhhH Q lcl|NC_011269. 357 EVRDDMQSLLAA---DFR--LMVHNFGLKVENVFGRESVPNLD-ADYDRIERKLLQAWGIGEALISGGTGGAYASSALNR 430 (867) Q Consensus 357 ~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 430 (867) .|---.+++|.- |=. +-+-.|+ +.-|-+-+--.|| ++||-|-.+|-+|.||+++|++ |+|++||+|.+|. T Consensus 295 Ki~~GVK~ALEK~~KDGv~~vs~PDFA---~~~FP~vK~~~LD~~K~D~I~~DI~~A~GlS~~L~N-G~~GNYAts~LNL 370 (506) T protein:vir:57 295 KIHGGVKTALEKNQKDGVTVVSIPDFA---DINFPDVKADGLDGAKFDHINSDIQSAYGLSGSLLN-GDGGNYATSSLNL 370 (506) T ss_pred HHHHHHHHHHhcccccCeEEEeccccc---ccccccccccCCCchhhcccchhhhhhhccchheec-CCCcceeeeechH Confidence 111133333321 100 0000110 1112222334567 8999999999999999999999 9999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhh-HHH-HHhhc-----ccchheehhhccccchhhhhhhhhhhh---hHhhhh-hhhhhh Q lcl|NC_011269. 431 EFVTQIMTGFQNALKRHIRRRC-EVV-AEAQG-----HYDYDLKGGVRVPIYREIVEYDEETGQ---EYIRKV-PKLLIP 499 (867) Q Consensus 431 ~~~~~~~~~~~~~l~~~~r~~~-~~i-~e~q~-----~~d~~~~~~~~~~~~rd~~~~k~e~~k---~~~r~~-~k~i~~ 499 (867) |..--++=++-.-||+.+-..+ ..| -+.|+ .||++ .-++.||+|.- +++..- .+-.+. T Consensus 371 D~FYKrIGV~~E~IEqEvY~~L~~lvL~~~~~~NY~~~Y~KD-----------~Pl~~~~K~D~LIKL~~~G~S~K~V~D 439 (506) T protein:vir:57 371 DTFYKRIGVLMEDIEQEVYQKLFNLVLPAAQKDNYYMNYDKD-----------KPLTLKEKMDILIKLNDKGWSIKHVVD 439 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceeEeeCCC-----------CccchhhhhchheeecccCccHHHHHH Confidence 9988885555556676665555 333 33443 12211 11334444321 121110 000000 Q ss_pred hhccccccccchhhhhhh-----hhhhhhceee-eeccccCC-CcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 500 EIKFSTLNLRDEAQERAF-----IAQLKGMGVP-VSDKTLAV-NIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 500 ~i~~~~~~Lr~e~~~~~~-----v~qL~~~~~p-itd~t~p~-tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) .+.+ +.-|..-++. -.+|+.+-.| .++-+..+ .+--+.| ..+ .-.+++........ +.+- T Consensus 440 nl~G----vS~E~Y~E~tlYE~E~LKL~EKI~P~~~s~~~tGN~vG~P~~---~~~--~~D~Tv~Satsngn---dnpi 506 (506) T protein:vir:57 440 NLAG----VSWESYLEQTLYETEELKLQEKIRPYQTSYTFTGNEVGRPNE---GNK--NNDNTVKSATSNGN---DNPI 506 (506) T ss_pred hhhc----cchHHHHHHHHHHHHHhhHHhhcCcccccceecccccCCCCC---CCC--cccchhhhcccCCC---CCCC Confidence 0110 0111111111 1122222211 11111111 1111111 111 11112221111100 0010 No 112 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=94.54 E-value=0.0041 Score=33.65 Aligned_cols=374 Identities=6% Similarity=-0.004 Sum_probs=152.1 Q ss_pred eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-cccccceeccc-chhHHHHHHHHhh Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-FPVVGMEFDSK-DPLIKTFYEDLFF 159 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~ 159 (867) |++=.+ ++.--+..+.... ....+..++...|-.++.|-.||++.++ ..-..++.-.+ ++.+++-=.+..+ T Consensus 1 MGlf~~---~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~lL 73 (395) T protein:vir:98 1 MGILDF---FSFKKSGTLSDDD----SGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLYWI 73 (395) T ss_pred Ccchhh---hcCCCcccccccc----cchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHHHH Confidence 211100 0000011222211 1224555777777778999999998865 44444443222 2333321111122 Q ss_pred ----cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc Q lcl|NC_011269. 160 ----GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP 235 (867) Q Consensus 160 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (867) -+..+-.+|+..++ ..++.-|+++-+.+.+... .+.+.++.+ ..+. +... ..- T Consensus 74 ~~~PN~~~t~~~f~~~~~-~~lll~Gnayi~~~~~~~~-------~~~~~~~~~--------~~~~---~~~~----~~~ 130 (395) T protein:vir:98 74 NTKANPNQSASQFWVEVI-QKLLVDGETLIFVIPGKGI-------YVADSFTQD--------KKIS---GSQF----KVS 130 (395) T ss_pred hhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEEeCCce-------ecCCccccc--------cccc---Cccc----cee Confidence 23455678888866 7888889987665544321 111111111 0000 0000 000 Q ss_pred cccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHH Q lcl|NC_011269. 236 TTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEE 315 (867) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (867) +.+ .|. -...++..-|-|+++....+...+.++ +.....++.. T Consensus 131 ~~~-------------------~~~----------------~~~~~~~~evih~k~~~~~~~~~~~~~-~~~~~~~~~~- 173 (395) T protein:vir:98 131 RVQ-------------------GQT----------------YEKTFTFDQVIYLKNDNSDLMSKVESL-WEEYGELLGH- 173 (395) T ss_pred eec-------------------Cce----------------eeeEecCccEEEecCCCCCccccccch-hhhHHHHHHH- Confidence 000 000 012234445778887666665555554 3344444321 Q ss_pred HHHHHHHHHHhhhhchhhhhhhcccccCCCCcC-CCCHH---HHHHHHHHHHHhhhcc-hhhhhhhhheeeeeccccCc- Q lcl|NC_011269. 316 SLNAAQDAVADRLYSPLVLATLGIEDMGDGEPW-IPDQG---ELDEVRDDMQSLLAAD-FRLMVHNFGLKVENVFGRES- 389 (867) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~- 389 (867) .+..+..+-+.|+.. -.++-|+ ..+.+. +++.+ .+++..+++.....+. ...++-..|++.+-+...-. T Consensus 174 ~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~ 248 (395) T protein:vir:98 174 VINNQKIANQIRFTM--IPPKDKV---RERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTG 248 (395) T ss_pred HHHHHHHHHHHHHhh--ccccccc---cccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEeccccccc Confidence 122222222333321 1111121 111222 23322 2222333344443333 44555567777777754433 Q ss_pred cCchh-HHHHHHH----HHHHHhhccchhhhcCCCccceeh-hhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccc Q lcl|NC_011269. 390 VPNLD-ADYDRIE----RKLLQAWGIGEALISGGTGGAYAS-SALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYD 463 (867) Q Consensus 390 ~~~~~-~~~~~~~----~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d 463 (867) .++++ .+|.+++ ++|.+++||...+++ | +|++ .+....|++.-++-+...|++++.+.+=...|..... T Consensus 249 ~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~-~---~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~~~~~g~- 323 (395) T protein:vir:98 249 AVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH-G---DIADNQKNYELLLEGPIESLITNIVDGLEYAIFDKSETLQGS- 323 (395) T ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc-C---CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCcc- Confidence 23443 3555554 579999999999995 3 4664 4556677777788888888888887663333432221 Q ss_pred hheehhhccccchhhhhhhhhhhhhHhhhh---hhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccc Q lcl|NC_011269. 464 YDLKGGVRVPIYREIVEYDEETGQEYIRKV---PKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMK 540 (867) Q Consensus 464 ~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~---~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme 540 (867) .+-+..+...|..+.-+.+ +-.++.. ...+-. .+...-+.+ ..+...-+..-..++.+ .++..+.+ T Consensus 324 ---~f~~~~l~~~d~~~~~~~~-~~~~~~G~~T~NE~R~--~~g~~Pi~~---~~gD~~~~~~n~~~~~~--~gge~~~~ 392 (395) T protein:vir:98 324 ---FIKVTGLKNYDLFSISNQA-DKLISSGFVFIDEVRE--EIGLPELPD---GLGKVLYMTKNYESVLE--RGGEVDEE 392 (395) T ss_pred ---eeeehhhhccCHHHHHHHH-HHHHhCCCcCHHHHHH--HhCCCCCCC---CCCceeeecccceeccc--ccCCCCCC Confidence 1122222222222111111 1111111 000100 000000100 00000000000011111 01111111 Q ss_pred cchhhhh Q lcl|NC_011269. 541 FDQELER 547 (867) Q Consensus 541 ~E~e~e~ 547 (867) -+ . T Consensus 393 ~~----~ 395 (395) T protein:vir:98 393 VE----T 395 (395) T ss_pred CC----C Confidence 00 0 No 113 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=94.21 E-value=0.0051 Score=33.17 Aligned_cols=385 Identities=11% Similarity=0.068 Sum_probs=148.0 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-c Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-F 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 136 (867) -|+-.+.-+ -.+.. -....+....-.++.... ...++-. .=+-..+.|-.|||+.++ + T Consensus 1 Mg~f~~~~~---~~~~~-----~~~~~~~~~~~~~~~~~~-~~~~~~~------------~~~~~~~~v~~~i~~ia~~i 59 (406) T protein:vir:95 1 MGLFDRWRR---TKRKS-----KIRADTGYVGLFMSGEDV-SFLVPGY------------VRLSDNPEVRMAVHKIADLI 59 (406) T ss_pred Ccchhhhcc---ccccc-----cccccchhhhhhccCccc-CccccCH------------HHHhhcHHHHHHHHHHHHhh Confidence 111100000 00000 000000000000000000 0011100 002356888889998764 3 Q ss_pred ccccceec-ccchhHHHHHHHHhh--c----ccccHHHHhHHHHHHHHhhhhhhcch--hhhhhhccceehheecCccee Q lcl|NC_011269. 137 PVVGMEFD-SKDPLIKTFYEDLFF--G----EDLNYLEFLPDQFAREYFTVGEVTSL--AHFNESLGVWSSEEILNPDML 207 (867) Q Consensus 137 ~~~~~~~~-~~~~~~~~~~~~~~~--~----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 207 (867) .-.++++. .+|...++-...+++ - ...+-.+|+..++ ..++.-|+...+ ...|.. |...++..|+|+.| T Consensus 60 a~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~-~~~ll~g~g~a~~~~~~~~~-g~~~~l~~i~~~~v 137 (406) T protein:vir:95 60 SSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIV-YTMLLDGEGNSVVFPKYTAD-GLIDELVPLTPSKV 137 (406) T ss_pred ccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHH-HHHHhcCCceEEEEEEECCC-CcEEEEEEEcCcee Confidence 32333331 122222211111111 1 1234457888866 777777775444 344444 56788889999999 Q ss_pred ehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHH Q lcl|NC_011269. 208 RVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALIS 287 (867) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (867) +|... .+ | ..-+ | ++-.++..-|- T Consensus 138 ~~~~~---~~----------------~------~~~~--------------~-----------------~~~~~~~~evi 161 (406) T protein:vir:95 138 NFLDT---PD----------------G------YQVL--------------Y-----------------GGQTFNYDEVL 161 (406) T ss_pred EEEEc---CC----------------e------EEEE--------------e-----------------ccEEEchhHEE Confidence 88621 11 0 0000 0 01124445577 Q ss_pred HhhhcCcccc-ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhh Q lcl|NC_011269. 288 RVVNRPTAWA-TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLL 366 (867) Q Consensus 288 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (867) |++.....+. -.|.+.+-.+-.+|.......+......+.-..|--+.++-. --+.++.+++|+.++..+ T Consensus 162 h~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~---------~l~~e~~~~~~~~~~~~~ 232 (406) T protein:vir:95 162 HFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA---------ATAELSSEEGRNAVFKKY 232 (406) T ss_pred EeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---------CCCHHHHHHHHHHHHHHh Confidence 8875444443 358898888888777766666666555666556655555431 125566777777666554 Q ss_pred h-cc---hhhhhhhhheeeeeccccCccCc-hhH----HHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHH Q lcl|NC_011269. 367 A-AD---FRLMVHNFGLKVENVFGRESVPN-LDA----DYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIM 437 (867) Q Consensus 367 ~-~~---~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 437 (867) . ++ -.+++..=+..++.+. .++ .|. -.+...++|.+++||...++..+++ . .+-...|+..-+ T Consensus 233 ~g~~n~~~~~v~~~~~~~~~~~~----~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~---~-~~~~~~~~~~~l 304 (406) T protein:vir:95 233 LQATEAGQPWIIPAELLEVEQVK----PLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGEF---N-RDEYNNFINSTI 304 (406) T ss_pred ccccccCCceeecCCCccccccc----cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCc---h-HHHHHHHHHHHH Confidence 3 23 2233322222222111 112 233 3366679999999999999953432 1 222222333334 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhh Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAF 517 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~ 517 (867) .-+...|++.+.+.+- ...+..|++-+..+...|..+.-+ .++..+.. +-+..+.-+...+ T Consensus 305 ~P~~~~ie~~l~~~l~------~~~~~~~~fd~~~l~~~d~~~~~~-~~~~l~~~-----------G~~t~NE~R~~~g- 365 (406) T protein:vir:95 305 LPIAKGIEQELTRKLL------ISPDLYFKFNPRSLYAYDLKELAE-VGSNMYVR-----------GIMEGNEVRDWLG- 365 (406) T ss_pred HHHHHHHHHHHHHhcC------CCCCcEEEeechhhhcCCHHHHHH-HHHHHHhC-----------CCcCHHHHHHHhC- Confidence 4444444444433321 011233444333332222222111 11111111 0011100000000 Q ss_pred hhhhhhceeeeeccccCCCcccccc-hhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 518 IAQLKGMGVPVSDKTLAVNIDMKFD-QELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 518 v~qL~~~~~pitd~t~p~tiqme~E-~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) +..++. +....+... +.++.. . .....+.... ...+-. .+ T Consensus 366 l~p~~~----------gd~~~~~~n~~~~~~~-----~----~~~~~k~g~~--~~~~~~--~~ 406 (406) T protein:vir:95 366 LSPKEG----------LSELVILENYIPLDKI-----G----DQSKLKGGDN--SGADGQ--TD 406 (406) T ss_pred CCCCCC----------cceeeeccCccchhhc-----c----cccccCCCCC--CCCCCC--CC Confidence 000000 000000000 000000 0 0000000000 000000 00 No 114 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=92.16 E-value=0.006 Score=32.77 Aligned_cols=426 Identities=13% Similarity=0.012 Sum_probs=162.2 Q ss_pred HhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHH------------HHHHHHHHHHHhhc--------------- Q lcl|NC_011269. 70 YRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEE------------LRVIRHWCRLFYAT--------------- 122 (867) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~--------------- 122 (867) |+..-.-|+|++--.=..-+.++ |.-+-++++.|.-.+ ++.++.+- .||.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~-~YY~g~~~i~~~~~~~~~~~ 78 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIF-DAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQ-EYYEQRPDIVKEPKPVDATG 78 (483) T ss_pred CccchhcCCceeecCcchhhhhh-hcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHH-HHhccccccccccccccccc Confidence 55544555777654333333221 111222333322211 11222222 35522 Q ss_pred ------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhh Q lcl|NC_011269. 123 ------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHF 189 (867) Q Consensus 123 ------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (867) +++.+.++|.+..|=++. +.++.+|+...++..+. |..+ +...+.+ ++++....|.++-+--. T Consensus 79 ~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~-~~n~--~~~~~~~-~~~~~~~~G~~y~~v~~ 154 (483) T protein:vir:12 79 AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEV-LGNR--FDDKLHS-VLTGASNKGIEWLHPYL 154 (483) T ss_pred cccccccccccccchHHHHHHHHhhhhcccCceeccCChHHHHHHHHH-Hhcc--HHHHHHH-HHHHHhhCCeEEEEEEE Confidence 466778899999887776 88999999988887765 5454 4445555 55888888888766655 Q ss_pred hhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccc--cccccccccchhhhhhhh--hHHHHHHh Q lcl|NC_011269. 190 NESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAG--GNMSTVEETPSEREQRMR--EFQDLQRR 265 (867) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~~~~~~ 265 (867) ++.+.. +..+++|.-+-+--+=-+-++.+ -++|.=-..+. -.+.|..++-.-+..-.. ..-+.... T Consensus 155 d~d~~~--~i~~~~p~~~~~v~d~~~~~~~~--------~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~ 224 (483) T protein:vir:12 155 DEEGEF--KLFRVPAEQGIPIWTDKEHEELE--------AFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLE 224 (483) T ss_pred cCCCce--EEEEEcccceEEEEcCCCCCceE--------EEEEEEEeecceEEEEEecCeEEEEEEeCCeeeeccccccc Confidence 665443 36667887654431100011111 11111000000 000000000000000000 00000011 Q ss_pred chHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhchhhhhhhccccc Q lcl|NC_011269. 266 YPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSPLVLATLGIEDM 342 (867) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 342 (867) ++++-...-.=..|+ |+.+.| ..+|.+.+-. ..+|+ +.|+.+..-++ +-+.-|.++++ |. T Consensus 225 ~~~~~~~~~~~g~vP-----vv~~~n-----n~~g~sd~e~-v~~li--Da~d~~~S~~~~~~~~~~~~~lv~~-g~--- 287 (483) T protein:vir:12 225 NSKTHFSTGSWGKIP-----FIPFKN-----NDLEISDIFM-YKTLI--DAYNRRLSDLSNTFKDSNELTYVLT-NY--- 287 (483) T ss_pred ccccccccCCCCccc-----eEEecC-----CCCCCCchhh-HHHHH--HHHHHHHHHHHHHHHHhcCceeeee-cC--- Confidence 222111000000111 222222 3467776543 44554 33443333333 22455655443 21 Q ss_pred CCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccc Q lcl|NC_011269. 343 GDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGA 422 (867) Q Consensus 343 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 422 (867) +.+++++.+.+++.. .++...=+-+++++=-.-..=.+...++.+++.|..--++-..- .+..|++ T Consensus 288 --------~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~~~~~n 353 (483) T protein:vir:12 288 --------DDQELPEFKRLLRYY-----GAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFS-SDKFGSA 353 (483) T ss_pred --------CcccchhHHHhhhhc-----cccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCC-ccccccC Confidence 123334444443321 11111112233332111111123356677777776665443211 1111221 Q ss_pred eehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hheehhhccccchhhhhhhhhhhhhHhhhhhhh Q lcl|NC_011269. 423 YASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKL 496 (867) Q Consensus 423 ~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~ 496 (867) .+.+.+.+ +-++....++.++..++++++-|.++-+.-+ .+++..|.-..+++. T Consensus 354 --~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~------------------ 413 (483) T protein:vir:12 354 --PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANT------------------ 413 (483) T ss_pred --cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCCCH------------------ Confidence 23344432 2233456677788888888888877654211 223333333333322 Q ss_pred hhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccchhhhhhHHHHHHHHhhcccccccccccccc Q lcl|NC_011269. 497 LIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDA 572 (867) Q Consensus 497 i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~ 572 (867) ..+-+.+.++.. .++..+ ++...+ .+.+.++...|.......... .......... T Consensus 414 ---------------~~~a~~~~kl~G---iiS~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~-~~~~~~d~~~ 472 (483) T protein:vir:12 414 ---------------ELQVQTAQQSMG---IVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPN-LDDGGADGAQ 472 (483) T ss_pred ---------------HHHHHHHHHHhc---cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccc-ccccccCCcc Confidence 111111222210 111111 111111 122333333333222211110 0000000000 Q ss_pred cCCCCCcccccccccccc Q lcl|NC_011269. 573 QNLPYPPELAQHLQSTLA 590 (867) Q Consensus 573 ~g~P~pp~~aQ~p~~t~~ 590 (867) .. .+....... T Consensus 473 ~~-------~~~~~~e~e 483 (483) T protein:vir:12 473 QQ-------ERSNNKESE 483 (483) T ss_pred cC-------CCCCcccCC Confidence 00 000000000 No 115 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=91.02 E-value=0.018 Score=30.17 Aligned_cols=414 Identities=13% Similarity=0.095 Sum_probs=147.4 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc- Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT- 122 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 122 (867) |+- --|--|-+|+-.+.......+.|=...++ +..++.+-+.|... T Consensus 1 ~~~------------------------------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~r~~~~~~yy~g~~ 47 (453) T protein:vir:73 1 MNL------------------------------KPIKLMTYSRDEEITDKVVNDFMKKHQEE---VERYEYLGNMYKGIM 47 (453) T ss_pred Ccc------------------------------ccceeeeccccccCCHHHHHHHHHHHHHH---HHHHHHHHHHhcccc Confidence 111 11222333332222222222222222222 22333332323323 Q ss_pred ------------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhh Q lcl|NC_011269. 123 ------------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEV 183 (867) Q Consensus 123 ------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (867) +++.+.++|.+.-|=++. +.++++|+...++..+.+= +-|+...+.+ ++++...-|.+ T Consensus 48 ~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~--~n~~~~~~~~-~~~~~~~~G~~ 124 (453) T protein:vir:73 48 EISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGIPIKKTHDDKSVLEAMQLFDN--LNDMEDEESE-LAKIACVYGRA 124 (453) T ss_pred chhcCCCCCccCccceeecchHHHHHHHhhhhhcccCceeecCChHHHHHHHHHHH--hcChhHHHHH-HHHHHHhcCeE Confidence 346678899998887776 8899999888887776643 4456666666 55888888887 Q ss_pred cchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHH Q lcl|NC_011269. 184 TSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQ 263 (867) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 263 (867) +-+--.++.+.. +..+++|+-+-+- | ++.+.-. ++-+++-.=..+| ...-.=-|+.+ ..+ |. T Consensus 125 ~~~v~~d~~~~~--~i~~~~p~~~~~v---~--dd~~~~~---~~~~i~~~~~~~~-~~~~~vyt~~~-i~~---~~--- 186 (453) T protein:vir:73 125 YELMYQNESTES--EVIYCSPLNVFMV---Y--DDSIKQK---PLFAVYYGFDEEG-NLSGTVYTLLE-TIS---IT--- 186 (453) T ss_pred EEEEEeCCCCce--EEEEEcccceEEE---E--eCCCCce---eEEEEEEEEecCc-eEEEEEEeCCe-EEE---EE--- Confidence 655544554333 2455676554221 1 1111100 0000110000011 00000001110 000 00 Q ss_pred HhchH--HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH-Hhhhhchhhhhhhccc Q lcl|NC_011269. 264 RRYPE--IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV-ADRLYSPLVLATLGIE 340 (867) Q Consensus 264 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 340 (867) ..--+ +++....+-| .+| |+++.| .++|.+.+-+ .+.|+-.-...-.+-+. .+.+..|.++++ |. T Consensus 187 ~~~~~~~~~~~~~~~~g-~vP---vv~~~n-----~~~g~s~~~~-v~~liDa~~~~~S~~~~~~~~~~~~~l~~~-g~- 254 (453) T protein:vir:73 187 GKAGEVKFGESTYNVYS-DLP---IVEYNF-----NEERQSIFEP-VHSLINSYNKVTSEKANDVEYFSDQYLVFL-GA- 254 (453) T ss_pred ecCCceEEccceeccCC-cee---EEEecC-----CCCCCcchhh-HHHHHHHHHHHHHHHHHHHHHhccceeeee-cC- Confidence 00000 0000000000 011 222322 2467776643 34444222221122221 247777877775 31 Q ss_pred ccCCCCcCCCCHHHHHHHHHHHHHhhhcc-hhhhhhhhhe-----eeeeccccCccCchhHHHHHHHHHHHHhhccchhh Q lcl|NC_011269. 341 DMGDGEPWIPDQGELDEVRDDMQSLLAAD-FRLMVHNFGL-----KVENVFGRESVPNLDADYDRIERKLLQAWGIGEAL 414 (867) Q Consensus 341 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (867) .++.+++..++. ....- +-...+..+. +++++=-....=.+...++.+++.|...-++-. + T Consensus 255 --------~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~-~ 321 (453) T protein:vir:73 255 --------EVDEEDAKNIKD----NRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAAN-I 321 (453) T ss_pred --------CCCchhhhcccc----cccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcc-c Confidence 112233333222 11000 0000111111 111110000011122445566666655433322 1 Q ss_pred hcCCCccceehhhhhHHHHH----HHHHHHHHHHHHHHhhhhHHHHHhhcccc-----hheehhhccccchhhhhhhhhh Q lcl|NC_011269. 415 ISGGTGGAYASSALNREFVT----QIMTGFQNALKRHIRRRCEVVAEAQGHYD-----YDLKGGVRVPIYREIVEYDEET 485 (867) Q Consensus 415 ~~~g~~~~~~~~~~~~~~~~----~~~~~~~~~l~~~~r~~~~~i~e~q~~~d-----~~~~~~~~~~~~rd~~~~k~e~ 485 (867) -.+..|. .+.+.+.+.- ++.-..+..++..++++++-|.++.+... .++++.|+...+++..+.-.=+ T Consensus 322 ~~~~~gn---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~ 398 (453) T protein:vir:73 322 SDENFGN---SSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETA 398 (453) T ss_pred CcccccC---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHH Confidence 1111121 2334444333 33455667777888888888877644221 1223333333333321111100 Q ss_pred hhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccc-ccc Q lcl|NC_011269. 486 GQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQ-AMK 564 (867) Q Consensus 486 ~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~tae-t~k 564 (867) .|+. + + + +.+..+.. ++...+. +.|.++...|+...+..... .+. T Consensus 399 ~k~~-------------g--i-i----s~et~~~~------------~~~~~d~--~~E~~ri~~E~~~~~~~~~~~~~~ 444 (453) T protein:vir:73 399 NILK-------------G--I-T----SEETALSV------------ISVIPDV--QAEMEKIKKKKLLQLSLTRTSNLV 444 (453) T ss_pred HHHh-------------c--c-C----cHHHHHHh------------CCCCCCH--HHHHHHHHHHHHHHHHHHHhccCC Confidence 0000 0 0 0 01111111 1111111 11222222222222211111 111 Q ss_pred cccccccccCC Q lcl|NC_011269. 565 KVQDLCDAQNL 575 (867) Q Consensus 565 kvq~~~p~~g~ 575 (867) . ......++ T Consensus 445 ~--~~~~~~~~ 453 (453) T protein:vir:73 445 R--MKQMRGNL 453 (453) T ss_pred c--chhhhcCC Confidence 1 11111122 No 116 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=90.35 E-value=0.021 Score=29.75 Aligned_cols=426 Identities=13% Similarity=0.031 Sum_probs=159.0 Q ss_pred hhHHHHHHHHhcccccccceeeccchhhhhhhh--------------hHHhhCCCchhhhHHHHHHHHHHHHHhhc---- Q lcl|NC_011269. 61 EANRQRLASYRKQGNFGSNMQIAMPKIRQPLGT--------------LADKGIPFNVEDEEELRVIRHWCRLFYAT---- 122 (867) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 122 (867) -.=||++...+..-.-|+|++--.=+--+-.+. ...+.|.... ++++.++.+- .||.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~---~~~~r~~~l~-~YY~g~~~I 76 (492) T protein:vir:94 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL---EKLPEISIGQ-EYYEQRPDI 76 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHH---HHHHHHHHHH-HHhcccccc Confidence 222333333333333456776543333222221 1111111111 1122223322 35532 Q ss_pred -----------------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHh Q lcl|NC_011269. 123 -----------------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYF 178 (867) Q Consensus 123 -----------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (867) +++.+.++|.+..|=++. +.++.+|+...++.+++ |..++ ...+.+ ++++.. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~~~~l~~~-~~n~~--~~~~~~-~~~~a~ 152 (492) T protein:vir:94 77 VKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEV-LGNRF--DDKLHS-VLTGAS 152 (492) T ss_pred ccccccccccccccccccccccccchHHHHHHHHHhhhcccCceeccCchHHHHHHHHH-HhccH--HHHHHH-HHHHHh Confidence 467778899999887766 88999999888888776 54544 455555 558888 Q ss_pred hhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhcccccccccc--ccccccccchhhhhhh Q lcl|NC_011269. 179 TVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGG--NMSTVEETPSEREQRM 256 (867) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 256 (867) ..|.++-+--.++.+.. +..+++|.-+-+--+--.-++.+ -++|.=-..... .+-|..++-.-+.... T Consensus 153 ~~G~a~~~v~~d~dg~~--~~~~~~p~~~~~v~d~~~~~~~~--------a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~ 222 (492) T protein:vir:94 153 NKGIEWLHPYLDEEGEF--KLFRVPAEQGIPIWTDKEHEELE--------AFIRMYKLENETKVEYWDKVTVNYYVYENG 222 (492) T ss_pred hCCeEEEEEEecCCCce--EEEEEcccceEEEEcCCCCCceE--------EEEEEEeeccceeEEEEecCeEEEEEEecC Confidence 88888766555554332 35667776543320000001100 011100000000 0000000000000000 Q ss_pred hhHHH--HHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhch Q lcl|NC_011269. 257 REFQD--LQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSP 331 (867) Q Consensus 257 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 331 (867) ....+ -...+.++-...-.=..|+ |+...| .++|.+.+= ....|+ +.|+.+..-.| +-+.-| T Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~g~vP-----vv~~~n-----n~~~~sd~e-~v~~li--Da~d~~~S~~~~~~~~~~~p 289 (492) T protein:vir:94 223 SLIPDYSNNLENSKTHFSTGSWGKIP-----FIPFKN-----NDLEISDIF-MYKTLI--DAYNRRLSDLSNTFKDSNEL 289 (492) T ss_pred eeeeccccccccccccccccCCCccc-----eEEecC-----CCCCCCchH-HHHHHH--HHHHHHHHHHHHHHHHhcCc Confidence 00000 0000111100000000011 122222 245776553 234443 34444333333 335556 Q ss_pred hhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccC--c-----hhHHHHHHHHHH Q lcl|NC_011269. 332 LVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVP--N-----LDADYDRIERKL 404 (867) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-----~~~~~~~~~~~~ 404 (867) .++++ |. +.+++.+.+++++..- .++++.-|.. +.+ + +...++++++.| T Consensus 290 ~lv~~-g~-----------~~~~~~~~~~~~~~~~-----------~~~~~~~~~~-~~l~~~~~~~~~~~~~~~l~~~I 345 (492) T protein:vir:94 290 TYVLK-NY-----------DDQELPEFKRLLRYYG-----------AIKVSDNGGV-DTIQVEVPVENSKKYLDELYQKI 345 (492) T ss_pred eeeee-cC-----------CcccchhhHHHHhhcc-----------ceecCCCCcc-eeEeccCCHHHHHHHHHHHHHHH Confidence 55543 31 1233344444433211 1222221111 111 2 225567777776 Q ss_pred HHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhccc--chheehhhccccchhh Q lcl|NC_011269. 405 LQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHY--DYDLKGGVRVPIYREI 478 (867) Q Consensus 405 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~--d~~~~~~~~~~~~rd~ 478 (867) ..--++-.. ..+..|++ .|.+.+.+ +-++....++.++..++++++.|.++-+.- ..+++..|+-..+++. T Consensus 346 ~~~s~~p~~-~~~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~ 422 (492) T protein:vir:94 346 MLFGQAVDF-SSDKFGSA--PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANT 422 (492) T ss_pred HHHhCCcCC-CccccccC--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCH Confidence 665544321 11111222 23344433 334456677788888888888887765421 1233333333333322 Q ss_pred hhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccchhhhhhHHHHHH Q lcl|NC_011269. 479 VEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQELERQADETVQ 554 (867) Q Consensus 479 ~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~e~e~k~~E~l~ 554 (867) ...-+.+.++.. .++..+ ++...+ .+.|.++...|+.. T Consensus 423 ---------------------------------~e~~~~~~kl~g---iiS~et~~~~l~~v~d--~~~E~eri~~E~~~ 464 (492) T protein:vir:94 423 ---------------------------------ELQVQTAQQSMG---IVSHETVLENHPFVED--LQAELERIEQEQME 464 (492) T ss_pred ---------------------------------HHHHHHHHHHhc---cCchHHHHHhCCCCCC--HHHHHHHHHHHHHH Confidence 111111222110 111111 111111 11222333333222 Q ss_pred HHhhcccccccccccccccCCCCCccccccccccccC Q lcl|NC_011269. 555 KLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLAL 591 (867) Q Consensus 555 tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~ 591 (867) ...... ....... .........+ ..... T Consensus 465 ~~~~~~-~~~~~~~--~~~~~~~~~~------~~e~e 492 (492) T protein:vir:94 465 YNKQLP-NLDDGGA--DSAQQQERSN------NKESE 492 (492) T ss_pred HHhhcc-ccccccC--CCCccccCCc------cccCC Confidence 211110 0000000 0000000000 00000 No 117 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=89.03 E-value=0.029 Score=29.03 Aligned_cols=375 Identities=7% Similarity=-0.006 Sum_probs=139.7 Q ss_pred eeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhc-ccccceecccc-hhHHHHHHHH Q lcl|NC_011269. 80 MQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKF-PVVGMEFDSKD-PLIKTFYEDL 157 (867) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~ 157 (867) |.|-.- ++.--+.-++.++ -...+...+..-|-.++.|-.||++-++- .-..+++..++ +..++-..+. T Consensus 1 Mgl~d~-----~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~ 71 (395) T protein:vir:96 1 MGILDF-----FSFKKSGTLSDDD----SGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLY 71 (395) T ss_pred Ccchhh-----hcCCCCccccccc----cccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHH Confidence 222211 1111111222221 11234445666677788999999987653 22333332222 1222211111 Q ss_pred hh----cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccc Q lcl|NC_011269. 158 FF----GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQ 233 (867) Q Consensus 158 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (867) .+ .+..+-.+|+..++ ..++.-|+++-+.+-+..+ .+.+..+.+. .+ ++.... T Consensus 72 lL~~~PN~~~t~~~f~~~l~-~~lll~Gna~~~~~~~~~~-------~~~~~~~~~~-------~~--------~~~~~~ 128 (395) T protein:vir:96 72 WINTKANPNQSASQFWVEVV-QKLLVDGETLIFVIPGKGI-------YVADAFTQDK-------KL--------SGNKFK 128 (395) T ss_pred HHhhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEEcCCce-------ecCCcccccc-------cc--------ccceee Confidence 22 24456678888866 7777788887665544321 0111111000 00 000000 Q ss_pred cccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHH Q lcl|NC_011269. 234 GPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMA 313 (867) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (867) .-+.. .|. +. ..++..-|-|++.....+...+.++ +.+...++. T Consensus 129 ~v~~~-------------------~~~-~~---------------~~~~~~dvih~k~~~~~~~~~~~~~-~~~~~~~~~ 172 (395) T protein:vir:96 129 VSRVQ-------------------GQT-YE---------------KIFTFDQVIYLKNDNSDLMLKVESL-WEEYGELLG 172 (395) T ss_pred eeeec-------------------cce-ee---------------eEeccCceEEecccCCccccccccc-cchHHHHHH Confidence 00000 000 00 1133344667765554444333332 224444433 Q ss_pred HHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcC-CCCHHHHHHHHHHHHHh---hhc-chhhhhhhhheeeeeccccC Q lcl|NC_011269. 314 EESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPW-IPDQGELDEVRDDMQSL---LAA-DFRLMVHNFGLKVENVFGRE 388 (867) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~~~~ 388 (867) . .+..+.++=|.|+. +..++-|+ ..+-.+ +.+...-+..++.++.. ..+ +...++-..|++.+.....- T Consensus 173 ~-~i~~~~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~ 246 (395) T protein:vir:96 173 H-VINNQKIANQIRFT--MTPPKDKV---RERAQENSDGGRQPKSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKN 246 (395) T ss_pred H-HHHHHHHHHHHHHH--hhhccccc---ccceeeccCchhhHHHHHHHHHHHHHHhhcCCcceEEccCCceeEecccCh Confidence 2 12222222233332 22222222 222222 22222222233322333 222 24445556777776665544 Q ss_pred ccCchh--HHHHHH----HHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcc Q lcl|NC_011269. 389 SVPNLD--ADYDRI----ERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGH 461 (867) Q Consensus 389 ~~~~~~--~~~~~~----~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~ 461 (867) ....+. .+|.++ .++|.+++||...++. | +|+++ +....|++.-+.-+...|++++.+.+=.-.|... T Consensus 247 ~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~-~---~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~- 321 (395) T protein:vir:96 247 TGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH-G---DIADNQKNYELLLEGPIESLITNIVDGLEYAIFDKSETLE- 321 (395) T ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc-C---CCccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcC- Confidence 433322 344443 4689999999999995 3 46553 3444555555666666666666654422122111 Q ss_pred cchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhh--ccccccccchhhhhhhhhhhhhceeeeeccccCCCccc Q lcl|NC_011269. 462 YDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEI--KFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDM 539 (867) Q Consensus 462 ~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i--~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqm 539 (867) .+.+-+..+...|..+. -+.++..+.....- ..++ .+...-+.+ ..+...-+..-..++.+ .++..+. T Consensus 322 ---~~~f~~~~l~~~d~~~~-~~~~~~~~~~G~~T-~NE~R~~~gl~pi~~---~~gD~~~~~~N~~~~~~--~gge~~~ 391 (395) T protein:vir:96 322 ---GSFIKVTGLKNYDLFSI-SSQADKLISSGFVF-IDEVREEIGLPELPD---GLGKVLYMTKNYESVLE--RGGEVDE 391 (395) T ss_pred ---ceeEeecchhccCHHHH-HHHHHHHHhCCCcC-HHHHHHHhCCCCCCC---CCCceeeecccceechh--ccCCCCC Confidence 11122222322332211 11222111111000 0000 000000000 00000000000011111 1111111 Q ss_pred ccch Q lcl|NC_011269. 540 KFDQ 543 (867) Q Consensus 540 e~E~ 543 (867) +.+. T Consensus 392 ~~~~ 395 (395) T protein:vir:96 392 EVET 395 (395) T ss_pred CCCC Confidence 1110 No 118 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=88.78 E-value=0.03 Score=28.91 Aligned_cols=401 Identities=13% Similarity=0.165 Sum_probs=151.3 Q ss_pred eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhh-----------------------------------ccchH Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYA-----------------------------------THDLV 126 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------------------~~~~~ 126 (867) |-+..+..=|-.+.++. .+.+.. +... +.||. .+++. T Consensus 1 ~~~e~~~~~i~~~~~~~----~~~~~~---~~~~-~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 72 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKH----GKFVSQ---AAEA-EKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWH 72 (471) T ss_pred CCHHHHHHHHHHHHHHH----HHHHHH---HHHH-HHHhccccccccccchhhhhcccccccccccccccccceeccchh Confidence 33333322222222221 111211 2222 12332 24578 Q ss_pred HHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcc Q lcl|NC_011269. 127 PLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPD 205 (867) Q Consensus 127 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (867) +.++|....|=++. +.++++|+...++.+++ |..+++ ..+.. +++.....|+++-+-..++.+|.. ..++++|. T Consensus 73 ~~Ivd~~~~yl~G~p~~~~~~~~~~~~~l~~~-~~n~~~--~~~~~-~~~~~~~~G~~~~~v~~d~~~g~~-~~~~~~p~ 147 (471) T protein:vir:10 73 QLLLDQKKAYALTYPPTFDVDDKKVNDMIVDV-LGDDYE--RISKQ-LCVNAGNAGIAWLHVWKDASDNSF-RYACVDSK 147 (471) T ss_pred HHHHHhhhhhhcccCceeccCChHHHHHHHHH-HhcCHH--HHHHH-HHHHHhhCCeEEEEEEeeCCCCee-EEEEEccc Confidence 88999999988877 89999999988887766 544443 44444 568888899888777777665543 46777887 Q ss_pred eeehhhhhhhcchHHHHHHHHHHhhcccccc---cccccccccc-----ccchhhhhhhhhHHHHHHhchH--HHhhhcc Q lcl|NC_011269. 206 MLRVSRSMFVQRERVQLMVKDLVDHLRQGPT---TAGGNMSTVE-----ETPSEREQRMREFQDLQRRYPE--IIQAAMQ 275 (867) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 275 (867) -+-+= |.-. +. ++++-.+|.--+ .++....-.| +.-.-+.........+. .+.. ++... . T Consensus 148 ~~~~i---~d~~--~~---~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~-~~~~~~~~~~~-~ 217 (471) T protein:vir:10 148 EVIPI---YSKS--LD---KKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELE-TFQAISLIDTM-N 217 (471) T ss_pred ceEEE---EcCC--CC---CceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccc-ccccccccccc-c Confidence 65321 2111 00 001111111100 0000000000 00000000000000000 0000 00000 0 Q ss_pred CCCC-------cccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhchhhhhhhcccccCCC Q lcl|NC_011269. 276 NDGL-------DISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSPLVLATLGIEDMGDG 345 (867) Q Consensus 276 ~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 345 (867) ++-. .+..==|+++.| ..+|++.+- ..+.|+.. |+.+.--+| +.+..|+.+++ | .+| T Consensus 218 ~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e-~v~~liDa--~d~~~S~~~~~~~~~~~~~lv~~-g----~~~ 284 (471) T protein:vir:10 218 GDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLK-PIKDLVDV--YDKVFSGFVNDTDDVQEVIFVLT-N----YGG 284 (471) T ss_pred ccccccccccCCCCceeEEEecc-----CCCCCCchH-HHHHHHHH--HHHHHHHHHHHHHHhhCceeeee-c----CCc Confidence 0000 000001233333 346777653 34555432 332222222 34556655544 2 112 Q ss_pred CcCCCCHHHHHHHHHHHHHhhhcchhhhh-----hhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCc Q lcl|NC_011269. 346 EPWIPDQGELDEVRDDMQSLLAADFRLMV-----HNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTG 420 (867) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 420 (867) +. +++.+.+++. ++++. +.-+.+++++=-....=.+...++++++.|...-+.-.. ..++.| T Consensus 285 ~~-------~~~~~~~~~~-----~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~-~~~~~g 351 (471) T protein:vir:10 285 QD-------KQEFLEDLKR-----YKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNP-ETDKLG 351 (471) T ss_pred cc-------cchhHHHhhc-----CCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCC-Cccccc Confidence 21 1122222211 11100 001112222211111122336667777777665533221 111222 Q ss_pred cceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhcccc-hheehhhccccchhhhhhhhhhhhhHhhhhhh Q lcl|NC_011269. 421 GAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHYD-YDLKGGVRVPIYREIVEYDEETGQEYIRKVPK 495 (867) Q Consensus 421 ~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d-~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k 495 (867) + .+.+.+.+ +.++....++.++..++++++.|.++-+..| .++...|+...+++.. T Consensus 352 -n--~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~---------------- 412 (471) T protein:vir:10 352 -N--SSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLSDKLKIKQTWTRNSINNDT---------------- 412 (471) T ss_pred -C--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEEeCCCCCCCHH---------------- Confidence 1 12223322 2344566788888899999999987655332 3344444444444322 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccchhhhhhHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 496 LLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCD 571 (867) Q Consensus 496 ~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p 571 (867) ..-+.+.++.. .++..+ ++..-+ .+.|.++...|...... ............+ T Consensus 413 -----------------e~~~~~~kl~g---~iS~et~~~~~p~v~D--~~~E~eri~~E~~~~~~-~~~~~~~~~~~~e 469 (471) T protein:vir:10 413 -----------------EMAQVVSTLAT---ITSRENVAKSNPIVED--WQDELRLQKAEQEGRSE-KLYDMEEVEHESE 469 (471) T ss_pred -----------------HHHHHHHHHhc---cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHh-cccccCCCCCccc Confidence 11111111110 111111 111111 11222222222222111 1111111111100 Q ss_pred ccCCC Q lcl|NC_011269. 572 AQNLP 576 (867) Q Consensus 572 ~~g~P 576 (867) - . T Consensus 470 ~---~ 471 (471) T protein:vir:10 470 V---E 471 (471) T ss_pred c---C Confidence 0 0 No 119 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=88.68 E-value=0.031 Score=28.87 Aligned_cols=367 Identities=10% Similarity=0.053 Sum_probs=147.2 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFP 137 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (867) -|+-+|+- +++-+.....+.+.-+..- -..|-.++.|-.|||+-+. - T Consensus 1 Mg~f~~~f--------------------------~~~~~~~~~~~~~~~~~~~------~~~a~~~~~v~~~i~~ia~-~ 47 (385) T protein:vir:95 1 MGLFDSVF--------------------------KRHSELSWMYDLEFLQDKS------KKAYLKQIALNTVVEMVAR-T 47 (385) T ss_pred Cchhhhhh--------------------------ccCcccccccchhhhhccc------hhhhhhhHHHHHHHHHHHH-H Confidence 11111110 0111111111111111100 0223356777888887765 4 Q ss_pred cccceecc--cchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhh Q lcl|NC_011269. 138 VVGMEFDS--KDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRS 212 (867) Q Consensus 138 ~~~~~~~~--~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (867) |..+.|.. ++...+.-..+++. -+..+-.+|+..++ ..++.-|+++-+. +.+++.|..+..+.+.-+.+... T Consensus 48 ia~~p~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~l~l~Gna~i~~--~~~~~~~~~~~~~~~~~~~~~~~ 124 (385) T protein:vir:95 48 ISQSEFRVMKNNTKEKGTLYYLLNVRPNRNQNAVDFWQKFI-FKLIMDNEVLVVK--NDEGHFFVADDFEKEDELGLYSH 124 (385) T ss_pred HcccceeeeecCccccchHHHHHhcccCcCCCHHHHHHHHH-HHHhhcCceEEEE--ecCCCeeeccccccccccccccc Confidence 44433322 22222221222221 13445578888866 8888889877544 33444443333333322222211 Q ss_pred hhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc Q lcl|NC_011269. 213 MFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR 292 (867) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (867) .|.+... . +| .+.+ .++..-|-|+.+. T Consensus 125 ~~~~~~~------------------~-------------------~~-~~~~---------------~~~~~eiih~~~~ 151 (385) T protein:vir:95 125 RFTNVLV------------------N-------------------DF-EFKR---------------VFTMDDVIYLKYN 151 (385) T ss_pred cceeeee------------------c-------------------cc-ceee---------------eeccccEEEecCC Confidence 1110000 0 00 0011 1333446677765 Q ss_pred CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh----c Q lcl|NC_011269. 293 PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA----A 368 (867) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 368 (867) ...-...|.+.+--+-.++- .+.++. .+--.|=-++++. + ...-+++..+.+++-|+..+. . T Consensus 152 ~~~~~~~G~s~~~~~~~~i~------~~~~~~-~~~~~~~g~l~~~------~-~~~~~~e~~~~~~~~~~~~~~g~~~~ 217 (385) T protein:vir:95 152 NQKLDAFSLGLFEDYGEIFG------RMIDLQ-MLNNQIRGILKVD------A-TKFYNKEKQKELQAYIDTLFDAFQNN 217 (385) T ss_pred CCCcccccchHHHHHHHHHH------HHHHHH-HhcCCCceEEEeC------C-ccCCCHHHHHHHHHHHHHHhhhhhhc Confidence 55444567776654433331 111111 1111222222332 1 113467777888886666554 2 Q ss_pred chhhhhhhhheeeeeccccCccC-c-hhHHH----HHHHHHHHHhhccchhhhcCCCccceehhh-hhHHHHHHHHHHHH Q lcl|NC_011269. 369 DFRLMVHNFGLKVENVFGRESVP-N-LDADY----DRIERKLLQAWGIGEALISGGTGGAYASSA-LNREFVTQIMTGFQ 441 (867) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~ 441 (867) ...++|..-|++++.......+. + -|.+| ++..++|.+++||...+++ | +|+++. -...|++.-+.-+. T Consensus 218 ~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~-~---~~sn~e~~~~~~~~~~l~P~~ 293 (385) T protein:vir:95 218 TIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL-G---EMADLEKTIESYLQFCINPLL 293 (385) T ss_pred CCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc-C---CCcCHHHHHHHHHHHHHHHHH Confidence 35567788888888775544432 2 23444 4455669999999999996 3 566643 34555555566667 Q ss_pred HHHHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhh--hhhhhhhhccccccccchhhhhhhhh Q lcl|NC_011269. 442 NALKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV--PKLLIPEIKFSTLNLRDEAQERAFIA 519 (867) Q Consensus 442 ~~l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~--~k~i~~~i~~~~~~Lr~e~~~~~~v~ 519 (867) ..|++.+.+.+-.-.+.- ...|++-+..+.-.|..+.-+-..+.-.... ...+ |..+...-+.++. .....+ T Consensus 294 ~~ie~~l~~~L~~~~~~~---~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~--R~~~g~~p~~~~~-gd~~~~ 367 (385) T protein:vir:95 294 RKIEAELNSKFFYQDEYL---NDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQV--RIMTGEEPADDPE-LDKFII 367 (385) T ss_pred HHHHHHHHhhcCChhhcc---cceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHH--HHHhCCCCCCCCC-Cceeee Confidence 777777765552222211 1123433333322222222111111111100 0000 0000000000000 000000 Q ss_pred hhhhceeeeeccccCCCcccccchhhh Q lcl|NC_011269. 520 QLKGMGVPVSDKTLAVNIDMKFDQELE 546 (867) Q Consensus 520 qL~~~~~pitd~t~p~tiqme~E~e~e 546 (867) .+ -..++ +...++. ...| T Consensus 368 ~~--n~~~~-~~~kgge------~~~e 385 (385) T protein:vir:95 368 TK--NLQSA-DAFKGGE------SNEE 385 (385) T ss_pred cc--cceec-ccccCCC------CCCC Confidence 00 00000 0000000 0000 No 120 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=88.09 E-value=0.035 Score=28.60 Aligned_cols=354 Identities=8% Similarity=0.047 Sum_probs=137.5 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-c Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-F 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 136 (867) -|+-+++.+ -++...++.++.+ + -++-.+ =|-+.+.|-.|||+-++ + T Consensus 1 Mg~f~~l~~---~~~~~~~~~~~~~---------~--------~~~~~~------------~~l~~~~v~~~i~~Ia~~i 48 (376) T protein:vir:78 1 MGFFSELFK---RNKEIEWMWDLDF---------L--------EDKTTK------------VYLKKMALNTCVKHIARTI 48 (376) T ss_pred Cchhhhhhc---cCCccccccchhh---------c--------cccchh------------hhhhhHHHHHHHHHHHHhh Confidence 122221111 0111111111000 0 001111 12245678889888774 3 Q ss_pred ccccceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhh Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSM 213 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (867) .-+.|++-.++..++....+++. -+..+-.+|+..++ ..|+.-|+.+-+..-+..+ ...+...+++..+.-. . T Consensus 49 a~~p~~~~~~~~~~~~~l~~ll~~~PN~~~t~~~f~~~~~-~~lll~Gn~~~~~~r~~~~-~~~~~~~~~~~~~~~~--~ 124 (376) T protein:vir:78 49 AKSDFRLKNGETSVRDKLYYKLNIRPNTDMSSSSFWEKVI-YKLIYDNECLIVLSDTDDF-LIADSYVRKEFAFFPD--V 124 (376) T ss_pred cccceeeccccccccchHHHHHhhccccCCCHHHHHHHHH-HHHhHcCcEEEEEEeCCCe-eeccceeecccceeee--e Confidence 33334443333333333333332 13455678888766 7788888876554333221 1111122222211100 0 Q ss_pred hhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcC Q lcl|NC_011269. 214 FVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRP 293 (867) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (867) |. +-+ ..+|.. .-.++..-|-|++... T Consensus 125 ~~------------------~~~-------------------~~~~~~----------------~~~~~~~evih~~~~~ 151 (376) T protein:vir:78 125 FE------------------GVT-------------------VKDYRY----------------NRNFSMDDVIFLEYGN 151 (376) T ss_pred ee------------------eee-------------------eeccee----------------eeeeccccEEEeccCC Confidence 00 000 001100 0113334455665443 Q ss_pred ccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcc---- Q lcl|NC_011269. 294 TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAAD---- 369 (867) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 369 (867) ......+-+++. ... .-+..+.++-..-- ..--++++ ..+-.-+++..+.+|+.++..+... T Consensus 152 ~~~~~~~~~~~~-~~~-----~~~~~~~~~~~~~~-~~~~~~~~-------~~~~~~~~e~~~~~~~~~~~~~~g~~~~~ 217 (376) T protein:vir:78 152 ERLSAFTDGMFE-DYG-----ELFGKMIRAQMRNF-QIRGAVNF-------KMAGVADKDKQTKLQEYIDKVYASFNNNE 217 (376) T ss_pred CCchhhhhHHHH-HHH-----HHHHHHHHHHHhcC-CCceeEEE-------ccCCCCCHHHHHHHHHHHHHHhccccccC Confidence 332222211111 111 11112222111100 00001111 1122457788889999888877653 Q ss_pred hhhhhhhhheeeeeccccCccCchh-HHHH----HHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHHHHH Q lcl|NC_011269. 370 FRLMVHNFGLKVENVFGRESVPNLD-ADYD----RIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGFQNA 443 (867) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~ 443 (867) ..++|-..|++++-+...-..++++ .+|. +..++|.+++||...+++ | +|+++ +....|++.-+.-+... T Consensus 218 ~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~-~---~~s~~e~~~~~f~~~~l~P~~~~ 293 (376) T protein:vir:78 218 IAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLH-G---DMADLSNNMKAYMEYCIDPLTKK 293 (376) T ss_pred cceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhC-C---CCCCHHHHHHHHHHHHHHHHHHH Confidence 4466677888888877666666655 3554 446779999999999996 4 45542 33344555556666666 Q ss_pred HHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhh---hhhh-----hhhhccccccccchhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKV---PKLL-----IPEIKFSTLNLRDEAQER 515 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~---~k~i-----~~~i~~~~~~Lr~e~~~~ 515 (867) |++++.+++=...+. +...++.+..+. |..+. -|..+-..... .+.+ .+.+....- . T Consensus 294 ie~~l~~kll~~~~~--~~~~~~~~ll~~----d~~~~-~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~--------d 358 (376) T protein:vir:78 294 LEDELNAKLFTFSEF--LAGEHIKIIHKK----DIIEN-AEAVDKLVASGSFNRNEVRELLGAERVDNPEL--------D 358 (376) T ss_pred HHHHHHhhhCCcccc--eecccchhhccc----CHHHH-HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--------c Confidence 666666555111121 122223222221 11111 11111111110 0111 111111100 0 Q ss_pred hhhhhhhhceeeeeccccCC Q lcl|NC_011269. 516 AFIAQLKGMGVPVSDKTLAV 535 (867) Q Consensus 516 ~~v~qL~~~~~pitd~t~p~ 535 (867) ...+.+ -..++..-...+ T Consensus 359 ~~~~~~--n~~~~~~~~e~g 376 (376) T protein:vir:78 359 KYLITK--NYQSADEGGEDG 376 (376) T ss_pred eeeecc--CceehhccccCC Confidence 000000 001111000011 No 121 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=88.04 E-value=0.035 Score=28.58 Aligned_cols=418 Identities=14% Similarity=0.129 Sum_probs=160.7 Q ss_pred ccceeeccchhh-hhhhhhHHhhCCCchhh---hHHHHHHHHHHHHHhhcc----------------------------c Q lcl|NC_011269. 77 GSNMQIAMPKIR-QPLGTLADKGIPFNVED---EEELRVIRHWCRLFYATH----------------------------D 124 (867) Q Consensus 77 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~----------------------------~ 124 (867) |-|++|--..+. .++-.+...-|..-+++ ..++..+++|-+.|.-.| + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~ 80 (479) T protein:vir:79 1 MLNIYISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINN 80 (479) T ss_pred CCCceecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecc Confidence 778777544432 23333322222222222 123444566655443334 4 Q ss_pred hHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC Q lcl|NC_011269. 125 LVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN 203 (867) Q Consensus 125 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (867) +.++++|.+.-|=++. +.++++|+.++++.+++ +.. |+...+.+ +++.-...|+++-+-..++.+.. ++.+++ T Consensus 81 ~~~~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~-~~n--~~~~~~~~-~~~~~~~~G~~~~~v~~d~~~~~--~i~~~~ 154 (479) T protein:vir:79 81 YHKLLVDQKVGYSVGNPIVFNADDDNLTKLLNDL-LGE--EFDDTITE-LYLNASNKGVEWLHPYINRKGEF--KYVIIP 154 (479) T ss_pred hHHHHHHHHHhhhhcCCceeccCCHHHHHHHHHH-Hhc--CHHHHHHH-HHHHHHhcCeEEEEEEeCCCCce--EEEEEc Confidence 4778899998887777 89999999999887665 434 45555566 45788888988776666665432 356677 Q ss_pred cceeehhhhhhhcchHHHHHHHHHHhhcccccc--ccccccccccccchhhhhhh-hhHHHHHHhchHHHhhhccCCCC- Q lcl|NC_011269. 204 PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPT--TAGGNMSTVEETPSEREQRM-REFQDLQRRYPEIIQAAMQNDGL- 279 (867) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~- 279 (867) |+.+-.- |. +.+. ++++.++|.=.+ .++...+.+|---.++..+. .+...+.....+ .....-.++ T Consensus 155 p~~~~~v---~d--~~~~---~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~--~~~~~~~~~~ 224 (479) T protein:vir:79 155 AEEAIPI---WD--SKRQ---RELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLY--DEYGKMTDIQ 224 (479) T ss_pred cceeEEE---Ee--CCCC---CceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccc--cccccccccc Confidence 7765433 11 1000 001111110000 00111111110000000000 000000000000 000000000 Q ss_pred -----------cccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH---HhhhhchhhhhhhcccccCCC Q lcl|NC_011269. 280 -----------DISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV---ADRLYSPLVLATLGIEDMGDG 345 (867) Q Consensus 280 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 345 (867) .+..-=|+++.| ..+|.+.+-. -..|+ +.|+.+..-+ -+.+..|+.+++-. +| T Consensus 225 ~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd~~~-v~~li--Da~d~~~S~~~~~~~~~~~~~~v~~g~-----~~ 291 (479) T protein:vir:79 225 EGHFRINNKEQGWGKVPFIPFKN-----NEKCVSDLTF-YKSLI--DIYDNNISTLADNLDEIQEVIYVLKEY-----PG 291 (479) T ss_pred cccccccccccCCCcccEEEecC-----CCCCCcchhh-hHHHH--HHHHHHHHHHHHHHHHhhCceeeeecC-----Cc Confidence 011111223322 3457776532 33443 2333322111 24556777665421 11 Q ss_pred CcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceeh Q lcl|NC_011269. 346 EPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYAS 425 (867) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 425 (867) +. +.+..+.++. .+++..+=+-+++++=.....=.+...++.+++.|...-++-.. ..++.|. . T Consensus 292 ~~---~~~~~~~~~~---------~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~gn---~ 355 (479) T protein:vir:79 292 TS---LQEFIDNIRY---------YKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNP-ESQNTGD---K 355 (479) T ss_pred cc---cccchhhhhh---------ccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccc-ccccccc---h Confidence 11 1122222211 22222222233333321211112235667777777665544332 2223332 2 Q ss_pred hhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHh---hcccc---hheehhhccccchhhhhhhhhhhhhHhhhhhh Q lcl|NC_011269. 426 SALNREF----VTQIMTGFQNALKRHIRRRCEVVAEA---QGHYD---YDLKGGVRVPIYREIVEYDEETGQEYIRKVPK 495 (867) Q Consensus 426 ~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~---q~~~d---~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k 495 (867) +.+.+.+ +-++-...+..++..++++++.|.++ .+..+ .+++..|+...+++..+.-.-+.++ T Consensus 356 Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl------- 428 (479) T protein:vir:79 356 SGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKS------- 428 (479) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH------- Confidence 3334433 23344556667777777777777653 22111 2334444433443321111100010 Q ss_pred hhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccccccccccc Q lcl|NC_011269. 496 LLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQ 573 (867) Q Consensus 496 ~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~ 573 (867) .+ + .+.+..+.. ++...+ .+.|.++...|+..................+.. T Consensus 429 ------~g--~-----iS~et~l~~------------l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 429 ------TG--I-----VSDETIVSN------------HPWVED--VNDELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred ------hc--c-----CcHHHHHHh------------CCCCCC--HHHHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 00 0 011111211 111111 122223333333222211111111112222221 No 122 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=86.90 E-value=0.042 Score=28.11 Aligned_cols=398 Identities=12% Similarity=0.147 Sum_probs=162.8 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) |+.+-+.+.+.... .++.|+.++.+--.+-.+| +.++ .++ + .-......+ T Consensus 1 l~~~~l~~~i~~~~----~~~~r~~~l~~yy~g~~~i------l~~~------------~~~--~------~~~~~ki~~ 50 (429) T protein:vir:98 1 MTKDLLSELIQKHR----SFNLSYSAYKQLYEGDHAI------LQQK------------QKE--Q------YKPDNRLVV 50 (429) T ss_pred CCHHHHHHHHHHHH----HHHHHHHHHHHHhcccccc------cccc------------ccc--c------CCCcceeec Confidence 77776666655543 3445555544332221111 0000 011 1 011224467 Q ss_pred chHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheec Q lcl|NC_011269. 124 DLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEIL 202 (867) Q Consensus 124 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (867) ++.+.++|.+..|=++. +.++.+|+...++..++ | .+.|+..++.+ +.++...-|+++=+--..+.+.. +..++ T Consensus 51 n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~l~~~-~-~~n~~~~~~~~-~~~~~~~~G~~~~~v~~d~~g~~--~~~~~ 125 (429) T protein:vir:98 51 NFAKYIVDTFNGYFIGVPVQTSHENKQVSNYLELL-D-GYNDQDDNNAE-LSKICSIYGHGYELVFNDENAEA--GITYL 125 (429) T ss_pred chHHHHHHHHhhhhcccCceeecCChHHHHHHHHH-H-hhcCHhHHHHH-HHHHHhhcCeEEEEEEecCCCcE--EEEEE Confidence 88889999999988876 89999999888887776 4 35556666666 56888888987655444454333 24567 Q ss_pred CcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCccc Q lcl|NC_011269. 203 NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDIS 282 (867) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (867) +|..+-+--+-...++ ++-.+|--- ..+... ..+-+..++....+ ..+.++.+. T Consensus 126 ~p~~~~~v~dd~~~~~--------~~~~i~~~~-~~~~~~-~~~~~~~~~~~~~~----------------~~~~~~~~~ 179 (429) T protein:vir:98 126 TPLEAFIVYDDSIRQK--------PLFAVRYFY-NKGGVL-EGSYSDASNITYFK----------------DGEKGIEIG 179 (429) T ss_pred cccceEEEEeCCCCCc--------eEEEEEEEE-ecCceE-EEEEEeCceEEEEE----------------ecCCceEec Confidence 7765533210000111 111111110 001110 11111111111110 011111111 Q ss_pred H--------HHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH---HhhhhchhhhhhhcccccCCCCcCCCC Q lcl|NC_011269. 283 E--------ALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV---ADRLYSPLVLATLGIEDMGDGEPWIPD 351 (867) Q Consensus 283 ~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (867) + ==|+++.| ..+|.+.+-+ .+.|+ +.|+.+..-. -+.+..|.++++ | ..+. T Consensus 180 ~~~~~~~g~vPvv~~~n-----~~~g~sd~e~-v~~li--D~~d~~~s~~~~~~~~~~~p~~~i~-g----~~~~----- 241 (429) T protein:vir:98 180 ESEPHPFDGVPMIEYVE-----NEERQSLLAS-VVTLI--NAFNKAISEKANDVEYFADAYLKIL-G----AELD----- 241 (429) T ss_pred ccccccCCccceEEecC-----CCCCCCcHHH-HHHHH--HHHHHHHHHHHHHHHHhcCceeeee-c----CCCC----- Confidence 1 11233333 3468877654 33443 3344433332 256778888876 3 1111 Q ss_pred HHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccC---c----cCchh---HHHHHHHHHHHHhhccchhhhcCCCcc Q lcl|NC_011269. 352 QGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRE---S----VPNLD---ADYDRIERKLLQAWGIGEALISGGTGG 421 (867) Q Consensus 352 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~g~~~ 421 (867) .++ +++ |+. +++ ++++.-++.. + -.+++ ..++.+++.|...-++-..... +.|. T Consensus 242 ~~~---~~~-~~~-----~~~------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~gn 305 (429) T protein:vir:98 242 DET---LKS-LRD-----TRI------INLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDE-SFGT 305 (429) T ss_pred cch---hhh-Hhh-----Cce------eeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-cccc Confidence 112 222 211 122 3333222111 1 11222 3456666666555544332221 2222 Q ss_pred ceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhccc--c---hheehhhccccchhhhhhhhhhhhhHhhh Q lcl|NC_011269. 422 AYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGHY--D---YDLKGGVRVPIYREIVEYDEETGQEYIRK 492 (867) Q Consensus 422 ~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~~--d---~~~~~~~~~~~~rd~~~~k~e~~k~~~r~ 492 (867) .+.+.+.+. -++....+..++..++++++-|..+-... + .+++..|....+++..+.-.-+.|+ T Consensus 306 ---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl---- 378 (429) T protein:vir:98 306 ---ASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQIAGNL---- 378 (429) T ss_pred ---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH---- Confidence 233444332 24455667778888888888887753211 1 1233333333333321111100000 Q ss_pred hhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccc Q lcl|NC_011269. 493 VPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDA 572 (867) Q Consensus 493 ~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~ 572 (867) . .+ + +.+..+.. ++...+ .+.+.++...|....+......+ .. T Consensus 379 ---------~--g~-i----s~et~~~~------------l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~-------~~ 421 (429) T protein:vir:98 379 ---------A--GI-V----SEETQVGV------------LSIVEN--PQKEIERKNSDKSTLISRQAGGL-------NG 421 (429) T ss_pred ---------h--cc-C----chHHHHHh------------CCCCCC--HHHHHHHHHHHHHHHHHHHHhhh-------cC Confidence 0 00 0 11111211 111111 11223333333322221111111 11 Q ss_pred cCCCCCcc Q lcl|NC_011269. 573 QNLPYPPE 580 (867) Q Consensus 573 ~g~P~pp~ 580 (867) .......+ T Consensus 422 ~~~~~~~~ 429 (429) T protein:vir:98 422 QNTTTILE 429 (429) T ss_pred CCCCCCCC Confidence 11111111 No 123 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=86.85 E-value=0.043 Score=28.09 Aligned_cols=421 Identities=11% Similarity=0.093 Sum_probs=157.5 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc----- Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH----- 123 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 123 (867) |.+-+...-+ .+-+...+-++ +. ...- .+.|+.+-+ ++..|+.|++.|.-.| T Consensus 1 m~~~~~~~~~----------~~~~~~~~~~~-------~~-~~~~--~~~~~~~~~---~~~~i~~~~~yy~g~~~~~~~ 57 (496) T protein:vir:38 1 MINQIIAGVK----------GVMRRMGLLKA-------LK-DVKD--HKKVNANDE---DYKYIDMWKRLYQGHYAEWHN 57 (496) T ss_pred ChhHHHHHHH----------HHHHHhccchh-------hH-HHHh--cCCCcCCHH---HHHHHHHHHHHhcCCCchhhc Confidence 3333222211 11111100000 11 1111 123443332 3345777876654434 Q ss_pred -----------------chHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcc Q lcl|NC_011269. 124 -----------------DLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTS 185 (867) Q Consensus 124 -----------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (867) .+-++++|-+..|=++. +.|+++|+...++.++++= +-++..-+.+ ...+-+..|.+.= T Consensus 58 ~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~--~n~f~~~~~~-~~~~a~~~G~~~~ 134 (496) T protein:vir:38 58 LNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKINIDDKAAEEFVLNVLK--TNGFTKNMER-YIEYGEAMGGFVI 134 (496) T ss_pred chhccCCCccccceeecchHHHHHHHHhhhhhCCcceEeeCChHHHHHHHHHHh--ccCHHHHHHH-HHHHHhhhCcEEE Confidence 44467888899987776 7899999888887766542 3445555555 3367777888776 Q ss_pred hhhhhhhccceehheecCcceee-hhhhhhhcchHHHHHHHHHHhhccccccccccccccccccch--hh-hhhhhhHHH Q lcl|NC_011269. 186 LAHFNESLGVWSSEEILNPDMLR-VSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPS--ER-EQRMREFQD 261 (867) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~ 261 (867) .--+++.++.+- .+++|+.+- |. + ....|..+| +++.... +|-..-.+|-.-. .+ .-+++-|.+ T Consensus 135 ~~~~D~~~~~~i--~~v~~~~~~P~~---~-~~~~~~~~~--f~~~~~~----~~~~y~~le~h~~~~~~~~I~~~~y~~ 202 (496) T protein:vir:38 135 KVYHDGNKNVKV--SFATADCMYPLS---N-DSENVDECV--IANSFHK----NNKYYTLLEWNEWQGDVYTVTTELYQS 202 (496) T ss_pred EEEEcCCCcEEE--EEEcccceEEEE---e-cCCcEEEEE--EEEEEEe----CCeEEEEEEEEEEeCceEEEEEEEEec Confidence 666777666653 455665432 11 0 111111111 0111111 0111111110000 00 001111111 Q ss_pred -----HHHhc-----hHHHhhhccCCCCcccHHHHHHh----hhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 262 -----LQRRY-----PEIIQAAMQNDGLDISEALISRV----VNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADR 327 (867) Q Consensus 262 -----~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (867) |-... +|-+.-...-.| ++.=++.++ .|...--++.|.+.+-.+- . ..+.|+.+...+++- T Consensus 203 ~~~~~~g~~v~~~~~~~~~~~~~~~~~--~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~-~--lid~ld~~~s~~~~~ 277 (496) T protein:vir:38 203 DDPNELGTKVSLTLLFDDIEPVVPLPD--FTRPTFIYIKPNIANNKNLTSPLGISVYANAL-D--TLKTLDLMFDSYYQE 277 (496) T ss_pred CCccccCccccccccccccccceeecC--CCcceEEEecCCcccccccCCcCCCchHhhHH-H--HHHHHHHHHHHHHHH Confidence 11111 110000000011 122233332 2333334456777765443 2 334555555554432 Q ss_pred h-------hchhhhhh-hcccccCCCCcC---CCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCcc--Cc-- Q lcl|NC_011269. 328 L-------YSPLVLAT-LGIEDMGDGEPW---IPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESV--PN-- 392 (867) Q Consensus 328 ~-------~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-- 392 (867) + +.|--+++ ++. +.|+.. +.+.+ .| +.++.+..+....+ ++ T Consensus 278 ~~~~~~~i~v~~~~l~~~~~---~~g~~~~~~~~~~~---------------~~------~~~~~~~~~~~~~i~~~~~~ 333 (496) T protein:vir:38 278 FKLGKKKVLVPSSFVKTAVN---LDGSTTQYFDSTDE---------------AF------FLYQGDQDDNGKAIKDISVE 333 (496) T ss_pred HhhcccceecchHHhhccCC---CCCccccCCCCccc---------------eE------EEeecCCCcccccceeeccc Confidence 2 22322222 221 233331 11100 01 11223333222111 11 Q ss_pred h--h---HHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHH----HHHHHHHHHhhhhHHHHHhhc--- Q lcl|NC_011269. 393 L--D---ADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTG----FQNALKRHIRRRCEVVAEAQG--- 460 (867) Q Consensus 393 ~--~---~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~~r~~~~~i~e~q~--- 460 (867) + + +.++.+.+.|....|++...++...++. +| +..+-..-|.|+. .++.++..|+++++.|-++-. T Consensus 334 i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~-~t-Atei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~ 411 (496) T protein:vir:38 334 IRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGL-KT-ATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIE 411 (496) T ss_pred cCHHHHHHHHHHHHHHHHHhhCCChhhcCCCcccc-ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1 4566777888888899988877433322 22 2233233445555 555566677777666655311 Q ss_pred ------ccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhcccc-ccccchhhhhhhhhhhhhceeeeecccc Q lcl|NC_011269. 461 ------HYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFST-LNLRDEAQERAFIAQLKGMGVPVSDKTL 533 (867) Q Consensus 461 ------~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~-~~Lr~e~~~~~~v~qL~~~~~pitd~t~ 533 (867) ..+..+++.|+...+.|.-+....+.++. ..-.+-.+..... +.+. +.+++..+..++.... T Consensus 412 ~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~---~~GiiS~et~l~~~~~~~-d~ea~~el~ri~~E~~------- 480 (496) T protein:vir:38 412 AYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAK---NQGMIPLKIALQRAWNIT-EAEADEWAEMLAKEKQ------- 480 (496) T ss_pred hhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHH---hcCCCCHHHHHHhcCCCC-hHHHHHHHHHHHHhhh------- Confidence 12223444444433333221111111110 0000111111110 1111 1111122222221110 Q ss_pred CCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 534 AVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 534 p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) .. ++ . .+.-+.-...+ T Consensus 481 --------------------~~-------~~-~---~d~~~~~~~~e 496 (496) T protein:vir:38 481 --------------------AE-------MP-N---NDMNGIFGEEE 496 (496) T ss_pred --------------------cc-------Cc-c---ccccCCCCCCC Confidence 00 00 0 00000000011 No 124 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=85.55 E-value=0.052 Score=27.62 Aligned_cols=429 Identities=10% Similarity=0.067 Sum_probs=152.7 Q ss_pred hhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCch-hhhHHHHHHHHHHHHHhhc-c---------- Q lcl|NC_011269. 56 RRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNV-EDEEELRVIRHWCRLFYAT-H---------- 123 (867) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~---------- 123 (867) |+ -|-.|| ..--.|..+-||.=-........+.|-... +.+..++ .++.||.. | T Consensus 1 ~~-~~~~~~---------~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~----~l~~Yy~g~~~i~~~~~~~~ 66 (470) T protein:vir:99 1 MK-DINYGR---------DKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYR----ENMKLYLGKHKILTAPEKET 66 (470) T ss_pred Cc-cccCCc---------ccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHH----HHHHHhccccccccCccccc Confidence 00 000000 111235555555333333332223332211 1111121 23456653 2 Q ss_pred --------chHHHHHHhhhhccccc-ceecccchh-HHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhc Q lcl|NC_011269. 124 --------DLVPLLIDIYSKFPVVG-MEFDSKDPL-IKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESL 193 (867) Q Consensus 124 --------~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (867) ++.+.++|.+..|=++. +.|+++|+. ..+...+++. +-|+-.++.+ +++.....|.++-+--.++.+ T Consensus 67 ~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~--~n~~~~~~~~-~~~~~~~~G~~~~~v~~d~dg 143 (470) T protein:vir:99 67 GADNRIVVNSAKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNR--QENFFDTINE-ISKQCDIFGRSIASIYQGEDA 143 (470) T ss_pred CCcceeecchHHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHH--hcCHhHHHHH-HHHHHHhcCeeEEEEEeCCCC Confidence 57889999999998776 888776543 3344555544 4455556666 447788888765544334533 Q ss_pred cceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccc---cccccchhhhhhhhhHHHHHHhchHHH Q lcl|NC_011269. 194 GVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMS---TVEETPSEREQRMREFQDLQRRYPEII 270 (867) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (867) .. +..+++|+.+-+--+-...++. +-.+|- -...+.... .+..|+... -+.+.. ++...+.+ + T Consensus 144 ~~--~i~~~~p~~~~~i~d~~~~~~~--------~~~vr~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~-~ 209 (470) T protein:vir:99 144 RP--HLMYSSPNHAFIIYDDTVQRQP--------LAFVHY-QIDNSNNWTDAYGVIQYADKF-YKFKGY-DIEEDTNA-A 209 (470) T ss_pred eE--EEEEEccceeEEEEcCCCCcce--------EEEEEE-EEEecCCeeEEEEEEEecCeE-EEEEec-cccccccc-c Confidence 32 3566788766443111111111 101110 000111110 111122111 100000 00001111 0 Q ss_pred hhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH-hhhhchhhhhhhcccccCCCCcCC Q lcl|NC_011269. 271 QAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA-DRLYSPLVLATLGIEDMGDGEPWI 349 (867) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 349 (867) +... . .+..=-|+++.| ..+|.+.+-+ .+.|+..-...-.+-+.+ +-+..|.++++-.. + T Consensus 210 ~~~~--~--~~g~vPvv~~~n-----~~~g~sd~e~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~---------~ 270 (470) T protein:vir:99 210 GYAI--N--PYGLVPAVEFFE-----NEERQGIFDS-IKTLINALDKVISQKANQVEYFDNAYMYMIGFK---------L 270 (470) T ss_pred cccc--c--CCCccceEeecC-----CCCCCcchHh-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC---------c Confidence 0000 0 011111223333 3467776644 444443222222222222 34556666554221 1 Q ss_pred CCHHHHHHHHHHHHHhhhcchhhhhh-----hhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCcccee Q lcl|NC_011269. 350 PDQGELDEVRDDMQSLLAADFRLMVH-----NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYA 424 (867) Q Consensus 350 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 424 (867) ++-++-+.++ +++. .+++.. .-+-+++++=-.-..-.+...++.+++.|...-++-. +..++.+++= T Consensus 271 ~~~~~g~~~~-~~~~-----~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~-~~~~~~~~n~- 342 (470) T protein:vir:99 271 PEDDEGNPKF-DFKN-----NRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPN-IQDKNFAGNS- 342 (470) T ss_pred ccccccchhh-hhhh-----cceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcc-ccccccccCc- Confidence 1111111121 2221 121111 1111122110000111122456777777766666553 2233333332 Q ss_pred hhhhhHH----HHHHHHHHHHHHHHHHHhhhhHHHHHhhcc---cc---hheehhhccccchhhhhhhhhhhhhHhhhhh Q lcl|NC_011269. 425 SSALNRE----FVTQIMTGFQNALKRHIRRRCEVVAEAQGH---YD---YDLKGGVRVPIYREIVEYDEETGQEYIRKVP 494 (867) Q Consensus 425 ~~~~~~~----~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~---~d---~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~ 494 (867) +.+.+. .+-++....++.++..|+++++-|.++-+. .+ ..++..|+...+++..+.-.-+.++ T Consensus 343 -Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl------ 415 (470) T protein:vir:99 343 -SGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNA------ 415 (470) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHH------ Confidence 333333 334455667788888888888888775331 11 1233333333333321111101000 Q ss_pred hhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccC Q lcl|NC_011269. 495 KLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQN 574 (867) Q Consensus 495 k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g 574 (867) . .+ + +.+..+.. ++. +..+.|.++...|..................... T Consensus 416 -------~--gi-i----s~et~l~~------------l~~---vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d-- 464 (470) T protein:vir:99 416 -------E--GI-V----SKKTQLGM------------IPD---IEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRD-- 464 (470) T ss_pred -------h--cc-C----CHHHHHHh------------CCC---CCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCC-- Confidence 0 00 1 11111111 111 1111222222222222221111111111100000 Q ss_pred CCCCccc Q lcl|NC_011269. 575 LPYPPEL 581 (867) Q Consensus 575 ~P~pp~~ 581 (867) +...+. T Consensus 465 -~~~ee~ 470 (470) T protein:vir:99 465 -NNAEEE 470 (470) T ss_pred -CCccCC Confidence 000011 No 125 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=85.29 E-value=0.054 Score=27.54 Aligned_cols=354 Identities=11% Similarity=0.070 Sum_probs=129.7 Q ss_pred eeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh----cccccceecccc---hhHHH Q lcl|NC_011269. 80 MQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK----FPVVGMEFDSKD---PLIKT 152 (867) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~---~~~~~ 152 (867) |.|-. ||+- ..-.+.+-.+-. ..-.|-+......+.|-.|||+-+. -|+--++-+.+| ..+.+ T Consensus 1 M~if~-~~~~---~~~~~~~~~~~~------~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLFG-KVVS---FSRGKLNNDTQR------VTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CchhH-HhHh---hhhcccccCcce------eeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccc Confidence 33333 2221 000111100000 0000111222344567778877654 332212111111 11111 Q ss_pred HHH----HHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHH Q lcl|NC_011269. 153 FYE----DLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVK 225 (867) Q Consensus 153 ~~~----~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (867) ..+ +++ =.+.++=.+|+..++ ..++.-|+++=+.-+....|.+ ++ T Consensus 71 ~~~~~l~~lLn~~PN~~~t~~~f~~~~~-~~lll~Gnayi~~i~~~~~g~~-----------------------~~---- 122 (378) T protein:vir:94 71 MAGSDLDEVLNWSSKGERNSMEFWQKVI-KKLLTTRYIDLYPIFDSETGEL-----------------------LD---- 122 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEeeCCCCcE-----------------------EE---- Confidence 111 111 123444557888877 7788888876321111111111 00 Q ss_pred HHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhh Q lcl|NC_011269. 226 DLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLL 305 (867) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (867) .|+ -+++..+...-|-|+.+.. +.-.++..+= T Consensus 123 ---------------------------------------~~~-------~~~~~~~~~~dvih~~~~~--~~~~~~~~~~ 154 (378) T protein:vir:94 123 ---------------------------------------LLF-------ANDKKEYKPEELVRLTSPF--YINEDTSILD 154 (378) T ss_pred ---------------------------------------EEE-------ecCcEEechhceeeecCcC--CcccchhHHH Confidence 000 0122333444455664322 1112222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHH----HHHHhhhcch--hhhhhhhhe Q lcl|NC_011269. 306 RSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRD----DMQSLLAADF--RLMVHNFGL 379 (867) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~--~~~~~~~~~ 379 (867) .+. ++.+.+++. .++=-++++.+ ++ +++..+++|+ .++....++. .++|-.-|+ T Consensus 155 ~~~---------~~~~~~~~~--~~~~g~l~~~~--------~l-~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~ 214 (378) T protein:vir:94 155 NAL---------ASIQTKLEQ--GKLRGLLKINA--------FL-DIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKT 214 (378) T ss_pred HHH---------HHHHHHHhh--CCcccceeeCC--------cC-CHHHHHHHHHHHHHHHHHhhcccccccceeccCCc Confidence 111 111222221 11111223321 12 2333344444 4443333332 467778899 Q ss_pred eeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHh- Q lcl|NC_011269. 380 KVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEA- 458 (867) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~- 458 (867) +++-+...-+...+ .++++++++|.+++||...+++ |+ |+. +-...|+.+-+.-+..+|++++.+.+=...|. T Consensus 215 ~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~-g~---~~e-~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~ 288 (378) T protein:vir:94 215 EIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL-GT---ATQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRR 288 (378) T ss_pred eEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhc-CC---chH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhh Confidence 99988877777776 5679999999999999999997 43 332 33445666667777777888877766433332 Q ss_pred hcccc---hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCC Q lcl|NC_011269. 459 QGHYD---YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAV 535 (867) Q Consensus 459 q~~~d---~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~ 535 (867) ++++. .++.+-+..+.--|..+ +-|.++..+.....- ..++.-- +.+ .+.++. .. ..++.-.. T Consensus 289 ~g~~~~~~~~~~f~~~~l~~~d~~~-~~e~~~~~~~~G~~t-~NE~R~~-~g~---~p~~gg-d~-----~~~~~n~~-- 354 (378) T protein:vir:94 289 VVKGNLYYERIIVDNQLFKFATLKE-LIDLYHENINGPIFT-QNQLLVK-MGE---QPIEGG-DV-----YIANLNAV-- 354 (378) T ss_pred hhhhhcccceeEeecchhhhcCHHH-HHHHHHHHHhCCCcC-HHHHHHH-hCC---CCCCCC-Ce-----eeeccccc-- Confidence 22222 11222222221111111 111111111111000 0011000 000 000000 00 00110000 Q ss_pred CcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 536 NIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 536 tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) .+....+.+.. .....+ .-+..+. T Consensus 355 ~~~~~~~~~~~-------------~~~~~~---~~e~~n~ 378 (378) T protein:vir:94 355 AVKNLSDLQGN-------------RKDVTS---TDETNNQ 378 (378) T ss_pred chhcchhcccc-------------cCCCCC---CCCCCCC Confidence 00000000000 000000 0000000 No 126 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=83.97 E-value=0.03 Score=28.93 Aligned_cols=241 Identities=10% Similarity=-0.004 Sum_probs=124.5 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhh---hhHH-hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhh Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLG---TLAD-KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIY 133 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (867) -|+-+.+.. |+.. +|. ..+...+. .... .+.. +.+. -+-.++-|-.|||+. T Consensus 1 MglF~~~~~----r~~~---~~~----~~~~~~~~~~~~~~~~~~~~--v~~~------------~al~~~~v~~~i~~i 55 (251) T protein:vir:46 1 MGIFYKNEK----RDLQ---YNE----DDLQMMVQTLPSFQGTKLRQ--YKDI------------EAIRHSDIFTAVMMI 55 (251) T ss_pred CCccccccc----cccC---CCc----cchhhhhhhhccccCcCcce--echh------------hhhccHHHHHHHHHH Confidence 111100000 0000 000 00000000 0000 0011 1111 122345566777766 Q ss_pred hh-cccccceecccchhHHHHHH-HHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceee Q lcl|NC_011269. 134 SK-FPVVGMEFDSKDPLIKTFYE-DLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLR 208 (867) Q Consensus 134 ~~-~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (867) ++ ..-..++..-+...+++--. +++. .+.++-.+|+..++ ..++.-|+.+-+..-|.. |-..+++.|+|+.|. T Consensus 56 a~~iA~lp~~~~~~~~~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~-~~lll~Gnay~~i~r~~~-G~~~~L~~i~~~~v~ 133 (251) T protein:vir:46 56 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVF-VSALLTSHGYIEITRDKT-GEPMNLTFRKTSEIE 133 (251) T ss_pred HHhHhhCceEEeeCccccccchHHHHHhccCCCCCCHHHHHHHHH-HHHhhcCCeEEEEEECCC-CcEEEEEEECCceEE Confidence 54 11111222222222222111 1111 24566678999877 889999999888877766 457889999999999 Q ss_pred hhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHH Q lcl|NC_011269. 209 VSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISR 288 (867) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 288 (867) |.+. .+..+. ++.+. +..-..+....++..-|-| T Consensus 134 v~~~---~~g~~~-----------------------------------------~~~~~--~~~~~~g~~~~~~~~diiH 167 (251) T protein:vir:46 134 LKSD---ARGRLY-----------------------------------------YFHQR--IDSNGNNIERNVKFEDMLD 167 (251) T ss_pred EEEC---CCCcEE-----------------------------------------EEEEE--eccCCcceeEEECCccEEE Confidence 8722 121111 00000 0111122334566667888 Q ss_pred hhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhc Q lcl|NC_011269. 289 VVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAA 368 (867) Q Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (867) ++.-... --.|.+.+--+..+|-......+......++-..|--++++-+ .|-+++..+.+|++++..... T Consensus 168 ~r~~~~d-g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--------~l~~~e~~~~~~~~~~~~~~g 238 (251) T protein:vir:46 168 IKFYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG--------VLDNKKARDRAREEFPKVLVE 238 (251) T ss_pred ecCcCCC-CeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCC--------CCCCHHHHHHHHHHHHHHhcC Confidence 8754321 2379999999999998888888888888888888888888742 255667788899887776543 Q ss_pred -chhhhhhhhheeeeeccccC Q lcl|NC_011269. 369 -DFRLMVHNFGLKVENVFGRE 388 (867) Q Consensus 369 -~~~~~~~~~~~~~~~~~~~~ 388 (867) ++- | +|. +|-.| T Consensus 239 ~~n~------g-~~~-~gm~~ 251 (251) T protein:vir:46 239 LNKL------G-KLS-YSMNQ 251 (251) T ss_pred cccc------c-ccc-cccCC Confidence 222 1 111 12222 No 127 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=83.29 E-value=0.07 Score=26.93 Aligned_cols=380 Identities=10% Similarity=-0.010 Sum_probs=139.2 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh-hc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS-KF 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 136 (867) -|+-+++- +=++....+.++. +-.++- + ..|-.++.|-.||++-+ .+ T Consensus 1 Mg~f~~lf---~~~~~~~~~~~~~-----------------~~~~v~-----------~-~~~~~~~~v~~~i~~Ia~~i 48 (395) T protein:vir:10 1 MSILEKIF---KTRKDITYMLDLD-----------------MIEDLS-----------Q-QAYVKRLAIDSCIEFVARAV 48 (395) T ss_pred Cchhhhhh---ccCccccccccch-----------------hccccc-----------h-hhhhhhHHHHHHHHHHHHhh Confidence 11111100 0011111111110 001111 1 23445788888888765 34 Q ss_pred ccccceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhh Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSM 213 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (867) .-+.|+...++.-++.=..+++. -+..+-.+|+..++ ..|+.-|.++=+.+-+. + .+++++..+.+.+ T Consensus 49 A~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~-~~lll~g~~~~~~~~~~--~----~~~~~~~~~~~~~-- 119 (395) T protein:vir:10 49 AQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVI-YKLIYDNEVLIVVSDSK--E----LLIADSFYREEYA-- 119 (395) T ss_pred ccceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHH-HHHhhCCceEEEEecCC--C----eEecCCccceeEe-- Confidence 43334433333222222222222 13445567888866 67888887765544332 2 2333333333221 Q ss_pred hhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcC Q lcl|NC_011269. 214 FVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRP 293 (867) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (867) +.+.+.. +-+. .+| +..-.++..-|-|+..-. T Consensus 120 -~~~~~~~------------~~~~-------------------~~~----------------~~~~~~~~~evih~~~~~ 151 (395) T protein:vir:10 120 -LYDDIFK------------DVTV-------------------KDY----------------TYQRTFTMQEVIYLKYNN 151 (395) T ss_pred -ecCccee------------EEEE-------------------cCc----------------eeeeeeccccEEEEccCC Confidence 1010000 0000 000 000123444466776544 Q ss_pred ccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--- Q lcl|NC_011269. 294 TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--- 370 (867) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 370 (867) +.-...|.+.+--+-. .+..+.++. .+-.++--+.++.. .+.+++..+.+|..++....++. T Consensus 152 ~~~~~~G~spi~~~~~------~~~~~~~~~-~~~~~~~gii~~~~--------~~~~~e~~~~~~~~~~~~~~~~~~~~ 216 (395) T protein:vir:10 152 NKVTHFVESLFEDYGK------IFGRMIGAQ-LKNYQIRGILKSAS--------SAYDEKNIEKLQAFTNKLFNTFNKNQ 216 (395) T ss_pred CCcccccchHHHHHHH------HHHHHHHHH-HhcCCCceEEEeCC--------CCCCHHHHHHHHHHHHHHhccccccC Confidence 4444456554432221 122233332 22333333333321 14578888888888877766642 Q ss_pred -hhhhhhhheeeeeccccCccCchh-HHH----HHHHHHHHHhhccchhhhcCCCccceeh-hhhhHHHHHHHHHHHHHH Q lcl|NC_011269. 371 -RLMVHNFGLKVENVFGRESVPNLD-ADY----DRIERKLLQAWGIGEALISGGTGGAYAS-SALNREFVTQIMTGFQNA 443 (867) Q Consensus 371 -~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~ 443 (867) .+++-.-|++++.+.-.-+..+++ .+| ++..++|.+++||..++|. | +|++ .+....|++.-+.-+... T Consensus 217 ~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~-~---~~sn~e~~~~~~~~~~l~P~~~~ 292 (395) T protein:vir:10 217 LAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY-G---ETADLEKNTLVFEKFCLTPLLKK 292 (395) T ss_pred cceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc-C---cccCHHHHHHHHHHHHHHHHHHH Confidence 233334456666555444444443 233 4556889999999999995 4 4554 233344444445555555 Q ss_pred HHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG 523 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~ 523 (867) |++++.+.+=.-.+. ...++|-+..+...|..+.-+ ..+..++.. | +..+.-+...+ +..++. T Consensus 293 ie~~l~~kL~~~~~~----~~~~~f~~~~l~~~D~~~~~~-~~~~~~~~G---~--------lt~NE~R~~~g-~~p~~~ 355 (395) T protein:vir:10 293 IQNELNAKLITQSMY----LKDTRIEIVGVNKKDPLQYAE-AIDKLVSSG---S--------FTRNEVRIMLG-EEPSDN 355 (395) T ss_pred HHHHHHHhhcChhhh----cccceecchhhhccCHHHHHH-HHHHHHhCC---C--------cCHHHHHHHhC-CCCCCC Confidence 555555444111111 111222222222222211111 111111110 0 11100000000 000000 Q ss_pred c-eeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 524 M-GVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 524 ~-~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) - +....-+..-..++...+.+ ....+ ..+ +..+. ...+- T Consensus 356 g~~d~~~~~~n~~~~~~~~~~~--~~~~~--~~~--------kgg~~-~~~g~ 395 (395) T protein:vir:10 356 PELDEYLITKNYEKANSGENDE--KEKDE--NTL--------KGGDE-DESGD 395 (395) T ss_pred CCCceeeecccccccccccccc--Ccccc--ccc--------CCCCC-CCCCC Confidence 0 00000000000111100000 00000 000 00000 00000 No 128 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=83.29 E-value=0.07 Score=26.93 Aligned_cols=380 Identities=10% Similarity=-0.010 Sum_probs=139.2 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh-hc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS-KF 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 136 (867) -|+-+++- +=++....+.++. +-.++- + ..|-.++.|-.||++-+ .+ T Consensus 1 Mg~f~~lf---~~~~~~~~~~~~~-----------------~~~~v~-----------~-~~~~~~~~v~~~i~~Ia~~i 48 (395) T protein:vir:95 1 MSILEKIF---KTRKDITYMLDLD-----------------MIEDLS-----------Q-QAYVKRLAIDSCIEFVARAV 48 (395) T ss_pred Cchhhhhh---ccCccccccccch-----------------hccccc-----------h-hhhhhhHHHHHHHHHHHHhh Confidence 11111100 0011111111110 001111 1 23445788888888765 34 Q ss_pred ccccceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhh Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSM 213 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (867) .-+.|+...++.-++.=..+++. -+..+-.+|+..++ ..|+.-|.++=+.+-+. + .+++++..+.+.+ T Consensus 49 A~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~-~~lll~g~~~~~~~~~~--~----~~~~~~~~~~~~~-- 119 (395) T protein:vir:95 49 AQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVI-YKLIYDNEVLIVVSDSK--E----LLIADSFYREEYA-- 119 (395) T ss_pred ccceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHH-HHHhhCCceEEEEecCC--C----eEecCCccceeEe-- Confidence 43334433333222222222222 13445567888866 67888887765544332 2 2333333333221 Q ss_pred hhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcC Q lcl|NC_011269. 214 FVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRP 293 (867) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (867) +.+.+.. +-+. .+| +..-.++..-|-|+..-. T Consensus 120 -~~~~~~~------------~~~~-------------------~~~----------------~~~~~~~~~evih~~~~~ 151 (395) T protein:vir:95 120 -LYDDIFK------------DVTV-------------------KDY----------------TYQRTFTMQEVIYLKYNN 151 (395) T ss_pred -ecCccee------------EEEE-------------------cCc----------------eeeeeeccccEEEEccCC Confidence 1010000 0000 000 000123444466776544 Q ss_pred ccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--- Q lcl|NC_011269. 294 TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--- 370 (867) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 370 (867) +.-...|.+.+--+-. .+..+.++. .+-.++--+.++.. .+.+++..+.+|..++....++. T Consensus 152 ~~~~~~G~spi~~~~~------~~~~~~~~~-~~~~~~~gii~~~~--------~~~~~e~~~~~~~~~~~~~~~~~~~~ 216 (395) T protein:vir:95 152 NKVTHFVESLFEDYGK------IFGRMIGAQ-LKNYQIRGILKSAS--------SAYDEKNIEKLQAFTNKLFNTFNKNQ 216 (395) T ss_pred CCcccccchHHHHHHH------HHHHHHHHH-HhcCCCceEEEeCC--------CCCCHHHHHHHHHHHHHHhccccccC Confidence 4444456554432221 122233332 22333333333321 14578888888888877766642 Q ss_pred -hhhhhhhheeeeeccccCccCchh-HHH----HHHHHHHHHhhccchhhhcCCCccceeh-hhhhHHHHHHHHHHHHHH Q lcl|NC_011269. 371 -RLMVHNFGLKVENVFGRESVPNLD-ADY----DRIERKLLQAWGIGEALISGGTGGAYAS-SALNREFVTQIMTGFQNA 443 (867) Q Consensus 371 -~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~ 443 (867) .+++-.-|++++.+.-.-+..+++ .+| ++..++|.+++||..++|. | +|++ .+....|++.-+.-+... T Consensus 217 ~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~-~---~~sn~e~~~~~~~~~~l~P~~~~ 292 (395) T protein:vir:95 217 LAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY-G---ETADLEKNTLVFEKFCLTPLLKK 292 (395) T ss_pred cceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc-C---cccCHHHHHHHHHHHHHHHHHHH Confidence 233334456666555444444443 233 4556889999999999995 4 4554 233344444445555555 Q ss_pred HHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG 523 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~ 523 (867) |++++.+.+=.-.+. ...++|-+..+...|..+.-+ ..+..++.. | +..+.-+...+ +..++. T Consensus 293 ie~~l~~kL~~~~~~----~~~~~f~~~~l~~~D~~~~~~-~~~~~~~~G---~--------lt~NE~R~~~g-~~p~~~ 355 (395) T protein:vir:95 293 IQNELNAKLITQSMY----LKDTRIEIVGVNKKDPLQYAE-AIDKLVSSG---S--------FTRNEVRIMLG-EEPSDN 355 (395) T ss_pred HHHHHHHhhcChhhh----cccceecchhhhccCHHHHHH-HHHHHHhCC---C--------cCHHHHHHHhC-CCCCCC Confidence 555555444111111 111222222222222211111 111111110 0 11100000000 000000 Q ss_pred c-eeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 524 M-GVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 524 ~-~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) - +....-+..-..++...+.+ ....+ ..+ +..+. ...+- T Consensus 356 g~~d~~~~~~n~~~~~~~~~~~--~~~~~--~~~--------kgg~~-~~~g~ 395 (395) T protein:vir:95 356 PELDEYLITKNYEKANSGENDE--KEKDE--NTL--------KGGDE-DESGD 395 (395) T ss_pred CCCceeeecccccccccccccc--Ccccc--ccc--------CCCCC-CCCCC Confidence 0 00000000000111100000 00000 000 00000 00000 No 129 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=83.29 E-value=0.07 Score=26.93 Aligned_cols=380 Identities=10% Similarity=-0.010 Sum_probs=139.2 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhh-hc Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYS-KF 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 136 (867) -|+-+++- +=++....+.++. +-.++- + ..|-.++.|-.||++-+ .+ T Consensus 1 Mg~f~~lf---~~~~~~~~~~~~~-----------------~~~~v~-----------~-~~~~~~~~v~~~i~~Ia~~i 48 (395) T protein:vir:10 1 MSILEKIF---KTRKDITYMLDLD-----------------MIEDLS-----------Q-QAYVKRLAIDSCIEFVARAV 48 (395) T ss_pred Cchhhhhh---ccCccccccccch-----------------hccccc-----------h-hhhhhhHHHHHHHHHHHHhh Confidence 11111100 0011111111110 001111 1 23445788888888765 34 Q ss_pred ccccceecccchhHHHHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhh Q lcl|NC_011269. 137 PVVGMEFDSKDPLIKTFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSM 213 (867) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (867) .-+.|+...++.-++.=..+++. -+..+-.+|+..++ ..|+.-|.++=+.+-+. + .+++++..+.+.+ T Consensus 49 A~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~-~~lll~g~~~~~~~~~~--~----~~~~~~~~~~~~~-- 119 (395) T protein:vir:10 49 AQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVI-YKLIYDNEVLIVVSDSK--E----LLIADSFYREEYA-- 119 (395) T ss_pred ccceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHH-HHHhhCCceEEEEecCC--C----eEecCCccceeEe-- Confidence 43334433333222222222222 13445567888866 67888887765544332 2 2333333333221 Q ss_pred hhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcC Q lcl|NC_011269. 214 FVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRP 293 (867) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (867) +.+.+.. +-+. .+| +..-.++..-|-|+..-. T Consensus 120 -~~~~~~~------------~~~~-------------------~~~----------------~~~~~~~~~evih~~~~~ 151 (395) T protein:vir:10 120 -LYDDIFK------------DVTV-------------------KDY----------------TYQRTFTMQEVIYLKYNN 151 (395) T ss_pred -ecCccee------------EEEE-------------------cCc----------------eeeeeeccccEEEEccCC Confidence 1010000 0000 000 000123444466776544 Q ss_pred ccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch--- Q lcl|NC_011269. 294 TAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF--- 370 (867) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 370 (867) +.-...|.+.+--+-. .+..+.++. .+-.++--+.++.. .+.+++..+.+|..++....++. T Consensus 152 ~~~~~~G~spi~~~~~------~~~~~~~~~-~~~~~~~gii~~~~--------~~~~~e~~~~~~~~~~~~~~~~~~~~ 216 (395) T protein:vir:10 152 NKVTHFVESLFEDYGK------IFGRMIGAQ-LKNYQIRGILKSAS--------SAYDEKNIEKLQAFTNKLFNTFNKNQ 216 (395) T ss_pred CCcccccchHHHHHHH------HHHHHHHHH-HhcCCCceEEEeCC--------CCCCHHHHHHHHHHHHHHhccccccC Confidence 4444456554432221 122233332 22333333333321 14578888888888877766642 Q ss_pred -hhhhhhhheeeeeccccCccCchh-HHH----HHHHHHHHHhhccchhhhcCCCccceeh-hhhhHHHHHHHHHHHHHH Q lcl|NC_011269. 371 -RLMVHNFGLKVENVFGRESVPNLD-ADY----DRIERKLLQAWGIGEALISGGTGGAYAS-SALNREFVTQIMTGFQNA 443 (867) Q Consensus 371 -~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~ 443 (867) .+++-.-|++++.+.-.-+..+++ .+| ++..++|.+++||..++|. | +|++ .+....|++.-+.-+... T Consensus 217 ~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~-~---~~sn~e~~~~~~~~~~l~P~~~~ 292 (395) T protein:vir:10 217 LAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY-G---ETADLEKNTLVFEKFCLTPLLKK 292 (395) T ss_pred cceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc-C---cccCHHHHHHHHHHHHHHHHHHH Confidence 233334456666555444444443 233 4556889999999999995 4 4554 233344444445555555 Q ss_pred HHHHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh Q lcl|NC_011269. 444 LKRHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG 523 (867) Q Consensus 444 l~~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~ 523 (867) |++++.+.+=.-.+. ...++|-+..+...|..+.-+ ..+..++.. | +..+.-+...+ +..++. T Consensus 293 ie~~l~~kL~~~~~~----~~~~~f~~~~l~~~D~~~~~~-~~~~~~~~G---~--------lt~NE~R~~~g-~~p~~~ 355 (395) T protein:vir:10 293 IQNELNAKLITQSMY----LKDTRIEIVGVNKKDPLQYAE-AIDKLVSSG---S--------FTRNEVRIMLG-EEPSDN 355 (395) T ss_pred HHHHHHHhhcChhhh----cccceecchhhhccCHHHHHH-HHHHHHhCC---C--------cCHHHHHHHhC-CCCCCC Confidence 555555444111111 111222222222222211111 111111110 0 11100000000 000000 Q ss_pred c-eeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 524 M-GVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 524 ~-~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) - +....-+..-..++...+.+ ....+ ..+ +..+. ...+- T Consensus 356 g~~d~~~~~~n~~~~~~~~~~~--~~~~~--~~~--------kgg~~-~~~g~ 395 (395) T protein:vir:10 356 PELDEYLITKNYEKANSGENDE--KEKDE--NTL--------KGGDE-DESGD 395 (395) T ss_pred CCCceeeecccccccccccccc--Ccccc--ccc--------CCCCC-CCCCC Confidence 0 00000000000111100000 00000 000 00000 00000 No 130 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=82.46 E-value=0.077 Score=26.70 Aligned_cols=437 Identities=10% Similarity=0.081 Sum_probs=140.6 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhh-- Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYA-- 121 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 121 (867) |.-| ++. .+++.....+.+.++-.. -|..+.++.+- ..+..++++-|.|.. T Consensus 1 ~~~~-~~~----~~~~~~~~~~~~~~l~~~----------------~i~~li~~~~~------~~~~r~~~l~~YY~g~~ 53 (506) T protein:vir:94 1 MDYD-LTE----HKQANLIYQESLENLTPN----------------KIMKFITHHFN------YQRPRLEMLDDYYQGYN 53 (506) T ss_pred CCcc-hhh----hhcceeecccchhcCCHH----------------HHHHHHHHHHH------HHHHHHHHHHHHhcCCC Confidence 3333 222 222211111111111000 00011111100 001112222222222 Q ss_pred --------------------ccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhh Q lcl|NC_011269. 122 --------------------THDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTV 180 (867) Q Consensus 122 --------------------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (867) .+++.+.++|....|=++. +.++++|+..+++..+.+= +-|+...+.+ ++++.... T Consensus 54 ~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d~~~~~~l~~~~~--~N~~~~~~~~-~~~~~~~~ 130 (506) T protein:vir:94 54 LKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKLPDDGSNSGFDTFNK--ANDVDAENYD-LFLDMSRY 130 (506) T ss_pred ccccccccccccccCCcceeecchHHHHHHHhhhhhcccCceeecCcchHHHHHHHHHh--ccCHhHHHHH-HHHHHHhc Confidence 3457788999998887776 8888888877777655443 3344445555 56888899 Q ss_pred hhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccc--cccccccccccccccchhhhhhhhh Q lcl|NC_011269. 181 GEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQ--GPTTAGGNMSTVEETPSEREQRMRE 258 (867) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (867) |.++=+--.+|.+.. ...+++|.-+-+- | ++.+.- +++-.++. -...++....+.. .+..- T Consensus 131 G~a~~~v~~ded~~~--~i~~~~p~~~~~v---~--dd~~~~---~~~~~v~~~~~~~~~~~~~~~~~-------~~~~~ 193 (506) T protein:vir:94 131 GRAYEYVYRGEDNEE--HLAKLDPLDTFVI---Y--STDVDP---KPIMAVRYHQIELVDDNQVSTIN-------YVPET 193 (506) T ss_pred CeEEEEEEecCCCee--EEEEEcccceEEE---e--cCCCCC---ceEEEEEEEeeeeccCCceeEEE-------EEEEE Confidence 998866666664432 2455666544322 1 111110 00000100 0000010000000 00000 Q ss_pred HHHHHHhchHHHhh-hccCCCCcccHHHHHHhhhcCccc----cccCcchhhHHHHHHHHHHHHHHHHHHHHh---hhhc Q lcl|NC_011269. 259 FQDLQRRYPEIIQA-AMQNDGLDISEALISRVVNRPTAW----ATRGAPHLLRSFRTLMAEESLNAAQDAVAD---RLYS 330 (867) Q Consensus 259 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 330 (867) |- ++-+.. ...+.+..+-+. ..|...+-+-+ .+.|.+.+-.. +.|+ +.|+.+.--.|+ -+.. T Consensus 194 yt------~~~~~~~~~~~~~~~~~~~-~~~~~g~vPvv~~~n~~~~~sd~e~~-~~li--Da~d~~~S~~~~~~~~~~~ 263 (506) T protein:vir:94 194 WT------ADTYTLYNPTPIMGKMQVD-TTKPITTFPVVEFKNSNFRLGDFENV-LPLI--DLYDAAQSDTANYMTDLNE 263 (506) T ss_pred Ee------CceEEEeccccCccceecc-ccccCCccceEEecCCCCCCCchhhh-HHHH--HHHHHHHHHHHHHHHHhhh Confidence 00 000000 011111111111 11222122211 13344433221 2222 333333322222 2445 Q ss_pred hhhhhhhcccccCC--------------CCcCCCCH--HHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCcc--C- Q lcl|NC_011269. 331 PLVLATLGIEDMGD--------------GEPWIPDQ--GELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESV--P- 391 (867) Q Consensus 331 ~~~~~~~~~~~~~~--------------~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~- 391 (867) |+.+++=-...... |..++... +.+...++ .+++...-+..+...++...+ | T Consensus 264 ~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~d~~~l~ 334 (506) T protein:vir:94 264 AMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKD---------ANMLLLKSGMTVNGTQTSVDAKYIN 334 (506) T ss_pred HHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhh---------cCeeeecccccccCccccccceeee Confidence 55554322111111 11111110 11111111 222222222222111111111 2 Q ss_pred ---ch---hHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhc- Q lcl|NC_011269. 392 ---NL---DADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQG- 460 (867) Q Consensus 392 ---~~---~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~- 460 (867) ++ ...++.+++.|...-++-.... ++.+++ .+.+.+.+. -++-...+..++..++++++.|.++-. T Consensus 335 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~n--~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 411 (506) T protein:vir:94 335 KTYDVVGSEAYKKRVAGDIHKFSHTPDLTD-ENFASN--SSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENS 411 (506) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCcccccc-cccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 12 2456666666665554443211 111222 233333332 234455667777888888888877422 Q ss_pred ---ccch---heehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccC Q lcl|NC_011269. 461 ---HYDY---DLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLA 534 (867) Q Consensus 461 ---~~d~---~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p 534 (867) ..+. .++..|+...+++..+.=.=+.|+. -.+-.+.....+ + T Consensus 412 ~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~-----g~iS~et~~~~l---------------------------p 459 (506) T protein:vir:94 412 IHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG-----ATLPQKYLYQQL---------------------------P 459 (506) T ss_pred cCCccccccccceEEeCCCCCcCHHHHHHHHHHHh-----ccCChHHHHHhC---------------------------C Confidence 1221 2344444444544322221111111 001111111111 1 Q ss_pred CCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcccccccccccc Q lcl|NC_011269. 535 VNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLA 590 (867) Q Consensus 535 ~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~ 590 (867) ...+.+ .|.++...|.............. ...........+.-+. .. T Consensus 460 ~v~d~~--~E~~ri~~E~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~e----~~ 506 (506) T protein:vir:94 460 GVTNPQ--DIVDMMKEQSANGDYSFDQNGVI---SNDGQTNTTATQTDEE----VR 506 (506) T ss_pred CCCCHH--HHHHHHHHHHHHHhhcchhhcCC---CcccCccccccccccC----CC Confidence 100000 11111111111100000000000 0000000000000000 00 No 131 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=82.23 E-value=0.079 Score=26.64 Aligned_cols=448 Identities=11% Similarity=0.091 Sum_probs=147.5 Q ss_pred HHHHHH-HhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhh---HHhhCCCchhhhHHHHHHHHHHHHHhhcc- Q lcl|NC_011269. 49 LIDYFQ-GRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTL---ADKGIPFNVEDEEELRVIRHWCRLFYATH- 123 (867) Q Consensus 49 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 123 (867) |..... +.+--+..|++- .+.+-. |+..-+..+-.=+..+ ..+.|......+ +..++++-+.|...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~----~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~--~~r~~~l~~YY~g~~~ 72 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNY--LFNDEA----NVVYTYDGTESDLLQNINEVSKYIEHHMDYQ--RPRLKVLSDYYEGKTK 72 (512) T ss_pred CccceeccCceeeeeCcee--eecccc----ccccccCchhhhhhhhHHHHHHHHHHHHHhh--HHHHHHHHHHhcccCc Confidence 110000 000122233322 122211 2211111100000000 011111111110 112222222222233 Q ss_pred --------------------chHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhh Q lcl|NC_011269. 124 --------------------DLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGE 182 (867) Q Consensus 124 --------------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (867) ++.+.++|.+..|=++. +.++++|+...++..+++= +-|+...+.+ +++....-|. T Consensus 73 i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~--~n~~~~~~~~-~~~~~~i~G~ 149 (512) T protein:vir:97 73 NLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFND--LNDVESHNRS-LGLDLSIYGK 149 (512) T ss_pred cccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHh--hcCHHHHHHH-HHHHHHhcCe Confidence 45577899988887776 8899999988888777643 4455555555 5577888888 Q ss_pred hcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc--cccccccccccccchhhhhhhhhHH Q lcl|NC_011269. 183 VTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP--TTAGGNMSTVEETPSEREQRMREFQ 260 (867) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (867) ++-+--.+|.+.. +..+++|.-+-+--+-...++.+- ++|-=- ..++....++ ++. T Consensus 150 ay~~vy~ded~~~--~i~~~~p~~~~~iyd~~~~~~~~~--------~vr~~~~~~~~~~~~~~~--------~~~---- 207 (512) T protein:vir:97 150 AYELMIRNQDDET--RLYKSDAMSTFVIYDNTIERNSIA--------GVRYLRTKPIDKTDEDEV--------FTV---- 207 (512) T ss_pred EEEEEEeCCCCce--EEEEEcccceEEEEcCCCCCceEE--------EEEEEEeeeccccccceE--------EEE---- Confidence 7766555554432 245667755433211111111111 111000 0000000000 000 Q ss_pred HHHHhchH-HHhhhc-cCCCCcccHHHHH---HhhhcCccc----cccCcchhhHHHHHHHHHHHHHHHHHHHH---hhh Q lcl|NC_011269. 261 DLQRRYPE-IIQAAM-QNDGLDISEALIS---RVVNRPTAW----ATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRL 328 (867) Q Consensus 261 ~~~~~~~~-~~~~~~-~~~~~~~~~~~~~---~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 328 (867) +++ .++ |..-.. .+.++.+....+. |--..-+-. ..+|.+.+-. .+.|+ +.|+.+..-.| +.+ T Consensus 208 ~vy--t~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~-v~~li--Da~d~~~S~~~~~~~~~ 282 (512) T protein:vir:97 208 DLF--TSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEK-VITLI--DLYDNAESDTANYMSDL 282 (512) T ss_pred EEE--eCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhh-hHHHH--HHHHHHHHHHHHHHHHh Confidence 000 000 111011 1112222211111 111111111 2466765543 33333 44554443333 344 Q ss_pred hchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeecccc-CccC----ch---hHHHHHH Q lcl|NC_011269. 329 YSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGR-ESVP----NL---DADYDRI 400 (867) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~---~~~~~~~ 400 (867) ..|.++++ |- .++ +-+++...+++....+... ..++-+..+..-++. -+.| ++ +..++++ T Consensus 283 ~~~~lv~~-G~----~~~----~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L 350 (512) T protein:vir:97 283 NDAMLLIK-GN----LNL----DPVEVRKQKEANVLFLEPT---VYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRL 350 (512) T ss_pred cCceeeee-cC----ccC----Cchhhhhhhhccccccccc---chhhcccccCCCCCcceEEEeecCCHHHHHHHHHHH Confidence 45554432 20 111 1122222222111111111 011112222221111 1111 22 2445666 Q ss_pred HHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhh---cccc-----hheeh Q lcl|NC_011269. 401 ERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQ---GHYD-----YDLKG 468 (867) Q Consensus 401 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q---~~~d-----~~~~~ 468 (867) ++.|..--++-... .+..+++ .|.+.+.. +-++-...++.++..++++++.|.++- +..+ .+++. T Consensus 351 ~~~I~~~s~~p~~~-~~~~~gn--~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~ 427 (512) T protein:vir:97 351 NSDIHMFTNTPNMK-DDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRY 427 (512) T ss_pred HHHHHHHhCCcccC-ccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceE Confidence 66665444333211 1122222 23444433 234455667778888888888887742 2221 13444 Q ss_pred hhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhh Q lcl|NC_011269. 469 GVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQ 548 (867) Q Consensus 469 ~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k 548 (867) .|....+++..+.-.-+.++. -.+-. +..+..+ +...+. +.+.++. T Consensus 428 ~f~~~~p~~~~e~~~~~~kl~-----giiS~---------------et~~~~l------------~~v~d~--~~E~eri 473 (512) T protein:vir:97 428 VYNRNLPKSLIEELKAYIDSG-----GKISQ---------------TTLMSLF------------SFFQDP--ELEVKKI 473 (512) T ss_pred EeCCCCCcCHHHHHHHHHHHh-----ccCch---------------HHHHHhC------------CCCCCH--HHHHHHH Confidence 454444554322221111111 00000 1111111 000000 0111111 Q ss_pred HHHHHHHHhhccc-ccc---cccccccccCCCCCccccccc Q lcl|NC_011269. 549 ADETVQKLMATAQ-AMK---KVQDLCDAQNLPYPPELAQHL 585 (867) Q Consensus 549 ~~E~l~tL~~tae-t~k---kvq~~~p~~g~P~pp~~aQ~p 585 (867) ..|.-..+..... ... ............ ...-+.- T Consensus 474 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 512 (512) T protein:vir:97 474 EEDEKESIKKAQKGIYKDPRDINDDEQDDDTK--DTVDKKE 512 (512) T ss_pred HHHHHHHHHHHhhcccCCCCCCCCCCCCCCcc--ccccccC Confidence 1111111100000 000 000000000000 0000000 No 132 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=82.08 E-value=0.08 Score=26.60 Aligned_cols=369 Identities=12% Similarity=0.117 Sum_probs=141.3 Q ss_pred eeeccchhhh------------hhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-cccccceec-c Q lcl|NC_011269. 80 MQIAMPKIRQ------------PLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-FPVVGMEFD-S 145 (867) Q Consensus 80 ~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~ 145 (867) |.|-++ |+. .+.......+..+. +.. ..+++.|-.|||+-++ ..=..++.- . T Consensus 1 Mg~~~~-f~~k~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~-~~~~~~V~~~I~~ia~~iA~~p~~~~~~ 66 (403) T protein:vir:80 1 MGLFNF-FRRKTRSEPTNAISWFLTQEAYDTLAIPG------------YTR-LSDNPEVRMAVHKIAELISSMTIHLMQN 66 (403) T ss_pred Cccccc-ccccccccccchhhhhcccccccccccch------------hhh-hhhhHHHHHHHHHHHHhhhhCceEEEEe Confidence 222211 111 01111111111111 111 2345677788887643 111112221 1 Q ss_pred cchhHHH---HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcch--hhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 146 KDPLIKT---FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSL--AHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 146 ~~~~~~~---~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) ++.+.++ -..+++. -+..+-.+|+..++ .+++..|+-..| ...+. .|...++..|+|+.|.|... .+ T Consensus 67 ~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v-~~~ll~~~Gna~i~~~~~~-~g~~~~L~~l~p~~v~~~~~---~~ 141 (403) T protein:vir:80 67 TDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIV-YTMLLDGEGNSVVFPKYTT-SGLIDELIPLAPSKVSFVDT---DT 141 (403) T ss_pred cCCceeecCChHHHHHhccCCcCCCHHHHHHHHH-HHHhhcCCccEEEEEEEcC-CCcEEEEEEEcCCeeEEEEc---CC Confidence 1222221 1222222 12334457887766 677776553333 34443 35678889999999987511 11 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA 297 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (867) |..+ -|+. ..| +.+-|-|+......+. T Consensus 142 ---------------------g~~~---------------~y~~--~~~---------------~~~eiih~~~~~~~~~ 168 (403) T protein:vir:80 142 ---------------------GYQI---------------WYQG--KAY---------------NYDEVLHFIVNPDPEK 168 (403) T ss_pred ---------------------ceEE---------------EEee--ccc---------------chhhEEEEeccCCCcC Confidence 0000 0110 112 2334556664444444 Q ss_pred c-cCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHH-hhhcch--hhh Q lcl|NC_011269. 298 T-RGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQS-LLAADF--RLM 373 (867) Q Consensus 298 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~ 373 (867) . .|.+.+--.-.+|-.....++......+.-..|--+.++.. -.+.+..++.|+.+.. ...+++ ..+ T Consensus 169 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 239 (403) T protein:vir:80 169 PYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA---------ATAELSSEEGRNAVFKKYLEASEAGQPW 239 (403) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---------CCChHHHHHHHHHHHHHHhhhhhcCCee Confidence 3 47777665666666655555655666666666665555542 1234445566664432 222221 111 Q ss_pred hhhhheeeeecccc-Ccc--Cc-hh----HHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHH Q lcl|NC_011269. 374 VHNFGLKVENVFGR-ESV--PN-LD----ADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALK 445 (867) Q Consensus 374 ~~~~~~~~~~~~~~-~~~--~~-~~----~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~ 445 (867) | ++..++. +++ ++ .| +..++..++|.+++||...++..|+ +.++. ...|+..-++-+...|+ T Consensus 240 ~------~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~---~~~~~-~~~f~~~~l~P~~~~ie 309 (403) T protein:vir:80 240 I------IPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGK---YDKDE-YNNFINSTILPIAKGIE 309 (403) T ss_pred e------ecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC---ccHHH-HHHHHHHHHHHHHHHHH Confidence 1 1111111 111 22 22 3446677889999999999985333 22222 22366666666777777 Q ss_pred HHHhhhhHHHHHhhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 446 RHIRRRCEVVAEAQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 446 ~~~r~~~~~i~e~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) +.+.+.+-.- .++.++|-...+...|..+.-+ ..+-.++.. | +..+.-+...+ +..++. + T Consensus 310 ~~l~~kll~~------~~~~~~f~~~~ll~~d~~~~~~-~~~~~~~~G---i--------~t~NE~R~~~g-l~p~~g-g 369 (403) T protein:vir:80 310 QELTRKLLIS------PDLYFKFNPRSLYAYDLKELAE-VGSNMYVRG---L--------MEGNEVRDWLG-LSPKEG-L 369 (403) T ss_pred HHHHHhccCC------CCcEEEeechhhhccCHHHHHH-HHHHHHhCC---C--------cCHHHHHHHhC-CCCCCC-C Confidence 7666544110 1122333222232222211111 111111111 0 11111011000 000000 0 Q ss_pred eeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 526 VPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 526 ~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ..+..+.. .++++.-.+. .++...... ..... .+ T Consensus 370 d~~~~~~n----~~pl~~~~~~------~~~k~ge~~------~~~~~-----~~ 403 (403) T protein:vir:80 370 SELVILEN----YIPLDKIGDQ------NKLKGGEKG------GADGQ-----TD 403 (403) T ss_pred CeEeeccc----ccchhhccch------hhccCCCCC------CCCCC-----CC Confidence 00000000 0000000000 000000000 00000 00 No 133 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=81.97 E-value=0.081 Score=26.57 Aligned_cols=441 Identities=14% Similarity=0.122 Sum_probs=148.4 Q ss_pred hhHHhhhhhcccCCchHHHHHHHHhhhcc-----hhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhh Q lcl|NC_011269. 32 MARAQAAALQNTVDNKPLIDYFQGRRRAA-----EANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDE 106 (867) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (867) |+ .---.+.+ .++.+....+-. ..++..|.++-+.- T Consensus 1 ~~------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~-------------------------------- 41 (503) T protein:vir:59 1 MA------DIYPLGKT-HTEELNEIIVESAKEIAEPDTTMIQKLIDEH-------------------------------- 41 (503) T ss_pred Cc------ccccCChh-hHHhHHHhhhhhhhhccchhHHHHHHHHHhh-------------------------------- Confidence 00 00000001 111111111111 11122222221110 Q ss_pred HHHHHHHHHHHHHhh------------------------------ccchHHHHHHhhhhccccc-ceecccchhHHHHHH Q lcl|NC_011269. 107 EELRVIRHWCRLFYA------------------------------THDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYE 155 (867) Q Consensus 107 ~~~~~~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 155 (867) ..+.++++-| ||. .+++.+.++|....|=++. +.++++|+..+++.+ T Consensus 42 -~~~~~~~~~~-YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~ 119 (503) T protein:vir:59 42 -NPEPLLKGVR-YYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVN 119 (503) T ss_pred -cHHHHHHHHH-HhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHH Confidence 0111222222 232 2466788999999998887 999999999998877 Q ss_pred HHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc Q lcl|NC_011269. 156 DLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP 235 (867) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (867) ++ |.. |+...+.. +++.....|.++-+--.++.+.. ++.+++|..+-.- |.- .+. ++++-.+|.=. T Consensus 120 ~~-~~n--~~~~~~~~-~~~~~~~~G~~~~~v~~d~dg~~--~i~~~~p~~~~~i---~d~--~~~---~~~~~~ir~~~ 185 (503) T protein:vir:59 120 EL-ADD--DFDDILNE-TVKNMSNKGIEYWHPFVDEEGEF--DYVIFPAEEMIVV---YKD--NTR---RDILFALRYYS 185 (503) T ss_pred HH-Hhc--CHHHHHHH-HHHHHhhCCeEEEEEeecCCCce--EEEEEccceeEEE---EeC--CCC---CceEEEEEEEE Confidence 65 434 45555555 56888888988776666664332 3666777654332 111 000 00000010000 Q ss_pred c--ccccccccccccchhhhhhhh---hHHHHHHhchHHHhhh---ccCCCCcccHHHHHHhhhcCccccccCcchhhHH Q lcl|NC_011269. 236 T--TAGGNMSTVEETPSEREQRMR---EFQDLQRRYPEIIQAA---MQNDGLDISEALISRVVNRPTAWATRGAPHLLRS 307 (867) Q Consensus 236 ~--~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (867) + .++.....+|---.++..+.+ .-..+-..+.+..... ....-..+..-=|++. ++ ..+|.+.+-+ T Consensus 186 ~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~--~n---n~~~~sd~~~- 259 (503) T protein:vir:59 186 YKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPF--KN---NEEMVSDLKF- 259 (503) T ss_pred EecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEe--cC---CCCCCcchhh- Confidence 0 000000001100001110000 0000000011100000 0000000000001111 11 2457776654 Q ss_pred HHHHHHHHHHHHHHHHH---HhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeec Q lcl|NC_011269. 308 FRTLMAEESLNAAQDAV---ADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENV 384 (867) Q Consensus 308 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 384 (867) .++|+ +.|+.+..-. .+.+..|+++++ | .+|+. ..+.+.+++ +.+++.++=+-+++++ T Consensus 260 ~~~li--Da~d~~~s~~~~~~~~~~~~~~v~~-g----~~~~~-------~~~~~~~~~-----~~~~~~~~~~~~~~~l 320 (503) T protein:vir:59 260 YKDLI--DNYDSITSSTMDSFSDFQQIVYVLK-N----YDGEN-------PKEFTANLR-----YHSVIKVSGDGGVDTL 320 (503) T ss_pred hHHHH--HHHHHHHHHHHHHHHHhcCCeeEee-c----CCccc-------cchhhhhhh-----cccceeccCCCcceeE Confidence 34444 3344333222 244555655443 2 22222 111122221 1222222222222222 Q ss_pred cccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhh- Q lcl|NC_011269. 385 FGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQ- 459 (867) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q- 459 (867) -..-..=.+...++.+++.|...-.+...-.- .-+++ .+.+.+.+. -++....++.++..|+++++.|.++- T Consensus 321 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~~--~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~ 397 (503) T protein:vir:59 321 RAEIPVDSAAKELERIQDELYKSAQAVDNSPE-TIGGG--ATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLR 397 (503) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhcccCCCcc-ccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111112235667777777666544321110 11121 223333332 23345566677777888777776532 Q ss_pred --cccc----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc- Q lcl|NC_011269. 460 --GHYD----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT- 532 (867) Q Consensus 460 --~~~d----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t- 532 (867) +..+ .+|+..|+...+++.. ..-+.+.++...+. ++..+ T Consensus 398 ~~~~~~~~~~~~i~i~f~~~~p~d~~---------------------------------~~~~~~~kl~~~Gi-iS~et~ 443 (503) T protein:vir:59 398 NTGKGDFNPDKELTMTFTRTRIQNDS---------------------------------EIVQSLVQGVTGGI-MSKETA 443 (503) T ss_pred hccCcccccccceeEEeCCCCCCCHH---------------------------------HHHHHHHHHHhCCC-CchHHH Confidence 2221 2344444444444322 11111222211110 11111 Q ss_pred ---cCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCCCCCCCCCc Q lcl|NC_011269. 533 ---LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTELGEAQAVA 607 (867) Q Consensus 533 ---~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~~~qa~~~a 607 (867) ++...+. +.+.++...|+............ ........+ ............+.++ .+ T Consensus 444 l~~l~~v~d~--~~E~~ri~~E~~~~~~~~~~~~~-------~~~~~~~~~-~~~~~~~~~~~~~~g~--------~~ 503 (503) T protein:vir:59 444 VARNPFVQDP--EEELARIEEEMNQYAEMQGNLLD-------DEGGDDDLE-EDDPNAGAAESGGAGQ--------VS 503 (503) T ss_pred HHhCCCCCCH--HHHHHHHHHHHHHHHhhhccccC-------ccCCCCCCC-cCCCCCCcccCCCCCC--------cC Confidence 1111111 12222222222222111111000 000000000 0000000000000000 00 No 134 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=81.85 E-value=0.082 Score=26.54 Aligned_cols=401 Identities=12% Similarity=0.113 Sum_probs=149.9 Q ss_pred HHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc--------------------cchHH Q lcl|NC_011269. 68 ASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT--------------------HDLVP 127 (867) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~ 127 (867) -+|.+++.| -+|+=-..-.....+.|-....+...++.++ .||.. +++.+ T Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~----~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~ 70 (452) T protein:vir:36 1 MKYKPPKLM------TFSKDEPITVEVVTKFMEKHKLEVARYEYLK----NMYLGIMAIDDEPAKDSWKPDNRLAVNFTK 70 (452) T ss_pred CcccCceeE------EcCCccCCCHHHHHHHHHHHHHHHHHHHHHH----HHhccccccccCccccccCccceeecchHH Confidence 233333311 1111111111122222222222222222222 33332 25778 Q ss_pred HHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcce Q lcl|NC_011269. 128 LLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDM 206 (867) Q Consensus 128 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (867) .++|.+.-|=++. +.++++|+...++..+.+= +-|+...+.. +++.....|.++=+--.++.+.. +..+++|.- T Consensus 71 ~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~--~n~~~~~~~~-~~~~~~~~G~~~~~v~~d~~g~~--~i~~~~p~~ 145 (452) T protein:vir:36 71 YIVDTFTGYFNGIPVKKSHSDKEILTKLQEFDN--LNDMEDEESE-LAKMACIYGRAFEFLYQDEDTQT--NVVYNSPEN 145 (452) T ss_pred HHHHHHhhhhcccCceeecCChhHHHHHHHHHh--hcChhHHHHH-HHHHHHhcCeEEEEEEecCCCee--EEEEEcccc Confidence 8999999988877 8999999888888776643 4466666666 55888889987765555555433 245666655 Q ss_pred eehhhhhhhcchHHHHHHHHHHhhcccccccccccccccc-ccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHH- Q lcl|NC_011269. 207 LRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVE-ETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEA- 284 (867) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 284 (867) +-.- | ++.+. ++++-++|.-=..++. ..+| -|+ .+..+ -...+.++.+.+. T Consensus 146 ~~~v---~--d~~~~---~~~~~~i~~~~~~~~~--~~~~vyt~-~~i~~----------------~~~~~~~~~~~~~~ 198 (452) T protein:vir:36 146 MFMV---Y--DDTVK---QEPLFAVRYGVDEDKK--LQGEVYTL-LETIK----------------ISGENDEISFGEGT 198 (452) T ss_pred eEEE---E--cCCCC---CceEEEEEEEEecCce--EEEEEEec-CeEEE----------------EEEcCCceEEecce Confidence 4221 1 11110 0011111110000000 0001 011 11111 1111222222111 Q ss_pred -------HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH-HhhhhchhhhhhhcccccCCCCcCCCCHHHHH Q lcl|NC_011269. 285 -------LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV-ADRLYSPLVLATLGIEDMGDGEPWIPDQGELD 356 (867) Q Consensus 285 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 356 (867) =|+++.| ...|.+.+-. -+.|+-.-...-.+-+. .+.+..|.++++ |. .. +++++. T Consensus 199 ~~~~g~iPvv~~~n-----~~~g~sd~e~-v~~liDa~d~~~s~~~~~~~~~~~p~~~~~-g~----~~-----~~~~~~ 262 (452) T protein:vir:36 199 YNPYPDLPVVEFYF-----NEERMSIFES-VISLVNAFNKAISEKANDVDYFSDQYLTFL-GA----AV-----EEEDLK 262 (452) T ss_pred eccCCcccEEEecC-----CCCCCcchHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEee-cC----Cc-----Cchhhh Confidence 0222222 2356665532 33443222222222222 246777877765 31 11 122322 Q ss_pred HHHHHHHHhhhcchhhhhhhhheeeeeccccC----ccC--ch-----hHHHHHHHHHHHHhhccchhhhcCCCccceeh Q lcl|NC_011269. 357 EVRDDMQSLLAADFRLMVHNFGLKVENVFGRE----SVP--NL-----DADYDRIERKLLQAWGIGEALISGGTGGAYAS 425 (867) Q Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~--~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 425 (867) .++. +++ +++...|... +.+ ++ ...++.+++.|...-++-. +..+..|.. T Consensus 263 ~~~~---------~~~------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~-~~~~~~gn~--- 323 (452) T protein:vir:36 263 NIRS---------NRV------INYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN-ISDESFGSS--- 323 (452) T ss_pred hhhh---------cce------EEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc-cCcccccCC--- Confidence 2222 121 2333332211 111 12 2455666666655444322 112222221 Q ss_pred hhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhcccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhh Q lcl|NC_011269. 426 SALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKL 496 (867) Q Consensus 426 ~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~ 496 (867) +.+.+.+ +-++....+..++..+++++|.|.++..... .+++..|+...+++..+.-.-+.|+. -. T Consensus 324 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~-----g~ 398 (452) T protein:vir:36 324 SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANILM-----GI 398 (452) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh-----cc Confidence 2233332 2345666778888889999998887554221 12344444444443222211111110 00 Q ss_pred hhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCC Q lcl|NC_011269. 497 LIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLP 576 (867) Q Consensus 497 i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P 576 (867) + +.+..+..+ +...+. +.+.++...|................. ....... T Consensus 399 i---------------S~et~~~~~------------~~~~d~--~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~ 448 (452) T protein:vir:36 399 T---------------SQETALSVI------------SVIPDV--QAEMEKIKKEEASTAIFDKDKQPSEKG-TDTVVSE 448 (452) T ss_pred C---------------ChHHHHHhC------------CCCCCH--HHHHHHHHHHHHHHHHHHhhccCCCCc-ccccCcc Confidence 0 111111111 111000 111111111111111000000000000 0000001 Q ss_pred CCcc Q lcl|NC_011269. 577 YPPE 580 (867) Q Consensus 577 ~pp~ 580 (867) ...+ T Consensus 449 ~~~e 452 (452) T protein:vir:36 449 TNEE 452 (452) T ss_pred ccCC Confidence 1111 No 135 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=81.55 E-value=0.065 Score=27.08 Aligned_cols=446 Identities=12% Similarity=0.067 Sum_probs=151.6 Q ss_pred hhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHH--------- Q lcl|NC_011269. 38 AALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEE--------- 108 (867) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 108 (867) .-|-|.. .+.++ +--|++. ...+.. |+..-+++. ++-..-|.++..+ T Consensus 1 ~~~~~~~--~~~~~--------~~~~~~~--~~~~~~----n~~~~~~~~--------e~~~~~~~~~i~~~i~~~~~~~ 56 (511) T protein:vir:99 1 MLKVNEF--ETDTD--------LRGNINY--LFNDEA----NVVYTYDGT--------ESDLLQNVNEVSKYIEHHMDYQ 56 (511) T ss_pred Cccccch--hhhhh--------hhhhhhh--hhhhhh----CCccccchh--------hhhhhccHHHHHHHHHHHHHhh Confidence 0111100 01111 1134443 222222 222222211 1111111111111 Q ss_pred HHHHHHHHHHHhh-ccc---------------------hHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccH Q lcl|NC_011269. 109 LRVIRHWCRLFYA-THD---------------------LVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNY 165 (867) Q Consensus 109 ~~~~~~~~~~~~~-~~~---------------------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (867) +..++++ +.||. .|+ +.+.++|.+..|=++. +.++++|+.+.++..+.+= +-|+ T Consensus 57 ~~r~~~l-~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~--~n~~ 133 (511) T protein:vir:99 57 RPRLKVL-SDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFND--LNDV 133 (511) T ss_pred HHHHHHH-HHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecCchHHHHHHHHHHh--hcCH Confidence 1122222 23442 444 4468899999988776 8899999888877766543 4456 Q ss_pred HHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc--cccccccc Q lcl|NC_011269. 166 LEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP--TTAGGNMS 243 (867) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 243 (867) ..++.+ +.+...+.|.++-+--.+|.+.. +..+++|.-+-+-.+-...++.+- ++|.=. ..++.... T Consensus 134 ~~~~~~-~~~~~~i~G~a~~~vy~ded~~~--~i~~~~p~~~~~vyd~~~~~~~~~--------~vr~~~~~~~~~~~~~ 202 (511) T protein:vir:99 134 ESHNRS-LGLDLSIYGKAYELMIRNQDDET--RLYKSDAMSTFVIYDNTIERNSIA--------GVRYLRTKPIDKTDED 202 (511) T ss_pred hHHHHH-HHHHHHhcCeeEEEEEeCCCCce--EEEEEccceeEEEEcCCCCCceEE--------EEEEEEeeecccCccc Confidence 666666 55888888887766555665543 245566655433211111111111 000000 00000000 Q ss_pred c---ccc-cchhhhhhhhh--HHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHH Q lcl|NC_011269. 244 T---VEE-TPSEREQRMRE--FQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESL 317 (867) Q Consensus 244 ~---~~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (867) + .|- |+. +..+.+. ...+.-.+-++... .-.+..==|+++.| ...|.+.+=. ..+|+ +.| T Consensus 203 ~~~~~~vyt~~-~i~~~~~~~~~~~~~~~~~~~~~-----~~~~g~vPvv~~~n-----n~~g~sd~e~-v~~li--Da~ 268 (511) T protein:vir:99 203 EVFTVDLFTSH-GVYRYLTSRTNGLKLTPRENGFE-----SHSFERMPITEFSN-----NERRKGDYEK-VITLI--DLY 268 (511) T ss_pred eEEEEEEEeCC-cEEEEEecCCccccccccccccc-----cCCCCccceEEecC-----CCCCCCchhh-hHHHH--HHH Confidence 0 000 000 0000000 00000000000000 00000001223322 2456665543 23333 334 Q ss_pred HHHHHHHH---hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHh-hhcchhhhhhhhhee------eeecccc Q lcl|NC_011269. 318 NAAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSL-LAADFRLMVHNFGLK------VENVFGR 387 (867) Q Consensus 318 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~------~~~~~~~ 387 (867) +.+..-+| +.+..|+++++ |.. +. +-+++...++ .. +..+-.......++. ++++=.. T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~-G~~----~~----~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~ 336 (511) T protein:vir:99 269 DNAESDTANYMSDLNDAMLLIK-GNL----NL----DPVEVRKQKE---ANVLFLEPTVYADSEGRETEGSVDGGYIYKQ 336 (511) T ss_pred HHHHHHHHHHHHHhhchhhhhc-cCc----cc----Cchhhccccc---ccceecccccccccccccCCCCcceeEEeec Confidence 44333333 34566665554 310 10 1112222222 11 111111111112221 1111111 Q ss_pred CccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhh---c Q lcl|NC_011269. 388 ESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQ---G 460 (867) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q---~ 460 (867) -..=.++..++.+++.|...-++-..... +-+++ .|.+.+.+. -++-...+..++..++++++.|.++- + T Consensus 337 ~~~~~~e~~~~~L~~~I~~~s~~P~~~~~-~~~gn--~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~ 413 (511) T protein:vir:99 337 YDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTR 413 (511) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcccccc-ccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 11112235566666666655444332111 12222 233334332 23345566777888888888887753 2 Q ss_pred ccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccc-cccchhhhhhhhhhhhhceeeeeccc-c Q lcl|NC_011269. 461 HYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTL-NLRDEAQERAFIAQLKGMGVPVSDKT-L 533 (867) Q Consensus 461 ~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~-~Lr~e~~~~~~v~qL~~~~~pitd~t-~ 533 (867) .++ ..++..|....+++..+.-.-+.|+.- .+-.+.....+ .+.+.+.+...|.+-+.......... . T Consensus 414 ~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~G-----iiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~ 488 (511) T protein:vir:99 414 SIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGG-----KISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMY 488 (511) T ss_pred CcccccccccceEEeCCCCCcCHHHHHHHHHHHhc-----cCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccc Confidence 222 135556655556554332222222210 11111111111 11111111111111100000000000 0 Q ss_pred CCCcccccchhhhhhHHHHHHHHhhcccccccccccccc-cCCCCCcc Q lcl|NC_011269. 534 AVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDA-QNLPYPPE 580 (867) Q Consensus 534 p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~-~g~P~pp~ 580 (867) .....+.- ....+. .......+ T Consensus 489 ~~~~~~~~-------------------------~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 489 QDPRNIND-------------------------DEQDDSTKDSIDKKE 511 (511) T ss_pred ccCCCCCC-------------------------CCCCCCCcCcccccC Confidence 00000000 000000 00000001 No 136 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=80.67 E-value=0.093 Score=26.25 Aligned_cols=351 Identities=12% Similarity=0.092 Sum_probs=129.0 Q ss_pred ccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHH-HhhccchHHHHHHhhhh----cccccceecccc Q lcl|NC_011269. 73 QGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRL-FYATHDLVPLLIDIYSK----FPVVGMEFDSKD 147 (867) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 147 (867) -|.|+..- .......+.+.+ .+..|--+ ....++.|-.||++-+. .|+.=++-+.+| T Consensus 1 Mg~f~~~~------------~~~~~~~~~~~~------~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~ 62 (378) T protein:vir:16 1 MNLFGKVV------------SFSRGKLNNDTQ------RVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD 62 (378) T ss_pred Cccchhhh------------hhhcccccCCcc------eeeecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccc Confidence 11111111 111111111111 01112111 11255667777776543 333222222222 Q ss_pred hhHH-------HHHHHHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 148 PLIK-------TFYEDLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 148 ~~~~-------~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) -..+ .-..++. =.+..+-.+|+..++ ..++.-|+++-+...+...|. .+-|.|+.+. T Consensus 63 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gna~i~~~~d~~~g~---~~~l~~~~~~--------- 129 (378) T protein:vir:16 63 VGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVI-KKLLRAPYVDLYAVFDDNTGE---LLDLLFADDK--------- 129 (378) T ss_pred cccccccccccchHHHHHhhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEEeecCCce---EEEEEecCCe--------- Confidence 1111 1111111 124455568888866 788888998866554433221 1112222111 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA 297 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (867) ..++..-|-|+.+. -+- T Consensus 130 -------------------------------------------------------------~~~~~~diih~r~~--~~~ 146 (378) T protein:vir:16 130 -------------------------------------------------------------KEYKPEELVRLTSP--FYI 146 (378) T ss_pred -------------------------------------------------------------eEecccceEEecCc--cCc Confidence 11112234454421 111 Q ss_pred ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhch--hhhhhhcccccCCCCcCCCC--HHHHHHHHHHHHHhhhcch--h Q lcl|NC_011269. 298 TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSP--LVLATLGIEDMGDGEPWIPD--QGELDEVRDDMQSLLAADF--R 371 (867) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~ 371 (867) -.|...+ ..|.++|+..+.+= =-+++..+ .+-++ ++..+.+|+.++....++. . T Consensus 147 ~~~~s~l-------------~~~~~~i~~~~~~~~~~g~l~~~~-------~l~~~~~~~~~~~~~~~~~~~~~~~~~g~ 206 (378) T protein:vir:16 147 NEDTSIL-------------DNALASIQTKLEQGKLRGLLKINA-------FLDIDNTQEYREKALTTIKNMQEGSSYNG 206 (378) T ss_pred cchhHHH-------------HHHHHHHHHHHhcCccceeeEeCC-------cCCHHHHHHHHHHHHHHHHHhhccccccc Confidence 2232222 12222222222211 01122221 11111 2233444444444444443 5 Q ss_pred hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 372 LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRR 451 (867) Q Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 451 (867) ++|-.-|++++-...+-..+.++ +.++++++|.+++||...+++ | +|+. +-...|++.-+.-+...|++++.+. T Consensus 207 ~~vl~~g~~~~~l~~~~~~~~~~-~~~~~~~~Ia~~fgVPp~~l~-g---~~~e-~~~~~f~~~tl~P~~~~ie~~l~~k 280 (378) T protein:vir:16 207 LTPVDNKTEIVELKKDYSVLNKD-EIDLIKSELLTGYFMNENILL-G---TASQ-EQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred ceEcCCCceEEEccCChhhhhHH-HHHHHHHHHHHHhCCCHHHhc-C---CchH-HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 67777889999888877777774 678999999999999999997 4 3333 3334455555666666666666655 Q ss_pred hHHHHH-hhccc---chheehhhccccchhhhhhhhhhhh--hHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 452 CEVVAE-AQGHY---DYDLKGGVRVPIYREIVEYDEETGQ--EYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 452 ~~~i~e-~q~~~---d~~~~~~~~~~~~rd~~~~k~e~~k--~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) +=.-.| -++++ ..++.|-+..+. ..++++++.- ..+... -+.. .|.=+...+..++. + T Consensus 281 Ll~~~e~~~~~~~~~~~~~~f~~~~l~---~~d~~~~~~~~~~~~~~G-----------~~T~-NE~R~~~g~~p~~g-g 344 (378) T protein:vir:16 281 LISTNRRRVVKGNLYYERIIVDNQLFK---FATLKELIDLYHENINGP-----------IFTQ-NQLLVKMGEQPIEG-G 344 (378) T ss_pred cCChhhhhhhhhcccccceeeccchhh---hcCHHHHHHHHHHHHhCC-----------CcCH-HHHHHHhCCCCCCC-C Confidence 522222 12211 112222222222 2222322211 111110 0000 00000000000000 0 Q ss_pred eeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccc Q lcl|NC_011269. 526 VPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 526 ~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ 583 (867) ..+--+..-.. ++.. ..........++.+.. ..+ T Consensus 345 D~~~~~~n~~~----~~~~---------~~~~~~~~~~~~~~e~-----------~ne 378 (378) T protein:vir:16 345 DVYIANLNAVA----VKNL---------SDLQGSRKDVTSTDET-----------NNQ 378 (378) T ss_pred CeEeecccccc----ccch---------hhhcCccCCCCCCCCC-----------CCC Confidence 00000000000 0000 0000000000000000 000 No 137 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=80.48 E-value=0.014 Score=30.74 Aligned_cols=593 Identities=13% Similarity=0.118 Sum_probs=149.7 Q ss_pred CCCcccccccchhHHHHH-HHHhcCCCCCCchhhHHhhhhhcccCCch---HHHHHHHHhhhcchhHHHHHHHHhccccc Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNR-LRKAGVNMPNSPTMARAQAAALQNTVDNK---PLIDYFQGRRRAAEANRQRLASYRKQGNF 76 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (867) .-.=-|..|.-|+.|+.. |+..|. |+ +..| +.++...+..+ .||.. T Consensus 45 ~~d~~fy~G~QW~~~~~~~l~~~g~-----p~------------~~~N~i~~~v~~v~g~~~---~nr~d---------- 94 (772) T protein:vir:10 45 DKEMDYADGNQLDTELLRRQQALGI-----PP------------AVEDLIGPALLSLQGYEA---VTRTD---------- 94 (772) T ss_pred HHHHHhhcCCCCCHHHHHHHHhcCC-----Cc------------EEEcchHHHHHHHHHHHH---hcCcc---------- Confidence 011113345666655533 222221 11 1111 12222222111 11111 Q ss_pred ccceeeccchhhhhhhhhHHhhCCC-chhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceec---ccchhHHH Q lcl|NC_011269. 77 GSNMQIAMPKIRQPLGTLADKGIPF-NVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFD---SKDPLIKT 152 (867) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 152 (867) . +-+|+ +..|++--+++-.=|+++......=..|-|.|...-++||-.. .+++..++ T Consensus 95 --------~-----------~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~ 155 (772) T protein:vir:10 95 --------W-----------RVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKF 155 (772) T ss_pred --------e-----------EEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCC Confidence 0 12454 3445444445555555555555555667787777777775432 22222111 Q ss_pred HHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcccee-hheecCcceeehhhhhhhcc-hHHHHHHHHHHh- Q lcl|NC_011269. 153 FYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWS-SEEILNPDMLRVSRSMFVQR-ERVQLMVKDLVD- 229 (867) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~- 229 (867) +... .++|-.+|+.|...+ . +-|+-.|- +...++.|.+ .+||-.+ ++|....++..+ T Consensus 156 ---~i~i-~~v~p~~v~~Dp~a~------------~-D~sDar~~~~~~~~~~d~~---~~~fp~~a~~~~~~~~~~~~~ 215 (772) T protein:vir:10 156 ---PYRC-RPIRRDEIHWDMKCG------------D-DWEACRFLRRQRWLSPDRI---ALVFPEHAELIGMVGKYGSTW 215 (772) T ss_pred ---CeEE-EeeCcccceecCCCC------------C-CHHHhhhhhhhccCCHHHH---HHhCCCchhHHHhhhhhcccc Confidence 1111 234555555442210 0 11111111 0111111111 1223321 111111110000 Q ss_pred ------hccccccccccccccccc------------cchhhhhhhhhHHHHHHhchHHH--hhhccCCCCcccHHHHH-- Q lcl|NC_011269. 230 ------HLRQGPTTAGGNMSTVEE------------TPSEREQRMREFQDLQRRYPEII--QAAMQNDGLDISEALIS-- 287 (867) Q Consensus 230 ------~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-- 287 (867) ..-.|-.++|- .+..++ +++.+.=|.-|| .+++ |+.. =....|+++.++..-+. T Consensus 216 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~--w~r~-~~~~~~~~~~~g~~~~~~~~~~~~~ 291 (772) T protein:vir:10 216 WGQPDLGMMEGGTSTGL-HNAWNEARAWTVQEDHWYNPTSKEICLVEL--WYRR-WVQVHVLKSPDGRVVEYDPNNLAHN 291 (772) T ss_pred cCccccccccccccccc-ccccchhhccccccccccccCCceEEEEEE--eeee-eeeeeeeccCCCceEeeCcccHHHH Confidence 00111111110 000000 111122223333 1222 2210 11233444444332221 Q ss_pred ----------------Hh----------hh-cCccccccCcchh-hHHHHHHH------HHHHHHHHHHHHHhhhh---c Q lcl|NC_011269. 288 ----------------RV----------VN-RPTAWATRGAPHL-LRSFRTLM------AEESLNAAQDAVADRLY---S 330 (867) Q Consensus 288 ----------------~~----------~~-~~~~~~~~~~~~~-~~~~~~~~------~~~~~~~~~~~~~~~~~---~ 330 (867) || .+ ..++|.-..-|++ .-+|+.-. .-+.++-+|+-+=+|.- - T Consensus 292 ~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~ 371 (772) T protein:vir:10 292 IALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRW 371 (772) T ss_pred HHHhhcccchheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHH Confidence 11 11 3455544334433 11111000 01222333332222111 0 Q ss_pred hhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeecccc---CccCchh----HHHHHHHHH Q lcl|NC_011269. 331 PLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGR---ESVPNLD----ADYDRIERK 403 (867) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~----~~~~~~~~~ 403 (867) =|-..+++. -.|- +|.+.+.+..-++=+.-+++||.+--.. .|.. ..-..|. +-+..-.+. T Consensus 372 ~l~~~~~~~---~~ga--------v~~~d~~~~e~~arp~~vi~~~~~~~~~-~~~~~~~~~~~~~~~~~~~llq~~~~~ 439 (772) T protein:vir:10 372 GMSVARVER---TKGA--------VAMTDAQFRRQIARPDADIVLDENHMAK-PGARFDVKRDYTLTDQHFQMLQDNRAT 439 (772) T ss_pred HHhcccccc---cCCC--------ccchhHHHHHhccCCCCeEEeCCccccC-CCCCccccCCccccHHHHHHHHHHHHH Confidence 011111111 1111 1111111222333334556666531110 0111 1112122 334444455 Q ss_pred HHHhhccchhhhcCCCccceehhhhh--HHHHHHHHHHHHHHHHHHHhhhhHHHHH-hhcccchheeh-hhcc--ccch- Q lcl|NC_011269. 404 LLQAWGIGEALISGGTGGAYASSALN--REFVTQIMTGFQNALKRHIRRRCEVVAE-AQGHYDYDLKG-GVRV--PIYR- 476 (867) Q Consensus 404 ~~~~~~~~~~~~~~g~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~r~~~~~i~e-~q~~~d~~~~~-~~~~--~~~r- 476 (867) |-..-||+.+++- -.|.+.+.+++. .+...+.++.|...|+.++|++.+.+-+ |+.+|+....+ ++.. ...+ T Consensus 440 i~~vsGv~~~~lG-~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~ 518 (772) T protein:vir:10 440 IERVSNITAGFQG-RKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADR 518 (772) T ss_pred HHHHhCCCHHHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCc Confidence 6666699999864 677666665544 2333344788899999999999876644 56666543222 2110 0001 Q ss_pred --------------------hh--h-------------hhhhhhhhhHhhhhhhhhhhhhccccc----cccchhhhhhh Q lcl|NC_011269. 477 --------------------EI--V-------------EYDEETGQEYIRKVPKLLIPEIKFSTL----NLRDEAQERAF 517 (867) Q Consensus 477 --------------------d~--~-------------~~k~e~~k~~~r~~~k~i~~~i~~~~~----~Lr~e~~~~~~ 517 (867) |+ . .++++..+ .....++.+.+.+....+ ++.+-...+.. T Consensus 519 ~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~-~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei 597 (772) T protein:vir:10 519 VVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLN-AMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDV 597 (772) T ss_pred eEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHH-HHHHHHhccChhHHHHHHHHHHhhcCCCChHHH Confidence 11 0 01111111 000000000111111100 11111001111 Q ss_pred hhhhhhceeeeeccccCCCcccccch--hhhhhHHH-----HHHHHhhcc----cccccccccccccCCCCCcccccccc Q lcl|NC_011269. 518 IAQLKGMGVPVSDKTLAVNIDMKFDQ--ELERQADE-----TVQKLMATA----QAMKKVQDLCDAQNLPYPPELAQHLQ 586 (867) Q Consensus 518 v~qL~~~~~pitd~t~p~tiqme~E~--e~e~k~~E-----~l~tL~~ta----et~kkvq~~~p~~g~P~pp~~aQ~p~ 586 (867) +..++.. ...-+ + .++.... ........ ...++.... ....+........... .... ... T Consensus 598 ~~~ir~~-~~~~~---p--eq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~--a~~~-a~~ 668 (772) T protein:vir:10 598 VEAIRAV-DQQQT---P--EQIQQQIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQ--AAFS-AMQ 668 (772) T ss_pred HHHHHHH-hccCC---h--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH-Hhh Confidence 1111100 00000 0 0000000 00000000 000000000 0000000000000000 0000 000 Q ss_pred ccccCCCCCCCCCCCCCCCCccCCCCccCCCCCCCccCccCcCCCCCCCCCCCCCcccccccccCccCCcCCCCCCCCCc Q lcl|NC_011269. 587 STLALRQGKTQTELGEAQAVAGEAQAELQTKQIEMQEMMMDQQMAGGVMPGQPMLPPGAPGDPAAGGPPPPAGGPMGGPP 666 (867) Q Consensus 587 ~t~~~a~gpgq~~~~qa~~~agq~~~p~~~~~~~~qp~~~~qg~pG~~gPpGP~gPpG~pG~pgP~gPpPp~~gP~G~Pp 666 (867) ......+.+ ..+.....-.. ..+...+.... ..+..|.....++....+.+ .+...++ T Consensus 669 aa~~~~q~~---------q~a~~ad~~l~-----~~g~~~~~~~~--~~~~~p~~~~~a~~~~~~~~------~~~~~~~ 726 (772) T protein:vir:10 669 AGAQIAQMP---------MIAPIADAVMQ-----SAGYQRPNPAG--DDPNYPIADQTAAMNIRSPY------IQGQGPA 726 (772) T ss_pred hhhhHHhhh---------hhhHHHHHHHH-----hcccccccccc--cCCCCCCCCCccCCCCCccC------CCCCCCC Confidence 000000000 00000000000 00000000000 00000000000000000000 0000000 Q ss_pred CCCCccccc-ccccccccccccchhcccccccccccccccccccccccccccccCCCCCCcccccccc Q lcl|NC_011269. 667 VAPAPGVAG-PGNAPASFYAASLRTADAINGPTGTGPSADGPLGPTGPELPPGVPEPTEVPRNRQRPA 733 (867) Q Consensus 667 ~~p~PGaPG-P~g~Pg~~gppg~pG~~g~~GP~G~gPgapGP~GP~GP~gpPG~PgP~gPpg~~g~Pg 733 (867) ..+.+++.+ +.+.++..+. .+..+ +.+..|...|+.+---.-..| T Consensus 727 ~~~~~~~~~~~~~~~p~~p~------~~q~~----------------~~~~~g~~~~~~~~~~~~~~~ 772 (772) T protein:vir:10 727 AEAEAESVSVRRNTSPTYPP------VPEEA----------------PTGLRGIETPSTADNLSVRGG 772 (772) T ss_pred CccccCCCCCccCCCCCCCC------CCccc----------------CCCCCCCCCCCCCccceecCC Confidence 000000000 0000000000 00000 000011111110000000000 No 138 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=80.25 E-value=0.097 Score=26.15 Aligned_cols=451 Identities=13% Similarity=0.028 Sum_probs=164.4 Q ss_pred ccccccchhHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeecc Q lcl|NC_011269. 5 IYKAGSNWSAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAM 84 (867) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (867) .|--+..=. .+.+.-+.++++. +-..+.+....+....++.|+.++.+--.+ +-.| T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~~----------------~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g--~~~i-- 56 (472) T protein:vir:93 1 MYPSQPTQT----EIFDAIVRTNNKP----------------ETLEEMIVRYIKQHLEKLPEISIGQEYYEQ--RPDI-- 56 (472) T ss_pred CCCCCCcch----hhhhceeeecCch----------------hhHHHHHHHHHHHHHHHHHHHHHHHHHhcc--cccc-- Confidence 121111111 1112222222221 112222333333334455555554442211 1111 Q ss_pred chhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccc Q lcl|NC_011269. 85 PKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDL 163 (867) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 163 (867) +.++.....+. .++ ..|. .+-..+++.+.++|.+..|=++. +.++.+|+...++.+++ |.. T Consensus 57 --~~~~~~~~~~~-----~~~-----~~~~---~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~-~~n-- 118 (472) T protein:vir:93 57 --VKEPKPVDATG-----AVD-----PLKP---DDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEV-LGN-- 118 (472) T ss_pred --ccccchhhccc-----ccc-----cccc---ccccccchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHH-Hhc-- Confidence 11111111000 000 0111 11224688889999999998877 89999999988888765 544 Q ss_pred cHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccc Q lcl|NC_011269. 164 NYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMS 243 (867) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (867) |+...+.+ +++.....|.++-+--.++.+.. +..+++|..+-+--+--.-++.+- .+|.=-..+.. T Consensus 119 ~~~~~~~~-~~~~~~~~G~~~~~v~~d~d~~~--~i~~~~p~~~~~i~d~~~~~~~~~--------~ir~~~~~~~~--- 184 (472) T protein:vir:93 119 RFDDKLHS-VLTGASNKGIEWLHPYLDEEGEF--KLFRVPAEQGIPIWTDKEHEELEA--------FIRMYKLENET--- 184 (472) T ss_pred cHHHHHHH-HHHHHhhcCeEEEEEEECCCCce--EEEEEcccceEEEEcCCCCCceEE--------EEEEEEeecce--- Confidence 44466666 45888888988766655555433 355677765544311000111111 11100000000 Q ss_pred cccccchhhhhhhhhH-------HHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHH Q lcl|NC_011269. 244 TVEETPSEREQRMREF-------QDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEES 316 (867) Q Consensus 244 ~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (867) .+|---..+..+.+-+ ..-...++++-... .+ +..==|+++.| .++|.+.+=. .+.|+ +. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~--~~~vPvv~~~n-----n~~g~s~~e~-v~~li--Da 251 (472) T protein:vir:93 185 KVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFST---GS--WGKIPFIPFKN-----NDLEISDIFM-YKTLI--DA 251 (472) T ss_pred eEEEEecCeEEEEEEecCeeeeccccccccccccccc---CC--CCCcceEEecC-----CCCCCCchhh-hHHHH--HH Confidence 0000000000000000 00000111110000 00 11111223332 3467777653 44554 33 Q ss_pred HHHHHHHHH--hh-hhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCch Q lcl|NC_011269. 317 LNAAQDAVA--DR-LYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNL 393 (867) Q Consensus 317 ~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (867) |+.+..-.+ -+ +..|.++++ |. +.+++++.+++++.. .++...=+-+++++=..-..=.+ T Consensus 252 ~~~~~s~~~~~~~~~~~~~~~~~-g~-----------~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~~~ 314 (472) T protein:vir:93 252 YNRRLSDLSNTFKDSNELTYVLT-NY-----------DDQELPEFKRLLRYY-----GAIKVSDNGGVDTIQVEVPVENS 314 (472) T ss_pred HHHHHHHHHHHHHHhcCceeEee-cC-----------CcccchhhHHHHhhc-----cccccCCCCcceeEeecCCHHHH Confidence 333322222 22 444544443 21 112333444433321 11111111222222111111223 Q ss_pred hHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hhee Q lcl|NC_011269. 394 DADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLK 467 (867) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~ 467 (867) ...++.+++.|..--++-..-. +..|++ .+.+.+.+ +-++.-..+..++..++++++.|.++.+.-+ .++. T Consensus 315 ~~~~~~l~~~i~~~s~~p~~~~-~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~i~ 391 (472) T protein:vir:93 315 KKYLDELYQKIMLFGQAVDFSS-DKFGSA--PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVD 391 (472) T ss_pred HHHHHHHHHHHHHHhCCCCCCc-cccccC--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceee Confidence 3567777777766655443211 122222 23444433 2334556677888888999988888765322 2233 Q ss_pred hhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccch Q lcl|NC_011269. 468 GGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQ 543 (867) Q Consensus 468 ~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~ 543 (867) ..|.-..+++.. ..-..+.++.. .++..+ ++...+ .+. T Consensus 392 v~f~~~~p~~~~---------------------------------~~~~~~~k~~g---iis~et~l~~l~~~~d--~~~ 433 (472) T protein:vir:93 392 ISFNYNKVANTE---------------------------------LQVQTAQQSMG---IVSHETVLENHPFVED--LQA 433 (472) T ss_pred EEeCCCCCCCHH---------------------------------HHHHHHHHHhc---cCchHHHHHhCCCCCC--HHH Confidence 333333333211 11111111100 011111 111111 112 Q ss_pred hhhhhHHHHHHHHhhcccccccccccccccCCCCCcccccccccccc Q lcl|NC_011269. 544 ELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLA 590 (867) Q Consensus 544 e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~ 590 (867) +.++...|......... .............. .......+ T Consensus 434 E~~ri~~E~~~~~~~~~-~~~~~~~d~~~~~~-------~~~~~~~e 472 (472) T protein:vir:93 434 ELERIEQEQMEYNKQLP-NLDDGGADGAQQQE-------RSNNKESE 472 (472) T ss_pred HHHHHHHHHHHHHHhcc-CcCcccCCCCCCCC-------CCCcccCC Confidence 22222222222111111 01110000000000 00011111 No 139 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=79.95 E-value=0.099 Score=26.08 Aligned_cols=495 Identities=10% Similarity=0.012 Sum_probs=158.7 Q ss_pred HHHHHHHhcCCCCCCch---hhHHhhhhhcccCCchHHHHHHHHhhhcch---hHHHHHH-------HHhcc--cccccc Q lcl|NC_011269. 15 EVNRLRKAGVNMPNSPT---MARAQAAALQNTVDNKPLIDYFQGRRRAAE---ANRQRLA-------SYRKQ--GNFGSN 79 (867) Q Consensus 15 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-------~~~~~--~~~~~~ 79 (867) -.+-.||.-+. |--+. .++-.+...+.+-.-.....+.+.-..++. +|.-++. .|+-. .+...| T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~ 79 (537) T protein:vir:10 1 MFKFWRKKTVE-AVQSSIAERIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGT 79 (537) T ss_pred CCCcccccccc-ccccccccccccccCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhh Confidence 01112222211 00000 000000000000000011111111110000 0000000 00000 000011 Q ss_pred ee----eccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceecccc------h Q lcl|NC_011269. 80 MQ----IAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSKD------P 148 (867) Q Consensus 80 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~------~ 148 (867) +. ...++-. ..-..+.++... + -|- .|+.|.|+...||+.-+..+.- +++.+.| + T Consensus 80 ~~~~~~~~~~~~~------~~~~~~~~~~~~-~------l~a-~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~ 145 (537) T protein:vir:10 80 FSAYANPNLSEGL------VLWYAQQAFIGH-Q------MCA-LIATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPK 145 (537) T ss_pred hhhhccccccchh------hhhccccCCccH-H------HHH-HHHhCchhhhhhhhhhHHhhcCCceeecCCcccccHH Confidence 11 1111111 111112222222 2 353 4899999999999998887665 7777643 3 Q ss_pred hHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhc-cceehheecCcceee---hhhhhhhcchHHH--- Q lcl|NC_011269. 149 LIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESL-GVWSSEEILNPDMLR---VSRSMFVQRERVQ--- 221 (867) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~---~~~~~~~~~~~~~--- 221 (867) .+ +.++... ++|++-.-|.|.+-+-- .-|...-+-....++ ..|++- ||++.|. ++ . |..=++.. T Consensus 146 ~~-~~l~~~~--~~l~~~~~l~~a~~~~r-lyG~~~i~i~v~~~D~~~~~~P--l~~~~i~kg~~k-~-l~vidp~~~~~ 217 (537) T protein:vir:10 146 DA-KFIDRYD--RAFNIKKHAIQFVRKGR-IFGIRIALFKVDSPDPYYYEKP--FNIDGVMPGAYK-G-IVQIDPYWCAP 217 (537) T ss_pred HH-HHHHHHH--HHhhHHHHHHHHHHhcc-cccceEEEEeecCcCCcccccc--ccccccccccee-E-EEEechhhccc Confidence 44 4455544 48999999998763322 235432222222222 224322 2343321 11 0 11111111 Q ss_pred HHHHHHHhhccccccccccccccccccchhhhhhhhhHHHH-HHhchH-HHhhhccCCCCcccHHHHHHhhhcCcccccc Q lcl|NC_011269. 222 LMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDL-QRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATR 299 (867) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (867) +++..+.++ |.+..- .+-..|+-- ++-+|. ||.. .|-++++.+ .+.|.-| T Consensus 218 ~~~~~~~~d----p~sp~f-------------g~P~~y~v~g~~iH~SRli~f----~g~~~p~~~-------~~~~~~~ 269 (537) T protein:vir:10 218 LLDAQASSN----PVSMHF-------------YEPTYWLINGKKYHRSHLAIY----INDEVVDFL-------KPSYIYG 269 (537) T ss_pred ccchhhhcc----CCcccc-------------CCceeeeecCeEecceeEEEe----cCCCCchhh-------hcccCcc Confidence 011111111 111100 000000000 011222 2111 122222221 2345568 Q ss_pred CcchhhHHHHHHHHHHHHHHHHHHHHhhhhch-hhhhhh-cccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhh Q lcl|NC_011269. 300 GAPHLLRSFRTLMAEESLNAAQDAVADRLYSP-LVLATL-GIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNF 377 (867) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (867) |.|++-+++-.|..-++ +...+++=++.. ++++|+ |...| -+.+.|....+=+....-....|+|-.= T Consensus 270 G~Svlq~~~~~l~~~~~---t~~~~~~l~~~~~~~v~k~~~~~~l-------~~~~~~~~r~~~~~~~r~n~g~~~id~e 339 (537) T protein:vir:10 270 GVPLPQQIMERVYAAER---TANEGPMLAMTKRQTVLKVDAAQVL-------ANKQQFDETMSWWTATRDNYQVRVVDKD 339 (537) T ss_pred cccHHHHHHHHHHHHHH---HHHHHHHHHHhcCCceeeechHHhh-------cCHHHHHHHHHHHHhhcCCcceeEecCC Confidence 99999999888765444 334444433322 222233 22223 2334443332211111111111111111 Q ss_pred heeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccc-eeh-hhhhHHHHHHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_011269. 378 GLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGA-YAS-SALNREFVTQIMTGFQNALKRHIRRRCEVV 455 (867) Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~~~-~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i 455 (867) +-.++.+=++ +=.|++=++..+..|--+.||--..+. |..+. |.+ ..=-+.-....+-.+|.+|+-.+.++.+.| T Consensus 340 ~e~~e~~~~~--lsgl~~~l~~~~~~iAa~~~IP~t~L~-G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll 416 (537) T protein:vir:10 340 NEDVVQIDTT--LNDLDKVIMNQYQLVCAIARTPAPKML-GTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLV 416 (537) T ss_pred CceeEEEecc--CCCHHHHHHHHHHHHHhhhCCCceeec-cCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222121 123667788888888888899888777 55432 321 111111122333444555666666655555 Q ss_pred HHhhcccchheehhhccccchhh---hhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc Q lcl|NC_011269. 456 AEAQGHYDYDLKGGVRVPIYREI---VEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT 532 (867) Q Consensus 456 ~e~q~~~d~~~~~~~~~~~~rd~---~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t 532 (867) ...-...+.++++.|+.|...+- .+-.+++++-. .. + +.-+ +.+...+...+.+- .....++ T Consensus 417 ~~~~~~~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~-~~----~---~~~G---~i~~~Evr~~L~~~--~~~g~~~-- 481 (537) T protein:vir:10 417 CRSHLRKRIRVKVEFPPMDAPKESERADTFLKKMQAA-KL----A---FEMG---AVDGVDVNEYLRMD--PTLGFTS-- 481 (537) T ss_pred HHhcCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHH-HH----H---HHcC---CCCHHHHHHHHhcc--Ccccccc-- Confidence 44333335567777776644322 22212121110 00 0 1111 11112222222210 0000000 Q ss_pred cCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCC Q lcl|NC_011269. 533 LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTE 599 (867) Q Consensus 533 ~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~ 599 (867) +...+..+- ..+....+....-..........+.. ..... ..+...... .+..... T Consensus 482 l~~~~~~ed--~e~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~------~~a~~~~ 537 (537) T protein:vir:10 482 ITPAMRPTD--AEDIDVDDEGKPVRIIEDQPAPSEMF--GATSS-GESANDPRD------SGAAFED 537 (537) T ss_pred ccCCCChhh--hhcccCCccCCcCCCCCCCCCccccC--CCCcc-ccccCCCcc------CccccCC Confidence 001111100 00000000000000000000000000 00000 000000000 1111111 No 140 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=79.80 E-value=0.026 Score=29.27 Aligned_cols=117 Identities=14% Similarity=0.231 Sum_probs=53.3 Q ss_pred cCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHH- Q lcl|NC_011269. 275 QNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQG- 353 (867) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 353 (867) |.++|.--+..+.++..+-++=..+. |||. |.+.=++..++.|.. ..+ -||+||-|... T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~---l~~~----Ig~~l~~~t~~Rf~~------------q~~-PDG~pW~p~k~~ 60 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRL---MYQQ----IGRELARSQRRRIKA------------QQN-PDGSAYEPRKKP 60 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHH---HHHH----HHHHHHHHHHHHHHh------------ccC-CCCCCCchhhhh Confidence 55554444444555555443322211 3332 222233333333321 111 58999988653 Q ss_pred ------------HHHHHHHH--HHHhhhcc-----------hhhhhhhhheeeeeccccC-c-------cCch-hHHHHH Q lcl|NC_011269. 354 ------------ELDEVRDD--MQSLLAAD-----------FRLMVHNFGLKVENVFGRE-S-------VPNL-DADYDR 399 (867) Q Consensus 354 ------------~~~~~~~~--~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~-~-------~~~~-~~~~~~ 399 (867) |+..+|.. |+.-.-+| .|--|||||..-...-+.. + -|-| ++|.++ T Consensus 61 ~~~~k~~~~~~~m~~~L~~a~~l~~~a~~~~~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~~d~~~ 140 (152) T protein:vir:10 61 KKGVKSKIKSGKMFDKITQPRFMRLRLESEGVSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTDDDLQM 140 (152) T ss_pred hhhhcccccchhHHHhhhhcceeeeeecCcEEEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCCHHHHHH Confidence 44444431 11001112 3456799997543221111 1 1222 377888 Q ss_pred HHHHHHHhhccchh Q lcl|NC_011269. 400 IERKLLQAWGIGEA 413 (867) Q Consensus 400 ~~~~~~~~~~~~~~ 413 (867) |..-|+.-| ..| T Consensus 141 I~~~i~~~l--~~a 152 (152) T protein:vir:10 141 IEDYMINIL--AGS 152 (152) T ss_pred HHHHHHHHH--hcC Confidence 888777776 222 No 141 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=78.93 E-value=0.11 Score=25.85 Aligned_cols=444 Identities=12% Similarity=0.104 Sum_probs=148.4 Q ss_pred hhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHH-------- Q lcl|NC_011269. 38 AALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEEL-------- 109 (867) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 109 (867) .-|-|.. .+..+.+ -|++. .+.+.. |+..-++ ...+-..-++++..++ T Consensus 1 ~~~~~~~--~~~~~~~--------~~~~~--~~~~~~----n~~~~~~--------~~~~~~~~~~~~i~~~i~~~~~~~ 56 (511) T protein:vir:10 1 MLKVNEF--ETDTDLR--------GNINY--LFNDEA----NVVYTYD--------GTESDLLQNVNEVSKCIEHHMDYQ 56 (511) T ss_pred Cccccch--hhhhhhh--------hhhhh--hhhhhh----cCCccCc--------hhhhhcccCHHHHHHHHHHHHHhh Confidence 1111110 0112221 34433 222222 2211111 1111112222111111 Q ss_pred -HHHHHHHHHHhh-c---------------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccH Q lcl|NC_011269. 110 -RVIRHWCRLFYA-T---------------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNY 165 (867) Q Consensus 110 -~~~~~~~~~~~~-~---------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (867) ..++++ +.||. . +++.+.++|.+..|=++. +.++++|+.+.++..+++= +-|+ T Consensus 57 ~~r~~~l-~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~--~n~~ 133 (511) T protein:vir:10 57 RPRLKVL-SDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFND--LNDV 133 (511) T ss_pred HHHHHHH-HHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHh--hcCH Confidence 111111 12342 2 345678899998887766 8899999888887776643 4455 Q ss_pred HHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc--cccccccc Q lcl|NC_011269. 166 LEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP--TTAGGNMS 243 (867) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 243 (867) ...+.+ +++...+-|.++-+--.+|.+.. +..+++|.-+-+-.+-...++.+- .+|-=- ..++.... T Consensus 134 ~~~~~~-~~~~~~i~G~ay~~vy~dedg~~--~i~~~~p~~~~~vydd~~~~~~~~--------~vr~~~~~~~d~~~~~ 202 (511) T protein:vir:10 134 ESHNRS-LGLDLSIYGKAYEIMIRNQDDET--RLYKSDAMSTFVIYDNTIERNSIA--------GVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHH-HHHHHHhcCeeEEEEEeCCCCce--EEEEEccceeEEEEcCCCCCceEE--------EEEEEEeeecccCccc Confidence 555555 55788888887766555554433 234566654433211111111111 010000 00000000 Q ss_pred cccccchhhhhhhhhHHHHHHhchH-HHhhhc-cCCCCcccHHHH---HHhhhcCccc----cccCcchhhHHHHHHHHH Q lcl|NC_011269. 244 TVEETPSEREQRMREFQDLQRRYPE-IIQAAM-QNDGLDISEALI---SRVVNRPTAW----ATRGAPHLLRSFRTLMAE 314 (867) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 314 (867) + .++. ++| .++ |..-.. .+..+.+....+ .|-..+-+-. ..+|.+.+=.. ..|+ T Consensus 203 ~--------~~~~----~iy--t~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v-~~li-- 265 (511) T protein:vir:10 203 E--------VFTV----DLF--TSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV-ITLI-- 265 (511) T ss_pred e--------EEEE----EEE--eCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhh-HHHH-- Confidence 0 0000 000 011 111000 111222222211 1211111111 24566654432 3333 Q ss_pred HHHHHHHHHHH---hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccC--- Q lcl|NC_011269. 315 ESLNAAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRE--- 388 (867) Q Consensus 315 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 388 (867) +.|+.+..-.| +.+..|+.+++ |. . .-+-+++...++. ..+..+-...++..++..+.-+..+ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~-g~----~----~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~~~l~ 334 (511) T protein:vir:10 266 DLYDNAESDTANYMSDLNDAMLLIK-GN----L----NLDPVEVRKQKEA--NVLFLEPTVYADSEGRETEGSVDGGYIY 334 (511) T ss_pred HHHHHHHHHHHHHHHHhhCceeeee-cc----c----cCCchhhccchhc--cceecccccccccccccCCCCcceeEEe Confidence 33444333322 23344444332 10 0 0011122222220 1111111112222222211111110 Q ss_pred ---ccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhc- Q lcl|NC_011269. 389 ---SVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQG- 460 (867) Q Consensus 389 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~- 460 (867) ..=.+...++++++.|..--++-... .++.+++ .|.+.+.+. -++....+..++..++++++.|.++-+ T Consensus 335 ~~~~~~~~e~~~~~L~~~I~~~s~~P~~~-~~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:10 335 KQYDVQGTEAYKDRLNSDIHMFTNTPNMK-DDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCcccc-ccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11112355666777766554433311 1122222 233444332 334556677778888888888877533 Q ss_pred --ccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhcccccc-ccchhhhhhhhhhhhhceeeeeccc Q lcl|NC_011269. 461 --HYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLN-LRDEAQERAFIAQLKGMGVPVSDKT 532 (867) Q Consensus 461 --~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~-Lr~e~~~~~~v~qL~~~~~pitd~t 532 (867) ..+ .++++.|....+++..+.-.-+.++. . .+-.+.....+- +.+.+ T Consensus 412 ~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~---G--~iS~et~~~~l~~v~d~~-------------------- 466 (511) T protein:vir:10 412 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG---G--KISQTTLMSLFSFFQDPE-------------------- 466 (511) T ss_pred hCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh---c--cCcHHHHHHhCCCCCCHH-------------------- Confidence 211 13555555555655433222222221 0 011111111100 01101 Q ss_pred cCCCcccccchhhhhhHHHHHHHHhhccc-ccccccccccccCCCC-Cccccccc Q lcl|NC_011269. 533 LAVNIDMKFDQELERQADETVQKLMATAQ-AMKKVQDLCDAQNLPY-PPELAQHL 585 (867) Q Consensus 533 ~p~tiqme~E~e~e~k~~E~l~tL~~tae-t~kkvq~~~p~~g~P~-pp~~aQ~p 585 (867) .|.++...|....+..... ................ ....-+.- T Consensus 467 ----------~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 467 ----------LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ----------HHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 1111111111111110000 0000000000000000 00000000 No 142 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=78.32 E-value=0.12 Score=25.72 Aligned_cols=441 Identities=12% Similarity=0.096 Sum_probs=148.0 Q ss_pred hhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHH--------- Q lcl|NC_011269. 38 AALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEE--------- 108 (867) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 108 (867) .-|-|.. .+..+.+ -|++. .+.+.. |+..-+++ .++-...|.++..+ T Consensus 1 ~~~~~~~--~~~~~~~--------~~~~~--~~~~~~----n~~~~~~~--------~e~~~~~~~~~i~~~i~~~~~~~ 56 (511) T protein:vir:96 1 MLKVNEF--ETDTDLR--------GNINY--LFNDEA----NVVYTYDG--------TESDLLQNVNEVSKYIEHHMDYQ 56 (511) T ss_pred Cccccch--hhhhhhh--------hhhhh--hhhhhh----CCccccch--------hhhhhhccHHHHHHHHHHHHHhh Confidence 1111100 0112222 23333 222222 22221211 11111112211111 Q ss_pred HHHHHHHHHHHhh-cc---------------------chHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccH Q lcl|NC_011269. 109 LRVIRHWCRLFYA-TH---------------------DLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNY 165 (867) Q Consensus 109 ~~~~~~~~~~~~~-~~---------------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (867) +..++++ +.||. .| .+.+.++|.+..|=++. +.++.+|+.+.++..+++= +-|+ T Consensus 57 ~~r~~~l-~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~--~n~~ 133 (511) T protein:vir:96 57 RPRLKVL-SDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFND--LNDV 133 (511) T ss_pred HHHHHHH-HHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhhccCCceeecCchHHHHHHHHHHh--hcCH Confidence 1112222 23442 23 45678899998888776 8898898888888766643 4456 Q ss_pred HHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc--cccccc-- Q lcl|NC_011269. 166 LEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP--TTAGGN-- 241 (867) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-- 241 (867) ...+.+ +++.....|.++-+--.+|.+.. +..+++|.-+-+-.+-...++.+- ++|.=- ..++.. T Consensus 134 ~~~~~~-~~~~~~i~G~a~~~vy~ded~~~--~i~~~~p~~~~~vydd~~~~~~~~--------~vr~~~~~~~d~~~~~ 202 (511) T protein:vir:96 134 ESHNRS-LGLDLSIYGKAYELMIRNQDDET--RLYKSDAMSTFVIYDNTIERNSIA--------GVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHH-HHHHHHhcCeeEEEEEeCCCCce--EEEEEccceeEEEEcCCCCCceEE--------EEEEEEeeeccccccc Confidence 666666 55888888887766555654432 245566665443211111111111 111000 000000 Q ss_pred -cccccc-cchhhhhhhhhHHHHHHhchHHHhhh-ccCCCCcccHHHHH---HhhhcCccc----cccCcchhhHHHHHH Q lcl|NC_011269. 242 -MSTVEE-TPSEREQRMREFQDLQRRYPEIIQAA-MQNDGLDISEALIS---RVVNRPTAW----ATRGAPHLLRSFRTL 311 (867) Q Consensus 242 -~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~~~~~~~~----~~~~~~~~~~~~~~~ 311 (867) ....|- |+. + |..-. ..+..+.+....+. |-...-+-+ ..+|.+.+=.. +.| T Consensus 203 ~~~~~~iyt~~-~----------------i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v-~~l 264 (511) T protein:vir:96 203 EVFTVDLFTSH-G----------------VYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV-ITL 264 (511) T ss_pred eEEEEEEEeCC-c----------------EEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhh-HHH Confidence 000100 100 0 00000 11122222222221 111111111 24566654432 333 Q ss_pred HHHHHHHHHHHHHH---hhhhchhhhhhh-cccccCCCCcCCCCHHHHHHHHHHHHHh-hhcchhhhhhhhheeeeeccc Q lcl|NC_011269. 312 MAEESLNAAQDAVA---DRLYSPLVLATL-GIEDMGDGEPWIPDQGELDEVRDDMQSL-LAADFRLMVHNFGLKVENVFG 386 (867) Q Consensus 312 ~~~~~~~~~~~~~~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 386 (867) + +.|+.+..-.| +.+..|+.+++= +..+ .++++.+.+.. +..+-.+..+..++..+.-+. T Consensus 265 i--Da~d~~~S~~~~~~~~~~~~~lv~~g~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:96 265 I--DLYDNAESDTANYMSDLNDAMLLIKGNLNLD-------------PVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred H--HHHHHHHHHHHHHHHHhhCceeeeecCccCC-------------chhhcccccccceecccccccccccccCCCCcc Confidence 3 23333333222 223444444331 1111 11111111111 111111111111111111111 Q ss_pred c------CccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_011269. 387 R------ESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVA 456 (867) Q Consensus 387 ~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~ 456 (867) . ...=.++..++++++.|..--++..... ++.+++ .|.+.+.+ +-++....+..++..++++++.|. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~-~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~ 406 (511) T protein:vir:96 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLE 406 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-cccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1111233556666776666554433211 122222 23344432 233455566777788888888887 Q ss_pred Hhh---cccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeee Q lcl|NC_011269. 457 EAQ---GHYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPV 528 (867) Q Consensus 457 e~q---~~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pi 528 (867) .+- +..+ .+++..|....+++..+.-.-+.++. -.+ +.+..+..+ T Consensus 407 ~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~-----G~i---------------S~et~l~~l------- 459 (511) T protein:vir:96 407 TILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG-----GKI---------------SQTTLMSLF------- 459 (511) T ss_pred HHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHh-----ccC---------------ChHHHHHhC------- Confidence 642 2221 13455555555554322211111111 000 011111111 Q ss_pred eccccCCCcccccchhhhhhHHHHHHHHhhccc-ccccccccccccCCCCCcccccccc Q lcl|NC_011269. 529 SDKTLAVNIDMKFDQELERQADETVQKLMATAQ-AMKKVQDLCDAQNLPYPPELAQHLQ 586 (867) Q Consensus 529 td~t~p~tiqme~E~e~e~k~~E~l~tL~~tae-t~kkvq~~~p~~g~P~pp~~aQ~p~ 586 (867) +...+. +.+.++...|....+..... ........-.........+....-. T Consensus 460 -----~~v~D~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 460 -----SFFQDP--ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred -----CCCCCH--HHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 000000 11111111111111111000 0000000000000000000000000 No 143 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=78.13 E-value=0.12 Score=25.68 Aligned_cols=412 Identities=11% Similarity=0.074 Sum_probs=156.0 Q ss_pred ccceee-ccchhhhhhhhhH-----------HhhCCCchhhhHHHHHHHHHHHHHhhc---------------------- Q lcl|NC_011269. 77 GSNMQI-AMPKIRQPLGTLA-----------DKGIPFNVEDEEELRVIRHWCRLFYAT---------------------- 122 (867) Q Consensus 77 ~~~~~~-~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------- 122 (867) |-|+.. ..|++++=++.-. .+.|-. .++.++.++.|-+.|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~---~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITK---HKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPD 77 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHH---HHHHHHHHHHHHHHhcCCCccccccccccccccccccccc Confidence 333311 1233333332211 122211 122344556665444444 Q ss_pred ----cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcccee Q lcl|NC_011269. 123 ----HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWS 197 (867) Q Consensus 123 ----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (867) +++.+.++|.+..|=++. +.++++|+...++..++ |.. |+...+.+ ++++...-|+++-+--.++.+.. T Consensus 78 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~-~~n--~~~~~~~~-~~~~~~~~G~~~~~v~~d~~~~~-- 151 (468) T protein:vir:96 78 WRMYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEV-LNH--KWDDKLVD-ILTAASNKGVEWIQPYVDEQGEF-- 151 (468) T ss_pred cccccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHH-Hhc--CHHHHHHH-HHHHHhhcCeEEEEEEEcCCCce-- Confidence 445667888888887766 88999999988887776 444 44455555 56888888888766666665433 Q ss_pred hheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhH-HHHHHhchHHHhhhccC Q lcl|NC_011269. 198 SEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREF-QDLQRRYPEIIQAAMQN 276 (867) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 276 (867) +..+++|+.+-.- |..+..-++ +-.+|.= ..++. ..+|.--..+..+.+.+ ..+...+.+-......+ T Consensus 152 ~i~~~~p~~~~~v---~~~~~~~~~-----~~~ir~~-~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (468) T protein:vir:96 152 KTFRVPAEQAIPI---WTNKERDEL-----KAFIRLY-ELDGG--ERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAH 220 (468) T ss_pred EEEEEcccceEEE---EcCCCCCce-----EEEEEEE-EecCc--eEEEEEeCCeEEEEEEcCCceeecccccccccccc Confidence 3566777665322 211110010 0000000 00000 00111001111111100 00011111100000000 Q ss_pred -----CCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHH---HHhhhhchhhhhhhcccccCCCCcC Q lcl|NC_011269. 277 -----DGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDA---VADRLYSPLVLATLGIEDMGDGEPW 348 (867) Q Consensus 277 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (867) .-..+..==|+++.| .+.|.+.+=. -+.|+.. |+.+..- ..+.+..|+++++ | ..++ T Consensus 221 ~~~~~~~~~~~~iPvv~~~n-----~~~g~sd~e~-v~~liDa--~d~~~S~~~~~~~~~~~p~lv~~-g----~~~~-- 285 (468) T protein:vir:96 221 YYVGNKSMSWNRVPFIPFKN-----NPQEVSDLFM-YKTIIDA--MDKRLSDTQNTFDEATELIYVLK-G----YEGE-- 285 (468) T ss_pred eeeccccccCCcccEEEecC-----CCCCCCchHH-HHHHHHH--HHHHHHHHHHHHHHhcCceeeee-c----CCcc-- Confidence 000111111222222 3457776432 3333322 3322222 1244667766655 2 1111 Q ss_pred CCCHHHHHHHHHHHHHhhhcchhhhh-h-hhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh Q lcl|NC_011269. 349 IPDQGELDEVRDDMQSLLAADFRLMV-H-NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS 426 (867) Q Consensus 349 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 426 (867) +......+|+. ++++. . .=+-+++++=-....=.+...++.+++.|...-++-.. ..++.|++ .+ T Consensus 286 -----~~~~~~~~~~~-----~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~-~~~~~~~n--~S 352 (468) T protein:vir:96 286 -----DLEEFMYNLKY-----YKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDF-QQDKFGNS--PS 352 (468) T ss_pred -----ccchhhhhhhc-----CceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccc-cccccccc--hH Confidence 11222222221 11110 0 00001111100111112235677777777766554321 11122222 23 Q ss_pred hhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhh Q lcl|NC_011269. 427 ALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPE 500 (867) Q Consensus 427 ~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~ 500 (867) .+.+.+. -++....+..++..++++++.|..+.++-. ..+...|.-..+++. .++++- ++ T Consensus 353 g~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~----~e~a~~-----~~----- 418 (468) T protein:vir:96 353 GIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVMVNE----LEQSQI-----GV----- 418 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCCcCH----HHHHHH-----HH----- Confidence 3333322 233456677888888898888888765322 123333333333332 112110 00 Q ss_pred hccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccc Q lcl|NC_011269. 501 IKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDA 572 (867) Q Consensus 501 i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~ 572 (867) ..+ + + ..+..+.. ++..-+ .+.|.++...|........ ..........+. T Consensus 419 -~~g-~-i----S~et~i~~------------l~~v~D--~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~ 468 (468) T protein:vir:96 419 -NSQ-Y-L----SKETVVTN------------HPWVDD--PVAEMERIDQEELALPSIE-EGLNGKENNEPT 468 (468) T ss_pred -hcC-C-C----chHHHHHh------------CCCCCC--HHHHHHHHHHHHHHHHHHh-hccCCCCCCCCC Confidence 000 0 0 11111211 111111 1233333333433322222 122222211111 No 144 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=76.84 E-value=0.13 Score=25.42 Aligned_cols=376 Identities=7% Similarity=0.003 Sum_probs=126.3 Q ss_pred eccchhhhhhhhhHHhh---C-CCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc--ceecccchhHHHHHH Q lcl|NC_011269. 82 IAMPKIRQPLGTLADKG---I-PFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG--MEFDSKDPLIKTFYE 155 (867) Q Consensus 82 ~~~~~~~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 155 (867) |++ .-.++...++. + ..++-+-.. ...=|..-|-.++.|-.||++-++ .|.. ++..-+|..++.-.. T Consensus 1 Mg~---~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~v~~~v~~Ia~-~ia~~p~~~~~~~~~~~~~~~ 73 (395) T protein:vir:40 1 MGF---KSWVSGFFNEEQRTLNLTDTVWCSI---PSEKLKELSIKKWAIDSCANKIAN-TLSCAEVLTYEKGEEVRKKNW 73 (395) T ss_pred Cch---HHHHHhhhcccccccccccchhhcc---ccccchhhhhhhHHHHHHHHHHHH-HHhhCceeeccCCccccchHH Confidence 332 22222222211 0 011110000 000111223346678889988765 4443 333334444443222 Q ss_pred HHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhcc Q lcl|NC_011269. 156 DLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLR 232 (867) Q Consensus 156 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (867) +++- .+.++-.+|+..++ ..++.-|+++-+.+-++ ..|. ..+.++. . .+.+..- T Consensus 74 ~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnay~~~~~~~-------~~~~-~~~~~~~---~-----------~~~~~~~ 130 (395) T protein:vir:40 74 YMFNVEANQNQNATEFWKKAI-YKLVYDNEALIFMQDEY-------IYVA-DSFTKND---K-----------SLYENTY 130 (395) T ss_pred HHHHhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEecCc-------eeec-CCccccc---c-----------cccccee Confidence 2211 22344478888866 78888898875443221 1111 1111111 0 0000000 Q ss_pred ccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHH Q lcl|NC_011269. 233 QGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLM 312 (867) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (867) ..-+.+ .|. +.+.|| ..-|-|+.-....-...+.+ ++.....++ T Consensus 131 ~~v~~~-------------------~~~-~~~~~~---------------~~evih~r~~~~~~~~~~~~-l~~~~~~~~ 174 (395) T protein:vir:40 131 TEVTLK-------------------DLT-LKKEFK---------------ESEVLHLTLNNESIKSIIDG-FYLLYGDLL 174 (395) T ss_pred eeeeec-------------------Cce-eeeeec---------------cccEEEeecCCCCccccchh-HHHHHHHHH Confidence 000000 010 011122 22333443111111111111 111222211 Q ss_pred HHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhh----cchhhhhhhhheeeeeccccC Q lcl|NC_011269. 313 AEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLA----ADFRLMVHNFGLKVENVFGRE 388 (867) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ 388 (867) . ..+ ++. .+--.+--++++. .+..++.+.-+++|+.++..+. .....+|-.-|.+.+.+...- T Consensus 175 ~-~~~----~~~-~~~~~~~~~l~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~ 241 (395) T protein:vir:40 175 T-AAV----NKY-KKLNSRKIIVKLK-------AMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDS 241 (395) T ss_pred H-HHH----HHH-HhcCCCCceEEEe-------cccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCCh Confidence 1 111 111 1111111122222 1223566666778887776553 345567778888887776554 Q ss_pred ccCchhH--HHH-HHHHHHHHhhccchhhhcCCCccceehh-hhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcccch Q lcl|NC_011269. 389 SVPNLDA--DYD-RIERKLLQAWGIGEALISGGTGGAYASS-ALNREFVTQIMTGFQNALKRHIRRRCEVVAEAQGHYDY 464 (867) Q Consensus 389 ~~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d~ 464 (867) ....+.+ +|. .+.++|.+++||...++. |+ |+++ +....|+..-+.-+...|++++.+.+=...|. ...+ T Consensus 242 ~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~-~~---~sn~e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~--~~g~ 315 (395) T protein:vir:40 242 KIAESRDIKKMIDDVFEMVANSFNIPLGLAK-GD---TVGLSEQVNSFLMFSINPIAEMFTDEGNRKFYGRDSV--LERT 315 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc-CC---CcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhh--cCCc Confidence 4444432 221 234689999999999995 44 5542 22333333334444444444444433111111 1111 Q ss_pred heehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh-ceeeeeccccCCCccc-ccc Q lcl|NC_011269. 465 DLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG-MGVPVSDKTLAVNIDM-KFD 542 (867) Q Consensus 465 ~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~-~~~pitd~t~p~tiqm-e~E 542 (867) .|++-+..+.--| .++ .++. ..+..+ -+-+..+.-+...+ +..++. .+....-+..-..+.. +-. T Consensus 316 ~i~fd~~~ll~~d---~~~-~~~~-~~~~~~-------~G~~t~NE~R~~~g-~~pi~~~~gD~~~~~~n~~~~~~~~~~ 382 (395) T protein:vir:40 316 YMKLDTTRIKVQD---IQE-IASS-MDVLFH-------IGVNTIDDNLRMIG-REPVMSPETQERFVTKNYAPLGENEED 382 (395) T ss_pred eEEEechhhhccC---HHH-HHHH-HHHHHh-------CCCCCHHHHHHHhC-CCCCCCCCCceeeeccccccccccccc Confidence 2222222221111 111 1111 111100 00011110000000 000000 0000000000000000 000 Q ss_pred hhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 543 QELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 543 ~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) ++..... ....+. T Consensus 383 ~kgge~~--------------------~~~~~~ 395 (395) T protein:vir:40 383 LKGGDIN--------------------ENKGDS 395 (395) T ss_pred cCCCCCC--------------------CCcCCC Confidence 0000000 000000 No 145 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=73.71 E-value=0.17 Score=24.84 Aligned_cols=418 Identities=12% Similarity=0.078 Sum_probs=158.0 Q ss_pred ccceeeccchhhhhhhhhHHhh--------CCCchhhh--------HHHHHHHHHHHHHhh-c----------------- Q lcl|NC_011269. 77 GSNMQIAMPKIRQPLGTLADKG--------IPFNVEDE--------EELRVIRHWCRLFYA-T----------------- 122 (867) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~--------~~~~~~~~~~~~~~~-~----------------- 122 (867) |- -+|+.|+..+.+.- .+-+.+.. +++..++..-+ ||. . T Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~-Yy~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:95 1 MI------NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQK-YYDKDNDINYQAYKQDLHGNID 73 (474) T ss_pred Cc------ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHH-HhcccCccccccchhhhccccc Confidence 22 24566655443322 22221111 11222233222 333 2 Q ss_pred ---------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhh Q lcl|NC_011269. 123 ---------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNES 192 (867) Q Consensus 123 ---------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 192 (867) +.+.+.++|.+..|=++. +.++..|+...++..++ |. =|+...+.+ +++.....|.++-+--.++. T Consensus 74 ~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~-~~--n~~~~~~~~-l~~~~~~~G~~~~~~~~d~~ 149 (474) T protein:vir:95 74 YTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQV-LD--TRWDNKLID-ILTAASNKGIDWLQVYINED 149 (474) T ss_pred ccccccccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHH-Hh--ccHHHHHHH-HHHHHhhCCeEEEEeeeCCC Confidence 466688999999998777 89999999988888776 43 355555666 56888889998877666665 Q ss_pred ccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHH-- Q lcl|NC_011269. 193 LGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEII-- 270 (867) Q Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 270 (867) +.. ...+++|+.+-+- |.....-+ ++-.+|.- +..+. +.+|---..+.. .|..--..+.... T Consensus 150 ~~~--~i~~~~p~~~~~v---~d~~~~~~-----~~a~ir~~-~~~~~--~~~~vy~~~~i~---~~~~~~~~~~~~~~~ 213 (474) T protein:vir:95 150 GEL--KLFRVPAEQAIPI---WTDKEREQ-----LNAFIRIF-TFNGE--TKVEYWTAETVT---YYVYENGGLIPDFYY 213 (474) T ss_pred Cce--EEEEEcccceEEE---EcCCCCCc-----eEEEEEEE-eecCe--eEEEEEeCCeEE---EEEEcCCceeecccc Confidence 433 3556777655432 21111111 11111110 00000 000000000000 0000000000000 Q ss_pred -hhhccCC--CCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHH-HHhhhhchhhhhhhcccccCCCC Q lcl|NC_011269. 271 -QAAMQND--GLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDA-VADRLYSPLVLATLGIEDMGDGE 346 (867) Q Consensus 271 -~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 346 (867) +.....+ ...+..==|+++.| ...|.+.+-. .+.|+-.-...-.+-+ ..+.+..|+.+++ | ..| T Consensus 214 ~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g----~~~- 281 (474) T protein:vir:95 214 GDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWM-YKSFVDAIDKRLSDVQNMFDESVELIYILR-G----YEG- 281 (474) T ss_pred ccccccCcccccCCCccceEEecC-----CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhc-C----CCc- Confidence 0000000 00001001122222 2456665533 4444432221111111 1245566655443 3 111 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh Q lcl|NC_011269. 347 PWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS 426 (867) Q Consensus 347 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 426 (867) +++.+...+++ .++++...=+-.++++=..-..=.+...++.+++.|...-++-..-. .+.|++ .+ T Consensus 282 ------~~~~~~~~~~~-----~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~n--~S 347 (474) T protein:vir:95 282 ------EDLSEFMEGLK-----YYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQT-DKFGSA--TS 347 (474) T ss_pred ------ccccchhhhhh-----ccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccc-cccccc--cH Confidence 12222233222 12222222222222221111111233667777777777665543221 122332 34 Q ss_pred hhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcc-c-chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhh Q lcl|NC_011269. 427 ALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGH-Y-DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPE 500 (867) Q Consensus 427 ~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~-~-d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~ 500 (867) .+.+.+. -++-...++.++..++++++.|.++.+. + ..+++..|+-..+.+..+ .++.. +. T Consensus 348 g~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e----~a~~~-----~~---- 414 (474) T protein:vir:95 348 GIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLE----QSQIG-----AQ---- 414 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHH----HHHHH-----HH---- Confidence 4444433 2344557778888889999988887552 1 123444444444443322 11110 00 Q ss_pred hccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 501 IKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 501 i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) +.+ + ..+..+..+ +...+. +.|.++...|...... ...................... T Consensus 415 ---~gi-i----S~et~~~~l------------p~v~D~--~~E~eri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:95 415 ---SQY-L----SKETLVRHH------------PWVDDP--KAELERLDEEQLELNK-QLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred ---cCC-C----ChHHHHHhC------------CCCCCH--HHHHHHHHHHHHHHHh-hccccccccCCCCCCcCCCCcc Confidence 000 0 111112111 111111 1122222222111111 0111111000000000000000 Q ss_pred ccc Q lcl|NC_011269. 581 LAQ 583 (867) Q Consensus 581 ~aQ 583 (867) ... T Consensus 472 e~~ 474 (474) T protein:vir:95 472 QSK 474 (474) T ss_pred ccC Confidence 111 No 146 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=73.71 E-value=0.17 Score=24.84 Aligned_cols=418 Identities=12% Similarity=0.078 Sum_probs=158.0 Q ss_pred ccceeeccchhhhhhhhhHHhh--------CCCchhhh--------HHHHHHHHHHHHHhh-c----------------- Q lcl|NC_011269. 77 GSNMQIAMPKIRQPLGTLADKG--------IPFNVEDE--------EELRVIRHWCRLFYA-T----------------- 122 (867) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~--------~~~~~~~~~~~~~~~-~----------------- 122 (867) |- -+|+.|+..+.+.- .+-+.+.. +++..++..-+ ||. . T Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~-Yy~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:96 1 MI------NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQK-YYDKDNDINYQAYKQDLHGNID 73 (474) T ss_pred Cc------ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHH-HhcccCccccccchhhhccccc Confidence 22 24566655443322 22221111 11222233222 333 2 Q ss_pred ---------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhh Q lcl|NC_011269. 123 ---------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNES 192 (867) Q Consensus 123 ---------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 192 (867) +.+.+.++|.+..|=++. +.++..|+...++..++ |. =|+...+.+ +++.....|.++-+--.++. T Consensus 74 ~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~-~~--n~~~~~~~~-l~~~~~~~G~~~~~~~~d~~ 149 (474) T protein:vir:96 74 YTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQV-LD--TRWDNKLID-ILTAASNKGIDWLQVYINED 149 (474) T ss_pred ccccccccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHH-Hh--ccHHHHHHH-HHHHHhhCCeEEEEeeeCCC Confidence 466688999999998777 89999999988888776 43 355555666 56888889998877666665 Q ss_pred ccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHH-- Q lcl|NC_011269. 193 LGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEII-- 270 (867) Q Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 270 (867) +.. ...+++|+.+-+- |.....-+ ++-.+|.- +..+. +.+|---..+.. .|..--..+.... T Consensus 150 ~~~--~i~~~~p~~~~~v---~d~~~~~~-----~~a~ir~~-~~~~~--~~~~vy~~~~i~---~~~~~~~~~~~~~~~ 213 (474) T protein:vir:96 150 GEL--KLFRVPAEQAIPI---WTDKEREQ-----LNAFIRIF-TFNGE--TKVEYWTAETVT---YYVYENGGLIPDFYY 213 (474) T ss_pred Cce--EEEEEcccceEEE---EcCCCCCc-----eEEEEEEE-eecCe--eEEEEEeCCeEE---EEEEcCCceeecccc Confidence 433 3556777655432 21111111 11111110 00000 000000000000 0000000000000 Q ss_pred -hhhccCC--CCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHH-HHhhhhchhhhhhhcccccCCCC Q lcl|NC_011269. 271 -QAAMQND--GLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDA-VADRLYSPLVLATLGIEDMGDGE 346 (867) Q Consensus 271 -~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 346 (867) +.....+ ...+..==|+++.| ...|.+.+-. .+.|+-.-...-.+-+ ..+.+..|+.+++ | ..| T Consensus 214 ~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g----~~~- 281 (474) T protein:vir:96 214 GDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWM-YKSFVDAIDKRLSDVQNMFDESVELIYILR-G----YEG- 281 (474) T ss_pred ccccccCcccccCCCccceEEecC-----CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhc-C----CCc- Confidence 0000000 00001001122222 2456665533 4444432221111111 1245566655443 3 111 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh Q lcl|NC_011269. 347 PWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS 426 (867) Q Consensus 347 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 426 (867) +++.+...+++ .++++...=+-.++++=..-..=.+...++.+++.|...-++-..-. .+.|++ .+ T Consensus 282 ------~~~~~~~~~~~-----~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~n--~S 347 (474) T protein:vir:96 282 ------EDLSEFMEGLK-----YYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQT-DKFGSA--TS 347 (474) T ss_pred ------ccccchhhhhh-----ccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccc-cccccc--cH Confidence 12222233222 12222222222222221111111233667777777777665543221 122332 34 Q ss_pred hhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcc-c-chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhh Q lcl|NC_011269. 427 ALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGH-Y-DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPE 500 (867) Q Consensus 427 ~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~-~-d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~ 500 (867) .+.+.+. -++-...++.++..++++++.|.++.+. + ..+++..|+-..+.+..+ .++.. +. T Consensus 348 g~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e----~a~~~-----~~---- 414 (474) T protein:vir:96 348 GIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLE----QSQIG-----AQ---- 414 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHH----HHHHH-----HH---- Confidence 4444433 2344557778888889999988887552 1 123444444444443322 11110 00 Q ss_pred hccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 501 IKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 501 i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) +.+ + ..+..+..+ +...+. +.|.++...|...... ...................... T Consensus 415 ---~gi-i----S~et~~~~l------------p~v~D~--~~E~eri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:96 415 ---SQY-L----SKETLVRHH------------PWVDDP--KAELERLDEEQLELNK-QLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred ---cCC-C----ChHHHHHhC------------CCCCCH--HHHHHHHHHHHHHHHh-hccccccccCCCCCCcCCCCcc Confidence 000 0 111112111 111111 1122222222111111 0111111000000000000000 Q ss_pred ccc Q lcl|NC_011269. 581 LAQ 583 (867) Q Consensus 581 ~aQ 583 (867) ... T Consensus 472 e~~ 474 (474) T protein:vir:96 472 QSK 474 (474) T ss_pred ccC Confidence 111 No 147 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=73.36 E-value=0.17 Score=24.79 Aligned_cols=434 Identities=13% Similarity=0.093 Sum_probs=156.3 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc------ Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT------ 122 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 122 (867) |+.-| .+.+..+-|-..+.+|-... . -......+.|-... +++..+++|-+.|--. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~-------------~-~~~~~i~~~i~~~~---~~~~~~~~~~~YY~g~~~i~~~ 62 (474) T protein:vir:94 1 MFNII-RMPWDKPYGEEVVEQLKPQF-------------E-TQEEMIVRLIDDHR---KQLDKITVGQRYYDKDNDIVKQ 62 (474) T ss_pred Ccccc-cccCCCchhhHHHHhhhhcc-------------c-CHHHHHHHHHHHHH---HHHHHHHHHHHHhccccchhcc Confidence 11000 00111111111111111000 0 01112222222221 1233444554433222 Q ss_pred --------------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhh Q lcl|NC_011269. 123 --------------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVG 181 (867) Q Consensus 123 --------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (867) +++.+.++|.+..|=++. +.+..+|+...++.++. | +-|+...+.+ +++.....| T Consensus 63 ~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~-~--~n~~~~~~~e-~~~~~~~~G 138 (474) T protein:vir:94 63 MKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDV-L--DTRWDNKLID-ILTATSNKG 138 (474) T ss_pred cchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHH-H--hccHHHHHHH-HHHHHhhcC Confidence 456788999999888877 99999999998887764 5 3456666666 568888889 Q ss_pred hhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhcccccccccccccccc-ccchhhhhhhhhHH Q lcl|NC_011269. 182 EVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVE-ETPSEREQRMREFQ 260 (867) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 260 (867) .++-+--.++.+.. ...+++|+.+-+- |.....-+ ++-.+|.= ...+- ..+| -|+.+-.+...+-. T Consensus 139 ~~~~~~~~d~~~~~--~i~~~~p~~~~~v---~d~~~~~~-----~~~~ir~~-~~~~~--~~~~~yt~~~~~~y~~~~~ 205 (474) T protein:vir:94 139 IDWLQVYINENGEM--KLFRVPAEQAIPI---WVDKEREE-----LKSFIRYY-KFNNE--EKVEFWTDTTVTYYVLENG 205 (474) T ss_pred ceEEEEEecCCCee--EEEEEcccceEEE---EcCCCCCc-----eEEEEEEE-EecCe--EEEEEEeCCeEEEEEEcCC Confidence 87766545554332 3566777765433 22111111 00011100 00000 0000 01111000000000 Q ss_pred --HHHHhchH-HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhchhhh Q lcl|NC_011269. 261 --DLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSPLVL 334 (867) Q Consensus 261 --~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 334 (867) .....+.+ .++.-. .--.+..==|++++| .+.|.+.+-+ .+.|+ +.|+.+..-.| +.+..|.++ T Consensus 206 ~~~~~~~~~~~~~~~~~--~~~~~g~vPvv~~~n-----n~~g~sd~e~-v~~li--Da~n~~~s~~~~~~~~~~~~~lv 275 (474) T protein:vir:94 206 GLIPDYYYGANHVQSHF--SNGNWGRVPFIAFKN-----NPEEVSDIWM-YKSII--DAIDKRLSDAQNMFDESVELIYI 275 (474) T ss_pred ccccccccCcCcccccc--cccCCCccceEEecC-----CcCCCCcHHH-HHHHH--HHHHHHHHHHHHHHHHhcCceee Confidence 00000000 000000 000000001222332 2457776543 44444 33444333322 335556554 Q ss_pred hhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhh Q lcl|NC_011269. 335 ATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEAL 414 (867) Q Consensus 335 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (867) ++ | .+|+. +++.++.++ .++++..+=+-.++++=..-..=.+...++.+++.|...-++-..- T Consensus 276 ~~-g----~~~~~-------~~~~~~~~~-----~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 338 (474) T protein:vir:94 276 LK-G----YEGED-------LEEFMRGLK-----YYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQ 338 (474) T ss_pred ee-c----CCccc-------chhhhhhhh-----ccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccC Confidence 43 3 22221 222233222 2333332222222222111111122256677777777766544311 Q ss_pred hcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hheehhhccccchhhhhhhhhhhhh Q lcl|NC_011269. 415 ISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLKGGVRVPIYREIVEYDEETGQE 488 (867) Q Consensus 415 ~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~~~~~~~~~rd~~~~k~e~~k~ 488 (867) . +..+++ .+.+.+.+. -++-...+..++..++++++.|.++.+.-. .+++..|.-..+.+ +.++++. T Consensus 339 ~-~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~----~~e~a~~ 411 (474) T protein:vir:94 339 T-DKFGSA--PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMN----DAEQSQI 411 (474) T ss_pred c-cccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccC----HHHHHHH Confidence 1 111222 234444332 233455667788888999988888765322 23343343333332 1122211 Q ss_pred HhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccc Q lcl|NC_011269. 489 YIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQD 568 (867) Q Consensus 489 ~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~ 568 (867) +. . .+ + + +.+..+..+ +...+ .+.+.++...|+.+...... ...+... T Consensus 412 --------~~-~--~g-~-i----S~et~l~~l------------~~v~D--~~~E~eri~~E~~~~~~~~~-~~~~~~~ 459 (474) T protein:vir:94 412 --------IA-Q--SQ-Y-L----SRETLVKSS------------PLVDD--YKAELERIEQEQMEYNKQLP-NLDDGGA 459 (474) T ss_pred --------HH-H--cC-C-C----CHHHHHHhC------------CCCCC--HHHHHHHHHHHHHHHHhhcc-ccCCCCC Confidence 00 0 00 0 0 111112211 11111 11222222222222111111 1111000 Q ss_pred cccccCCCCCccccc Q lcl|NC_011269. 569 LCDAQNLPYPPELAQ 583 (867) Q Consensus 569 ~~p~~g~P~pp~~aQ 583 (867) ......-........ T Consensus 460 ~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 460 DGAQQQEGSNNKESE 474 (474) T ss_pred CCcccCCCCcccccC Confidence 000000000000001 No 148 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=73.36 E-value=0.17 Score=24.79 Aligned_cols=434 Identities=13% Similarity=0.093 Sum_probs=156.3 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc------ Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT------ 122 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 122 (867) |+.-| .+.+..+-|-..+.+|-... . -......+.|-... +++..+++|-+.|--. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~-------------~-~~~~~i~~~i~~~~---~~~~~~~~~~~YY~g~~~i~~~ 62 (474) T protein:vir:97 1 MFNII-RMPWDKPYGEEVVEQLKPQF-------------E-TQEEMIVRLIDDHR---KQLDKITVGQRYYDKDNDIVKQ 62 (474) T ss_pred Ccccc-cccCCCchhhHHHHhhhhcc-------------c-CHHHHHHHHHHHHH---HHHHHHHHHHHHhccccchhcc Confidence 11000 00111111111111111000 0 01112222222221 1233444554433222 Q ss_pred --------------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhh Q lcl|NC_011269. 123 --------------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVG 181 (867) Q Consensus 123 --------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (867) +++.+.++|.+..|=++. +.+..+|+...++.++. | +-|+...+.+ +++.....| T Consensus 63 ~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~-~--~n~~~~~~~e-~~~~~~~~G 138 (474) T protein:vir:97 63 MKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDV-L--DTRWDNKLID-ILTATSNKG 138 (474) T ss_pred cchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHH-H--hccHHHHHHH-HHHHHhhcC Confidence 456788999999888877 99999999998887764 5 3456666666 568888889 Q ss_pred hhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhcccccccccccccccc-ccchhhhhhhhhHH Q lcl|NC_011269. 182 EVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVE-ETPSEREQRMREFQ 260 (867) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 260 (867) .++-+--.++.+.. ...+++|+.+-+- |.....-+ ++-.+|.= ...+- ..+| -|+.+-.+...+-. T Consensus 139 ~~~~~~~~d~~~~~--~i~~~~p~~~~~v---~d~~~~~~-----~~~~ir~~-~~~~~--~~~~~yt~~~~~~y~~~~~ 205 (474) T protein:vir:97 139 IDWLQVYINENGEM--KLFRVPAEQAIPI---WVDKEREE-----LKSFIRYY-KFNNE--EKVEFWTDTTVTYYVLENG 205 (474) T ss_pred ceEEEEEecCCCee--EEEEEcccceEEE---EcCCCCCc-----eEEEEEEE-EecCe--EEEEEEeCCeEEEEEEcCC Confidence 87766545554332 3566777765433 22111111 00011100 00000 0000 01111000000000 Q ss_pred --HHHHhchH-HHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhchhhh Q lcl|NC_011269. 261 --DLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSPLVL 334 (867) Q Consensus 261 --~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 334 (867) .....+.+ .++.-. .--.+..==|++++| .+.|.+.+-+ .+.|+ +.|+.+..-.| +.+..|.++ T Consensus 206 ~~~~~~~~~~~~~~~~~--~~~~~g~vPvv~~~n-----n~~g~sd~e~-v~~li--Da~n~~~s~~~~~~~~~~~~~lv 275 (474) T protein:vir:97 206 GLIPDYYYGANHVQSHF--SNGNWGRVPFIAFKN-----NPEEVSDIWM-YKSII--DAIDKRLSDAQNMFDESVELIYI 275 (474) T ss_pred ccccccccCcCcccccc--cccCCCccceEEecC-----CcCCCCcHHH-HHHHH--HHHHHHHHHHHHHHHHhcCceee Confidence 00000000 000000 000000001222332 2457776543 44444 33444333322 335556554 Q ss_pred hhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhh Q lcl|NC_011269. 335 ATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEAL 414 (867) Q Consensus 335 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (867) ++ | .+|+. +++.++.++ .++++..+=+-.++++=..-..=.+...++.+++.|...-++-..- T Consensus 276 ~~-g----~~~~~-------~~~~~~~~~-----~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 338 (474) T protein:vir:97 276 LK-G----YEGED-------LEEFMRGLK-----YYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQ 338 (474) T ss_pred ee-c----CCccc-------chhhhhhhh-----ccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccC Confidence 43 3 22221 222233222 2333332222222222111111122256677777777766544311 Q ss_pred hcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hheehhhccccchhhhhhhhhhhhh Q lcl|NC_011269. 415 ISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLKGGVRVPIYREIVEYDEETGQE 488 (867) Q Consensus 415 ~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~~~~~~~~~rd~~~~k~e~~k~ 488 (867) . +..+++ .+.+.+.+. -++-...+..++..++++++.|.++.+.-. .+++..|.-..+.+ +.++++. T Consensus 339 ~-~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~----~~e~a~~ 411 (474) T protein:vir:97 339 T-DKFGSA--PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMN----DAEQSQI 411 (474) T ss_pred c-cccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccC----HHHHHHH Confidence 1 111222 234444332 233455667788888999988888765322 23343343333332 1122211 Q ss_pred HhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccc Q lcl|NC_011269. 489 YIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQD 568 (867) Q Consensus 489 ~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~ 568 (867) +. . .+ + + +.+..+..+ +...+ .+.+.++...|+.+...... ...+... T Consensus 412 --------~~-~--~g-~-i----S~et~l~~l------------~~v~D--~~~E~eri~~E~~~~~~~~~-~~~~~~~ 459 (474) T protein:vir:97 412 --------IA-Q--SQ-Y-L----SRETLVKSS------------PLVDD--YKAELERIEQEQMEYNKQLP-NLDDGGA 459 (474) T ss_pred --------HH-H--cC-C-C----CHHHHHHhC------------CCCCC--HHHHHHHHHHHHHHHHhhcc-ccCCCCC Confidence 00 0 00 0 0 111112211 11111 11222222222222111111 1111000 Q ss_pred cccccCCCCCccccc Q lcl|NC_011269. 569 LCDAQNLPYPPELAQ 583 (867) Q Consensus 569 ~~p~~g~P~pp~~aQ 583 (867) ......-........ T Consensus 460 ~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 460 DGAQQQEGSNNKESE 474 (474) T ss_pred CCcccCCCCcccccC Confidence 000000000000001 No 149 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=72.93 E-value=0.18 Score=24.71 Aligned_cols=417 Identities=12% Similarity=0.065 Sum_probs=142.8 Q ss_pred Hhcccccccceeeccch----hhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc--------------------cch Q lcl|NC_011269. 70 YRKQGNFGSNMQIAMPK----IRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT--------------------HDL 125 (867) Q Consensus 70 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~ 125 (867) |.+ .|+.+ +++ -..=|-.+.++.+ .++++.++++-+.|.-. +++ T Consensus 1 ~~~-----~~~~~-~~~~~~~~~~~~~~~i~~~~------~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~ 68 (489) T protein:vir:99 1 MLQ-----EDFEA-IDYESKLWIDQLKNYISRFK------AEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDF 68 (489) T ss_pred CCc-----cceee-eCCCCCCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcccCccccccccccccCCcceeecch Confidence 111 11111 111 0000111111110 11122334443323222 346 Q ss_pred HHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccce--ehheec Q lcl|NC_011269. 126 VPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVW--SSEEIL 202 (867) Q Consensus 126 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 202 (867) .+.++|.+..|=++. +.++.+|+.+.++..+++- +-|+..+..+ ++++...-|.++-+-...+...-. -...++ T Consensus 69 ~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~--~n~~~~~~~~-~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~ 145 (489) T protein:vir:99 69 AKYITVFEQGYMLGVPVEYKNENKDLQAAIDLMSV--RNNEDYHNVK-IKTDLSIYGRAYELLTVEKIDDKKTEVKLYQL 145 (489) T ss_pred HHHHHHHHhhhhccCCceeecCChhHHHHHHHHHh--hcChhHHHHH-HHHHHhhCCeEEEEEeeccCcCCCcceEEEEE Confidence 788899998888776 8999999999888777655 3344455555 558888888776544322211101 124456 Q ss_pred CcceeehhhhhhhcchHHHHHHHHHHhhccccccccc--cccccccc-cchhhhhhhhhHH------HHHHhchHHHhhh Q lcl|NC_011269. 203 NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAG--GNMSTVEE-TPSEREQRMREFQ------DLQRRYPEIIQAA 273 (867) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~------~~~~~~~~~~~~~ 273 (867) +|+-+-+-..-...++.+- +++.=-...+ ......|- |+.+ ..+.+.+. .+...+|.. T Consensus 146 ~p~~~~~v~dd~~~~~~~~--------~i~~~~~~~~~~~~~~~~~~y~~~~-i~~~~~~~~~~~~~~~~~~~~~~---- 212 (489) T protein:vir:99 146 PAEQTFVIYDDTYQRNSLM--------AVHFYDIDYGSGKRKQIIKAYTSDT-IYTYEDYNLETKGMRLKDYEGHF---- 212 (489) T ss_pred cccceEEEEcCCCCCceEE--------EEEEEEEecCCCceEEEEEEEeCCc-EEEEEecCCCcccceeccccccc---- Confidence 6665433211111111111 1110000000 00000000 1110 00000000 000000000 Q ss_pred ccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhchhhhhhhcccccCCCCcCCC Q lcl|NC_011269. 274 MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIP 350 (867) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (867) =..|+ |+++.| ...|++.+=. -..| -+.|+.+..-.+ +.+..|.++++ |. . ++ T Consensus 213 --~g~vP-----vv~~~n-----~~~~~s~~~~-v~~l--iDa~d~~~s~~~~~~~~~~~~~l~i~-g~-------~-~~ 268 (489) T protein:vir:99 213 --FKGVP-----VNEYAN-----NEERTGAYES-VLDN--IDAYDLSQSELANFQQDSVNALLVIA-GN-------A-YT 268 (489) T ss_pred --CCcee-----EEEeec-----CCCCCCchhh-hHHH--HHHHHHHHHHHHHHHHHhhhhhhhhc-cC-------C-cc Confidence 00011 233333 2356665422 1222 233433332222 23555655543 31 1 22 Q ss_pred C---HHHHHHHHHHHHHhhhcchhhhhhhhheeeeecccc---------CccCch-------hHHHHHHHHHHHHhhccc Q lcl|NC_011269. 351 D---QGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGR---------ESVPNL-------DADYDRIERKLLQAWGIG 411 (867) Q Consensus 351 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~-------~~~~~~~~~~~~~~~~~~ 411 (867) + ++..+..+-+....+.. .++.-+.++-.+... -+-|+. ...++.+++.|..--++- T Consensus 269 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 344 (489) T protein:vir:99 269 GADENDYLDDGRLNPNGRLAI----SIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTP 344 (489) T ss_pred cccchhhhhhccccccccccc----ccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCc Confidence 2 12222222111111111 111111121111111 111221 234556666665544433 Q ss_pred hhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhcccc---------hheehhhccccchhh Q lcl|NC_011269. 412 EALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHYD---------YDLKGGVRVPIYREI 478 (867) Q Consensus 412 ~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d---------~~~~~~~~~~~~rd~ 478 (867) ..... .-+++ .+.+.+.+ +-++....+..++..++++++-|.++-+... .+++..|+...+++. T Consensus 345 ~~~~~-~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~ 421 (489) T protein:vir:99 345 DTQDM-KFSGV--QSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQND 421 (489) T ss_pred ccccc-ccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCH Confidence 21111 11112 13333332 3445667778889999999998887644211 134444444444443 Q ss_pred hhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhh Q lcl|NC_011269. 479 VEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMA 558 (867) Q Consensus 479 ~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~ 558 (867) .+.-.-+.|+. .+ + +.+..+..+ +.-.....+.+.++...|....... T Consensus 422 ~~~~~~~~kl~---------------gi-i----s~et~~~~l------------~~v~~~d~~~E~~ri~~E~~~~~~~ 469 (489) T protein:vir:99 422 NEIVTAAQNLY---------------GI-V----SDQTIFEIL------------NTVTGVDAEAELKRLKEEADKKQSL 469 (489) T ss_pred HHHHHHHHHHh---------------cc-C----CHHHHHHhc------------CCCCchhHHHHHHHHHHHHHHHhcc Confidence 22111111110 00 0 111111111 1100001112222222222211111 Q ss_pred ccccccccccc-ccccCCCCCc Q lcl|NC_011269. 559 TAQAMKKVQDL-CDAQNLPYPP 579 (867) Q Consensus 559 taet~kkvq~~-~p~~g~P~pp 579 (867) .. .....+. -+....+..| T Consensus 470 ~~--~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 470 PE--PRLVGDASGQEEPTAEKP 489 (489) T ss_pred cc--ccccCCCCCCcCCCCCCC Confidence 10 0111000 0011111111 No 150 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=71.10 E-value=0.2 Score=24.41 Aligned_cols=419 Identities=11% Similarity=0.083 Sum_probs=151.2 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc------ Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT------ 122 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 122 (867) |.+-+.. ++..+-+. |+. =+--++.....++.++.. +...|+.|.+.|.-. T Consensus 1 m~~~~~~----------~~~~~~~~---~~~-----~~~~~~~~~~~~i~~~~~-----~~~~i~~~~~~Y~g~~~~~~~ 57 (499) T protein:vir:80 1 MINQIIA----------GVKGVMRR---MGL-----LKSLKDVTDHKKVNANDE-----DYKYIDMWKRLYQGNYAEWHN 57 (499) T ss_pred ChhHHHH----------HHHHHHHH---hcc-----ccchhhhhcCCCCcCCHH-----HHHHHHHHHHHhcCCcchhhc Confidence 3332221 12222111 111 111223333444444432 224577786655433 Q ss_pred ----------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcc Q lcl|NC_011269. 123 ----------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTS 185 (867) Q Consensus 123 ----------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (867) ..+-++++|-+..|=++. +.|+++|....++.++++= +-++..-+.+ ...+=+.+|.+.= T Consensus 58 ~~~~~~~~~~~~~~~s~n~~~~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~--~n~f~~~~~~-~~~~a~~~G~~~~ 134 (499) T protein:vir:80 58 LNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKINIDDETAEEFVLNVLK--TNGFTKNMER-YIEYGEAMGGFVI 134 (499) T ss_pred cccccCCCccccceeecchHHHHHHHHHHhhhCCcceEeeCCHHHHHHHHHHHh--hccHHHHHHH-HHHHHhhcCcEEE Confidence 245577888888887777 8999999888887776542 2233333433 2244455677776 Q ss_pred hhhhhhhccceehheecCcceee-hhhhhhhcchHHHHHHHHHHhhccccccccccccccccc----------------- Q lcl|NC_011269. 186 LAHFNESLGVWSSEEILNPDMLR-VSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEE----------------- 247 (867) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------- 247 (867) .--+++++..| ..+++|+.+- |. +- ..+|..++ +++....+ +..-|.-| T Consensus 135 ~~~~D~~~~~~--i~~v~a~~~~Pi~---~d-~~~~~~~~--f~~~~~~~-----~~~y~~lE~h~~~~~~~~~y~I~n~ 201 (499) T protein:vir:80 135 KVYHDGNKNVK--VSFATADCMYPLS---ND-SENVDECL--IANSFHKN-----NKYYKLLEWNEWKGEKEEVYTVTTE 201 (499) T ss_pred EEEECCCCcEE--EEEEcCCceEEEE---ec-CCCeEEEE--EEEEEeec-----CeEEEEEEEEEecccceeeEEEEEE Confidence 66667666555 3556666532 22 11 11221111 01111111 11111000 Q ss_pred ----cchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhh----hcCccccccCcchhhHHHHHHHHHHHHHH Q lcl|NC_011269. 248 ----TPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVV----NRPTAWATRGAPHLLRSFRTLMAEESLNA 319 (867) Q Consensus 248 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (867) +-.+.+-+.-.+.+++...++ . ..-.| ++.-++++++ |....=++.|.+.+=.+- .| .+.|+. T Consensus 202 ~~~~~~~~~lG~~v~l~~~~~~~~~---~-~~~~~--~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~-~l--id~lD~ 272 (499) T protein:vir:80 202 LYQSDDPNELGGKVSLKLLFNDIEP---V-VPLPS--LTRPTFIYIKPNIANNKNLTSPLGISVYANAL-DT--LKTLDL 272 (499) T ss_pred EEeccCccccCcccchhhhccCcCC---c-eeecC--CCccceEeecCCccccccCCCccCCchHhhHH-HH--HHHHHH Confidence 000111111111122111111 0 00011 1222333332 222222345777665543 22 233444 Q ss_pred HHHHHH-------hhhhchhhhhhhcccccCCCCcC-CCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCc-- Q lcl|NC_011269. 320 AQDAVA-------DRLYSPLVLATLGIEDMGDGEPW-IPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRES-- 389 (867) Q Consensus 320 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 389 (867) +-..++ .|++.|--+++.... +.|+.. .++. .+.. |+ .++++.-+.... T Consensus 273 ~~s~~~~e~~~~~~~i~v~~~~l~~~~~--~~g~~~~~~~~-----~~~~--------~~------~~~~~~~~~~~~i~ 331 (499) T protein:vir:80 273 MFDSYYQEFKLGKKKVLVPSSFVKTAVN--LDGSTTQYFDS-----TDEA--------FF------LYQGEQDDNGKAIK 331 (499) T ss_pred HHHHHHHHHHhcccceecchhhhhccCC--CCCCcccCCCc-----ccce--------ee------EeeccCCCCcCcee Confidence 443332 233444444443210 334432 1111 0110 11 011111111101 Q ss_pred c----Cchh---HHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHH----hhhhHHHHHh Q lcl|NC_011269. 390 V----PNLD---ADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHI----RRRCEVVAEA 458 (867) Q Consensus 390 ~----~~~~---~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~----r~~~~~i~e~ 458 (867) + +.++ +.++.+.+.|..+.|++...++.+.++. . .+...-..-|.++..++.+++.+ +++++-|.++ T Consensus 332 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~-~-TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~ 409 (499) T protein:vir:80 332 DISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGL-K-TATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEV 409 (499) T ss_pred EecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccc-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1122 5677788899999999988877433322 1 22233234455655555555444 4444444432 Q ss_pred -h--ccc------chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccc-cccchhhhhhhhhhhhhceeee Q lcl|NC_011269. 459 -Q--GHY------DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTL-NLRDEAQERAFIAQLKGMGVPV 528 (867) Q Consensus 459 -q--~~~------d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~-~Lr~e~~~~~~v~qL~~~~~pi 528 (867) + +.. +..+.+.|...++.|. ++++....-.+..-.+-.+...... .+ ++.+++..+..++... T Consensus 410 ~~~~~~~~~~~~~~~~v~v~f~d~i~~d~---~~~~~~~~~~~~~Gi~S~et~l~~~~~~-~d~ea~~el~~i~~E~--- 482 (499) T protein:vir:80 410 GKLIKAYDGDTVELDTITVDFDDSIAQDE---DTTINRYTTAKNQGMIPLKIALQRAWNI-TEAEADEWAEMLAKEK--- 482 (499) T ss_pred HHHhccccCCCCCccceEEEeCCCCCCCH---HHHHHHHHHHHHcCCCCHHHHHhhcCCC-ChHHHHHHHHHHHHHh--- Confidence 1 111 1123333332222221 1111111000000001111111100 01 1111111122211111 Q ss_pred eccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 529 SDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 529 td~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ... ... .+..+.-...+ T Consensus 483 ------------------------~~~-------~~~----~d~~g~~ge~e 499 (499) T protein:vir:80 483 ------------------------QAE-------IPN----NDMTGIFGEEE 499 (499) T ss_pred ------------------------hcC-------CCC----CCccccCCCCC Confidence 000 000 00000000011 No 151 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=70.91 E-value=0.2 Score=24.38 Aligned_cols=442 Identities=11% Similarity=0.087 Sum_probs=151.2 Q ss_pred hhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHH--------- Q lcl|NC_011269. 38 AALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEE--------- 108 (867) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 108 (867) .-|-|.. .+..+.+ -|++. .+.+..| ...-++ ..++-...|.++..+ T Consensus 1 ~~~~~~~--~~~~~~~--------~~~~~--~~~~~~n----~~~~~~--------~~e~~~~~~~~~i~~~i~~~~~~~ 56 (511) T protein:vir:93 1 MLKVNEF--ETDTDLR--------GNINY--LFNDEAN----VVYTYD--------GTESDLLQNVNEVSKYIEHHMDYQ 56 (511) T ss_pred Cccccch--hhhhhhh--------hhhhh--hhhhhhC----Cccccc--------chhhhhhccHHHHHHHHHHHHHhh Confidence 1111100 0112222 23333 2222222 111111 111122222222111 Q ss_pred HHHHHHHHHHHhh-cc---------------------chHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccH Q lcl|NC_011269. 109 LRVIRHWCRLFYA-TH---------------------DLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNY 165 (867) Q Consensus 109 ~~~~~~~~~~~~~-~~---------------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (867) +..++++- .||. .| ++.+.++|.+..|=++. +.++.+|+...++..+.+= +-|+ T Consensus 57 ~~r~~~l~-~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~--~n~~ 133 (511) T protein:vir:93 57 RPRLKVLS-DYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEVIEAFND--LNDV 133 (511) T ss_pred HHHHHHHH-HHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHh--hcCH Confidence 11112222 2442 33 45588999999988776 8999999988888776643 4456 Q ss_pred HHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccc--cccccccc Q lcl|NC_011269. 166 LEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGP--TTAGGNMS 243 (867) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 243 (867) ..++.+ +++...+-|.++-+--.+|.+... ..+++|.-+-+--+-...++.+- ++|-=- ..++.... T Consensus 134 ~~~~~~-~~~~~~~~G~ay~~vy~de~~~~~--i~~~~p~~~~~vydd~~~~~~~~--------~vr~~~~~~~~~~~~~ 202 (511) T protein:vir:93 134 ESHNRS-LGLDLSIYGKAYELMIRNQDDETR--LYKSDAMSTFVIYDNTIERNSIA--------GVRYLRTKPIDKTDED 202 (511) T ss_pred hHHHHH-HHHHHHhcCeeEEEEEeCCCCceE--EEEEccceeEEEEcCCCCCceEE--------EEEEEEeeeccccccc Confidence 666666 558888888887766556654432 45667765433211111122111 110000 00000000 Q ss_pred cccccchhhhhhhhhHHHHHHhchH-HHhhh-ccCCCCcccHHHH---HHhhhcCccc----cccCcchhhHHHHHHHHH Q lcl|NC_011269. 244 TVEETPSEREQRMREFQDLQRRYPE-IIQAA-MQNDGLDISEALI---SRVVNRPTAW----ATRGAPHLLRSFRTLMAE 314 (867) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 314 (867) ++ .+. +++ .++ |..-. ..+.++.+....+ .|-...-+-+ ..+|.+.+=.. ..|+ T Consensus 203 ~~--------~~~----~iy--t~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v-~~li-- 265 (511) T protein:vir:93 203 EV--------FTV----DLF--TSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV-ITLI-- 265 (511) T ss_pred eE--------EEE----EEE--eCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhH-HHHH-- Confidence 00 000 000 111 11100 1111222222111 1221121211 24666655432 3333 Q ss_pred HHHHHHHHHHH---hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhh-hcchhhhhhhhheeeeeccccC-- Q lcl|NC_011269. 315 ESLNAAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLL-AADFRLMVHNFGLKVENVFGRE-- 388 (867) Q Consensus 315 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-- 388 (867) +.|+.+..-.| +.+..|.++++ |- .+. +-+++...++ ..+ ..+-...+...++..+.-+..+ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~-G~----~~~----~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:93 266 DLYDNAESDTANYMSDLNDAMLLIK-GN----LNL----DPVEVRKQKE---ANVLFLEPTVYADSEGRETEGSVDGGYI 333 (511) T ss_pred HHHHHHHHHHHHHHHHhhCcceeee-cC----ccc----Cchhhccccc---ccceecccccccccccccCCCCcceeEE Confidence 33333333322 23445555444 31 111 1111111111 111 0111111112222221111110 Q ss_pred -ccCc---hhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHh-- Q lcl|NC_011269. 389 -SVPN---LDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEA-- 458 (867) Q Consensus 389 -~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~-- 458 (867) +-++ +...++++++.|..--++-.... ++.+++ .|.+.+.+ +-++....+..++..++++++.|..+ T Consensus 334 ~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~-~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~ 410 (511) T protein:vir:93 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccc-cccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112 22556666666665554443211 122222 23333333 23445566777888888888888664 Q ss_pred -hcccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc Q lcl|NC_011269. 459 -QGHYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT 532 (867) Q Consensus 459 -q~~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t 532 (867) ++..+ ..++..|.-..+++..+.-.-+.++.- .+- .+..+..+ T Consensus 411 ~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g-----~iS---------------~et~~~~l----------- 459 (511) T protein:vir:93 411 NTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGG-----KIS---------------QTTLMSLF----------- 459 (511) T ss_pred hccCcccccccccceEEeCCCCCCCHHHHHHHHHHHhc-----cCc---------------hHHHHHhC----------- Confidence 22222 134555555455443222211111110 000 01111111 Q ss_pred cCCCcccccchhhhhhHHHHHHHHhhcc-cc---cccccccccccCCCCCccccccc Q lcl|NC_011269. 533 LAVNIDMKFDQELERQADETVQKLMATA-QA---MKKVQDLCDAQNLPYPPELAQHL 585 (867) Q Consensus 533 ~p~tiqme~E~e~e~k~~E~l~tL~~ta-et---~kkvq~~~p~~g~P~pp~~aQ~p 585 (867) +...+. +.+.++...|.-..+.... .. .................. +.- T Consensus 460 -~~v~d~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 511 (511) T protein:vir:93 460 -SFFQDP--ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD--KKE 511 (511) T ss_pred -CCCCCH--HHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCccccccc--ccC Confidence 000000 0111111111111111000 00 000000000000000000 000 No 152 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=70.51 E-value=0.21 Score=24.32 Aligned_cols=431 Identities=10% Similarity=0.086 Sum_probs=145.7 Q ss_pred eeeccchhhhhh-----hhhHHhhCCCchhhhHHHHHHHHHHHHHhh-------------------ccchHHHHHHhhhh Q lcl|NC_011269. 80 MQIAMPKIRQPL-----GTLADKGIPFNVEDEEELRVIRHWCRLFYA-------------------THDLVPLLIDIYSK 135 (867) Q Consensus 80 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~ 135 (867) |-|.+..-.-=. .....+.|-...+... .++++-+.|.. .+.+.+.++|.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~---~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~ 77 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKK---RLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNVG 77 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHH---HHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHhh Confidence 222222111000 1111222211111111 22222222222 34567889999999 Q ss_pred ccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceeh---------------h Q lcl|NC_011269. 136 FPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSS---------------E 199 (867) Q Consensus 136 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~ 199 (867) |=++. +.++++|+...+.+.+. | .+-|+..++.+ +++.-..-|.++-+-..++.+..|.. . T Consensus 78 ~l~g~p~~~~~~~~~~~~~l~~~-~-~~n~~~~~~~~-~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~ 154 (499) T protein:vir:10 78 FMTGNPVKYVAEKGKNIDDILEV-F-NQIDIHKHDIE-LEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKI 154 (499) T ss_pred hhcccCceeecCChhHHHHHHHH-H-hhcCHhHHHHH-HHHHHHhcCceEEEEEecccccccccccccccccccccceEE Confidence 88876 88988888777766665 4 34455566776 55888888888776666665554432 3 Q ss_pred eecCcceeehhhhhhhcchHHHHHHHHHHhhccccccc--c-ccccccccc-cchhhhhhhhh--HHHHHHhchHHHhhh Q lcl|NC_011269. 200 EILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTT--A-GGNMSTVEE-TPSEREQRMRE--FQDLQRRYPEIIQAA 273 (867) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~ 273 (867) .+|+|.-+-+ +-++.+. +.++-.++.--+. + +.....+|- |+ ++..+.+. -..+...++++-... T Consensus 155 ~~v~p~~~~~-----v~~d~~~---~~~~~~i~~~~~~~~~~~~~~~~~~iyt~-~~i~~~~~~~~~~~~~~~~~~~~~~ 225 (499) T protein:vir:10 155 EVIDPRATVV-----VCDDTVE---HDPLFAVFTQEKKDLEGNTNGYSITVYMP-QRIVEYRTKTTMEVSANDPIVYDGE 225 (499) T ss_pred EEEcccceEE-----EecCCCC---cceEEEEEEEEEeecCCCceEEEEEEEeC-CeEEEEEecCCccccCcceeccccc Confidence 4455532211 0000000 0011111110000 0 000000010 00 01000000 000011111111100 Q ss_pred ccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH---HhhhhchhhhhhhcccccCCCCcCCC Q lcl|NC_011269. 274 MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV---ADRLYSPLVLATLGIEDMGDGEPWIP 350 (867) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (867) -.=.-|+ |++..| ..+|.+.+-. .+.|+ +.|+.+..-+ .+.+..|.++++ | .. +. T Consensus 226 ~~~g~vP-----vv~~~n-----~~~~~~d~e~-v~~li--D~~~~~~S~~~~~~~~~~~~~lv~~-G----~~----~~ 283 (499) T protein:vir:10 226 NLFGAVP-----IIEFRN-----NEERQGDFEQ-LISLI--DAYNLLQTDRISDKEAFVDALLVTF-G----FG----LG 283 (499) T ss_pred CCCCccc-----eEEecC-----CCCCCCchHh-HHHHH--HHHHHHHHHHHHHHHHhcCceeeee-c----Cc----cc Confidence 0001122 223333 2446665543 33333 2233332222 235666776665 2 11 11 Q ss_pred CHH-HHHHHHHHHHHhhhcchhhhhh-hhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCC-Cccceehhh Q lcl|NC_011269. 351 DQG-ELDEVRDDMQSLLAADFRLMVH-NFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGG-TGGAYASSA 427 (867) Q Consensus 351 ~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~ 427 (867) +.. ....++. -..+++. .=+.+++++=.....=.+...++.+++.|..--++.. ++.+ .+++ .|. T Consensus 284 ~~~~~~~~~~~--------~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~~gn--~Sg 351 (499) T protein:vir:10 284 DDKDDIQRLKR--------GAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPN--MNDEKFMGN--VSG 351 (499) T ss_pred cccchhhhhhh--------cceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccc--CCchhhccc--chH Confidence 111 1111111 0000000 0011111110000111123556677776665443322 1111 1122 133 Q ss_pred hhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhcc--cch---heehhhccccchhhhhhhhhhhhhHhhhhhhhhh Q lcl|NC_011269. 428 LNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGH--YDY---DLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLI 498 (867) Q Consensus 428 ~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~--~d~---~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~ 498 (867) +.+.+ +-++.-..+..++..++++++-|.++-+. .+. .++..|+...+++..+.-.-+.|+. T Consensus 352 ~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--------- 422 (499) T protein:vir:10 352 EAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNAD--------- 422 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--------- Confidence 33322 23555667788888899999888775321 111 2344444444443322111111110 Q ss_pred hhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccc-------ccccccc Q lcl|NC_011269. 499 PEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMK-------KVQDLCD 571 (867) Q Consensus 499 ~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~k-------kvq~~~p 571 (867) .+ .+.+..+.. ++...+.. .+.++...|....+....+... ......+ T Consensus 423 ------g~-----iS~et~~~~------------l~~v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (499) T protein:vir:10 423 ------GI-----IPRKYTYSW------------LPDVDNPQ--DVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQD 477 (499) T ss_pred ------cc-----CChHHHHHh------------CCCCCCHH--HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCc Confidence 00 011111111 11111111 1111111111111111000000 0111111 Q ss_pred ccCCCCCccccccccccccCCCCCCCCCCC Q lcl|NC_011269. 572 AQNLPYPPELAQHLQSTLALRQGKTQTELG 601 (867) Q Consensus 572 ~~g~P~pp~~aQ~p~~t~~~a~gpgq~~~~ 601 (867) ....+.+... .....+++..+. T Consensus 478 ~~~~~~~~~~--------~~~~~~~~~~~~ 499 (499) T protein:vir:10 478 DSSENDKEAG--------SNHNQSHRTRAV 499 (499) T ss_pred ccCCCCCCCc--------cccccCCCCCCC Confidence 1111111111 111122222222 No 153 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=68.36 E-value=0.24 Score=24.00 Aligned_cols=407 Identities=11% Similarity=0.059 Sum_probs=151.3 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH 123 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (867) |+..-+...+....+-...|+.+- +|-+-. |++- ...+.+|....... +.+--||+. T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~-~yy~g~--------------~~~~-~~~~~~p~~~~~~~--~~v~nw~~~----- 57 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRY-RYYAMD--------------DRDD-TRSIVMPNNVREMY--RSVLEWTAK----- 57 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHH-HHHhcC--------------CChh-hcCccccHHHHHHH--HhhcchhHH----- Confidence 665555444444445555555442 332222 1211 11122332222221 222357764 Q ss_pred chHHHHHHhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC Q lcl|NC_011269. 124 DLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN 203 (867) Q Consensus 124 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (867) +||.+...=+++ -|++.|..+.+-+.+--| +..+.+ .-+.-++-|..|-+--=++.+|.- +..+++ T Consensus 58 -----~Vd~~a~rl~~~-Gf~~~d~~l~~~w~~N~l--d~~~~~-----~~~~al~~G~sf~~v~~~~~~~~p-~i~~~s 123 (422) T protein:vir:97 58 -----GVDSLADRIIFR-EFTNDDFNAWEIFKANNP--DIFFDT-----AIQSALIASCCFVYIMPGAEDGLP-KMQVIE 123 (422) T ss_pred -----HHHHHHhccccc-eeeCCchhHHHHHHhcCh--HHHHHH-----HHHHHHHhcceeEEEeeCCCCCee-EEEEec Confidence 466665422222 246677767666554222 111222 223333444433332222222210 122233 Q ss_pred cceeehhhhhhhcchHHHHHHHH--HHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcc Q lcl|NC_011269. 204 PDMLRVSRSMFVQRERVQLMVKD--LVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDI 281 (867) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (867) |..+-+- | +++...+..+ +.+.-..| ....+...+.++..+.++-. .++-+ - -.+ T Consensus 124 p~~~~~i---~--D~~~~~~~~a~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~-----~~~~~---~----~~~ 180 (422) T protein:vir:97 124 ASKATGI---L--DPTTFLLTEGYAILESDSNG------NPTLEAYFTDKDIWYYPKKG-----KPYNI---K----NPT 180 (422) T ss_pred hhhEEEE---E--eCCCCcceeeEEEEEecCCC------cEEEEEEEcCceEEEEcCCC-----ccccc---c----CCC Confidence 3322211 1 1111111000 01111111 11111111222111111100 01100 0 112 Q ss_pred cHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhh-chhhhhhhcccccCCCCcCCCCHHHHHHHHH Q lcl|NC_011269. 282 SEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLY-SPLVLATLGIEDMGDGEPWIPDQGELDEVRD 360 (867) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (867) ..-.|++++|++..-...|.+-+-+...+|...-.---.+-+++..++ .|.|.+ +|-.. +|.+ .+..+. T Consensus 181 g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~G~d~--d~~~-------~~~~~~ 250 (422) T protein:vir:97 181 GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYV-LGMDP--DAKP-------MEKWRA 250 (422) T ss_pred CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-cccCc--cccc-------Cchhhh Confidence 233578999999888889988665555555443333333434444444 455544 45322 2333 122222 Q ss_pred HHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehh---hhhHHHHHHHH Q lcl|NC_011269. 361 DMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASS---ALNREFVTQIM 437 (867) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~ 437 (867) .+-.+++.+-- =...+++|-.. .....=|+-+-++.+-+.|.-.=+|....+. |.+-|-+|+ ...+.-+.++- T Consensus 251 ~~~~i~~~~~d--e~~~~~~v~q~-~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg-~~~~NpsSa~Ai~a~~~~L~~ka 326 (422) T protein:vir:97 251 TVSTLLEISKD--EDGDKPTVGQF-TTASMAPFMEHLKMYASLFAGGSGLTLDDLG-FPSDNPSSVESIKAAHENLRAAG 326 (422) T ss_pred hhhhhhccCCC--CCCCcceeeec-CCCChhHHHHHHHHHHHHHhcccCCCHHHhc-cccCchhHHHHHHHHHHHHHHHH Confidence 22234333200 00011222111 1111112223333333333333366655554 555432221 23344445555 Q ss_pred HHHHHHHHHHHhhhhHHHHHhhcccch------heehhhccccchhhhhhhhhh---hhhHhhhhhhhhhhhhccccccc Q lcl|NC_011269. 438 TGFQNALKRHIRRRCEVVAEAQGHYDY------DLKGGVRVPIYREIVEYDEET---GQEYIRKVPKLLIPEIKFSTLNL 508 (867) Q Consensus 438 ~~~~~~l~~~~r~~~~~i~e~q~~~d~------~~~~~~~~~~~rd~~~~k~e~---~k~~~r~~~k~i~~~i~~~~~~L 508 (867) -..++..+..++++.|-+.++.+..+. ++...++-..+.+..+.-... -|+.-- ..-...-++....+-+ T Consensus 327 ~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a-~~~~~~~~~~~~~lg~ 405 (422) T protein:vir:97 327 RKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQA-IPGFMDADVIRDLTGV 405 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhh-ccccccHHHHHHHcCC Confidence 666777788888888888888775432 122223211122221111100 010000 0000111222222222 Q ss_pred cchhhhhhhhhhhhhce Q lcl|NC_011269. 509 RDEAQERAFIAQLKGMG 525 (867) Q Consensus 509 r~e~~~~~~v~qL~~~~ 525 (867) .+...+...+.+++.-+ T Consensus 406 ~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 406 KGADKPIPAITEVTTDG 422 (422) T ss_pred CchhHHHHHHHhhhccC Confidence 33333333444443333 No 154 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=68.07 E-value=0.24 Score=23.96 Aligned_cols=406 Identities=14% Similarity=0.083 Sum_probs=147.1 Q ss_pred HHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ce Q lcl|NC_011269. 64 RQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-ME 142 (867) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 142 (867) +.++.+|+.--.++++-...... +.+ ....+ -| ..|+.|.|+...||+.-+..+.. ++ T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~-------------~~~-~~~~~------l~-a~Y~~~~l~~~~Vd~~aed~~r~g~~ 59 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGS-------------LQN-QAPTI------LA-SLYADNALVRRIIDTIPETALAAGFH 59 (422) T ss_pred CccchhhHHHHcCCCCCccccCc-------------ccc-cCHHH------HH-HHHHhChhhHHHHhhhhHHHhcCCcc Confidence 33333333321111110000000 100 11111 34 45999999999999999888875 88 Q ss_pred ecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHH Q lcl|NC_011269. 143 FDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQL 222 (867) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (867) |.++|+.- + ++ ..| ++|++-+-|.|.+-.. -.-|...=+-+-. ..+.|+.- ||++- .|+. |..=++.++ T Consensus 60 i~~~~~~~-~-~~-~~~-~~l~~~~~l~~a~~~~-rl~G~a~i~i~v~-d~~~~~~P--l~~~g-~~~~--l~v~d~~~i 128 (422) T protein:vir:10 60 IDGIDDEP-A-FW-SRW-DDLEMTQNINDAWSWA-RLFGGAAIVAIVK-DNRALTSP--VREGA-ELET--VRVYDRTQV 128 (422) T ss_pred ccCCCHHH-H-HH-HHH-HHhhHHHHHHHHHHhh-ccccceEEEEEec-CCCCcccc--ccccC-ceee--EEeeccccc Confidence 88887642 3 33 345 7999999998866322 2223332222222 23445533 22220 1110 111122222 Q ss_pred HHHHHHhhcccccccccccc---ccccccchhhhhhhhhHHHHHHhchH-HHhhhccCCCCcccHHHHHHhhhcCccccc Q lcl|NC_011269. 223 MVKDLVDHLRQGPTTAGGNM---STVEETPSEREQRMREFQDLQRRYPE-IIQAAMQNDGLDISEALISRVVNRPTAWAT 298 (867) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (867) .+..+ -+.|++..--. -++...-..+- +.-+|. +|. . +|-++++.+ ...+.. T Consensus 129 ~~~~~----~~dp~s~~fg~P~~y~v~~~~~~~~---------~~iH~SRli~---~-~g~~~p~~~-------~~~~~~ 184 (422) T protein:vir:10 129 KVQTR----EENPRNARFGEPLTYRITTNESDMF---------YDVHYSRIHI---I-DGERIPNVM-------RRQNDG 184 (422) T ss_pred cchhc----ccCccccccCcceEEEEecCCCCcc---------eeeccceeEE---e-CCCCchhhh-------cccCCc Confidence 22111 11222221000 00000000000 111222 111 0 222233221 245667 Q ss_pred cCcchhhH-HHHHHHHHHHHHHHHHHHHhhhhchhhhhhhc--ccccCCCCcCCCCHHHHHHHHHHHHHhhhc---chhh Q lcl|NC_011269. 299 RGAPHLLR-SFRTLMAEESLNAAQDAVADRLYSPLVLATLG--IEDMGDGEPWIPDQGELDEVRDDMQSLLAA---DFRL 372 (867) Q Consensus 299 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 372 (867) ||.+.|.| ||-.|..-++-...=..+..|-. ++.+|+- .+-+++|.. ..+++.-+...... ..-| T Consensus 185 ~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~--~~v~~~~~l~~~~~~~~~-------~~~~~~r~~~~~~~~~~~~~~ 255 (422) T protein:vir:10 185 WGRSVLSSDILDSIKDYTNCERLATQLLKRKQ--QAVWKAKGLAELCDDSEG-------FGAARLRLAQVDNNSGVGQAI 255 (422) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhc--cccccchhHHHhcCCccc-------hHHHHHHHHHHHHhcCCccce Confidence 99999997 78877766655554444554543 2222221 111222222 22333311111111 1111 Q ss_pred hhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhh-HHHHHHHHHHHH-HHHHHHHhh Q lcl|NC_011269. 373 MVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALN-REFVTQIMTGFQ-NALKRHIRR 450 (867) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~~~-~~l~~~~r~ 450 (867) +|-.=+=++|.+-++ +=.|++=++..+..|--+.||--..+-|=.-..+.++.=+ +.--...+-.+| ++|+-.+.+ T Consensus 256 ~l~~~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~ 333 (422) T protein:vir:10 256 GIDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEF 333 (422) T ss_pred eEecCCcceEEEecc--cCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111112222222 1246788888899999999998888863333333221100 000111122223 123333333 Q ss_pred hhHHHHHhhcccchheehhhccccch---hhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceee Q lcl|NC_011269. 451 RCEVVAEAQGHYDYDLKGGVRVPIYR---EIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVP 527 (867) Q Consensus 451 ~~~~i~e~q~~~d~~~~~~~~~~~~r---d~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~p 527 (867) +.+.|.- . .++++.|+-|... +..+..+++++-. ..-+.-+ +.+...+...+.+- ....- T Consensus 334 l~~~i~~-s----~~~~~~f~pL~~~sekekaei~~~~a~a~--------~~~~~~g---~i~~~e~r~~L~~~-~~~~~ 396 (422) T protein:vir:10 334 LIPFIVN-A----EEWSVEFNPLAQESSKDKAEILEKNVNSI--------AALIAAG---AMDIDEARDTLRTI-APEVK 396 (422) T ss_pred HHHHhcc-c----CCcEEEeCCCCCCCHHHHHHHHHHHHHHH--------HHHHhcC---CCCHHHHHHHhhhh-ccccc Confidence 3333321 0 1344455544332 2222222222110 0001111 11222222222211 00001 Q ss_pred eeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccc Q lcl|NC_011269. 528 VSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQD 568 (867) Q Consensus 528 itd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~ 568 (867) +.+...... ..+.+.. ..+.. .+-++ T Consensus 397 ~~~~~~~~~-----~~~~~~~---------~~~~~-~~~~d 422 (422) T protein:vir:10 397 INDGSVETE-----VTISETS---------NDPLE-VPTDD 422 (422) T ss_pred CCCCCCccc-----cchhhcC---------CCCCC-CCCCC Confidence 111110000 0000000 11110 11111 No 155 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=65.45 E-value=0.28 Score=23.59 Aligned_cols=439 Identities=12% Similarity=0.127 Sum_probs=148.3 Q ss_pred cCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHH---------HHHH Q lcl|NC_011269. 43 TVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEEL---------RVIR 113 (867) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~ 113 (867) -.--|..+ ..-.+-.||+- .+.+-. |+..-+++ ..+-...+.++..++ ..++ T Consensus 1 ~~~~~~~~-----~~~~~~~~~~~--~~~~~~----n~~~~~~~--------~e~~~~~~~~~i~~~i~~~~~~~~~r~~ 61 (511) T protein:vir:96 1 MLKVNEFE-----TDTDLRGNINY--LFNDEA----NVVYTYDG--------TESDLLQNVNEVSKYIEHHMDYQRPRLK 61 (511) T ss_pred Cccccchh-----hhhhhhhhhhh--hhhhhh----CCcccccc--------hhhhhhcCHHHHHHHHHHHHHhhhHHHH Confidence 00111110 01133355554 233332 22111111 111122222221111 1122 Q ss_pred HHHHHHhh-ccc---------------------hHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhH Q lcl|NC_011269. 114 HWCRLFYA-THD---------------------LVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLP 170 (867) Q Consensus 114 ~~~~~~~~-~~~---------------------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (867) ++ +.||. .|+ +.+.++|.+..|=++. +.++++|+...++..+++= +-|+..++. T Consensus 62 ~l-~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~--~n~~~~~~~ 138 (511) T protein:vir:96 62 VL-SDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFND--LNDVESHNR 138 (511) T ss_pred HH-HHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHh--hcChhHHHH Confidence 22 23453 444 4468899998887665 8888888888887766543 345555555 Q ss_pred HHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccc---cccccc Q lcl|NC_011269. 171 DQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGN---MSTVEE 247 (867) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 247 (867) + +++...+-|.++-+--.++.+.. +..+++|.-+-+-.+-...++.+-.+ ..-.... .++.. ....|- T Consensus 139 ~-~~~~~~~~G~a~~~vy~d~dg~~--~i~~~~p~~~~~v~dd~~~~~~~~~v-----r~~~~~~-~~~~~~~~~~~~~v 209 (511) T protein:vir:96 139 S-LGLDLSIYGKAYELMIRNQDDET--RLYKSDAMSTFIIYDNTVERNSIAGV-----RYLRTKP-IDKTDEDEVFTVDL 209 (511) T ss_pred H-HHHHHHhcCeeEEEEEeCCCCce--EEEEEcccceEEEEcCCCCCceEEEE-----EEEEeee-ccccccceEEEEEE Confidence 5 55777888887665555554332 34556665543321111111111100 0000000 00000 000000 Q ss_pred -cchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHH------------HHHhhhcCccccccCcchhhHHHHHHHHH Q lcl|NC_011269. 248 -TPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEAL------------ISRVVNRPTAWATRGAPHLLRSFRTLMAE 314 (867) Q Consensus 248 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (867) |+. +..+ | .-..+.++.+.... |+++.| ..+|.+.+=. ...|+ T Consensus 210 yt~~-~i~~----------~-----~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~-v~~li-- 265 (511) T protein:vir:96 210 FTSH-GVYR----------Y-----LTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEK-VITLI-- 265 (511) T ss_pred EeCC-cEEE----------E-----EecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhh-hHHHH-- Confidence 110 0000 0 00011111111111 122222 2356665443 23333 Q ss_pred HHHHHHHHHHHhhh--h-chhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHh-hhcchhhhhhhhheeeeeccccC-- Q lcl|NC_011269. 315 ESLNAAQDAVADRL--Y-SPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSL-LAADFRLMVHNFGLKVENVFGRE-- 388 (867) Q Consensus 315 ~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-- 388 (867) +.|+.+..-.++-. + .|+++++ |- .+.. -++++..++ .. +..+-.++++..+...+.-+..+ T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~-G~----~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:96 266 DLYDNAESDTANYMSDLNDAMLLIK-GN----LNLD----PVEVRKQKE---ANVLFLEPTVYVDAEGRETEGSVDGGYI 333 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhhee-cC----ccCC----chhhccccc---ccceeccccceeccccccCCCCcceeEE Confidence 34554444444322 2 2332222 21 1111 112222211 11 11112222222222222211111 Q ss_pred ----ccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhc Q lcl|NC_011269. 389 ----SVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQG 460 (867) Q Consensus 389 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~ 460 (867) ..=.++..++++++.|..--++-.. ..+..+++ .|.+.+.+ +-++....+..++..++++++.|.++-+ T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~P~~-~~~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~ 410 (511) T protein:vir:96 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNM-KDDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccc-cccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112235566666666655443321 11112222 23344433 2333455667778888888888877533 Q ss_pred ---ccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc Q lcl|NC_011269. 461 ---HYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT 532 (867) Q Consensus 461 ---~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t 532 (867) ..+ .+++..|....+++..+.-.-+.++. -.+-. +..+..+ T Consensus 411 ~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~-----G~iS~---------------et~l~~l----------- 459 (511) T protein:vir:96 411 NTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG-----GKISQ---------------TTLMSLF----------- 459 (511) T ss_pred hcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh-----ccCCh---------------HHHHHhC----------- Confidence 211 23555555555555332211111111 00101 1111110 Q ss_pred cCCCcccccchhhhhhHHHHHHHHhhccc-ccccccccccccCCCC-Cccccccc Q lcl|NC_011269. 533 LAVNIDMKFDQELERQADETVQKLMATAQ-AMKKVQDLCDAQNLPY-PPELAQHL 585 (867) Q Consensus 533 ~p~tiqme~E~e~e~k~~E~l~tL~~tae-t~kkvq~~~p~~g~P~-pp~~aQ~p 585 (867) +...+ .+.+.++...|.-..+..... ........-....... .....+.- T Consensus 460 -~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 460 -SFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred -CCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 00000 011111111111111110000 0000000000000000 00000000 No 156 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=65.45 E-value=0.28 Score=23.59 Aligned_cols=439 Identities=12% Similarity=0.127 Sum_probs=148.3 Q ss_pred cCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHH---------HHHH Q lcl|NC_011269. 43 TVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEEL---------RVIR 113 (867) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~ 113 (867) -.--|..+ ..-.+-.||+- .+.+-. |+..-+++ ..+-...+.++..++ ..++ T Consensus 1 ~~~~~~~~-----~~~~~~~~~~~--~~~~~~----n~~~~~~~--------~e~~~~~~~~~i~~~i~~~~~~~~~r~~ 61 (511) T protein:vir:78 1 MLKVNEFE-----TDTDLRGNINY--LFNDEA----NVVYTYDG--------TESDLLQNVNEVSKYIEHHMDYQRPRLK 61 (511) T ss_pred Cccccchh-----hhhhhhhhhhh--hhhhhh----CCcccccc--------hhhhhhcCHHHHHHHHHHHHHhhhHHHH Confidence 00111110 01133355554 233332 22111111 111122222221111 1122 Q ss_pred HHHHHHhh-ccc---------------------hHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhH Q lcl|NC_011269. 114 HWCRLFYA-THD---------------------LVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLP 170 (867) Q Consensus 114 ~~~~~~~~-~~~---------------------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (867) ++ +.||. .|+ +.+.++|.+..|=++. +.++++|+...++..+++= +-|+..++. T Consensus 62 ~l-~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~--~n~~~~~~~ 138 (511) T protein:vir:78 62 VL-SDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFND--LNDVESHNR 138 (511) T ss_pred HH-HHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHh--hcChhHHHH Confidence 22 23453 444 4468899998887665 8888888888887766543 345555555 Q ss_pred HHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccc---cccccc Q lcl|NC_011269. 171 DQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGN---MSTVEE 247 (867) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 247 (867) + +++...+-|.++-+--.++.+.. +..+++|.-+-+-.+-...++.+-.+ ..-.... .++.. ....|- T Consensus 139 ~-~~~~~~~~G~a~~~vy~d~dg~~--~i~~~~p~~~~~v~dd~~~~~~~~~v-----r~~~~~~-~~~~~~~~~~~~~v 209 (511) T protein:vir:78 139 S-LGLDLSIYGKAYELMIRNQDDET--RLYKSDAMSTFIIYDNTVERNSIAGV-----RYLRTKP-IDKTDEDEVFTVDL 209 (511) T ss_pred H-HHHHHHhcCeeEEEEEeCCCCce--EEEEEcccceEEEEcCCCCCceEEEE-----EEEEeee-ccccccceEEEEEE Confidence 5 55777888887665555554332 34556665543321111111111100 0000000 00000 000000 Q ss_pred -cchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHH------------HHHhhhcCccccccCcchhhHHHHHHHHH Q lcl|NC_011269. 248 -TPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEAL------------ISRVVNRPTAWATRGAPHLLRSFRTLMAE 314 (867) Q Consensus 248 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (867) |+. +..+ | .-..+.++.+.... |+++.| ..+|.+.+=. ...|+ T Consensus 210 yt~~-~i~~----------~-----~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~-v~~li-- 265 (511) T protein:vir:78 210 FTSH-GVYR----------Y-----LTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEK-VITLI-- 265 (511) T ss_pred EeCC-cEEE----------E-----EecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhh-hHHHH-- Confidence 110 0000 0 00011111111111 122222 2356665443 23333 Q ss_pred HHHHHHHHHHHhhh--h-chhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHh-hhcchhhhhhhhheeeeeccccC-- Q lcl|NC_011269. 315 ESLNAAQDAVADRL--Y-SPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSL-LAADFRLMVHNFGLKVENVFGRE-- 388 (867) Q Consensus 315 ~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-- 388 (867) +.|+.+..-.++-. + .|+++++ |- .+.. -++++..++ .. +..+-.++++..+...+.-+..+ T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~-G~----~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:78 266 DLYDNAESDTANYMSDLNDAMLLIK-GN----LNLD----PVEVRKQKE---ANVLFLEPTVYVDAEGRETEGSVDGGYI 333 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhhee-cC----ccCC----chhhccccc---ccceeccccceeccccccCCCCcceeEE Confidence 34554444444322 2 2332222 21 1111 112222211 11 11112222222222222211111 Q ss_pred ----ccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhc Q lcl|NC_011269. 389 ----SVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQG 460 (867) Q Consensus 389 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~ 460 (867) ..=.++..++++++.|..--++-.. ..+..+++ .|.+.+.+ +-++....+..++..++++++.|.++-+ T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~P~~-~~~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~ 410 (511) T protein:vir:78 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNM-KDDNFSGT--QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccc-cccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112235566666666655443321 11112222 23344433 2333455667778888888888877533 Q ss_pred ---ccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc Q lcl|NC_011269. 461 ---HYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT 532 (867) Q Consensus 461 ---~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t 532 (867) ..+ .+++..|....+++..+.-.-+.++. -.+-. +..+..+ T Consensus 411 ~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~-----G~iS~---------------et~l~~l----------- 459 (511) T protein:vir:78 411 NTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG-----GKISQ---------------TTLMSLF----------- 459 (511) T ss_pred hcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh-----ccCCh---------------HHHHHhC----------- Confidence 211 23555555555555332211111111 00101 1111110 Q ss_pred cCCCcccccchhhhhhHHHHHHHHhhccc-ccccccccccccCCCC-Cccccccc Q lcl|NC_011269. 533 LAVNIDMKFDQELERQADETVQKLMATAQ-AMKKVQDLCDAQNLPY-PPELAQHL 585 (867) Q Consensus 533 ~p~tiqme~E~e~e~k~~E~l~tL~~tae-t~kkvq~~~p~~g~P~-pp~~aQ~p 585 (867) +...+ .+.+.++...|.-..+..... ........-....... .....+.- T Consensus 460 -~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 460 -SFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred -CCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 00000 011111111111111110000 0000000000000000 00000000 No 157 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=65.35 E-value=0.28 Score=23.57 Aligned_cols=352 Identities=13% Similarity=0.106 Sum_probs=116.9 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhh-c Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSK-F 136 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 136 (867) -|+- .++.+..+..... -+.+.++. |.+-.+..++.|-.||++-++ . T Consensus 1 M~~f---~k~~~~~~~~~~~---------~~~~~~~~--------------------~~~~~~~~~~~v~~~v~~ia~~i 48 (378) T protein:vir:85 1 MNLF---GKVVSFSRGKLNN---------DTQRVTAW--------------------QNEAVEYTSAFVTNIHNKIANEI 48 (378) T ss_pred Cchh---hhhhhhhhccccc---------CCcceeee--------------------eccchhhhhHHHHHHHHHHHHhH Confidence 0110 0110100000000 00111111 122233345566667776543 2 Q ss_pred cccccee---cccc----hh--HHH-HHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecC Q lcl|NC_011269. 137 PVVGMEF---DSKD----PL--IKT-FYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILN 203 (867) Q Consensus 137 ~~~~~~~---~~~~----~~--~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (867) .-+.|+. +.++ .. ++. -..+++- .+..+=.+|+..++ ..++.-|+++=+.-.....|.+.... T Consensus 49 A~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gnayi~~i~~~~~g~~~~~~--- 124 (378) T protein:vir:85 49 TKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVI-KKLLCTRYVDLYPIFDSETGELLDLL--- 124 (378) T ss_pred hhCceeEEEEeccccccccccccccchHHHHHhccCCCCCCHHHHHHHHH-HHHhhcCCeEEEEeecCCCceEEEEE--- Confidence 2222222 1111 10 111 1111111 12334446777766 66777787652211111111100000 Q ss_pred cceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccH Q lcl|NC_011269. 204 PDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISE 283 (867) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (867) + .+.+..... T Consensus 125 ---------------------------------------------------------------~-------~~~~~~~~~ 134 (378) T protein:vir:85 125 ---------------------------------------------------------------F-------ANDKKEYKP 134 (378) T ss_pred ---------------------------------------------------------------e-------cCCCEEEcc Confidence 0 001111111 Q ss_pred HHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhh--hchhhhhhhcccccCCCCcCCCCHHHHHHHHHH Q lcl|NC_011269. 284 ALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRL--YSPLVLATLGIEDMGDGEPWIPDQGELDEVRDD 361 (867) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (867) .-|.|+. .++...|.-..| ..|.++|+..+ .+|=-++|+.+ .-+.+..+++|+. T Consensus 135 ~dvih~~---~~~~~~~~~~~~------------~~a~~~~~~~~~~~~~~g~l~~~~---------~l~~~~~~~~~~~ 190 (378) T protein:vir:85 135 EELVRLV---SPFYINEDTSIL------------DNALASIQTKLEQGKLRGLLKINA---------FLDIDNTQEYREK 190 (378) T ss_pred cceEEEe---cCcCccchhhHH------------HHHHHHHHHHHhcCCcceEEEeCC---------cCCHHHHHHHHHH Confidence 1122332 123333322221 12222222221 12212223321 1123344555555 Q ss_pred HHHhh----hcc--hhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHH Q lcl|NC_011269. 362 MQSLL----AAD--FRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQ 435 (867) Q Consensus 362 ~~~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 435 (867) ++..+ .++ ..++|-.-|++++-+..+-+.+.+ ...++++++|.+++||...+++ | +|+. +-...|+.. T Consensus 191 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~-~---s~~e-~~~~~f~~~ 264 (378) T protein:vir:85 191 ALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILL-G---TATQ-EQQIYFYNS 264 (378) T ss_pred HHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc-C---CchH-HHHHHHHHH Confidence 44443 333 256777888888888766666665 4678999999999999999997 3 3333 233445555 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHh-hc-ccchheehhhccccchhhhhhhhhh--hhhHhhhhhhhhhhhhccccccccch Q lcl|NC_011269. 436 IMTGFQNALKRHIRRRCEVVAEA-QG-HYDYDLKGGVRVPIYREIVEYDEET--GQEYIRKVPKLLIPEIKFSTLNLRDE 511 (867) Q Consensus 436 ~~~~~~~~l~~~~r~~~~~i~e~-q~-~~d~~~~~~~~~~~~rd~~~~k~e~--~k~~~r~~~k~i~~~i~~~~~~Lr~e 511 (867) -+.-+..+|++.+.+.+=.-.|. ++ .....++..|++-.. ...+.|+++ ++-.+... + +.. .| T Consensus 265 tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l-~~~d~~~~~~~~~~~~~~G---~--------~T~-NE 331 (378) T protein:vir:85 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLF-KFATLKELIDLYHENINGP---I--------FTQ-NQ 331 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhh-hhcCHHHHHHHHHHHHhCC---C--------cCH-HH Confidence 56666666666666555211111 11 111111122221100 001122211 11111110 0 000 00 Q ss_pred hhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 512 AQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 512 ~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) .=+...+..++. +..+--+..-. +++...+.+. .....++- -+..+. T Consensus 332 ~R~~lgl~p~~g-GD~~~~~~N~~----~~~~~~~~~~---------~~~~~~~~---~e~~n~ 378 (378) T protein:vir:85 332 LLVKMGEQPIEG-GDIYIANLNAV----AVKNLSDLQG---------SRKDVAST---DETNNQ 378 (378) T ss_pred HHHHhCCCCCCC-CCeEeeccccc----ccccchhhcC---------ccCCCCCC---CCCCCC Confidence 000000000000 00000000000 0000000000 00000000 000000 No 158 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=57.44 E-value=0.26 Score=23.77 Aligned_cols=432 Identities=10% Similarity=0.052 Sum_probs=150.7 Q ss_pred hhHHHHHHHH-hcccccccceeeccchhhhhhhhhHH-hhCCCchhhhHHHHHHHHHHHHHhhc---------------- Q lcl|NC_011269. 61 EANRQRLASY-RKQGNFGSNMQIAMPKIRQPLGTLAD-KGIPFNVEDEEELRVIRHWCRLFYAT---------------- 122 (867) Q Consensus 61 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---------------- 122 (867) -+=++||... ||.+.-+.+. -|....+ ..|.-+- +++..|+.|.+.| .- T Consensus 1 m~~~~~~k~~~~~~~~~~~~~---------~~~~~~~~~~i~~~~---~~~~~i~~~~~~Y-~g~~~~~~~~~~~~~~~~ 67 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQ---------SLTNITDHPKIAISK---LEYDRITTNLKYY-KSDWDSVLYLNTDGETKK 67 (500) T ss_pred CchHHHHHHHHHHHHHHhhcc---------hhhhhhccccccCCH---HHHHHHHHHHHHh-cCCCCCcccccCCCCccc Confidence 0111222221 2222111111 1222221 2232222 3566799997744 21 Q ss_pred -----cchHHHHHHhhhhcccccc-eecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccce Q lcl|NC_011269. 123 -----HDLVPLLIDIYSKFPVVGM-EFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVW 196 (867) Q Consensus 123 -----~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (867) -.|-+.+++-+..+=...+ .|+++|+...+|+++++= +-++...+.+.+ ..-+.+|++. |+.+-..+.+| T Consensus 68 ~~~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~--~n~f~~~~~~~~-e~a~a~G~~~-~k~~~d~~~~~ 143 (500) T protein:vir:30 68 RDLNHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLK--NDRFNKNFERYL-ESCLALGGLA-MRPYVDGDKVR 143 (500) T ss_pred CceeecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHh--hccHHHHHHHHH-HHHhhcCCEE-EEEEEeCCceE Confidence 1455677777777756664 789999999988887653 333334343322 3334455544 33332334443 Q ss_pred ehheecCcceeehhhhhhhcchHHHHHHH-HHHhhcccccccccccc----ccccccchh-hhhhhhhHH-----HHHHh Q lcl|NC_011269. 197 SSEEILNPDMLRVSRSMFVQRERVQLMVK-DLVDHLRQGPTTAGGNM----STVEETPSE-REQRMREFQ-----DLQRR 265 (867) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~~-~~~~~~~~~-----~~~~~ 265 (867) ...+++|++-.-+ +--+.+++..|. +.+.-. .| +... ..-|-+.++ -.-+++-|. +|-+. T Consensus 144 --I~~v~ad~~~P~~--~d~~~~~~~a~~~~~~~~~-~~----~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~ 214 (500) T protein:vir:30 144 --VAFVQAPVFLPLQ--SNTQDVSSAAVVIKSVKTI-NG----KEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSR 214 (500) T ss_pred --EEEEcCCeeEEEE--EcCCCeEEEEEEEEEeeee-cC----CceEEEEEEEEEEeCCceeEEEEEEEecccccccCcc Confidence 3445666542100 111111110000 000000 00 0000 000000000 000111111 11111 Q ss_pred c-----hH-HHhhhccCCCCcccHHHHHH----hhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHh-------hh Q lcl|NC_011269. 266 Y-----PE-IIQAAMQNDGLDISEALISR----VVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVAD-------RL 328 (867) Q Consensus 266 ~-----~~-~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~ 328 (867) . +| +-+.+ .=.|+ +.=+.++ +.|...--++.|-+++=.+--+| +.|+.+-..++. |. T Consensus 215 v~l~~~~~~l~~~~-~~~~~--~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~li---d~lD~~~s~~~~e~~~g~~~i 288 (500) T protein:vir:30 215 VPLSEVYKDLKDEA-KVTDV--TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTI---DFINTTYDEFMWEVKMGQRRV 288 (500) T ss_pred cccccccCCcCcce-EeccC--CCccEEEecCCccccccCCCccCCchhhhhHHHH---HHHHHHHHHHHHHHHhCccee Confidence 1 11 10000 00111 1122333 33444444666877766554322 223332222221 23 Q ss_pred hchhhhhhhcccccCCCCcCCCCHHHHHHHHH-HHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHh Q lcl|NC_011269. 329 YSPLVLATLGIEDMGDGEPWIPDQGELDEVRD-DMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQA 407 (867) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (867) +-|--+++... +-..|+...+---++++-.- -|.....+.-. +++.+ .+=++=.+-+-++.+-+.|..+ T Consensus 289 ~v~~~~l~~~~-~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~-------i~~~~--~~ir~e~~~~~l~~~l~~i~~~ 358 (500) T protein:vir:30 289 AVPESLTALTV-RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSA-------IQDLT--TPIRADDYIKAINEGLSLFEMQ 358 (500) T ss_pred eechHHhcccC-CCCCccccCCcccCCCcceEEEcCCCCCcCcc-------eeEec--cccChHHHHHHHHHHHHHHHHH Confidence 33433333210 11233332111111111000 00000000000 11100 0000011224566777888888 Q ss_pred hccchhhhcCCCccc-eehhhhhHHHHHHHHH----HHHHHHHHHHhhhhHHHHHhhc---c------cchheehhhccc Q lcl|NC_011269. 408 WGIGEALISGGTGGA-YASSALNREFVTQIMT----GFQNALKRHIRRRCEVVAEAQG---H------YDYDLKGGVRVP 473 (867) Q Consensus 408 ~~~~~~~~~~g~~~~-~~~~~~~~~~~~~~~~----~~~~~l~~~~r~~~~~i~e~q~---~------~d~~~~~~~~~~ 473 (867) .|++..-++...++. =|+++.. .-|.++ .+++.++..|+++++-|.++-. + .++.|++-|... T Consensus 359 ~gls~~~~~~~~~g~~TAtei~s---~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~ 435 (500) T protein:vir:30 359 IGVSAGLFSFDGKSMKTATEIVS---ENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDG 435 (500) T ss_pred hCCCccccccCcCccccHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCC Confidence 998888776322222 1333322 223444 4555566666666666655422 1 233444444443 Q ss_pred cchhhhhhhhhhhhhHhhhhhhhhhhhhccccc-cccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhh Q lcl|NC_011269. 474 IYREIVEYDEETGQEYIRKVPKLLIPEIKFSTL-NLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELE 546 (867) Q Consensus 474 ~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~-~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e 546 (867) +..|- ++|+.....-+..-.+-.+...... .+ ++..++..+.+.+....+..+... ...+.--+ T Consensus 436 i~~d~---~~~~~~~~~~v~aGi~s~~~~i~~~~g~-~eeea~~~l~~i~~E~~~~~~~~~-----~~~~~~g~ 500 (500) T protein:vir:30 436 VFTDR---DAELDYWIKVVNAGFGTREMAIQKVLNV-TEEKAQEIAAEINTGIVDEINQQR-----TDTHLYGE 500 (500) T ss_pred CCCCH---HHHHHHHHHHHHcCCCCHHHHHHhcCCC-CHHHHHHHHHHHHHhccccCCCCC-----ccccccCC Confidence 33332 2222111111111111111211111 12 233344445555444332221110 00110000 No 159 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=57.44 E-value=0.26 Score=23.77 Aligned_cols=432 Identities=10% Similarity=0.052 Sum_probs=150.7 Q ss_pred hhHHHHHHHH-hcccccccceeeccchhhhhhhhhHH-hhCCCchhhhHHHHHHHHHHHHHhhc---------------- Q lcl|NC_011269. 61 EANRQRLASY-RKQGNFGSNMQIAMPKIRQPLGTLAD-KGIPFNVEDEEELRVIRHWCRLFYAT---------------- 122 (867) Q Consensus 61 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---------------- 122 (867) -+=++||... ||.+.-+.+. -|....+ ..|.-+- +++..|+.|.+.| .- T Consensus 1 m~~~~~~k~~~~~~~~~~~~~---------~~~~~~~~~~i~~~~---~~~~~i~~~~~~Y-~g~~~~~~~~~~~~~~~~ 67 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQ---------SLTNITDHPKIAISK---LEYDRITTNLKYY-KSDWDSVLYLNTDGETKK 67 (500) T ss_pred CchHHHHHHHHHHHHHHhhcc---------hhhhhhccccccCCH---HHHHHHHHHHHHh-cCCCCCcccccCCCCccc Confidence 0111222221 2222111111 1222221 2232222 3566799997744 21 Q ss_pred -----cchHHHHHHhhhhcccccc-eecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccce Q lcl|NC_011269. 123 -----HDLVPLLIDIYSKFPVVGM-EFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVW 196 (867) Q Consensus 123 -----~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (867) -.|-+.+++-+..+=...+ .|+++|+...+|+++++= +-++...+.+.+ ..-+.+|++. |+.+-..+.+| T Consensus 68 ~~~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~--~n~f~~~~~~~~-e~a~a~G~~~-~k~~~d~~~~~ 143 (500) T protein:vir:98 68 RDLNHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLK--NDRFNKNFERYL-ESCLALGGLA-MRPYVDGDKVR 143 (500) T ss_pred CceeecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHh--hccHHHHHHHHH-HHHhhcCCEE-EEEEEeCCceE Confidence 1455677777777756664 789999999988887653 333334343322 3334455544 33332334443 Q ss_pred ehheecCcceeehhhhhhhcchHHHHHHH-HHHhhcccccccccccc----ccccccchh-hhhhhhhHH-----HHHHh Q lcl|NC_011269. 197 SSEEILNPDMLRVSRSMFVQRERVQLMVK-DLVDHLRQGPTTAGGNM----STVEETPSE-REQRMREFQ-----DLQRR 265 (867) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~~-~~~~~~~~~-----~~~~~ 265 (867) ...+++|++-.-+ +--+.+++..|. +.+.-. .| +... ..-|-+.++ -.-+++-|. +|-+. T Consensus 144 --I~~v~ad~~~P~~--~d~~~~~~~a~~~~~~~~~-~~----~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~ 214 (500) T protein:vir:98 144 --VAFVQAPVFLPLQ--SNTQDVSSAAVVIKSVKTI-NG----KEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSR 214 (500) T ss_pred --EEEEcCCeeEEEE--EcCCCeEEEEEEEEEeeee-cC----CceEEEEEEEEEEeCCceeEEEEEEEecccccccCcc Confidence 3445666542100 111111110000 000000 00 0000 000000000 000111111 11111 Q ss_pred c-----hH-HHhhhccCCCCcccHHHHHH----hhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHh-------hh Q lcl|NC_011269. 266 Y-----PE-IIQAAMQNDGLDISEALISR----VVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVAD-------RL 328 (867) Q Consensus 266 ~-----~~-~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~ 328 (867) . +| +-+.+ .=.|+ +.=+.++ +.|...--++.|-+++=.+--+| +.|+.+-..++. |. T Consensus 215 v~l~~~~~~l~~~~-~~~~~--~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~li---d~lD~~~s~~~~e~~~g~~~i 288 (500) T protein:vir:98 215 VPLSEVYKDLKDEA-KVTDV--TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTI---DFINTTYDEFMWEVKMGQRRV 288 (500) T ss_pred cccccccCCcCcce-EeccC--CCccEEEecCCccccccCCCccCCchhhhhHHHH---HHHHHHHHHHHHHHHhCccee Confidence 1 11 10000 00111 1122333 33444444666877766554322 223332222221 23 Q ss_pred hchhhhhhhcccccCCCCcCCCCHHHHHHHHH-HHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHh Q lcl|NC_011269. 329 YSPLVLATLGIEDMGDGEPWIPDQGELDEVRD-DMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQA 407 (867) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (867) +-|--+++... +-..|+...+---++++-.- -|.....+.-. +++.+ .+=++=.+-+-++.+-+.|..+ T Consensus 289 ~v~~~~l~~~~-~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~-------i~~~~--~~ir~e~~~~~l~~~l~~i~~~ 358 (500) T protein:vir:98 289 AVPESLTALTV-RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSA-------IQDLT--TPIRADDYIKAINEGLSLFEMQ 358 (500) T ss_pred eechHHhcccC-CCCCccccCCcccCCCcceEEEcCCCCCcCcc-------eeEec--cccChHHHHHHHHHHHHHHHHH Confidence 33433333210 11233332111111111000 00000000000 11100 0000011224566777888888 Q ss_pred hccchhhhcCCCccc-eehhhhhHHHHHHHHH----HHHHHHHHHHhhhhHHHHHhhc---c------cchheehhhccc Q lcl|NC_011269. 408 WGIGEALISGGTGGA-YASSALNREFVTQIMT----GFQNALKRHIRRRCEVVAEAQG---H------YDYDLKGGVRVP 473 (867) Q Consensus 408 ~~~~~~~~~~g~~~~-~~~~~~~~~~~~~~~~----~~~~~l~~~~r~~~~~i~e~q~---~------~d~~~~~~~~~~ 473 (867) .|++..-++...++. =|+++.. .-|.++ .+++.++..|+++++-|.++-. + .++.|++-|... T Consensus 359 ~gls~~~~~~~~~g~~TAtei~s---~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~ 435 (500) T protein:vir:98 359 IGVSAGLFSFDGKSMKTATEIVS---ENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDG 435 (500) T ss_pred hCCCccccccCcCccccHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCC Confidence 998888776322222 1333322 223444 4555566666666666655422 1 233444444443 Q ss_pred cchhhhhhhhhhhhhHhhhhhhhhhhhhccccc-cccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhh Q lcl|NC_011269. 474 IYREIVEYDEETGQEYIRKVPKLLIPEIKFSTL-NLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELE 546 (867) Q Consensus 474 ~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~-~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e 546 (867) +..|- ++|+.....-+..-.+-.+...... .+ ++..++..+.+.+....+..+... ...+.--+ T Consensus 436 i~~d~---~~~~~~~~~~v~aGi~s~~~~i~~~~g~-~eeea~~~l~~i~~E~~~~~~~~~-----~~~~~~g~ 500 (500) T protein:vir:98 436 VFTDR---DAELDYWIKVVNAGFGTREMAIQKVLNV-TEEKAQEIAAEINTGIVDEINQQR-----TDTHLYGE 500 (500) T ss_pred CCCCH---HHHHHHHHHHHHcCCCCHHHHHHhcCCC-CHHHHHHHHHHHHHhccccCCCCC-----ccccccCC Confidence 33332 2222111111111111111211111 12 233344445555444332221110 00110000 No 160 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=55.16 E-value=0.49 Score=22.30 Aligned_cols=431 Identities=12% Similarity=0.094 Sum_probs=154.9 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhh------- Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYA------- 121 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 121 (867) |+.-| .|-..-+.+.+.|..+..... ..-....+.|-... ..++.++.|-+ ||. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~--------------~~~~~i~~~i~~~~---~~~~~~~~~~~-Yy~g~~~i~~ 61 (474) T protein:vir:95 1 MFNII-RMPWDKPYGEEVVEQLKPQFE--------------TQEEMIIRLIDDHR---KQLDKITVGQR-YYDKDNDIVK 61 (474) T ss_pred Cccee-ecCCCCchhhHHHHhhhhccC--------------ChHHHHHHHHHHHH---HHHHHHHHHHH-HhcccCchhc Confidence 22211 122234444444444433320 00011111121111 12223334422 332 Q ss_pred --------------------ccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhh Q lcl|NC_011269. 122 --------------------THDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTV 180 (867) Q Consensus 122 --------------------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (867) .+++.+.++|.+..|=++. +.++++|+...++..++ |.. |+...+.+ +++..... T Consensus 62 r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~-~~n--~~~~~~~e-~~~~~~~~ 137 (474) T protein:vir:95 62 QMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDV-LDT--RWDNKLID-ILTATSNK 137 (474) T ss_pred cccccccccccccccccceeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHH-Hhc--cHHHHHHH-HHHHHhhc Confidence 2577788899999888777 89999999988888776 433 44455555 55888888 Q ss_pred hhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHH Q lcl|NC_011269. 181 GEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQ 260 (867) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (867) |.++-+--.++.+.. +..+++|+.+-.- |..+..-++ +-.++.= ...+.. ...--|+.+-.+ |. T Consensus 138 G~~~~~v~~d~~~~~--~i~~~~p~~~~~v---~d~~~~~~~-----~~~i~~~-~~~~~~-~~~~y~~~~~~~----~~ 201 (474) T protein:vir:95 138 GIDWLQVYINENGEM--KLFRVPAEQAIPI---WVDKEREEL-----KSFIRYY-KFNNEE-KVEFWTDTTVTY----YV 201 (474) T ss_pred CcEEEEEEecCCCce--EEEEEcccceEEE---EcCCCCCce-----EEEEEEE-EEcCee-EEEEEeCCeEEE----EE Confidence 887665544554332 3566777655322 211111111 1111100 000000 000001110000 00 Q ss_pred HHHHhc-hHHHhhhc--cC--CCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhhhchh Q lcl|NC_011269. 261 DLQRRY-PEIIQAAM--QN--DGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRLYSPL 332 (867) Q Consensus 261 ~~~~~~-~~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 332 (867) .....+ ++...... .. .-..+..==|.+++| .+.|.+.+-+ .+.|+ +.|+.+..-.+ +.+..|. T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~-v~~li--Da~d~~~S~~~~~~~~~~~p~ 273 (474) T protein:vir:95 202 LENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWM-YKSLI--DAIDKRLSDAQNMFDESVELI 273 (474) T ss_pred EcCCccccccccCcccccccccccCCCccceEeecC-----CCCCCCcHHH-HHHHH--HHHHHHHHHHHHHHHHhcCce Confidence 000000 00000000 00 000000000222222 2446665533 23333 23333322221 3455666 Q ss_pred hhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccch Q lcl|NC_011269. 333 VLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGE 412 (867) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (867) ++++ | .+|+ +.+..+++++. ++++..+=+-.++++=..-..=.+...++.+++.|...-++-. T Consensus 274 lv~~-g----~~~~-------~~~~~~~~~~~-----~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 336 (474) T protein:vir:95 274 YILK-G----YEGQ-------DLEEFMRGLKY-----YKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVD 336 (474) T ss_pred eeee-c----CCcc-------cchhhhhhhhc-----cceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 5554 3 1222 12233333322 2222111111111111111111122566777777766554432 Q ss_pred hhhcCC-CccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhccc--chheehhhccccchhhhhhhhhh Q lcl|NC_011269. 413 ALISGG-TGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQGHY--DYDLKGGVRVPIYREIVEYDEET 485 (867) Q Consensus 413 ~~~~~g-~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~--d~~~~~~~~~~~~rd~~~~k~e~ 485 (867) ++.+ -+++ .+.+.+.+ +-++-...+..++..++++++.|.++.+.- ..++...|+-..+.+. .++ T Consensus 337 --~~~~~~~~n--~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~----~e~ 408 (474) T protein:vir:95 337 --FQTDKFGSA--PSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMND----AEQ 408 (474) T ss_pred --ccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCH----HHH Confidence 2211 1122 23333433 223345567778888899999998876531 1223333333333332 122 Q ss_pred hhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccccccc Q lcl|NC_011269. 486 GQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKK 565 (867) Q Consensus 486 ~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kk 565 (867) ++... +. -.+ +.+..+.. ++...+. +.+.++...|+.+....... ... T Consensus 409 a~~~~-~~-g~i---------------S~et~i~~------------l~~v~d~--~~E~~ri~~E~~~~~~~~~~-~~~ 456 (474) T protein:vir:95 409 SQIIA-QS-QYL---------------SRETLVKS------------SPLVDDY--KAELERIEQEQMEYNKQLPN-LDD 456 (474) T ss_pred HHHHH-hc-CCC---------------chHHHHHh------------CCCCCCH--HHHHHHHHHHHHHHHhcccc-ccc Confidence 21100 00 000 11111111 1111111 11222222222211111110 011 Q ss_pred ccccccccCCCCCccccc Q lcl|NC_011269. 566 VQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 566 vq~~~p~~g~P~pp~~aQ 583 (867) .................. T Consensus 457 ~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 457 GGADGAQQQERSNDKESE 474 (474) T ss_pred ccCCCCcCCCCCccCCCC Confidence 000000001011000011 No 161 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=53.32 E-value=0.53 Score=22.09 Aligned_cols=471 Identities=12% Similarity=0.095 Sum_probs=143.9 Q ss_pred CCCcccccccchhHHHHHHH-HhcCCCCCCchhhHHhhhhhcccCCchH--HHHHHHHhhhcchhHHHHHHHHhcccccc Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNRLR-KAGVNMPNSPTMARAQAAALQNTVDNKP--LIDYFQGRRRAAEANRQRLASYRKQGNFG 77 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (867) |-.-.|.-+ +-...+-.+| -...| ..|-.-. ..+...++. +.......+.-....++++..|-.- T Consensus 2 ~~~~~~~~~-~~~~~~~~~~~~~~~~------~~~~~~~-~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g---- 69 (502) T protein:vir:48 2 MEQTLFTDS-TGQDLVLNLRFHRESR------IRYRADN-LEELMVNNWELLKNFINHHKLRQAPRIQELLDYARG---- 69 (502) T ss_pred ceeEEEEec-chhHHHhhcccChhHH------hhhcccc-hhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcC---- Confidence 111111111 1111110000 00000 0111100 111111110 1111111111111122222222110 Q ss_pred cceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceecccchh----HHH Q lcl|NC_011269. 78 SNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSKDPL----IKT 152 (867) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~----~~~ 152 (867) .|-.| +.. ...++ . +....-..+.+.+.++|.+.-|=++. +.++..|+. +.+ T Consensus 70 ~~~~i-----~~~----------~~~~~--~------~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~ 126 (502) T protein:vir:48 70 ENHDV-----LKS----------GRRKD--N------EMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQNDD 126 (502) T ss_pred CCccc-----ccc----------ccccc--c------ccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHHH Confidence 11111 100 00000 0 00011244677788999999988776 777776543 445 Q ss_pred HHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhcc Q lcl|NC_011269. 153 FYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLR 232 (867) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (867) ++.+. | .+-|+-.++.+ +.+.....|.++=+--.+|.+.. ...+++|..+-+- |. +.+. ++++-++| T Consensus 127 ~l~~~-~-~~N~~~~~~~~-~~~~~~~~G~a~~~v~~dedg~~--~i~~~~p~~~~~v---yd--d~~~---~~~~~~ir 193 (502) T protein:vir:48 127 AIKRI-G-RINDIDTHNRN-LIRDLSQTGRAYEVIYRSEYDET--RIKRLSPLETFVI---YD--NSLE---DNSIAAVR 193 (502) T ss_pred HHHHH-H-hhcCHhHHHHH-HHHHHhhcCeEEEEEEeCCCCce--EEEEEcccceEEE---Ec--CCCC---CceEEEEE Confidence 55443 3 34455666666 55888888887755555665543 2456677655332 11 1110 11111121 Q ss_pred cccccccc-ccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHH Q lcl|NC_011269. 233 QGPTTAGG-NMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTL 311 (867) Q Consensus 233 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (867) -=...... ...-+|---..+..+... ...+.++....-.=..|+ |++..| ...|.+.+-. ...| T Consensus 194 ~~~~~~~~~~~~~~~iyt~~~i~~~~~----~~~~~~~~~~~~~~g~vP-----vv~~~n-----n~~g~sd~e~-v~~l 258 (502) T protein:vir:48 194 YYNRGTLQNAKDVVEIYTNQHIYTLDA----SDSFNEISVTPHAFGTVP-----ITEFLN-----NADGIGDYET-ELYL 258 (502) T ss_pred EEEEeecCCcEEEEEEEeCCeEEEEEe----CCceeeccceecCCCccc-----eEEecC-----CCCCCCchhh-hHHH Confidence 11100000 001111100011111100 001111111000000111 222222 2356665543 3333 Q ss_pred HHHHHHHHHHHHHH---hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchh--hhhhhhheeeeeccc Q lcl|NC_011269. 312 MAEESLNAAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFR--LMVHNFGLKVENVFG 386 (867) Q Consensus 312 ~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 386 (867) + +.|+.+..-++ +.+..|.++++ |.....++ +....+++ .. .+..+.. .--..-+.+++++=- T Consensus 259 i--Da~d~~~S~~~~~~~~~~~~~lv~~-g~~~~~~~-------~~~~~~~~-~~-~~~~~~~~~~~~~~~~~d~~~l~~ 326 (502) T protein:vir:48 259 I--DLYDSAESDTANHMSDMADAILAIY-GDLALPQG-------MQASDMKR-TR-LMQLKPPKSADGKEGTVKAEYLTK 326 (502) T ss_pred H--HHHHHHHHHHHHHHHHhcCceeeee-cCcccccc-------cchhhhhh-cc-eeeccccccccccccCcceeEeee Confidence 3 23333333222 23455555543 31111111 11111221 00 0100000 000000111211110 Q ss_pred cCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----HHHHHHHHHHHHHHHHhhhhHHHHHhhc-- Q lcl|NC_011269. 387 RESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----VTQIMTGFQNALKRHIRRRCEVVAEAQG-- 460 (867) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~r~~~~~i~e~q~-- 460 (867) .-..=.+...++++++.|..-=++... ..+..+++ .|.+.+.+ +-|+....+..++..++++++-|..+-+ T Consensus 327 ~~~~~~~~~~~~~L~~~I~~~s~~p~~-~~~~~~~n--~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 403 (502) T protein:vir:48 327 SYDVSGAEAYKTRLNKDIHVFTNTPDM-SDNHFSGN--ASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLV 403 (502) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCCCc-CccccccC--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 000111223455555555544333321 11122222 23333432 2344566777888888888888876532 Q ss_pred -cc-c---hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCC Q lcl|NC_011269. 461 -HY-D---YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAV 535 (867) Q Consensus 461 -~~-d---~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~ 535 (867) .. + ..++..|+...+++..+.-.-+.|+. -.+-. +..+..+ +. T Consensus 404 ~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~-----g~iS~---------------et~l~~l------------~~ 451 (502) T protein:vir:48 404 NEFKDFDESRLKITFTPNLPKSLYEQVSILNDLG-----GQVSQ---------------ETALSLS------------GL 451 (502) T ss_pred ccccccccccceEEeCCCCCcCHHHHHHHHHHHh-----ccCcH---------------HHHHHhC------------CC Confidence 11 1 12444454444444322111111111 00101 1111111 00 Q ss_pred CcccccchhhhhhHHHHHH-HHh----hcccccccccccccccCCCCCcccccccc Q lcl|NC_011269. 536 NIDMKFDQELERQADETVQ-KLM----ATAQAMKKVQDLCDAQNLPYPPELAQHLQ 586 (867) Q Consensus 536 tiqme~E~e~e~k~~E~l~-tL~----~taet~kkvq~~~p~~g~P~pp~~aQ~p~ 586 (867) .-+. +.|.++...|... ... ..........+...... ..+....+. T Consensus 452 v~D~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~---~~~~~~~~~ 502 (502) T protein:vir:48 452 VENP--TEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETH---TDDFERVYE 502 (502) T ss_pred CCCH--HHHHHHHHHHHHhhhhhcccccccccccccCCCccCCC---CcCcCCCCC Confidence 0000 1111111111110 000 00000011111000000 000000111 No 162 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=51.58 E-value=0.34 Score=23.17 Aligned_cols=115 Identities=17% Similarity=0.144 Sum_probs=41.3 Q ss_pred ccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHH Q lcl|NC_011269. 274 MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQG 353 (867) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (867) |-.+-..|..++- ++.... ++.|.+.-..+ ++.++ +|+. -+--.+|+||-|-.. T Consensus 1 ~i~~~~~i~~~l~-~l~~~~-------~~~l~~i~~~~-----~~~~~----~rf~---------~~~~p~G~~W~pLs~ 54 (145) T protein:vir:31 1 MVEDENNIPEARE-AIQDGL-------TDGLERLHTIT-----LRELI----TNMS---------DGQDALGNPWEPLKE 54 (145) T ss_pred CcccHHHHHHHHH-HHHHHH-------HHHHHHHHHHH-----HHHHH----HHHH---------hcCCCCCCCCcccCh Confidence 3333323332221 111100 11111111110 11111 2211 112257889987211 Q ss_pred ----------HHH---HHHHHHHHhhhc------------chhhhhhhhheeeeeccccCccCch-----hHHHHHHHHH Q lcl|NC_011269. 354 ----------ELD---EVRDDMQSLLAA------------DFRLMVHNFGLKVENVFGRESVPNL-----DADYDRIERK 403 (867) Q Consensus 354 ----------~~~---~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 403 (867) -|- .+|+.|++-+.. ..|-.|||||-+--+.=-| -.|-+ ++++++|=++ T Consensus 55 st~a~k~~~~~L~~tG~L~~Si~~~~~~~~~~~~a~vGtn~~YA~~hqfG~~~~~IPaR-PfLG~~~~~~~~~~~~ii~~ 133 (145) T protein:vir:31 55 STIRAKGSDTPLIDNSRLLTDINAASMMDRANRMAVIGTNLDYAEHHEFGAPEAGIPAR-PIFGPAGAYASQQAPDVIGD 133 (145) T ss_pred HHHHHhcCCCCCccCHHHHHHHHHHhhhcccCceeEecCCchhhhhhccCCcccccCCC-CccCCCccchHHHHHHHHHH Confidence 121 355556544321 2688899999642111011 11222 2445553333 Q ss_pred HHHhhccchhhhc Q lcl|NC_011269. 404 LLQAWGIGEALIS 416 (867) Q Consensus 404 ~~~~~~~~~~~~~ 416 (867) ++..| |.-++|- T Consensus 134 ~i~~~-L~~~~~~ 145 (145) T protein:vir:31 134 EIDTN-LEGAVID 145 (145) T ss_pred HHHHH-hhhhccC Confidence 33333 2333333 No 163 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=51.16 E-value=0.48 Score=22.33 Aligned_cols=119 Identities=18% Similarity=0.233 Sum_probs=51.1 Q ss_pred cCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCH-- Q lcl|NC_011269. 275 QNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQ-- 352 (867) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 352 (867) |.+.+.-=++.+.++..+-++-..+ -|||.. .+.-++..++.|..- .. -||+||-|.. T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~---~l~r~I----g~~l~~~t~~Rf~~q---------~~----PdG~~W~p~~~~ 60 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRA---ALARSL----ARDLRRSQQKRVMAQ---------RN----PDGSAYEPRKKR 60 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchH---HHHHHH----HHHHHHHHHHHHHhh---------cC----CCCCCCcccchH Confidence 3333322233334444433221111 133322 222233333333221 11 4788888754 Q ss_pred -------------HHHHHHHHH--HHHhhhcc-----------hhhhhhhhheeeeeccccCcc--------Cchh-HHH Q lcl|NC_011269. 353 -------------GELDEVRDD--MQSLLAAD-----------FRLMVHNFGLKVENVFGRESV--------PNLD-ADY 397 (867) Q Consensus 353 -------------~~~~~~~~~--~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~--------~~~~-~~~ 397 (867) .++..+|+- +++-..+| .|--|||||.+. .+...+.. |-|. ++. T Consensus 61 ~~~~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~Gs~~~yA~iHQfG~~~-~~~~~~~~v~iPaRp~LG~s~~d~ 139 (156) T protein:vir:11 61 ELRGKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAGRIARIARVHQYGLRD-RAEPGAPEVSYAQRLLLGFDSSDM 139 (156) T ss_pred HHhhhccccccchhhhhhhhhhheeeeeecCcEEEEEecCCchhhhhhhcccccc-cccCCCCcccccccccCCCCHHHH Confidence 233333331 11101111 356789999763 33333332 2333 677 Q ss_pred HHHHHHHHHhhccchhhhcCCCccc Q lcl|NC_011269. 398 DRIERKLLQAWGIGEALISGGTGGA 422 (867) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~~g~~~~ 422 (867) ++|..-|+..| + +..|. T Consensus 140 ~~i~~~i~~~l-------~-~~~~~ 156 (156) T protein:vir:11 140 ETIQNGILAHI-------D-ANSPI 156 (156) T ss_pred HHHHHHHHHHH-------h-hcCCC Confidence 77877777666 3 44444 No 164 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=50.55 E-value=0.57 Score=21.90 Aligned_cols=115 Identities=24% Similarity=0.312 Sum_probs=46.2 Q ss_pred hhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhh Q lcl|NC_011269. 256 MREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLA 335 (867) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (867) |-|+++|.+.. .++..+-++=..+ -|||. |.+.-++..++.|.. T Consensus 1 m~~~~~l~~~L--------------------~~ll~~l~~~~~~---~l~r~----Ig~~l~~st~~Rf~~--------- 44 (148) T protein:vir:79 1 MSESRELEAWL--------------------AGMLTKLDAPARR---MLARA----VAAELRRRQAARIAE--------- 44 (148) T ss_pred CccHHHHHHHH--------------------HHHHHhcCChhHH---HHHHH----HHHHHHHHHHHHHHh--------- Confidence 33443333222 2222221110000 02221 222222223332221 Q ss_pred hhcccccCCCCcCCCCH------------HHHHHHHH--HHHHhhhcc-----------hhhhhhhhheeeeeccccCc- Q lcl|NC_011269. 336 TLGIEDMGDGEPWIPDQ------------GELDEVRD--DMQSLLAAD-----------FRLMVHNFGLKVENVFGRES- 389 (867) Q Consensus 336 ~~~~~~~~~~~~~~~~~------------~~~~~~~~--~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~- 389 (867) =.. -||+||-|-. .+++.+|. .|++-..+| .|--|||||.+..-.+..-+ T Consensus 45 -q~~---PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~Gt~~~yAaiHQfG~~~r~~~~~~~v 120 (148) T protein:vir:79 45 -QRN---PDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARYMKTQADANTAVVTFAGNAQRIATVHQFGLRDRVNKAGLTA 120 (148) T ss_pred -hcC---CCCCcCcccchHHHhhcccccccccchhhhhhheeeeeeCCeeeEEeeccchhhhhhhhcCccccccCCCCcc Confidence 111 4788887732 12333332 233332333 35568999976542222111 Q ss_pred ------cCchh-HHHHHHHHHHHHhhccchhhhcCC Q lcl|NC_011269. 390 ------VPNLD-ADYDRIERKLLQAWGIGEALISGG 418 (867) Q Consensus 390 ------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g 418 (867) .|-|. ++.++|..-|+..| + | T Consensus 121 ~iPaRp~LG~s~~d~~~i~~~i~~~l-------~-~ 148 (148) T protein:vir:79 121 QYPARELLGMDGVDMEHITNLLLLHL-------G-A 148 (148) T ss_pred ccCcccccCCCHHHHHHHHHHHHHHh-------c-C Confidence 12233 67777887777766 3 2 No 165 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=45.46 E-value=0.77 Score=21.21 Aligned_cols=438 Identities=9% Similarity=0.011 Sum_probs=132.3 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHH-HHHHHHHHHhhc-------cchHHHH Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELR-VIRHWCRLFYAT-------HDLVPLL 129 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-------~~~~~~~ 129 (867) -|.-.|+.+.-+---.|.--+ -...+-..+.++. .-|.++-.-.. ..+-||+-++.+ .+|-..+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~-~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i 72 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNG-SEPELIPKYLPLV-------PDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEI 72 (518) T ss_pred CcchhhHHHHHHHhhcCCCCc-cchhccHHHhhhc-------ccchhhhhhhhhhhhhcccCCCCccccccccCChHHHH Confidence 333333333111000000000 0011111111111 11111100000 012344433322 2233445 Q ss_pred HHhhhhccccc-ceecc------cchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheec Q lcl|NC_011269. 130 IDIYSKFPVVG-MEFDS------KDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEIL 202 (867) Q Consensus 130 ~~~~~~~~~~~-~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (867) ++-+..+=... .+|++ +|+.+.+|+++++- +.++..-|.+.+ ..-+.+|++.==--++ ++.. +...+ T Consensus 73 ~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~--~n~f~~~~~~~~-e~a~a~G~~~~k~~~d-~~~~--~i~~v 146 (518) T protein:vir:78 73 VVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALR--IDNFDSKSVKIV-ELAGGSGVSAVKINIL-NGRP--SISVH 146 (518) T ss_pred HHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHHH--hccHHHHHHHHH-HHhhccCceEEEEEEE-CCee--EEEEE Confidence 55555554444 23433 57778888887664 333333333322 2233445433100111 1122 13334 Q ss_pred CcceeehhhhhhhcchHHHHHHHHHHhhcccccccccccccccccc----c--------hhhhhhhhhHHH-H-----HH Q lcl|NC_011269. 203 NPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEET----P--------SEREQRMREFQD-L-----QR 264 (867) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~--------~~~~~~~~~~~~-~-----~~ 264 (867) ++|++-.- +.-+++++.+. ++.+..+ +....-|.-|- - ..-.=+++-|.+ + .. T Consensus 147 ~ad~~~P~---~~~g~~~~~~f---~~~~~~~---~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~ 217 (518) T protein:vir:78 147 SSSQFWID---FKNNEPFRFNF---FEEIPTS---NKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPIS 217 (518) T ss_pred cCCeeEEE---eecCcEEEEEE---EEEeecC---CcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccc Confidence 45444221 11111111000 0000000 00000000000 0 000000111110 0 00 Q ss_pred hchHH--HhhhccCCCCc--------ccHHHHHHhhhc--Cccc--cccCcchhhHHHHHHHHHHHHHHHHHHHHh---- Q lcl|NC_011269. 265 RYPEI--IQAAMQNDGLD--------ISEALISRVVNR--PTAW--ATRGAPHLLRSFRTLMAEESLNAAQDAVAD---- 326 (867) Q Consensus 265 ~~~~~--~~~~~~~~~~~--------~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 326 (867) ..|+- +..+..-+++. ...=++.++.|. +..| +++|-+.+=.+-- ..+.|+.+-+.++. T Consensus 218 ~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~---~id~lD~~~s~~~~e~~~ 294 (518) T protein:vir:78 218 AERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTN---YLFAVDYFFTVYMREGEK 294 (518) T ss_pred ccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhH---HHHHHHHHHHHHHHHHHh Confidence 11110 00000011111 011123333332 2333 4557777655442 23445555555544 Q ss_pred ---hhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeee--ccccC---cc----Cch- Q lcl|NC_011269. 327 ---RLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVEN--VFGRE---SV----PNL- 393 (867) Q Consensus 327 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~---~~----~~~- 393 (867) |.+-|--+++......+.+.+|..+.+ -+.|. +++... .+..- ++ +.+ T Consensus 295 g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~--------------~~~y~-----~i~~~~~~~~~~~~~i~~~~~~Ir~e 355 (518) T protein:vir:78 295 TKTKIAASERMFRKKVNKSTDKEEWSMNVD--------------EDYFM-----QFKGTLDAGAKLNDMIQFMQGDFRDG 355 (518) T ss_pred CCceeeechhHhccCCCCCCCccccccCCC--------------CceEE-----EecCcCCCCCccccceeeeecccChH Confidence 555555555444322234444432221 01121 111110 00000 00 112 Q ss_pred --hHHHHHHHHHHHHhhccchhhhcCCCccceehhhhh-HHHHHHHHHHHHHHHHHHHhhhhHHHHHh-hccc------- Q lcl|NC_011269. 394 --DADYDRIERKLLQAWGIGEALISGGTGGAYASSALN-REFVTQIMTGFQNALKRHIRRRCEVVAEA-QGHY------- 462 (867) Q Consensus 394 --~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~r~~~~~i~e~-q~~~------- 462 (867) -+-++.+-++|+.+.|++...++-+.|-.=|+.+.. ..-+-|..=..++.|+..|+++++.|.+| +.++ T Consensus 356 ~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~ 435 (518) T protein:vir:78 356 SYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAI 435 (518) T ss_pred HHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc Confidence 245777889999999999887753322211121111 11122333445556666666666666663 2221 Q ss_pred ---chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhcccc--ccccchhhhhhhhhhhhhceeeeeccccCCCc Q lcl|NC_011269. 463 ---DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFST--LNLRDEAQERAFIAQLKGMGVPVSDKTLAVNI 537 (867) Q Consensus 463 ---d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~--~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~ti 537 (867) +..+++-|...++.|-.+..+.+.+ -+..-.+-.+.+... ..+ ++..++..+..++..... T Consensus 436 ~~~~~~v~i~f~D~i~~D~~~~~~~~~~---~v~aGimS~e~~i~~~~~~~-~deea~~e~~ri~~E~~~---------- 501 (518) T protein:vir:78 436 MRDEIRVIIEFPDPMSVNLNELSSTLNN---MNSALAMSVEEKVKLIHPKW-EDEEIQAEVKRIYLENAI---------- 501 (518) T ss_pred CCCceeEEEEeCCCCCCCHHHHHHHHHH---HHhcCCCCHHHHHHHhCCCC-CHHHHHHHHHHHHHHhcc---------- Confidence 1123333333322221111110000 000000000000000 011 111111122222111111 Q ss_pred ccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 538 DMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 538 qme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ... +...+..+...... T Consensus 502 ------------------------~~~--~~p~~~~g~~~~~g 518 (518) T protein:vir:78 502 ------------------------GEV--PDPEAIGGMETKGG 518 (518) T ss_pred ------------------------cCC--CCCccccCCCCCCC Confidence 000 00000111111111 No 166 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=43.27 E-value=0.85 Score=20.97 Aligned_cols=411 Identities=11% Similarity=0.097 Sum_probs=149.2 Q ss_pred HHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhc--------------------cchHH Q lcl|NC_011269. 68 ASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYAT--------------------HDLVP 127 (867) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~ 127 (867) -|+-|.+ -|-+|+=..-......+.|=.... ++..++.+-| ||.. .++.+ T Consensus 1 ~~~~~~~------~~~~p~d~~~~~~~l~~~i~~~~~---~~~r~~~~~~-yy~g~~~i~~~~~~~~~~~~~ki~~n~~~ 70 (453) T protein:vir:39 1 MKYKPPK------LMTFPKDEPITNEVVTKFMEKHRL---EVARYEYLKN-MYRGIMAIDAEPTKDLWKPDNRLTVNFTK 70 (453) T ss_pred CeecCCc------ceEcCCCCCCCHHHHHHHHHHHHH---HHHHHHHHHH-HhhccCchhcCCCccccCccceeecchHH Confidence 2333333 223333221111111222111111 1222233222 4432 24677 Q ss_pred HHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcce Q lcl|NC_011269. 128 LLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDM 206 (867) Q Consensus 128 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (867) .++|.+.-|=++. +.++++|+...++..+.+= +-|+...+.+ +++.....|.++-+--.++.+.. +..+++|+. T Consensus 71 ~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~i~~--~N~~~~~~~~-~~~~~~~~G~~~~~v~~d~~g~~--~i~~~~p~~ 145 (453) T protein:vir:39 71 YIVDTFTGYFNGIPVKKSHSDKETLSKLQEFDN--LNDMEDEESE-LAKMACIYGRAFELLYQNEETQT--NVIYNTPEN 145 (453) T ss_pred HHHHHHhhhhcccCceeccCChHHHHHHHHHHH--hcChhHHHHH-HHHHHhhcCeEEEEEEecCCCce--EEEEEcccc Confidence 8899988887766 8899898888777766654 3455555666 66888999988766555554432 345667765 Q ss_pred eehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHH-- Q lcl|NC_011269. 207 LRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEA-- 284 (867) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 284 (867) +-+- | ++.+. ++++-++|.- ........+|---..|..+ -...+.++.+-+. T Consensus 146 ~~~v---~--d~~~~---~~~~~~ir~~--~~~~~~~~~~~yt~~~i~~----------------~~~~~~~~~~~~~~~ 199 (453) T protein:vir:39 146 MFMV---Y--DDTIK---QEPLFAVRYG--YDDDYKLYGEVYTKETTYA----------------LNGTMGFYNMTEQAP 199 (453) T ss_pred eEEE---e--cCCCC---CeEEEEEEEE--EeCCeEEEEEEEeCCeEEE----------------EEecCCceeeecccc Confidence 5332 1 11111 0011111110 0000000011000011111 1111111111110 Q ss_pred ------HHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHH-HHhhhhchhhhhhhcccccCCCCcCCCCHHHHHH Q lcl|NC_011269. 285 ------LISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDA-VADRLYSPLVLATLGIEDMGDGEPWIPDQGELDE 357 (867) Q Consensus 285 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (867) -|+++.| .++|.+.+-+ -++|+-.-...-.+-+ ..+-+..|.++++-.. +++ +++.. T Consensus 200 ~~~g~vPvv~~~n-----~~~g~sd~e~-v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~---------~~~-~~~~~ 263 (453) T protein:vir:39 200 NPFDDLPVVEFYF-----NEERMSIFES-VISLVNAFNKAISEKANDVDYFSDQYLTFLGAA---------VEE-EDLKN 263 (453) T ss_pred cCCCceeEEEecC-----CCCCCcchhh-hHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC---------CCc-hhhhh Confidence 1122222 3567776643 3444422222222222 2345667777765221 222 22333 Q ss_pred HHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHH----H Q lcl|NC_011269. 358 VRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREF----V 433 (867) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----~ 433 (867) ++.. ..+.++.-. .-.-+-+++++=-....=.+...++.+++.|...-+.-. +..++.|. .+.+.+.+ + T Consensus 264 ~~~~--~~~~~~~~~-~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~-~~~~~~gn---~Sg~Al~~~~~~l 336 (453) T protein:vir:39 264 IRSN--RVINYYGES-SEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN-ISDESFGS---SSGVSLAYKLQAM 336 (453) T ss_pred hhhc--ceeeecCCC-CCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc-cccccccC---ChHHHHHHHHHHH Confidence 3331 111111000 000011111110000011112445666665544332211 11112222 12233332 2 Q ss_pred HHHHHHHHHHHHHHHhhhhHHHHHhhcccc-----hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccc Q lcl|NC_011269. 434 TQIMTGFQNALKRHIRRRCEVVAEAQGHYD-----YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNL 508 (867) Q Consensus 434 ~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d-----~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~L 508 (867) -++.-..+..++..++++++.|.++..... .+++..|+...+++..+.-.=+.|+ .+ + T Consensus 337 ~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl-------------~g--~-- 399 (453) T protein:vir:39 337 SNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANIL-------------MG--I-- 399 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHH-------------hc--c-- Confidence 344566778888889999998887654221 1233334333433321111101111 00 0 Q ss_pred cchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHH---hhcccccccccccccccCCCCCcc Q lcl|NC_011269. 509 RDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKL---MATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 509 r~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL---~~taet~kkvq~~~p~~g~P~pp~ 580 (867) .+.+..+..+ +...+. +.+.++...|.-... ..........++..+ +...+ T Consensus 400 ---is~et~l~~l------------~~v~D~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~----~~~~e 453 (453) T protein:vir:39 400 ---TSQETALSVI------------SVIPDV--QAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVP----ETNEE 453 (453) T ss_pred ---CChHHHHHhC------------CCCCCH--HHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCC----CcCCC Confidence 0111112111 111000 111122222211111 111111111111111 11122 No 167 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=42.91 E-value=0.87 Score=20.93 Aligned_cols=437 Identities=11% Similarity=0.093 Sum_probs=135.3 Q ss_pred hcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHH-hhCCCchhhhHHHHHHHHHHHHHhhccchH---------- Q lcl|NC_011269. 58 RAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLAD-KGIPFNVEDEEELRVIRHWCRLFYATHDLV---------- 126 (867) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 126 (867) -|+-..+.+ -+||-.+-+. + ...|.+..+ +.|-- +-+++..|+.|++.|=-.|+.+ T Consensus 1 m~~~~~~k~--~~~~~~~~~~-~-------~~~~~~~~~~~~i~~---~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~ 67 (508) T protein:vir:15 1 MGLIQRIKD--LFWKGAAATG-V-------TGSLSKITDDPRISI---DPDEYVRIQTDLDYYSDKLQYIHYQASDGIKK 67 (508) T ss_pred CChHHHHHH--HHHHHHHHhc-c-------ccchHHhhccccccc---CHHHHHHHHHHHHHhcCCCcccccccCCCCcc Confidence 222111111 1122111110 0 012333332 22211 2345667999988776555543 Q ss_pred ----------HHHHHhhhhcccccc-eecccchh-HHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhcc Q lcl|NC_011269. 127 ----------PLLIDIYSKFPVVGM-EFDSKDPL-IKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLG 194 (867) Q Consensus 127 ----------~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 194 (867) ..+++-+..+=...+ +|+.+|+. .-+|.++++= +-++..-+.+.+ ..-+.+|++.= +-+-..++ T Consensus 68 ~~~~~sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~--~n~f~~~~~~~~-e~a~a~G~~~~-k~~~d~~~ 143 (508) T protein:vir:15 68 KRLKNTINMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLE--DNDFKNKFEEAL-EKGVALGGFAM-RPYIDGNH 143 (508) T ss_pred ccceeecchHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHH--hccHHHHHHHHH-HHHhhcCceEE-EEEEeCCe Confidence 455555566655553 56654332 2233332221 111111111111 12223333221 11111122 Q ss_pred ceehheecCcceee-hhhhhhhcchHHHHHHHHHHhhcccccccccccccc-cccc---chh-hhhhhhhHHH-----HH Q lcl|NC_011269. 195 VWSSEEILNPDMLR-VSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMST-VEET---PSE-REQRMREFQD-----LQ 263 (867) Q Consensus 195 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~-~~~~~~~~~~-----~~ 263 (867) +| ...++||.+- +. |--+.+.+..+ +.-..++-. .++-.-| .|-. ... -.-+++-|.+ |- T Consensus 144 ~~--i~~v~ad~~~P~~---~d~~~~~~~af---~~~~~~~~~-~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG 214 (508) T protein:vir:15 144 IK--IAWVRADQFYPLQ---SNTNDISEAAI---ASRTQRTES-NQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVG 214 (508) T ss_pred eE--EEEEcCCeeEEEE---EcCCCeEEEEE---EEEEEeecC-CCceEEEEEEEEEEecCcceEEEEEEEecCCchhcC Confidence 22 3334555421 11 11111111000 000000000 0000000 0000 000 0001111111 00 Q ss_pred -----H---hchHHHhhhccCCCCcccHHHHHH----hhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHh----- Q lcl|NC_011269. 264 -----R---RYPEIIQAAMQNDGLDISEALISR----VVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVAD----- 326 (867) Q Consensus 264 -----~---~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 326 (867) . .|.++.+.+- -.|++-+ +..+ +.|....-++.|-+.+=.+--+ .+.|+.+-+.++. T Consensus 215 ~~v~l~~~~e~~~l~~~~~-~~g~~~p--~f~y~~~~~~N~~~~~splG~S~~~~~~~l---id~lD~~~s~~~~e~~~~ 288 (508) T protein:vir:15 215 NQVPLSTLPVYKELAPQVT-ISGLQRP--LFAYFKTPGANNINIESPLGLGVVDNAKHV---LDDINDTHDQFIWEIRLG 288 (508) T ss_pred cccchhhcccccCCCcceE-ecCCCcc--eeEEecCCccccccCCCCcCCchHhhhHHH---HHHHHHHHHHHHHHHHhc Confidence 0 0111111000 0112111 2222 3455555677888887655422 2344555554554 Q ss_pred --hhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccc-cCccC-------chhHH Q lcl|NC_011269. 327 --RLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFG-RESVP-------NLDAD 396 (867) Q Consensus 327 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-------~~~~~ 396 (867) |.+-|-.+++.+ ++|.+ +++.++ + .| + +++.+.-.+ -=+++ ++.+- T Consensus 289 ~~~i~v~~~~l~~d----~~~~~-~~~~~~-----~---------~~---~--~~~~~~~~~~~i~~~~~~ir~e~~~~~ 344 (508) T protein:vir:15 289 QKHIAVQPGMLRFD----DEHKP-TFDTEQ-----N---------VY---V--GVLSDDNNGLGVKDMTTPIRTVQYKDA 344 (508) T ss_pred ccceeechHHhcCC----CCCcc-ccCCCC-----e---------eE---E--eccCCCCCCCceeEeecccChHHHHHH Confidence 556666666655 45554 333210 0 11 0 111111110 00111 12356 Q ss_pred HHHHHHHHHHhhccchhhhcC-CCccceehhhhhHHHHHHHHHH----HHHHHHHHHhhhhHHHHHhhcccchheehh-h Q lcl|NC_011269. 397 YDRIERKLLQAWGIGEALISG-GTGGAYASSALNREFVTQIMTG----FQNALKRHIRRRCEVVAEAQGHYDYDLKGG-V 470 (867) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~~r~~~~~i~e~q~~~d~~~~~~-~ 470 (867) ++.+-+.|..+.|++..-++- |.|..=|+.+.+ .-|.++. +++.++..|+++++-|-++-..+...-.+. + T Consensus 345 ~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s---~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~ 421 (508) T protein:vir:15 345 IDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVS---NNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPL 421 (508) T ss_pred HHHHHHHHHHHhCCCchhcccccCccccHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 778888899999998776651 222222333332 2345555 555666666666666655422111000000 0 Q ss_pred ccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccchhhh Q lcl|NC_011269. 471 RVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQELE 546 (867) Q Consensus 471 ~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~e~e 546 (867) ... +....+ ....|.|..-.+.+...+...+.++...+ .++..+ +..-...+++.+.+ T Consensus 422 ~~~---~~~~~~--------------~~v~v~f~D~i~~d~~~~~~~~~~~v~aG-i~s~e~~i~~~~g~~deea~~el~ 483 (508) T protein:vir:15 422 FTL---DSASQP--------------LDIECHFDDGVFVNKDKQLEEDAKVLAIG-ALSKQTFLQRNYGMTDEQAAEELA 483 (508) T ss_pred ccc---ccccCC--------------cceEEEeCCCCCCCHHHHHHHHHHHHhcC-CCCHHHHHHhcCCCChHHHHHHHH Confidence 000 000000 00011111111111111111122211111 011100 00000000111111 Q ss_pred hhHHHHHHHHhhcccccccccccccccCCCCCccccc Q lcl|NC_011269. 547 RQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 547 ~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ 583 (867) +...|. .+...... ...+....... T Consensus 484 ri~~E~--------~~~~~~~~----~~~~~~g~~ge 508 (508) T protein:vir:15 484 KIQSEA--------PTDTFEGG----RSAILNGGDGE 508 (508) T ss_pred HHHHhc--------cccCcccc----ccccCCCCCCC Confidence 111111 00000000 00000000000 No 168 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=39.71 E-value=1 Score=20.57 Aligned_cols=638 Identities=13% Similarity=0.123 Sum_probs=155.2 Q ss_pred CCCcccccccchhHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchH----HHHHHHHhhhcchhHHHHH--------- Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKP----LIDYFQGRRRAAEANRQRL--------- 67 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~--------- 67 (867) |-.---+-|+|---+ -+-.+=-|+|...-....-+++.++.+..- +.+.++.+...-..||... T Consensus 1 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G 77 (776) T protein:vir:93 1 MFDLNDKDSTQLVPA---RTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDN 77 (776) T ss_pred CCCcccccccccccc---ccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCC Confidence 222222223322111 011122234444333333333333333222 2223333333444444211 Q ss_pred --------HHHhccc--ccccceeeccchhhhhhhhhHH-----hhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHh Q lcl|NC_011269. 68 --------ASYRKQG--NFGSNMQIAMPKIRQPLGTLAD-----KGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDI 132 (867) Q Consensus 68 --------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (867) +.+...| ...-| +-.|.|..=||.... +-+|++-.|++--+++..-|++++.....=..+-|. T Consensus 78 ~Qw~~~~~~~l~~~g~p~~~~N--~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~a 155 (776) T protein:vir:93 78 IQWSQDEIDELKERGQAPTVYN--VISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMA 155 (776) T ss_pred CCCCHHHHHHHHhcCCceEEec--chHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHH Confidence 1111111 11112 224555555555443 346877777766667777777777776777777777 Q ss_pred hhhccccccee---cccchhH-----------HHHHHHHhh-cccccHHHHh-------HHHHHHHHh--------hhhh Q lcl|NC_011269. 133 YSKFPVVGMEF---DSKDPLI-----------KTFYEDLFF-GEDLNYLEFL-------PDQFAREYF--------TVGE 182 (867) Q Consensus 133 ~~~~~~~~~~~---~~~~~~~-----------~~~~~~~~~-~~~~~~~~~~-------~~~~~~~~~--------~~~~ 182 (867) |..--+.||-+ ..+++.. -+||-|..- -.||.=..|+ +|.+.+.|= ..++ T Consensus 156 f~d~~~~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 235 (776) T protein:vir:93 156 FEETTKAGIGWLESQVQDENDGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVD 235 (776) T ss_pred HHHhhhcCcceEEEEeeccCCCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhh Confidence 76666655322 1122111 122222211 0011111122 111111110 0000 Q ss_pred hcc---------h------hhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccc Q lcl|NC_011269. 183 VTS---------L------AHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEE 247 (867) Q Consensus 183 ~~~---------~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (867) -.. . ..++......+....-+-++|+|-.. |.-.++. .-|-.+..| ++ .-++. T Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~-~~r~~~~-----~~~~~~~~~---~~---~~~~~ 303 (776) T protein:vir:93 236 NFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEA-WFRMPVR-----VQRLKGRNS---DF---RGEVF 303 (776) T ss_pred cccccchhcccccccccccccccccccccccccccCCCeEEEEEE-EEeeeee-----hhhcccccc---cc---cceee Confidence 000 0 00000000011111111223333211 1111100 000000001 00 00000 Q ss_pred cchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHh----------h-hcCccccccCcchh-hHHHHH----- Q lcl|NC_011269. 248 TPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRV----------V-NRPTAWATRGAPHL-LRSFRT----- 310 (867) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~-~~~~~~~~~~~~~~-~~~~~~----- 310 (867) .+.. +..+..+..| .+.+..+.+++| . ...++|...--|++ +.+|+. T Consensus 304 d~~~---------------~~~~~~~~~g-~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~ 367 (776) T protein:vir:93 304 DPND---------------ERHVLEVESG-RAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGM 367 (776) T ss_pred cccc---------------hHHHHHhhcC-ceeehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceeccccc Confidence 0000 0011112211 222222222211 1 12233333223333 222221 Q ss_pred -HHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCH---HHHHHHHHHHHHhhhcchhhhhhhhh---eeeee Q lcl|NC_011269. 311 -LMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQ---GELDEVRDDMQSLLAADFRLMVHNFG---LKVEN 383 (867) Q Consensus 311 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 383 (867) .=.-+.+.-+|+.+=+|.-.=+-+ |. - .+|+=+- ++.|++++ ...-.+-.+.|.+=+ ++++. T Consensus 368 ~~G~v~~~~d~Q~~~N~~~s~~~~~--l~-----~-~~~~~~~gav~~~d~~~~---~~~rp~~vi~~~~~~~~~~~~~~ 436 (776) T protein:vir:93 368 PYGVIRFMRGMQDDVNKRLSKALYI--LS-----T-NKVLMEEGAVDDIDEFRR---EAARPDAVMTVKNGKLGAVKMDV 436 (776) T ss_pred ccchHHhhhHHHHHHHHHHHHHHHh--hc-----C-CceeeccccccchHHHHH---hcccCCceeeeCCcccccccccc Confidence 001222333333222221111111 11 1 1231111 12344444 111111111111100 11111 Q ss_pred ccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhH--HHHHHHHHHHHHHHHHHHhhhhHHHHH-hhc Q lcl|NC_011269. 384 VFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNR--EFVTQIMTGFQNALKRHIRRRCEVVAE-AQG 460 (867) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~r~~~~~i~e-~q~ 460 (867) . ..--..+-.-+......|-..-||+.++.- ..+.+-+.++++. +.....+..|...|+..+|++.+.+-+ |+- T Consensus 437 ~--~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G-~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~ 513 (776) T protein:vir:93 437 D--RDLAPAHLELASRSIQMIQQVGGVTDEMLG-RTTNAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQ 513 (776) T ss_pred C--cCccHHHHHHHHHHHHHHHHhhCcChHHhC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000011224455666777777799998775 5555444444332 333334777778888888887766654 455 Q ss_pred ccchheehhh-ccccchhhhhhh--------------hhhh--------hhHhhhhh----hhhhhhhcccc----cccc Q lcl|NC_011269. 461 HYDYDLKGGV-RVPIYREIVEYD--------------EETG--------QEYIRKVP----KLLIPEIKFST----LNLR 509 (867) Q Consensus 461 ~~d~~~~~~~-~~~~~rd~~~~k--------------~e~~--------k~~~r~~~----k~i~~~i~~~~----~~Lr 509 (867) |++....+-. ..-.-+++|... ..+. +++-...+ +.+.+.+.... +.+. T Consensus 514 ~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~ 593 (776) T protein:vir:93 514 YMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENM 593 (776) T ss_pred hcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhc Confidence 5554322211 100001111110 0000 00000000 00000000000 0001 Q ss_pred chhhhhhhhhhhhhceeeeeccccCCCcccccc-hhhhhhHHHHHHHHhh------cccccccccccccccCCCCCcccc Q lcl|NC_011269. 510 DEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFD-QELERQADETVQKLMA------TAQAMKKVQDLCDAQNLPYPPELA 582 (867) Q Consensus 510 ~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E-~e~e~k~~E~l~tL~~------taet~kkvq~~~p~~g~P~pp~~a 582 (867) +..........++... ...++...-....... ++.+.........+.. ...+. +................. T Consensus 594 d~p~~~e~~~~l~~~~-~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~-~~~aea~~~~aqa~~~~~ 671 (776) T protein:vir:93 594 DIPNRDELVKRIRAVN-GQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKAR-KAAAEAQVAEAKAKHISR 671 (776) T ss_pred CccchHHHHHHHHHhh-cccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHH-HHHHHHHHHhhhhhhhhh Confidence 1011111111111000 0000000000000000 0000000000000000 00000 000000000000000000 Q ss_pred ccccccccCCCCCCCCCCCCCCCCccCCCCccCCCCCCCccCccCcCCCCCCCCCCCCCcccccccccCccCCcCCCCCC Q lcl|NC_011269. 583 QHLQSTLALRQGKTQTELGEAQAVAGEAQAELQTKQIEMQEMMMDQQMAGGVMPGQPMLPPGAPGDPAAGGPPPPAGGPM 662 (867) Q Consensus 583 Q~p~~t~~~a~gpgq~~~~qa~~~agq~~~p~~~~~~~~qp~~~~qg~pG~~gPpGP~gPpG~pG~pgP~gPpPp~~gP~ 662 (867) +........ .+... ++.... ........ ..... ....++..|..|..++.. .+. ++.+.+. T Consensus 672 ~a~~~~~~a----~q~a~-qa~~~~---~~~~~~a~-~a~~~---~~~a~~~~p~~p~~~~~~------~~~-~~~~~~p 732 (776) T protein:vir:93 672 MAIREGVGA----VKDAT-DAATAI---AFMPELAG-LSDGI---LRESGWDDPNTPQPASAA------SGM-PPAPAQP 732 (776) T ss_pred cchhhhhhh----hhhhh-hhhhhh---hhhhhhhh-hhhhh---hccccccccccccccccc------cCC-CCCCCCC Confidence 000000000 00000 000000 00000000 00000 000000000000000000 000 0000000 Q ss_pred CCCcCCCCcccccccccccccccccchhcccccccccccccccccccccccccccccCCCCCCc Q lcl|NC_011269. 663 GGPPVAPAPGVAGPGNAPASFYAASLRTADAINGPTGTGPSADGPLGPTGPELPPGVPEPTEVP 726 (867) Q Consensus 663 G~Pp~~p~PGaPGP~g~Pg~~gppg~pG~~g~~GP~G~gPgapGP~GP~GP~gpPG~PgP~gPp 726 (867) ..|..+..|..+.++.++..+.. |.. .+.. +..++..+.|..|- T Consensus 733 ~~p~~p~~p~~p~~~~~~~~p~~----------------p~~-~p~~---p~~~~~~~~pqqP~ 776 (776) T protein:vir:93 733 AQPANPAQPPAPGQAASEAQPAL----------------PAN-PPQP---PGVVPDGAAPQQPM 776 (776) T ss_pred CCCCCcCCCCCCCCCCCCCCCcc----------------cCC-CCCC---CCCCCCCCCCCCCC Confidence 00000000000000000000000 000 0000 00001111111111 No 169 >protein:vir:9706 Length: 100 # NCBI annotation: hypothetical protein # Family: family:all:316 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795468;genbank:gi:28876223;genbank:GeneID:1257767 Probab=37.57 E-value=0.44 Score=22.56 Aligned_cols=96 Identities=20% Similarity=0.278 Sum_probs=43.3 Q ss_pred CCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHh Q lcl|NC_011269. 99 IPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYF 178 (867) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (867) .+-+ .++|+.+|.||| |.++.+|+.|++++.-+. ++|+.-|+ T Consensus 1 m~~t---~e~L~~lK~~lR-----------------------ID~d~DD~li~~~i~~Ae--------~~I~~AV~---- 42 (100) T protein:vir:97 1 MAVS---KELLNSVKLYCK-----------------------IDFDFENDIIKEMIESAQ--------EQICFAID---- 42 (100) T ss_pred Cccc---HHHHHHHHHHcC-----------------------CCCCcchHHHHHHHHHHH--------HHHhhhcc---- Confidence 2222 356778899998 456778888888865432 24443321 Q ss_pred hhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhc--cccccccccccccccccchhhhhhh Q lcl|NC_011269. 179 TVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHL--RQGPTTAGGNMSTVEETPSEREQRM 256 (867) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 256 (867) ...+++.|.+.++..+-|+-+|.|- -||.+++-. ..+.|+ T Consensus 43 ----------------------------~~~t~~~~~~~~rF~~Av~~Lv~~~Y~nR~~t~d~~----~~~ip~------ 84 (100) T protein:vir:97 43 ----------------------------DGSTPEMFEGHAKFALAVKKQVKEEYDHRGLSADSF----RYPLAN------ 84 (100) T ss_pred ----------------------------CCCCcchhhccchHHHHHHHHHHHHHHhccccchhh----cchhhh------ Confidence 1122334555666666666655542 334322110 001111 Q ss_pred hhHHHHHHhchHHHhhh--ccCCC Q lcl|NC_011269. 257 REFQDLQRRYPEIIQAA--MQNDG 278 (867) Q Consensus 257 ~~~~~~~~~~~~~~~~~--~~~~~ 278 (867) -.=-||.+. ++.|+ T Consensus 85 --------gv~~lI~QLR~~~~~~ 100 (100) T protein:vir:97 85 --------GVLNIIHQLRLRGDDS 100 (100) T ss_pred --------hHHHHHHHHHHhhcCC Confidence 000022211 12222 No 170 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=37.53 E-value=1.1 Score=20.33 Aligned_cols=351 Identities=12% Similarity=0.097 Sum_probs=131.6 Q ss_pred ccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHH-HHhhccchHHHHHHhhhh----cccccceecccc Q lcl|NC_011269. 73 QGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCR-LFYATHDLVPLLIDIYSK----FPVVGMEFDSKD 147 (867) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 147 (867) -|-|+.-.+ ...+-.+.+.+ .+..|.. ..+...++|-.||++-+. .|+.=++.+.+| T Consensus 1 Mg~f~~~~~------------~~~~~~~~~~~------~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~ 62 (378) T protein:vir:94 1 MNLFGKVVS------------FSRGKLNNDTQ------RVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD 62 (378) T ss_pred CCccccchh------------cccccccCCcc------eeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccC Confidence 111111111 01111111111 1112321 123355677778876654 333222222222 Q ss_pred hhH------H-HHHHHHhh---cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 148 PLI------K-TFYEDLFF---GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 148 ~~~------~-~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) ... + .-..+++- -+.++-.+|+..++ ..++.-|+++=+..++...|. .+.+.|+-.. T Consensus 63 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gna~i~~~~~~~~g~---~~~l~p~~~~--------- 129 (378) T protein:vir:94 63 VGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVI-KKLLSAPYVDLYAVFDDNTGE---LLDLLFADDK--------- 129 (378) T ss_pred cccccccccccchHHHHHhhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEEeeCCCce---EEEEEecCCe--------- Confidence 111 1 11111111 13455568888766 788888987755444332211 1112222111 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA 297 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (867) ..+...-|-|+.+. -+- T Consensus 130 -------------------------------------------------------------~~~~~~diiH~~~~--~~~ 146 (378) T protein:vir:94 130 -------------------------------------------------------------KEYKPEELVRLTSP--FYI 146 (378) T ss_pred -------------------------------------------------------------eEeeeeeeEEecCc--CCc Confidence 11222234455421 112 Q ss_pred ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhc--hhhhhhhcccccCCCCcCCCC--HHHHHHHHHHHHHhhhcch--h Q lcl|NC_011269. 298 TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYS--PLVLATLGIEDMGDGEPWIPD--QGELDEVRDDMQSLLAADF--R 371 (867) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~ 371 (867) -.|...|= .|.++|...+.. +==++++. + .+-++ ++..+.+|+.++.....+. . T Consensus 147 ~~g~s~l~-------------~~~~~i~~~~~~~~~~gil~~~------~-~l~~~~~~~~~~~~~~~~~~~~~~~~~g~ 206 (378) T protein:vir:94 147 NEDTSILD-------------NALASIQTKLEQGKLRGLLKIN------A-FLDIDNTQEYREKALTTIKNMQEGSSYNG 206 (378) T ss_pred cchhHHHH-------------HHHHHHHHHHhcccccceeeeC------C-cCCHHHHHHHHHHHHHHHHHhhccccccc Confidence 23433321 222222221111 10011111 1 11122 2334566666665555553 4 Q ss_pred hhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 372 LMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRR 451 (867) Q Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 451 (867) ++|-.-|++++-...+-+...+ .+.++++++|.+++||...++. | +|+. +-...|+..-+.-+...|++++.+. T Consensus 207 ~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~-~---~~se-~~~~~f~~~tL~P~~~~ie~~l~~~ 280 (378) T protein:vir:94 207 LTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL-G---TASQ-EQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred ceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc-C---ChHH-HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 6777789999988877777777 4678999999999999999997 4 3443 3344556555666666666666665 Q ss_pred hHHHHHh-hcccc-h--heehhhccccchhhhhhhhhh--hhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhce Q lcl|NC_011269. 452 CEVVAEA-QGHYD-Y--DLKGGVRVPIYREIVEYDEET--GQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMG 525 (867) Q Consensus 452 ~~~i~e~-q~~~d-~--~~~~~~~~~~~rd~~~~k~e~--~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~ 525 (867) +=.-.|. ++++. . ++.|-+..+.- .++|+++ ++-.+... -+..+.-+... .+..++. + T Consensus 281 Ll~~~er~~g~~~~~~~~~~f~~~~l~~---~d~~~~~~~~~~~~~~G-----------~~T~NE~R~~~-gl~p~~g-G 344 (378) T protein:vir:94 281 LISTNRRRVVKGNLYYERIIVDNQLFKF---ATLKELIDLYHENINGP-----------IFTQNQLLVKM-GEQPIEG-G 344 (378) T ss_pred cCChhHhhhhhhcccccceeecchhhhh---cCHHHHHHHHHHHHhCC-----------CcCHHHHHHHh-CCCCCCC-C Confidence 5322232 22211 1 12222221211 1222221 11111110 00000000000 0110000 0 Q ss_pred eeeeccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCC Q lcl|NC_011269. 526 VPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNL 575 (867) Q Consensus 526 ~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~ 575 (867) ..+--+..-..+....+. +. ......+. -+..+. T Consensus 345 D~~~~~~n~~~~~~~~~~----~~---------~~~~~~~~---~e~~n~ 378 (378) T protein:vir:94 345 DVYIANLNAVAVKNLSDL----QG---------SRKDVTST---DETNNQ 378 (378) T ss_pred Ceeeecccccccccchhh----cC---------CcCCCCCC---CCCCCC Confidence 000000000000000000 00 00000000 000000 No 171 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=36.66 E-value=1.2 Score=20.23 Aligned_cols=354 Identities=12% Similarity=0.088 Sum_probs=128.2 Q ss_pred ccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHH-HhhccchHHHHHHhhh----hcccccceecccc Q lcl|NC_011269. 73 QGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRL-FYATHDLVPLLIDIYS----KFPVVGMEFDSKD 147 (867) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 147 (867) -|.|+..- .......+.+.+ .+..|--+ -+..++.|-.|||+-+ .-|+.=++-+.++ T Consensus 1 Mg~f~~~~------------~f~~~~~~~~~~------~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~ 62 (378) T protein:vir:93 1 MNLFGKVV------------SFSRGKLNNDTQ------RVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD 62 (378) T ss_pred Cccchhhh------------hhhccccCCCcc------eeeecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccc Confidence 11111111 111111111111 11123211 1225566777776543 2333222222222 Q ss_pred hhHH---H----HHHHHh---hcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcc Q lcl|NC_011269. 148 PLIK---T----FYEDLF---FGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQR 217 (867) Q Consensus 148 ~~~~---~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (867) -..+ + -..+++ =.+.++-.+|+..++ ..++.-|+++-+..++...| +.+.|-|+ T Consensus 63 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~-~~lll~Gn~~i~~~~~~~~g---~~~~l~~~------------ 126 (378) T protein:vir:93 63 VGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVI-KKLLRAPYVDLYAVFDDNTG---ELLDLLFA------------ 126 (378) T ss_pred cccccccccccchHHHHHhhcCCCCCCHHHHHHHHH-HHHhhcCceEEEEEeecCCc---eEEEEEec------------ Confidence 1111 1 111111 123455567888866 77888888776554433222 11111111 Q ss_pred hHHHHHHHHHHhhccccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCcccc Q lcl|NC_011269. 218 ERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWA 297 (867) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (867) +.+.+++..-|-|+.+ +-+- T Consensus 127 ----------------------------------------------------------~~~~~~~~~diih~r~--~~~~ 146 (378) T protein:vir:93 127 ----------------------------------------------------------DDKKEYKTEELVRLTS--PFYI 146 (378) T ss_pred ----------------------------------------------------------CCeeEeccceeEEecC--cccc Confidence 1122333344556543 2222 Q ss_pred ccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCC--HHHHHHHHHHHHHhhhcch--hhh Q lcl|NC_011269. 298 TRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPD--QGELDEVRDDMQSLLAADF--RLM 373 (867) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~~~ 373 (867) -.|+..|--+.++ .+.+.+.-. ... ++++.+ -+-++ ++..+.+++.++.....+. ..+ T Consensus 147 ~~~~s~l~~~~~~---------i~~~~~~~~-~~g-~l~~~~-------~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (378) T protein:vir:93 147 NEDTSILDNALAS---------IQTKLEQGK-LRG-LLKINA-------FLDIDNTQEYREKALTTIKNMQEGSSYNGLT 208 (378) T ss_pred chhhHHHHHHHHH---------HHHHHhcCc-ccc-eeeeCC-------cCCHHHHHHHHHHHHHHHHHhhcccccccce Confidence 2344333221111 122222111 111 122221 11122 2233344444444444442 466 Q ss_pred hhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhH Q lcl|NC_011269. 374 VHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCE 453 (867) Q Consensus 374 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~ 453 (867) |-.-|++++....+-....+ .+.++++++|.+++||...+++ |+ |+. +-...|+..-++-+...|++++.+.+= T Consensus 209 ~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~-g~---~~e-~~~~~f~~~tl~P~~~~ie~~l~~kLl 282 (378) T protein:vir:93 209 PVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL-GT---ATQ-EQQIYFYNSTIIPLLIQLEKELTYKLI 282 (378) T ss_pred EcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc-CC---cHH-HHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 66778999888877777777 5678999999999999999997 43 332 233344444455566666666665552 Q ss_pred HHHHh-hccc---chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhh-ceeee Q lcl|NC_011269. 454 VVAEA-QGHY---DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKG-MGVPV 528 (867) Q Consensus 454 ~i~e~-q~~~---d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~-~~~pi 528 (867) .-.|. ++++ ..++.|-+..+.--|.-+.-+ .++..+... | +..+.-+...+ +..++. ....+ T Consensus 283 ~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~-~~~~~~~~G---~--------~t~NE~R~~~g-l~p~~ggD~~~~ 349 (378) T protein:vir:93 283 STNRRRVVKGNLYYERIIVDNQLFKFATLKELID-LYHENINGP---I--------FTQNQLLVKMG-EQPIEGGDVYIA 349 (378) T ss_pred ChhHhhhhhhcccccceeeccchhhhcCHHHHHH-HHHHHHhCC---C--------cCHHHHHHHhC-CCCCCCCCeeee Confidence 22221 1211 112333333332222211111 111111110 0 00000000000 000000 00000 Q ss_pred eccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccc Q lcl|NC_011269. 529 SDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQ 583 (867) Q Consensus 529 td~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ 583 (867) +.-..+.. ... +.+..+ .. ..+.+ ....+ T Consensus 350 ~~n~~~~~--~~~----~~~~~~-------~~--~~~~~-----------e~~n~ 378 (378) T protein:vir:93 350 NLNAVAVK--NLS----DLQGSR-------KD--VTSTD-----------ETNNQ 378 (378) T ss_pred cccccccc--chh----hhcCcc-------CC--CCCCC-----------CCCCC Confidence 00000000 000 000000 00 00000 00000 No 172 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=35.97 E-value=1.2 Score=20.15 Aligned_cols=456 Identities=11% Similarity=0.079 Sum_probs=169.5 Q ss_pred HhcCCCCCCchhhHHh--hhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhh Q lcl|NC_011269. 21 KAGVNMPNSPTMARAQ--AAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKG 98 (867) Q Consensus 21 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (867) -+-||-|++.+-..-. .+-..+..+..-+...+..++.- .+++.++.+|-..- ..+ +.+.+.. ++. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~-~~~~~~~~~yY~g~---~~i---~~~~~~~-----~~~ 68 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKEN-IDNITMGERYYNHH---PDI---LDAPPKR-----DVN 68 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHH-HHHHHHHHHHhcCC---Cch---hcccccc-----ccc Confidence 4556667777655432 22222223333233333333322 23344444443321 111 0011100 000 Q ss_pred CCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHH Q lcl|NC_011269. 99 IPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREY 177 (867) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (867) .. ++ ..+.+- ...+.+.+.++|.+..|=++. +.++.+|+...++..++ |. -|+...+.+ +++.. T Consensus 69 ~~-----~~---~~~~~~---ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~-~~--n~~~~~~~~-~~~~~ 133 (478) T protein:vir:10 69 GD-----YD---ETKPDW---RMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHT-LN--HKWDDKLVD-ILTAA 133 (478) T ss_pred cc-----cc---cccccc---eeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHH-Hh--cCHHHHHHH-HHHHH Confidence 00 00 011121 234577888999998887766 88888898888887775 53 466667766 56888 Q ss_pred hhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccc---cccccccccchhhhh Q lcl|NC_011269. 178 FTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAG---GNMSTVEETPSEREQ 254 (867) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 254 (867) ...|.++-+--.++.+.. +..+++|+.+-+- |.-...-++ +-+++.= ...+ ..+-|..+.-.-+.. T Consensus 134 ~~~G~~~~~~~~d~~g~~--~~~~~~p~~~~~i---~d~~~~~~~-----~~~v~~~-~~~~~~~~~~y~~~~i~~~~~~ 202 (478) T protein:vir:10 134 SNKGIEWVQPYVDEEGEF--KTFRVPAEQAVPI---WTNKERDEL-----QAFIRVY-ELDGAERVEYWTKDDVTYYELK 202 (478) T ss_pred HhcCeEEEEEEecCCCee--EEEEEcccceEEE---EcCCCCCce-----EEEEEEE-EecCceEEEEEeCCeEEEEEEc Confidence 888988766555555443 3556777665432 221111110 0001000 0000 000010000000000 Q ss_pred hhhhHHHHHHhchHHHhh-hccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHH---Hhhhhc Q lcl|NC_011269. 255 RMREFQDLQRRYPEIIQA-AMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAV---ADRLYS 330 (867) Q Consensus 255 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 330 (867) +...+..+......+... .....-..+..-=|+++.| .++|.+.+-+ .+.|+ +.|+.+..-+ .+.+.. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~-v~~li--Da~~~~~S~~~~~~~~~~~ 274 (478) T protein:vir:10 203 EGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFM-YKTII--DALDKRLSDTQNTFDESVE 274 (478) T ss_pred CCeeeccccccccccccceecccccccCCccceEEecc-----CCCCCCcHHH-HHHHH--HHHHHHHHHHHHHHHHhhC Confidence 100000000000000000 0000011111111334433 3567776554 44444 2333222221 144667 Q ss_pred hhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhh--hheeeeeccccCccCchhHHHHHHHHHHHHhh Q lcl|NC_011269. 331 PLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHN--FGLKVENVFGRESVPNLDADYDRIERKLLQAW 408 (867) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (867) |.++++ | .+++. ..+.+.+++. +.++.-+ =|-.++++--....=.+...++.+++.|..-- T Consensus 275 p~~~~~-g----~~~~~-------~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 337 (478) T protein:vir:10 275 LIYILK-G----YEGED-------MKDFMHNLKY-----YKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFG 337 (478) T ss_pred ceeeee-c----CCccc-------cchhhhhhhh-----cceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHh Confidence 765543 2 11211 1112222221 2221111 11222222212222233466777777777665 Q ss_pred ccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hheehhhccccchhhhhhh Q lcl|NC_011269. 409 GIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLKGGVRVPIYREIVEYD 482 (867) Q Consensus 409 ~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~~~~~~~~~rd~~~~k 482 (867) ++-..-. ++.|++ .+.+.+.+. -++--..+..++..++++++.|.++.+.-. .+++..|....+++. T Consensus 338 ~~p~~~~-~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~---- 410 (478) T protein:vir:10 338 QGVDFQQ-DKFGNS--PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMVNE---- 410 (478) T ss_pred CccccCc-cccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCCCH---- Confidence 5433211 122222 233333332 233455667778888888888877664211 112222222222221 Q ss_pred hhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccchhhhhhHHHHHHHHhh Q lcl|NC_011269. 483 EETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQELERQADETVQKLMA 558 (867) Q Consensus 483 ~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~e~e~k~~E~l~tL~~ 558 (867) ...-+.+.++.. .++..+ ++...+ .+.+.++...|....... T Consensus 411 -----------------------------~e~a~~~~kl~g---~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~ 456 (478) T protein:vir:10 411 -----------------------------LENSQIAMNSTG---LLSKETILSNHAWVED--PVAEMERIEQENIELNQQ 456 (478) T ss_pred -----------------------------HHHHHHHHHHhC---CCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhh Confidence 111111111100 111111 111111 112223332232222111 Q ss_pred cccccccccccccccCCCCCcc Q lcl|NC_011269. 559 TAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 559 taet~kkvq~~~p~~g~P~pp~ 580 (867) .....+.+.+......-...++ T Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 457 LPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred ccccccccCCCCCCCCCCCCCC Confidence 1111111111111111001111 No 173 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=34.04 E-value=0.65 Score=21.59 Aligned_cols=114 Identities=19% Similarity=0.237 Sum_probs=40.8 Q ss_pred cCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHHH Q lcl|NC_011269. 275 QNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQGE 354 (867) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (867) |++==.|..+|- ++..+-++-.. =-||| -|.+.=.+..++.| .+=.. -||+||-|-..- T Consensus 1 m~~~~~l~~~L~-~~l~~L~~~~~---~~l~~----~Ig~~l~~~~~~rf----------~~q~~---PdG~~W~p~k~~ 59 (150) T protein:vir:57 1 MNEFKRFEDRLT-GLIESLSPSGR---RRLSA----ELAKRLRQSQQRRV----------MAQKA---PDGTPYAPRQQQ 59 (150) T ss_pred CchHHHHHHHHH-HHHHhcCChhH---HHHHH----HHHHHHHHHHHHHH----------HhhcC---CCCCCCcccChH Confidence 332222222222 22222111110 00111 11222222222222 11111 589999886542 Q ss_pred -------------HHHHHHHHHHh---hhcc------------hhhhhhhhheeeeeccccCc-------cCchh-HHHH Q lcl|NC_011269. 355 -------------LDEVRDDMQSL---LAAD------------FRLMVHNFGLKVENVFGRES-------VPNLD-ADYD 398 (867) Q Consensus 355 -------------~~~~~~~~~~~---~~~~------------~~~~~~~~~~~~~~~~~~~~-------~~~~~-~~~~ 398 (867) +...+. ..++ .-+| .|--|||||....-.+..-+ .|-|+ ++.. T Consensus 60 ~~~~k~~~~~~~l~~~~~l-~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~d~~ 138 (150) T protein:vir:57 60 SARKKTGRVKRKMFAKLIT-SRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGEDVQ 138 (150) T ss_pred HHHHhccCCCcccchhhhh-ccceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCHHHHH Confidence 111110 1110 1112 24458999986532222111 12333 5556 Q ss_pred HHHHHHHHhhccch Q lcl|NC_011269. 399 RIERKLLQAWGIGE 412 (867) Q Consensus 399 ~~~~~~~~~~~~~~ 412 (867) +|...|+..|- + T Consensus 139 ~i~~~i~~~l~--r 150 (150) T protein:vir:57 139 MIEEIILAHLD--R 150 (150) T ss_pred HHHHHHHHHHh--C Confidence 66665555552 1 No 174 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=32.06 E-value=1.5 Score=19.70 Aligned_cols=414 Identities=13% Similarity=0.036 Sum_probs=142.2 Q ss_pred CCchH--HHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhh-HHHHHHHHHHHHHh Q lcl|NC_011269. 44 VDNKP--LIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDE-EELRVIRHWCRLFY 120 (867) Q Consensus 44 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 120 (867) |++.. ++..|.... .+.+.|++.+.+--. ++ |.+ ..++....++ ..+|.+-.||+ T Consensus 1 ~~~~~~~~i~~l~~~~---~~~~~r~~~l~~Yy~--G~---------~~i-----~~~~~~~~~~~~~~k~~~n~~~--- 58 (441) T protein:vir:80 1 MNSDELALIEGMYDRI---QRLSSWHCCIEGYYE--GS---------NRV-----RDLGVAIPPELQRVQTVVSWPG--- 58 (441) T ss_pred CCccHHHHHHHHHHHH---HHHHHHHHHHHHHHh--cC---------Ccc-----hhcCcccchhhhhhhhhcchHH--- Confidence 66655 333333322 223333333322111 11 111 1122222221 11233445555 Q ss_pred hccchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehh Q lcl|NC_011269. 121 ATHDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSE 199 (867) Q Consensus 121 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (867) .++|.+..|-+.. |. +.+|+.|++++++ -|+..++.+ ..++...-|+.+-+--.++. |.- .. T Consensus 59 -------~ivd~~~~~l~~~g~~-~~d~~~l~~i~~~------n~~~~~~~~-~~~~~~~~G~a~~~v~~d~~-g~~-~i 121 (441) T protein:vir:80 59 -------IAVDALEERLDWLGWT-NGDGYGLDGVYAA------NRLATASCD-VHLDALIFGLSFVAIIPHGD-GTV-SV 121 (441) T ss_pred -------HHHHHHHhhhcccccc-CCChHHHHHHHHh------cCHHHHHHH-HHHHHhhcCeeEEEEEeCCC-Cce-EE Confidence 4566655544322 43 2345677777664 244444444 33666666665443323332 211 34 Q ss_pred eecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhh----hhhhHHHHHHhchHHHhhhcc Q lcl|NC_011269. 200 EILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQ----RMREFQDLQRRYPEIIQAAMQ 275 (867) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ 275 (867) .+++|.-+-+- | ++.+.. ++.+++.==...+....-+.-++.+... -...+. ..+.+|. T Consensus 122 ~~~~p~~~~~i---~--d~~~~~----~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~-~~~~~~~------- 184 (441) T protein:vir:80 122 RPQSPKNCTGK---F--SADGSR----LDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWV-EVDRIPN------- 184 (441) T ss_pred EEEccceEEEE---E--eCCCCc----eeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCccee-ecccccc------- Confidence 55666544321 1 111100 0000000000000000000011111100 000000 0111111 Q ss_pred CCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHH-HHhhhhchhhhhhhcccccCCCCcCCCCHHH Q lcl|NC_011269. 276 NDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDA-VADRLYSPLVLATLGIEDMGDGEPWIPDQGE 354 (867) Q Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (867) .+..--|++++|+...=..+|.+-+-+..++|+-.=...-.+-+ +.+-+..|.+.++ |. -+++.. T Consensus 185 ----~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~---------~~~~~~ 250 (441) T protein:vir:80 185 ----VLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GV---------SADEFS 250 (441) T ss_pred ----CCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cC---------Cccccc Confidence 22333466777877666667888766666666643222222222 3344666766554 42 112211 Q ss_pred HHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHH---hhccchhhhcCCCccceehhhhhHH Q lcl|NC_011269. 355 LDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQ---AWGIGEALISGGTGGAYASSALNRE 431 (867) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~g~~~~~~~~~~~~~ 431 (867) .+.-+-.+-.+++.+- ..-|-.++ ++.- ..-+++.-++.|+..|-+ .-+|...-+ |+.+-+ .+|.+.+. T Consensus 251 ~~~~~~~~~~i~~~~~----~~~~~~~~-~~~~-~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~~-~~Sg~Al~ 322 (441) T protein:vir:80 251 QPGWVLSMASVWAVDK----DDDGDTPN-VGSF-PVNSPTPYSDQMRLLAQLTAGEAAVPERYF-GFITSN-PPSGEALA 322 (441) T ss_pred cchhhhcccccccCCC----CCCCCcce-eEec-CccchHHHHHHHHHHHHHHhcccCCCHHHh-ccCCCc-chHHHHHH Confidence 1111112222222210 00000011 0100 011234444555544433 233432222 233322 22333332 Q ss_pred ----HHHHHHHHHHHHHHHHHhhhhHHHHHhhcccc------hheehhhccccchhhhhhhhhhhhhHhh----hhhhhh Q lcl|NC_011269. 432 ----FVTQIMTGFQNALKRHIRRRCEVVAEAQGHYD------YDLKGGVRVPIYREIVEYDEETGQEYIR----KVPKLL 497 (867) Q Consensus 432 ----~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d------~~~~~~~~~~~~rd~~~~k~e~~k~~~r----~~~k~i 497 (867) -+-++....++.++..++++++-+..+.+... +.+++.|+...+++..+.-.-+-|+.-- .+...+ T Consensus 323 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~ 402 (441) T protein:vir:80 323 AEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTV 402 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHH Confidence 23445666677788888888888877654211 2344445544444432211101111000 000001 Q ss_pred hhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHHHHHhhccc-ccccc Q lcl|NC_011269. 498 IPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETVQKLMATAQ-AMKKV 566 (867) Q Consensus 498 ~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l~tL~~tae-t~kkv 566 (867) ...+.+. .++...+ +.+ +.+..+....+..... ....+ T Consensus 403 ~~~l~~~-------~~e~~~~---------------------~~e---~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 403 LEMLGLD-------DVQVEAV---------------------MRH---RAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred HHhCCCC-------HHHHHHH---------------------HHH---HHHHHHHHHHHhhhhhcccccC Confidence 1111000 0000000 000 0000111111111110 00111 No 175 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=31.64 E-value=1.5 Score=19.65 Aligned_cols=432 Identities=12% Similarity=0.062 Sum_probs=152.4 Q ss_pred HHHHHHHhhhcchhHHHHHHHHhccccc-cc--ceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhcc-- Q lcl|NC_011269. 49 LIDYFQGRRRAAEANRQRLASYRKQGNF-GS--NMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATH-- 123 (867) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 123 (867) |+-|.. .|+...-.+..-..| +. |+-|-.-.|+.-|-.. .-+.++.++++-+.|+..| T Consensus 1 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~----------~~~~~~~~~~~~~yY~g~~~~ 63 (481) T protein:vir:10 1 MTVYTI-------NNINTKFSPLANDDFVVSDLAELLKEENLRNFISRH----------QTEQVPRLEMLESYYLNRNTD 63 (481) T ss_pred CeeEee-------ehhchhcccccCceeeeecchhhcCHHHHHHHHHHH----------HHHHHHHHHHHHHHhcCCCcc Confidence 221111 111110011111111 11 1111111111111110 0112223333433333333 Q ss_pred ---------------------chHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHhhhh Q lcl|NC_011269. 124 ---------------------DLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYFTVG 181 (867) Q Consensus 124 ---------------------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (867) ++.+.++|.+..|=++. +.++++|+...++..+++- +-++..++.+ +.+...+-| T Consensus 64 i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~--~n~~~~~~~~-~~~~~~~~G 140 (481) T protein:vir:10 64 ILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELND--LNDADEVNSD-LALNLSIYG 140 (481) T ss_pred cccCccccccccccccceeecchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHH--hcChhHHHHH-HHHHHHhcC Confidence 45678889988887766 8899999888887776543 5566677777 558888888 Q ss_pred hhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccc--cccccccchhhhhhhhhH Q lcl|NC_011269. 182 EVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGN--MSTVEETPSEREQRMREF 259 (867) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 259 (867) .++-+--.++.+.. +..+++|+-+-+- |-.+..-+ ++-+++.=-..+..+ .-..|---.++..+.+.- T Consensus 141 ~~~~~~~~d~dg~~--~i~~~~p~~~~~v---~d~~~~~~-----~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~ 210 (481) T protein:vir:10 141 RAYEIVYRDFEDRD--TFKVLDPKSTFVV---YDQTLDKK-----VVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIK 210 (481) T ss_pred eEEEEEEeCCCCeE--EEEEEcccceEEE---EcCCCCCc-----eEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEec Confidence 77654444454332 3566788665332 11110001 111111000000000 000000000111110000 Q ss_pred H---HHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHH--HHHHHHHHHHHHhhhhchhhh Q lcl|NC_011269. 260 Q---DLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMA--EESLNAAQDAVADRLYSPLVL 334 (867) Q Consensus 260 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 334 (867) . .+...+|.-+ ..|+ |+++.| ..+|.+.+-. .+.|+- .+-+-...++ .+.+..|.++ T Consensus 211 ~~~~~~~~~~~~~~------g~vP-----vv~~~n-----~~~g~~~~~~-v~~lida~~~~~s~~~~~-~~~~~~~~~~ 272 (481) T protein:vir:10 211 GGTYHRVEEVEHYY------NDVP-----IIEYLN-----DQFKQGDFEN-VIALIDLYDSAQSDTANY-MTDLNDAMLA 272 (481) T ss_pred CCceeecccccccC------Ccee-----EEEeec-----CCCCCCchhh-HHHHHHHHHHHHHHHHHH-HHHhcCceeE Confidence 0 0000111100 0112 223333 3567776543 333332 1122222222 2345666665 Q ss_pred hhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhhccchhh Q lcl|NC_011269. 335 ATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAWGIGEAL 414 (867) Q Consensus 335 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 414 (867) ++-.. ..+-++++.++++-.-.+....--.-+.=+-.++++=-.-..=.+...++.+++.|..--++-..- T Consensus 273 ~~g~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 343 (481) T protein:vir:10 273 IIGNV---------DLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLN 343 (481) T ss_pred eecCc---------CCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc Confidence 43110 223445666666322111100000000001111111000011112345566666665554444321 Q ss_pred hcCCCccceehhhhhHH----HHHHHHHHHHHHHHHHHhhhhHHHHHhhccc---c---hheehhhccccchhhhhhhhh Q lcl|NC_011269. 415 ISGGTGGAYASSALNRE----FVTQIMTGFQNALKRHIRRRCEVVAEAQGHY---D---YDLKGGVRVPIYREIVEYDEE 484 (867) Q Consensus 415 ~~~g~~~~~~~~~~~~~----~~~~~~~~~~~~l~~~~r~~~~~i~e~q~~~---d---~~~~~~~~~~~~rd~~~~k~e 484 (867) .|..+++ .+.+.+. .+-++.-..+..++..++++++.|..+-+.. + .+++..|+...+++..+.-.- T Consensus 344 -~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~ 420 (481) T protein:vir:10 344 -DEQFSGV--QSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINA 420 (481) T ss_pred -ccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHH Confidence 1112222 1222222 3334455567788888888888887653321 1 244555555555443222211 Q ss_pred hhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccchhhhhhHHHHH-----HHHhhc Q lcl|NC_011269. 485 TGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQELERQADETV-----QKLMAT 559 (867) Q Consensus 485 ~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~e~e~k~~E~l-----~tL~~t 559 (867) +.++. . .+ +.+..+..+ +...+.+ .+.++...|.. ...... T Consensus 421 ~~kl~---g--~i---------------s~et~~~~l------------~~i~d~~--~E~~ri~~E~~~~~~~~~~~~~ 466 (481) T protein:vir:10 421 FNALS---G--GV---------------SESTRLSLL------------DFIDNPK--EELEKMQEEEAQREKQADKRGY 466 (481) T ss_pred HHHHh---c--cC---------------ChHHHHHhC------------CCCCCHH--HHHHHHHHHHHHHHhhhhhccC Confidence 11110 0 01 111111111 1000000 11111111111 111111 Q ss_pred ccccccccccccccC Q lcl|NC_011269. 560 AQAMKKVQDLCDAQN 574 (867) Q Consensus 560 aet~kkvq~~~p~~g 574 (867) ......-...-+..+ T Consensus 467 ~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 467 GEAFENHLNVDDSNG 481 (481) T ss_pred CccCCCCCCCCCCCC Confidence 111111111122223 No 176 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=29.81 E-value=1.3 Score=19.87 Aligned_cols=115 Identities=21% Similarity=0.228 Sum_probs=41.4 Q ss_pred ccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCCHH Q lcl|NC_011269. 274 MQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPDQG 353 (867) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (867) |+++-..|. ..+.++..+-++-..+ -|||..= +.=++..++.|.. =.. -||+||-|--. T Consensus 1 m~~~~~~l~-~~l~~ll~~l~~~~~~---~l~r~Ig----~~l~~~t~~Rf~~----------q~~---PDG~~W~prk~ 59 (155) T protein:vir:79 1 MTDDLQALE-RWAGGLLAKLSPAARR---QLLRELG----RDLRRAQQSRVAA----------QRN---PDGSAYEPRKV 59 (155) T ss_pred CchHHHHHH-HHHHHHHHhcCChhHH---HHHHHHH----HHHHHHHHHHHHh----------hcC---CCCCCCcccch Confidence 222222222 2223333332221111 1333322 2222223333222 111 57888877321 Q ss_pred ------------------HHHHHHH--HHHHhhhcch-----------hhhhhhhheeeeeccccCcc--------Cchh Q lcl|NC_011269. 354 ------------------ELDEVRD--DMQSLLAADF-----------RLMVHNFGLKVENVFGRESV--------PNLD 394 (867) Q Consensus 354 ------------------~~~~~~~--~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~--------~~~~ 394 (867) +++.+|. =+++-.-+|. |--||+||.+.. |...... |-|. T Consensus 60 ~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~Gs~~~yAaiHQfG~~~r-~~~~~~~v~iPaRp~LGls 138 (155) T protein:vir:79 60 KAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFDERLSRIARVHQEGQKAP-VEPGGPLAQYPVRVVLGFS 138 (155) T ss_pred hhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEecCcchhhhhhhhcCCccc-CCCCCcccccccccccCCC Confidence 1112211 0011111221 345899997643 2222221 2222 Q ss_pred -HHHHHHHHHHHHhhccch Q lcl|NC_011269. 395 -ADYDRIERKLLQAWGIGE 412 (867) Q Consensus 395 -~~~~~~~~~~~~~~~~~~ 412 (867) ++.++|...|+..| ++ T Consensus 139 ~~d~~~I~~~i~~~l--~r 155 (155) T protein:vir:79 139 DADRELVRDRLLREL--TR 155 (155) T ss_pred HHHHHHHHHHHHHHh--hC Confidence 56666666665555 22 No 177 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=27.81 E-value=1.8 Score=19.17 Aligned_cols=114 Identities=18% Similarity=0.247 Sum_probs=45.7 Q ss_pred hhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhh Q lcl|NC_011269. 256 MREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLA 335 (867) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (867) |-|++.+. ..+.++.++-++-..+ -|||..=. .=.+..++.+.. T Consensus 1 m~~~~~~~--------------------~~l~~ll~~L~~~~~~---~l~r~Ig~----~l~~~t~~rf~~--------- 44 (149) T protein:vir:18 1 MSELTALQ--------------------ERLAGLIASLSPAARR---KMAAEIAK----KLRTSQQQRIKR--------- 44 (149) T ss_pred CchHHHHH--------------------HHHHHHHHhcCCchHH---HHHHHHHH----HHHHHHHHHHHh--------- Confidence 33332222 2222333332221111 13333222 222222332221 Q ss_pred hhcccccCCCCcCCCCHH-------------HHHHHHH--HHHHhhhcc-----------hhhhhhhhheeeeeccccCc Q lcl|NC_011269. 336 TLGIEDMGDGEPWIPDQG-------------ELDEVRD--DMQSLLAAD-----------FRLMVHNFGLKVENVFGRES 389 (867) Q Consensus 336 ~~~~~~~~~~~~~~~~~~-------------~~~~~~~--~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~ 389 (867) =.. -||+||-|... ++..+|. =|++-.-+| .|--|||||.+..-. .... T Consensus 45 -q~~---PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~Gtn~~yAaiHQfG~~~r~~-~~~~ 119 (149) T protein:vir:18 45 -QQA---PDGTPYAARKRQPVRSKKGRIKREMFAKLRTSRFMKAKGSDSAAVVEFTGKVQRMARVHQYGLKDRPN-RNSR 119 (149) T ss_pred -hcC---CCCCCCcccchhhhhhccCcccchhhhhhhhhhhhheeecCceeEEEecccchhhhhhhhcccccccc-CCCc Confidence 111 57899988653 3333332 011111111 345688999764322 2221 Q ss_pred --------cCch-hHHHHHHHHHHHHhhcc Q lcl|NC_011269. 390 --------VPNL-DADYDRIERKLLQAWGI 410 (867) Q Consensus 390 --------~~~~-~~~~~~~~~~~~~~~~~ 410 (867) .|-| +++-.+|+.-|...|+= T Consensus 120 ~v~iPaRp~LG~s~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 120 DVQYEARPLLGFTRDDEQMIEDVIISHLGK 149 (149) T ss_pred cccccccccCCCCHHHHHHHHHHHHHHHhC Confidence 1233 36667777777766622 No 178 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=26.29 E-value=2 Score=18.98 Aligned_cols=430 Identities=13% Similarity=0.024 Sum_probs=157.4 Q ss_pred hhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhh--------------HHHHHHHHHHHHHhhc---- Q lcl|NC_011269. 61 EANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDE--------------EELRVIRHWCRLFYAT---- 122 (867) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~---- 122 (867) -.=||++...+..-.-|+|++--.-+--+-.+. ..+-.+.+.+ +++..++.+ ..||.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l-~~YY~g~~~i 76 (492) T protein:vir:97 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFD---AIVRTNNKPETLEEMIVRYIKQHLEKLPEISIG-QEYYEQRPDI 76 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeeccchhhhhHhh---hcccCCCchhhHHHHHHHHHHHHHHHHHHHHHH-HHHhcccCcc Confidence 122333333333333346665432221111111 1111111110 111112222 245522 Q ss_pred -----------------------cchHHHHHHhhhhccccc-ceecccchhHHHHHHHHhhcccccHHHHhHHHHHHHHh Q lcl|NC_011269. 123 -----------------------HDLVPLLIDIYSKFPVVG-MEFDSKDPLIKTFYEDLFFGEDLNYLEFLPDQFAREYF 178 (867) Q Consensus 123 -----------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (867) +.+.+.++|.+..|=++. +.++.+|+...++.+++ |..+ +...+.+ ++++.. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~-~~n~--~~~~~~~-~~~~~~ 152 (492) T protein:vir:97 77 VKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEV-LGNR--FDDKLHS-VLTGAS 152 (492) T ss_pred ccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHHHHHHHHH-Hhcc--HHHHHHH-HHHHHh Confidence 467778899998887776 88999999999988876 5444 4455566 568888 Q ss_pred hhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhhhhh- Q lcl|NC_011269. 179 TVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQRMR- 257 (867) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 257 (867) .-|.++-+--.++.+.. +..+++|..+-+--+=-.-++. +-++|.=-..+.. .+|---..+..+.+ T Consensus 153 ~~G~a~~~v~~d~dg~~--~~~~~~p~~~~~i~d~~~~~~~--------~~~vr~~~~~~~~---~~~~y~~~~v~~~~~ 219 (492) T protein:vir:97 153 NKGIEWLHPYLDEEGEF--KLFRVPAEQGIPIWTDKEHEEL--------EAFIRMYKLENET---KVEYWDKVTVNYYVY 219 (492) T ss_pred hcCeEEEEEEecCCCce--EEEEEcccceEEEEcCCCCCce--------EEEEEEEeeccce---eEEEEecCeEEEEEE Confidence 89988766655554432 4566777655332100000111 1111110000000 00000000000000 Q ss_pred ---hHHHH---HHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHH---hhh Q lcl|NC_011269. 258 ---EFQDL---QRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVA---DRL 328 (867) Q Consensus 258 ---~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 328 (867) ..... ...++++-...-.=..++ |+++.| ..+|.+.+-. ...|+ +.|+.+..-.| +-+ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vP-----vv~~~n-----n~~g~sd~e~-v~~li--Da~d~~~S~~~~~~~~~ 286 (492) T protein:vir:97 220 ENGSLIPDYSNNLENSKTHFSTGSWGKIP-----FIPFKN-----NDLEISDIFM-YKTLI--DAYNRRLSDLSNTFKDS 286 (492) T ss_pred ecCeeeecccccccccccccccCCCCCcc-----eEEecC-----CCCCCCchHh-HHHHH--HHHHHHHHHHHHHHHHh Confidence 00000 001111100000000011 222222 2457776543 44444 33333333322 223 Q ss_pred hchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhHHHHHHHHHHHHhh Q lcl|NC_011269. 329 YSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDADYDRIERKLLQAW 408 (867) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (867) .-|.++++ | . +.+++.+.+++++.. .++...=+-+++++=-.-..=.+...++++++.|..-- T Consensus 287 ~~~~l~~~-g----~-------~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s 349 (492) T protein:vir:97 287 NELTYVLK-N----Y-------DDQELPEFKRLLRYY-----GAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG 349 (492) T ss_pred ccceeeee-c----C-------CcccchhHHHHHhhc-----cceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHh Confidence 34444332 2 1 123344444433322 11111222223322111111123355677777666655 Q ss_pred ccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcccc--hheehhhccccchhhhhhh Q lcl|NC_011269. 409 GIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGHYD--YDLKGGVRVPIYREIVEYD 482 (867) Q Consensus 409 ~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~~d--~~~~~~~~~~~~rd~~~~k 482 (867) ++-..-. +..|++ .|.+.+.+. -++-...++.++..++++++-|.++-+.-+ .+++..|.-..+++.. T Consensus 350 ~~p~~~~-~~~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~--- 423 (492) T protein:vir:97 350 QAVDFSS-DKFGSA--PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTE--- 423 (492) T ss_pred CCCCCCc-cccccC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHH--- Confidence 4432111 111221 344444333 233456677788888888888877644211 2233333333333221 Q ss_pred hhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccc----cCCCcccccchhhhhhHHHHHHHHhh Q lcl|NC_011269. 483 EETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKT----LAVNIDMKFDQELERQADETVQKLMA 558 (867) Q Consensus 483 ~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t----~p~tiqme~E~e~e~k~~E~l~tL~~ 558 (867) ..-+.+.++.. .++..+ ++...+ .+.|.++...|+...... T Consensus 424 ------------------------------e~a~~~~kl~G---~iS~et~l~~l~~v~d--~~~Eleri~~E~~~~~~~ 468 (492) T protein:vir:97 424 ------------------------------LQVQTAQQSMG---IVSHETVLENHPFVED--LQAELERIEQEQTEYNKQ 468 (492) T ss_pred ------------------------------HHHHHHHHHhc---cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHh Confidence 11111222110 111111 111111 112233333333221111 Q ss_pred cccccccccccccccCCCCCccccccccccccC Q lcl|NC_011269. 559 TAQAMKKVQDLCDAQNLPYPPELAQHLQSTLAL 591 (867) Q Consensus 559 taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~ 591 (867) .. ........... + .......... T Consensus 469 ~~-~~~~~~~~~~~-------~-~~~~~~~~~e 492 (492) T protein:vir:97 469 LP-NLDDGGADSAQ-------Q-QERSNNKESE 492 (492) T ss_pred hh-ccccCCCCCCc-------c-cccccccccC Confidence 10 00110000000 0 0000000001 No 179 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=25.20 E-value=2.1 Score=18.83 Aligned_cols=391 Identities=10% Similarity=0.075 Sum_probs=133.4 Q ss_pred hCCCchhhhHHHHHHHHHHHHHh----------------------hccchHHHHHHhhhhccccc-ceecccchhHHH-- Q lcl|NC_011269. 98 GIPFNVEDEEELRVIRHWCRLFY----------------------ATHDLVPLLIDIYSKFPVVG-MEFDSKDPLIKT-- 152 (867) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-- 152 (867) +|=.-.+++ ++.+++.- .|| ..+.+.+++||.+..|=++. +.++..|+.-++ T Consensus 1 ~~~~~~~~~--~~r~~~l~-~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~ 77 (440) T protein:vir:95 1 MLAAFLGSQ--KQRLAILA-SYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQL 77 (440) T ss_pred ChhhHHHHH--HHHHHHHH-HHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHH Confidence 111111111 11122221 122 23456678899998887775 777665554333 Q ss_pred -HHHHHhhcccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhc Q lcl|NC_011269. 153 -FYEDLFFGEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHL 231 (867) Q Consensus 153 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (867) +..+++...+++ ..+.. +++.-.+.|.++-+---++.+.. ...+++|.-+-+-..-.+.++ ++-++ T Consensus 78 ~~l~~~~~~n~~~--~~~~~-~~~~~~~~G~a~~~~~~d~~~~~--~i~~~~p~~~~~~~d~~~~~~--------~~~~i 144 (440) T protein:vir:95 78 STIKDIEWQNDIN--ALNSD-LAFDASVYGRAYEYHFRDKDKVD--RVVLISPLEMFVIRDLTVEQN--------IIAAV 144 (440) T ss_pred HHHHHHHHhcCHh--HHHHH-HHHHHhhcCeEEEEEEecCCCce--EEEEEcccceEEEEcCCCCCc--------eEEEE Confidence 455555533444 44444 44777777876544333443222 245667765544311111111 11111 Q ss_pred cccccccccccccccccchhhhhhhhhHHHHHHhchHHHhhhccCCC-CcccHHHHHHhhhcCccccccCcchhhHHHHH Q lcl|NC_011269. 232 RQGPTTAGGNMSTVEETPSEREQRMREFQDLQRRYPEIIQAAMQNDG-LDISEALISRVVNRPTAWATRGAPHLLRSFRT 310 (867) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (867) +.--..+...+. --|+. +..+.+.+..--..+ +.++...-+=| |+ |+++.| ..+|.+.+=+ .+. T Consensus 145 ~~~~~~~~~~~~--vyt~~-~~~~~~~~~~~~~~~-~~~~~~~~~~g~vP-----vv~~~n-----~~~g~sd~e~-v~~ 209 (440) T protein:vir:95 145 HLPIYADKVNMT--VYTKD-KVITYKPYSNNSVRL-VVDDVKKHSYNDVP-----VVEWWN-----NRFRMGDYES-EIS 209 (440) T ss_pred EEEEecCceEEE--EEeCC-eEEEEEEecCCccce-eecceeeccCceee-----EEEeeC-----CCCCCCchhh-hHH Confidence 110000000000 00110 000000000000000 00000000000 11 223333 2346665533 333 Q ss_pred HHHHHHHHHH--HHHHH-hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeecccc Q lcl|NC_011269. 311 LMAEESLNAA--QDAVA-DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGR 387 (867) Q Consensus 311 ~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 387 (867) |+ +.|+.+ +-+.+ +-+..|.++++-- ....-.+-+++..+++. ..+..... ......+.. T Consensus 210 li--da~~~~~s~~~~~~~~~~~~~~v~~g~------~~~~~~~~e~~~~~~~~--~~~~~~~~-------~~~~~~~~~ 272 (440) T protein:vir:95 210 LI--DAYDAGQSDTANYMSDLNDAMLLVKGD------LDGIKLSPEDAAKMKDA--NMLFLKTG-------ISTTGQQTT 272 (440) T ss_pred HH--HHHHHHHHHHHHHHHHhhcceeeeecc------cccCCCCccchhhhhhc--cceecccc-------cccccCCCC Confidence 33 223322 22221 2345676665521 11112233444445441 01111000 000011111 Q ss_pred Ccc------Cch---hHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHH Q lcl|NC_011269. 388 ESV------PNL---DADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEV 454 (867) Q Consensus 388 ~~~------~~~---~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~ 454 (867) +.+ .++ ...++.+++.|..--++...... ..+++ .|.+.+.+. -++--..+..++..++++++. T Consensus 273 ~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~l 349 (440) T protein:vir:95 273 ADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDD-RFNST--SSGIALLYKMIGLEQVRKDKETYFTKALRRRYEL 349 (440) T ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-ccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 222 25677777777766655432211 22222 233333322 223444566677778888888 Q ss_pred HHHhhccc------chheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeee Q lcl|NC_011269. 455 VAEAQGHY------DYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPV 528 (867) Q Consensus 455 i~e~q~~~------d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pi 528 (867) |.++=+.. ..+++..|+...+++..+.-.=+.|+. .+ + +.+..+.. T Consensus 350 i~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl~---------------g~-i----S~et~~~~-------- 401 (440) T protein:vir:95 350 ISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEAG---------------GE-I----SQETLMEN-------- 401 (440) T ss_pred HHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHHh---------------cc-C----cHHHHHHh-------- Confidence 77642211 123444444444444222111111110 00 0 11111111 Q ss_pred eccccCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 529 SDKTLAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 529 td~t~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) ++.. + .+.|.++...|............... .+.....+ T Consensus 402 ----l~~~-d--~~~E~~ri~~E~~~~~~~~~~~~~~~------~~~~~~~e 440 (440) T protein:vir:95 402 ----ASFT-D--YKTEHSRILKQGGSSDLEIGQIVGDA------DVGQADTE 440 (440) T ss_pred ----CCCC-C--cHHHHHHHHHHHHHhhhhHHhhccCC------CCCCcCCC Confidence 1111 1 11122222222211111111000111 11111111 No 180 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=24.00 E-value=2.2 Score=18.67 Aligned_cols=482 Identities=13% Similarity=0.104 Sum_probs=189.0 Q ss_pred CCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcc-cccccceeeccchhhhhhhhhHHhhCCCchh Q lcl|NC_011269. 26 MPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQ-GNFGSNMQIAMPKIRQPLGTLADKGIPFNVE 104 (867) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (867) || |..-|+.+-..+ +...... .+.+|..+ +..|+ ++--++..+|..++..-.-=|--. T Consensus 1 ~~-----~~~~w~~~de~~-------~~~~~~~-------~~~~~~~p~~~dG~--s~i~~~~~~~~~~~~~~~~~~gg~ 59 (533) T protein:vir:58 1 MP-----SLEKYKKLNEAV-------NFTNFLS-------PMYGMGAPHGAGGS--SMIPINMYHPFATAGYASRFYGGI 59 (533) T ss_pred CC-----CcchhhhhhHHH-------HHHHhhc-------hhhcccCccCCCCC--ccccCCCCcchhhhhhhhhhhccc Confidence 22 111122111100 0000000 01122111 11222 222234455544432110001112 Q ss_pred hhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccccceec------c----cchhHHHHHHHHhhcccccHHHHhHHHHH Q lcl|NC_011269. 105 DEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVVGMEFD------S----KDPLIKTFYEDLFFGEDLNYLEFLPDQFA 174 (867) Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 174 (867) .+++-..|+++ |.-+-+||.|-+.||.--...|+--+++ - =+..||+.-.+ =||.-+-.-+.| T Consensus 60 ~~n~~eLI~~Y-R~ma~~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~-----lldf~~~~~~~f- 132 (533) T protein:vir:58 60 EFNRFFLYDMY-DRMDYTDPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDY-----VINIEKNAYPII- 132 (533) T ss_pred cccHHHHHHHH-HHhhccCcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHH-----HhcchhhhhHHH- Confidence 34555567776 5556689999999998877776643221 1 23455654332 345444445555 Q ss_pred HHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccccccccccchhhhh Q lcl|NC_011269. 175 REYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGNMSTVEETPSEREQ 254 (867) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (867) |.+++-|..+=-+--+-+.+-+.++.-|||-.|+..+.+.-.. |-. T Consensus 133 R~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~~t~~---------------------------------eyy- 178 (533) T protein:vir:58 133 RNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNPETDT---------------------------------WYY- 178 (533) T ss_pred HhhhhcceeEEEeccCCcccchhhheecCCeeeEEEEeeccce---------------------------------EEE- Confidence 5555666553322222234567789999999888764332110 000 Q ss_pred hhhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhc-CccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhch-- Q lcl|NC_011269. 255 RMREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNR-PTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSP-- 331 (867) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 331 (867) -|..-+.. ....+.+++|+..-|.|+.+. -......+-++|=++.+++=++.-+..| -|.=|+.-. T Consensus 179 ---vy~~~~~~------~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDA--lVIYRisRAPe 247 (533) T protein:vir:58 179 ---VITDVYRN------VVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDA--LMLYRVVRSVD 247 (533) T ss_pred ---eecccccc------cccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHH--HHHHhhcCChh Confidence 01111111 235677899999999988877 4556777889999998888776655433 244454433 Q ss_pred hhh--hhhcccccCCCCcCCCCHHHHHHHHH---HHHHhhhcc----------hhh----hhhhh---------heeeee Q lcl|NC_011269. 332 LVL--ATLGIEDMGDGEPWIPDQGELDEVRD---DMQSLLAAD----------FRL----MVHNF---------GLKVEN 383 (867) Q Consensus 332 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~----------~~~----~~~~~---------~~~~~~ 383 (867) =|+ ..||. ||...-=+-+|| -++.-|+.| .+| |+-.| |-.|++ T Consensus 248 RRvFYIDVGN---------lpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~T 318 (533) T protein:vir:58 248 RRVFYVDVGN---------VPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDI 318 (533) T ss_pred heEEEEeecC---------CCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeee Confidence 123 45674 777665566666 223333333 332 44444 356777 Q ss_pred ccccCccCchhHHHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHHh-hccc Q lcl|NC_011269. 384 VFGRESVPNLDADYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFVTQIMTGFQNALKRHIRRRCEVVAEA-QGHY 462 (867) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~i~e~-q~~~ 462 (867) .-+ ++ |..=+|++-.+|+|.++|++--+-+. -+++-=.++.+.+|=+- ...|+.+|... .++| ...+ T Consensus 319 LpG-g~-lgemeDV~YF~kkLy~ALnVP~sRl~-~e~~fgr~~eItRDEiK--F~KFI~rLR~r-------F~~ll~~qL 386 (533) T protein:vir:58 319 LQG-SK-VDLAEDVEYMLNRLISALKVPKAFIG-YEGDVNAKNTLATQDIK--FNNTIKRIQGF-------FVEELERMV 386 (533) T ss_pred cCC-CC-CCcHHHHHHHHHHHHHHhCCCeeecC-CCCCCccchhhhHHHHH--HHHHHHHHHHH-------HHHHHhccc Confidence 775 34 66669999999999999999888876 44432233334443332 33444444332 2221 1100 Q ss_pred chheehhhcccc----c-hhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCc Q lcl|NC_011269. 463 DYDLKGGVRVPI----Y-REIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNI 537 (867) Q Consensus 463 d~~~~~~~~~~~----~-rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~ti 537 (867) . ++++|..-+ + +|- -| .|+.+.++-.....++..+. .+.... -+-+..+-.+. T Consensus 387 i--lk~iit~eew~~~f~~Dn-~f-~ElKe~Eil~~Ri~~l~~~d-------pyvgk~-----------yi~k~ILr~td 444 (533) T protein:vir:58 387 R--MNKEFADQDFRLVMNRSN-SI-VEGERFAVIEQRIGIAERLK-------GWVRED-----------WIYSNILQIPY 444 (533) T ss_pred c--cccCcchhheeeeeeccc-hH-HHHHHHHHHHHHHHHHHHhc-------chhhHH-----------HHHHHHhcCCh Confidence 0 111111100 0 000 00 11222222221111111111 111110 01111111111 Q ss_pred ccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCCCCCCCCCccCCCCccCCC Q lcl|NC_011269. 538 DMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTELGEAQAVAGEAQAELQTK 617 (867) Q Consensus 538 qme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~~~qa~~~agq~~~p~~~~ 617 (867) +.+. + .+....|....+-.... ...+.......++..... ....++.....+ .. +++...+ T Consensus 445 ei~~-q-~e~ie~E~~~~~~~~~~------~~~e~~~~~~~~~~~~p~----~~~~~~~~~~~~----~~---~~~~~~~ 505 (533) T protein:vir:58 445 DLKP-Q-EEVAEAAGGGGLFDTGG------FGEETTPADFLGERGSPI----ESPRGRTEFDFG----TE---GGEELGG 505 (533) T ss_pred hhhH-H-HHHHHHhhcCCCCCCCC------cccccCCcccCccccCcc----cCCCChhhHhcc----cC---Ccccccc Confidence 0100 0 01111111110100000 000000001111110000 000000000000 00 0000000 Q ss_pred CCCCccCc---cCcCCCCCCCCCCCCCc Q lcl|NC_011269. 618 QIEMQEMM---MDQQMAGGVMPGQPMLP 642 (867) Q Consensus 618 ~~~~qp~~---~~qg~pG~~gPpGP~gP 642 (867) ...-.+.. ......|...|+-|--. T Consensus 506 ~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 506 ELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred cccccccchhhhhhcCCcccCCCCCCCC Confidence 00000000 00000000000000000 No 181 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=23.06 E-value=2.4 Score=18.54 Aligned_cols=414 Identities=12% Similarity=0.058 Sum_probs=146.2 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHH----- Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRL----- 118 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 118 (867) |. |-.++-..+.. --.+. ...+.|-..-.+++++....+|-+. T Consensus 1 ~~---~~~~~~~~~~~----------------------~~~~e-------~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~ 48 (474) T protein:vir:10 1 MT---LYKLIDDIEAQ----------------------GILPK-------HIEALIESHKDDRERMVNLYNRYKTHIDYV 48 (474) T ss_pred Cc---hHHHHhhcccc----------------------CCCHH-------HHHHHHHHhhhhhHHHHHHHHHHhhhcchh Confidence 11 11111111110 00000 0111111111222222211111110 Q ss_pred -------------Hh---------------hccchHHHHHHhhhhccccc-ceecc-----cchhHHHHHHHHhhccccc Q lcl|NC_011269. 119 -------------FY---------------ATHDLVPLLIDIYSKFPVVG-MEFDS-----KDPLIKTFYEDLFFGEDLN 164 (867) Q Consensus 119 -------------~~---------------~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~~ 164 (867) || ..+++.+.++|.+..|=++. +.++. +|+.+++++.++.= +-| T Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~--~n~ 126 (474) T protein:vir:10 49 PIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAI--RNS 126 (474) T ss_pred hhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHh--hcC Confidence 00 23788899999999998765 66654 46777887777643 334 Q ss_pred HHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccc--- Q lcl|NC_011269. 165 YLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGN--- 241 (867) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 241 (867) +...+.+ +++....-|.++-+-..++.+.. +..+++|.-+-+- | ++.++. +-+++.--..++.. T Consensus 127 ~~~~~~~-~~~~~~~~G~a~~~~~~d~~~~~--~~~~i~p~~~~~v---~--d~~~~~-----~~~i~~~~~~~~~~~~~ 193 (474) T protein:vir:10 127 VDDEDSE-IGKMAAICGYGARLAYIDTNGDI--RIKNIDPYNVIFV---G--DNILEP-----TYSLRYFYEKDDDNGTD 193 (474) T ss_pred HhHHHHH-HHHHHhhcCeEEEEEEeCCCCee--EEEEEcccceEEE---E--cCCCce-----EEEEEEEEEeeCCCceE Confidence 5555555 45778888887665544554433 3456667644221 1 222221 11122111111100 Q ss_pred cccccccchhhhhhhhhH--HH-HHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHH Q lcl|NC_011269. 242 MSTVEETPSEREQRMREF--QD-LQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLN 318 (867) Q Consensus 242 ~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (867) ...+|- ..+.+-| .. -...+-+ ++..-- + +..=-|+++.| ..+|.+.+-. .+.|+- .|+ T Consensus 194 ~~~~~~-----y~~~~~~~~~~~~~~~~~~-~~~~~~--~--~g~vPvv~~~n-----~~~g~sd~e~-v~~liD--a~d 255 (474) T protein:vir:10 194 YVYAEF-----YDNAYYYVFRGEGIDALQE-VGRYEH--L--FDYNPLFGVPN-----NKEMIGDAEK-VIHLID--AYD 255 (474) T ss_pred EEEEEE-----EcCceEEEEeecCCCcccc-cccccC--C--CCccceEEecC-----CCCCCCchHH-HHHHHH--HHH Confidence 001110 0000000 00 0000000 000000 0 00001223333 3457776654 444433 233 Q ss_pred HHHHHHH---hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhH Q lcl|NC_011269. 319 AAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDA 395 (867) Q Consensus 319 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (867) .+..-.+ +.+..|+++++ | ..+. + +++..++. .-.++++.=+.+++++=-.-..=.+.. T Consensus 256 ~~~S~~~~~~~~~~~~~l~i~-g----~~~~----~-~~~~~~~~--------~~~i~~~~~~~~~~~l~~~~~~~~~~~ 317 (474) T protein:vir:10 256 LTMSDASSEISQTRLAYLVLR-G----MGMS----E-EMIQETQK--------SGAFELFDKDMDVKYLTKDVNDTMIEN 317 (474) T ss_pred HHHHHHHHHHHHhhcchhhhc-c----CCCC----c-hhhhhhhh--------cceeEecCCCCceeEEeccCCHHHHHH Confidence 3222222 34667776664 3 1111 1 12222221 112222222223332211111112235 Q ss_pred HHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcc---c----c- Q lcl|NC_011269. 396 DYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGH---Y----D- 463 (867) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~---~----d- 463 (867) .++.+++.|..--++-..- .++-+++ .+.+.+.+. -++....+..++..++++++.|.++-.+ . + T Consensus 318 ~~~~l~~~I~~~s~~p~~~-~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~ 394 (474) T protein:vir:10 318 HLDRIEKNIMRFAKSVNFN-SDEFNGN--VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSY 394 (474) T ss_pred HHHHHHHHHHHHhCCcccc-ccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc Confidence 5677777776654443211 1112222 233344332 2334556678888888888888774221 1 1 Q ss_pred hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccch Q lcl|NC_011269. 464 YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQ 543 (867) Q Consensus 464 ~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~ 543 (867) .+++..|....+++..+.-.-+.++. -.+-. +..+..+ +...+ .+. T Consensus 395 ~~i~~~f~~~~p~d~~e~a~~~~kl~-----g~iS~---------------et~~~~l------------~~v~d--~~~ 440 (474) T protein:vir:10 395 LNLIFKFTRNIPVNKLEESQVLINLK-----GQVSE---------------RTRLGQS------------QLVDD--VDY 440 (474) T ss_pred ccceEEeCCCCCCCHHHHHHHHHHHh-----ccCch---------------HHHHHhC------------CCCCC--HHH Confidence 23444555445554322222111111 00101 1111111 10000 111 Q ss_pred hhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 544 ELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 544 e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) +.++...|............+.-.+.....+- .+ T Consensus 441 E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~---s~ 474 (474) T protein:vir:10 441 ELDEMEKESLEFNDKLPDIDEGDANDKSQNNQ---SE 474 (474) T ss_pred HHHHHHHHHHHHHhhcccccCCCcCCCCcccc---CC Confidence 11222222211111111111100000000000 01 No 182 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=23.06 E-value=2.4 Score=18.54 Aligned_cols=414 Identities=12% Similarity=0.058 Sum_probs=146.2 Q ss_pred CCchHHHHHHHHhhhcchhHHHHHHHHhcccccccceeeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHH----- Q lcl|NC_011269. 44 VDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNMQIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRL----- 118 (867) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 118 (867) |. |-.++-..+.. --.+. ...+.|-..-.+++++....+|-+. T Consensus 1 ~~---~~~~~~~~~~~----------------------~~~~e-------~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~ 48 (474) T protein:vir:94 1 MT---LYKLIDDIEAQ----------------------GILPK-------HIEALIESHKDDRERMVNLYNRYKTHIDYV 48 (474) T ss_pred Cc---hHHHHhhcccc----------------------CCCHH-------HHHHHHHHhhhhhHHHHHHHHHHhhhcchh Confidence 11 11111111110 00000 0111111111222222211111110 Q ss_pred -------------Hh---------------hccchHHHHHHhhhhccccc-ceecc-----cchhHHHHHHHHhhccccc Q lcl|NC_011269. 119 -------------FY---------------ATHDLVPLLIDIYSKFPVVG-MEFDS-----KDPLIKTFYEDLFFGEDLN 164 (867) Q Consensus 119 -------------~~---------------~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~~ 164 (867) || ..+++.+.++|.+..|=++. +.++. +|+.+++++.++.= +-| T Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~--~n~ 126 (474) T protein:vir:94 49 PIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAI--RNS 126 (474) T ss_pred hhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHh--hcC Confidence 00 23788899999999998765 66654 46777887777643 334 Q ss_pred HHHHhHHHHHHHHhhhhhhcchhhhhhhccceehheecCcceeehhhhhhhcchHHHHHHHHHHhhccccccccccc--- Q lcl|NC_011269. 165 YLEFLPDQFAREYFTVGEVTSLAHFNESLGVWSSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQGPTTAGGN--- 241 (867) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 241 (867) +...+.+ +++....-|.++-+-..++.+.. +..+++|.-+-+- | ++.++. +-+++.--..++.. T Consensus 127 ~~~~~~~-~~~~~~~~G~a~~~~~~d~~~~~--~~~~i~p~~~~~v---~--d~~~~~-----~~~i~~~~~~~~~~~~~ 193 (474) T protein:vir:94 127 VDDEDSE-IGKMAAICGYGARLAYIDTNGDI--RIKNIDPYNVIFV---G--DNILEP-----TYSLRYFYEKDDDNGTD 193 (474) T ss_pred HhHHHHH-HHHHHhhcCeEEEEEEeCCCCee--EEEEEcccceEEE---E--cCCCce-----EEEEEEEEEeeCCCceE Confidence 5555555 45778888887665544554433 3456667644221 1 222221 11122111111100 Q ss_pred cccccccchhhhhhhhhH--HH-HHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHH Q lcl|NC_011269. 242 MSTVEETPSEREQRMREF--QD-LQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLN 318 (867) Q Consensus 242 ~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (867) ...+|- ..+.+-| .. -...+-+ ++..-- + +..=-|+++.| ..+|.+.+-. .+.|+- .|+ T Consensus 194 ~~~~~~-----y~~~~~~~~~~~~~~~~~~-~~~~~~--~--~g~vPvv~~~n-----~~~g~sd~e~-v~~liD--a~d 255 (474) T protein:vir:94 194 YVYAEF-----YDNAYYYVFRGEGIDALQE-VGRYEH--L--FDYNPLFGVPN-----NKEMIGDAEK-VIHLID--AYD 255 (474) T ss_pred EEEEEE-----EcCceEEEEeecCCCcccc-cccccC--C--CCccceEEecC-----CCCCCCchHH-HHHHHH--HHH Confidence 001110 0000000 00 0000000 000000 0 00001223333 3457776654 444433 233 Q ss_pred HHHHHHH---hhhhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcchhhhhhhhheeeeeccccCccCchhH Q lcl|NC_011269. 319 AAQDAVA---DRLYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADFRLMVHNFGLKVENVFGRESVPNLDA 395 (867) Q Consensus 319 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (867) .+..-.+ +.+..|+++++ | ..+. + +++..++. .-.++++.=+.+++++=-.-..=.+.. T Consensus 256 ~~~S~~~~~~~~~~~~~l~i~-g----~~~~----~-~~~~~~~~--------~~~i~~~~~~~~~~~l~~~~~~~~~~~ 317 (474) T protein:vir:94 256 LTMSDASSEISQTRLAYLVLR-G----MGMS----E-EMIQETQK--------SGAFELFDKDMDVKYLTKDVNDTMIEN 317 (474) T ss_pred HHHHHHHHHHHHhhcchhhhc-c----CCCC----c-hhhhhhhh--------cceeEecCCCCceeEEeccCCHHHHHH Confidence 3222222 34667776664 3 1111 1 12222221 112222222223332211111112235 Q ss_pred HHHHHHHHHHHhhccchhhhcCCCccceehhhhhHHHH----HHHHHHHHHHHHHHHhhhhHHHHHhhcc---c----c- Q lcl|NC_011269. 396 DYDRIERKLLQAWGIGEALISGGTGGAYASSALNREFV----TQIMTGFQNALKRHIRRRCEVVAEAQGH---Y----D- 463 (867) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~r~~~~~i~e~q~~---~----d- 463 (867) .++.+++.|..--++-..- .++-+++ .+.+.+.+. -++....+..++..++++++.|.++-.+ . + T Consensus 318 ~~~~l~~~I~~~s~~p~~~-~~~~~~n--~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~ 394 (474) T protein:vir:94 318 HLDRIEKNIMRFAKSVNFN-SDEFNGN--VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSY 394 (474) T ss_pred HHHHHHHHHHHHhCCcccc-ccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc Confidence 5677777776654443211 1112222 233344332 2334556678888888888888774221 1 1 Q ss_pred hheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhceeeeeccccCCCcccccch Q lcl|NC_011269. 464 YDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGVPVSDKTLAVNIDMKFDQ 543 (867) Q Consensus 464 ~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~pitd~t~p~tiqme~E~ 543 (867) .+++..|....+++..+.-.-+.++. -.+-. +..+..+ +...+ .+. T Consensus 395 ~~i~~~f~~~~p~d~~e~a~~~~kl~-----g~iS~---------------et~~~~l------------~~v~d--~~~ 440 (474) T protein:vir:94 395 LNLIFKFTRNIPVNKLEESQVLINLK-----GQVSE---------------RTRLGQS------------QLVDD--VDY 440 (474) T ss_pred ccceEEeCCCCCCCHHHHHHHHHHHh-----ccCch---------------HHHHHhC------------CCCCC--HHH Confidence 23444555445554322222111111 00101 1111111 10000 111 Q ss_pred hhhhhHHHHHHHHhhcccccccccccccccCCCCCcc Q lcl|NC_011269. 544 ELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPE 580 (867) Q Consensus 544 e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~ 580 (867) +.++...|............+.-.+.....+- .+ T Consensus 441 E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~---s~ 474 (474) T protein:vir:94 441 ELDEMEKESLEFNDKLPDIDEGDANDKSQNNQ---SE 474 (474) T ss_pred HHHHHHHHHHHHHhhcccccCCCcCCCCcccc---CC Confidence 11222222211111111111100000000000 01 No 183 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=21.97 E-value=2.5 Score=18.38 Aligned_cols=456 Identities=13% Similarity=0.115 Sum_probs=145.4 Q ss_pred CCCcccccccchhHHHHHHHHhcCCCCCCchhhHHhhhhhcccCCchHHHHHHHHhhhcchhHHHHHHHHhcccccccce Q lcl|NC_011269. 1 MSSPIYKAGSNWSAEVNRLRKAGVNMPNSPTMARAQAAALQNTVDNKPLIDYFQGRRRAAEANRQRLASYRKQGNFGSNM 80 (867) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (867) ||+|+... . .+++..+++.+...-+....-.+++..|-.- .| T Consensus 1 ~~~~~~~~-~--------------------------------~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G----~~- 42 (484) T protein:vir:77 1 MTSPLQKQ-E--------------------------------NVDPEKAREEMLNLFTERTQDLGDNTAYYES----ER- 42 (484) T ss_pred CCCccccc-C--------------------------------CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cc- Confidence 55554433 1 2222222222222211111111233344221 11 Q ss_pred eeccchhhhhhhhhHHhhCCCchhhhHHHHHHHHHHHHHhhccchHHHHHHhhhhcccc-cceecccchhHHHHHHHHhh Q lcl|NC_011269. 81 QIAMPKIRQPLGTLADKGIPFNVEDEEELRVIRHWCRLFYATHDLVPLLIDIYSKFPVV-GMEFDSKDPLIKTFYEDLFF 159 (867) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 159 (867) .+- -++...+ +++ |.+...+.+.+.++|.+..+=++ ||.+..+|..-+. +.+ +| T Consensus 43 ---------~i~-----~~~~~~~--~~~-------~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~~~~-l~~-i~ 97 (484) T protein:vir:77 43 ---------RPD-----AVGVTVP--QQM-------QKLLAHVGYPRLYIDAIAARQELEGFRLGGADKADEQ-LWD-WW 97 (484) T ss_pred ---------cch-----hcccccc--hhH-------HhhhhhcCcHHHHHHHHHhhhccCceecCCcchhHHH-HHH-HH Confidence 111 1122222 222 22334556667888888775443 4766555443332 222 23 Q ss_pred cccccHHHHhHHHHHHHHhhhhhhcchhhhhhhccc----e--ehheecCcceeehhhhhhhcchHHHHHHHHHHhhccc Q lcl|NC_011269. 160 GEDLNYLEFLPDQFAREYFTVGEVTSLAHFNESLGV----W--SSEEILNPDMLRVSRSMFVQRERVQLMVKDLVDHLRQ 233 (867) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (867) .+-|+...... +.++.++-|.++=+--.++.+.. + -...+++|..+-+- | ++.. ++++.+++. T Consensus 98 -~~N~~d~~~~~-~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~---~--D~~~----~~~~~a~~~ 166 (484) T protein:vir:77 98 -QANDLDIESTL-GHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQ---I--DPRT----RQVMRAIRA 166 (484) T ss_pred -HhcCHhHHHHH-HHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEE---e--cCCC----CceEEEEEE Confidence 22333333344 44667777776544333333221 0 12344555443211 1 1110 111111111 Q ss_pred cccccccccc-cccccchhhhhh---hhhHHHHHHhchHHHhhhccCCCCcccHHHHHHhhhcCccccccCcchhhHHHH Q lcl|NC_011269. 234 GPTTAGGNMS-TVEETPSEREQR---MREFQDLQRRYPEIIQAAMQNDGLDISEALISRVVNRPTAWATRGAPHLLRSFR 309 (867) Q Consensus 234 ~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (867) ==..+++... -+.-|+.+.... ...|.. .+.+|. .+..=-|++++|+...-..+|.+-+-+... T Consensus 167 ~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~-~~~~~~-----------~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~ 234 (484) T protein:vir:77 167 IEDEEGNEVIGATLYLPNNTVIWNREDGQWVQ-VANVAH-----------NLEMVPVIPIPNRTRLSDLYGTTEITPELR 234 (484) T ss_pred EEeecCCcEEEEEEEecCeEEEEEecCCceEe-eccccC-----------CCCCcceEEeccccccCccCCcccchHHHH Confidence 0000111000 001112111110 001110 111111 111223577888877767778876666555 Q ss_pred HHHHHHHHHHH--HHHHHhh-hhchhhhhhhcccccCCCCcCCCCHHHHHHHHHHHHHhhhcch-hhhh-hhhheeeeec Q lcl|NC_011269. 310 TLMAEESLNAA--QDAVADR-LYSPLVLATLGIEDMGDGEPWIPDQGELDEVRDDMQSLLAADF-RLMV-HNFGLKVENV 384 (867) Q Consensus 310 ~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~ 384 (867) +|+ +.|+++ +-+++.+ +..|.|.++ |. +... .+ .+ .+..+..+.+.+ ++++ -+=+.++-.. T Consensus 235 ~L~--Da~~~~~s~~~~~~~~~a~p~~~i~-G~----~~~~-~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 300 (484) T protein:vir:77 235 SVT--DAAARTLMLMQATAELMGVPQRLLF-GV----KGEE-LG----VD--PETGQTLFDAYLARILAFEDHESKAQQF 300 (484) T ss_pred HHH--HHHHHHHHHHHHHHHhhhhhHHHHh-CC----Ccch-hc----cc--ccccchhhhhhhhhhcccCCCCceeEee Confidence 665 333333 3333333 455666554 41 1110 00 00 001111111111 1111 1111111000 Q ss_pred cccCccCchhHHHHHHHHHHHHh---hccchhhhcCCCccceehhhhh----HHHHHHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_011269. 385 FGRESVPNLDADYDRIERKLLQA---WGIGEALISGGTGGAYASSALN----REFVTQIMTGFQNALKRHIRRRCEVVAE 457 (867) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~----~~~~~~~~~~~~~~l~~~~r~~~~~i~e 457 (867) ..-.++--+++++..|-+- -+|...-+. |.+.+-+ |.+. +.-+.++--..++.++..++++++-+.+ T Consensus 301 ----~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg-~~~~n~~-Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~ 374 (484) T protein:vir:77 301 ----SAAELRNFVDALDALDRKAAAYTGLPPYYLS-FSSENPA-SAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYK 374 (484) T ss_pred ----cCCChHHHHHHHHHHHHHHhcccCCCHHHhc-cccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0112333344555444333 345554453 4443322 2233 2233344455566677778888887777 Q ss_pred hhcccchheehhhccccchhhhhhhhhhhhhHhhhhhhhhhhhhccccccccchhhhhhhhhhhhhcee-eeeccc---- Q lcl|NC_011269. 458 AQGHYDYDLKGGVRVPIYREIVEYDEETGQEYIRKVPKLLIPEIKFSTLNLRDEAQERAFIAQLKGMGV-PVSDKT---- 532 (867) Q Consensus 458 ~q~~~d~~~~~~~~~~~~rd~~~~k~e~~k~~~r~~~k~i~~~i~~~~~~Lr~e~~~~~~v~qL~~~~~-pitd~t---- 532 (867) +.+..+.... . ...++.+...........-..+.+|...+. .+++.+ T Consensus 375 ~~~~~~~~~~-------~---------------------~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~ 426 (484) T protein:vir:77 375 VMNGGDIPPE-------Y---------------------YRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARID 426 (484) T ss_pred HhCCCCcccc-------c---------------------ccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhc Confidence 6553221000 0 000122222222222222223333322211 112111 Q ss_pred cCCCcccccchhhhhhHHHHHHHHhhcccccccccccccccCCCCCccccccccccccCCCCCCCCCCCCCCCCcc Q lcl|NC_011269. 533 LAVNIDMKFDQELERQADETVQKLMATAQAMKKVQDLCDAQNLPYPPELAQHLQSTLALRQGKTQTELGEAQAVAG 608 (867) Q Consensus 533 ~p~tiqme~E~e~e~k~~E~l~tL~~taet~kkvq~~~p~~g~P~pp~~aQ~p~~t~~~a~gpgq~~~~qa~~~ag 608 (867) ++..... .+ +.+....+.-.. ....+....... . +....+.... +....+......++ T Consensus 427 l~~~~~~-~~-e~~~~~~ee~~~------~~~~~~~~~~~~-----~---~~~~~~~~~~--~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 427 MGYSITE-RE-EMRKWDEEEQAQ------GLGLMGTMFGTD-----P---SGGGNPDNPE--TPEPQPNPAEEAAA 484 (484) T ss_pred CCCChhH-HH-HHHHHHHHHHHH------HHHHHhhhcccc-----c---cCCCCCCCCC--cccccCCCccccCC Confidence 1111110 00 000000010000 000000000000 0 0000000000 00000000000000 No 184 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=21.44 E-value=0.65 Score=21.61 Aligned_cols=116 Identities=11% Similarity=0.095 Sum_probs=39.2 Q ss_pred cCCCCccc--HHHHHHhhhcCccccccCcchhhHHHHHHHHHHHHHHHHHHHHhhhhchhhhhhhcccccCCCCcCCCC- Q lcl|NC_011269. 275 QNDGLDIS--EALISRVVNRPTAWATRGAPHLLRSFRTLMAEESLNAAQDAVADRLYSPLVLATLGIEDMGDGEPWIPD- 351 (867) Q Consensus 275 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 351 (867) |...|+|. +..|....++-..- +...+.||.+-- ...+..+-+|+. -+|+||-|- T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~--------~~~~~~l~~~ig-~~l~~~~~~rF~-------------p~G~~W~pls 58 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAA--------VTDTLPLMRGIA-AELLAETEFAFM-------------DEGPGWPQLS 58 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHH--------hhhHHHHHHHHH-HHHHHHHHHHHh-------------hcCCCCCCCC Confidence 66666654 22221111111000 000111111100 111122223331 356777541 Q ss_pred -----HHH---------HH---HHHHHHHHhhhcc--------hhhhhhhhheeeeeccc----cCccCchh--HH---- Q lcl|NC_011269. 352 -----QGE---------LD---EVRDDMQSLLAAD--------FRLMVHNFGLKVENVFG----RESVPNLD--AD---- 396 (867) Q Consensus 352 -----~~~---------~~---~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~----~~~~~~~~--~~---- 396 (867) .+. |- .+++-+++-.-.| .|--|||||-+.--.+. .-.-|-++ +| T Consensus 59 p~t~~~r~k~g~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~e 138 (155) T protein:vir:10 59 PVTVAARAAKGRGAHPILQVTNALARSITTRADRDQAQIGSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPS 138 (155) T ss_pred ccchHHHHhccCCCCCccccchhhhhhhhceecCCEEEEecCcchhhhhhcccccCCCCccccCCccccCCCccccchHH Confidence 111 11 1333222221222 67789999965421111 01113342 33 Q ss_pred -HHHHHHHHHHhhccch Q lcl|NC_011269. 397 -YDRIERKLLQAWGIGE 412 (867) Q Consensus 397 -~~~~~~~~~~~~~~~~ 412 (867) .+.|.+.|...|-=++ T Consensus 139 i~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 139 ARDAVLDVLLAALSQGR 155 (155) T ss_pred HHHHHHHHHHHHHhhcC Confidence 2344444444443333 Done!