Query lcl|NC_020866.1_cdsid_YP_007676413.1 [gene=RHVG_00034] [protein=hypothetical protein] [protein_id=YP_007676413.1] [location=17973..19616] Match_columns 547 No_of_seqs 167 out of 383 Neff 8.2 Searched_HMMs 1612 Date Thu Nov 7 17:25:43 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_34 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_34_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79233 Length: 526 100.0 6E-158 4E-161 882.6 59.8 514 1-547 1-523 (526) 2 protein:vir:99232 Length: 526 100.0 2E-157 1E-160 879.2 59.9 514 1-547 1-523 (526) 3 protein:vir:103860 Length: 528 100.0 5E-156 3E-159 872.2 59.7 516 1-547 1-525 (528) 4 protein:vir:1986 Length: 512 # 100.0 3E-153 2E-156 856.5 57.5 505 1-547 1-511 (512) 5 protein:vir:107880 Length: 491 100.0 7E-142 5E-145 794.4 56.4 490 1-544 1-491 (491) 6 protein:vir:79063 Length: 491 100.0 1E-139 8E-143 782.1 56.1 490 1-544 1-491 (491) 7 protein:vir:99853 Length: 488 100.0 5E-138 3E-141 773.5 55.7 478 15-547 1-481 (488) 8 protein:vir:95254 Length: 488 100.0 7E-112 4E-115 630.0 40.8 418 24-480 1-488 (488) 9 protein:vir:108215 Length: 469 100.0 4E-110 2E-113 620.5 42.3 423 20-480 1-469 (469) 10 protein:vir:79511 Length: 448 100.0 2E-109 1E-112 616.9 42.2 425 1-468 1-448 (448) 11 protein:vir:77981 Length: 448 100.0 1E-107 8E-111 606.5 42.2 425 1-460 1-448 (448) 12 protein:vir:98816 Length: 446 100.0 6E-106 4E-109 597.4 37.0 384 1-410 3-446 (446) 13 protein:vir:78161 Length: 355 100.0 2.8E-84 1.8E-87 478.6 34.8 328 138-504 1-355 (355) 14 protein:vir:7853 Length: 518 # 100.0 1.5E-28 9.3E-32 173.1 37.7 493 7-541 1-518 (518) 15 protein:vir:101648 Length: 518 100.0 2E-28 1.3E-31 172.4 38.1 494 7-541 1-518 (518) 16 protein:vir:8418 Length: 409 # 99.9 1E-23 6.3E-27 146.6 35.8 393 1-444 1-409 (409) 17 protein:vir:1380 Length: 422 # 99.9 9.9E-24 6.1E-27 146.7 34.4 402 1-438 1-422 (422) 18 protein:vir:102727 Length: 945 99.9 1.3E-22 8.2E-26 140.5 39.7 483 1-547 71-596 (945) 19 protein:vir:1266 Length: 416 # 99.9 2.2E-23 1.3E-26 144.8 35.2 397 6-447 1-416 (416) 20 protein:vir:93610 Length: 454 99.9 7.7E-23 4.8E-26 141.8 37.8 429 7-495 1-454 (454) 21 protein:vir:4454 Length: 414 # 99.9 2.3E-23 1.4E-26 144.7 34.8 394 29-442 1-414 (414) 22 protein:vir:79772 Length: 648 99.9 8.1E-21 5.1E-24 130.7 45.7 530 1-547 1-612 (648) 23 protein:vir:105064 Length: 421 99.9 4.9E-23 3E-26 142.9 33.3 395 1-465 1-421 (421) 24 protein:vir:81152 Length: 411 99.9 5.7E-23 3.6E-26 142.5 33.4 389 1-435 1-411 (411) 25 protein:vir:102080 Length: 429 99.9 1.6E-22 1E-25 140.1 35.1 408 1-448 1-429 (429) 26 protein:vir:3843 Length: 397 # 99.9 1.3E-22 7.9E-26 140.6 34.2 380 1-445 1-397 (397) 27 protein:vir:10362 Length: 432 99.9 2E-22 1.2E-25 139.6 34.4 415 1-458 1-432 (432) 28 protein:vir:189 Length: 424 # 99.9 6.9E-23 4.3E-26 142.1 31.6 401 15-438 1-424 (424) 29 protein:vir:4509 Length: 424 # 99.9 1.9E-22 1.2E-25 139.6 33.4 397 1-448 1-424 (424) 30 protein:vir:483 Length: 413 # 99.9 2.8E-22 1.8E-25 138.7 34.2 394 15-447 1-413 (413) 31 protein:vir:100249 Length: 431 99.9 1.6E-22 1E-25 140.0 32.2 403 1-440 1-431 (431) 32 protein:vir:102855 Length: 432 99.9 4.3E-22 2.6E-25 137.7 34.0 410 1-448 1-432 (432) 33 protein:vir:107605 Length: 432 99.9 4.3E-22 2.6E-25 137.7 34.0 410 1-448 1-432 (432) 34 protein:vir:105002 Length: 432 99.9 4.3E-22 2.6E-25 137.7 34.0 410 1-448 1-432 (432) 35 protein:vir:102118 Length: 409 99.9 5.2E-22 3.2E-25 137.3 34.2 388 1-435 1-409 (409) 36 protein:vir:6240 Length: 457 # 99.9 9.9E-22 6.1E-25 135.7 35.4 429 1-473 1-457 (457) 37 protein:vir:1431 Length: 419 # 99.9 5.5E-22 3.4E-25 137.1 33.4 396 1-457 1-419 (419) 38 protein:vir:1326 Length: 457 # 99.9 1.4E-21 8.5E-25 135.0 35.4 427 1-473 1-457 (457) 39 protein:vir:3153 Length: 467 # 99.9 3.6E-21 2.2E-24 132.7 37.7 393 68-505 1-467 (467) 40 protein:vir:4337 Length: 434 # 99.9 6.1E-22 3.8E-25 136.9 33.0 413 1-472 1-434 (434) 41 protein:vir:97060 Length: 432 99.9 1.2E-21 7.6E-25 135.2 34.3 415 1-458 1-432 (432) 42 protein:vir:5737 Length: 419 # 99.9 1.3E-21 7.8E-25 135.2 34.0 400 1-454 1-419 (419) 43 protein:vir:1884 Length: 424 # 99.9 4.9E-22 3E-25 137.4 31.5 401 15-441 1-424 (424) 44 protein:vir:81072 Length: 432 99.9 1.9E-21 1.2E-24 134.2 34.2 412 1-458 1-432 (432) 45 protein:vir:98396 Length: 441 99.9 3.6E-21 2.2E-24 132.7 34.2 413 1-447 1-441 (441) 46 protein:vir:100150 Length: 437 99.9 5.9E-21 3.7E-24 131.5 34.6 416 1-473 1-437 (437) 47 protein:vir:96980 Length: 409 99.9 9.2E-21 5.7E-24 130.4 35.1 394 1-440 1-409 (409) 48 protein:vir:9408 Length: 441 # 99.9 7.5E-21 4.7E-24 130.9 34.5 413 1-448 1-441 (441) 49 protein:vir:79984 Length: 441 99.9 7.5E-21 4.7E-24 130.9 34.5 413 1-448 1-441 (441) 50 protein:vir:81095 Length: 416 99.9 7.8E-21 4.8E-24 130.8 34.4 394 1-449 1-416 (416) 51 protein:vir:4598 Length: 416 # 99.9 7.8E-21 4.8E-24 130.8 34.4 394 1-449 1-416 (416) 52 protein:vir:63755 Length: 547 99.9 5.9E-20 3.6E-23 126.0 39.2 445 1-487 1-547 (547) 53 protein:vir:80333 Length: 419 99.9 3.8E-21 2.4E-24 132.5 31.8 397 1-483 1-419 (419) 54 protein:vir:960 Length: 413 # 99.9 2.2E-20 1.3E-23 128.4 35.5 395 1-435 4-413 (413) 55 protein:vir:94666 Length: 723 99.9 9.7E-20 6E-23 124.8 38.4 481 29-547 1-572 (723) 56 protein:vir:80644 Length: 551 99.9 5.5E-20 3.4E-23 126.2 37.0 446 1-487 5-551 (551) 57 protein:vir:101647 Length: 460 99.9 1E-19 6.2E-23 124.8 38.0 400 1-449 1-460 (460) 58 protein:vir:94426 Length: 409 99.8 1E-19 6.4E-23 124.7 34.9 394 1-440 1-409 (409) 59 protein:vir:93943 Length: 409 99.8 2E-19 1.2E-22 123.1 35.1 395 1-440 1-409 (409) 60 protein:vir:2683 Length: 412 # 99.8 2.4E-19 1.5E-22 122.7 35.0 396 1-440 1-412 (412) 61 protein:vir:9359 Length: 348 # 99.8 8.3E-20 5.1E-23 125.2 31.3 331 89-440 1-348 (348) 62 protein:vir:81218 Length: 423 99.8 1.6E-19 9.9E-23 123.6 32.8 393 1-436 1-423 (423) 63 protein:vir:95378 Length: 406 99.8 7.4E-19 4.6E-22 120.0 36.0 390 1-450 1-406 (406) 64 protein:vir:99312 Length: 563 99.8 1.4E-18 8.9E-22 118.4 36.5 452 1-508 1-563 (563) 65 protein:vir:95599 Length: 563 99.8 1.4E-18 8.9E-22 118.4 36.5 452 1-508 1-563 (563) 66 protein:vir:9702 Length: 406 # 99.8 6.5E-19 4E-22 120.3 34.1 390 1-452 1-406 (406) 67 protein:vir:3868 Length: 417 # 99.8 4.7E-19 2.9E-22 121.1 32.8 393 1-455 1-417 (417) 68 protein:vir:99452 Length: 651 99.8 4.5E-18 2.8E-21 115.7 37.0 448 1-547 1-586 (651) 69 protein:vir:80796 Length: 574 99.8 1.3E-17 8.2E-21 113.1 38.7 474 1-505 27-574 (574) 70 protein:vir:4952 Length: 386 # 99.8 1.6E-18 9.9E-22 118.2 32.3 372 1-450 1-386 (386) 71 protein:vir:100187 Length: 385 99.8 1.1E-17 6.9E-21 113.5 35.5 366 1-436 1-385 (385) 72 protein:vir:7407 Length: 392 # 99.8 3.9E-18 2.4E-21 116.0 32.1 374 1-439 1-392 (392) 73 protein:vir:80134 Length: 403 99.8 1.4E-17 8.8E-21 112.9 34.5 383 1-450 1-403 (403) 74 protein:vir:4854 Length: 386 # 99.8 2.9E-18 1.8E-21 116.7 30.5 376 1-439 1-386 (386) 75 protein:vir:3989 Length: 392 # 99.8 6.2E-18 3.9E-21 114.9 32.1 374 1-439 1-392 (392) 76 protein:vir:1023 Length: 392 # 99.8 6.2E-18 3.9E-21 114.9 32.1 374 1-439 1-392 (392) 77 protein:vir:96579 Length: 576 99.8 2.3E-17 1.4E-20 111.9 34.7 471 1-512 27-576 (576) 78 protein:vir:100691 Length: 535 99.8 2E-16 1.2E-19 106.7 39.6 452 1-476 1-535 (535) 79 protein:vir:6210 Length: 394 # 99.8 4.1E-17 2.5E-20 110.4 34.3 376 1-450 1-394 (394) 80 protein:vir:8317 Length: 409 # 99.7 2.6E-18 1.6E-21 117.0 26.6 376 1-422 1-409 (409) 81 protein:vir:100882 Length: 383 99.7 6.9E-17 4.3E-20 109.2 34.3 364 1-436 1-383 (383) 82 protein:vir:4194 Length: 540 # 99.7 4.8E-16 3E-19 104.5 38.8 475 1-544 1-540 (540) 83 protein:vir:4156 Length: 542 # 99.7 8.1E-16 5E-19 103.3 39.2 481 1-544 1-542 (542) 84 protein:vir:104259 Length: 403 99.7 1.5E-16 9.3E-20 107.3 35.2 374 1-443 1-403 (403) 85 protein:vir:8100 Length: 466 # 99.7 1.1E-16 7E-20 108.0 32.3 424 1-448 1-466 (466) 86 protein:vir:9507 Length: 395 # 99.7 8.4E-16 5.2E-19 103.2 33.5 380 29-450 1-395 (395) 87 protein:vir:101289 Length: 395 99.7 8.4E-16 5.2E-19 103.2 33.5 380 29-450 1-395 (395) 88 protein:vir:100650 Length: 395 99.7 8.4E-16 5.2E-19 103.2 33.5 380 29-450 1-395 (395) 89 protein:vir:4995 Length: 384 # 99.7 6.2E-17 3.9E-20 109.4 26.5 364 1-412 1-384 (384) 90 protein:vir:94049 Length: 532 99.7 4.7E-15 2.9E-18 99.1 36.7 447 1-472 1-532 (532) 91 protein:vir:5249 Length: 437 # 99.7 7.6E-15 4.7E-18 98.0 37.7 393 1-443 1-437 (437) 92 protein:vir:4089 Length: 395 # 99.7 1.4E-15 8.5E-19 102.1 32.4 374 1-448 1-395 (395) 93 protein:vir:4828 Length: 382 # 99.7 7.6E-16 4.7E-19 103.5 29.9 365 1-437 1-382 (382) 94 protein:vir:107742 Length: 537 99.7 2.5E-14 1.6E-17 95.1 36.8 444 1-472 25-537 (537) 95 protein:vir:78310 Length: 376 99.6 6.6E-15 4.1E-18 98.3 32.0 361 8-433 1-376 (376) 96 protein:vir:95965 Length: 385 99.6 1.1E-14 6.7E-18 97.2 32.6 362 29-436 1-385 (385) 97 protein:vir:1082 Length: 359 # 99.6 6.1E-15 3.8E-18 98.5 30.9 338 1-413 1-359 (359) 98 protein:vir:9641 Length: 395 # 99.6 2.8E-14 1.8E-17 94.9 30.1 373 29-435 1-395 (395) 99 protein:vir:80040 Length: 461 99.5 3.3E-12 2E-15 83.6 35.0 398 1-433 1-461 (461) 100 protein:vir:389 Length: 530 # 99.5 3.8E-13 2.3E-16 88.7 29.4 432 2-452 1-530 (530) 101 protein:vir:98643 Length: 395 99.5 5.1E-13 3.2E-16 88.0 30.1 372 29-448 1-395 (395) 102 protein:vir:78641 Length: 278 99.5 2.3E-13 1.4E-16 89.8 27.3 268 89-372 1-278 (278) 103 protein:vir:96068 Length: 765 99.5 1E-11 6.5E-15 80.8 34.0 512 1-547 37-618 (765) 104 protein:vir:94002 Length: 378 99.4 6.4E-13 3.9E-16 87.4 27.1 355 8-444 1-378 (378) 105 protein:vir:93867 Length: 378 99.4 2.9E-12 1.8E-15 83.8 29.5 355 29-444 1-378 (378) 106 protein:vir:6382 Length: 553 # 99.4 4.4E-12 2.7E-15 82.9 29.9 432 7-472 1-553 (553) 107 protein:vir:1661 Length: 378 # 99.4 3.7E-12 2.3E-15 83.3 28.7 355 8-444 1-378 (378) 108 protein:vir:3420 Length: 533 # 99.4 4.9E-12 3E-15 82.6 29.2 431 1-457 1-533 (533) 109 protein:vir:79538 Length: 502 99.4 6E-12 3.7E-15 82.1 28.1 430 1-472 1-502 (502) 110 protein:vir:858 Length: 378 # 99.4 1.4E-11 8.9E-15 80.0 29.6 355 8-444 1-378 (378) 111 protein:vir:99563 Length: 862 99.4 6.7E-11 4.2E-14 76.4 32.5 515 1-547 39-668 (862) 112 protein:vir:104338 Length: 422 99.3 1.6E-10 1E-13 74.3 29.7 377 7-447 1-422 (422) 113 protein:vir:79647 Length: 435 99.2 2.9E-10 1.8E-13 72.9 30.0 385 1-435 5-435 (435) 114 protein:vir:94869 Length: 378 99.2 1.6E-10 1E-13 74.3 27.9 355 29-444 1-378 (378) 115 protein:vir:94101 Length: 474 99.2 5.3E-10 3.3E-13 71.4 33.6 405 1-446 1-474 (474) 116 protein:vir:105889 Length: 474 99.2 5.3E-10 3.3E-13 71.4 33.6 405 1-446 1-474 (474) 117 protein:vir:107662 Length: 427 99.2 4.9E-10 3E-13 71.6 29.6 379 1-450 1-427 (427) 118 protein:vir:93747 Length: 472 99.2 6.1E-10 3.8E-13 71.1 32.5 409 1-447 1-472 (472) 119 protein:vir:4698 Length: 251 # 99.2 1.4E-11 8.5E-15 80.1 20.7 240 1-274 1-251 (251) 120 protein:vir:1150 Length: 350 # 99.2 3.8E-10 2.3E-13 72.3 28.1 331 1-375 1-350 (350) 121 protein:vir:1236 Length: 483 # 99.2 7.7E-10 4.8E-13 70.5 34.9 408 1-472 1-483 (483) 122 protein:vir:79207 Length: 351 99.2 2.9E-10 1.8E-13 72.9 27.0 333 1-377 1-351 (351) 123 protein:vir:96738 Length: 505 99.2 5.2E-10 3.2E-13 71.5 28.2 432 1-451 1-505 (505) 124 protein:vir:103971 Length: 376 99.2 2.2E-10 1.4E-13 73.5 26.0 339 1-377 1-376 (376) 125 protein:vir:95542 Length: 548 99.1 6.3E-10 3.9E-13 71.0 26.9 450 1-473 1-548 (548) 126 protein:vir:5691 Length: 344 # 99.1 1.6E-10 1E-13 74.2 23.5 328 1-375 1-344 (344) 127 protein:vir:10321 Length: 495 99.1 1.6E-09 1E-12 68.8 30.6 423 1-449 1-495 (495) 128 protein:vir:102602 Length: 456 99.1 1.1E-09 6.9E-13 69.7 27.6 377 41-440 1-456 (456) 129 protein:vir:105819 Length: 456 99.1 1.1E-09 6.9E-13 69.7 27.6 377 41-440 1-456 (456) 130 protein:vir:78191 Length: 351 99.1 1.5E-09 9.5E-13 68.9 26.2 335 1-379 1-351 (351) 131 protein:vir:98567 Length: 340 99.1 1.6E-09 9.9E-13 68.8 26.0 324 1-374 1-340 (340) 132 protein:vir:6058 Length: 344 # 99.0 3.4E-09 2.1E-12 67.0 27.6 325 1-375 1-344 (344) 133 protein:vir:2013 Length: 344 # 99.0 1.1E-09 6.6E-13 69.8 24.4 328 1-375 1-344 (344) 134 protein:vir:267 Length: 348 # 99.0 4.8E-09 3E-12 66.2 28.8 329 1-382 1-348 (348) 135 protein:vir:99072 Length: 479 99.0 2.6E-09 1.6E-12 67.7 25.8 418 1-457 1-479 (479) 136 protein:vir:4898 Length: 502 # 99.0 6.7E-09 4.2E-12 65.4 30.5 424 1-453 1-502 (502) 137 protein:vir:78749 Length: 337 99.0 4.1E-09 2.5E-12 66.6 26.1 318 1-375 1-337 (337) 138 protein:vir:99916 Length: 504 99.0 6.9E-09 4.3E-12 65.3 29.6 414 1-470 1-504 (504) 139 protein:vir:7987 Length: 456 # 99.0 5.2E-09 3.2E-12 66.0 25.7 393 15-440 1-456 (456) 140 protein:vir:100328 Length: 346 99.0 9.4E-09 5.9E-12 64.6 27.5 328 1-377 1-346 (346) 141 protein:vir:79150 Length: 368 98.9 1.1E-08 6.6E-12 64.3 26.6 346 1-386 1-368 (368) 142 protein:vir:97336 Length: 492 98.9 1.4E-08 8.6E-12 63.7 34.9 412 1-472 1-492 (492) 143 protein:vir:3743 Length: 345 # 98.9 1.2E-08 7.5E-12 64.0 25.9 317 1-377 1-345 (345) 144 protein:vir:98444 Length: 434 98.9 1.5E-08 9.3E-12 63.5 25.7 365 39-447 1-434 (434) 145 protein:vir:5961 Length: 503 # 98.9 1.9E-08 1.2E-11 62.9 33.2 421 1-459 1-503 (503) 146 protein:vir:95113 Length: 474 98.9 2.3E-08 1.4E-11 62.5 35.6 415 1-447 1-474 (474) 147 protein:vir:3780 Length: 345 # 98.8 1.3E-08 8.1E-12 63.8 23.7 325 1-377 1-345 (345) 148 protein:vir:97447 Length: 474 98.8 3E-08 1.9E-11 61.8 35.5 413 1-446 1-474 (474) 149 protein:vir:94498 Length: 474 98.8 3E-08 1.9E-11 61.8 35.5 413 1-446 1-474 (474) 150 protein:vir:2732 Length: 501 # 98.8 4.2E-08 2.6E-11 61.0 31.0 429 1-475 1-501 (501) 151 protein:vir:99522 Length: 470 98.8 4.2E-08 2.6E-11 61.0 31.6 402 1-472 1-470 (470) 152 protein:vir:96366 Length: 511 98.8 4.7E-08 2.9E-11 60.8 31.7 427 1-450 1-511 (511) 153 protein:vir:78805 Length: 511 98.8 4.7E-08 2.9E-11 60.8 31.7 427 1-450 1-511 (511) 154 protein:vir:94805 Length: 492 98.8 4.8E-08 3E-11 60.7 34.5 404 1-448 10-492 (492) 155 protein:vir:95806 Length: 440 98.8 5.1E-08 3.2E-11 60.5 30.3 393 19-441 1-440 (440) 156 protein:vir:96266 Length: 474 98.8 5.4E-08 3.3E-11 60.4 31.0 412 1-446 1-474 (474) 157 protein:vir:95899 Length: 474 98.8 5.4E-08 3.3E-11 60.4 31.0 412 1-446 1-474 (474) 158 protein:vir:105292 Length: 478 98.8 5.6E-08 3.5E-11 60.3 33.7 409 1-450 1-478 (478) 159 protein:vir:106639 Length: 481 98.8 6E-08 3.7E-11 60.2 33.5 417 1-454 6-481 (481) 160 protein:vir:105782 Length: 449 98.7 8.4E-08 5.2E-11 59.4 34.1 395 1-438 1-449 (449) 161 protein:vir:9871 Length: 429 # 98.7 8.4E-08 5.2E-11 59.4 30.6 382 15-472 1-429 (429) 162 protein:vir:99781 Length: 511 98.7 8.5E-08 5.3E-11 59.4 30.6 420 1-450 1-511 (511) 163 protein:vir:103951 Length: 511 98.7 9.2E-08 5.7E-11 59.2 32.8 422 1-450 1-511 (511) 164 protein:vir:94956 Length: 452 98.7 1.1E-07 6.6E-11 58.8 30.6 381 1-441 1-452 (452) 165 protein:vir:9306 Length: 511 # 98.6 1.5E-07 9.5E-11 58.0 32.0 420 1-450 1-511 (511) 166 protein:vir:79043 Length: 479 98.6 1.8E-07 1.1E-10 57.6 33.0 403 1-440 1-479 (479) 167 protein:vir:80959 Length: 499 98.6 1.8E-07 1.1E-10 57.5 28.9 398 1-438 1-499 (499) 168 protein:vir:96240 Length: 511 98.6 2.1E-07 1.3E-10 57.2 32.7 422 1-450 1-511 (511) 169 protein:vir:97171 Length: 512 98.6 2.6E-07 1.6E-10 56.7 30.2 427 1-450 1-512 (512) 170 protein:vir:2341 Length: 488 # 98.6 2.7E-07 1.7E-10 56.6 29.4 419 1-449 1-488 (488) 171 protein:vir:78227 Length: 480 98.6 2.8E-07 1.7E-10 56.5 29.4 410 20-460 1-480 (480) 172 protein:vir:38 Length: 496 # N 98.6 3E-07 1.9E-10 56.4 30.1 401 1-439 1-496 (496) 173 protein:vir:106571 Length: 499 98.5 3.5E-07 2.2E-10 56.0 31.8 422 1-470 1-499 (499) 174 protein:vir:7768 Length: 484 # 98.5 4E-07 2.5E-10 55.6 25.7 413 1-457 1-484 (484) 175 protein:vir:104082 Length: 485 98.5 4.1E-07 2.6E-10 55.6 25.4 417 1-457 1-485 (485) 176 protein:vir:3964 Length: 453 # 98.5 5.5E-07 3.4E-10 54.9 31.1 401 1-444 1-453 (453) 177 protein:vir:102950 Length: 471 98.4 6.6E-07 4.1E-10 54.5 35.1 385 15-441 1-471 (471) 178 protein:vir:107112 Length: 478 98.4 6.8E-07 4.2E-10 54.4 34.4 414 1-446 1-478 (478) 179 protein:vir:96494 Length: 501 98.4 7.5E-07 4.7E-10 54.2 31.4 431 1-475 1-501 (501) 180 protein:vir:2500 Length: 501 # 98.4 8.3E-07 5.1E-10 53.9 25.1 418 1-457 1-501 (501) 181 protein:vir:98853 Length: 219 98.3 4.6E-07 2.9E-10 55.3 19.3 201 163-376 1-219 (219) 182 protein:vir:78083 Length: 537 98.3 1.2E-06 7.3E-10 53.1 37.9 423 15-469 1-537 (537) 183 protein:vir:2427 Length: 485 # 98.3 1.2E-06 7.8E-10 52.9 26.8 419 1-452 1-485 (485) 184 protein:vir:96839 Length: 474 98.3 1.3E-06 7.9E-10 52.9 33.8 405 1-447 1-474 (474) 185 protein:vir:733 Length: 453 # 98.3 1.7E-06 1E-09 52.2 29.6 404 1-477 3-453 (453) 186 protein:vir:96179 Length: 468 98.2 2.2E-06 1.4E-09 51.6 35.5 401 1-437 1-468 (468) 187 protein:vir:78537 Length: 480 98.2 2.3E-06 1.4E-09 51.5 30.3 401 20-458 1-480 (480) 188 protein:vir:3609 Length: 452 # 98.2 2.6E-06 1.6E-09 51.2 33.2 400 1-473 1-452 (452) 189 protein:vir:9922 Length: 489 # 98.2 2.7E-06 1.7E-09 51.1 28.5 410 1-444 1-489 (489) 190 protein:vir:102330 Length: 451 98.2 3.3E-06 2E-09 50.6 31.8 362 44-471 1-451 (451) 191 protein:vir:4223 Length: 486 # 98.0 6.4E-06 4E-09 49.1 23.3 415 1-470 1-486 (486) 192 protein:vir:80680 Length: 441 98.0 7.8E-06 4.9E-09 48.6 25.9 391 8-473 1-441 (441) 193 protein:vir:95149 Length: 501 98.0 7.9E-06 4.9E-09 48.6 31.3 398 1-461 1-501 (501) 194 protein:vir:94546 Length: 506 98.0 8E-06 4.9E-09 48.5 30.4 415 1-475 1-506 (506) 195 protein:vir:79703 Length: 505 98.0 9.4E-06 5.8E-09 48.1 28.6 399 1-433 1-505 (505) 196 protein:vir:94742 Length: 409 97.9 1E-05 6.2E-09 48.0 30.6 354 15-407 1-409 (409) 197 protein:vir:9568 Length: 410 # 97.9 1E-05 6.4E-09 47.9 26.2 354 20-421 1-410 (410) 198 protein:vir:105461 Length: 470 97.9 1.1E-05 7.1E-09 47.7 32.9 381 15-472 1-470 (470) 199 protein:vir:102239 Length: 527 97.9 1.2E-05 7.7E-09 47.5 21.7 419 1-474 1-527 (527) 200 protein:vir:101494 Length: 527 97.9 1.3E-05 7.8E-09 47.5 21.7 419 1-474 1-527 (527) 201 protein:vir:1587 Length: 508 # 97.8 2E-05 1.2E-08 46.4 29.3 404 1-437 1-508 (508) 202 protein:vir:1634 Length: 409 # 97.7 2.4E-05 1.5E-08 45.9 32.4 355 15-407 1-409 (409) 203 protein:vir:97265 Length: 513 97.6 3.3E-05 2.1E-08 45.1 27.1 421 1-473 1-513 (513) 204 protein:vir:8184 Length: 474 # 97.6 3.4E-05 2.1E-08 45.1 28.9 411 1-439 1-474 (474) 205 protein:vir:105154 Length: 525 97.6 3.7E-05 2.3E-08 44.9 24.5 433 1-469 1-525 (525) 206 protein:vir:80453 Length: 535 97.6 4E-05 2.5E-08 44.7 26.7 425 1-451 1-535 (535) 207 protein:vir:4782 Length: 522 # 97.5 5.7E-05 3.5E-08 43.9 29.5 419 5-445 1-522 (522) 208 protein:vir:5839 Length: 533 # 97.4 6.8E-05 4.2E-08 43.4 24.4 438 1-473 20-533 (533) 209 protein:vir:78907 Length: 518 97.3 0.0001 6.2E-08 42.5 28.8 396 22-437 1-518 (518) 210 protein:vir:9751 Length: 422 # 97.3 0.0001 6.3E-08 42.5 28.8 365 15-420 1-422 (422) 211 protein:vir:95014 Length: 491 97.0 0.00023 1.4E-07 40.5 27.0 405 7-445 1-491 (491) 212 protein:vir:9815 Length: 500 # 96.6 0.00047 2.9E-07 38.8 27.7 402 1-443 1-500 (500) 213 protein:vir:3028 Length: 500 # 96.6 0.00047 2.9E-07 38.8 27.7 402 1-443 1-500 (500) 214 protein:vir:78393 Length: 489 96.5 0.00059 3.7E-07 38.3 27.7 406 7-437 1-489 (489) 215 protein:vir:98883 Length: 517 96.2 0.00093 5.7E-07 37.2 30.9 409 1-450 1-517 (517) 216 protein:vir:7430 Length: 563 # 96.2 0.00094 5.8E-07 37.2 25.5 438 1-473 1-563 (563) 217 protein:vir:104500 Length: 537 95.4 0.0022 1.3E-06 35.2 25.6 433 1-474 1-537 (537) 218 protein:vir:80165 Length: 651 94.9 0.0032 2E-06 34.2 39.6 462 1-527 3-651 (651) 219 protein:vir:103219 Length: 201 94.8 0.0036 2.2E-06 34.0 14.8 182 238-443 1-201 (201) 220 protein:vir:101541 Length: 694 94.1 0.0053 3.3E-06 33.1 26.0 514 1-547 1-655 (694) 221 protein:vir:78589 Length: 695 94.1 0.0053 3.3E-06 33.1 27.2 515 1-542 46-695 (695) 222 protein:vir:106282 Length: 521 93.9 0.0059 3.7E-06 32.8 18.9 419 1-451 1-521 (521) 223 protein:vir:103177 Length: 533 92.4 0.012 7.1E-06 31.2 24.6 433 1-454 1-533 (533) 224 protein:vir:106716 Length: 698 92.3 0.012 7.6E-06 31.1 27.7 515 1-545 46-698 (698) 225 protein:vir:572 Length: 506 # 89.7 0.025 1.5E-05 29.4 15.1 425 7-474 1-506 (506) 226 protein:vir:1538 Length: 535 # 86.8 0.043 2.6E-05 28.1 25.3 419 1-486 1-535 (535) 227 protein:vir:94572 Length: 535 84.2 0.062 3.8E-05 27.2 22.9 413 1-492 1-535 (535) 228 protein:vir:3648 Length: 695 # 82.1 0.079 4.9E-05 26.6 31.9 516 1-542 46-695 (695) 229 protein:vir:96783 Length: 488 80.7 0.092 5.7E-05 26.3 25.0 395 1-428 1-488 (488) 230 protein:vir:3361 Length: 535 # 78.7 0.11 6.9E-05 25.8 25.2 421 1-486 1-535 (535) 231 protein:vir:107517 Length: 639 75.2 0.15 9.2E-05 25.1 25.4 506 1-547 1-636 (639) 232 protein:vir:97900 Length: 639 75.2 0.15 9.2E-05 25.1 25.4 506 1-547 1-636 (639) 233 protein:vir:101418 Length: 569 68.1 0.24 0.00015 24.0 23.4 412 1-470 33-569 (569) 234 protein:vir:8883 Length: 543 # 67.6 0.25 0.00015 23.9 24.3 425 1-468 1-543 (543) 235 protein:vir:106999 Length: 564 66.0 0.27 0.00017 23.7 24.0 440 1-475 1-564 (564) 236 protein:vir:100039 Length: 522 63.2 0.32 0.0002 23.3 22.6 411 15-487 1-522 (522) 237 protein:vir:6896 Length: 523 # 46.6 0.73 0.00045 21.3 22.5 421 1-451 1-523 (523) 238 protein:vir:99672 Length: 532 42.9 0.87 0.00054 20.9 22.1 414 1-472 1-532 (532) 239 protein:vir:93630 Length: 776 42.8 0.87 0.00054 20.9 30.9 465 1-547 22-677 (776) 240 protein:vir:94709 Length: 522 40.7 0.96 0.0006 20.7 33.0 413 1-512 1-522 (522) 241 protein:vir:108049 Length: 524 40.7 0.96 0.0006 20.7 21.0 424 1-451 1-524 (524) 242 protein:vir:102668 Length: 547 38.1 1.1 0.00067 20.4 23.5 414 15-455 1-547 (547) 243 protein:vir:94599 Length: 641 37.4 1.1 0.00069 20.3 35.0 450 1-525 5-641 (641) 244 protein:vir:101189 Length: 516 27.6 1.8 0.0011 19.1 21.5 408 1-451 3-516 (516) 245 protein:vir:101806 Length: 516 27.6 1.8 0.0011 19.1 21.5 408 1-451 3-516 (516) 246 protein:vir:103765 Length: 549 25.7 2 0.0013 18.9 22.3 413 1-468 1-549 (549) 247 protein:vir:104892 Length: 558 23.5 2.3 0.0014 18.6 24.6 453 1-496 1-558 (558) No 1 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=100.00 E-value=5.7e-158 Score=882.65 Aligned_cols=514 Identities=31% Similarity=0.476 Sum_probs=467.6 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |++ |+|++||||+...|+++ +++++++++++++.||++||||+||++|||+|++||+.+|++||++|+++|+||++ T Consensus 1 ~~~---~~d~~g~p~~~~~~~~~-~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s 76 (526) T protein:vir:79 1 MAQ---IVDVYGNPIRPQQLREP-QTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFA 76 (526) T ss_pred CCe---eeCCCCCccCccccchh-hhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHH Confidence 886 99999999999999776 67899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCceEecCC-CCHHHHHHHHHHHHHHhCC-CHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPAS-DDPQDVKIADMVRDWLKRE-ELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWR 158 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~-~~~~~~~~ae~v~~~l~~~-~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~ 158 (547) +|++|+++|++++|+|+|++ ++++++++|++|+++|+++ +|++++++|++|++|||||+|++|..++|.|.|+++.+| T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ldA~~~G~s~~Ei~w~~~~g~~~~~~l~~r 156 (526) T protein:vir:79 77 EMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALDGIGHGYSCIELEWALQGREWMPLAFHHR 156 (526) T ss_pred HHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHhhhhhcceeEEEEEeecCCceeEEEeeee Confidence 99999999999999999985 5688999999999999885 799999999999999999999999999999999999999 Q ss_pred ccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE Q lcl|NC_020866. 159 DPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI 238 (547) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v 238 (547) ||+||.|+.+++.+++++++...|+++|++|||+|+|+.++++|||.||+|.|||+|+||++++++|+.|+|+||+|++| T Consensus 157 ~~~~F~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~i 236 (526) T protein:vir:79 157 PQSWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRL 236 (526) T ss_pred cccceEeccCCCcEEEecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEE Confidence 99999999999888888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred EeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC---ccc Q lcl|NC_020866. 239 GKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD---AVA 315 (547) Q Consensus 239 gk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~---~~g 315 (547) ||||++++++++++|+++|++|++++++|||.|++|||+++++ ++.++|++|++|||++|||+||||||||+ +++ T Consensus 237 gky~~~a~~~ek~~L~~av~~i~~da~~iiP~~~~ie~~ea~~--~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~ 314 (526) T protein:vir:79 237 GKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQ--GSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGG 314 (526) T ss_pred EecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCC--CCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcc Confidence 9999999999999999999999999999999999999999764 34568999999999999999999999985 356 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccccEEEecCccccCHHHHHHHHHHHHhCCCCC Q lcl|NC_020866. 316 GGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQ---KRYPRIKIARPKTEDLTKLASSLDTLVRLGMEI 392 (547) Q Consensus 316 gs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~---~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v 392 (547) ||+|+|+||++|+++++++|+++|+++||++||++++.+||+.. .++|+|+|+..+++|++++++++++|+++|++| T Consensus 315 gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i 394 (526) T protein:vir:79 315 GAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEI 394 (526) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcC Confidence 89999999999999999999999999999999999999999843 468999999999999999999999999999999 Q ss_pred CHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 393 EENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 393 ~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) +.+|++++||||+|+++|+++.+..++++++..++......... .......++ T Consensus 395 ~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~---------------------------~~~~~~~~~ 447 (526) T protein:vir:79 395 PSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQRVAALATI---------------------------VGPRYGDQQ 447 (526) T ss_pred CHHHHHHHhCCCCCCCchhhccccCCcccccccccccccccccc---------------------------ccccCchhh Confidence 99999999999999999999877665554433322211111000 001112345 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 473 EIAALT-DQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 473 ~~~~l~-~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) .++.+. +.....+++.+++++++|+++|++|+||+|++++|.++||+|+.++|+++|++||++|+++||+++++= T Consensus 448 ~~d~~l~~~~~~~~~~~~~~~~~~i~~~~~~~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~Gr~~~~~e 523 (526) T protein:vir:79 448 ALDKALADLPAKDMQNQANDLLAPLLDAVNRGDSETELLGALAEAFPDMDDSALTDALHRLLFAADTWGRLHGNLD 523 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhhhhc Confidence 565544 445667899999999999999999999999999999999999999999999999999999999999776 No 2 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=100.00 E-value=2.4e-157 Score=879.22 Aligned_cols=514 Identities=30% Similarity=0.474 Sum_probs=467.6 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |++ |+|++||||+++.|+++++ ++++++++++++||+++|||+||++|||+|++||+.+|.+||++|+++|+||++ T Consensus 1 ~~~---~~d~~g~p~~~~~~~~~~~-~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s 76 (526) T protein:vir:99 1 MAQ---IVDVYGNPIRTQQLREPQT-SRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFA 76 (526) T ss_pred CCe---eECCCCCccccccccchhh-hhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHH Confidence 876 9999999999999998875 789999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCceEecCC-CCHHHHHHHHHHHHHHhCC-CHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPAS-DDPQDVKIADMVRDWLKRE-ELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWR 158 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~-~~~~~~~~ae~v~~~l~~~-~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~ 158 (547) ++++|+++|++++|.|+|++ ++++++++|++|+++|+++ +|++++++|++|++|||||+|++|..++|.|.|+++.+| T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~lda~~~G~s~~Eivw~~~~g~~~~~~l~~r 156 (526) T protein:vir:99 77 EMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALDGIGHGYSCIELEWALQGREWMPLAFHHR 156 (526) T ss_pred HHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHHHhhhhcceeEEEEEeecCCceeEEEeeee Confidence 99999999999999999985 5688999999999999885 799999999999999999999999999999999999999 Q ss_pred ccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE Q lcl|NC_020866. 159 DPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI 238 (547) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v 238 (547) ||+||.|+.+++.+++++++...|+++|++|||+|+|+.++++|||.||++.|||+|+||++++++|+.|+|+||+|++| T Consensus 157 ~~~~f~~~~~~~~~l~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~i 236 (526) T protein:vir:99 157 PQSWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRL 236 (526) T ss_pred cccceeeccCCCcEEEecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEE Confidence 99999999999988888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred EeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC---ccc Q lcl|NC_020866. 239 GKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD---AVA 315 (547) Q Consensus 239 gk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~---~~g 315 (547) |||+++++++++++|+++|++|++++++|||.|++|||+++++ ++.++|++|++|||++|||+||||||||+ +++ T Consensus 237 gky~~~a~~~ek~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~--~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~ 314 (526) T protein:vir:99 237 GKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQ--GSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGG 314 (526) T ss_pred EecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCC--CCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcc Confidence 9999999999999999999999999999999999999999764 44568999999999999999999999985 356 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccccEEEecCccccCHHHHHHHHHHHHhCCCCC Q lcl|NC_020866. 316 GGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQ---KRYPRIKIARPKTEDLTKLASSLDTLVRLGMEI 392 (547) Q Consensus 316 gs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~---~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v 392 (547) ||+|+|+||++|+++++++|+++|+++||++||++++.+||+.. ..+|+|+|+..+++|++++++++++|+++|++| T Consensus 315 gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i 394 (526) T protein:vir:99 315 GAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEI 394 (526) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcc Confidence 89999999999999999999999999999999999999999843 468999999999999999999999999999999 Q ss_pred CHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 393 EENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 393 ~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) +.+|++++||||+|+++|+++.+..+++.+...++......... .......++ T Consensus 395 ~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~---------------------------~~~~~~~~~ 447 (526) T protein:vir:99 395 PSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATI---------------------------VGPRYGDQQ 447 (526) T ss_pred CHHHHHHHhCCCCCCCcccccCCCCCCccccccccccccccccc---------------------------ccccCcchh Confidence 99999999999999999999887665544433222111111000 000111233 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 473 EIAA-LTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 473 ~~~~-l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) .++. +.+.+...+++++++++++|+++|++|+||+|++++|.++||+|+.++|+++|++||++|+++||+++++= T Consensus 448 ~~d~~l~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~Gr~~~~~e 523 (526) T protein:vir:99 448 ALDKALADLPAKDMQNQANDLLAPLLEAVNRGDSETELLGALAEAFPDMDDSALTDALHRLLFAADTWGRLHGNLD 523 (526) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhhhhc Confidence 4443 55566778999999999999999999999999999999999999999999999999999999999988777 No 3 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=100.00 E-value=4.5e-156 Score=872.24 Aligned_cols=516 Identities=28% Similarity=0.464 Sum_probs=464.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |++ |+|++|||++++.|+++++ ++++++++++++||++||||.||++|||.|++||+.+|.+||++|+++|+||++ T Consensus 1 ~~~---~~d~~g~p~~~~~~~~~~~-~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s 76 (528) T protein:vir:10 1 MAA---IVDIYGNPLRTQQLRKQQT-AHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFA 76 (528) T ss_pred CCe---eECCCCCccccccccchhh-hhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHH Confidence 876 9999999999999999965 689999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCceEecCC-CCHHHHHHHHHHHHHHhCC-CHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPAS-DDPQDVKIADMVRDWLKRE-ELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWR 158 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~-~~~~~~~~ae~v~~~l~~~-~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~ 158 (547) ++++|+++|++++|+|+|++ ++++++++|++|+++|+++ +|++++++|++|++|||||+|++|..++|.|.|+++.+| T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r 156 (528) T protein:vir:10 77 EMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDCMDGVGHGYSAIELDWSLQGREWLPQAFDHR 156 (528) T ss_pred HHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHHHhhhhhcceeEEEEEeecCCceeEEEeeee Confidence 99999999999999999986 4688899999999999886 699999999999999999999999999999999999999 Q ss_pred ccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE Q lcl|NC_020866. 159 DPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI 238 (547) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v 238 (547) ||+||.|+.+++.+++++++...|+++|++|||+|+|+.++++|||.||++.|||+|+||++++++|+.|+|+||+|++| T Consensus 157 ~~~~f~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~i 236 (528) T protein:vir:10 157 PQSWFQLNPDDQDELRLRDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRL 236 (528) T ss_pred cccceeeccCCCcEEeccCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEE Confidence 99999999999888888888889999999999999999999999999999999999999999999999999999999999 Q ss_pred EeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCc---cc Q lcl|NC_020866. 239 GKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDA---VA 315 (547) Q Consensus 239 gk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~---~g 315 (547) |||+++++++++++|+++|++|++++++|||.|++|||+++++ ++.++|++|++|||++|||+||||||||++ ++ T Consensus 237 gky~~~a~~~ek~~L~~al~~i~~~~~~iiP~~~~ie~~ea~~--~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~ 314 (528) T protein:vir:10 237 GKYPPGTPDEEKVTLLRAVTGLGHAAAGIIPESMSIDFQEASK--GSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGG 314 (528) T ss_pred EecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCC--CChhHHHHHHHHHHHHHHHHHhhhhhhcccccccc Confidence 9999999999999999999999999999999999999999864 445689999999999999999999999863 46 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccccEEEecCccccCHHHHHHHHHHHHhCCCCC Q lcl|NC_020866. 316 GGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQ---KRYPRIKIARPKTEDLTKLASSLDTLVRLGMEI 392 (547) Q Consensus 316 gs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~---~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v 392 (547) ||+|+|+||++|+++++++|+++|+++||++||++|+.+||++. ..+|+|+|+..+++|++++++++++|+++|++| T Consensus 315 gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i 394 (528) T protein:vir:10 315 GAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQV 394 (528) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCC Confidence 89999999999999999999999999999999999999999864 467999999999999999999999999999999 Q ss_pred CHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 393 EENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 393 ~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) +++|++++||||.|+++|+++.+...+++....+...+.. ...+. ........++ T Consensus 395 ~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~--~~~~~-----------------------~~~~~~~~~~ 449 (528) T protein:vir:10 395 PVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRI--AALAQ-----------------------VIGPRYRDQE 449 (528) T ss_pred CHHHHHHHhCCCCCCCCcccccCCCcccccccCccccccc--ccccc-----------------------cccccccccc Confidence 9999999999999999998876544332221111111100 00000 0001112234 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 473 EIAA-LTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 473 ~~~~-l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) .++. +...+.+++++++++++++|+++|++|+||+|++++|.++||+|++++|+++|++||++|+++||+++++= T Consensus 450 ~~d~~~~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~G~~~~~~e 525 (528) T protein:vir:10 450 ALDQVLASLPAQDMQNQADSLVAPLLDVISRGGSEAELLGALAEAFPDMDDSALADALHRLLFVADTWGRLNGTLD 525 (528) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 4444 44556678999999999999999999999999999999999999999999999999999999999998877 No 4 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=100.00 E-value=3.4e-153 Score=856.46 Aligned_cols=505 Identities=27% Similarity=0.424 Sum_probs=460.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |++ |+|++||||+++.+.++. +++++++++.|.+||++||||+||+.|||.|++||+.+|.+||++|+++|+||++ T Consensus 1 m~~---~~d~~g~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s 76 (512) T protein:vir:19 1 MGR---ILDISGQPFDFDDEMQSR-SDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFS 76 (512) T ss_pred Ccc---eeCCCCCccccccccccc-cchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHH Confidence 887 999999999999999885 6789999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCceEecCCC-CHHHHHHHHHHHHHHhCC-CHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASD-DPQDVKIADMVRDWLKRE-ELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWR 158 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~-~~~~~~~ae~v~~~l~~~-~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~ 158 (547) +|++||++|++++|+|+|+++ +++++++|++|+++|.+. +|++++++|++|++|||||+|++|.+++|.|.|.++.+| T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~lldA~~~G~s~~Ei~w~~~~g~~~~~~~~~r 156 (512) T protein:vir:19 77 ELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAGDAILKGYSMQEIEWGWLGKMRVPVALHHR 156 (512) T ss_pred HHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhhhcceeeeeEeeeeCCceeeeeeeee Confidence 999999999999999999865 889999999999999875 799999999999999999999999999999999999999 Q ss_pred ccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE Q lcl|NC_020866. 159 DPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI 238 (547) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v 238 (547) ||+||.|+.+++.++++++++..|+++|++|||+|+|++++++|||.||++.|||+|+||++++++|+.|+|+||+|++| T Consensus 157 ~~~~f~~~~~~~~~lr~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~i 236 (512) T protein:vir:19 157 DPALFCANPDNLNELRLRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRV 236 (512) T ss_pred ccccceeccCCCcEEEecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeE Confidence 99999999999989998888889999999999999999999999999999999999999999999999999999999999 Q ss_pred EeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCc-cccc Q lcl|NC_020866. 239 GKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDA-VAGG 317 (547) Q Consensus 239 gk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~-~ggs 317 (547) |||+++++++++++|+++|++|++++++|||.|++|||+++++ ++.++|+++++|||++|||+||||||||++ ++|| T Consensus 237 gky~~~a~~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~--~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs 314 (512) T protein:vir:19 237 GKYPTGSTNREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAAD--GQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGA 314 (512) T ss_pred EecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCC--CCHHHHHHHHHHHHHHHHHHHhhhhhcccccccch Confidence 9999999999999999999999999999999999999999864 455789999999999999999999999984 6789 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccccEEEecCccccCHHHHHHHHHHHHhCCCCCCH Q lcl|NC_020866. 318 HAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQ---KRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEE 394 (547) Q Consensus 318 ~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~---~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~ 394 (547) +|+|+||++|+++++++|+++|+++||++||++|+.+||++. .++|+|+|+..+++|+++.++.+++|+ +|++|++ T Consensus 315 ~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~-~G~~i~~ 393 (512) T protein:vir:19 315 RSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLA-AGMRIPV 393 (512) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHh-cCCCCCH Confidence 999999999999999999999999999999999999999865 358999999999999999999999997 8999999 Q ss_pred HHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHH Q lcl|NC_020866. 395 NEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEI 474 (547) Q Consensus 395 ~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (547) +|++++||||.|+++|+++.......+...... +. ........++.+ T Consensus 394 ~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~----------~~-----------------------~~~~~~~~~~~~ 440 (512) T protein:vir:19 394 SWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKE----------AA-----------------------LSAEDIPQEDDI 440 (512) T ss_pred HHHHHHhCCCCCCCccccccCCCcccccccccc----------cc-----------------------ccccCCCchhhH Confidence 999999999999998887655332211110000 00 000111234556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 475 AALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 475 ~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) +++... ..+++.++++++++|+++++ +.|++|++++|.++||+|++++|+++|++||++|+++||+++++- T Consensus 441 d~~~~~-~~~~~~~~~~~~~~i~~~~~-~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~G~~~~~~e 511 (512) T protein:vir:19 441 DRMGVS-PEDWQRSVDPLLKPVIFSVL-KDGPEAAMNKAASLYPQMDDAELIDMLTRAIFVADIWGRLDAAAD 511 (512) T ss_pred hHHhhh-HHHHHHHHHHHHHHHHHHHH-hCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhhhcc Confidence 666665 35688999999999999996 469999999999999999999999999999999999999999999 No 5 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=100.00 E-value=7.3e-142 Score=794.36 Aligned_cols=490 Identities=21% Similarity=0.277 Sum_probs=432.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |+ |+|||++|+|++...++++++.. + ..++.+..+|+.++||.++++|||.++ ++ ..+|++|+ +|+||++ T Consensus 1 m~--~~i~~~~g~p~~~~~~~~~~~~~-i-a~~~~~~~~~~~~~~~~~~~~iLr~~~-~~----~~~y~~m~-~D~~i~s 70 (491) T protein:vir:10 1 MS--KGLWVSPTEFVTFGEPDKSLSSQ-I-ATRARSIDFFALGMYLPNPDPVLKALG-KD----IRVYRELR-ADAHVGG 70 (491) T ss_pred CC--CceeCCCCCccCcccCChHHHHH-H-HhhhcccccccccCCccchHHHHHhcC-CC----HHHHHHHh-hChHHHH Confidence 87 89999999999988877776532 2 235567889999999999999999876 44 46888887 7999999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEeecc Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWRDP 160 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~ 160 (547) ++++|+++|++++|+|+|++++ .+.+++|+++|++++|++++++|++|++|||||+|++|.+++|.|.|.++.+||| T Consensus 71 ~l~~Rk~av~~~~w~i~~~~~~---~~~~e~v~e~l~~~~~~~~l~~~lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 147 (491) T protein:vir:10 71 CVRRRKAAVKALEWGLDRGKAK---SRVAKSIADVFADLDLSRIVTEMLDAVLYGYQPMEITWGKVGNYIVPIDVVGKPA 147 (491) T ss_pred HHHHHHHHHhCCCcEEecCCCC---HHHHHHHHHHHhcCCHHHHHHHHHHhhhhcceeEEEEEeecCCeeEEEEeeeecc Confidence 9999999999999999998764 3568899999999999999999999999999999999999999999999999999 Q ss_pred ceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe Q lcl|NC_020866. 161 RHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK 240 (547) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk 240 (547) +||.|+.+++.+++..+++..|+++|++|||+|+|+.+++||||.||++.|||+|+||++++++|+.|+|+||+|++||| T Consensus 148 ~~f~~d~~~~l~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igk 227 (491) T protein:vir:10 148 DWFVYDPENQLRFRSKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGK 227 (491) T ss_pred cceeeccCCceEEecCCCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe Confidence 99999999988888888888999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccch Q lcl|NC_020866. 241 WQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAV 320 (547) Q Consensus 241 ~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~ 320 (547) ||++++++++++|++++++|++++++|||.|++|||+++++.+++.++|+++|+|||++|||+||||||||++ |||+|+ T Consensus 228 y~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~-~gs~a~ 306 (491) T protein:vir:10 228 HPRSASDGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEA-TSTRAS 306 (491) T ss_pred cCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCc-ccchhH Confidence 9999999999999999999999999999999999999998777888899999999999999999999999995 789999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHH Q lcl|NC_020866. 321 SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSR 400 (547) Q Consensus 321 ~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~ 400 (547) |+||++|+++++++|+++|+++|| +||++++.+||++ ..+|+|+|.+.+ ++.+.+++++++|+++|++++++|++++ T Consensus 307 ~~vh~~v~~di~~~D~~~i~~tln-~li~~l~~~N~~~-~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~~i~e~ 383 (491) T protein:vir:10 307 AQAGLEVTDDIRDGDKAVVSEAMN-MLIRWICDLNFDG-ADRPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPAYFKRA 383 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCC-CCcceEEecCcC-chhHHHHHHHHHHHhCCCcCCHHHHHHH Confidence 999999999999999999999999 5999999999974 457999998765 4447899999999999999999999999 Q ss_pred hCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHHHH-HHH Q lcl|NC_020866. 401 FGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAA-LTD 479 (547) Q Consensus 401 ~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~ 479 (547) ||||.|.++++.+....+...++.. .+ .. ....++.++. +.+ T Consensus 384 ~Gip~~~~~~~~~~~~~~~~~~~~~----~~-------~~--------------------------~~~~~~~~d~~~~~ 426 (491) T protein:vir:10 384 YNLQDGDLDERPLPVSAVDTVGAAS----FA-------EF--------------------------EAPDQDALDAALNT 426 (491) T ss_pred hCCCCCCcCccccccCCCCCccccc----cc-------cc--------------------------CCCCCCchHHHHHH Confidence 9999998877654322211111000 00 00 0001112222 223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhh Q lcl|NC_020866. 480 QLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGA 544 (547) Q Consensus 480 ~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~ 544 (547) .....+++++++++++|+++|++|+||+|++++|.++||+|++++|+++|++||++|+++||+++ T Consensus 427 ~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~G~~~a 491 (491) T protein:vir:10 427 LSARDLNADAQALVAPLLKRIANGASADELLGMLAELYPSLDADALQERLARAIFVANLWGRLHA 491 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhccC Confidence 33457888999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=100.00 E-value=1.3e-139 Score=782.09 Aligned_cols=490 Identities=20% Similarity=0.273 Sum_probs=432.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |+ |+|+|++|+|++++..++++.. .+...++.+..|+..+++|.+ +++|+.++ ++ .++|++|+ +|+||++ T Consensus 1 ~~--~~i~~~~g~~~~~~~~~~~~~~-~ia~~~~~~~~~~~~~~~p~~-~~il~~~~-~~----~~~y~~m~-~D~~i~s 70 (491) T protein:vir:79 1 MS--KGLWVSPTEFVKFGEPDKSLSS-QIATRARSIDFFALGMYLPNP-DPVLKALG-KD----IRVYRELR-ADAHVGG 70 (491) T ss_pred CC--CeeeCCCCCcccccccchhHHH-HHhhhccccccccccccCcch-hHHHhhcc-CC----HHHHHHHh-hChHHHH Confidence 87 7999999999999887777643 555677788889999999988 77998654 44 46889997 7999999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEeecc Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWRDP 160 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~ 160 (547) ++++|+++|++++|+|+|++++. +++++|+++|++++|++++++|++|++|||||+|++|..++|.|.|.++.+||| T Consensus 71 ~l~~Rk~av~~~~w~i~~~~~~~---~~a~~i~e~l~~~~~~~~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 147 (491) T protein:vir:79 71 CVRRRKAAVKALEWGLDRGKAKS---RVAKSIADVFADLDLSRIATEMLDAVLYGYQPMEITWGKVGNYIVPIDVVGKPA 147 (491) T ss_pred HHHHHHHHHhCCCcEEecCCCCH---HHHHHHHHHHhcCCHHHHHHHHHHhhhhcceeEEEEEeecCCeeeEEeeeeecc Confidence 99999999999999999987653 468999999999999999999999999999999999999999999999999999 Q ss_pred ceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe Q lcl|NC_020866. 161 RHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK 240 (547) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk 240 (547) +||.|+.+++.+++..++...|+++|++|||+|+|+.++++|||.||++.|||+|+||++++++|+.|+|+||+|++||| T Consensus 148 ~~f~~d~~~~l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igk 227 (491) T protein:vir:79 148 DWFVYDPENQLRFRSKEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGK 227 (491) T ss_pred cceeeccCCceEEeecCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEe Confidence 99999999988888888888999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccch Q lcl|NC_020866. 241 WQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAV 320 (547) Q Consensus 241 ~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~ 320 (547) ||++++++++++|+++|++|++++++|||.|++|||+++++.+++.++|+++++|||++|||+||||||||++ |||+|+ T Consensus 228 y~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~-~gs~a~ 306 (491) T protein:vir:79 228 HPRSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEA-TSTRAS 306 (491) T ss_pred cCCCCCHHHHHHHHHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCc-ccchhh Confidence 9999999999999999999999999999999999999998877888899999999999999999999999985 789999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHH Q lcl|NC_020866. 321 SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSR 400 (547) Q Consensus 321 ~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~ 400 (547) |+||++|+++++++|+++|+++||+ ||++++.+||++ ...|+|.+.+.++ +.+.+++++++|+++|++++++|++++ T Consensus 307 ~~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~-~~~p~f~~~e~ee-~~~~~a~~~~~L~~~G~~i~~~~~~e~ 383 (491) T protein:vir:79 307 AQAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDG-AARPVFDMWEQEQ-VDEIQAGRDEKLTRAGARFTPAYFKRA 383 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCC-CCcceEeecCcCc-hhHHHHHHHHHHHhCCCccCHHHHHHH Confidence 9999999999999999999999995 999999999964 4567887765443 336789999999999999999999999 Q ss_pred hCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHHHHHHHH Q lcl|NC_020866. 401 FGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQ 480 (547) Q Consensus 401 ~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 480 (547) ||||+|+++++++....+.+.+..+. +. .....++.++.+.+. T Consensus 384 ~Gip~~~~~e~~~~~~~~~~~~~~~~-----------~~--------------------------~~~~~~~~~d~~~~~ 426 (491) T protein:vir:79 384 YNLQDGDLDERPLPVSAVDAVGAASF-----------AE--------------------------FEAPDQDALDAALNA 426 (491) T ss_pred hCCCCCCCCccccCcCcccccccccc-----------cc--------------------------cCCCCCcchHHHHHH Confidence 99999988877654332211111000 00 000112334444444 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhh Q lcl|NC_020866. 481 L-ELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGA 544 (547) Q Consensus 481 ~-~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~ 544 (547) + ...+++++++++++|+++|++|+|++|++++|.++||+|++++|+++|++||++|+++||+++ T Consensus 427 ~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~Gr~~a 491 (491) T protein:vir:79 427 LSARDLNADAQALVAPLLKRIANGASADELLGMLAELYPSLDTDALQERLARAIFVANLWGRLHA 491 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhccC Confidence 3 346888999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=100.00 E-value=4.5e-138 Score=773.55 Aligned_cols=478 Identities=23% Similarity=0.315 Sum_probs=433.1 Q ss_pred ccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCc Q lcl|NC_020866. 15 VQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDI 94 (547) Q Consensus 15 ~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w 94 (547) |+++.|++++|| ..++++++..|++.+.+| + +.+|+.+++|++. +|++|+ +|+||++++++|+++|++++| T Consensus 1 v~~~~l~~e~at--~~~~~d~~~~~~~~l~~~-~-~~il~~a~~g~~~----~y~~l~-~D~~i~s~l~~rk~av~~~~w 71 (488) T protein:vir:99 1 MEKPALGREIAT--SGDGRDITRPFISGLQVP-N-DSILQRRGGNDLR----VYEEIL-SDAQVKTVWGQRQLAVVSREW 71 (488) T ss_pred CCccchhHHHHH--HHhhhhhhccccCCCCCC-C-hHHHHhhccCCHH----HHHHHh-hChHHHHHHHHHHHHHhcCCc Confidence 999999999985 667788888888765545 4 6999999999975 556665 599999999999999999999 Q ss_pred eEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCccee Q lcl|NC_020866. 95 TVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPL 174 (547) Q Consensus 95 ~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~ 174 (547) +|+|+++++++++++++|+++|++++|++++++||+|++|||||+|++|.+++|.|.|.++.+|||+||.|+.+++.+++ T Consensus 72 ~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~~~l~~~ 151 (488) T protein:vir:99 72 KVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKMLFGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQDGGLRLL 151 (488) T ss_pred eEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHHhhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecCCCceEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred eccCCCCceeec-cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCC-CCCHHHHHH Q lcl|NC_020866. 175 LLDDNGAEVPLP-AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQP-GASEQDKDT 252 (547) Q Consensus 175 ~~~~~~~g~~l~-~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~-~~~~~~~~~ 252 (547) ..++...|+++| +++|++|+|+.++|||||.||++.|||+|+||++++++|+.|+|+||+|++|||||+ +++++++++ T Consensus 152 ~~~~~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~ 231 (488) T protein:vir:99 152 TPNNMFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAK 231 (488) T ss_pred ccCCCCCccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHH Confidence 888888899996 679999999999999999999999999999999999999999999999999999996 788999999 Q ss_pred HHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHHHHHHH Q lcl|NC_020866. 253 LFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLVQEDIE 332 (547) Q Consensus 253 l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~~~~~ 332 (547) |+++++++++++++|||.|++|||+++++ ++.++|+++++|||++|||+||||||||++++||+|+|+||++|+++++ T Consensus 232 l~~av~~~~~~~~~viP~~~~ie~~ea~~--~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~~v~~d~~ 309 (488) T protein:vir:99 232 LLAALHAIQTDSAIIMPAGMQAELLEAGR--SGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQADVRLDLV 309 (488) T ss_pred HHHHHHHHhcCcEEEecCCceeEEeecCC--CChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHH Confidence 99999999999999999999999999864 3446899999999999999999999999988899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhC-CCCCCHHHHHHHhCCCCCCCcch Q lcl|NC_020866. 333 TSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRL-GMEIEENEVRSRFGFSQPKPGAK 411 (547) Q Consensus 333 ~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~-G~~v~~~~~~e~~Glp~p~~~e~ 411 (547) ++|+++|+++||+|||++|+.+||+ ...+|+|+|+..+++|++++++++++|+++ |++++++|++++||||+|+++++ T Consensus 310 ~aDa~~i~~tln~~li~~l~~~N~~-~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~~ 388 (488) T protein:vir:99 310 KADADLICESFNLGPARWLTEWNFP-GAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQAE 388 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCcC-CcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcccccc Confidence 9999999999999999999999995 567899999999999999999999999997 99999999999999999988776 Q ss_pred hcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 412 LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVA 491 (547) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 491 (547) +..+.+.... +.. ....+..+.+.+++..++++++.+ T Consensus 389 ~~~~~~~~~~---------~~~----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~ 425 (488) T protein:vir:99 389 ATAPTPSTEF---------AEG----------------------------------DQPSDPAAAMAPQLAEAMQPVVGN 425 (488) T ss_pred cccCCCcccC---------CCC----------------------------------CCCCCchHHHHHHHHHHHHHHHHH Confidence 5432211000 000 001122345667777788999999 Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 492 MIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 492 ~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) ++++|+++|++|+||+|++++|.++||+|++++|+++|++||++|+++||+++.|- T Consensus 426 ~~~~i~~~l~~a~s~ee~~~~L~~l~~~~d~~~l~~~l~~a~~~a~l~G~~~~~~e 481 (488) T protein:vir:99 426 WTTQLRTLIEQASSLEDLRERLLDLAPQLSLDQYAQAMAEGLEAAHLAGRNDVQEE 481 (488) T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhHhhh Confidence 99999999999999999999999999999999999999999999999999999876 No 8 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=100.00 E-value=6.8e-112 Score=630.04 Aligned_cols=418 Identities=15% Similarity=0.158 Sum_probs=342.5 Q ss_pred hhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHH-------------HHHHHHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_020866. 24 VAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVR-------------YLDLAEVIEERDPHYLGVIGTRKRSVS 90 (547) Q Consensus 24 ~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~-------------~~~L~~~m~~~D~~v~s~l~~Rk~~v~ 90 (547) .| -...+.+||||.||+.|++.+..|+... +.+||++|+ +|+||+++|++|+++|+ T Consensus 1 ~~----------~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~-~D~hi~s~l~~Rk~av~ 69 (488) T protein:vir:95 1 MA----------DITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMM-RDPAVAASVNIIKMFVR 69 (488) T ss_pred CC----------CccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHh-hChHHHHHHHHHHHHHh Confidence 11 1234678999999999999999877655 456999998 59999999999999999 Q ss_pred cCCceEecCCCCHH---HHHHHHHHHHHHhCC--CHHHHHHHHHHhhhhcCeeeeeEEeec------------CCeeeEE Q lcl|NC_020866. 91 QLDITVEPASDDPQ---DVKIADMVRDWLKRE--ELQQEVFHILDCISKGYSFTEIVWNTS------------EGQYFPD 153 (547) Q Consensus 91 ~~~w~v~p~~~~~~---~~~~ae~v~~~l~~~--~~~~~i~~~l~a~~~G~sv~Ei~w~~~------------~g~~~~~ 153 (547) +++|+|+|++++++ +++.|++|+++++++ +|+++|++||+|++|||||+|++|.+. +|.|.|+ T Consensus 70 ~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~ 149 (488) T protein:vir:95 70 KVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVMSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWA 149 (488) T ss_pred cCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHHHhhcccceeeeeeeeccccccccccccccCCeeeee Confidence 99999999875544 445789999999886 589999999999999999999999763 7899999 Q ss_pred eeEeecc---ceeEecccCCcceeeccC---------------CCCceeeccCceEEEeecCCcCccccchhHHHHHHHH Q lcl|NC_020866. 154 KLIWRDP---RHFRFQKSDLATPLLLDD---------------NGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGW 215 (547) Q Consensus 154 ~l~~~~~---~~~~~~~~~~~~~~~~~~---------------~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~ 215 (547) +|.+||| +||.|+.+++..++..+. ...++.+|++|||+|+|+++++||||.||++.|||+| T Consensus 150 ~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~ 229 (488) T protein:vir:95 150 KLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPW 229 (488) T ss_pred eeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHH Confidence 9999999 699999988776665443 2356789999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCCCeEEEeCCC----CCCHHHHHHHHHHHHHHhcC------eeEEecCCceEEEEe------- Q lcl|NC_020866. 216 MFKAFTARDWAIFTQTYGQPLRIGKWQP----GASEQDKDTLFRAVANIAGD------CAAIIPETMSIDFVE------- 278 (547) Q Consensus 216 ~~K~~~~~~w~~f~E~~G~P~~vgk~~~----~~~~~~~~~l~~al~~~~~~------~~~vip~~~~ie~~e------- 278 (547) +||++++++|+.|+||||+|++++++++ ++++++++++++++.+++++ +++|||.++++++.+ T Consensus 230 ~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l 309 (488) T protein:vir:95 230 KYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSL 309 (488) T ss_pred HHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhc Confidence 9999999999999999988888877754 46788899999999988754 678999999887643 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC-ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD-AVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYG 357 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~-~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg 357 (547) .+.++++..+|.++|+|||++|||+|||||||++ ++|||+|+|+||++|+++++++|+++|+++||+|||+||+.+||| T Consensus 310 ~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg 389 (488) T protein:vir:95 310 VSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNMW 389 (488) T ss_pred cccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 2334456667999999999999999999999974 567999999999999999999999999999999999999999999 Q ss_pred CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCC----HHHHHHHhCCCCCCCcchhccccccccccccccccccccc Q lcl|NC_020866. 358 PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIE----ENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAG 433 (547) Q Consensus 358 ~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~----~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~ 433 (547) +...+|+|+|+..+++|++++++++++|+++|+.++ ++|++++||||.|.++++++.+..+.+.+..+....... T Consensus 390 ~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~- 468 (488) T protein:vir:95 390 DDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYKTAG- 468 (488) T ss_pred CCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccCCCc- Confidence 999999999999999999999999999999999998 689999999999998888876654433332222211111 Q ss_pred CCccccccccccccchhhhhhcccchhhhcccccccchhHHHHHHHH Q lcl|NC_020866. 434 DLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQ 480 (547) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 480 (547) ......+++ +....+.. +++ T Consensus 469 ~~~~~~~~~-~~~~~a~~--------------------------~~~ 488 (488) T protein:vir:95 469 EGTAKTPSA-KDPSTANK--------------------------ANK 488 (488) T ss_pred ccCCccccc-ccchhhhh--------------------------ccC Confidence 111111110 00000000 000 No 9 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=100.00 E-value=3.8e-110 Score=620.48 Aligned_cols=423 Identities=13% Similarity=0.164 Sum_probs=329.6 Q ss_pred hhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHH---HHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceE Q lcl|NC_020866. 20 LTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPV---RYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITV 96 (547) Q Consensus 20 l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~---~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v 96 (547) +++..+.++.. .....-.++++++. ..+|++++.++.. ...+||++|+++|+||+++|++|+++|++++|+| T Consensus 1 ~~~~~~~~~p~---~~~g~~~~~~~~~~--~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v 75 (469) T protein:vir:10 1 MTERVKTAAPV---SEAGYVFGSGVVDG--WTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRI 75 (469) T ss_pred CCCcccCCCCc---cchhhhhhcccccc--hhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceE Confidence 22222221111 01122223455543 5788888776553 5688999999999999999999999999999999 Q ss_pred ecCCCCHHHHHHHHHHHHHHhC-----------------CCHHHHHHHHHH-hhhhcCeeeeeEEeec----CCeeeEEe Q lcl|NC_020866. 97 EPASDDPQDVKIADMVRDWLKR-----------------EELQQEVFHILD-CISKGYSFTEIVWNTS----EGQYFPDK 154 (547) Q Consensus 97 ~p~~~~~~~~~~ae~v~~~l~~-----------------~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~----~g~~~~~~ 154 (547) +|++++. ++++++.++|.. .+|.++|.++|+ |++|||||+|++|.+. +|.|.|.+ T Consensus 76 ~p~~~~~---e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~ 152 (469) T protein:vir:10 76 RANGASD---EVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRK 152 (469) T ss_pred ecCCCCH---HHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeee Confidence 9987643 344555554432 247777777775 8999999999999865 58899999 Q ss_pred eEeeccc---eeEecccCCcceeecc------------CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHH Q lcl|NC_020866. 155 LIWRDPR---HFRFQKSDLATPLLLD------------DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKA 219 (547) Q Consensus 155 l~~~~~~---~~~~~~~~~~~~~~~~------------~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~ 219 (547) |.+|||+ +|.++.+++...+... .+..++++|++|||+|+|+++++||||.||++.|||+|+||+ T Consensus 153 l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~ 232 (469) T protein:vir:10 153 LAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKD 232 (469) T ss_pred eeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHH Confidence 9999997 4556666544433321 234578999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHH--hcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHH Q lcl|NC_020866. 220 FTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANI--AGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLD 297 (547) Q Consensus 220 ~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~--~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d 297 (547) +++++|+.|+|+||+|++||||+++++++++++|+++++++ ++++++|||.|++|||+++++ +..+|+++|+||| T Consensus 233 ~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g---~~~~~~~li~~~d 309 (469) T protein:vir:10 233 KLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSG---NLPDIRRAIEGHD 309 (469) T ss_pred HHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCC---CchHHHHHHHHHH Confidence 99999999999999999999999999999999999999998 568899999999999999753 2347999999999 Q ss_pred HHHHHHHhcchhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHH Q lcl|NC_020866. 298 QQISKAVLGQTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 298 ~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~ 377 (547) ++|||+|||||||++++|||+|+|+||++|+++++++|+++|+++||++||++|+.+||++...+|+|+|+..+ ++.+. T Consensus 310 ~~Isk~iLG~tlTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e-~~~~~ 388 (469) T protein:vir:10 310 RSIALSGLAHFLNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIG-SRQDL 388 (469) T ss_pred HHHHHHHhcccccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCC-CcHHH Confidence 99999999999999988999999999999999999999999999999999999999999999999999998776 55678 Q ss_pred HHHHHHHHHhCCC----CCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhh Q lcl|NC_020866. 378 LASSLDTLVRLGM----EIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSR 453 (547) Q Consensus 378 ~a~~~~~l~~~G~----~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (547) .++++++|+++|+ +++++|++++||||+|.++++++.+..+.+.+.+.+...........+..+..+.. T Consensus 389 ~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 461 (469) T protein:vir:10 389 TAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNADARARAPKA------- 461 (469) T ss_pred HHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCcccccccCCC------- Confidence 8999999999999 88899999999999999998887665443322222111111111111111110100 Q ss_pred hcccchhhhcccccccchhHHHHHHHH Q lcl|NC_020866. 454 GDLSLNSEMPSTGQKTGGAEIAALTDQ 480 (547) Q Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~l~~~ 480 (547) .. +.|.+. T Consensus 462 ----------------~~---~~l~da 469 (469) T protein:vir:10 462 ----------------DQ---GVLFDA 469 (469) T ss_pred ----------------hH---HhhccC Confidence 00 011110 No 10 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=100.00 E-value=1.7e-109 Score=616.85 Aligned_cols=425 Identities=16% Similarity=0.140 Sum_probs=352.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) ||+..+- -.-+.|+. ...++.+++++.+++..+..+|..|++|.++++|||..+ ..+||++|++ |+||++ T Consensus 1 m~k~~~k-~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~------~~~ly~~m~~-D~hi~s 70 (448) T protein:vir:79 1 MAKRGRK-PKELVPGP--GSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKD------GLLVYHKMLS-DGTVKN 70 (448) T ss_pred CCCCCCC-CccccCcc--cccccccchhhhhhhhhhcccccccccccchhHhhcccc------chHHHHHHhh-ChHHHH Confidence 9986653 22334444 335556778899999999999999999999999998643 2589999986 999999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC-------CCHHHHHHHHHHhhhhcCeeeeeEEee-cCCeeeE Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR-------EELQQEVFHILDCISKGYSFTEIVWNT-SEGQYFP 152 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~-------~~~~~~i~~~l~a~~~G~sv~Ei~w~~-~~g~~~~ 152 (547) +|++|+++|++++|+|+|++++++++++|++|+++|.. .+|++++.+||+|++|||||+|++|.+ .+|.|.+ T Consensus 71 ~l~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~~~g~~~~ 150 (448) T protein:vir:79 71 ALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKLIL 150 (448) T ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHHHhhhhcceeEEEEeeecCCCceec Confidence 99999999999999999999999999999999999864 479999999999999999999999986 5899999 Q ss_pred EeeEeeccc---eeEecccCCcceeeccCC-------CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPR---HFRFQKSDLATPLLLDDN-------GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTA 222 (547) Q Consensus 153 ~~l~~~~~~---~~~~~~~~~~~~~~~~~~-------~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~ 222 (547) .++.+|||+ ||.|+.+++..++.+++. ..++++|.++|++| +++++|||||.||++.|||+|+||++++ T Consensus 151 ~~l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~-~~~~~g~p~g~gLlr~~~w~~~fK~~~~ 229 (448) T protein:vir:79 151 DKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVF-LHNDDGSFTGQSALRAAVPHWLAKRALI 229 (448) T ss_pred ccccccCCccccceeeecCCceEEeecCCcccccccCCCccccccceEEEE-ecCccCCcccchhHHHHHHHHHHHHHHH Confidence 999999997 899999888877776543 24567788887766 5679999999999999999999999999 Q ss_pred HHHHHHHHHcCCCeEEEeCCCCCC--HHHHHHHHHHHHHH--hcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHH Q lcl|NC_020866. 223 RDWAIFTQTYGQPLRIGKWQPGAS--EQDKDTLFRAVANI--AGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQ 298 (547) Q Consensus 223 ~~w~~f~E~~G~P~~vgk~~~~~~--~~~~~~l~~al~~~--~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~ 298 (547) ++|+.|+|+||+|++|||||++++ ++++++|+++++++ ++++++|||.|++|||+++++++ . .|.++|+|||+ T Consensus 230 ~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~--~-~~~~~i~~~d~ 306 (448) T protein:vir:79 230 LLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAM--P-DAIPYLTYHDA 306 (448) T ss_pred HHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCc--c-cHHHHHHHHHH Confidence 999999999999999999998876 57888999999987 57899999999999999987533 2 46789999999 Q ss_pred HHHHHHhcchhccCccccccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHH Q lcl|NC_020866. 299 QISKAVLGQTTTTDAVAGGHAVS-QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 299 ~Isk~ilGqtlts~~~ggs~a~~-~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~ 377 (547) +|||+|||||||+++++||++.+ .+|.+++++++++|+++|+++||++||++|+++|||+..++|+|+|+..+++|+++ T Consensus 307 ~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~e~~Dl~~ 386 (448) T protein:vir:79 307 GIARALGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEMEERNDFSA 386 (448) T ss_pred HHHHHHhhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCCChHHHHH Confidence 99999999999998776665444 47889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 378 LASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 378 ~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) +++++++|+.++ +..++|+++++|+|+|.++++.......+..... .. .+. + T Consensus 387 ~a~~~~~l~~~~-~~~~~~~~~~~~~p~~~~~~~~~a~~~~~~~~~~------~~----------~~~-~---------- 438 (448) T protein:vir:79 387 AANLMGMLINAV-KDSEDIPTELKALIDALPSKMRRALGVVDEVREA------VR----------QPA-D---------- 438 (448) T ss_pred HHHHhhhhhccc-hhhHHHHHHhhcCCCCCCCccccccCCCCccccc------cc----------CCc-c---------- Confidence 999999999885 6778999999999998877643221111110000 00 000 0 Q ss_pred chhhhcccccc Q lcl|NC_020866. 458 LNSEMPSTGQK 468 (547) Q Consensus 458 ~~~~~~~~~~~ 468 (547) ++-+...... T Consensus 439 -~~~~~~~~~~ 448 (448) T protein:vir:79 439 -SRYLYTRRRR 448 (448) T ss_pred -ccchhhcccC Confidence 0000111111 No 11 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=100.00 E-value=1.4e-107 Score=606.48 Aligned_cols=425 Identities=16% Similarity=0.159 Sum_probs=339.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) ||++.+- -.-. +..+.-..+.+.+++.+++..+..||..|++|.++++|||..+ ..+||++|++ |+||++ T Consensus 1 m~kk~~k-~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~------~~~ly~~m~~-D~hi~s 70 (448) T protein:vir:77 1 MAKRGRK-PKEL--VPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKD------GLLVYHKMLS-DGTVKN 70 (448) T ss_pred CCCCCCC-Cccc--CCcccccchhhhhhhccchhhhcccccccccccchhHhhcccc------chHHHHHHhh-ChHHHH Confidence 9975431 0011 2222323455667888999999999999999999999998532 3689999986 999999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC-------CCHHHHHHHHHHhhhhcCeeeeeEEee-cCCeeeE Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR-------EELQQEVFHILDCISKGYSFTEIVWNT-SEGQYFP 152 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~-------~~~~~~i~~~l~a~~~G~sv~Ei~w~~-~~g~~~~ 152 (547) +|++||++|++++|+|+|++++++++++|++|+++|.. .+|++++++||+|++|||||+|++|.+ .+|.|.+ T Consensus 71 ~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~lda~~~G~s~~Eivw~~~~dg~~~~ 150 (448) T protein:vir:77 71 ALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKLIL 150 (448) T ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHHHHhhhhcceeEEEEEeecCCCceee Confidence 99999999999999999999999999999999999864 489999999999999999999999986 5899999 Q ss_pred EeeEeecc---ceeEecccCCcceeeccCCC-------CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDP---RHFRFQKSDLATPLLLDDNG-------AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTA 222 (547) Q Consensus 153 ~~l~~~~~---~~~~~~~~~~~~~~~~~~~~-------~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~ 222 (547) .++.+||| +||.|+.+++.+++++.+.. .++++|..+|++| +++++|||||.||++.|||+|+||++++ T Consensus 151 ~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~-~~~~~g~p~g~gLlr~~~w~~~fK~~~~ 229 (448) T protein:vir:77 151 DKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKTVVF-LHNDDGSFTGQSALRAAVPHWLAKRALI 229 (448) T ss_pred ccccccCCCccceeeeecCCceEEEecCCcccccccCCCccccccceEEEE-ecCCcCCcccchHHHHHHHHHHHHHhhH Confidence 99999998 58999999888887766532 3566777777655 5688999999999999999999999999 Q ss_pred HHHHHHHHHcCCCeEEEeCCCCCC--HHHHHHHHHHHHHH--hcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHH Q lcl|NC_020866. 223 RDWAIFTQTYGQPLRIGKWQPGAS--EQDKDTLFRAVANI--AGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQ 298 (547) Q Consensus 223 ~~w~~f~E~~G~P~~vgk~~~~~~--~~~~~~l~~al~~~--~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~ 298 (547) ++|+.|+|+||+|++||||+++++ ++++++|++++.++ ++++++|||.|++|||++++++++ .|.++|+|||+ T Consensus 230 ~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~---~~~~~i~~~d~ 306 (448) T protein:vir:77 230 LLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMP---DAIPYLTYHDA 306 (448) T ss_pred HHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcc---CHHHHHHHHHH Confidence 999999999999999999998775 46889999999987 678999999999999999875332 37789999999 Q ss_pred HHHHHHhcchhccCccccccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHH Q lcl|NC_020866. 299 QISKAVLGQTTTTDAVAGGHAVS-QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 299 ~Isk~ilGqtlts~~~ggs~a~~-~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~ 377 (547) +|||+||||||||++++|+++.+ ..|.+++.+++++|+++|+++||++||++|+.+|||+..++|+|+|+..+++|+++ T Consensus 307 ~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~ 386 (448) T protein:vir:77 307 GIARALGIDFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDFSA 386 (448) T ss_pred HHHHHHhccccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhHHH Confidence 99999999999998776655544 35668999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 378 LASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 378 ~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) +++++++|+ +++++++|||++.++.. .+.+..+.++++. .+...++....++...+.. + T Consensus 387 ~a~~~~~l~--------~~~~~~~~ip~~~~~~~----~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~---r 445 (448) T protein:vir:77 387 AANLMGMLI--------NAVKDSEDIPTELKALI----DALPSKMRRALGV------VDEVREAVRQPADSRYLYT---R 445 (448) T ss_pred HHHHhHHHH--------HHHHHHhcCCccCCcCC----CCCchhcccccCC------CCCCCchhhcchhhHHHHh---h Confidence 999999986 46899999998764321 1111111111111 0111111111111111111 1 Q ss_pred chh Q lcl|NC_020866. 458 LNS 460 (547) Q Consensus 458 ~~~ 460 (547) ..+ T Consensus 446 ~~~ 448 (448) T protein:vir:77 446 RRR 448 (448) T ss_pred hcC Confidence 111 No 12 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=100.00 E-value=6.1e-106 Score=597.39 Aligned_cols=384 Identities=14% Similarity=0.165 Sum_probs=321.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |-. ++ -.+++|.++++++.. ...+..++. +| +.+||+++ |++..+.+||++|+++|+||++ T Consensus 3 ~~~--------~~-~p~~~~~~~~~~~~~--~~~~~~g~~----~~---D~~lr~~g-g~~~~~~~l~~~m~e~D~~v~s 63 (446) T protein:vir:98 3 MEV--------RN-APTPAIRRRTIYAME--HLGLATSYL----SE---DGGYKRAG-KPTYQQLSAWDEAAQTEPIIAQ 63 (446) T ss_pred ccc--------cC-CCchhhhhhhhhccc--cchhhcccC----Cc---chHhhhcC-CChHHHHHHHHHHHhcchHHHH Confidence 221 11 146788888886432 234445443 44 78999884 7899999999999999999999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE---- Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI---- 156 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~---- 156 (547) +|++||++|++++|+|+|+ ++++|++|+++|+++.|+.++.+|++|++|||||+|++|++.+|.+.|.++. T Consensus 64 ~l~~Rk~av~~~~w~V~p~-----~~~~a~~v~~~l~~~~~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~ 138 (446) T protein:vir:98 64 GLDSIALSVLNKVGPYQHG-----DKRIKKFIDDQLRNRAKTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIV 138 (446) T ss_pred HHHHHHHHhhcCCceecCc-----cHHHHHHHHHHHhhcCchhHHHHHHHHHhhCceeeeEEEeecccccccchhhcccc Confidence 9999999999999999985 3568999999999999999999999999999999999999998888876542 Q ss_pred eeccc--eeEecccCCcc--------------------------eeeccCCCCceeeccCceEEEeecCCcCccccchhH Q lcl|NC_020866. 157 WRDPR--HFRFQKSDLAT--------------------------PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLA 208 (547) Q Consensus 157 ~~~~~--~~~~~~~~~~~--------------------------~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll 208 (547) .+.|. +|.++.+++.. .+.....+.++++|+.||++|+|+++++||||.||+ T Consensus 139 ~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLl 218 (446) T protein:vir:98 139 NYHPLQVMLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCL 218 (446) T ss_pred ccccccceeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchH Confidence 22222 23444433221 011123345678999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHH-------------HHHHHHHHHHhcCeeEEe-----cC Q lcl|NC_020866. 209 RVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDK-------------DTLFRAVANIAGDCAAII-----PE 270 (547) Q Consensus 209 ~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~-------------~~l~~al~~~~~~~~~vi-----p~ 270 (547) |.|||+|+||++++++|+.|+|+||+|+++||||++++++++ ++|+++++++++++++|+ |+ T Consensus 219 r~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~ 298 (446) T protein:vir:98 219 TSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQ 298 (446) T ss_pred HHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCC Confidence 999999999999999999999999999999999999887666 359999999999999998 99 Q ss_pred CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhcc---CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 271 TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTT---DAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDL 347 (547) Q Consensus 271 ~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts---~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~l 347 (547) |++|||+++++ ++..+|+++|+|||++|||+|||||||+ ++++||+|+|+||++|+.+++++|+++||++||+|| T Consensus 299 g~eie~~ea~~--~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~L 376 (446) T protein:vir:98 299 PVQVGALTTGN--NFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQV 376 (446) T ss_pred CceEEeecccc--CChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999875 3334699999999999999999998864 245799999999999999999999999999999999 Q ss_pred HHHHHHhcCCCccccc-----EEEecCccccCHHHHHHHHHHHHhCCCCC--CHHHHHHHhCCCCCCCcc Q lcl|NC_020866. 348 IRPWIQLEYGPQKRYP-----RIKIARPKTEDLTKLASSLDTLVRLGMEI--EENEVRSRFGFSQPKPGA 410 (547) Q Consensus 348 i~~l~~~Nfg~~~~~p-----~~~~~~~~~ed~~~~a~~~~~l~~~G~~v--~~~~~~e~~Glp~p~~~e 410 (547) |+|||.+||++...+| +++|+..+++|++++++++++|+++|+.+ +++|++++||||++.++- T Consensus 377 i~~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 377 IGNLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred HHHHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 9999999998765543 46788889999999999999999999854 588999999998766543 No 13 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=100.00 E-value=2.8e-84 Score=478.62 Aligned_cols=328 Identities=17% Similarity=0.192 Sum_probs=247.2 Q ss_pred eeeeEEeecCCeeeEEeeEeeccc---eeEecccCCcceeecc--CCCCceeeccCceEEEeecCCcCccccchhHHHHH Q lcl|NC_020866. 138 FTEIVWNTSEGQYFPDKLIWRDPR---HFRFQKSDLATPLLLD--DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAA 212 (547) Q Consensus 138 v~Ei~w~~~~g~~~~~~l~~~~~~---~~~~~~~~~~~~~~~~--~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~ 212 (547) |+||+|++++|.|.|.+|.+|||+ ||.++.+++...+... .+.+++++|++|||+|+|+++++||||.||++.|| T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 999999999999999999999998 5556665554333322 33467899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC--C-----------CHHHHHHHHHHHHHHhc--CeeEEecCCceEEEE Q lcl|NC_020866. 213 WGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG--A-----------SEQDKDTLFRAVANIAG--DCAAIIPETMSIDFV 277 (547) Q Consensus 213 ~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~--~-----------~~~~~~~l~~al~~~~~--~~~~vip~~~~ie~~ 277 (547) |+|+||++++++|+.|+||||+|+++++++.+ + +.++++.+.++++++.. ++++|||.|++|||+ T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ 160 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLT 160 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEe Confidence 99999999999999999999776666665543 2 34567788888888754 589999999999999 Q ss_pred ecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCc--cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 278 ESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDA--VAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLE 355 (547) Q Consensus 278 e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~--~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~N 355 (547) ++++++ . .|.++|+|||++|||+|||||||+++ +|||+|+|++|++|+++++++|+++|+++||++||++|+.+| T Consensus 161 ea~g~~--~-~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN 237 (355) T protein:vir:78 161 GVQGKL--P-EMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQN 237 (355) T ss_pred ecCCCc--c-cHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 976432 2 48889999999999999999999864 569999999999999999999999999999999999999999 Q ss_pred CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCC----HHHHHHHhCCCCCCCcchhccccccccccccccccccc Q lcl|NC_020866. 356 YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIE----ENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPG 431 (547) Q Consensus 356 fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~----~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~ 431 (547) |++...+|+|+|+..++ +.+++++++++|+++|+.++ ++|++++||||+|.++++.+.+..++.++.......++ T Consensus 238 ~~~~~~~P~~~~~~~~~-~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (355) T protein:vir:78 238 WGPEEPAPRLVPAQLGK-EQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPG 316 (355) T ss_pred CCCCCCCCEEEecCcCh-hHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccCC Confidence 99989999999987664 55678999999999999887 46999999999999888776655444433333222222 Q ss_pred ccCCccccccc-cccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_020866. 432 AGDLKTAERKL-KPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAAS 504 (547) Q Consensus 432 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~ 504 (547) .......+... ................+ ....+. ..++ T Consensus 317 ~~~~~~~~a~~~~a~~~~~~~~~~~~~~~-~~~~~~----------------------------------~~~~ 355 (355) T protein:vir:78 317 QRQGAALPSRSPRADPPRRRGPLRRRPRH-PAHRRC----------------------------------APDG 355 (355) T ss_pred ccccccccccCCCCCChhhhHHHHHHhhc-cccCCC----------------------------------CCCC Confidence 21111111110 01111111111100111 111111 1111 No 14 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.96 E-value=1.5e-28 Score=173.12 Aligned_cols=493 Identities=12% Similarity=0.099 Sum_probs=290.1 Q ss_pred eeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRK 86 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk 86 (547) ++=..|+.+..++..+-. ..++..+.+.|..+..+. ....++..+..+-+.|.+|++.+- T Consensus 1 ~~~~~~~~~~~p~~~~~~-----~~~~~~~~~~~~~g~~~~---------------~~~~~~~~~~~~~~~V~acV~~IA 60 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELS-----PQMQDSYYYAPAVGMQLE---------------RQFSLYGGIYKNQPWVRTVIAKRA 60 (518) T ss_pred CcccCceeeccchhhhhh-----hhhhhcccccceeceecc---------------cccchhhHHhhhhHHHHHHHHHHH Confidence 455667766655432221 123333333343333221 112222222236799999999999 Q ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccc Q lcl|NC_020866. 87 RSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPR 161 (547) Q Consensus 87 ~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~ 161 (547) ..|.+++|++.-.+++...++.-..+..++.++ ...++++.++ +.+.+|.+++++++...+ .+..|.+++|. T Consensus 61 ~~iA~lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G---~~~~L~~l~p~ 137 (518) T protein:vir:78 61 QALARLPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPMHPS 137 (518) T ss_pred HhhccCceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC---cEEEEEEECCC Confidence 999999999964332221111112234455555 3556666666 455799999999875433 36789999999 Q ss_pred eeEecccCC-c--ceeeccCCC---CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020866. 162 HFRFQKSDL-A--TPLLLDDNG---AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQP 235 (547) Q Consensus 162 ~~~~~~~~~-~--~~~~~~~~~---~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P 235 (547) ++++..+.. . .+......+ ..+.+++...|++++...++..+|.+.+..+....-.-....++...|...-|.| T Consensus 138 ~Vtv~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p 217 (518) T protein:vir:78 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRP 217 (518) T ss_pred ceEEEEcCCCCEEEEEEEecCCccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 887654432 2 222222222 2356888888777776666777899999999988888888888999999999999 Q ss_pred eEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc--h Q lcl|NC_020866. 236 LRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ--T 308 (547) Q Consensus 236 ~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq--t 308 (547) --|-+++...++++++++.+.+.+.-+ +..+|++.|++++-+..+.. ...|.+..++...+|+++. |- . T Consensus 218 ~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~---d~q~le~r~~~~~eIa~af-gVPp~ 293 (518) T protein:vir:78 218 NLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAV---EMQFIEARQLNREEVCGVY-DIAPP 293 (518) T ss_pred cEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChh---HHHHHHHHHHHHHHHHHHh-CCCHH Confidence 888899888899999988888876532 24688899988876654321 2247777788889999873 32 2 Q ss_pred hccCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHHHHHHH Q lcl|NC_020866. 309 TTTDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLASSLDTL 385 (547) Q Consensus 309 lts~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~~~~~l 385 (547) +....++++++-.+.+.. ....-+...++.|+..||+.|+..+ . ... +|+|+ .....|.+.+++.+.++ T Consensus 294 ~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~-----~-~~~--~~~fd~~~Llr~D~~~r~~~~~~~ 365 (518) T protein:vir:78 294 IVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----V-RKN--RMKFDIDDVIQPDWEAKSESTQKM 365 (518) T ss_pred HhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----c-Ccc--eEEeechhhhccCHHHHHHHHHHH Confidence 333333456654444443 3455688889999999998775432 1 112 45554 44468999999999999 Q ss_pred HhCCCCCCHHHHHHHhCCCCCC--Ccchhcccccc-cccccccccccccccCCccccccccccccchhhhhhcccchhhh Q lcl|NC_020866. 386 VRLGMEIEENEVRSRFGFSQPK--PGAKLLRPLEN-PATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEM 462 (547) Q Consensus 386 ~~~G~~v~~~~~~e~~Glp~p~--~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (547) ++.|+ ++.+++|+.+|+|+-+ .+++++..... +-.............+.+..+..........+............ T Consensus 366 ~~~G~-lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (518) T protein:vir:78 366 VNSGV-ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTN 444 (518) T ss_pred HhCCC-cCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCccc Confidence 99998 9999999999998644 34444322111 10000000000000000000000000000111111111000000 Q ss_pred cccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHH-HHHHHHHHHHHHhH Q lcl|NC_020866. 463 PSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLAS-RISAAMLASEFAGR 541 (547) Q Consensus 463 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~-~l~~al~~A~~~G~ 541 (547) ........++.-..++..- -......+.++.|...+..+++++.|.-.|.+.||+ .|++ +|+--+.+|+-.-. T Consensus 445 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 518 (518) T protein:vir:78 445 SDRSTDSGKTEPRRLMQKP--PPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYPD----DLEDILLAVQLALAERKDN 518 (518) T ss_pred ccccccccccchhcccCCC--CcccccchHHHHHHHHhhcCCcchhhhhhhhhhcch----hHHHHHHHHHHhhhhccCC Confidence 0011111111111111110 112234567788899999999999999999999985 5555 44555555443322 No 15 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.96 E-value=2e-28 Score=172.41 Aligned_cols=494 Identities=12% Similarity=0.095 Sum_probs=288.5 Q ss_pred eeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRK 86 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk 86 (547) ++=..|+.+.+++..+... .++..+.+.+..+..+ .....++..+..+-+.|.+|++.+- T Consensus 1 ~~~~~~~~~~~p~~~e~~~-----~~~~~~~~~~~~~~~~---------------~~~~~~~~~~a~~~~~V~acV~~IA 60 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSP-----QMQDSYYYAPAVGMQL---------------ERQFSLYGGIYKNQPWVRTVIAKRA 60 (518) T ss_pred CcccCceeecCchhhhhhh-----hhhcccccccccceec---------------ccccchhhHHHhhhHHHHHHHHHHH Confidence 4445677666654332211 2223333333222221 1112222222336789999999999 Q ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccc Q lcl|NC_020866. 87 RSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPR 161 (547) Q Consensus 87 ~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~ 161 (547) ..|.+++|.+.-..++...+..-..+..++.++ ...++++.++ +.+.+|.+++++++..+| .+..|.++++. T Consensus 61 ~~iA~lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G---~~~~L~~l~p~ 137 (518) T protein:vir:10 61 QALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPMHPS 137 (518) T ss_pred HhhccCceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEEEEECCC Confidence 999999999854322111111112234455554 3566777777 466899999999875433 46789999999 Q ss_pred eeEecccCC---cceeeccCCC---CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020866. 162 HFRFQKSDL---ATPLLLDDNG---AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQP 235 (547) Q Consensus 162 ~~~~~~~~~---~~~~~~~~~~---~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P 235 (547) ++++..+.. ..+......+ ..+.+++...|++++...++..+|.+.+..+....-.-....++...|.+.-|.| T Consensus 138 ~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p 217 (518) T protein:vir:10 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRP 217 (518) T ss_pred ceEEEEcCCCCEEEEEEEecCCccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 887654432 2222222222 2356788888877777667777999999999998888888899999999999999 Q ss_pred eEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chh Q lcl|NC_020866. 236 LRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTT 309 (547) Q Consensus 236 ~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtl 309 (547) --+.+++...++++++++.+.+.+.-+ +..+|++.|++++-+..+.. ...|.+..++...+|+++.-- ..+ T Consensus 218 ~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~---D~q~le~r~~~~~eIa~afgVPp~~ 294 (518) T protein:vir:10 218 NLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAV---EMQFIEARQLNREEVCGVYDIAPPI 294 (518) T ss_pred cEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChh---HHHHHHHHHHHHHHHHHHhCCCHHH Confidence 888899988899999888888876533 23688999988876653321 123777778888999987421 223 Q ss_pred ccCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHHHHHHHH Q lcl|NC_020866. 310 TTDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLASSLDTLV 386 (547) Q Consensus 310 ts~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~~~~~l~ 386 (547) ....++++++-.+.+.. ....-+...++.|+..||+.|+... . ... +|+|+ .....|.+.+++.+.+++ T Consensus 295 lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-----~-~~~--~~~fd~~~llr~D~~~r~~~~~~~~ 366 (518) T protein:vir:10 295 VHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----V-RKN--RMKFDIDDVIQPDWEAKSESTQKMV 366 (518) T ss_pred hccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----c-CCc--eEEEechhhhccCHHHHHHHHHHHH Confidence 33333455654444433 3455688889999999998775442 1 122 45554 444689999999999999 Q ss_pred hCCCCCCHHHHHHHhCCCCCC--Ccchhcccccc-cccccccccccccccCCccccccccccccchhhhhhcccchhhhc Q lcl|NC_020866. 387 RLGMEIEENEVRSRFGFSQPK--PGAKLLRPLEN-PATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMP 463 (547) Q Consensus 387 ~~G~~v~~~~~~e~~Glp~p~--~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (547) +.|+ ++.+++|+.+|+|+-+ .+++++..... +-.............+.+..+........................ T Consensus 367 ~~G~-lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (518) T protein:vir:10 367 NSGV-ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNS 445 (518) T ss_pred hCCC-cCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCcccc Confidence 9998 9999999999998643 34444322111 000000000000000000000000000000000010000000000 Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHH-HHHHHHHHHHHHhH Q lcl|NC_020866. 464 STGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLAS-RISAAMLASEFAGR 541 (547) Q Consensus 464 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~-~l~~al~~A~~~G~ 541 (547) .......++.-...+. ..-......+.++.|...+..+++++.|.-.|.+.||+ .|++ +|+--+.+|+-.-. T Consensus 446 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 518 (518) T protein:vir:10 446 DRSTDSGKTEPRRLMQ--KPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYPD----DLEDILLAVQLALAERKDN 518 (518) T ss_pred cccccccccchhcccc--CCCcccccchHHHHHHHHhhcCccchhHhhhhhhhcch----hHHHHHHHHHHhhhhccCC Confidence 0000000110000110 01122344567788899999999999999999999985 4555 44555555443322 No 16 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.91 E-value=1e-23 Score=146.64 Aligned_cols=393 Identities=13% Similarity=0.087 Sum_probs=244.2 Q ss_pred CCCCCeeeCC-CCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDR-WGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~-~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |. |++. ++++......+.... .+ ++ ...+ ...|..... +..+ +=+.|. T Consensus 1 Mg----l~~~~f~~~~~~~~~~~~~~-------------~~----~~---~~~~--~~~g~~v~~----~~al-~~~~v~ 49 (409) T protein:vir:84 1 MS----LFTRIFSGPSEERTLTKISG-------------IP----SP---AEDW--AMHGDRPGA----NSAM-TLGAFY 49 (409) T ss_pred Cc----hhhhhhcCCCcccccccccc-------------cc----cc---cchh--hccCcccch----hhhh-ccHHHH Confidence 88 6774 444432222211110 00 00 0000 001111110 0111 347799 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) +|++.+...+.+++|++.-.++....+. .-+..+|. + +.+.++++.++ +.+.+|-+++++.+...+| .+. T Consensus 50 ~~v~~ia~~iA~lp~~~~~~~~~~~~~~--~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g--~~~ 125 (409) T protein:vir:84 50 ACVTLLADTVASLSIDAYRKKDNVRIPV--SPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEAN--RPT 125 (409) T ss_pred HHHHHHHHhhhhCceEEEEecCCccccc--chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCC--ceE Confidence 9999999999999999975444322211 12333442 3 35777888777 5778999998887754455 467 Q ss_pred eeEeeccceeEec--ccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQ--KSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 154 ~l~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) .|.+++|.++.+. .+....++.......|..+++..+|++++....+..+|.+.+..+....-.-....++...|... T Consensus 126 ~L~~l~p~~v~v~~~~~~~~~~~~~~~~~~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~n 205 (409) T protein:vir:84 126 AIMPIHPDCIHVTDAKDEDGDWIEPVYRIDGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRD 205 (409) T ss_pred EEEEEcCceeEEEEcCCCcceEEEEEecCCceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 8999999887654 33334444433344567889888888888888888899999999998888888888899999999 Q ss_pred cCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCe--eEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc-- Q lcl|NC_020866. 232 YGQPLRIGKWQPGASEQDKDTLFRAVANIAGDC--AAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ-- 307 (547) Q Consensus 232 ~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~--~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq-- 307 (547) -|.|--+-+++...++++++++.+.+.+...++ .+|++.|++++-+..+. ....|.+..++..++|+++ +|- T Consensus 206 g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~---~d~q~~e~~~~~~~~Ia~~-fgVPp 281 (409) T protein:vir:84 206 SANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITP---NESQFLETRSFQRSEIAMW-FRIPP 281 (409) T ss_pred CCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCCh---hHHHHHHHHHHHHHHHHHH-hCCCH Confidence 999988889988889999988888777765543 68889988877654332 1224667777888889887 332 Q ss_pred hhccCccccccch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 308 TTTTDAVAGGHAV---SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 308 tlts~~~ggs~a~---~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) .+.+...+++... .+........-+.--++.|+..||+.|. ...+-+|.+......|.+.+++++.+ T Consensus 282 ~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~----------~g~~i~fd~~~l~~~d~~~~~~~~~~ 351 (409) T protein:vir:84 282 HMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFLP----------RGQFVKFNVDGLMRGDVTARFTAYQM 351 (409) T ss_pred HHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CCCeEEEechhhhccCHHHHHHHHHH Confidence 2333222233211 2222334456677888889999987641 12233455555566899999999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) +++.|+ ++.+++|+.+|+|+-+.+++.+.+.........++ ..++....+......+. T Consensus 352 ~~~~G~-~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~-~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 352 GLQNGI-WSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPP-EEPAQEPQPNSATEGNK 409 (409) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCcceeeecccccccccCCc-cccCcCCCCCCccCCCC Confidence 999997 99999999999997766666554432211111111 11111111111100100 No 17 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.91 E-value=9.9e-24 Score=146.71 Aligned_cols=402 Identities=10% Similarity=0.019 Sum_probs=248.3 Q ss_pred CCCCCeeeCC-CCCcccchhhhhh-hhhhhhhh-hHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDR-WGNPVQRAQLTEE-VAAPTVTG-VRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~-~g~~~~~~~l~~~-~a~~~~~~-~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |. |+|. +++.-.....+.. .....+.. .-..+..+ +..+ .. . +...-..+-+. T Consensus 1 MG----~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---g~~~----------~~-~------v~~~~al~~~~ 56 (422) T protein:vir:13 1 MG----FLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKF---GIKL----------NF-S------VRGKRALKENT 56 (422) T ss_pred Cc----hhhhhhhccCCccchhhhhhhccccccCcchhhhhc---cccC----------Cc-c------cchhhhhccHH Confidence 88 6765 3332222111110 00000000 00111111 1111 00 0 00000013467 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |.+|++.+-..|.++++.|....+...+..+...+... -..+.+.++++.++ +.+.+|-+.+++++... | .+..| T Consensus 57 v~~ci~~ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~~L 133 (422) T protein:vir:13 57 VYVCTKIRAESIGKLSLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRK-G--KIIGL 133 (422) T ss_pred HHHHHHHHHHhhhhCceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEE Confidence 99999999999999999997544333332333333211 11234667777777 46779999999987654 3 36789 Q ss_pred EeeccceeEecccCCc--------ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLA--------TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAI 227 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~--------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 227 (547) .+++|.++.+..+... .+.....++....++++..|++++....+..+|.|.+..+....-.-....++... T Consensus 134 ~~i~~~~v~~~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 213 (422) T protein:vir:13 134 YPINSDNVTKIIDDDNFLSSLSKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINK 213 (422) T ss_pred EEECCcceEEEEcCCcceeccceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 9999998876544332 22333445555678899888888776677789999999999988888888888888 Q ss_pred HHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_020866. 228 FTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISK 302 (547) Q Consensus 228 f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk 302 (547) |...-+.|--+.+++...++++++++.+.+.+..+ +..+|++.|++++-+..+. ....|.+..++...+|++ T Consensus 214 ~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~---~d~q~le~~~~~~~~Ia~ 290 (422) T protein:vir:13 214 FFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSM---ADAQFLENSKLTKRELAA 290 (422) T ss_pred HHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCCh---hHHHHHHHHHHHHHHHHH Confidence 99998999999999988899999999988887643 3468899999887654332 222467777888899998 Q ss_pred HHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHH Q lcl|NC_020866. 303 AVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLAS 380 (547) Q Consensus 303 ~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~ 380 (547) +.-- ..+.+..++++++..+.+. .....-+...++.|+..||+.|+...-... ....+|.+......|.+.+++ T Consensus 291 ~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~----g~~i~fd~~~l~r~d~~~~~~ 366 (422) T protein:vir:13 291 TFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQ----DVKAEFNVDTILRSDIKTRYE 366 (422) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCChhhhcC----CceEEeechhhhcCCHHHHHH Confidence 8421 2333443445555444333 445667888899999999998876542211 112234334445568999999 Q ss_pred HHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccc Q lcl|NC_020866. 381 SLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTA 438 (547) Q Consensus 381 ~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (547) .++++++.|+ ++.+++|+.+|+|+-+.++..+.+.... +....+...+..+..... T Consensus 367 ~~~~~~~~G~-~T~NE~R~~~gl~p~~ggD~~~~~~n~~-~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 367 AYRIGIQGGF-IEANEARRRENLPPVEGGDRLLVNGNMI-PIEMAGEQYKKGGEKGGK 422 (422) T ss_pred HHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeeccCcc-chhhcccccccCCCcCCC Confidence 9999999998 9999999999999776666655432211 111100000000000000 No 18 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.91 E-value=1.3e-22 Score=140.52 Aligned_cols=483 Identities=8% Similarity=-0.026 Sum_probs=252.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh---ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP---GDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~---~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |. +.+|+-++++..... ..+.+-..+ ....++ ..++.-....+...+..+.+. T Consensus 71 ~k-k~~i~~pfkkk~~~~-------------~~d~f~~s~es~s~vtsl----------s~pdaf~~vnVs~~~Alknsa 126 (945) T protein:vir:10 71 LK-KEKIIVPYNHQEPPF-------------KFNLFEYSPESLMYLPSI----------SDPDAFFLINLFRKYRFNNDS 126 (945) T ss_pred HH-hhcccccccccccch-------------hhhhhhccCccceecccc----------cCccceeeehhhhhhhhccHH Confidence 33 445554444422211 111111111 000000 000000011122333336789 Q ss_pred HHHHHHHHHHHHhcCCceEecCC--CC-H-HHHH--HHHHHHHHHhCCC--------HHHHHHHHH-HhhhhcCeeeeeE Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPAS--DD-P-QDVK--IADMVRDWLKREE--------LQQEVFHIL-DCISKGYSFTEIV 142 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~--~~-~-~~~~--~ae~v~~~l~~~~--------~~~~i~~~l-~a~~~G~sv~Ei~ 142 (547) |.+|++.+...|.++++++.-.. +. . ..++ ....+..+|++++ |..+++.++ +.+.+|-+.++++ T Consensus 127 V~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIi 206 (945) T protein:vir:10 127 KLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKI 206 (945) T ss_pred HHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEE Confidence 99999999999999999984321 11 0 0011 1223455665543 445666665 6788999999998 Q ss_pred EeecCCeeeEEeeEeeccceeEeccc--CCcceee--ccCCCCceeeccCceEEEeecCCcC---ccccchhHHHHHHHH Q lcl|NC_020866. 143 WNTSEGQYFPDKLIWRDPRHFRFQKS--DLATPLL--LDDNGAEVPLPAFRFIFADIPAKSG---ILLRSGLARVAAWGW 215 (547) Q Consensus 143 w~~~~g~~~~~~l~~~~~~~~~~~~~--~~~~~~~--~~~~~~g~~l~~~k~i~~~~~~~~~---~p~G~gll~~~~~~~ 215 (547) +..+| .+..|.+++|.++++..+ ++..+.. ..++.....+++...|+|.+....+ .+||.+.+..+.... T Consensus 207 Rd~~G---~ii~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI 283 (945) T protein:vir:10 207 RDEQG---NLVAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVI 283 (945) T ss_pred ECCCC---cEEEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHH Confidence 75443 356899999998876443 3222222 2233344577888888887665433 346888899998888 Q ss_pred HHHHHHHHHHHHHHHHcC-CCeEEEe----------CCCCCCHHHHHHHHHHHHHHhcC----eeEEecCCceEEEEecC Q lcl|NC_020866. 216 MFKAFTARDWAIFTQTYG-QPLRIGK----------WQPGASEQDKDTLFRAVANIAGD----CAAIIPETMSIDFVESK 280 (547) Q Consensus 216 ~~K~~~~~~w~~f~E~~G-~P~~vgk----------~~~~~~~~~~~~l~~al~~~~~~----~~~vip~~~~ie~~e~~ 280 (547) -......++-+.+..+.| +|--+-+ ++...++++++++.+.+.+..++ ..+|++.|++++=+..+ T Consensus 284 ~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s 363 (945) T protein:vir:10 284 LSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGK 363 (945) T ss_pred HHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCC Confidence 777777777777776555 6743433 34456788888888888876432 23577888877655432 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHHhc-chhccCccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_020866. 281 SIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGP 358 (547) Q Consensus 281 ~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~ 358 (547) . . ...|.+..++..++|+++.-- ..+.....+++++..+.+...+ ..-+...++.++..||+.|.+.. T Consensus 364 ~--~-DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~------- 433 (945) T protein:vir:10 364 R--R-DMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFR------- 433 (945) T ss_pred h--h-HHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------- Confidence 2 2 224677778888889988421 2233333344555555555544 56789999999999998654321 Q ss_pred cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccc Q lcl|NC_020866. 359 QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTA 438 (547) Q Consensus 359 ~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (547) ....-+|.|+.....|.+.+++.++++.+.|+ ++.+++|+.+|+|+-+.|+..+.+...-.+.........+..+...+ T Consensus 434 eg~~i~fdFd~ldl~D~ksraEal~kli~sGi-LTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~a 512 (945) T protein:vir:10 434 NEKDIKLWFKEDDLEKERDWWNIIQGQLNTGF-RSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLA 512 (945) T ss_pred cCceeEEEecchhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccc Confidence 12233778887778888999999999999998 99999999999998776776654322111111111111111100000 Q ss_pred cc-cccccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHh Q lcl|NC_020866. 439 ER-KLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARY 517 (547) Q Consensus 439 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~ 517 (547) .. ...+.. ......+....... .++....+...+.+.+++.. .++| +.|.++- T Consensus 513 q~~~dqp~~--------kGGe~dEns~~psE-~kda~~e~~~~l~~~~~~~a---~e~i--------------~~~~e~~ 566 (945) T protein:vir:10 513 QAMADQPSQ--------QGGGVDENSSVPSE-QKNAGLEVLRNLFKSLDANA---SENL--------------KQVIELT 566 (945) T ss_pred cCCCCCCCC--------CCCCCCCCCCCCCc-ccchHHHHHHHHHHHHHHHH---HHHH--------------HHHHhhc Confidence 00 000000 00000000000000 11111111111111111111 1122 2222333 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 518 GEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 518 ~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) .+-...+.+++|.+.+..--|-|-..-.+. T Consensus 567 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 596 (945) T protein:vir:10 567 NDDNYLKEKELLTRVLKSVGLDSVSEFIEN 596 (945) T ss_pred CCCchhHHHHHHHHHHHHhhhHHHHHHHhc Confidence 333344455555555433222221111111 No 19 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.91 E-value=2.2e-23 Score=144.85 Aligned_cols=397 Identities=13% Similarity=0.080 Sum_probs=243.8 Q ss_pred eeeCC-CCCcccchhhhhhhhhhhhhhhHHHHhhhhcc---CCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_020866. 6 VLLDR-WGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGD---GLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGV 81 (547) Q Consensus 6 ~~~~~-~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~---~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~ 81 (547) .|++. +++.-.. +.... .........+.+.+.. .+++ +..+ +-+.|.+| T Consensus 1 m~~~~~f~~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~v~~----------------------~~al-~~~~v~~~ 53 (416) T protein:vir:12 1 MLLERMFEKRSGS---SDHED-GFNNILLNMFGGRKTASGERVSE----------------------SNSL-VQPDIFAC 53 (416) T ss_pred CccchhcccccCc---cccCc-cchhHHHHhhcCcccccCceech----------------------hhhh-ccHHHHHH Confidence 34544 3332211 11111 1111111222221111 0111 0111 35779999 Q ss_pred HHHHHHHHhcCCceEecCCCC--HHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 82 IGTRKRSVSQLDITVEPASDD--PQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 82 l~~Rk~~v~~~~w~v~p~~~~--~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) ++.+...|.+++|++.-..++ ...+++ ....-++.+ +...++++.++ +.+.+|-+++++++... | .+.. T Consensus 54 i~~Ia~~ia~l~~~~~~~~~~~~~~~~~~-~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~-G--~~~~ 129 (416) T protein:vir:12 54 VNVLSDDIAKLPIHTYKRTDGGIERKPEH-KSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSH-G--YPEA 129 (416) T ss_pred HHHHHHhhhhCceEEEEecCCcccccccc-HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEE Confidence 999999999999998543221 111111 111122233 35677777777 45779999999987543 2 3778 Q ss_pred eEeeccceeEecc--cCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 155 LIWRDPRHFRFQK--SDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 155 l~~~~~~~~~~~~--~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) |.+++|.++.+.. +++...+....++..+.+++..++++++.. .+.++|.|.+..++...-.-....++...|.+.- T Consensus 130 L~~l~~~~v~v~~~~~~~~~~~~~~~~g~~~~~~~~eiih~~~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng 208 (416) T protein:vir:12 130 LFPLRPDYTNAYVHPTTGMLWYQTVLNGKAIELYDYEVLHFKGLS-TDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNE 208 (416) T ss_pred EEEECCcceEEEEeCCCcEEEEEEecCCeEEEecCccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 9999998887443 333333333444555788888887777654 4558999999999999888888888889999999 Q ss_pred CCCeEEEeCCCCCCHHHHHHHHHHHHHHh-cCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhc Q lcl|NC_020866. 233 GQPLRIGKWQPGASEQDKDTLFRAVANIA-GDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTT 310 (547) Q Consensus 233 G~P~~vgk~~~~~~~~~~~~l~~al~~~~-~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlt 310 (547) +.|--|.+++...++++++++.+.+..+. ++..+|+|.|++++-+..+. ....|.+..++..++|+++.-- ..+. T Consensus 209 ~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgVPp~~l 285 (416) T protein:vir:12 209 ATPRGILKVPAFLDEKPKENVRKEWKRVNKVENIAIIDYGLEYQSISMPL---QEAQFVESMKFNKAQISMIYKVPLHKL 285 (416) T ss_pred CCCceEEecCCCCCHHHHHHHHHHHHHHhcCCCeeecCCCceEEEccCCh---hhHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 99999999998889999999999888765 45678899999887765332 2224777788889999988422 2333 Q ss_pred cCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHHHHHHHHh Q lcl|NC_020866. 311 TDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLASSLDTLVR 387 (547) Q Consensus 311 s~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~~~~~l~~ 387 (547) .....++++..+.... ....-+...++.|+..||+.|+...-.. ...+|+|+ .....|.+.+++++.++++ T Consensus 286 g~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~------~g~~i~fd~~~l~~~d~~~~~~~~~~~~~ 359 (416) T protein:vir:12 286 NELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFLDHDQK------SGHYVKFNIDSELRGDSKTQAEYLKTLHE 359 (416) T ss_pred CCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhhc------CCceEEeechhhhccCHHHHHHHHHHHHh Confidence 4444566665554443 4467788999999999999876433211 11245554 4446789999999999999 Q ss_pred CCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccc-cccCCcccccccccccc Q lcl|NC_020866. 388 LGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDP-GAGDLKTAERKLKPLSG 447 (547) Q Consensus 388 ~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 447 (547) .|+ ++.+++|+.+|+|+-+.++..+.+............... ........+.... + T Consensus 360 ~G~-~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~---g 416 (416) T protein:vir:12 360 TGV-LNKDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNKNE---G 416 (416) T ss_pred CCC-cCHHHHHHHhCCCCCCCcceeeeccccccccccchhhccccccccCCCCCcCC---C Confidence 998 999999999999987667665433211100000000000 0000000000000 0 No 20 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.91 E-value=7.7e-23 Score=141.81 Aligned_cols=429 Identities=11% Similarity=0.059 Sum_probs=248.3 Q ss_pred eeCCCCCcccchhhhhhhhhhhhhh----hHHHHhhhhccC--CCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAAPTVTG----VRSPLSGYPGDG--LNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~~~~~~----~~~~~~~~~~~~--l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |.|.+.+.-+...-.++........ +...+.+.-..+ +|+ ..+ | +=+.|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~---~~a------------------l--~~~~V~~ 57 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADP---EAV------------------L--SFHAVFA 57 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccCh---HHh------------------h--ccHHHHH Confidence 6777666333332222332111111 111111110011 111 111 1 3466999 Q ss_pred HHHHHHHHHhcCCceEecCCC--CHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASD--DPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~--~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) |++.+...|.+++|+|.-.+. ..+.. ....+..++.++ ...++++.++ +.+.+|-+++++++... | .+. T Consensus 58 ~v~~Ia~~iA~lp~~~~~~~~~g~~~~~-~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~-G--~~~ 133 (454) T protein:vir:93 58 CISLISQDIAKMRLRLMQTDAQGIRRET-RRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR-G--QIK 133 (454) T ss_pred HHHHHHHhhccCceEEEEeccCCccchh-hhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC-C--cEE Confidence 999999999999999964322 21111 112344455554 4667777777 56789999999997544 3 367 Q ss_pred eeEeeccceeEecccCC--cceeeccCC----CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDL--ATPLLLDDN----GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAI 227 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~--~~~~~~~~~----~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 227 (547) .|.+++|.++.+..++. ..+...... +....++....|++++....+..+|.+.+..+....-.-....++... T Consensus 134 ~L~~i~~~~v~v~~~~~g~~~y~~~~~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~ 213 (454) T protein:vir:93 134 ELRILDWNRVEPLVADDGEVFYRITPDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTS 213 (454) T ss_pred EEEEEcCcceEEEEcCCCcEEEEEEeccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHH Confidence 89999999887654433 333332221 234568888887777767777889999999999998888888888889 Q ss_pred HHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 228 FTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKA 303 (547) Q Consensus 228 f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ 303 (547) |...-+.|--+.+++...++++++++.+.+.++.+ +..+|++.|++++-+..+.. ...|.+..++...+|+++ T Consensus 214 ~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~---d~q~le~~~~~~~~Ia~~ 290 (454) T protein:vir:93 214 FFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPV---DSQTVEQLKMTAEIVCSV 290 (454) T ss_pred HHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEcccChh---HHHHHHHHHHHHHHHHHH Confidence 99888999888999888899999999888887743 34688999998877654322 123667777888899988 Q ss_pred Hhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHH Q lcl|NC_020866. 304 VLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASS 381 (547) Q Consensus 304 ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~ 381 (547) .-- ..+....++++++..+.+. .....-+.-.++.|+..||+.|+.. ...+.+|.++.....|.+.+++. T Consensus 291 fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L~~~--------~~~~~~f~~~~ll~~D~~~r~~~ 362 (454) T protein:vir:93 291 FRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETG--------ENESTEFDVTTLLRMDSERRMKT 362 (454) T ss_pred hCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--------CCcEEEeechhhhccCHHHHHHH Confidence 321 1233333344555444333 3456678888999999999876431 12233444444456789999999 Q ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhh Q lcl|NC_020866. 382 LDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSE 461 (547) Q Consensus 382 ~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (547) +.++.+.|+ ++.+++|+.+|+|+-+.+++.+.+.....-..................+...+....... .+.. T Consensus 363 ~~~~~~~G~-~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d------~~~~ 435 (454) T protein:vir:93 363 LGDAVKNTL-LTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASD------GNKA 435 (454) T ss_pred HHHHHhCCC-cCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCC------CCCC Confidence 999999997 999999999999987666665433221100000000000000000000000000000000 0001 Q ss_pred hcccccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 462 MPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEE 495 (547) Q Consensus 462 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 495 (547) ..+... +..++.... .+++ T Consensus 436 ~~e~~~----d~~~~~~~~-----------~~~~ 454 (454) T protein:vir:93 436 ITETEH----DAVKAMFRG-----------ILKK 454 (454) T ss_pred ccCCcc----chhhhhhhh-----------hhcC Confidence 111111 111111111 1111 No 21 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.91 E-value=2.3e-23 Score=144.68 Aligned_cols=394 Identities=11% Similarity=0.019 Sum_probs=239.4 Q ss_pred hhhhHHHHhh-hhccCCCHHHHHHHHHh---hcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHH Q lcl|NC_020866. 29 VTGVRSPLSG-YPGDGLNPLRLAQILRA---ADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQ 104 (547) Q Consensus 29 ~~~~~~~~~~-~~~~~l~p~~~~~ilr~---a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~ 104 (547) +.-+.+++.. .+....+|..+...+-. ...|. .+..+...+-+.|.+|++.+...|.+++|.|...+++.. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~-----~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~ 75 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGK-----QISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLK 75 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCc-----eechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCce Confidence 2222122211 11111222222222110 01111 011111125678999999999999999999975443221 Q ss_pred HHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcc--eeec Q lcl|NC_020866. 105 DVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLAT--PLLL 176 (547) Q Consensus 105 ~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~--~~~~ 176 (547) ....-.-+..+|. + +...++++.+. +.+.+|-+++.++. ++| .+..|.+++|.++.+..++... +... T Consensus 76 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~--~~g--~~~~L~~l~~~~v~~~~~~~~~~~y~~~ 151 (414) T protein:vir:44 76 QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVK--AFG--EVAELLPVDPGCVVPKLNSSWEPVYQVT 151 (414) T ss_pred eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEe--CCC--cEEEEEEEcCceEEEEECCCCcEEEEEE Confidence 1111112223332 2 34666777777 46679999988753 234 3668999999987754433333 3344 Q ss_pred cCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHH Q lcl|NC_020866. 177 DDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRA 256 (547) Q Consensus 177 ~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~a 256 (547) ...+....+++...|++++.. .+.++|.+.+..+....-.-....++...|....+.|--+.+++...++++++++.+. T Consensus 152 ~~~g~~~~~~~~evih~~~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~ 230 (414) T protein:vir:44 152 FPDGSTDVLSQEDIWHVRTLT-LDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKD 230 (414) T ss_pred ecCceEEEEccccEEEecCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHH Confidence 445556778888888777654 4558999999999988877777888888899999999888899888899999888888 Q ss_pred HHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc--hhccCccccccchhHHHHH-HH Q lcl|NC_020866. 257 VANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ--TTTTDAVAGGHAVSQEHRL-VQ 328 (547) Q Consensus 257 l~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq--tlts~~~ggs~a~~~vh~~-v~ 328 (547) +.+..+ +..+++|.|++++-+..+. .-..|.+..++...+|+++. |- .+....++++++..+.+.. .. T Consensus 231 ~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~---~d~~~~e~~~~~~~~Ia~~f-gVpp~~l~~~~~~t~~n~e~~~~~~~ 306 (414) T protein:vir:44 231 FEERHTGLGNAHRPMILEMGLDWKSMALNA---EDSQFLETRKFQLEEICRLF-RVPLHMVQNTDRATFNNIEELGLGFI 306 (414) T ss_pred HHHHhcCccccCcceecCCCceEEEccCCh---HHHHHHHHHHHHHHHHHHHh-CCCHHHhCCCCCCCcccHHHHHHHHH Confidence 876533 2367889998877664332 22247777888888998883 32 2333333456665554443 44 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC Q lcl|NC_020866. 329 EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP 408 (547) Q Consensus 329 ~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~ 408 (547) ..-+...++.|++.||+.|++.-- ....+.+|.++.....|.+.+++.++++++.|+ ++.+++|+.+|+|+-+. T Consensus 307 ~~~l~P~~~~ie~~ln~~L~~~~~-----~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~-~t~NE~R~~~gl~p~~g 380 (414) T protein:vir:44 307 NYSLVPYLTRIEQRINTGLVRKSK-----QGVFYAKFNAGALLRGDMKSRFEAYATGINWGI-YSPNDCRDLEDMNPRPG 380 (414) T ss_pred HHHHHHHHHHHHHHHHhhcCCccc-----cCceEEEEechhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCC Confidence 667888899999999987755321 111223444444446789999999999999998 99999999999987666 Q ss_pred cchhcccccccccccccccccccccCCccccccc Q lcl|NC_020866. 409 GAKLLRPLENPATQAAPPGMDPGAGDLKTAERKL 442 (547) Q Consensus 409 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (547) ++..+.+......+.................+++ T Consensus 381 gD~~~~~~n~~~~~~~~~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 381 GDVYLTPMNMTTKPSDGSKAGKQKDNANADETTS 414 (414) T ss_pred cceecccccccccCCccccCCCCCCCCCCCCCCC Confidence 6665544332221111111111111111111111 No 22 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.90 E-value=8.1e-21 Score=130.72 Aligned_cols=530 Identities=11% Similarity=0.041 Sum_probs=253.7 Q ss_pred CCCCC-------eee------CCCCCccc----chhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC--- Q lcl|NC_020866. 1 MAQKP-------VLL------DRWGNPVQ----RAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD--- 60 (547) Q Consensus 1 ma~~~-------~~~------~~~g~~~~----~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~--- 60 (547) ||++- +|- |-+..|.- .+-.+.|.+.+...+.......-| ......|+..+...+..|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~-~~~~~~r~g~~~~~~~~g~~~~ 79 (648) T protein:vir:79 1 MARKVWGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDP-KMSLVKRIGLAIMDGGGGGRDF 79 (648) T ss_pred CccchhcchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccc-hhHHHHHhHHHHHhhcCCcccc Confidence 66421 000 00111110 001111112111111100000000 0112223233333332221 Q ss_pred ---HHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC----CCHHHHHHHHH-Hhh Q lcl|NC_020866. 61 ---PVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR----EELQQEVFHIL-DCI 132 (547) Q Consensus 61 ---~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~ 132 (547) ...+..|.+. ....++|++|+..+...|.+++|.|.+.++.....-. .+..+.+ +...+++..++ +.+ T Consensus 80 ~epp~d~~~l~~l-~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~---~~~ll~rPn~~~t~~~f~~~l~~~ll 155 (648) T protein:vir:79 80 EEPEFDFNEITSA-YNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIR---MRFTLMAEATQIPTNQLFIEIAEDLV 155 (648) T ss_pred ccCCcCHHHHHHH-HhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhH---HHHHhhccCCCCCHHHHHHHHHHHHH Confidence 1122333222 2369999999999999999999999987664433211 1122222 35566777666 466 Q ss_pred hhcCeeeeeEEeecCCe------------eeEEeeEeeccceeEecccCCcce--eecc--CCCCceeeccCceEEEeec Q lcl|NC_020866. 133 SKGYSFTEIVWNTSEGQ------------YFPDKLIWRDPRHFRFQKSDLATP--LLLD--DNGAEVPLPAFRFIFADIP 196 (547) Q Consensus 133 ~~G~sv~Ei~w~~~~g~------------~~~~~l~~~~~~~~~~~~~~~~~~--~~~~--~~~~g~~l~~~k~i~~~~~ 196 (547) .||.+.+|++...+++. .....+.+++|.++.+..+..+.. +.+. +++..+.+++...|++++. T Consensus 156 l~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~dIIHik~~ 235 (648) T protein:vir:79 156 KYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPEDIVHIYYK 235 (648) T ss_pred hcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecCccEEEEccC Confidence 89999999998665532 123456778887776654433332 2222 2334467888888777777 Q ss_pred CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhcC----eeEEecCC Q lcl|NC_020866. 197 AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAGD----CAAIIPET 271 (547) Q Consensus 197 ~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~~----~~~vip~~ 271 (547) ...+.+||.|.+..+....-.-....++...|....+.|..+.+++.+. ..+..+++.+.+....++ ++.+.++. T Consensus 236 ~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~gg~v~~~~ 315 (648) T protein:vir:79 236 REKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEGGMVTTER 315 (648) T ss_pred CCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccccccccce Confidence 7778899999999999998888889999999999999998888875332 223334444444433222 22333333 Q ss_pred ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc--hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 272 MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ--TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIR 349 (547) Q Consensus 272 ~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq--tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~ 349 (547) +.++... ....-.|.+..++..++|+.+. |- .+.+..++++++.++.....+...+..-++.+...++..+.+ T Consensus 316 ~~i~~~~----s~~dlqfle~rk~~~~eIa~aF-gVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~ 390 (648) T protein:vir:79 316 VNISSIA----SNQIIDAKEYLKHFEQRAFTVL-GVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVK 390 (648) T ss_pred eeccccC----CHHHHHHHHHHHHHHHHHHHHh-CCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333221 1111136667778888999884 32 333444455666666666666666777777777666665554 Q ss_pred HHHH-hcC---CCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcch--hcccccccc--- Q lcl|NC_020866. 350 PWIQ-LEY---GPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAK--LLRPLENPA--- 420 (547) Q Consensus 350 ~l~~-~Nf---g~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~--~~~~~~~~~--- 420 (547) .+.. ..| .......+|.|......|.+..++.+.++.+.|+ ++.+++|+..|+|+-.+++. .+.....+. T Consensus 391 ~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~Gi-lT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~ 469 (648) T protein:vir:79 391 EILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNA-ISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQA 469 (648) T ss_pred HHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCCccccccccccchhc Confidence 4322 111 1122234677777667788888999999999998 99999999999976443332 121111110 Q ss_pred -cccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHHHHH-------------HHHHHHHH- Q lcl|NC_020866. 421 -TQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAAL-------------TDQLELEG- 485 (547) Q Consensus 421 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-------------~~~~~~~~- 485 (547) ....+....++.. ...+...........+...........-+. .......+..+ ...+.+.. T Consensus 470 ~~~~~~~~~~~~~~-~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (648) T protein:vir:79 470 TALAALAPTPAGGS-SASASGDKKKKATDNKTKPTNQHGTKTSPK--KQTNGRHVRYMQEMLLEYTTLNEAIKALIERYY 546 (648) T ss_pred cccccCCCCCCCCC-CCCccccccccccCCCCCCCCCCCcCCCCc--cccchhhhhhhhhhhhcchhhhHHHhhHHHHHH Confidence 0000000000000 000000000000000000000000000000 00000001100 11111110 Q ss_pred HHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcC-------CCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 486 QPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGE-------IGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 486 ~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~-------~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) +......+..|-+.|.-. ..-+++.+...+.+ |+..+.++.|..+...---.|. .+.+| T Consensus 547 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 612 (648) T protein:vir:79 547 QYGSKEHLKSINGSLMYT--EGRLLELTTQYWGEEVTEKVRIPFHRMTENLREEVMSTIDKVE-GVAEA 612 (648) T ss_pred HHhHHHHHHhhhhhheec--cchhHHHHHHHhhhhhhceeeeeHHHHHHHHHHHHHhhhhhhh-hhHHH Confidence 111112222232222221 12233333333332 5555566666655543222221 12222 No 23 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.90 E-value=4.9e-23 Score=142.89 Aligned_cols=395 Identities=13% Similarity=0.061 Sum_probs=233.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhc----cC--CCHHHHHHHHHhhcCCCHHHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPG----DG--LNPLRLAQILRAADHGDPVRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~----~~--l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~ 74 (547) |- +-+..++ +. +.++.. ..+...+.+..+ .| +|+. ...+ T Consensus 1 m~----~~~~~~~----~~--~~~s~~--~~w~~~~~~~~~~~~~~g~~vt~~-----------------------~al~ 45 (421) T protein:vir:10 1 MF----IPQMFEG----KK--RSVSGG--GFWEAMLGGVRSSHSKAGVMITPE-----------------------TALA 45 (421) T ss_pred CC----Ccchhcc----cc--cccCcc--hhhHHHhhhhccCcccCCceechH-----------------------Hhhc Confidence 54 1111111 11 112111 111112221111 11 1221 1114 Q ss_pred hHHHHHHHHHHHHHHhcCCceEecC--CCCHH-HHHHHHHHHHHH-hCC----CHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQLDITVEPA--SDDPQ-DVKIADMVRDWL-KRE----ELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~~w~v~p~--~~~~~-~~~~ae~v~~~l-~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) -+.|.+|++.+...|.+++|+|.-. ++..+ ...+ -+..+| .++ ...++++.++ +.+.+|-+++++++.. T Consensus 46 ~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~--~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~ 123 (421) T protein:vir:10 46 LSAVRACVTLLAESVAQLPVELYRRDKNGGRQRATDH--PIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDG 123 (421) T ss_pred cHHHHHHHHHHHHhhccCceEEEEEcCCCceeecccc--hHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC Confidence 6789999999999999999998532 22211 1111 122333 233 4666777766 5778999999998753 Q ss_pred cCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) + ..|..|.+++|.++.+..+....+..+- ...|..+|....++.++.. .+..+|.|.+..+....-.-....++. T Consensus 124 ~---G~~~~L~~l~~~~v~v~~~~~g~~~y~~-~~~g~~~~~~eiih~~~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~ 198 (421) T protein:vir:10 124 K---GYPKELIPINPKKVIVLKGPDGMPYYEI-PEIGETLPMRMMHHVKVFS-LDGYIGSSPIQTNADVLGLNLAVEEHA 198 (421) T ss_pred C---CcEEEEEEecCceEEEEECCCceEEEEE-cCCCcEEchhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHH Confidence 3 3477899999998887554444333322 2334567777666666554 455899999999998888888888888 Q ss_pred HHHHHHcCCCeEEEeCCCCC----CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGA----SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHL 296 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~----~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~ 296 (547) ..+...-|.|--+-+++... ++++++++.+.+.+.-+ +..+|++.|++++-+..+. ....|.+..++. T Consensus 199 ~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~ 275 (421) T protein:vir:10 199 SAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDN---EKAQLLQSRQWG 275 (421) T ss_pred HHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCCh---hHHHHHHHHHHh Confidence 88999989997777776543 77888888887777532 3468899998887665332 222477777788 Q ss_pred HHHHHHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccC Q lcl|NC_020866. 297 DQQISKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTED 374 (547) Q Consensus 297 d~~Isk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed 374 (547) .++|+++.-- ..+....+.++++..+.+. .....-+...++.++..||+.|+.+--. ...+.+|..+.....| T Consensus 276 ~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~-----~~~~v~fd~~~l~~~d 350 (421) T protein:vir:10 276 VEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPSER-----RDLYIEFNVSGLLRGD 350 (421) T ss_pred HHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-----CCeEEEEechhhhccC Confidence 8999988422 2333443445555444333 4455678888899999999876543211 1122234334444678 Q ss_pred HHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhh Q lcl|NC_020866. 375 LTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRG 454 (547) Q Consensus 375 ~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (547) ++.+++.++++++.|+ ++.+++|+.+|+|+-+.++.++.+..........++... +......+. +.. T Consensus 351 ~~~~~~~~~~~~~~G~-~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~------~~~~~~~e~-d~~----- 417 (421) T protein:vir:10 351 QKSRYESYALGRQWGW-LSVNDIRRMENLPPIAGGDKYLTPLNMVDSAQIIPGDKK------PTAQQMAEI-DTI----- 417 (421) T ss_pred HHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCcceeeeccccccccccccCCCC------cccccCccc-ccc----- Confidence 9999999999999997 999999999999977777766544322111111111000 000000000 000 Q ss_pred cccchhhhccc Q lcl|NC_020866. 455 DLSLNSEMPST 465 (547) Q Consensus 455 ~~~~~~~~~~~ 465 (547) ...+ T Consensus 418 -------~~~~ 421 (421) T protein:vir:10 418 -------LSRT 421 (421) T ss_pred -------cccC Confidence 0000 No 24 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.90 E-value=5.7e-23 Score=142.52 Aligned_cols=389 Identities=12% Similarity=0.042 Sum_probs=239.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |- |.+...+.+....-+...+. | .+..--+++.... +..+ +-+.|.+ T Consensus 1 MG----~~~~~~~~~~~~~~~~~~~~-------------------~-----~~~~~~g~~~~~~----~~al-~~~~V~~ 47 (411) T protein:vir:81 1 MG----WWSRLTRFFRPRNETVDMTN-------------------P-----LLLQWLGVDPDTP----RNQL-SEATYFA 47 (411) T ss_pred Cc----hHHHHHhhccCcccccccch-------------------H-----HHHHHhcCcccCh----hhhh-ccHHHHH Confidence 55 33321111111100000000 0 0000001111000 0111 4678999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |++.+-..|.+++|++...+++...+..-.-+..+|+ + +...++++.++ +.+.+|-+.+++++. +|. +.. T Consensus 48 ~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--~g~--~~~ 123 (411) T protein:vir:81 48 CLKILSESLGKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS--GPQ--LQA 123 (411) T ss_pred HHHHHHHhHhhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--CCc--eEE Confidence 9999999999999999643322111110112333343 3 34677777777 467899999998864 443 567 Q ss_pred eEeeccceeEecccCCcce-------eecc--CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLATP-------LLLD--DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~~~-------~~~~--~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) |.+++|..+.+..++...+ +... ..+....+++...|++++....+..+|.+.+..+....-.-....++. T Consensus 124 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 203 (411) T protein:vir:81 124 LWILPSQYVTIVVDDRGLLGEKNAIWYRYNDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFM 203 (411) T ss_pred EEEECCceEEEEEcCcccccccceEEEEEEecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 8999999887765443321 1121 234456788888887776666777899999999999988889999999 Q ss_pred HHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC-----eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGD-----CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~-----~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) ..+...-+.|--+.+++...++++++++.+.+.+..++ ..+|++.|++++-+..+. ....|.+..++..++| T Consensus 204 ~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~I 280 (411) T protein:vir:81 204 NNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKL---TDSQFFELKKYTALQI 280 (411) T ss_pred HHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCH---HHHHHHHHHHHHHHHH Confidence 99999999998888998888999999999888876432 358889998877664322 1224667778888999 Q ss_pred HHHHhc-chhccCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHH Q lcl|NC_020866. 301 SKAVLG-QTTTTDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKL 378 (547) Q Consensus 301 sk~ilG-qtlts~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~ 378 (547) +++.-- ..+.+..++++++..+.+.. ....-+...++.|++.||+.|+..-.. ....+-+|.++.....|.+.+ T Consensus 281 a~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~----~~~~~~~fd~~~ll~~d~~~~ 356 (411) T protein:vir:81 281 AAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDLI----SQGHYFKFNVNVILRADIKTQ 356 (411) T ss_pred HHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhc----CCCcEEEeechhhhccCHHHH Confidence 988422 23444445567766655443 445567888899999999887643211 111222333334446789999 Q ss_pred HHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCC Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDL 435 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (547) ++.++++++.|+ ++.+++|+.+|+|+-+.++..+.+... .+...........+.. T Consensus 357 ~~~~~~~~~~g~-~t~NE~R~~~gl~p~~ggD~~~~~~n~-~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 357 MDSLSTAVQNGI-MTPNEARDYLDMPADDYGNNLMANGNY-IPLSMLGANYGKGGDS 411 (411) T ss_pred HHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCeeeeccCc-cchhhhhhhhccCCCC Confidence 999999999998 999999999999865555544433221 1111100000000000 No 25 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.89 E-value=1.6e-22 Score=140.07 Aligned_cols=408 Identities=11% Similarity=0.032 Sum_probs=244.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |.--.++.+..+|+- ....+++.. ....... .+....+.+. +. +..+ +-+.|.+ T Consensus 1 M~~~~~~f~~~~r~~---~~~~~~~~~-~~~~~~~-~g~~~~~~~v-------------~~-------~~al-~~~~v~~ 54 (429) T protein:vir:10 1 MDSVKKFFNFEKRQT---SQVIELNKD-DEKLLEW-LGISPSTISV-------------KG-------KNAL-KVATVFA 54 (429) T ss_pred CchhhhhhcccccCc---ccccccCCC-hHHHHHH-hcCCCCccee-------------ch-------hhhh-ccHHHHH Confidence 884444444333321 111111110 0000011 1100011110 00 0011 4678999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |+..+...|.+++|+|.-..++...+..-.-+..+|+ ++ ...++++.++. .+.+|-+++++++...| .+.. T Consensus 55 ~i~~ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~ 131 (429) T protein:vir:10 55 CIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG---KVQA 131 (429) T ss_pred HHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEE Confidence 9999999999999998643222111111112334443 33 45667777664 57799999999875433 3678 Q ss_pred eEeeccceeEecccCCc-------ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLA-------TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAI 227 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~-------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 227 (547) |.++++.++.+..++.. ..+....++....+++...|++++....+..+|.|.+..+....-.-....++... T Consensus 132 L~~i~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 211 (429) T protein:vir:10 132 LWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINN 211 (429) T ss_pred EEEEcCceeEEEEcCcccccccceEEEEEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 99999988876443321 12334445556778999888777766777789999999999988888888899999 Q ss_pred HHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHh-----cCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_020866. 228 FTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIA-----GDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISK 302 (547) Q Consensus 228 f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~-----~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk 302 (547) +.+.-+.|--+.+++...+++.++++.+.+...- ++..+|+|.|++++-+..+. .. ..|.+..++..++|++ T Consensus 212 ~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~--~d-~q~~e~~~~~~~~Ia~ 288 (429) T protein:vir:10 212 FYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNM--SD-AQFLENTELTIRQIAT 288 (429) T ss_pred HHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecCCCceEEEccCCh--hH-HHHHHHHHHHHHHHHH Confidence 9999899988889988889998888888887653 23578999999887765332 22 2356667788888988 Q ss_pred HHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHH Q lcl|NC_020866. 303 AVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLAS 380 (547) Q Consensus 303 ~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~ 380 (547) +.-- ..+.....+|+++-.+-+. .....-+...++.|+..||+.|+..--.. ...+.+|.++.....|.+++++ T Consensus 289 ~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~----~g~~~~fd~~~ll~~d~~~~~~ 364 (429) T protein:vir:10 289 AFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELD----KGFYSKFNVDAILRADIKTRYE 364 (429) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcC----CCcEEEeechhhhcCCHHHHHH Confidence 8421 2233333345565444443 34566788899999999998875433111 1122344444445679999999 Q ss_pred HHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccc-cccccccCCccccccccccccc Q lcl|NC_020866. 381 SLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPP-GMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 381 ~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 448 (547) .++++++.|+ ++.+++|+.+|+|+-+.++..+.+... .+....+ ....+............+ ++ T Consensus 365 ~~~~~~~~G~-~T~NE~R~~~gl~p~~ggD~~~~~~n~-~~~d~~~~~~~k~g~~~~~~~~~~~e--~~ 429 (429) T protein:vir:10 365 AYRTGIQGGF-LKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQAYLKGGDTNGEVSKEGNE--GN 429 (429) T ss_pred HHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeecccc-cchhhccccccCCCCCCCCCCCCCCC--CC Confidence 9999999998 999999999999876555655544322 1111111 111111111100000000 01 No 26 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.89 E-value=1.3e-22 Score=140.63 Aligned_cols=380 Identities=14% Similarity=0.050 Sum_probs=226.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. +.+..++- ++.. .++...|...+.....|..... +.. .+-+.|.+ T Consensus 1 M~----~f~~~~~~------~~~~------------------~~~~~~~~~~~~~~~~~~~v~~----~~a-l~~~~V~~ 47 (397) T protein:vir:38 1 MP----LLKLNKSH------SQGF------------------SLNDPDWVNFLTGGEAQKYVSA----DTA-LKNSDIFS 47 (397) T ss_pred Cc----chhhhhcc------cCcc------------------cCCchhhhhhhcCCcCCceech----HHh-hccHHHHH Confidence 65 43322110 0111 0111112222221111111100 011 14678999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.+...|.+++|++.. + .+..++.+ +.+.++++.+. +.+.+|.+++++++...+ .+..| T Consensus 48 ~v~~ia~~ia~~p~~~~~----~-------~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~l 113 (397) T protein:vir:38 48 LIMQLSGDLAMVRYTSES----D-------RSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNG---VDLSW 113 (397) T ss_pred HHHHHHHHHhhCcccccc----c-------HHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEEEE Confidence 999999999999998752 1 12334444 35777777777 467799999999986433 36789 Q ss_pred EeeccceeEeccc--CCcceee--ccC--CCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKS--DLATPLL--LDD--NGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~--~~~~~~~--~~~--~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .+++|.++.+..+ ++...+. ... .+....+++...|++++....+..||.+.+..+....-......++...+. T Consensus 114 ~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 193 (397) T protein:vir:38 114 EYLRPSQVQPMLLQDGSGLIYNINFDEPAIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKAL 193 (397) T ss_pred EEEcCceeEEEEcCCCceEEEEEEeccccccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999998765443 2222221 122 233357888888888888877778999999999999999899999999999 Q ss_pred HHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 230 E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) ...|+|--+.+++...++++++++.+.+....+ +..+|++.|++++=+..+ .....|.+..++..++|+++.- T Consensus 194 ~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~---~~d~~~~e~~~~~~~~Ia~afg 270 (397) T protein:vir:38 194 KQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDALEDYKPLEVK---GNIASLLNQVDWTRDQIAKVYG 270 (397) T ss_pred hccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCCCceEEecCCC---hhHHHHHHHHHHHHHHHHHHhC Confidence 999999999999988888888888777765432 345788888877755432 2223477888889999998831 Q ss_pred c-chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 306 G-QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 306 G-qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) - ..+.++..+ +++..+-.......-+...+..++..||+.|+... .+ .+.+ .-..|.+.+++.+++ T Consensus 271 Vp~~~lg~~~~-~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~l~~~~-~~---------~~~~--~~~~d~~~~~~~~~~ 337 (397) T protein:vir:38 271 VPDSYLNGQGD-QQSSITQISGQYAKSLNRYVQAIVGELNDKLHANI-SA---------NIRF--AIDAMGDQYASTISS 337 (397) T ss_pred CCHHHhCCCCC-cccHHHHHHHHHHHHHHHHHHHHHHHHHHhccChh-cc---------cccc--cccCCHHHHHHHHHH Confidence 1 222222212 22222222333445777888899999998776532 11 1122 223578899999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccc-cCCcccccccccc Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGA-GDLKTAERKLKPL 445 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 445 (547) +++.|+ ++.+++|+.+|+|+-..++ +..................+. ......+....+. T Consensus 338 ~~~~G~-~t~nE~R~~lg~~p~~~~d-~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 338 SVKGGT-IAGNQARFILQNSGYLAKD-LPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCCc-cccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 999997 9999999999997644443 221111111111100000000 0000000000000 No 27 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.89 E-value=2e-22 Score=139.58 Aligned_cols=415 Identities=11% Similarity=0.038 Sum_probs=236.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |-- -++++-.|+. +..++.+-+. ...+..+ ..|... +...+.-.+..++. .... +.. .+-+.|.+ T Consensus 1 ~~~-~~~~~~~~~~--~~~~~~~~~~-~~~~~~~---~~~~~~-~~~~~~~~~s~~g~-~v~~-----~~a-l~~~~V~~ 65 (432) T protein:vir:10 1 MPD-EKKLGLLGQL--KAMFVPPDPV-DIGGGQT---FTPVNA-TARDLGIIISDTGA-AVNA-----DAI-MRLDAVAA 65 (432) T ss_pred CCC-Ccccchhhhh--HhhcCCcccc-ccccccc---cccCcc-hhhhhcccccccCc-ccch-----hhh-hcchHHHH Confidence 331 1222222321 1111111110 0000000 001000 00000001111111 1111 111 14688999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |++.+-..|.+++|+|.-.+.+...+..-.-+..+| .+ +...++++.++ +.+.+|.+++++++. +| .+.. T Consensus 66 ~i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--~g--~~~~ 141 (432) T protein:vir:10 66 CVKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DG--RIES 141 (432) T ss_pred HHHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--cEEE Confidence 999999999999999853222111111111233344 23 34566666666 567899999999874 44 3678 Q ss_pred eEeeccceeEecccCCc--ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLA--TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) |.++++.++++..+..+ .+......+..+.++.+.++++++...+ ..+|.+.+..+....-.-....++...|...- T Consensus 142 L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~~~iih~~~~~~d-g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng 220 (432) T protein:vir:10 142 LQYLANDRLTITTDTKGNTAYRYRRTDGQMIDIPKQQIWKIMGYSLD-GENGLSAIRYGAQIFGTAIAAEAQAARAFRNG 220 (432) T ss_pred EEEEcCCceEEEEcCCCcEEEEEEecCceEEEEcCccEEEecCCCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 99999998876554333 3344445556678888888776665444 47899999999988887778888888888888 Q ss_pred CCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc--chh Q lcl|NC_020866. 233 GQPLRIGKWQPGASEQDKDTLFRAVANIAG-DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG--QTT 309 (547) Q Consensus 233 G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG--qtl 309 (547) +.|--|-+++...++++++++.+.+....+ +..+|++.|++++-+..+. . ...|.+..++...+|+++. | ..+ T Consensus 221 ~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~--~-d~q~le~~~~~~~~Ia~af-gVPp~~ 296 (432) T protein:vir:10 221 QLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNP--V-DAQLLQSRQYSVESICRFF-GVPPSM 296 (432) T ss_pred CCcceEEecCCCCCHHHHHHHHHHHhhhhhCCCceecCCCceEEEccCCh--H-HHHHHHHHHHHHHHHHHHh-CCCHHH Confidence 999999999988899999999988876644 4578999999887665432 2 2236777788899999873 3 233 Q ss_pred ccCccccccchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC--ccccCHHHHHHHHH Q lcl|NC_020866. 310 TTDAVAGGHAVSQ----EHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR--PKTEDLTKLASSLD 383 (547) Q Consensus 310 ts~~~ggs~a~~~----vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~--~~~ed~~~~a~~~~ 383 (547) ......|+++.+. ........-+...++.|+..||+.|+.+-- . .. -+|+|+. .-..|.+++++.+. T Consensus 297 lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~----~-~~--~~~~fd~~~ll~~d~~~r~~~~~ 369 (432) T protein:vir:10 297 IGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIALNLLSPAE----R-RR--YFADFDTSALLRADSAARSSYYS 369 (432) T ss_pred cCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCccc----c-Cc--eEEEeechhhhccCHHHHHHHHH Confidence 3333333333222 222334456777888888888887755421 1 11 2455543 34578999999999 Q ss_pred HHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccc Q lcl|NC_020866. 384 TLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSL 458 (547) Q Consensus 384 ~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (547) ++++.|+ ++.+++|+.+|+|+-+.++..+.......+.... +..+...+. ....+. . +...+. T Consensus 370 ~~~~~G~-~T~NE~R~~~glppi~g~~~~~~~~~~~~pl~~~-~~~~~~~~~---~~~~~~--~-----~~~~~~ 432 (432) T protein:vir:10 370 QLVNNGL-MTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSI-GLQASPEPA---SGLGNQ--Q-----QDKVSK 432 (432) T ss_pred HHHhCCC-CCHHHHHHHhCCCCCCCCcceEeecCcccchhhh-cccCCCCCC---CCCCCc--c-----cccccC Confidence 9999998 9999999999998876555444222111111110 000000000 000000 0 000000 No 28 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.89 E-value=6.9e-23 Score=142.07 Aligned_cols=401 Identities=10% Similarity=0.006 Sum_probs=245.2 Q ss_pred ccchhhhhhhhhhhhhhhHHHHhhhhc--cCCCHHHHH---HH-HHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_020866. 15 VQRAQLTEEVAAPTVTGVRSPLSGYPG--DGLNPLRLA---QI-LRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRS 88 (547) Q Consensus 15 ~~~~~l~~~~a~~~~~~~~~~~~~~~~--~~l~p~~~~---~i-lr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~ 88 (547) +.-++.|.++.+. .++++.+..+.. +..+|..-. ++ ......|.. +-..-..+-+.|.+|+..+... T Consensus 1 ~~~~~~~~~~~~~--~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~-----v~~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:18 1 MEEPKYTIDLRTN--NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSS-----INDERILQISTVWRCVSLISTL 73 (424) T ss_pred CCCCccccccCCC--CchHHHHHhhccccccccccchhhcccccccccccccc-----ccHHHhhccHHHHHHHHHHHHh Confidence 5566666666542 233344433321 122332100 00 000001111 1111112467899999999999 Q ss_pred HhcCCceEecCC--CCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeecc Q lcl|NC_020866. 89 VSQLDITVEPAS--DDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDP 160 (547) Q Consensus 89 v~~~~w~v~p~~--~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~ 160 (547) |.+++|.|.-.. +.......-.-+..+|+ ++ ...++++.++ +.+.+|-+++++++... | .+..|.+++| T Consensus 74 iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~l~~ 150 (424) T protein:vir:18 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVISLLPLQS 150 (424) T ss_pred hccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEecC Confidence 999999985322 11111111112333342 33 4555666666 56789999999987543 3 3678999999 Q ss_pred ceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe Q lcl|NC_020866. 161 RHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK 240 (547) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk 240 (547) .++.+..+++...+.....+....++++..|++++.. .+..+|.|.+..+....-.-....++...|...-+.|--+.+ T Consensus 151 ~~v~v~~~~~~~~y~~~~~g~~~~~~~~eVihir~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~ 229 (424) T protein:vir:18 151 ANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) T ss_pred cceEEEEcCCeEEEEEEeCCeEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEE Confidence 9988776655444444445566789998887777655 445899999999988887888888888899999899988888 Q ss_pred CCCC-CCHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCcc Q lcl|NC_020866. 241 WQPG-ASEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAV 314 (547) Q Consensus 241 ~~~~-~~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ 314 (547) ++.+ .++++++.+.+.+.+..+ +..+|++.|++++-+..+. ....|.+..++...+|+++.-- ..+..+.. T Consensus 230 ~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 306 (424) T protein:vir:18 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTP---QDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) T ss_pred eCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCCh---hHHHHHHHHHHhHHHHHHHhCCCHHHhCCCC Confidence 8765 578888888888876533 2468999999887664332 2224777778888999988421 23333333 Q ss_pred cccc--c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCC Q lcl|NC_020866. 315 AGGH--A-VSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGME 391 (547) Q Consensus 315 ggs~--a-~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~ 391 (547) ++++ + ..+........-+...++.|+..||+.|+..- .....+.+|.++..-..|.+.+++.+.++.+.|+ T Consensus 307 ~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~-----~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~- 380 (424) T protein:vir:18 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPSK-----DVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGL- 380 (424) T ss_pred CcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc-----ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCC- Confidence 3333 2 22333455567788889999999999875532 1112233444444456789999999999999998 Q ss_pred CCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccc Q lcl|NC_020866. 392 IEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTA 438 (547) Q Consensus 392 v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (547) ++.+++|+.+|+|+-+.+++.+.+.....-........+.. ..+ T Consensus 381 ~T~NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~~~~~~~~~---n~a 424 (424) T protein:vir:18 381 RTINEMRRTDNMPPLPGGDVAMRQAQYVPITDLGTNKEPRN---NGA 424 (424) T ss_pred cCHHHHHHHhCCCCCCCcCeeeeccCccchhhhhccCCccc---cCC Confidence 99999999999997666665543332111000000000000 000 No 29 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.89 E-value=1.9e-22 Score=139.64 Aligned_cols=397 Identities=11% Similarity=-0.026 Sum_probs=236.4 Q ss_pred CCC---CC--------eeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHH Q lcl|NC_020866. 1 MAQ---KP--------VLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAE 69 (547) Q Consensus 1 ma~---~~--------~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~ 69 (547) |.- +. .|++...+ ++.++.+-. +.. + ..+... +++ .+ |.... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~---~~~~~~~~~-~~~-~--~~~~~~---~~~---------~~--~~~vs-----~ 54 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFR---SKSLENPST-PIT-G--DAVDTD---GLF---------RA--DVYVS-----P 54 (424) T ss_pred CeeEeeeceecCcchhHHHHhhcc---ccCCCCCcc-ccc-h--hhhhhh---ccc---------cC--Cceec-----h Confidence 110 00 02333222 122222211 100 0 011000 000 00 00000 0 Q ss_pred HHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEE Q lcl|NC_020866. 70 VIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVW 143 (547) Q Consensus 70 ~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w 143 (547) +...+-+.|.+|++.+...|.+++|+|.-.+++...+....-+.++|. ++ ...++.+.++ +.+.+|-+++++++ T Consensus 55 ~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r 134 (424) T protein:vir:45 55 ETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKR 134 (424) T ss_pred HHhhccHHHHHHHHHHHHHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 000145779999999999999999999643221111110112334442 33 3555666555 56789999999987 Q ss_pred eecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_020866. 144 NTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTAR 223 (547) Q Consensus 144 ~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 223 (547) ...| .+..|.+++|..+.+..++....+..........+++...|++++.. .+..+|.+.+..+....-.-....+ T Consensus 135 ~~~G---~~~~L~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~eVih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~ 210 (424) T protein:vir:45 135 NRRG---EVISLDCCMPWETTLMNTGGRYTYGLYNEYGAFAISPDDMIHIRALG-NNQKMGLSPIMQHAETIGMGMSGQK 210 (424) T ss_pred cCCC---cEEEEEEecCceEEEEEcCCeEEEEEEecCceEEECcccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHH Confidence 5433 35689999999888876655444444444445678888877666654 4568999999999988888888888 Q ss_pred HHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc------CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHH Q lcl|NC_020866. 224 DWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG------DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLD 297 (547) Q Consensus 224 ~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~------~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d 297 (547) +...|...-+.|--|.+++...++++++++.+.+.+... +..+|++.|++++-++.+. .-.-|.+..++.. T Consensus 211 ~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~ 287 (424) T protein:vir:45 211 YTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSP---VDAQIIDMMKLNR 287 (424) T ss_pred HHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCCh---hHHHHHHHHHHHH Confidence 888899888999999999988899988888777765321 3467899998887664332 1123667777888 Q ss_pred HHHHHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--Ccccc Q lcl|NC_020866. 298 QQISKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTE 373 (547) Q Consensus 298 ~~Isk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~e 373 (547) .+|+++.-- ..+....++++++-.+-.. .....-+...++.|++.||+.|+..--.. ... +|+|+ ..-.. T Consensus 288 ~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~----~g~--~i~fd~~~llr~ 361 (424) T protein:vir:45 288 SMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELA----AGY--YVRFNLTGLLRG 361 (424) T ss_pred HHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhc----CCc--EEEeechhhhcc Confidence 999988421 2333333345555444333 34456688889999999998876532111 111 35554 33457 Q ss_pred CHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 374 DLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 374 d~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) |.+.+++.+.++++.|+ ++.+++|+.+|+|+-+.+++.+.+..... +.......+.....+ +. T Consensus 362 d~~~r~~~~~~~~~~g~-~T~NE~R~~~gl~pi~ggD~~~~~~n~~~-~~~~~~~~~~~~~~~----------~~ 424 (424) T protein:vir:45 362 TPQERAQFYHFAITDGW-MSRNEARAFEDMNPVEGLDEMLVSVNAAN-PAGDFKPPKNDEGKT----------NE 424 (424) T ss_pred CHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCcceeeecccccc-cccccCCCCCCCCCC----------CC Confidence 89999999999999997 99999999999997666676655432211 110000000000000 00 No 30 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.89 E-value=2.8e-22 Score=138.72 Aligned_cols=394 Identities=12% Similarity=0.030 Sum_probs=237.1 Q ss_pred ccchhh-hhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHh---hcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_020866. 15 VQRAQL-TEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRA---ADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVS 90 (547) Q Consensus 15 ~~~~~l-~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~---a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~ 90 (547) |-...+ .+.-..+ -.++..+...+-. ...|... -.+...+=+.|.+|+..+-..|. T Consensus 1 ~~f~~~f~r~~~~~---------------~~~~~~~~~~~~~~~~~~~g~~v-----~~~~~l~~~~v~~~i~~Ia~~iA 60 (413) T protein:vir:48 1 MFFSGLFQRKSDAP---------------VTTPAELAEAIGLSYDTYTGKRI-----SSQRAMRLTAVYSCVRVLAESVG 60 (413) T ss_pred CccchhhccCccCC---------------ccchHHHHHhhhcCcccccCcee-----chhhhhccHHHHHHHHHHHHhhh Confidence 221111 0000000 0111111111100 1111110 01111145789999999999999 Q ss_pred cCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE Q lcl|NC_020866. 91 QLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR 164 (547) Q Consensus 91 ~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~ 164 (547) ++++++...+++-.....-.-+..+|+ + +...++++.++ +.+.+|-+++++++. .| .|..|.++++.++. T Consensus 61 ~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~--~g--~~~~L~~l~~~~v~ 136 (413) T protein:vir:48 61 MLPCSLYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA--LG--EVVELLPIDPGCVE 136 (413) T ss_pred hCceEEEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC--CC--cEEEEEEEcCceEE Confidence 999999754332111110111333342 2 34666777766 567899999988753 34 36789999999887 Q ss_pred ecccCCccee--eccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC Q lcl|NC_020866. 165 FQKSDLATPL--LLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ 242 (547) Q Consensus 165 ~~~~~~~~~~--~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~ 242 (547) +..+....+. .....+....+++..+|++++.. .+.++|.|.+..+....-.-....++...+...-|.|--+.+++ T Consensus 137 ~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~ 215 (413) T protein:vir:48 137 PKLNSQWQPVYQVTFPDGSVDVLTQDEIWHVRTLT-LDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTE 215 (413) T ss_pred EEEcCCceEEEEEEecCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC Confidence 6554443333 33334445678888887776665 45589999999999888888888888888988889998889998 Q ss_pred CCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCcccc Q lcl|NC_020866. 243 PGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAG 316 (547) Q Consensus 243 ~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~gg 316 (547) ...++++++++.+.+.+.-+ +..+|++.|++++-+..+. ....|.+..++...+|+.+.-- ..+....+.+ T Consensus 216 ~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 292 (413) T protein:vir:48 216 QKLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNA---EDSQFLETRKFQLEEICRLFRVPLHMVQNTDRA 292 (413) T ss_pred CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCC Confidence 88899999999988876532 2358889998877664322 2224677788888999888422 2233333345 Q ss_pred ccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHH Q lcl|NC_020866. 317 GHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEEN 395 (547) Q Consensus 317 s~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~ 395 (547) +++..+-+. .....-+.-.++.|++.||+.|+.+--. ...+-+|.++.....|.+.++++++++++.|+ ++.+ T Consensus 293 t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~~~~~-----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~-~T~N 366 (413) T protein:vir:48 293 TFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRESKQ-----GKFYAKFNAGALLRGDMKSRFEAYATGINWGI-YSPN 366 (413) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-----CCeEEEEechhhhccCHHHHHHHHHHHHhCCC-cCHH Confidence 666544343 4445678888999999999887654321 11222333344445688999999999999998 9999 Q ss_pred HHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccccccccc Q lcl|NC_020866. 396 EVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 396 ~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) ++|+.+|+|+-+.++..+.+...................+. ....+. T Consensus 367 E~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~-----~~~~~~ 413 (413) T protein:vir:48 367 DCRDLEDMNPRPGGDVYLTPMNMTTSPSAGDDNGKKKESGD-----ADKTAS 413 (413) T ss_pred HHHHHhCCCCCCCcceeeccccccccccccccCCCCCCCCC-----ccccCC Confidence 99999999876666665544322111111100000000000 000000 No 31 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.89 E-value=1.6e-22 Score=140.04 Aligned_cols=403 Identities=14% Similarity=0.090 Sum_probs=241.4 Q ss_pred CCCCCeeeCCCCC---cccchhhh----hhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhc-CCCHHHHHHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGN---PVQRAQLT----EEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAAD-HGDPVRYLDLAEVIE 72 (547) Q Consensus 1 ma~~~~~~~~~g~---~~~~~~l~----~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~-~g~~~~~~~L~~~m~ 72 (547) |- |+|..-+ |....... .+.+++......+. ..+++-..+...++... .|. . ... +..+ T Consensus 1 Mg----l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-----~~~~~~~~~~~~~~~~~~~g~--~-v~~-~~al 67 (431) T protein:vir:10 1 MG----LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGET-----FEGLDDPRLKEYIRRGELNGG--T-GRE-TRAL 67 (431) T ss_pred Cc----chhhhhcCcccccccccccccccccccccccccccc-----cccccchHHHHhhccCccCcc--e-ech-hhhh Confidence 88 7777433 11111110 11111111111111 11122112222222111 110 0 000 1111 Q ss_pred hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeec Q lcl|NC_020866. 73 ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTS 146 (547) Q Consensus 73 ~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~ 146 (547) +-+.|.+|++.+...|.+++++|.-.++..+ +..-.-+..+|+ ++ ...++.+.++ +.+.+|-+++++++. T Consensus 68 -~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~-~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-- 143 (431) T protein:vir:10 68 -RNMAVLRCVTLISGTIGMLPMNLISSDDSKQ-VLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWS-- 143 (431) T ss_pred -ccHHHHHHHHHHHHhhccCceEEEEecCcee-eeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc-- Confidence 4678999999999999999999965433211 111122334443 33 3556666654 567899999999875 Q ss_pred CCeeeEEeeEeeccceeEeccc--CCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQKS--DLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) +|. +..|.++++.++.+..+ +...+......+..+.++....|++++.. .+..+|.|.+..+....-.-....++ T Consensus 144 ~g~--~~~L~pl~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir~~~-~dg~~G~spi~~~~~~i~~~~~~~~~ 220 (431) T protein:vir:10 144 GNR--PIRLIPMDRGSAKGRLTSTWQIVYDYTTPTGDKIELPAREVFHLRDLS-IDGVSGVSRVKLSGNALELAEQAERA 220 (431) T ss_pred CCc--eEEEEEEcCceeEEEEcCCCeEEEEEEeCCceEEEEchhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHH Confidence 343 56899999998775433 33333344445566778888877666654 45589999999999888888888899 Q ss_pred HHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHH Q lcl|NC_020866. 225 WAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQ 299 (547) Q Consensus 225 w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~ 299 (547) ...|...-+.|--|.+++...++++++++.+.+.+.-+ +..+|++.|++++=+..+ .....|.+..++..++ T Consensus 221 ~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~---~~d~q~le~r~~~~~~ 297 (431) T protein:vir:10 221 ASRTFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNT---AASAQQIENRNHQIEE 297 (431) T ss_pred HHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCC---hhHHHHHHHHHHhHHH Confidence 99999999999888999988899999999988877533 245899999887755432 2222466667777888 Q ss_pred HHHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHH Q lcl|NC_020866. 300 ISKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 300 Isk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~ 377 (547) |+++.-- ..+.....+++++..+-+. .....-+...++.|++.||+.|+..-..- ..+.+|.++..-..|.+. T Consensus 298 Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~-----~~~~~fd~~~llr~d~~~ 372 (431) T protein:vir:10 298 VARMYGVPRPLLMMDDTSWGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLG-----QRQFKFNEGALLRGTLND 372 (431) T ss_pred HHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcC-----CceEEEechhhhccCHHH Confidence 9887321 2233333344555444333 34455688888999999998876532111 122344444445679999 Q ss_pred HHHHHHHHHhCCCC---CCHHHHHHHhCCCCCCC--cchhcccccccccccccccccccccCCccccc Q lcl|NC_020866. 378 LASSLDTLVRLGME---IEENEVRSRFGFSQPKP--GAKLLRPLENPATQAAPPGMDPGAGDLKTAER 440 (547) Q Consensus 378 ~a~~~~~l~~~G~~---v~~~~~~e~~Glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (547) +++.+++++..|+. ++.+++|+.+|+|+-++ ++.+..+.... +... ...+++.. T Consensus 373 r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~-~~~~--------~~~~p~~~ 431 (431) T protein:vir:10 373 QAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQLRNPMTQK-QKGS--------GDEPPATT 431 (431) T ss_pred HHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccccceecccccc-cCCC--------CCCCCCCC Confidence 99999999999875 89999999999987544 55544332211 1100 00000000 No 32 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.88 E-value=4.3e-22 Score=137.74 Aligned_cols=410 Identities=12% Similarity=0.060 Sum_probs=244.1 Q ss_pred CCCCCeeeCCCCCcc--cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV--QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~--~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |. |.|...+-+ .++.-.+...... .. ..+..+. +.++. |-. +...-..+.+.| T Consensus 1 M~----~~~r~~~~~~~~~r~~~~~~~~~~--~~-~~~~~~~--g~~~~-----------~~~-----v~~~~al~~~~v 55 (432) T protein:vir:10 1 MK----IVDSVKKFFNFEKRQTSQVIELNK--DD-EKLLEWL--GISPS-----------TIS-----VKGKNALKVATV 55 (432) T ss_pred CC----hHHHHHHhcCccccCcccccccCC--ch-HHHHHHh--CCCcC-----------ccc-----cchhhhhccHHH Confidence 88 555421111 1111111111000 00 0111111 11110 000 000001146889 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) .+|++.+-..|.+++|.|.-.+++...+..-..+..+|+ + +...++++.++ +.+.+|-+++++++...| .+ T Consensus 56 ~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~ 132 (432) T protein:vir:10 56 FACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG---KV 132 (432) T ss_pred HHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cE Confidence 999999999999999998533222111111112334443 3 34677777777 457799999999875443 36 Q ss_pred EeeEeeccceeEecccCCc-------ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLA-------TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~-------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) ..|.+++|.++.+..+... ..+....++....+++...|++++....+..+|.|.+..+....-.-....++. T Consensus 133 ~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 212 (432) T protein:vir:10 133 QALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFI 212 (432) T ss_pred EEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 7899999988775443221 123334445557789888877776666677899999999999888888888889 Q ss_pred HHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) ..+...-|.|--+.+++...++++++++.+.+.+.-+ +..+|+|.|++++-+..+. . ...|.+..++..++| T Consensus 213 ~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~I 289 (432) T protein:vir:10 213 NNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNM--S-DAQFLENTELTIRQI 289 (432) T ss_pred HHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCCh--h-HHHHHHHHHHHHHHH Confidence 9999998999888899888899988888888877532 3568999999887765432 2 224667777888999 Q ss_pred HHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHH Q lcl|NC_020866. 301 SKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKL 378 (547) Q Consensus 301 sk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~ 378 (547) +++.-- ..+.+....|+++..+-+. .....-+...++.|+..||+.|+..-- + ....+.+|.+......|.+.+ T Consensus 290 a~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~---~-~~g~~~~fd~~~l~~~d~~~~ 365 (432) T protein:vir:10 290 ATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE---L-DKGFYSKFNVDAILRADIKTR 365 (432) T ss_pred HHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh---c-CCCcEEEeechhhhcCCHHHH Confidence 888421 2233333445665544443 345667888999999999988764321 1 111223444445556799999 Q ss_pred HHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) ++.++++++.|+ ++.+++|+.+|+|+-+.++..+.+... .+...........+. ...+.......++ T Consensus 366 ~~~~~~~~~~G~-~t~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~~~~k~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:10 366 YEAYRTGIQGGF-LKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQAYLKGGD-TNGEVSKEGNEGN 432 (432) T ss_pred HHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCeEeecccc-cchhhccccccCCCC-CCCCCCCCCCCCC Confidence 999999999998 999999999999876556655444322 111111110000000 0000000010011 No 33 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.88 E-value=4.3e-22 Score=137.74 Aligned_cols=410 Identities=12% Similarity=0.060 Sum_probs=244.1 Q ss_pred CCCCCeeeCCCCCcc--cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV--QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~--~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |. |.|...+-+ .++.-.+...... .. ..+..+. +.++. |-. +...-..+.+.| T Consensus 1 M~----~~~r~~~~~~~~~r~~~~~~~~~~--~~-~~~~~~~--g~~~~-----------~~~-----v~~~~al~~~~v 55 (432) T protein:vir:10 1 MK----IVDSVKKFFNFEKRQTSQVIELNK--DD-EKLLEWL--GISPS-----------TIS-----VKGKNALKVATV 55 (432) T ss_pred CC----hHHHHHHhcCccccCcccccccCC--ch-HHHHHHh--CCCcC-----------ccc-----cchhhhhccHHH Confidence 88 555421111 1111111111000 00 0111111 11110 000 000001146889 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) .+|++.+-..|.+++|.|.-.+++...+..-..+..+|+ + +...++++.++ +.+.+|-+++++++...| .+ T Consensus 56 ~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~ 132 (432) T protein:vir:10 56 FACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG---KV 132 (432) T ss_pred HHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cE Confidence 999999999999999998533222111111112334443 3 34677777777 457799999999875443 36 Q ss_pred EeeEeeccceeEecccCCc-------ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLA-------TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~-------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) ..|.+++|.++.+..+... ..+....++....+++...|++++....+..+|.|.+..+....-.-....++. T Consensus 133 ~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 212 (432) T protein:vir:10 133 QALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFI 212 (432) T ss_pred EEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 7899999988775443221 123334445557789888877776666677899999999999888888888889 Q ss_pred HHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) ..+...-|.|--+.+++...++++++++.+.+.+.-+ +..+|+|.|++++-+..+. . ...|.+..++..++| T Consensus 213 ~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~I 289 (432) T protein:vir:10 213 NNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNM--S-DAQFLENTELTIRQI 289 (432) T ss_pred HHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCCh--h-HHHHHHHHHHHHHHH Confidence 9999998999888899888899988888888877532 3568999999887765432 2 224667777888999 Q ss_pred HHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHH Q lcl|NC_020866. 301 SKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKL 378 (547) Q Consensus 301 sk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~ 378 (547) +++.-- ..+.+....|+++..+-+. .....-+...++.|+..||+.|+..-- + ....+.+|.+......|.+.+ T Consensus 290 a~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~---~-~~g~~~~fd~~~l~~~d~~~~ 365 (432) T protein:vir:10 290 ATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE---L-DKGFYSKFNVDAILRADIKTR 365 (432) T ss_pred HHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh---c-CCCcEEEeechhhhcCCHHHH Confidence 888421 2233333445665544443 345667888999999999988764321 1 111223444445556799999 Q ss_pred HHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) ++.++++++.|+ ++.+++|+.+|+|+-+.++..+.+... .+...........+. ...+.......++ T Consensus 366 ~~~~~~~~~~G~-~t~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~~~~k~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:10 366 YEAYRTGIQGGF-LKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQAYLKGGD-TNGEVSKEGNEGN 432 (432) T ss_pred HHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCeEeecccc-cchhhccccccCCCC-CCCCCCCCCCCCC Confidence 999999999998 999999999999876556655444322 111111110000000 0000000010011 No 34 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.88 E-value=4.3e-22 Score=137.74 Aligned_cols=410 Identities=12% Similarity=0.060 Sum_probs=244.1 Q ss_pred CCCCCeeeCCCCCcc--cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV--QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~--~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |. |.|...+-+ .++.-.+...... .. ..+..+. +.++. |-. +...-..+.+.| T Consensus 1 M~----~~~r~~~~~~~~~r~~~~~~~~~~--~~-~~~~~~~--g~~~~-----------~~~-----v~~~~al~~~~v 55 (432) T protein:vir:10 1 MK----IVDSVKKFFNFEKRQTSQVIELNK--DD-EKLLEWL--GISPS-----------TIS-----VKGKNALKVATV 55 (432) T ss_pred CC----hHHHHHHhcCccccCcccccccCC--ch-HHHHHHh--CCCcC-----------ccc-----cchhhhhccHHH Confidence 88 555421111 1111111111000 00 0111111 11110 000 000001146889 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) .+|++.+-..|.+++|.|.-.+++...+..-..+..+|+ + +...++++.++ +.+.+|-+++++++...| .+ T Consensus 56 ~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~ 132 (432) T protein:vir:10 56 FACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG---KV 132 (432) T ss_pred HHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cE Confidence 999999999999999998533222111111112334443 3 34677777777 457799999999875443 36 Q ss_pred EeeEeeccceeEecccCCc-------ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLA-------TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~-------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) ..|.+++|.++.+..+... ..+....++....+++...|++++....+..+|.|.+..+....-.-....++. T Consensus 133 ~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 212 (432) T protein:vir:10 133 QALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFI 212 (432) T ss_pred EEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 7899999988775443221 123334445557789888877776666677899999999999888888888889 Q ss_pred HHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) ..+...-|.|--+.+++...++++++++.+.+.+.-+ +..+|+|.|++++-+..+. . ...|.+..++..++| T Consensus 213 ~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~I 289 (432) T protein:vir:10 213 NNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNM--S-DAQFLENTELTIRQI 289 (432) T ss_pred HHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCCh--h-HHHHHHHHHHHHHHH Confidence 9999998999888899888899988888888877532 3568999999887765432 2 224667777888999 Q ss_pred HHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHH Q lcl|NC_020866. 301 SKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKL 378 (547) Q Consensus 301 sk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~ 378 (547) +++.-- ..+.+....|+++..+-+. .....-+...++.|+..||+.|+..-- + ....+.+|.+......|.+.+ T Consensus 290 a~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~---~-~~g~~~~fd~~~l~~~d~~~~ 365 (432) T protein:vir:10 290 ATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE---L-DKGFYSKFNVDAILRADIKTR 365 (432) T ss_pred HHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh---c-CCCcEEEeechhhhcCCHHHH Confidence 888421 2233333445665544443 345667888999999999988764321 1 111223444445556799999 Q ss_pred HHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) ++.++++++.|+ ++.+++|+.+|+|+-+.++..+.+... .+...........+. ...+.......++ T Consensus 366 ~~~~~~~~~~G~-~t~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~~~~k~~~-~~~~~~~~~~~~~ 432 (432) T protein:vir:10 366 YEAYRTGIQGGF-LKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQAYLKGGD-TNGEVSKEGNEGN 432 (432) T ss_pred HHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCeEeecccc-cchhhccccccCCCC-CCCCCCCCCCCCC Confidence 999999999998 999999999999876556655444322 111111110000000 0000000010011 No 35 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.88 E-value=5.2e-22 Score=137.25 Aligned_cols=388 Identities=13% Similarity=0.060 Sum_probs=235.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |- . .++..++. +... ..+. .+..+.+ .++ .|..... +..+ +-+.|.+ T Consensus 1 m~-----f---~~~~~~~~--~~~~---~~~~--~~~~~~g--~~~-----------~~~~v~~----~~al-~~~~v~~ 47 (409) T protein:vir:10 1 ML-----F---RKGFKNQS--QEIS---IDDK--KILEWLG--INP-----------SETYVNG----KSCL-KQATVFG 47 (409) T ss_pred Cc-----c---cccccCcC--CCCC---CChH--HHHHHhc--CCc-----------Ccceech----hhhh-ccHHHHH Confidence 44 1 22222211 1111 0000 0111110 011 0000000 0111 3567999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |+..+...|.+++|+|.-..+.....+ -.-+..+|. ++ ...++++.++ +.+.+|-+++++++...|. +.. T Consensus 48 ~i~~ia~~ia~lp~~~~~~~~~~~~~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~---~~~ 123 (409) T protein:vir:10 48 CIRILSDNISKLPIKIYQKKDGIKRVP-DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGE---IKG 123 (409) T ss_pred HHHHHHHhhhhCceEEEEecCCeeecc-CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc---EEE Confidence 999999999999999953322111110 012233343 33 4667777666 4678999999998865433 668 Q ss_pred eEeeccceeEecccCCc--------ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLA--------TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWA 226 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~--------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 226 (547) |.++++.++++..+... .+......+....+++...|++++.. .+.++|.|.+..+....-.-....++.. T Consensus 124 L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 202 (409) T protein:vir:10 124 LYPLKSDGMKIFVDDTGLLNSENNVWYLYTDDLGQRHKFMSDEILHFKGLT-ADGLAGLSVIELLNHLIENGKSSETYLN 202 (409) T ss_pred EEEEcCCceEEEEcCCccccccceEEEEEEeCCceeEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 99999988765443221 23334455566788888887766554 4558999999999999888888888899 Q ss_pred HHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_020866. 227 IFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQIS 301 (547) Q Consensus 227 ~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Is 301 (547) .+...-+.|--|.+++...++++++++.+.+.+... +..+|++.|++++=+..+. ....|.+..++..++|+ T Consensus 203 ~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia 279 (409) T protein:vir:10 203 NFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKL---VDAQFLENSQLTIRQIA 279 (409) T ss_pred HHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCCh---hhHHHHHHHHHHHHHHH Confidence 999999999888899888889999988888887643 3468899998877654332 22246777888889999 Q ss_pred HHHhc-chhccCccccccchhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHH Q lcl|NC_020866. 302 KAVLG-QTTTTDAVAGGHAVSQEH-RLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 302 k~ilG-qtlts~~~ggs~a~~~vh-~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a 379 (547) ++.-- ..+.+..+.++++..+.. ......-+...++.|+..||+.|+..-- + ....+-+|.++.....|.+.++ T Consensus 280 ~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~---~-~~~~~~~fd~~~ll~~d~~~~~ 355 (409) T protein:vir:10 280 SVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLFLISE---I-KNGFYSKFNVDTILRADIKTRY 355 (409) T ss_pred HHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchh---c-cCCcEEEEechhhhccCHHHHH Confidence 88422 233333334555544433 3445566788888999999887642211 1 1111223333444467999999 Q ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCC Q lcl|NC_020866. 380 SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDL 435 (547) Q Consensus 380 ~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (547) +.+.++++.|+ ++.+++|+.+|+|+-+.++..+.+.... +....+......+.. T Consensus 356 ~~~~~~~~~G~-~T~NE~R~~lgl~p~~ggD~~~~~~n~~-~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 356 ESYKEAIQNGF-KTPNEIRELEEDEPLEGGDVLLINGNMI-PVKMAGEQYSKGGEK 409 (409) T ss_pred HHHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeeccCcc-chhhccccccccCCC Confidence 99999999998 9999999999998766666554433211 111100000000000 No 36 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.88 E-value=9.9e-22 Score=135.73 Aligned_cols=429 Identities=12% Similarity=0.056 Sum_probs=238.3 Q ss_pred CCCCCeeeCC-CCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDR-WGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~-~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |. |++. .|++-.. .+.... ..........+.. .......|....- +.. .+=+.|. T Consensus 1 Mg----~~~~l~~~~~~~-~~~~~~-~~~~~~~~~~~~~-------------~~~~~~~g~~v~~----~~a-l~~~~v~ 56 (457) T protein:vir:62 1 MG----FWSALFGRGHSP-ALDAAE-GRAWEPYDPSIYN-------------LGATASSGERVTP----HDA-LQVSAVF 56 (457) T ss_pred Cc----hhhhhhcccccc-cccccc-ccccccchhhhhh-------------ccccccCCceech----HHh-hccHHHH Confidence 88 5554 2332111 110000 0000000000000 0000001110000 111 1357799 Q ss_pred HHHHHHHHHHhcCCceEecCCCC-HHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDD-PQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~-~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) +|++.+-..|.+++++|.-..++ .+..+ ...+..++.. +.+.++++.++ +.+.+|.+++++.+. +|. +. T Consensus 57 ~~i~~ia~~iA~lp~~~~~~~~~~~~~~~-~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~--~g~--~~ 131 (457) T protein:vir:62 57 ASVRLLSETIATLPLSTYSKRGGTRKEID-TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA--GPN--IA 131 (457) T ss_pred HHHHHHHHhHhhCceEEEEecCCcccccc-chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCc--EE Confidence 99999999999999999754332 11111 1123333433 34677777777 467899999998764 343 56 Q ss_pred eeEeeccceeEeccc--CCcc---e--eeccCCCCc---eeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQKS--DLAT---P--LLLDDNGAE---VPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTAR 223 (547) Q Consensus 154 ~l~~~~~~~~~~~~~--~~~~---~--~~~~~~~~g---~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 223 (547) .|.+++|.++.+..+ +... . +.....+.. ..++++..|++++....+..+|.+.+..+....-.-....+ T Consensus 132 ~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~ 211 (457) T protein:vir:62 132 GLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQK 211 (457) T ss_pred EEEEEcCcceEEEEeccCCccceeEEEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHH Confidence 788999888765432 1111 1 111112211 34678888877777777778999999999999888888999 Q ss_pred HHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHH Q lcl|NC_020866. 224 DWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQ 298 (547) Q Consensus 224 ~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~ 298 (547) +...|...-+.|--|.+++...++++++++.+.+.+..+ +..+|++.|++++-+..+. .-..|.+..++... T Consensus 212 ~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~ 288 (457) T protein:vir:62 212 YGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSP---DEAQFLQTRQFQVP 288 (457) T ss_pred HHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh---hHHHHHHHHHHHHH Confidence 999999999999999999988899999999998887643 3368899998887664332 12246777778888 Q ss_pred HHHHHHhc-chhccCccccccchh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccC Q lcl|NC_020866. 299 QISKAVLG-QTTTTDAVAGGHAVS---QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTED 374 (547) Q Consensus 299 ~Isk~ilG-qtlts~~~ggs~a~~---~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed 374 (547) +|+++.-- ..+.++..++++..+ +........-+..-++.++..||+.|+...- ....+.+|.++.....| T Consensus 289 ~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~-----~~~~~i~fd~~~l~~~d 363 (457) T protein:vir:62 289 EIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETA-----DRFRFVKFNLDEIKRGA 363 (457) T ss_pred HHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCccc-----cCceEEEeechhhhccC Confidence 89888421 233444333443222 2333344556788888899999987765431 11122344444555679 Q ss_pred HHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc--chhcccccccccccccccccccccCCccccccccccccchhhh Q lcl|NC_020866. 375 LTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG--AKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 375 ~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~--e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (547) .+.+++.+.++++.|+ ++.+++|+.+|+|+-+++ ++.+.+..... ....+...+........++...+. ...... T Consensus 364 ~~~r~~~~~~~~~~G~-~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 440 (457) T protein:vir:62 364 PKERMELWSLGLQNGI-YSIDEVRAAEDMTPLPDGLGEKYRVPLNLGE-IGEEPEPEPAPAPPAIDPPAEEPA-DDEEPD 440 (457) T ss_pred HHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceeeecccccc-ccccccccccCCCccCCCCccCCC-CCCCCC Confidence 9999999999999997 999999999999876655 44443321111 111000000000000000000000 000000 Q ss_pred hhcccchhhhcccccccchhH Q lcl|NC_020866. 453 RGDLSLNSEMPSTGQKTGGAE 473 (547) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~ 473 (547) ......... -....++. T Consensus 441 ~~~~~~d~~----~~~~~~~~ 457 (457) T protein:vir:62 441 NAEGDPDEG----ETEDDDDA 457 (457) T ss_pred CCCCCCccc----cccccccC Confidence 000000000 00011111 No 37 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.88 E-value=5.5e-22 Score=137.12 Aligned_cols=396 Identities=13% Similarity=0.019 Sum_probs=229.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhc-cC--CCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPG-DG--LNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~-~~--l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |- +.|...+..-.+..+. ...+...+...++ .+ +|+ ..+ | +-+. T Consensus 1 ~~--------~~r~~~~~~~~~~~~~--~~~~~~~~g~~~s~~~~~vt~---~~a------------------l--~~~~ 47 (419) T protein:vir:14 1 MF--------FSRQLLSNLGQTQMSA--GGWVSALLGSSRSDSGQVVTP---ASA------------------L--ALTV 47 (419) T ss_pred Cc--------ccccccccccccccCc--chhhHHhhcCCCccCCcccch---HHh------------------h--ccHH Confidence 43 1222222111111111 1111122222111 11 222 111 1 3567 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeee Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYF 151 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~ 151 (547) |.+|++.+...|.+++|+|.-.+++...+..-.-+..+|. ++ ...++++.++ +.+.+|.+++++++..+ | . T Consensus 48 v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G--~ 124 (419) T protein:vir:14 48 LQNCVTLLAESIAQLPIELYERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD-G--V 124 (419) T ss_pred HHHHHHHHHHhhccCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--c Confidence 9999999999999999999643322111110112333332 33 4666777755 56789999999987543 3 3 Q ss_pred EEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 152 PDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 152 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) |..|.+++|.++.+..++...++.+-....+ ++.+..++.++.. .+..+|.|.+..+....-.-....++...+... T Consensus 125 ~~~l~pl~~~~v~v~~~~~~~~~y~~~~~~~--~~~~~i~h~~~~~-~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~n 201 (419) T protein:vir:14 125 IQGLYPLDNEAVTVMRGSDLKPVYRVRGSDP--MPQRLVHHVRWMS-INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMN 201 (419) T ss_pred EEEEEEecCceEEEEECCCceEEEEEccCcc--cchhheeEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6789999999988766555555544333332 4444444444444 455899999999999888888888888899999 Q ss_pred cCCCeEEEeCCCCC----CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_020866. 232 YGQPLRIGKWQPGA----SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISK 302 (547) Q Consensus 232 ~G~P~~vgk~~~~~----~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk 302 (547) -+.|--+.+++... ++++++++.+.+.+..+ +..++++.|++++=+..+. . ...|.+..++...+|++ T Consensus 202 g~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~Ia~ 278 (419) T protein:vir:14 202 GTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTN--V-DAALIDALRLSALDIAR 278 (419) T ss_pred cCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCh--h-hHHHHHHHHHHHHHHHH Confidence 89997777776433 46777888877776533 2358888998777554322 1 12366777788899998 Q ss_pred HHhcc--hhccCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHH Q lcl|NC_020866. 303 AVLGQ--TTTTDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTK 377 (547) Q Consensus 303 ~ilGq--tlts~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~ 377 (547) +. |- .+....++++++..+.+.. ....-+...++.|++.||+.|+.+--.. .-+++|+ .....|.+. T Consensus 279 ~f-gVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll~~~~~~-------~~~i~fd~~~l~r~d~~~ 350 (419) T protein:vir:14 279 IY-KIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERK-------QYFIEYNLAGLLRGDQSS 350 (419) T ss_pred Hh-CCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccC-------CeEEEEechhhhccCHHH Confidence 84 32 2333334456655444433 3456788888899999998765332111 1135554 334678999 Q ss_pred HHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 378 LASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 378 ~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) +++.+.++++.|+ ++.+++|+.+|+|+-+.++..+.+... .+...+ ...+...+.+++ ...+-....++ T Consensus 351 ~~~~~~~~~~~G~-~T~NE~R~~~gl~p~~gGD~~~~~~n~-~~~~~~-~~~~~~~~~~~~--------~~~~e~~~~l~ 419 (419) T protein:vir:14 351 RYAAYAVGRQWGW-LSINDIRRLENMPPVKGGDIYLSPMNM-VDASKP-QQLPVGKSEPTK--------AAIDEIGRILS 419 (419) T ss_pred HHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeecccc-cccccc-ccccCCCCCCcc--------ccccchhcccC Confidence 9999999999998 999999999999977666665544221 111111 111111111000 00000011111 No 38 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.88 E-value=1.4e-21 Score=134.97 Aligned_cols=427 Identities=12% Similarity=0.039 Sum_probs=237.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. |++...+-...+.+...... .... .++.- -........|....- +..+ +=+.|.+ T Consensus 1 Mg----~~~~l~~r~~~~~~~~~~~~-~~~~------------~~~~~-~~~~~~~~~g~~V~~----~~al-~~~~V~~ 57 (457) T protein:vir:13 1 MG----FWSALFGRGHSPALDGIEAR-AWEP------------YDPSI-YNLGAVAASGETVTP----HDAL-QVSAVFA 57 (457) T ss_pred Cc----hhhhhhcccccccccccccc-cccc------------cchHH-HhhcccccCCceech----HHhh-ccHHHHH Confidence 88 44443222222211111100 0000 00100 000000001110000 0111 3477999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.+-..|.+++++|.-..++.........+..+++. +.+.++++.++ +.+.+|.+++++.+. +| .+..| T Consensus 58 ~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~--~g--~~~~l 133 (457) T protein:vir:13 58 SVRLLSETIATLPLSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ--GP--NIVGL 133 (457) T ss_pred HHHHHHHhhccCceEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--cEEEE Confidence 99999999999999997543322111112233344443 34567777776 577899999999764 44 35678 Q ss_pred EeeccceeEeccc--CCc---ce--eeccCCCCc---eeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKS--DLA---TP--LLLDDNGAE---VPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 156 ~~~~~~~~~~~~~--~~~---~~--~~~~~~~~g---~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) .+++|.++.+..+ +.. .. +.....+.. ..+++..+|++++....+..+|.|.+..+....-.-....++. T Consensus 134 ~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 213 (457) T protein:vir:13 134 DVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYG 213 (457) T ss_pred EEEccCceEEEEecCCCccceeEEEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 8999887765332 111 11 111222211 3467888877777777777899999999999988888889999 Q ss_pred HHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) ..|....+.|--|.+++...++++++++.+.+.+..+ +..+|++.|++++-+..+. . ...|.+..++...+| T Consensus 214 ~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~I 290 (457) T protein:vir:13 214 SKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSP--D-EAQFLQTRQFQVPEI 290 (457) T ss_pred HHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh--h-HHHHHHHHHHHHHHH Confidence 9999999999999999988899999999998887643 3468999998887665332 1 123666677888889 Q ss_pred HHHHhc-chhccCccccccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHH Q lcl|NC_020866. 301 SKAVLG-QTTTTDAVAGGHA---VSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLT 376 (547) Q Consensus 301 sk~ilG-qtlts~~~ggs~a---~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~ 376 (547) +++.-- ..+.+...++++. ..+........-+...++.|+..||+.|+...- ....+.+|.+......|.+ T Consensus 291 a~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~-----~~~~~i~fd~~~l~~~D~~ 365 (457) T protein:vir:13 291 ARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETA-----DRFRFVKFNLDEIKRGAPK 365 (457) T ss_pred HHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc-----cCceeEEeechhhhccCHH Confidence 888421 2333433333332 223333344556788888899999887755431 1112234444455567999 Q ss_pred HHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc--chhcccccccccccccccccc----cccCCccccccccccccchh Q lcl|NC_020866. 377 KLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG--AKLLRPLENPATQAAPPGMDP----GAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 377 ~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~--e~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 450 (547) .+++.+.++++.|+ ++.+++|+.+|+|+-+++ +..+.+.... +....+...+ .....+...+...+...... T Consensus 366 ~r~~~~~~~~~~G~-~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 443 (457) T protein:vir:13 366 ERMELWSLGLQNGI-YSIDEVRAAEDMTPLPDGLGEKYRVPLNLG-EVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKP 443 (457) T ss_pred HHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCcccceeeccccc-cccccccccccCCCCCCCCCccccCCCCCCCCCC Confidence 99999999999997 999999999999865554 4444332111 1000000000 00000000000000000000 Q ss_pred hhhhcccchhhhcccccccchhH Q lcl|NC_020866. 451 LSRGDLSLNSEMPSTGQKTGGAE 473 (547) Q Consensus 451 ~~~~~~~~~~~~~~~~~~~~~~~ 473 (547) ... ........+|. T Consensus 444 d~~---------~~~~~~~~~~~ 457 (457) T protein:vir:13 444 DDE---------GATEEDDEDDA 457 (457) T ss_pred ccc---------cCCCCcccccC Confidence 000 00000000111 No 39 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.88 E-value=3.6e-21 Score=132.69 Aligned_cols=393 Identities=14% Similarity=0.098 Sum_probs=227.5 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCC---CHHHHHHHHHHHHHHhCC--------------CHHHHHHHHH- Q lcl|NC_020866. 68 AEVIEERDPHYLGVIGTRKRSVSQLDITVEPASD---DPQDVKIADMVRDWLKRE--------------ELQQEVFHIL- 129 (547) Q Consensus 68 ~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~---~~~~~~~ae~v~~~l~~~--------------~~~~~i~~~l- 129 (547) .+++.+..+.|.+|++.+...|.+++|.|.+..+ ........+.+...+... .+.++++.++ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 3455567899999999999999999999976432 112222223333333211 2345555544 Q ss_pred HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcce---------------------------ee----ccC Q lcl|NC_020866. 130 DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATP---------------------------LL----LDD 178 (547) Q Consensus 130 ~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~---------------------------~~----~~~ 178 (547) +...+|.+++++++...| .+..|.++++.+++...+..... .. ... T Consensus 81 ~l~l~Gn~~i~~~r~~~G---~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDG---TPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGS 157 (467) T ss_pred HHHhcCCeEEEEEECCCC---cEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeecccc Confidence 577899999999986544 35678888887765433321100 00 011 Q ss_pred CCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC-CCCCCHHHHHHHHHHH Q lcl|NC_020866. 179 NGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKW-QPGASEQDKDTLFRAV 257 (547) Q Consensus 179 ~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~-~~~~~~~~~~~l~~al 257 (547) .+....+|....|+++.....+..||.+.+..+......-....++-..|...-+.|--+.++ +...++++++.+.+.+ T Consensus 158 ~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~ 237 (467) T protein:vir:31 158 TGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLI 237 (467) T ss_pred ccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHH Confidence 233456788888877766667788999999999888777777777777788888888766665 4457888888888887 Q ss_pred HHHhc----------------CeeEEecCCce-----EEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCccc Q lcl|NC_020866. 258 ANIAG----------------DCAAIIPETMS-----IDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVA 315 (547) Q Consensus 258 ~~~~~----------------~~~~vip~~~~-----ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~g 315 (547) .+... +..++++.|++ ++|...+......-.|.+..++..++|+++.-- ..+.+...+ T Consensus 238 ~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~ 317 (467) T protein:vir:31 238 EDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVES 317 (467) T ss_pred HhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCC Confidence 75433 23456776665 444433221121224778888888899987311 233333333 Q ss_pred cccch-h-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCC Q lcl|NC_020866. 316 GGHAV-S-QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIE 393 (547) Q Consensus 316 gs~a~-~-~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~ 393 (547) ++++. . +........-+...++.|++.||+.|++..... ...+.+|.+......|.+..++.+..+++.|+ +| T Consensus 318 ~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~----~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~-~T 392 (467) T protein:vir:31 318 GAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDA----PDWTIEFELAKPDTKLQDVEIASQRVQAMQGL-LT 392 (467) T ss_pred CCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhcc----CCceEEEecchhhccCHHHHHHHHHHHHhCCC-cC Confidence 44432 2 223334456688889999999999876543222 11223555556667899999999999999998 99 Q ss_pred HHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhH Q lcl|NC_020866. 394 ENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAE 473 (547) Q Consensus 394 ~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (547) .+++|+.+|+|+- +++..+......................+...... ..+.. T Consensus 393 ~NE~R~~~Gl~pi-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------~~~~~ 445 (467) T protein:vir:31 393 VNELRDEFGFEPF-PEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVED--------------------------RADEI 445 (467) T ss_pred HHHHHHHhCCCCC-CcccccCCcccccccccccCCCCcccCcCCCCCCC--------------------------cccch Confidence 9999999999754 33333221111111111000000000000000000 00111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHccCC Q lcl|NC_020866. 474 IAALTDQLELEGQPEIVAMIEEIRTMLEAASS 505 (547) Q Consensus 474 ~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s 505 (547) ++.+...+. .++..+.-++|.| T Consensus 446 ~~~~~~~~~----------~~~~~~~~~~~~~ 467 (467) T protein:vir:31 446 IDSYQADLE----------TEQLIEIGANADS 467 (467) T ss_pred Hhhhhhccc----------cchhhhhccccCC Confidence 222211111 1122233334444 No 40 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.88 E-value=6.1e-22 Score=136.89 Aligned_cols=413 Identities=12% Similarity=0.028 Sum_probs=237.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhh---hccCCCHHHHHHHHHhhcCCCHHHHHHH-HHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGY---PGDGLNPLRLAQILRAADHGDPVRYLDL-AEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~---~~~~l~p~~~~~ilr~a~~g~~~~~~~L-~~~m~~~D~ 76 (547) |++ .+ |+.+.+ +.. ..++.+.+. ..+.-++.-|...+- +.......+ .+.. .+-+ T Consensus 1 ~~~---~l---~~~~~~-~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~g----~~~~~g~~v~~~~a-l~~~ 59 (434) T protein:vir:43 1 MSK---SL---GKVLSS-ATS---------APRSSLFGWGGKTIRLTDGAFWSQFLG----RESSSGKKVTVDKA-MKLS 59 (434) T ss_pred Ccc---ch---hhhhhh-ccc---------ccchhhhcccccccccCchHHHHHHhc----CCccCCceechhhh-hccH Confidence 764 22 221111 111 111111111 111112322222221 110000001 0111 1457 Q ss_pred HHHHHHHHHHHHHhcCCceEecCC-CCHHHHHHHHHHHHHH-hCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCe Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPAS-DDPQDVKIADMVRDWL-KRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQ 149 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~-~~~~~~~~ae~v~~~l-~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~ 149 (547) .|.+|+..+-..|.+++|++.-.+ +....+...-.+..+| .++ ...++++.++ +.+.+|-+++.+.+. +|. T Consensus 60 ~V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~--~G~ 137 (434) T protein:vir:43 60 AVWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA--AGR 137 (434) T ss_pred HHHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCc Confidence 799999999999999999984322 1111111111233334 233 3556666666 567899998887643 454 Q ss_pred eeEEeeEeeccceeEecccCCcc--eeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 150 YFPDKLIWRDPRHFRFQKSDLAT--PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAI 227 (547) Q Consensus 150 ~~~~~l~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 227 (547) +..|.+++|.++.+..+..+. +......+..+.+++..+|++++... +..+|.+.+..+....-......++... T Consensus 138 --~~~L~~l~p~~v~~~~~~~g~~~y~~~~~~g~~~~~~~~eVih~~~~~~-dg~~G~spi~~~~~~i~~~~~~~~~~~~ 214 (434) T protein:vir:43 138 --PAALDFLLPSRVDLECDENGRLKYFYTTKKGARREIERTNMLHIPAFTL-DGRIGLSAIRYGVDVFGSVMSAEDAANG 214 (434) T ss_pred --EEEEEEEcCcceEEEEcCCCeEEEEEEecCceEEEEccccEEEecCcCC-CCccccCHHHHHHHHHHHHHHHHHHHHH Confidence 668999999988765544433 33444445557899998887776644 4589999999999988888888888889 Q ss_pred HHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 228 FTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKA 303 (547) Q Consensus 228 f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ 303 (547) |...-+.|--+.+++...++++++++.+.+.+... +..+|++.|++++=++.+. . ...|.+..++..++|+++ T Consensus 215 ~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~Ia~~ 291 (434) T protein:vir:43 215 TFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINP--V-DAQLLETREHGVIEICRW 291 (434) T ss_pred HHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCccccCCCceEEEccCCh--h-HHHHHHHHHHHHHHHHHH Confidence 99888999999999988899888888888876532 3468899998877654322 2 224777788888999988 Q ss_pred Hhc-chhccCcccccc--c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHH Q lcl|NC_020866. 304 VLG-QTTTTDAVAGGH--A-VSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 304 ilG-qtlts~~~ggs~--a-~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a 379 (547) .-- ..+....++++. + ..+........-+...++.|+..||+.|+..--..+ .+.+|.+......|.+.++ T Consensus 292 fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~-----~~~~fd~~~llr~d~~~r~ 366 (434) T protein:vir:43 292 FGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERIR-----YYAEFSLEGFLKADSAGRA 366 (434) T ss_pred hCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcC-----ceEEEechhhhccCHHHHH Confidence 421 233333333332 2 123333445566888899999999987754422111 1223333344457999999 Q ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccch Q lcl|NC_020866. 380 SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLN 459 (547) Q Consensus 380 ~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (547) +.+.++.+.|+ ++.+++|+.+|+|+-+.+++.+.+... .+............. ..+.....+. T Consensus 367 ~~~~~~~~~G~-~T~NE~R~~~gl~p~~ggD~~~~~~n~-~~~~~~~~~~~~~~~-~~~~~~~~~~-------------- 429 (434) T protein:vir:43 367 AWYSTMAQNGF-MTRNEGRRKENLPELPGGDILTVQSNL-VPIDQLGQSNKSQAV-RAALMNWFSQ-------------- 429 (434) T ss_pred HHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCeEeeccCc-cchhhhhccCCCcch-hhhhhccCCC-------------- Confidence 99999999998 999999999999976666655433221 111110000000000 0000000000 Q ss_pred hhhcccccccchh Q lcl|NC_020866. 460 SEMPSTGQKTGGA 472 (547) Q Consensus 460 ~~~~~~~~~~~~~ 472 (547) ..++. T Consensus 430 --------~~~~~ 434 (434) T protein:vir:43 430 --------PEPQE 434 (434) T ss_pred --------CCCCC Confidence 00000 No 41 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.88 E-value=1.2e-21 Score=135.22 Aligned_cols=415 Identities=11% Similarity=0.024 Sum_probs=235.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |-- -++++-+|+. +..++.+-+. ...+..+. .|... +...+.-.+..++. .. -.+-..+-+.|.+ T Consensus 1 ~~~-~~~~g~~~~~--~~~~~~~~~~-~~~~~~~~---~~~~~-~~~~~~~~~~~~g~--~v-----~~~~a~~~~aV~~ 65 (432) T protein:vir:97 1 MPD-EKKLGLLGQL--KAMFVPPDPV-DIGGGQTF---TPVNA-TARDLGIIISDTGA--AV-----NADAIMRLDAVAA 65 (432) T ss_pred CCC-cccCchhhhh--HhhcCCcccc-cccccccc---ccCch-hhhhhcccccccCc--cc-----chHhhhcchHHHH Confidence 431 2223333332 1111111110 00000000 00000 00000000111111 11 1111124688999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |++.+-..|.+++|.|.-.+.+...+..-.-+..+| .+ +...++++.++ +.+.+|.+.+++++. +|. +.. T Consensus 66 ~v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--~g~--~~~ 141 (432) T protein:vir:97 66 CVKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DGR--IES 141 (432) T ss_pred HHHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CCc--EEE Confidence 999999999999999853322111111111123333 23 34566666666 567899999999874 453 678 Q ss_pred eEeeccceeEeccc--CCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 155 LIWRDPRHFRFQKS--DLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 155 l~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) |.+++|.++.+..+ ++..+.....++....++.+.+|+.++...+ ..+|.|.+..+...+-.-....++...|...- T Consensus 142 L~~l~p~~v~v~~~~~g~~~y~~~~~~g~~~~~~~~~iih~r~~~~d-g~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng 220 (432) T protein:vir:97 142 LQYLANDRLTITTDTKGNTAYRYRRTDGQMIDIPRQQIWKIMGYSLD-GENGLSAIRYGAQIFGTAIAAEAQAARAFRNG 220 (432) T ss_pred EEEEcCcceEEEEcCCCcEEEEEEecCceEEEEccccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99999998876543 3333344444556678888888776665544 47999999999988877777888888888888 Q ss_pred CCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc--chh Q lcl|NC_020866. 233 GQPLRIGKWQPGASEQDKDTLFRAVANIAG-DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG--QTT 309 (547) Q Consensus 233 G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG--qtl 309 (547) +.|--|-+++...+++.++++.+.+....+ +..+|++.|++++-+..+. . ...|.+..++...+|+++. | ..+ T Consensus 221 ~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~Ia~~f-gVPp~~ 296 (432) T protein:vir:97 221 QLQSVYYQIDRFLTDDQYDSFSKKVSGSVEAGRAPLLEGGMDVKSLGLNP--V-DAQLLQSRQYSVESICRFF-GVPPSM 296 (432) T ss_pred CCcceeEecCCCCCHHHHHHHHHHHhhhhcCCCceecCCCceEEEccCCh--h-HHHHHHHHHHHHHHHHHHh-CCCHHH Confidence 999989999988899999999888876533 4578999999887664332 2 2246777888889999873 3 233 Q ss_pred ccCccccccchh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC--ccccCHHHHHHHHH Q lcl|NC_020866. 310 TTDAVAGGHAVS----QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR--PKTEDLTKLASSLD 383 (547) Q Consensus 310 ts~~~ggs~a~~----~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~--~~~ed~~~~a~~~~ 383 (547) ......|+++.+ +........-+...++.|+..||+.|+.+--. .. -.|+|+. .-..|.+.+++.+. T Consensus 297 lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~-----~~--~~~~fd~~~llr~d~~~r~~~~~ 369 (432) T protein:vir:97 297 IGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLTPAER-----RR--YFADFDTSALLRADSAARSSYYS 369 (432) T ss_pred cCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-----Cc--eEEEeechhhhccCHHHHHHHHH Confidence 333333333322 22223344567777888888888877543211 11 2455544 34578999999999 Q ss_pred HHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccc Q lcl|NC_020866. 384 TLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSL 458 (547) Q Consensus 384 ~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (547) ++++.|+ ++.+++|+.+|+|+.+.++..+.......+...... .+...+ +. +.+. ..+...+. T Consensus 370 ~~~~~G~-~T~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~-~~~~~~---~~--~~~~-----~~~~~~~~ 432 (432) T protein:vir:97 370 QLVNNGL-MTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGL-QASPEP---AS--GLGN-----QQQDKVSK 432 (432) T ss_pred HHHhCCC-CCHHHHHHHhCCCCCCCCcceEeecccccchhhhcc-cCCCCC---CC--CCCC-----cccccccC Confidence 9999998 999999999999876555444432222111111000 000000 00 0000 00000000 No 42 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.88 E-value=1.3e-21 Score=135.16 Aligned_cols=400 Identities=13% Similarity=0.032 Sum_probs=234.0 Q ss_pred CCCCCeeeCCC-CCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRW-GNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~-g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |- |.+.+ ++|-....-... +.+.........+..+++. . .| +-+.|. T Consensus 1 m~----~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~g~~v~~~---~------------------al--~~~~v~ 48 (419) T protein:vir:57 1 MF----IPQFWKGRPSENRVNWQV-----VPGGMRSSSSQAGVIITPE---T------------------AL--ALSAVR 48 (419) T ss_pred Cc----chhhhccCCccccccccc-----cccccccccccCCceechH---H------------------hh--ccHHHH Confidence 55 44443 332221100000 0000000000000111110 0 11 357799 Q ss_pred HHHHHHHHHHhcCCceEecCCCCH-HHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDP-QDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~-~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) +|++.+...|.+++|.+.-..++. .....-.-+..+|. ++ ...++++.+. +.+.+|-+++++++..+| .| T Consensus 49 ~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G---~~ 125 (419) T protein:vir:57 49 ACVTLLAESVAQLPCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRG---DI 125 (419) T ss_pred HHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cE Confidence 999999999999999985322111 11100112334442 33 4666777766 567899999999875433 36 Q ss_pred EeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) ..|.+++|.++.+..+....+... ....+..++....++.++.. .+..||.|.+..+....-.-....++...|...- T Consensus 126 ~~L~pl~~~~v~v~~~~~g~~~y~-~~~~~~~~~~~~vih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng 203 (419) T protein:vir:57 126 TELIPINPHKVIVLKGPDGMPYYD-IPSIGEILPMRMVHHIKSFS-LDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARG 203 (419) T ss_pred EEEEEEcCcceEEEECCCceEEEE-EcCCceEEchhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHcc Confidence 789999999998866555444332 23345567777666665554 4458999999999988888888888888999999 Q ss_pred CCCeEEEeCC----CCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 233 GQPLRIGKWQ----PGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKA 303 (547) Q Consensus 233 G~P~~vgk~~----~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ 303 (547) |.|--+-+++ ...++++++++.+.+.+..+ +..+|++.|++++=+..+. .-..|.+..++..++|+++ T Consensus 204 ~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~ 280 (419) T protein:vir:57 204 TTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDN---EKAQLLQSRQYTVNEVCRL 280 (419) T ss_pred CCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCCh---hhHHHHHHHHHHHHHHHHH Confidence 9997676664 34567777888877776422 3468889998876554322 2224777788888899988 Q ss_pred Hhc-chhccCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHH Q lcl|NC_020866. 304 VLG-QTTTTDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASS 381 (547) Q Consensus 304 ilG-qtlts~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~ 381 (547) .-- ..+....++++++..+.+.. ....-+...++.++..||+.|+.+--. ...+.+|.++.....|++.+++. T Consensus 281 fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~-----~~~~i~fd~~~ll~~d~~~~~~~ 355 (419) T protein:vir:57 281 YKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLLLPSER-----RDFYIEFNVSSLLRGDQKSRYES 355 (419) T ss_pred hCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-----CCeEEEEechhhhccCHHHHHHH Confidence 321 22333333455554444443 345668888889999998877543211 11223343444456789999999 Q ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhh Q lcl|NC_020866. 382 LDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRG 454 (547) Q Consensus 382 ~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (547) ++++++.|+ ++.+++|+.+|+|+-+.+++++.+..............+.+.. .+.......... T Consensus 356 ~~~~~~~G~-~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~ 419 (419) T protein:vir:57 356 YALGRQWGW-LSVNDIRRMENLTPIPGGDKYLTPLNMVDSKALTGIGKATPQQ--------LKDIEAILCTRN 419 (419) T ss_pred HHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCCCccc--------CcchhhhhhccC Confidence 999999998 9999999999998766667665543321111111100011111 111111111111 No 43 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.88 E-value=4.9e-22 Score=137.43 Aligned_cols=401 Identities=9% Similarity=-0.016 Sum_probs=241.1 Q ss_pred ccchhhhhhhhhhhhhhhHHHHhhhhc--cCCCHHHH---HHHHHh-hcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_020866. 15 VQRAQLTEEVAAPTVTGVRSPLSGYPG--DGLNPLRL---AQILRA-ADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRS 88 (547) Q Consensus 15 ~~~~~l~~~~a~~~~~~~~~~~~~~~~--~~l~p~~~---~~ilr~-a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~ 88 (547) +.-+.-+-++.+ -.++++++..... +..+|..- .++.-. ...|.. +-.+-..+-+.|.+|+..+-.. T Consensus 1 ~~~~~~~~~~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----v~~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:18 1 MEEPKYTIDLRT--NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSS-----INDERILQISTVWRCVSLISTL 73 (424) T ss_pred CCCCcceEeecC--CCchHHHHHhhhccccccccccccccccccccccccccc-----ccHHHhhccHHHHHHHHHHHHh Confidence 333344444432 2233333332211 11122110 000000 001110 1111112568899999999999 Q ss_pred HhcCCceEecCC--CCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeecc Q lcl|NC_020866. 89 VSQLDITVEPAS--DDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDP 160 (547) Q Consensus 89 v~~~~w~v~p~~--~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~ 160 (547) |.+++|++.-.+ +.......-.-+..+|+ ++ ...++++.++ +.+.+|-+++++++...| .+..|.+++| T Consensus 74 iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L~pl~~ 150 (424) T protein:vir:18 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQS 150 (424) T ss_pred hccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEEEEecC Confidence 999999985322 11111000111333332 33 3556666666 567899999999875443 2678999999 Q ss_pred ceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe Q lcl|NC_020866. 161 RHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK 240 (547) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk 240 (547) .++.+..+++...+.....+....+++...|++++.. .+..+|.|.+..+....-.-....++...|...-+.|--+-+ T Consensus 151 ~~V~v~~~~~~~~y~~~~~g~~~~~~~~eIih~r~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~ 229 (424) T protein:vir:18 151 ANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) T ss_pred cceEEEEcCCeEEEEEEeCCeEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEE Confidence 9988876665544444455566789988887766654 455899999999998888878888888889988899988888 Q ss_pred CCCC-CCHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCcc Q lcl|NC_020866. 241 WQPG-ASEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAV 314 (547) Q Consensus 241 ~~~~-~~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ 314 (547) ++.+ .++++++++.+.+.++.+ +..+|++.|++++-+..+. ....|.+..++..++|+++.-- ..+..+.. T Consensus 230 ~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~---~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~ 306 (424) T protein:vir:18 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTP---QDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) T ss_pred eCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 8765 588888888888876542 2468999999887765432 2224677778888999988421 23333333 Q ss_pred cccc--c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCC Q lcl|NC_020866. 315 AGGH--A-VSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGME 391 (547) Q Consensus 315 ggs~--a-~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~ 391 (547) ++++ + ..+........-+...++.|+..||+.|++.. .....+.+|.++..-..|.+++++.+.++.+.|+ T Consensus 307 ~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~-----~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~- 380 (424) T protein:vir:18 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL- 380 (424) T ss_pred CcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc-----ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCC- Confidence 3333 2 12233344566788889999999998876542 1112223333344446789999999999999998 Q ss_pred CCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccc Q lcl|NC_020866. 392 IEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERK 441 (547) Q Consensus 392 v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (547) ++.+++|+.+|+|+-+.+++.+.+.... +..... ....+....+ T Consensus 381 ~T~NE~R~~~gl~pi~gGD~~~~~~n~~-~l~~~~-----~~~~p~~~ga 424 (424) T protein:vir:18 381 RTINEMRRTDNLPPLPGGDVAMRQSQYV-PITDLG-----TNKEPRNNGA 424 (424) T ss_pred cCHHHHHHHhCCCCCCCcCeeeeccCcc-chHhhh-----ccCCCccCCC Confidence 9999999999999766666554333211 111000 0000000000 No 44 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.87 E-value=1.9e-21 Score=134.24 Aligned_cols=412 Identities=11% Similarity=0.028 Sum_probs=237.1 Q ss_pred CCCCCeeeCCCCCcccchhhh--hhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLT--EEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~--~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |.. -++.+-++|.-. .+. .+.+ ..+..+ ..|... +. +...+.....+..... +.. .+-+.| T Consensus 1 ~~~-~~~mg~f~r~~~--~~~~~~~~~---~~~~~~---~~~~~~-~~-~~~~~~~~~~g~~v~~-----~~a-l~~~~V 63 (432) T protein:vir:81 1 MPD-EKKLGLFGQLKA--MFVPPDPVD---IGGGQT---FTPVNA-TA-RDLGIIISDTGAAVNA-----DAI-MRLDAV 63 (432) T ss_pred CCc-hhhcchhhhhhh--hcccccccc---cccccc---cccCcc-ch-hhhcccccccCcccch-----Hhh-hccHHH Confidence 442 233333333100 000 0110 000000 001000 00 0001111111111111 111 146889 Q ss_pred HHHHHHHHHHHhcCCceEecC--CCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCee Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPA--SDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQY 150 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~--~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~ 150 (547) .+|+..+-..|.++++.|.-. ++..+... .-+..+|. ++ ...++++.+. +.+.+|-+.+++++. +|. T Consensus 64 ~~~i~~Ia~~ia~lp~~~y~~~~~g~~~~~~--~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~--~g~- 138 (432) T protein:vir:81 64 AACVKLVSQAIAAMPLTMYMRTPDGRKEAVN--HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DGR- 138 (432) T ss_pred HHHHHHHHHhhhhCceeeEEecCCcceeccc--chHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CCc- Confidence 999999999999999998432 22111111 11333342 33 3556666666 567899999998764 443 Q ss_pred eEEeeEeeccceeEecccCCc--ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 151 FPDKLIWRDPRHFRFQKSDLA--TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIF 228 (547) Q Consensus 151 ~~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 228 (547) +..|.++++.++.+..+..+ .+......+....++.+..+++++...++ .+|.|.+..+....-.-....++...| T Consensus 139 -~~~L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~ 216 (432) T protein:vir:81 139 -IESLQYLANDRLTITTDPKGNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARA 216 (432) T ss_pred -EEEEEEEcCCceEEEECCCCcEEEEEEecCceEEEEccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHH Confidence 67899999998877654433 33344445566788888887777665555 799999999998888888888888889 Q ss_pred HHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc- Q lcl|NC_020866. 229 TQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG- 306 (547) Q Consensus 229 ~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG- 306 (547) ....+.|--+-+++...++++++++.+.+....+ +..+++|.|++++-+..+. ....|.+..++..++|+++.-- T Consensus 217 f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~---~d~q~le~~~~~~~~Ia~~fgVP 293 (432) T protein:vir:81 217 FRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNP---VDAQLLQSRQYSVESICRFFGVP 293 (432) T ss_pred HhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhcCCCceecCCCceEEEccCCH---HHHHHHHHHHHHHHHHHHHhCCC Confidence 8888999888899988999999999888876544 4578999999887664332 1224667778888999988421 Q ss_pred chhccCccccccchh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHH Q lcl|NC_020866. 307 QTTTTDAVAGGHAVS----QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLAS 380 (547) Q Consensus 307 qtlts~~~ggs~a~~----~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~ 380 (547) ..+....+.|+++.+ +........-+..-++.|+..||+.|+.+--. .. -+|+|+ ..-..|.+.+++ T Consensus 294 p~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~-----~~--~~~~fd~~~llr~d~~~r~~ 366 (432) T protein:vir:81 294 PSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLSPAER-----RR--YFADFDTSALLRADSAARSS 366 (432) T ss_pred HHHcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-----Cc--eEEEeechhhhccCHHHHHH Confidence 233344333444322 22223344567778888888888877554211 11 245554 334678999999 Q ss_pred HHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccc Q lcl|NC_020866. 381 SLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSL 458 (547) Q Consensus 381 ~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (547) .+.++++.|+ ++.+++|+.+|+|+-+.+++.+.......+...... .+...+ +....... . ...+. T Consensus 367 ~~~~~~~~G~-~t~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~-~~~~~~---~~~~~n~~--~-----~~~~~ 432 (432) T protein:vir:81 367 YYSQLVNNGL-MTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGL-QASPEP---ASGLGNQQ--Q-----DKVSK 432 (432) T ss_pred HHHHHHhCCC-CCHHHHHHHhCCCCCCCCcceEeecCcccchhhhcc-CCCCCC---CCCCCCcc--c-----ccccC Confidence 9999999998 999999999999876655544432222111111000 000000 00000000 0 00000 No 45 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.87 E-value=3.6e-21 Score=132.65 Aligned_cols=413 Identities=11% Similarity=0.033 Sum_probs=225.7 Q ss_pred CCCCC---eeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhh-----hccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|NC_020866. 1 MAQKP---VLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGY-----PGDGLNPLRLAQILRAADHGDPVRYLDLAEVIE 72 (547) Q Consensus 1 ma~~~---~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~-----~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~ 72 (547) |-=-+ -.+|..||.=.++.+. +..+|... ....-++..+...+-.....+...+ ...-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~a 67 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELV----------VVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQY---KDIEA 67 (441) T ss_pred CceecCccceeccccccchhhhhh----------ccccccccccccccCCCcchHHHHHHhhcccccCcccc---chhhh Confidence 21000 0223333221111000 00000000 0001111111222211111111111 00001 Q ss_pred hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hCC----CHHHHHHHHH-HhhhhcCeeeeeEEeec Q lcl|NC_020866. 73 ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KRE----ELQQEVFHIL-DCISKGYSFTEIVWNTS 146 (547) Q Consensus 73 ~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~ 146 (547) .+-+.|.+|++.+-..|.++++++...+....... +..+| .++ ...++++.+. +.+.+|.+++++++..+ T Consensus 68 l~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~----~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~ 143 (441) T protein:vir:98 68 IRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDR----IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT 143 (441) T ss_pred hccHHHHHHHHHHHHhhccCceEEecCCcccccch----HHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC Confidence 14677999999999999999999975432211111 22333 233 3455666555 46789999999987543 Q ss_pred CCeeeEEeeEeeccceeEecccCCcceeec--c--CC--CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHH Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQKSDLATPLLL--D--DN--GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAF 220 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~--~~--~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 220 (547) | .|..|.+++|.++.+..+..+.+... . .. +....+++...|++++...+ ..+|.|.+..+....-.-.. T Consensus 144 G---~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~~d-g~~G~spi~~~~~~i~~~~a 219 (441) T protein:vir:98 144 G---EPMNLTFRKTSEIELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLD-GINGLSLLDTLSRTIESDNN 219 (441) T ss_pred C---cEEEEEEEcCceeEEEECCCCcEEEEEEEeccCcceeeEEEccccEEEeccCCCC-CccccCHHHHHHHHHHHHHH Confidence 3 46789999999887655444433322 1 11 22346788888877765544 47999999999988888888 Q ss_pred HHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHH Q lcl|NC_020866. 221 TARDWAIFTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRIN 294 (547) Q Consensus 221 ~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~ 294 (547) ..++...|.+.-+.|--|.+++... ++++++++.+.+.+.-+ +..+|++.|++++-+..+. ..-.|.+..+ T Consensus 220 ~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~---~d~q~~e~r~ 296 (441) T protein:vir:98 220 GKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDT---EVLKLIRENK 296 (441) T ss_pred HHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh---hHHHHHHHHH Confidence 8888899999999999888988765 45667777777766433 2368999999877664322 1113667777 Q ss_pred HHHHHHHHHHhcc-hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcccc Q lcl|NC_020866. 295 HLDQQISKAVLGQ-TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTE 373 (547) Q Consensus 295 ~~d~~Isk~ilGq-tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~e 373 (547) +..++|+++.--. .+.... +++.+..+.. ..+..-+...++.|+..||+.|.... ...+.+|..+..... T Consensus 297 ~~~~~Ia~~fgVPp~~lg~~-~~~~s~~q~~-~~y~~tl~P~~~~ie~~ln~~L~~~~-------~~~~~~fd~~~llr~ 367 (441) T protein:vir:98 297 SSTREIAGVFGIPLHKFGIE-TANMSITDAN-LDYLSTLKPYITCVCAELNFKFNDEY-------VNREFKFDTTEIRVV 367 (441) T ss_pred HhHHHHHHHhCCCHHHcCCC-CCCccHHHHH-HHHHHHHHHHHHHHHHHHHhhccccc-------cCceEEEechhhhcc Confidence 8888999884211 222221 1222222222 22334677788888888888764321 112223444444567 Q ss_pred CHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcch-hcccccccccccccccccccccCCcccccccccccc Q lcl|NC_020866. 374 DLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAK-LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 374 d~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) |.+.+++.++++++.|+ ++.+++|+.+|+|+-+.++. +........+....+..............++..... T Consensus 368 d~~~~~~~~~~~~~~G~-~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 368 DEKTQAEIDKINIDSGK-MNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred CHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 89999999999999997 99999999999986555553 222222221111111000000000000000000000 No 46 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.86 E-value=5.9e-21 Score=131.47 Aligned_cols=416 Identities=9% Similarity=0.004 Sum_probs=233.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHh--hcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRA--ADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~--a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |=+ + ..+.+++.-++ +.+ |.+.|.+.-++.-|. .+-. ...|.... .+..+ +=+.| T Consensus 1 ~~~----------~-~~~~~~~~~~~-----~~~-~~g~~~s~~~~~~~~-~~~~~~~~~g~~v~----~~~al-~~~~v 57 (437) T protein:vir:10 1 MKQ----------G-KQRALGRIKSS-----FLK-WLGVPISLTDGSFWS-AWGGMGSSSGETVT----ADSAL-QLSAV 57 (437) T ss_pred CCc----------c-hhhhhhhhHHh-----hhh-hcCCcccCCchhHHH-hhcccccCCCceec----hHhhh-ccHHH Confidence 321 0 00111111100 000 111122111111111 1100 00011000 01111 45779 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCH-HHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeee Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDP-QDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYF 151 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~-~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~ 151 (547) .+|++.+...|.+++|++...+.+. .....-.-+..+|. ++ ...++++.++ +.+.+|-+++++++. +| . T Consensus 58 ~~ci~~Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~--~g--~ 133 (437) T protein:vir:10 58 WSCVRLIAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS--AG--V 133 (437) T ss_pred HHHHHHHHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--c Confidence 9999999999999999985432211 11100111223332 33 4667777777 457899999998864 34 3 Q ss_pred EEeeEeeccceeEecccCCc--ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 152 PDKLIWRDPRHFRFQKSDLA--TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 152 ~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) +..|.+++|.++.+..+..+ .+.....++....+++...|++++.. .+.++|.|.+..+....-.-....++...|. T Consensus 134 ~~~L~~l~p~~v~i~~~~~g~~~y~~~~~~g~~~~~~~~dIih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f 212 (437) T protein:vir:10 134 LIGLELMLPQRTTVKRLTSGALQYTYRNVDGTVSTLAEDDVFHVRGFS-LDGLMGLTPIQYAREVLGNSTAANKTSASVF 212 (437) T ss_pred EEEEEEEcCcceEEEECCCCeEEEEEEecCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 66899999998877554333 33334445555678888877666554 4558999999999988888888888999999 Q ss_pred HHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV 304 (547) Q Consensus 230 E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i 304 (547) +.-+.|--+.+++...++++++++.+.+.+.-+ +..+|++.|++++=+..+. .. -.|.+..++..++|+++. T Consensus 213 ~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~--~d-~q~~e~~~~~~~~Ia~~f 289 (437) T protein:vir:10 213 RNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNP--GD-VQLLETRAFNIEEICRWY 289 (437) T ss_pred hccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCCh--hh-HHHHHHHHHHHHHHHHHh Confidence 998999888899888899998888888876532 2468899998776554332 22 236777778888999884 Q ss_pred hc-chhccCcccccc--c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHH Q lcl|NC_020866. 305 LG-QTTTTDAVAGGH--A-VSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLAS 380 (547) Q Consensus 305 lG-qtlts~~~ggs~--a-~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~ 380 (547) -- ..+.....++++ + ..+........-+...++.|+..||+.|+..--. ...+.+|.++..-..|.+.+++ T Consensus 290 gVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~-----~~~~~~fd~~~ll~~d~~~r~~ 364 (437) T protein:vir:10 290 RVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTLRPWLTRIEQAARRSLLRPGER-----DQFYAEFSVEGLLRADSAGRAA 364 (437) T ss_pred CCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-----CceEEEEechhhhccCHHHHHH Confidence 21 223333333333 2 2233334456667888888999999877543211 1122233333444678999999 Q ss_pred HHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccc-cCCccccccccccccchhhhhhcccch Q lcl|NC_020866. 381 SLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGA-GDLKTAERKLKPLSGQADLSRGDLSLN 459 (547) Q Consensus 381 ~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (547) ++.++++.|+ ++.+++|+.+|+|+-+.+++.+.......+........+.. ...........+.... T Consensus 365 ~~~~~~~~G~-~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 432 (437) T protein:vir:10 365 FYSTMTQNGL-MTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLYQEEKTR----------- 432 (437) T ss_pred HHHHHHhCCC-cCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCCCCCCCC----------- Confidence 9999999998 99999999999987555555443222211111100000000 0000000000000000 Q ss_pred hhhcccccccchhH Q lcl|NC_020866. 460 SEMPSTGQKTGGAE 473 (547) Q Consensus 460 ~~~~~~~~~~~~~~ 473 (547) .+++. T Consensus 433 ---------~~~e~ 437 (437) T protein:vir:10 433 ---------ATQER 437 (437) T ss_pred ---------ccccC Confidence 00000 No 47 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.86 E-value=9.2e-21 Score=130.43 Aligned_cols=394 Identities=12% Similarity=0.043 Sum_probs=232.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh--ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP--GDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~--~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |+ +..|+.+-++-+-...+.++.. +..++..... ..++++ ....+-+.| T Consensus 1 ~~-~~~~~~~~k~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~v~~-----------------------~~a~~~~~V 51 (409) T protein:vir:96 1 MA-KENIVTRIKKKLIDNWIDQSAS-----KLYDFSPWKNKSFWGVIN-----------------------NTLETNETI 51 (409) T ss_pred Cc-cccchhhhhhHHhhhhhccccc-----cccccccccCccccccch-----------------------hhHhhhHHH Confidence 88 4666665555443333333332 1111110000 011111 001145779 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) .+|++.+...|.+++|.+.-.... .+..+.+.+...= ..+...++++.++ +.+.+|-+++++++...| .+..|. T Consensus 52 ~~ci~~ia~~ia~lp~~~~~~~~~-~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L~ 127 (409) T protein:vir:96 52 FSAITKLSNSMASLPLKMYEDYKV-VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKLF 127 (409) T ss_pred HHHHHHHHHhhhhCceEEeecccc-cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCC---cEEEEE Confidence 999999999999999999643322 2222333332211 1234566666666 567899999999875433 367899 Q ss_pred eeccceeEecccCCcc---eeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 157 WRDPRHFRFQKSDLAT---PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~---~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) +++|.++.+..++... +.....++....+++...|++++....+..+|.|.+..+....-......+ +. |-.... T Consensus 128 ~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~-~~-~~~~~~ 205 (409) T protein:vir:96 128 LLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRT-FN-LTEMQK 205 (409) T ss_pred EEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHH-HH-HHhcCC Confidence 9999988765433222 223344455667888888777766566778899999887655444433333 32 322223 Q ss_pred CCeEEEeCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhc Q lcl|NC_020866. 234 QPLRIGKWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTT 310 (547) Q Consensus 234 ~P~~vgk~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlt 310 (547) .|..+.+++...++++++++.+.+.+..++ ..++++.|++++-+..+. ....|.+..++..++|+++.-- ..+. T Consensus 206 ~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgVPp~~l 282 (409) T protein:vir:96 206 PDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKY---VSEDIVASENLTRERVANVFQLPSIFL 282 (409) T ss_pred CceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 345677788888999999999888876554 467888888887664322 2224677777888899888421 2233 Q ss_pred cCccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHHHHHHHHh Q lcl|NC_020866. 311 TDAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLASSLDTLVR 387 (547) Q Consensus 311 s~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~~~~~l~~ 387 (547) ....+++++..+.+.. ....-+...++.|++.||+.|+...- . . ..-+|+|+ .....|.+.++++++++++ T Consensus 283 g~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~---~-~--~g~~i~fd~~~ll~~d~~~~~e~~~~~~~ 356 (409) T protein:vir:96 283 NARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD---R-E--KNRYFKFNVKSYLRADSATQAEVYFKAVR 356 (409) T ss_pred CCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc---c-c--CcceEEeechhhhccCHHHHHHHHHHHHh Confidence 3333456665554443 44566888889999999988765321 1 1 11245554 3345689999999999999 Q ss_pred CCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccc-cc-cccccCCccccc Q lcl|NC_020866. 388 LGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPP-GM-DPGAGDLKTAER 440 (547) Q Consensus 388 ~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~ 440 (547) .|+ ++.+++|+.+|+|+-+.++..+..... .+...+. .. ....+..+..+. T Consensus 357 ~G~-~T~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 357 SGY-YTINDIREWEDLPPVEGGDKPLISGDL-YPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred CCC-CCHHHHHHHhCCCCCCCcceeeecccc-cccccchhhcccccCCCCCcCCC Confidence 997 999999999999876666665543221 1111110 00 001110110000 No 48 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.86 E-value=7.5e-21 Score=130.90 Aligned_cols=413 Identities=13% Similarity=0.054 Sum_probs=225.3 Q ss_pred CCCCC---eeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhcc-----CCCHHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|NC_020866. 1 MAQKP---VLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGD-----GLNPLRLAQILRAADHGDPVRYLDLAEVIE 72 (547) Q Consensus 1 ma~~~---~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~-----~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~ 72 (547) |-=-+ -.+|+.+|.=.++.+ . +...|.....| ..+...+...+-.....+...+.. -..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----~------~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al- 68 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKEL----V------VVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAI- 68 (441) T ss_pred CccccCccccccccccccchhhh----h------ccccccccccccccCCCcchHHHHHHhcccCcccccccch-hhhh- Confidence 21000 012222221111100 0 00000000000 011111111211111111111100 0111 Q ss_pred hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hCC----CHHHHHHHHH-HhhhhcCeeeeeEEeec Q lcl|NC_020866. 73 ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KRE----ELQQEVFHIL-DCISKGYSFTEIVWNTS 146 (547) Q Consensus 73 ~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~ 146 (547) +-+.|.+|+..+-..|.++++++...+..... +-+..+| .++ ...++++.+. +.+.+|.+++++++... T Consensus 69 -~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~ 143 (441) T protein:vir:94 69 -RHSDIFTAVMMIASDLARMPIRVTVNGQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT 143 (441) T ss_pred -ccHHHHHHHHHHHHhhccCceeeecCcccccc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 46779999999999999999999753321111 1222333 234 3455666655 46789999999987543 Q ss_pred CCeeeEEeeEeeccceeEecccCCcceeecc----CC--CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHH Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQKSDLATPLLLD----DN--GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAF 220 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~--~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 220 (547) | .|..|.+++|..+.+..++.+.+.... .. +....+++...|++++...+ ..+|.|.+..+....-.-.. T Consensus 144 -G--~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~d-g~~G~spl~~~~~~i~~~~~ 219 (441) T protein:vir:94 144 -G--EPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLD-GINGLSLLDTLSRTIESDNN 219 (441) T ss_pred -C--cEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCC-CccccCHHHHHHHHHHHHHH Confidence 3 367899999998877655444433221 11 22246788888777765544 47999999999988888888 Q ss_pred HHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHH Q lcl|NC_020866. 221 TARDWAIFTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRIN 294 (547) Q Consensus 221 ~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~ 294 (547) ..++...|...-+.|--|.+++... +++.++++.+.+.+.-+ +..+|+|.|++++=+..+. ..-.|.+..+ T Consensus 220 ~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~---~d~q~~e~~~ 296 (441) T protein:vir:94 220 GKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDT---EVLKLIRENK 296 (441) T ss_pred HHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh---hHHHHHHHHH Confidence 8888889999999999988988765 45667777777766532 2358999999877664322 1123777778 Q ss_pred HHHHHHHHHHhcc-hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcccc Q lcl|NC_020866. 295 HLDQQISKAVLGQ-TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTE 373 (547) Q Consensus 295 ~~d~~Isk~ilGq-tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~e 373 (547) +..++|+.+.--. .+.... +++.+..+.. ..+..-+...++.|+..||+.|.... ...+.+|.++..... T Consensus 297 ~~~~~Ia~~fgVPp~~lg~~-~~~~s~~q~~-~~~~~tl~P~~~~ie~eln~kl~~~~-------~~~~~~fd~~~llr~ 367 (441) T protein:vir:94 297 SSTREIAGVFGIPLHKFGIE-TANMSITDAN-LDYLSTLKPYITCVCAELNFKFNDEY-------VNREFKFDTTEIRVV 367 (441) T ss_pred HhHHHHHHHhCCCHHHcCCC-CCCccHHHHH-HHHHHHHHHHHHHHHHHHhhhccccc-------cCceEEeechhhhcc Confidence 8889999884211 222221 1222222222 22334677888889998988764321 112224444444567 Q ss_pred CHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcch-hcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 374 DLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAK-LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 374 d~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) |.+.+++.++++++.|+ ++.+++|+.+|+|+-+.++. ++.......+....+...............+.... . T Consensus 368 D~~~~~~~~~~~i~~G~-~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~-e 441 (441) T protein:vir:94 368 DEKTQAEIDKINIDSGK-MNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEEN-E 441 (441) T ss_pred CHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCC-C Confidence 89999999999999998 99999999999987555553 22222222111111100000000000000000000 0 No 49 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.86 E-value=7.5e-21 Score=130.90 Aligned_cols=413 Identities=13% Similarity=0.054 Sum_probs=225.3 Q ss_pred CCCCC---eeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhcc-----CCCHHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|NC_020866. 1 MAQKP---VLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGD-----GLNPLRLAQILRAADHGDPVRYLDLAEVIE 72 (547) Q Consensus 1 ma~~~---~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~-----~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~ 72 (547) |-=-+ -.+|+.+|.=.++.+ . +...|.....| ..+...+...+-.....+...+.. -..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----~------~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al- 68 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKEL----V------VVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAI- 68 (441) T ss_pred CccccCccccccccccccchhhh----h------ccccccccccccccCCCcchHHHHHHhcccCcccccccch-hhhh- Confidence 21000 012222221111100 0 00000000000 011111111211111111111100 0111 Q ss_pred hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hCC----CHHHHHHHHH-HhhhhcCeeeeeEEeec Q lcl|NC_020866. 73 ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KRE----ELQQEVFHIL-DCISKGYSFTEIVWNTS 146 (547) Q Consensus 73 ~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~ 146 (547) +-+.|.+|+..+-..|.++++++...+..... +-+..+| .++ ...++++.+. +.+.+|.+++++++... T Consensus 69 -~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~ 143 (441) T protein:vir:79 69 -RHSDIFTAVMMIASDLARMPIRVTVNGQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT 143 (441) T ss_pred -ccHHHHHHHHHHHHhhccCceeeecCcccccc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 46779999999999999999999753321111 1222333 234 3455666655 46789999999987543 Q ss_pred CCeeeEEeeEeeccceeEecccCCcceeecc----CC--CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHH Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQKSDLATPLLLD----DN--GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAF 220 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~--~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 220 (547) | .|..|.+++|..+.+..++.+.+.... .. +....+++...|++++...+ ..+|.|.+..+....-.-.. T Consensus 144 -G--~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~d-g~~G~spl~~~~~~i~~~~~ 219 (441) T protein:vir:79 144 -G--EPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLD-GINGLSLLDTLSRTIESDNN 219 (441) T ss_pred -C--cEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCC-CccccCHHHHHHHHHHHHHH Confidence 3 367899999998877655444433221 11 22246788888777765544 47999999999988888888 Q ss_pred HHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHH Q lcl|NC_020866. 221 TARDWAIFTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRIN 294 (547) Q Consensus 221 ~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~ 294 (547) ..++...|...-+.|--|.+++... +++.++++.+.+.+.-+ +..+|+|.|++++=+..+. ..-.|.+..+ T Consensus 220 ~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~---~d~q~~e~~~ 296 (441) T protein:vir:79 220 GKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDT---EVLKLIRENK 296 (441) T ss_pred HHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh---hHHHHHHHHH Confidence 8888889999999999988988765 45667777777766532 2358999999877664322 1123777778 Q ss_pred HHHHHHHHHHhcc-hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcccc Q lcl|NC_020866. 295 HLDQQISKAVLGQ-TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTE 373 (547) Q Consensus 295 ~~d~~Isk~ilGq-tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~e 373 (547) +..++|+.+.--. .+.... +++.+..+.. ..+..-+...++.|+..||+.|.... ...+.+|.++..... T Consensus 297 ~~~~~Ia~~fgVPp~~lg~~-~~~~s~~q~~-~~~~~tl~P~~~~ie~eln~kl~~~~-------~~~~~~fd~~~llr~ 367 (441) T protein:vir:79 297 SSTREIAGVFGIPLHKFGIE-TANMSITDAN-LDYLSTLKPYITCVCAELNFKFNDEY-------VNREFKFDTTEIRVV 367 (441) T ss_pred HhHHHHHHHhCCCHHHcCCC-CCCccHHHHH-HHHHHHHHHHHHHHHHHHhhhccccc-------cCceEEeechhhhcc Confidence 8889999884211 222221 1222222222 22334677888889998988764321 112224444444567 Q ss_pred CHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcch-hcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 374 DLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAK-LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 374 d~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) |.+.+++.++++++.|+ ++.+++|+.+|+|+-+.++. ++.......+....+...............+.... . T Consensus 368 D~~~~~~~~~~~i~~G~-~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~-e 441 (441) T protein:vir:79 368 DEKTQAEIDKINIDSGK-MNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEEN-E 441 (441) T ss_pred CHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCC-C Confidence 89999999999999998 99999999999987555553 22222222111111100000000000000000000 0 No 50 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.86 E-value=7.8e-21 Score=130.84 Aligned_cols=394 Identities=13% Similarity=0.085 Sum_probs=224.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhcc-CCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGD-GLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~-~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |. |. .+.-++ ..+.... .....+...++. +.. +. .+.. -..| +-+.|. T Consensus 1 Mg----~f---~~~~~r-----~~~~~~~-~~~~~~~~~~~~~~~~-------------~~--~~~~-~~al--~~~~v~ 49 (416) T protein:vir:81 1 MG----IF---YKNEKR-----DLQYNED-DLQMMVQTLPGFQGTK-------------LR--QYKD-IEAI--RHSDIF 49 (416) T ss_pred CC----cc---cccccc-----cccCCCc-chhHHHHHhccccccC-------------cc--ccch-hhhh--cchHHH Confidence 65 43 222111 1111111 111111111111 000 00 0000 0011 456799 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) +|++.+-..+.+++|++...+....+ ..+..+|. ++ ...++++.+.. .+.+|.+++++++... | .+. T Consensus 50 ~cv~~Ia~~iA~~p~~~~~~~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~ 122 (416) T protein:vir:81 50 TAVMMIASDLARMPIRVTVNGQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPM 122 (416) T ss_pred HHHHHHHHhhccCceEEecCcccccc----chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEE Confidence 99999999999999999754322211 22333343 34 35566666664 5689999999987533 3 367 Q ss_pred eeEeeccceeEecccCCcceeecc----CCC--CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATPLLLD----DNG--AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAI 227 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~~~----~~~--~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 227 (547) .|.+++|.++.+..++.++++... ..+ ....+++...|++++.. .+..+|.|.+..+....-.-....++... T Consensus 123 ~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~ 201 (416) T protein:vir:81 123 NLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTLSRTIESDNNGKDFLNN 201 (416) T ss_pred EEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEEEccccEEEeccCC-CCCccccCHHHHHHHHHHHHHHHHHHHHH Confidence 899999999877655444433221 122 22468888887777654 44589999999999988888888888899 Q ss_pred HHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_020866. 228 FTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQIS 301 (547) Q Consensus 228 f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Is 301 (547) |....+.|--|.+++... +++.++++.+.+.+..+ +..+|++.|++++-+..+. ..-.|.+..++..++|+ T Consensus 202 ~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia 278 (416) T protein:vir:81 202 FLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDT---EVLKLIRENKSSTREIA 278 (416) T ss_pred HHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCH---HHHHHHHHHHHHHHHHH Confidence 999999999999998665 45567777776665432 2368899998877654322 12236777788888999 Q ss_pred HHHhcc--hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHH Q lcl|NC_020866. 302 KAVLGQ--TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 302 k~ilGq--tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a 379 (547) .+. |- .+.... +++.+..+... .+..-+...++.|+..||+.|.... ....-+|.++.....|.+.++ T Consensus 279 ~~f-gVPp~~lg~~-~~~~~~~~~~~-~~~~~l~P~~~~ie~~ln~~l~~~~-------~~~~~~f~~~~l~~~D~~~~~ 348 (416) T protein:vir:81 279 GVF-GIPLHKFGIE-TANMSITDANL-DYLSTLKPYITCVCAELNFKFNDEY-------VNREFKFDTTEIRVVDEKTQA 348 (416) T ss_pred HHh-CCCHHHcCCC-CCCccHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc-------cCceEEEechhhhccCHHHHH Confidence 884 32 222222 12222222222 2334677888889998988764321 112224444444567899999 Q ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcch-hcccccccccccccccccccccCCccccccccccccch Q lcl|NC_020866. 380 SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAK-LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQA 449 (547) Q Consensus 380 ~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (547) +.++++++.|+ ++.+++|+.+|+|+-+.++. ++.......+....+..............++... +. T Consensus 349 ~~~~~~~~~G~-~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~--n~ 416 (416) T protein:vir:81 349 EIDKINIDSGK-MNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEE--NE 416 (416) T ss_pred HHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCC--CC Confidence 99999999997 99999999999987555543 2222211111111111000000000000000000 00 No 51 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.86 E-value=7.8e-21 Score=130.84 Aligned_cols=394 Identities=13% Similarity=0.085 Sum_probs=224.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhcc-CCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGD-GLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~-~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |. |. .+.-++ ..+.... .....+...++. +.. +. .+.. -..| +-+.|. T Consensus 1 Mg----~f---~~~~~r-----~~~~~~~-~~~~~~~~~~~~~~~~-------------~~--~~~~-~~al--~~~~v~ 49 (416) T protein:vir:45 1 MG----IF---YKNEKR-----DLQYNED-DLQMMVQTLPGFQGTK-------------LR--QYKD-IEAI--RHSDIF 49 (416) T ss_pred CC----cc---cccccc-----cccCCCc-chhHHHHHhccccccC-------------cc--ccch-hhhh--cchHHH Confidence 65 43 222111 1111111 111111111111 000 00 0000 0011 456799 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) +|++.+-..+.+++|++...+....+ ..+..+|. ++ ...++++.+.. .+.+|.+++++++... | .+. T Consensus 50 ~cv~~Ia~~iA~~p~~~~~~~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~ 122 (416) T protein:vir:45 50 TAVMMIASDLARMPIRVTVNGQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPM 122 (416) T ss_pred HHHHHHHHhhccCceEEecCcccccc----chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEE Confidence 99999999999999999754322211 22333343 34 35566666664 5689999999987533 3 367 Q ss_pred eeEeeccceeEecccCCcceeecc----CCC--CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATPLLLD----DNG--AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAI 227 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~~~----~~~--~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 227 (547) .|.+++|.++.+..++.++++... ..+ ....+++...|++++.. .+..+|.|.+..+....-.-....++... T Consensus 123 ~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~ 201 (416) T protein:vir:45 123 NLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTLSRTIESDNNGKDFLNN 201 (416) T ss_pred EEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEEEccccEEEeccCC-CCCccccCHHHHHHHHHHHHHHHHHHHHH Confidence 899999999877655444433221 122 22468888887777654 44589999999999988888888888899 Q ss_pred HHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_020866. 228 FTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQIS 301 (547) Q Consensus 228 f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Is 301 (547) |....+.|--|.+++... +++.++++.+.+.+..+ +..+|++.|++++-+..+. ..-.|.+..++..++|+ T Consensus 202 ~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia 278 (416) T protein:vir:45 202 FLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDT---EVLKLIRENKSSTREIA 278 (416) T ss_pred HHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCH---HHHHHHHHHHHHHHHHH Confidence 999999999999998665 45567777776665432 2368899998877654322 12236777788888999 Q ss_pred HHHhcc--hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHH Q lcl|NC_020866. 302 KAVLGQ--TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 302 k~ilGq--tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a 379 (547) .+. |- .+.... +++.+..+... .+..-+...++.|+..||+.|.... ....-+|.++.....|.+.++ T Consensus 279 ~~f-gVPp~~lg~~-~~~~~~~~~~~-~~~~~l~P~~~~ie~~ln~~l~~~~-------~~~~~~f~~~~l~~~D~~~~~ 348 (416) T protein:vir:45 279 GVF-GIPLHKFGIE-TANMSITDANL-DYLSTLKPYITCVCAELNFKFNDEY-------VNREFKFDTTEIRVVDEKTQA 348 (416) T ss_pred HHh-CCCHHHcCCC-CCCccHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc-------cCceEEEechhhhccCHHHHH Confidence 884 32 222222 12222222222 2334677888889998988764321 112224444444567899999 Q ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcch-hcccccccccccccccccccccCCccccccccccccch Q lcl|NC_020866. 380 SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAK-LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQA 449 (547) Q Consensus 380 ~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (547) +.++++++.|+ ++.+++|+.+|+|+-+.++. ++.......+....+..............++... +. T Consensus 349 ~~~~~~~~~G~-~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~--n~ 416 (416) T protein:vir:45 349 EIDKINIDSGK-MNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEE--NE 416 (416) T ss_pred HHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCC--CC Confidence 99999999997 99999999999987555543 2222211111111111000000000000000000 00 No 52 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.86 E-value=5.9e-20 Score=126.02 Aligned_cols=445 Identities=15% Similarity=0.111 Sum_probs=227.1 Q ss_pred CC----------CCCeeeCCCCC---------cccchhhhh-----hhh--hhhhhhhHHHHhhhhcc--CCCHHHHHHH Q lcl|NC_020866. 1 MA----------QKPVLLDRWGN---------PVQRAQLTE-----EVA--APTVTGVRSPLSGYPGD--GLNPLRLAQI 52 (547) Q Consensus 1 ma----------~~~~~~~~~g~---------~~~~~~l~~-----~~a--~~~~~~~~~~~~~~~~~--~l~p~~~~~i 52 (547) |- -.+.+++..+. ++....+++ +.| ++.+.. -.+-.++-.+ ..++.++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~-~~~~~g~~~~~~~~~~~~l~~l 79 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGS-MSANPGFKTKPSIRNNQDLHGV 79 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhhe-eecccccccCCccCChhHHHHH Confidence 22 22222222222 111111111 111 000100 0000111000 0123222222 Q ss_pred HHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhc-----------CCceEecCC----CCHHHHHHHHHHHHHHh Q lcl|NC_020866. 53 LRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQ-----------LDITVEPAS----DDPQDVKIADMVRDWLK 117 (547) Q Consensus 53 lr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~-----------~~w~v~p~~----~~~~~~~~ae~v~~~l~ 117 (547) ++ .+. .-+.|.+|++.++..|.+ ..|.|...+ ....+....+.++++|. T Consensus 80 ~~---------------~~~-~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~ 143 (547) T protein:vir:63 80 LK---------------KFG-GNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIE 143 (547) T ss_pred HH---------------Hhh-cCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHH Confidence 22 222 348889999999988875 345554432 23444444556666665 Q ss_pred CCC---------HHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCc-------ceeeccCCC Q lcl|NC_020866. 118 REE---------LQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLA-------TPLLLDDNG 180 (547) Q Consensus 118 ~~~---------~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~-------~~~~~~~~~ 180 (547) +++ |.++++.++ +.+.+|.+++|+++...| .|..|.+++|.++++..+... +++...+++ T Consensus 144 ~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G---~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~ 220 (547) T protein:vir:63 144 KTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQ---SMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQK 220 (547) T ss_pred hhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEECCCC---cEEEEEEecCceeEEEECCccccccCceEEEEEcCCc Confidence 542 557777777 578899999999986543 367899999998876533222 233444455 Q ss_pred CceeeccCceEEEeecCCcC---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHH Q lcl|NC_020866. 181 AEVPLPAFRFIFADIPAKSG---ILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFR 255 (547) Q Consensus 181 ~g~~l~~~k~i~~~~~~~~~---~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~ 255 (547) ....+++...|++++...++ .+||.+.+..+......-....++...|.+..+.|--+-+ .+...++++++.+.+ T Consensus 221 ~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~ 300 (547) T protein:vir:63 221 IVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKR 300 (547) T ss_pred EEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHH Confidence 55678888888777665443 5789999999999998889999999999999888864444 344568888888888 Q ss_pred HHHHHhc---Cee--EEe-cCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH------hcchh---ccCccccc--c Q lcl|NC_020866. 256 AVANIAG---DCA--AII-PETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV------LGQTT---TTDAVAGG--H 318 (547) Q Consensus 256 al~~~~~---~~~--~vi-p~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i------lGqtl---ts~~~ggs--~ 318 (547) .+.+.-+ +++ .|+ ..|++ +...+.. ..-..|.+..++..++|+.+. +|..- ++.+.++| + T Consensus 301 ~~~~~~~G~~nagk~~vl~~~g~~--~~~l~~~-~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~ 377 (547) T protein:vir:63 301 EWKNSLSGINGSWQIPVVSAEDVK--FVNMTPS-ARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNE 377 (547) T ss_pred HHHHHhcCcccccccccccCCCce--EEEcCCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccch Confidence 8876522 222 344 34544 4443322 222247777788889999884 33211 11112222 2 Q ss_pred chhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHH Q lcl|NC_020866. 319 AVSQEH-RLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEV 397 (547) Q Consensus 319 a~~~vh-~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~ 397 (547) +-.+.. ......-+...++.|+..||+.|++. |+. .-+|.|+.....+....++. .++...|+ ++.+++ T Consensus 378 sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~-----~~~---~~~~~f~~~~~~~~~~~~~~-~~~~~~g~-lT~NE~ 447 (547) T protein:vir:63 378 GNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE-----FGD---KYTFQFVGGDIKSELESVKI-LAEKAKVA-MTVNEV 447 (547) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----cCC---ceEEEeeccccccHHHHHHH-HHHHhCCC-cCHHHH Confidence 222222 23455678889999999999987653 221 23678877777776665554 45667787 899999 Q ss_pred HHHhCCCCC-CCcchhccccccccc-----ccccccc---------cccccCCccccccccccccchhhhhhcccchhhh Q lcl|NC_020866. 398 RSRFGFSQP-KPGAKLLRPLENPAT-----QAAPPGM---------DPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEM 462 (547) Q Consensus 398 ~e~~Glp~p-~~~e~~~~~~~~~~~-----~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (547) |+.+|+|+. +.++.++.+...... ...+... ....+.....++...+.... .+... T Consensus 448 R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~ 518 (547) T protein:vir:63 448 RKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKD---------TTGDI 518 (547) T ss_pred HHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcc---------cCCCc Confidence 999999763 344444322111000 0000000 00000000000000000000 00000 Q ss_pred cccccccchhH---HHHHHHH-HHHHHHH Q lcl|NC_020866. 463 PSTGQKTGGAE---IAALTDQ-LELEGQP 487 (547) Q Consensus 463 ~~~~~~~~~~~---~~~l~~~-~~~~~~~ 487 (547) .+......++- -...... -.++|.. T Consensus 519 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (547) T protein:vir:63 519 GKDGQRKDKDNANAGKQGMKGDKPNDWQT 547 (547) T ss_pred CccccccCccccchhhhhcCCCCccccCC Confidence 00000000000 0000000 0111111 No 53 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.86 E-value=3.8e-21 Score=132.52 Aligned_cols=397 Identities=12% Similarity=0.023 Sum_probs=231.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhc---cCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPG---DGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~---~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |- .. ++ ..+.+..... .....+...+...++ ..+|+ ..+| +-+. T Consensus 1 m~-----~~---~~-~~~~~~~~~~-~~~~~~~~~~g~~~s~~~~~v~~---~~al--------------------~~~~ 47 (419) T protein:vir:80 1 MF-----FS---RQ-LLSNLGQTQP-GSGGWVSALLGSARSEAGQVVTP---ASAL--------------------SLTV 47 (419) T ss_pred CC-----cc---cc-cccccCcCCC-CcchhhHHhhcccccccCcccCh---HHhh--------------------ccHH Confidence 44 11 11 1111111111 011111122221111 11222 1111 3467 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCC--HHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCe Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDD--PQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQ 149 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~--~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~ 149 (547) |.+|++.+-..|.+++++|.-.+++ .+... .-+..+|. + +...++++.+. +.+.+|-+++++++... | T Consensus 48 v~~cv~~ia~~ia~lp~~~~~~~~~~~~~~~~--~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G- 123 (419) T protein:vir:80 48 LQNCVTLLAESIAQLPVELYERSGDDRKPATD--HPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQD-G- 123 (419) T ss_pred HHHHHHHHHHhhccCceEEEEecCCCcccccc--cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C- Confidence 9999999999999999998643221 11111 11233332 2 34667777777 56789999999987543 3 Q ss_pred eeEEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 150 YFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 150 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .+..|.+++|.++.+..++...++.+..+.. .++....++.++.. .+.+||.|.+..+....-.-....++...|. T Consensus 124 -~~~~L~~i~~~~v~i~~~~~~~~~y~~~~~~--~~~~~~i~h~~~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 199 (419) T protein:vir:80 124 -VIQGLYPLDNEAVTVMKGPDLKPMYRVAGAD--PLPQRLVHHVRWMS-INGYTGLSPVLLHANAIGHAQAIQQYAGKSF 199 (419) T ss_pred -cEEEEEEecCceEEEEECCCceEEEEEcCcc--ccchhheEEecCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 3678999999998876666555554433322 35555554445544 4458999999999988888888888888999 Q ss_pred HHcCCCeEEEeCCCC----CCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 230 QTYGQPLRIGKWQPG----ASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 230 E~~G~P~~vgk~~~~----~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) +.-|.|--+.+++.. .+++..+++.+.+.+..+ +..++++.|++++=+..+. ....|.+..++..++| T Consensus 200 ~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~---~d~q~~e~~~~~~~~I 276 (419) T protein:vir:80 200 MNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTN---VDAALIDALRLSALDI 276 (419) T ss_pred hcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCCh---hhHHHHHHHHHHHHHH Confidence 999999777776532 356667777777776533 2368899988877554322 2224777778888999 Q ss_pred HHHHhc-chhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHH Q lcl|NC_020866. 301 SKAVLG-QTTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKL 378 (547) Q Consensus 301 sk~ilG-qtlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~ 378 (547) +++.-- ..+....++++++..+.+. .....-+.-.++.|+..||+.|+.+--.. ..+.+|..+.....|.+.+ T Consensus 277 a~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~-----~~~i~fd~~~l~~~d~~~~ 351 (419) T protein:vir:80 277 ARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERK-----QYFIEYNLAGLLRGDQSSR 351 (419) T ss_pred HHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccC-----CeEEEEechhhhccCHHHH Confidence 988421 2233343445565444443 34456688899999999998876442111 1122333334445789999 Q ss_pred HHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccc Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSL 458 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (547) ++.+.++++.|+ ++.+++|+.+|+|+-+.+++.+.+... .....+. ..+...+.+ . T Consensus 352 ~~~~~~~~~~G~-~T~NE~R~~~g~~p~~gGD~~~~~~n~-~~~~~~~-~~~~~~~~~--------~------------- 407 (419) T protein:vir:80 352 YAAYAVGRQWGW-LSINDIRRLENMPPVKGGDIYLSPMNM-VDASKPQ-PIPMGKTEP--------T------------- 407 (419) T ss_pred HHHHHHHHhCCC-cCHHHHHHHhCCCCCCCcceeeecccc-ccccccc-cccCCCCCc--------h------------- Confidence 999999999998 999999999999977667766544221 1111110 001100000 0 Q ss_pred hhhhcccccccchhHHHHHHHHHHH Q lcl|NC_020866. 459 NSEMPSTGQKTGGAEIAALTDQLEL 483 (547) Q Consensus 459 ~~~~~~~~~~~~~~~~~~l~~~~~~ 483 (547) ...++.+ +++++ T Consensus 408 ------------~~~~~~~-~~~l~ 419 (419) T protein:vir:80 408 ------------KAALDEI-GRILS 419 (419) T ss_pred ------------hhhHHHH-HhhcC Confidence 1111221 12211 No 54 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.86 E-value=2.2e-20 Score=128.39 Aligned_cols=395 Identities=11% Similarity=0.001 Sum_probs=236.7 Q ss_pred CCCCCeeeC--CCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLD--RWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~--~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |+...+|.+ .++++-... .... ........|....++.++..-+ ..+... .++ +-+.| T Consensus 4 ~~~~~~~~~m~~F~~~~~~~--~~~~-------~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-------~~~-~~~~v 63 (413) T protein:vir:96 4 VSEIRKDKNLKFFNNKRSPT--EESK-------AKDEIPKAPQVVMTLPNFFKEL---ISDGYT-------KLS-DSPEV 63 (413) T ss_pred cchhhhhhcCCccccCCCcc--hhhh-------hhccccccccccccchhhHhhh---ccchhH-------HHh-hchHH Confidence 777666643 344422111 1100 0011111122222332222111 111111 112 46889 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) .+|+..+...|.+++|.+...+++... +....+..+| .+ +.+.++++.+. +.+.+|.++++++...+++. + T Consensus 64 ~~cI~~ia~~ia~~~~~~~~~~~~~~~-~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~--~ 140 (413) T protein:vir:96 64 RMAVDCIADLVSNMTIQLMQNGETGDK-RIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDK--I 140 (413) T ss_pred HHHHHHHHHhhccCceEEEEecCCCcc-ccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCc--e Confidence 999999999999999999654332111 1112233333 23 34667777777 46689999999987655543 4 Q ss_pred EeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCc-CccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKS-GILLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~-~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) ..|.++++.++++..+++...+... ..+..+++...|++++.... +..+|.|.+..+....-.-....++...|... T Consensus 141 ~~L~~l~~~~v~~~~~~~~~~y~~~--~~~~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 218 (413) T protein:vir:96 141 IGLTPISPYKVTFNVSDDDLDYSIT--FDNKEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMAS 218 (413) T ss_pred EEEEEecCceeEEEEcCCeEEEEEe--ecCcEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6788999998887665544333222 23346778888777765443 44579999999999988888899999999999 Q ss_pred cCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC-----eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 232 YGQPLRIGKWQPGASEQDKDTLFRAVANIAGD-----CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 232 ~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~-----~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) .|.|--+.+++...++++++++.+.+.+.-++ ..+|++.|... +.+....+.....|.+..++..++|+++.-- T Consensus 219 g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~-~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgV 297 (413) T protein:vir:96 219 EYMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVN-VQQIKPLTLNDLAINDAVTLDKKTVAGIFGV 297 (413) T ss_pred cCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCccc-ccccccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 99998888998888999999999888875332 33677777542 2221111111223666667888889988421 Q ss_pred -chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHH Q lcl|NC_020866. 307 -QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTL 385 (547) Q Consensus 307 -qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l 385 (547) ..+.+.++ ++ .+........-+...++.|++.||+.|++ ...+.+|.+......|.+.++++++++ T Consensus 298 P~~~lg~~~-~~---~~~~~~~~~~~l~P~~~~ie~~ln~~ll~---------~~~~~~fd~~~ll~~d~~~~~~~~~~~ 364 (413) T protein:vir:96 298 PAFLLGVGT-YN---KDEFNNFINTKIMSIAQVIQQTYNKLIVE---------EDMYFSLNPRSLYNYSLTEMVSAGAQM 364 (413) T ss_pred CHHHcCCCc-ch---HHHHHHHHHHHHHHHHHHHHHHHHHhhCC---------CCcEEEEechhhhccCHHHHHHHHHHH Confidence 12222211 22 22233455567888889999999987643 223334444455577999999999999 Q ss_pred HhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCC Q lcl|NC_020866. 386 VRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDL 435 (547) Q Consensus 386 ~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (547) ++.|+ ++.+++|+.+|+|+-+.++..+.+.................+.+ T Consensus 365 ~~~G~-~t~NE~R~~~g~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 365 TQLNA-LRRNEFRNWVGMPPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred HhCCC-cCHHHHHHHhCCCCCCCcceeeecccccchhhcccccCCCCCCC Confidence 99998 99999999999986555555443322111111111111100000 No 55 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.85 E-value=9.7e-20 Score=124.83 Aligned_cols=481 Identities=12% Similarity=0.064 Sum_probs=235.0 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKI 108 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ 108 (547) ++.+-+...+...+..+ ....+ .. .- . .+-+.|.+|+...-..|.+++|.+...++... ++ T Consensus 1 ~~~~~~~~g~~~~~~~~--~~~~~---------~~--~~---~-~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~-~~- 61 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSAD--SVFGN---------GA--KG---W-SNSAVAYRCISMLANNAASVDLVVRGPDGELD-EL- 61 (723) T ss_pred CcccccCCCcccccccc--ccccc---------cH--HH---H-hhhHHHHHHHHHHHHhhccceeEEEcCCCccc-hh- Confidence 11111111111111100 00000 00 00 1 15688999999999999999999975433221 11 Q ss_pred HHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCc---------ce Q lcl|NC_020866. 109 ADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLA---------TP 173 (547) Q Consensus 109 ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~---------~~ 173 (547) .-+..+|+ ++ ...++.+.++ +.+.+|-+.+++++.-++....|..+.+++++...+...... .+ T Consensus 62 -~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y 140 (723) T protein:vir:94 62 -HPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGY 140 (723) T ss_pred -hHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEE Confidence 12333443 33 4566777666 577899999999875333334577888888876554322111 11 Q ss_pred eeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHH Q lcl|NC_020866. 174 LLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTL 253 (547) Q Consensus 174 ~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l 253 (547) ..+...+.-++++....|++++....+..||.|.+..+....-.-....++...|...-+.|--|-+++ ..++++++++ T Consensus 141 ~~~~~~G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~l~~e~~~~~ 219 (723) T protein:vir:94 141 VIERTDGVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-DMDEQTFTKT 219 (723) T ss_pred EEEecCceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCHHHHHHH Confidence 222233444678888887777666667789999999999888888888888888988888897777876 5788888888 Q ss_pred HHHHHHHhc-----CeeEEec----------CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc-hhccCccccc Q lcl|NC_020866. 254 FRAVANIAG-----DCAAIIP----------ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ-TTTTDAVAGG 317 (547) Q Consensus 254 ~~al~~~~~-----~~~~vip----------~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq-tlts~~~ggs 317 (547) .+.+.+.-+ +-.+|++ +|++++-+..+. . --.|.+..++..++|+++.--. .+..+ +++ T Consensus 220 ~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~--~-D~q~le~r~~~~~eIa~afgVPp~~i~~--~st 294 (723) T protein:vir:94 220 VAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSP--A-EMDYINSRMHSAEEVMLAFGIRKDALLG--GST 294 (723) T ss_pred HHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCH--H-HHHHHHHHHHhHHHHHHHhCCChhHcCC--CCC Confidence 888876532 2235554 465555443221 1 1136677778888999874222 22222 222 Q ss_pred cch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcc--ccCHHHHHHHHHHHHhCCCCCCH Q lcl|NC_020866. 318 HAV-SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPK--TEDLTKLASSLDTLVRLGMEIEE 394 (547) Q Consensus 318 ~a~-~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~--~ed~~~~a~~~~~l~~~G~~v~~ 394 (547) ++- .+........-+...++.|+..||+.|++.. +.. -+|.|+... ..|.+.+++.+.++++.|+ ++. T Consensus 295 ~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-----g~~---~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~-~T~ 365 (723) T protein:vir:94 295 YENQAEAKAAVWTETLIPQMEVMASITDLQLLPDI-----GWT---VEWDFNSVPALQEDLEAQAGRNQGYLVNDV-LMV 365 (723) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcccc-----cCc---eEEeecchhhhhcCHHHHHHHHHHHHhCCC-cCH Confidence 222 2222334466788899999999999886532 211 246665433 4789999999999999998 999 Q ss_pred HHHHHHhCCCCCCCcchh--cccccccccccccccccccccCCccccccc-cccccchhhhhhcccchhhh--ccccccc Q lcl|NC_020866. 395 NEVRSRFGFSQPKPGAKL--LRPLENPATQAAPPGMDPGAGDLKTAERKL-KPLSGQADLSRGDLSLNSEM--PSTGQKT 469 (547) Q Consensus 395 ~~~~e~~Glp~p~~~e~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 469 (547) +++|+.+|+|+-+.++.. +.+...+-.+. +...|+.........+. ........+.. ........ ...+... T Consensus 366 NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~--~~~~p~~~e~~~~~~~~~~~~~~~~p~~~-~~~~~~~~~~~~~~~~~ 442 (723) T protein:vir:94 366 DEVRATIGLDPLPGGIGQMTLTPYRAQFAPA--PAPAPAVEEGAARMLALLERVAADRPLPE-LPVRATTVLHHDPGPDP 442 (723) T ss_pred HHHHHHhCCCCCCCCcccceeccccccccCC--CCCCccchhhhHhhhhhccccccccCcCC-CCCCCCCCCCCCcccCC Confidence 999999999865544422 22211111100 01111111000000000 00000000000 00000000 0000000 Q ss_pred chh---HHHHHH--------HHHHH----------HHHHHHHHHHHHHHHHHHccCCH---------HH----------- Q lcl|NC_020866. 470 GGA---EIAALT--------DQLEL----------EGQPEIVAMIEEIRTMLEAASSL---------GE----------- 508 (547) Q Consensus 470 ~~~---~~~~l~--------~~~~~----------~~~~~~~~~~~~i~~~l~~a~s~---------ee----------- 508 (547) .+. .+.++. .+.+. ....+...++..+++.+.+.-.- ++ T Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~ 522 (723) T protein:vir:94 443 QQTLYERLEALLQPLLVELGRRQAAVTLREFDLLMRGERAAALWLADVRAVASEAYERGALLAPPDAEEVPPARLTRLDL 522 (723) T ss_pred chhHHHHHHHHHhhhHHHHHHHHHHHHHHhhchhhcchHHHHHHHHHHHHHHHhccccceeccccccchhhHHHHHHHHH Confidence 011 111111 11110 01223333344444443321000 00 Q ss_pred ------HHHHHHHHhc-CCCHHHHHHHHHHHHHHHHHHhHH----hhccC Q lcl|NC_020866. 509 ------FSDMLDARYG-EIGASQLASRISAAMLASEFAGRE----GALDG 547 (547) Q Consensus 509 ------~~~~L~~l~~-~~~~~~l~~~l~~al~~A~~~G~~----~~~~g 547 (547) ++..+.+.+. .+....+.+.+...+..+.-.+.. ...+| T Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~l~~~ 572 (723) T protein:vir:94 523 APEELAVRINVKRIFNARKWVARTKDTLRGWYETAWRTGGDHVAAQLGDG 572 (723) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 0000000000 000001111111100000000000 01111 No 56 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.85 E-value=5.5e-20 Score=126.16 Aligned_cols=446 Identities=15% Similarity=0.116 Sum_probs=229.2 Q ss_pred CCCCCeeeCCC---CC--------------------cccchhhh-----hhhh--hhhhhhhHHHHhhhhcc--CCCHHH Q lcl|NC_020866. 1 MAQKPVLLDRW---GN--------------------PVQRAQLT-----EEVA--APTVTGVRSPLSGYPGD--GLNPLR 48 (547) Q Consensus 1 ma~~~~~~~~~---g~--------------------~~~~~~l~-----~~~a--~~~~~~~~~~~~~~~~~--~l~p~~ 48 (547) |+ |++.. |+ ++....++ +..| .+.+... ++-.+|-.+ -.++.+ T Consensus 5 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~-~~~~~~~~r~~~~~~~~ 79 (551) T protein:vir:80 5 LG----LFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSM-SANPGFKTKPSIRNNQD 79 (551) T ss_pred hh----hHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccce-ecCcccccCccccChhH Confidence 33 22211 11 11111111 1111 1111000 000011000 112322 Q ss_pred HHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhc-----------CCceEecCC----CCHHHHHHHHHHH Q lcl|NC_020866. 49 LAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQ-----------LDITVEPAS----DDPQDVKIADMVR 113 (547) Q Consensus 49 ~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~-----------~~w~v~p~~----~~~~~~~~ae~v~ 113 (547) +..+++ .+. +-+.|.+|++.|...|.+ .+|.|.+.+ ....+.+..+.++ T Consensus 80 l~~~~~---------------~~~-~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~ 143 (551) T protein:vir:80 80 LHGVLK---------------KFG-GNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIE 143 (551) T ss_pred HHHHHH---------------Hhh-cCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHH Confidence 222222 222 358899999999999986 567776543 2344444455566 Q ss_pred HHHhCCC---------HHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCc-------ceeec Q lcl|NC_020866. 114 DWLKREE---------LQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLA-------TPLLL 176 (547) Q Consensus 114 ~~l~~~~---------~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~-------~~~~~ 176 (547) ++|.+++ |.++++.++ +.+.+|.+.+|+++...| .|..|.+++|.++++..+... +++.. T Consensus 144 ~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G---~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~ 220 (551) T protein:vir:80 144 SFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQ---SMVRFVAKDPTTIFFATTADGKIPDNGNRFVQV 220 (551) T ss_pred HHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCC---cEEEEEEeCCceeEEEECCccccccCceEEEEE Confidence 6665543 456777777 567899999999986543 377899999998876443222 23344 Q ss_pred cCCCCceeeccCceEEEeecCCcC---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCCCHHHHH Q lcl|NC_020866. 177 DDNGAEVPLPAFRFIFADIPAKSG---ILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKW--QPGASEQDKD 251 (547) Q Consensus 177 ~~~~~g~~l~~~k~i~~~~~~~~~---~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~~~~~~~ 251 (547) .+++....+++...|++++...++ .+||.+.+..+......-....++...|...-+.|--+-++ +...++++++ T Consensus 221 ~~g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~ 300 (551) T protein:vir:80 221 IDQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALE 300 (551) T ss_pred eCCcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHH Confidence 444555678888888777655433 57899999999999888888888999999998888654443 4457888888 Q ss_pred HHHHHHHHHhc---Cee--EEe-cCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH------hcchh---ccCcccc Q lcl|NC_020866. 252 TLFRAVANIAG---DCA--AII-PETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV------LGQTT---TTDAVAG 316 (547) Q Consensus 252 ~l~~al~~~~~---~~~--~vi-p~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i------lGqtl---ts~~~gg 316 (547) ++.+.+.+.-+ +++ .|+ +.|++++=+. .. ..-..|.+..++..++|+++. +|... ++...++ T Consensus 301 ~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~--~~-~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~ 377 (551) T protein:vir:80 301 IFKREWKNSLSGINGSWQIPVVSAEDVKFVNMT--PS-ARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGG 377 (551) T ss_pred HHHHHHHHHhcCccccCccccccCCCceEEEcc--CC-hhHHHHHHHHHHHHHHHHHHhcCCHHHcCccccccccccccc Confidence 88888876522 222 344 4566555443 22 222247777888889999884 33211 1111222 Q ss_pred c--cchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCC Q lcl|NC_020866. 317 G--HAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIE 393 (547) Q Consensus 317 s--~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~ 393 (547) | ++-.+... .....-+...++.|+..||+.|++. |+. .-+|.|+.....+....++.. ++...|+ ++ T Consensus 378 s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~-----~~~---~~~f~f~~~~~~~~~~~~~~~-~~~~~g~-lT 447 (551) T protein:vir:80 378 SLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE-----FGD---KYTFQFVGGDIKSELESVKIL-AEKAKVA-MT 447 (551) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-----cCC---ceEEEeeccChhhHHHHHHHH-HHHhcCC-cC Confidence 2 23222222 3455678889999999999987653 221 236788777667766665544 4666787 89 Q ss_pred HHHHHHHhCCCCC-CCcchhcccccccc-----ccccccccccccc----CCccccccccccccchhhhhhcccchhhhc Q lcl|NC_020866. 394 ENEVRSRFGFSQP-KPGAKLLRPLENPA-----TQAAPPGMDPGAG----DLKTAERKLKPLSGQADLSRGDLSLNSEMP 463 (547) Q Consensus 394 ~~~~~e~~Glp~p-~~~e~~~~~~~~~~-----~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (547) .+++|+.+|+|+. +.++.++.+..... +............ ....+.....+....+..+. .+.... T Consensus 448 ~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~ 523 (551) T protein:vir:80 448 VNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKD----TTGDIG 523 (551) T ss_pred HHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccc----cCCCcc Confidence 9999999999763 44444432211110 0000000000000 00000000000000000000 000000 Q ss_pred ccccccchh---HHHHHHHH-HHHHHHH Q lcl|NC_020866. 464 STGQKTGGA---EIAALTDQ-LELEGQP 487 (547) Q Consensus 464 ~~~~~~~~~---~~~~l~~~-~~~~~~~ 487 (547) +......++ .-...... -.++|.. T Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (551) T protein:vir:80 524 KDGQRKDKDNANAGKQGMKGDKPNDWQT 551 (551) T ss_pred ccccccCccccchhhhhcCCCCccccCC Confidence 000000000 00000000 0111111 No 57 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.85 E-value=1e-19 Score=124.75 Aligned_cols=400 Identities=11% Similarity=0.025 Sum_probs=232.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCH--HHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNP--LRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p--~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) ||. ++....+ ..+..... ..+.|..+.+..+++ .....+. .....+-+.| T Consensus 1 ~~~---~~~~~~~---------~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~---------------~~~a~~~~~v 52 (460) T protein:vir:10 1 MAN---RIIRALR---------ELTGLDNK-FNDAFIKYIGQTFTKYDNNGKTYL---------------EQGYNINPDV 52 (460) T ss_pred Cch---hHHHHHh---------hhhccCCC-chHHHHHhhccccCCCccchhhhh---------------HHHHhcchHH Confidence 874 3332221 11111111 111222222111111 1111111 1111246899 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHH----------------------------HHHHHHHHHhCC----CHHHHHH Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVK----------------------------IADMVRDWLKRE----ELQQEVF 126 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~----------------------------~ae~v~~~l~~~----~~~~~i~ 126 (547) .+|++.....|.+++|.|.....+....+ .......++.++ ...++++ T Consensus 53 ~~~v~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~ 132 (460) T protein:vir:10 53 YSCISQMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYS 132 (460) T ss_pred HHHHHHHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHH Confidence 99999999999999999965432211000 001122334444 4667777 Q ss_pred HHH-HhhhhcCeeeeeEEeecC-CeeeEEeeEeeccceeEecccCCccee---------eccCCCCceeeccCceEEEee Q lcl|NC_020866. 127 HIL-DCISKGYSFTEIVWNTSE-GQYFPDKLIWRDPRHFRFQKSDLATPL---------LLDDNGAEVPLPAFRFIFADI 195 (547) Q Consensus 127 ~~l-~a~~~G~sv~Ei~w~~~~-g~~~~~~l~~~~~~~~~~~~~~~~~~~---------~~~~~~~g~~l~~~k~i~~~~ 195 (547) .++ +.+.+|.+.+++++...+ +.-.+..|.+++|.++.+..+.....+ ....++....++++..|++++ T Consensus 133 ~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~ 212 (460) T protein:vir:10 133 LYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKY 212 (460) T ss_pred HHHHHHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEec Confidence 777 677899999999875542 334577899999999987665443222 222345557888998887776 Q ss_pred cCCc-----CccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----Cee Q lcl|NC_020866. 196 PAKS-----GILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCA 265 (547) Q Consensus 196 ~~~~-----~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~ 265 (547) .... +..+|.+.+..+...........++...|...-+.|-.+.+++...++++++++.+.+.+.-+ +.. T Consensus 213 ~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~ 292 (460) T protein:vir:10 213 ANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQI 292 (460) T ss_pred CCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCc Confidence 5443 456899999999988888888888888888888888888888888899999999988887633 235 Q ss_pred EEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh-cchhccCc--cccccchhHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_020866. 266 AIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVL-GQTTTTDA--VAGGHAVSQEH-RLVQEDIETSDAMALAA 341 (547) Q Consensus 266 ~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il-Gqtlts~~--~ggs~a~~~vh-~~v~~~~~~ad~~~i~~ 341 (547) ++++.|++++-+..+. . ...|.+..++...+|+++.- -..+.+.. .+.+++-.+.+ ......-+...++.|++ T Consensus 293 ~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~ 369 (460) T protein:vir:10 293 AGASGEIAFTKISLNT--D-ELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQ 369 (460) T ss_pred eecCCCceEEEccCCh--h-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH Confidence 7888888777654332 2 22467777888899998841 12233322 22345444433 34455678889999999 Q ss_pred HHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCC-CCcchhcccccccc Q lcl|NC_020866. 342 ILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQP-KPGAKLLRPLENPA 420 (547) Q Consensus 342 ~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p-~~~e~~~~~~~~~~ 420 (547) .||+.|++..-. ....+|+|+...-..+....+....+.+.|+ +|.+++|+.+|+|+- +++.+.+-....-. T Consensus 370 ~ln~kl~~~~~~------~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~-~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~ 442 (460) T protein:vir:10 370 AFDKKFIKRFKG------YENAVIEWDISELPEMQTDMVAMASWLNTIP-VTPNEIRIAMKYETLNQDGMDIVFMPSNKV 442 (460) T ss_pred HHHHhhcCcccc------cCCceEEeecchhhhHHHHHHHHHHHHhCCC-CCHHHHHHHhCCCCCCCCCCCeeeeccccc Confidence 999988664311 1122456654433223333344445677887 999999999999864 33333332222211 Q ss_pred cccccccccccccCCccccccccccccch Q lcl|NC_020866. 421 TQAAPPGMDPGAGDLKTAERKLKPLSGQA 449 (547) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (547) +........ ... ...... T Consensus 443 ~~~~~~~~~--------~~~---~~nq~~ 460 (460) T protein:vir:10 443 RIDDVSNNL--------IDS---AFNQNQ 460 (460) T ss_pred chhhccccc--------CCC---cccCCC Confidence 111100000 000 000000 No 58 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.84 E-value=1e-19 Score=124.68 Aligned_cols=394 Identities=12% Similarity=0.058 Sum_probs=227.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh--ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP--GDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~--~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |++ ..|+.+-++-+ +.+.+... .....++..... ..+++. . .+ .+-+.| T Consensus 1 ~~~-~~~~~~~k~~~----~~~~~~~~-~~~~~~~~~~~~~~~~~v~~---------------~----~a----~~~~~v 51 (409) T protein:vir:94 1 MAK-ENIVTRIKKKL----IDNWIDQS-ASKLYDFSPWKNKSFWGVIN---------------N----TL----ETNETI 51 (409) T ss_pred Ccc-cccchhhhhHH----hhhhhcCC-cccccccccccCccccccch---------------h----hh----hccHHH Confidence 984 55555544433 22332211 111111100000 001111 0 01 145779 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) .+|++.+...|.+++|++.-..+. .+..+...+... -..+...++++.++ +.+.+|-+.+++++...| .|..|. T Consensus 52 ~~~i~~Ia~~ia~lp~~~~~~~~~-~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L~ 127 (409) T protein:vir:94 52 FSAITKLSNSMASLPLKMYEDYKV-VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKLF 127 (409) T ss_pred HHHHHHHHHhhhhCceeEeecccc-cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEEE Confidence 999999999999999999654332 222233322211 11234666666655 567899999998765333 367899 Q ss_pred eeccceeEecccCC-cce--eeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 157 WRDPRHFRFQKSDL-ATP--LLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 157 ~~~~~~~~~~~~~~-~~~--~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) +++|.++.+..+.. ..+ .....++..+.+++...|++++....+..+|.+.+..+....-..... ..|. +..... T Consensus 128 ~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~-~~~~-~~~~~~ 205 (409) T protein:vir:94 128 LLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-RTFN-LTEMQK 205 (409) T ss_pred EEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHH-HHHH-HHhcCC Confidence 99999877543322 222 233344555678888877777665667788999998887665555443 3343 222223 Q ss_pred CCeEEEeCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc-hhc Q lcl|NC_020866. 234 QPLRIGKWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ-TTT 310 (547) Q Consensus 234 ~P~~vgk~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq-tlt 310 (547) .|-.+-+++...++++++++.+.+.+..++ ..+|++.|++++-+..+ .....|.+..++..++|+++.--. .+. T Consensus 206 ~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~---~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 282 (409) T protein:vir:94 206 PDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKK---YVSEDIVASENLTRERVANVFQLPSVFL 282 (409) T ss_pred CCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCC---hhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 345667788888999999998888876554 35788888887755432 222246777778888999884212 233 Q ss_pred cCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC--ccccCHHHHHHHHHHHHh Q lcl|NC_020866. 311 TDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR--PKTEDLTKLASSLDTLVR 387 (547) Q Consensus 311 s~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~--~~~ed~~~~a~~~~~l~~ 387 (547) .....++++..+-+. .....-+.--++.|++.||+.|+... .. .. ...|+|+. .-..|.+.+++++.++++ T Consensus 283 g~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~---~~-~~--~~~i~fd~~~ll~~d~~~~~~~~~~~~~ 356 (409) T protein:vir:94 283 NARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT---DR-EK--NRYFKFNVKSYLRADSATQAEVYFKAVR 356 (409) T ss_pred CCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc---cc-cC--cceEEeechhhhccCHHHHHHHHHHHHh Confidence 333345555544444 34455688888899999998775532 11 11 12455543 345789999999999999 Q ss_pred CCCCCCHHHHHHHhCCCCCCCcchhccccccccccccccc-cc-ccccCCccccc Q lcl|NC_020866. 388 LGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPG-MD-PGAGDLKTAER 440 (547) Q Consensus 388 ~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~ 440 (547) .|+ ++.+++|+.+|+|+-+.++..+..... .+...+.. .. ...+.....+. T Consensus 357 ~G~-~T~NE~R~~~g~~p~~ggD~~~~~~n~-~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 357 SGY-YTINDIREWEDLPPVEGGDKPLISGDL-YPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred CCC-cCHHHHHHHhCCCCCCCcCeEeecccc-cccccchhhcccccCCCCCcCCC Confidence 997 999999999999876666655433221 11111100 00 00000000000 No 59 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.83 E-value=2e-19 Score=123.09 Aligned_cols=395 Identities=11% Similarity=0.046 Sum_probs=226.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |.+ ..|+.+-..++.+..+.++.. +.. ++.-|. .+.....+.. .. .+-+.|.+ T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~-----~~~-----------~~~~~~--~~~~~~v~~~-------~~-~~~~~V~~ 53 (409) T protein:vir:93 1 MAK-ENIVTRIKKKLIDNWIDQSTS-----KLY-----------DFSPWK--NRSFWGVINN-------TL-ETNETIFS 53 (409) T ss_pred CCc-cchhhhhhhhhhhhhhccccc-----ccc-----------cccccc--Cccccccchh-------hh-hccHHHHH Confidence 774 333333333333222222211 100 010000 0011110111 11 14577999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWR 158 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~ 158 (547) |++.+-..|.+++|.+.-..+. .+..+...+... -..+...++++.++ +.+.+|-+.+++++...| .+..|.++ T Consensus 54 ci~~Ia~~ia~lp~~~~~~~~~-~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L~~l 129 (409) T protein:vir:93 54 AITKLSNSMASLPLKMYEDYKV-VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKLFLL 129 (409) T ss_pred HHHHHHHhhhhCceeEeecccc-ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCC---cEEEEEEE Confidence 9999999999999999754322 222233332211 11234666666666 467799999998765332 36789999 Q ss_pred ccceeEeccc--CCc-ceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCC- Q lcl|NC_020866. 159 DPRHFRFQKS--DLA-TPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQ- 234 (547) Q Consensus 159 ~~~~~~~~~~--~~~-~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~- 234 (547) +|.++.+..+ ++. .+......+..+.+++...|++++....+..||.|.+..+....-...... .+. + ..++. T Consensus 130 ~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~-~~~-~-~~~~~~ 206 (409) T protein:vir:93 130 NPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVR-TFN-L-TEMQKP 206 (409) T ss_pred cCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHH-HHH-H-HhcCCC Confidence 9988776433 222 222333445556788888877776656677889999888776655544433 332 2 33443 Q ss_pred CeEEEeCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc-hhcc Q lcl|NC_020866. 235 PLRIGKWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ-TTTT 311 (547) Q Consensus 235 P~~vgk~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq-tlts 311 (547) |-.+.+.+...++++++++.+.+.+...+ ..+|++.|++++=+..+. ....|.+..++...+|+++.--. .+.. T Consensus 207 ~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~---~d~q~~e~r~~~~~~Ia~~fgVPp~~lg 283 (409) T protein:vir:93 207 DSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKY---VSEDIVASENLTRERVANVFQLPSVFLN 283 (409) T ss_pred CceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHhC Confidence 45566788888999998888888765443 467788888877554321 22246777778888898884222 2333 Q ss_pred CccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC--ccccCHHHHHHHHHHHHhC Q lcl|NC_020866. 312 DAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR--PKTEDLTKLASSLDTLVRL 388 (547) Q Consensus 312 ~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~--~~~ed~~~~a~~~~~l~~~ 388 (547) ..++++++..+.+. .....-+...++.|++.||+.|+...- . ... .+|+|+. ....|.+.+++.++++++. T Consensus 284 ~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~---~-~~~--~~~~fd~~~ll~~d~~~~~~~~~~~~~~ 357 (409) T protein:vir:93 284 ARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD---R-EKN--RYFKFNVKSYLRADSATQAEVYFKAVRS 357 (409) T ss_pred CCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc---c-cCc--ceEEeechhhhccCHHHHHHHHHHHHhC Confidence 33445666555544 344567888889999999988765331 1 111 2455543 3456899999999999999 Q ss_pred CCCCCHHHHHHHhCCCCCCCcchhccccccccccccccc--ccccccCCccccc Q lcl|NC_020866. 389 GMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPG--MDPGAGDLKTAER 440 (547) Q Consensus 389 G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 440 (547) |+ ++.+++|+.+|+|+-+.++..+..... .+...+.. .....+..+..+. T Consensus 358 G~-~T~NE~R~~~g~~p~~ggD~~~~~~n~-~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 358 GY-YTINDIREWEDLPPVEGGDKPLISGDL-YPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred CC-cCHHHHHHHhCCCCCCCcCeeeecccc-cccccchhhcccccCCCCCcCCC Confidence 98 999999999999876556655433221 11111100 0001111111100 No 60 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.83 E-value=2.4e-19 Score=122.71 Aligned_cols=396 Identities=12% Similarity=0.056 Sum_probs=225.8 Q ss_pred CCCC--CeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCC-HHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQK--PVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLN-PLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~--~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~-p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |.=- +.++....+++..+-.. .+.++++ +.-|. .....+ +...-..+-+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~---~~~~~~-------v~~~~a~~~~~ 53 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWID-----------------QSTSKLYDFSPWK---NRSFWG-------VINNTLETNET 53 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhc-----------------ccccccccccccC---Cccccc-------cchhhhhccHH Confidence 4411 11222222222111111 1111111 10000 001111 00111124678 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHH-HHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRD-WLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~-~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |.+|++.+...|.+++|++.-..+. .+..+...+.. --..+...++++.++ +.+.+|-++++++....| .+..| T Consensus 54 v~~~i~~ia~~iA~lp~~~~~~~~~-~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L 129 (412) T protein:vir:26 54 IFSAITKLSNSMASLPLKMYEDYKV-VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKL 129 (412) T ss_pred HHHHHHHHHHhHhhCceeEeecccc-ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCC---cEEEE Confidence 9999999999999999998643322 22223332221 111234666666666 567899999998764333 36689 Q ss_pred EeeccceeEecccCCc-c--eeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLA-T--PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~-~--~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) .+++|.++.+..+... . +......+....+++...|++++....+..||.|.+..+....-..... ..|. +.... T Consensus 130 ~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~-~~~~-~~~~~ 207 (412) T protein:vir:26 130 FLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-RTFN-LTEMQ 207 (412) T ss_pred EEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHH-HHHH-HHhcC Confidence 9999988776443322 2 2233344555678888888777766667789999998887655554443 3343 32222 Q ss_pred CCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc-hh Q lcl|NC_020866. 233 GQPLRIGKWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ-TT 309 (547) Q Consensus 233 G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq-tl 309 (547) ..|-.+.+++...++++++++.+.+.+...+ ..+|++.|++++-+..+ .....|.+..++..++|+++.--. .+ T Consensus 208 ~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~---~~d~q~~e~~~~~~~~Ia~afgVPp~~ 284 (412) T protein:vir:26 208 KPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKK---YVSEDIVASENLTRERVANVFQLPSVF 284 (412) T ss_pred CCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCC---hhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 3345667788888999999888888775544 45678888887755422 222246777778888999884212 23 Q ss_pred ccCccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC--ccccCHHHHHHHHHHHH Q lcl|NC_020866. 310 TTDAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR--PKTEDLTKLASSLDTLV 386 (547) Q Consensus 310 ts~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~--~~~ed~~~~a~~~~~l~ 386 (547) .....+++++..+.+... ...-+...++.|++.||+.|+... .. ... .+|+|+. ....|.+++++.+++++ T Consensus 285 lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kLl~~~---~~-~~~--~~~~fd~~~l~~~d~~~~~~~~~~~~ 358 (412) T protein:vir:26 285 LNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT---DR-EKN--RYFKFNVKSYLRADSATQAEVYFKAV 358 (412) T ss_pred hCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc---cc-cCc--ceEEeechhhhccCHHHHHHHHHHHH Confidence 333334566666655544 455688899999999998875432 11 111 2455544 34578999999999999 Q ss_pred hCCCCCCHHHHHHHhCCCCCCCcchhccccccccccccccc--ccccccCCccccc Q lcl|NC_020866. 387 RLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPG--MDPGAGDLKTAER 440 (547) Q Consensus 387 ~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 440 (547) +.|+ ++.+++|+.+|+|+-+.+++.+..... .+...+.. .....+.....+. T Consensus 359 ~~G~-~t~NE~R~~~gl~p~~ggD~~~~~~n~-~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 359 RSGY-YTINDIREWEDLPPVEGGDKPLISGDL-YPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred hCCC-cCHHHHHHHhCCCCCCCcCeeeecccc-cccccchhhcccccCCCCCcCCC Confidence 9998 999999999999876666665543221 11111100 0000000000000 No 61 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.82 E-value=8.3e-20 Score=125.19 Aligned_cols=331 Identities=13% Similarity=0.076 Sum_probs=207.6 Q ss_pred HhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccce Q lcl|NC_020866. 89 VSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRH 162 (547) Q Consensus 89 v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~ 162 (547) |.++||+|.-.+. ..+.. +..+|. ++ ...++++.++ +.+.+|.+++.+++...| .|..|.++++.+ T Consensus 1 ia~lp~~~~~~~~-~~~~~----l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G---~~~~L~~l~~~~ 72 (348) T protein:vir:93 1 MASLPLKMYEDYK-VVNTE----VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKLFLLNPDV 72 (348) T ss_pred CcccceEeEecCc-CcccH----HHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEEEEEcCCc Confidence 9999999964332 22222 334443 33 4666777776 567899999998865433 367899999987 Q ss_pred eEecccCCc-c--eeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE Q lcl|NC_020866. 163 FRFQKSDLA-T--PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIG 239 (547) Q Consensus 163 ~~~~~~~~~-~--~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vg 239 (547) +++..+... . +......+....++++..|++++....+..+|.|.+..+...+-.-.. ...|. +......|..+. T Consensus 73 v~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~-~~~~~-~~~~~~~~~~i~ 150 (348) T protein:vir:93 73 VEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNA-VRTFN-LTEMQKPDSFML 150 (348) T ss_pred eEEEEeCCCcEEEEEEEcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHH-HHHHH-HHhcCCCceeEE Confidence 775433322 2 223344456677888888777776666778999998888765544333 34443 333333456777 Q ss_pred eCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCcccc Q lcl|NC_020866. 240 KWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAG 316 (547) Q Consensus 240 k~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~gg 316 (547) +++...++++++++.+.+.+...+ ..+|++.|++++-+..+. ....|.+..++..++|+++.-- ..+....+++ T Consensus 151 ~~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~ 227 (348) T protein:vir:93 151 KYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKY---VSEDIVASENLTRERVANVFQLPSIFLNARSNT 227 (348) T ss_pred ecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 888888999999988888876554 357888888877654322 2224777788889999988421 2333333445 Q ss_pred ccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHHHHHHHHhCCCCCC Q lcl|NC_020866. 317 GHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLASSLDTLVRLGMEIE 393 (547) Q Consensus 317 s~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~~~~~l~~~G~~v~ 393 (547) +++..+.+.. ....-+.-.++.|++.||+.|++.. ... ...+|+|+ .....|.+.+++.+.++++.|+ ++ T Consensus 228 ~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~-----~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~-~T 300 (348) T protein:vir:93 228 NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT-DRE-----KNRYFKFNVKSYLRADSATQAEVYFKAVRSGY-YT 300 (348) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc-ccc-----CcceEEeechhhhccCHHHHHHHHHHHHhCCC-CC Confidence 6776655544 4466788889999999998876532 111 11235554 3345689999999999999998 99 Q ss_pred HHHHHHHhCCCCCCCcchhccccccccccccc-ccccc-cccCCccccc Q lcl|NC_020866. 394 ENEVRSRFGFSQPKPGAKLLRPLENPATQAAP-PGMDP-GAGDLKTAER 440 (547) Q Consensus 394 ~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~ 440 (547) .+++|+.+|+|+-+.++..+....-. +...+ ..... ..+..+..+. T Consensus 301 ~NE~R~~~g~~p~~ggD~~~~~~n~~-~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 301 INDIREWEDLPPVEGGDKPLISGDLY-PIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred HHHHHHHhCCCCCCCcCeEeeccccc-ccccchhhcccccCCCCCcCCC Confidence 99999999998766666655332211 11111 00000 0000000000 No 62 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.82 E-value=1.6e-19 Score=123.64 Aligned_cols=393 Identities=14% Similarity=0.055 Sum_probs=229.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. |++..|+..... +++....+...+.. +..+ +. .... .+.+.++.+.|.+ T Consensus 1 Mg----~~~~~~~~~~~~------~~~~~~~~~~~~~~-~~~~------------~~--~~~~----~~~~~~~~~~v~~ 51 (423) T protein:vir:81 1 MG----FLQKLGLAPSVV------ATPEPIELVGPIFE-SLKL------------ST--KNMT----VEQIWEDQPHLRT 51 (423) T ss_pred Cc----hhHhhccccccc------cCcccccccccccc-cccc------------cc--chhh----HHHHHHhhhHHHH Confidence 88 777765422211 11111111000000 0000 00 0011 1222246899999 Q ss_pred HHHHHHHHHhcCCceEec--CCCC-HHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEP--ASDD-PQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p--~~~~-~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) |++.+...|.+++|+|.- +++. ..... ..+..+|.++ ...++++.++ +.+.+|-+++.+... .++...+ T Consensus 52 ~i~~ia~~ia~lp~~~~~~~~dg~~~~~~~--~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd-~~~~~~~ 128 (423) T protein:vir:81 52 VTTFIARNVASLQLQAFERVEDGGRERVRE--GHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGD-LGVDTPT 128 (423) T ss_pred HHHHHHHhHhhCceEEEEEecCCceeeecc--chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcCcce Confidence 999999999999999842 2222 11111 1233455554 3666766666 566799877766532 2333334 Q ss_pred EeeEeeccceeEec--ccC--Ccceeecc---CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQ--KSD--LATPLLLD---DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 153 ~~l~~~~~~~~~~~--~~~--~~~~~~~~---~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) ..+.+.++..+... .++ ...+.... ..+.-+.+++...|+++.....+..+|.|.+..+....-......++. T Consensus 129 ~~l~p~~~~~v~~~~~~~~~~~~~Y~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~ 208 (423) T protein:vir:81 129 LDIRPIPVSWVQRRAYKDGWGSLDYIIIESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFR 208 (423) T ss_pred EEEeecccceeeeeeccCCCcceEEEEEEecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 45666666665432 222 22233222 233446788888776665556666789999999998888888888888 Q ss_pred HHHHHHcCCCeEEEeCCC-----CCCHHHHHHHHHHHHHHhc------CeeEEecCCceEEEEecCCCCCchHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQP-----GASEQDKDTLFRAVANIAG------DCAAIIPETMSIDFVESKSIGSSVDLYEKRIN 294 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~-----~~~~~~~~~l~~al~~~~~------~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~ 294 (547) ..|...-+.|--|.+++. ..++++++++.+.+...-+ +..+|++.|++++-+..+. .-..|.+..+ T Consensus 209 ~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~---~d~q~~e~~~ 285 (423) T protein:vir:81 209 AQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTS---KDEQTVETTK 285 (423) T ss_pred HHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCCh---hhHHHHHHHH Confidence 889888888977766543 3577888888877776421 2467899998877654322 1223666677 Q ss_pred HHHHHHHHHHhcc--hhccCccccccchhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcc Q lcl|NC_020866. 295 HLDQQISKAVLGQ--TTTTDAVAGGHAVSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPK 371 (547) Q Consensus 295 ~~d~~Isk~ilGq--tlts~~~ggs~a~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~ 371 (547) +...+|+++. |- .|..+.++++++-.+... .....-+...++.+++.||+.|+++.-... ...+.+|.++... T Consensus 286 ~~~~eIa~~f-gVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~---~~~~~~fd~~~ll 361 (423) T protein:vir:81 286 LSLQTVAQVY-GINPTMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDN---EKFYFEFNLEEKL 361 (423) T ss_pred hhHHHHHHHh-CCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccccc---CccEEEecchhhh Confidence 8888999873 32 233333345555444433 344556888899999999998866542111 1122233334445 Q ss_pred ccCHHHHHHHHHHHH-hCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCc Q lcl|NC_020866. 372 TEDLTKLASSLDTLV-RLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLK 436 (547) Q Consensus 372 ~ed~~~~a~~~~~l~-~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (547) ..|++.+++.+.+++ +.|+ ++.+++|+.+|+|+-+.+++++.+.... +...+ ..++....+ T Consensus 362 r~d~~~r~~~~~~~l~~~G~-~T~NE~R~~~gl~p~~gGD~~~~p~n~~-~~~~~--~~~~~~~~t 423 (423) T protein:vir:81 362 RASFEEAAEIKRAAVGNVAW-MTINEVRAMDNLPSIDGGDDLARPLNTE-FGDSE--DAPGEEVET 423 (423) T ss_pred ccCHHHHHHHHHHHHhCCCC-cCHHHHHHHhCCCCCCCcceeecccccc-cCccC--CCCCCCCCC Confidence 678999999998866 5676 8999999999998877777666543221 11100 000000000 No 63 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.82 E-value=7.4e-19 Score=120.00 Aligned_cols=390 Identities=16% Similarity=0.103 Sum_probs=229.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHH-hhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPL-SGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~-~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |- |.|.+.+.-.+...... ......+ .+....+.++ .. ..++ +-+.|. T Consensus 1 Mg----~f~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~----------------~~----~~~~-~~~~v~ 49 (406) T protein:vir:95 1 MG----LFDRWRRTKRKSKIRAD------TGYVGLFMSGEDVSFLVP----------------GY----VRLS-DNPEVR 49 (406) T ss_pred Cc----chhhhcccccccccccc------chhhhhhccCcccCcccc----------------CH----HHHh-hcHHHH Confidence 88 88877654433221111 1111111 1111111111 00 1111 468999 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHHH-hhhh--cCeeeeeEEeecCCeeeE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHILD-CISK--GYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l~-a~~~--G~sv~Ei~w~~~~g~~~~ 152 (547) +|+..+...+.+++|.|....++.....--.....++.++ .+.++++.++. .+.+ ||+++++++... ..+ T Consensus 50 ~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~---g~~ 126 (406) T protein:vir:95 50 MAVHKIADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTAD---GLI 126 (406) T ss_pred HHHHHHHHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCC---CcE Confidence 9999999999999999964332211111111222333333 47777777774 3444 677777776432 347 Q ss_pred EeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCc-CccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKS-GILLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~-~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) ..|.+++|.++++..+.....+.. .+..++++.+|++++...+ +..+|.+.+..+....-.-....++...+... T Consensus 127 ~~l~~i~~~~v~~~~~~~~~~~~~----~~~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~n 202 (406) T protein:vir:95 127 DELVPLTPSKVNFLDTPDGYQVLY----GGQTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSG 202 (406) T ss_pred EEEEEEcCceeEEEEcCCeEEEEe----ccEEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 889999999988776654322222 2457888888877765444 44679999999999988888889999999999 Q ss_pred cCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCc-eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 232 YGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETM-SIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 232 ~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~-~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) .+.|--+.+++...++++++++.+.+.+.-. +..+|++.+. +++-+ ...+.....|.+..++.-++|+++.- T Consensus 203 g~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~--~~~~~~d~q~~e~~~~~~~~Ia~~fg 280 (406) T protein:vir:95 203 KYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQV--KPLSLKDIAINEAVELDKRTVAGMFG 280 (406) T ss_pred cCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCcccccc--ccCChhHHHHHHHHHHHHHHHHHHhC Confidence 9999999999888899988888777766432 2245676554 44321 11112222466777888889988832 Q ss_pred c-chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 306 G-QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 306 G-qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) - ..+.+.+ +.. .+........-+...++.|++.||+.++. +...+.+|.+......|.+.+++.+.+ T Consensus 281 Vp~~~lg~~---~~~-~~~~~~~~~~~l~P~~~~ie~~l~~~l~~--------~~~~~~~fd~~~l~~~d~~~~~~~~~~ 348 (406) T protein:vir:95 281 VPAFLLGIG---EFN-RDEYNNFINSTILPIAKGIEQELTRKLLI--------SPDLYFKFNPRSLYAYDLKELAEVGSN 348 (406) T ss_pred CCHHHcCCC---Cch-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC--------CCCcEEEeechhhhcCCHHHHHHHHHH Confidence 1 2222211 221 23344556667778888888888876642 222333444444456789999999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +++.|+ ++.+++|+.+|+|+-+.++..+.+.... +...........+..... ..+..+ T Consensus 349 l~~~G~-~t~NE~R~~~gl~p~~~gd~~~~~~n~~-~~~~~~~~~~~k~g~~~~------~~~~~~ 406 (406) T protein:vir:95 349 MYVRGI-MEGNEVRDWLGLSPKEGLSELVILENYI-PLDKIGDQSKLKGGDNSG------ADGQTD 406 (406) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCcceeeeccCcc-chhhcccccccCCCCCCC------CCCCCC Confidence 999998 8999999999998766566555332221 111100000000000000 000000 No 64 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.82 E-value=1.4e-18 Score=118.42 Aligned_cols=452 Identities=14% Similarity=0.093 Sum_probs=224.3 Q ss_pred CCCC---CeeeCCCCC-------cc----------------cchhhhhhhhh---hhhhhhHHH------HhhhhccCCC Q lcl|NC_020866. 1 MAQK---PVLLDRWGN-------PV----------------QRAQLTEEVAA---PTVTGVRSP------LSGYPGDGLN 45 (547) Q Consensus 1 ma~~---~~~~~~~g~-------~~----------------~~~~l~~~~a~---~~~~~~~~~------~~~~~~~~l~ 45 (547) ||.. -++=--+|. || ....+++-+-. +-..++... +..-|...-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 80 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKN 80 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCC Confidence 3321 000000111 11 11111111100 001111111 1111211223 Q ss_pred HHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhc-----------CCceEec--CCCCH--HHHHHHH Q lcl|NC_020866. 46 PLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQ-----------LDITVEP--ASDDP--QDVKIAD 110 (547) Q Consensus 46 p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~-----------~~w~v~p--~~~~~--~~~~~ae 110 (547) |.++..+|+.... -+.|.+|++.|...|.. ..|.|.. .+.+. .+.+... T Consensus 81 ~~~l~~~l~~~~~----------------n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~ 144 (563) T protein:vir:99 81 EHNLHDVLKKFGN----------------NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK 144 (563) T ss_pred cccHHHHHHHhhc----------------chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH Confidence 4444555554332 24455556655555553 2344433 22222 2222222 Q ss_pred HHHHHHhC------C---CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcce------- Q lcl|NC_020866. 111 MVRDWLKR------E---ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATP------- 173 (547) Q Consensus 111 ~v~~~l~~------~---~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~------- 173 (547) .+..+|.. + .|.++++.++ +.+.+|.+.+|+++.+++ ...|..|.+++|.++++..+....+ T Consensus 145 ~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~-~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y 223 (563) T protein:vir:99 145 RIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNN-KTKLEKFIAVDPSTIFYATDKKGKIIKGGKRF 223 (563) T ss_pred HHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecC-CCceEEEEEeCCceeEEEECCCCceeccceeE Confidence 33333321 1 3667887777 578899999999887754 3346789999999988754443322 Q ss_pred eeccCCCCceeeccCceEEEeecCCcC---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCC--CCCHH Q lcl|NC_020866. 174 LLLDDNGAEVPLPAFRFIFADIPAKSG---ILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQP--GASEQ 248 (547) Q Consensus 174 ~~~~~~~~g~~l~~~k~i~~~~~~~~~---~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~--~~~~~ 248 (547) +...++.....+++...|+|+.....+ .+||.+.+..+......-....++...|....|.|--+-+++. ..+++ T Consensus 224 ~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e 303 (563) T protein:vir:99 224 VQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQH 303 (563) T ss_pred EEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHH Confidence 222333344567888888888776544 6889999999999999999999999999999999976666543 45888 Q ss_pred HHHHHHHHHHHHhc---Ce---eEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH------hcchh----ccC Q lcl|NC_020866. 249 DKDTLFRAVANIAG---DC---AAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV------LGQTT----TTD 312 (547) Q Consensus 249 ~~~~l~~al~~~~~---~~---~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i------lGqtl----ts~ 312 (547) +++++.+.+.+.-+ ++ .+|++.|++++=+..+. ....|.+..++..++|+++. +|-.- +++ T Consensus 304 ~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~---~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~ 380 (563) T protein:vir:99 304 ALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTA---NDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGS 380 (563) T ss_pred HHHHHHHHHHHHhccccccccceEEcCCCceEEeccCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHcccccccccccc Confidence 89998888887533 22 36789998777654332 12247777888889999883 33221 122 Q ss_pred ccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHH--HHHHh Q lcl|NC_020866. 313 AVAGGHAVSQE---HRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSL--DTLVR 387 (547) Q Consensus 313 ~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~--~~l~~ 387 (547) +.+++...+.+ .......-+..-++.|+..||+.|++.. +. ..+|.|... |.+..++.. .++++ T Consensus 381 ~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-----~~---~~~~~f~r~---D~~~~~e~~~~~~~~~ 449 (563) T protein:vir:99 381 KGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-----GD---KYTFQFVGG---DTKSATDKLNILKLET 449 (563) T ss_pred ccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-----cc---ccEEEeccC---CHHHHHHHHHHHHHhc Confidence 22333222222 2234556678888999999999887642 21 124555443 444455443 34567 Q ss_pred CCCCCCHHHHHHHhCCCCCCCcchhcccccccccc-----ccccccc--------ccccCCccccccccccccchhhhhh Q lcl|NC_020866. 388 LGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQ-----AAPPGMD--------PGAGDLKTAERKLKPLSGQADLSRG 454 (547) Q Consensus 388 ~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~-----~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (547) .|+ ++.+++|+.+|+|+-+.++..+.+....... ....... .+....+...++..+.......... T Consensus 450 ~G~-lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (563) T protein:vir:99 450 QIF-KTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKE 528 (563) T ss_pred CCc-cCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccc Confidence 787 8999999999998776666554332111000 0000000 0000000000000000000000000 Q ss_pred ----cccchhhhcccc-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHH Q lcl|NC_020866. 455 ----DLSLNSEMPSTG-QKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGE 508 (547) Q Consensus 455 ----~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee 508 (547) ............ .....+- +.. +...++.. T Consensus 529 ~~~~~~~~~~~~~~~~~~~~~~~~-------~~~-----------------~~~~~~~~ 563 (563) T protein:vir:99 529 IGTDAQIKGDDNVYRTQTSNKGQG-------RKG-----------------EKSSDFKH 563 (563) T ss_pred cccccccccccccccccCcccccc-------ccC-----------------cCcccccC Confidence 000000000000 0000000 000 00001110 No 65 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.82 E-value=1.4e-18 Score=118.42 Aligned_cols=452 Identities=14% Similarity=0.093 Sum_probs=224.3 Q ss_pred CCCC---CeeeCCCCC-------cc----------------cchhhhhhhhh---hhhhhhHHH------HhhhhccCCC Q lcl|NC_020866. 1 MAQK---PVLLDRWGN-------PV----------------QRAQLTEEVAA---PTVTGVRSP------LSGYPGDGLN 45 (547) Q Consensus 1 ma~~---~~~~~~~g~-------~~----------------~~~~l~~~~a~---~~~~~~~~~------~~~~~~~~l~ 45 (547) ||.. -++=--+|. || ....+++-+-. +-..++... +..-|...-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 80 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKN 80 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCC Confidence 3321 000000111 11 11111111100 001111111 1111211223 Q ss_pred HHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhc-----------CCceEec--CCCCH--HHHHHHH Q lcl|NC_020866. 46 PLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQ-----------LDITVEP--ASDDP--QDVKIAD 110 (547) Q Consensus 46 p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~-----------~~w~v~p--~~~~~--~~~~~ae 110 (547) |.++..+|+.... -+.|.+|++.|...|.. ..|.|.. .+.+. .+.+... T Consensus 81 ~~~l~~~l~~~~~----------------n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~ 144 (563) T protein:vir:95 81 EHNLHDVLKKFGN----------------NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK 144 (563) T ss_pred cccHHHHHHHhhc----------------chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH Confidence 4444555554332 24455556655555553 2344433 22222 2222222 Q ss_pred HHHHHHhC------C---CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcce------- Q lcl|NC_020866. 111 MVRDWLKR------E---ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATP------- 173 (547) Q Consensus 111 ~v~~~l~~------~---~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~------- 173 (547) .+..+|.. + .|.++++.++ +.+.+|.+.+|+++.+++ ...|..|.+++|.++++..+....+ T Consensus 145 ~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~-~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y 223 (563) T protein:vir:95 145 RIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNN-KTKLEKFIAVDPSTIFYATDKKGKIIKGGKRF 223 (563) T ss_pred HHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecC-CCceEEEEEeCCceeEEEECCCCceeccceeE Confidence 33333321 1 3667887777 578899999999887754 3346789999999988754443322 Q ss_pred eeccCCCCceeeccCceEEEeecCCcC---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCC--CCCHH Q lcl|NC_020866. 174 LLLDDNGAEVPLPAFRFIFADIPAKSG---ILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQP--GASEQ 248 (547) Q Consensus 174 ~~~~~~~~g~~l~~~k~i~~~~~~~~~---~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~--~~~~~ 248 (547) +...++.....+++...|+|+.....+ .+||.+.+..+......-....++...|....|.|--+-+++. ..+++ T Consensus 224 ~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e 303 (563) T protein:vir:95 224 VQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQH 303 (563) T ss_pred EEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHH Confidence 222333344567888888888776544 6889999999999999999999999999999999976666543 45888 Q ss_pred HHHHHHHHHHHHhc---Ce---eEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH------hcchh----ccC Q lcl|NC_020866. 249 DKDTLFRAVANIAG---DC---AAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV------LGQTT----TTD 312 (547) Q Consensus 249 ~~~~l~~al~~~~~---~~---~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i------lGqtl----ts~ 312 (547) +++++.+.+.+.-+ ++ .+|++.|++++=+..+. ....|.+..++..++|+++. +|-.- +++ T Consensus 304 ~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~---~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~ 380 (563) T protein:vir:95 304 ALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTA---NDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGS 380 (563) T ss_pred HHHHHHHHHHHHhccccccccceEEcCCCceEEeccCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHcccccccccccc Confidence 89998888887533 22 36789998777654332 12247777888889999883 33221 122 Q ss_pred ccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHH--HHHHh Q lcl|NC_020866. 313 AVAGGHAVSQE---HRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSL--DTLVR 387 (547) Q Consensus 313 ~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~--~~l~~ 387 (547) +.+++...+.+ .......-+..-++.|+..||+.|++.. +. ..+|.|... |.+..++.. .++++ T Consensus 381 ~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-----~~---~~~~~f~r~---D~~~~~e~~~~~~~~~ 449 (563) T protein:vir:95 381 KGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-----GD---KYTFQFVGG---DTKSATDKLNILKLET 449 (563) T ss_pred ccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-----cc---ccEEEeccC---CHHHHHHHHHHHHHhc Confidence 22333222222 2234556678888999999999887642 21 124555443 444455443 34567 Q ss_pred CCCCCCHHHHHHHhCCCCCCCcchhcccccccccc-----ccccccc--------ccccCCccccccccccccchhhhhh Q lcl|NC_020866. 388 LGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQ-----AAPPGMD--------PGAGDLKTAERKLKPLSGQADLSRG 454 (547) Q Consensus 388 ~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~-----~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (547) .|+ ++.+++|+.+|+|+-+.++..+.+....... ....... .+....+...++..+.......... T Consensus 450 ~G~-lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (563) T protein:vir:95 450 QIF-KTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKE 528 (563) T ss_pred CCc-cCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccc Confidence 787 8999999999998776666554332111000 0000000 0000000000000000000000000 Q ss_pred ----cccchhhhcccc-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHH Q lcl|NC_020866. 455 ----DLSLNSEMPSTG-QKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGE 508 (547) Q Consensus 455 ----~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee 508 (547) ............ .....+- +.. +...++.. T Consensus 529 ~~~~~~~~~~~~~~~~~~~~~~~~-------~~~-----------------~~~~~~~~ 563 (563) T protein:vir:95 529 IGTDAQIKGDDNVYRTQTSNKGQG-------RKG-----------------EKSSDFKH 563 (563) T ss_pred cccccccccccccccccCcccccc-------ccC-----------------cCcccccC Confidence 000000000000 0000000 000 00001110 No 66 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.81 E-value=6.5e-19 Score=120.28 Aligned_cols=390 Identities=12% Similarity=0.059 Sum_probs=220.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. +. . +..+. ..+. ...+..++.+.++ ..+.... .| +-+.|.+ T Consensus 1 m~----~f---~-~~~~~----~~~~--~~~~~~~~~~~~~--------------------~~~~~~~-Al--~~~~V~~ 43 (406) T protein:vir:97 1 MS----FF---Q-PLGTS----KVSY--DDYISSVLAGDVS--------------------QKYLGVS-AL--KNSDILT 43 (406) T ss_pred Cc----cc---c-ccCCC----CCCc--chHHHHHhcCCCC--------------------cccccch-hh--ccHHHHH Confidence 33 11 1 11110 0000 0001011111110 0111110 11 3467999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |++.+...|.++++.+.-.++... .. .-+..+|+ + +.+.++++.++ +.+.+|.+.++++....+| .+.. T Consensus 44 ~i~~Ia~~iA~lp~~~~~~~g~~~-~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g--~~~~ 118 (406) T protein:vir:97 44 ATSIIAGDIARFPLVKKDVNGDII-HD--EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTN--QALQ 118 (406) T ss_pred HHHHHHHhhhhCeeEEEecCcccc-cc--chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCC--eEEE Confidence 999999999999998875443211 11 12344453 3 34666777666 4667999999998654444 3568 Q ss_pred eEeeccceeEecccCCcc--eeec-cCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLAT--PLLL-DDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~~--~~~~-~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) |.+++|..+++..++... +... ..++....+++...|++++... +..+|.|.+..+....-.-....++...|.+. T Consensus 119 L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~~~evih~r~~~~-dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~n 197 (406) T protein:vir:97 119 FQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKCFAHDVIHWKFFSH-DTILGRSPLLSLGDEIDLQTGGINTLIKFFKD 197 (406) T ss_pred EEEECCCeeEEEEcCCceEEEEEEecCCceEEEEccccEEEecCCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 999999988775443333 2222 2334456788888887776553 34779999999888877777888888888877 Q ss_pred cCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC----eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020866. 232 YGQPLRIGKWQPGASEQDKDTLFRAVANIAGD----CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ 307 (547) Q Consensus 232 ~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~----~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq 307 (547) -+.|-.+-+.+...++++++++.+.+.+..++ ..+|++.|++++-++.+. ... -|.+..++..++|+++.--. T Consensus 198 g~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~--~d~-q~le~~~~~~~~Ia~afgVP 274 (406) T protein:vir:97 198 GFSSGILTMKGAQLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEYTPLEIDT--NVL-QLITSNNFSTAQIAKALRVP 274 (406) T ss_pred cCCCceEEecCCCCCHHHHHHHHHHHHHHhcccccCceeecCCCceEEEccCCH--HHH-HHHHHHHhhHHHHHHHhCCC Confidence 77777777777778999999998888877542 357889999887765332 111 25566677788898873111 Q ss_pred -hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHH Q lcl|NC_020866. 308 -TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLV 386 (547) Q Consensus 308 -tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~ 386 (547) .+......++ +..+........-+...++.|++.||+.|+..--. ..-+++|+. ..+++..++.+.+++ T Consensus 275 p~~lg~~~~~~-~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~-------~~~~i~fd~--~~~~~~~~~~~~~~~ 344 (406) T protein:vir:97 275 SYKLGVNSPNQ-SVAQLMEDYVTNDLPFYFDAITSELGLKTLNDKDR-------RLYHIEFDT--RSVTGRNVDEIVKLV 344 (406) T ss_pred HHHcCCCCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHhhhhcChhhc-------cceeEEEec--CccchhhHHHHHHHH Confidence 1111111122 22334444455667888888999998877543211 112456654 346677788888999 Q ss_pred hCCCCCCHHHHHHHhCCCCCCC--cchhcccccccccccccccccccccCCccccccccccccchhhh Q lcl|NC_020866. 387 RLGMEIEENEVRSRFGFSQPKP--GAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 387 ~~G~~v~~~~~~e~~Glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (547) +.|+ ++.+++|+.+|+|+-.+ ++..+.+.................. .....+.....+.+ T Consensus 345 ~~g~-~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~-----~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 345 NNQI-LTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGI-----KGKGGEVNAEEDKS 406 (406) T ss_pred hCCC-cCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccccccccccc-----ccCCCCCCCCCCCC Confidence 9998 99999999999986433 3444333211000000000000000 00000000000000 No 67 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.81 E-value=4.7e-19 Score=121.07 Aligned_cols=393 Identities=14% Similarity=0.077 Sum_probs=222.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh-ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP-GDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~-~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |- +. | .+..+..+. |..+. ..++-| ...++.. ... .| +=+.|. T Consensus 1 m~----~~----~-----~~~~~~~~~--------~~~~~~~~~~~~--------~~~g~~~-~~~----Al--~~~~V~ 44 (417) T protein:vir:38 1 MK----LF----R-----GLATEVDPH--------WADHLLDSGVIP--------SFRGGYL-GIS----AL--RNSDVL 44 (417) T ss_pred Cc----cc----c-----ccccCCCcc--------chhhhccccccc--------ccCCcee-chh----hc--ccHHHH Confidence 22 11 1 111111110 11110 111111 1111111 100 12 457799 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH-hCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL-KRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l-~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) +|++.+...|.+++|++.-.+.+..... .-+..+| .++ ...++++.++ +.+.+|.++++++....++ .|. T Consensus 45 ~cv~~ia~~iA~lp~~~~~~~~~~~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~--~~~ 120 (417) T protein:vir:38 45 TAVSIVSGDVSRFPLVITDSSTDEVIDL--ANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITN--EPA 120 (417) T ss_pred HHHHHHHHhhccCeeEEEEcCCcceecc--chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC--EEE Confidence 9999999999999999965443222111 1122333 233 4566766666 4678999999998654444 367 Q ss_pred eeEeeccceeEecccCCcce--eecc-CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATP--LLLD-DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQ 230 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~--~~~~-~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E 230 (547) .+.+++|.++.+.......+ .... +++....+++..+|++++.. .+..+|.|.+..+....-......++...|.. T Consensus 121 ~l~~l~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~~~dviH~r~~~-~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ 199 (417) T protein:vir:38 121 MFEFYAPSQTQVDTSDPDNIIYRFTPYNSSMQKVCGFEDVIHWKFFS-YDTIMGRSPLLSLGDEIGLQESGVSTLQKFFK 199 (417) T ss_pred EEEEeCCceEEEEEcCCCeEEEEEEEcCCcEEEEecCcceEEecCCC-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHh Confidence 78999999887654433332 2222 22334457777777777654 34478999999999888888888899999998 Q ss_pred HcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 231 TYGQPLRIGKWQPGASEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 231 ~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) .-+.|-.|.+++...++++++++.+.+.+..+ +..+|++.|++++-++.+. . .-.|.+..++..++|+++ +| T Consensus 200 ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~--~-d~q~le~~~~~~~~Ia~~-fg 275 (417) T protein:vir:38 200 SGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDT--N-VLNLINSNNYSTAQIAKA-LR 275 (417) T ss_pred ccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCH--H-HHHHHHHHHhhHHHHHHH-hC Confidence 88999888899888899999999888877543 3467889999877654332 1 123677777888999986 44 Q ss_pred ch--hccCccccccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHH Q lcl|NC_020866. 307 QT--TTTDAVAGGHAVS-QEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLD 383 (547) Q Consensus 307 qt--lts~~~ggs~a~~-~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~ 383 (547) -. +.++ .++++.. +........-+...++.|++.||+.|+.+....+ ..|+|+... .+ ......++ T Consensus 276 VPp~~lg~--~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~~-------~~~~fd~~~-l~-~~~~~~~~ 344 (417) T protein:vir:38 276 VPAYRLAQ--NSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQRHQ-------YCIGFDTKS-VN-GLPIADVN 344 (417) T ss_pred CCHHHhCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhccc-------ceEEechhh-hh-HHHHHHHH Confidence 21 2222 2233322 2333445567888899999999988865432211 257776432 22 23345577 Q ss_pred HHHhCCCCCCHHHHHHHhCCCCCCCc--chhccccccccccc-----ccccccccccCCccccccccccccchhhhhhc Q lcl|NC_020866. 384 TLVRLGMEIEENEVRSRFGFSQPKPG--AKLLRPLENPATQA-----APPGMDPGAGDLKTAERKLKPLSGQADLSRGD 455 (547) Q Consensus 384 ~l~~~G~~v~~~~~~e~~Glp~p~~~--e~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (547) ++++.|+ ++.+++|+.+|+|+-+.+ +.+..+.. ..+.. +.+......+..+..+...+....... + T Consensus 345 ~~~~~G~-~T~NE~R~~~gl~pi~~g~~d~~~~~~n-~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~~~~~----~ 417 (417) T protein:vir:38 345 TAVNGGL-WTGNEGRAELGKKPLKDPNMDRIQSTLN-TVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSGTNAN----S 417 (417) T ss_pred HHHhCCC-cCHHHHHHHhCCCCCCCCCCCeeeeccc-ccccccccccccccccccCCCCCCCCCCCcCCCCcCC----C Confidence 8889997 999999999999864444 22222111 00000 000000000000001100000000000 0 No 68 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.80 E-value=4.5e-18 Score=115.70 Aligned_cols=448 Identities=13% Similarity=0.071 Sum_probs=230.2 Q ss_pred CCCCCe-----ee---CCCCCcccchhhhhhhhhhhhhhhHHHHhhh--hccCCCHHHHHHHHHhhcCCCHHHHHHHHHH Q lcl|NC_020866. 1 MAQKPV-----LL---DRWGNPVQRAQLTEEVAAPTVTGVRSPLSGY--PGDGLNPLRLAQILRAADHGDPVRYLDLAEV 70 (547) Q Consensus 1 ma~~~~-----~~---~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~--~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~ 70 (547) |..+-. .+ ++-|+...+ +...+.++.. -++.... .-.-.||..|+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~-~~~~~~~~~~~p~~~~~~L~-------------------~ 56 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLA----KSPNSTQIPD-HRIQSHNVGVNPPYNPDRLA-------------------A 56 (651) T ss_pred CCCccceeeeeEEEeeccccccccc----ccccccccch-hhhcccCCCCCCCCCHHHHH-------------------H Confidence 332210 00 111111111 1111111111 0111111 11122443322 2 Q ss_pred HHhhhHHHHHHHHHHHHHHhcCCceEecCC---CCHHHHHHHHHHHHHHhC---------------CCHHHHHHHHH-Hh Q lcl|NC_020866. 71 IEERDPHYLGVIGTRKRSVSQLDITVEPAS---DDPQDVKIADMVRDWLKR---------------EELQQEVFHIL-DC 131 (547) Q Consensus 71 m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~---~~~~~~~~ae~v~~~l~~---------------~~~~~~i~~~l-~a 131 (547) |.+.-+.+.+|++..+..|.++.|.|+|.. ++.++.+..+.++.++.. ..+..++..++ |- T Consensus 57 ~~e~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dl 136 (651) T protein:vir:99 57 FLELNETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDY 136 (651) T ss_pred HHhcChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHH Confidence 334578899999999999999999999843 233343334444444322 24566666555 56 Q ss_pred hhhcCeeeeeEEeecCCeeeEEeeEeecccee--------------------------------------------Eecc Q lcl|NC_020866. 132 ISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHF--------------------------------------------RFQK 167 (547) Q Consensus 132 ~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~--------------------------------------------~~~~ 167 (547) ..+||+|+|++-+ ..+. |..+...|+..+ .... T Consensus 137 e~tGna~ieiIrn-~~g~--pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g 213 (651) T protein:vir:99 137 HGVGWLALEMLTD-IEGR--PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAG 213 (651) T ss_pred HHHhhHhhhhhhc-Cccc--hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEee Confidence 7789999998522 1111 111111111111 0000 Q ss_pred cCC--cc-----------e---------------------eeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHH Q lcl|NC_020866. 168 SDL--AT-----------P---------------------LLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAW 213 (547) Q Consensus 168 ~~~--~~-----------~---------------------~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~ 213 (547) +.. .. + +...+......+++...|++++....+.+||.|.+..+.. T Consensus 214 ~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~ 293 (651) T protein:vir:99 214 DRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIR 293 (651) T ss_pred ccccceeeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHH Confidence 000 00 0 0001111223456666766666665677899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHcCCCeEEEeCCC-CCCHHHHHHHHHHHHHHhcCe--eEEecC---------CceEEEEecCC Q lcl|NC_020866. 214 GWMFKAFTARDWAIFTQTYGQPLRIGKWQP-GASEQDKDTLFRAVANIAGDC--AAIIPE---------TMSIDFVESKS 281 (547) Q Consensus 214 ~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~-~~~~~~~~~l~~al~~~~~~~--~~vip~---------~~~ie~~e~~~ 281 (547) ....-....++...|....+.|--|-+++. ..++++++++.+.+.+...++ .+|++. +..+++...+. T Consensus 294 ~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~ 373 (651) T protein:vir:99 294 TISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQ 373 (651) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCc Confidence 999889999999999999999988888754 478999999998888766553 455554 22444444332 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHhc-chhccCccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_020866. 282 IGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQ 359 (547) Q Consensus 282 ~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~ 359 (547) .......|.+..++...+|+++.-- ..+....+.++++..+.+... ...-+...++.++..||+.|+....... T Consensus 374 ~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~---- 449 (651) T protein:vir:99 374 GISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVT---- 449 (651) T ss_pred CchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc---- Confidence 2211224788888889999988422 233444445667766666554 4567889999999999998876543322 Q ss_pred ccccEEEecC--ccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC--cchhcccccccccccccccccccccCC Q lcl|NC_020866. 360 KRYPRIKIAR--PKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP--GAKLLRPLENPATQAAPPGMDPGAGDL 435 (547) Q Consensus 360 ~~~p~~~~~~--~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (547) ...-+|+|+. ....|.+.+++.+.++++.|+ ++.+++|+.+|+|+-.+ ++..+.+........ ....+.. T Consensus 450 ~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~-~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~-----~~~gge~ 523 (651) T protein:vir:99 450 DWTIEYELRGADQPKQEAQLAEQRVRAMRLAGV-GLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGD-----VAGGGET 523 (651) T ss_pred CceEEEEeccchhhhccHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCccccccccccccccccc-----cccCCCC Confidence 1112455544 345788999999999999997 99999999999986432 233332211111000 0000000 Q ss_pred ccccccccccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHH Q lcl|NC_020866. 436 KTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDA 515 (547) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~ 515 (547) +.....+ .. ...... ++ +.+.+.+. -.+.++++ T Consensus 524 ~~~~~~~-~~---------------------~~~~~~-----------e~--------~~~~~~~~----~~e~~~~~-- 556 (651) T protein:vir:99 524 EAVHEPP-EE---------------------NKIGER-----------EW--------DTVKSELT----TKDPIEQM-- 556 (651) T ss_pred cccccCc-cc---------------------cccccc-----------hh--------hhhhhhhc----ccchhhhh-- Confidence 0000000 00 000000 00 00111110 01111111 Q ss_pred HhcCCCHHHHHHHHHHHHHHHHHHhHHhhcc-------------C Q lcl|NC_020866. 516 RYGEIGASQLASRISAAMLASEFAGREGALD-------------G 547 (547) Q Consensus 516 l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~-------------g 547 (547) .-.-..-+-.|+ ++.. | T Consensus 557 --------------~v~ss~~~~~gy-d~~~~~l~~~f~~~~~~~ 586 (651) T protein:vir:99 557 --------------QFSSSNLDEGLY-DFGENELYLSFLRDEGQS 586 (651) T ss_pred --------------hHHHHHHHhhcC-CCccceEEEEEeecCCCC Confidence 111111122332 1111 1 No 69 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.80 E-value=1.3e-17 Score=113.11 Aligned_cols=474 Identities=14% Similarity=0.052 Sum_probs=236.3 Q ss_pred CCCCCeeeCC-CCCcccchhhhhh-----hhhh-hhhhhHHHHhhh---hccCCCHHHHHHHHHhhcCCCHHHHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDR-WGNPVQRAQLTEE-----VAAP-TVTGVRSPLSGY---PGDGLNPLRLAQILRAADHGDPVRYLDLAEV 70 (547) Q Consensus 1 ma~~~~~~~~-~g~~~~~~~l~~~-----~a~~-~~~~~~~~~~~~---~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~ 70 (547) |-..+.=-.. .-++.....+..- +|.+ ...++-++-.++ |. -.+|.++..+|+....+.... .. T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~iv~-----~~ 100 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPS-IRNSQDLHKTLKKFGNNIILN-----AI 100 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCc-cCCcccHHHHHHhhccChhHH-----HH Confidence 3311100000 0011111111111 1110 010111111111 21 135667777777666543221 11 Q ss_pred HHhhhHHHHHHHHHHHHHHhcCCceEecCCCC----HHHHHHHHHHHHHHhCC---------CHHHHHHHHH-HhhhhcC Q lcl|NC_020866. 71 IEERDPHYLGVIGTRKRSVSQLDITVEPASDD----PQDVKIADMVRDWLKRE---------ELQQEVFHIL-DCISKGY 136 (547) Q Consensus 71 m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~----~~~~~~ae~v~~~l~~~---------~~~~~i~~~l-~a~~~G~ 136 (547) |..+=.+|.+|+......+.+++|.|...+.+ ....+....+..+|+.. .|.++++.++ +.+.+|. T Consensus 101 i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gn 180 (574) T protein:vir:80 101 INTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQ 180 (574) T ss_pred HHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCC Confidence 22234567888877777888999999754322 11122222344444321 3567887777 4678999 Q ss_pred eeeeeEEeecCCeeeEEeeEeeccceeEecccCCc-------ceeeccCCCCceeeccCceEEEeecCCcC---ccccch Q lcl|NC_020866. 137 SFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLA-------TPLLLDDNGAEVPLPAFRFIFADIPAKSG---ILLRSG 206 (547) Q Consensus 137 sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~-------~~~~~~~~~~g~~l~~~k~i~~~~~~~~~---~p~G~g 206 (547) +++|+++...| .|..|.+++|.++++..+... +++...+++....+++..+|+++++..++ .+||.+ T Consensus 181 ayi~i~r~~~G---~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~s 257 (574) T protein:vir:80 181 VNFEKVFDKDG---NFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYP 257 (574) T ss_pred eEEEEEECCCC---cEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCccccccccc Confidence 99999986543 367899999998877544322 23444445555678888888887766543 568999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCCCHHHHHHHHHHHHHHhc---Cee---EEecCCceEEEEe Q lcl|NC_020866. 207 LARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKW--QPGASEQDKDTLFRAVANIAG---DCA---AIIPETMSIDFVE 278 (547) Q Consensus 207 ll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~~~~~~~~l~~al~~~~~---~~~---~vip~~~~ie~~e 278 (547) .+..+....-.-....++...|...-+.|--|.++ +...++++++++.+.+.+.-+ +++ ++.+.|++++=+. T Consensus 258 pi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~ 337 (574) T protein:vir:80 258 ELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMT 337 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEcc Confidence 99999999888888899999999998998755554 344688888888888877532 222 3446676655443 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHH------hcchh--cc---CccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAV------LGQTT--TT---DAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRD 346 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~i------lGqtl--ts---~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~ 346 (547) .+ ..-..|.+..++..++|+.+. +|.+- |+ .++..+++..+.... ....-+...++.|+..||+. T Consensus 338 ~s---~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~ 414 (574) T protein:vir:80 338 PS---ANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTY 414 (574) T ss_pred CC---hhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 22 222247777788889999884 33321 11 111123454444443 44556889999999999998 Q ss_pred HHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccc Q lcl|NC_020866. 347 LIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPP 426 (547) Q Consensus 347 li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~ 426 (547) |++.. +. . -+|+|...+-.+.....+ +.+++..|+ ++.+++|+.+|+|+-+.++.++.+........... T Consensus 415 Ll~~~-----~~--~-~~~~f~~~d~~~~~~~~~-~~~~~~~G~-lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~ 484 (574) T protein:vir:80 415 IVAEF-----GE--K-YQFQFRGGDLSAQLDKLK-IIEQEGKVF-RTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQ 484 (574) T ss_pred hhhhc-----CC--c-eEEEecccchhhHHHHHH-HHHHHhCCc-cCHHHHHHHhCCCCCCCCCEeeeccceeecccccc Confidence 87642 21 1 256666544333332222 234566786 99999999999987766665543321110000000 Q ss_pred cc--------------cccccCCccccccccccccchhhhhhccc----chhhhcccccccchhHHHHHHHHHHHHHH-- Q lcl|NC_020866. 427 GM--------------DPGAGDLKTAERKLKPLSGQADLSRGDLS----LNSEMPSTGQKTGGAEIAALTDQLELEGQ-- 486 (547) Q Consensus 427 ~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-- 486 (547) .. ....+..+..+....+.....+....... .....+....... +.+ ..+.+.+.=. T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~ 561 (574) T protein:vir:80 485 EEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGKVD-DNV--GKDGQLKSEENT 561 (574) T ss_pred cccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCCcc-ccc--cccccccccccc Confidence 00 00000001111111111111100000000 0000000000000 000 0000000000 Q ss_pred HHHHHHHHHHHHHHHccCC Q lcl|NC_020866. 487 PEIVAMIEEIRTMLEAASS 505 (547) Q Consensus 487 ~~~~~~~~~i~~~l~~a~s 505 (547) -+...-..... +. T Consensus 562 ~~~~~~~~~~~------~~ 574 (574) T protein:vir:80 562 NSTKHGTDGIK------KE 574 (574) T ss_pred ccccccCcccc------CC Confidence 00000000000 00 No 70 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.79 E-value=1.6e-18 Score=118.15 Aligned_cols=372 Identities=11% Similarity=0.057 Sum_probs=218.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. |.+..-+.... ..... ...-.... .. ...+ . ..|.. +......+-+.|.+ T Consensus 1 M~----~f~~~~~~~~~----~~~~~---~~~~~~~~-~~--~~~~------~---~~~~~-----v~~~~al~~~~v~~ 52 (386) T protein:vir:49 1 MP----IFNITNLATES----PPINQ---ESFFDIAD-SD--FLAS------L---NSSEW-----VSAENALKNSDLFS 52 (386) T ss_pred Cc----hhhhhccCCCC----cccch---hhhhhhhh-cc--cccc------c---cCCce-----echhhhhccHHHHH Confidence 77 54332221111 11110 00000000 00 0000 0 00000 00000114678999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.+...|.++++++..... + .++.+ +...++++.++ +.+.+|-+++++++..++ .+..| T Consensus 53 ~i~~ia~~ia~~p~~~~~~~~--------~---~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~l 118 (386) T protein:vir:49 53 IISQLSNDLATAKITTSRKQL--------Q---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDMKW 118 (386) T ss_pred HHHHHHHHhhhCceeeccchh--------h---hhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEEEE Confidence 999999999999999974221 1 23333 35677777777 456799999999986443 36689 Q ss_pred EeeccceeEecccCCc-ce--eec-c--CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLA-TP--LLL-D--DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~-~~--~~~-~--~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .+++|.++++..+... .+ ... . ..+....++....|++++....+..+|.|.+..+....-.-....++...+. T Consensus 119 ~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~ 198 (386) T protein:vir:49 119 EYLRPSQVSFNRLDNQNGLYYNITFDDPHIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISAL 198 (386) T ss_pred EEecCceeEEEEcCCCceEEEEEEEcCccccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999988876543322 22 121 1 2234467888888877777777778999999999999988888899999999 Q ss_pred HHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc- Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG- 306 (547) Q Consensus 230 E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG- 306 (547) ..-++|-.+.+++...+++++.++.+...+...+ ..+|++.|++++-+..+. ....|.+..++...+|+++.-- T Consensus 199 ~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~---~d~~~~e~~~~~~~~Ia~~fgVP 275 (386) T protein:vir:49 199 KNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKS---NVAQLLSQADWTTGQFAKVYGIP 275 (386) T ss_pred HccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecCCCceEEEccCCh---hHHHHHHHHHHHHHHHHHHhCCC Confidence 9999999999999888888888888877777665 457788888776554322 2224777788888999988311 Q ss_pred chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHH Q lcl|NC_020866. 307 QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLV 386 (547) Q Consensus 307 qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~ 386 (547) ..+.+. .+.+++.++..+......+..-++.++..||+.|...+ .|........|...++..+.+|. T Consensus 276 p~~lg~-~~~~~~~~~~~~~~~~~~i~~~l~~i~~~~~~~l~~~~------------~~~~~~~~~~d~~~~~~~~~~l~ 342 (386) T protein:vir:49 276 ESIVGG-DGDQQSSLEMIYNIYFKSVSRYLRPFVSEMSKKLSCEV------------DVDISPAVDPTGSNYISLINSMV 342 (386) T ss_pred HHHhCC-CCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchh------------cccchhhhccCHHHHHHHHHHHH Confidence 122221 22334434444444455666666777777776553221 12222233457778889999999 Q ss_pred hCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 387 RLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 387 ~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) ..|+ ++.+++|+.++-..-...+ +-... .+.. +..+.+. .. +.+ T Consensus 343 ~~g~-~t~nE~r~~l~~~~~~~~~-~~~~~-~~~~----~~~~gGd--~~-----------~~~ 386 (386) T protein:vir:49 343 KSGT-LAQNQGLYILQQAEILPKE-LPDGK-NPNR----TSLKGGE--IN-----------EQD 386 (386) T ss_pred hCCC-cCHHHHHHHHhhCCCCCCc-Ccchh-ccCC----CCCCCCC--CC-----------CCC Confidence 9998 8999999988632111111 00000 0000 0000000 00 000 No 71 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.78 E-value=1.1e-17 Score=113.53 Aligned_cols=366 Identities=10% Similarity=0.041 Sum_probs=218.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. |.+..+..... -+..............+.+.....+++ . .. .+.+.|.+ T Consensus 1 Mg----~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~v~~---------------~-------~a-l~~~~v~~ 51 (385) T protein:vir:10 1 MG----LLTPRNFNKRK--AKNMVYPSNPAFFTTTVGGMQLSYVSA---------------L-------SA-LQNTNVYS 51 (385) T ss_pred Cc----cccchhccccc--ccccccccchhhhhhhccccCccccCH---------------H-------Hh-hccHHHHH Confidence 88 77664432111 111111101111101111111111111 0 01 14678999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.....+.+++|+|.... ...+|+++ ...++++.+. +.+.+|-++++++.. . ..+ T Consensus 52 ~i~~ia~~ia~~p~~v~~~~-----------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~--~-----~~~ 113 (385) T protein:vir:10 52 VINRIASDVASAHFKTENTA-----------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ--N-----LEH 113 (385) T ss_pred HHHHHHHHHhhCceeeeccc-----------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC--c-----eeE Confidence 99999999999999996321 22345554 4666777666 455799999998642 1 234 Q ss_pred EeeccceeEecccCCcceeec--cCCCCceeeccCceEEEeecCC--cCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLATPLLL--DDNGAEVPLPAFRFIFADIPAK--SGILLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~l~~~k~i~~~~~~~--~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) .+.++.++....++....+.. ..++....++++..|++++... .+..+|.|.+..+....-......++...+... T Consensus 114 ~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~n 193 (385) T protein:vir:10 114 IPNSDVQINYLPGNMGIVYTVLESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMEN 193 (385) T ss_pred eecCCceEEEEEcCCceEEEEEEcCCceEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 455566665554444443333 3344556789888877765433 345789999999999998888899999999999 Q ss_pred cCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 232 YGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 232 ~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) -+.|--+.+++... ++++++.+.+.+.++.+ +..++++.|++++-+..+. ...+...+..++..++|+++.-- T Consensus 194 g~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~d~~~l~e~~~~~~~~Ia~~fgV 271 (385) T protein:vir:10 194 QINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKT--DVFKALADNSAYSADQISKAFGV 271 (385) T ss_pred cCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCCh--hHHHHHHHHHHHHHHHHHHHhCC Confidence 89998888887555 46678888888887643 2468899998887665332 22222245667778889888421 Q ss_pred c-hhcc--CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHH Q lcl|NC_020866. 307 Q-TTTT--DAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLD 383 (547) Q Consensus 307 q-tlts--~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~ 383 (547) . .+.. +.++++++..+.....+..-+...++.|++.||+.|+ ++ ..+|.+...-..|.+.+++.++ T Consensus 272 p~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~l~--------~~---~~~f~~~~ll~~d~~~~~~~~~ 340 (385) T protein:vir:10 272 PSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLKMN--------AP---DLELDIKDMLDVDDSALINQVS 340 (385) T ss_pred CHHHcCCccCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHhhC--------Cc---eEEeechhhhccCHHHHHHHHH Confidence 1 2222 2222334444444444555677888889999987652 11 1234444444679999999999 Q ss_pred HHHhCCCCCCHHHHHHHhCCCCCC-Cc-chhcccccccccccccccccccccCCc Q lcl|NC_020866. 384 TLVRLGMEIEENEVRSRFGFSQPK-PG-AKLLRPLENPATQAAPPGMDPGAGDLK 436 (547) Q Consensus 384 ~l~~~G~~v~~~~~~e~~Glp~p~-~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (547) ++++.|+ ++.+++|+.+|++.-. ++ +++..+... ...+..... T Consensus 341 ~~~~~G~-~T~NE~R~~~g~~p~p~~~~~~~~~~~~~---------~~~g~~~dn 385 (385) T protein:vir:10 341 NLAKSGV-LGAEQAQFILTRSGFLPDNLPEFKPLTTQ---------VKGGDEGDN 385 (385) T ss_pred HHHhCCC-cCHHHHHHHhCCCccCCCCCccccCcccc---------cCCCCCCCC Confidence 9999998 9999999999975422 12 221111110 011111100 No 72 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.78 E-value=3.9e-18 Score=116.04 Aligned_cols=374 Identities=12% Similarity=0.058 Sum_probs=216.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. -.|.+..-+.-. +..+..+. .....+-++ .+...+.... |.. +......+-+.|.+ T Consensus 1 m~--m~~~~~~~~~~~------~~~~~~~~-------~~~~~~~~~-~~~~~~~~~~-g~~-----v~~~~al~~~~v~~ 58 (392) T protein:vir:74 1 MI--LPILNFINQTND------PPEAGSVQ-------SYFPDGNDA-QIMESLLGDN-NEW-----VSARAALRNSDLFS 58 (392) T ss_pred Cc--chhhhhhhcccC------cccccccc-------cccccCchh-hhhhhccCCC-Ccc-----cchhhhhcchHHHH Confidence 33 013333222111 11111111 011111111 1122222211 111 11111125688999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.+-..|.++++++..... ..+++++ ...++++.++ +.+.+|.+++++++..+| .+..| T Consensus 59 ~v~~ia~~ia~lp~~~~~~~~-----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G---~~~~L 124 (392) T protein:vir:74 59 IILQLSSDLAIVKINAEKKKN-----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANG---ADMKW 124 (392) T ss_pred HHHHHHHhhccCceeeccchh-----------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEEEE Confidence 999999999999999974321 1233333 4667777766 677899999999876443 36789 Q ss_pred EeeccceeEeccc--CCcce-eeccCC-C--CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKS--DLATP-LLLDDN-G--AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~--~~~~~-~~~~~~-~--~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .+++|.++.+..+ ++... ...... . .-..+++...|++++...++..||.|.+..+....-.-....++...+. T Consensus 125 ~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 204 (392) T protein:vir:74 125 EYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSL 204 (392) T ss_pred EEEcCceeEEEEcCCCceEEEEEEecCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999998876543 22222 222221 1 2246788888777777777778999999999999999999999999999 Q ss_pred HHcCCCeEEEeCCCCC--CHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGA--SEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 230 E~~G~P~~vgk~~~~~--~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) ...+.|--+.+++... ++++++.+.+......+. ..+|++.|++++-+..+. ....|.+..++..++|+++.-- T Consensus 205 ~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgV 281 (392) T protein:vir:74 205 NSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKS---NVAQLLSQTDWTSKQYAKVYGL 281 (392) T ss_pred hccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCCh---hHHHHHHHHHHHHHHHHHHhCC Confidence 9999999888886543 345566666666555443 458899999888765432 2224777788888999988421 Q ss_pred -chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHH Q lcl|NC_020866. 307 -QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTL 385 (547) Q Consensus 307 -qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l 385 (547) ..+.++. +.+++..+........-+..-++.|++.+|+.|++.+ .+| +...-..|...+++.+.++ T Consensus 282 Pp~~lg~~-~~~~~~~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~~-~~~-----------~~~~~~~d~~~~~~~~~~l 348 (392) T protein:vir:74 282 PDSYIGGQ-GDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-SVN-----------MRPAIDPLGDNYLSTISTA 348 (392) T ss_pred CHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccchh-ccc-----------chhhhcCCHHHHHHHHHHH Confidence 2222221 1122222323334556677788888888888775432 222 1122234677888999999 Q ss_pred HhCCCCCCHHHHHHHh---CCCCCCCcchhcccccccccccccccccccccCCcccc Q lcl|NC_020866. 386 VRLGMEIEENEVRSRF---GFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAE 439 (547) Q Consensus 386 ~~~G~~v~~~~~~e~~---Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (547) ++.|+ ++.+++|+.+ |+. |.+--..... + + .+.+.. ..+.+ T Consensus 349 ~~~g~-~t~near~~~~~~g~~-pne~r~~enl---~---~----~~~Gd~-~~p~p 392 (392) T protein:vir:74 349 TRWGA-LAENQATFVLQEAGYI-PKDLPAPENT---N---K----KTTGQS-NEPVP 392 (392) T ss_pred HhCCC-cCHHHHHHHHHhCCCC-ccccchhcCC---C---C----CCCCCC-CCCCC Confidence 99997 8988888876 654 3211110000 0 0 000000 00000 No 73 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.77 E-value=1.4e-17 Score=112.95 Aligned_cols=383 Identities=16% Similarity=0.093 Sum_probs=215.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. |.+.+++.-. +.+... ... +........+.. . .+..+ . +.+.|.+ T Consensus 1 Mg----~~~~f~~k~~----~~~~~~--~~~---~~~~~~~~~~~~-----------~----~~~~~----~-~~~~V~~ 47 (403) T protein:vir:80 1 MG----LFNFFRRKTR----SEPTNA--ISW---FLTQEAYDTLAI-----------P----GYTRL----S-DNPEVRM 47 (403) T ss_pred Cc----cccccccccc----ccccch--hhh---hccccccccccc-----------c----hhhhh----h-hhHHHHH Confidence 88 7777666321 121111 110 111111111110 0 01111 1 3577999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHHH-hhh--hcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHILD-CIS--KGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l~-a~~--~G~sv~Ei~w~~~~g~~~~ 152 (547) |+..+...|.++++++.-..++-. ++...-+..+|. ++ ...++.+.++. .+. +|+++++++|... -.+ T Consensus 48 ~I~~ia~~iA~~p~~~~~~~~~g~-~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~---g~~ 123 (403) T protein:vir:80 48 AVHKIAELISSMTIHLMQNTDNGD-IRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTS---GLI 123 (403) T ss_pred HHHHHHHhhhhCceEEEEecCCce-eecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCC---CcE Confidence 999999999999999853322111 111112333343 44 34556665553 333 6889999887432 246 Q ss_pred EeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCc-cccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGI-LLRSGLARVAAWGWMFKAFTARDWAIFTQT 231 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~-p~G~gll~~~~~~~~~K~~~~~~w~~f~E~ 231 (547) ..|.+++|..+.+..++....+... +..+++...|+++.+..+.+ .+|.|.+..+....-.-....++...|... T Consensus 124 ~~L~~l~p~~v~~~~~~~g~~~~y~----~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 199 (403) T protein:vir:80 124 DELIPLAPSKVSFVDTDTGYQIWYQ----GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSG 199 (403) T ss_pred EEEEEEcCCeeEEEEcCCceEEEEe----ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 7899999998876554443333222 34577777776665444444 569999998888877777778888889988 Q ss_pred cCCCeEEEeCCCCCCHHHHHHHHHHHHHH-----hcCeeEEecCCce-E-EEEecCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 232 YGQPLRIGKWQPGASEQDKDTLFRAVANI-----AGDCAAIIPETMS-I-DFVESKSIGSSVDLYEKRINHLDQQISKAV 304 (547) Q Consensus 232 ~G~P~~vgk~~~~~~~~~~~~l~~al~~~-----~~~~~~vip~~~~-i-e~~e~~~~~~~~~~f~~li~~~d~~Isk~i 304 (547) -+.|-.|.+++...+++..+++.+.+.+. .++..+++|.+.. . ++...+ ...-.|.+..++...+|+++. T Consensus 200 g~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~---~~d~q~~e~~~~~~~~Ia~~f 276 (403) T protein:vir:80 200 KYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLS---LKDLAIHETVELDKRTVAGIF 276 (403) T ss_pred cCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCC---HHHHHHHHHHHHhHHHHHHHh Confidence 89999888888777776666655554432 2345577776642 2 232221 111135677778888899873 Q ss_pred hcc-hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC--ccccCHHHHHHH Q lcl|NC_020866. 305 LGQ-TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR--PKTEDLTKLASS 381 (547) Q Consensus 305 lGq-tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~--~~~ed~~~~a~~ 381 (547) --. .+.+.+ ++.. +........-+...++.|+..||+.|+. +... +|+|+. ....|.+.+++. T Consensus 277 gVPp~~lg~~---~~~~-~~~~~f~~~~l~P~~~~ie~~l~~kll~--------~~~~--~~~f~~~~ll~~d~~~~~~~ 342 (403) T protein:vir:80 277 GVPAFLLGVG---KYDK-DEYNNFINSTILPIAKGIEQELTRKLLI--------SPDL--YFKFNPRSLYAYDLKELAEV 342 (403) T ss_pred CCCHHHcCCC---CccH-HHHHHHHHHHHHHHHHHHHHHHHHhccC--------CCCc--EEEeechhhhccCHHHHHHH Confidence 211 222211 2221 1223455566778888888888876642 2222 455543 345689999999 Q ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCCcchhccccccccccccccc-ccccccCCccccccccccccchh Q lcl|NC_020866. 382 LDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPG-MDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 382 ~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +.++++.|+ ++.+++|+.+|+|+-+.+++.+.... -.+....+. .....+... ...+..+ T Consensus 343 ~~~~~~~Gi-~t~NE~R~~~gl~p~~ggd~~~~~~n-~~pl~~~~~~~~~k~ge~~-------~~~~~~~ 403 (403) T protein:vir:80 343 GSNMYVRGL-MEGNEVRDWLGLSPKEGLSELVILEN-YIPLDKIGDQNKLKGGEKG-------GADGQTD 403 (403) T ss_pred HHHHHhCCC-cCHHHHHHHhCCCCCCCCCeEeeccc-ccchhhccchhhccCCCCC-------CCCCCCC Confidence 999999998 99999999999986555555443221 111100000 000000000 0000000 No 74 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.77 E-value=2.9e-18 Score=116.71 Aligned_cols=376 Identities=11% Similarity=0.034 Sum_probs=222.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |- |.+...+.-. .+.++ ........ .+ .+...+.. |... ......+-+.|.+ T Consensus 1 M~----~f~~~~~~~~-----~~~~~--~~~~~~~~--------~~-~~~~~~~~---~~~v-----~~~~~~~~~~v~~ 52 (386) T protein:vir:48 1 MP----IFNITNLATE-----SPPIS--QGGFFDIT--------DP-DFLSTLNG---SEWV-----SAESALRNSDLFS 52 (386) T ss_pred Cc----cccccccccc-----ccccc--cccccccc--------cc-hhcccccC---Ccee-----chhhhhcchHHHH Confidence 66 5444322111 11111 00000000 00 11111111 1110 0111125788999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeec Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRD 159 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~ 159 (547) |+..+...+.++++++..... -......-..+...++++.++ +.+.+|-+++++++...+ .+..|.+++ T Consensus 53 ~i~~ia~~ia~~p~~~~~~~~-------~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~L~~l~ 122 (386) T protein:vir:48 53 IINQLSNDLATVKLTASRKQL-------QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENG---RDMKWEYLR 122 (386) T ss_pred HHHHHHHhhccCceeeccchh-------HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCC---cEEEEEEec Confidence 999999999999999873211 111111112235677777777 567899999999875433 467899999 Q ss_pred cceeEecccCCc--ceee-cc-CC--CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 160 PRHFRFQKSDLA--TPLL-LD-DN--GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 160 ~~~~~~~~~~~~--~~~~-~~-~~--~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) +.++++..+... ..+. .. +. +....+++..+|++++....+..||.|.+..+....-.-....++...+.+.-| T Consensus 123 ~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~ 202 (386) T protein:vir:48 123 PSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNAL 202 (386) T ss_pred CceeEEEEcCCCceEEEEEEecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 999886554322 2221 11 11 233568888888777777777789999999999888888888888999999999 Q ss_pred CCeEEEeCCCCCCHHHHHHHHHHHHHHhcCe--eEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhc Q lcl|NC_020866. 234 QPLRIGKWQPGASEQDKDTLFRAVANIAGDC--AAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTT 310 (547) Q Consensus 234 ~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~--~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlt 310 (547) .|--+.+++...+++++.++.+.......++ .+|++.|++++-+..+. ....|.+..++..++|+++.-- ..+. T Consensus 203 ~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~---~d~q~~e~~~~~~~~Ia~~fgVPp~~l 279 (386) T protein:vir:48 203 NANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKS---NVSQLLKQADWTTGQFAKVYGIPENVV 279 (386) T ss_pred CcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCCh---hHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 9999999999999999888888777766654 46778887766553221 1224777778888899988311 2232 Q ss_pred cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCC Q lcl|NC_020866. 311 TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGM 390 (547) Q Consensus 311 s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~ 390 (547) .+..+++. ..+........-+.--++.|+..||+.|++.+ .++. ......|...++..+.+|+..|+ T Consensus 280 g~~~~~~~-~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~-----------~~~~~~d~~~~~~~~~~l~~~g~ 346 (386) T protein:vir:48 280 GGQGDQQS-SLEMSLDLYNKAVSRYLRPFLSELSQKLSCDV-DADI-----------LPAVDPTGSNSVSRINSMVKSGT 346 (386) T ss_pred CCCCCccc-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh-hcch-----------hhhhccChHHHHHHHHHHHhCCC Confidence 22212222 22333445556677788889999998876532 1111 11122455667788889999998 Q ss_pred CCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccc Q lcl|NC_020866. 391 EIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAE 439 (547) Q Consensus 391 ~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (547) .+.+++|+.+|.+.-.++|.-...... .++...+ .....+ T Consensus 347 -~t~nE~r~~lg~~~~~~~~~~~~~~~~--~~~~~gG------d~~~~~ 386 (386) T protein:vir:48 347 -LAQNQGLYILQQAEILPKELPEGENPN--KTTLKGG------EINGED 386 (386) T ss_pred -cCHHHHHHHhhcCCCCCccchhhcCCC--CCccCCC------CCCCCC Confidence 999999999997643333311110000 0000000 000000 No 75 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.77 E-value=6.2e-18 Score=114.92 Aligned_cols=374 Identities=12% Similarity=0.060 Sum_probs=215.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. -.|.+...+.-+....+.... .+. .+ +...+...+.... |.... .....+-+.|.+ T Consensus 1 m~--m~~f~~~~~~~~~~~~~~~~~---------~~~----~~-~~~~~~~~~~~~~-~~~v~-----~~~al~~~~v~~ 58 (392) T protein:vir:39 1 MI--LPILNFINQTNDPPEVGSVQS---------YFP----DG-NDAQIMESLLGDN-NEWVS-----ARAALRNSDLFS 58 (392) T ss_pred Cc--chhhhhhhccccccccccccc---------ccc----cC-chhhhhhhhcCCC-Cceec-----hHHhhccHHHHH Confidence 33 014444333221111111110 000 00 1111111222111 11110 011115688999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |+..+-..|.++++++..... ..+++++ ...++++.+. +.+.+|.+++++++...+ .+..| T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~-----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~L 124 (392) T protein:vir:39 59 IILQLSSDLAIVKINAEKKKN-----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANG---ADMKW 124 (392) T ss_pred HHHHHHHhhccCceeeccchh-----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCC---cEEEE Confidence 999999999999999873221 1234443 4567777777 577899999999875433 36789 Q ss_pred EeeccceeEeccc--CCccee-eccCCC---CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKS--DLATPL-LLDDNG---AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~--~~~~~~-~~~~~~---~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .++++.++.+..+ ++...+ .....+ .-..+++...|++++...++..||.|.+..+....-.-....++...+. T Consensus 125 ~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 204 (392) T protein:vir:39 125 EYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSL 204 (392) T ss_pred EEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999998765543 222222 222211 1246788888777777777778999999999999988899999999999 Q ss_pred HHcCCCeEEEeCCCC--CCHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 230 QTYGQPLRIGKWQPG--ASEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 230 E~~G~P~~vgk~~~~--~~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) ...+.|--+.+++.. .++++++.+.+......+. ..+|+|.|++++-+..+. ....|.+..++..++|+++.-- T Consensus 205 ~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~---~d~~~~e~~~~~~~~Ia~~fgV 281 (392) T protein:vir:39 205 NSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKS---NVAQLLSQTDWTSKQYAKVYGL 281 (392) T ss_pred hccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCCh---hHHHHHHHHHHHHHHHHHHhCC Confidence 999999888888654 3445566666655544433 458899998888765332 2224777788888999988421 Q ss_pred -chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHH Q lcl|NC_020866. 307 -QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTL 385 (547) Q Consensus 307 -qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l 385 (547) ..+.++. +.+.+..+........-++..++.+++.+|+.|++.+ .++ ....-..|.+.+++.+.++ T Consensus 282 pp~~lg~~-~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~-~~d-----------~~~~~~~d~~~~~~~~~~l 348 (392) T protein:vir:39 282 PDSYIGGQ-GDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-SVN-----------MRPAIDPLGDNYLSTISTA 348 (392) T ss_pred CHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccc-----------chhhhccCHHHHHHHHHHH Confidence 2222221 1122222333445566778888889999988775432 111 1112234667888899999 Q ss_pred HhCCCCCCHHHHHHHh---CCCCCCCcchhcccccccccccccccccccccCCcccc Q lcl|NC_020866. 386 VRLGMEIEENEVRSRF---GFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAE 439 (547) Q Consensus 386 ~~~G~~v~~~~~~e~~---Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (547) .+.|+ .+.+++|+.+ |+. |.+--..... .+ .+.+.+. .+.+ T Consensus 349 ~~~g~-~t~nE~r~~l~~~g~~-p~e~r~~e~l----~~------~~~Gd~~-~p~p 392 (392) T protein:vir:39 349 TRWGA-LAENQATFVLQEAGYI-PKDLPAPENT----NK------KTTGQSN-EPVP 392 (392) T ss_pred HhCCC-cCHHHHHHHHHhcCCC-ccccchhcCC----CC------CCCCCCC-CCCC Confidence 99997 8888888877 664 3211110000 00 0000000 0000 No 76 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.77 E-value=6.2e-18 Score=114.92 Aligned_cols=374 Identities=12% Similarity=0.060 Sum_probs=215.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. -.|.+...+.-+....+.... .+. .+ +...+...+.... |.... .....+-+.|.+ T Consensus 1 m~--m~~f~~~~~~~~~~~~~~~~~---------~~~----~~-~~~~~~~~~~~~~-~~~v~-----~~~al~~~~v~~ 58 (392) T protein:vir:10 1 MI--LPILNFINQTNDPPEVGSVQS---------YFP----DG-NDAQIMESLLGDN-NEWVS-----ARAALRNSDLFS 58 (392) T ss_pred Cc--chhhhhhhccccccccccccc---------ccc----cC-chhhhhhhhcCCC-Cceec-----hHHhhccHHHHH Confidence 33 014444333221111111110 000 00 1111111222111 11110 011115688999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |+..+-..|.++++++..... ..+++++ ...++++.+. +.+.+|.+++++++...+ .+..| T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~-----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~L 124 (392) T protein:vir:10 59 IILQLSSDLAIVKINAEKKKN-----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANG---ADMKW 124 (392) T ss_pred HHHHHHHhhccCceeeccchh-----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCC---cEEEE Confidence 999999999999999873221 1234443 4567777777 577899999999875433 36789 Q ss_pred EeeccceeEeccc--CCccee-eccCCC---CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKS--DLATPL-LLDDNG---AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~--~~~~~~-~~~~~~---~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .++++.++.+..+ ++...+ .....+ .-..+++...|++++...++..||.|.+..+....-.-....++...+. T Consensus 125 ~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f 204 (392) T protein:vir:10 125 EYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSL 204 (392) T ss_pred EEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999998765543 222222 222211 1246788888777777777778999999999999988899999999999 Q ss_pred HHcCCCeEEEeCCCC--CCHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 230 QTYGQPLRIGKWQPG--ASEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 230 E~~G~P~~vgk~~~~--~~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) ...+.|--+.+++.. .++++++.+.+......+. ..+|+|.|++++-+..+. ....|.+..++..++|+++.-- T Consensus 205 ~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~---~d~~~~e~~~~~~~~Ia~~fgV 281 (392) T protein:vir:10 205 NSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKS---NVAQLLSQTDWTSKQYAKVYGL 281 (392) T ss_pred hccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCCh---hHHHHHHHHHHHHHHHHHHhCC Confidence 999999888888654 3445566666655544433 458899998888765332 2224777788888999988421 Q ss_pred -chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHH Q lcl|NC_020866. 307 -QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTL 385 (547) Q Consensus 307 -qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l 385 (547) ..+.++. +.+.+..+........-++..++.+++.+|+.|++.+ .++ ....-..|.+.+++.+.++ T Consensus 282 pp~~lg~~-~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~-~~d-----------~~~~~~~d~~~~~~~~~~l 348 (392) T protein:vir:10 282 PDSYIGGQ-GDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-SVN-----------MRPAIDPLGDNYLSTISTA 348 (392) T ss_pred CHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccc-----------chhhhccCHHHHHHHHHHH Confidence 2222221 1122222333445566778888889999988775432 111 1112234667888899999 Q ss_pred HhCCCCCCHHHHHHHh---CCCCCCCcchhcccccccccccccccccccccCCcccc Q lcl|NC_020866. 386 VRLGMEIEENEVRSRF---GFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAE 439 (547) Q Consensus 386 ~~~G~~v~~~~~~e~~---Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (547) .+.|+ .+.+++|+.+ |+. |.+--..... .+ .+.+.+. .+.+ T Consensus 349 ~~~g~-~t~nE~r~~l~~~g~~-p~e~r~~e~l----~~------~~~Gd~~-~p~p 392 (392) T protein:vir:10 349 TRWGA-LAENQATFVLQEAGYI-PKDLPAPENT----NK------KTTGQSN-EPVP 392 (392) T ss_pred HhCCC-cCHHHHHHHHHhcCCC-ccccchhcCC----CC------CCCCCCC-CCCC Confidence 99997 8888888877 664 3211110000 00 0000000 0000 No 77 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.77 E-value=2.3e-17 Score=111.85 Aligned_cols=471 Identities=11% Similarity=0.072 Sum_probs=230.2 Q ss_pred CCCCC----eeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKP----VLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~----~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~ 76 (547) |+... +.++.+-..++.+.-+++.|.. .++.-.+.+.+...-.|.. ...+ .++.. +...+. .-+ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~--~p~~~~~~~~~~~~~~p~~-~~~~-----~~~~~---~l~~~~-~np 94 (576) T protein:vir:96 27 IDDGLQANIRNIEEKSKELNKSLYGKQQAYA--EPFLEVMDTNPEFRTKRSY-MKNS-----DNLHD---VLKQFG-NNP 94 (576) T ss_pred cccChhHHHHHhhhhhhhhccccCCccchhh--cceeeeeecCCCccccCcc-hhhh-----hhhHH---HHHHhh-cCH Confidence 22100 0122222233333323333321 1100001111111101100 0000 01111 122222 247 Q ss_pred HHHHHHHHHHHHHhc-----------CCceEecCCC--CHHHHHH------HHHHHHHHhC-----CCHHHHHHHHH-Hh Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQ-----------LDITVEPASD--DPQDVKI------ADMVRDWLKR-----EELQQEVFHIL-DC 131 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~-----------~~w~v~p~~~--~~~~~~~------ae~v~~~l~~-----~~~~~~i~~~l-~a 131 (547) .|.+|++.+...|.. ..|.|..... .+..... ..++..++.. ..|.++++.++ +. T Consensus 95 iv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dl 174 (576) T protein:vir:96 95 ILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDT 174 (576) T ss_pred HHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHH Confidence 788999999888775 4666655322 2111111 2233333322 24677888877 57 Q ss_pred hhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCccee-------eccCCCCceeeccCceEEEeecCCcC---c Q lcl|NC_020866. 132 ISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPL-------LLDDNGAEVPLPAFRFIFADIPAKSG---I 201 (547) Q Consensus 132 ~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~~~~g~~l~~~k~i~~~~~~~~~---~ 201 (547) +.+|.+.+|++|.++++ -.+..|.+++|.++++..+....++ ...++.....+++...|+|++....+ . T Consensus 175 ll~Gna~~~i~~~rd~~-g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~ 253 (576) T protein:vir:96 175 YTYDQVNFEKVFNKKNA-TTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSS 253 (576) T ss_pred HhcCCeEEEEEEecCCC-CceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccC Confidence 88999999999877543 2366899999998887655443322 22233344567888888888776554 6 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCC--CCCHHHHHHHHHHHHHHhc---Ce---eEEecCCce Q lcl|NC_020866. 202 LLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQP--GASEQDKDTLFRAVANIAG---DC---AAIIPETMS 273 (547) Q Consensus 202 p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~--~~~~~~~~~l~~al~~~~~---~~---~~vip~~~~ 273 (547) .||.+.+..+....-.-....++...|....+.|--|-+++. ..++++++++.+.+.+.-+ ++ .+|++.|++ T Consensus 254 ~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~ 333 (576) T protein:vir:96 254 GYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIK 333 (576) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCce Confidence 789999999999999899999999999999999976666543 4578888888888887532 22 367888887 Q ss_pred EEEEecCCCCCchHHHHHHHHHHHHHHHHHH------hcchh----ccCcccc--ccchhH-HHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 274 IDFVESKSIGSSVDLYEKRINHLDQQISKAV------LGQTT----TTDAVAG--GHAVSQ-EHRLVQEDIETSDAMALA 340 (547) Q Consensus 274 ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i------lGqtl----ts~~~gg--s~a~~~-vh~~v~~~~~~ad~~~i~ 340 (547) ++=+..+. ....|.+..++..++|+++. +|.+- ++...+| +++-.+ ........-+...++.|+ T Consensus 334 ~~~ls~~~---~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie 410 (576) T protein:vir:96 334 FVNMTPTA---NDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIE 410 (576) T ss_pred EEeccCCh---hhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 76654322 22247888889999999884 33221 1111122 333222 333445567888899999 Q ss_pred HHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHH--HhCCCCCCHHHHHHHhCCCCCCCcchhcccccc Q lcl|NC_020866. 341 AILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTL--VRLGMEIEENEVRSRFGFSQPKPGAKLLRPLEN 418 (547) Q Consensus 341 ~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l--~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~ 418 (547) ..||+.|++. |+. . -+|+|.. .|.+..++....+ ...|+ ++.+++|+.+|+|+-+.++..+.+... T Consensus 411 ~~ln~~Ll~~-----~~~--~-~~~~f~r---~d~~~~~e~~~~~~~~~~G~-lT~NE~R~~~gl~piegGD~~~~~~~~ 478 (576) T protein:vir:96 411 DLINTHIISE-----YSD--K-YVFQFVG---GDTKSELDKIKILQEEVKTY-KTVNEARKEKGLKPIEGGDVLLDGSFI 478 (576) T ss_pred HHHHhhhchh-----ccC--c-eEEEecc---CCHHHHHHHHHHHHHHhcCc-cCHHHHHHHhCCCCCCCcceecccccc Confidence 9999988754 221 1 2455543 4555555555433 44576 899999999999876656654432211 Q ss_pred ccccccc--ccccccccCC----------ccccccccccccchhhhhhccc----chhhhcccccccchhHHHHHHHHHH Q lcl|NC_020866. 419 PATQAAP--PGMDPGAGDL----------KTAERKLKPLSGQADLSRGDLS----LNSEMPSTGQKTGGAEIAALTDQLE 482 (547) Q Consensus 419 ~~~~~~~--~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~l~~~~~ 482 (547) ....... .......... ...+..+.+.+........... ........++....+-.-. T Consensus 479 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 552 (576) T protein:vir:96 479 QSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDNVKS------ 552 (576) T ss_pred ccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCccccccccCCCCcccc------ Confidence 1100000 0000000000 0000000000000000000000 0000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHH-ccCCHHHHHHH Q lcl|NC_020866. 483 LEGQPEIVAMIEEIRTMLE-AASSLGEFSDM 512 (547) Q Consensus 483 ~~~~~~~~~~~~~i~~~l~-~a~s~ee~~~~ 512 (547) .........+.- .-.-+.++... T Consensus 553 -------~~~~~~~~~~~~~~~~~~~~~~~~ 576 (576) T protein:vir:96 553 -------QEGSNKGQGTKGKGNEKPSDFKNN 576 (576) T ss_pred -------cccccccccccccCCCCcccccCC Confidence 000000000000 00001111111 No 78 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.77 E-value=2e-16 Score=106.71 Aligned_cols=452 Identities=12% Similarity=0.032 Sum_probs=221.8 Q ss_pred CCCCCeeeCCCCCcc-----------cchhhhhhhhhhhhhhhHHHHhhh----hccCCCH------------HHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV-----------QRAQLTEEVAAPTVTGVRSPLSGY----PGDGLNP------------LRLAQIL 53 (547) Q Consensus 1 ma~~~~~~~~~g~~~-----------~~~~l~~~~a~~~~~~~~~~~~~~----~~~~l~p------------~~~~~il 53 (547) || |+.-.-+.+ ..+.-.+.+-...+...|.-.+.. ....-+| ..+..++ T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~ 76 (535) T protein:vir:10 1 MA----ILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLL 76 (535) T ss_pred Ch----hhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHH Confidence 77 443222221 111111222111111111111111 0100111 1233444 Q ss_pred HhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCC--CHHHHHHHHHHHHHHh-CCC--------HH Q lcl|NC_020866. 54 RAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASD--DPQDVKIADMVRDWLK-REE--------LQ 122 (547) Q Consensus 54 r~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~--~~~~~~~ae~v~~~l~-~~~--------~~ 122 (547) +.+......+. ..+.+ .++..+.+|+......+.++++.+.-.+. +...++-...+..+|. +++ |. T Consensus 77 ~~~~~~~~~~~--~i~t~-~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~ 153 (535) T protein:vir:10 77 KAYADNDIVQA--IIRTR-TNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFP 153 (535) T ss_pred HHhccChhHHH--HHHHH-HHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHH Confidence 43332111111 11111 23455556666655566666666653332 2222222223444453 221 44 Q ss_pred HHHHHHH-Hhhhhc-CeeeeeEEeecCCeeeEEeeEeeccceeEecccCC-----cceeeccCCCCceeeccCceEEEee Q lcl|NC_020866. 123 QEVFHIL-DCISKG-YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDL-----ATPLLLDDNGAEVPLPAFRFIFADI 195 (547) Q Consensus 123 ~~i~~~l-~a~~~G-~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~-----~~~~~~~~~~~g~~l~~~k~i~~~~ 195 (547) .++..++ +.+.+| .++.++++.. .| .|..|.+++|.++++..+.. ..++...+++.+..+++..+|++++ T Consensus 154 ~~~~~lv~d~l~~~g~ay~~i~r~~-~G--~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~~eiih~~~ 230 (535) T protein:vir:10 154 RLLTKIINDMYVQDQINIERIFKND-SN--ELDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVKFSERNLTFINY 230 (535) T ss_pred HHHHHHHHHHHhhCCceEEEEEECC-CC--cEEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEEECcccEEEEec Confidence 5777666 345555 5566665433 33 36789999999887654321 2333344555667899999888876 Q ss_pred cCCc---CccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCC----CCCHHHHHHHHHHHHHHhcC----e Q lcl|NC_020866. 196 PAKS---GILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQP----GASEQDKDTLFRAVANIAGD----C 264 (547) Q Consensus 196 ~~~~---~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~----~~~~~~~~~l~~al~~~~~~----~ 264 (547) .... +.+||.+.+..+....-.-....++...|...-+.|--|-+++. ..++++++++.+.+.+..++ + T Consensus 231 ~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag 310 (535) T protein:vir:10 231 WNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAW 310 (535) T ss_pred cCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccc Confidence 5543 35789999999999988888999999999999999977766653 36778888888888775432 2 Q ss_pred eEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCccccccch------------hHHHHH-HHHH Q lcl|NC_020866. 265 AAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHAV------------SQEHRL-VQED 330 (547) Q Consensus 265 ~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a~------------~~vh~~-v~~~ 330 (547) ...+..+..+++...+...... .|.+..++..++|+++.-- ..+.+..+.++|+- .+.... .... T Consensus 311 ~~~vl~~~g~~~~~l~~~~~D~-qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~ 389 (535) T protein:vir:10 311 KIPILAAKDAKFVNMTQNSRDM-EFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDK 389 (535) T ss_pred ccccccCCCceEEecCCChhHH-HHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHH Confidence 2223333344554444322222 3777778888999988421 12222222222221 122222 2234 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcc Q lcl|NC_020866. 331 IETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGA 410 (547) Q Consensus 331 ~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e 410 (547) -+...++.++..||+.|++.. +. .-+|.|+.....|.+..++..+.. ..|. ++.+++|+.+|+|+-+.|+ T Consensus 390 ~L~P~l~~ie~~ln~~Ll~~~-----~~---~~~f~f~~l~~~d~~~r~~~~~~~-~~g~-lT~NE~R~~~gl~piegGD 459 (535) T protein:vir:10 390 GLTPLLSFIEQVINDKIMRYV-----DT---DYRFSFTLGDAQDKLQEEQVWKLK-LANG-YFINEYRKDHGLKTVDGLD 459 (535) T ss_pred HHHHHHHHHHHHHhhhccccc-----CC---eEEEEeccccccCHHHHHHHHHHH-HcCC-CCHHHHHHHhCCCCCCCcc Confidence 577888899999998886432 21 237888888888888877776644 4565 7999999999998766555 Q ss_pred hhcccc--c------ccccccccccccccccC-Cccc--cccccc--cccchhhhhhcccchhhhcccccccchhHHHH Q lcl|NC_020866. 411 KLLRPL--E------NPATQAAPPGMDPGAGD-LKTA--ERKLKP--LSGQADLSRGDLSLNSEMPSTGQKTGGAEIAA 476 (547) Q Consensus 411 ~~~~~~--~------~~~~~~~~~~~~~~~~~-~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (547) ..+... . ....+..+....+.... .+.. +..... .....+.+...... -.+..+...++..+- T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~---~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 460 VPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPK---PSESDDVSNNEDADT 535 (535) T ss_pred ccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCc---CCCCCccccccccCC Confidence 322110 0 00000000000000000 0000 000000 00000000000000 000011111111111 No 79 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.76 E-value=4.1e-17 Score=110.43 Aligned_cols=376 Identities=10% Similarity=0.037 Sum_probs=214.6 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhcc---CCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGD---GLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~---~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |- |+|.+-+.+.+..-.+. .....+...+.. .+++. . .+ +-+. T Consensus 1 MG----l~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~vt~~---~---------------al-----~~~~ 46 (394) T protein:vir:62 1 MG----LRDRFSNYLFKKAEKRG-------YLDNVLGKSIRYSGVYVTDS---N---------------IL-----QSSD 46 (394) T ss_pred Cc----hhhhhhhhccCCCCchh-------hhhhhhhcccccCccccChh---h---------------hh-----ccHH Confidence 87 66665433221110000 111111111110 01110 1 11 3577 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) |.+|++.+...|.+++|+|...++.... -..+..++.++ ...++++.+. +.+.+|.+.+.+. + .....+ T Consensus 47 v~~~i~~Ia~~iA~lp~~v~~~~g~~~~---~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~--~-~~~~~~ 120 (394) T protein:vir:62 47 VYELLQDISNQMVLADIVVEDEFGNEIK---DDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILN--G-AQIHLA 120 (394) T ss_pred HHHHHHHHHHhhcccceEEEcCCCcccc---hhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEe--c-ceeecc Confidence 9999999999999999999865443221 12234555554 3555665555 5677999888763 1 111111 Q ss_pred EeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) . +-.+..+ +...+.. . ..+..++++.+|+.++.. .+..+|.|.+..+....-.-....++...+...- T Consensus 121 ~------~~~~~~~--~~~~~~~-~--~~~~~~~~~eiih~r~~~-~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 188 (394) T protein:vir:62 121 S------NVFTELD--DNLVEHF-N--IGGHEIPPCMIRHVKNIG-ADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKG 188 (394) T ss_pred c------cceEEEC--CceEEEE-e--eCCEEechhheEEecCcC-CCCccccChHHHHHHHHHHHHHHHHHHHHHHHcc Confidence 1 1111222 2222222 1 235678888877666655 4457999999999988877888888888999998 Q ss_pred CCCeEEEeCCCCCC--HHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 233 GQPLRIGKWQPGAS--EQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 233 G~P~~vgk~~~~~~--~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) |.|--+.+++...+ ++.++++.+++.+.-+ +..+|+|.|.+++++..+..... ..|.+..++..++|+++.- T Consensus 189 ~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d-~q~~e~~~~~~~~Ia~~fg 267 (394) T protein:vir:62 189 GLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDD-EKTLAYLNVYKKDLGKFLG 267 (394) T ss_pred CCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcch-HHHHHHHHHHHHHHHHHhC Confidence 99988888865543 4456666666655432 34578999998888765432222 2366677788899998842 Q ss_pred c-chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 306 G-QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 306 G-qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) - ..+..+.++ |. ..+........-+...++.|+..||+.|+.+- ....-+|+|+...-.+....++.+.+ T Consensus 268 VPp~~lg~~~~-sn-~e~~~~~~~~~~l~P~~~~ie~~l~~kll~~~-------~~~~~~~~fd~~~~~~~~~~~~~~~~ 338 (394) T protein:vir:62 268 INVDTYTELIK-ED-IEKAMMYIHNKAVRPIMKNFEDHLSLLFYAQN-------SGKRIKFKINILDFVTYSNKTNIGYN 338 (394) T ss_pred CCHHHcCCCCC-cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcc-------ccCceEEEechhhhcCHHHHHHHHHH Confidence 1 223322222 22 22333444566788888999999987764431 11224678887777777888999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCC--CCcchhcccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQP--KPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +++.|+ ++.+++|+.+|+|+- +.++.+.... +-.+...... ...+.+.... ..+ T Consensus 339 ~~~~g~-~T~NE~R~~~gl~p~~~~~gd~~~~~~-n~~~~~~~~~--------~~~~~kgge~--~en 394 (394) T protein:vir:62 339 LVRTAI-TSPDNVADMLGFPKQNTKESQAIYISN-DVTEIGKKEA--------TDGSLGGGEE--NEN 394 (394) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCCCCCeeeccc-cccccccccc--------ccccCCCCCC--CCC Confidence 999997 999999999999875 2333333221 1111100000 0000000000 000 No 80 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.75 E-value=2.6e-18 Score=117.02 Aligned_cols=376 Identities=15% Similarity=0.071 Sum_probs=219.7 Q ss_pred CC---------CCCee------eCC-CCCc--ccchhhhhhhhhhhhhhhH-HHHhhhhccCCCHHHHHHHHHhhcCCCH Q lcl|NC_020866. 1 MA---------QKPVL------LDR-WGNP--VQRAQLTEEVAAPTVTGVR-SPLSGYPGDGLNPLRLAQILRAADHGDP 61 (547) Q Consensus 1 ma---------~~~~~------~~~-~g~~--~~~~~l~~~~a~~~~~~~~-~~~~~~~~~~l~p~~~~~ilr~a~~g~~ 61 (547) |. +.|.. +|- .|+| ++.+..+++.+.......+ ..+.+++. .|. ....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~------~~~----~~~~~~~ 70 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPE------SWA----TPSWGSA 70 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccc------ccc----ccCcccc Confidence 33 22222 111 1111 1111122222111111100 01111211 010 0111111 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHHHhhhhcC Q lcl|NC_020866. 62 VRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHILDCISKGY 136 (547) Q Consensus 62 ~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l~a~~~G~ 136 (547) .... + .+-+.|.+|++.+...|.++++.+.-.+. ..+ . +...++ ++ .+.++++.++..+..|- T Consensus 71 t~~~-----~-~~~~~v~acV~~Ia~~iA~lpl~~~~~~~-~~~-~----~~~ll~~~PN~~~t~~~f~~~l~~~lllGn 138 (409) T protein:vir:83 71 QDKL-----R-TLIDVAWACIDLNASVLSSMPIYRMRNGR-IID-S----VAWMSNPDPEVYTSWQEFAKQLFWDFQLGE 138 (409) T ss_pred chhh-----H-hhhHHHHHHHHHHHHhhccCceEEeeCCc-ccc-c----hhhhcccCCCCCCCHHHHHHHHHHHHhhCC Confidence 1111 1 14578999999999999999998874322 111 1 112222 32 46666666665555688 Q ss_pred eeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHH Q lcl|NC_020866. 137 SFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWM 216 (547) Q Consensus 137 sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~ 216 (547) +.++++-...+| .|..|.+++|..+++..+....+.++-+... .++..|++++....+..||.|.+..+....- T Consensus 139 ay~~~i~r~~~G--~~~~L~pl~p~~v~v~~~~~g~~~y~~~~~~----~~~eiiHir~~~~~~~~~G~spi~~~~~~i~ 212 (409) T protein:vir:83 139 AFVLPMAHGSDG--YPIRFRVVPPWLVNVELKKGARREYRIGGLN----VTDEILHIRYQGNTADAHGHGPLESAAPRQV 212 (409) T ss_pred cEEEEEEECCCC--cEEEEEEECCcceEEEEcCCceEEEEEcccc----CccceEEeCCCCCCCCcccccHHHHHHHHHH Confidence 877765322233 4678999999987765554444333222211 1234444456556677899999999999998 Q ss_pred HHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC---eeEEecCCceEE-EEecCCCCCchHHHHHH Q lcl|NC_020866. 217 FKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGD---CAAIIPETMSID-FVESKSIGSSVDLYEKR 292 (547) Q Consensus 217 ~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~---~~~vip~~~~ie-~~e~~~~~~~~~~f~~l 292 (547) ......++...|...-+.|--+-+++...++++++++.+.+....++ ..+++..|+++. .+..+ ...-.|.+. T Consensus 213 ~~~a~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il~~g~~~~~~~~~s---~~d~q~le~ 289 (409) T protein:vir:83 213 VIGLLQKYVQNLAETGGVPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALVTGGATLNQAKSMS---AQDLSLMEL 289 (409) T ss_pred HHHHHHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHHHHHHHhhCCccCccceecCCcccccccCCC---HHHHHHHHH Confidence 88888888888988889999999999888999999988888766543 236777887763 33322 111136666 Q ss_pred HHHHHHHHHHHHhc-chhcc---Cccccccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEe Q lcl|NC_020866. 293 INHLDQQISKAVLG-QTTTT---DAVAGGHAV-SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKI 367 (547) Q Consensus 293 i~~~d~~Isk~ilG-qtlts---~~~ggs~a~-~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~ 367 (547) -++..++|+++.-- ..|.. ++...+|+- .+........-+...++.|+..||+.|++. ..+-+|.+ T Consensus 290 r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~---------~~~~~f~~ 360 (409) T protein:vir:83 290 TQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS---------PQHLELNR 360 (409) T ss_pred HHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC---------CcEEEeeh Confidence 67888899988422 22322 222223443 333333445577788899999999877532 12234444 Q ss_pred cCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccc Q lcl|NC_020866. 368 ARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQ 422 (547) Q Consensus 368 ~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~ 422 (547) +.....|++++++.++++.+.|+ ++.+++|+..|+|+-+.++++-.. .. T Consensus 361 ~~llr~d~~~r~~~~~~~~~~G~-lT~NE~R~~~glpp~~ggd~l~~~-----gv 409 (409) T protein:vir:83 361 DDYTRPSLVERATAYKIMIEAGV-MEPNEARAMERLHSEAAAVRLSGG-----GV 409 (409) T ss_pred hhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCcccCCC-----CC Confidence 44456789999999999999998 999999999999865544433111 00 No 81 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.75 E-value=6.9e-17 Score=109.17 Aligned_cols=364 Identities=11% Similarity=0.033 Sum_probs=216.2 Q ss_pred CCCCCeeeCCCCC-cccchhhhhhhhhhhhh-hhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGN-PVQRAQLTEEVAAPTVT-GVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~-~~~~~~l~~~~a~~~~~-~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |- |.++... +-..+ ....+... ...-.+.+.....+++ +..+ +-+.| T Consensus 1 Mg----~~~~~~~~k~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~----------------------~~~l-~~~~v 49 (383) T protein:vir:10 1 MG----LLTPKNFSKRNAK----NMVYPSNPAFFTTTVGGMQLSYVSA----------------------LSAL-QNTNV 49 (383) T ss_pred CC----ccccccccccccc----ccccccchhhhhhhccCccccccch----------------------hHhh-cchHH Confidence 77 6665321 11111 11110000 0000000000001111 0011 34679 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) .+|++.+...+.+++|+|.-.. ...+|+++ ...++++.++ +.+.+|-++++++- +. . T Consensus 50 ~~~i~~ia~~ia~~~~~~~~~~-----------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~--~~-----~ 111 (383) T protein:vir:10 50 YSVINRIASDVSSAHFKTENTA-----------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVG--QN-----L 111 (383) T ss_pred HHHHHHHHHhhccCceeecccc-----------hhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEc--Cc-----e Confidence 9999999999999999996321 12344444 4666666666 45578999998752 11 2 Q ss_pred eeEeeccceeEecccCCcceee--ccCCCCceeeccCceEEEeecCC--cCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATPLL--LDDNGAEVPLPAFRFIFADIPAK--SGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~l~~~k~i~~~~~~~--~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .+.+.++-++.+..+....++. ....+....+++...|++++... .+..||.+.+..+....-.-....++...|. T Consensus 112 ~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f 191 (383) T protein:vir:10 112 EHIPNSDVQINYLPGNMGIVYTVLESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAM 191 (383) T ss_pred eEeecCcceEEEEEcCCceEEEEEEcCCceEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 3444555555554444333332 23344557788888877664432 3346899999999999888888999999999 Q ss_pred HHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV 304 (547) Q Consensus 230 E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i 304 (547) ...+.|--+-+++... ++++++.+.+.+.+..+ +..++++.|++++-+..+. ...+.+.++.++..++|+.+. T Consensus 192 ~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~d~~~l~e~~~~~~~~Ia~af 269 (383) T protein:vir:10 192 ENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKT--DVFKALADNSAYSADQISKAF 269 (383) T ss_pred hccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCCh--hHHHHHHHHHHHHHHHHHHHh Confidence 9999998888887655 46777888888877643 2468899999888765332 222223567778889999884 Q ss_pred hc-chhcc--CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHH Q lcl|NC_020866. 305 LG-QTTTT--DAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASS 381 (547) Q Consensus 305 lG-qtlts--~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~ 381 (547) -- ..+.+ ++++.+++..+.....+..-+...++.|+..||+.|+ ++ ..+|.+......|.+.++++ T Consensus 270 gVPp~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~l~--------~~---~~~f~~~~l~~~d~~~~~~~ 338 (383) T protein:vir:10 270 GVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLKMN--------AP---DLELDIKDMLDVDDSILINQ 338 (383) T ss_pred CCCHHHcCCccCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhC--------Cc---eEEeechhhhccCHHHHHHH Confidence 21 12222 2222234434444445556777888888888887652 21 12444455556899999999 Q ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCc Q lcl|NC_020866. 382 LDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLK 436 (547) Q Consensus 382 ~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (547) +.++++.|+ ++.+++|+.+|+++-..++... ...+.. +...+ +.. T Consensus 339 ~~~~~~~G~-~t~nE~R~~lg~~p~~~~d~~~-~~~~~~--~~~gG------d~e 383 (383) T protein:vir:10 339 VSNLAKSGV-LGAEQAQFILTRSGFLPDNLPE-FKPLTN--ETKGG------DDK 383 (383) T ss_pred HHHHHhCCC-cCHHHHHHHhCCCcccCCcccc-cCCCcc--cCCCC------CCC Confidence 999999998 9999999999997644443211 111100 00000 000 No 82 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.75 E-value=4.8e-16 Score=104.54 Aligned_cols=475 Identities=13% Similarity=0.148 Sum_probs=227.5 Q ss_pred CCCCCeeeCCCCCcc--cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV--QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~--~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |=....-+...+|+- .+..-++.+.. . + +.++...-+++.. +.. +.+.-++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~---~-~~~~~~pp~~~~~---------------La~----~~~~n~~v 54 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKE---D---R-FEEYVEPKVHPLV---------------LLS----LLQVNPYH 54 (540) T ss_pred CCCcccChhhccchhhhhcccccccccc---C---C-CCccccCCCCHHH---------------HHH----HHHhcHHH Confidence 322221133334442 12111111110 0 1 1111111123321 112 22357899 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC--CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR--EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~--~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) .+|+..+...|.+++|.|+...... .+++-+ +.+.+++..++ +.+.+|.+++++++...| .+..| T Consensus 55 ~scI~~ia~~ia~~~~~i~~~~~~~---------~~~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G---~~~~L 122 (540) T protein:vir:41 55 ASACSIKANDILRTGYLIDGDDGGV---------EELLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQG---EPVRL 122 (540) T ss_pred HHHHHHHHHHHhcCCceEecCccch---------hhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCC---cEEEE Confidence 9999999999999999998654321 122222 45777888877 577899999999875433 36688 Q ss_pred EeeccceeEecccCCcceeec-------------------cCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLATPLLL-------------------DDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWM 216 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~ 216 (547) .++++.++++..++...+... .++.....+++...|+++.....+..||.+.+..+..... T Consensus 123 ~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~ 202 (540) T protein:vir:41 123 DYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSIL 202 (540) T ss_pred EEeCCcceEEeEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHH Confidence 899988876644332211111 1122234567777776666666778899999999999999 Q ss_pred HHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHH------HH----HHHHHHHHH----HhcC--eeEEecC----CceEEE Q lcl|NC_020866. 217 FKAFTARDWAIFTQTYGQPLRIGKWQPGASEQ------DK----DTLFRAVAN----IAGD--CAAIIPE----TMSIDF 276 (547) Q Consensus 217 ~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~------~~----~~l~~al~~----~~~~--~~~vip~----~~~ie~ 276 (547) .-....++...|....+.|--|-+++...+++ .. +.+.+...+ ...+ ..+|++. +..+++ T Consensus 203 ~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~ 282 (540) T protein:vir:41 203 AMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTF 282 (540) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeE Confidence 99999999999999999998887776443322 12 223333322 1222 2345531 223455 Q ss_pred EecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCcc--ccccchhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 277 VESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAV--AGGHAVSQEH-RLVQEDIETSDAMALAAILNRDLIRPWI 352 (547) Q Consensus 277 ~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~--ggs~a~~~vh-~~v~~~~~~ad~~~i~~~ln~~li~~l~ 352 (547) ...+.. ..-..|.+..++...+|+++.-- ..+.+... +++++-.+.. .......+...++.|+..||+.|++. T Consensus 283 ~pl~~~-~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~-- 359 (540) T protein:vir:41 283 TPLNTS-QKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLK-- 359 (540) T ss_pred Eecccc-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-- Confidence 443332 22224788888888999988421 23333222 2233333333 34456778899999999999876542 Q ss_pred HhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh-CCCCCCCcchhccccccccccccccccc-c Q lcl|NC_020866. 353 QLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF-GFSQPKPGAKLLRPLENPATQAAPPGMD-P 430 (547) Q Consensus 353 ~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~-Glp~p~~~e~~~~~~~~~~~~~~~~~~~-~ 430 (547) ++. .-+|+|+...-.. ...++.+.++++.|+ +|.+++|+.+ |++. .++..+.+.............. . T Consensus 360 ---~~~---~~~i~f~~~~ll~-~D~~~~~~~lv~~G~-lT~NE~Re~L~g~e~--gdd~~l~p~n~~~~~~~~~~~~~~ 429 (540) T protein:vir:41 360 ---LDP---GARFVFNEEILME-SEFVHNYALLVQCGV-LTPSEVREKLFGLDG--GPDMFMVPSSIGKSAMKRQKRNYE 429 (540) T ss_pred ---cCC---ceEEEecchhhcc-hHHHHHHHHHHhCCC-CCHHHHHHHhCcCcC--CCcccccccccccccccccccccC Confidence 222 1256777654433 245666788999998 9999999865 6653 2233332211111000000000 0 Q ss_pred ccc--CCccccccccccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHcc Q lcl|NC_020866. 431 GAG--DLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIR-----TMLEAA 503 (547) Q Consensus 431 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~-----~~l~~a 503 (547) ... .......+..+. .... .+...+ .......++.......+++++....++.-+. .++..+ T Consensus 430 ~~~~~~~~k~~~~~~~~-~~~~-------~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (540) T protein:vir:41 430 KNQINEIKRTYAKYKPR-IQEI-------ISSESP---LEDKKKKIDEVLSDFRAEAYENGKKMLSIAGDMGTMSAINRG 498 (540) T ss_pred CCCccccccccchhccc-ccCc-------cccccc---cccccccccccccccCCccccchhHHHHHhhhhhhhhhhhcC Confidence 000 000000000000 0000 000000 0000000100000000111111111111110 011111 Q ss_pred --------CCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhh Q lcl|NC_020866. 504 --------SSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGA 544 (547) Q Consensus 504 --------~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~ 544 (547) ..+|...+.|.+ +.+++.+.+.--|. .++|--+. T Consensus 499 ~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~--~~~~~~~~ 540 (540) T protein:vir:41 499 VSMIPPKPSNLEAYEDLLAA-----SVDDIVERIRHYLY--KVIGWREL 540 (540) T ss_pred ceecCCCCcchHHHHHHHHh-----hHHHHHHHHHHHHH--HHhhhccC Confidence 112222222222 22333333333333 23332222 No 83 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.74 E-value=8.1e-16 Score=103.32 Aligned_cols=481 Identities=13% Similarity=0.128 Sum_probs=224.3 Q ss_pred CCCCC-eeeCC-CCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKP-VLLDR-WGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~-~~~~~-~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |=... .|.+- .=+++.++ ...+..+.. ..+..+....+++ ..+.. +.+..+.| T Consensus 1 ~~~~~~~i~s~~~~~~i~~~----~~~s~~~~~--~~~~~~~~pp~~~---------------~~la~----l~~~n~~v 55 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKRE----EVESQALGE--TRFEEYVEPKVNP---------------LVLLS----LLQVNPYH 55 (542) T ss_pred Cccccccccccccchhhhhc----ccccccccc--ccCCccccCCCCH---------------HHHHH----HHhhcHHH Confidence 21000 13221 11122211 111111100 0011111111222 22222 23468999 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC--CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR--EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~--~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) .+|++.+...|.+++|.+.+.... .+...+-+ +.+.+++..++ +.+.+|.+.+++++...| .+..| T Consensus 56 ~scI~~ia~~IA~l~~~~~~~~~~--------~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G---~~~~L 124 (542) T protein:vir:41 56 ASACSIKANDIIRTGYILEGDDEG--------VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRG---DPIRF 124 (542) T ss_pred HHHHHHHHHHHhhCceeeecccch--------hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC---cEEEE Confidence 999999999999999999754322 12233323 45777887777 578899999999875543 36788 Q ss_pred EeeccceeEecccCCcceeecc-------------------CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLATPLLLD-------------------DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWM 216 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~ 216 (547) .++++.++++..++........ .+..+..+++...|++++....+.+||.+.+..+..... T Consensus 125 ~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~ 204 (542) T protein:vir:41 125 EYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAIL 204 (542) T ss_pred EEEcCcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHH Confidence 9999988877654433221111 111233466667776676666778999999999999988 Q ss_pred HHHHHHHHHHHHHHHcCCCeEEEeCCC----------CCCHHHHHHHHHHHHHH----hcC--eeEEec--C--CceEEE Q lcl|NC_020866. 217 FKAFTARDWAIFTQTYGQPLRIGKWQP----------GASEQDKDTLFRAVANI----AGD--CAAIIP--E--TMSIDF 276 (547) Q Consensus 217 ~K~~~~~~w~~f~E~~G~P~~vgk~~~----------~~~~~~~~~l~~al~~~----~~~--~~~vip--~--~~~ie~ 276 (547) .-....++...|...-+.|--|-+++. ..+++.++.+.+.+.+. ..+ ..+|++ . ...+++ T Consensus 205 ~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~ 284 (542) T protein:vir:41 205 AMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTF 284 (542) T ss_pred HHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeE Confidence 888889999999999999977766542 34566666666665543 222 245554 1 234555 Q ss_pred EecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCcccccc--chhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 277 VESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGH--AVSQEH-RLVQEDIETSDAMALAAILNRDLIRPWI 352 (547) Q Consensus 277 ~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~--a~~~vh-~~v~~~~~~ad~~~i~~~ln~~li~~l~ 352 (547) ...+.. ..-..|.+..++..++|+++.-- ..+.+...++++ +-.+.. .......+...++.|+..||+.|++. T Consensus 285 ~pl~~~-~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~-- 361 (542) T protein:vir:41 285 TPLNTS-QKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVK-- 361 (542) T ss_pred EEcCCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-- Confidence 544332 22234777778888999988421 223332222322 333333 33456778899999999999866543 Q ss_pred HhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh-CCCCCCCcchhccccccccccccccccccc Q lcl|NC_020866. 353 QLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF-GFSQPKPGAKLLRPLENPATQAAPPGMDPG 431 (547) Q Consensus 353 ~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~-Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~ 431 (547) ++. .-+|+|+...-.. ....+.++.+++.|+ ++.+++|+.+ |++. - ++..+.+.......... ..... T Consensus 362 ---~~~---~~~~~f~~~~ll~-~d~~~~~~~~v~~Gi-lT~NE~Re~L~g~~p-g-dd~~l~p~~~~~~~~~~-~~~n~ 430 (542) T protein:vir:41 362 ---FNP---KTRFKFNDETLLE-SDSVRNCALLVQSGV-LTPAEARERLFGLDG-G-PDIFMVPSKGAAKSVKR-QERNY 430 (542) T ss_pred ---cCC---ceEEEecchhhcc-hHHHHHHHHHHhCCC-CCHHHHHHhhCCCCC-C-Ccccccccccccccccc-CCcCC Confidence 221 1256665433222 223455677889997 8999999864 7753 2 22222221111110000 00000 Q ss_pred ccCCccccccccccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHH--------HHHHHHHHHHHHHHHH-H- Q lcl|NC_020866. 432 AGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELE--------GQPEIVAMIEEIRTML-E- 501 (547) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--------~~~~~~~~~~~i~~~l-~- 501 (547) . .....+....-....+.+....-..... ........+....-+.+.+... .-..|..+..-+ ..+ . T Consensus 431 ~-~~~~~~~~k~~~k~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 507 (542) T protein:vir:41 431 E-KNQIREIRKIYAKYRPRFNEIISSKLSA-EEKKKKIDESLAEFRAEAYEAGKKMLIIGGDMGSMSALNQGV-SVIPSK 507 (542) T ss_pred C-CCchhhhhhcccccCccccccccccccc-hhhcccccchhhhhHHhHHhcCceEEEeecCchhhhhhhccc-eeccCC Confidence 0 0000000000000000000000000000 0000000011111111111000 000000000000 000 0 Q ss_pred --ccCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhh Q lcl|NC_020866. 502 --AASSLGEFSDMLDARYGEIGASQLASRISAAMLASEFAGREGA 544 (547) Q Consensus 502 --~a~s~ee~~~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~ 544 (547) +-+-|||+++ + +.+++-..+.--+. .++|--+. T Consensus 508 ~~~~~~~~~~~~---~-----~~~~~~~~~~~~~~--~~~~~~~~ 542 (542) T protein:vir:41 508 PLNLERYEELLE---A-----SVEDMIGRIRHYLY--KVIGWREL 542 (542) T ss_pred CcChHHHHHHHH---h-----hHHHHHHHHHHHHH--HHhhhccC Confidence 0011222211 1 11111111111111 22222222 No 84 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.74 E-value=1.5e-16 Score=107.33 Aligned_cols=374 Identities=12% Similarity=0.020 Sum_probs=213.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC----CCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH----GDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~----g~~~~~~~L~~~m~~~D~ 76 (547) |. |.++....+ +|.+ .+++..+. .+.... ..-+.+. +-+ T Consensus 1 mg----~~~~~~~~~-----------------------------~~~~--~~~~~~~~~~~~~~~~~~-~t~~~~~-~~~ 43 (403) T protein:vir:10 1 MG----FKSWITEKL-----------------------------NPGQ--RIIRDMEPVSHRTNRKPF-TTGQAYS-KIE 43 (403) T ss_pred Cc----chhhhhhcc-----------------------------chhh--hhhhcccccccccCCccc-ccHHHHH-HHH Confidence 55 333332222 2211 11111100 000000 0001111 467 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHH--HHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCC Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDV--KIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEG 148 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~--~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g 148 (547) .|.+|+......|.+++|+|.......... ....-+..+|. ++ ...++.+.+. +.+++|-+.+.+. +. T Consensus 44 ~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~~ 119 (403) T protein:vir:10 44 ILNRTANMVIDSAAECSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----GT 119 (403) T ss_pred HHHHHHHHHHHHHhhCceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----Cc Confidence 799999999999999999996432211111 01112233343 33 3566666666 5678998876542 22 Q ss_pred eeeEEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecC----CcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 149 QYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPA----KSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 149 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~----~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) .+.++|+..+.+..+....+... ....+..+++...+.++... ..+.++|.+.+..+....-.-....++ T Consensus 120 -----~l~~l~~~~~~v~~~~~~~~~~~-~~~~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~ 193 (403) T protein:vir:10 120 -----SLYHVPAALMQVEADANKFIKKF-IFNNQINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNF 193 (403) T ss_pred -----eeEeecCcceEEEEcCCceEEEE-EecCceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 35566777666554443333222 22234556666655544222 246689999999999888888888888 Q ss_pred HHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHH Q lcl|NC_020866. 225 WAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQ 299 (547) Q Consensus 225 w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~ 299 (547) -..|.+.-+.|--|-+++...++++++++.+.+.+.-+ +..+|++.|++++-+........ -.|.+..++..++ T Consensus 194 ~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d-~q~~e~~~~~~~~ 272 (403) T protein:vir:10 194 KEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKD-LDFKEDIEGFNKS 272 (403) T ss_pred HHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHH-HHHHHHHHHHHHH Confidence 88888888889888899988999999999998887633 23689999999887653221111 2367778888999 Q ss_pred HHHHHhcc--hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCc----ccc Q lcl|NC_020866. 300 ISKAVLGQ--TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARP----KTE 373 (547) Q Consensus 300 Isk~ilGq--tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~----~~e 373 (547) |+++ +|- .+..++.+++ ..+........-+...++.|++.||+.|. ++|.|+.. -.. T Consensus 273 Ia~~-fgVPp~~lg~~~~sn--~e~~~~~f~~~tl~P~~~~ie~~l~~~L~--------------~~~~~d~~~~~~l~~ 335 (403) T protein:vir:10 273 ICLA-FGVPQVLLDGGNNAN--IRPNIELFYYMTIIPMLNKLTSSLTFFFG--------------YKITPNTKEVAALTP 335 (403) T ss_pred HHHH-hCCCHHHcCCCCCcC--HHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------------ceeeeccchhhhccc Confidence 9987 442 2222222222 22333445566677888889999887551 23444432 245 Q ss_pred CHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC--cchhcccccccccccccccccccccCCcccccccc Q lcl|NC_020866. 374 DLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP--GAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLK 443 (547) Q Consensus 374 d~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (547) |.+.+++++.++++.|+ ++.+++|+.+|+|+-++ ++..+-+ .+......+.+ .+.+..+.+....+ T Consensus 336 D~~~~~~~~~~~~~~G~-lT~NE~R~~~gl~pi~~~~~d~~~~p-~n~~~~~~~~~--~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 336 DKEAEAKHLTSLVNNGI-ITGNEARSELNLEPLDDEQMNKIRIP-ANVAGSATGVS--GQEGGRPKGSTEGD 403 (403) T ss_pred CHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCcccccccccc-cccccccccCC--CCcCCCCCCCcCCC Confidence 88899999999999998 99999999999986432 2222222 21111111111 11111111111111 No 85 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.73 E-value=1.1e-16 Score=108.00 Aligned_cols=424 Identities=15% Similarity=0.047 Sum_probs=227.9 Q ss_pred CCCCCeeeCCCCCcccch-hhhh-hhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCC-CHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA-QLTE-EVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHG-DPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~-~l~~-~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g-~~~~~~~L~~~m~~~D~~ 77 (547) |- |+|....-...+ +.+- +.+.+...... ...++. .+.+-.++...|...... ....+..+-..-..+-+. T Consensus 1 M~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~ 74 (466) T protein:vir:81 1 MR----LIDRLLSTRGAAPRMSIDDYAQMLNEFAF-NGIGYG-FGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGP 74 (466) T ss_pred Cc----hhHHHhhccCcccccchhhhhhhhhhhhc-cccccc-cccccHHHHHhhccccccccCccccccchhhhhccHH Confidence 77 555533322211 0101 11111111000 011111 122222333333321100 000111111111125788 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecC----- Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSE----- 147 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~----- 147 (547) |.+|++.+...|.+++|.|.-..++...+.....+..++.++ ...++++.++ +.+.+|.+.+++++...+ T Consensus 75 v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~ 154 (466) T protein:vir:81 75 VFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPD 154 (466) T ss_pred HHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccc Confidence 999999999999999999975443221111112334455554 3666777766 577899999999874322 Q ss_pred CeeeEEeeEeeccceeEecccCCc--ce--eeccC----CCCceeeccCceEEEeecC-CcCccccchhHHHHHHHHHHH Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFRFQKSDLA--TP--LLLDD----NGAEVPLPAFRFIFADIPA-KSGILLRSGLARVAAWGWMFK 218 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~~~~~~~~--~~--~~~~~----~~~g~~l~~~k~i~~~~~~-~~~~p~G~gll~~~~~~~~~K 218 (547) ..-.+..+.++++..+.+..+... .+ ....+ ......+++...|++++.. ..+..||.+.+..+....-.- T Consensus 155 ~~g~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~ 234 (466) T protein:vir:81 155 WVDVVVEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRAD 234 (466) T ss_pred cCcceeEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHH Confidence 112367899999988776543332 22 11111 1234568888887766543 345578999999999988888 Q ss_pred HHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHH Q lcl|NC_020866. 219 AFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRI 293 (547) Q Consensus 219 ~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li 293 (547) ....++...+....+.|--|-+++...++++++++.+.+.+.-+ +..+|++.|++++-+..+. ....|.+.. T Consensus 235 ~a~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~---~d~q~le~~ 311 (466) T protein:vir:81 235 QAMSKHQAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNL---QEIDFKNVR 311 (466) T ss_pred HHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCCh---hHHHHHHHH Confidence 88889999999999999888899988899999999999887633 2357899998888765332 222477788 Q ss_pred HHHHHHHHHHHhc-chhc---cCccccccchhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec Q lcl|NC_020866. 294 NHLDQQISKAVLG-QTTT---TDAVAGGHAVSQEH-RLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA 368 (547) Q Consensus 294 ~~~d~~Isk~ilG-qtlt---s~~~ggs~a~~~vh-~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~ 368 (547) ++..++|+++.-- ..+. .+...++++-.+-. ......-+...++.|+..||+.|+.. ..... -+|+|+ T Consensus 312 ~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~------~~~~~-~~~~f~ 384 (466) T protein:vir:81 312 GGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDM------GPDVR-LWYDAD 384 (466) T ss_pred HHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc------ccCcc-eEEEec Confidence 8899999988411 1222 12223445433333 34456677888999999999876442 11111 244554 Q ss_pred Cc--cccCHHHHHHH-------HHHHHhCCCCCCHHHHHHHhCCCCCCCcch-hcccccccccccccccccccccCCccc Q lcl|NC_020866. 369 RP--KTEDLTKLASS-------LDTLVRLGMEIEENEVRSRFGFSQPKPGAK-LLRPLENPATQAAPPGMDPGAGDLKTA 438 (547) Q Consensus 369 ~~--~~ed~~~~a~~-------~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (547) .. -..|.+.++++ ++.+++.|+ +.+++|+.... ++. .+...........+++.... ...+.. T Consensus 385 ~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~--t~nE~r~~~~~-----gd~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 456 (466) T protein:vir:81 385 DVPFLREDEKDAADIQKVRAETINTLITAGY--EPESVVAAVNS-----GDLRLLKHTGLTSVQLLPPGVSAS-ASSDTP 456 (466) T ss_pred chhhhccCHHHHHHHHHHHHHHHHHHHHcCC--ChhhccccccC-----CccccccCCCcchhhhcccccccc-cCCCCc Confidence 33 34566665544 667888884 77888854321 111 11111000000000000000 000000 Q ss_pred cccccccccc Q lcl|NC_020866. 439 ERKLKPLSGQ 448 (547) Q Consensus 439 ~~~~~~~~~~ 448 (547) ........++ T Consensus 457 ~~~Gg~~ngn 466 (466) T protein:vir:81 457 TSGGADDNGN 466 (466) T ss_pred ccCCCCcCCC Confidence 0000000000 No 86 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.70 E-value=8.4e-16 Score=103.23 Aligned_cols=380 Identities=12% Similarity=0.030 Sum_probs=193.1 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKI 108 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ 108 (547) +.-+.+++.. .-.+..+.. ......+...-..+-+.|.+|++.....|.+++|.+.-.+. ..+... T Consensus 1 Mg~f~~lf~~----~~~~~~~~~---------~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~~~~~ 66 (395) T protein:vir:95 1 MSILEKIFKT----RKDITYMLD---------LDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQKNDV 66 (395) T ss_pred Cchhhhhhcc----Ccccccccc---------chhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc-cccchH Confidence 2111111111 101100000 00001111111114688999999999999999999875432 222222 Q ss_pred HHHHHHH-HhCCCHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeec Q lcl|NC_020866. 109 ADMVRDW-LKREELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLP 186 (547) Q Consensus 109 ae~v~~~-l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~ 186 (547) +..+... -..+...++++.++. .+..|.++..+ .+++.+.+.....+.+.. ........+.....+....++ T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~---~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 140 (395) T protein:vir:95 67 YYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV---SDSKELLIADSFYREEYA---LYDDIFKDVTVKDYTYQRTFT 140 (395) T ss_pred HHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE---ecCCCeEecCCccceeEe---ecCcceeEEEEcCceeeeeec Confidence 3322211 111245556665553 34456554322 223333322222221211 111111222233333345788 Q ss_pred cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC-CCHHHHHHHHHHHHHHhc--- Q lcl|NC_020866. 187 AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG-ASEQDKDTLFRAVANIAG--- 262 (547) Q Consensus 187 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~-~~~~~~~~l~~al~~~~~--- 262 (547) +..+|++++....+..||.|.+..+.... .....+..+.|.|--+.+++.. .+++.++++.+.+.++.+ T Consensus 141 ~~evih~~~~~~~~~~~G~spi~~~~~~~-------~~~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~ 213 (395) T protein:vir:95 141 MQEVIYLKYNNNKVTHFVESLFEDYGKIF-------GRMIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFN 213 (395) T ss_pred cccEEEEccCCCCcccccchHHHHHHHHH-------HHHHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcccc Confidence 89988888888888899999988775443 2222333455665444455443 577788888777765432 Q ss_pred -Ce--eEEecCCceEEEEecCCCCCch--HHHHHHHHHHHHHHHHHHhc-chhccCccccccc-hhHHHHHHHHHHHHHH Q lcl|NC_020866. 263 -DC--AAIIPETMSIDFVESKSIGSSV--DLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHA-VSQEHRLVQEDIETSD 335 (547) Q Consensus 263 -~~--~~vip~~~~ie~~e~~~~~~~~--~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a-~~~vh~~v~~~~~~ad 335 (547) +. +++++.|++++-++.+...... ..|.+..++..++|+++.-- ..+. +|+++ ..+........-+..- T Consensus 214 ~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l----~~~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:95 214 KNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI----YGETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred ccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCcccCHHHHHHHHHHHHHHHH Confidence 23 3345778887766543322211 12556666778889988421 1222 12333 3444455556678888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc--chhc Q lcl|NC_020866. 336 AMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG--AKLL 413 (547) Q Consensus 336 ~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~--e~~~ 413 (547) +..|+..||+.|+.+--.. ..-+|.+......|.+.+++++.++++.|+ ++.+++|+.+|+|+-+++ +..+ T Consensus 290 ~~~ie~~l~~kL~~~~~~~------~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~-lt~NE~R~~~g~~p~~~g~~d~~~ 362 (395) T protein:vir:95 290 LKKIQNELNAKLITQSMYL------KDTRIEIVGVNKKDPLQYAEAIDKLVSSGS-FTRNEVRIMLGEEPSDNPELDEYL 362 (395) T ss_pred HHHHHHHHHHhhcChhhhc------ccceecchhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCCceee Confidence 9999999998876542111 112566665667889999999999999998 899999999999865544 3333 Q ss_pred ccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 414 RPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) .+... .+...+.......... ..+.......-+ T Consensus 363 ~~~n~-~~~~~~~~~~~~~~~~---~~kgg~~~~~g~ 395 (395) T protein:vir:95 363 ITKNY-EKANSGENDEKEKDEN---TLKGGDEDESGD 395 (395) T ss_pred ecccc-ccccccccccCccccc---ccCCCCCCCCCC Confidence 22211 1111110000000000 000000000000 No 87 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.70 E-value=8.4e-16 Score=103.23 Aligned_cols=380 Identities=12% Similarity=0.030 Sum_probs=193.1 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKI 108 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ 108 (547) +.-+.+++.. .-.+..+.. ......+...-..+-+.|.+|++.....|.+++|.+.-.+. ..+... T Consensus 1 Mg~f~~lf~~----~~~~~~~~~---------~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~~~~~ 66 (395) T protein:vir:10 1 MSILEKIFKT----RKDITYMLD---------LDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQKNDV 66 (395) T ss_pred Cchhhhhhcc----Ccccccccc---------chhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc-cccchH Confidence 2111111111 101100000 00001111111114688999999999999999999875432 222222 Q ss_pred HHHHHHH-HhCCCHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeec Q lcl|NC_020866. 109 ADMVRDW-LKREELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLP 186 (547) Q Consensus 109 ae~v~~~-l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~ 186 (547) +..+... -..+...++++.++. .+..|.++..+ .+++.+.+.....+.+.. ........+.....+....++ T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~---~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 140 (395) T protein:vir:10 67 YYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV---SDSKELLIADSFYREEYA---LYDDIFKDVTVKDYTYQRTFT 140 (395) T ss_pred HHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE---ecCCCeEecCCccceeEe---ecCcceeEEEEcCceeeeeec Confidence 3322211 111245556665553 34456554322 223333322222221211 111111222233333345788 Q ss_pred cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC-CCHHHHHHHHHHHHHHhc--- Q lcl|NC_020866. 187 AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG-ASEQDKDTLFRAVANIAG--- 262 (547) Q Consensus 187 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~-~~~~~~~~l~~al~~~~~--- 262 (547) +..+|++++....+..||.|.+..+.... .....+..+.|.|--+.+++.. .+++.++++.+.+.++.+ T Consensus 141 ~~evih~~~~~~~~~~~G~spi~~~~~~~-------~~~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~ 213 (395) T protein:vir:10 141 MQEVIYLKYNNNKVTHFVESLFEDYGKIF-------GRMIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFN 213 (395) T ss_pred cccEEEEccCCCCcccccchHHHHHHHHH-------HHHHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcccc Confidence 89988888888888899999988775443 2222333455665444455443 577788888777765432 Q ss_pred -Ce--eEEecCCceEEEEecCCCCCch--HHHHHHHHHHHHHHHHHHhc-chhccCccccccc-hhHHHHHHHHHHHHHH Q lcl|NC_020866. 263 -DC--AAIIPETMSIDFVESKSIGSSV--DLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHA-VSQEHRLVQEDIETSD 335 (547) Q Consensus 263 -~~--~~vip~~~~ie~~e~~~~~~~~--~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a-~~~vh~~v~~~~~~ad 335 (547) +. +++++.|++++-++.+...... ..|.+..++..++|+++.-- ..+. +|+++ ..+........-+..- T Consensus 214 ~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l----~~~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:10 214 KNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI----YGETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred ccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCcccCHHHHHHHHHHHHHHHH Confidence 23 3345778887766543322211 12556666778889988421 1222 12333 3444455556678888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc--chhc Q lcl|NC_020866. 336 AMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG--AKLL 413 (547) Q Consensus 336 ~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~--e~~~ 413 (547) +..|+..||+.|+.+--.. ..-+|.+......|.+.+++++.++++.|+ ++.+++|+.+|+|+-+++ +..+ T Consensus 290 ~~~ie~~l~~kL~~~~~~~------~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~-lt~NE~R~~~g~~p~~~g~~d~~~ 362 (395) T protein:vir:10 290 LKKIQNELNAKLITQSMYL------KDTRIEIVGVNKKDPLQYAEAIDKLVSSGS-FTRNEVRIMLGEEPSDNPELDEYL 362 (395) T ss_pred HHHHHHHHHHhhcChhhhc------ccceecchhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCCceee Confidence 9999999998876542111 112566665667889999999999999998 899999999999865544 3333 Q ss_pred ccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 414 RPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) .+... .+...+.......... ..+.......-+ T Consensus 363 ~~~n~-~~~~~~~~~~~~~~~~---~~kgg~~~~~g~ 395 (395) T protein:vir:10 363 ITKNY-EKANSGENDEKEKDEN---TLKGGDEDESGD 395 (395) T ss_pred ecccc-ccccccccccCccccc---ccCCCCCCCCCC Confidence 22211 1111110000000000 000000000000 No 88 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.70 E-value=8.4e-16 Score=103.23 Aligned_cols=380 Identities=12% Similarity=0.030 Sum_probs=193.1 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKI 108 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ 108 (547) +.-+.+++.. .-.+..+.. ......+...-..+-+.|.+|++.....|.+++|.+.-.+. ..+... T Consensus 1 Mg~f~~lf~~----~~~~~~~~~---------~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~~~~~ 66 (395) T protein:vir:10 1 MSILEKIFKT----RKDITYMLD---------LDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQKNDV 66 (395) T ss_pred Cchhhhhhcc----Ccccccccc---------chhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc-cccchH Confidence 2111111111 101100000 00001111111114688999999999999999999875432 222222 Q ss_pred HHHHHHH-HhCCCHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeec Q lcl|NC_020866. 109 ADMVRDW-LKREELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLP 186 (547) Q Consensus 109 ae~v~~~-l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~ 186 (547) +..+... -..+...++++.++. .+..|.++..+ .+++.+.+.....+.+.. ........+.....+....++ T Consensus 67 ~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~---~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 140 (395) T protein:vir:10 67 YYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVV---SDSKELLIADSFYREEYA---LYDDIFKDVTVKDYTYQRTFT 140 (395) T ss_pred HHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEE---ecCCCeEecCCccceeEe---ecCcceeEEEEcCceeeeeec Confidence 3322211 111245556665553 34456554322 223333322222221211 111111222233333345788 Q ss_pred cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC-CCHHHHHHHHHHHHHHhc--- Q lcl|NC_020866. 187 AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG-ASEQDKDTLFRAVANIAG--- 262 (547) Q Consensus 187 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~-~~~~~~~~l~~al~~~~~--- 262 (547) +..+|++++....+..||.|.+..+.... .....+..+.|.|--+.+++.. .+++.++++.+.+.++.+ T Consensus 141 ~~evih~~~~~~~~~~~G~spi~~~~~~~-------~~~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~ 213 (395) T protein:vir:10 141 MQEVIYLKYNNNKVTHFVESLFEDYGKIF-------GRMIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFN 213 (395) T ss_pred cccEEEEccCCCCcccccchHHHHHHHHH-------HHHHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcccc Confidence 89988888888888899999988775443 2222333455665444455443 577788888777765432 Q ss_pred -Ce--eEEecCCceEEEEecCCCCCch--HHHHHHHHHHHHHHHHHHhc-chhccCccccccc-hhHHHHHHHHHHHHHH Q lcl|NC_020866. 263 -DC--AAIIPETMSIDFVESKSIGSSV--DLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHA-VSQEHRLVQEDIETSD 335 (547) Q Consensus 263 -~~--~~vip~~~~ie~~e~~~~~~~~--~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a-~~~vh~~v~~~~~~ad 335 (547) +. +++++.|++++-++.+...... ..|.+..++..++|+++.-- ..+. +|+++ ..+........-+..- T Consensus 214 ~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l----~~~~sn~e~~~~~~~~~~l~P~ 289 (395) T protein:vir:10 214 KNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI----YGETADLEKNTLVFEKFCLTPL 289 (395) T ss_pred ccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCcccCHHHHHHHHHHHHHHHH Confidence 23 3345778887766543322211 12556666778889988421 1222 12333 3444455556678888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc--chhc Q lcl|NC_020866. 336 AMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG--AKLL 413 (547) Q Consensus 336 ~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~--e~~~ 413 (547) +..|+..||+.|+.+--.. ..-+|.+......|.+.+++++.++++.|+ ++.+++|+.+|+|+-+++ +..+ T Consensus 290 ~~~ie~~l~~kL~~~~~~~------~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~-lt~NE~R~~~g~~p~~~g~~d~~~ 362 (395) T protein:vir:10 290 LKKIQNELNAKLITQSMYL------KDTRIEIVGVNKKDPLQYAEAIDKLVSSGS-FTRNEVRIMLGEEPSDNPELDEYL 362 (395) T ss_pred HHHHHHHHHHhhcChhhhc------ccceecchhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCCCCCceee Confidence 9999999998876542111 112566665667889999999999999998 899999999999865544 3333 Q ss_pred ccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 414 RPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) .+... .+...+.......... ..+.......-+ T Consensus 363 ~~~n~-~~~~~~~~~~~~~~~~---~~kgg~~~~~g~ 395 (395) T protein:vir:10 363 ITKNY-EKANSGENDEKEKDEN---TLKGGDEDESGD 395 (395) T ss_pred ecccc-ccccccccccCccccc---ccCCCCCCCCCC Confidence 22211 1111110000000000 000000000000 No 89 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.69 E-value=6.2e-17 Score=109.43 Aligned_cols=364 Identities=11% Similarity=0.001 Sum_probs=212.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |- |.+...+.-..+.... .. +...+ ++. .+..-..|....... .+ +-+.|.+ T Consensus 1 Mg----lf~~~~~~~~~~~~~~-------~~----~~~~~----~~~----~~~~~~~~~~v~~~~----al-~~~~V~~ 52 (384) T protein:vir:49 1 MP----IFNITNLATESPPSNQ-------DS----FFDIT----DPE----FLDALNGSEWVSAET----AL-KNSDLFS 52 (384) T ss_pred Cc----cccccccCcccccccc-------hh----hcccc----chh----hcccccCCceechhh----hh-ccHHHHH Confidence 76 5554222111110000 00 00000 110 000001111110001 11 3577999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.+...|.++++++.-... ..++.++ ...++++.++ +.+.+|-+++++++...+ .+..| T Consensus 53 ~i~~Ia~~ia~l~~~~~~~~~-----------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g---~~~~L 118 (384) T protein:vir:49 53 IISQLSNDLATAKITTSRKQL-----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENG---RDMKW 118 (384) T ss_pred HHHHHHHHHhhCceeeecchh-----------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCC---cEEEE Confidence 999999999999999973221 1233333 4677777777 567799999999985433 36789 Q ss_pred EeeccceeEecccC-Ccce--eecc-C--CCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSD-LATP--LLLD-D--NGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~~-~~~~--~~~~-~--~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .+++|.++++..+. ...+ .... + .+....+++..+|++++....+..+|.|.+..+....-.-....++...+. T Consensus 119 ~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 198 (384) T protein:vir:49 119 EYLRPSQVSFNRLDNQNGLYYNITFDDPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNAL 198 (384) T ss_pred EEEcCceeEEEEcCCCceEEEEEEecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999988764432 2222 2221 1 233467888888877777777778999999999999888888889999999 Q ss_pred HHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc--CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc- Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG--DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG- 306 (547) Q Consensus 230 E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~--~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG- 306 (547) ..-+.|--+.+++...+++++.+..++-..... +..++++.|++++-+..+ .. ...|.+..++..++|+.+.-- T Consensus 199 ~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~--~~-d~q~~e~~~~~~~~Ia~~fgVp 275 (384) T protein:vir:49 199 KNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIK--SN-VAQLLSQADWTTGQFAKVYGIP 275 (384) T ss_pred hccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCCCceEEEccCC--hh-hHHHHHHHHHHHHHHHHHhCCC Confidence 999999999999877776666554444333333 345778888876654322 22 224677778888999988421 Q ss_pred chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCCcccccEEEecCccccCHHHHHHHHH Q lcl|NC_020866. 307 QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQ---LEYGPQKRYPRIKIARPKTEDLTKLASSLD 383 (547) Q Consensus 307 qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~---~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~ 383 (547) ..+.+.. +++.+..+.-++.....++.-++-+.+.+++.+.+.+.. ....+.....+|.++..-..++....++.. T Consensus 276 ~~~lg~~-~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~ 354 (384) T protein:vir:49 276 ESVVGGE-GDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLY 354 (384) T ss_pred HHHhCCC-CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHH Confidence 1222221 112222233344445555666666666666655544311 111111222233333333456777888888 Q ss_pred HHHhCCCCCCHHHHHHHhCCCCCCCcc---hh Q lcl|NC_020866. 384 TLVRLGMEIEENEVRSRFGFSQPKPGA---KL 412 (547) Q Consensus 384 ~l~~~G~~v~~~~~~e~~Glp~p~~~e---~~ 412 (547) ++...|+. + +++|+..|+|+-+.|+ +. T Consensus 355 ~l~~~g~~-~-ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 355 VLQQAEIL-P-KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred HHhhCCCC-C-hhHHHHcCCCCCCCCCCCCCC Confidence 88888874 4 7799999987644332 22 No 90 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.69 E-value=4.7e-15 Score=99.13 Aligned_cols=447 Identities=13% Similarity=0.010 Sum_probs=221.2 Q ss_pred CCCCCeeeCCCCCcc-cchhhhhhh------hhhhhhhhHHHHhhhhccCCCHHHHHH--------HHH----------- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV-QRAQLTEEV------AAPTVTGVRSPLSGYPGDGLNPLRLAQ--------ILR----------- 54 (547) Q Consensus 1 ma~~~~~~~~~g~~~-~~~~l~~~~------a~~~~~~~~~~~~~~~~~~l~p~~~~~--------ilr----------- 54 (547) ||. -|++-||- ....|.+.. ++++.-... +.| -++|..+.. .|- T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~---~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~ 70 (532) T protein:vir:94 1 MAD----TDPTPRPEITYATLQQAQRVDAKRATHTSLGLA---TAH---EIDPTAYSPYERNAAQNAMAMDYGLQTGRNG 70 (532) T ss_pred CCC----CCCCCCcceehhhhhhHhhhhhhhhhhhhhhhh---hhh---hhcccccccccccccccccccccccCccccc Confidence 885 45655552 222222211 111100000 111 111111110 000 Q ss_pred ----hhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH- Q lcl|NC_020866. 55 ----AADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL- 129 (547) Q Consensus 55 ----~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l- 129 (547) ..........++|+..+ ++.+-++.++++.-.-.++..|+|...+++..+.+....+++.++++...+.+.+++ T Consensus 71 ~~~~~~~~~~~~~~~~l~a~Y-~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~ 149 (532) T protein:vir:94 71 RNALSFVEATSWPGFPTLALL-AQLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVI 149 (532) T ss_pred ccccccccccccchHHHHHHH-HcCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHH Confidence 00001111223444333 468999999999999999999999876655545556667777777777666666555 Q ss_pred HhhhhcCeeeeeEEeecC----------------CeeeEEeeEeeccceeEecccCCcc----------eeeccCCCCce Q lcl|NC_020866. 130 DCISKGYSFTEIVWNTSE----------------GQYFPDKLIWRDPRHFRFQKSDLAT----------PLLLDDNGAEV 183 (547) Q Consensus 130 ~a~~~G~sv~Ei~w~~~~----------------g~~~~~~l~~~~~~~~~~~~~~~~~----------~~~~~~~~~g~ 183 (547) .+.+||.+++=+.=.-++ +...++.|...+|.|+......... .+.. ..+. T Consensus 150 ~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v---~~g~ 226 (532) T protein:vir:94 150 HDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIA---TSGK 226 (532) T ss_pred hhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEE---ccCe Confidence 688999886533211111 1112356777777776543221110 1111 1245 Q ss_pred eeccCceEEEeecC------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHH Q lcl|NC_020866. 184 PLPAFRFIFADIPA------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAV 257 (547) Q Consensus 184 ~l~~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al 257 (547) .+.+.++|++.+.. ...+.+|.+++..+|....--......-+..+.++.+.+..-..-...+.+..+.+.+.+ T Consensus 227 ~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~ 306 (532) T protein:vir:94 227 KIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARL 306 (532) T ss_pred eeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcchhHHHHHHHH Confidence 67777777765442 234557999999999877665566666677788888776532211111222334443333 Q ss_pred ---HHHh-cCeeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHH Q lcl|NC_020866. 258 ---ANIA-GDCAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQE 329 (547) Q Consensus 258 ---~~~~-~~~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~ 329 (547) .... +.+.++++. +.+++.+...- ++ -+.+++..-++||-+ +|-.+| +.+.+|-.+.|+.-...+- T Consensus 307 ~~~~~~~~n~g~~~id~~~e~~e~~~~~l-sg----l~~~l~~~~~~iAaa-~~IP~t~LfG~sp~GlnstGe~D~~~yy 380 (532) T protein:vir:94 307 QLFNLYRDNRNIGALDKGTEEIQQTNTPL-SG----LDSLQAQSQEQMAAV-SHIPLVKLLGITPNGLNASSDGEIRVWY 380 (532) T ss_pred HHHHhhcCCccceEEcCCCceeEEEeccc-CC----HHHHHHHHHHHHHhH-hCCCeeeeecCCcccccccchHHHHHHH Confidence 3332 346677876 47888886443 33 345566666677755 554433 3334455566777777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHH-------HHHHHHHHHHhCCCCCCHHHHHHHhC Q lcl|NC_020866. 330 DIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLT-------KLASSLDTLVRLGMEIEENEVRSRFG 402 (547) Q Consensus 330 ~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~-------~~a~~~~~l~~~G~~v~~~~~~e~~G 402 (547) +.+++-......-+.+.|++.|+...||....--.|+|......+-+ +.+++++++.+.|+ ++.+++|+.++ T Consensus 381 d~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gv-i~~~Evr~~l~ 459 (532) T protein:vir:94 381 DFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGV-IDAKMVQQRLA 459 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHHh Confidence 88888876654455556777777666764333346667644444333 34566788888887 99999999998 Q ss_pred CCCCCCcchhc-------ccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 403 FSQPKPGAKLL-------RPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 403 lp~p~~~e~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) ...-...+... ................+..++.. ..++.....+.++ .............+.+... T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~~---~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:94 460 ADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQT-PNPQPDSEDDQTD---NQPDAQADPAQNDQPVGNR 532 (532) T ss_pred cCCccccccccccccccccccchhhhhcccccCCCCCCCCC-CCCCCCCCCCCCC---CccCCCccccccCCCcCCC Confidence 64321111110 00000000000000000000000 0000000000000 0001111111111111111 No 91 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.69 E-value=7.6e-15 Score=98.00 Aligned_cols=393 Identities=11% Similarity=0.018 Sum_probs=216.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCC-CHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGL-NPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l-~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |- ..++..+... ++ |.. +...-..+.+....+..|+..+ ++.+-++ T Consensus 1 ~~-------------------------~~D~~~~~~~-----~~g~~~--~~~~~~~~~~~~~~~~~l~a~Y-~~~~l~~ 47 (437) T protein:vir:52 1 MK-------------------------FFDGIKSLAL-----KLGSKQ--EQTYYSPSLSLTDDLVQLEALW-RDNWIAN 47 (437) T ss_pred Cc-------------------------hhhhhHhHHh-----cCCCcc--ccceeecCccccccHHHHHHHH-HhCchhh Confidence 11 1222222211 11 110 0111112222223344444333 4799999 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCC--------ee Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEG--------QY 150 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g--------~~ 150 (547) .++++.-.-.++..|.|...+.++ +..+.+++.++++.+.+.+.+++ .+.+||-+++=++ .++. .- T Consensus 48 ~~vd~~a~d~~r~~~~i~~~d~~~---~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~--~d~~~~~~pl~~~~ 122 (437) T protein:vir:52 48 KVCIKRPEDMVRNWREIYSNDLNS---KQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVV--TDSQNTSAPLKPTE 122 (437) T ss_pred HHhhcchHHhhcCCceEecCCCCH---HHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEE--ecCCCcccccccCC Confidence 999999999999999997643333 23345677777777777777666 4889997665443 2221 12 Q ss_pred eEEeeEeeccceeEeccc----------CCcceeeccCCCCceeeccCceEEEeec---CCcCccccchhHHHHHHHHHH Q lcl|NC_020866. 151 FPDKLIWRDPRHFRFQKS----------DLATPLLLDDNGAEVPLPAFRFIFADIP---AKSGILLRSGLARVAAWGWMF 217 (547) Q Consensus 151 ~~~~l~~~~~~~~~~~~~----------~~~~~~~~~~~~~g~~l~~~k~i~~~~~---~~~~~p~G~gll~~~~~~~~~ 217 (547) .++.|...++.++..... +....+...+.+.+..+.+.++|++.+. ....+.+|.+++..+|....- T Consensus 123 ~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~ 202 (437) T protein:vir:52 123 RLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKR 202 (437) T ss_pred ceeEEEEechhhccccccccccccccccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHH Confidence 356677777766542111 1122233333444567888888877533 345678899999999988776 Q ss_pred HHHHHHHHHHHHHHcCCCeEEEe-----CCCCCCHHHHHHHHHHHHHHhc-CeeEEecCCceEEEEecCCCCCchHHHHH Q lcl|NC_020866. 218 KAFTARDWAIFTQTYGQPLRIGK-----WQPGASEQDKDTLFRAVANIAG-DCAAIIPETMSIDFVESKSIGSSVDLYEK 291 (547) Q Consensus 218 K~~~~~~w~~f~E~~G~P~~vgk-----~~~~~~~~~~~~l~~al~~~~~-~~~~vip~~~~ie~~e~~~~~~~~~~f~~ 291 (547) -......-+..+.++.+++.--+ ... ..++...+..+.+..+.+ .+.+++..+.+++.+...- ++ -.. T Consensus 203 ~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~-sg----l~~ 276 (437) T protein:vir:52 203 FDSASVNVGDLIFESKIDIFKIAGLSDKIAA-GMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTF-TG----LKD 276 (437) T ss_pred HHHHHHHHHHHHHHcCCCceecchHHHHhcC-CcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCc-CC----HHH Confidence 66666666777888877654322 111 123344455555555544 5788999999999886543 33 345 Q ss_pred HHHHHHHHHHHHHhcchhc--cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC Q lcl|NC_020866. 292 RINHLDQQISKAVLGQTTT--TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR 369 (547) Q Consensus 292 li~~~d~~Isk~ilGqtlt--s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~ 369 (547) +++..-.+||.+ .+-.+| .+...++.|.|+-....+-+.+++.......-+.+.|++.++.-.||+...--+|+|.. T Consensus 277 ~l~~~~~~iaaa-~~iP~t~L~G~s~~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~p 355 (437) T protein:vir:52 277 LLTEFRNAVAGA-ADMPVTILFGQSVSGLASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVP 355 (437) T ss_pred HHHHHHHHHHHH-hcCchhhhcCcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCC Confidence 556666778866 443332 12223445777777788888899988665555556688888777777533334566654 Q ss_pred ccccC-------HHHHHHHHHHHHhCCCCCCHHHHHHHhC----CCC-CCCcchhcccccccccc-cccccccccccCCc Q lcl|NC_020866. 370 PKTED-------LTKLASSLDTLVRLGMEIEENEVRSRFG----FSQ-PKPGAKLLRPLENPATQ-AAPPGMDPGAGDLK 436 (547) Q Consensus 370 ~~~ed-------~~~~a~~~~~l~~~G~~v~~~~~~e~~G----lp~-p~~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~ 436 (547) ....+ .++.+++++++++.|+ ++.+++++++. ++. +.++.+-.....++... .++++..+ .+ T Consensus 356 L~~~s~kekae~~~~~a~a~~~~~~~g~-i~~~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 430 (437) T protein:vir:52 356 LTTVKQEQQINMLNTFATAANTLIQNGV-LNEYQIANELRESGLFANISAEHIEELKNADEFAGNFEEPEKMEG----AQ 430 (437) T ss_pred cCCcCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCCC----CC Confidence 43333 2456677888889997 88888888762 111 11110001111110000 01111000 00 Q ss_pred ccccccc Q lcl|NC_020866. 437 TAERKLK 443 (547) Q Consensus 437 ~~~~~~~ 443 (547) +..+++. T Consensus 431 ~~~~~~~ 437 (437) T protein:vir:52 431 VQNSEDQ 437 (437) T ss_pred CCCCCCC Confidence 0000000 No 92 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.68 E-value=1.4e-15 Score=102.06 Aligned_cols=374 Identities=12% Similarity=0.068 Sum_probs=189.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |.=.++|.+..|+...+..+.. +..|.. ... ...+.+ .+-+.|.+ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~----~~~----~~~~~~----l~~~~v~~ 45 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTD-----------------------TVWCSI----PSE----KLKELS----IKKWAIDS 45 (395) T ss_pred CchHHHHHhhhccccccccccc-----------------------chhhcc----ccc----cchhhh----hhhHHHHH Confidence 7744444444443222211110 000000 000 000111 14577999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) |++.+...|.+++|++...++ ..... +..+|+ ++ ...++++.+. +.+.+|.+.+.+. .++.+.+.. T Consensus 46 ~v~~Ia~~ia~~p~~~~~~~~-~~~~~----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~---~~~~~~~~~ 117 (395) T protein:vir:40 46 CANKIANTLSCAEVLTYEKGE-EVRKK----NWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQ---DEYIYVADS 117 (395) T ss_pred HHHHHHHHHhhCceeeccCCc-cccch----HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEe---cCceeecCC Confidence 999999999999999975432 22222 223332 33 3555666544 4667898886553 233332222 Q ss_pred eEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQ 234 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~ 234 (547) +...... .. . ....-+...+......+++...+++++....+.+++.++...+.-. +... ...+....+. T Consensus 118 ~~~~~~~--~~-~-~~~~~v~~~~~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~--~~~~----~~~~~~~~~~ 187 (395) T protein:vir:40 118 FTKNDKS--LY-E-NTYTEVTLKDLTLKKEFKESEVLHLTLNNESIKSIIDGFYLLYGDL--LTAA----VNKYKKLNSR 187 (395) T ss_pred ccccccc--cc-c-ceeeeeeecCceeeeeeccccEEEeecCCCCccccchhHHHHHHHH--HHHH----HHHHHhcCCC Confidence 2111000 00 0 0000011122222345788888877877777788888776544321 1111 1122222333 Q ss_pred -CeEEEeCCCCCCHHHHHHHHHHHHHH----hcC--eeEEecCCceEEEEecCCCCCchHHHHH---HHHHHHHHHHHHH Q lcl|NC_020866. 235 -PLRIGKWQPGASEQDKDTLFRAVANI----AGD--CAAIIPETMSIDFVESKSIGSSVDLYEK---RINHLDQQISKAV 304 (547) Q Consensus 235 -P~~vgk~~~~~~~~~~~~l~~al~~~----~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~---li~~~d~~Isk~i 304 (547) |..+.+++...+++.++++.+.+.+. .++ .+++++.|++++-+..+.... .|.+ +.+.+-++|+++ T Consensus 188 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~---q~~e~~~~~~~~~~~Ia~~- 263 (395) T protein:vir:40 188 KIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIA---ESRDIKKMIDDVFEMVANS- 263 (395) T ss_pred CceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhh---hHHHHHHHHHHHHHHHHHH- Confidence 56667777777777776666655543 223 357788888877665432211 2433 333344789888 Q ss_pred hcc--hhccCccccccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHH Q lcl|NC_020866. 305 LGQ--TTTTDAVAGGHAV-SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASS 381 (547) Q Consensus 305 lGq--tlts~~~ggs~a~-~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~ 381 (547) +|- .+. +|+++- .+........-+...++.|++.||+.|++.--.. ...+.+|.++.....|.+++++. T Consensus 264 fgVPp~~l----~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~----~g~~i~fd~~~ll~~d~~~~~~~ 335 (395) T protein:vir:40 264 FNIPLGLA----KGDTVGLSEQVNSFLMFSINPIAEMFTDEGNRKFYGRDSVL----ERTYMKLDTTRIKVQDIQEIASS 335 (395) T ss_pred hCCCHHHh----cCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhc----CCceEEEechhhhccCHHHHHHH Confidence 342 222 133332 2333444556778888899999998875533211 11223454455557799999999 Q ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCC--cchhcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 382 LDTLVRLGMEIEENEVRSRFGFSQPKP--GAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 382 ~~~l~~~G~~v~~~~~~e~~Glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) +.++++.|+ ++.+++|+.+|+|+-+. ++....+...............+....... . . T Consensus 336 ~~~~~~~G~-~t~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~kgge~~~~~~---~-----~ 395 (395) T protein:vir:40 336 MDVLFHIGV-NTIDDNLRMIGREPVMSPETQERFVTKNYAPLGENEEDLKGGDINENKG---D-----S 395 (395) T ss_pred HHHHHhCCC-CCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCCCCCCCCcC---C-----C Confidence 999999997 99999999999986533 443333322111111111111111000000 0 0 No 93 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.67 E-value=7.6e-16 Score=103.48 Aligned_cols=365 Identities=12% Similarity=0.059 Sum_probs=216.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. |.+..-++... -+... .......+..+... |.. +......+-+.|.+ T Consensus 1 Mg----~f~~~~~~~~~--~~~~~----~~~~~~~~~~~~~~----------------~~~-----v~~~~~l~~~~v~~ 49 (382) T protein:vir:48 1 MP----IFNLATESPPD--NQGGF----FDVVDSDFLASLKG----------------NEW-----VSAETALRNSDLFS 49 (382) T ss_pred Cc----cccccccCCcc--ccccc----ccchhhhccccccC----------------Ccc-----cchHhhhccHHHHH Confidence 77 55442211100 00000 00000011100000 000 00000014578999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) |++.+...|.+++|++.-... + .++.+ +.+.++++.+. +.+.+|-++++++.... | .+..| T Consensus 50 ~i~~ia~~ia~~~~~~~~~~~--------~---~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~-G--~~~~l 115 (382) T protein:vir:48 50 IINQLSNDLATVKLITSRKKL--------Q---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN-G--RDMKW 115 (382) T ss_pred HHHHHHHhhccCceeeecchh--------h---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC-C--cEEEE Confidence 999999999999999974321 1 12333 35777888887 57789999999976432 3 35689 Q ss_pred EeeccceeEecccC--Cccee-ecc-C--CCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 156 IWRDPRHFRFQKSD--LATPL-LLD-D--NGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 156 ~~~~~~~~~~~~~~--~~~~~-~~~-~--~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) .++++.++++..+. ....+ ... + .+....+++..+|++++....+..+|.|.+..+....-.-....++...+. T Consensus 116 ~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~ 195 (382) T protein:vir:48 116 EYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSL 195 (382) T ss_pred EEEcCceeEEEEcCCCCeEEEEEEecCccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998765432 22222 221 2 133457888888888877777889999999999998888888888999999 Q ss_pred HHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc- Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG- 306 (547) Q Consensus 230 E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG- 306 (547) ..-+.|--+.+++...+++++.++.++..+...+ ..+|++.|++++=+..+. . ...|.+..++..++|+++.-- T Consensus 196 ~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~--~-d~q~~e~~~~~~~~Ia~afgVp 272 (382) T protein:vir:48 196 KNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKS--N-VSQLLKQADWTTGQFAKVYGIP 272 (382) T ss_pred hccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCCh--h-HHHHHHHHHHHHHHHHHHhCCC Confidence 9999999999998888888888888887776644 457888888776654332 2 224777788888999988411 Q ss_pred chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHH Q lcl|NC_020866. 307 QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLV 386 (547) Q Consensus 307 qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~ 386 (547) ..+.+....++ ...+........-+..-++.+++.||+.|.+++- .+ ..+.+. .+.......+.+|. T Consensus 273 ~~~lg~~~~~~-~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~~-~~-----~~~~~~------~~~~~~~~~~~~l~ 339 (382) T protein:vir:48 273 DNVVGGQGDQQ-SSLEMSSDLYSKAVSRYLRPFLSELSQKLSCDVD-AD-----IFPAVD------PTGSNYISRINSLV 339 (382) T ss_pred HHHhCCCCCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhh-hh-----hhhhhc------cchhHHHHHHHHHh Confidence 22222222222 2333444556677778888899999887755431 11 111111 22344556677888 Q ss_pred hCCCCCCHHHHHHHhC---CCCCCCcchhcccccccccccccccccccccCCcc Q lcl|NC_020866. 387 RLGMEIEENEVRSRFG---FSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKT 437 (547) Q Consensus 387 ~~G~~v~~~~~~e~~G---lp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (547) ..|+ .+.+++|+.++ +- |.+ +-... .+. +....+...... T Consensus 340 ~~g~-~t~~e~r~~l~~~g~~-~~~---~~~~~-~~~-----~~~~GGd~~~~~ 382 (382) T protein:vir:48 340 KTGT-LAQNQGLYILQQAEIL-PKE---LPNGE-NPN-----STLKGGEEDGQD 382 (382) T ss_pred hcCc-cCHHHHHHHHhhCCCC-Ccc---hhhhh-cCC-----CCCCCCCCCCCC Confidence 8887 88999999874 31 111 00000 000 000011000000 No 94 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.66 E-value=2.5e-14 Score=95.13 Aligned_cols=444 Identities=9% Similarity=-0.034 Sum_probs=216.6 Q ss_pred CCCCCe-------------------eeCCCCCcc-cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHH-hhcCC Q lcl|NC_020866. 1 MAQKPV-------------------LLDRWGNPV-QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILR-AADHG 59 (547) Q Consensus 1 ma~~~~-------------------~~~~~g~~~-~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr-~a~~g 59 (547) |+-.+. |..+...-+ ....+...+|. ++..+-...+.+.+-+. + ...+- ..... T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~---d~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~ 99 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAM---DGLDVEGGTFSAYANPN-L-SEGLVLWYAQQ 99 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhc---cccccchhhhhhhcccc-c-cchhhhhcccc Confidence 110000 111111111 11111111110 00000000000000000 0 01111 11111 Q ss_pred CHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCee Q lcl|NC_020866. 60 DPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSF 138 (547) Q Consensus 60 ~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv 138 (547) .+.. ++|+..+ ++.+-++.++++.-.-.++..|.|...+++..+.+..+.+.+.++++.+...+.+++ .+.+||.++ T Consensus 100 ~~~~-~~l~a~Y-~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~ 177 (537) T protein:vir:10 100 AFIG-HQMCALI-ATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRI 177 (537) T ss_pred CCcc-HHHHHHH-HhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceE Confidence 1221 3444333 478999999999999999999999876665555556677778888887666666666 578899776 Q ss_pred eeeEEe-ecCC------------eeeEEeeEeeccceeEecc-----cC---CcceeeccCCCCceeeccCceEEEeecC Q lcl|NC_020866. 139 TEIVWN-TSEG------------QYFPDKLIWRDPRHFRFQK-----SD---LATPLLLDDNGAEVPLPAFRFIFADIPA 197 (547) Q Consensus 139 ~Ei~w~-~~~g------------~~~~~~l~~~~~~~~~~~~-----~~---~~~~~~~~~~~~g~~l~~~k~i~~~~~~ 197 (547) +=+.=. .++. ...++.|..++|.|..... .+ ..-.....-...+..+-+.++|++.... T Consensus 178 i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~g~~iH~SRli~f~g~~ 257 (537) T protein:vir:10 178 ALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLINGKKYHRSHLAIYINDE 257 (537) T ss_pred EEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeecCeEecceeEEEecCCC Confidence 543311 1111 1123456666776654211 11 0000000001234466667777654332 Q ss_pred ------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCC-CCCHHHHHHHHHHHHHHhc-CeeEEec Q lcl|NC_020866. 198 ------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQP-GASEQDKDTLFRAVANIAG-DCAAIIP 269 (547) Q Consensus 198 ------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~-~~~~~~~~~l~~al~~~~~-~~~~vip 269 (547) ...+.+|.+++..+|....--......-+..+.++.+++...+... -.+++...+-.+.+....+ .+.++++ T Consensus 258 ~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~id 337 (537) T protein:vir:10 258 VVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQVRVVD 337 (537) T ss_pred CchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcceeEec Confidence 2345679999999998876655566666777888887755333211 1233334443444444444 4667777 Q ss_pred C-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 270 E-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNR 345 (547) Q Consensus 270 ~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~ 345 (547) . +.+++.+...- ++ -..+++..-.+||-+ .|-.+| +.+.+|..|.|+--...+-+.+++....+...+++ T Consensus 338 ~e~e~~e~~~~~l-sg----l~~~l~~~~~~iAa~-~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~ 411 (537) T protein:vir:10 338 KDNEDVVQIDTTL-ND----LDKVIMNQYQLVCAI-ARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDR 411 (537) T ss_pred CCCceeEEEeccC-CC----HHHHHHHHHHHHHhh-hCCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 7 48888876543 33 345666666667766 454433 33445667778888888889999888778777754 Q ss_pred HHHHHHHHhcCCCcccccEEEecCccccCHHHHH-------HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccc Q lcl|NC_020866. 346 DLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA-------SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLEN 418 (547) Q Consensus 346 ~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a-------~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~ 418 (547) +++.++...|++. ..-.|+|......+-+++| ++++++.+.|+ ++.+++|+.++-.. ..+..-+.+..+ T Consensus 412 -l~~ll~~~~~~~~-~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~-i~~~Evr~~L~~~~-~~g~~~l~~~~~ 487 (537) T protein:vir:10 412 -HHQLVCRSHLRKR-IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGA-VDGVDVNEYLRMDP-TLGFTSITPAMR 487 (537) T ss_pred -HHHHHHHhcCCCC-cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC-CCHHHHHHHHhccC-ccccccccCCCC Confidence 7787777777642 2345666655555545444 56888888886 99999999976421 111111110000 Q ss_pred c-------ccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 419 P-------ATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 419 ~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) . .....++......++.+............+...+...+.. ++ T Consensus 488 ~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~-----------~~ 537 (537) T protein:vir:10 488 PTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSGAAF-----------ED 537 (537) T ss_pred hhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCcccc-----------CC Confidence 0 0000000000000000000000000000000000111110 11 No 95 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.64 E-value=6.6e-15 Score=98.31 Aligned_cols=361 Identities=13% Similarity=0.039 Sum_probs=192.7 Q ss_pred eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020866. 8 LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKR 87 (547) Q Consensus 8 ~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~ 87 (547) .+-+.+.+.+.. + +....+. . ........ ..+ +-+.|.+|++.... T Consensus 1 Mg~f~~l~~~~~-----~--------------~~~~~~~---~----~~~~~~~~-------~~l-~~~~v~~~i~~Ia~ 46 (376) T protein:vir:78 1 MGFFSELFKRNK-----E--------------IEWMWDL---D----FLEDKTTK-------VYL-KKMALNTCVKHIAR 46 (376) T ss_pred CchhhhhhccCC-----c--------------cccccch---h----hccccchh-------hhh-hhHHHHHHHHHHHH Confidence 222211111100 0 0000000 0 00000000 111 35679999999999 Q ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEe Q lcl|NC_020866. 88 SVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRF 165 (547) Q Consensus 88 ~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~ 165 (547) .+.+++|++...+ ...+..++..+... -..+...++++.++ +.+.+|.+.+.+++ ++...+..+.++.+..+.. T Consensus 47 ~ia~~p~~~~~~~-~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r---~~~~~~~~~~~~~~~~~~~ 122 (376) T protein:vir:78 47 TIAKSDFRLKNGE-TSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSD---TDDFLIADSYVRKEFAFFP 122 (376) T ss_pred hhcccceeecccc-ccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEe---CCCeeeccceeecccceee Confidence 9999999997433 22222222222210 11224555666555 45678998876654 2333455566666654433 Q ss_pred cccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC-eEEEeCCCC Q lcl|NC_020866. 166 QKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQP-LRIGKWQPG 244 (547) Q Consensus 166 ~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P-~~vgk~~~~ 244 (547) +.... +...+.+....++....+++++....+.+++.++...+.-. .+ ..+..+....|.+ ..+.+++.. T Consensus 123 ~~~~~---~~~~~~~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~~--~~----~~~~~~~~~~~~~~~~~~~~~~~ 193 (376) T protein:vir:78 123 DVFEG---VTVKDYRYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGEL--FG----KMIRAQMRNFQIRGAVNFKMAGV 193 (376) T ss_pred eeeee---eeeecceeeeeeccccEEEeccCCCCchhhhhHHHHHHHHH--HH----HHHHHHHhcCCCceeEEEccCCC Confidence 22211 11122222345777778777777777777666665443211 11 1222233344544 444556666 Q ss_pred CCHHHHHHHHHHHHHHh----cCe--eEEecCCceEEEEecCCCCCc--hHHHHHHHHHHHHHHHHHHhc-chhccCccc Q lcl|NC_020866. 245 ASEQDKDTLFRAVANIA----GDC--AAIIPETMSIDFVESKSIGSS--VDLYEKRINHLDQQISKAVLG-QTTTTDAVA 315 (547) Q Consensus 245 ~~~~~~~~l~~al~~~~----~~~--~~vip~~~~ie~~e~~~~~~~--~~~f~~li~~~d~~Isk~ilG-qtlts~~~g 315 (547) .+++.++++.+.+.+.. +++ .++++.|++++-++.+..... ...|.+..++..++|+++.-- ..+.+ T Consensus 194 ~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~---- 269 (376) T protein:vir:78 194 ADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLH---- 269 (376) T ss_pred CCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhC---- Confidence 78888888888777653 233 345788988877654432221 124677778888899988421 22221 Q ss_pred cccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCH Q lcl|NC_020866. 316 GGHAV-SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEE 394 (547) Q Consensus 316 gs~a~-~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~ 394 (547) |+++. .+........-+...++.|++.+|+.|+.+ .....++.+...-..|.++++++++++++.|+ ++. T Consensus 270 ~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~--------~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~-~t~ 340 (376) T protein:vir:78 270 GDMADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTF--------SEFLAGEHIKIIHKKDIIENAEAVDKLVASGS-FNR 340 (376) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCc--------ccceecccchhhcccCHHHHHHHHHHHHhCCC-cCH Confidence 23332 222334455667888899999999877443 22222344444456789999999999999997 899 Q ss_pred HHHHHHhCCCCCCCc--chhccccccccccccccccccccc Q lcl|NC_020866. 395 NEVRSRFGFSQPKPG--AKLLRPLENPATQAAPPGMDPGAG 433 (547) Q Consensus 395 ~~~~e~~Glp~p~~~--e~~~~~~~~~~~~~~~~~~~~~~~ 433 (547) +++|+.+|+|+-+++ +..+.+. .-.+... .+. .+ T Consensus 341 NE~R~~lg~~p~~~g~~d~~~~~~-n~~~~~~-~~e---~g 376 (376) T protein:vir:78 341 NEVRELLGAERVDNPELDKYLITK-NYQSADE-GGE---DG 376 (376) T ss_pred HHHHHHhCCCCCCCCCCceeeecc-Cceehhc-ccc---CC Confidence 999999999875554 3333222 1111100 000 00 No 96 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.63 E-value=1.1e-14 Score=97.17 Aligned_cols=362 Identities=13% Similarity=0.039 Sum_probs=196.7 Q ss_pred hhhhHHHHhhh--hccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHH Q lcl|NC_020866. 29 VTGVRSPLSGY--PGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDV 106 (547) Q Consensus 29 ~~~~~~~~~~~--~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~ 106 (547) +.-+.+++... +....+.. .+. .++.....+.+.|.+|++.+...|.+++|++...+. .... T Consensus 1 Mg~f~~~f~~~~~~~~~~~~~----~~~-----------~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~-~~~~ 64 (385) T protein:vir:95 1 MGLFDSVFKRHSELSWMYDLE----FLQ-----------DKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNT-KEKG 64 (385) T ss_pred CchhhhhhccCcccccccchh----hhh-----------ccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCc-cccc Confidence 22222222111 11111111 111 111111125788999999999999999999975432 2222 Q ss_pred HHHHHHHHHHh-C----CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCC Q lcl|NC_020866. 107 KIADMVRDWLK-R----EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNG 180 (547) Q Consensus 107 ~~ae~v~~~l~-~----~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (547) . +..+|. + +...++++.++ +.+.+|.+.+.+. ++++ +.+... +..+.......+.- ......+.+ T Consensus 65 ~----l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~--~~~~-~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~ 135 (385) T protein:vir:95 65 T----LYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKN--DEGH-FFVADD-FEKEDELGLYSHRF-TNVLVNDFE 135 (385) T ss_pred h----HHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEe--cCCC-eeeccc-cccccccccccccc-eeeeecccc Confidence 2 333442 3 34666766666 4667899886543 3333 333222 22222222211111 111122223 Q ss_pred CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCCCHHHHHHHHHHHH Q lcl|NC_020866. 181 AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKW--QPGASEQDKDTLFRAVA 258 (547) Q Consensus 181 ~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~~~~~~~~l~~al~ 258 (547) ....+++...|++++....+..+|.|++..+.-..-. .+..+ ..-|.|--+.+. ....+++.++++.+.+. T Consensus 136 ~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~------~~~~~-~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~ 208 (385) T protein:vir:95 136 FKRVFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGR------MIDLQ-MLNNQIRGILKVDATKFYNKEKQKELQAYID 208 (385) T ss_pred eeeeeccccEEEecCCCCCcccccchHHHHHHHHHHH------HHHHH-HhcCCCceEEEeCCccCCCHHHHHHHHHHHH Confidence 3356788888887887777788999999887654321 12222 222334333333 34457777777777666 Q ss_pred HHh----cCe--eEEecCCceEEEEecCCCC-Cc--hHHHHHHHHHHHHHHHHHHhc-chhccCccccccchhH-HHHHH Q lcl|NC_020866. 259 NIA----GDC--AAIIPETMSIDFVESKSIG-SS--VDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHAVSQ-EHRLV 327 (547) Q Consensus 259 ~~~----~~~--~~vip~~~~ie~~e~~~~~-~~--~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a~~~-vh~~v 327 (547) +.- +++ .++++.|++++-++..... .+ ...|.+..++...+|+++.-- ..+. +|+++..+ ..... T Consensus 209 ~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l----~~~~sn~e~~~~~~ 284 (385) T protein:vir:95 209 TLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLV----LGEMADLEKTIESY 284 (385) T ss_pred HHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCCcCHHHHHHHH Confidence 542 223 4668888888766533221 11 124778888899999998422 1222 23444333 33455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCC- Q lcl|NC_020866. 328 QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQP- 406 (547) Q Consensus 328 ~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p- 406 (547) ...-+...++.|++.||+.|+.+--..++ +.+|.++.....|.+.++++++++++.|+ ++.+++|+.+|+|+- T Consensus 285 ~~~~l~P~~~~ie~~l~~~L~~~~~~~~~-----~~~fd~~~l~~~D~~~~~~~~~~~~~~g~-lt~NE~R~~~g~~p~~ 358 (385) T protein:vir:95 285 LQFCINPLLRKIEAELNSKFFYQDEYLND-----DMHIKVVGIDKRDPLKLSEAIDKLVASGT-FTRNQVRIMTGEEPAD 358 (385) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhcccc-----eEEEechhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCC Confidence 56678889999999999988765422221 23454455567799999999999999997 999999999999863 Q ss_pred -CCcchhcccccccccccccccccccccCCc Q lcl|NC_020866. 407 -KPGAKLLRPLENPATQAAPPGMDPGAGDLK 436 (547) Q Consensus 407 -~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (547) +.++..+.+.. -.+... ..++..... T Consensus 359 ~~~gd~~~~~~n-~~~~~~---~kgge~~~e 385 (385) T protein:vir:95 359 DPELDKFIITKN-LQSADA---FKGGESNEE 385 (385) T ss_pred CCCCceeeeccc-ceeccc---ccCCCCCCC Confidence 23343333221 111110 001100000 No 97 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.63 E-value=6.1e-15 Score=98.52 Aligned_cols=338 Identities=12% Similarity=0.063 Sum_probs=201.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHH-hhh---hccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPL-SGY---PGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~-~~~---~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~ 76 (547) |. |++++.+ +....+. .....+ .+. ....+|+. .+| +-+ T Consensus 1 M~----~~~~f~~----r~~~~~~------~~~~~~~~~~~~~~~~~v~~~---~al--------------------~~~ 43 (359) T protein:vir:10 1 MS----ILNPFER----RSSITPN------NYYPFMVQNGSIVPNSLVDAT---EAL--------------------KNS 43 (359) T ss_pred Cc----ccchhhc----cccCCCC------cchhhhhccccccCCcccCHH---Hhh--------------------cch Confidence 87 7665443 2111111 000000 000 01111221 111 346 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeee Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKRE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYF 151 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~ 151 (547) .|.+|++.....|.++++.- ++ ....++.++ ...++++.+. +.+.+|-+++++++... | . T Consensus 44 av~~cv~~ia~~ia~~p~~~-----~~-------~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-g--~ 108 (359) T protein:vir:10 44 DLYAVTSLISSDIAGTRFIG-----NQ-------VFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDN-S--L 108 (359) T ss_pred HHHHHHHHHHHhhhcCcccc-----ch-------HHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCC-C--e Confidence 78999999999999998741 11 123345554 3455566655 45679999999886432 2 4 Q ss_pred EEeeEeeccceeEecccCCcceeec--cCCCCceeeccCceEEEeecCC----cCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 152 PDKLIWRDPRHFRFQKSDLATPLLL--DDNGAEVPLPAFRFIFADIPAK----SGILLRSGLARVAAWGWMFKAFTARDW 225 (547) Q Consensus 152 ~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~l~~~k~i~~~~~~~----~~~p~G~gll~~~~~~~~~K~~~~~~w 225 (547) +..+.++++.++.+..+++...+.. ..++....+++...+++++... .+..+|.|.+..+....-......++. T Consensus 109 ~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~ 188 (359) T protein:vir:10 109 MKELRLIPSNAITIDLTDDTLTYEVNQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLS 188 (359) T ss_pred EEEEEEeCCceEEEEEcCCeEEEEEEecCCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHH Confidence 7789999999888765544332222 2345567888888877765543 244679999999999888888889999 Q ss_pred HHHHHHcCCCeEEEeCCC-CCCHHHHHHHHHHHHHHhcC----eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHH Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQP-GASEQDKDTLFRAVANIAGD----CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQI 300 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~-~~~~~~~~~l~~al~~~~~~----~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~I 300 (547) ..+...-+.|--+-+++. ..++++++++.+.+.+..+. ..+|++.|++++-++.+. ....|.+..++..++| T Consensus 189 ~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~---~d~q~le~~~~~~~~I 265 (359) T protein:vir:10 189 LSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINA---DVANYLNSMNWGRTQI 265 (359) T ss_pred HHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCCH---HHHHHHHHHHHHHHHH Confidence 999988888988889875 46888888888888766432 358999999887665332 1113677778888999 Q ss_pred HHHHhcc-hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHH Q lcl|NC_020866. 301 SKAVLGQ-TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 301 sk~ilGq-tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a 379 (547) +++.--. .+..+.+..+...+++ ++.....+.--..-+++.|+..+.+.+ .++.+ . .+.+ |.+.+. T Consensus 266 a~~fgVPp~~lg~~~~~~~~~~~~-e~~~~~~l~~~l~p~~~~l~~~l~~~~-~~~~~---~--~~~~------d~~~~~ 332 (359) T protein:vir:10 266 AKAFGVSDSYLNGTGDQQSSLDQI-KDLYVNALNRFIEPLISELRIKCDSSI-GVDMS---P--ITDY------SNSVFK 332 (359) T ss_pred HHHhCCCHHHhCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhh-cccch---h--hhhc------CHHHHH Confidence 9884211 1222211111112222 222223333334444455554443322 22211 0 1222 234455 Q ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhc Q lcl|NC_020866. 380 SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLL 413 (547) Q Consensus 380 ~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~ 413 (547) ..+.++++.|+ ++.+++|+.+|+|+ ++ T Consensus 333 ~~~~~~~~~G~-~t~NE~R~~l~~~p------v~ 359 (359) T protein:vir:10 333 ADILNWVKEGI-IEPTEAKTLLESKG------II 359 (359) T ss_pred HHHHHHHhCCC-cCHHHHHHHhCCCC------CC Confidence 66778899998 89999999999964 23 No 98 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.58 E-value=2.8e-14 Score=94.86 Aligned_cols=373 Identities=13% Similarity=0.068 Sum_probs=192.1 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKI 108 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ 108 (547) +. +-+++.......++.. ..++ ........+..+-+.|.+|+..+...|.+++|+|...+.... . T Consensus 1 Mg-l~d~~~~~~~~~~~~~---------~~~~--~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~-~-- 65 (395) T protein:vir:96 1 MG-ILDFFSFKKSGTLSDD---------DSGS--TTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTE-N-- 65 (395) T ss_pred Cc-chhhhcCCCCcccccc---------cccc--chhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCcccc-c-- Confidence 21 1122211111111111 1111 111122222224578999999999999999999975432211 1 Q ss_pred HHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCc Q lcl|NC_020866. 109 ADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAE 182 (547) Q Consensus 109 ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g 182 (547) ..-+..+|+ ++ ...++++.++ +.+.+|.+.+.+.+. +...+.....+... ... ..-..+...+.... T Consensus 66 ~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~---~~~~~~~~~~~~~~---~~~-~~~~~v~~~~~~~~ 138 (395) T protein:vir:96 66 QKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPG---KGIYVADAFTQDKK---LSG-NKFKVSRVQGQTYE 138 (395) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcC---CceecCCccccccc---ccc-ceeeeeeeccceee Confidence 112334443 33 3455555554 455689988776542 22222211111110 000 00001111222223 Q ss_pred eeeccCceEEEeecCCcCccccchhHHHHH------HHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHH Q lcl|NC_020866. 183 VPLPAFRFIFADIPAKSGILLRSGLARVAA------WGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRA 256 (547) Q Consensus 183 ~~l~~~k~i~~~~~~~~~~p~G~gll~~~~------~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~a 256 (547) ..+++...+++++....+.+++.|+..... .....+.+..++...+....|.|.-+-+.+....++.+++..+. T Consensus 139 ~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (395) T protein:vir:96 139 KIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKDFFKR 218 (395) T ss_pred eEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhHHHHHHHHHH Confidence 457777777777776666677766644322 22223344556666666667777777676555555555555554 Q ss_pred HH-HHhcCee--EEecCCceEEEEecCCCCCc---hHHHHHHHHHHHHHHHHHHhc-chhccCccccccch-hHHHHHHH Q lcl|NC_020866. 257 VA-NIAGDCA--AIIPETMSIDFVESKSIGSS---VDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHAV-SQEHRLVQ 328 (547) Q Consensus 257 l~-~~~~~~~--~vip~~~~ie~~e~~~~~~~---~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a~-~~vh~~v~ 328 (547) .. ...++++ ++++.|++++-+..+..... ...|.++...+-++|+++.-- ..+. +|+++. .+...... T Consensus 219 ~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l----~~~~sn~e~~~~~f~ 294 (395) T protein:vir:96 219 TIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLL----HGDIADNQKNYELLL 294 (395) T ss_pred HHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCCccHHHHHHHHH Confidence 44 3334443 45677777665443322111 112334444556789988321 1222 133332 23334455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC Q lcl|NC_020866. 329 EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP 408 (547) Q Consensus 329 ~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~ 408 (547) ..-+...++.|+..||+.|++.--.. ..-+|.++.....|.+++++.++++++.|+ ++.+++|+.+|+|+-++ T Consensus 295 ~~~L~P~~~~ie~~l~~~Ll~~~e~~------~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~gl~pi~~ 367 (395) T protein:vir:96 295 EGPIESLITNIVDGLEYAIFDKSETL------EGSFIKVTGLKNYDLFSISSQADKLISSGF-VFIDEVREEIGLPELPD 367 (395) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhhc------CceeEeecchhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCCC Confidence 56788888999999998876532111 112467777777899999999999999997 89999999999987544 Q ss_pred --cchhcccccccccccccccccccccCC Q lcl|NC_020866. 409 --GAKLLRPLENPATQAAPPGMDPGAGDL 435 (547) Q Consensus 409 --~e~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (547) +++++.+.. -.+....++........ T Consensus 368 ~~gD~~~~~~N-~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 368 GLGKVLYMTKN-YESVLERGGEVDEEVET 395 (395) T ss_pred CCCceeeeccc-ceechhccCCCCCCCCC Confidence 444443322 11111111110000000 No 99 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.50 E-value=3.3e-12 Score=83.56 Aligned_cols=398 Identities=10% Similarity=0.016 Sum_probs=204.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHH-HhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQIL-RAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~il-r~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |++.+. .-+ .. + .....+..+++.+. |..-.+ +... ...+......+. ++..+.++.+-++ T Consensus 1 ~~~~~~----a~~-------~~-~-~~~a~~~~~~~~~~---g~~~~~-d~~~~~~~~~~~~~~~~-~l~~lY~~~~l~r 62 (461) T protein:vir:80 1 MYSIDK----AKQ-------AK-I-DSKIVNRNDFMVGH---GKANSR-DKLTRQTPGNGQKLDLK-ACENLYASNSIAM 62 (461) T ss_pred Cccchh----hhh-------hh-h-hhhhhhhhHHHhhc---CCcchh-hhhhccccCcccccCHH-HHHHHHHhCCccc Confidence 764111 111 01 1 11111222222221 111000 1111 111221111222 2223335788899 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHH-HHHHhhhhcCeeeeeEEeecCCee-------- Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVF-HILDCISKGYSFTEIVWNTSEGQY-------- 150 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~-~~l~a~~~G~sv~Ei~w~~~~g~~-------- 150 (547) .++++-..-.++..|.|...+ + +..+.++++++++.....+. .+-.+..||.+.+=+.-. +++.+ T Consensus 63 ~iVd~~a~d~~r~g~~i~~~~--~---~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~-d~~~~~~~~~~pl 136 (461) T protein:vir:80 63 NIVDIISEDMVRAGWSLKTDN--K---EMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVV-SSNREQADLSTAI 136 (461) T ss_pred hhhccchHHhhcCCeeeecCC--H---HHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEee-cCCccccCccCCc Confidence 999999999999999887532 2 22344556666665444444 444799999987655421 11100 Q ss_pred ------eEEeeEeeccceeEe---ccc------CCcceeecc-------------CCCCceeeccCceEEEeecCCcCcc Q lcl|NC_020866. 151 ------FPDKLIWRDPRHFRF---QKS------DLATPLLLD-------------DNGAEVPLPAFRFIFADIPAKSGIL 202 (547) Q Consensus 151 ------~~~~l~~~~~~~~~~---~~~------~~~~~~~~~-------------~~~~g~~l~~~k~i~~~~~~~~~~p 202 (547) .+..|.+..+..+.. ..+ +....+... ....+..+-+.++|++.+..-.+.. T Consensus 137 ~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~ 216 (461) T protein:vir:80 137 DPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGET 216 (461) T ss_pred ccccccceeEEEeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccc Confidence 111222222221110 000 111111111 1122356777888888777777888 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHh-cCeeEEecCCceEEEEecCC Q lcl|NC_020866. 203 LRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIA-GDCAAIIPETMSIDFVESKS 281 (547) Q Consensus 203 ~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~-~~~~~vip~~~~ie~~e~~~ 281 (547) +|.+++..+|....--......-+..+.++.+++..-+-......++...+.+.+..+. +.+.+++..+.+++.+...- T Consensus 217 ~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~e~~e~~~~~l 296 (461) T protein:vir:80 217 KGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIKGDEQLTKESTNV 296 (461) T ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEcCCcceEEEecCc Confidence 99999999998887777777777788888888865433211112233334444444433 45678899999999887543 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC- Q lcl|NC_020866. 282 IGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYG- 357 (547) Q Consensus 282 ~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg- 357 (547) ++ -+.+++..-.+||-+ .+-.+| +.+ .|+.|.|+-.....-+.+++.......-+.+.|++.++.-.++ T Consensus 297 -sg----l~~~l~~~~~~iaa~-s~iP~t~L~G~s-~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~ 369 (461) T protein:vir:80 297 -SG----MKDLLDYGWDYLAGA-VRMPKTVLKGQE-AGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDC 369 (461) T ss_pred -CC----HHHHHHHHHHHHhhh-hcCCeeeeeccc-CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 33 345566555677765 333332 233 3567778878888899999988765555556678777654333 Q ss_pred -Ccc----cccEEEecCccccCHHH-------HHHHHHHHHhCCCCCCHHHHHH----HhCCCCCCC----cchhccccc Q lcl|NC_020866. 358 -PQK----RYPRIKIARPKTEDLTK-------LASSLDTLVRLGMEIEENEVRS----RFGFSQPKP----GAKLLRPLE 417 (547) Q Consensus 358 -~~~----~~p~~~~~~~~~ed~~~-------~a~~~~~l~~~G~~v~~~~~~e----~~Glp~p~~----~e~~~~~~~ 417 (547) +.. .--.|+|...-+.+-++ .+++++++++.|+ ++.+++|+ ++|+.++.. +.+...... T Consensus 370 ~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~-is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~ 448 (461) T protein:vir:80 370 GPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGV-LDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAK 448 (461) T ss_pred ccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhh Confidence 211 12356676554444444 4566888888897 88888876 445543210 000000000 Q ss_pred cccccccccccccccc Q lcl|NC_020866. 418 NPATQAAPPGMDPGAG 433 (547) Q Consensus 418 ~~~~~~~~~~~~~~~~ 433 (547) . ....+....+.+ T Consensus 449 ~---~~~~~~~e~~~g 461 (461) T protein:vir:80 449 L---VYDAYAKKNADG 461 (461) T ss_pred h---ccccccccCCCC Confidence 0 000000000000 No 100 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.50 E-value=3.8e-13 Score=88.68 Aligned_cols=432 Identities=10% Similarity=0.020 Sum_probs=221.3 Q ss_pred CCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_020866. 2 AQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGV 81 (547) Q Consensus 2 a~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~ 81 (547) -|+|.|+++.+++.......-..+++.. .+.+.++.....++ ...++. +...+..-.+++...+++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~---~~~~~~w~~~~~s~---~~~i~~----~~~~lr~RaRdl~rNn~~a~~a 70 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGF---GGQLRGWNPPSESA---DAALLP----NYSRGNARADDLVRNNGYAANA 70 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCC---CCcccccccCCCCH---HHHHHH----HHHHHHHHHHHHHhcChHHHHH Confidence 3589999999987754433322222111 12333332223333 233332 2344555566777799999999 Q ss_pred HHHHHHHHhcCCceEecCC-------CCHHHHHHHHHHHHHHhC--------------CCHHHHHHHHHH-hhhhcCeee Q lcl|NC_020866. 82 IGTRKRSVSQLDITVEPAS-------DDPQDVKIADMVRDWLKR--------------EELQQEVFHILD-CISKGYSFT 139 (547) Q Consensus 82 l~~Rk~~v~~~~w~v~p~~-------~~~~~~~~ae~v~~~l~~--------------~~~~~~i~~~l~-a~~~G~sv~ 139 (547) ++.....|.+..+.+.+.. +..++++..+.|++.|.. .+|..+...++. .+.-|=.++ T Consensus 71 v~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~ 150 (530) T protein:vir:38 71 VQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCV 150 (530) T ss_pred HHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEE Confidence 9999999999999887642 344566677777766631 258888877775 466888888 Q ss_pred eeEEeecCCeeeEEeeEeeccceeEec----------------ccCCcce-eec-c-CCCC----ceeeccC-----ceE Q lcl|NC_020866. 140 EIVWNTSEGQYFPDKLIWRDPRHFRFQ----------------KSDLATP-LLL-D-DNGA----EVPLPAF-----RFI 191 (547) Q Consensus 140 Ei~w~~~~g~~~~~~l~~~~~~~~~~~----------------~~~~~~~-~~~-~-~~~~----g~~l~~~-----k~i 191 (547) -+.|..++|.-++-+|..+++.++..+ ..++..- .+. . ..+. ...+|.+ .-| T Consensus 151 ~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~v 230 (530) T protein:vir:38 151 QATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSF 230 (530) T ss_pred EeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHe Confidence 888988777666667888877655432 2222111 111 1 1111 1122222 245 Q ss_pred EEeecC-CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC----------C-CHHHHH-------- Q lcl|NC_020866. 192 FADIPA-KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG----------A-SEQDKD-------- 251 (547) Q Consensus 192 ~~~~~~-~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~----------~-~~~~~~-------- 251 (547) +|.+.. +.+..-|.+.|.++.....--......-+.-...-..=..+.|.+.+ . .+++.. T Consensus 231 lH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (530) T protein:vir:38 231 IHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGE 310 (530) T ss_pred EeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchh Confidence 665554 46888899999887655433222222222111111111111111110 0 000000 Q ss_pred ---HHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch---hccCccccccchhHHHH Q lcl|NC_020866. 252 ---TLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT---TTTDAVAGGHAVSQEHR 325 (547) Q Consensus 252 ---~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt---lts~~~ggs~a~~~vh~ 325 (547) .-......+..+....++.|.+|++.++...+++ |..|.+.+-+.|+.. +|-+ ||.+-.+.||+++-.-. T Consensus 311 ~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~---~~~f~~~~lr~iaag-lGi~ye~lt~D~s~~nYSS~R~~~ 386 (530) T protein:vir:38 311 MAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNG---YSTFEQSLLRYIAAG-LGVSYEQLSRNYSQMSYSTARASA 386 (530) T ss_pred hhhcccccceeccCceeeecCCCCeeeeeCCCCCCCC---HHHHHHHHHHHHHhh-cCCCHHHHhcccccccHHHHHHHH Confidence 0011122467777888999999999987765444 788888888999988 5654 56664455787776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--C--C--cc----------cccEEEe--cCccccCHHHHHHHHHHHHh Q lcl|NC_020866. 326 LVQEDIETSDAMALAAILNRDLIRPWIQLEY--G--P--QK----------RYPRIKI--ARPKTEDLTKLASSLDTLVR 387 (547) Q Consensus 326 ~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nf--g--~--~~----------~~p~~~~--~~~~~ed~~~~a~~~~~l~~ 387 (547) .-+...++.....+...+-+-+...+++.-+ | + .. .+....| ..-...|..+.+++....++ T Consensus 387 ~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~ 466 (530) T protein:vir:38 387 NESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIE 466 (530) T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHH Confidence 5556666666665555444444444443211 1 1 00 0111222 33345688888889999999 Q ss_pred CCCCCCHHHHHHHhCCCCCCCcchhcccccccc----cccccccccccccCCccccccccccccchhhh Q lcl|NC_020866. 388 LGMEIEENEVRSRFGFSQPKPGAKLLRPLENPA----TQAAPPGMDPGAGDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 388 ~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (547) .|+ -|.+.+....|....+-.+++........ ..+..+...++.+..... ..+..+. .-+ T Consensus 467 ~G~-~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~---~~~~d~~-~~a 530 (530) T protein:vir:38 467 AGL-STYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSN---EEEQDGA-RAA 530 (530) T ss_pred cCC-CCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCC---CCCCCCC-CCC Confidence 998 44444444456543221111110000000 000000000000000000 0000000 000 No 101 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.50 E-value=5.1e-13 Score=87.98 Aligned_cols=372 Identities=14% Similarity=0.085 Sum_probs=185.5 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCC-CHHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASD-DPQDVK 107 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~-~~~~~~ 107 (547) +.-+ +.+..--...++.. . +.............+-+.|.+|++..-..|.+++|++...+. ...+.. T Consensus 1 MGlf-~~~~~~~~~~~~~~---------~--~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~ 68 (395) T protein:vir:98 1 MGIL-DFFSFKKSGTLSDD---------D--SGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKD 68 (395) T ss_pred Ccch-hhhcCCCccccccc---------c--cchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccch Confidence 2221 21110000011110 0 111111122222224678999999999999999999975432 222212 Q ss_pred HHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCC Q lcl|NC_020866. 108 IADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGA 181 (547) Q Consensus 108 ~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (547) +..+|+ ++ ...++++.+. +.+.+|.+.+.++.. ++.+.+..+ .+... ... ............. T Consensus 69 ----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~--~~~~~~~~~-~~~~~---~~~-~~~~~~~~~~~~~ 137 (395) T protein:vir:98 69 ----WLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPG--KGIYVADSF-TQDKK---ISG-SQFKVSRVQGQTY 137 (395) T ss_pred ----HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeC--CceecCCcc-ccccc---ccC-cccceeeecCcee Confidence 233343 33 3445555544 455689998777642 222221111 11100 000 0000111112222 Q ss_pred ceeeccCceEEEeecCCcCccccchhHHHHHHHH--HHHHHHHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHH----HHH Q lcl|NC_020866. 182 EVPLPAFRFIFADIPAKSGILLRSGLARVAAWGW--MFKAFTARDWAIFTQTYGQPLRIGKWQPGA-SEQDKD----TLF 254 (547) Q Consensus 182 g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~--~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~----~l~ 254 (547) ...+++...|++++....+.+++.|+........ .........-..+...++.+..+.++.... +++..+ .+. T Consensus 138 ~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (395) T protein:vir:98 138 EKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFK 217 (395) T ss_pred eeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHH Confidence 3567778888777777777788888766433221 111111122223444444444333333222 222222 233 Q ss_pred HHHHHHhcCe--eEEecCCceEEEEecCCCC---CchHHHHHHHHHHHHHHHHHHhc-chhccCccccccchhH-HHHHH Q lcl|NC_020866. 255 RAVANIAGDC--AAIIPETMSIDFVESKSIG---SSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHAVSQ-EHRLV 327 (547) Q Consensus 255 ~al~~~~~~~--~~vip~~~~ie~~e~~~~~---~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a~~~-vh~~v 327 (547) +......+++ +++++.|++++=+..+... .....|.+..++.-++|+++.-- ..+. +|+++..+ ..... T Consensus 218 ~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l----~~~~sn~e~~~~~f 293 (395) T protein:vir:98 218 RTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLL----HGDIADNQKNYELL 293 (395) T ss_pred HHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCcccHHHHHHHH Confidence 3333333344 3456778877655433211 11124666777778889988421 1222 23444333 33345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCC Q lcl|NC_020866. 328 QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPK 407 (547) Q Consensus 328 ~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~ 407 (547) ...-+...++.+++.||+.|+++-... ..-+|.++.....|.+++++.++++++.|+ ++.+++|+.+|+|+-+ T Consensus 294 ~~~tl~P~~~~ie~~l~~kll~~~~~~------~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~g~~Pi~ 366 (395) T protein:vir:98 294 LEGPIESLITNIVDGLEYAIFDKSETL------QGSFIKVTGLKNYDLFSISNQADKLISSGF-VFIDEVREEIGLPELP 366 (395) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhc------CcceeeehhhhccCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCCCC Confidence 567788899999999999886543211 112577777778899999999999999997 9999999999998654 Q ss_pred C--cchhcccccccccccccccccccccCCccccccccccccc Q lcl|NC_020866. 408 P--GAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 408 ~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) + ++..+.+.. -.+.....+....... . T Consensus 367 ~~~gD~~~~~~n-~~~~~~~gge~~~~~~-------------~ 395 (395) T protein:vir:98 367 DGLGKVLYMTKN-YESVLERGGEVDEEVE-------------T 395 (395) T ss_pred CCCCceeeeccc-ceecccccCCCCCCCC-------------C Confidence 4 444443322 1111111111000000 0 No 102 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.48 E-value=2.3e-13 Score=89.83 Aligned_cols=268 Identities=9% Similarity=0.033 Sum_probs=172.0 Q ss_pred HhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEec Q lcl|NC_020866. 89 VSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQ 166 (547) Q Consensus 89 v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~ 166 (547) |.+++|.+.-.+... +..++..+... -..+.+.++++.++ +.+.+|.++++++...+| .+..|.++++.++.+. T Consensus 1 ia~l~~~~~~~~~~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G---~~~~l~~l~~~~v~v~ 76 (278) T protein:vir:78 1 MASLPLKMYEDYKVV-NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKLFLLNPDVVEML 76 (278) T ss_pred CccceeEEEecCccc-ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCC---cEEEEEEECCceeEEE Confidence 999999997544322 22222222211 01234777888887 578899999999865333 3678999999988876 Q ss_pred ccCCcc---eeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC-CCeEEEeCC Q lcl|NC_020866. 167 KSDLAT---PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG-QPLRIGKWQ 242 (547) Q Consensus 167 ~~~~~~---~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G-~P~~vgk~~ 242 (547) .+.... +.....++....+++...|++++....+.++|.|.+..+....-...... .|.. .+++ .|..+.+.+ T Consensus 77 ~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~-~~~~--~~~~~~~~~i~~~~ 153 (278) T protein:vir:78 77 IENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVR-TFNL--TEMQKPDSFMLKYG 153 (278) T ss_pred EcCCCceEEEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHH-HHHH--HHhcCCCcEEEEeC Confidence 543322 22333444557788888777776666777899999999987766555443 3432 2332 366777788 Q ss_pred CCCCHHHHHHHHHHHHHHhcCe--eEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-chhccCccccccc Q lcl|NC_020866. 243 PGASEQDKDTLFRAVANIAGDC--AAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-QTTTTDAVAGGHA 319 (547) Q Consensus 243 ~~~~~~~~~~l~~al~~~~~~~--~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-qtlts~~~ggs~a 319 (547) ...++++++++.+.+.+...++ .+++|.|++++-+..+ .....|.+..++..++|+++.-- ..+.+..++++++ T Consensus 154 ~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~---~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~s 230 (278) T protein:vir:78 154 SNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKK---YVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFA 230 (278) T ss_pred CCCCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCC---hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc Confidence 8889999999988888766554 5777888877655432 22335778888999999988421 2344444455666 Q ss_pred hhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccc Q lcl|NC_020866. 320 VSQEHR-LVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKT 372 (547) Q Consensus 320 ~~~vh~-~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ 372 (547) ..+.+. .....-+...++.|++.||+.|+++--.. ..-+|+|+...= T Consensus 231 n~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~------~g~~~~f~~~~l 278 (278) T protein:vir:78 231 KNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDRE------KIGILNLTLNLI 278 (278) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhc------CCceEEEecccC Confidence 555554 45566789999999999999886542111 111355553221 No 103 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.45 E-value=1e-11 Score=80.79 Aligned_cols=512 Identities=10% Similarity=-0.015 Sum_probs=219.7 Q ss_pred CCCCCeeeCCCCCcccch------hhhhhhhhhhhhhhH-----HHHhhhhccCCCHHHHHHHHHhh-cCCCHHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA------QLTEEVAAPTVTGVR-----SPLSGYPGDGLNPLRLAQILRAA-DHGDPVRYLDLA 68 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~------~l~~~~a~~~~~~~~-----~~~~~~~~~~l~p~~~~~ilr~a-~~g~~~~~~~L~ 68 (547) |++..++.+..-.+.+.+ ....+.+.-..+..- ..+. ...-++++.+....+-.. ..+.+. .++|+ T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~f~-gyql~ 114 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAK-AAAGGQNPYVVPTMLQDWYNSQGFI-GYQAC 114 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHH-HhhhccCccchhhHHHhhhcccCCc-cHHHH Confidence 666555555332222111 111111110011000 0000 000122222211221111 111111 13455 Q ss_pred HHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEe-ec Q lcl|NC_020866. 69 EVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWN-TS 146 (547) Q Consensus 69 ~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~-~~ 146 (547) . +.++.+-++.++++...-.++..|+|...+++. +.+..+.+++.++++.+.+.|.+.+ .+.+||-+++=+.=. .+ T Consensus 115 a-lY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~-~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D 192 (765) T protein:vir:96 115 A-IISQHWLVDKACSMSGEDAARNGWELKSDGRKL-SDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDD 192 (765) T ss_pred H-HHHhCchhhhhhhcchHHhhcCCceeecCcccc-CHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccC Confidence 4 345799999999999888899999998654333 2334456777777777666666666 588998665433221 11 Q ss_pred CCe------------eeEEeeEeeccceeEeccc-----CCcc---eeeccCCCCceeeccCceEEEeecC------CcC Q lcl|NC_020866. 147 EGQ------------YFPDKLIWRDPRHFRFQKS-----DLAT---PLLLDDNGAEVPLPAFRFIFADIPA------KSG 200 (547) Q Consensus 147 ~g~------------~~~~~l~~~~~~~~~~~~~-----~~~~---~~~~~~~~~g~~l~~~k~i~~~~~~------~~~ 200 (547) +.. ..++.|..++|.|..-... +... .........+..+-+.++|++.... ... T Consensus 193 ~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g~~IH~SRli~~~g~~lpd~lk~~~ 272 (765) T protein:vir:96 193 PDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISGKKYHRSHLVVVRGPQPPDILKPTY 272 (765) T ss_pred cchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecCceeccceEEEecCCCchhhhcccc Confidence 111 1233455555544332110 0000 0000001123345566666554332 345 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC-CCHHHHHHHHHHHHH-HhcCeeEEecCCceEEEEe Q lcl|NC_020866. 201 ILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG-ASEQDKDTLFRAVAN-IAGDCAAIIPETMSIDFVE 278 (547) Q Consensus 201 ~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~-~~~~~~~~l~~al~~-~~~~~~~vip~~~~ie~~e 278 (547) +.||.++++.+|....--......-+..+.++.+.+.--+.... .+++....-.+.+.. ..+.+.+++..+.+++.+. T Consensus 273 ~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~ee~~e~~s 352 (765) T protein:vir:96 273 IFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGIDETMEQFD 352 (765) T ss_pred CccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecCCcceeEEe Confidence 56799999999988766666666777788888876543332111 223333222233333 3456788999999999887 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLE 355 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~N 355 (547) ..- ++ -..+++..-++||-+ .+-.+| +.+..|-.|.|+--...+-+.+++.......-+.+.|++.++... T Consensus 353 ~~l-sg----l~d~l~~~~~~iAaa-s~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~ 426 (765) T protein:vir:96 353 TNL-SD----FDSVIMNQYQLVAAI-AKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKSE 426 (765) T ss_pred ccc-CC----HHHHHHHHHHHHHhh-hCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 553 34 344555555667765 444332 233356667777777888888888886654444456787776553 Q ss_pred CCCcccccEEEecCccccCH-------HHHHHHHHHHHhCCCCCCHHHHHHHhCCC------CCCCcchhcccccccccc Q lcl|NC_020866. 356 YGPQKRYPRIKIARPKTEDL-------TKLASSLDTLVRLGMEIEENEVRSRFGFS------QPKPGAKLLRPLENPATQ 422 (547) Q Consensus 356 fg~~~~~p~~~~~~~~~ed~-------~~~a~~~~~l~~~G~~v~~~~~~e~~Glp------~p~~~e~~~~~~~~~~~~ 422 (547) ..++ --.|+|......+- +..+++++++++.|+ ++.+++|++++-. ...+++....+...+... T Consensus 427 ~i~~--d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gv-is~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~ 503 (765) T protein:vir:96 427 SIDV--QLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGV-VSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENL 503 (765) T ss_pred CCCC--cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHHhccccCCCCCCCccccccccCCCcccc Confidence 2221 23556654443333 345567888889997 9999999987532 111111100000000000 Q ss_pred cccc------cccccccCCccccccccccc-----cchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 423 AAPP------GMDPGAGDLKTAERKLKPLS-----GQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVA 491 (547) Q Consensus 423 ~~~~------~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 491 (547) ..-+ ...+......++++...... ..+..........++....+..... ...........+.. T Consensus 504 ~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~------~~~p~~~~~~~~~~ 577 (765) T protein:vir:96 504 AELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPS------RPNPRAELRNLLSD 577 (765) T ss_pred ccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCccc------cccccccchhcccc Confidence 0000 00000000000000000000 0000000000000000000000000 00000000000000 Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHH-HHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 492 MIEEIRTMLEAASSLGEFSDMLDARYGEIGASQL-ASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 492 ~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l-~~~l~~al~~A~~~G~~~~~~g 547 (547) ..+.... +... +..+..+..+. +..+..+..-+ +..++.. T Consensus 578 ~~~~~~~----------~~~~--a~~~g~~v~~~~~~a~~~a~~ps----~a~~~~~ 618 (765) T protein:vir:96 578 LLSKLEA----------LDDA--QAPDGVDIEQDDAPGLKRTSKPS----VSGMEPS 618 (765) T ss_pred hhhhhhc----------cccc--cccCCCCCCCCccchhhhhhccc----cCCCCCc Confidence 0000000 0000 01111111111 11222222211 1122222 No 104 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.45 E-value=6.4e-13 Score=87.45 Aligned_cols=355 Identities=10% Similarity=0.052 Sum_probs=183.7 Q ss_pred eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020866. 8 LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKR 87 (547) Q Consensus 8 ~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~ 87 (547) .+-+|+.... ++...+.+..... +..+. .....-+.|.+|++.+.. T Consensus 1 Mg~f~~~~~~-----------------------~~~~~~~~~~~~~--~~~~~---------~~~~~~~~v~~~v~~IA~ 46 (378) T protein:vir:94 1 MNLFGKVVSF-----------------------SRGKLNNDTQRVT--AWQNE---------AVEYTSAFVTNIHNKIAN 46 (378) T ss_pred CCccccchhc-----------------------ccccccCCcceee--eeccc---------hhHHHHHHHHHHHHHHHh Confidence 3333332211 1111010000000 00000 011123569999999999 Q ss_pred HHhcCCceEecCC--CCHHHH--H-HHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 88 SVSQLDITVEPAS--DDPQDV--K-IADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 88 ~v~~~~w~v~p~~--~~~~~~--~-~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) .|.+++|.+.-.. +...+. . .-.-+..+|+ ++ ...++++.++ +.+.+|.+.+.++|....+.+.. +. T Consensus 47 ~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~--l~ 124 (378) T protein:vir:94 47 EITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLD--LL 124 (378) T ss_pred hhhhCceeeEEEcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEE--EE Confidence 9999999875322 111111 0 0112344554 33 3556666665 56789999988877644443221 11 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) | ...+..+++...|++++ . .+...|.|.+..+.-.. ..+. +.|.|- T Consensus 125 ---p------------------~~~~~~~~~~diiH~~~-~-~~~~~g~s~l~~~~~~i----------~~~~-~~~~~~ 170 (378) T protein:vir:94 125 ---F------------------ADDKKEYKPEELVRLTS-P-FYINEDTSILDNALASI----------QTKL-EQGKLR 170 (378) T ss_pred ---e------------------cCCeeEeeeeeeEEecC-c-CCccchhHHHHHHHHHH----------HHHH-hccccc Confidence 1 11123456665554442 2 22234667766554321 1122 235555 Q ss_pred EEEeCCCCCCHHHHH----HHHHHHHHHh----cCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc- Q lcl|NC_020866. 237 RIGKWQPGASEQDKD----TLFRAVANIA----GDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ- 307 (547) Q Consensus 237 ~vgk~~~~~~~~~~~----~l~~al~~~~----~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq- 307 (547) -+.+++...+++..+ ++.+.+.+.. ++..++++.|++++=+..+. ... .+.. .++..++|+++. |- T Consensus 171 gil~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~--~~~-~~~~-~~~~~~~Ia~~f-gVP 245 (378) T protein:vir:94 171 GLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDY--SVL-NKDE-IDLIKSELLTGY-FMN 245 (378) T ss_pred ceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCCh--hhh-hHHH-HHHHHHHHHHHh-CCC Confidence 556777665655444 4444444432 22458888888876554322 222 2443 467788999873 32 Q ss_pred -hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CC-cccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 308 -TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEY-GP-QKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 308 -tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nf-g~-~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) .+. +|+++ .+........-+...++.|+..||+.|+.+--...+ +. ....++|.++.....|.+.+++.+.+ T Consensus 246 ~~~l----~~~~s-e~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~ 320 (378) T protein:vir:94 246 ENIL----LGTAS-QEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHE 320 (378) T ss_pred HHHh----cCChH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHH Confidence 222 23333 233445566778888999999999988765433221 11 11224566666677899999999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) +.+.|+ ++.+++|+.+|+|+-+.++..+.+.... +...+....++.....++....++ T Consensus 321 ~~~~G~-~T~NE~R~~~gl~p~~gGD~~~~~~n~~-~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 321 NINGPI-FTQNQLLVKMGEQPIEGGDVYIANLNAV-AVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCCCeeeeccccc-ccccchhhcCCcCCCCCCCCCCCC Confidence 999998 9999999999999877777655443221 111111111111111111000000 No 105 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.43 E-value=2.9e-12 Score=83.83 Aligned_cols=355 Identities=9% Similarity=0.034 Sum_probs=183.3 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCC--CHHHH Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASD--DPQDV 106 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~--~~~~~ 106 (547) +.-+++... ..+...+...... . +..+.. + ...-+.|.+|++.+...|.+++|.+..... ...+. T Consensus 1 Mg~f~~~~~--f~~~~~~~~~~~~-~-~~~~~~-----~----~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:93 1 MNLFGKVVS--FSRGKLNNDTQRV-T-AWQNEA-----V----EYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred Cccchhhhh--hhccccCCCccee-e-ecccch-----h----HHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccc Confidence 444433322 1111111110001 0 111111 1 113457999999999999999998854321 11110 Q ss_pred --H-HHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeecc Q lcl|NC_020866. 107 --K-IADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLD 177 (547) Q Consensus 107 --~-~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 177 (547) . .-.-+..+|+ ++ ...++++.++ +.+.+|.+.+.+++....|... .+. | T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~--~l~---~----------------- 125 (378) T protein:vir:93 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLL---F----------------- 125 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEE---e----------------- Confidence 0 1122444454 33 3556666555 5667999988776643333221 111 1 Q ss_pred CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHH----H Q lcl|NC_020866. 178 DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDT----L 253 (547) Q Consensus 178 ~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~----l 253 (547) ...+..++++..|++++ . .+...|.+++..+.-. ...+.. .|.|--+.+++...+++..++ + T Consensus 126 -~~~~~~~~~~diih~r~-~-~~~~~~~s~l~~~~~~----------i~~~~~-~~~~~g~l~~~~~l~~~~~~~~~~~~ 191 (378) T protein:vir:93 126 -ADDKKEYKTEELVRLTS-P-FYINEDTSILDNALAS----------IQTKLE-QGKLRGLLKINAFLDIDNTQEYREKA 191 (378) T ss_pred -cCCeeEeccceeEEecC-c-cccchhhHHHHHHHHH----------HHHHHh-cCcccceeeeCCcCCHHHHHHHHHHH Confidence 11234566666654442 2 2223366666655422 222333 355545556665555554444 4 Q ss_pred HHHHHHHh----cCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccccccchhHHHHHH Q lcl|NC_020866. 254 FRAVANIA----GDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVAGGHAVSQEHRLV 327 (547) Q Consensus 254 ~~al~~~~----~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~ggs~a~~~vh~~v 327 (547) .+.+++.. ++..++++.|++++-+..+. ... .++ ..++..++|+++ +|-. +. +|+++ .+..... T Consensus 192 ~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~--~~~-~~~-~~~~~~~~Ia~~-fgVPp~~l----~g~~~-e~~~~~f 261 (378) T protein:vir:93 192 LTTIKNMQEGSSYNGLTPVDNKTEIVELKKDY--SVL-NKD-EIDLIKSELLTG-YFMNENIL----LGTAT-QEQQIYF 261 (378) T ss_pred HHHHHHhhcccccccceEcCCCceEEEccCCh--hhh-hHH-HHHHHHHHHHHH-hCCCHHHh----cCCcH-HHHHHHH Confidence 44444332 22457888888877554332 222 233 346788899987 3422 22 23333 2333445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCC-cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCC Q lcl|NC_020866. 328 QEDIETSDAMALAAILNRDLIRPWIQL-EYGP-QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQ 405 (547) Q Consensus 328 ~~~~~~ad~~~i~~~ln~~li~~l~~~-Nfg~-~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~ 405 (547) ...-+...++.|++.||+.|+..--.. .++. .....+|.++.....|++++++++.++++.|+ ++.+++|+.+|+|+ T Consensus 262 ~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~-~t~NE~R~~~gl~p 340 (378) T protein:vir:93 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPI-FTQNQLLVKMGEQP 340 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCC Confidence 567788999999999999887543211 1111 11224566666778899999999999999997 99999999999997 Q ss_pred CCCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 406 PKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 406 p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) -+.++..+.+... .+........++....+++...... T Consensus 341 ~~ggD~~~~~~n~-~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 341 IEGGDVYIANLNA-VAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred CCCCCeeeecccc-ccccchhhhcCccCCCCCCCCCCCC Confidence 7666655543321 1111111111111111111000000 No 106 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.42 E-value=4.4e-12 Score=82.87 Aligned_cols=432 Identities=9% Similarity=0.008 Sum_probs=213.8 Q ss_pred eeCCCCCcccchhhhhhhhhhhhhh--------hHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAAPTVTG--------VRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~~~~~~--------~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |.+..+|++......++...+.... ..+.+.++-....++ +..++ .+...+..-.+++...++++ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~---~~~~~----~~~~~lr~RaRdL~rNn~~a 73 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSP---DALIN----PLKRIADARGRDMADNDGFT 73 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCCh---HHHHH----HHHHHHHHHHHHHHhcChHH Confidence 5566677776544444432211110 011122221111122 11222 23445556666777899999 Q ss_pred HHHHHHHHHHHhcCCceEecCC--------CCHHHHHHHHHHHHHHhC--------------CCHHHHHHHHHH-hhhhc Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPAS--------DDPQDVKIADMVRDWLKR--------------EELQQEVFHILD-CISKG 135 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~--------~~~~~~~~ae~v~~~l~~--------------~~~~~~i~~~l~-a~~~G 135 (547) +++++.....|.+..+.+.+.. +...+++..+.|++.|.. .+|..+...++. .+.-| T Consensus 74 ~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dG 153 (553) T protein:vir:63 74 NGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTG 153 (553) T ss_pred HHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCC Confidence 9999999999999998887642 334556666666655431 258888888874 56689 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEecccC----------------Cccee--eccCCCCc--------------- Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSD----------------LATPL--LLDDNGAE--------------- 182 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~----------------~~~~~--~~~~~~~g--------------- 182 (547) =.++-+.|.+..+..++-+|..+++.++..+.+. +..-+ ....++.. T Consensus 154 E~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~ 233 (553) T protein:vir:63 154 EVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQ 233 (553) T ss_pred ceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeecc Confidence 8888888987766556667888888766443321 11111 11111111 Q ss_pred -eeeccCceEEEeec-CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHH-------- Q lcl|NC_020866. 183 -VPLPAFRFIFADIP-AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDT-------- 252 (547) Q Consensus 183 -~~l~~~k~i~~~~~-~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~-------- 252 (547) ..+|.. -|+|.+. .+.+..-|.+.|.++.....-.......-+.-...-.+=..+.|.+.. ++..-+. T Consensus 234 ~~~v~a~-~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~-~~~~~~~~~~~~~~~ 311 (553) T protein:vir:63 234 SKPWGRR-QVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP-PEFIHSQMSGGSPNA 311 (553) T ss_pred ccccChh-HheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC-hhhhhhhcccccccc Confidence 123333 3455444 457778899988877655433222222222222222221222332211 0000000 Q ss_pred -----------------HHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc---hhccC Q lcl|NC_020866. 253 -----------------LFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ---TTTTD 312 (547) Q Consensus 253 -----------------l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq---tlts~ 312 (547) --.....+..+....++.|.+|++.++...+++ |..|.+.+-+.|+.. +|- .||.+ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~---~~~F~~~~lr~iaag-lGi~Ye~lt~D 387 (553) T protein:vir:63 312 DMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGV---GSEFEASLNRHLASA-FGMSYEEFTRD 387 (553) T ss_pred cccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCC---HHHHHHHHHHHHHhh-cCCCHHHHhhh Confidence 011223466677888999999999987754443 888888888999988 554 36667 Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------Ccc-------------cccEEEec--Ccc Q lcl|NC_020866. 313 AVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYG------PQK-------------RYPRIKIA--RPK 371 (547) Q Consensus 313 ~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg------~~~-------------~~p~~~~~--~~~ 371 (547) -.+.||+++-.-..-+....+.....+...|-+-+..++++.-+. |.. .+-...|. .-. T Consensus 388 ~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~ 467 (553) T protein:vir:63 388 FSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQG 467 (553) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCcc Confidence 545678776655555555555665555555544444444432211 110 00011222 223 Q ss_pred ccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccc------cccccccccccccccccCCcccccccccc Q lcl|NC_020866. 372 TEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPL------ENPATQAAPPGMDPGAGDLKTAERKLKPL 445 (547) Q Consensus 372 ~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (547) ..|..+-+++....++.|+ -|.+.+....|....+-.+++.... .-+.. ..+... .+.+.......+..+. T Consensus 468 ~iDP~Ke~~A~~~~i~~G~-~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~-~~~~~~-~~~~~~~~~~~~~~~~ 544 (553) T protein:vir:63 468 QIDQLKETQAAVMRIDAGL-STYEREIARLGGDFRKSFAQRAREDALLKKYGLTFN-LSAKRS-LGDGRDAATGIAEDPA 544 (553) T ss_pred ccChHHHHHHHHHHHHcCC-CCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCC-CCCccc-cCCCcccCCCCCCCCC Confidence 4688888889999999998 4443333334654322111111000 00000 000000 0000000000000000 Q ss_pred ccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 446 SGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) ..+... .+. T Consensus 545 ~~~~~~------------------~~e 553 (553) T protein:vir:63 545 AAQTSQ------------------QGE 553 (553) T ss_pred CCCccc------------------ccC Confidence 000000 000 No 107 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.41 E-value=3.7e-12 Score=83.25 Aligned_cols=355 Identities=10% Similarity=0.064 Sum_probs=179.8 Q ss_pred eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020866. 8 LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKR 87 (547) Q Consensus 8 ~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~ 87 (547) .+-+|+ ... ..++..+...... . +..+.. ....-+.|.+|+..+.. T Consensus 1 Mg~f~~---------------------~~~--~~~~~~~~~~~~~-~-~~~~~~---------~~~~~~~v~~~i~~Ia~ 46 (378) T protein:vir:16 1 MNLFGK---------------------VVS--FSRGKLNNDTQRV-T-AWQNEA---------VEYTSAFVTNIHNKIAN 46 (378) T ss_pred Cccchh---------------------hhh--hhcccccCCccee-e-ecccch---------hhHHHHHHHHHHHHHHh Confidence 222222 211 0111000000000 0 000000 11134679999999999 Q ss_pred HHhcCCceEecCCC-C-HHH---HHHHHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 88 SVSQLDITVEPASD-D-PQD---VKIADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 88 ~v~~~~w~v~p~~~-~-~~~---~~~ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) .|.+++|.+.-... + ..+ ...-.-+..+|+ ++ ...++++.++ +.+.+|.+.+.++|.-..+.++ .+. T Consensus 47 ~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l~ 124 (378) T protein:vir:16 47 EITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLL 124 (378) T ss_pred hhhhCceeEEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEE Confidence 99999998753211 1 000 001123444454 23 4555666555 4667999999888764333322 111 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) +- ..+..+++...|++++ . ...-.|.+++..+.-. ...+.. .|.|- T Consensus 125 ~~---------------------~~~~~~~~~diih~r~-~-~~~~~~~s~l~~~~~~----------i~~~~~-~~~~~ 170 (378) T protein:vir:16 125 FA---------------------DDKKEYKPEELVRLTS-P-FYINEDTSILDNALAS----------IQTKLE-QGKLR 170 (378) T ss_pred ec---------------------CCeeEecccceEEecC-c-cCccchhHHHHHHHHH----------HHHHHh-cCccc Confidence 10 1123455555554442 2 2222345555544322 222332 34554 Q ss_pred EEEeCCCCCCHHHH----HHHHHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc- Q lcl|NC_020866. 237 RIGKWQPGASEQDK----DTLFRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ- 307 (547) Q Consensus 237 ~vgk~~~~~~~~~~----~~l~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq- 307 (547) -+-+++...+++.. +.+.+.+.+... +..++++.|++++-+..+. ... .+.. .++..++|+++. |- T Consensus 171 g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~--~~~-~~~~-~~~~~~~Ia~~f-gVP 245 (378) T protein:vir:16 171 GLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDY--SVL-NKDE-IDLIKSELLTGY-FMN 245 (378) T ss_pred eeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCCh--hhh-hHHH-HHHHHHHHHHHh-CCC Confidence 45566655555544 444444444322 2457888888877554332 222 2443 477888999883 32 Q ss_pred -hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CC-cccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 308 -TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEY-GP-QKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 308 -tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nf-g~-~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) .+. +|+++. +........-+...++.|++.||+.|+..--...+ +. .....+|.++.....|++++++.+.+ T Consensus 246 p~~l----~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~ 320 (378) T protein:vir:16 246 ENIL----LGTASQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHE 320 (378) T ss_pred HHHh----cCCchH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHH Confidence 222 233332 23334455678888999999999988765322211 11 11224566677778899999999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) +++.|+ ++.+++|+.+|+|+-+.++.++.+.... +........++....+++....++ T Consensus 321 ~~~~G~-~T~NE~R~~~g~~p~~ggD~~~~~~n~~-~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 321 NINGPI-FTQNQLLVKMGEQPIEGGDVYIANLNAV-AVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCCCeEeeccccc-cccchhhhcCccCCCCCCCCCCCC Confidence 999997 9999999999999877666655443221 111111111111111111000000 No 108 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.40 E-value=4.9e-12 Score=82.58 Aligned_cols=431 Identities=12% Similarity=0.042 Sum_probs=208.2 Q ss_pred CC--CCCeeeCCCCCcccchhhhhhhhhhhh--hhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MA--QKPVLLDRWGNPVQRAQLTEEVAAPTV--TGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma--~~~~~~~~~g~~~~~~~l~~~~a~~~~--~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~ 76 (547) |. ..+.++++.+.. . .+..++..- ....+.+.++-....++ +..++. +...+..-.+++...++ T Consensus 1 ~~~p~~~~~~~~~~~~----~-~~~~~~y~~~a~~~~~~~~~w~p~~~s~---~~~~~~----~~~~lr~RaRdl~rNn~ 68 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMT----S-LREYAGYHGGGSGFGGQLRSWNPPSESV---DAALLP----NFTRGNARADDLVRNNG 68 (533) T ss_pred CCCchhhhhhcccccc----h-HHHHHhhhhccCCCCCcccccccCCCCH---HHHHHH----HHHHHHHHHHHHHhcCh Confidence 32 111222222210 0 011111000 01111222221112223 222221 33455555667778999 Q ss_pred HHHHHHHHHHHHHhcCCceEecCC-------CCHHHHHHHHHHHHHHhC--------------CCHHHHHHHHHHh-hhh Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPAS-------DDPQDVKIADMVRDWLKR--------------EELQQEVFHILDC-ISK 134 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~-------~~~~~~~~ae~v~~~l~~--------------~~~~~~i~~~l~a-~~~ 134 (547) ++.++++.....|.+..+.+.+.. +..++++..+.|++.|.. .+|..+...++.+ +.- T Consensus 69 ~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~d 148 (533) T protein:vir:34 69 YAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFN 148 (533) T ss_pred HHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhC Confidence 999999999999999999888743 345566777777665521 2588888777754 668 Q ss_pred cCeeeeeEEeecCCeeeEEeeEeeccceeEeccc----------------CCcce-eec-cC-CCC----ceeec----- Q lcl|NC_020866. 135 GYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS----------------DLATP-LLL-DD-NGA----EVPLP----- 186 (547) Q Consensus 135 G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~----------------~~~~~-~~~-~~-~~~----g~~l~----- 186 (547) |=.++-+.|.+..|..++-+|..+++.++..+.+ ++..- .+. .. .+. ..-++ T Consensus 149 GE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v 228 (533) T protein:vir:34 149 GELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPG 228 (533) T ss_pred CceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeecc Confidence 9899988998876665666788888766544322 11111 111 11 110 00111 Q ss_pred cCceEEEeecC-CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC-----------CCHHHHHHHH Q lcl|NC_020866. 187 AFRFIFADIPA-KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG-----------ASEQDKDTLF 254 (547) Q Consensus 187 ~~k~i~~~~~~-~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~-----------~~~~~~~~l~ 254 (547) +..-|+|.+.. +.+..-|.+.|.++.....-.......-+.-...-..=..+.|.+.+ ..+.....+. T Consensus 229 ~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (533) T protein:vir:34 229 GRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLT 308 (533) T ss_pred ChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCccccccccc Confidence 11245565554 47888899998877654332222122221111111111111121110 0011111111 Q ss_pred H-----------HHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch---hccCccccccch Q lcl|NC_020866. 255 R-----------AVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT---TTTDAVAGGHAV 320 (547) Q Consensus 255 ~-----------al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt---lts~~~ggs~a~ 320 (547) . .-..++.+....++.|.+|++.++...+++ |..|.+.+-+.|+.. +|-+ ||.+-.+.||++ T Consensus 309 ~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~---~~~f~~~~lr~iAag-lGi~ye~lt~D~s~~nYSS 384 (533) T protein:vir:34 309 GWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNG---YSVFEQSLLRYIAAG-LGVSYEQLSRNYAQMSYST 384 (533) T ss_pred ccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCC---HHHHHHHHHHHHHhh-cCCCHHHHhhhcccccHHH Confidence 0 111366777788999999999987765544 788888888999988 5654 666754568877 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---c-CCC--cc----------cccEEEe--cCccccCHHHHHHHH Q lcl|NC_020866. 321 SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQL---E-YGP--QK----------RYPRIKI--ARPKTEDLTKLASSL 382 (547) Q Consensus 321 ~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~---N-fg~--~~----------~~p~~~~--~~~~~ed~~~~a~~~ 382 (547) +-.-..-+...++.....+...+-+-+...+++. + ..+ .. .+.+..| ..-...|..+.+++. T Consensus 385 ~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~ 464 (533) T protein:vir:34 385 ARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEA 464 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHH Confidence 6665555555566555555555544444444432 2 111 00 0011222 233456888888899 Q ss_pred HHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccc----cccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 383 DTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPA----TQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 383 ~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) ...++.|+ .|.+.+....|....+-.+++........ ..+..+......+..+ ....+.++... + T Consensus 465 ~~~i~~G~-~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~---~~~~~~~~~~~------~ 533 (533) T protein:vir:34 465 VMLIEAGL-STYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQ---STEEEKSDSRA------A 533 (533) T ss_pred HHHHHcCC-CCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCC---CCCCCcccCCC------C Confidence 99999998 55544444556643221111111000000 0000000000000000 00000000000 0 No 109 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.38 E-value=6e-12 Score=82.12 Aligned_cols=430 Identities=12% Similarity=0.022 Sum_probs=215.7 Q ss_pred CCCCCeeeCCCCCccc-chhhhhhhhhhhhh-----hhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ-RAQLTEEVAAPTVT-----GVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~-~~~l~~~~a~~~~~-----~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~ 74 (547) |- |+|..--++. ...+.+..+..... +..+....++.. -++ +..+. .+...+..-.+++... T Consensus 1 mn----~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~-~s~---~~~~~----~~~~~lr~RaRdl~rN 68 (502) T protein:vir:79 1 MA----ILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARREN-RTA---DQLSQ----YGAVSLREQARYLDNN 68 (502) T ss_pred Cc----hHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCC-CCh---HHHHH----HHHHHHHHHHHHHHhc Confidence 77 7776555553 22333333221110 001111112211 112 12221 1334444555666779 Q ss_pred hHHHHHHHHHHHHHHhcC-CceEecCC---CCHHHHHHHHHHHHHHh----------CCCHHHHHHHHHH-hhhhcCeee Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQL-DITVEPAS---DDPQDVKIADMVRDWLK----------REELQQEVFHILD-CISKGYSFT 139 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~-~w~v~p~~---~~~~~~~~ae~v~~~l~----------~~~~~~~i~~~l~-a~~~G~sv~ 139 (547) ++++.+++++....|.+. .+.+.|.. +...+++.++.|++.|+ ..+|..+...++. .+.-|=.++ T Consensus 69 n~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~ 148 (502) T protein:vir:79 69 HDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFA 148 (502) T ss_pred ChHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEE Confidence 999999999999999987 57776542 22345566777776664 2468888887774 456898889 Q ss_pred eeEEeec----CCeeeEEeeEeeccceeEeccc--------------CCccee--eccCCCC-----ceeeccCceEEEe Q lcl|NC_020866. 140 EIVWNTS----EGQYFPDKLIWRDPRHFRFQKS--------------DLATPL--LLDDNGA-----EVPLPAFRFIFAD 194 (547) Q Consensus 140 Ei~w~~~----~g~~~~~~l~~~~~~~~~~~~~--------------~~~~~~--~~~~~~~-----g~~l~~~k~i~~~ 194 (547) -++|.+. +|.-++-+|..++|.++..+.+ ++..-+ .....+. .+.+|... |+|. T Consensus 149 ~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~-vlH~ 227 (502) T protein:vir:79 149 QMVSGRINSLTPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAER-MLHL 227 (502) T ss_pred EEeecccCccCCCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhh-eEEe Confidence 8888653 3455566788888876643322 222111 1111111 13456554 4554 Q ss_pred ec-CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHH---HHHHHHHHHHHhcCeeE-Eec Q lcl|NC_020866. 195 IP-AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQD---KDTLFRAVANIAGDCAA-IIP 269 (547) Q Consensus 195 ~~-~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~---~~~l~~al~~~~~~~~~-vip 269 (547) +. .+.+..-|.+.|.++.....-.......-+.-.....+-..+.|.+.+.+... -..-......+..++.+ .++ T Consensus 228 f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~ 307 (502) T protein:vir:79 228 KFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLK 307 (502) T ss_pred ecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccccccCCccccccC Confidence 44 56888889999998876654333323333333333333334444322110000 00000111234445544 478 Q ss_pred CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc---hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 270 ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ---TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRD 346 (547) Q Consensus 270 ~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq---tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~ 346 (547) .|.+|++.++...++ .|..|.+...++|+.. +|- .||.+-. +||+.+-.-..-+...++....++...|-+- T Consensus 308 pGe~i~~~~p~~p~~---~~~~f~~~~lr~iaag-lGi~ye~lt~D~s-~nySs~R~~~~e~~r~~~~~q~~~~~~~~~p 382 (502) T protein:vir:79 308 PGEEIGMVKSDRPNP---NLETFRNGQLRAVAAG-SRLSFSSTARNYN-GTYSAQRQELVESTDGYLILQDWFIGAVTRP 382 (502) T ss_pred CCceeeeeCCCCCCC---CHHHHHHHHHHHHHhh-cCCCHHHHhcccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999998765444 3888888889999988 454 4677753 4787766665555556666666555555444 Q ss_pred HHHHHHHhcC--C--------CcccccEEEec--CccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcc Q lcl|NC_020866. 347 LIRPWIQLEY--G--------PQKRYPRIKIA--RPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLR 414 (547) Q Consensus 347 li~~l~~~Nf--g--------~~~~~p~~~~~--~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~ 414 (547) +-..++..-+ | ....+....|. .-...|..+.+++....++.|+ .|.+.+....|....+-.++... T Consensus 383 i~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl-~t~~~~~a~~G~D~~~v~~q~a~ 461 (502) T protein:vir:79 383 MYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGA-ATESDWVRAGGRNPDDVKRRRKA 461 (502) T ss_pred HHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCC-CCHHHHHHHcCCCHHHHHHHHHH Confidence 4444333211 1 11112233332 2234688888888889999998 55555555557643221111111 Q ss_pred cc------cccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 415 PL------ENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 415 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) .. .-+... .+...++..+.+.....+.+..+.. ++ T Consensus 462 e~~~~~~~Gl~~~~--~~~~~~~~~~~~~~~~e~~~~~~~~---------------------e~ 502 (502) T protein:vir:79 462 EIDENRKLDLVFDT--DPASDKGGSSAATKRQEPQHTDDQS---------------------EE 502 (502) T ss_pred HHHHHHHcCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCC---------------------CC Confidence 00 000000 0000000000000000000000000 00 No 110 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.37 E-value=1.4e-11 Score=80.04 Aligned_cols=355 Identities=9% Similarity=0.048 Sum_probs=174.8 Q ss_pred eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020866. 8 LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKR 87 (547) Q Consensus 8 ~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~ 87 (547) .+-+||... ++......+-++ .....+ . .....-+.|.+|+..+-. T Consensus 1 M~~f~k~~~------------------~~~~~~~~~~~~-----~~~~~~-~----------~~~~~~~~v~~~v~~ia~ 46 (378) T protein:vir:85 1 MNLFGKVVS------------------FSRGKLNNDTQR-----VTAWQN-E----------AVEYTSAFVTNIHNKIAN 46 (378) T ss_pred Cchhhhhhh------------------hhhcccccCCcc-----eeeeec-c----------chhhhhHHHHHHHHHHHH Confidence 222222111 111000001111 000000 0 111134569999999999 Q ss_pred HHhcCCceEecCCCCH--HH--HHHH-HHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 88 SVSQLDITVEPASDDP--QD--VKIA-DMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 88 ~v~~~~w~v~p~~~~~--~~--~~~a-e~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) .|.++++.+.....+. .+ +++. .-+..+|+ ++ ...++...++ +.+.+|-+.+.+++....|.+... . T Consensus 47 ~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~--~ 124 (378) T protein:vir:85 47 EITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDL--L 124 (378) T ss_pred hHhhCceeEEEEeccccccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEE--E Confidence 9999999986432110 00 0111 12334453 23 3444555444 455789998877776555543211 0 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) . ...+..+.++..|+|+..-..+.+ .+.+..+.- .-..+. +-|.|- T Consensus 125 -------------------~--~~~~~~~~~~dvih~~~~~~~~~~--~~~~~~a~~----------~~~~~~-~~~~~~ 170 (378) T protein:vir:85 125 -------------------F--ANDKKEYKPEELVRLVSPFYINED--TSILDNALA----------SIQTKL-EQGKLR 170 (378) T ss_pred -------------------e--cCCCEEEcccceEEEecCcCccch--hhHHHHHHH----------HHHHHH-hcCCcc Confidence 0 012334555666665532221111 222322211 111222 346665 Q ss_pred EEEeCCCCCCHHHHHHHHHHHH----HHh----cCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_020866. 237 RIGKWQPGASEQDKDTLFRAVA----NIA----GDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT 308 (547) Q Consensus 237 ~vgk~~~~~~~~~~~~l~~al~----~~~----~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt 308 (547) -+.+++...+++..+++.+.++ +.. ++..++++.|++++-++.+. ... .++ ..++..++|+++ +|-. T Consensus 171 g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~--~~~-~~~-~~~~~~~~Ia~~-fgVP 245 (378) T protein:vir:85 171 GLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDY--SVL-NKD-EIELIKSELLTG-YFMN 245 (378) T ss_pred eEEEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCCh--hhh-hHH-HHHHHHHHHHHH-hCCC Confidence 6667776666665544444443 322 22467888888887654332 222 243 457778899987 4432 Q ss_pred --hccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCC-cccccEEEecCccccCHHHHHHHHHH Q lcl|NC_020866. 309 --TTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQ-LEYGP-QKRYPRIKIARPKTEDLTKLASSLDT 384 (547) Q Consensus 309 --lts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~-~Nfg~-~~~~p~~~~~~~~~ed~~~~a~~~~~ 384 (547) +. +|+++. +........-+...++.|+..||+.|+.+--. ..++. ...-..|..+.....|++++++.+.+ T Consensus 246 p~~l----~~s~~e-~~~~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~ 320 (378) T protein:vir:85 246 ENIL----LGTATQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHE 320 (378) T ss_pred HHHh----cCCchH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHH Confidence 22 234432 22233555667788888888888888653211 11111 11113445555567899999999999 Q ss_pred HHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 385 LVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 385 l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) +++.|+ ++.+++|+++|+|+-+.++..+.+... .+........++....+......+. T Consensus 321 ~~~~G~-~T~NE~R~~lgl~p~~gGD~~~~~~N~-~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 321 NINGPI-FTQNQLLVKMGEQPIEGGDIYIANLNA-VAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred HHhCCC-cCHHHHHHHhCCCCCCCCCeEeecccc-cccccchhhcCccCCCCCCCCCCCC Confidence 999998 999999999999877666655433221 1111111111111111101000000 No 111 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.35 E-value=6.7e-11 Score=76.35 Aligned_cols=515 Identities=11% Similarity=0.012 Sum_probs=206.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhh----------------------hhh-hhhhhhHH---HHhhhhccCCCHHHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEE----------------------VAA-PTVTGVRS---PLSGYPGDGLNPLRLAQILR 54 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~----------------------~a~-~~~~~~~~---~~~~~~~~~l~p~~~~~ilr 54 (547) |+++-+ |.|++....... ++. +....++. .+.++...++. .+...+- T Consensus 39 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~--n~~~~lG 111 (862) T protein:vir:99 39 LARTRQ-----NWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGG--GAPVPIG 111 (862) T ss_pred HHhhcc-----cCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcch--hhhhhcc Confidence 665433 555543322110 000 00000000 00000001110 0000110 Q ss_pred h----------------hcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCC-CHHHHHHHHHHHHHHh Q lcl|NC_020866. 55 A----------------ADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASD-DPQDVKIADMVRDWLK 117 (547) Q Consensus 55 ~----------------a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~-~~~~~~~ae~v~~~l~ 117 (547) . ...+.+..| .|+.. .++.+-++.++++...-.++..|+|...++ +..+.+..+.+++.++ T Consensus 112 ~~~~~s~y~~~~~~~~~~~~~~f~gy-ql~al-Y~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~ 189 (862) T protein:vir:99 112 AEGKQSSYAVPEALQDWYLSQGFIGH-QACAL-IAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDV 189 (862) T ss_pred ccccccccccchhccccccccCcccH-HHHHH-HHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHH Confidence 0 011112222 34443 357999999999999999999999986543 2333455667777788 Q ss_pred CCCHHHHHHHHH-HhhhhcCee-eeeEEeecCCee------------eEEeeEeeccceeEecc-----cCCcc---eee Q lcl|NC_020866. 118 REELQQEVFHIL-DCISKGYSF-TEIVWNTSEGQY------------FPDKLIWRDPRHFRFQK-----SDLAT---PLL 175 (547) Q Consensus 118 ~~~~~~~i~~~l-~a~~~G~sv-~Ei~w~~~~g~~------------~~~~l~~~~~~~~~~~~-----~~~~~---~~~ 175 (547) ++...+.+.+++ .+.+||-++ +-++=..++..| .++.|..++|.|..-.. .+... ... T Consensus 190 rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP 269 (862) T protein:vir:99 190 EFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEP 269 (862) T ss_pred HhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCc Confidence 877666666666 488898554 333321222111 23456666665543211 01000 000 Q ss_pred ccCCCCceeeccCceEEEeecC------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCC---C Q lcl|NC_020866. 176 LDDNGAEVPLPAFRFIFADIPA------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGA---S 246 (547) Q Consensus 176 ~~~~~~g~~l~~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~---~ 246 (547) ......+..+-+.++|++.... ...+++|.+++..+|....--......-...+.++.+.+. |++.-. + T Consensus 270 ~~y~I~g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~--ktd~l~~l~~ 347 (862) T protein:vir:99 270 EFWIISGQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAI--HTDTAKAIAN 347 (862) T ss_pred eeeeecCeeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhcccee--echhHhhhcc Confidence 0001123345556666554432 3455789999999998776655566666777788876543 443211 1 Q ss_pred HHHHHHHHHHHHHHhc-CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhH Q lcl|NC_020866. 247 EQDKDTLFRAVANIAG-DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQ 322 (547) Q Consensus 247 ~~~~~~l~~al~~~~~-~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~ 322 (547) ++...+=.+.+....+ .+.+++..+.+++.+...- ++ -..+++..-.+||-+ .+-.+| +.+..|-.|.|+ T Consensus 348 ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~ls~sl-SG----L~dll~~~~q~IAaa-s~IP~tiLfGqspaGlnATGE 421 (862) T protein:vir:99 348 EDKFIQRLMFWVRYRDNHAVKVLGTDETMEQFDTSL-AD----FDAVIMGQYQLVASI-AKTPATKLLGTAPKGFNSTGE 421 (862) T ss_pred HHHHHHHHHHHHhccCcceeEEecCCCceeEEeccc-CC----hHHHHHHHHHHHHhh-hCCCceeecccCcccccCchH Confidence 1222111222333333 4678899999999887553 33 344555555677766 343332 223345666667 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHH-------HHHHHHHHHHhCCCCCCH Q lcl|NC_020866. 323 EHRLVQEDIETSDAM-ALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLT-------KLASSLDTLVRLGMEIEE 394 (547) Q Consensus 323 vh~~v~~~~~~ad~~-~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~-------~~a~~~~~l~~~G~~v~~ 394 (547) --...+-+.+++... .+...|++ |+. ++..-++.. .--.|+|......+-+ ..+++++++++.|+ ++. T Consensus 422 ~D~~nYyD~I~s~QE~~L~P~Ler-L~~-li~~~lg~~-~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGv-isp 497 (862) T protein:vir:99 422 FETISYHEELESIQEHVYMPFLQR-HYL-ISRLSLGIQ-HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGV-ISP 497 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHH-HHHHhcCCC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCH Confidence 666777787777653 35555543 443 333333321 1235566544433333 34566788888996 999 Q ss_pred HHHHHHh------CCCCCCCcchhcccccccc------cccccccccccccCCccccccccccccc-hhhhhhcccchh- Q lcl|NC_020866. 395 NEVRSRF------GFSQPKPGAKLLRPLENPA------TQAAPPGMDPGAGDLKTAERKLKPLSGQ-ADLSRGDLSLNS- 460 (547) Q Consensus 395 ~~~~e~~------Glp~p~~~e~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~- 460 (547) +++|+++ |++.-.+++....+...+. ....+....+.......+.......... ..... ..+... T Consensus 498 dEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~-~~~~g~~ 576 (862) T protein:vir:99 498 DEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVP-SMKPGQM 576 (862) T ss_pred HHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcccccccccccccccCCccccCCcccccccC-CCCCCCc Confidence 9999875 3331111110000000000 0000000000000000000000000000 00000 000000 Q ss_pred hhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHHHHH--------HH Q lcl|NC_020866. 461 EMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLASRIS--------AA 532 (547) Q Consensus 461 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~~l~--------~a 532 (547) .....+........+. .....|....+-.-.++.+.+.-...+-+..+++... .-.+... ++ T Consensus 577 ~~~t~~~~a~~p~~~~---~~~~~~~~~~e~~~~~~~~~~~v~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 646 (862) T protein:vir:99 577 VGPEVGITAPMPEDDA---PVAGVVAKLAELQQAQMGAVTGVLARLVEQLDRMHDR-------TIAEGADIGQYDASGRT 646 (862) T ss_pred cccccccccCCCcccc---ccCcccccchhhhcCcchhhcchhhhhHHHHHhhhhh-------hhhhhcchhhhcccccc Confidence 0000000000000000 0000000000000011111111100001111111111 0000000 00 Q ss_pred HHHHH-------HHhHHhhccC Q lcl|NC_020866. 533 MLASE-------FAGREGALDG 547 (547) Q Consensus 533 l~~A~-------~~G~~~~~~g 547 (547) ..--. ..+.+-+.-| T Consensus 647 ~~~~~~~~~~~~~~~~~~~~~~ 668 (862) T protein:vir:99 647 VKPGTIATIRPSVSGNHVGEQP 668 (862) T ss_pred ccccccCCCCCcccccccccCC Confidence 00000 0000000111 No 112 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.26 E-value=1.6e-10 Score=74.26 Aligned_cols=377 Identities=12% Similarity=0.021 Sum_probs=185.1 Q ss_pred eeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRK 86 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk 86 (547) |.-. ++..+.+.+ +-++... .+......+..|+.. .++.+-++.++++-- T Consensus 1 ~~~~-------------------D~~~n~~~g----g~~~~~~------~~~~~~~~~~~l~a~-Y~~~~l~~~~Vd~~a 50 (422) T protein:vir:10 1 MVKT-------------------DSYANIFLG----GSDGSEI------YGSLQNQAPTILASL-YADNALVRRIIDTIP 50 (422) T ss_pred Cccc-------------------hhhHHHHcC----CCCCccc------cCcccccCHHHHHHH-HHhChhhHHHHhhhh Confidence 1111 122222111 0011000 000011112333332 357899999999999 Q ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCC--------eeeEEeeEe Q lcl|NC_020866. 87 RSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEG--------QYFPDKLIW 157 (547) Q Consensus 87 ~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g--------~~~~~~l~~ 157 (547) .-.++..|+|+.. +.+. .+...++++.+.+.+.+++ .+.+||++++=+.=. ++. .-.++.|.. T Consensus 51 ed~~r~g~~i~~~--~~~~-----~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~-d~~~~~~Pl~~~g~~~~l~v 122 (422) T protein:vir:10 51 ETALAAGFHIDGI--DDEP-----AFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRV 122 (422) T ss_pred HHHhcCCccccCC--CHHH-----HHHHHHHHhhHHHHHHHHHHhhccccceEEEEEec-CCCCccccccccCceeeEEe Confidence 9999999999632 2221 1334455666555666555 689999997655421 211 112446667 Q ss_pred eccceeEeccc---------CCcceeecc--CCCCceeeccCceEEEeec------CCcCccccchhHHH-HHHHHHHHH Q lcl|NC_020866. 158 RDPRHFRFQKS---------DLATPLLLD--DNGAEVPLPAFRFIFADIP------AKSGILLRSGLARV-AAWGWMFKA 219 (547) Q Consensus 158 ~~~~~~~~~~~---------~~~~~~~~~--~~~~g~~l~~~k~i~~~~~------~~~~~p~G~gll~~-~~~~~~~K~ 219 (547) +++.++..... +....+... +.+.+..+-+.++|++.+. ....++||.+++.. ||....--. T Consensus 123 ~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~ 202 (422) T protein:vir:10 123 YDRTQVKVQTREENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYT 202 (422) T ss_pred eccccccchhcccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHH Confidence 77766543211 111122121 2223456777777766433 24667789997764 777665555 Q ss_pred HHHHHHHHHHHHcCCCeEEEeC-----CCC-CCHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEecCCCCCchHHHHHH Q lcl|NC_020866. 220 FTARDWAIFTQTYGQPLRIGKW-----QPG-ASEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVESKSIGSSVDLYEKR 292 (547) Q Consensus 220 ~~~~~w~~f~E~~G~P~~vgk~-----~~~-~~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e~~~~~~~~~~f~~l 292 (547) .....-+..+.++.+.+..-+- +.+ .....+.++.......+.. +.+++..+.+++.+...- ++ -..+ T Consensus 203 ~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~l-sg----l~~~ 277 (422) T protein:vir:10 203 NCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDI-GG----IDAF 277 (422) T ss_pred HHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEeccc-CC----hHHH Confidence 5666667777787766543221 111 1111223333333333333 445667788999887553 33 3455 Q ss_pred HHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcccccEEEec Q lcl|NC_020866. 293 INHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMA-LAAILNRDLIRPWIQLEYGPQKRYPRIKIA 368 (547) Q Consensus 293 i~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~-i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~ 368 (547) ++..-.+||-+ .|-.+| +.+.+|-.|.|+--...+-+.+++.... +...+ +.|++.++. +. -..|+|. T Consensus 278 ~~~~~~~iaaa-~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l-~~l~~~i~~----s~--~~~~~f~ 349 (422) T protein:vir:10 278 LDKKFDRIVAL-SGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPIL-EFLIPFIVN----AE--EWSVEFN 349 (422) T ss_pred HHHHHHHHHhh-hCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhcc----cC--CcEEEeC Confidence 66666677766 444332 2333444456666677788888887754 33343 345554432 11 1244443 Q ss_pred Cccc-------cCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccc Q lcl|NC_020866. 369 RPKT-------EDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERK 441 (547) Q Consensus 369 ~~~~-------ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (547) .... +-.++.+++++++++.|+ ++.+++|+.+.=..+ ...+.... .+..... ......+.. T Consensus 350 pL~~~sekekaei~~~~a~a~~~~~~~g~-i~~~e~r~~L~~~~~--~~~~~~~~-~~~~~~~-----~~~~~~~~~--- 417 (422) T protein:vir:10 350 PLAQESSKDKAEILEKNVNSIAALIAAGA-MDIDEARDTLRTIAP--EVKINDGS-VETEVTI-----SETSNDPLE--- 417 (422) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHhhhhcc--cccCCCCC-Cccccch-----hhcCCCCCC--- Confidence 2222 222566788889999997 999999987731111 00000000 0000000 000000000 Q ss_pred cccccc Q lcl|NC_020866. 442 LKPLSG 447 (547) Q Consensus 442 ~~~~~~ 447 (547) .+..+ T Consensus 418 -~~~~d 422 (422) T protein:vir:10 418 -VPTDD 422 (422) T ss_pred -CCCCC Confidence 00000 No 113 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.24 E-value=2.9e-10 Score=72.87 Aligned_cols=385 Identities=10% Similarity=-0.007 Sum_probs=189.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |+++- +|+. +.++..+.+.+.- +.--. ....+....+..|+. |.++.+-++. T Consensus 5 m~~~~-------~~~~-----------~~D~~~~~~~~~~--g~~~~-------~~~~~~~~~~~~l~~-~Y~~~~l~~~ 56 (435) T protein:vir:79 5 MSDKV-------KAIT-----------KEDGYNEIFGSKD--GTFRP-------NAFYMQRAAFKALSQ-FYEEDGMARR 56 (435) T ss_pred ccccc-------ccch-----------hhcchhhhhcccc--ccccc-------CcccCCcCCHHHHHH-HHhcCchhhh Confidence 77541 1221 1223333221110 10000 001111112233332 2247899999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCC--------eee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEG--------QYF 151 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g--------~~~ 151 (547) ++++--.-.++..|+|+..++ . +.+++.++++...+.+.+++ .+.+||++.+=+.= .++. .-. T Consensus 57 ~Vd~~aed~~r~g~~i~g~~~---~----~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~-~d~~~~~~Pl~~~g~ 128 (435) T protein:vir:79 57 IVDVIPEEMVTPGFKVDGVKN---E----KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVV-ADNKMLKSPVKPGAQ 128 (435) T ss_pred hhccchHHhhcCCceecCCCh---H----HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEe-cCCCCcccccccCCc Confidence 999999999999999964321 1 12444555555555555554 78999998665442 1211 112 Q ss_pred EEeeEeeccceeEeccc---------CCcceeecc--CCCCceeeccCceEEEee------cCCcCccccchhH-HHHHH Q lcl|NC_020866. 152 PDKLIWRDPRHFRFQKS---------DLATPLLLD--DNGAEVPLPAFRFIFADI------PAKSGILLRSGLA-RVAAW 213 (547) Q Consensus 152 ~~~l~~~~~~~~~~~~~---------~~~~~~~~~--~~~~g~~l~~~k~i~~~~------~~~~~~p~G~gll-~~~~~ 213 (547) ++.|...++.++..... +....+... ++..+..+-+.++|++.+ ....+++||.+.| +.+|. T Consensus 129 i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~ 208 (435) T protein:vir:79 129 LEDIRVYDRYQITIHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIE 208 (435) T ss_pred eeeEEeechhhccchhhccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHH Confidence 34666777766543221 111112221 222356677777776643 2346788999965 78988 Q ss_pred HHHHHHHHHHHHHHHHHHcCCCeEEEeC-----CCC-CCHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEecCCCCCch Q lcl|NC_020866. 214 GWMFKAFTARDWAIFTQTYGQPLRIGKW-----QPG-ASEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVESKSIGSSV 286 (547) Q Consensus 214 ~~~~K~~~~~~w~~f~E~~G~P~~vgk~-----~~~-~~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e~~~~~~~~ 286 (547) ...--......-+..+.++.+.+...+- ..+ ...+.+.++.........+ +.+++..+.+++.+...- ++ T Consensus 209 ~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~l-sg-- 285 (435) T protein:vir:79 209 AIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDV-SG-- 285 (435) T ss_pred HHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEeccc-CC-- Confidence 7766666667777778888766543321 111 1112222332232333333 445556667899887543 33 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_020866. 287 DLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYP 363 (547) Q Consensus 287 ~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p 363 (547) -..+++..-.+||.+ .|-.+| +.+.+|-.|.|+--...+-+.+++.......-+.+.|++.++ ... -- T Consensus 286 --l~~~~~~~~~~iaaa-~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~----~s~--d~ 356 (435) T protein:vir:79 286 --VPEFLQEKIDRIVAL-TGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMI----SET--EW 356 (435) T ss_pred --HHHHHHHHHHHHHhh-hCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cCC--CC Confidence 355566666778766 444332 333344446666666777788887664433333333444332 221 12 Q ss_pred EEEecCccccCH-------HHHHHHHHHHHhCCCCCCHHHHHHHh-C-CCCCCCcchhcccccccccccccccccccccC Q lcl|NC_020866. 364 RIKIARPKTEDL-------TKLASSLDTLVRLGMEIEENEVRSRF-G-FSQPKPGAKLLRPLENPATQAAPPGMDPGAGD 434 (547) Q Consensus 364 ~~~~~~~~~ed~-------~~~a~~~~~l~~~G~~v~~~~~~e~~-G-lp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 434 (547) .|+|......+- +..+++++++.+.|+ ++.+++|+.+ . .+.-.-.++.......+. ...++...++.+. T Consensus 357 ~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~-i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~e~g~~ 434 (435) T protein:vir:79 357 SIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQA-INLKETRDTLRSICPDLKIMDNDNIELPEPE-DLDPEPGQEGGLN 434 (435) T ss_pred eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHHHHhccccCCCCcccccCCccc-cCCCCCCCCCCCC Confidence 455543333222 556778888888997 8999998876 1 221111111111100000 0000000000000 Q ss_pred C Q lcl|NC_020866. 435 L 435 (547) Q Consensus 435 ~ 435 (547) . T Consensus 435 ~ 435 (435) T protein:vir:79 435 K 435 (435) T ss_pred C Confidence 0 No 114 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.22 E-value=1.6e-10 Score=74.27 Aligned_cols=355 Identities=9% Similarity=0.041 Sum_probs=172.3 Q ss_pred hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCC---CHH- Q lcl|NC_020866. 29 VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASD---DPQ- 104 (547) Q Consensus 29 ~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~---~~~- 104 (547) +.-+.++....-...++.. ... ...+-+.+...=+.|.+|+......|.++++.+.-... ... T Consensus 1 M~if~~~~~~~~~~~~~~~--~~~-----------~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDT--QRV-----------TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhHHhHhhhhcccccCc--cee-----------eeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccc Confidence 2222222111100000000 000 00011111112357999999999999999998743211 100 Q ss_pred HHHH-HHHHHHHHh-CC----CHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeecc Q lcl|NC_020866. 105 DVKI-ADMVRDWLK-RE----ELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLD 177 (547) Q Consensus 105 ~~~~-ae~v~~~l~-~~----~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 177 (547) ...+ -.-+..+|+ ++ ...++...++ +.+..|.+.+-.+|...+|.+.. +.+. T Consensus 68 ~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~--~~~~------------------- 126 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLD--LLFA------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEE--EEEe------------------- Confidence 0011 112334453 23 3445555554 45578988876666554444321 1110 Q ss_pred CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHH----HH Q lcl|NC_020866. 178 DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKD----TL 253 (547) Q Consensus 178 ~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~----~l 253 (547) ..+..++++..+++++.. +.+ -+.+++..+.-. ...+. +-|.|--+.+++...+++.++ .+ T Consensus 127 --~~~~~~~~~dvih~~~~~-~~~-~~~~~~~~~~~~----------~~~~~-~~~~~~g~l~~~~~l~~~~~~~~~e~~ 191 (378) T protein:vir:94 127 --NDKKEYKPEELVRLTSPF-YIN-EDTSILDNALAS----------IQTKL-EQGKLRGLLKINAFLDIDNTQEYREKA 191 (378) T ss_pred --cCcEEechhceeeecCcC-Ccc-cchhHHHHHHHH----------HHHHH-hhCCcccceeeCCcCCHHHHHHHHHHH Confidence 112345656555444222 111 133455544321 11222 224443345666555655444 44 Q ss_pred HHHHHHHhc----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccccccchhHHHHHH Q lcl|NC_020866. 254 FRAVANIAG----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVAGGHAVSQEHRLV 327 (547) Q Consensus 254 ~~al~~~~~----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~ggs~a~~~vh~~v 327 (547) .+.+.+... +..++++.|++++-+..+. ... .++. .++..++|+++ +|-. +. +|+++. +..... T Consensus 192 ~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~--~~~-~~~~-~~~~~~~Ia~~-fgvPp~~l----~g~~~e-~~~~~f 261 (378) T protein:vir:94 192 LATIKNMQEGSSYNGLTPVDNKTEIVELKKDY--SVL-NKDE-IDLIKSELLTG-YFMNENIL----LGTATQ-EQQIYF 261 (378) T ss_pred HHHHHHhhcccccccceeccCCceEEEccCCh--HHh-hHHH-HHHHHHHHHHH-hCCCHHHh----cCCchH-HHHHHH Confidence 444443321 2357888888877554322 222 2333 47778899987 4432 22 133331 112234 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCC-cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCC Q lcl|NC_020866. 328 QEDIETSDAMALAAILNRDLIRPWIQLE-YGP-QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQ 405 (547) Q Consensus 328 ~~~~~~ad~~~i~~~ln~~li~~l~~~N-fg~-~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~ 405 (547) ...-+...++.++..||+.|+..--..- ++. ......|.++.....|.+.+++++.++.+.|+ ++.+++|+.+|+|+ T Consensus 262 ~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~-~t~NE~R~~~g~~p 340 (378) T protein:vir:94 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPI-FTQNQLLVKMGEQP 340 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCC-cCHHHHHHHhCCCC Confidence 4556778888899999988865422211 111 11223455556667899999999999999997 99999999999987 Q ss_pred CCCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 406 PKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 406 p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) -+.++..+.+... .+........++....+......+. T Consensus 341 ~~ggd~~~~~~n~-~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 341 IEGGDVYIANLNA-VAVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred CCCCCeeeecccc-cchhcchhcccccCCCCCCCCCCCC Confidence 7666665544222 1111111111111111100000000 No 115 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.21 E-value=5.3e-10 Score=71.42 Aligned_cols=405 Identities=10% Similarity=0.046 Sum_probs=192.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC----------CCHHHHHHHHH- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH----------GDPVRYLDLAE- 69 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~----------g~~~~~~~L~~- 69 (547) |- +.++=.-++-..|+.+ .+.+.+..|.. ...|+....+-..+ .....+...+. T Consensus 1 ~~-----~~~~~~~~~~~~~~~e-------~i~~~i~~~~~---~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 65 (474) T protein:vir:94 1 MT-----LYKLIDDIEAQGILPK-------HIEALIESHKD---DRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETG 65 (474) T ss_pred Cc-----hHHHHhhccccCCCHH-------HHHHHHHHhhh---hhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhc Confidence 32 1111111111111111 01111111110 01111111111100 00000000000 Q ss_pred ---HH-------HhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCee Q lcl|NC_020866. 70 ---VI-------EERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSF 138 (547) Q Consensus 70 ---~m-------~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv 138 (547) .. .-..+...-.+.+....+.+.+..+...++...++++.+++.++++.-.|+.....+. ++..||.+ T Consensus 66 ~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a- 144 (474) T protein:vir:94 66 GNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYG- 144 (474) T ss_pred ccccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeE- Confidence 00 0014555666777778888889888876666667777788989888778888887766 68889974 Q ss_pred eeeEEeecCCeeeEEeeEeeccceeE--ecccCCc---------------------------ceee-ccCCCCc-ee--- Q lcl|NC_020866. 139 TEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLA---------------------------TPLL-LDDNGAE-VP--- 184 (547) Q Consensus 139 ~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~---------------------------~~~~-~~~~~~g-~~--- 184 (547) .+++|...+|.+. +..++|+... ++..... .... ...+..+ .. T Consensus 145 ~~~~~~d~~~~~~---~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~ 221 (474) T protein:vir:94 145 ARLAYIDTNGDIR---IKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGR 221 (474) T ss_pred EEEEEeCCCCeeE---EEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCccccccc Confidence 5777865566543 3344444321 1111000 0000 0011000 00 Q ss_pred e-ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc- Q lcl|NC_020866. 185 L-PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG- 262 (547) Q Consensus 185 l-~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~- 262 (547) . .++..|-+.+ -.++++|.|.+..+....--=...+...+..++.|..|+++.+= .+.+++.... +.. T Consensus 222 ~~~~~g~vPvv~--~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g-~~~~~~~~~~-------~~~~ 291 (474) T protein:vir:94 222 YEHLFDYNPLFG--VPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG-MGMSEEMIQE-------TQKS 291 (474) T ss_pred ccCCCCccceEE--ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc-CCCCchhhhh-------hhhc Confidence 0 1112221111 13467899999987665555555688888888999999887652 2223332222 222 Q ss_pred CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHHHH Q lcl|NC_020866. 263 DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAMAL 339 (547) Q Consensus 263 ~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~~i 339 (547) ....+.+.+.+++++..... ...++..++.+.+.|...--+..++.++.+ +..+|..-.. -....+..-.+.+ T Consensus 292 ~~i~~~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~ 367 (474) T protein:vir:94 292 GAFELFDKDMDVKYLTKDVN---DTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKM 367 (474) T ss_pred ceeEecCCCCceeEEeccCC---HHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHH Confidence 33445588899999875532 345888899998888775222223333322 2333332211 1122233334556 Q ss_pred HHHHHHHHHHHHHHh-cC---C---CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcch Q lcl|NC_020866. 340 AAILNRDLIRPWIQL-EY---G---PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAK 411 (547) Q Consensus 340 ~~~ln~~li~~l~~~-Nf---g---~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~ 411 (547) ...+. ++++.++.+ +. + ......++.|...-+.+..+.++.+.++. |+ ++.+.+.+.++. +.++..-+ T Consensus 368 ~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~-iS~et~~~~l~~v~d~~~E~e 443 (474) T protein:vir:94 368 TAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQ-VSERTRLGQSQLVDDVDYELD 443 (474) T ss_pred HHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHHHHHH Confidence 66663 355555543 21 1 11233578888888999999999999985 66 899999999874 43321111 Q ss_pred hcccccccccccccccccccccCCccccccccccc Q lcl|NC_020866. 412 LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLS 446 (547) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (547) -+...........+... .+..... .....+. T Consensus 444 ri~~E~~e~~~~~~~~~-~~~~~~~---~~~~~s~ 474 (474) T protein:vir:94 444 EMEKESLEFNDKLPDID-EGDANDK---SQNNQSE 474 (474) T ss_pred HHHHHHHHHHhhccccc-CCCcCCC---CccccCC Confidence 11111100000000000 0000000 0000000 No 116 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.21 E-value=5.3e-10 Score=71.42 Aligned_cols=405 Identities=10% Similarity=0.046 Sum_probs=192.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC----------CCHHHHHHHHH- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH----------GDPVRYLDLAE- 69 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~----------g~~~~~~~L~~- 69 (547) |- +.++=.-++-..|+.+ .+.+.+..|.. ...|+....+-..+ .....+...+. T Consensus 1 ~~-----~~~~~~~~~~~~~~~e-------~i~~~i~~~~~---~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 65 (474) T protein:vir:10 1 MT-----LYKLIDDIEAQGILPK-------HIEALIESHKD---DRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETG 65 (474) T ss_pred Cc-----hHHHHhhccccCCCHH-------HHHHHHHHhhh---hhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhc Confidence 32 1111111111111111 01111111110 01111111111100 00000000000 Q ss_pred ---HH-------HhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCee Q lcl|NC_020866. 70 ---VI-------EERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSF 138 (547) Q Consensus 70 ---~m-------~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv 138 (547) .. .-..+...-.+.+....+.+.+..+...++...++++.+++.++++.-.|+.....+. ++..||.+ T Consensus 66 ~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a- 144 (474) T protein:vir:10 66 GNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYG- 144 (474) T ss_pred ccccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeE- Confidence 00 0014555666777778888889888876666667777788989888778888887766 68889974 Q ss_pred eeeEEeecCCeeeEEeeEeeccceeE--ecccCCc---------------------------ceee-ccCCCCc-ee--- Q lcl|NC_020866. 139 TEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLA---------------------------TPLL-LDDNGAE-VP--- 184 (547) Q Consensus 139 ~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~---------------------------~~~~-~~~~~~g-~~--- 184 (547) .+++|...+|.+. +..++|+... ++..... .... ...+..+ .. T Consensus 145 ~~~~~~d~~~~~~---~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~ 221 (474) T protein:vir:10 145 ARLAYIDTNGDIR---IKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGR 221 (474) T ss_pred EEEEEeCCCCeeE---EEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCccccccc Confidence 5777865566543 3344444321 1111000 0000 0011000 00 Q ss_pred e-ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc- Q lcl|NC_020866. 185 L-PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG- 262 (547) Q Consensus 185 l-~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~- 262 (547) . .++..|-+.+ -.++++|.|.+..+....--=...+...+..++.|..|+++.+= .+.+++.... +.. T Consensus 222 ~~~~~g~vPvv~--~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g-~~~~~~~~~~-------~~~~ 291 (474) T protein:vir:10 222 YEHLFDYNPLFG--VPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG-MGMSEEMIQE-------TQKS 291 (474) T ss_pred ccCCCCccceEE--ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc-CCCCchhhhh-------hhhc Confidence 0 1112221111 13467899999987665555555688888888999999887652 2223332222 222 Q ss_pred CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHHHH Q lcl|NC_020866. 263 DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAMAL 339 (547) Q Consensus 263 ~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~~i 339 (547) ....+.+.+.+++++..... ...++..++.+.+.|...--+..++.++.+ +..+|..-.. -....+..-.+.+ T Consensus 292 ~~i~~~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~ 367 (474) T protein:vir:10 292 GAFELFDKDMDVKYLTKDVN---DTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKM 367 (474) T ss_pred ceeEecCCCCceeEEeccCC---HHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHH Confidence 33445588899999875532 345888899998888775222223333322 2333332211 1122233334556 Q ss_pred HHHHHHHHHHHHHHh-cC---C---CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcch Q lcl|NC_020866. 340 AAILNRDLIRPWIQL-EY---G---PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAK 411 (547) Q Consensus 340 ~~~ln~~li~~l~~~-Nf---g---~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~ 411 (547) ...+. ++++.++.+ +. + ......++.|...-+.+..+.++.+.++. |+ ++.+.+.+.++. +.++..-+ T Consensus 368 ~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~-iS~et~~~~l~~v~d~~~E~e 443 (474) T protein:vir:10 368 TAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQ-VSERTRLGQSQLVDDVDYELD 443 (474) T ss_pred HHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHHHHHH Confidence 66663 355555543 21 1 11233578888888999999999999985 66 899999999874 43321111 Q ss_pred hcccccccccccccccccccccCCccccccccccc Q lcl|NC_020866. 412 LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLS 446 (547) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (547) -+...........+... .+..... .....+. T Consensus 444 ri~~E~~e~~~~~~~~~-~~~~~~~---~~~~~s~ 474 (474) T protein:vir:10 444 EMEKESLEFNDKLPDID-EGDANDK---SQNNQSE 474 (474) T ss_pred HHHHHHHHHHhhccccc-CCCcCCC---CccccCC Confidence 11111100000000000 0000000 0000000 No 117 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.20 E-value=4.9e-10 Score=71.62 Aligned_cols=379 Identities=12% Similarity=0.025 Sum_probs=184.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhh-hccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGY-PGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~-~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |= ++ +.++..+++.+. ..+ . ..-... ...++|+.. .++.+-++ T Consensus 1 ~~----~~-------------------~~d~~~~~~~~~~~~~---~---~~~~~~------~~~~~l~a~-Y~~~~l~~ 44 (427) T protein:vir:10 1 MK----IV-------------------KHDGYNDIFNGGADGS---P---KPFFMS------DASYHVGSF-YNDNATAK 44 (427) T ss_pred CC----cc-------------------ccchHHHHhhcCCCCc---c---cCcccc------CchHHHHHH-HHcCchhh Confidence 10 11 122222222211 000 0 000001 111244433 34789999 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeec-------CCeee Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTS-------EGQYF 151 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~-------~g~~~ 151 (547) .++++--.-.++..|+|+..+ .. +.++..++++.+.+.+.+++ .+.+||++++=+.=.-+ .+.-. T Consensus 45 ~~Vd~~aed~~r~g~~i~g~~---~~----~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~ 117 (427) T protein:vir:10 45 RIVDVIPEEMVTAGFKMSGVK---DE----KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAK 117 (427) T ss_pred hhhccchHHhhcCCccccCcc---HH----HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcc Confidence 999999888889999997422 11 23455566666666666666 69999999875532100 11223 Q ss_pred EEeeEeeccceeEeccc---------CCcceeecc--CCCCceeeccCceEEEeec------CCcCccccchhHH-HHHH Q lcl|NC_020866. 152 PDKLIWRDPRHFRFQKS---------DLATPLLLD--DNGAEVPLPAFRFIFADIP------AKSGILLRSGLAR-VAAW 213 (547) Q Consensus 152 ~~~l~~~~~~~~~~~~~---------~~~~~~~~~--~~~~g~~l~~~k~i~~~~~------~~~~~p~G~gll~-~~~~ 213 (547) ++.|.+.++.++..... +....+... +...+..+-+.++|++.+. ....+.||.+++. .+|. T Consensus 118 l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~ 197 (427) T protein:vir:10 118 LEGVRVYDRFAITVEKRVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLID 197 (427) T ss_pred eeEEEEechhcccccccccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHH Confidence 45677777665533211 111111111 1223356777777766433 2467788999775 5565 Q ss_pred HHHHHHHHHHHHHHHHHHcCCCeEEEeC------CCCCCHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEecCCCCCch Q lcl|NC_020866. 214 GWMFKAFTARDWAIFTQTYGQPLRIGKW------QPGASEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVESKSIGSSV 286 (547) Q Consensus 214 ~~~~K~~~~~~w~~f~E~~G~P~~vgk~------~~~~~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e~~~~~~~~ 286 (547) ...--......-+..+.++.+.+.--+- ........+.++....+..+.. +.+++.++.+++.+...- ++ T Consensus 198 ~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~l-sg-- 274 (427) T protein:vir:10 198 AICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDI-SG-- 274 (427) T ss_pred HHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEeccc-CC-- Confidence 5444444555556667787766542220 0111122233333333334433 445556678899886543 33 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_020866. 287 DLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYP 363 (547) Q Consensus 287 ~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p 363 (547) -..+++..-.+||-+ .+-.+| +.+.+|-.|.|+--...+-+.+++.......-+.+.|++.++ .+ . -- T Consensus 275 --l~~~~~~~~~~iaaa-~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~-~s---~--~~ 345 (427) T protein:vir:10 275 --VPEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV-DE---E--EW 345 (427) T ss_pred --hHHHHHHHHHHHHhh-hCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cC---C--Cc Confidence 344566666677766 444332 333344446666667777788887764433333334555442 21 1 12 Q ss_pred EEEecCccccC-------HHHHHHHHHHHHhCCCCCCHHHHHHHhC----CCCCCCcchhcccccccccccccccccccc Q lcl|NC_020866. 364 RIKIARPKTED-------LTKLASSLDTLVRLGMEIEENEVRSRFG----FSQPKPGAKLLRPLENPATQAAPPGMDPGA 432 (547) Q Consensus 364 ~~~~~~~~~ed-------~~~~a~~~~~l~~~G~~v~~~~~~e~~G----lp~p~~~e~~~~~~~~~~~~~~~~~~~~~~ 432 (547) .++|...-..+ .++.+++++++++.|+ ++.+++++.+- ...-.+..+.-... ......++|+. T Consensus 346 ~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gv-i~~~e~r~~L~~~~~~~~~~~~~~~~~e~-~~~~~e~~p~~---- 419 (427) T protein:vir:10 346 SIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIRE-PEETTEPEPGL---- 419 (427) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHHHhhhccccCCCCccccccc-cchhcCCCCCC---- Confidence 44453222222 2466788889999997 88888887662 21111111110000 00000000000 Q ss_pred cCCccccccccccccchh Q lcl|NC_020866. 433 GDLKTAERKLKPLSGQAD 450 (547) Q Consensus 433 ~~~~~~~~~~~~~~~~~~ 450 (547) .+...+.+ T Consensus 420 ----------~e~~~d~~ 427 (427) T protein:vir:10 420 ----------GEKLEDEN 427 (427) T ss_pred ----------CCCCCCCC Confidence 00000000 No 118 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.20 E-value=6.1e-10 Score=71.09 Aligned_cols=409 Identities=10% Similarity=-0.009 Sum_probs=185.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhh-----hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHH---HHHHH---- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAP-----TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVR---YLDLA---- 68 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~-----~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~---~~~L~---- 68 (547) |- |.+ + ..+..++.-+... +..-+...+..|..+ ..++....+ .-.|.... ..... T Consensus 1 ~~--~~~-----~-~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~-YY~g~~~i~~~~~~~~~~~~ 68 (472) T protein:vir:93 1 MY--PSQ-----P-TQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEK---LPEISIGQE-YYEQRPDIVKEPKPVDATGA 68 (472) T ss_pred CC--CCC-----C-cchhhhhceeeecCchhhHHHHHHHHHHHHHHH---HHHHHHHHH-Hhccccccccccchhhcccc Confidence 21 111 0 0000000000000 000011111111111 011111111 11111000 00000 Q ss_pred HHHHh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeE Q lcl|NC_020866. 69 EVIEE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIV 142 (547) Q Consensus 69 ~~m~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~ 142 (547) ....+ ..+...-.+++....+.+.+.++...+ .+..++++.++++ +|++.+.++. ++.-||.+ ++++ T Consensus 69 ~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~-~~~v 141 (472) T protein:vir:93 69 VDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-----DEVVKRIDEVLGN-RFDDKLHSVLTGASNKGIE-WLHP 141 (472) T ss_pred ccccccccccccchHHHHHHHHhhhhcccCeeeccCC-----hHHHHHHHHHHhc-cHHHHHHHHHHHHhhcCeE-EEEE Confidence 00000 135555667777777788887776432 2345677887765 7778877775 68889985 5677 Q ss_pred EeecCCeeeEEeeEeeccceeEe--cccCCccee----e-ccCCCCceee-c---------------------------- Q lcl|NC_020866. 143 WNTSEGQYFPDKLIWRDPRHFRF--QKSDLATPL----L-LDDNGAEVPL-P---------------------------- 186 (547) Q Consensus 143 w~~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~----~-~~~~~~g~~l-~---------------------------- 186 (547) |...+|.. ++..++|+.+.. +......+. . .......+.+ . T Consensus 142 ~~d~d~~~---~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (472) T protein:vir:93 142 YLDEEGEF---KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTH 218 (472) T ss_pred EECCCCce---EEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccc Confidence 76555554 344455554322 111111100 0 0000000000 0 Q ss_pred ----cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_020866. 187 ----AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAG 262 (547) Q Consensus 187 ----~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~ 262 (547) ++..|-+. .-.++++|.|.+..+....---...+..++.-++.|..|+++++-. +.++..... ..+.. T Consensus 219 ~~~~~~~~vPvv--~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~---~~~~~~~~~---~~~~~ 290 (472) T protein:vir:93 219 FSTGSWGKIPFI--PFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFK---RLLRY 290 (472) T ss_pred cccCCCCCcceE--EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecC---CcccchhhH---HHHhh Confidence 01111000 0123678999999865555455557788888889999999988732 222222222 23344 Q ss_pred CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH---HHHHHHHHHHHHHH Q lcl|NC_020866. 263 DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR---LVQEDIETSDAMAL 339 (547) Q Consensus 263 ~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~---~v~~~~~~ad~~~i 339 (547) ...+.++.+.+++++.... ....++.+++.+.+.|...--...++.++.+| ..+|..-. .-....+..-.+.+ T Consensus 291 ~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~ka~~~~~~~ 366 (472) T protein:vir:93 291 YGAIKVSDNGGVDTIQVEV---PVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-APSGVALEFLYTNLNLKADKLARKA 366 (472) T ss_pred ccccccCCCCcceeEeecC---CHHHHHHHHHHHHHHHHHHhCCCCCCcccccc-CchHHHHHHHHHHHHHHHHHHHHHH Confidence 5667789999999986443 23358888888888777662222233333222 22333211 11222344445666 Q ss_pred HHHHHHHHHHHHHHhc-CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhccccc Q lcl|NC_020866. 340 AAILNRDLIRPWIQLE-YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLE 417 (547) Q Consensus 340 ~~~ln~~li~~l~~~N-fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~ 417 (547) ...+. ++++.++.+. ......-..+.|....+.+..+.++.+.+++ |+ ++.+.+.+.++. +.|+..=+-+.... T Consensus 367 ~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~~--gi-is~et~l~~l~~~~d~~~E~~ri~~E~ 442 (472) T protein:vir:93 367 KVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPFVEDLQAELERIEQEQ 442 (472) T ss_pred HHHHH-HHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHHHHHHHHHHHH Confidence 66663 4666666654 2222233467788888899999999998874 76 888888888864 43321101111110 Q ss_pred ccccccccccccccccCCcccccccccccc Q lcl|NC_020866. 418 NPATQAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) ..................+....+...... T Consensus 443 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 443 MEYNKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHHHHhccCcCcccCCCCCCCCCCCcccCC Confidence 000000000000000000000000000000 No 119 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=99.19 E-value=1.4e-11 Score=80.13 Aligned_cols=240 Identities=10% Similarity=0.038 Sum_probs=147.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |. | +.++..+.....+.. ....-..+..+.+.... ...... .| +=+.|.+ T Consensus 1 Mg----l---F~~~~~r~~~~~~~~---~~~~~~~~~~~~~~~~~--------------~v~~~~----al--~~~~v~~ 50 (251) T protein:vir:46 1 MG----I---FYKNEKRDLQYNEDD---LQMMVQTLPSFQGTKLR--------------QYKDIE----AI--RHSDIFT 50 (251) T ss_pred CC----c---cccccccccCCCccc---hhhhhhhhccccCcCcc--------------eechhh----hh--ccHHHHH Confidence 65 3 334333221111111 00000011111111000 000100 11 3577999 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHH-HHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEee Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVKIADMVRD-WLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWR 158 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~-~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~ 158 (547) |++.+...|.+++|++........+..+...+.. --..+...++++.+. +.+.+|-+.++++...+| .+..|.++ T Consensus 51 ~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G---~~~~L~~i 127 (251) T protein:vir:46 51 AVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG---EPMNLTFR 127 (251) T ss_pred HHHHHHHhHhhCceEEeeCccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEEEEE Confidence 9999999999999999865433322223333321 112234667777777 468899999999865433 47789999 Q ss_pred ccceeEecccCCcceee--c--c--CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 159 DPRHFRFQKSDLATPLL--L--D--DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 159 ~~~~~~~~~~~~~~~~~--~--~--~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) +|.++++..++...+.. . . ..+....++++.+|++++... +..+|.|.+..+....-......++...+...- T Consensus 128 ~~~~v~v~~~~~g~~~~~~~~~~~~~~g~~~~~~~~diiH~r~~~~-dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng 206 (251) T protein:vir:46 128 KTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSL-DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNG 206 (251) T ss_pred CCceEEEEECCCCcEEEEEEEeccCCcceeEEECCccEEEecCcCC-CCeeecCHHHHHHHHHHHHHHHHHHHHHHHHcc Confidence 99998876654443332 1 1 123346788898888777654 457999999999999999999999999999999 Q ss_pred CCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc--CeeEEecCCceE Q lcl|NC_020866. 233 GQPLRIGKWQPGA-SEQDKDTLFRAVANIAG--DCAAIIPETMSI 274 (547) Q Consensus 233 G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~--~~~~vip~~~~i 274 (547) +.|--+-+++... +++.++++.+.+.+.-+ +.++.++.||+= T Consensus 207 ~~p~gil~~~~~l~~~e~~~~~~~~~~~~~~g~~n~g~~~~gm~~ 251 (251) T protein:vir:46 207 THAGGILKMKGVLDNKKARDRAREEFPKVLVELNKLGKLSYSMNQ 251 (251) T ss_pred CCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCC Confidence 9999999998665 45557778777777644 344556666654 No 120 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=99.18 E-value=3.8e-10 Score=72.25 Aligned_cols=331 Identities=12% Similarity=0.006 Sum_probs=180.3 Q ss_pred CCCCCeeeCCCCCcc---cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCH----HHHHHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPV---QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDP----VRYLDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~---~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~----~~~~~L~~~m~~ 73 (547) |.++-+ .++-. .........++.+...+..+.-+-|..-++..++.+.+.-...|+. .....|.+.. + T Consensus 1 m~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~-~ 75 (350) T protein:vir:11 1 MSKRRS----HRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSV-G 75 (350) T ss_pred CCcccc----CCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHH-h Confidence 876322 11111 0111122222222222212222334333444433444444444542 1122232222 3 Q ss_pred hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 74 RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 74 ~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) .-+|-.++|..++.-+.+. +.|...-. ...|..++. +.+.+|.+.+|++....| .+. T Consensus 76 ~~~~h~~~l~~k~n~l~~~---~~Pn~~~t--------------~~~f~~~v~---d~ll~Gnay~~~~rn~~G---~~~ 132 (350) T protein:vir:11 76 SSVYLQSGLKFKRNMLAKT---FIPHRLLS--------------RATFEQFSL---DWLTFGSAYLEQPRSRLG---TRM 132 (350) T ss_pred hhhhhccchhhhhhhhhhc---ccCCCCCC--------------HHHHHHHHH---HHHhcCCeEEEEEEcCCC---CEE Confidence 5677788887766655542 34543211 112444443 455789999999865433 356 Q ss_pred eeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) .|.++++.+.+...+++..+. ....+....+++...|+++.....+..||.+.+..+.-....-.....+-..|....+ T Consensus 133 ~L~~l~~~~vr~~~~~~~~~~-~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa 211 (350) T protein:vir:11 133 PLQAPLAKYMRRGTDLETFYQ-VRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGS 211 (350) T ss_pred EEEEeCCceeEeeecCCeEEE-EeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 899999998888776654333 3445566788888877666555566789999999998888888887888888888888 Q ss_pred CCeEEEeC-CCCCCHHHHHHHHHHHHHHhcC---e-eEE-ecCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 234 QPLRIGKW-QPGASEQDKDTLFRAVANIAGD---C-AAI-IPET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 234 ~P~~vgk~-~~~~~~~~~~~l~~al~~~~~~---~-~~v-ip~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) .|--|-++ ++..++++++++.+++++.... . .+| .|.| ..++++..+.. ..-..|.+.-++-..+|+.+.- T Consensus 212 ~~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~-~~d~qf~e~k~~~~~eIa~a~~ 290 (350) T protein:vir:11 212 HAGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEV-AAKDEFGSIKNISRDDQLAGLR 290 (350) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCC-hhHHHHHHHHHHhHHHHHHHhC Confidence 88666665 4568999999999999875321 1 133 3332 23555544432 3333588888888888988842 Q ss_pred c-chhcc--CccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH Q lcl|NC_020866. 306 G-QTTTT--DAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL 375 (547) Q Consensus 306 G-qtlts--~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~ 375 (547) - ..|.+ +..+|+++-.+....+ ...-+.--++.+++ +|+.|.+.+++ |....-..+ T Consensus 291 VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~~-------------F~~~~~~~l 350 (350) T protein:vir:11 291 VYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVVR-------------FAQFDAPGL 350 (350) T ss_pred CCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCccccc-------------cCcccccCC Confidence 1 22322 2223445443333333 33445555666653 55544322222 222222222 No 121 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.18 E-value=7.7e-10 Score=70.54 Aligned_cols=408 Identities=11% Similarity=0.016 Sum_probs=190.3 Q ss_pred CCCCCeeeCCCCCcccch--hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh----- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA--QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE----- 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~--~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~----- 73 (547) |||. +=.-|+.+-.. +.++..- ..+..-...-++...+...++... ....++..+++.+.- T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~e~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~ 68 (483) T protein:vir:12 1 MAQA---LIKGGNILYPSQPTQTEIFD--------AIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIV 68 (483) T ss_pred Cccc---hhcCCceeecCcchhhhhhh--------cccccCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccc Confidence 9962 22334444211 1111100 000000000111111111111111 112222222221100 Q ss_pred ---------------------hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hh Q lcl|NC_020866. 74 ---------------------RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DC 131 (547) Q Consensus 74 ---------------------~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a 131 (547) ......-.+++....+.+.+.++...+ .+..+++++++++ ++++.+.++. ++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~~~~~~~ 142 (483) T protein:vir:12 69 KEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-----DEVVKRIDEVLGN-RFDDKLHSVLTGA 142 (483) T ss_pred cccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCC-----hHHHHHHHHHHhc-cHHHHHHHHHHHH Confidence 134455566677777778887775432 2345677777765 6777777764 68 Q ss_pred hhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce------eeccCCCCceeec----------------- Q lcl|NC_020866. 132 ISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP------LLLDDNGAEVPLP----------------- 186 (547) Q Consensus 132 ~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~------~~~~~~~~g~~l~----------------- 186 (547) ..||.+ .+++|.-.+|... +..++|+.+. ++......+ ....+. ..+.+. T Consensus 143 ~~~G~~-y~~v~~d~d~~~~---i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~-~~~~~y~~~~v~~~~~~~~~~~~ 217 (483) T protein:vir:12 143 SNKGIE-WLHPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-TKVEYWDKVTVNYYVYENGSLIP 217 (483) T ss_pred hhCCeE-EEEEEEcCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEeecc-eEEEEEecCeEEEEEEeCCeeee Confidence 889985 5566754555543 4445555432 111110100 000000 000000 Q ss_pred ----------------cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHH Q lcl|NC_020866. 187 ----------------AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDK 250 (547) Q Consensus 187 ----------------~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~ 250 (547) ++..|-+. .-.+++.|.|.+..+....---...+.+++..++.|..|+++.+-... ++. T Consensus 218 ~~~~~~~~~~~~~~~~~~g~vPvv--~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~---~~~ 292 (483) T protein:vir:12 218 DYSNNLENSKTHFSTGSWGKIPFI--PFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD---QEL 292 (483) T ss_pred cccccccccccccccCCCCccceE--EecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc---ccc Confidence 01111000 012367889999886666555566788899999999999998873222 221 Q ss_pred HHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH---HH Q lcl|NC_020866. 251 DTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR---LV 327 (547) Q Consensus 251 ~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~---~v 327 (547) ... ...+.....+.++.+.+++++.... ....++.+++.+.+.|...--...++.++-+| ..+|..-. .- T Consensus 293 ~~~---~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~ 365 (483) T protein:vir:12 293 PEF---KRLLRYYGAIKVSDNGGVDTIQVEV---PVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-APSGVALEFLYTN 365 (483) T ss_pred hhH---HHhhhhccccccCCCCcceEEeecC---CHHHHHHHHHHHHHHHHHHhCCCCCCcccccc-CcHHHHHHHHHHH Confidence 111 1233445566778999999987543 23358888888888877663222233343222 22333221 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CC Q lcl|NC_020866. 328 QEDIETSDAMALAAILNRDLIRPWIQLE-YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQ 405 (547) Q Consensus 328 ~~~~~~ad~~~i~~~ln~~li~~l~~~N-fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~ 405 (547) ....+..-.+.+...+. ++++.++.+. .........+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. +. T Consensus 366 l~~k~~~~~~~f~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~--Gi-iS~et~~~~~~~v~d 441 (483) T protein:vir:12 366 LNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPFVED 441 (483) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCC Confidence 22233444555666664 3566555553 2222234578888889999999999999984 66 888888888864 43 Q ss_pred CCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 406 PKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 406 p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) |+..=+-+.................+....+....+..... + T Consensus 442 ~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~-------------------------e 483 (483) T protein:vir:12 442 LQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKES-------------------------E 483 (483) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccC-------------------------C Confidence 32111111111000000000000000000000000000000 0 No 122 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=99.17 E-value=2.9e-10 Score=72.91 Aligned_cols=333 Identities=11% Similarity=0.029 Sum_probs=180.7 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC----HHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD----PVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~----~~~~~~L~~~m~~~D~ 76 (547) |.++-+. .+.....+.....++.+......+.-+-|-.-++..++-..++-...|+ +..+..|.+.. +..+ T Consensus 1 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~-~~~~ 75 (351) T protein:vir:79 1 MSKRRSR----APRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSF-RAST 75 (351) T ss_pred CCCCCCC----CCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHH-hhhH Confidence 7753222 1111111111111111111111111122322344444444555555565 22233343332 4689 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) |..++|..++.-+.+. +.|...-.. ..|..++. +.+.+|.+.+|++....| .+..|. T Consensus 76 ~h~~~l~~k~n~l~~~---~~Pnp~~t~--------------~~f~~~v~---d~ll~Gnay~~~~r~~~G---~~~~L~ 132 (351) T protein:vir:79 76 HHSSALFFKANVLAST---FRPHRWLSR--------------HAFERWAL---DFLTFGNGYLERRRNMVG---GTLRLE 132 (351) T ss_pred hhhhhhhhhhhHHhhc---ccCCCCCCH--------------HHHHHHHH---HHHhcCCeEEEEEECCCC---CEEEEE Confidence 9999998877777663 445433111 11444443 455789999999875433 366899 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ++++.+.+...+.+. ++....++....+++...|+++.....+..||.+.+..+......-.....+-..|.+..+.|- T Consensus 133 ~l~~~~v~~~~~~~~-~~~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg 211 (351) T protein:vir:79 133 PALAKYVRRKADFSG-FVYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAG 211 (351) T ss_pred EeCCcceeeeecCCe-EEEEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 999998887665543 4444445556778888877666555567789999999999888887777777788888888886 Q ss_pred EEEeC-CCCCCHHHHHHHHHHHHHHhc---Ce-eEE-ecCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-c Q lcl|NC_020866. 237 RIGKW-QPGASEQDKDTLFRAVANIAG---DC-AAI-IPET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-Q 307 (547) Q Consensus 237 ~vgk~-~~~~~~~~~~~l~~al~~~~~---~~-~~v-ip~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-q 307 (547) -|-++ ++..++++++++.+++++... .. .+| +|.+ ..++++..+.. ..-..|.+.-++-..+|+.+.-- . T Consensus 212 ~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~-~~d~ef~e~k~~s~~eI~~a~~VPp 290 (351) T protein:vir:79 212 FILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEV-AAKDEFFNIKNVTRDDLLAAHRVPP 290 (351) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCCCH Confidence 65554 556799999999999987532 11 233 3433 23566554432 23335888888888889888421 1 Q ss_pred hhcc--CccccccchhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcc--ccCHHH Q lcl|NC_020866. 308 TTTT--DAVAGGHAVSQEHRLVQE-DIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPK--TEDLTK 377 (547) Q Consensus 308 tlts--~~~ggs~a~~~vh~~v~~-~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~--~ed~~~ 377 (547) .|.+ +..+++++-.+....++. .-+.--++.+++ +|..| ... .++|+..+ ..|.+. T Consensus 291 ~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~l-----------g~~--~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 291 QLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWL-----------GDE--VVTFDDYEIPPAPVAA 351 (351) T ss_pred HHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhc-----------Ccc--eeeeChhhhccccccC Confidence 2222 222344444333333322 334444444432 33321 112 23443322 122111 No 123 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.17 E-value=5.2e-10 Score=71.49 Aligned_cols=432 Identities=9% Similarity=0.028 Sum_probs=203.9 Q ss_pred CCCCCe---eeCCCCCcccchhhhhh-hhhhh--hhhhHHHHhhhhc--cCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|NC_020866. 1 MAQKPV---LLDRWGNPVQRAQLTEE-VAAPT--VTGVRSPLSGYPG--DGLNPLRLAQILRAADHGDPVRYLDLAEVIE 72 (547) Q Consensus 1 ma~~~~---~~~~~g~~~~~~~l~~~-~a~~~--~~~~~~~~~~~~~--~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~ 72 (547) |++.++ ++|..-.|.-....... .+... -....+...++.. ...++ ...++ ++...+..-.+++. T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~---~~~i~----~~~~~lr~RaRdL~ 73 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSA---DEEIY----ADLASLVQRAREQS 73 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCCh---HHHHH----HHHHHHHHHHHHHH Confidence 765433 44444333311111000 00000 0000011111110 01122 12111 23445555566777 Q ss_pred hhhHHHHHHHHHHHHHHhc-CCceEecCCC---CHHHHHHHHHHHHHHhC------------CCHHHHHHHHHH-hhhhc Q lcl|NC_020866. 73 ERDPHYLGVIGTRKRSVSQ-LDITVEPASD---DPQDVKIADMVRDWLKR------------EELQQEVFHILD-CISKG 135 (547) Q Consensus 73 ~~D~~v~s~l~~Rk~~v~~-~~w~v~p~~~---~~~~~~~ae~v~~~l~~------------~~~~~~i~~~l~-a~~~G 135 (547) ..+++++++++.....|.+ ..+.+++... ...+++.++.|+..++. .+|..+...++. .+.-| T Consensus 74 rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dG 153 (505) T protein:vir:96 74 INNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDG 153 (505) T ss_pred hcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCC Confidence 7999999999999999998 4788876432 23345566666655432 248777777774 35567 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEeccc------------------CCccee-e-ccCCCC-----------cee Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS------------------DLATPL-L-LDDNGA-----------EVP 184 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~------------------~~~~~~-~-~~~~~~-----------g~~ 184 (547) =.++-+.|.. ++. .+-+|..++|.++..+.+ ++..-+ + ...++. ... T Consensus 154 E~f~~~~~~~-~~~-~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~r 231 (505) T protein:vir:96 154 EVLVREHRGY-PNK-WGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYER 231 (505) T ss_pred ceEEEEeecC-CCC-cceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccc Confidence 7766555532 333 444677777776644322 111111 1 111111 112 Q ss_pred eccCceEEEeec-CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHHHHHHHHHHHhc Q lcl|NC_020866. 185 LPAFRFIFADIP-AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGA-SEQDKDTLFRAVANIAG 262 (547) Q Consensus 185 l~~~k~i~~~~~-~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~~l~~al~~~~~ 262 (547) +|.. -|+|.+. .+.+..-|.+.|.++.....-.......-+.-.....+=..+.|.+.+. .....+.-......+.. T Consensus 232 vpa~-~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~p 310 (505) T protein:vir:96 232 VPAD-EIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGEIVEEVEA 310 (505) T ss_pred cCHh-HhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCccccccCC Confidence 3333 3444443 4577778888888876554433332333333333322222333433221 11111111233456777 Q ss_pred CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc---hhccCccccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 263 DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ---TTTTDAVAGGHAVSQEHRLVQEDIETSDAMAL 339 (547) Q Consensus 263 ~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq---tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i 339 (547) +....++.|.+|++.++...++ .|..|.+..-++|+.. +|- .||.+-.+.||+++-.-..-+....+.....+ T Consensus 311 G~i~~L~pGe~i~~~~~~~p~~---~~~~f~~~~lr~iaag-lgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~ 386 (505) T protein:vir:96 311 GTYQLLPYGIRFKEHKIDHPHT---NFGAFVKSSLRGVAAG-MGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFV 386 (505) T ss_pred ceeeecCCCCeeeeeCCCCCCC---CHHHHHHHHHHHHHhh-cCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 8888899999999998775444 3888888888999988 564 36666444577665554444444444444444 Q ss_pred HHHHHHHHHHHHHHh----cCCC-----cccccEEEecCc--cccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC Q lcl|NC_020866. 340 AAILNRDLIRPWIQL----EYGP-----QKRYPRIKIARP--KTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP 408 (547) Q Consensus 340 ~~~ln~~li~~l~~~----Nfg~-----~~~~p~~~~~~~--~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~ 408 (547) ...+-+-+...+++. +.-+ ...+....|... .-.|..+.+++....++.|+ -|.+.+....|....+- T Consensus 387 ~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~-~t~~~~~a~~G~D~~~v 465 (505) T protein:vir:96 387 VTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRT-RSRSSIIRAAGDDPEDV 465 (505) T ss_pred HHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCC-CCHHHHHHHcCCCHHHH Confidence 444433333333332 2111 111223333332 33588888899999999998 44444444467754322 Q ss_pred cchhcccccccccccccccccccccCCcccc-ccccccccchhh Q lcl|NC_020866. 409 GAKLLRPLENPATQAAPPGMDPGAGDLKTAE-RKLKPLSGQADL 451 (547) Q Consensus 409 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 451 (547) .++........ ..-+..+......... +...+...+.+. T Consensus 466 ~~q~a~e~~~~----~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 466 FDEIAWEEQLM----RDKGVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred HHHHHHHHHHH----HHcCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 22211110000 0000000000000000 000000000000 No 124 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=99.17 E-value=2.2e-10 Score=73.49 Aligned_cols=339 Identities=11% Similarity=0.021 Sum_probs=182.2 Q ss_pred CC-----------CCCeeeCCCCCcccchhhh---------hhhh-hhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCC Q lcl|NC_020866. 1 MA-----------QKPVLLDRWGNPVQRAQLT---------EEVA-APTVTGVRSPLSGYPGDGLNPLRLAQILRAADHG 59 (547) Q Consensus 1 ma-----------~~~~~~~~~g~~~~~~~l~---------~~~a-~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g 59 (547) |- +.+.|+..---||+++.-+ ...+ +........+.-+-|-.-++..++-..++-...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~ 80 (376) T protein:vir:10 1 MPARDRPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNG 80 (376) T ss_pred CCCCccchhhhhhcccchhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcC Confidence 10 1112222222222222111 1110 1000011011112233335554445555555556 Q ss_pred C----HHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhc Q lcl|NC_020866. 60 D----PVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKG 135 (547) Q Consensus 60 ~----~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G 135 (547) + +.....|.+. .+..+|..++|..++.-+.+. +.|+..-. ...|..++. +.+.+| T Consensus 81 ~~~~pp~~~~~La~~-~~~~~~h~s~l~~k~n~l~~~---~~Pnp~lT--------------~~~f~~~v~---d~ll~G 139 (376) T protein:vir:10 81 EWFEPPVSFAGLAKS-FRASTHHSSALFFKANVLAST---FRPHRWLS--------------RHAFERWAL---DFLTFG 139 (376) T ss_pred ceecCCCCHHHHHHH-HhhhHHhhhhHHHHhHHHHhc---cCCCCCCC--------------HHHHHHHHH---HHHhcC Confidence 4 2333344333 346899999999887776652 44543211 112444444 455689 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHH Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGW 215 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~ 215 (547) .+.+|++....| .+..|.++++.+.+...+.. ++......+....+++...|.++.....+..||.+.+..+.... T Consensus 140 nay~~~~rn~~G---~~~~L~pl~~~~vr~~~d~~-~~~~~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si 215 (376) T protein:vir:10 140 NGYLERRRNMVG---GTLRLEPALAKYVRRKADFN-GFVYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSA 215 (376) T ss_pred CeEEEEEECCCC---CEEEEEEeCCcceEEEeeCC-eEEEEEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHH Confidence 999999875434 46789999999888766554 33444445556778888887776666567789999999998888 Q ss_pred HHHHHHHHHHHHHHHHcCCCeEEEeC-CCCCCHHHHHHHHHHHHHHhcC---e-eEE-ecCC--ceEEEEecCCCCCchH Q lcl|NC_020866. 216 MFKAFTARDWAIFTQTYGQPLRIGKW-QPGASEQDKDTLFRAVANIAGD---C-AAI-IPET--MSIDFVESKSIGSSVD 287 (547) Q Consensus 216 ~~K~~~~~~w~~f~E~~G~P~~vgk~-~~~~~~~~~~~l~~al~~~~~~---~-~~v-ip~~--~~ie~~e~~~~~~~~~ 287 (547) ..-....++-..|.+.-+.|--|-++ ++..++++++++.+++++.... . .+| .|.| ..++++..+.. ..-. T Consensus 216 ~l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~-~~d~ 294 (376) T protein:vir:10 216 WLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEV-AAKD 294 (376) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCC-HHHH Confidence 87777777777888888888666654 4567999999999999875321 1 233 3332 23555544432 3334 Q ss_pred HHHHHHHHHHHHHHHHHhc-chhcc--CccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_020866. 288 LYEKRINHLDQQISKAVLG-QTTTT--DAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYP 363 (547) Q Consensus 288 ~f~~li~~~d~~Isk~ilG-qtlts--~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p 363 (547) .|.+..++-..+|+.+.-- ..|.+ +..+++++-.+....++ ..-+.--++.+++ +|+.|... +. T Consensus 295 qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~iee-ln~~L~~~-----------~~ 362 (376) T protein:vir:10 295 EFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGEE-----------VV 362 (376) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcccc-----------cc Confidence 5888888888889887421 22332 22234455444333333 2344455555543 44333211 22 Q ss_pred EEEecCccccCHHH Q lcl|NC_020866. 364 RIKIARPKTEDLTK 377 (547) Q Consensus 364 ~~~~~~~~~ed~~~ 377 (547) +|........|.+. T Consensus 363 ~F~~~~Llr~d~ka 376 (376) T protein:vir:10 363 RFDDYEIPPAPVAA 376 (376) T ss_pred ccChhHhhcccccC Confidence 22211111222221 No 125 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.13 E-value=6.3e-10 Score=71.03 Aligned_cols=450 Identities=13% Similarity=0.047 Sum_probs=214.7 Q ss_pred CCCCCeeeCCCCCccc-chhhhhhhhhhhhh-----hhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ-RAQLTEEVAAPTVT-----GVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~-~~~l~~~~a~~~~~-----~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~ 74 (547) |- |+|..+.|+. ...+.+..+..... ...+....++.. .++ +..++ .+......-.+++... T Consensus 1 Mn----~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~-~s~---~~~i~----~~~~~lr~RaRdL~rN 68 (548) T protein:vir:95 1 MN----LIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQP-LGA---DTSLQ----KSAVSMREQCRKLDED 68 (548) T ss_pred Cc----hHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCC-CCh---HHHHH----HHHHHHHHHHHHHHhc Confidence 88 9999999985 22333333221110 011111222211 122 22222 2344556666777789 Q ss_pred hHHHHHHHHHHHHHHhcC-CceEecCC---CCHHHHHHHHHHHHHHhC----------CCHHHHHHHHHH-hhhhcCeee Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQL-DITVEPAS---DDPQDVKIADMVRDWLKR----------EELQQEVFHILD-CISKGYSFT 139 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~-~w~v~p~~---~~~~~~~~ae~v~~~l~~----------~~~~~~i~~~l~-a~~~G~sv~ 139 (547) +++++++++.....|.+. .+.|+|.. +...+++.++.|++.++. .+|..+...++. .+.-|=.++ T Consensus 69 n~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~ 148 (548) T protein:vir:95 69 HDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLA 148 (548) T ss_pred ChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEE Confidence 999999999999999974 56676632 334455666666665532 358888887774 566898898 Q ss_pred eeEEeecC----CeeeEEeeEeeccceeEeccc---------------CCccee--eccCCC---------CceeeccCc Q lcl|NC_020866. 140 EIVWNTSE----GQYFPDKLIWRDPRHFRFQKS---------------DLATPL--LLDDNG---------AEVPLPAFR 189 (547) Q Consensus 140 Ei~w~~~~----g~~~~~~l~~~~~~~~~~~~~---------------~~~~~~--~~~~~~---------~g~~l~~~k 189 (547) -+.|.+.. +..++-+|..++|.++..+.+ ++..-+ ....++ .-..+|... T Consensus 149 ~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~ 228 (548) T protein:vir:95 149 QKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAER 228 (548) T ss_pred EeeecccccccCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhH Confidence 88897643 344455677777766543222 111111 111111 113456555 Q ss_pred eEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHH-HH-HHHHHHHHHHhcCeeE- Q lcl|NC_020866. 190 FIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQ-DK-DTLFRAVANIAGDCAA- 266 (547) Q Consensus 190 ~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~-~~-~~l~~al~~~~~~~~~- 266 (547) +|++-...+.+..-|.++|.++.....--......-+.-.....+=..+.|.+...... +. ..-......+..++.+ T Consensus 229 VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~ 308 (548) T protein:vir:95 229 IIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMVFD 308 (548) T ss_pred heecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCcccccccccccCCcccc Confidence 44434445677888999998887654433332333333333333333333432111000 00 0000111224444443 Q ss_pred EecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc---hhccCccccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 267 IIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ---TTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAIL 343 (547) Q Consensus 267 vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq---tlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~l 343 (547) .++.|.+|++.+++..++ .|..|.+.+-+.|+.. +|- .||.+-. +||+++-.-..-+...++....++...| T Consensus 309 ~L~pGe~i~~~~p~~p~~---~~~~f~~~~lr~IAag-lGipYe~ltgD~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~ 383 (548) T protein:vir:95 309 DLEPGEDVGMIESNRPNP---FLEGFRNGQLRMIGAG-TRSTYSSVSRAYD-GTYSAQRQELVEGWLGYDLLQHEFIDYW 383 (548) T ss_pred ccCCCceeeecCCCCCCC---CHHHHHHHHHHHHHhh-cCCCHHHHhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 478899999998765444 3888888888999988 464 4677753 5787665555444445555544444444 Q ss_pred HHHHHHHHHHhcC--C--------CcccccEEEecC--ccccCHHHHHHHHHHHHhCCCCCCHH---------------- Q lcl|NC_020866. 344 NRDLIRPWIQLEY--G--------PQKRYPRIKIAR--PKTEDLTKLASSLDTLVRLGMEIEEN---------------- 395 (547) Q Consensus 344 n~~li~~l~~~Nf--g--------~~~~~p~~~~~~--~~~ed~~~~a~~~~~l~~~G~~v~~~---------------- 395 (547) -+-+-.++++.-+ | ....+....|.. -...|..+.+++....++.|+ .|.+ T Consensus 384 ~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl-~T~~~~~a~~G~D~~ev~~q 462 (548) T protein:vir:95 384 CRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGF-ADEAEVARARGRDPRELKKS 462 (548) T ss_pred HHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCC-CCHHHHHHHhCCCHHHHHHH Confidence 4433333333221 1 111122333322 223577888888888888887 3322 Q ss_pred -----HHHHHhCCCCCCCcc-hhccccccccccc---ccccccccccCCccccc----cccccccchhhhhhcccchhhh Q lcl|NC_020866. 396 -----EVRSRFGFSQPKPGA-KLLRPLENPATQA---APPGMDPGAGDLKTAER----KLKPLSGQADLSRGDLSLNSEM 462 (547) Q Consensus 396 -----~~~e~~Glp~p~~~e-~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~ 462 (547) ...+++||+.+.+.. ........+..++ ...+..+.++......- +..+ -..++++.... + -- T Consensus 463 ~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~--~-~~ 538 (548) T protein:vir:95 463 RETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLP-VPGPDFPNESN--N-GG 538 (548) T ss_pred HHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCc-CCCCCCCcccc--c-CC Confidence 223455555332111 1111111111110 00000000100000000 0000 00111111111 0 00 Q ss_pred cccccccchhH Q lcl|NC_020866. 463 PSTGQKTGGAE 473 (547) Q Consensus 463 ~~~~~~~~~~~ 473 (547) ++.+... .+. T Consensus 539 ~~~~~~~-~~~ 548 (548) T protein:vir:95 539 ADGQPSN-PDP 548 (548) T ss_pred CCCCCCC-CCC Confidence 1111111 111 No 126 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=99.12 E-value=1.6e-10 Score=74.23 Aligned_cols=328 Identities=12% Similarity=0.017 Sum_probs=178.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCH----HHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDP----VRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~----~~~~~L~~~m~~~D~ 76 (547) |.++-+- +|-. ......+ +......+.-+-|-.-++..++-.-+.-...|+. .....|.+.+ +..+ T Consensus 1 ~~~~~~~-----~~~~--~~~~~~~--~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~-~a~~ 70 (344) T protein:vir:56 1 MSKKKGK-----TPQP--AAKTMTA--SAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSL-RAAV 70 (344) T ss_pred CCCCCCC-----CCch--hhHHhhc--CCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHH-hhhh Confidence 7753331 1110 0111011 1111111111223223333333333333444441 1122233322 3588 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) |..++|.-++.-+.+. +.|...-. ...|..++. +-+.+|.+.+|++....| .+..|. T Consensus 71 ~h~s~i~~k~n~l~~~---~~Pnp~~t--------------~~~f~~~~~---d~ll~Gnay~~~~rn~~G---~~~~L~ 127 (344) T protein:vir:56 71 HHSSPIYVKRNILAST---FIPHPWLS--------------QQDFSRFVL---DFLVFGNAFLEKRYSTTG---KVIRLE 127 (344) T ss_pred hhCccceehhhhHHhh---cCCCCCCC--------------HHHHHHHHH---HHHhcCCeEEEEEECCCC---cEEEEE Confidence 8888888777766652 34543211 111444443 455789999999875433 467899 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ++++.+.+...++. .++.....+....+++...|+++.....+..||.+.+..+......-.....+-..|.+.-+.|- T Consensus 128 pl~~~~v~~~~~~~-~~~~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg 206 (344) T protein:vir:56 128 TSPAKYTRRGVEED-VYWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAG 206 (344) T ss_pred EeCCceeEEeecCC-EEEEEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 99999888866554 33444445566778888876666555456689999999999988888888888888888888887 Q ss_pred EEEeC-CCCCCHHHHHHHHHHHHHHhcC---eeEEe--cCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-c Q lcl|NC_020866. 237 RIGKW-QPGASEQDKDTLFRAVANIAGD---CAAII--PET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-Q 307 (547) Q Consensus 237 ~vgk~-~~~~~~~~~~~l~~al~~~~~~---~~~vi--p~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-q 307 (547) -|-++ ++..++++++++.+++++.... ..+++ |.| ..++++..+.. ..-..|.+.-++-..+|+.+.-- . T Consensus 207 ~Il~~~d~~ls~e~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~-~~d~qf~e~k~~s~~eIa~afrVPp 285 (344) T protein:vir:56 207 YIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV-ATKDDFFNIKKASAADLLDAHRIPF 285 (344) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCC-hHHHHHHHHHHhhHHHHHHHhCCCH Confidence 66654 4568999999999999875322 23444 332 23555544432 23334777777888888887421 1 Q ss_pred hhcc--CccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH Q lcl|NC_020866. 308 TTTT--DAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL 375 (547) Q Consensus 308 tlts--~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~ 375 (547) .|.+ +..+|+++-.+....+ ...-+.--++.+++ +|+.|...+++++ -...+.+|- T Consensus 286 ~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~~~~F~-----------~y~l~~~~~ 344 (344) T protein:vir:56 286 QLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQEVIRFK-----------NYSLDTDNG 344 (344) T ss_pred HHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhccccccCC-----------CccccccCC Confidence 2322 2234455544333333 33344455555553 5554433332222 111112221 No 127 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.12 E-value=1.6e-09 Score=68.76 Aligned_cols=423 Identities=10% Similarity=0.038 Sum_probs=199.6 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhh--hhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVT--GVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~--~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |- +.|. |-.-..+......+...-. +..+.++..+.+ .+ +..++. +...+..-.+++...++++ T Consensus 1 m~----~~~~-~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~--s~---d~~~~~----~~~~lr~RaRdl~rNn~~a 66 (495) T protein:vir:10 1 MN----MTPS-GYQSLASGLLVPVGASAYEGASGGHRWQDIGDY--GP---DTAVAS----GIQTLRARSHHNVRNNPWA 66 (495) T ss_pred CC----cccc-cccccchhhhhHHHhhhhhccccCcccCCCCCC--Ch---hHHHHH----HHHHHHHHHHHHHhcChHH Confidence 55 3333 2211111111121111100 111122222222 22 122221 3344555566777799999 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH----hC------CCHHHHHHHHHH-hhhhcCeeeeeEEeec- Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL----KR------EELQQEVFHILD-CISKGYSFTEIVWNTS- 146 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l----~~------~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~- 146 (547) +++++.....|.+..+...+..++ ++.++.|++.+ ++ .+|..+...++. .+.-|=.++-+.|... T Consensus 67 ~~av~~~~~~vVG~Gi~p~~~~~~---~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~ 143 (495) T protein:vir:10 67 TNAVATWVAAAVGNGLTPRWRMKE---QELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLS 143 (495) T ss_pred HHHHHHHHHhhcCCCcccccCCch---HHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccC Confidence 999999999999998877765443 33444454444 22 368888877774 4557877777777543 Q ss_pred CCeeeEEeeEeeccceeEeccc-------------------CCcce--eeccCCCC---------ceeeccCceEEEeec Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQKS-------------------DLATP--LLLDDNGA---------EVPLPAFRFIFADIP 196 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~~~-------------------~~~~~--~~~~~~~~---------g~~l~~~k~i~~~~~ 196 (547) +|.-++-+|..++|.++..+.+ ++..- +.....+. .+.+|.. .|+|.+. T Consensus 144 ~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~-~vlH~f~ 222 (495) T protein:vir:10 144 EGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAE-HVLHVTV 222 (495) T ss_pred CCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechh-heEeccc Confidence 4444555777777776643221 11110 11112221 1335554 4567777 Q ss_pred CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC--CCCC-----HHHHHHHHHHHHHHhcCeeEEec Q lcl|NC_020866. 197 AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ--PGAS-----EQDKDTLFRAVANIAGDCAAIIP 269 (547) Q Consensus 197 ~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~--~~~~-----~~~~~~l~~al~~~~~~~~~vip 269 (547) .+.+..-|.++|..+...-.+..+ ...-+.-.....+-..+.|.+ .... .+..+.-......+..+....++ T Consensus 223 ~r~gQ~RGis~la~i~~l~~l~~y-~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~ 301 (495) T protein:vir:10 223 LTVRSDAGAPWFQLLLRLNELDQY-EDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQ 301 (495) T ss_pred cCCCcccCcchhHHHHHHHHhhHH-HHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecC Confidence 788888888877654322111111 111111111112222222211 1100 00111111223456777888899 Q ss_pred CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc---hhccCccccccchhHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_020866. 270 ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ---TTTTDAVAGGHAVSQEHRLVQEDIETSDAM-ALAAILNR 345 (547) Q Consensus 270 ~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq---tlts~~~ggs~a~~~vh~~v~~~~~~ad~~-~i~~~ln~ 345 (547) .|.+|++.++...+++ |..|.+..-+.|+.. +|- .||.|-.+.||+++-....-+....+.... ++...|.+ T Consensus 302 pGe~i~~~~p~~p~~~---~~~f~~~~lr~iaag-lGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~ 377 (495) T protein:vir:10 302 PGQEVKFSNPADVGTT---YEPWLRYQLLSIAKG-YGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCR 377 (495) T ss_pred CCCeeeeeCCCCCCCC---HHHHHHHHHHHHHhh-cCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999987755443 788888888999988 554 466675556787666555445555555443 34444444 Q ss_pred HHHHHHHHhcCCCc-----------ccccEEEec--CccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchh Q lcl|NC_020866. 346 DLIRPWIQLEYGPQ-----------KRYPRIKIA--RPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKL 412 (547) Q Consensus 346 ~li~~l~~~Nfg~~-----------~~~p~~~~~--~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~ 412 (547) -+..++++.-+..+ ..+....|. .-...|..+.+++....++.|+ .|.+.+....|....+-.++. T Consensus 378 pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~-~s~~~~~a~~G~D~~~v~~q~ 456 (495) T protein:vir:10 378 PVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGF-APISDKQAERGYDMEELFDMI 456 (495) T ss_pred HHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCC-CCHHHHHHHcCCCHHHHHHHH Confidence 44444444221110 011122232 2234688888889999999998 444444444566432211111 Q ss_pred cccccccc----cccccccccccccCCccccccccccccch Q lcl|NC_020866. 413 LRPLENPA----TQAAPPGMDPGAGDLKTAERKLKPLSGQA 449 (547) Q Consensus 413 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (547) ........ .-...+....+.+..+.... ....... T Consensus 457 a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~--~~~~~~e 495 (495) T protein:vir:10 457 SDANQLIDEYDLRLDSDPRYVNGSGAEQKSVM--EAALNNE 495 (495) T ss_pred HHHHHHHHHcCCCCCCCCCcCCCccCCCCCCC--CCCCCCC Confidence 10000000 00000000000000000000 0000000 No 128 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.11 E-value=1.1e-09 Score=69.66 Aligned_cols=377 Identities=11% Similarity=-0.040 Sum_probs=172.9 Q ss_pred ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH----------------------hhhHHHHHHHHHHHHHHhcCCceEec Q lcl|NC_020866. 41 GDGLNPLRLAQILRAADHGDPVRYLDLAEVIE----------------------ERDPHYLGVIGTRKRSVSQLDITVEP 98 (547) Q Consensus 41 ~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~----------------------~~D~~v~s~l~~Rk~~v~~~~w~v~p 98 (547) ...+||.++..-|...-.....++..|.+-+. -.-....-+++.....+.+.++.+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 11222222211111111111111111111000 01233444555666666677777754 Q ss_pred CCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce-- Q lcl|NC_020866. 99 ASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP-- 173 (547) Q Consensus 99 ~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~-- 173 (547) .++...+ +.+.++++.-+++....+++ ++..||.+ ++++|...+|... +..++|+... ++...+..+ T Consensus 81 ~~d~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~d~~g~~~---i~~~~p~~~~~i~d~~~~~~~~~ 152 (456) T protein:vir:10 81 SADSDLA----LRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT---ITADSPETMVVSVDPLQPWRIRA 152 (456) T ss_pred CCCcchH----HHHHHHHHhcChhhHHHHHHHHHhhcCee-EEEEeeCCCCceE---EEEEccceeEEEEcCCCCcceEE Confidence 3333322 23445555556788887775 78889997 5788976666543 3334444321 111110000 Q ss_pred --eec-----------------------------cCCCCceeeccC------------ceEEEeecCCcCccccchhHHH Q lcl|NC_020866. 174 --LLL-----------------------------DDNGAEVPLPAF------------RFIFADIPAKSGILLRSGLARV 210 (547) Q Consensus 174 --~~~-----------------------------~~~~~g~~l~~~------------k~i~~~~~~~~~~p~G~gll~~ 210 (547) +.. ............ .++.. ....|+.|.|.+.. T Consensus 153 ~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv---v~~~N~~g~gd~e~ 229 (456) T protein:vir:10 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPV---VVYQNPDGMGEVEP 229 (456) T ss_pred EEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeE---EEecCCCCCchhhh Confidence 000 000000000001 11111 12457889999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCC-CHHHHHHHHHHHH--HHhcCeeEEecCCceEEEEecCCCCCc Q lcl|NC_020866. 211 AAWGWMFKAFTARDWAIFTQTYGQPLRIGKW--QPGA-SEQDKDTLFRAVA--NIAGDCAAIIPETMSIDFVESKSIGSS 285 (547) Q Consensus 211 ~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~-~~~~~~~l~~al~--~~~~~~~~vip~~~~ie~~e~~~~~~~ 285 (547) +....---...+.+.+...+-|..|.++.+- +... .++.-..+ +... ....+....+|.+.++..+. ..+ T Consensus 230 vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~~~~~~~~~~~~~~~~~~~~~~q~~----~~~ 304 (456) T protein:vir:10 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DYASIFEAAPGALWELPPGVDIWESQ----AND 304 (456) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-chhhhhhhhccccccCCCCcceEEec----ccC Confidence 7655444444566667777888888766542 1110 11111111 1111 12223344567777764432 233 Q ss_pred hHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_020866. 286 VDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQE---HRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRY 362 (547) Q Consensus 286 ~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~ 362 (547) ...|...++.+-.+|+.. .+-+...-+..++..+|+. ...-....+..-.+.+...+. ++++.++.+.-.+.... T Consensus 305 ~~~~~~~l~~~i~~~~~~-s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~-~~~rl~~~~~g~~~~~~ 382 (456) T protein:vir:10 305 FTPMLSAIKEHIRQLSSA-TKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGESVEDT 382 (456) T ss_pred hhHHHHHHHHHHHHHHhc-cCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCcccc Confidence 345777777777776654 1211111111112222322 222233444455566667774 46676666653233334 Q ss_pred cEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc-chhcccccccccccccccccccccCCccccc Q lcl|NC_020866. 363 PRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG-AKLLRPLENPATQAAPPGMDPGAGDLKTAER 440 (547) Q Consensus 363 p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (547) .++.|....+.++.+.++++.+|+..|+ ++.+-+++.+|+..++-. ++.-..........+ .+...+.+.++. T Consensus 383 ~~v~w~~~~~~~~~~~ada~~kl~~~gi-~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~----~~~~~~~~~~~~ 456 (456) T protein:vir:10 383 VDVSFESPDRVTLGEKYSAASLAKAAGE-SWASIRRNILNYNADQIKQDDLDRAREQITLFAG----NPVQRPQEDGSR 456 (456) T ss_pred eeEEecCCCCcCHHHHHHHHHHHHHcCC-ChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhh----hhhhcCCCCCCC Confidence 5788888888999999999999999998 777777888898643211 111111000000000 000000000000 No 129 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.11 E-value=1.1e-09 Score=69.66 Aligned_cols=377 Identities=11% Similarity=-0.040 Sum_probs=172.9 Q ss_pred ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH----------------------hhhHHHHHHHHHHHHHHhcCCceEec Q lcl|NC_020866. 41 GDGLNPLRLAQILRAADHGDPVRYLDLAEVIE----------------------ERDPHYLGVIGTRKRSVSQLDITVEP 98 (547) Q Consensus 41 ~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~----------------------~~D~~v~s~l~~Rk~~v~~~~w~v~p 98 (547) ...+||.++..-|...-.....++..|.+-+. -.-....-+++.....+.+.++.+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 11222222211111111111111111111000 01233444555666666677777754 Q ss_pred CCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce-- Q lcl|NC_020866. 99 ASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP-- 173 (547) Q Consensus 99 ~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~-- 173 (547) .++...+ +.+.++++.-+++....+++ ++..||.+ ++++|...+|... +..++|+... ++...+..+ T Consensus 81 ~~d~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~d~~g~~~---i~~~~p~~~~~i~d~~~~~~~~~ 152 (456) T protein:vir:10 81 SADSDLA----LRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT---ITADSPETMVVSVDPLQPWRIRA 152 (456) T ss_pred CCCcchH----HHHHHHHHhcChhhHHHHHHHHHhhcCee-EEEEeeCCCCceE---EEEEccceeEEEEcCCCCcceEE Confidence 3333322 23445555556788887775 78889997 5788976666543 3334444321 111110000 Q ss_pred --eec-----------------------------cCCCCceeeccC------------ceEEEeecCCcCccccchhHHH Q lcl|NC_020866. 174 --LLL-----------------------------DDNGAEVPLPAF------------RFIFADIPAKSGILLRSGLARV 210 (547) Q Consensus 174 --~~~-----------------------------~~~~~g~~l~~~------------k~i~~~~~~~~~~p~G~gll~~ 210 (547) +.. ............ .++.. ....|+.|.|.+.. T Consensus 153 ~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv---v~~~N~~g~gd~e~ 229 (456) T protein:vir:10 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPV---VVYQNPDGMGEVEP 229 (456) T ss_pred EEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeE---EEecCCCCCchhhh Confidence 000 000000000001 11111 12457889999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCC-CHHHHHHHHHHHH--HHhcCeeEEecCCceEEEEecCCCCCc Q lcl|NC_020866. 211 AAWGWMFKAFTARDWAIFTQTYGQPLRIGKW--QPGA-SEQDKDTLFRAVA--NIAGDCAAIIPETMSIDFVESKSIGSS 285 (547) Q Consensus 211 ~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~-~~~~~~~l~~al~--~~~~~~~~vip~~~~ie~~e~~~~~~~ 285 (547) +....---...+.+.+...+-|..|.++.+- +... .++.-..+ +... ....+....+|.+.++..+. ..+ T Consensus 230 vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~~~~~~~~~~~~~~~~~~~~~~q~~----~~~ 304 (456) T protein:vir:10 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DYASIFEAAPGALWELPPGVDIWESQ----AND 304 (456) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-chhhhhhhhccccccCCCCcceEEec----ccC Confidence 7655444444566667777888888766542 1110 11111111 1111 12223344567777764432 233 Q ss_pred hHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_020866. 286 VDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQE---HRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRY 362 (547) Q Consensus 286 ~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~ 362 (547) ...|...++.+-.+|+.. .+-+...-+..++..+|+. ...-....+..-.+.+...+. ++++.++.+.-.+.... T Consensus 305 ~~~~~~~l~~~i~~~~~~-s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~-~~~rl~~~~~g~~~~~~ 382 (456) T protein:vir:10 305 FTPMLSAIKEHIRQLSSA-TKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGESVEDT 382 (456) T ss_pred hhHHHHHHHHHHHHHHhc-cCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCcccc Confidence 345777777777776654 1211111111112222322 222233444455566667774 46676666653233334 Q ss_pred cEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc-chhcccccccccccccccccccccCCccccc Q lcl|NC_020866. 363 PRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG-AKLLRPLENPATQAAPPGMDPGAGDLKTAER 440 (547) Q Consensus 363 p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (547) .++.|....+.++.+.++++.+|+..|+ ++.+-+++.+|+..++-. ++.-..........+ .+...+.+.++. T Consensus 383 ~~v~w~~~~~~~~~~~ada~~kl~~~gi-~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~----~~~~~~~~~~~~ 456 (456) T protein:vir:10 383 VDVSFESPDRVTLGEKYSAASLAKAAGE-SWASIRRNILNYNADQIKQDDLDRAREQITLFAG----NPVQRPQEDGSR 456 (456) T ss_pred eeEEecCCCCcCHHHHHHHHHHHHHcCC-ChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhh----hhhhcCCCCCCC Confidence 5788888888999999999999999998 777777888898643211 111111000000000 000000000000 No 130 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=99.06 E-value=1.5e-09 Score=68.90 Aligned_cols=335 Identities=12% Similarity=0.034 Sum_probs=178.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC----HHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD----PVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~----~~~~~~L~~~m~~~D~ 76 (547) |.++-+. .+.....+.....++.+......+.-+-|-.-++..++-..++-...|+ +..+..|.+.. +..+ T Consensus 1 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~-~~~~ 75 (351) T protein:vir:78 1 MSKRRSR----APRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSF-RAST 75 (351) T ss_pred CCCCCCC----CCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHH-hhhH Confidence 7753222 1111111111111111111111111122322344444444555555565 12223333332 3689 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) |..++|..++.-+.+. +.|...-. ...|..++ ++.+.+|-+.+|++-+..| .+..|. T Consensus 76 ~h~~~l~~k~n~l~~~---~~Pn~~~t--------------~~~f~~~~---~d~ll~Gnay~~~~rn~~G---~~~~L~ 132 (351) T protein:vir:78 76 HHSSALFFKANVLAST---FRPHRWLS--------------RHAFERWA---LDFLTFGNGYLERRRNMVG---GTLRLE 132 (351) T ss_pred hhhhhhhhhhhHHhhc---ccCCCCCC--------------HHHHHHHH---HHHHhcCCeEEEEEECCCC---CEEEEE Confidence 9999998877777653 44543211 11133444 3556789999999864333 467899 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ++++.+.+...+... +......+....+++...|.++........||.+.+..+......-.....+-..|....+.|- T Consensus 133 pl~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg 211 (351) T protein:vir:78 133 PALAKYVRRKADFSG-FVYVNGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAG 211 (351) T ss_pred EecCcceEEeeeCCe-EEEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 999998887665543 3334445556778888877666555567789999999999888887777777788888888886 Q ss_pred EEEeC-CCCCCHHHHHHHHHHHHHHhc---Ce-eEE-ecCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-c Q lcl|NC_020866. 237 RIGKW-QPGASEQDKDTLFRAVANIAG---DC-AAI-IPET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-Q 307 (547) Q Consensus 237 ~vgk~-~~~~~~~~~~~l~~al~~~~~---~~-~~v-ip~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-q 307 (547) -|-++ ++..++++++++.+++++... .. .+| .|.| ..++++..+.. ..-..|.+.-++-..+|+.+.-- . T Consensus 212 gIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~-~~d~qf~e~k~~~~~eIa~a~~VPp 290 (351) T protein:vir:78 212 FILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEV-AAKDEFFNIKNVTRDDLLAAHRVPP 290 (351) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCCCH Confidence 65554 456799999999999887532 12 223 3433 34566654432 23335778778888889887421 2 Q ss_pred hhcc--CccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHH Q lcl|NC_020866. 308 TTTT--DAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 308 tlts--~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a 379 (547) .|.+ +..+++++-.+....++ ..-+.--++.+++ +|+.+ ... .|+|+..+-......| T Consensus 291 ~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l-----------~~~--~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 291 QLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWL-----------GDE--VVRFDDYEIPPAPVAA 351 (351) T ss_pred HHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhc-----------Ccc--ceecChhhhccccccC Confidence 2322 22234444333333332 2344444555543 33222 111 2444432222111111 No 131 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=99.05 E-value=1.6e-09 Score=68.81 Aligned_cols=324 Identities=14% Similarity=0.057 Sum_probs=171.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC----HHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD----PVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~----~~~~~~L~~~m~~~D~ 76 (547) |-++-+ + + ...+..+++ +....+.-+-|-.-++-.++-..+.-...|+ +..+..|.+.+ +..+ T Consensus 1 m~~~~~------~--~--~~~~~~~~~--~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~-~a~~ 67 (340) T protein:vir:98 1 MSKRKP------R--K--AVAMTASAP--QKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSL-RSAV 67 (340) T ss_pred CCCCCC------C--c--cccccccCc--cceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHH-Hhcc Confidence 653211 1 1 111111111 1111111122322233222233333344443 11222233222 3588 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) |..++|.-++.-+.+. +.|...-. ...|..++. +-+.+|.+.+|++++..| .+..|. T Consensus 68 ~h~s~i~~k~n~l~~~---~~Pn~~lt--------------~~~f~~~~~---d~ll~Gnay~~~~rn~~G---~~~~L~ 124 (340) T protein:vir:98 68 HHSSPIYVKRNVLAST---YIPHPLLS--------------RQDFSRFAL---DYLVFGNAFLEQRHSVTG---QLIKLL 124 (340) T ss_pred ccchhhhhhhhHHhhc---cCCCCCCC--------------HHHHHHHHH---HHHhcCCeEEEEEECCCC---cEEEEE Confidence 9999998877776653 35543211 111344443 445689999999875443 356899 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ++++.+++...+++. .......+....+++...++++........||.+.+..+.-....-.....+-..|.+.-+.|- T Consensus 125 pl~~~~vr~~~~~~~-~~~~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg 203 (340) T protein:vir:98 125 TSPAKYTRRGVDDSV-FWFVENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAG 203 (340) T ss_pred EeCCceEEEcccCcE-EEEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 999988887665543 3333445555678888877666544456789999999988888887777777788888888886 Q ss_pred EEEeC-CCCCCHHHHHHHHHHHHHHhcC----eeEEe-cCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-c Q lcl|NC_020866. 237 RIGKW-QPGASEQDKDTLFRAVANIAGD----CAAII-PET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-Q 307 (547) Q Consensus 237 ~vgk~-~~~~~~~~~~~l~~al~~~~~~----~~~vi-p~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-q 307 (547) -|-.+ ++..++++++++.+++++.... ..+|+ |.| ..++++..+.. ..-..|.+.-++-..+|+.+.-- . T Consensus 204 ~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~-~~d~qf~e~k~~~~~eIa~a~~VPp 282 (340) T protein:vir:98 204 YIMYVTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEV-ATKDDFFNIKKASAADLMDAHRVPF 282 (340) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCC-hhHHHHHHHHHhhHHHHHHHhCCCH Confidence 55544 4568999999999999875321 12333 332 23555544332 23335777777788888887321 2 Q ss_pred hhcc--CccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccC Q lcl|NC_020866. 308 TTTT--DAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTED 374 (547) Q Consensus 308 tlts--~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed 374 (547) .|.+ +..+|+++-.+....++ ..-+.--++.+++ +|+.|.. ...+|.-......| T Consensus 283 ~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~-----------e~~rF~~~~l~~~d 340 (340) T protein:vir:98 283 QLMGGKPENIGSLGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGM-----------EVIRFKEYTLDNPE 340 (340) T ss_pred HHhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhcccc-----------cccccCccccccCC Confidence 2322 22234454333333322 2334444444443 4432211 12233222222222 No 132 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=99.04 E-value=3.4e-09 Score=67.00 Aligned_cols=325 Identities=12% Similarity=0.020 Sum_probs=176.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhc---cCCCHHHHHHHHHhhcCCCH----HHHHHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPG---DGLNPLRLAQILRAADHGDP----VRYLDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~---~~l~p~~~~~ilr~a~~g~~----~~~~~L~~~m~~ 73 (547) |.++-+ ++..-..- .++.+......+.-+-|- .+-++-++..+. ..|+. .....|.+. .+ T Consensus 1 m~~~~~------~~~~~~~~---~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~---~~~~~~~pp~~~~~la~~-~~ 67 (344) T protein:vir:60 1 MSKKKG------KTLQPAAK---KMTASAPKMEAFTFGEPVPVLDRRDILDYVECI---SNGRWYEPPISFTGLAKS-LR 67 (344) T ss_pred CCcccC------CCCCchHH---hhcCCcCcEEEEEcCCceeecCCcchhHHHHhh---hcCccccCCCCHHHHHHH-HH Confidence 765322 11111000 001011011011111221 122343333333 33431 112223322 23 Q ss_pred hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 74 RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 74 ~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) .-+|..++|.-++.-+.+. +.|+..-.. ..|..++. +-+.+|.+.+|++....| .+. T Consensus 68 a~~~h~~~i~~k~n~l~~~---~~Pn~~~t~--------------~~f~~~~~---d~ll~Gnay~~i~rn~~G---~~~ 124 (344) T protein:vir:60 68 AAVHHSSPIYVKRNILAST---FIPHPWLSQ--------------QDFSRFVL---DFLVFGNAFLEKRYSTTG---KVI 124 (344) T ss_pred hhhhhccchhhhhhHHHhh---ccCCCCCCH--------------HHHHHHHH---HHHhcCCeEEEEEECCCC---cEE Confidence 5788888888777766652 455432111 11444444 445689999999875433 467 Q ss_pred eeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) .|.++|+.+.++..+++. ++.....+....+++...|+.+.....+..||.+.+..+.-....-.....+-..|.+.-+ T Consensus 125 ~L~~l~~~~vr~~~~~~~-~~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~ 203 (344) T protein:vir:60 125 RLETSPAKYTRRGVEEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGA 203 (344) T ss_pred EEEEcCcceEEEeecCCe-EEEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 899999998887766543 3334445556788888877666554456779999999998888887777788888888888 Q ss_pred CCeEEEeC-CCCCCHHHHHHHHHHHHHHhcC---eeEEe--cCCc--eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 234 QPLRIGKW-QPGASEQDKDTLFRAVANIAGD---CAAII--PETM--SIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 234 ~P~~vgk~-~~~~~~~~~~~l~~al~~~~~~---~~~vi--p~~~--~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) .|--|-++ ++..++++++++.+++++.... ..+++ |.|. .++++..+.. ..-..|.+.-++-..+|+.+.- T Consensus 204 ~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~-~~d~qf~e~k~~~~~eIa~af~ 282 (344) T protein:vir:60 204 HAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV-ATKDDFFNIKKASAADLLDAHR 282 (344) T ss_pred CCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCC-hhHHHHHHHHHhhHHHHHHHhC Confidence 88666654 5678999999999999875322 23344 3322 3555544332 2233477877888888988842 Q ss_pred c-chhcc--CccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH Q lcl|NC_020866. 306 G-QTTTT--DAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL 375 (547) Q Consensus 306 G-qtlts--~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~ 375 (547) - ..|.+ +..+|+++-.+-...+ ...-+.-.++.+++ ||+ || .....+|.....+..|- T Consensus 283 VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e~-ln~----~l-------g~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 283 IPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-ING----WL-------GQEVIRFKNYSLDTDNG 344 (344) T ss_pred CCHHHhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHH----hc-------CCcccccCccccCCCCC Confidence 1 22322 2234455544333332 23334444444442 333 22 12223455444444443 No 133 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=99.04 E-value=1.1e-09 Score=69.79 Aligned_cols=328 Identities=13% Similarity=0.031 Sum_probs=178.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCH----HHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDP----VRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~----~~~~~L~~~m~~~D~ 76 (547) |.++-.- +|-... ..++.+......+.-+-|-.-++..++-..+.-...|+. .....|.+.+ +..+ T Consensus 1 ~~~~~~~-----~~~~~~----~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~-~a~~ 70 (344) T protein:vir:20 1 MSKKKGK-----TPQPAA----KTMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSL-RAAV 70 (344) T ss_pred CCcccCC-----CCcchh----hhhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHH-hhhh Confidence 7753331 111111 010111111111111233333444433444444445541 1223333322 4588 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) |..++|.-++.-+.+. +.|+..-. ...|..++. +-+.+|.+.+|++.+..| .+..|. T Consensus 71 ~h~~~i~~k~n~l~~~---~~Pn~~lt--------------~~~f~~~~~---d~ll~Gnay~~i~rn~~G---~~~~L~ 127 (344) T protein:vir:20 71 HHSSPIYVKRNILAST---FIPHPWLS--------------QQDFSRFVL---DFLVFGNAFLEKRYSTTG---KVIRLE 127 (344) T ss_pred hhCccceehhhhHHHh---ccCCCCCC--------------HHHHHHHHH---HHHhcCCeEEEEEECCCC---cEEEEE Confidence 8888888777666552 34543211 111444443 445689999999864333 477899 Q ss_pred eeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ++++.+.++..+++. ++.....+....+++...|+.+.....+..||.+.+..+.-....-.....+-..|.+.-+.|- T Consensus 128 pl~~~~vr~~~~~~~-~~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~ 206 (344) T protein:vir:20 128 TSPAKYTRRGVEEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAG 206 (344) T ss_pred EcCCceeEeeecCCE-EEEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 999998888765543 3334445556788888877666555456789999999988888887777888888888888886 Q ss_pred EEEeC-CCCCCHHHHHHHHHHHHHHhcC---eeEEe--cCCc--eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-c Q lcl|NC_020866. 237 RIGKW-QPGASEQDKDTLFRAVANIAGD---CAAII--PETM--SIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-Q 307 (547) Q Consensus 237 ~vgk~-~~~~~~~~~~~l~~al~~~~~~---~~~vi--p~~~--~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-q 307 (547) -|-++ ++..++++++++.+.+++.... ..+++ |.|. .++++..+.. ..-..|.+.-++-..+|+.+.-- . T Consensus 207 ~Il~~~d~~l~~e~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~-~~d~qf~e~k~~s~~eIa~af~VPp 285 (344) T protein:vir:20 207 YIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV-ATKDDFFNIKKASAADLLDAHRIPF 285 (344) T ss_pred eEEEecCcCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCC-hhHHHHHHHHHhhHHHHHHHhCCCH Confidence 66654 5678999999999999875322 12343 3322 3555544432 22334778788888889888421 2 Q ss_pred hhcc--CccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH Q lcl|NC_020866. 308 TTTT--DAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL 375 (547) Q Consensus 308 tlts--~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~ 375 (547) .|.+ +..+|+++-.+....++ ..-+.--++.+++ +|+.| | ...-+|.+...+..|- T Consensus 286 ~llGi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~~e~-in~~l---------g--~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 286 QLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-INGWL---------G--QEVIRFKNYSLDTDND 344 (344) T ss_pred HHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHHhc---------C--CcccccCccccccCCC Confidence 2332 22344555444333332 2334444444442 33322 1 1112333333322221 No 134 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=99.03 E-value=4.8e-09 Score=66.19 Aligned_cols=329 Identities=13% Similarity=0.001 Sum_probs=176.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHH-HhhhhccCC---CHHHHHHHHHhhcCCC---HHHHHHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSP-LSGYPGDGL---NPLRLAQILRAADHGD---PVRYLDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~-~~~~~~~~l---~p~~~~~ilr~a~~g~---~~~~~~L~~~m~~ 73 (547) |-+ +. ....++.......-+ |..-|-.-+ +.-++..+...-.... +.....|.+ +.+ T Consensus 1 ~~~--------------~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~-l~~ 64 (348) T protein:vir:26 1 MTE--------------QL-IHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAE-IAN 64 (348) T ss_pred CCc--------------cc-cchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHH-HHh Confidence 321 00 000000000000000 110122222 2233333333211111 011222322 224 Q ss_pred hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEE Q lcl|NC_020866. 74 RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPD 153 (547) Q Consensus 74 ~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~ 153 (547) ..+|..++|..++.-+.+. +.|...-. ...|..++ ++.+.+|.+.+|++....| .+. T Consensus 65 ~n~~h~~~i~~k~N~l~~~---~~Pn~~~t--------------~~~f~~~~---~d~ll~Gnay~~~~rn~~G---~~~ 121 (348) T protein:vir:26 65 ANGYHGSLLKARANYVAGR---FMNGGGLP--------------MYKMNSAC---WDYFGLGMSAFVKIRSYLK---NVI 121 (348) T ss_pred hhhhhhhhHhhhhhHHhhc---ccCCCCCC--------------HHHHHHHH---HHHHhcCCeEEEEEEcCCC---cEE Confidence 6899999999888877663 45544211 11133344 3555789999999865433 466 Q ss_pred eeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 154 KLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) .|.++|+.+.+...++. ++.....+....+++...+.++.....+..||.+.+..+......-.....+-..|.+.-+ T Consensus 122 ~L~~l~~~~v~~~~d~~--~~~~~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa 199 (348) T protein:vir:26 122 ALEPLPMVHMRKRKNGD--FVQLLRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGA 199 (348) T ss_pred EEEEecCceeEeeecCc--EEEEEecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 89999998888776543 3333334556788888887666555456789999999988888877777777778887778 Q ss_pred CCeEEEe-CCCCCCHHHHHHHHHHHHHHhcCe----eEEe-cCCc--eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 234 QPLRIGK-WQPGASEQDKDTLFRAVANIAGDC----AAII-PETM--SIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 234 ~P~~vgk-~~~~~~~~~~~~l~~al~~~~~~~----~~vi-p~~~--~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) .|--|-+ .++..++++++++.+++++..... .+|+ |.|. .++++..+.. ..-+.|.+.-++-..+|+.+.- T Consensus 200 ~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~-~~d~qf~e~k~~t~~dIa~af~ 278 (348) T protein:vir:26 200 HMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDI-ATKDEFERIKNITAQDIFVGHR 278 (348) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCC-hhHHHHHHHHHhhHHHHHHHhC Confidence 7865554 556789999999999998753211 2333 3332 3555554432 2333477777777777887731 Q ss_pred c-chhcc--CccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHH Q lcl|NC_020866. 306 G-QTTTT--DAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASS 381 (547) Q Consensus 306 G-qtlts--~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~ 381 (547) - ..|.+ +..+++++-.+.... ....-+...++.+++.||+.+ .++ ... +|+|+.....+... +.+ T Consensus 279 VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l-------~~~-~~~--~~~fdl~~~~e~~~-~~a 347 (348) T protein:vir:26 279 FPAGMGGMLPQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDP-------EIP-DNL--KLKFNLNPGVESAN-GSA 347 (348) T ss_pred CCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhh-------CCC-Ccc--EEEEecCcccccch-hhc Confidence 1 22332 222344444333333 334566777788888888743 122 222 45554322222111 112 Q ss_pred H Q lcl|NC_020866. 382 L 382 (547) Q Consensus 382 ~ 382 (547) + T Consensus 348 ~ 348 (348) T protein:vir:26 348 V 348 (348) T ss_pred C Confidence 2 No 135 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.02 E-value=2.6e-09 Score=67.66 Aligned_cols=418 Identities=10% Similarity=0.028 Sum_probs=162.7 Q ss_pred CCCCCeeeCCCCCcccch---hhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA---QLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~---~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) |-..|. -+.+..-+.-. .|-+.... ++..-.++.+.+.. .+ +..... ...........+. .= T Consensus 1 ~~~~p~-~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~----~i------~~~~~~-~~~~~~~~~~~~~-~~ 67 (479) T protein:vir:99 1 MIDLPD-EDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQ----EV------PDLATR-HKNKEREVLQQLS-RK 67 (479) T ss_pred CccCCc-ccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCC----cc------cccccc-cCChhHHHHHHHh-hc Confidence 554332 11121111100 00000000 01111111111110 00 000000 0000000000000 00 Q ss_pred HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee---cCCeee Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT---SEGQYF 151 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~---~~g~~~ 151 (547) .+.+-++++....+.--.+. .++++ . .+.+.++++.-+|+.....++ ++..||. .++++|.. .++.-. T Consensus 68 n~~~~iVd~~~~~l~~~gf~--~~d~~-~----~~~~~~i~~~N~~d~~~~~~~~~a~~~G~-af~~v~~~~~~~d~~g~ 139 (479) T protein:vir:99 68 PWMGLMVNSFAQQLIVDGYR--KTGTN-E----NAKGWDTWRLNQMDKQQFWLNRAVLTFGY-AFIKVTSGISPLDGTTV 139 (479) T ss_pred CcHHHHHHHHHhhccccccc--CCCch-h----hHHHHHHHHhcChhHHHHHHHHHHhhcCc-eEEEEecCCCCcCCCCc Confidence 11122222222222112222 22222 1 233556666557888888865 7889998 46788842 111111 Q ss_pred EEeeEeeccceeEe--cccC-Ccc-eee-------------------ccCCCCce----e----eccCceEEEeecCCcC Q lcl|NC_020866. 152 PDKLIWRDPRHFRF--QKSD-LAT-PLL-------------------LDDNGAEV----P----LPAFRFIFADIPAKSG 200 (547) Q Consensus 152 ~~~l~~~~~~~~~~--~~~~-~~~-~~~-------------------~~~~~~g~----~----l~~~k~i~~~~~~~~~ 200 (547) + .+..++|+.... +... +.. ++. ......+. . +..--++.|.++.+. T Consensus 140 ~-~i~~~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~- 217 (479) T protein:vir:99 140 A-RIKCIDPRDAFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDL- 217 (479) T ss_pred e-EEEEechhheEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCc- Confidence 1 244445544321 1100 000 000 00000000 0 111124445555544 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEe Q lcl|NC_020866. 201 ILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVE 278 (547) Q Consensus 201 ~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e 278 (547) .++|.|.+..+....=-=...+.+.....+.|+.|.++.+ ............ ..+. ...++...+.+.++.+ T Consensus 218 ~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-----~~~~-~~~i~~~~~~~~~~~q 291 (479) T protein:vir:99 218 RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEK-----MRFA-QESMLISQNEKASFGA 291 (479) T ss_pred CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhc-----cccc-cccceeecCCCceEEE Confidence 4689999887654433334456777778899999976644 221111111111 1222 2233445566777776 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC-ccc-cccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD-AVA-GGHAVSQEH---RLVQEDIETSDAMALAAILNRDLIRPWIQ 353 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~-~~g-gs~a~~~vh---~~v~~~~~~ad~~~i~~~ln~~li~~l~~ 353 (547) ... .+...|...++.+-.+|+.. +-+.. .-| .+.++|..- ..-....++.-.+.+...+.+ +++.++. T Consensus 292 ~~~--~~~~~~~~~l~~~i~~i~~~----t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~-~~~l~~~ 364 (479) T protein:vir:99 292 IPA--APLDGLLNAYKESLLEFLAL----AQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQ-TMRLVNK 364 (479) T ss_pred ecc--cchHHHHHHHHHHHHHHhcc----CCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 542 33445666666555555532 22211 111 122333222 222233344445566666744 5665555 Q ss_pred hcCCCcc---cccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh-CCCCCCCcchhccc--cccc-------- Q lcl|NC_020866. 354 LEYGPQK---RYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF-GFSQPKPGAKLLRP--LENP-------- 419 (547) Q Consensus 354 ~Nfg~~~---~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~-Glp~p~~~e~~~~~--~~~~-------- 419 (547) +.-.... .-..+.|......+..+.++.+.+|++.|. ++.+.+.+.+ |+..++-. .+... .... T Consensus 365 ~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~-is~et~l~~l~gv~~~~~e-~~~~~~~~~~~~~~~~~~~ 442 (479) T protein:vir:99 365 IEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLK-IPAEGVWDMIPNLDQSTVN-GWKEIYDREGDFGKYMRKL 442 (479) T ss_pred HcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CCHHHHHHhcCCCCHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 5422111 123567777777888999999999999986 8888888877 88754311 11000 0000 Q ss_pred ccccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 420 ATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) .....+.....+......+..+.+.+..++.+ +.+.+ T Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 479 (479) T protein:vir:99 443 QNGPDPAEQRGGPNGATNMQQANNKTGEPASL-NKSGA 479 (479) T ss_pred hcccCcccccCCCCCCCCCCCCCCCCcchhcc-CCCCC Confidence 00000000011111111111111111111111 00000 No 136 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.00 E-value=6.7e-09 Score=65.38 Aligned_cols=424 Identities=11% Similarity=0.043 Sum_probs=197.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCC---CHHHHHHHHHhhcCCCHHHHHHHHHHHHh---- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGL---NPLRLAQILRAADHGDPVRYLDLAEVIEE---- 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l---~p~~~~~ilr~a~~g~~~~~~~L~~~m~~---- 73 (547) |+.+....+.+|.-..+...-...|.. .+.......+ +...+...+.........++..|.+.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~ 73 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRI-------RYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHD 73 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHh-------hhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 888888888888754332222222210 0000000000 11111111111111111111111111100 Q ss_pred -----------------hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hhhhhc Q lcl|NC_020866. 74 -----------------RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKG 135 (547) Q Consensus 74 -----------------~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G 135 (547) ...+-.-.+......+.+.+..+...++ +....+.+++.+++..-+|+....+++ ++..|| T Consensus 74 i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~-~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G 152 (502) T protein:vir:48 74 VLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN-EDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTG 152 (502) T ss_pred ccccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCc-cchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcC Confidence 1244455666667777888888876543 334456677888888778998888887 688899 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce------eeccC---------------------CCCceee- Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP------LLLDD---------------------NGAEVPL- 185 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~------~~~~~---------------------~~~g~~l- 185 (547) .+ ++++|...+|... +..++|+... ++......+ +.... .+....+ T Consensus 153 ~a-~~~v~~dedg~~~---i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~ 228 (502) T protein:vir:48 153 RA-YEVIYRSEYDETR---IKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEIS 228 (502) T ss_pred eE-EEEEEeCCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeecc Confidence 75 4777765556543 3444554332 111100110 00000 0000000 Q ss_pred ---ccCceE-EEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHH-HHHHHHHHHHH Q lcl|NC_020866. 186 ---PAFRFI-FADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQD-KDTLFRAVANI 260 (547) Q Consensus 186 ---~~~k~i-~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~-~~~l~~al~~~ 260 (547) .++..| ++. -.+++.|.|.+..+....=--...+..++..++.|..|+++.+-.....+++ ...+.+. ..+ T Consensus 229 ~~~~~~g~vPvv~---~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~-~~~ 304 (502) T protein:vir:48 229 VTPHAFGTVPITE---FLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRT-RLM 304 (502) T ss_pred ceecCCCccceEE---ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhc-cee Confidence 011111 111 1346789999998766655566678888999999999998876432222111 1111110 000 Q ss_pred h---cCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-HH--HHHHHHH Q lcl|NC_020866. 261 A---GDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-LV--QEDIETS 334 (547) Q Consensus 261 ~---~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-~v--~~~~~~a 334 (547) . ....-..+.+.+++++..... ...++..++.+.+.|.+.--...++.++.+ +..+|..-. .. ....+.. T Consensus 305 ~~~~~~~~~~~~~~~d~~~l~~~~~---~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~ 380 (502) T protein:vir:48 305 QLKPPKSADGKEGTVKAEYLTKSYD---VSGAEAYKTRLNKDIHVFTNTPDMSDNHFS-GNASGEALKYKLFGLDQDRVD 380 (502) T ss_pred eccccccccccccCcceeEeeecCC---HHHHHHHHHHHHHHHHHHhCCCCcCccccc-cCchHHHHHHHHHHHHHHHHH Confidence 0 011111234567888754432 235778888888888765322223333322 233333222 11 2223334 Q ss_pred HHHHHHHHHHHHHHHHHHHh-c---CCC--cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCC Q lcl|NC_020866. 335 DAMALAAILNRDLIRPWIQL-E---YGP--QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPK 407 (547) Q Consensus 335 d~~~i~~~ln~~li~~l~~~-N---fg~--~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~ 407 (547) -.+.+...+.+ +++.++.+ + .+. ......+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. +.|+ T Consensus 381 ~~~~~~~~l~~-~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS~et~l~~l~~v~D~~ 456 (502) T protein:vir:48 381 TQSQFTQGLKR-RYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLG--GQ-VSQETALSLSGLVENPT 456 (502) T ss_pred HHHHHHHHHHH-HHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CcHHHHHHhCCCCCCHH Confidence 44566666643 55555443 1 111 2233578898888999999999999984 65 899999999875 3222 Q ss_pred Ccchhccccccccc---cccc--ccccccccCCccccccccccccchhhhh Q lcl|NC_020866. 408 PGAKLLRPLENPAT---QAAP--PGMDPGAGDLKTAERKLKPLSGQADLSR 453 (547) Q Consensus 408 ~~e~~~~~~~~~~~---~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (547) +++-........ ...+ .....+.+.....+. +.....+... T Consensus 457 --~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~---~~~~~~~~~~ 502 (502) T protein:vir:48 457 --EELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKET---HTDDFERVYE 502 (502) T ss_pred --HHHHHHHHHHHhhhhhcccccccccccccCCCccCC---CCcCcCCCCC Confidence 222211111000 0000 000010110000000 0000000000 No 137 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.99 E-value=4.1e-09 Score=66.57 Aligned_cols=318 Identities=11% Similarity=0.005 Sum_probs=174.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC--CCH----HHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH--GDP----VRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~--g~~----~~~~~L~~~m~~~ 74 (547) |.++- .+|-.+. +. +....+.-+-|-.-++-..+-..+.-+-. |+. ..+..|.+.. +. T Consensus 1 m~~~~------~~~~~~~----~~-----~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~pP~~~~~La~l~-~~ 64 (337) T protein:vir:78 1 MTKRQ------QQPAQAA----AS-----SPRPSVVFSMPEAIDPTAWMTDYTGVFYNPYGEYYQPPIDRKGLAKVA-RA 64 (337) T ss_pred CCCcc------cCccccc----cc-----CceeEEEecCcccccCcchhHhhhhhhhccCcceecCCCCHHHHHHHh-hc Confidence 55311 1111111 11 01111111112222221112222222211 221 1122222222 35 Q ss_pred hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) .+|-.++|..|...+.+. ..+. .+ .|..++. +-+.+|.+.+|++++..| .+.. T Consensus 65 ~~~h~~~L~~k~N~~~~~---f~~~---------~~---------~~~~~~~---d~ll~GNay~~~~rn~~G---~~~~ 117 (337) T protein:vir:78 65 NAHHGAILMARRNMVAGR---FTNQ---------RA---------TITAFVH---NYLQFGDGGLLKLRNSFG---QVVG 117 (337) T ss_pred chhhhhHHHhhhcccccc---CcCc---------HH---------HHHHHHH---HHHhhCCeEEEEEECCCC---cEEE Confidence 788888888887765542 1111 01 1344444 445689999999886433 4678 Q ss_pred eEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020866. 155 LIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQ 234 (547) Q Consensus 155 l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~ 234 (547) |.++++.+.+...++...++ ...+....+++...+..+.....+..||.+.+..+......-.....+-..|...-+. T Consensus 118 L~pl~~~~v~~~~d~~~~~~--~~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~ 195 (337) T protein:vir:78 118 LHPLSSVYLRRREDGCFVYL--QQGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAH 195 (337) T ss_pred EEEeCCceeEeeeCCeEEEE--EcCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999998888766544333 3345567788888766665554567899999999988888888888888888888888 Q ss_pred CeEEEeC-CCCCCHHHHHHHHHHHHHHhcC---ee--EEecCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 235 PLRIGKW-QPGASEQDKDTLFRAVANIAGD---CA--AIIPET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 235 P~~vgk~-~~~~~~~~~~~l~~al~~~~~~---~~--~vip~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) |--|-+. ++..++++++++.+++++.... .. +..|.| ..++++..+.. ..-..|.+.-++-..+|+.+.-- T Consensus 196 p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~-~~d~qfle~k~~s~~eIa~a~~V 274 (337) T protein:vir:78 196 MGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDI-ATKDEFAAIKGITAQDVLTAHRY 274 (337) T ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCC Confidence 8666554 4567899999999998875321 12 333443 23566654443 33335777777777888877321 Q ss_pred -chhcc---CccccccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH Q lcl|NC_020866. 307 -QTTTT---DAVAGGHAVSQEHRL-VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL 375 (547) Q Consensus 307 -qtlts---~~~ggs~a~~~vh~~-v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~ 375 (547) ..|.+ ++.+|+++-.+.... ....-+.-.++.|++.+|+.+++..... .|+.....-+ T Consensus 275 Pp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~~-----------~f~~~~~~~~ 337 (337) T protein:vir:78 275 PPALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAINSAGLPRALWV-----------TFRETIGAAV 337 (337) T ss_pred CHHHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhce-----------eccccccccC Confidence 12222 223455554444443 3446677788888888887554432222 2222222211 No 138 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.99 E-value=6.9e-09 Score=65.33 Aligned_cols=414 Identities=11% Similarity=-0.013 Sum_probs=166.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhc--cCCCHHH------HHHHHHhhcCCCHHHHHHHHHH-- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPG--DGLNPLR------LAQILRAADHGDPVRYLDLAEV-- 70 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~--~~l~p~~------~~~ilr~a~~g~~~~~~~L~~~-- 70 (547) |- ||.+.. ..++. .+|++.. |...+.. ..+.+..+..+|+- T Consensus 1 ~~-----------~~~~~~-----------------~~~~~~~~~l~~~e~~~i~~L~~~~~~-~~~r~~~l~~YY~G~~ 51 (504) T protein:vir:99 1 MT-----------EETTSA-----------------SKFTFRIPELNDDVVDKVNGLYQQLVD-RTPRNLLRASFYDGKY 51 (504) T ss_pred CC-----------ccCCcc-----------------cccccccCCCCHHHHHHHHHHHHHHHH-HhHHHHHHHHHHhccc Confidence 21 111110 00111 1222222 1111111 11111222222210 Q ss_pred ------------HHh---hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hhhhh Q lcl|NC_020866. 71 ------------IEE---RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISK 134 (547) Q Consensus 71 ------------m~~---~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~ 134 (547) |.. .-.+.+-++++....+.--.|.+ ++++..+ +.+.++++.-+|+....+++ +|+.| T Consensus 52 ~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~--~d~~~~~----~~l~~i~~~N~ld~~~~~~~~~a~iy 125 (504) T protein:vir:99 52 AIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFVW--PDGDYGS----IGGPDVWDENFFATKANNAMVSSLIH 125 (504) T ss_pred cchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceeeC--CCCChhh----HHHHHHHHhcChhhHHHHHHHHHHhh Confidence 000 00000111111111111112322 2222222 23556666667777777766 78899 Q ss_pred cCeeeeeEEeecCCeeeEEeeEeeccceeE--ecc----------------cCCcce---------ee--ccCCCCc-ee Q lcl|NC_020866. 135 GYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQK----------------SDLATP---------LL--LDDNGAE-VP 184 (547) Q Consensus 135 G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~----------------~~~~~~---------~~--~~~~~~g-~~ 184 (547) |.|. +++|.-.++.-.+ .|..++|++.. ++. ++.... +. ..+.+.. .. T Consensus 126 G~af-~~v~~~~d~~~~~-~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~ 203 (504) T protein:vir:99 126 GPAF-LINTEGGAGEPDS-LIHVKSAMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHAD 203 (504) T ss_pred Ccee-EEEecCCCCCcee-EEEEeccceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeec Confidence 9975 6778655443222 24444554321 111 100000 00 0000000 00 Q ss_pred ecc----CceEEEeecCCcCccccchhHH-HHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHH------HHH Q lcl|NC_020866. 185 LPA----FRFIFADIPAKSGILLRSGLAR-VAAWGWMFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQ------DKD 251 (547) Q Consensus 185 l~~----~k~i~~~~~~~~~~p~G~gll~-~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~------~~~ 251 (547) ..+ .-++.+.++.+.+.|+|.+-+. .+....=-=+..+..-+.-.|.|..|.++.. .+....++ -.+ T Consensus 204 ~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~ 283 (504) T protein:vir:99 204 VRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQ 283 (504) T ss_pred cccCCCCcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhh Confidence 000 0144556666677788887543 3333222222234444556688999976653 22111111 112 Q ss_pred HHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc---hh--ccCccccccchhHHHHH Q lcl|NC_020866. 252 TLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ---TT--TTDAVAGGHAVSQEHRL 326 (547) Q Consensus 252 ~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq---tl--ts~~~ggs~a~~~vh~~ 326 (547) .....+-.+..+.-+.++.+...++.+... ++...|...++.+-.+||.. .+- .| +++...+|-..-..... T Consensus 284 ~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~--~~l~~~~~~l~~~i~~~a~~-t~~P~~~lG~~~~~n~sSa~Ai~~~~~ 360 (504) T protein:vir:99 284 IALARVFALPDDEDEPDAARARADVKQFPA--SSPQPHIEMLEQIAMMFSGE-TSIPVESLGFSNRANPTSADAYIASRE 360 (504) T ss_pred hhhhhhhcCCCccccccccCccceeeecCC--CChHHHHHHHHHHHHHHHhh-hCCCHHHhcccccccccHHHHHHHHHH Confidence 222233333444445555566677765443 34446766666666666543 111 11 11111112111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCC--C-cccccEEEecCccccCHHHHHHHHHHHHhCCC--CCCHHHHHH Q lcl|NC_020866. 327 VQEDIETSDAMALAAILNRDLIRPWIQL--EYG--P-QKRYPRIKIARPKTEDLTKLASSLDTLVRLGM--EIEENEVRS 399 (547) Q Consensus 327 v~~~~~~ad~~~i~~~ln~~li~~l~~~--Nfg--~-~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~--~v~~~~~~e 399 (547) -....++.-.+.+...+.+ +++..+.+ |.+ + ...-.+++|......++.+.++++.||++.|. .+..+.+.+ T Consensus 361 ~L~~ka~~k~~~f~~~l~~-~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~ 439 (504) T protein:vir:99 361 DLIAEAEGATDDWSPAFRR-SMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLE 439 (504) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHh Confidence 3444555666777778854 66655444 332 1 11335778888888899999999999999985 345678889 Q ss_pred HhCCCCCCCcchhcccccccc---------cccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccc Q lcl|NC_020866. 400 RFGFSQPKPGAKLLRPLENPA---------TQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTG 470 (547) Q Consensus 400 ~~Glp~p~~~e~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (547) .+|+++.+-. .+........ .....++........+..+++....+.....+. +.+ T Consensus 440 ~lg~~~~ei~-r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~--------------~~~ 504 (504) T protein:vir:99 440 LLGLTPQQAK-RALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPT--------------LVG 504 (504) T ss_pred hcCCCHHHHH-HHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcc--------------cCC Confidence 9999744211 1110000000 000000000000011111111100000000000 000 No 139 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.97 E-value=5.2e-09 Score=66.01 Aligned_cols=393 Identities=11% Similarity=-0.027 Sum_probs=171.3 Q ss_pred ccchhhhh-------hhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020866. 15 VQRAQLTE-------EVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKR 87 (547) Q Consensus 15 ~~~~~l~~-------~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~ 87 (547) |...+..+ .... ...-.+.....|-+.. ++ +.....-...+..+-... .....+-++++... T Consensus 1 ~~~~t~~~~~~~l~~~~~~-~~~r~~~l~~Yy~g~~-~i-------~~~~~~~~~~~~~~~~~~--~~n~~~~ivd~~~~ 69 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDD-GMSRVRLLARYSNGDA-PL-------PELTRNTSAAWRSFQREA--RTNWGLMVRDSVAD 69 (456) T ss_pred CCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHhccC-Ch-------hhcCcccChhhchhhhhh--hcchHHHHHHHHHh Confidence 22111111 1000 0000000101111100 00 000000000000000000 12344556666666 Q ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEe- Q lcl|NC_020866. 88 SVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRF- 165 (547) Q Consensus 88 ~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~- 165 (547) .+.+.++.+...++.... +.+.++++.-+|+....+++ ++.-||.+ ++++|...+|... +..++|+.... T Consensus 70 ~l~~~g~~~~~~~d~~~~----~~~~~~~~~n~~d~~~~~~~~~a~~~G~a-~~~~~~~edg~~~---i~~~~p~~~~~i 141 (456) T protein:vir:79 70 RIIPNGITVGGSADSDLA----LRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT---ITADSPETMVVS 141 (456) T ss_pred hhccCCeecCCCCCccHH----HHHHHHHHhcChhHHHHHHHHHHhhcCee-EEEEeeCCCCceE---EEEeccceeEEE Confidence 677778776543333322 33555666667888888777 78889985 6788876666543 44444543221 Q ss_pred -cccCCcce----eeccC-C----------------------------------CCc--ee---ec-cCceEEEeecCCc Q lcl|NC_020866. 166 -QKSDLATP----LLLDD-N----------------------------------GAE--VP---LP-AFRFIFADIPAKS 199 (547) Q Consensus 166 -~~~~~~~~----~~~~~-~----------------------------------~~g--~~---l~-~~k~i~~~~~~~~ 199 (547) +......+ +.... . ..+ .. .+ .+..+.+. .. T Consensus 142 ~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv---~~ 218 (456) T protein:vir:79 142 VDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV---VY 218 (456) T ss_pred EcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEE---Ee Confidence 11110000 00000 0 000 00 00 11111111 23 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC--CCCCHHHHHHHHHHHHH--HhcCeeEEecCCceEE Q lcl|NC_020866. 200 GILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ--PGASEQDKDTLFRAVAN--IAGDCAAIIPETMSID 275 (547) Q Consensus 200 ~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~--~~~~~~~~~~l~~al~~--~~~~~~~vip~~~~ie 275 (547) .|+.|.|.+..+....---...+..-....+.|..|.++..-. .....++.......... ...+....+|.+.++. T Consensus 219 ~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~~~ 298 (456) T protein:vir:79 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVDIW 298 (456) T ss_pred cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCCCccee Confidence 5788999988865433222333445555667888887665311 00000011111111121 2223445567777664 Q ss_pred EEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 276 FVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQ---EHRLVQEDIETSDAMALAAILNRDLIRPWI 352 (547) Q Consensus 276 ~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~---vh~~v~~~~~~ad~~~i~~~ln~~li~~l~ 352 (547) .+. ..+...|...++.+-++|+.. .+-+...-+..++..+|+ ....-....++.-.+.+...|++ +++.++ T Consensus 299 q~~----~~~~~~~~~~l~~~i~~i~~~-t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~ 372 (456) T protein:vir:79 299 ESQ----TNDFTPMLSAIKEHIRQLSSA-TKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVKAL 372 (456) T ss_pred eec----ccChHHHHHHHHHHHHHHHhh-cCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 332 223345777777766676654 221111111111222232 22222334445555677777754 777776 Q ss_pred HhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCc-chhccccccccccccccccccc Q lcl|NC_020866. 353 QLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPG-AKLLRPLENPATQAAPPGMDPG 431 (547) Q Consensus 353 ~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~-e~~~~~~~~~~~~~~~~~~~~~ 431 (547) .+.-.+.....++.|....+.+..+.|+++.+|+..|+ ++.+-+++.+|+.+++-. .+.-..........+ .+. T Consensus 373 ~~~g~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~-~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~----~~~ 447 (456) T protein:vir:79 373 QIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGE-SWASIRRNILNYNADQIKQDDLDRAREQITLFAG----NPV 447 (456) T ss_pred HhcCCCccccceEEeCCCCCcCHHHHHHHHHHHHhcCC-ChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhh----hHh Confidence 66633333345778888888888999999999999998 667777888898653211 111111110000000 000 Q ss_pred ccCCccccc Q lcl|NC_020866. 432 AGDLKTAER 440 (547) Q Consensus 432 ~~~~~~~~~ 440 (547) ..+.+.++. T Consensus 448 ~~~~~~~~~ 456 (456) T protein:vir:79 448 QRPQEDGSR 456 (456) T ss_pred hcCCCCCCC Confidence 000000000 No 140 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.96 E-value=9.4e-09 Score=64.58 Aligned_cols=328 Identities=12% Similarity=-0.033 Sum_probs=169.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHh-hcCCCHH----HHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRA-ADHGDPV----RYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~-a~~g~~~----~~~~L~~~m~~~D 75 (547) |.++-+ + ..+.. .-..+.+...+ +.-+-|-.-++=.++-.-+.- +..|+.. ....|.+.+ +.. T Consensus 1 m~~~~~---~-~~~~~--~~~~~~~~~~~-----~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~-~~~ 68 (346) T protein:vir:10 1 MKKQLR---K-NLTQN--DRLQPQAQTEI-----FSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSL-RSS 68 (346) T ss_pred CCcccC---C-CCCcc--cccccccCeEE-----EecCCcceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHH-Hhh Confidence 776422 1 11111 00011111000 111112111111112222222 2233221 122233222 356 Q ss_pred HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) +|-.+++..++..+..+- . .|... +...++.+-+++-+.+|.+++|+++...| .+..| T Consensus 69 ~~h~~~i~~k~n~l~~l~-~-~Pn~~-----------------~t~~~f~~~~~d~ll~Gnay~~i~r~~~G---~~~~L 126 (346) T protein:vir:10 69 THHESAIITKANILLSTC-E-VDSRY-----------------LSRRDLSSFVKDYLVFGNAYFEVVRNRLG---QVQRI 126 (346) T ss_pred hhcchhhhhhhhhHHHHH-h-CCCCC-----------------CCHHHHHHHHHHHHhcCCeEEEEEEcCCC---cEEEE Confidence 777777776665554420 0 12211 11223333334556799999999875433 35689 Q ss_pred EeeccceeEecccCCcc-eeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020866. 156 IWRDPRHFRFQKSDLAT-PLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQ 234 (547) Q Consensus 156 ~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~ 234 (547) .++++.+.+...+++.. +.....++....+++...|+.+.....+..||.+.+..+......-.....+...|...-+. T Consensus 127 ~pl~~~~v~~~~~~~~~~~~~~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~ 206 (346) T protein:vir:10 127 ESPLAKYVRKGLEAGQFYYVPQRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAH 206 (346) T ss_pred EEecCCceEEEEcCCeEEEEEEccCCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999988776544433 33333445567888888776666655567899999999998888888888888899888888 Q ss_pred CeEEEeC-CCCCCHHHHHHHHHHHHHHhcC----eeEEecCCc---eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 235 PLRIGKW-QPGASEQDKDTLFRAVANIAGD----CAAIIPETM---SIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG 306 (547) Q Consensus 235 P~~vgk~-~~~~~~~~~~~l~~al~~~~~~----~~~vip~~~---~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG 306 (547) |--|.++ ++..++++++++.+++++.... ..+|++.|. .+++...+.. ..-..|.+.-++-..+|+.+.-- T Consensus 207 ~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~-~~d~qf~e~k~~~~~~I~~af~V 285 (346) T protein:vir:10 207 AGFVFYMSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADV-SAKDEFFNIKNVSRDDVLAAHRV 285 (346) T ss_pred CceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCC-hhHHHHHHHHHHhHHHHHHHhCC Confidence 8666554 5567899999999888865321 224444332 3556554432 23334777777888888888421 Q ss_pred -chhcc--CccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHH Q lcl|NC_020866. 307 -QTTTT--DAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 307 -qtlts--~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~ 377 (547) ..|.+ +..+|+++-.+....++ ..-+.-.++.|++ +|+.|... .++|...+-...++ T Consensus 286 Pp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L~~e-------------~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 286 PPQLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQWLGQE-------------VIKFKPSKLLQRTQ 346 (346) T ss_pred CHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcccc-------------eeeechhhhcccCC Confidence 22322 22234455444333333 2334555555543 33322221 23333211111111 No 141 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=98.94 E-value=1.1e-08 Score=64.29 Aligned_cols=346 Identities=13% Similarity=0.010 Sum_probs=167.3 Q ss_pred CCCCCeeeCCCCCcccchh----hhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCH----HHHHHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQ----LTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDP----VRYLDLAEVIE 72 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~----l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~----~~~~~L~~~m~ 72 (547) |.++-+=--+.-++-..+. ...+..+.....+..+.-+-|..-++..++-.-+.-+..|++ ..+.-|.+ +. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~-~~ 79 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLAR-SF 79 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHH-HH Confidence 8864432111111110000 111111111111111111223222222212222222222221 11111221 12 Q ss_pred hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeE Q lcl|NC_020866. 73 ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFP 152 (547) Q Consensus 73 ~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~ 152 (547) +..+|-.+++..+...+. +. ..|+..- ....|..++ .+-+.+|.+++|++....| .+ T Consensus 80 ~~~~~h~~~~~~~~n~l~-l~--~~Pn~~~--------------t~~~f~~l~---~d~ll~Gnay~~~~r~~~G---~~ 136 (368) T protein:vir:79 80 RAAAHHSSAVYVKRNILV-ST--FIPHPLL--------------SRATFERLV---LDWQVFGNAYLERRENVLG---GT 136 (368) T ss_pred hhccccchhhhhhcchhh-hh--cCCCcCC--------------CHHHHHHHH---HHHhhcCCeEEEEEEcCCC---CE Confidence 234444454444333222 11 1232211 111133343 3556799999999865433 35 Q ss_pred EeeEeeccceeEecccCCcceeeccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATPLLLDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) ..|.++++.+++...+++ .+......+....+++...|..+.....+..||.+.+..+......-.....+-..|.+.. T Consensus 137 ~~L~~l~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NG 215 (368) T protein:vir:79 137 IRLDTPLAKYVRRGLDLN-TYFFVQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNG 215 (368) T ss_pred EEEEEeCcccceeeccCC-EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 689999999888766554 3444445566678888887666655546678999999999988888777777778888888 Q ss_pred CCCeEEEeC-CCCCCHHHHHHHHHHHHHHhcC----eeEEe-cCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 233 GQPLRIGKW-QPGASEQDKDTLFRAVANIAGD----CAAII-PET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAV 304 (547) Q Consensus 233 G~P~~vgk~-~~~~~~~~~~~l~~al~~~~~~----~~~vi-p~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i 304 (547) +.|--|-++ ++..++++++++.+.+++.... ..+|+ |.| ..++++..+.. ..-..|.+.-++-.++|+.+. T Consensus 216 a~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~-~~d~qf~e~k~~~~~eIa~af 294 (368) T protein:vir:79 216 SHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEV-AAKDEFWNIKNVTRDDQLAAH 294 (368) T ss_pred CCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCC-HHHHHHHHHHHHhHHHHHHHh Confidence 888666555 4567899999999988874321 23444 332 23555554432 333458888888889999884 Q ss_pred hc-chhcc--CccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCc--cccCHHHH Q lcl|NC_020866. 305 LG-QTTTT--DAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARP--KTEDLTKL 378 (547) Q Consensus 305 lG-qtlts--~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~--~~ed~~~~ 378 (547) -- ..|.+ +..+++++-.+....+ ...-+.--++.+++ +|..|.. . .++|+.. ...|.+.. T Consensus 295 ~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie~-ln~~l~~-----------e--~~rF~~~~l~~~D~~a~ 360 (368) T protein:vir:79 295 RVPPQLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLLA-INDWIGD-----------E--VVRFAPYALGGHDQPAA 360 (368) T ss_pred CCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhccCc-----------c--eeeechhHhhccccccc Confidence 21 12222 2222334433333222 23334444555542 3332211 1 2333321 12233333 Q ss_pred HHHHHHHH Q lcl|NC_020866. 379 ASSLDTLV 386 (547) Q Consensus 379 a~~~~~l~ 386 (547) ++.-++-. T Consensus 361 a~~~~rsa 368 (368) T protein:vir:79 361 APGGQRSA 368 (368) T ss_pred CCcccccC Confidence 32222111 No 142 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.92 E-value=1.4e-08 Score=63.66 Aligned_cols=412 Identities=11% Similarity=0.009 Sum_probs=187.0 Q ss_pred CC------CCCeeeCCCCCccc--chhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH Q lcl|NC_020866. 1 MA------QKPVLLDRWGNPVQ--RAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIE 72 (547) Q Consensus 1 ma------~~~~~~~~~g~~~~--~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~ 72 (547) |- |...-+=.-|+.+. .++.++.. .+.+..-.-..++...+...+.... ....++..|++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~l~~YY~ 71 (492) T protein:vir:97 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIF--------DAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYE 71 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeeccchhhhhHh--------hhcccCCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhc Confidence 10 00000111222221 11111111 0000000000111111111111111 11222222222110 Q ss_pred h--------------------------hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHH Q lcl|NC_020866. 73 E--------------------------RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVF 126 (547) Q Consensus 73 ~--------------------------~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~ 126 (547) - ......-++++....+.+.+.++.+.+ .+..+++.+++++ ++++.+. T Consensus 72 g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~ 145 (492) T protein:vir:97 72 QRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-----DEVVKRIDEVLGN-RFDDKLH 145 (492) T ss_pred ccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCc-----hHHHHHHHHHHhc-cHHHHHH Confidence 0 023344455666666777777775432 2345677777765 6777777 Q ss_pred HHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEe--cccCCccee--e--c-cCCCCceee-ccCceEEEeec- Q lcl|NC_020866. 127 HIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRF--QKSDLATPL--L--L-DDNGAEVPL-PAFRFIFADIP- 196 (547) Q Consensus 127 ~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~--~--~-~~~~~g~~l-~~~k~i~~~~~- 196 (547) ++. ++..||. +++++|...+|... +..++|+.... +......++ . . ......+.+ .+.+...+.+. T Consensus 146 ~~~~~~~~~G~-a~~~v~~d~dg~~~---~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~ 221 (492) T protein:vir:97 146 SVLTGASNKGI-EWLHPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN 221 (492) T ss_pred HHHHHHhhcCe-EEEEEEecCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEec Confidence 765 6888997 46677765566543 44455554321 111111100 0 0 000000000 01100000000 Q ss_pred -----------------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCH Q lcl|NC_020866. 197 -----------------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASE 247 (547) Q Consensus 197 -----------------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~ 247 (547) .-.+++.|.|.+..+....---...+..++..++.|..|+++++-... T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~--- 298 (492) T protein:vir:97 222 GSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDD--- 298 (492) T ss_pred CeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc--- Confidence 012356789999886665555556788888889999999999873222 Q ss_pred HHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-- Q lcl|NC_020866. 248 QDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-- 325 (547) Q Consensus 248 ~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-- 325 (547) ++... ....+.....+.++.+.+++++.... ....++.+++.+.+.|.+.--...++.++.+| ..+|..-. T Consensus 299 ~~~~~---~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~ 371 (492) T protein:vir:97 299 QELPE---FKRLLRYYGAIKVSDNGGVDTIQVEV---PVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-APSGVALEFL 371 (492) T ss_pred ccchh---HHHHHhhccceecCCCCcceeEeccC---CHHHHHHHHHHHHHHHHHHhCCCCCCcccccc-CcHHHHHHHH Confidence 22111 22334455677789999999986543 23458888888888887763333333343232 22333221 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC Q lcl|NC_020866. 326 -LVQEDIETSDAMALAAILNRDLIRPWIQLE-YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF 403 (547) Q Consensus 326 -~v~~~~~~ad~~~i~~~ln~~li~~l~~~N-fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl 403 (547) .-....+..-.+.+...+. ++++.++.+. .......-.+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. T Consensus 372 ~~~l~~ka~~~~~~f~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--G~-iS~et~l~~l~~ 447 (492) T protein:vir:97 372 YTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPF 447 (492) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCC Confidence 1122233444555666664 3566555553 2222233467888888899999999999984 66 899988888874 Q ss_pred -CCCCCcchhcc--cccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 404 -SQPKPGAKLLR--PLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 404 -p~p~~~e~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) +.|+ +++-. ..........+.....+.......+.+.... . + T Consensus 448 v~d~~--~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-----------------------e 492 (492) T protein:vir:97 448 VEDLQ--AELERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKE--S-----------------------E 492 (492) T ss_pred CCCHH--HHHHHHHHHHHHHHHhhhccccCCCCCCcccccccccc--c-----------------------C Confidence 3332 22211 1110000000000000000000000000000 0 0 No 143 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.92 E-value=1.2e-08 Score=63.99 Aligned_cols=317 Identities=14% Similarity=0.032 Sum_probs=173.6 Q ss_pred CCCCCeeeC---CCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhc-----CCCHH----HHHHHH Q lcl|NC_020866. 1 MAQKPVLLD---RWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAAD-----HGDPV----RYLDLA 68 (547) Q Consensus 1 ma~~~~~~~---~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~-----~g~~~----~~~~L~ 68 (547) |-++.+=.. ..+.|.+...++-+ .| .| ..+++-.+ .|+.- .+..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~----~~---~~~~~y~~~~~~~~~~~~epp~~~~~la 58 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLS---------------EI----TA---SPALDYVGIGFDENYNCYLPPVNRHALA 58 (345) T ss_pred CCccccccchhhhcCCCceEEEeecC---------------Cc----cc---chhhcccceeeecCCccccCCCCHHHHH Confidence 543222111 11222211111111 01 11 11222111 11110 111132 Q ss_pred HHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCC Q lcl|NC_020866. 69 EVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEG 148 (547) Q Consensus 69 ~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g 148 (547) +.. +..+|-.++|.-++.-+.+ .+.|+..-.. ..|..++. +.+.+|.+.+|++....| T Consensus 59 ~~~-~~~~~h~~~i~~k~n~l~~---~~~Pn~~~t~--------------~~f~~~v~---d~ll~Gnay~~i~rn~~G- 116 (345) T protein:vir:37 59 KLP-HQNAQHGGILHSRANMVSA---TYEGGKALSK--------------MEMRALCL---NLIQFGDVGLLKVRNGFG- 116 (345) T ss_pred HHh-hcchhhcchhhhhhhHHhh---ccCCCCCCCH--------------HHHHHHHH---HHHhcCCeEEEEEECCCC- Confidence 222 3688889999887777765 2355443211 11334443 455789999999875544 Q ss_pred eeeEEeeEeeccceeEecccCCcceeecc----CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 149 QYFPDKLIWRDPRHFRFQKSDLATPLLLD----DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 149 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) .+..|.++++.+.+...++...+.... ..+....+++...|+++.....+..||.+-+..+......-.....+ T Consensus 117 --~~~~L~pl~~~~vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~ 194 (345) T protein:vir:37 117 --QVVRLVPLSSLYLRVHKDGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVF 194 (345) T ss_pred --CEEEEEEecCceeEEeecCCeeEEEeeeeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHH Confidence 356899999998887766654443322 23444578888877666555556789999888888877777777777 Q ss_pred HHHHHHHcCCCeEEEe-CCCCCCHHHHHHHHHHHHHHhcC---e--eEEecCCc--eEEEEecCCCCCchHHHHHHHHHH Q lcl|NC_020866. 225 WAIFTQTYGQPLRIGK-WQPGASEQDKDTLFRAVANIAGD---C--AAIIPETM--SIDFVESKSIGSSVDLYEKRINHL 296 (547) Q Consensus 225 w~~f~E~~G~P~~vgk-~~~~~~~~~~~~l~~al~~~~~~---~--~~vip~~~--~ie~~e~~~~~~~~~~f~~li~~~ 296 (547) -..|.+.-+.|--|-+ .++..++++.+++.+++++.... . .+.+|.|. .++++..+.. ..-..|.+..++- T Consensus 195 ~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~-~~d~qf~e~k~~~ 273 (345) T protein:vir:37 195 RRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDT-GTKDEFANIKNIS 273 (345) T ss_pred HHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCC-hhHHHHHHHHHHh Confidence 7788877778866554 45578999999999999876432 1 13344442 3566554443 2333477777777 Q ss_pred HHHHHHHHhc-chhcc--CccccccchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccc Q lcl|NC_020866. 297 DQQISKAVLG-QTTTT--DAVAGGHAVSQEHRLVQ-EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKT 372 (547) Q Consensus 297 d~~Isk~ilG-qtlts--~~~ggs~a~~~vh~~v~-~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ 372 (547) ..+|+.+.-- ..|.+ +..+|+++-.+....++ ..-+.--++.+++.+|+. .. ++ . ...++|+.. T Consensus 274 ~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~-----~e--~~-~--~~~i~F~~~-- 341 (345) T protein:vir:37 274 AQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQD-----PE--IK-N--LLKIKFREQ-- 341 (345) T ss_pred HHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hc--cC-C--cceEEECch-- Confidence 7888887321 22332 22344555444444433 344667777788777751 11 11 1 124555532 Q ss_pred cCHHH Q lcl|NC_020866. 373 EDLTK 377 (547) Q Consensus 373 ed~~~ 377 (547) ++.+ T Consensus 342 -~l~k 345 (345) T protein:vir:37 342 -NFAK 345 (345) T ss_pred -hhcC Confidence 2222 No 144 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.89 E-value=1.5e-08 Score=63.48 Aligned_cols=365 Identities=11% Similarity=0.037 Sum_probs=164.5 Q ss_pred hhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC Q lcl|NC_020866. 39 YPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR 118 (547) Q Consensus 39 ~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~ 118 (547) |...... +.+....|.+-. .+.+-+++.....+.-..|++ ++.+ .. +.+.++++. T Consensus 1 ~l~~~~~-~~~~~~~~~~v~-----------------n~~~~ivd~~~~~l~~~gf~~--~d~~-~~----~~~~~i~~~ 55 (434) T protein:vir:98 1 MLPKNAE-QAFLDFQRKART-----------------NFCGLIANASVHRLLALGVTG--PDGE-PD----TRASRWWQA 55 (434) T ss_pred CCCCCcc-HHHHHhhhhhhc-----------------cchHHHHHHHHhhhccCceec--CCCc-hH----HHHHHHHHh Confidence 2212211 222222222111 111222222222222222332 2222 22 234556666 Q ss_pred CCHHHHHHHHH-HhhhhcCeeeeeEEeecCCee-----eEEeeEeeccceeEecccCC-cce----ee------------ Q lcl|NC_020866. 119 EELQQEVFHIL-DCISKGYSFTEIVWNTSEGQY-----FPDKLIWRDPRHFRFQKSDL-ATP----LL------------ 175 (547) Q Consensus 119 ~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~-----~~~~l~~~~~~~~~~~~~~~-~~~----~~------------ 175 (547) -+|+.....++ +|..||.| ++++|...++.. .+ .|..++|++.....+.. .++ .. T Consensus 56 N~~d~~~~~~~~~a~i~G~a-y~~v~~~~~~~~~~~~~~~-~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~ 133 (434) T protein:vir:98 56 NRLDSRQKLVWRMAMAQSAG-YMLVGAHPTRTEDNGRPSP-LITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYAR 133 (434) T ss_pred cChhHHHHHHHHHHhhcCce-EEEEecCCCcccccCCcee-EEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEE Confidence 67888888865 78999976 456674332211 11 25555665432211110 000 00 Q ss_pred ------------ccCCCC-ce----------eec-----cCc---eEEEeecCCcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 176 ------------LDDNGA-EV----------PLP-----AFR---FIFADIPAKSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 176 ------------~~~~~~-g~----------~l~-----~~k---~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) ...... .. ..+ ++. ++.+.++...+ ..|.|-++.+....--=...+.. T Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~-~~g~sd~e~vi~liDa~~~~~s~ 212 (434) T protein:vir:98 134 VFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLG-EDPEPEFAGVLDIQDRVNLGILN 212 (434) T ss_pred EEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcC-cCCcchhhhHHHHHHHHHHHHHH Confidence 000000 00 000 111 22233333332 25888888876665555666778 Q ss_pred HHHHHHHcCCCeEEEe--CCCCCCHHHH-HHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_020866. 225 WAIFTQTYGQPLRIGK--WQPGASEQDK-DTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQIS 301 (547) Q Consensus 225 w~~f~E~~G~P~~vgk--~~~~~~~~~~-~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Is 301 (547) .+...|.|+.|.++.+ .+....++.. ..-...+-... .+.+....+.+.++.+... .+...|...++.+-.+|+ T Consensus 213 ~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~q~~~--~~~~~~~~~l~~~i~~~~ 289 (434) T protein:vir:98 213 RMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPS-PSAVWASEGENTQFGQLDA--TDLSGFLKEHASDVRDML 289 (434) T ss_pred HHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhcc-ccccccCCCCCceEEEecC--cchHHHHHHHHHHHHHHh Confidence 8888899999987765 1111111111 11111111122 2233444556777776543 334457776766666665 Q ss_pred HHHhc--chhccCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccccEEEecCccccCHHHH Q lcl|NC_020866. 302 KAVLG--QTTTTDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGP-QKRYPRIKIARPKTEDLTKL 378 (547) Q Consensus 302 k~ilG--qtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~-~~~~p~~~~~~~~~ed~~~~ 378 (547) ..--- ..+.+...+.|-..-.....-....++.-.+.+...+. ++++.++.++-.+ .....++.|....+.++.+. T Consensus 290 ~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ 368 (434) T protein:vir:98 290 TISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLE-SVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVK 368 (434) T ss_pred cccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCChhheeeeEEecCCCCCCHHHH Confidence 44100 01111101111111222233334444555566777774 4667666665322 22335788888899999999 Q ss_pred HHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccc---------cccccccccccccccCCcccccccccccc Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLEN---------PATQAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) ++++.+|+..| ++.+.+++.+|+++.+ -+.+...... ++...++++..+..+. ...+ T Consensus 369 ada~~kl~~~g--~~~e~~~~~lg~~~~e-~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~---------~~dg 434 (434) T protein:vir:98 369 ADAATKLKSIG--YPLDVIAEELDESPAR-VRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGG---------AVDG 434 (434) T ss_pred HHHHHHHHhcC--CcHHHHHHhCCCCHHH-HHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccC---------CCCC Confidence 99999999888 4778899999986421 1111111100 0000000000000000 0000 No 145 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.89 E-value=1.9e-08 Score=62.86 Aligned_cols=421 Identities=12% Similarity=0.091 Sum_probs=193.1 Q ss_pred CCCCCeeeCCCCCccc-chhhh----hhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHH---HHHHHHH-- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ-RAQLT----EEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVR---YLDLAEV-- 70 (547) Q Consensus 1 ma~~~~~~~~~g~~~~-~~~l~----~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~---~~~L~~~-- 70 (547) ||. |+..+-.-+. .-..- .+++.....-+.+.+..| ...++....+ .-.|.... ....+.. T Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~-----~~~~~~~~~~-YY~g~~~i~~~~~~~~~~~~ 71 (503) T protein:vir:59 1 MAD---IYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEH-----NPEPLLKGVR-YYMCENDIEKKRRTYYDAAG 71 (503) T ss_pred Ccc---cccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhh-----cHHHHHHHHH-Hhccccchhhccchhccccc Confidence 885 4433222221 11111 122222222222222211 1222222222 11111100 0000000 Q ss_pred --H--------HhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeee Q lcl|NC_020866. 71 --I--------EERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFT 139 (547) Q Consensus 71 --m--------~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~ 139 (547) . .-..+...-.+.+....+.+.+.++...+ .+..++++.++++ +|......+. ++..||.++ T Consensus 72 ~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~- 144 (503) T protein:vir:59 72 QQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSDN-----KTLLEYVNELADD-DFDDILNETVKNMSNKGIEY- 144 (503) T ss_pred ccccccccccceeecchHHHHHHHHHhhhhcCCeeeccCc-----HHHHHHHHHHHhc-CHHHHHHHHHHHHhhCCeEE- Confidence 0 00123445566777778888887775432 2344567777654 6777766665 688899986 Q ss_pred eeEEeecCCeeeEEeeEeeccceeEe--cccCCcce------eecc-CCCCc---e-eeccCceEEEee----------- Q lcl|NC_020866. 140 EIVWNTSEGQYFPDKLIWRDPRHFRF--QKSDLATP------LLLD-DNGAE---V-PLPAFRFIFADI----------- 195 (547) Q Consensus 140 Ei~w~~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~------~~~~-~~~~g---~-~l~~~k~i~~~~----------- 195 (547) +++|...+|.+. +..++|+.+.. +......+ .... ..+.. + -+.+.....+.. T Consensus 145 ~~v~~d~dg~~~---i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~ 221 (503) T protein:vir:59 145 WHPFVDEEGEFD---YVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSY 221 (503) T ss_pred EEEeecCCCceE---EEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccc Confidence 455544455543 44555544321 11000100 0000 00000 0 000000000000 Q ss_pred -----------------------cCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHH Q lcl|NC_020866. 196 -----------------------PAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDT 252 (547) Q Consensus 196 -----------------------~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~ 252 (547) -.-.+|+.|.|.+..+....-.=...+..++..++.|+.|+++.+--.+...+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~---- 297 (503) T protein:vir:59 222 GENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK---- 297 (503) T ss_pred cccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc---- Confidence 01134678999998866555444556778888889999999887732222211 Q ss_pred HHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH---HHHH Q lcl|NC_020866. 253 LFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR---LVQE 329 (547) Q Consensus 253 l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~---~v~~ 329 (547) +...++.....+.+|.+.+++++.... ....++..++.+.+.|.+.--+..++.+.. ++..+|..-. .-.. T Consensus 298 --~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~~Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 298 --EFTANLRYHSVIKVSGDGGVDTLRAEI---PVDSAAKELERIQDELYKSAQAVDNSPETI-GGGATGPALENLYALLD 371 (503) T ss_pred --hhhhhhhcccceeccCCCcceeEeccC---CHHHHHHHHHHHHHHHHHHhcccCCCcccc-cccccHHHHHHHHHHHH Confidence 222345566777889999999987543 234578888888888776643333333322 2222333221 1122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---cCCC---cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC Q lcl|NC_020866. 330 DIETSDAMALAAILNRDLIRPWIQL---EYGP---QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF 403 (547) Q Consensus 330 ~~~~ad~~~i~~~ln~~li~~l~~~---Nfg~---~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl 403 (547) ..+..-.+.+...+. ++++.++.+ ..+. ......+.|....+.+..+.++.+.+|++.|+ ++.+.+.+.++. T Consensus 372 ~k~~~~~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~Gi-iS~et~l~~l~~ 449 (503) T protein:vir:59 372 LKANMAERKIRAGLR-LFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGI-MSKETAVARNPF 449 (503) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCC-CchHHHHHhCCC Confidence 233444555666664 355554443 2111 11225788888999999999999999999997 888888888864 Q ss_pred -CCCCCcchhcccccccc----cccccccccccccCCccccccccccccchhhhhhcccch Q lcl|NC_020866. 404 -SQPKPGAKLLRPLENPA----TQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLN 459 (547) Q Consensus 404 -p~p~~~e~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (547) +.|+ +++........ .........++..... ...+.+ ..+..+.+...+ T Consensus 450 v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~g~~~ 503 (503) T protein:vir:59 450 VQDPE--EELARIEEEMNQYAEMQGNLLDDEGGDDDLE--EDDPNA---GAAESGGAGQVS 503 (503) T ss_pred CCCHH--HHHHHHHHHHHHHHhhhccccCccCCCCCCC--cCCCCC---CcccCCCCCCcC Confidence 2222 22211111000 0000000001100000 000000 000000000000 No 146 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.87 E-value=2.3e-08 Score=62.46 Aligned_cols=415 Identities=10% Similarity=0.020 Sum_probs=187.7 Q ss_pred CCCCCeeeCCCCCcccch---hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHH-------HH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA---QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLA-------EV 70 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~---~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~-------~~ 70 (547) |- .++-=|+++|.... .+..+. ..+..-+...+..|-.+ ..++... +..-.|.-..+.... .+ T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~i~~~~~~---~~~~~~~-~~Yy~g~~~i~~r~~~~~~~~~~~ 73 (474) T protein:vir:95 1 MF--NIIRMPWDKPYGEEVVEQLKPQF-ETQEEMIIRLIDDHRKQ---LDKITVG-QRYYDKDNDIVKQMKKVDVYGNID 73 (474) T ss_pred Cc--ceeecCCCCchhhHHHHhhhhcc-CChHHHHHHHHHHHHHH---HHHHHHH-HHHhcccCchhccccccccccccc Confidence 43 44434445544421 111111 11111222222222111 1122111 111111100000000 00 Q ss_pred HHh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEe Q lcl|NC_020866. 71 IEE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWN 144 (547) Q Consensus 71 m~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~ 144 (547) ..+ ......-.+.+....+.+.+..+...+ ++..++++.++++ +|+..+..+. ++.-||.+ .+++|. T Consensus 74 ~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~e~~~~~~~~G~~-~~~v~~ 146 (474) T protein:vir:95 74 YDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCED-----ESVLKIIHDVLDT-RWDNKLIDILTATSNKGID-WLQVYI 146 (474) T ss_pred cccccceeccchHHHHHHHHHhhhccCCceeccCc-----hHHHHHHHHHHhc-cHHHHHHHHHHHHhhcCcE-EEEEEe Confidence 000 124444456666777888888876433 2344567777764 5776666655 68889975 467776 Q ss_pred ecCCeeeEEeeEeeccceeE--ecccCCcce----eec-cCCCCcee-eccCceEEEeec-------------------- Q lcl|NC_020866. 145 TSEGQYFPDKLIWRDPRHFR--FQKSDLATP----LLL-DDNGAEVP-LPAFRFIFADIP-------------------- 196 (547) Q Consensus 145 ~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~----~~~-~~~~~g~~-l~~~k~i~~~~~-------------------- 196 (547) ..+|.+. +..++|+.+. ++......+ +.. ........ +.+..+..++.. T Consensus 147 d~~~~~~---i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (474) T protein:vir:95 147 NENGEMK---LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFS 223 (474) T ss_pred cCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCccccccccc Confidence 5556543 3344444322 111100110 000 00000000 001111111100 Q ss_pred ----------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeE Q lcl|NC_020866. 197 ----------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAA 266 (547) Q Consensus 197 ----------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~ 266 (547) .-.+|+.|.|.+..+....---...+.+++..++.|..|+++.+--.+.+.+ .....+.....+ T Consensus 224 ~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~------~~~~~~~~~~~i 297 (474) T protein:vir:95 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLE------EFMRGLKYYKAI 297 (474) T ss_pred ccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch------hhhhhhhcccee Confidence 0124688999998876655555667888998999999998887632222211 122344455667 Q ss_pred EecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHHHHHHHHH Q lcl|NC_020866. 267 IIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDAMALAAIL 343 (547) Q Consensus 267 vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~~~i~~~l 343 (547) .++.+.+++++.... ....++..++.+.+.|...--+..++.++.+| ..+|..-.-. ....+..-.+.+...+ T Consensus 298 ~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l 373 (474) T protein:vir:95 298 NVDGDGGVETIQVEV---PVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGS-APSGIALKFLYGNLDLKANKLKNKATVAI 373 (474) T ss_pred eccCCCceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 789999999986442 33468888999988887763333344443322 2223221111 2222334445666666 Q ss_pred HHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhC-CCCCCCcchhccccccccc Q lcl|NC_020866. 344 NRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFG-FSQPKPGAKLLRPLENPAT 421 (547) Q Consensus 344 n~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~G-lp~p~~~e~~~~~~~~~~~ 421 (547) +++++.++.+.-. .+.....+.|....+.+..+.++. ++++|+ ++.+.+.+.++ ++.|+..-+-+........ T Consensus 374 -~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~~---~~~~g~-iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~ 448 (474) T protein:vir:95 374 -QELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQI---IAQSQY-LSRETLVKSSPLVDDYKAELERIEQEQMEYN 448 (474) T ss_pred -HHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHH---HHhcCC-CchHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 3466666666421 222335677888788887666665 455687 88888888886 4433211111111110000 Q ss_pred ccccccccccccCCcccccccccccc Q lcl|NC_020866. 422 QAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) .........+.........+....+. T Consensus 449 ~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 449 KQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred hcccccccccCCCCcCCCCCccCCCC Confidence 00000000000000000000000000 No 147 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.84 E-value=1.3e-08 Score=63.82 Aligned_cols=325 Identities=15% Similarity=0.036 Sum_probs=171.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCH----HHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDP----VRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~----~~~~~L~~~m~~~D~ 76 (547) |=++.+ ++... .+++.......+.-+-|... +.-++..+.-. ..|+. .....|.+.. +..+ T Consensus 1 ~~~~~~------~~~~~------~~~~~~~~~~~f~~~~~~~~-~~~~y~~~~~~-~~~~~~epp~~~~~la~l~-~~~~ 65 (345) T protein:vir:37 1 MKTNVK------TDNKK------GIVIAPINDRTFSLNEISAS-PALDYVGIGFD-ENYNCYLPPVNRHALAKLP-HQNA 65 (345) T ss_pred CCCCcc------ccchh------hcccCcceeEEeecCCcccc-cchhhhhhhhc-CCccccCCCCCHHHHHHHh-hccc Confidence 332111 11000 00000000000000001100 11122333210 11110 1111222222 3578 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) |-.++|..++.-+.+. +.|+..-. .-.|..++ ++.+.+|.+++|++....| .+..|. T Consensus 66 ~h~~~i~~k~n~l~~~---~~Pn~~lt--------------~~~f~~~~---~d~ll~Gnay~~~~rn~~G---~~~~L~ 122 (345) T protein:vir:37 66 QHGGILHSRANMVSSL---YEGGKALS--------------RMDMRALC---LNLIQFGDVGLLKVRNGFG---QVVRLV 122 (345) T ss_pred ccccceeeechHHHhh---ccCCCCCC--------------HHHHHHHH---HHHHhcCCeEEEEEEcCCC---cEEEEE Confidence 8888888766666542 34543211 11133344 3555789999999875433 467899 Q ss_pred eeccceeEecccCCcceeecc----CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020866. 157 WRDPRHFRFQKSDLATPLLLD----DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTY 232 (547) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~----~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~ 232 (547) ++++.+.+...++...+.... ..+....+++...|+.+.....+..||.+.+..+......-....++-..|.+.- T Consensus 123 pl~~~~vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG 202 (345) T protein:vir:37 123 PLSSLYLRVRKDGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNG 202 (345) T ss_pred EEcCceeEEEEeCCeeEEEEEeEecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 999988877666554443322 2344467888887766655556677999999999888888777777778888887 Q ss_pred CCCeEEEeC-CCCCCHHHHHHHHHHHHHHhcC---ee-E-EecCC--ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 233 GQPLRIGKW-QPGASEQDKDTLFRAVANIAGD---CA-A-IIPET--MSIDFVESKSIGSSVDLYEKRINHLDQQISKAV 304 (547) Q Consensus 233 G~P~~vgk~-~~~~~~~~~~~l~~al~~~~~~---~~-~-vip~~--~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i 304 (547) +.|--|-++ ++..++++++++.+++++.... .. + ..|.| ..++++..+.. ..-..|.+.-++...+|+.+. T Consensus 203 ~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~-~~d~qf~e~k~~~~~dIa~a~ 281 (345) T protein:vir:37 203 AHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDT-GTKDEFANIKNISAQDVLTAH 281 (345) T ss_pred CCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCC-hhHHHHHHHHHHhHHHHHHHh Confidence 888666654 4667899999999998874321 22 2 33432 23455444332 233357777778888888874 Q ss_pred hc-chhcc--CccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHH Q lcl|NC_020866. 305 LG-QTTTT--DAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 305 lG-qtlts--~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~ 377 (547) -- ..|.+ +..+++++-.+....+ ...-+.-.++.|++.+|+. . -++ .. ..+.|+.. ++.+ T Consensus 282 ~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~-----~--~~~-~~--~~i~F~~~---~L~~ 345 (345) T protein:vir:37 282 RFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQD-----P--EIK-NL--LKIKFREQ---NFAK 345 (345) T ss_pred CCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhh-----c--cCC-Cc--ceEEecch---hhcC Confidence 21 12322 2223455544444333 3455677777888888752 1 122 11 24555432 2322 No 148 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.84 E-value=3e-08 Score=61.81 Aligned_cols=413 Identities=10% Similarity=-0.003 Sum_probs=187.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHH-H------HHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDL-A------EVI 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L-~------~~m 71 (547) |-+. |-=+..+|+.......-.-. -+...+.+.+..|..+ ..++....+ .-.|........ + .+. T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~-YY~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:97 1 MFNI--IRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQ---LDKITVGQR-YYDKDNDIVKQMKKVDVHGNIDY 74 (474) T ss_pred Cccc--ccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHH---HHHHHHHHH-Hhccccchhcccchhcccccccc Confidence 5531 11234555532222111111 1111222222222211 112211111 111110000000 0 000 Q ss_pred Hh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 72 EE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 72 ~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) .. ......-.+.+....+.+.+..+...+ ....++++.++++ +|...+..+. ++.-||.+ .+++|.. T Consensus 75 ~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~e~~~~~~~~G~~-~~~~~~d 147 (474) T protein:vir:97 75 DKPDWRITTNFHQNLVDQKVSYVASKPVTYSCED-----ENVLKVIHDVLDT-RWDNKLIDILTATSNKGID-WLQVYIN 147 (474) T ss_pred ccCcceeecchHHHHHHHHHhhhhcCCceeccCc-----HHHHHHHHHHHhc-cHHHHHHHHHHHHhhcCce-EEEEEec Confidence 00 123444556666777778887776432 2344567777764 6777777665 68889974 5677765 Q ss_pred cCCeeeEEeeEeeccceeEecccC--Ccce-------------------------eeccCCCCce-------------ee Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRFQKSD--LATP-------------------------LLLDDNGAEV-------------PL 185 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~~~~~--~~~~-------------------------~~~~~~~~g~-------------~l 185 (547) .+|... +..++|+.+....++ ...+ +...+++... .. T Consensus 148 ~~~~~~---i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:97 148 ENGEMK---LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSN 224 (474) T ss_pred CCCeeE---EEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccc Confidence 566543 444455433211110 0110 0000000000 00 Q ss_pred ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCee Q lcl|NC_020866. 186 PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCA 265 (547) Q Consensus 186 ~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~ 265 (547) .++..+-+. .-.++++|.|.+..+....---...+.+++..++.|+.|+++.+--.+.+.+ ....++..... T Consensus 225 ~~~g~vPvv--~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~------~~~~~~~~~~~ 296 (474) T protein:vir:97 225 GNWGRVPFI--AFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLE------EFMRGLKYYKA 296 (474) T ss_pred cCCCccceE--EecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch------hhhhhhhccce Confidence 001111000 1134688999999866655555667888999999999999987742222111 12334555667 Q ss_pred EEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-H--HHHHHHHHHHHHHHHH Q lcl|NC_020866. 266 AIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-L--VQEDIETSDAMALAAI 342 (547) Q Consensus 266 ~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-~--v~~~~~~ad~~~i~~~ 342 (547) +.++.+.+++++.... ....++..++.+.+.|...--+..++.++.+| ..+|..-. . -....+..-.+.+... T Consensus 297 i~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 372 (474) T protein:vir:97 297 INVDGDGGVETIQVEV---PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGS-APSGIALKFLYGNLDLKANKLKNKATVA 372 (474) T ss_pred eeccCCCceeEEeecC---CHHHHHHHHHHHHHHHHHHhCccccCcccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7889999999987543 23457888888888776663222233343222 22332221 1 1122233334556666 Q ss_pred HHHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcccccccc Q lcl|NC_020866. 343 LNRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLENPA 420 (547) Q Consensus 343 ln~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~~ 420 (547) +. ++++.++.+.-. .+.....+.|....+.+..+.++.+ ++.|+ ++.+.+.+.++. +.|+..-+-+....... T Consensus 373 l~-~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~~g~-iS~et~l~~l~~v~D~~~E~eri~~E~~~~ 447 (474) T protein:vir:97 373 IQ-ELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII---AQSQY-LSRETLVKSSPLVDDYKAELERIEQEQMEY 447 (474) T ss_pred HH-HHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH---HHcCC-CCHHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 64 466666665421 1222246677777777776666654 45676 899988888864 43322111111111110 Q ss_pred cccccccccccccCCc-cccccccccc Q lcl|NC_020866. 421 TQAAPPGMDPGAGDLK-TAERKLKPLS 446 (547) Q Consensus 421 ~~~~~~~~~~~~~~~~-~~~~~~~~~~ 446 (547) ....+.....+..... ...+...... T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 448 NKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HhhccccCCCCCCCcccCCCCcccccC Confidence 0111000000000000 0000000000 No 149 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.84 E-value=3e-08 Score=61.81 Aligned_cols=413 Identities=10% Similarity=-0.003 Sum_probs=187.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHH-H------HHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDL-A------EVI 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L-~------~~m 71 (547) |-+. |-=+..+|+.......-.-. -+...+.+.+..|..+ ..++....+ .-.|........ + .+. T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~-YY~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:94 1 MFNI--IRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQ---LDKITVGQR-YYDKDNDIVKQMKKVDVHGNIDY 74 (474) T ss_pred Cccc--ccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHH---HHHHHHHHH-Hhccccchhcccchhcccccccc Confidence 5531 11234555532222111111 1111222222222211 112211111 111110000000 0 000 Q ss_pred Hh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 72 EE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 72 ~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) .. ......-.+.+....+.+.+..+...+ ....++++.++++ +|...+..+. ++.-||.+ .+++|.. T Consensus 75 ~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~e~~~~~~~~G~~-~~~~~~d 147 (474) T protein:vir:94 75 DKPDWRITTNFHQNLVDQKVSYVASKPVTYSCED-----ENVLKVIHDVLDT-RWDNKLIDILTATSNKGID-WLQVYIN 147 (474) T ss_pred ccCcceeecchHHHHHHHHHhhhhcCCceeccCc-----HHHHHHHHHHHhc-cHHHHHHHHHHHHhhcCce-EEEEEec Confidence 00 123444556666777778887776432 2344567777764 6777777665 68889974 5677765 Q ss_pred cCCeeeEEeeEeeccceeEecccC--Ccce-------------------------eeccCCCCce-------------ee Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRFQKSD--LATP-------------------------LLLDDNGAEV-------------PL 185 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~~~~~--~~~~-------------------------~~~~~~~~g~-------------~l 185 (547) .+|... +..++|+.+....++ ...+ +...+++... .. T Consensus 148 ~~~~~~---i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:94 148 ENGEMK---LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSN 224 (474) T ss_pred CCCeeE---EEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccc Confidence 566543 444455433211110 0110 0000000000 00 Q ss_pred ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCee Q lcl|NC_020866. 186 PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCA 265 (547) Q Consensus 186 ~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~ 265 (547) .++..+-+. .-.++++|.|.+..+....---...+.+++..++.|+.|+++.+--.+.+.+ ....++..... T Consensus 225 ~~~g~vPvv--~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~------~~~~~~~~~~~ 296 (474) T protein:vir:94 225 GNWGRVPFI--AFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLE------EFMRGLKYYKA 296 (474) T ss_pred cCCCccceE--EecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch------hhhhhhhccce Confidence 001111000 1134688999999866655555667888999999999999987742222111 12334555667 Q ss_pred EEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-H--HHHHHHHHHHHHHHHH Q lcl|NC_020866. 266 AIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-L--VQEDIETSDAMALAAI 342 (547) Q Consensus 266 ~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-~--v~~~~~~ad~~~i~~~ 342 (547) +.++.+.+++++.... ....++..++.+.+.|...--+..++.++.+| ..+|..-. . -....+..-.+.+... T Consensus 297 i~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 372 (474) T protein:vir:94 297 INVDGDGGVETIQVEV---PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGS-APSGIALKFLYGNLDLKANKLKNKATVA 372 (474) T ss_pred eeccCCCceeEEeecC---CHHHHHHHHHHHHHHHHHHhCccccCcccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7889999999987543 23457888888888776663222233343222 22332221 1 1122233334556666 Q ss_pred HHHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcccccccc Q lcl|NC_020866. 343 LNRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLENPA 420 (547) Q Consensus 343 ln~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~~ 420 (547) +. ++++.++.+.-. .+.....+.|....+.+..+.++.+ ++.|+ ++.+.+.+.++. +.|+..-+-+....... T Consensus 373 l~-~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~~g~-iS~et~l~~l~~v~D~~~E~eri~~E~~~~ 447 (474) T protein:vir:94 373 IQ-ELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII---AQSQY-LSRETLVKSSPLVDDYKAELERIEQEQMEY 447 (474) T ss_pred HH-HHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH---HHcCC-CCHHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 64 466666665421 1222246677777777776666654 45676 899988888864 43322111111111110 Q ss_pred cccccccccccccCCc-cccccccccc Q lcl|NC_020866. 421 TQAAPPGMDPGAGDLK-TAERKLKPLS 446 (547) Q Consensus 421 ~~~~~~~~~~~~~~~~-~~~~~~~~~~ 446 (547) ....+.....+..... ...+...... T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 448 NKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HhhccccCCCCCCCcccCCCCcccccC Confidence 0111000000000000 0000000000 No 150 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.80 E-value=4.2e-08 Score=61.00 Aligned_cols=429 Identities=11% Similarity=0.024 Sum_probs=195.9 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCC---CHHHHHHHHHhhcCCCHHHHHHHHHHHH----- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGL---NPLRLAQILRAADHGDPVRYLDLAEVIE----- 72 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l---~p~~~~~ilr~a~~g~~~~~~~L~~~m~----- 72 (547) |-|+-. ++.+|+-..+...-...|. ..+.......+ ++..+...+.....-...++..|.+.+. T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~ 72 (501) T protein:vir:27 1 MEQTLF-TDSTGQDLVLNLRFHRESR-------IRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHD 72 (501) T ss_pred CCceeE-EeccchhhhhhcccChhHH-------HhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 887766 5666765443322222221 00100001111 1111222222111111112211111110 Q ss_pred ----------------hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hhhhhc Q lcl|NC_020866. 73 ----------------ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKG 135 (547) Q Consensus 73 ----------------~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G 135 (547) -...+..-.+++....+.+.+.++...++. ..+...+++.+++..-+|+.....+. ++..|| T Consensus 73 i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~-~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G 151 (501) T protein:vir:27 73 VLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDND-NNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTG 151 (501) T ss_pred ccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCcc-chHHHHHHHHHHHHhcChhHHHHHHHHHHhhCC Confidence 012444456666677777888777654432 33445677888888778999998887 688899 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEe--cccCCcce------eec---------------------cCCCCcee-- Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRF--QKSDLATP------LLL---------------------DDNGAEVP-- 184 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~------~~~---------------------~~~~~g~~-- 184 (547) .+ ++++|...+|... +..++|+.+.. +......+ +.. ...+.... T Consensus 152 ~a-~~~vy~ded~~~~---i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~ 227 (501) T protein:vir:27 152 RA-YEVIYRNEYDETR---IKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEIS 227 (501) T ss_pred eE-EEEEEeCCCCceE---EEEEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeecc Confidence 86 5677765556543 44455544321 11100100 000 00000000 Q ss_pred -ec-cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCC-CHHHHHHHHHH-HHHH Q lcl|NC_020866. 185 -LP-AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGA-SEQDKDTLFRA-VANI 260 (547) Q Consensus 185 -l~-~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~-~~~~~~~l~~a-l~~~ 260 (547) .+ ++..|-..+ -.+++.|.|.+..+....---...+..++..++.|..|+++.+-.... ..+....+.+. +..+ T Consensus 228 ~~~~~~g~vPvv~--~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~ 305 (501) T protein:vir:27 228 VTTHAFGTVPITE--FLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQL 305 (501) T ss_pred ccccCCCcccEEE--ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceee Confidence 00 111111111 134678999999876666666667888888899999999987632221 12222222110 0000 Q ss_pred -hcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-H--HHHHHHHHHH Q lcl|NC_020866. 261 -AGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-L--VQEDIETSDA 336 (547) Q Consensus 261 -~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-~--v~~~~~~ad~ 336 (547) ..+++...+.+.+++++..... ...++.+++.+.+.|...--...++.++.+| ..+|..-. . -....+..-. T Consensus 306 ~~~~~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~ka~~~~ 381 (501) T protein:vir:27 306 KPPKSADGKEGTVKAEYLTKSYD---VSGAEAYKTRLNRDIHIFTNIPDMSDTNFSG-NTSGEALKYKLFGLDQDRVDTQ 381 (501) T ss_pred cccccccCCCCCcceeeeeccCC---HHHHHHHHHHHHHHHHHHhCCcccCcccccc-CchHHHHHHHHHHHHHHHHHHH Confidence 0111122334567888754432 2357888888888887763323334343222 22232221 1 1223344445 Q ss_pred HHHHHHHHHHHHHHHHHh-cC---C--CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCc Q lcl|NC_020866. 337 MALAAILNRDLIRPWIQL-EY---G--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPG 409 (547) Q Consensus 337 ~~i~~~ln~~li~~l~~~-Nf---g--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~ 409 (547) +.+...+.+ +++-++.+ +. + .+.....+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. +.|+.. T Consensus 382 ~~~~~~l~~-~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~--g~-iS~et~l~~l~~v~D~~~E 457 (501) T protein:vir:27 382 SQFTQGLKR-RYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQ-VSQETALSLSGLVESPNEE 457 (501) T ss_pred HHHHHHHHH-HHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CcHHHHHHhCCCCCCHHHH Confidence 666666643 55555543 11 1 11233578888889999999999999885 65 888888888754 333222 Q ss_pred chhccccccc-ccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHHH Q lcl|NC_020866. 410 AKLLRPLENP-ATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIA 475 (547) Q Consensus 410 e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (547) -+-+...... ......+......+ ...............+. .+ T Consensus 458 ~eri~~E~~e~~~~~~~~~~~~~~~-~~~d~~~~~~~d~~e~~----------------------~~ 501 (501) T protein:vir:27 458 LDKINKEVSEIDFKGYSNDFNEHVG-KYTDEVKETHTDDFERA----------------------YE 501 (501) T ss_pred HHHHHHHHHhhhHhhhcCccccccc-cccCCCCCCcccccccc----------------------CC Confidence 1111111100 00000000000000 00000000000000000 00 No 151 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.80 E-value=4.2e-08 Score=61.00 Aligned_cols=402 Identities=12% Similarity=0.072 Sum_probs=185.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH-------- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIE-------- 72 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~-------- 72 (547) |+. + .|+-....+- ..+..-....+++..+...++....-...++..|.+.+. T Consensus 1 ~~~----~--~~~~~~~~~~-------------~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~ 61 (470) T protein:vir:99 1 MKD----I--NYGRDKVTGN-------------SSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTA 61 (470) T ss_pred Ccc----c--cCCcccccCC-------------ceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccC Confidence 652 2 1221110000 000000001222222222222211111111111111000 Q ss_pred ----------hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeee Q lcl|NC_020866. 73 ----------ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEI 141 (547) Q Consensus 73 ----------~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei 141 (547) -......-.++.....+.+.+.++...+++.. .+.+.+++..-+|+..+.++. ++..||.+ +++ T Consensus 62 ~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~ 136 (470) T protein:vir:99 62 PEKETGADNRIVVNSAKYVVDVYNGYFCGIEPKLALLNDSSK----IDEIARWNRQENFFDTINEISKQCDIFGRS-IAS 136 (470) T ss_pred cccccCCcceeecchHHHHHHHHhhhhccCCeeEeeCCchhH----HHHHHHHHHhcCHhHHHHHHHHHHHhcCee-EEE Confidence 01245555666777778888877766544322 244666676777888877777 68889975 667 Q ss_pred EEeecCCeeeEEeeEeeccceeE--ecccCCccee----ec--cC-----------------------CCCc-----eee Q lcl|NC_020866. 142 VWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL----LL--DD-----------------------NGAE-----VPL 185 (547) Q Consensus 142 ~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~----~~--~~-----------------------~~~g-----~~l 185 (547) +|...+|... +..++|+.+. ++......++ .. .+ .... ... T Consensus 137 v~~d~dg~~~---i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (470) T protein:vir:99 137 IYQGEDARPH---LMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAI 213 (470) T ss_pred EEeCCCCeEE---EEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccc Confidence 7765566543 4445555432 1111110000 00 00 0000 000 Q ss_pred ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCee Q lcl|NC_020866. 186 PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCA 265 (547) Q Consensus 186 ~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~ 265 (547) .+...|-+.+ -.++++|.|.+..+....--=...+.+++..++.|..|+++.+--....++. -+.+..+..... T Consensus 214 ~~~g~vPvv~--~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~----g~~~~~~~~~~~ 287 (470) T protein:vir:99 214 NPYGLVPAVE--FFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDE----GNPKFDFKNNRV 287 (470) T ss_pred cCCCccceEe--ecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccc----cchhhhhhhcce Confidence 1111111111 1346789999988665544445577888888899999998876321111111 123344555555 Q ss_pred EEec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHH Q lcl|NC_020866. 266 AIIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAM 337 (547) Q Consensus 266 ~vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~ 337 (547) +.+| .+.+++++.... ....++..++.+.+.|...--...++.++.+| ..+|..-.. -....+..-.+ T Consensus 288 ~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~ 363 (470) T protein:vir:99 288 LYVSQLDPDTNPQIGFIAKPD---ADQMQENLIQHLTDFIFMMAMVPNIQDKNFAG-NSSGVALQYKLFAMKNKADSKER 363 (470) T ss_pred eeecCCCCCCCCcceEEeecC---ChHHHHHHHHHHHHHHHHHhCCcccccccccc-CchHHHHHHHHHHHHHHHHHHHH Confidence 5554 355678876432 23357888888888887663332333343223 223332221 12223344455 Q ss_pred HHHHHHHHHHHHHHHHh-c--CCC--cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchh Q lcl|NC_020866. 338 ALAAILNRDLIRPWIQL-E--YGP--QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKL 412 (547) Q Consensus 338 ~i~~~ln~~li~~l~~~-N--fg~--~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~ 412 (547) .+...+. ++++.++.+ + ... ......+.|....+.+..+.++.+.+++ |+ ++.+.+.+.++.-.|+ +++ T Consensus 364 ~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--gi-is~et~l~~l~~vd~~--~E~ 437 (470) T protein:vir:99 364 KFDKSLM-QLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--GI-VSKKTQLGMIPDIEPD--AEM 437 (470) T ss_pred HHHHHHH-HHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCHHHHHHhCCCCCHH--HHH Confidence 6666663 355655443 1 111 1234588888888999999999999986 66 8888888888764332 222 Q ss_pred cccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 413 LRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) ............ ...............+... ++ T Consensus 438 eri~~E~~~~~~--~~~~~~~~~d~~~~d~~~e-------------------------e~ 470 (470) T protein:vir:99 438 KQIAKEKADAIK--QTQQLSMPIDILKRDNNAE-------------------------EE 470 (470) T ss_pred HHHHHHHHHHHH--HHHhhcCCCCcCCCCCCcc-------------------------CC Confidence 111111000000 0000000000000000000 00 No 152 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.79 E-value=4.7e-08 Score=60.78 Aligned_cols=427 Identities=11% Similarity=-0.013 Sum_probs=193.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh------- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE------- 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~------- 73 (547) |.+.+.+-.-.+.--+....=...|. .....+..-.....++.++...++........++..|++.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n----~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~ 76 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEAN----VVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE 76 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhC----CcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccc Confidence 77666553222211111000000010 0000000001123355555566554433333333333322110 Q ss_pred ----h----------hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCee Q lcl|NC_020866. 74 ----R----------DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSF 138 (547) Q Consensus 74 ----~----------D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv 138 (547) . ..+..-.+.+....+.+.+..+.+.++ +..+.+..+++.-+++.....+. ++..||. + T Consensus 77 ~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~-a 150 (511) T protein:vir:96 77 LTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGK-A 150 (511) T ss_pred cCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCch-----HHHHHHHHHHhhcChhHHHHHHHHHHHhcCe-e Confidence 0 133345666667777788877765432 23467888887777888888777 5888997 4 Q ss_pred eeeEEeecCCeeeEEeeEeeccceeE--ecccCCccee------ec---cCCC-C---ce-eeccCceEEEeec------ Q lcl|NC_020866. 139 TEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL------LL---DDNG-A---EV-PLPAFRFIFADIP------ 196 (547) Q Consensus 139 ~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~------~~---~~~~-~---g~-~l~~~k~i~~~~~------ 196 (547) .+++|...+|... +..++|+... ++......++ .. ++.. . -. -+.+.+...+... T Consensus 151 ~~~vy~d~dg~~~---i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~ 227 (511) T protein:vir:96 151 YELMIRNQDDETR---LYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLK 227 (511) T ss_pred EEEEEeCCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc Confidence 6677765556543 3444444332 1211111100 00 0000 0 00 0111111111110 Q ss_pred -------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHH Q lcl|NC_020866. 197 -------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAV 257 (547) Q Consensus 197 -------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al 257 (547) .-.++++|.|.+..+....=--...+..++..++.|+.|+++.+-....+.++.......- T Consensus 228 ~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~ 307 (511) T protein:vir:96 228 LTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEAN 307 (511) T ss_pred ccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhccccccc Confidence 0123567899999876655555567888888899999998876643333333322211110 Q ss_pred HHHhc------CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHH-HHH--H Q lcl|NC_020866. 258 ANIAG------DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEH-RLV--Q 328 (547) Q Consensus 258 ~~~~~------~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh-~~v--~ 328 (547) .-... ..+.-...+.+++++.... ....++.+++.+.+.|...--...++.++.+| ..+|..- ... . T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~Sg~Al~~~~~~l 383 (511) T protein:vir:96 308 VLFLEPTVYVDAEGRETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFGL 383 (511) T ss_pred ceeccccceeccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-ccHHHHHHHHHHHH Confidence 00000 0111123456788876443 23357888888888887663333333333222 2222222 111 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh---cCC----CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh Q lcl|NC_020866. 329 EDIETSDAMALAAILNRDLIRPWIQL---EYG----PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF 401 (547) Q Consensus 329 ~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg----~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~ 401 (547) ...+..-.+.+...+++ +++.++.+ ..+ ....-.++.|....+.+..+.++.+.+++ |+ ++.+.+.+.+ T Consensus 384 ~~ka~~~~~~f~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~-iS~et~l~~l 459 (511) T protein:vir:96 384 EQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQTTLMSLF 459 (511) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhC Confidence 22233444556666643 55555443 111 11123588898888999999999999985 66 8888888888 Q ss_pred CC-CCCCCcchhccccccccc--c--cccccccccccCCccccccccccccchh Q lcl|NC_020866. 402 GF-SQPKPGAKLLRPLENPAT--Q--AAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 402 Gl-p~p~~~e~~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +. +.|+ +++-........ . .......+...............+.... T Consensus 460 ~~v~d~~--~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 460 SFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCHH--HHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 64 3222 222111111000 0 0000000100000000011111100000 No 153 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.79 E-value=4.7e-08 Score=60.78 Aligned_cols=427 Identities=11% Similarity=-0.013 Sum_probs=193.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh------- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE------- 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~------- 73 (547) |.+.+.+-.-.+.--+....=...|. .....+..-.....++.++...++........++..|++.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n----~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~ 76 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFNDEAN----VVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE 76 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhC----CcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccc Confidence 77666553222211111000000010 0000000001123355555566554433333333333322110 Q ss_pred ----h----------hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCee Q lcl|NC_020866. 74 ----R----------DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSF 138 (547) Q Consensus 74 ----~----------D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv 138 (547) . ..+..-.+.+....+.+.+..+.+.++ +..+.+..+++.-+++.....+. ++..||. + T Consensus 77 ~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~-a 150 (511) T protein:vir:78 77 LTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGK-A 150 (511) T ss_pred cCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCch-----HHHHHHHHHHhhcChhHHHHHHHHHHHhcCe-e Confidence 0 133345666667777788877765432 23467888887777888888777 5888997 4 Q ss_pred eeeEEeecCCeeeEEeeEeeccceeE--ecccCCccee------ec---cCCC-C---ce-eeccCceEEEeec------ Q lcl|NC_020866. 139 TEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL------LL---DDNG-A---EV-PLPAFRFIFADIP------ 196 (547) Q Consensus 139 ~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~------~~---~~~~-~---g~-~l~~~k~i~~~~~------ 196 (547) .+++|...+|... +..++|+... ++......++ .. ++.. . -. -+.+.+...+... T Consensus 151 ~~~vy~d~dg~~~---i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~ 227 (511) T protein:vir:78 151 YELMIRNQDDETR---LYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLK 227 (511) T ss_pred EEEEEeCCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc Confidence 6677765556543 3444444332 1211111100 00 0000 0 00 0111111111110 Q ss_pred -------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHH Q lcl|NC_020866. 197 -------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAV 257 (547) Q Consensus 197 -------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al 257 (547) .-.++++|.|.+..+....=--...+..++..++.|+.|+++.+-....+.++.......- T Consensus 228 ~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~ 307 (511) T protein:vir:78 228 LTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEAN 307 (511) T ss_pred ccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhccccccc Confidence 0123567899999876655555567888888899999998876643333333322211110 Q ss_pred HHHhc------CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHH-HHH--H Q lcl|NC_020866. 258 ANIAG------DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEH-RLV--Q 328 (547) Q Consensus 258 ~~~~~------~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh-~~v--~ 328 (547) .-... ..+.-...+.+++++.... ....++.+++.+.+.|...--...++.++.+| ..+|..- ... . T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~Sg~Al~~~~~~l 383 (511) T protein:vir:78 308 VLFLEPTVYVDAEGRETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFGL 383 (511) T ss_pred ceeccccceeccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-ccHHHHHHHHHHHH Confidence 00000 0111123456788876443 23357888888888887663333333333222 2222222 111 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh---cCC----CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh Q lcl|NC_020866. 329 EDIETSDAMALAAILNRDLIRPWIQL---EYG----PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF 401 (547) Q Consensus 329 ~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg----~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~ 401 (547) ...+..-.+.+...+++ +++.++.+ ..+ ....-.++.|....+.+..+.++.+.+++ |+ ++.+.+.+.+ T Consensus 384 ~~ka~~~~~~f~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~-iS~et~l~~l 459 (511) T protein:vir:78 384 EQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQTTLMSLF 459 (511) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhC Confidence 22233444556666643 55555443 111 11123588898888999999999999985 66 8888888888 Q ss_pred CC-CCCCCcchhccccccccc--c--cccccccccccCCccccccccccccchh Q lcl|NC_020866. 402 GF-SQPKPGAKLLRPLENPAT--Q--AAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 402 Gl-p~p~~~e~~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +. +.|+ +++-........ . .......+...............+.... T Consensus 460 ~~v~d~~--~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 460 SFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCHH--HHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 64 3222 222111111000 0 0000000100000000011111100000 No 154 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.79 E-value=4.8e-08 Score=60.72 Aligned_cols=404 Identities=11% Similarity=0.024 Sum_probs=185.7 Q ss_pred CC----CCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHH---- Q lcl|NC_020866. 1 MA----QKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIE---- 72 (547) Q Consensus 1 ma----~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~---- 72 (547) .| +..-|+=+.|+- -++... +.+..---..++...+...++... ....++..+++-+. T Consensus 10 ~~~~~~~~~~~~~~~~~~-----~~~~~~--------~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~ 75 (492) T protein:vir:94 10 VAQALIKGGNILYPSQPT-----QTEIFD--------AIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPD 75 (492) T ss_pred HHHHHhcCCceeecCccc-----hhhhhh--------cccccCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccc Confidence 11 111122222221 001000 000000000011111111111111 01112222111110 Q ss_pred -----------------h-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH- Q lcl|NC_020866. 73 -----------------E-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL- 129 (547) Q Consensus 73 -----------------~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l- 129 (547) + ..+...-.+++....+.+.+..+...+ .+..+.+..++++ ++++.+.++. T Consensus 76 I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~~~~~ 149 (492) T protein:vir:94 76 IVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-----DEVVKRIDEVLGN-RFDDKLHSVLT 149 (492) T ss_pred cccccccccccccccccccccccccchHHHHHHHHHhhhcccCceeccCc-----hHHHHHHHHHHhc-cHHHHHHHHHH Confidence 0 134455566677777778877775432 2345667777764 5777777666 Q ss_pred HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce------eeccCCCCceeec--------------- Q lcl|NC_020866. 130 DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP------LLLDDNGAEVPLP--------------- 186 (547) Q Consensus 130 ~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~------~~~~~~~~g~~l~--------------- 186 (547) ++..||.+. +++|...+|... +..++|+... ++......+ ....+.....-+. T Consensus 150 ~a~~~G~a~-~~v~~d~dg~~~---~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~ 225 (492) T protein:vir:94 150 GASNKGIEW-LHPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLI 225 (492) T ss_pred HHhhCCeEE-EEEEecCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeee Confidence 688899875 566654455543 4444554321 111110000 0000000000000 Q ss_pred -----------------cCceE-EEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHH Q lcl|NC_020866. 187 -----------------AFRFI-FADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQ 248 (547) Q Consensus 187 -----------------~~k~i-~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~ 248 (547) ++..| ++. -.+|+.|.|.+..+....---...+..++..++.|..|+++.+--. .+ T Consensus 226 ~~~~~~~~~~~~~~~~~~~g~vPvv~---~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~---~~ 299 (492) T protein:vir:94 226 PDYSNNLENSKTHFSTGSWGKIPFIP---FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD---DQ 299 (492) T ss_pred eccccccccccccccccCCCccceEE---ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC---cc Confidence 11111 111 1235678999988766555555578888889999999998876322 22 Q ss_pred HHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH--- Q lcl|NC_020866. 249 DKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR--- 325 (547) Q Consensus 249 ~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~--- 325 (547) +.... ...+....++.++.+.+++++..... ...++..++.+.+.|.+.--...++.++-+| ..+|..-. T Consensus 300 ~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~~ 372 (492) T protein:vir:94 300 ELPEF---KRLLRYYGAIKVSDNGGVDTIQVEVP---VENSKKYLDELYQKIMLFGQAVDFSSDKFGS-APSGVALEFLY 372 (492) T ss_pred cchhh---HHHHhhccceecCCCCcceeEeccCC---HHHHHHHHHHHHHHHHHHhCCcCCCcccccc-CchHHHHHHHH Confidence 21111 22344456677899999999864432 2357888888888877663323333343222 23333221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC- Q lcl|NC_020866. 326 LVQEDIETSDAMALAAILNRDLIRPWIQLE-YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF- 403 (547) Q Consensus 326 ~v~~~~~~ad~~~i~~~ln~~li~~l~~~N-fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl- 403 (547) .-....+..-.+.+...+. ++++.++.+. .........+.|....+.+..+.++.+.+++ |+ ++.+.+.+.++. T Consensus 373 ~~l~~k~~~k~~~f~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl~--gi-iS~et~~~~l~~v 448 (492) T protein:vir:94 373 TNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPFV 448 (492) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCC Confidence 1122334445566666664 3666666554 1122233467888888999999999999885 76 899999998875 Q ss_pred CCCCCcchhcccccc--cccccccccccccccCCccccccccccccc Q lcl|NC_020866. 404 SQPKPGAKLLRPLEN--PATQAAPPGMDPGAGDLKTAERKLKPLSGQ 448 (547) Q Consensus 404 p~p~~~e~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (547) +.|+ +++-..... ......+..........+....+..... . T Consensus 449 ~d~~--~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~-e 492 (492) T protein:vir:94 449 EDLQ--AELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNKES-E 492 (492) T ss_pred CCHH--HHHHHHHHHHHHHHhhccccccccCCCCccccCCccccC-C Confidence 3332 222111110 0000000000000000000000000000 0 No 155 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.78 E-value=5.1e-08 Score=60.55 Aligned_cols=393 Identities=10% Similarity=0.007 Sum_probs=183.7 Q ss_pred hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEec Q lcl|NC_020866. 19 QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEP 98 (547) Q Consensus 19 ~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p 98 (547) -|+.-+.. ...-..+....|-+..-.+.+- ...+.....+.. + .-....-.+.+....+.+.+.++.. T Consensus 1 ~~~~~~~~-~~~r~~~l~~yy~g~~~~~~~~-~~~~~~~~~~~k----i------~~n~~~~ivd~~~~~l~g~~~~~~~ 68 (440) T protein:vir:95 1 MLAAFLGS-QKQRLAILASYAQGDNFSILSG-HRRLDDEKADYR----V------RHKWGGYISSFATGYVIGNPVSIGV 68 (440) T ss_pred ChhhHHHH-HHHHHHHHHHHhccCCcccccc-cccccccCCcce----e------ecchHHHHHHhhhhheeccCceEee Confidence 11111111 1111111112221111000000 000000000000 1 1345556677777888888888765 Q ss_pred CCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEe--cccCCccee- Q lcl|NC_020866. 99 ASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRF--QKSDLATPL- 174 (547) Q Consensus 99 ~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~- 174 (547) .++ .+.+..+++.+++..-+++.....+. ++.-||.+. +++|...+|... +..++|+.... +......++ T Consensus 69 ~~~--~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~-~~~~~d~~~~~~---i~~~~p~~~~~~~d~~~~~~~~~ 142 (440) T protein:vir:95 69 MEG--GSADQLSTIKDIEWQNDINALNSDLAFDASVYGRAY-EYHFRDKDKVDR---VVLISPLEMFVIRDLTVEQNIIA 142 (440) T ss_pred CCC--ccHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEE-EEEEecCCCceE---EEEEcccceEEEEcCCCCCceEE Confidence 443 23445567788887778888877776 688899975 455654455432 44445543322 111111100 Q ss_pred -------------------------eccCCCCce----ee-ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 175 -------------------------LLDDNGAEV----PL-PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 175 -------------------------~~~~~~~g~----~l-~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) .......+. .. .++..|-+.+ -.++.+|.|.+..+....--=...+.+ T Consensus 143 ~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~--~~n~~~g~sd~e~v~~lida~~~~~s~ 220 (440) T protein:vir:95 143 AVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVE--WWNNRFRMGDYESEISLIDAYDAGQSD 220 (440) T ss_pred EEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEE--eeCCCCCCCchhhhHHHHHHHHHHHHH Confidence 000000000 00 1112221111 123567899999877766556667888 Q ss_pred HHHHHHHcCCCeEEEeCC---CCCCHHHHHHHHHHHHHHhcC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHH Q lcl|NC_020866. 225 WAIFTQTYGQPLRIGKWQ---PGASEQDKDTLFRAVANIAGD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQ 299 (547) Q Consensus 225 w~~f~E~~G~P~~vgk~~---~~~~~~~~~~l~~al~~~~~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~ 299 (547) ++..++.|+.|+++.+-. ...++++...+.+.-.-.... .......+.+++++.... ....++..++.+.+. T Consensus 221 ~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~---~~~~~~~~~~~l~~~ 297 (440) T protein:vir:95 221 TANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQY---DVNGTEAYKNRLAND 297 (440) T ss_pred HHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHH Confidence 999999999998876642 112333333332221111111 111224456788886543 233588888888888 Q ss_pred HHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCC--cccccEEEecCcc Q lcl|NC_020866. 300 ISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDAMALAAILNRDLIRPWIQL---EYGP--QKRYPRIKIARPK 371 (547) Q Consensus 300 Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg~--~~~~p~~~~~~~~ 371 (547) |...--...++.++.+| ..+|..-... ....+..-.+.+...+. ++++.++.+ ..+. +.....+.|.... T Consensus 298 i~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~ 375 (440) T protein:vir:95 298 IHRFSRIPNLDDDRFNS-TSSGIALLYKMIGLEQVRKDKETYFTKALR-RRYELISNIHKAINGPVIEANKLTFTFHPNI 375 (440) T ss_pred HHHHhCCcccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCCcccccccceEEeCCCC Confidence 87663222333333222 2233322111 11223333455566663 355554433 2222 2234688899999 Q ss_pred ccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccc Q lcl|NC_020866. 372 TEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERK 441 (547) Q Consensus 372 ~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (547) +.+..+.++.+.++. |+ ++.+.+.++++.-.+....+-+...............+.. .......+ T Consensus 376 p~~~~~~ad~~~kl~--g~-iS~et~~~~l~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~--~~~~~~~e 440 (440) T protein:vir:95 376 PQDVWTEIKAYIEAG--GE-ISQETLMENASFTDYKTEHSRILKQGGSSDLEIGQIVGDA--DVGQADTE 440 (440) T ss_pred CCCHHHHHHHHHHHh--cc-CcHHHHHHhCCCCCcHHHHHHHHHHHHHhhhhHHhhccCC--CCCCcCCC Confidence 999999999999984 66 8888888888764332211111111111110000000000 00000000 No 156 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.77 E-value=5.4e-08 Score=60.45 Aligned_cols=412 Identities=9% Similarity=0.005 Sum_probs=184.1 Q ss_pred CCCCCeeeCCCCCcccchhh--hhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH-HHHHHH------H Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQL--TEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY-LDLAEV------I 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l--~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~-~~L~~~------m 71 (547) |- .+|-=|++.|-.-.-. -...+.....-+...+..|-.+ ..++.. |+..-.|.-... ..-+.. - T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~-l~~Yy~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:96 1 MI--NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQK---LKDINV-GQKYYDKDNDINYQAYKQDLHGNIDY 74 (474) T ss_pred Cc--ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHH---HHHHHH-HHHHhcccCccccccchhhhcccccc Confidence 43 3332222322211100 0011111111122222222111 112111 111111110000 000000 0 Q ss_pred Hh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 72 EE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 72 ~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) .+ ......-.+.+....+.+.+..+...++ +..+.+.+++++ +|.+.+..+. ++.-||.+ ++++|.. T Consensus 75 ~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~-----~~~~~l~~~~~n-~~~~~~~~l~~~~~~~G~~-~~~~~~d 147 (474) T protein:vir:96 75 TKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDD-----KVLDVIHQVLDT-RWDNKLIDILTAASNKGID-WLQVYIN 147 (474) T ss_pred cccccccccchHHHHHHhhhhhhcccCceeccCCh-----HHHHHHHHHHhc-cHHHHHHHHHHHHhhCCeE-EEEeeeC Confidence 00 1233345566677778888888765432 233566666654 5777777666 68889995 5677765 Q ss_pred cCCeeeEEeeEeeccceeEe--cccCCccee----e-ccCCCCceee-ccCceEEEeec--------------------- Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRF--QKSDLATPL----L-LDDNGAEVPL-PAFRFIFADIP--------------------- 196 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~----~-~~~~~~g~~l-~~~k~i~~~~~--------------------- 196 (547) .+|.+. +..++|+.+.. +......++ . ..+....+.+ .+.++..++.. T Consensus 148 ~~~~~~---i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:96 148 EDGELK---LFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFST 224 (474) T ss_pred CCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccc Confidence 556543 44455544321 111111100 0 0000000111 11111111100 Q ss_pred ---------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEE Q lcl|NC_020866. 197 ---------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAI 267 (547) Q Consensus 197 ---------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~v 267 (547) .-.+++.|.|.+..+....=--...+..++..++.|..|+++.+- .+.++.. .....+.....+. T Consensus 225 ~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g---~~~~~~~---~~~~~~~~~~~i~ 298 (474) T protein:vir:96 225 GSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRG---YEGEDLS---EFMEGLKYYKAIN 298 (474) T ss_pred cCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcC---CCccccc---chhhhhhccceee Confidence 012457789999886555554556788999999999999887653 2222211 1223444556677 Q ss_pred ecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_020866. 268 IPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDAMALAAILN 344 (547) Q Consensus 268 ip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~~~i~~~ln 344 (547) ++.+.+++++..... ...++.+++.+.+.|...--...++.++.+ +..+|..-... ....+..-.+.+...+. T Consensus 299 ~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~ 374 (474) T protein:vir:96 299 VSSDGGVETIQVEVP---VASTKEYLDMMRAYIVEFGQGVDFQTDKFG-SATSGIALKFLYTNLNLKANKLKNKANVALQ 374 (474) T ss_pred ccCCCceeEEeccCC---HHHHHHHHHHHHHHHHHHhCCcCccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899999999875532 235788888888887665332233334333 33333322221 11222333445666663 Q ss_pred HHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcc--ccc-cc Q lcl|NC_020866. 345 RDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLR--PLE-NP 419 (547) Q Consensus 345 ~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~--~~~-~~ 419 (547) ++++.++.+.-. .+.....+.|....+.+..+.++.+ ++.|+ ++.+.+.+.++. +.|+ +++-. ... .. T Consensus 375 -~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~---~~~gi-iS~et~~~~lp~v~D~~--~E~eri~~E~~~~ 447 (474) T protein:vir:96 375 -ELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG---AQSQY-LSKETLVRHHPWVDDPK--AELERLDEEQLEL 447 (474) T ss_pred -HHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHH---HHcCC-CChHHHHHhCCCCCCHH--HHHHHHHHHHHHH Confidence 466666665311 2223346778777788877766654 45687 899888888875 3332 22211 110 00 Q ss_pred ccccc-cccccccccCCccccccccccc Q lcl|NC_020866. 420 ATQAA-PPGMDPGAGDLKTAERKLKPLS 446 (547) Q Consensus 420 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 446 (547) ....+ ..+..+.. ..+...+...... T Consensus 448 ~~~~~~~~~~~~~~-~~~~~~~~~~e~~ 474 (474) T protein:vir:96 448 NKQLPNLDDGGADG-AQQQQQSENNQSK 474 (474) T ss_pred HhhccccccccCCC-CCCcCCCCccccC Confidence 00000 00000000 0000000000000 No 157 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.77 E-value=5.4e-08 Score=60.45 Aligned_cols=412 Identities=9% Similarity=0.005 Sum_probs=184.1 Q ss_pred CCCCCeeeCCCCCcccchhh--hhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH-HHHHHH------H Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQL--TEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY-LDLAEV------I 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l--~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~-~~L~~~------m 71 (547) |- .+|-=|++.|-.-.-. -...+.....-+...+..|-.+ ..++.. |+..-.|.-... ..-+.. - T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~-l~~Yy~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:95 1 MI--NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQK---LKDINV-GQKYYDKDNDINYQAYKQDLHGNIDY 74 (474) T ss_pred Cc--ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHH---HHHHHH-HHHHhcccCccccccchhhhcccccc Confidence 43 3332222322211100 0011111111122222222111 112111 111111110000 000000 0 Q ss_pred Hh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 72 EE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 72 ~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) .+ ......-.+.+....+.+.+..+...++ +..+.+.+++++ +|.+.+..+. ++.-||.+ ++++|.. T Consensus 75 ~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~-----~~~~~l~~~~~n-~~~~~~~~l~~~~~~~G~~-~~~~~~d 147 (474) T protein:vir:95 75 TKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDD-----KVLDVIHQVLDT-RWDNKLIDILTAASNKGID-WLQVYIN 147 (474) T ss_pred cccccccccchHHHHHHhhhhhhcccCceeccCCh-----HHHHHHHHHHhc-cHHHHHHHHHHHHhhCCeE-EEEeeeC Confidence 00 1233345566677778888888765432 233566666654 5777777666 68889995 5677765 Q ss_pred cCCeeeEEeeEeeccceeEe--cccCCccee----e-ccCCCCceee-ccCceEEEeec--------------------- Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRF--QKSDLATPL----L-LDDNGAEVPL-PAFRFIFADIP--------------------- 196 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~----~-~~~~~~g~~l-~~~k~i~~~~~--------------------- 196 (547) .+|.+. +..++|+.+.. +......++ . ..+....+.+ .+.++..++.. T Consensus 148 ~~~~~~---i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:95 148 EDGELK---LFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFST 224 (474) T ss_pred CCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccc Confidence 556543 44455544321 111111100 0 0000000111 11111111100 Q ss_pred ---------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEE Q lcl|NC_020866. 197 ---------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAI 267 (547) Q Consensus 197 ---------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~v 267 (547) .-.+++.|.|.+..+....=--...+..++..++.|..|+++.+- .+.++.. .....+.....+. T Consensus 225 ~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g---~~~~~~~---~~~~~~~~~~~i~ 298 (474) T protein:vir:95 225 GSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRG---YEGEDLS---EFMEGLKYYKAIN 298 (474) T ss_pred cCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcC---CCccccc---chhhhhhccceee Confidence 012457789999886555554556788999999999999887653 2222211 1223444556677 Q ss_pred ecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_020866. 268 IPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDAMALAAILN 344 (547) Q Consensus 268 ip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~~~i~~~ln 344 (547) ++.+.+++++..... ...++.+++.+.+.|...--...++.++.+ +..+|..-... ....+..-.+.+...+. T Consensus 299 ~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~ 374 (474) T protein:vir:95 299 VSSDGGVETIQVEVP---VASTKEYLDMMRAYIVEFGQGVDFQTDKFG-SATSGIALKFLYTNLNLKANKLKNKANVALQ 374 (474) T ss_pred ccCCCceeEEeccCC---HHHHHHHHHHHHHHHHHHhCCcCccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899999999875532 235788888888887665332233334333 33333322221 11222333445666663 Q ss_pred HHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcc--ccc-cc Q lcl|NC_020866. 345 RDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLR--PLE-NP 419 (547) Q Consensus 345 ~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~--~~~-~~ 419 (547) ++++.++.+.-. .+.....+.|....+.+..+.++.+ ++.|+ ++.+.+.+.++. +.|+ +++-. ... .. T Consensus 375 -~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~---~~~gi-iS~et~~~~lp~v~D~~--~E~eri~~E~~~~ 447 (474) T protein:vir:95 375 -ELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG---AQSQY-LSKETLVRHHPWVDDPK--AELERLDEEQLEL 447 (474) T ss_pred -HHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHH---HHcCC-CChHHHHHhCCCCCCHH--HHHHHHHHHHHHH Confidence 466666665311 2223346778777788877766654 45687 899888888875 3332 22211 110 00 Q ss_pred ccccc-cccccccccCCccccccccccc Q lcl|NC_020866. 420 ATQAA-PPGMDPGAGDLKTAERKLKPLS 446 (547) Q Consensus 420 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 446 (547) ....+ ..+..+.. ..+...+...... T Consensus 448 ~~~~~~~~~~~~~~-~~~~~~~~~~e~~ 474 (474) T protein:vir:95 448 NKQLPNLDDGGADG-AQQQQQSENNQSK 474 (474) T ss_pred HhhccccccccCCC-CCCcCCCCccccC Confidence 00000 00000000 0000000000000 No 158 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.77 E-value=5.6e-08 Score=60.32 Aligned_cols=409 Identities=11% Similarity=0.019 Sum_probs=189.9 Q ss_pred CCCCCeeeCCCCCcccchhhhh--hhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHH---HHH--- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTE--EVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAE---VIE--- 72 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~--~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~---~m~--- 72 (547) |+..+. |.++|+...-+.. +.+......+.+.+..|-.+ ..++...-+ .-.|.......-.. ... T Consensus 1 ~~~~~~---~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~-yY~g~~~i~~~~~~~~~~~~~~~ 73 (478) T protein:vir:10 1 MISINW---PWDKPYHEQVVEQIKPKYETQEEMILRLVREHKEN---IDNITMGER-YYNHHPDILDAPPKRDVNGDYDE 73 (478) T ss_pred CccccC---CCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHH---HHHHHHHHH-HhcCCCchhcccccccccccccc Confidence 887333 3444443322111 11111122222333322211 122222211 11111100000000 000 Q ss_pred ------hhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 73 ------ERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 73 ------~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) -.-+...-.+.+....+.+.+..+...++ +..+.+.+++++ +|.+.+..+. ++..||.+. +++|.- T Consensus 74 ~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d-----~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~~~~d 146 (478) T protein:vir:10 74 TKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDND-----KALKQIQHTLNH-KWDDKLVDILTAASNKGIEW-VQPYVD 146 (478) T ss_pred ccccceeccchHHHHHHHHHhhhccCCeeeecCCh-----HHHHHHHHHHhc-CHHHHHHHHHHHHHhcCeEE-EEEEec Confidence 01244455666666777788877765432 234566677764 6777777666 688899875 566654 Q ss_pred cCCeeeEEeeEeeccceeEe--cccCCccee-------------------------eccCCC------------------ Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRF--QKSDLATPL-------------------------LLDDNG------------------ 180 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~-------------------------~~~~~~------------------ 180 (547) .+|.+. +..++|+.... +......+. ...+.. T Consensus 147 ~~g~~~---~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (478) T protein:vir:10 147 EEGEFK---TFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQ 223 (478) T ss_pred CCCeeE---EEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceec Confidence 455543 33344433221 111001000 000000 Q ss_pred CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHH Q lcl|NC_020866. 181 AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANI 260 (547) Q Consensus 181 ~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~ 260 (547) ...+.+..++=++.+ .++++|.|.+..+....---...+..++..++.|..|+++.+--...+.. +...++ T Consensus 224 ~~~~~~~~~vPvv~~---~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~------~~~~~~ 294 (478) T protein:vir:10 224 GNKLMSWGRVPFIPF---KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMK------DFMHNL 294 (478) T ss_pred ccccccCCccceEEe---ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc------hhhhhh Confidence 000000000001111 24788999999866665556667888999999999998876532111111 122233 Q ss_pred hcCeeEEec--CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHH Q lcl|NC_020866. 261 AGDCAAIIP--ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSD 335 (547) Q Consensus 261 ~~~~~~vip--~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad 335 (547) .....+.++ .|.+++++.... ....++..++.+.+.|.+.--+..++.++.+ +..+|..-... ....+..- T Consensus 295 ~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~ 370 (478) T protein:vir:10 295 KYYKAISVAGESGSGVDTIKVEV---PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKL 370 (478) T ss_pred hhcceEEecCCCCCcceEEeecC---ChHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHH Confidence 344455555 567888875432 2345888888888888776433333333322 23233322111 22223344 Q ss_pred HHHHHHHHHHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhc Q lcl|NC_020866. 336 AMALAAILNRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLL 413 (547) Q Consensus 336 ~~~i~~~ln~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~ 413 (547) .+.+...+.+ +++.++.+.-. .+...+.+.|....+.+..+.++.+.++ .|+ ++.+.+.+.++. +.|+ +++- T Consensus 371 ~~~~~~~l~~-~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~-iS~et~~~~l~~v~D~~--~E~~ 444 (478) T protein:vir:10 371 KNKTLTALQE-LLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNS--TGL-LSKETILSNHAWVEDPV--AEME 444 (478) T ss_pred HHHHHHHHHH-HHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHH--hCC-CChHHHHHhCCCCCCHH--HHHH Confidence 5566666643 66666666421 2233467888888899999999999887 465 899999999975 3322 2222 Q ss_pred ccccccc--cccccccccccccCCccccccccccccchh Q lcl|NC_020866. 414 RPLENPA--TQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 414 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) ....... .+...... .+. . .........+... T Consensus 445 ri~~E~~~~~~~~~~~~-~~~---~-~~~~~~~~~~~~~ 478 (478) T protein:vir:10 445 RIEQENIELNQQLPDIE-EGL---N-GEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHHHHhhccccc-ccc---C-CCCCCCCCCCCCC Confidence 1111111 00000000 000 0 0000000000000 No 159 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.76 E-value=6e-08 Score=60.16 Aligned_cols=417 Identities=12% Similarity=0.021 Sum_probs=184.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH---HHHHHHH---- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY---LDLAEVI---- 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~---~~L~~~m---- 71 (547) |- .++.-+++.....+-.+--. -+...+.+.+..|-.. ...+++.+.+-..+.+...+ ....... T Consensus 6 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~--~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~ 79 (481) T protein:vir:10 6 IN----NINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTE--QVPRLEMLESYYLNRNTDILAGERRLQKYGDKAD 79 (481) T ss_pred ee----hhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHH--HHHHHHHHHHHhcCCCcccccCcccccccccccc Confidence 44 34555555554433211100 0011122222222100 11222222221111100000 0000000 Q ss_pred -HhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCe Q lcl|NC_020866. 72 -EERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQ 149 (547) Q Consensus 72 -~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~ 149 (547) .-..++..-.+.+....+.+.+..+.+.++. ..+.+.++++.-+|+..+..+. ++..||.+. +++|...+|. T Consensus 80 ~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~-----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~-~~~~~d~dg~ 153 (481) T protein:vir:10 80 HRAVHNYAKYVSRFIVGYLTGNPITITHQDNQ-----TNDKIIELNDLNDADEVNSDLALNLSIYGRAY-EIVYRDFEDR 153 (481) T ss_pred ceeecchHHHHHHHHHhhhccCCceEecCChh-----HHHHHHHHHHhcChhHHHHHHHHHHHhcCeEE-EEEEeCCCCe Confidence 0013455566667777788888888764332 2346777787777888888777 688899665 4556545555 Q ss_pred eeEEeeEeeccceeE--ecccCCcce----------------------------eeccCCCCce----eec-cCceEEEe Q lcl|NC_020866. 150 YFPDKLIWRDPRHFR--FQKSDLATP----------------------------LLLDDNGAEV----PLP-AFRFIFAD 194 (547) Q Consensus 150 ~~~~~l~~~~~~~~~--~~~~~~~~~----------------------------~~~~~~~~g~----~l~-~~k~i~~~ 194 (547) .. +..++|+... ++......+ ......+.+. +.| ++..|-.. T Consensus 154 ~~---i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv 230 (481) T protein:vir:10 154 DT---FKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPII 230 (481) T ss_pred EE---EEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEE Confidence 43 4445554432 111111110 0000000000 111 11111111 Q ss_pred ecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEE--ecCCc Q lcl|NC_020866. 195 IPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAI--IPETM 272 (547) Q Consensus 195 ~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~v--ip~~~ 272 (547) + -.++++|.|.+..+....--=...+..++..++.|+.|+++.+-....++++...+...-.-........ ...+. T Consensus 231 ~--~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (481) T protein:vir:10 231 E--YLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKA 308 (481) T ss_pred E--eecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCc Confidence 1 1235678898886544333333356777778899999998876432333333222222100000111111 12345 Q ss_pred eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 273 SIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAMALAAILNRDLIR 349 (547) Q Consensus 273 ~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~~i~~~ln~~li~ 349 (547) +++++..... ...++..++.+.+.|...--...++.++.+ +...|..-.. -....+..-.+.+...+.+ +++ T Consensus 309 ~~~~l~~~~~---~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~ 383 (481) T protein:vir:10 309 EVKYVYKQYD---VAGVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQSGESMKYKLFGLEQVRAIKERLFKKGLMK-RYK 383 (481) T ss_pred ceeEEeecCC---HHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 7788764432 235777788887777665222223333222 2233332222 2223344445666677743 566 Q ss_pred HHHHh-cC--CC--cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhccccccc--cc Q lcl|NC_020866. 350 PWIQL-EY--GP--QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLENP--AT 421 (547) Q Consensus 350 ~l~~~-Nf--g~--~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~--~~ 421 (547) .++.+ |. +. ...-.++.|....+.+..+.++.+.+++ |+ ++.+.+.+.++. +.|+ +++-...... .. T Consensus 384 li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~-is~et~~~~l~~i~d~~--~E~~ri~~E~~~~~ 458 (481) T protein:vir:10 384 LLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNALS--GG-VSESTRLSLLDFIDNPK--EELEKMQEEEAQRE 458 (481) T ss_pred HHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHH--HHHHHHHHHHHHHH Confidence 55554 22 11 1223578888888999999999999885 65 888888888875 2222 2221111100 00 Q ss_pred ccccccccccccCCccccccccccccchhhhhh Q lcl|NC_020866. 422 QAAPPGMDPGAGDLKTAERKLKPLSGQADLSRG 454 (547) Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (547) +.......+...... ...+..+. T Consensus 459 ~~~~~~~~~~~~~~~----------~~~dd~~g 481 (481) T protein:vir:10 459 KQADKRGYGEAFENH----------LNVDDSNG 481 (481) T ss_pred hhhhhccCCccCCCC----------CCCCCCCC Confidence 000000000000000 00000000 No 160 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.72 E-value=8.4e-08 Score=59.37 Aligned_cols=395 Identities=11% Similarity=0.053 Sum_probs=175.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |.++-. .-++. . +-.++++..|+-+. .+..++.-.| +....+.+.+....+.+|+..+ +++.-.+. T Consensus 1 ~~~~~~------~~~~~-~----~~~~~~~~~rd~l~-~~~~glg~~r-~~~~~~~g~~~~~~~~~l~~~Y-r~~~ia~~ 66 (449) T protein:vir:10 1 MTDKLT------LAVNH-A----LNDARMARARMGLM-VPTMGLDNKR-HSAWCEYGFPELVTYENLYSLY-RRGGIAHG 66 (449) T ss_pred CchhhH------HHHhh-h----cchhHHHHHHHHHH-HHHhcCCccc-chhhhhcCCcccCCHHHHHHHH-hcCchhHH Confidence 443210 00000 0 11112222332222 2344554444 4566667766666666666544 35777777 Q ss_pred HHHHHHHHHhcCCceEecCCCCHHHHH---HHHHHHHHHhCCCHHHHHHHHHHhhhhcCeeeeeEEeecCCee------- Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPASDDPQDVK---IADMVRDWLKREELQQEVFHILDCISKGYSFTEIVWNTSEGQY------- 150 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~~~~~~~~~---~ae~v~~~l~~~~~~~~i~~~l~a~~~G~sv~Ei~w~~~~g~~------- 150 (547) ++++-.......-..|..+.+...... .-..+++++..--|..+....-.+.+||++++=+.- .++..+ T Consensus 67 iVd~~~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v-~d~~~l~~Pl~~~ 145 (449) T protein:vir:10 67 AVEKLVGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHI-RDEKDWNLPATKG 145 (449) T ss_pred HHHhhhhhhhhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEe-cCCCCCCcccccC Confidence 887665544333233433332222211 111222222211144455455567789999874332 122111 Q ss_pred -eEEeeEeeccceeEec---cc------CCcceeecc-----CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHH Q lcl|NC_020866. 151 -FPDKLIWRDPRHFRFQ---KS------DLATPLLLD-----DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGW 215 (547) Q Consensus 151 -~~~~l~~~~~~~~~~~---~~------~~~~~~~~~-----~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~ 215 (547) .+++|.+.....+... .+ +....+... ....+..+-+.+++.+.... ..|.++++.+|-.. T Consensus 146 ~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~----~~g~~~L~~~yn~l 221 (449) T protein:vir:10 146 RGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYS----EDAIGFLEPAYNAF 221 (449) T ss_pred cceeeEEeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCC----CCChhHHHHHHHHh Confidence 2334444433222211 10 111111111 11223345555554443222 22778899887532 Q ss_pred H-HHHH---H----HHHH-----HHHHHH---cCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC-eeEEecCCceEEEEe Q lcl|NC_020866. 216 M-FKAF---T----ARDW-----AIFTQT---YGQPLRIGKWQPGASEQDKDTLFRAVANIAGD-CAAIIPETMSIDFVE 278 (547) Q Consensus 216 ~-~K~~---~----~~~w-----~~f~E~---~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~-~~~vip~~~~ie~~e 278 (547) + +-+. . ++.. ..|..+ .|+.-.. . ...++-.+.+...++.+.++ ..+++-++.+++.+. T Consensus 222 ~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~---~-~~~e~~~~~~~~~~~~~~~~~~~~~i~~~~d~~~~~ 297 (449) T protein:vir:10 222 VSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY---G-VSIDELQDKFNEVAGEINRGNDVLMTTQGATVTPLV 297 (449) T ss_pred hhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh---h-CCchHHHHHHHHHHHHHhccchheeecCCcceEEEe Confidence 1 1010 0 0111 111111 1222111 1 12233344454455544332 345667777888886 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLE 355 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~N 355 (547) .+- +|-.+. ++..-.++|-+ .|-.+| +.+.+| .+.++ -...+.+.+.+-...+.-.|. .|+..|+... T Consensus 298 ~~~-sgl~d~----l~~~~q~iaaa-~~IP~t~L~Gqsp~g-lnst~-D~~nyyd~i~~~Q~~l~p~le-~l~~~l~~s~ 368 (449) T protein:vir:10 298 TSV-ADPTAT----YNVNLQTAAAG-VDIPTRILIGNQQAE-RSSTE-DQKYFNARCQSRRVDLSFEIE-DFCDKLIELK 368 (449) T ss_pred ccc-CChhHH----HHHHHHHHHHH-hCCCeeeeeccCccc-cccch-hHHHHHHHHHHHHHhhhHHHH-HHHHHHHHhh Confidence 654 443333 33333445544 343332 222233 33222 235577777776666777775 4888888888 Q ss_pred CCCcccccEEEecCccccCHHH-------HHHHHHHHHhCCC--CCCHHHHHHHhCCCCCCCcchhcccccccccccccc Q lcl|NC_020866. 356 YGPQKRYPRIKIARPKTEDLTK-------LASSLDTLVRLGM--EIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPP 426 (547) Q Consensus 356 fg~~~~~p~~~~~~~~~ed~~~-------~a~~~~~l~~~G~--~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~ 426 (547) |+....--.|+|......+-++ .|++++++++.|. .++.+++|+..|.+.+.... ..... .++..+ T Consensus 369 ~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~~-~~~e~----~de~~~ 443 (449) T protein:vir:10 369 IIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDEEP-LGEED----GDEEDK 443 (449) T ss_pred cCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCCCCC-CCCCC----Cccccc Confidence 8754333466665554444444 4667777887772 47999999999996543221 10000 000000 Q ss_pred cccccccCCccc Q lcl|NC_020866. 427 GMDPGAGDLKTA 438 (547) Q Consensus 427 ~~~~~~~~~~~~ 438 (547) +..++ + T Consensus 444 ~~d~~------a 449 (449) T protein:vir:10 444 ATDSA------A 449 (449) T ss_pred cCCcC------C Confidence 00000 0 No 161 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.72 E-value=8.4e-08 Score=59.36 Aligned_cols=382 Identities=12% Similarity=0.116 Sum_probs=185.1 Q ss_pred ccchhhhhhhhhhhh--hhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcC Q lcl|NC_020866. 15 VQRAQLTEEVAAPTV--TGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQL 92 (547) Q Consensus 15 ~~~~~l~~~~a~~~~--~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~ 92 (547) |+...|.+-+..... .-.......|-+.. .||..........-..+ ..+...-.+.+....+.+. T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~-------~il~~~~~~~~~~~~ki------~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDH-------AILQQKQKEQYKPDNRL------VVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------ccccccccccCCCccee------ecchHHHHHHHHhhhhccc Confidence 444444443322110 00111111111110 11111100000000001 2455666777777888888 Q ss_pred CceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccC Q lcl|NC_020866. 93 DITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSD 169 (547) Q Consensus 93 ~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~ 169 (547) +..+.+.++ ..-+.+.++++.-+|+.....+. ++..||.+ ++++|...+|... +..++|+... ++... T Consensus 68 ~~~~~~~~~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~d~~g~~~---~~~~~p~~~~~v~dd~~ 138 (429) T protein:vir:98 68 PVQTSHENK-----QVSNYLELLDGYNDQDDNNAELSKICSIYGHG-YELVFNDENAEAG---ITYLTPLEAFIVYDDSI 138 (429) T ss_pred CceeecCCh-----HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeE-EEEEEecCCCcEE---EEEEcccceEEEEeCCC Confidence 888775432 23345777777767888777776 68889975 5667765556543 4445554432 11111 Q ss_pred Cccee----e--------------------ccCCCCceee----c-cCceE-EEeecCCcCccccchhHHHHHHHHHHHH Q lcl|NC_020866. 170 LATPL----L--------------------LDDNGAEVPL----P-AFRFI-FADIPAKSGILLRSGLARVAAWGWMFKA 219 (547) Q Consensus 170 ~~~~~----~--------------------~~~~~~g~~l----~-~~k~i-~~~~~~~~~~p~G~gll~~~~~~~~~K~ 219 (547) ...++ . ......+..+ | +...| ++. -.++++|.|.+..+....---. T Consensus 139 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~---~~n~~~g~sd~e~v~~liD~~d 215 (429) T protein:vir:98 139 RQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIE---YVENEERQSLLASVVTLINAFN 215 (429) T ss_pred CCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEE---ecCCCCCCCcHHHHHHHHHHHH Confidence 11000 0 0000111111 0 11111 111 1346789999998776666666 Q ss_pred HHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCC----ceEEEEecCCCCCchHHHHHHHHH Q lcl|NC_020866. 220 FTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPET----MSIDFVESKSIGSSVDLYEKRINH 295 (547) Q Consensus 220 ~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~----~~ie~~e~~~~~~~~~~f~~li~~ 295 (547) ..+..++...+.|..|+++.+-- ..+++.. .++.....+.+|.+ .+++++.... ....++..++. T Consensus 216 ~~~s~~~~~~~~~~~p~~~i~g~-~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~ 284 (429) T protein:vir:98 216 KAISEKANDVEYFADAYLKILGA-ELDDETL-------KSLRDTRIINLKDTDAQQLTVEFLQKPD---ADATQEHLLDR 284 (429) T ss_pred HHHHHHHHHHHHhcCceeeeecC-CCCcchh-------hhHhhCceeeccCCCCCCcceeEEeecC---CHHHHHHHHHH Confidence 67888999999999999887632 2232222 23333455555543 4678875442 23457888888 Q ss_pred HHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHh-c--CCC-cccccEEEec Q lcl|NC_020866. 296 LDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAMALAAILNRDLIRPWIQL-E--YGP-QKRYPRIKIA 368 (547) Q Consensus 296 ~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~~i~~~ln~~li~~l~~~-N--fg~-~~~~p~~~~~ 368 (547) +.+.|.+.--+..++.++. | .++|..-.. -....+..-.+.+...+. ++++.++.+ + ... +.....+.|. T Consensus 285 l~~~i~~~s~~p~~~~~~~-g-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~v~f~ 361 (429) T protein:vir:98 285 LENLIFRTAMVANISDESF-G-TASGIALRYRLQAMDNLAKTKERKFMSGMN-RRYKLIASYPTSKIGPKDWIGIKYKFT 361 (429) T ss_pred HHHHHHHHhCccccCcccc-c-cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCccccccceEEeC Confidence 8888876643333333322 2 222322221 122234444566666664 355655554 2 111 1122478888 Q ss_pred CccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcccccccccccccccccccccCCcccccccccccc Q lcl|NC_020866. 369 RPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 369 ~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) ...+.+..+.++.+.++. |+ ++.+.+.+.+|. +.|+ +++-......................+... T Consensus 362 ~~~p~~~~~~a~~~~kl~--g~-is~et~~~~l~~v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-------- 428 (429) T protein:vir:98 362 RNLPANLLEESQIAGNLA--GI-VSEETQVGVLSIVENPQ--KEIERKNSDKSTLISRQAGGLNGQNTTTIL-------- 428 (429) T ss_pred CCCCcCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHhhhcCCCCCCCC-------- Confidence 888999999999999884 65 899999999974 3332 222111111110000000000000000000 Q ss_pred chhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 448 QADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) + T Consensus 429 ------------------------~ 429 (429) T protein:vir:98 429 ------------------------E 429 (429) T ss_pred ------------------------C Confidence 0 No 162 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.72 E-value=8.5e-08 Score=59.36 Aligned_cols=420 Identities=11% Similarity=0.023 Sum_probs=195.7 Q ss_pred CCCCCee---eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh----ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh Q lcl|NC_020866. 1 MAQKPVL---LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP----GDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~---~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~----~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~ 73 (547) |.+.+.| +|..|+--.+ .. ..+ +...-++ ....++.++..+++........++..|++.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~-~~~--------n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g 69 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYL--FN-DEA--------NVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEG 69 (511) T ss_pred Cccccchhhhhhhhhhhhhh--hh-hhh--------CCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcc Confidence 7665554 2333331100 00 000 0000111 123356666666665443333444333332210 Q ss_pred -----------h----------hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hh Q lcl|NC_020866. 74 -----------R----------DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DC 131 (547) Q Consensus 74 -----------~----------D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a 131 (547) . .....-.+.+....+.+.+..+.+.++ +..+++..+++.-+|+.....+. ++ T Consensus 70 ~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~ 144 (511) T protein:vir:99 70 KTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDL 144 (511) T ss_pred cCccccccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecCch-----HHHHHHHHHHhhcCHhHHHHHHHHHH Confidence 0 133345666677777788888765432 23467888887778888888877 68 Q ss_pred hhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce----ee--cc--C-CCCc----e-eeccCceEEEee Q lcl|NC_020866. 132 ISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP----LL--LD--D-NGAE----V-PLPAFRFIFADI 195 (547) Q Consensus 132 ~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~----~~--~~--~-~~~g----~-~l~~~k~i~~~~ 195 (547) ..||.+ .+++|...+|... +..++|+.+. ++......+ +. .. + .... . -+.+.+...++. T Consensus 145 ~i~G~a-~~~vy~ded~~~~---i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~ 220 (511) T protein:vir:99 145 SIYGKA-YELMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLT 220 (511) T ss_pred HhcCee-EEEEEeCCCCceE---EEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEe Confidence 889964 5677765555543 3444454332 111110110 00 00 0 0000 0 001111111100 Q ss_pred c-------------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHH Q lcl|NC_020866. 196 P-------------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDK 250 (547) Q Consensus 196 ~-------------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~ 250 (547) . .-.+++.|.|.+..+....---...+..++..++.|+.|+++.+-....++.+. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~ 300 (511) T protein:vir:99 221 SRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV 300 (511) T ss_pred cCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhh Confidence 0 012357889999987666655666788899999999999887653222333322 Q ss_pred HHHHHH----HHHH--hcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHH Q lcl|NC_020866. 251 DTLFRA----VANI--AGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEH 324 (547) Q Consensus 251 ~~l~~a----l~~~--~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh 324 (547) ....+. +... ....+.-...+.+++++.... ....++..++.+.+.|...--...++.++.+| ..+|..- T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~---~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~g-n~Sg~Al 376 (511) T protein:vir:99 301 RKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAM 376 (511) T ss_pred cccccccceecccccccccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-cchHHHH Confidence 211110 0000 111112234567888886543 23458888999888887653333333333222 2222221 Q ss_pred -HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC----CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCH Q lcl|NC_020866. 325 -RLV--QEDIETSDAMALAAILNRDLIRPWIQL---EYG----PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEE 394 (547) Q Consensus 325 -~~v--~~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg----~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~ 394 (547) ... ....+..-.+.+...+++ +++.++.+ +.. ....-.++.|....+.+..+.++.+.++. |+ ++. T Consensus 377 k~~~~~l~~ka~~k~~~~~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--Gi-iS~ 452 (511) T protein:vir:99 377 KYKLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQ 452 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCH Confidence 111 222333445566666644 55555443 211 11223578888888999999999998885 65 899 Q ss_pred HHHHHHhCC-CCCCCcchhccccccccc----ccccccccccccCCccccccccccccchh Q lcl|NC_020866. 395 NEVRSRFGF-SQPKPGAKLLRPLENPAT----QAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 395 ~~~~e~~Gl-p~p~~~e~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +.+.+.++. +.|+ +++-........ ........+........+.......+... T Consensus 453 et~l~~l~~v~D~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 453 TTLMSLFSFFQDPE--LEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred HHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 888888854 3222 222111111000 00000000000000000000000000000 No 163 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.71 E-value=9.2e-08 Score=59.17 Aligned_cols=422 Identities=12% Similarity=0.016 Sum_probs=194.2 Q ss_pred CCCCCee---eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh---- Q lcl|NC_020866. 1 MAQKPVL---LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE---- 73 (547) Q Consensus 1 ma~~~~~---~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~---- 73 (547) |.+.+.+ .|..|+--. .=...|. ........--....++.++..++.........++..|.+.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~~n----~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i 73 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINY---LFNDEAN----VVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKN 73 (511) T ss_pred Cccccchhhhhhhhhhhhh---hhhhhhc----CCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc Confidence 7765554 122222000 0000000 0000000001123355666666655444344444443332210 Q ss_pred -----------------hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hhhhhc Q lcl|NC_020866. 74 -----------------RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKG 135 (547) Q Consensus 74 -----------------~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G 135 (547) ......-.+......+.+.+..+.+.++ +..+.+.++++.-+|+.....+. ++..|| T Consensus 74 ~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G 148 (511) T protein:vir:10 74 LVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDLSIYG 148 (511) T ss_pred ccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCch-----HHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Confidence 0123345566667777888888865433 23467888887778888888777 688899 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCcce------ee--------------------------ccCCCC Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP------LL--------------------------LDDNGA 181 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~------~~--------------------------~~~~~~ 181 (547) . +.+++|...+|... +..++|+... ++......+ .. ....+. T Consensus 149 ~-ay~~vy~dedg~~~---i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~ 224 (511) T protein:vir:10 149 K-AYEIMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTN 224 (511) T ss_pred e-eEEEEEeCCCCceE---EEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCC Confidence 7 45777765556543 3344444332 111110000 00 000000 Q ss_pred ceee---------ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHH Q lcl|NC_020866. 182 EVPL---------PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDT 252 (547) Q Consensus 182 g~~l---------~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~ 252 (547) +... -++..|-..+ -.++..|.|.+..+....---...+..++..++.|+.|+++.+-....+.++... T Consensus 225 ~~~~~~~~~~~~~~~~~~vPvv~--f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~ 302 (511) T protein:vir:10 225 GLKLTPRENGFESHSFERMPITE--FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK 302 (511) T ss_pred cccccccccccccccCcceeEEE--ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhcc Confidence 0000 0111111111 1235678999998876665556678888889999999988866322233333221 Q ss_pred HHHH-HHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH Q lcl|NC_020866. 253 LFRA-VANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL 326 (547) Q Consensus 253 l~~a-l~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~ 326 (547) ..+. +-.+.. ..+.-...+.+++++.... ....++..++.+.+.|...--...++.++.+| ..+|..-.- T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~---~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~Sg~Al~~ 378 (511) T protein:vir:10 303 QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKY 378 (511) T ss_pred chhccceecccccccccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-cchHHHHHH Confidence 1110 000100 0111123467788876432 23357888888888887653222333333222 222322111 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC----CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHH Q lcl|NC_020866. 327 ---VQEDIETSDAMALAAILNRDLIRPWIQL---EYG----PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENE 396 (547) Q Consensus 327 ---v~~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg----~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~ 396 (547) -....+..-.+.+...+.+ +++.++.+ .-+ ....-.++.|....+.+..+.++.+.+|+ |+ ++.+. T Consensus 379 ~~~~l~~k~~~k~~~f~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~-iS~et 454 (511) T protein:vir:10 379 KLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQTT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-CcHHH Confidence 1222233445555666643 44554443 111 11223578888888999999999999985 66 88888 Q ss_pred HHHHhCC-CCCCCcchhccccccccc----ccccccccccccCCccccccccccccchh Q lcl|NC_020866. 397 VRSRFGF-SQPKPGAKLLRPLENPAT----QAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 397 ~~e~~Gl-p~p~~~e~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +.+.++. +.|+ +++-........ ........+........+....+.+.... T Consensus 455 ~~~~l~~v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 455 LMSLFSFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhCCCCCCHH--HHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 8888864 3332 222111111000 00000000000000001111111111100 No 164 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.69 E-value=1.1e-07 Score=58.83 Aligned_cols=381 Identities=11% Similarity=0.031 Sum_probs=176.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhh--hhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH----HHHHHHHHh- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVA--APTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY----LDLAEVIEE- 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a--~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~----~~L~~~m~~- 73 (547) |. |+++ ..+.. .+.-..+++.+.+ ...+|.++.-.++.. .+-|+.... T Consensus 1 m~------------V~~~--hp~y~a~~~~W~~~rd~~~G-----------~~~~r~~g~~YLpk~~~E~~~~Y~~rl~r 55 (452) T protein:vir:94 1 MP------------IETK--HPEYLAYENDWIDCRVASLG-----------QREVKKKGVRFLPKLSGQTDDMYNAYKQR 55 (452) T ss_pred CC------------CCCc--CHHHHHHHHHHHHHHHHhcC-----------hHHHHcCCcccCCCCCCCCHHHHHHHHhh Confidence 22 1211 11111 1111122222221 123333322111100 111222221 Q ss_pred --hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCee Q lcl|NC_020866. 74 --RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQY 150 (547) Q Consensus 74 --~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~ 150 (547) --++++..++.....|.+.++.++-++ .+.+. ..-..+.+++.+++.++ .++.||.+.+=+-|-..+++= T Consensus 56 A~~~n~~~~t~~~~~G~vf~k~p~~~~p~------~l~~~-~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rP 128 (452) T protein:vir:94 56 ALFYSITSKTLSALSGMVLDQPPVITHPD------AMSKY-FEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDP 128 (452) T ss_pred ccCCchHHHHHHHHhchhhcCCceecccH------HHHHH-HhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCce Confidence 257888888888888888888875321 12222 22245678999999998 688899887666664443321 Q ss_pred eEEeeEeeccc---eeEecccC------------------------------------Ccceee-ccCCCC----ceeec Q lcl|NC_020866. 151 FPDKLIWRDPR---HFRFQKSD------------------------------------LATPLL-LDDNGA----EVPLP 186 (547) Q Consensus 151 ~~~~l~~~~~~---~~~~~~~~------------------------------------~~~~~~-~~~~~~----g~~l~ 186 (547) ++. .+++. .|.++..+ ..+... ....+. +...+ T Consensus 129 y~~---~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~ 205 (452) T protein:vir:94 129 YIS---VYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTST 205 (452) T ss_pred EEE---EechhhhcCccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeecccee Confidence 111 11111 00011100 000000 000100 00000 Q ss_pred ------cCc---eEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHH Q lcl|NC_020866. 187 ------AFR---FIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAV 257 (547) Q Consensus 187 ------~~k---~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al 257 (547) +.. |+++.. ...+-..+.+-|..++..-+--.....+.-.-+..-|.|+++..-..... . T Consensus 206 ~~~~~~~l~~IP~v~~~~-~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~-----~----- 274 (452) T protein:vir:94 206 IQNVGVTMDYIPFFCITP-SGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS-----T----- 274 (452) T ss_pred ecCCCcccceeEEEEEcC-CCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC-----c----- Confidence 011 222222 22223345555666665544333334445555566688877765321111 1 Q ss_pred HHHhcCeeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchh-ccCccccccchhH-HHHHHHHHHHHH Q lcl|NC_020866. 258 ANIAGDCAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTT-TTDAVAGGHAVSQ-EHRLVQEDIETS 334 (547) Q Consensus 258 ~~~~~~~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtl-ts~~~ggs~a~~~-vh~~v~~~~~~a 334 (547) -.+|+.++..+|+ |.++.+++.++ ++....++.++-+.++|.. +|.-+ ...+.+..-+.+. ....-....+.. T Consensus 275 i~iG~~~~~~lpe~~~~~~yie~~g--~~i~~~~~~l~~le~~m~~--~Ga~ll~~~~~~~~s~ea~~~~~~~~~s~L~~ 350 (452) T protein:vir:94 275 MHIGSTKAWVIPEVAAKVGFLEFTG--QGLQSLEKALSEKQAQLAS--LSARLIDNSTRGSEATETVKLRYMSETASLKS 350 (452) T ss_pred eEecccccccCCCCCCcceEEccCc--hhHHHHHHHHHHHHHHHHH--HHHHhhccCCCcchHHHHHHHHHHHhhHHHHH Confidence 2578899999996 99999999764 4444566666666666633 34333 3222111111111 222223467778 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcccccEEEec--CccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh---CCCCCCCc Q lcl|NC_020866. 335 DAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIA--RPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF---GFSQPKPG 409 (547) Q Consensus 335 d~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~--~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~---Glp~p~~~ 409 (547) -+..+++.++ +++++++.+- |.+. ..+|++. ..........++++.++...|. ++.+.+++.+ |+..++.+ T Consensus 351 ~a~~~e~al~-~~l~~~a~w~-g~~~-~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~-is~~t~~~~L~~~gvl~~~~e 426 (452) T protein:vir:94 351 VTRAVEALLN-KAYSCIMDME-SMGG-TLNIKLNSAFLDSKLTAAELKAWVEAYLSGG-ISKEIYIHALKVGKVLPPPGE 426 (452) T ss_pred HHHHHHHHHH-HHHHHHHHHc-CCCC-ceEEEeccccccccCCHHHHHHHHHHHhcCC-CcHHHHHHHHHhCCCCCCccC Confidence 8888999996 5888888875 3322 2345433 2222222345666777888886 8876665555 88877655 Q ss_pred chhcccccccccccccccccccccCCcccccc Q lcl|NC_020866. 410 AKLLRPLENPATQAAPPGMDPGAGDLKTAERK 441 (547) Q Consensus 410 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (547) ++.+....+.+. +.+.. .|..+ ...+ T Consensus 427 ~~~i~~E~~~~~-~~~~~-~~~~~----~~~~ 452 (452) T protein:vir:94 427 SMGVIPDPPAPE-PSPSN-TPPNP----SSKA 452 (452) T ss_pred HHHHHHHhhccC-cccCC-CCCCC----ccCC Confidence 443332211111 11111 11110 0000 No 165 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.64 E-value=1.5e-07 Score=57.95 Aligned_cols=420 Identities=12% Similarity=0.040 Sum_probs=195.4 Q ss_pred CCCCCee---eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh----ccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh Q lcl|NC_020866. 1 MAQKPVL---LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP----GDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~---~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~----~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~ 73 (547) |.+.+.+ .|..|+--.+ .. ..| +...-++ ....++.++..+++........++..|++.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~-~~~--------n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g 69 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYL--FN-DEA--------NVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEG 69 (511) T ss_pred Cccccchhhhhhhhhhhhhh--hh-hhh--------CCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcc Confidence 7765554 1222221000 00 000 0000011 122355666666665444444444443332210 Q ss_pred -----------h----------hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hh Q lcl|NC_020866. 74 -----------R----------DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DC 131 (547) Q Consensus 74 -----------~----------D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a 131 (547) . -++..-.+......+.+.+.++.+.++ +..+.+.++++.-+|+.....+. ++ T Consensus 70 ~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~ 144 (511) T protein:vir:93 70 KTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK-----DVLEVIEAFNDLNDVESHNRSLGLDL 144 (511) T ss_pred cCccccccCcCcccccCcceeecchHHHHHHHHhhhhcccCeeeccCCh-----HHHHHHHHHHhhcCHhHHHHHHHHHH Confidence 0 133455666677777788877765433 23456788887778888888887 68 Q ss_pred hhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCccee----ec-----cCCCCc----ee-eccCceEEEee Q lcl|NC_020866. 132 ISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL----LL-----DDNGAE----VP-LPAFRFIFADI 195 (547) Q Consensus 132 ~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~----~~-----~~~~~g----~~-l~~~k~i~~~~ 195 (547) ..||.+ .+++|...+|... +..++|+... ++......++ .. ++.... .. +.+.+...+.. T Consensus 145 ~~~G~a-y~~vy~de~~~~~---i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~ 220 (511) T protein:vir:93 145 SIYGKA-YELMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLT 220 (511) T ss_pred HhcCee-EEEEEeCCCCceE---EEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEe Confidence 889964 5677755556543 4445555432 2211111100 00 000000 00 01111111100 Q ss_pred c-------------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHH Q lcl|NC_020866. 196 P-------------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDK 250 (547) Q Consensus 196 ~-------------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~ 250 (547) . .-.+++.|.|.+..+....--=...+..++..++.|..|+++.+-....+..+. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~ 300 (511) T protein:vir:93 221 SRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV 300 (511) T ss_pred cCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhh Confidence 0 012356789999988666555556788899999999999887663333333322 Q ss_pred HHHHHH-HHHHhc-----CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHH Q lcl|NC_020866. 251 DTLFRA-VANIAG-----DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEH 324 (547) Q Consensus 251 ~~l~~a-l~~~~~-----~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh 324 (547) ....+. +..+.. ....-...+.+++++.... ....++.+++.+.+.|.+.--...++.++.+| ..+|..- T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~-n~Sg~Al 376 (511) T protein:vir:93 301 RKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAM 376 (511) T ss_pred cccccccceecccccccccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-cchHHHH Confidence 211110 000000 0111223567788886443 23457888888888887663333344443222 2223322 Q ss_pred H-HH--HHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCC----cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCH Q lcl|NC_020866. 325 R-LV--QEDIETSDAMALAAILNRDLIRPWIQL---EYGP----QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEE 394 (547) Q Consensus 325 ~-~v--~~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg~----~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~ 394 (547) . .. ....+..-.+.+...+.+ +++.++.+ +... .....++.|....+.+..+.++.+.++. |+ ++. T Consensus 377 ~~~~~~l~~k~~~k~~~f~~~l~~-~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~--g~-iS~ 452 (511) T protein:vir:93 377 KYKLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQ 452 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-Cch Confidence 1 11 122333344556666643 55555543 2111 1123578888888999999999998884 65 898 Q ss_pred HHHHHHhCC-CCCCCcchhcccccccccc----cccccccccccCCccccccccccccchh Q lcl|NC_020866. 395 NEVRSRFGF-SQPKPGAKLLRPLENPATQ----AAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 395 ~~~~e~~Gl-p~p~~~e~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +.+.+.++. +.|+ +++-......... .......+...............+.... T Consensus 453 et~~~~l~~v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 453 TTLMSLFSFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccccccccC Confidence 888888864 3222 2221111110000 0000000000000000000000000000 No 166 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.62 E-value=1.8e-07 Score=57.59 Aligned_cols=403 Identities=10% Similarity=0.048 Sum_probs=183.3 Q ss_pred CC-----CCCeeeCCCCCcccchhhhhhhhhh----hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHH-- Q lcl|NC_020866. 1 MA-----QKPVLLDRWGNPVQRAQLTEEVAAP----TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAE-- 69 (547) Q Consensus 1 ma-----~~~~~~~~~g~~~~~~~l~~~~a~~----~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~-- 69 (547) |. .+-.|-.. ........+.+.+... ...-.++....|.+..-.+.+.... .++......... T Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~-----~~~~~~~~~~~~~~ 74 (479) T protein:vir:79 1 MLNIYISETDLIKVQ-LKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYY-----LLDGAKVDDFTKVN 74 (479) T ss_pred CCCceecccceEeec-cccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccccccccc-----ccccccccccccCc Confidence 22 11111111 1111111222222111 0111111111111110000000000 000000000000 Q ss_pred -HHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC Q lcl|NC_020866. 70 -VIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE 147 (547) Q Consensus 70 -~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~ 147 (547) .+ ..+...-.+.+....+.+.+..+.+.++ +..++++.++++ +|++....+. ++..||.+. +++|...+ T Consensus 75 ~ki--~~~~~~~Ivd~~~~~l~g~p~~~~~~~~-----~~~~~~~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~~d~~ 145 (479) T protein:vir:79 75 NKA--INNYHKLLVDQKVGYSVGNPIVFNADDD-----NLTKLLNDLLGE-EFDDTITELYLNASNKGVEW-LHPYINRK 145 (479) T ss_pred cee--ecchHHHHHHHHHhhhhcCCceeccCCH-----HHHHHHHHHHhc-CHHHHHHHHHHHHHhcCeEE-EEEEeCCC Confidence 00 1233445666777778888888765332 233456666654 7888777776 688899774 56675555 Q ss_pred CeeeEEeeEeeccceeE--ecccCCcce------eecc-CCCCc---ee-ec---------------------------- Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFR--FQKSDLATP------LLLD-DNGAE---VP-LP---------------------------- 186 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~--~~~~~~~~~------~~~~-~~~~g---~~-l~---------------------------- 186 (547) |... +..++|+.+. ++......+ .... ..+.. .. .. T Consensus 146 ~~~~---i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 222 (479) T protein:vir:79 146 GEFK---YVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTD 222 (479) T ss_pred CceE---EEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccc Confidence 5543 3444444332 111110000 0000 00000 00 00 Q ss_pred -------------cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHH Q lcl|NC_020866. 187 -------------AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTL 253 (547) Q Consensus 187 -------------~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l 253 (547) ++..|-+.+ -.++++|.|.+..+....--=...+.+++..++.|..|+++.+--.+...++ T Consensus 223 ~~~~~~~~~~~~~~~~~vPvv~--~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~---- 296 (479) T protein:vir:79 223 IQEGHFRINNKEQGWGKVPFIP--FKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQE---- 296 (479) T ss_pred cccccccccccccCCCcccEEE--ecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccccc---- Confidence 011110000 1246788999987666554555577889999999999998876322222111 Q ss_pred HHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHH Q lcl|NC_020866. 254 FRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQED 330 (547) Q Consensus 254 ~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~ 330 (547) ...++.....+.++.+.++++++... ....++..++.+.+.|...--+..++.++. |+ ++|..-.. -... T Consensus 297 --~~~~~~~~~~i~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-gn-~Sg~Ai~~~~~~l~~ 369 (479) T protein:vir:79 297 --FIDNIRYYKSIKVDGGGGVDKLEINI---PVEAKKELLDRLEKNIIIFGQGVNPESQNT-GD-KSGVALKFLYSLLDL 369 (479) T ss_pred --chhhhhhccceecCCCCcceEEeccC---CHHHHHHHHHHHHHHHHHHhCccccccccc-cc-hhHHHHHHHHHHHHH Confidence 12334455667789999999987553 234688888888888877644444443332 22 22322211 1222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh-cC--C--CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-C Q lcl|NC_020866. 331 IETSDAMALAAILNRDLIRPWIQL-EY--G--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-S 404 (547) Q Consensus 331 ~~~ad~~~i~~~ln~~li~~l~~~-Nf--g--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p 404 (547) .+..-.+.+...+. ++++.++.+ +. + .+...+.+.|....+.+.++.++.+.+++ |+ ++.+.+.+.++. + T Consensus 370 k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~--g~-iS~et~l~~l~~v~ 445 (479) T protein:vir:79 370 KCSKTEKKFKKAIR-ELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKST--GI-VSDETIVSNHPWVE 445 (479) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CcHHHHHHhCCCCC Confidence 33334445555553 355555543 21 1 12234688888888999999999998885 66 899988888874 3 Q ss_pred CCCCcchhcccccccccccccccccccccCCccccc Q lcl|NC_020866. 405 QPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAER 440 (547) Q Consensus 405 ~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (547) .++ +++-................++.......+. T Consensus 446 d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 446 DVN--DELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred CHH--HHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 322 2221111111100000000000000000000 No 167 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.62 E-value=1.8e-07 Score=57.54 Aligned_cols=398 Identities=9% Similarity=0.011 Sum_probs=165.2 Q ss_pred CCCCCeeeCCCCCccc----chhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHh--hcCCCHHHHHHHH------ Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ----RAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRA--ADHGDPVRYLDLA------ 68 (547) Q Consensus 1 ma~~~~~~~~~g~~~~----~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~--a~~g~~~~~~~L~------ 68 (547) |=. +|.......++ ...+...+- ++.-.+++.....|.+- -=.|+...+.... T Consensus 1 m~~--~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~ 65 (499) T protein:vir:80 1 MIN--QIIAGVKGVMRRMGLLKSLKDVTD-------------HKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGN 65 (499) T ss_pred Chh--HHHHHHHHHHHHhccccchhhhhc-------------CCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCC Confidence 210 00000000000 000000000 00000111111111110 0001100000000 Q ss_pred ---HHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEe Q lcl|NC_020866. 69 ---EVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWN 144 (547) Q Consensus 69 ---~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~ 144 (547) +... .-..-....++-..-+++.+-.|...+ ++..+++.++++.-.|...+..++ .|..+|-+++-+.|+ T Consensus 66 ~~~~~~~-s~n~~~~iv~~~a~~l~~ep~~i~~~d-----~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D 139 (499) T protein:vir:80 66 PVNRRQL-SMNLPKVTAKYMSKLLFNEKVKINIDD-----ETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHD 139 (499) T ss_pred cccccee-ecchHHHHHHHHHHhhhCCcceEeeCC-----HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEEC Confidence 0000 001112334455566667766665432 355678888888767888888877 599999999988886 Q ss_pred ecCCeeeEEeeEeeccceeEe--cccCCc------------------------------cee-----eccCCC--Cceee Q lcl|NC_020866. 145 TSEGQYFPDKLIWRDPRHFRF--QKSDLA------------------------------TPL-----LLDDNG--AEVPL 185 (547) Q Consensus 145 ~~~g~~~~~~l~~~~~~~~~~--~~~~~~------------------------------~~~-----~~~~~~--~g~~l 185 (547) .++ .+ ++.++++..|.. ...++. .+. ...... -|.++ T Consensus 140 ~~~-~~---~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v 215 (499) T protein:vir:80 140 GNK-NV---KVSFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKV 215 (499) T ss_pred CCC-cE---EEEEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCccc Confidence 543 22 233333332210 000000 000 000010 01111 Q ss_pred ---------ccC---------ceEEEeec----CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE----- Q lcl|NC_020866. 186 ---------PAF---------RFIFADIP----AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI----- 238 (547) Q Consensus 186 ---------~~~---------k~i~~~~~----~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v----- 238 (547) ++. -|++++.+ ...++|+|.|.+..+....--=...+..|+.-++.....+.| T Consensus 216 ~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l 295 (499) T protein:vir:80 216 SLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFV 295 (499) T ss_pred chhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhh Confidence 111 14444332 245779999999999876666566666676666653333332 Q ss_pred --EeCCCCCCHHHHHHHHHHHHHHhcCeeEEec---C--CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hcc-hh Q lcl|NC_020866. 239 --GKWQPGASEQDKDTLFRAVANIAGDCAAIIP---E--TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LGQ-TT 309 (547) Q Consensus 239 --gk~~~~~~~~~~~~l~~al~~~~~~~~~vip---~--~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lGq-tl 309 (547) .+-..+..... .+. ....+..++ . +..|+.....- -.+.|...++.+-++|+..+ +++ ++ T Consensus 296 ~~~~~~~g~~~~~----~~~----~~~~~~~~~~~~~~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~g~s~~~f 364 (499) T protein:vir:80 296 KTAVNLDGSTTQY----FDS----TDEAFFLYQGEQDDNGKAIKDISVEI---RSTEFIESINAMLRIYAMQVGLSAGTF 364 (499) T ss_pred hccCCCCCCcccC----CCc----ccceeeEeeccCCCCcCceeEecCcC---ChHHHHHHHHHHHHHHHHhcCCChhhc Confidence 11001111000 000 000011111 1 11233332211 11235455555555665443 332 22 Q ss_pred ccCccccccchhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHh-------cCC-CcccccEEEecCccccCHHHHH Q lcl|NC_020866. 310 TTDAVAGGHAVSQEHRLVQ--EDIETSDAMALAAILNRDLIRPWIQL-------EYG-PQKRYPRIKIARPKTEDLTKLA 379 (547) Q Consensus 310 ts~~~ggs~a~~~vh~~v~--~~~~~ad~~~i~~~ln~~li~~l~~~-------Nfg-~~~~~p~~~~~~~~~ed~~~~a 379 (547) +.++ +|...+-++...-. ...+..-.+.+...|. +|++.++.+ +.. .....+.+.|+..-..|.++.+ T Consensus 365 g~~~-~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~ 442 (499) T protein:vir:80 365 TFDE-NGLKTATEVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTI 442 (499) T ss_pred CCCc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHH Confidence 2222 23222233322221 2233444566666774 466665543 111 1234578889988899999999 Q ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccc Q lcl|NC_020866. 380 SSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTA 438 (547) Q Consensus 380 ~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (547) +...+++..|+--.+.++.+.+|+++++-.+++-......... .+...++....... T Consensus 443 ~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~~el~~i~~E~~~~--~~~~d~~g~~ge~e 499 (499) T protein:vir:80 443 NRYTTAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQAE--IPNNDMTGIFGEEE 499 (499) T ss_pred HHHHHHHHcCCCCHHHHHhhcCCCChHHHHHHHHHHHHHhhcC--CCCCCccccCCCCC Confidence 9999999999845556778888987644333332222111110 01001100000000 No 168 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.60 E-value=2.1e-07 Score=57.20 Aligned_cols=422 Identities=11% Similarity=0.010 Sum_probs=197.5 Q ss_pred CCCCCee---eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHh---- Q lcl|NC_020866. 1 MAQKPVL---LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEE---- 73 (547) Q Consensus 1 ma~~~~~---~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~---- 73 (547) |.+.+.+ .|..|+--.+ .. ..+. ........--....++.++...++........++..|.+.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~-~~~n----~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i 73 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYL--FN-DEAN----VVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN 73 (511) T ss_pred Cccccchhhhhhhhhhhhhh--hh-hhhC----CccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc Confidence 7765554 1223221000 00 0000 0000000001123356666666665443333344333332210 Q ss_pred -----------------hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-Hhhhhc Q lcl|NC_020866. 74 -----------------RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKG 135 (547) Q Consensus 74 -----------------~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G 135 (547) ...+..-.+.+....+.+.+..+.+.++ +..+++.++++.-.|+.....+. ++..|| T Consensus 74 ~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G 148 (511) T protein:vir:96 74 LVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDLSIYG 148 (511) T ss_pred ccccCcCcccccCcceeecchHHHHHHHHHhhhccCCceeecCch-----HHHHHHHHHHhhcCHHHHHHHHHHHHHhcC Confidence 0133455666677777888888875443 23467888888778888888877 688899 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEe--cccCCccee----e----------------------------ccCCCC Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRF--QKSDLATPL----L----------------------------LDDNGA 181 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~----~----------------------------~~~~~~ 181 (547) . +.+++|...+|... +..++|+.+.. +......++ . ....+. T Consensus 149 ~-a~~~vy~ded~~~~---i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~ 224 (511) T protein:vir:96 149 K-AYELMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTN 224 (511) T ss_pred e-eEEEEEeCCCCceE---EEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCC Confidence 7 56777765556543 44445544321 111101100 0 000000 Q ss_pred ceee---------ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHH Q lcl|NC_020866. 182 EVPL---------PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDT 252 (547) Q Consensus 182 g~~l---------~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~ 252 (547) +... .++..|-+.. -.++..|.|.+..+....---...+..++..++.|+.|+++.+-....+..+... T Consensus 225 ~~~~~~~~~~~~~~~~~~vPvv~--~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~ 302 (511) T protein:vir:96 225 GLKLTPRENGFESHSFERMPITE--FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK 302 (511) T ss_pred cccccccccccccccCCceeeEE--ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcc Confidence 0000 0011111111 1235678999998877666666788899999999999988866422233222211 Q ss_pred HHH-H---HHHH--hcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHH-H Q lcl|NC_020866. 253 LFR-A---VANI--AGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEH-R 325 (547) Q Consensus 253 l~~-a---l~~~--~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh-~ 325 (547) ..+ . +... ....+.-...+.+++++.... ....++..++.+.+.|...--...++.++.+| ..+|..- . T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~ 378 (511) T protein:vir:96 303 QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKY 378 (511) T ss_pred cccccceecccccccccccccCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccccc-cchHHHHHH Confidence 111 0 0000 011111233467788886443 23357888888888886653333344443222 2222221 1 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC----CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHH Q lcl|NC_020866. 326 L--VQEDIETSDAMALAAILNRDLIRPWIQL---EYG----PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENE 396 (547) Q Consensus 326 ~--v~~~~~~ad~~~i~~~ln~~li~~l~~~---Nfg----~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~ 396 (547) . -....+..-.+.+...+++ +++.++.+ +.. .+..-.++.|....+.+..+.++.+.++ .|+ ++.+. T Consensus 379 ~~~~l~~k~~~k~~~~~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~-iS~et 454 (511) T protein:vir:96 379 KLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-ISQTT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hcc-CChHH Confidence 1 1222333445566666654 45555443 211 1223468888888899999999999887 466 88888 Q ss_pred HHHHhCC-CCCCCcchhccccccccc---c-cccccccccccCCccccccccccccchh Q lcl|NC_020866. 397 VRSRFGF-SQPKPGAKLLRPLENPAT---Q-AAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 397 ~~e~~Gl-p~p~~~e~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +.+.++. +.|+ +++-........ . .......+.............+.+.... T Consensus 455 ~l~~l~~v~D~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 455 LMSLFSFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhCCCCCCHH--HHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 8888874 3322 222111111000 0 0000000111000001111111100000 No 169 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.57 E-value=2.6e-07 Score=56.68 Aligned_cols=427 Identities=11% Similarity=0.025 Sum_probs=184.1 Q ss_pred CCCCCee---eCCCC----------Ccc-cchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_020866. 1 MAQKPVL---LDRWG----------NPV-QRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLD 66 (547) Q Consensus 1 ma~~~~~---~~~~g----------~~~-~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~ 66 (547) |-+..-. .|-.+ +-. ....+...+ .-....+.+.+..|.... -.++... +..-.|.-..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~-~~~~~~i~~~i~~~~~~~--~~r~~~l-~~YY~g~~~i~~~ 76 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDL-LQNINEVSKYIEHHMDYQ--RPRLKVL-SDYYEGKTKNLVE 76 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhh-hhhHHHHHHHHHHHHHhh--HHHHHHH-HHHhcccCccccc Confidence 6654321 11111 000 000000000 001111222222221100 0011111 1010010000000 Q ss_pred ---HHHHH----HhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCee Q lcl|NC_020866. 67 ---LAEVI----EERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSF 138 (547) Q Consensus 67 ---L~~~m----~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv 138 (547) .-+.. .-......-.++.....+.+.+..+.+.++ +..+.+.++++.-+|+....++. ++..||.+ T Consensus 77 ~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a- 150 (512) T protein:vir:97 77 LTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA- 150 (512) T ss_pred cCcccccccCcceeecchHHHHHHHHhhhhcccCceeccCCh-----HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeE- Confidence 00000 001233445666666777888888875433 23356888887778888888777 68889975 Q ss_pred eeeEEeecCCeeeEEeeEeeccceeE--ecccCCccee----e--cc--CCC-----Cceee-ccCceEEEeec------ Q lcl|NC_020866. 139 TEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL----L--LD--DNG-----AEVPL-PAFRFIFADIP------ 196 (547) Q Consensus 139 ~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~----~--~~--~~~-----~g~~l-~~~k~i~~~~~------ 196 (547) .+++|.-.+|.+. +..++|+... ++......++ . .. +.. ....+ .+.+...+... T Consensus 151 y~~vy~ded~~~~---i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~ 227 (512) T protein:vir:97 151 YELMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK 227 (512) T ss_pred EEEEEeCCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc Confidence 5677755555543 3444454332 1111100000 0 00 000 00000 11111111000 Q ss_pred -------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHH- Q lcl|NC_020866. 197 -------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRA- 256 (547) Q Consensus 197 -------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~a- 256 (547) .-.+++.|.|.+..+....=--...+..++..++.|..|+++.+-....+..+....... T Consensus 228 ~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~ 307 (512) T protein:vir:97 228 LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEAN 307 (512) T ss_pred ccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcc Confidence 012357789999887766655566788899999999999888663222232222211110 Q ss_pred HH------HHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-H--H Q lcl|NC_020866. 257 VA------NIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-L--V 327 (547) Q Consensus 257 l~------~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-~--v 327 (547) +. ........-...+.+++++.... ....++.+++.+.+.|.+.--...++.++.+| ..+|..-. . - T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~---~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~g-n~Sg~Al~~~~~~ 383 (512) T protein:vir:97 308 VLFLEPTVYENRDTGIETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFG 383 (512) T ss_pred cccccccchhhcccccCCCCCcceEEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccCcccccc-cchHHHHHHHHHH Confidence 00 01111111234567788886442 23358888888888887653222333333222 22333221 1 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh-c-CC-----CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHH Q lcl|NC_020866. 328 QEDIETSDAMALAAILNRDLIRPWIQL-E-YG-----PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSR 400 (547) Q Consensus 328 ~~~~~~ad~~~i~~~ln~~li~~l~~~-N-fg-----~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~ 400 (547) ....+..-.+.+...+++ +++.++.+ + .+ ....-.++.|....+.+..+.++.+.+|+ |+ ++.+.+.+. T Consensus 384 l~~ka~~k~~~f~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~--gi-iS~et~~~~ 459 (512) T protein:vir:97 384 LEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQTTLMSL 459 (512) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CchHHHHHh Confidence 222334445566666643 55555443 1 11 11223578888888999999999999885 66 899988888 Q ss_pred hCC-CCCCCcchhcccccccc----cccccccccccccCCccccccccccccchh Q lcl|NC_020866. 401 FGF-SQPKPGAKLLRPLENPA----TQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 401 ~Gl-p~p~~~e~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) ++. +.|+ +++-....... .........+........+......+.... T Consensus 460 l~~v~d~~--~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 460 FSFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCHH--HHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 875 3332 22211111000 000001111111111111111111100000 No 170 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.57 E-value=2.7e-07 Score=56.63 Aligned_cols=419 Identities=10% Similarity=0.020 Sum_probs=172.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLG 80 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s 80 (547) |+-...| |.. . -...|.+..... ..-.......|-+.. + ++.....-......+ .+ ......- T Consensus 1 ~~~~~~~-d~~--~-~i~~L~~~~~~~-~~r~~~~~~Yy~g~~-~-------i~~~~~~~~~~~~~~--~~--~~n~~~~ 63 (488) T protein:vir:23 1 MAETESI-DPE--K-LRDQLLDAFENK-QNELKSSKAYYDAER-R-------PDAIGLAVPLDMRKY--LA--HVGYPRT 63 (488) T ss_pred CCcccCC-CHH--H-HHHHHHHHHHHH-HHHHHHHHHHHhccc-c-------hhhcCcccchhhhhh--hh--hcchHHH Confidence 5432221 100 0 001111111110 000111111111100 0 000000000000000 00 1222233 Q ss_pred HHHHHHHHHhcCCceEecC----CCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC------Ce Q lcl|NC_020866. 81 VIGTRKRSVSQLDITVEPA----SDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE------GQ 149 (547) Q Consensus 81 ~l~~Rk~~v~~~~w~v~p~----~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~------g~ 149 (547) ++++....+.-..|.+-.. .....+.+..+.+.++++.-+|+.....+. ++..||.| ++++|...+ +. T Consensus 64 ivd~~a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a-~~~v~~~~~~~~~~~~~ 142 (488) T protein:vir:23 64 YVDAIAERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTA-YITISMPDPEVDFDVDP 142 (488) T ss_pred HHHHHHHhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCce-EEEEecCCcccccCCCC Confidence 3333332222223333211 112233444566777787778888888766 68889997 556654211 10 Q ss_pred eeEEeeEeeccceeEe--c---------------ccCC-cc----------eeeccCCCCc-------eeeccCceEEEe Q lcl|NC_020866. 150 YFPDKLIWRDPRHFRF--Q---------------KSDL-AT----------PLLLDDNGAE-------VPLPAFRFIFAD 194 (547) Q Consensus 150 ~~~~~l~~~~~~~~~~--~---------------~~~~-~~----------~~~~~~~~~g-------~~l~~~k~i~~~ 194 (547) ..+ .|..++|+.+.. + .++. .. .......+.. -++..--++.+. T Consensus 143 ~~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~ 221 (488) T protein:vir:23 143 EVP-LIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPIS 221 (488) T ss_pred Ccc-eEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEec Confidence 001 122333332110 0 0000 00 0000011110 011111245566 Q ss_pred ecCCcCccccchhHHHHHHHHH-HHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCC Q lcl|NC_020866. 195 IPAKSGILLRSGLARVAAWGWM-FKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPET 271 (547) Q Consensus 195 ~~~~~~~p~G~gll~~~~~~~~-~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~ 271 (547) ++.+.+.++|.|-+........ -=...+..++...+-|..|.++.+ .+......+.. ...+-.++.+...++++| T Consensus 222 n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~--~~~~~~~~~~~v~~~~~g 299 (488) T protein:vir:23 222 NRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAET--GQRMFDAYMARILAFEGG 299 (488) T ss_pred cccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccc--cchhhhhhhhhhccCCCC Confidence 7777788899887754322221 223356667777788999977654 11111111000 011233444566778888 Q ss_pred ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC----ccccccchhHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 272 MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD----AVAGGHAVSQE---HRLVQEDIETSDAMALAAILN 344 (547) Q Consensus 272 ~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~----~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln 344 (547) .+.++.+... .+...|...++-+-.+|+.. +-+.. +.+-+.++|.. ...-....++.-.+.+...+. T Consensus 300 ~~~~~~q~~~--~~~~~~~~~l~~~i~~~~~~----~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~ 373 (488) T protein:vir:23 300 EGAHAEQFSA--AELRNFVDALDALDRKAASY----SGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWE 373 (488) T ss_pred CCceeEecCC--CChHHHHHHHHHHHHHHhcc----cCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888877543 33445655555444454432 22221 11111122322 222233344455566677774 Q ss_pred HHHHHHHHHhcCCCc----ccccEEEecCccccCHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCcchhcc---cc Q lcl|NC_020866. 345 RDLIRPWIQLEYGPQ----KRYPRIKIARPKTEDLTKLASSLDTLVRLGM-EIEENEVRSRFGFSQPKPGAKLLR---PL 416 (547) Q Consensus 345 ~~li~~l~~~Nfg~~----~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~-~v~~~~~~e~~Glp~p~~~e~~~~---~~ 416 (547) ++++.++.+.-+.. ..--++.|....+.++.+.++.+.+|++.|. .++.+.+.+.+|+-.... +++-. .. T Consensus 374 -~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~-~~~~~~~~~~ 451 (488) T protein:vir:23 374 -QAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVER-EQMRQWLEQD 451 (488) T ss_pred -HHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHH-HHHHHHHHHH Confidence 36666655532211 1234678888888899999999999999884 489999999998743221 11110 00 Q ss_pred ccc-cc----ccccccccccccCCccccccccccccch Q lcl|NC_020866. 417 ENP-AT----QAAPPGMDPGAGDLKTAERKLKPLSGQA 449 (547) Q Consensus 417 ~~~-~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (547) ... .. ...........+..++...+..+.+ .+ T Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~-~a 488 (488) T protein:vir:23 452 QKQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPD-AA 488 (488) T ss_pred HHHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCC-CC Confidence 000 00 0000000011111111111111110 00 No 171 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.56 E-value=2.8e-07 Score=56.52 Aligned_cols=410 Identities=12% Similarity=0.062 Sum_probs=164.1 Q ss_pred hhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHH-HH--HHHHHHHH---hhhHHHHHHHHHHHHHHhcCC Q lcl|NC_020866. 20 LTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPV-RY--LDLAEVIE---ERDPHYLGVIGTRKRSVSQLD 93 (547) Q Consensus 20 l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~-~~--~~L~~~m~---~~D~~v~s~l~~Rk~~v~~~~ 93 (547) ++++.- .+. +.+..|-.+.....++..-.+ |... .+ ...-+.+. -......-+++.....+.... T Consensus 1 ~~t~~~--~i~---~L~~~~~~~~~r~~~l~~Yy~----G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g 71 (480) T protein:vir:78 1 MTTYHE--HVE---RLQGLLARDLPNLLEAEAYRN----GTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) T ss_pred CCCHHH--HHH---HHHHHHHHHHHHHHHHHHHHh----ccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCc Confidence 222211 011 111111111101111111111 1000 00 00000000 011222222333333333333 Q ss_pred ceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeec------CCeeeEEeeEeeccceeEe- Q lcl|NC_020866. 94 ITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTS------EGQYFPDKLIWRDPRHFRF- 165 (547) Q Consensus 94 w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~------~g~~~~~~l~~~~~~~~~~- 165 (547) +.+ ++++ +..+.+.++++.-+|+.....++ +|..||.| ++++|... +|... +..++|+.... T Consensus 72 ~~~--~~d~----~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~g~~~---i~~~~p~~~~~~ 141 (480) T protein:vir:78 72 FRI--SEDS----EGLEELWNWWQANDLDEESVLGHDDSLTFGRS-YITVSHPDVESGDPAGIPL---IRVESPLYMYAE 141 (480) T ss_pred eec--CCCc----hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCce-EEEEecCccccCCCCCeeE---EEEEcccceEEE Confidence 432 1222 22345666676667888888876 79999986 56777421 22222 33333332211 Q ss_pred -ccc----------------CCc------------ceeeccCCC---C----cee----eccCceEEEeecCCcCccccc Q lcl|NC_020866. 166 -QKS----------------DLA------------TPLLLDDNG---A----EVP----LPAFRFIFADIPAKSGILLRS 205 (547) Q Consensus 166 -~~~----------------~~~------------~~~~~~~~~---~----g~~----l~~~k~i~~~~~~~~~~p~G~ 205 (547) +.. +.. .......++ . +.. ++.--++.|.++.+.+.++|. T Consensus 142 ~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~ 221 (480) T protein:vir:78 142 LDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR 221 (480) T ss_pred EcCCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCc Confidence 100 000 000000000 0 011 122234556677778889999 Q ss_pred hhHHH-HHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCC Q lcl|NC_020866. 206 GLARV-AAWGWMFKAFTARDWAIFTQTYGQPLRIGKW--QPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSI 282 (547) Q Consensus 206 gll~~-~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~ 282 (547) |-+.. +....=-=...+..++...+.|+.|.++.+- .....++..... + ....+.... ..|.+.++.+... T Consensus 222 s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~---~-~~~~~~~~~-~~~~~~~~~~~~~- 295 (480) T protein:vir:78 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT---L-DIYYGRILT-LASEAAKISEFKA- 295 (480) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccch---h-hhhhhhhcc-CCCCCceEEecCc- Confidence 88764 3222222244567778888999999776641 111111111111 1 111122222 3455677776543 Q ss_pred CCchHHHHHHHHHHHHHHHHHHhcchhcc-CccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_020866. 283 GSSVDLYEKRINHLDQQISKAVLGQTTTT-DAVAGGHAVSQEH---RLVQEDIETSDAMALAAILNRDLIRPWIQLEYGP 358 (547) Q Consensus 283 ~~~~~~f~~li~~~d~~Isk~ilGqtlts-~~~ggs~a~~~vh---~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~ 358 (547) .+.+.|...++.+-.+|+.. .+-+... .+.+.+.++|..- ..-....++.-.+.+...|.+ +++.++.+.-.. T Consensus 296 -~~~~~~~~~l~~~i~~~~~~-~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~-~~~l~~~~~g~~ 372 (480) T protein:vir:78 296 -AELRNFAEEMEVFRKEAASI-TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIAMQIMGRE 372 (480) T ss_pred -cCHHHHHHHHHHHHHHHhcc-cCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCC Confidence 33445666666666666533 1111111 1111122233222 222233344445556666744 566666665211 Q ss_pred -cc--cccEEEecCccccCHHHHHHHHHHHHhCC-CCCCHHHHHHHhCCCCCCCcchhccccccccc-----cccccccc Q lcl|NC_020866. 359 -QK--RYPRIKIARPKTEDLTKLASSLDTLVRLG-MEIEENEVRSRFGFSQPKPGAKLLRPLENPAT-----QAAPPGMD 429 (547) Q Consensus 359 -~~--~~p~~~~~~~~~ed~~~~a~~~~~l~~~G-~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~-----~~~~~~~~ 429 (547) .. ..-.++|......++.+.++.+.+|+..| ..++.+.+.+.+|+.+.+ -+++......... ........ T Consensus 373 ~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~-~~~~~~~~~e~~~~~~~~~~~~~~~~ 451 (480) T protein:vir:78 373 VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ-REQMRDWDKQETEDMIDTLYSTTKAQ 451 (480) T ss_pred ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhH-HHHHHHHHHHHHHHHHHHhhcccccc Confidence 11 12356777777788899999999999887 357899999999986432 2221111000000 00000000 Q ss_pred ccccCCccccccccccccchhhhhhcccchh Q lcl|NC_020866. 430 PGAGDLKTAERKLKPLSGQADLSRGDLSLNS 460 (547) Q Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (547) +...+.+.+.....+....+.-..++ ..+ T Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 480 (480) T protein:vir:78 452 ADATPKPTVTETKTETQTSPSGFNRT--KTR 480 (480) T ss_pred CCCCCCCCCCCCCCccccccCCCCcc--cCC Confidence 00000000000000000000000000 000 No 172 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.55 E-value=3e-07 Score=56.35 Aligned_cols=401 Identities=10% Similarity=0.037 Sum_probs=161.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhh--cCCCHHHHHHHHHHHHh---hh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAA--DHGDPVRYLDLAEVIEE---RD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a--~~g~~~~~~~L~~~m~~---~D 75 (547) |-. +|......-+.+--....+ .. +..++.-.+++.++..+.+-. =.|+...+..-...... .+ T Consensus 1 m~~--~~~~~~~~~~~~~~~~~~~--------~~-~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~ 69 (496) T protein:vir:38 1 MIN--QIIAGVKGVMRRMGLLKAL--------KD-VKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNR 69 (496) T ss_pred Chh--HHHHHHHHHHHHhccchhh--------HH-HHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCcccc Confidence 210 0110000000000000000 00 000111111222222211100 00100000000000000 00 Q ss_pred -----HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCe Q lcl|NC_020866. 76 -----PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQ 149 (547) Q Consensus 76 -----~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~ 149 (547) ..-...+.+-...+++.+-.|... +++..+++.++++.-.|.+.+..++ .|..+|-+++-+.|..++ . T Consensus 70 ~~~~~n~~k~i~~~~a~~l~~~p~~i~~~-----d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~-~ 143 (496) T protein:vir:38 70 RQLSMNLPKVTAKYMSKLLFNEKVKINID-----DKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNK-N 143 (496) T ss_pred ceeecchHHHHHHHHhhhhhCCcceEeeC-----ChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCC-c Confidence 011123344455566666666543 2345678888888778888888777 688999887777665433 3 Q ss_pred eeEEeeEeeccceeEe--cccCCc---------------------------ce-----eeccCCC--Cceee-------- Q lcl|NC_020866. 150 YFPDKLIWRDPRHFRF--QKSDLA---------------------------TP-----LLLDDNG--AEVPL-------- 185 (547) Q Consensus 150 ~~~~~l~~~~~~~~~~--~~~~~~---------------------------~~-----~~~~~~~--~g~~l-------- 185 (547) . ++..+++..|.. ...++. .+ +.....+ .|.++ T Consensus 144 ~---~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~ 220 (496) T protein:vir:38 144 V---KVSFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDD 220 (496) T ss_pred E---EEEEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccc Confidence 2 233444433221 000100 00 0000000 01111 Q ss_pred -cc-------Cc--eEEEe----ecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe-------CCCC Q lcl|NC_020866. 186 -PA-------FR--FIFAD----IPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK-------WQPG 244 (547) Q Consensus 186 -~~-------~k--~i~~~----~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk-------~~~~ 244 (547) ++ .+ |++++ .+...++|+|.|.+..+....-.=...+..|+.-++. |.+.+++. .+.+ T Consensus 221 ~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~ 299 (496) T protein:vir:38 221 IEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLD 299 (496) T ss_pred cccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCCC Confidence 10 01 22222 2235678999999999876655555556666665554 44433331 1100 Q ss_pred CCHHHHHHHHHHHHHHhcCe-eEEec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hcc-hhccCcccc Q lcl|NC_020866. 245 ASEQDKDTLFRAVANIAGDC-AAIIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LGQ-TTTTDAVAG 316 (547) Q Consensus 245 ~~~~~~~~l~~al~~~~~~~-~~vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lGq-tlts~~~gg 316 (547) ... ... . ..... ..++. .+..|+.....- ..+.|...++.+-++|+..+ +++ +++.++ +| T Consensus 300 -g~~-~~~-~-----~~~~~~~~~~~~~~~~~~~~i~~~~~~i---~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~-~g 367 (496) T protein:vir:38 300 -GST-TQY-F-----DSTDEAFFLYQGDQDDNGKAIKDISVEI---RSTEFIESINAMLRIYAMQVGLSAGTFTFDE-NG 367 (496) T ss_pred -Ccc-ccC-C-----CCccceEEEeecCCCcccccceeecccc---CHHHHHHHHHHHHHHHHHhhCCChhhcCCCc-cc Confidence 000 000 0 00000 00111 111233332211 11235555555555665443 222 222232 23 Q ss_pred ccchhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh-------cC-CCcccccEEEecCccccCHHHHHHHHHHHH Q lcl|NC_020866. 317 GHAVSQEHRLV--QEDIETSDAMALAAILNRDLIRPWIQL-------EY-GPQKRYPRIKIARPKTEDLTKLASSLDTLV 386 (547) Q Consensus 317 s~a~~~vh~~v--~~~~~~ad~~~i~~~ln~~li~~l~~~-------Nf-g~~~~~p~~~~~~~~~ed~~~~a~~~~~l~ 386 (547) ...+.++.... ....+..-.+.+...|. ++++.++.+ +. .....-+.+.|...-+.|.++.++.+.+++ T Consensus 368 ~~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 446 (496) T protein:vir:38 368 LKTATEVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAK 446 (496) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHH Confidence 22233343222 22223444555667774 466666533 21 122334688899888999999999999999 Q ss_pred hCCCCCCH-HHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccc Q lcl|NC_020866. 387 RLGMEIEE-NEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAE 439 (547) Q Consensus 387 ~~G~~v~~-~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (547) .+|+ ++. .++.+.+|+++++-.+++-......+.. .+....+. .....+ T Consensus 447 ~~Gi-iS~et~l~~~~~~~d~ea~~el~ri~~E~~~~--~~~~d~~~-~~~~~e 496 (496) T protein:vir:38 447 NQGM-IPLKIALQRAWNITEAEADEWAEMLAKEKQAE--MPNNDMNG-IFGEEE 496 (496) T ss_pred hcCC-CCHHHHHHhcCCCChHHHHHHHHHHHHhhhcc--CccccccC-CCCCCC Confidence 9998 554 5667777987644333332222111111 01111000 000000 No 173 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.53 E-value=3.5e-07 Score=55.96 Aligned_cols=422 Identities=11% Similarity=0.085 Sum_probs=190.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHH-----HHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLA-----EVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~-----~~m~~~D 75 (547) || |+ +.. .+.+.+...+...+...+..|-.+ ..++..+ ...-.|.-.....-. ....-.. T Consensus 1 ~~----~~------~~~-~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~l-~~Yy~g~~~i~~~~~~~~~~~~~ki~~ 65 (499) T protein:vir:10 1 MA----VV------IDK-DLLDDVNEPNIEAINYAIRELQNR---KKRLDKL-SDYYNGKQEIEKHEFDNATVEAANVMV 65 (499) T ss_pred Cc----cc------hhh-hHHhhhhcCCHHHHHHHHHHHHHH---HHHHHHH-HHHhccccchhcCCcCcCCCCcceeec Confidence 77 32 122 222222222333333333333211 1233221 222222111000000 0000024 Q ss_pred HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE-- Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP-- 152 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~-- 152 (547) .+..-.+.+....+.+.+..+...++ +..+.+.++++.-+|+..+..+. ++..||.+ .+++|...+|...+ T Consensus 66 n~~~~Iv~~~~~~l~g~p~~~~~~~~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~~~~g~~~~~~ 139 (499) T protein:vir:10 66 NHAKYITDMNVGFMTGNPVKYVAEKG-----KNIDDILEVFNQIDIHKHDIELEKDLSVFGYG-YELLYLKKTDPISVRD 139 (499) T ss_pred chHHHHHHHHhhhhcccCceeecCCh-----hHHHHHHHHHhhcCHhHHHHHHHHHHHhcCce-EEEEEecccccccccc Confidence 56667777888888888888875432 22345677777777887777766 68889974 56666554442211 Q ss_pred ------------EeeEeecccee-EecccCCc-c------eeeccCC--CC---cee-ecc------------------- Q lcl|NC_020866. 153 ------------DKLIWRDPRHF-RFQKSDLA-T------PLLLDDN--GA---EVP-LPA------------------- 187 (547) Q Consensus 153 ------------~~l~~~~~~~~-~~~~~~~~-~------~~~~~~~--~~---g~~-l~~------------------- 187 (547) .++..++|+.. .+..+... . .....+. .. ... +.+ T Consensus 140 ~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~ 219 (499) T protein:vir:10 140 ELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDP 219 (499) T ss_pred cccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcce Confidence 12444444432 11111000 0 0000000 00 000 001 Q ss_pred --------CceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHH Q lcl|NC_020866. 188 --------FRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVAN 259 (547) Q Consensus 188 --------~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~ 259 (547) +..|-+.+ -.++++|.|.+..+....--=...+..++..++.|+.|+++.+-.....+.+. ... T Consensus 220 ~~~~~~~~~g~vPvv~--~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~------~~~ 291 (499) T protein:vir:10 220 IVYDGENLFGAVPIIE--FRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDD------IQR 291 (499) T ss_pred ecccccCCCCccceEE--ecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccch------hhh Confidence 11111110 12356788988887666555555778999999999999998873221111111 112 Q ss_pred HhcCeeEEe--cCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHH Q lcl|NC_020866. 260 IAGDCAAII--PETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETS 334 (547) Q Consensus 260 ~~~~~~~vi--p~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~a 334 (547) +.......+ +.+.++++++... ....++.+++.+.+.|...--...++.+..+ +..+|..-.-. ....+.. T Consensus 292 ~~~~~~~~~~~~~~~d~~~l~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~ 367 (499) T protein:vir:10 292 LKRGAIEAPPREEGADIEWLTKSF---DETQVNLLSQSIENDIHKISYVPNMNDEKFM-GNVSGEAMKFKLFGLENLLSI 367 (499) T ss_pred hhhcceeccCCCCCCcceEEeccC---CHHHHHHHHHHHHHHHHHHhCcccCCchhhc-ccchHHHHHHHHHHHHHHHHH Confidence 222333333 3566788886443 2345888999999988774222223333222 22223222111 2223444 Q ss_pred HHHHHHHHHHHHHHHHHHHh-cC-C--CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCc Q lcl|NC_020866. 335 DAMALAAILNRDLIRPWIQL-EY-G--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPG 409 (547) Q Consensus 335 d~~~i~~~ln~~li~~l~~~-Nf-g--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~ 409 (547) -.+.+...++ ++++.++.+ |. + .+.....+.|....+.+..+.++.+.+| .|+ ++.+.+.+.++. +.+++. T Consensus 368 k~~~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~-iS~et~~~~l~~v~d~~~E 443 (499) T protein:vir:10 368 KQRYFFDGLR-RRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNA--DGI-IPRKYTYSWLPDVDNPQDV 443 (499) T ss_pred HHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHH--hcc-CChHHHHHhCCCCCCHHHH Confidence 4566777774 466666664 21 1 1223458889888899999999999998 465 898888888864 323211 Q ss_pred chhccccccc---ccccccccccccccCCccc---cccccccccchhhhhhcccchhhhcccccccc Q lcl|NC_020866. 410 AKLLRPLENP---ATQAAPPGMDPGAGDLKTA---ERKLKPLSGQADLSRGDLSLNSEMPSTGQKTG 470 (547) Q Consensus 410 e~~~~~~~~~---~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (547) -+-+...... .......+..+........ +.+............ .+.... T Consensus 444 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~ 499 (499) T protein:vir:10 444 IDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGSNHNQS-----------HRTRAV 499 (499) T ss_pred HHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCccccccC-----------CCCCCC Confidence 0111111000 0000000111111000000 000000000000000 001000 No 174 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.51 E-value=4e-07 Score=55.65 Aligned_cols=413 Identities=11% Similarity=0.033 Sum_probs=163.0 Q ss_pred CCCCCeeeCCCCCcccch-hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCH-HHH--HHHHHHHHh--- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA-QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDP-VRY--LDLAEVIEE--- 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~-~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~-~~~--~~L~~~m~~--- 73 (547) |+. |+... .++.+. ....+.+.+..+. .|+... +..-.|.- ..+ ..+-..+.. T Consensus 1 ~~~----------~~~~~~~~~~~~---~~~~l~~~~~~~~------~rl~~l-~~Yy~G~~~i~~~~~~~~~~~~~~~~ 60 (484) T protein:vir:77 1 MTS----------PLQKQENVDPEK---AREEMLNLFTERT------QDLGDN-TAYYESERRPDAVGVTVPQQMQKLLA 60 (484) T ss_pred CCC----------cccccCCCCHHH---HHHHHHHHHHHHH------HHHHHH-HHHHhccccchhcccccchhHHhhhh Confidence 331 11100 000000 0001111111111 111111 11111110 000 000001100 Q ss_pred hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeee- Q lcl|NC_020866. 74 RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYF- 151 (547) Q Consensus 74 ~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~- 151 (547) .-...+-++++....+....|.+ ++++.. .+.+.++++.-+|+....+++ ++..||.| ++++|...++... T Consensus 61 ~~n~~~~ivd~~~~~l~~~g~~~--~~~~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a-~~~v~~~~~~~~~~ 133 (484) T protein:vir:77 61 HVGYPRLYIDAIAARQELEGFRL--GGADKA----DEQLWDWWQANDLDIESTLGHTDSLVHGRS-YITISKPDPNIDPG 133 (484) T ss_pred hcCcHHHHHHHHHhhhccCceec--CCcchh----HHHHHHHHHhcCHhHHHHHHHHHHhhcCce-EEEEecCCCCcccc Confidence 01111223333333333334443 222222 234566666667888888877 68899996 5666654443210 Q ss_pred ----EEeeEeeccceeE--ecc---------------cCCc-c----------eeeccCCCCce-------eeccCceEE Q lcl|NC_020866. 152 ----PDKLIWRDPRHFR--FQK---------------SDLA-T----------PLLLDDNGAEV-------PLPAFRFIF 192 (547) Q Consensus 152 ----~~~l~~~~~~~~~--~~~---------------~~~~-~----------~~~~~~~~~g~-------~l~~~k~i~ 192 (547) ...|..++|+... ++. ++.. . .......+... ++..--+|. T Consensus 134 ~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 213 (484) T protein:vir:77 134 VDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIP 213 (484) T ss_pred cccccceEEEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEE Confidence 0013333333221 110 0000 0 00000111111 111112455 Q ss_pred EeecCCcCccccchhHHHHHHHH-HHHHHHHHHHHHHHHHcCCCeEEEeC--CCCCCHHHHHHHHHHHHHHhcCeeEEec Q lcl|NC_020866. 193 ADIPAKSGILLRSGLARVAAWGW-MFKAFTARDWAIFTQTYGQPLRIGKW--QPGASEQDKDTLFRAVANIAGDCAAIIP 269 (547) Q Consensus 193 ~~~~~~~~~p~G~gll~~~~~~~-~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~~~~~~~~l~~al~~~~~~~~~vip 269 (547) |.++.+.+.|+|.|-+....... =-=...+.+++..++-|..|.++.+- +..... +..+-...+ ....+...++| T Consensus 214 f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~ 291 (484) T protein:vir:77 214 IPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGV-DPETGQTLF-DAYLARILAFE 291 (484) T ss_pred eccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcc-cccccchhh-hhhhhhhcccC Confidence 67777888899998876433222 22234567788888999999877552 111100 011101111 12223334444 Q ss_pred CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC----ccccccchhHHH---HHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 270 ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD----AVAGGHAVSQEH---RLVQEDIETSDAMALAAI 342 (547) Q Consensus 270 ~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~----~~ggs~a~~~vh---~~v~~~~~~ad~~~i~~~ 342 (547) . .+.++.+.. .++.+.|...++.+-.+||.. +-+++ +.+.+.++|+.- ..-....++.-.+.+... T Consensus 292 ~-~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~----~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 364 (484) T protein:vir:77 292 D-HESKAQQFS--AAELRNFVDALDALDRKAAAY----TGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGA 364 (484) T ss_pred C-CCceeEeec--CCChHHHHHHHHHHHHHHhcc----cCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 356666544 233445655555555555433 11111 111111223222 222233344445556666 Q ss_pred HHHHHHHHHHHhcCCCc----ccccEEEecCccccCHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCcchh--ccc Q lcl|NC_020866. 343 LNRDLIRPWIQLEYGPQ----KRYPRIKIARPKTEDLTKLASSLDTLVRLGM-EIEENEVRSRFGFSQPKPGAKL--LRP 415 (547) Q Consensus 343 ln~~li~~l~~~Nfg~~----~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~-~v~~~~~~e~~Glp~p~~~e~~--~~~ 415 (547) +.+ +++.++.+--+.. ...-++.|......++.+.++.+.+|++.|. .++.+.+.+.+|+.+.... ++ +.. T Consensus 365 l~~-~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~-e~~~~~~ 442 (484) T protein:vir:77 365 WEQ-AMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITERE-EMRKWDE 442 (484) T ss_pred HHH-HHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHH-HHHHHHH Confidence 643 5555544421211 1224677888888899999999999999874 3788999999998543222 11 111 Q ss_pred ccc-ccc-----ccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 416 LEN-PAT-----QAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 416 ~~~-~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) ... ... .....+..++.+..+ .+... .++......+ T Consensus 443 ee~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~~~~~~~ 484 (484) T protein:vir:77 443 EEQAQGLGLMGTMFGTDPSGGGNPDNP-ETPEP-----QPNPAEEAAA 484 (484) T ss_pred HHHHHHHHHHhhhccccccCCCCCCCC-Ccccc-----cCCCccccCC Confidence 000 000 000000000000000 00000 0000000000 No 175 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.51 E-value=4.1e-07 Score=55.58 Aligned_cols=417 Identities=12% Similarity=0.047 Sum_probs=163.6 Q ss_pred CC-CCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHH Q lcl|NC_020866. 1 MA-QKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYL 79 (547) Q Consensus 1 ma-~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~ 79 (547) |- ..|-+.+..--..-...|....... ..-.+.....|-+.. .+ +.....-...+..+ ...-...+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~-~~r~~~~~~Yy~G~~-~i-------~~~~~~~~~~~~~~----~~~~n~~~ 67 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDS-TQNLKTNTSYYEAER-RP-------EAIGVTVPIQMQSL----LAHVGYPR 67 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCC-cc-------hhcCCCCChhhhhh----hhhcCcHH Confidence 32 0000111111111111111111110 000111111111100 00 00000000000000 00111222 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCee------eE Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQY------FP 152 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~------~~ 152 (547) -++++....+....|. .++++..+ +.+.+++..-+|+.....++ +|..||.| ++++|.-.++.. .+ T Consensus 68 ~ivd~~~~~l~~~g~~--~~~~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~~e~~~~~~~~~~~~ 140 (485) T protein:vir:10 68 LYVDSIAERQAVEGFR--FGDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPNTP 140 (485) T ss_pred HHHHHHHhhhccccee--cCCCchhH----HHHHHHHHhcCHhHHHHHHHHHHhhcCce-EEEEeeCCcccccccCCCee Confidence 2333332222222333 23333322 34566666667888888776 68899988 556664322110 01 Q ss_pred EeeEeeccceeE--ecccCCcc--------------------------eeeccCCCCc-----e--eeccCceEEEeecC Q lcl|NC_020866. 153 DKLIWRDPRHFR--FQKSDLAT--------------------------PLLLDDNGAE-----V--PLPAFRFIFADIPA 197 (547) Q Consensus 153 ~~l~~~~~~~~~--~~~~~~~~--------------------------~~~~~~~~~g-----~--~l~~~k~i~~~~~~ 197 (547) .+..++|+.+. ++...+.. +......+.. . +++.--++.|.++. T Consensus 141 -~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 219 (485) T protein:vir:10 141 -IIRVEPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRT 219 (485) T ss_pred -EEEEEccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEecccc Confidence 23344443321 11100000 0000001110 0 11122345566777 Q ss_pred CcCccccchhHHH-HHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCC-HHHHHHHHHHHHHHhcCeeEEecCCce Q lcl|NC_020866. 198 KSGILLRSGLARV-AAWGWMFKAFTARDWAIFTQTYGQPLRIGK--WQPGAS-EQDKDTLFRAVANIAGDCAAIIPETMS 273 (547) Q Consensus 198 ~~~~p~G~gll~~-~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~-~~~~~~l~~al~~~~~~~~~vip~~~~ 273 (547) +.+.++|.|-+.. +....=--...+.+.....+.|..|.++.+ .+.... +++.. ..+-....+....+ .+.+ T Consensus 220 ~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~---~~~~~~~~~~i~~~-~~~d 295 (485) T protein:vir:10 220 RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETG---QTLFDAYLARILAF-EDAE 295 (485) T ss_pred ccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCccccccccccc---chhhhhcccceecc-CCCC Confidence 7778899887754 222211223346677778899999987654 111111 01110 11112223333333 4456 Q ss_pred EEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC----ccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 274 IDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD----AVAGGHAVSQE---HRLVQEDIETSDAMALAAILNRD 346 (547) Q Consensus 274 ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~----~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~ 346 (547) .+|.+... .+...|...++-+-.+|+.. +-+.. +.+.+.++|.. ...-....++.-.+.+...+++ T Consensus 296 ~k~~q~~~--~~~~~~~~~l~~~i~~~~~~----~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~- 368 (485) T protein:vir:10 296 GKIQQFSA--AELANFTNALDQIAKQVAAY----TGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEE- 368 (485) T ss_pred ceEEeecc--cchHHHHHHHHHHHHHHhcc----cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 77766543 33445655555444444433 22111 11111222222 2222333344555566667744 Q ss_pred HHHHHHHhcCCCc----ccccEEEecCccccCHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCcchhcc--ccccc Q lcl|NC_020866. 347 LIRPWIQLEYGPQ----KRYPRIKIARPKTEDLTKLASSLDTLVRLGM-EIEENEVRSRFGFSQPKPGAKLLR--PLENP 419 (547) Q Consensus 347 li~~l~~~Nfg~~----~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~-~v~~~~~~e~~Glp~p~~~e~~~~--~~~~~ 419 (547) +++.++.+.-+.. ...-.+.|....+.++.+.++++.+|++.|. .++.+.+.+.+|+.+... +++.. ..... T Consensus 369 ~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~-~~~~~~~ee~~~ 447 (485) T protein:vir:10 369 AMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAER-EEMRRWDEEEAA 447 (485) T ss_pred HHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHH-HHHHHHHHHHHH Confidence 5565555432221 1234678888888999999999999999883 478999999999865422 22111 10000 Q ss_pred cc------ccccccccccc-cCCccccccccccccchhhhhhccc Q lcl|NC_020866. 420 AT------QAAPPGMDPGA-GDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 420 ~~------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) .. ...+....++. ...+.+.++.... ....+ T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~ 485 (485) T protein:vir:10 448 MGLGLIGTMVDPNPTVPGSPSPAPAPKPAALES-------GGDAA 485 (485) T ss_pred HHHHHHHHhhccCCCCCCCCCccccccCcCCCC-------CCCCC Confidence 00 00000000000 0000000000000 00000 No 176 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.46 E-value=5.5e-07 Score=54.91 Aligned_cols=401 Identities=11% Similarity=0.063 Sum_probs=188.2 Q ss_pred CC--CCCeeeCCCCCcccchhhhhhhhhhhhh--hhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MA--QKPVLLDRWGNPVQRAQLTEEVAAPTVT--GVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma--~~~~~~~~~g~~~~~~~l~~~~a~~~~~--~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~ 76 (547) |- ++.+++=|.-.++....|.+-+...... -..+....|-+.. .++..-.......-..+ ..+ T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~-------~i~~~~~~~~~~~~~ki------~~n 67 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIM-------AIDAEPTKDLWKPDNRL------TVN 67 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccC-------chhcCCCccccCcccee------ecc Confidence 32 1222333333444444444444332110 1111111111110 01111000000000001 235 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEee Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKL 155 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l 155 (547) ...-.+.+....+.+.+..+.+.+++ ..+.+.+++..-+|+..+.++. ++..||.+ ++++|...+|... + T Consensus 68 ~~~~ivd~~~~~l~g~~~~~~~~d~~-----~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~-~~~v~~d~~g~~~---i 138 (453) T protein:vir:39 68 FTKYIVDTFTGYFNGIPVKKSHSDKE-----TLSKLQEFDNLNDMEDEESELAKMACIYGRA-FELLYQNEETQTN---V 138 (453) T ss_pred hHHHHHHHHhhhhcccCceeccCChH-----HHHHHHHHHHhcChhHHHHHHHHHHhhcCeE-EEEEEecCCCceE---E Confidence 55666677777788888888754332 3356778787778888777776 68889975 4666654455433 3 Q ss_pred EeeccceeE--ecccCCcce------------------------eeccCCCCcee----ec-cCceEEEeecCCcCcccc Q lcl|NC_020866. 156 IWRDPRHFR--FQKSDLATP------------------------LLLDDNGAEVP----LP-AFRFIFADIPAKSGILLR 204 (547) Q Consensus 156 ~~~~~~~~~--~~~~~~~~~------------------------~~~~~~~~g~~----l~-~~k~i~~~~~~~~~~p~G 204 (547) ...+|+... ++......+ ......+.+.. .+ ++..|-+.+ -.+++.| T Consensus 139 ~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~--~~n~~~g 216 (453) T protein:vir:39 139 IYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVE--FYFNEER 216 (453) T ss_pred EEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEE--ecCCCCC Confidence 334444322 111000000 00000100100 01 111111111 1236789 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEe------cCCceEEEEe Q lcl|NC_020866. 205 SGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAII------PETMSIDFVE 278 (547) Q Consensus 205 ~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vi------p~~~~ie~~e 278 (547) .|.+..+....---...+.+++..++.|..|+++.+- .+.++++...+ ..+..+.+ +.+.+++++. T Consensus 217 ~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~lt 288 (453) T protein:vir:39 217 MSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNI-------RSNRVINYYGESSEAKNVDVKFLE 288 (453) T ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec-CCCCchhhhhh-------hhcceeeecCCCCCCCCCceeEEe Confidence 9999887666656667788999999999999887662 22233333222 22222222 3566788886 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDAMALAAILNRDLIRPWIQLE 355 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~~~i~~~ln~~li~~l~~~N 355 (547) ... ....++..++.+.+.|...--...++.++. | .++|..-... ....+..-.+.+...+. ++++.++.+. T Consensus 289 ~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-g-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~ 362 (453) T protein:vir:39 289 KPD---SDSQTENLLDRLTKLIFQTTMVANISDESF-G-SSSGVSLAYKLQAMSNLALSFQRKFQSSLN-SRYKLYCELS 362 (453) T ss_pred ecC---CHHHHHHHHHHHHHHHHHHhCCcccccccc-c-CChHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Confidence 442 234688888888888876522222333322 2 2223322221 22333444555666664 3566555543 Q ss_pred --CC--CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcccccccc--cccccccc Q lcl|NC_020866. 356 --YG--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLENPA--TQAAPPGM 428 (547) Q Consensus 356 --fg--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~~--~~~~~~~~ 428 (547) .+ .+..-..+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. +.|+ +++-....... ........ T Consensus 363 ~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~--g~-is~et~l~~l~~v~D~~--~E~~ri~~E~~~~~~~~~~~~ 437 (453) T protein:vir:39 363 TNVSNKEAWKDIEYTFTRNEPKDIKEQAETANILM--GI-TSQETALSVISVIPDVQ--AEMEKIKKEEASTAIFDKDKQ 437 (453) T ss_pred hccCCccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHhcc Confidence 11 11223478888888999999999999884 65 899999999974 3332 22221111111 00000111 Q ss_pred cccccCCccccccccc Q lcl|NC_020866. 429 DPGAGDLKTAERKLKP 444 (547) Q Consensus 429 ~~~~~~~~~~~~~~~~ 444 (547) ....+.....+....+ T Consensus 438 ~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 438 PSEKGTDTVVPETNEE 453 (453) T ss_pred CCCCCCCCCCCCcCCC Confidence 1111110000000000 No 177 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=98.44 E-value=6.6e-07 Score=54.46 Aligned_cols=385 Identities=10% Similarity=0.019 Sum_probs=179.9 Q ss_pred ccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH-----HHHH-----HHH----------Hhh Q lcl|NC_020866. 15 VQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY-----LDLA-----EVI----------EER 74 (547) Q Consensus 15 ~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~-----~~L~-----~~m----------~~~ 74 (547) |+...+.+-+.. .+..|..+ ..++.. ++..-.|.-... .+.. ... .-. T Consensus 1 ~~~e~~~~~i~~--------~~~~~~~~---~~~~~~-~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 68 (471) T protein:vir:10 1 MEIEVIKKIISS--------QMVKHGKF---VSQAAE-AEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRIS 68 (471) T ss_pred CCHHHHHHHHHH--------HHHHHHHH---HHHHHH-HHHHhccccccccccchhhhhcccccccccccccccccceec Confidence 444433333321 11111111 011111 111111110000 0000 000 001 Q ss_pred hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEe-ecCCeeeE Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWN-TSEGQYFP 152 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~-~~~g~~~~ 152 (547) .+...-.+.+....+.+.+..+.+.++ +..+.+..++++ +|+.....+. ++..||.+. +++|. ..+|.+. T Consensus 69 ~n~~~~Ivd~~~~yl~G~p~~~~~~~~-----~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~~d~~~g~~~- 140 (471) T protein:vir:10 69 HNWHQLLLDQKKAYALTYPPTFDVDDK-----KVNDMIVDVLGD-DYERISKQLCVNAGNAGIAW-LHVWKDASDNSFR- 140 (471) T ss_pred cchhHHHHHhhhhhhcccCceeccCCh-----HHHHHHHHHHhc-CHHHHHHHHHHHHhhCCeEE-EEEEeeCCCCeeE- Confidence 234455666777777888877765332 234556666664 6888877766 588899766 45553 3456543 Q ss_pred EeeEeeccceeE--ecccCCcce------eecc--CCCCce---ee-ccCceEEEeecC--------------------- Q lcl|NC_020866. 153 DKLIWRDPRHFR--FQKSDLATP------LLLD--DNGAEV---PL-PAFRFIFADIPA--------------------- 197 (547) Q Consensus 153 ~~l~~~~~~~~~--~~~~~~~~~------~~~~--~~~~g~---~l-~~~k~i~~~~~~--------------------- 197 (547) +..++|+... ++......+ .... ...... .+ .+.....+++.. T Consensus 141 --~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (471) T protein:vir:10 141 --YACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNG 218 (471) T ss_pred --EEEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccc Confidence 4444554432 111111000 0000 000000 00 000111111000 Q ss_pred -------------------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHH Q lcl|NC_020866. 198 -------------------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVA 258 (547) Q Consensus 198 -------------------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~ 258 (547) -.++..|.|.+..+-...--=...+.+++..++.|..|+++.+-..+... .+.+. T Consensus 219 ~~~~~~~~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~------~~~~~ 292 (471) T protein:vir:10 219 DRSSDNSFKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDK------QEFLE 292 (471) T ss_pred cccccccccCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccc------chhHH Confidence 01245678888876655555555788899999999999888774222221 12233 Q ss_pred HHhcCeeEEec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHHH---HH Q lcl|NC_020866. 259 NIAGDCAAIIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLVQ---ED 330 (547) Q Consensus 259 ~~~~~~~~vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~---~~ 330 (547) .+.....+.++ .+.+++++..... ...++..++.+.+.|...--+..++.++. | .++|..-.... .. T Consensus 293 ~~~~~~~i~~~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~tp~~~~~~~-g-n~Sg~Alk~~~~~l~~ 367 (471) T protein:vir:10 293 DLKRYKMIKMDNDGMGDQSGVTTIAIDIP---TEARNLILERTKKQIFISGQGVNPETDKL-G-NSSGVALKFLYSLLEL 367 (471) T ss_pred HhhcCCeEEecCCCCccCccceEEeecCC---hHHHHHHHHHHHHHHHHHhCCcCCCcccc-c-CccHHHHHHHHHHHHH Confidence 44445555554 3347888875532 23588888888888876532223333322 2 33333222221 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCc Q lcl|NC_020866. 331 IETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPG 409 (547) Q Consensus 331 ~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~ 409 (547) .+..-.+.+...+ +++++.++.+.-..+.....+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. +.|+ T Consensus 368 k~~~~~~~~~~~l-~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-iS~et~~~~~p~v~D~~-- 441 (471) T protein:vir:10 368 KAGNMETQFRSGY-ATLVKMILKHLGLSDKLKIKQTWTRNSINNDTEMAQVVSTLA--TI-TSRENVAKSNPIVEDWQ-- 441 (471) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhccCCCceeEEEeCCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHH-- Confidence 3444455666666 346666666542233344578888888999999999998874 66 999999988854 2222 Q ss_pred chhccccccccc-ccccccccccccCCcccccc Q lcl|NC_020866. 410 AKLLRPLENPAT-QAAPPGMDPGAGDLKTAERK 441 (547) Q Consensus 410 e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 441 (547) +++-........ ....+....+... .+.. T Consensus 442 ~E~eri~~E~~~~~~~~~~~~~~~~~---~e~~ 471 (471) T protein:vir:10 442 DELRLQKAEQEGRSEKLYDMEEVEHE---SEVE 471 (471) T ss_pred HHHHHHHHHHHHHHhcccccCCCCCc---cccC Confidence 222111111000 0000000000000 0000 No 178 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.43 E-value=6.8e-07 Score=54.38 Aligned_cols=414 Identities=9% Similarity=-0.011 Sum_probs=188.8 Q ss_pred CCCCCeeeCCCCCcccchhhh--hhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHH---H----HHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLT--EEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYL---D----LAEVI 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~--~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~---~----L~~~m 71 (547) |+..++ +...|....-+. ++.+......+.+.+..|-.+ ..|+.. ++..-.|...... . ..... T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~r~~~-~~~Yy~g~~~i~~~~~~~~~~~~~~~ 73 (478) T protein:vir:10 1 MISINW---PWDKPYHEQVVEQIKPKYETQEEMILRLVREHKEN---IDNITM-GERYYNHHPDILDAPFKRDVNGDYDE 73 (478) T ss_pred Cccccc---cCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHH---HHHHHH-HHHHhcccccccccchhhhccccccc Confidence 887433 223333221111 122111222233333333211 223322 2222222211000 0 00000 Q ss_pred Hh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee Q lcl|NC_020866. 72 EE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT 145 (547) Q Consensus 72 ~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~ 145 (547) .+ .-++..-.+.+....+.+.+..+...++ +..+.+..++++ +|++....+. ++.-||.+. +++|.- T Consensus 74 ~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~-----~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~~d 146 (478) T protein:vir:10 74 TKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDND-----KALKQIQHTLNH-KWDDKLVDILTAASNKGIEW-VQPYVD 146 (478) T ss_pred ccccceeccchHHHHHHHHhhhhcccCceeecCCh-----HHHHHHHHHHhc-cHHHHHHHHHHHHhhCCeEE-EEEEec Confidence 00 1355566777777888888888765432 233556666654 6777776665 788899864 667755 Q ss_pred cCCeeeEEeeEeeccceeE--ecccCCccee----e-ccCCCCcee-eccCceEEEeec--------------------- Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFR--FQKSDLATPL----L-LDDNGAEVP-LPAFRFIFADIP--------------------- 196 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~----~-~~~~~~g~~-l~~~k~i~~~~~--------------------- 196 (547) .++.+. +..++|+.+. ++......+. . .......+. ..+.++.+++.. T Consensus 147 ~~~~~~---~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (478) T protein:vir:10 147 EEGEFK---TFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQ 223 (478) T ss_pred CCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceec Confidence 555543 4445554432 2111111110 0 000000111 111111111110 Q ss_pred -------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcC Q lcl|NC_020866. 197 -------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGD 263 (547) Q Consensus 197 -------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~ 263 (547) .-.+++.|.|.+..+....--=...+..++..++.|..|+++.+-- +.++.. +...++... T Consensus 224 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~---~~~~~~---~~~~~~~~~ 297 (478) T protein:vir:10 224 GNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGY---EGEDMK---DFMHNLKYY 297 (478) T ss_pred ccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecC---Cccccc---chhhhhhhC Confidence 0123567899988865554444557788888889999998776531 211111 122333334 Q ss_pred eeEEec--CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHHH Q lcl|NC_020866. 264 CAAIIP--ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAMA 338 (547) Q Consensus 264 ~~~vip--~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~~ 338 (547) .++.++ .|.+++++.... ....++..++.+.+.|...--...++.++.+| ..+|..-.- -....+..-.+. T Consensus 298 ~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~ 373 (478) T protein:vir:10 298 KAISVAGESGSGVDTIKVEV---PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGN-SPSGIALKFMYSNLDLKANKLKNK 373 (478) T ss_pred ceeEecCCCCCcceEEeecC---CHHHHHHHHHHHHHHHHHHhCCcCcCcccccc-chHHHHHHHHHHHHHHHHHHHHHH Confidence 455554 567888886543 23457888888888777663222334343322 223322211 122233444556 Q ss_pred HHHHHHHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcccc Q lcl|NC_020866. 339 LAAILNRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPL 416 (547) Q Consensus 339 i~~~ln~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~ 416 (547) +...+. ++++-++.+.-. .+..-..+.|....+.+..+.++.+.++ .|+ ++.+.+.+.++. +.|...=+-+... T Consensus 374 ~~~~l~-~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~-iS~et~i~~~~~v~d~~~E~~ri~~E 449 (478) T protein:vir:10 374 TLTALQ-ELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMNS--TGL-LSKETILGNHSWVQDPVAEMERIEQE 449 (478) T ss_pred HHHHHH-HHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHHH--hCC-CChHHHHHhCCCCCCHHHHHHHHHHH Confidence 666664 466656555421 2223357888888888999999988877 465 888888888864 3332110111111 Q ss_pred cccccccccccccccccCCccccccccccc Q lcl|NC_020866. 417 ENPATQAAPPGMDPGAGDLKTAERKLKPLS 446 (547) Q Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (547) ........+....+... .......+.... T Consensus 450 ~~~~~~~~~~~~~~~~d-~~~~~~~d~~~e 478 (478) T protein:vir:10 450 NIELNQQLPDIEEGLND-EQQRQSEDNQSE 478 (478) T ss_pred HHHHHHhccccCCCCcc-cccccCcCCCCC Confidence 11000000000000000 000000000000 No 179 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.42 E-value=7.5e-07 Score=54.16 Aligned_cols=431 Identities=12% Similarity=0.040 Sum_probs=186.0 Q ss_pred CCCCCeeeCCCCCccc------chhhhhhhh-------hhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC-HHHH-- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ------RAQLTEEVA-------APTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD-PVRY-- 64 (547) Q Consensus 1 ma~~~~~~~~~g~~~~------~~~l~~~~a-------~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~-~~~~-- 64 (547) |-|+-. .+..++--. ......-+- ......+.+.+..|-... ..++....+=. .|+ ...+ T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~r~~~~~~yY-~g~~~~i~~~ 76 (501) T protein:vir:96 1 MEQTLF-TDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQ--APRIQELLDYA-RGENHDVLKS 76 (501) T ss_pred Cceeee-eecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHH--HHHHHHHHHHh-cCCCCcccCc Confidence 554322 232222111 111100000 000001111111110000 00111111101 010 0000 Q ss_pred ---HHHHH-HHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeee Q lcl|NC_020866. 65 ---LDLAE-VIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFT 139 (547) Q Consensus 65 ---~~L~~-~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~ 139 (547) ..-.. .-.-..++..-.+++....+.+.+.++...++ ...+++.+++.++++.-+|+.....+. ++..||.+. T Consensus 77 ~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~-~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~- 154 (501) T protein:vir:96 77 GRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN-DDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAY- 154 (501) T ss_pred cccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCc-cchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEE- Confidence 00000 00002445555677777778888888876543 234456677888888778888888887 688899765 Q ss_pred eeEEeecCCeeeEEeeEeeccceeE--ecccCCcce------ee---------------------ccCCCCce---eec- Q lcl|NC_020866. 140 EIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATP------LL---------------------LDDNGAEV---PLP- 186 (547) Q Consensus 140 Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~------~~---------------------~~~~~~g~---~l~- 186 (547) +++|...+|... +..++|+.+. ++......+ .. ....+... ..+ T Consensus 155 ~~v~~dedg~~~---i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:96 155 EVIYRSEYDETR---IKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTH 231 (501) T ss_pred EEEEEcCCCceE---EEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceecccccc Confidence 666655555543 4445555432 111100110 00 00000000 000 Q ss_pred cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCC-HHHHHHHHHH-HHHHhc-C Q lcl|NC_020866. 187 AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGAS-EQDKDTLFRA-VANIAG-D 263 (547) Q Consensus 187 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~-~~~~~~l~~a-l~~~~~-~ 263 (547) ++..|-+.+ -.+++.|.|.+..+....-.-...+..++..++.|..|+++.+-....+ .+....+... +-.+.. + T Consensus 232 ~~g~vPvv~--~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (501) T protein:vir:96 232 AFGTVPITE--YLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPK 309 (501) T ss_pred CCCccceEE--ecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccc Confidence 111111111 1346889999998766555556678888889999999988876322111 1111111110 000111 1 Q ss_pred eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH-H--HHHHHHHHHHHHHH Q lcl|NC_020866. 264 CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR-L--VQEDIETSDAMALA 340 (547) Q Consensus 264 ~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~-~--v~~~~~~ad~~~i~ 340 (547) +.-....+.+++++..... ...++.+++.+.+.|...--...++.++.+| ..+|..-. . -....+..-.+.+. T Consensus 310 ~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~ka~~~~~~~~ 385 (501) T protein:vir:96 310 SADGKEGTVKAEYLTKSYD---VSGAEAYKTRLNRDIHIFTNTPDMSDTNFSG-NTSGEALKYKLFGLDQDRVDTQSQFT 385 (501) T ss_pred cccccccCcceeeEeccCC---HHHHHHHHHHHHHHHHHHhCCcccCcccccc-cchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111234557778754432 2347888888887776652222233332222 22233221 1 12223344455666 Q ss_pred HHHHHHHHHHHHHh-c---CCC--cccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchh- Q lcl|NC_020866. 341 AILNRDLIRPWIQL-E---YGP--QKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKL- 412 (547) Q Consensus 341 ~~ln~~li~~l~~~-N---fg~--~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~- 412 (547) ..+. ++++.++.+ + .+. +....++.|....+.+..+.++.+.+++ |+ ++.+.+.+.++. +.|+ +++ T Consensus 386 ~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~-iS~et~~~~l~~v~D~~--~E~~ 459 (501) T protein:vir:96 386 KGLK-RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQ-VSQETALSLSGLVESPN--EELD 459 (501) T ss_pred HHHH-HHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHH--HHHH Confidence 6664 355555443 1 111 1233578899999999999999999986 65 888888888864 3322 222 Q ss_pred -cccccccccc-cccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHHH Q lcl|NC_020866. 413 -LRPLENPATQ-AAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIA 475 (547) Q Consensus 413 -~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (547) +......... .......+..+......... . ....++..+ T Consensus 460 ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~-~----------------------~d~~e~~~~ 501 (501) T protein:vir:96 460 KINKEMSEIDFKGYSNDFNEHVGKYTDEVKET-H----------------------TDDFEREYE 501 (501) T ss_pred HHHHHHHHhhccccccchhhcccccCCcCCCC-C----------------------CCccccccC Confidence 1111110000 00000000000000000000 0 000000000 No 180 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.40 E-value=8.3e-07 Score=53.93 Aligned_cols=418 Identities=12% Similarity=0.049 Sum_probs=164.0 Q ss_pred CCC-CCeeeCCCCCcccch--hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC-H--------HHHHHHH Q lcl|NC_020866. 1 MAQ-KPVLLDRWGNPVQRA--QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD-P--------VRYLDLA 68 (547) Q Consensus 1 ma~-~~~~~~~~g~~~~~~--~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~-~--------~~~~~L~ 68 (547) |-. ...|.+.--.=|+.+ .++.+..... ++..+..|-.+. .|++..-+=..+-+ . ..+..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l---~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~ 74 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGAL---VADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASDEVKELA 74 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHH---HHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCChhhhhhH Confidence 321 011111111111111 1111111111 111111111100 11111111100000 0 0000000 Q ss_pred HHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC Q lcl|NC_020866. 69 EVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE 147 (547) Q Consensus 69 ~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~ 147 (547) +.+ .-..-+-++++....+.--.|.+ ++++.. +.+.++++.-+|+....++. ++..||.|. +++|..++ T Consensus 75 ~~~--v~n~~~~ivd~~a~~l~~~gf~~--~d~~~~-----~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay-~~v~~de~ 144 (501) T protein:vir:25 75 KLS--VKNVLSLVRDSFAQNLSVVGYRN--ALAKEN-----DPAWEMWQRNRMDARQAEVHRPALTYGASY-VTVTPTDE 144 (501) T ss_pred hhh--hcChHHHHHHHHHhhhcccceec--CCccch-----HHHHHHHHhcChhHHHHHHHHHHhhcCceE-EEEecCCC Confidence 000 00011111111111111122322 222221 23455666666888877754 788999975 78886555 Q ss_pred CeeeEEeeEeeccceeE--e-cccCCcce---ee-c---cCC-CCc-eee------------------------------ Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFR--F-QKSDLATP---LL-L---DDN-GAE-VPL------------------------------ 185 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~--~-~~~~~~~~---~~-~---~~~-~~g-~~l------------------------------ 185 (547) +. .+..++|+... + +...+.+. +. . .+. ... ..+ T Consensus 145 ~~----~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (501) T protein:vir:25 145 GP----VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVN 220 (501) T ss_pred CC----eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeecccccccccccc Confidence 42 24455665442 1 11111100 00 0 000 000 000 Q ss_pred --------------ccCceE--EEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHH Q lcl|NC_020866. 186 --------------PAFRFI--FADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQD 249 (547) Q Consensus 186 --------------~~~k~i--~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~ 249 (547) .++..| ++-.+....+++|.|-+..+....=-=...+...+...|.|..|.++.+- .+.++ T Consensus 221 ~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G---~~~~~ 297 (501) T protein:vir:25 221 VREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISG---WTGSK 297 (501) T ss_pred ccccccccccccccCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhC---CCCCc Confidence 011111 11112223367788887765443333344566677788999999776541 22111 Q ss_pred HHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHH---HH Q lcl|NC_020866. 250 KDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEH---RL 326 (547) Q Consensus 250 ~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh---~~ 326 (547) .+. -.+..+...++| |.+.++.+.. ..+.+.|...++.+-++|+..- +-+....+..++..+|+.- .. T Consensus 298 ~~~-----~~~~~~~i~~~~-~~~~~~~q~~--~~~~~~~~~~l~~~i~~i~~~s-~~P~~~~~~~~~N~Sg~Al~~~~~ 368 (501) T protein:vir:25 298 AEV-----LKASALRVWTFE-DPEVKAQAFP--PASVEPYNLILEEMLQHVAMVA-QISPAQVTGKMINVSAEALAAAEA 368 (501) T ss_pred cch-----hhhcccceeccC-CCCceEEEec--ccChHHHHHHHHHHHHHHHhhc-CCChhhhccccCChHHHHHHHHHH Confidence 111 122233344444 4456666543 3344567777777777776652 1111111111222233322 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---cccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHH-HhC Q lcl|NC_020866. 327 VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQK---RYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRS-RFG 402 (547) Q Consensus 327 v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~---~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e-~~G 402 (547) -....++.-.+.+...+. ++++.++.+..+... .-.++.|....+.++.+.++++.+|+.+|+ +.+.+.+ ..| T Consensus 369 ~l~~ka~~k~~~f~~~l~-~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi--s~et~~~~~~g 445 (501) T protein:vir:25 369 NQQRKLAAKRESFGESWE-QLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI--PIEHLLSMVPG 445 (501) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC--CHHHHHHHcCC Confidence 333445556677777785 466766666643321 235778888889999999999999998885 5555554 569 Q ss_pred CCCCCCcchhcccccccccc---ccccccccccc-CCcccc-ccccccccchhhhhhccc Q lcl|NC_020866. 403 FSQPKPGAKLLRPLENPATQ---AAPPGMDPGAG-DLKTAE-RKLKPLSGQADLSRGDLS 457 (547) Q Consensus 403 lp~p~~~e~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~ 457 (547) +.+++- +.+.......... .+.....+... +.+... +........ .....+ T Consensus 446 ~~~~~i-e~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~ 501 (501) T protein:vir:25 446 MTQQTI-QAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGV---NGNGGA 501 (501) T ss_pred CCHHHH-HHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccC---CCCCCC Confidence 975431 1111110000000 00000000000 000000 000000000 000000 No 181 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=98.35 E-value=4.6e-07 Score=55.33 Aligned_cols=201 Identities=11% Similarity=0.032 Sum_probs=118.2 Q ss_pred eEecccCCcceeecc----CCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE Q lcl|NC_020866. 163 FRFQKSDLATPLLLD----DNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI 238 (547) Q Consensus 163 ~~~~~~~~~~~~~~~----~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v 238 (547) ++...++..+++... ..+....++++..++++.....+..||.+.+..+......-....++-..|....++|--| T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~gi 80 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHMGFI 80 (219) T ss_pred CceeecCeEEEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceE Confidence 333334433333322 2244567888887666654445667899999988877776666666667788999999877 Q ss_pred EeCC-CCCCHHHHHHHHHHHHHHhcC---eeEEe------cCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhc-c Q lcl|NC_020866. 239 GKWQ-PGASEQDKDTLFRAVANIAGD---CAAII------PETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLG-Q 307 (547) Q Consensus 239 gk~~-~~~~~~~~~~l~~al~~~~~~---~~~vi------p~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilG-q 307 (547) -+++ +..+++.++++.+.+++.... ..+++ +.|++++-+..+. .. .-|.+.-++-..+|+.+.-- . T Consensus 81 l~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~--~d-~qfle~rk~~~~eIa~~fgVPp 157 (219) T protein:vir:98 81 LYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTG--QK-DEFANIKNISAQDVLTSHRFPP 157 (219) T ss_pred EEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCH--HH-HHHHHHHHhhHHHHHHHhCCCH Confidence 7654 457889999999888864221 12344 2345554443222 22 23666667777888887311 2 Q ss_pred hhcc--Cccccccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHH Q lcl|NC_020866. 308 TTTT--DAVAGGHAV-SQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLT 376 (547) Q Consensus 308 tlts--~~~ggs~a~-~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~ 376 (547) .+.+ +..+++++. .+.....-..-+..-++.|+..||+++ + ++ ...++.|+.....|+. T Consensus 158 ~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~---~--~~-----~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 158 GLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY---E--IK-----SALKVNFKQPEKRDKN 219 (219) T ss_pred HHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh---c--CC-----CccEEeecCcccccCC Confidence 2322 112233432 233334445566677777888887642 1 11 1136778877777766 No 182 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.34 E-value=1.2e-06 Score=53.08 Aligned_cols=423 Identities=9% Similarity=0.032 Sum_probs=188.0 Q ss_pred ccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHH--HHHhhcCCCHHHH--------------HHHHH---HHHhhh Q lcl|NC_020866. 15 VQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQ--ILRAADHGDPVRY--------------LDLAE---VIEERD 75 (547) Q Consensus 15 ~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~--ilr~a~~g~~~~~--------------~~L~~---~m~~~D 75 (547) |....|++++-.- -.-+...+..| ....+... -++..-.|.-..+ .+.+. .+ .. T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~i~~~----~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki--~~ 73 (537) T protein:vir:78 1 MTSPLLNKPIDQL-GGLLNTEITTY----MASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKI--SH 73 (537) T ss_pred CCcccccccHHHH-HHHHHHHHHHH----HHHHHHHHHHHHHHHhcccchhhhccccccccccccccccccccccc--cc Confidence 4444444444210 00011111111 11111111 1222222211000 00000 00 13 Q ss_pred HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) ....-.+.+....+.+.+..+.+.+ ..+++..+.++++++ -.|.+...++. ++..||.+ .|++|...++.+. T Consensus 74 nf~k~Ivd~~~~yl~G~Pv~~~~~d--~~~~e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~a-y~~~y~de~~~~~--- 146 (537) T protein:vir:78 74 GFFTELVDQLAQYLLSNGVEVKVKD--EDNTQLDEILQEYFD-EDFQATIDTLVTNASKKGFE-GIFARTTSEGKLK--- 146 (537) T ss_pred chHHHHHHHHhhhhcccCceeecCc--chhHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCee-EEEeeecCCCceE--- Confidence 4455567777888889998887543 344556677777765 35666665554 68889986 5667765566554 Q ss_pred eEeecccee--EecccCCcceeec------cCC--CCc-----e-eeccCceEEEeec---------------------- Q lcl|NC_020866. 155 LIWRDPRHF--RFQKSDLATPLLL------DDN--GAE-----V-PLPAFRFIFADIP---------------------- 196 (547) Q Consensus 155 l~~~~~~~~--~~~~~~~~~~~~~------~~~--~~g-----~-~l~~~k~i~~~~~---------------------- 196 (547) +..++|+.+ .|+..+......+ ... ... + -+.+.+...+... T Consensus 147 ~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~ 226 (537) T protein:vir:78 147 FQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHV 226 (537) T ss_pred EEEEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccee Confidence 334444432 1221111110000 000 000 0 0001111111000 Q ss_pred ------------------------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCC Q lcl|NC_020866. 197 ------------------------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGAS 246 (547) Q Consensus 197 ------------------------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~ 246 (547) .-.+|.+|.|.+..+....=-=...+.+.+..++.|.-|+++.+=-.+.+ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~ 306 (537) T protein:vir:78 227 LAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDS 306 (537) T ss_pred eeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCcc Confidence 01235678899988777666666678888999999999988877322222 Q ss_pred HHHHHHHHHHHHHHhcCeeEEec-CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHH Q lcl|NC_020866. 247 EQDKDTLFRAVANIAGDCAAIIP-ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHR 325 (547) Q Consensus 247 ~~~~~~l~~al~~~~~~~~~vip-~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~ 325 (547) ..+ ...++....++-++ .+.+++++.... ....++.+++++.+.|-+. +++..++..++|.++|. .. T Consensus 307 ~~~------~~~~l~~~~~i~v~~d~~~v~~l~~~~---~~~~~e~~ld~L~~~I~~~--s~~~~~~~~~~gn~SGv-Al 374 (537) T protein:vir:78 307 TDK------LRQNIKAKKMIGVNGDNAGMEIQTVSI---PYEARKAKMDIDVENIYRS--GMGFNSTAVGDGNVTNV-VI 374 (537) T ss_pred chh------HHHHHhhcCceeecCCCCceeEEEecC---CHHHHHHHHHHHHHHHHHh--cCCCCCccccccCCcHH-HH Confidence 111 22234444444444 678899986553 2345788888888888654 33332222233444443 32 Q ss_pred HHHHH----HHHHHHHHHHHHHHHHHHHHHHHh---cCC--CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHH Q lcl|NC_020866. 326 LVQED----IETSDAMALAAILNRDLIRPWIQL---EYG--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENE 396 (547) Q Consensus 326 ~v~~~----~~~ad~~~i~~~ln~~li~~l~~~---Nfg--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~ 396 (547) ..... .+..-.+.+...|.+ +++.++.+ ... .+.....+.|...-+.+..+.++.+.+|.+.|+ ++.+. T Consensus 375 k~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~gi-iS~eT 452 (537) T protein:vir:78 375 KSRYTLLAMKARKMETSLRKVLRW-CADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEA-LKIGN 452 (537) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCc-chHHH Confidence 22222 223334455555543 44444333 211 122446888888899999999999999999997 88887 Q ss_pred HHHHhCCCCCCCcchh---------------ccccccccccccccccccccc-CCccccccccccccchhhhhhcccchh Q lcl|NC_020866. 397 VRSRFGFSQPKPGAKL---------------LRPLENPATQAAPPGMDPGAG-DLKTAERKLKPLSGQADLSRGDLSLNS 460 (547) Q Consensus 397 ~~e~~Glp~p~~~e~~---------------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (547) +.+.++.-...+.+.. +.....+.....+.......+ .....+++..+.. ....++..++.. T Consensus 453 ~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~-~~~~~~~~~~~~- 530 (537) T protein:vir:78 453 IMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQ-PVADPNVVPPTD- 530 (537) T ss_pred HHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccC-CCCCCCCCCCCC- Confidence 7777764221100000 000000000000000000000 0000000000000 000000000000 Q ss_pred hhccccccc Q lcl|NC_020866. 461 EMPSTGQKT 469 (547) Q Consensus 461 ~~~~~~~~~ 469 (547) +..++.. T Consensus 531 --~~~~~~~ 537 (537) T protein:vir:78 531 --PNAVPQT 537 (537) T ss_pred --CccCCCC Confidence 0000000 No 183 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.33 E-value=1.2e-06 Score=52.95 Aligned_cols=419 Identities=13% Similarity=0.057 Sum_probs=163.6 Q ss_pred CCCCCeeeCCCCCcccc---hhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQR---AQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~---~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |- .-|-+..=.-++. ..|-+.... +..-.+.....|-+.. ++ +.....-...+..+ + ..-.. T Consensus 1 ~~--~~i~~~~~~~~~~~~~~~L~~~~~~-~~~r~~~~~~YY~G~~-~i-------~~~~~~~~~~~~~~-~---~~~n~ 65 (485) T protein:vir:24 1 MT--APLPGQEEIADPAIARDEMVSAFED-QNQNLRSNTSYYEAER-RP-------EAIGVTVPVQMQSL-L---AHVGY 65 (485) T ss_pred CC--CCCCCCCcccchHHHHHHHHHHHHH-HHHHHHHHHHHHhccC-ch-------hhcCcccchhhhhh-h---hccch Confidence 11 1111111111111 111111110 0001111111111110 00 00000000000000 0 00122 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeE---- Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFP---- 152 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~---- 152 (547) ..-++++....+....|.+ ++++..+ +.+.++++.-+|+....+++ ++..||.| ++++|.-.++.... T Consensus 66 ~~~ivd~~~~~l~~~g~~~--~~~~~~~----~~l~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~~~~~~~~~~~~~ 138 (485) T protein:vir:24 66 PRLYVDSIAERQAVEGFRL--GDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPN 138 (485) T ss_pred HHHHHHHHhhhhccCceec--CCCchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCce-EEEEecCCcccccccCCC Confidence 2223333333333344443 2323222 23456666557888777777 68899997 66667543221100 Q ss_pred -EeeEeeccceeE--eccc---------------CC-c----------ceeeccCCCCce-------eeccCceEEEeec Q lcl|NC_020866. 153 -DKLIWRDPRHFR--FQKS---------------DL-A----------TPLLLDDNGAEV-------PLPAFRFIFADIP 196 (547) Q Consensus 153 -~~l~~~~~~~~~--~~~~---------------~~-~----------~~~~~~~~~~g~-------~l~~~k~i~~~~~ 196 (547) ..+..++|+... ++.. +. . .+......+... +++.--++.|.++ T Consensus 139 ~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~ 218 (485) T protein:vir:24 139 VPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNR 218 (485) T ss_pred cceEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccC Confidence 013333333221 1100 00 0 000111111110 1112224556667 Q ss_pred CCcCccccchhHHHHHHH-HHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCce Q lcl|NC_020866. 197 AKSGILLRSGLARVAAWG-WMFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPETMS 273 (547) Q Consensus 197 ~~~~~p~G~gll~~~~~~-~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ 273 (547) .+.+.++|.|-+...... .=--...+.+.+...+.|+.|.++.+ .+.....++. .- ..+-....+....+| +.+ T Consensus 219 ~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-~~-~~~~~~~~~~i~~~~-~~~ 295 (485) T protein:vir:24 219 TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-TG-QTLFDAYLARILAFE-DAE 295 (485) T ss_pred cccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-cc-cchhhhcccceeccC-CCC Confidence 777888999887643222 22223456677778899999987654 1111100000 00 111122223333444 446 Q ss_pred EEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC-cccc---ccchhHH---HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 274 IDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD-AVAG---GHAVSQE---HRLVQEDIETSDAMALAAILNRD 346 (547) Q Consensus 274 ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~-~~gg---s~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~ 346 (547) .++.+... ++...|...++.+-.++|.. +-++. .-|| +.++|.. ...-....++.-.+.+...+++ T Consensus 296 ~~~~q~~~--~~~e~~~~~l~~~i~~~s~~----~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~- 368 (485) T protein:vir:24 296 GKIQQFSA--AELANFTNALDQIAKQVAAY----TGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEE- 368 (485) T ss_pred ceEEeecc--cchHHHHHHHHHHHHHHhcc----cCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 66765442 33334544444333333322 22221 1111 1122322 2223334445556666777754 Q ss_pred HHHHHHHh-cCC---CcccccEEEecCccccCHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCcchhccccccccc Q lcl|NC_020866. 347 LIRPWIQL-EYG---PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGM-EIEENEVRSRFGFSQPKPGAKLLRPLENPAT 421 (547) Q Consensus 347 li~~l~~~-Nfg---~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~-~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~ 421 (547) +++.++.+ |.. .+....++.|....+.++.+.++.+.+|+..|. .++.+.+.+.+|+..... +++......... T Consensus 369 ~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~-~e~~~~~ee~~~ 447 (485) T protein:vir:24 369 AMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAER-EEMRRWDEEEAA 447 (485) T ss_pred HHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHH-HHHHHHHHHHhh Confidence 56655555 322 122335778888888899999999999998873 489999999999864321 222111000000 Q ss_pred --c---cccccccccc--cCCccccccccccccchhhh Q lcl|NC_020866. 422 --Q---AAPPGMDPGA--GDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 422 --~---~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 452 (547) . ..-....+.. .+.++......+.....+.+ T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 448 MGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred hhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 0 0000000000 00000111111111100000 No 184 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.33 E-value=1.3e-06 Score=52.91 Aligned_cols=405 Identities=11% Similarity=0.006 Sum_probs=185.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhh------hhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHH---HHHHHH-- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEV------AAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVR---YLDLAE-- 69 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~------a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~---~~~L~~-- 69 (547) ||. | + .|++...-.+.+ ......-+...+..|-.+ ..++....+ .-.|+... ...... T Consensus 1 ~~~---~-~---~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~-Yy~g~~~i~~~~~~~~~~~ 69 (474) T protein:vir:96 1 MIV---I-F---WPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPK---IDDITVGER-YYNHDPDVLRLAPKLDNKG 69 (474) T ss_pred Cee---e-c---cCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHH---HHHHHHHHH-HhccCCcchhccchhcccc Confidence 885 3 2 222221111111 111111222222222111 122222221 11111000 000000 Q ss_pred --HHHh-----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeee Q lcl|NC_020866. 70 --VIEE-----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEI 141 (547) Q Consensus 70 --~m~~-----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei 141 (547) ...+ .-+...-.+.+....+.+.+..+.+.++ +..+.+.+++++ ++.+....+. ++..||.+. ++ T Consensus 70 ~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~-----~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~ 142 (474) T protein:vir:96 70 EIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDD-----KSLKTIQEVLNH-KWDDKLVDILTAASNKGIEW-LQ 142 (474) T ss_pred cccccccchhcccchHHHHHHhhhhhhcccCceeecCch-----HHHHHHHHHHhc-CHHHHHHHHHHHHHhcCeeE-EE Confidence 0000 1244455666777788888888865432 234556677665 5555555554 688899964 66 Q ss_pred EEeecCCeeeEEeeEeeccceeE--ecccCCccee--e---ccCCCCceee----------------------------- Q lcl|NC_020866. 142 VWNTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL--L---LDDNGAEVPL----------------------------- 185 (547) Q Consensus 142 ~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~--~---~~~~~~g~~l----------------------------- 185 (547) +|.-.+|.+. +..++|+.+. ++......++ . .........+ T Consensus 143 ~y~d~~~~~~---i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:96 143 PYIDENGEFK---TFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQS 219 (474) T ss_pred EEecCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccc Confidence 6755556543 4444554332 1111011100 0 0000000000 Q ss_pred --------ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHH Q lcl|NC_020866. 186 --------PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAV 257 (547) Q Consensus 186 --------~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al 257 (547) .++..|-+.+ -.++++|.|.+..+....=-=...+.+++..++.|+.|+++.+--.+.+ . .... T Consensus 220 ~~~~~~~~~~~g~iPvv~--~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~---~---~~~~ 291 (474) T protein:vir:96 220 HYYVGNKRVSWGRVPFIP--FKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQD---L---DEFM 291 (474) T ss_pred cccccccccCCCceeEEE--eccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccc---c---cchh Confidence 0111111110 1236788999988665555555578899999999999988876322111 1 1233 Q ss_pred HHHhcCeeEEec-CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHH Q lcl|NC_020866. 258 ANIAGDCAAIIP-ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIET 333 (547) Q Consensus 258 ~~~~~~~~~vip-~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ 333 (547) .++.....+.++ +|.+++++.... ....++..++.+.+.|...--+..++.++. |+..+|..-.-. ....+. T Consensus 292 ~~~~~~~~i~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~ 367 (474) T protein:vir:96 292 RNLKYYKAINVDGDGSGVDTIQIEV---PVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKAN 367 (474) T ss_pred hhhhcCceEEecCCCCceeEEeecC---ChHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHH Confidence 455566666666 577899986543 223578888888888877643333443432 333334332222 122333 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcch Q lcl|NC_020866. 334 SDAMALAAILNRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAK 411 (547) Q Consensus 334 ad~~~i~~~ln~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~ 411 (547) .-.+.+...+. ++++.++.+.-. .+..-..+.|....+.+..+.++ .+++.|+ ++.+.+.+.++. +.|+ ++ T Consensus 368 ~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~---~~~~ag~-iS~et~~~~~~~v~d~~--~E 440 (474) T protein:vir:96 368 KLKNKTLTALQ-ELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQ---IGVQSQY-LSKETVVTNHPWVDDPV--AE 440 (474) T ss_pred HHHHHHHHHHH-HHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHH---HHHhcCC-CchHHHHHhCCCCCCHH--HH Confidence 44556667774 466666666421 12223467788778888766655 4566786 888888888764 3332 22 Q ss_pred hcccccccccccccccccccccCCcccccccccccc Q lcl|NC_020866. 412 LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSG 447 (547) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (547) +............ ...+..+.............+ T Consensus 441 ~~ri~~E~~e~~~--~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 441 LERIEQDNIDFNK--QLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HHHHHHHHHHHHh--cccccccccccccCCCcccCC Confidence 2211111000000 000000000000000000000 No 185 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.28 E-value=1.7e-06 Score=52.23 Aligned_cols=404 Identities=9% Similarity=0.034 Sum_probs=190.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhh--hhhHHHHhhhhccCCCHHHHHHHHHhh-cCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTV--TGVRSPLSGYPGDGLNPLRLAQILRAA-DHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~--~~~~~~~~~~~~~~l~p~~~~~ilr~a-~~g~~~~~~~L~~~m~~~D~~ 77 (547) |-.+..|.-+...-+....+.+-+..... .-.......|.+.. .|+... ...... -..+ ..+. T Consensus 3 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~-------~i~~~~~~~~~~~-~~ki------~~n~ 68 (453) T protein:vir:73 3 LKPIKLMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIM-------EISSQKAKDSWKP-DNRL------TNNF 68 (453) T ss_pred cccceeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------chhcCCCCCccCc-ccee------ecch Confidence 44322232333344444444333322111 00111111111110 111111 000000 0001 2455 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) ..-.+.+....+.+.+..+.+.++ +..+++.++++.-+|+.....+. ++..||.+ ++++|...+|... +. T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~d~~~~~~---i~ 139 (453) T protein:vir:73 69 AKYIVDTFVGYFNGIPIKKTHDDK-----SVLEAMQLFDNLNDMEDEESELAKIACVYGRA-YELMYQNESTESE---VI 139 (453) T ss_pred HHHHHHHhhhhhcccCceeecCCh-----HHHHHHHHHHHhcChhHHHHHHHHHHHhcCeE-EEEEEeCCCCceE---EE Confidence 666667777778888888765432 23356788887777888888777 68889986 4566654555543 33 Q ss_pred eeccceeE--ecc----------------cCCcc--------e-eeccCCCCce---eec-cCceEEEeecCCcCccccc Q lcl|NC_020866. 157 WRDPRHFR--FQK----------------SDLAT--------P-LLLDDNGAEV---PLP-AFRFIFADIPAKSGILLRS 205 (547) Q Consensus 157 ~~~~~~~~--~~~----------------~~~~~--------~-~~~~~~~~g~---~l~-~~k~i~~~~~~~~~~p~G~ 205 (547) ..+|+... ++. ++... . ......+... ..+ ++..|-... -.+++.|. T Consensus 140 ~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~--~~n~~~g~ 217 (453) T protein:vir:73 140 YCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVE--YNFNEERQ 217 (453) T ss_pred EEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEE--ecCCCCCC Confidence 33443221 111 00000 0 0000000000 000 111111111 13467888 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHH----HHHHhcCeeEEecCCceEEEEecCC Q lcl|NC_020866. 206 GLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRA----VANIAGDCAAIIPETMSIDFVESKS 281 (547) Q Consensus 206 gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~a----l~~~~~~~~~vip~~~~ie~~e~~~ 281 (547) |.+..+....=-=...+..++..++.|..|.++.+-- ..++++...+... +.....++....+.+.+++++.... T Consensus 218 s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 296 (453) T protein:vir:73 218 SIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA-EVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPD 296 (453) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC-CCCchhhhcccccccccccccccccccccccCceeEEeeecC Confidence 9998765555445557888889999999998877521 2223333322211 1222334455667788899886443 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh-cCC Q lcl|NC_020866. 282 IGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDAMALAAILNRDLIRPWIQL-EYG 357 (547) Q Consensus 282 ~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~~~i~~~ln~~li~~l~~~-Nfg 357 (547) . ...++..++.+.+.|...--+..++.++. | .++|..-... ....++.-.+.+...+. ++++.++.+ +.. T Consensus 297 ~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-g-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~ 370 (453) T protein:vir:73 297 S---DVQTENLLNRLERSIFQFTMAANISDENF-G-NSSGVALAYKLQAMSNLALSFQRKFQSALN-RRYSLWSSLSTNA 370 (453) T ss_pred C---HHHHHHHHHHHHHHHHHHhCCcccCcccc-c-CccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcc Confidence 2 23578889999888876532333333332 2 2233222221 12233344455666664 355555554 211 Q ss_pred ---CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhccccccccccccccccccccc Q lcl|NC_020866. 358 ---PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLRPLENPATQAAPPGMDPGAG 433 (547) Q Consensus 358 ---~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~ 433 (547) ......++.|....+.+..+.++.+.+++ |+ ++.+.+.+.++. +.|+ +++-.......... .....+.. T Consensus 371 ~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--gi-is~et~~~~~~~~~d~~--~E~~ri~~E~~~~~--~~~~~~~~ 443 (453) T protein:vir:73 371 SNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GI-TSEETALSVISVIPDVQ--AEMEKIKKKKLLQL--SLTRTSNL 443 (453) T ss_pred CCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-CcHHHHHHhCCCCCCHH--HHHHHHHHHHHHHH--HHHHhccC Confidence 12234578898888999999999999986 66 888888888865 3332 22111111000000 00000000 Q ss_pred CCccccccccccccchhhhhhcccchhhhcccccccchhHHHHH Q lcl|NC_020866. 434 DLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAAL 477 (547) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 477 (547) ...++..+.| T Consensus 444 ----------------------------------~~~~~~~~~~ 453 (453) T protein:vir:73 444 ----------------------------------VRMKQMRGNL 453 (453) T ss_pred ----------------------------------CcchhhhcCC Confidence 0000001111 No 186 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=98.24 E-value=2.2e-06 Score=51.57 Aligned_cols=401 Identities=9% Similarity=0.017 Sum_probs=182.5 Q ss_pred CCCCCeeeCCCCCcccchhhhh--hhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH---HHH-------- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTE--EVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY---LDL-------- 67 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~--~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~---~~L-------- 67 (547) |++ |.=+..++.-...... ..+..+...+.+.+..|-.+ ..++.. ++..-.|..... ... T Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~-~~~yY~g~~~i~~~~~~~~~~~~~~~ 73 (468) T protein:vir:96 1 MID---IFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKEN---VEDITV-GERYYNHQPDVLFNAPKRNVKGEIDP 73 (468) T ss_pred Ccc---ccCCcCceeehheeecccccccCcHHHHHHHHHHHHHH---HHHHHH-HHHHhcCCCccccccccccccccccc Confidence 887 3212222221111110 00011111122222222111 112211 111112210000 000 Q ss_pred HH---HHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEE Q lcl|NC_020866. 68 AE---VIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVW 143 (547) Q Consensus 68 ~~---~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w 143 (547) +. .+ ..+...-.+.+....+.+.+..+...++ +..+.+.+++++ +|.+.+..+. ++..||.++ +++| T Consensus 74 ~~~~~ki--~~n~~~~Iv~~~~~~l~g~p~~~~~~d~-----~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~ 144 (468) T protein:vir:96 74 FKPDWRM--YTNYHQNLVDQKVAYAVANPVTYGTEDE-----KSLKTIQEVLNH-KWDDKLVDILTAASNKGVEW-IQPY 144 (468) T ss_pred ccccccc--ccchHHHHHHHHHhhhccCCceeccCCh-----HHHHHHHHHHhc-CHHHHHHHHHHHHhhcCeEE-EEEE Confidence 00 01 1455566667777788888888765432 234566677754 5666666554 788899975 4566 Q ss_pred eecCCeeeEEeeEeeccceeE--ecccCCccee--e---ccCCCCceee-ccCceEEEeec------------------- Q lcl|NC_020866. 144 NTSEGQYFPDKLIWRDPRHFR--FQKSDLATPL--L---LDDNGAEVPL-PAFRFIFADIP------------------- 196 (547) Q Consensus 144 ~~~~g~~~~~~l~~~~~~~~~--~~~~~~~~~~--~---~~~~~~g~~l-~~~k~i~~~~~------------------- 196 (547) ...+|.+. +..++|+.+. ++......+. . ..+......+ .+.+..+++.. T Consensus 145 ~d~~~~~~---i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (468) T protein:vir:96 145 VDEQGEFK---TFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHY 221 (468) T ss_pred EcCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccce Confidence 54455443 3344444322 1111001100 0 0000000000 00000000000 Q ss_pred ---------------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020866. 197 ---------------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIA 261 (547) Q Consensus 197 ---------------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~ 261 (547) .-.+++.|.|.+..+....---...+..++..++.|..|+++.+--.. ++.. .....+. T Consensus 222 ~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~---~~~~---~~~~~~~ 295 (468) T protein:vir:96 222 YVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEG---EDLE---EFMYNLK 295 (468) T ss_pred eeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc---cccc---hhhhhhh Confidence 012367899999886655555556788899999999999988773221 1111 1223344 Q ss_pred cCeeEEecC--CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHHHHH Q lcl|NC_020866. 262 GDCAAIIPE--TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIETSDA 336 (547) Q Consensus 262 ~~~~~vip~--~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ad~ 336 (547) ....+.++. +.+++++..... ...++..++.+.+.|...--+..++.++.+ +..+|..-... ....+..-. T Consensus 296 ~~~~i~~~~d~~~~~~~l~~~~~---~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~k~ 371 (468) T protein:vir:96 296 YYKAINVDGDGSGGVDTIQIDVP---VQSAKEYLDMLRDYVIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLK 371 (468) T ss_pred cCceEEecCCCCCcceEEeecCC---hHHHHHHHHHHHHHHHHHhCcccccccccc-cchHHHHHHHHHHHHHHHHHHHH Confidence 455555653 456888865432 345888888888888776333334434322 33334332211 222234445 Q ss_pred HHHHHHHHHHHHHHHHHhcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCcchhcc Q lcl|NC_020866. 337 MALAAILNRDLIRPWIQLEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPGAKLLR 414 (547) Q Consensus 337 ~~i~~~ln~~li~~l~~~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~e~~~~ 414 (547) +.+...+. ++++.++.+.-. .+.....+.|....+.+..+.++. +++.|+ ++.+.+.+.++. ..| .+++.. T Consensus 372 ~~~~~~l~-~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~---~~~~g~-iS~et~i~~l~~v~D~--~~E~~r 444 (468) T protein:vir:96 372 NKTLTALQ-ELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQI---GVNSQY-LSKETVVTNHPWVDDP--VAEMER 444 (468) T ss_pred HHHHHHHH-HHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHH---HHhcCC-CchHHHHHhCCCCCCH--HHHHHH Confidence 56666774 466666666411 122345778888888887766654 456786 888888888744 332 222222 Q ss_pred cccccccccc-cccccccccCCcc Q lcl|NC_020866. 415 PLENPATQAA-PPGMDPGAGDLKT 437 (547) Q Consensus 415 ~~~~~~~~~~-~~~~~~~~~~~~~ 437 (547) .......... ......+....|+ T Consensus 445 i~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 445 IDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred HHHHHHHHHHHhhccCCCCCCCCC Confidence 1111110000 0111111111111 No 187 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.23 E-value=2.3e-06 Score=51.50 Aligned_cols=401 Identities=12% Similarity=0.081 Sum_probs=165.6 Q ss_pred hhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHH------hhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCC Q lcl|NC_020866. 20 LTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILR------AADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLD 93 (547) Q Consensus 20 l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr------~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~ 93 (547) ++++... +....+.+..+-.+ ..++..-.+ .....-...+. ..........-++++....+.... T Consensus 1 ~~t~~d~--i~~L~~~~~~~~~r---~~~~~~Yy~G~~~i~~~~~~~~~~~~----~~~~~~n~~~~ivd~~~~~l~~~g 71 (480) T protein:vir:78 1 MTTYHEH--VERLQGLLARDLPN---LLEAEAYRNGTRRLKTIGIGAPPELA----YLDVQPGWVATYLRTLSDRLDIEG 71 (480) T ss_pred CCCHHHH--HHHHHHHHHHHHHH---HHHHHHHHhccccchhcccccchhhh----hhhhhcchHHHHHHHHHhhhccCc Confidence 3333221 11111111111000 111111100 00000000000 000012222333333333333334 Q ss_pred ceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee------cCCeeeEEeeEeeccceeEe- Q lcl|NC_020866. 94 ITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT------SEGQYFPDKLIWRDPRHFRF- 165 (547) Q Consensus 94 w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~------~~g~~~~~~l~~~~~~~~~~- 165 (547) |.+ ++++. ..+.+.++++.-+|+.....++ ++..||.| ++++|.- .++.. .+..++|+.... T Consensus 72 ~~~--~~d~~----~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~~~~---~i~~~~p~~~~~i 141 (480) T protein:vir:78 72 FRI--SEDSE----GLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIP---LIRVESPLYMYAE 141 (480) T ss_pred eec--CCCch----hHHHHHHHHHhcCHHHHHHHHHHHHhhcCce-EEEeecCccccCCCCCee---EEEEEcccceEEE Confidence 433 12222 2344666676667888888876 79999997 4667742 12322 233444433211 Q ss_pred -ccc----------------CCcc------------eeeccCCC-------Ccee----eccCceEEEeecCCcCccccc Q lcl|NC_020866. 166 -QKS----------------DLAT------------PLLLDDNG-------AEVP----LPAFRFIFADIPAKSGILLRS 205 (547) Q Consensus 166 -~~~----------------~~~~------------~~~~~~~~-------~g~~----l~~~k~i~~~~~~~~~~p~G~ 205 (547) +.. +... ......++ .+.. ++.--++.+.++.+.+.++|. T Consensus 142 ~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~ 221 (480) T protein:vir:78 142 LDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGR 221 (480) T ss_pred EcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCc Confidence 110 0000 00000000 0111 111234456677777889999 Q ss_pred hhHHH-HHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCC Q lcl|NC_020866. 206 GLARV-AAWGWMFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSI 282 (547) Q Consensus 206 gll~~-~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~ 282 (547) |-+.. +....=-=...+...+...+.|..|.++.+ ......++...... ....+.. ....|.+.++.+... T Consensus 222 sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~- 295 (480) T protein:vir:78 222 SEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTL----DIYYGRI-LTLASEAAKISEFKA- 295 (480) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchh----hhhhhhh-ccCCCCCceEEecCc- Confidence 87764 322222223345566777899999977654 11111111111111 1111222 233455677776443 Q ss_pred CCchHHHHHHHHHHHHHHHHHHhcchhccC----ccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 283 GSSVDLYEKRINHLDQQISKAVLGQTTTTD----AVAGGHAVSQE---HRLVQEDIETSDAMALAAILNRDLIRPWIQLE 355 (547) Q Consensus 283 ~~~~~~f~~li~~~d~~Isk~ilGqtlts~----~~ggs~a~~~v---h~~v~~~~~~ad~~~i~~~ln~~li~~l~~~N 355 (547) .+.+.|...++.+-.+|+.. +-+.. +.+.+.++|.. ...-....++.-.+.+...|.+ +++.++.++ T Consensus 296 -~~~~~~~~~l~~~i~~~~~~----~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~rl~~~~~ 369 (480) T protein:vir:78 296 -AELRNFAEEMEVFRKEAASI----TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIAMQIM 369 (480) T ss_pred -cCHHHHHHHHHHHHHHHhcc----cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHc Confidence 33445666555555555433 21111 11111122322 2222334455556666667754 666666665 Q ss_pred CCC---cccccEEEecCccccCHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCcchhccc-----------ccccc Q lcl|NC_020866. 356 YGP---QKRYPRIKIARPKTEDLTKLASSLDTLVRLGM-EIEENEVRSRFGFSQPKPGAKLLRP-----------LENPA 420 (547) Q Consensus 356 fg~---~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~-~v~~~~~~e~~Glp~p~~~e~~~~~-----------~~~~~ 420 (547) -.. ......+.|......++.+.++.+.+|+..|. .++.+.+++.+|+.+... +++... ...+. T Consensus 370 ~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~-~e~~~~~~~~~~~~~~~~~~~~ 448 (480) T protein:vir:78 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTLYSTT 448 (480) T ss_pred CCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHH-HHHHHHHHHHHHHHHHHhhccc Confidence 321 11234677877778888999999999998874 578899999999864321 111100 00000 Q ss_pred cccccccccccccCCccccccccccccchhhhhhcccc Q lcl|NC_020866. 421 TQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSL 458 (547) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (547) ...+.....+..+.. ++ +...++.-..+.... T Consensus 449 ~~~~~~~~~~~~~~~--~~----~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 449 KAQADATPKPTVTET--KT----ETQTSPSGFNRTKTR 480 (480) T ss_pred cCCCccccCCCCCCC--CC----ccCCCcccCCCcCCC Confidence 000000000000000 00 000000000000000 No 188 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.20 E-value=2.6e-06 Score=51.17 Aligned_cols=400 Identities=11% Similarity=0.096 Sum_probs=188.2 Q ss_pred CCCCC-eee-CCCCCcccchhhhhhhhhhhh--hhhHHHHhhhhccCCCHHHHHHHHHhh-cCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKP-VLL-DRWGNPVQRAQLTEEVAAPTV--TGVRSPLSGYPGDGLNPLRLAQILRAA-DHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~-~~~-~~~g~~~~~~~l~~~~a~~~~--~~~~~~~~~~~~~~l~p~~~~~ilr~a-~~g~~~~~~~L~~~m~~~D 75 (547) |--.| .+. -+...++....+.+-+..... .-.......|.+.. .|+... ...... -.. + .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~-------~i~~~~~~~~~~~-~~k----i--~~ 66 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIM-------AIDDEPAKDSWKP-DNR----L--AV 66 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------ccccCccccccCc-cce----e--ec Confidence 54222 222 223445544444443332111 11111112222211 011110 000000 000 1 24 Q ss_pred HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) +...-.+.+....+.+.+..+.+.++. .-+++.++++.-+|+..+..+. ++..||.+ ++++|.-.+|... T Consensus 67 n~~~~ivd~~~~~l~g~~~~~~~~d~~-----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~d~~g~~~--- 137 (452) T protein:vir:36 67 NFTKYIVDTFTGYFNGIPVKKSHSDKE-----ILTKLQEFDNLNDMEDEESELAKMACIYGRA-FEFLYQDEDTQTN--- 137 (452) T ss_pred chHHHHHHHHhhhhcccCceeecCChh-----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeE-EEEEEecCCCeeE--- Confidence 566667778888888999888765332 2356777777778888888777 68889975 4666654455443 Q ss_pred eEeeccceeE--ecccCCcce------------------------eeccCCCCceee-----ccCceE-EEeecCCcCcc Q lcl|NC_020866. 155 LIWRDPRHFR--FQKSDLATP------------------------LLLDDNGAEVPL-----PAFRFI-FADIPAKSGIL 202 (547) Q Consensus 155 l~~~~~~~~~--~~~~~~~~~------------------------~~~~~~~~g~~l-----~~~k~i-~~~~~~~~~~p 202 (547) +..++|+... ++......+ ........+..+ .++..| ++. -.+++ T Consensus 138 i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~---~~n~~ 214 (452) T protein:vir:36 138 VVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVE---FYFNE 214 (452) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEE---ecCCC Confidence 3334444321 111100000 000000000000 011111 111 13356 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCC-----ceEEEE Q lcl|NC_020866. 203 LRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPET-----MSIDFV 277 (547) Q Consensus 203 ~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~-----~~ie~~ 277 (547) .|.|.+..+....=--...+..++..++.|..|+++.+= ...++++.. ++.....+.++.+ .+++++ T Consensus 215 ~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~l 286 (452) T protein:vir:36 215 ERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLK-------NIRSNRVINYYADGEGKNVDVKFL 286 (452) T ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec-CCcCchhhh-------hhhhcceEEecCCCCccCCcceeE Confidence 788999876665555556788899999999999888762 222332222 2233344555543 357777 Q ss_pred ecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 278 ESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL---VQEDIETSDAMALAAILNRDLIRPWIQL 354 (547) Q Consensus 278 e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~---v~~~~~~ad~~~i~~~ln~~li~~l~~~ 354 (547) .... ....++..++.+.+.|...--+..++.++. | .++|..-.. -....+..-.+.+...+. ++++.++.+ T Consensus 287 ~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-g-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~ 360 (452) T protein:vir:36 287 EKPD---SDSQTENLLDRLTKLIFQTTMVANISDESF-G-SSSGVSLAYKLQAMSNLALSFQRKFQSSLN-SRYKLFCEL 360 (452) T ss_pred eecC---CHHHHHHHHHHHHHHHHHHhCccccCcccc-c-CCcHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 6443 234578888888888866532223333322 2 222322211 122233344455666664 366655554 Q ss_pred c--CC--CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccc--cccccc Q lcl|NC_020866. 355 E--YG--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQ--AAPPGM 428 (547) Q Consensus 355 N--fg--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~--~~~~~~ 428 (547) . .+ .+.....+.|....+.+..+.++.+.+++ |+ ++.+.+.+.++.-. +..+++-......... ...... T Consensus 361 ~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~--g~-iS~et~~~~~~~~~-d~~~E~~ri~~E~~~~~~~~~~~~ 436 (452) T protein:vir:36 361 STNVSNKDSWKDIEYTFTRNEPKDIKEQAETANILM--GI-TSQETALSVISVIP-DVQAEMEKIKKEEASTAIFDKDKQ 436 (452) T ss_pred HhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHhhcc Confidence 3 11 12233578888888999999999998874 65 89999999887531 1122222111111000 000000 Q ss_pred cccccCCccccccccccccchhhhhhcccchhhhcccccccchhH Q lcl|NC_020866. 429 DPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAE 473 (547) Q Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (547) .+..+........ ++. T Consensus 437 ~~~~~~~~~~~~~-----------------------------~~e 452 (452) T protein:vir:36 437 PSEKGTDTVVSET-----------------------------NEE 452 (452) T ss_pred CCCCcccccCccc-----------------------------cCC Confidence 0000000000000 000 No 189 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.20 E-value=2.7e-06 Score=51.12 Aligned_cols=410 Identities=12% Similarity=0.053 Sum_probs=177.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhh---hhhHHHHhhhhccCCCHHHHHHHHHhhcCCC-HHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTV---TGVRSPLSGYPGDGLNPLRLAQILRAADHGD-PVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~---~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~-~~~~~~L~~~m~~~D~ 76 (547) |.++-...=+.+..+....+..-+..... .-.......|-+.. + +++.....+ ...-..+ ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~-~------i~~~~~~~~~~~~~~ki------~~n 67 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDN-N------IKYRPAKTDKYAADNRI------ASD 67 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC-c------cccccccccccCCccee------ecc Confidence 66666555455555544444444332211 11111111121111 0 111000000 0000001 245 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee---cCCeeeE Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT---SEGQYFP 152 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~---~~g~~~~ 152 (547) ...-.+.+....+.+.+..+.+.++. .-+++..+++.-+|+.....+. ++..||.+..=+.+.. .++.. T Consensus 68 ~~~~iv~~~~~~l~g~~~~~~~~d~~-----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~-- 140 (489) T protein:vir:99 68 FAKYITVFEQGYMLGVPVEYKNENKD-----LQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEV-- 140 (489) T ss_pred hHHHHHHHHhhhhccCCceeecCChh-----HHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcce-- Confidence 55566667777788888887764332 3356777777667877766655 6888997754332221 12222 Q ss_pred EeeEeeccceeEeccc--CCcce-----------------------------eec-cC-CCCceee----c-cCceE-EE Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKS--DLATP-----------------------------LLL-DD-NGAEVPL----P-AFRFI-FA 193 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~--~~~~~-----------------------------~~~-~~-~~~g~~l----~-~~k~i-~~ 193 (547) .+..++|+.+....+ ....+ ... .. ...+..+ | +...| +. T Consensus 141 -~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 219 (489) T protein:vir:99 141 -KLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVN 219 (489) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEE Confidence 244444443211110 00000 000 00 0011000 0 11111 11 Q ss_pred eecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC--CCCCHHHHHHHHHH--------HHHHhcC Q lcl|NC_020866. 194 DIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ--PGASEQDKDTLFRA--------VANIAGD 263 (547) Q Consensus 194 ~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~--~~~~~~~~~~l~~a--------l~~~~~~ 263 (547) .+ .+++.|.|.+..+....--=...+..++..++.|..|+++.+-. ......+.....+. ....... T Consensus 220 ~~---~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (489) T protein:vir:99 220 EY---ANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKA 296 (489) T ss_pred Ee---ecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccc Confidence 11 13467888888765544444556788888889999998776521 11111111111110 0011111 Q ss_pred eeEEec-------CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHH---HHHHHH Q lcl|NC_020866. 264 CAAIIP-------ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLV---QEDIET 333 (547) Q Consensus 264 ~~~vip-------~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v---~~~~~~ 333 (547) ..+.++ .+.+++++.... ....++..++.+.+.|.+.--+..++.++.+ +..+|..-... ....+. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~ 372 (489) T protein:vir:99 297 QVLILDDNPNPNGVKPQAYFLKKEY---DTAGSEAYKNRLVADILRFTFTPDTQDMKFS-GVQSGESMKYKLMASDNYRE 372 (489) T ss_pred eeeeeccccCccccccceeeeeecC---ChHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHH Confidence 112222 234566665332 2235777888888888765222223333322 33334332221 122233 Q ss_pred HHHHHHHHHHHHHHHHHHHHhc---CCCc-----ccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhC-CC Q lcl|NC_020866. 334 SDAMALAAILNRDLIRPWIQLE---YGPQ-----KRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFG-FS 404 (547) Q Consensus 334 ad~~~i~~~ln~~li~~l~~~N---fg~~-----~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~G-lp 404 (547) .-.+.+...+. ++++-++.+- .+.. .....+.|....+.+..+.++++.+++ |+ ++.+.+.+.++ +. T Consensus 373 ~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--gi-is~et~~~~l~~v~ 448 (489) T protein:vir:99 373 KQERLFKKGLM-RRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GI-VSDQTIFEILNTVT 448 (489) T ss_pred HHHHHHHHHHH-HHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCHHHHHHhcCCCC Confidence 33455556663 3555554431 1111 123578888888999999999999885 66 88888877764 43 Q ss_pred CCCCcchhcccccc---cccccccccccccccCCccccccccc Q lcl|NC_020866. 405 QPKPGAKLLRPLEN---PATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 405 ~p~~~e~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) .++..+++-..... .......... +.......+.+..| T Consensus 449 ~~d~~~E~~ri~~E~~~~~~~~~~~~~--~~~~~~~~~~~~~p 489 (489) T protein:vir:99 449 GVDAEAELKRLKEEADKKQSLPEPRLV--GDASGQEEPTAEKP 489 (489) T ss_pred chhHHHHHHHHHHHHHHHhcccccccc--CCCCCCcCCCCCCC Confidence 22222222111111 1111111100 00000000001111 No 190 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=98.16 E-value=3.3e-06 Score=50.64 Aligned_cols=362 Identities=9% Similarity=0.042 Sum_probs=179.3 Q ss_pred CCHHHHHHHHHhhcCCCHHHHHHHHHHHHh--------------------------hhHHHHHHHHHHHHHHhcCCceEe Q lcl|NC_020866. 44 LNPLRLAQILRAADHGDPVRYLDLAEVIEE--------------------------RDPHYLGVIGTRKRSVSQLDITVE 97 (547) Q Consensus 44 l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~--------------------------~D~~v~s~l~~Rk~~v~~~~w~v~ 97 (547) ||++.+...++.... ...++..+.+.+.- ......-.+.+....+.+.+..+. T Consensus 1 l~~~~i~~~i~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADAA-RRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHHH-HHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee Confidence 333333332222111 11111111111100 134444556666777778877766 Q ss_pred cCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC--------CeeeEEeeEeeccceeE--ec Q lcl|NC_020866. 98 PASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE--------GQYFPDKLIWRDPRHFR--FQ 166 (547) Q Consensus 98 p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~--------g~~~~~~l~~~~~~~~~--~~ 166 (547) ..++ ++..+.++.++++ +|+.....+. ++.-||.+. +++|...+ |.+ ++..++|+.+. ++ T Consensus 80 ~~~~----~~~~~~~~~~~~n-~~~~~~~~~~~~~~~~G~a~-~~~y~de~~~~~~~~~~~~---~~~~i~p~~~~~vyd 150 (451) T protein:vir:10 80 IDNN----KELNEKVTDVLGN-EFTRKAKNLAIEASNCGSAW-LHYWIDEEYSGEQVTNQTF---KYGVVNTEEIIPIYR 150 (451) T ss_pred cCCc----HHHHHHHHHHhcc-CHHHHHHHHHHHHhhcCeEE-EEEeecCCcccccccccce---eEEEEcccceEEEEc Confidence 4432 2233556666653 6788877765 688899776 45553321 222 24444444332 11 Q ss_pred ccCCcc------eeeccC--CC----------------------------Cc-----eeec-cCceEEEeecCCcCcccc Q lcl|NC_020866. 167 KSDLAT------PLLLDD--NG----------------------------AE-----VPLP-AFRFIFADIPAKSGILLR 204 (547) Q Consensus 167 ~~~~~~------~~~~~~--~~----------------------------~g-----~~l~-~~k~i~~~~~~~~~~p~G 204 (547) ...... ...... .+ .+ ...+ .+..|-+.. -.+|..| T Consensus 151 d~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~--~~nn~~~ 228 (451) T protein:vir:10 151 NGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVE--FSNNIKK 228 (451) T ss_pred CCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEE--eccCCCC Confidence 110000 000000 00 00 0000 111111111 1235678 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeC-CCCCCHHHHHHHHHHHHHHhcCeeEEecC-----CceEEEEe Q lcl|NC_020866. 205 SGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKW-QPGASEQDKDTLFRAVANIAGDCAAIIPE-----TMSIDFVE 278 (547) Q Consensus 205 ~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~-~~~~~~~~~~~l~~al~~~~~~~~~vip~-----~~~ie~~e 278 (547) .|.+..+....---...+...+..++.|.-|+++.+- ....+.+ .+.++.....++++. +.+++++. T Consensus 229 ~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~-------~~~~~~~~~~i~~~~~~~~~~~~~~~l~ 301 (451) T protein:vir:10 229 QSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSE-------FLKELKRYKTIKTETDSEGDSGGLKTMQ 301 (451) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchh-------hHHHHhhCCeEEecCcCCccCCcceEEe Confidence 8999887666655666788889999999999988772 2111222 223455556666653 45688886 Q ss_pred cCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 279 SKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLVQ---EDIETSDAMALAAILNRDLIRPWIQLE 355 (547) Q Consensus 279 ~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~---~~~~~ad~~~i~~~ln~~li~~l~~~N 355 (547) ... ....++..++++.+.|.+.--+..++.++. | .++|..-.-.. ...+..-.+.+...+. ++++.++.+. T Consensus 302 ~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-g-n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~-~~~~li~~~~ 375 (451) T protein:vir:10 302 IEI---PTEARKIILEILKKQIYESGQGLQQDTENF-G-NASGVALKFFYRKLELKSGLLETEFRTSFD-KLIKAILYFL 375 (451) T ss_pred ecC---CHHHHHHHHHHHHHHHHHHhCccccccccc-c-cccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHh Confidence 543 234588889999888887733333433332 2 33343332222 2233344555666663 4666666654 Q ss_pred CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCC-CCCcchhcccccccccccccccccccccC Q lcl|NC_020866. 356 YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQ-PKPGAKLLRPLENPATQAAPPGMDPGAGD 434 (547) Q Consensus 356 fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~-p~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 434 (547) -..+.....+.|....+.+..+.++.+.+++ |+ ++.+.+.+.++.-. |++.++................. + T Consensus 376 ~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~~~-~---- 447 (451) T protein:vir:10 376 GVTDYKKIQQTYTRNMMSNDLEDADIATKSV--GI-IPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVSDDY-N---- 447 (451) T ss_pred CCCCccceeEEecCCCCCCHHHHHHHHHHHh--cc-CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc-C---- Confidence 2223344578888889999999999999985 65 99998988887633 32222211111100000000000 0 Q ss_pred Cccccccccccccchhhhhhcccchhhhcccccccch Q lcl|NC_020866. 435 LKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGG 471 (547) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (547) .+. + T Consensus 448 ----------------------~~~-----------~ 451 (451) T protein:vir:10 448 ----------------------NFT-----------E 451 (451) T ss_pred ----------------------CCC-----------C Confidence 000 0 No 191 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.03 E-value=6.4e-06 Score=49.05 Aligned_cols=415 Identities=12% Similarity=0.034 Sum_probs=159.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHH-HH------HHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPV-RY------LDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~-~~------~~L~~~m~~ 73 (547) |- + ++.-.-.+-.-..+-+.+. +.+..+. .|+... ...-.|+.. .+ .++ ..+.- T Consensus 1 ~~--~-~~~~~~e~~~~~~~~~~l~--------~~~~~~~------~r~~~l-~~YY~G~~~i~~~~~~~~~~~-~~~~~ 61 (486) T protein:vir:42 1 MT--A-PLPGMEEIEDPAVVREEMI--------SAFEDAS------KDLASN-TSYYDAERRPEAIGVTVPREM-QQLLA 61 (486) T ss_pred CC--C-CCCCCCCcccHHHHHHHHH--------HHHHHHH------HHHHHH-HHHhcccCcchhcccccchhH-hhhhh Confidence 22 1 1221111111111111111 1111110 011111 001111100 00 000 00000 Q ss_pred hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeee- Q lcl|NC_020866. 74 RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYF- 151 (547) Q Consensus 74 ~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~- 151 (547) .-...+-+++.....+.-..|.+ ++++..+ +.+.++++.-+|+....+++ ++..||.| ++++|...++... T Consensus 62 v~n~~~~iVd~~~~~l~~~g~~~--~~~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~~~G~a-y~~v~~~e~~~~~~ 134 (486) T protein:vir:42 62 HVGYPRLYVDSVAERQAVEGFRL--GDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRS-FITISKPDPQLDLG 134 (486) T ss_pred ccchHHHHHHHHHhhhcccceec--CCCchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCce-EEEEecCCcccccc Confidence 11122222232222222233433 2222222 23455666557888777766 68889997 6677753222110 Q ss_pred ----EEeeEeeccceeE--ecc---------------cC-Ccc----------eeeccCCCCc-----e--eeccCceEE Q lcl|NC_020866. 152 ----PDKLIWRDPRHFR--FQK---------------SD-LAT----------PLLLDDNGAE-----V--PLPAFRFIF 192 (547) Q Consensus 152 ----~~~l~~~~~~~~~--~~~---------------~~-~~~----------~~~~~~~~~g-----~--~l~~~k~i~ 192 (547) ...+..++|+... ++. ++ ... +......+.. + .++.--++. T Consensus 135 ~~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~ 214 (486) T protein:vir:42 135 WDQNVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVP 214 (486) T ss_pred cCCCeeEEEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEE Confidence 0123333333221 110 00 000 0001111110 0 111113455 Q ss_pred EeecCCcCccccchhHHHHHHH-HHHHHHHHHHHHHHHHHcCCCeEEEeC--CCCCC-HHHHHHHHHHHHHHhcCeeEEe Q lcl|NC_020866. 193 ADIPAKSGILLRSGLARVAAWG-WMFKAFTARDWAIFTQTYGQPLRIGKW--QPGAS-EQDKDTLFRAVANIAGDCAAII 268 (547) Q Consensus 193 ~~~~~~~~~p~G~gll~~~~~~-~~~K~~~~~~w~~f~E~~G~P~~vgk~--~~~~~-~~~~~~l~~al~~~~~~~~~vi 268 (547) +.++.+.+.++|.|-+..-..+ .=-=...+.+.+...|.|..|.++.+- +.... +.++.. .+-....+...++ T Consensus 215 ~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 291 (486) T protein:vir:42 215 LPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQ---TLFDAYLARILAF 291 (486) T ss_pred eccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCcccccccccccc---chhhhhhchhccc Confidence 6677777888999887642222 212223455677778999999877652 11111 111110 0011112222333 Q ss_pred cCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccC-cccc---ccchhHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_020866. 269 PETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTD-AVAG---GHAVSQEH---RLVQEDIETSDAMALAA 341 (547) Q Consensus 269 p~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~-~~gg---s~a~~~vh---~~v~~~~~~ad~~~i~~ 341 (547) | +.+.+|.+... .+.+.| +++++.-|.+. .+.+-+++ .-|| +.++|..- ..-....++.-.+.+.. T Consensus 292 ~-~~~~~~~q~~~--~~~e~~---~~~l~~~i~~~-s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~ 364 (486) T protein:vir:42 292 E-DAEGKIQQFSA--AELANF---TNALDQIAKQV-AAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGG 364 (486) T ss_pred C-CCCceEEeecc--cCHHHH---HHHHHHHHHHH-hcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 34677766542 333344 44444444333 22222221 1111 11223221 22223334445566666 Q ss_pred HHHHHHHHHHHHhcCCC----cccccEEEecCccccCHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCcchhcc-- Q lcl|NC_020866. 342 ILNRDLIRPWIQLEYGP----QKRYPRIKIARPKTEDLTKLASSLDTLVRLGM-EIEENEVRSRFGFSQPKPGAKLLR-- 414 (547) Q Consensus 342 ~ln~~li~~l~~~Nfg~----~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~-~v~~~~~~e~~Glp~p~~~e~~~~-- 414 (547) .+.+ +++.++.+-.+. +..--++.|....+.++.+.++++.+|++.|. .++.+.+++.+|+.+... +++.. T Consensus 365 ~l~~-~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~-~e~~~~~ 442 (486) T protein:vir:42 365 AWEE-AMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKER-EEMRRWD 442 (486) T ss_pred HHHH-HHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHH-HHHHHHH Confidence 7744 555554443221 11224678888888999999999999998763 488899999999854322 12111 Q ss_pred c-ccc-cccc-cc--cccccccccCCccccccccccccchhhhhhcccchhhhcccccccc Q lcl|NC_020866. 415 P-LEN-PATQ-AA--PPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTG 470 (547) Q Consensus 415 ~-~~~-~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (547) . ... +... .. .....+..+..+..++.+.+.... +... . T Consensus 443 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~--------------~ 486 (486) T protein:vir:42 443 EEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIES---SGGD--------------A 486 (486) T ss_pred HHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCC---CCCC--------------C Confidence 0 000 0000 00 000000000111111111000000 0000 0 No 192 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.99 E-value=7.8e-06 Score=48.57 Aligned_cols=391 Identities=10% Similarity=-0.021 Sum_probs=156.6 Q ss_pred eCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_020866. 8 LDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKR 87 (547) Q Consensus 8 ~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~ 87 (547) ++.+-+.+ ...|-+..... ..-.+..-..|-+.. .++.....-...+..+ . + .-....-++++... T Consensus 1 ~~~~~~~~-i~~l~~~~~~~-~~r~~~l~~Yy~G~~--------~i~~~~~~~~~~~~~~-k-~--~~n~~~~ivd~~~~ 66 (441) T protein:vir:80 1 MNSDELAL-IEGMYDRIQRL-SSWHCCIEGYYEGSN--------RVRDLGVAIPPELQRV-Q-T--VVSWPGIAVDALEE 66 (441) T ss_pred CCccHHHH-HHHHHHHHHHH-HHHHHHHHHHHhcCC--------cchhcCcccchhhhhh-h-h--hcchHHHHHHHHHh Confidence 11111100 01111111100 000000001111100 0000000000000000 0 0 01111112222222 Q ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE-- Q lcl|NC_020866. 88 SVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR-- 164 (547) Q Consensus 88 ~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~-- 164 (547) .+.--.|.+ ++. +.+.++++.-+|+..+.++. ++..||.| ++++|.-.+|... +..++|+.+. T Consensus 67 ~l~~~g~~~----~d~------~~l~~i~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~d~~g~~~---i~~~~p~~~~~i 132 (441) T protein:vir:80 67 RLDWLGWTN----GDG------YGLDGVYAANRLATASCDVHLDALIFGLS-FVAIIPHGDGTVS---VRPQSPKNCTGK 132 (441) T ss_pred hhccccccC----CCh------HHHHHHHHhcCHHHHHHHHHHHHhhcCee-EEEEEeCCCCceE---EEEEccceEEEE Confidence 121112211 111 23556666667888888876 68899987 5678865556543 3444554332 Q ss_pred ecccCCcc-------------------------eeeccCCCCc------e--eeccCceEEEeecCCcCccccchhHHH- Q lcl|NC_020866. 165 FQKSDLAT-------------------------PLLLDDNGAE------V--PLPAFRFIFADIPAKSGILLRSGLARV- 210 (547) Q Consensus 165 ~~~~~~~~-------------------------~~~~~~~~~g------~--~l~~~k~i~~~~~~~~~~p~G~gll~~- 210 (547) ++...+.. .........+ . ++...-++.+.++.+.+.++|.|-+.. T Consensus 133 ~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~ 212 (441) T protein:vir:80 133 FSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRS 212 (441) T ss_pred EeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhh Confidence 12111100 0000000000 0 111112344556677788889886543 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCce---EEEEecCCCCCchH Q lcl|NC_020866. 211 AAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMS---IDFVESKSIGSSVD 287 (547) Q Consensus 211 ~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~---ie~~e~~~~~~~~~ 287 (547) +....=--...+.+++...+.|+.|.++.+- ...++..... .++..+....+|.+.+ +++.+.. .++.+ T Consensus 213 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G-~~~~~~~~~~-----~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~ 284 (441) T protein:vir:80 213 IRAYTDEAVRTLLGQSVNRDFYAYPQRWVTG-VSADEFSQPG-----WVLSMASVWAVDKDDDGDTPNVGSFP--VNSPT 284 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCceeeeec-CCccccccch-----hhhcccccccCCCCCCCCcceeEecC--ccchH Confidence 3222222344566777888999999887651 1111111111 1223344555565433 4555433 23334 Q ss_pred HHHHHHHHHHHHHHHHHhcchhccCcccc-ccchhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCc-- Q lcl|NC_020866. 288 LYEKRINHLDQQISKAVLGQTTTTDAVAG-GHAVSQEHR---LVQEDIETSDAMALAAILNRDLIRPWIQLE--YGPQ-- 359 (547) Q Consensus 288 ~f~~li~~~d~~Isk~ilGqtlts~~~gg-s~a~~~vh~---~v~~~~~~ad~~~i~~~ln~~li~~l~~~N--fg~~-- 359 (547) .|...++-+-.+|+.. .+-+...-+..+ +.++|..-. .-....+..-.+.+...|.+ +++.++.+- .+.. T Consensus 285 ~~~~~l~~~i~~~~~~-~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~~~~~~~~~~~~ 362 (441) T protein:vir:80 285 PYSDQMRLLAQLTAGE-AAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLS-VGFLAAKALDSRVDEAD 362 (441) T ss_pred HHHHHHHHHHHHHhcc-cCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCcccc Confidence 4555444444444422 111110001111 112232221 12223333444555556643 455454442 2111 Q ss_pred -ccccEEEecCccccCHHHHHHHHHHHHhCCCC-CCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcc Q lcl|NC_020866. 360 -KRYPRIKIARPKTEDLTKLASSLDTLVRLGME-IEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKT 437 (547) Q Consensus 360 -~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~-v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (547) ..-.++.|....+.++.+.++.+.+|+..|.. ++.+.+.+.+|+.+. +-+++......... .-+...+. T Consensus 363 ~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~-e~~~~~~e~~e~~~---~~~~~~~~----- 433 (441) T protein:vir:80 363 FFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDV-QVEAVMRHRAESSD---PLAVLAGA----- 433 (441) T ss_pred cceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHH-HHHHHHHHHHHHHH---HHHHHhhh----- Confidence 12347788888999999999999999999974 577778899988642 22222211111000 00000000 Q ss_pred ccccccccccchhhhhhcccchhhhcccccccchhH Q lcl|NC_020866. 438 AERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAE 473 (547) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (547) ...++ ++. T Consensus 434 -------------------------~~~~~---~~~ 441 (441) T protein:vir:80 434 -------------------------ISRQT---NEV 441 (441) T ss_pred -------------------------hhccc---ccC Confidence 00000 000 No 193 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.99 E-value=7.9e-06 Score=48.56 Aligned_cols=398 Identities=12% Similarity=0.059 Sum_probs=168.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhh-h--hhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHH---------HHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEV-A--APTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVR---------YLDLA 68 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~-a--~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~---------~~~L~ 68 (547) |.. |+++ .+. . .+.-.-+++.+.+ ...+|.++.-.++. ...-| T Consensus 1 m~~-----------V~~~---hp~y~~~~~~W~~ird~~~G-----------~~~~r~~g~~YLP~~~~e~~~~e~~~~Y 55 (501) T protein:vir:95 1 MPN-----------VSFI---RPELGKLLPLYYLIRDAIAG-----------EPTVKGARTTYLPMPNAEDQSKENKARY 55 (501) T ss_pred CCC-----------CCCC---CHHHHHHHHHHHHHHHHhcC-----------hHHHHhcccccCcCCCCCCCcccchHHH Confidence 221 1111 111 0 1111122222221 12334333222211 01223 Q ss_pred HHHHhh---hHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHHH-hhhhcCeeeeeEE Q lcl|NC_020866. 69 EVIEER---DPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHILD-CISKGYSFTEIVW 143 (547) Q Consensus 69 ~~m~~~---D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w 143 (547) +....+ -.+++..++.....+.+.+-.++ .+. ++..+++.+ ..+.+++.+++.++. ++.||.+.+=+-+ T Consensus 56 ~~rl~rA~~~n~~~~t~~~l~G~vf~k~p~~~----~p~--~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~ 129 (501) T protein:vir:95 56 EAYLKRAVFYNVARRTLFGLVGQVFMRDPVVK----VPA--LLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDY 129 (501) T ss_pred HHHhhccccCchHHHHHHHHhhhhhcCCccee----CcH--HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEee Confidence 333222 46666666666667776665553 121 111222111 123478999988884 6688887554433 Q ss_pred ee------------------------------------cCCeeeEEeeEeecc-----cee-----------EecccCCc Q lcl|NC_020866. 144 NT------------------------------------SEGQYFPDKLIWRDP-----RHF-----------RFQKSDLA 171 (547) Q Consensus 144 ~~------------------------------------~~g~~~~~~l~~~~~-----~~~-----------~~~~~~~~ 171 (547) -. .+|...+..+..+.. ..| ..+.++.- T Consensus 130 P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~ 209 (501) T protein:vir:95 130 PTTEAEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYY 209 (501) T ss_pred cCCCCcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceE Confidence 11 122222222221110 000 00000000 Q ss_pred ceeec-c-CCC-----------------------CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHH-HHH Q lcl|NC_020866. 172 TPLLL-D-DNG-----------------------AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTA-RDW 225 (547) Q Consensus 172 ~~~~~-~-~~~-----------------------~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~-~~w 225 (547) +.... . ..+ .+..+..=-|+++... ..+--.+...|..++.. -.++|-. .+. T Consensus 210 ~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl~lA~l-ni~hy~~ssd~ 287 (501) T protein:vir:95 210 VHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSE-NNDSNPDNPNFYDLASL-NMAHYRNSADY 287 (501) T ss_pred EEEEEEecCCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecC-CCCCCCCccchHHHHHH-HHHHHhhhhHH Confidence 00000 0 000 0001110113332222 22223344445555544 3344433 344 Q ss_pred HHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHh Q lcl|NC_020866. 226 AIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVL 305 (547) Q Consensus 226 ~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~il 305 (547) -.-+..-+.|+++.+- .+++..+.....--.+|+.++..+|+|.++.++|.++ .+ ..++.++....+|.. + T Consensus 288 ~~~l~~~~~P~l~i~G---~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~~--~~--i~~~~l~~l~~~m~~--~ 358 (501) T protein:vir:95 288 EESCYIVGQPTPVLIG---LTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQASE--NT--MLKEAMDTKERQMVA--L 358 (501) T ss_pred HHHHHHcccceeeeeC---CcccccccCCCCceeecccccccCCCCCceeEEecCh--hh--HHHHHHHHHHHHHHH--H Confidence 4455666888887652 1222111111112457888999999999999999763 22 345556666666644 3 Q ss_pred cchhccCcccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEe--cCccccCHHHHHHHH Q lcl|NC_020866. 306 GQTTTTDAVAG-GHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKI--ARPKTEDLTKLASSL 382 (547) Q Consensus 306 Gqtlts~~~gg-s~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~--~~~~~ed~~~~a~~~ 382 (547) |..+...+.+. |--...........++..-+..+++.++ +++++++.+---.+ .-++|++ +..........++.+ T Consensus 359 Ga~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~-~~l~~~a~w~g~~~-~~~~v~i~~df~~~~~~~~~~~al 436 (501) T protein:vir:95 359 GAKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAFE-WALKWAARWVGQAD-SGVKFELNTDFDIARMTPDERRSL 436 (501) T ss_pred HHhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCC-CceEEEEecccccccCCHHHHHHH Confidence 54443322111 1111222223335567788888999996 48899988863222 2234433 322222224456777 Q ss_pred HHHHhCCCCCCHHHHH---HHhCCCCCCCcchhc--ccccccccccccccccccccCCccccccccccccchhhhhhccc Q lcl|NC_020866. 383 DTLVRLGMEIEENEVR---SRFGFSQPKPGAKLL--RPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLS 457 (547) Q Consensus 383 ~~l~~~G~~v~~~~~~---e~~Glp~p~~~e~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (547) .++.+.|. ++.+.++ ++.|++.+...++.. .....+......+...+........ +.+. T Consensus 437 ~~~~~~G~-is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~------------~~~~--- 500 (501) T protein:vir:95 437 VEEWQKGA-ITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDN------------VGNS--- 500 (501) T ss_pred HHHHhCCC-CcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCccccc------------ccCC--- Confidence 88888887 7766553 345887765433221 1111111111111111111100000 0000 Q ss_pred chhh Q lcl|NC_020866. 458 LNSE 461 (547) Q Consensus 458 ~~~~ 461 (547) + T Consensus 501 ---~ 501 (501) T protein:vir:95 501 ---E 501 (501) T ss_pred ---C Confidence 0 No 194 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.99 E-value=8e-06 Score=48.54 Aligned_cols=415 Identities=11% Similarity=0.052 Sum_probs=177.1 Q ss_pred CCCCCeeeCC----CCC---cccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHH---HH----HH Q lcl|NC_020866. 1 MAQKPVLLDR----WGN---PVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPV---RY----LD 66 (547) Q Consensus 1 ma~~~~~~~~----~g~---~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~---~~----~~ 66 (547) |- +|- .++ |=+...|+.+ .+...+..|-... -.++....+ .-.|... .. .. T Consensus 1 ~~-----~~~~~~~~~~~~~~~~~~~l~~~-------~i~~li~~~~~~~--~~r~~~l~~-YY~g~~~~i~~~~~~~~~ 65 (506) T protein:vir:94 1 MD-----YDLTEHKQANLIYQESLENLTPN-------KIMKFITHHFNYQ--RPRLEMLDD-YYQGYNLKILDKQSRRHE 65 (506) T ss_pred CC-----cchhhhhcceeecccchhcCCHH-------HHHHHHHHHHHHH--HHHHHHHHH-HhcCCCcccccccccccc Confidence 33 221 111 1011112111 1112211111100 001111111 1111100 00 00 Q ss_pred HH-HHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEe Q lcl|NC_020866. 67 LA-EVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWN 144 (547) Q Consensus 67 L~-~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~ 144 (547) .. ....-..+...-.+.+....+.+.+..+.+.++.. .+.+..+++.-+|+.....+. ++..+|.+. +++|. T Consensus 66 ~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d~~~-----~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~-~~v~~ 139 (506) T protein:vir:94 66 DGKADHRATHSFAKYIADFQTSYSVGNPINVKLPDDGS-----NSGFDTFNKANDVDAENYDLFLDMSRYGRAY-EYVYR 139 (506) T ss_pred ccCCcceeecchHHHHHHHhhhhhcccCceeecCcchH-----HHHHHHHHhccCHhHHHHHHHHHHHhcCeEE-EEEEe Confidence 00 00000245666777788888888888887654432 356778887777888777766 688899854 56665 Q ss_pred ecCCeeeEEeeEeeccceeEe--cccCCccee--e--c-----cCCCCc-e-----eeccCceEEEeec----------- Q lcl|NC_020866. 145 TSEGQYFPDKLIWRDPRHFRF--QKSDLATPL--L--L-----DDNGAE-V-----PLPAFRFIFADIP----------- 196 (547) Q Consensus 145 ~~~g~~~~~~l~~~~~~~~~~--~~~~~~~~~--~--~-----~~~~~g-~-----~l~~~k~i~~~~~----------- 196 (547) ..+|... +..++|+.... +......++ . . .+.... . -..+..+..+... T Consensus 140 ded~~~~---i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~ 216 (506) T protein:vir:94 140 GEDNEEH---LAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTT 216 (506) T ss_pred cCCCeeE---EEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceecccc Confidence 4455543 44455554322 111100000 0 0 000000 0 0000000000000 Q ss_pred ---------CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHH------------------H Q lcl|NC_020866. 197 ---------AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQ------------------D 249 (547) Q Consensus 197 ---------~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~------------------~ 249 (547) .-.++..|.|.+..+....=-=...+.+++..++.|..|+++.+-....... . T Consensus 217 ~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (506) T protein:vir:94 217 KPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKL 296 (506) T ss_pred ccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhcccccccccccccccc Confidence 0122455778777766554444556777777777766665443311000000 0 Q ss_pred HHHHHHHHHHHhcCeeEEecC---------CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccch Q lcl|NC_020866. 250 KDTLFRAVANIAGDCAAIIPE---------TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAV 320 (547) Q Consensus 250 ~~~l~~al~~~~~~~~~vip~---------~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~ 320 (547) .......+..+.....+.++. +.+++++.... ....++..++.+.+.|...--...++.++.+ +..+ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~---~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~S 372 (506) T protein:vir:94 297 AKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTY---DVVGSEAYKKRVAGDIHKFSHTPDLTDENFA-SNSS 372 (506) T ss_pred ccchhHHHhhhhhcCeeeecccccccCccccccceeeeecC---CHHHHHHHHHHHHHHHHHHhCcccccccccc-ccch Confidence 111123333444444444443 34566665432 2335788888888888776433334444322 2223 Q ss_pred hHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC--CcccccEEEecCccccCHHHHHHHHHHHHhCCCC Q lcl|NC_020866. 321 SQEHRLV---QEDIETSDAMALAAILNRDLIRPWIQL----EYG--PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGME 391 (547) Q Consensus 321 ~~vh~~v---~~~~~~ad~~~i~~~ln~~li~~l~~~----Nfg--~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~ 391 (547) |..-... ....+..-.+.+...+. ++++.++.+ +.+ .+....++.|....+.+..+.++.+.++. |+ T Consensus 373 g~Aik~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~- 448 (506) T protein:vir:94 373 GVAMQYKVLGTVELASTKRRMFERGLY-ARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG--AT- 448 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc- Confidence 3322111 11223333445555553 355554443 211 12234678899999999999999999884 65 Q ss_pred CCHHHHHHHhCC-CCCCCcchhccccccc-ccccccccccccccCCccccccccccccchhhhhhcccchhhhccccccc Q lcl|NC_020866. 392 IEENEVRSRFGF-SQPKPGAKLLRPLENP-ATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKT 469 (547) Q Consensus 392 v~~~~~~e~~Gl-p~p~~~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (547) ++.+.+.+.++. +.|+.+-+-+...... .......+.....+.....+... . T Consensus 449 iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--------------------- 502 (506) T protein:vir:94 449 LPQKYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQTNTTATQT-----D--------------------- 502 (506) T ss_pred CChHHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCcccccccc-----c--------------------- Confidence 999999998854 3333211111111110 00000000000000000000000 0 Q ss_pred chhHHH Q lcl|NC_020866. 470 GGAEIA 475 (547) Q Consensus 470 ~~~~~~ 475 (547) ..+. T Consensus 503 --~e~~ 506 (506) T protein:vir:94 503 --EEVR 506 (506) T ss_pred --cCCC Confidence 0010 No 195 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.95 E-value=9.4e-06 Score=48.14 Aligned_cols=399 Identities=13% Similarity=0.078 Sum_probs=169.4 Q ss_pred CCCCCeeeCCCCCcccc----hhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHh--hcCCCHHHHHHH--HHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQR----AQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRA--ADHGDPVRYLDL--AEVIE 72 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~----~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~--a~~g~~~~~~~L--~~~m~ 72 (547) |. |.+.-.+.|.+ -.+.+.+. .+..+|.-.+++..+..|-+- -=.|+...+... ...-. T Consensus 1 m~----~~~~ik~~~~~~~~~~~~~~~~~---------~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~ 67 (505) T protein:vir:79 1 MA----FWDTLKNLFRKGSAAVGMTKSLG---------QIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQ 67 (505) T ss_pred Cc----hHHHHHHHHHHhhhhhcchhhhh---------hhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCcc Confidence 55 22222222211 00111111 011122222233222222110 001111100000 00000 Q ss_pred h----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC Q lcl|NC_020866. 73 E----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE 147 (547) Q Consensus 73 ~----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~ 147 (547) . +=..=+....+-..-|++-+-.|... +.+..+++.++++.-.|...+...+ .|..+|=.++-+.|. + T Consensus 68 ~~~~~slnl~~~i~~~~A~ll~~e~~~i~~~-----d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D--~ 140 (505) T protein:vir:79 68 KHELQSVNVTKLASAKLASLIFNEQCQVTVS-----DETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD--S 140 (505) T ss_pred ccceeecchHHHHHHHHHhhhcCCCceeecC-----ChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe--C Confidence 0 00011233344455566665555542 2345678888888777888877766 688899888887775 2 Q ss_pred CeeeEEeeEeeccceeEe---ccc----------------C----------------Ccce---eeccCCC--Cce--e- Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFRF---QKS----------------D----------------LATP---LLLDDNG--AEV--P- 184 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~~---~~~----------------~----------------~~~~---~~~~~~~--~g~--~- 184 (547) +.. ++.++++..|.. +.+ + ..++ +..+.+. -|. + T Consensus 141 ~~~---~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l 217 (505) T protein:vir:79 141 GKI---KLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPL 217 (505) T ss_pred Cce---EEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccch Confidence 322 122233322110 000 0 0000 0111110 011 1 Q ss_pred --ec------c---------CceEEEee----cCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE----E Q lcl|NC_020866. 185 --LP------A---------FRFIFADI----PAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI----G 239 (547) Q Consensus 185 --l~------~---------~k~i~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v----g 239 (547) +| + --|.+++. +...++|+|.|.+..+.-..-.=...+..|+.-++.-...+.| - T Consensus 218 ~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l 297 (505) T protein:vir:79 218 NSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWL 297 (505) T ss_pred hhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHh Confidence 11 1 11333322 2345679999999999866655555666666655543333333 1 Q ss_pred eC-CCCCCH--HHHHHH----HHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hcc-hhc Q lcl|NC_020866. 240 KW-QPGASE--QDKDTL----FRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LGQ-TTT 310 (547) Q Consensus 240 k~-~~~~~~--~~~~~l----~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lGq-tlt 310 (547) +. +.+..+ .....+ .+....+..+++ +..|+.....- -.+.|...++.+-++|+..+ +++ +++ T Consensus 298 ~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~-----~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~g~s~~~~~ 369 (505) T protein:vir:79 298 KTGSSYGGQASETHPPMFDPDETVYQAMYGDAS-----EVGFHDATSPI---RVADYQATMDFFLREFENQTGLSQGTFT 369 (505) T ss_pred cccCCCCcccccccccCCCccceeeeeccCCCC-----CCceEEecccC---CHHHHHHHHHHHHHHHHHHhCCChhhcC Confidence 11 111110 000000 011111111211 22344443221 11235554555555555433 343 344 Q ss_pred cCccccccchhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----C-------CC--cccccEEEecCccccC Q lcl|NC_020866. 311 TDAVAGGHAVSQEHRL--VQEDIETSDAMALAAILNRDLIRPWIQLE-----Y-------GP--QKRYPRIKIARPKTED 374 (547) Q Consensus 311 s~~~ggs~a~~~vh~~--v~~~~~~ad~~~i~~~ln~~li~~l~~~N-----f-------g~--~~~~p~~~~~~~~~ed 374 (547) .++ .|...+-++..+ -....+..-.+.+...| ++|++.++.+. | +. ...-+.|.|+..-..| T Consensus 370 ~~~-~~~~TAtei~s~~~~l~~t~~~~~~~~~~al-~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d 447 (505) T protein:vir:79 370 TSP-SGIQTATEVVTNNSQTYQTRSSYITQVEKTI-KALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVD 447 (505) T ss_pred CCc-cccchHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCC Confidence 333 233323333322 23334555566677777 45777776542 1 11 1123568888888899 Q ss_pred HHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhccccccccccccccccccccc Q lcl|NC_020866. 375 LTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAG 433 (547) Q Consensus 375 ~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~ 433 (547) .++.++...+++..|+--.+.++.+.+|+++.+..+++-.......... |.....+.. T Consensus 448 ~~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~eeea~~el~ri~~E~~~~~-p~~~~~gg~ 505 (505) T protein:vir:79 448 QESKRAADLQAVQAQVMPKKQFLMRNYGLDEEEADEWLAQIDAENSTAE-PEFNQFGGD 505 (505) T ss_pred HHHHHHHHHHHHHcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccC-CCchhccCC Confidence 9999999999999998444567788889876443333332222211111 111111100 No 196 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=97.94 E-value=1e-05 Score=47.99 Aligned_cols=354 Identities=9% Similarity=-0.013 Sum_probs=153.3 Q ss_pred ccchhh---hhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhc Q lcl|NC_020866. 15 VQRAQL---TEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQ 91 (547) Q Consensus 15 ~~~~~l---~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~ 91 (547) |+...+ .+.+.. +....+.....|-+.- .+ +..+..-...+...++.+ =....-+++.....+.= T Consensus 1 ~~~~~i~~L~~~~~~-~~~r~~~~~~yY~g~~-~~-------~~~~~~~p~~~~~~~~~v---~nw~~~iVds~a~rl~~ 68 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV-HKRRAEMRYDQYAMKY-VD-------RFKGITIPQALSQQYRSI---LGWCAKGVDSLADRLVF 68 (409) T ss_pred CCHHHHHHHHHHHHH-HhHHHHHHHHHhcccC-ch-------hhcChhhhHHHHHHHhhh---cchhHHHHHHhHhhccc Confidence 443322 222221 1111111111111110 00 000000000011111100 01111111111110000 Q ss_pred CCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCC Q lcl|NC_020866. 92 LDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDL 170 (547) Q Consensus 92 ~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~ 170 (547) ..|+ .++ . .+.++++.-+|+....+++ +|+.||.|. ..+|...+|.. .|..++|++.....+.. T Consensus 69 ~Gf~----~~d---~----~l~~i~~~N~ld~~~~~~~~~aliyG~sf-~~v~~~~dg~~---~i~~~sp~~~~~i~D~~ 133 (409) T protein:vir:94 69 REFE----NDD---F----TVNEIFEENNPDIFFDSAVLSSLIASCSF-TYISKGENDAV---RLQVIEAVNATGIIDPI 133 (409) T ss_pred Cccc----CCc---h----HHHHHHHhcChhHHHHHHHHHHHHhccee-EEEecCCCCce---EEEEeccceEEEEEecC Confidence 1111 111 1 2456666667777777766 699999964 46676555543 34455554322111100 Q ss_pred ----------------c-c----ee-------eccCCCCcee----eccCceEEEeecCCcCccccchhH-HHHHHHHHH Q lcl|NC_020866. 171 ----------------A-T----PL-------LLDDNGAEVP----LPAFRFIFADIPAKSGILLRSGLA-RVAAWGWMF 217 (547) Q Consensus 171 ----------------~-~----~~-------~~~~~~~g~~----l~~~k~i~~~~~~~~~~p~G~gll-~~~~~~~~~ 217 (547) . . ++ ...+++.... +..--++.+.++.+-+.|+|.|-+ +.+....-- T Consensus 134 ~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da 213 (409) T protein:vir:94 134 TGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSN 213 (409) T ss_pred CCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHH Confidence 0 0 00 0011111111 111124555666677889998865 444333222 Q ss_pred HHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCC---ceEEEEecCCCCCchHHHHHH Q lcl|NC_020866. 218 KAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPET---MSIDFVESKSIGSSVDLYEKR 292 (547) Q Consensus 218 K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~---~~ie~~e~~~~~~~~~~f~~l 292 (547) -+..+..-..-.|-|+.|.++.+ .+.+.. .+.+.. ..+....+|++ ..+++.+.. +++...|... T Consensus 214 ~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~---~~~~~~-----~~~~i~~~~~d~dg~~~~v~q~~--~~~l~~~~~~ 283 (409) T protein:vir:94 214 AKRTLERADVTAEFYSFPQKYVTGLSDDAEP---METWKA-----TVSSMLQFTKDEDGDKPTLGQFT--QPSMSPFTEQ 283 (409) T ss_pred HHHHHHHHHHHHHHhcChhheeEecCCCCcc---cchhhh-----hHHHhhcCCCCCCCCCceEEecC--CCChhHHHHH Confidence 33344555566688999976654 221111 122211 11234555644 346665543 3444567666 Q ss_pred HHHHHHHHHHHHhcchhccCc-cccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-c-CCC--cc-ccc Q lcl|NC_020866. 293 INHLDQQISKAVLGQTTTTDA-VAGGHAVSQEH---RLVQEDIETSDAMALAAILNRDLIRPWIQL-E-YGP--QK-RYP 363 (547) Q Consensus 293 i~~~d~~Isk~ilGqtlts~~-~ggs~a~~~vh---~~v~~~~~~ad~~~i~~~ln~~li~~l~~~-N-fg~--~~-~~p 363 (547) ++.+-+++|-. .+-+...=+ .+.+.++++.- ..-.....+.-.+.+...+. ++++..+.+ + ++. .. .-. T Consensus 284 l~~~~~~~a~~-t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~-~~~rla~~i~~~~~~~~~~~~~~ 361 (409) T protein:vir:94 284 LRTAAAGFAGE-TGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLL-NVAYLAACLRDDAPYLREQFRKT 361 (409) T ss_pred HHHHHHHHhhh-cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCccccccccc Confidence 66666666532 221111101 11111222222 22222334445566777774 466665544 2 211 11 224 Q ss_pred EEEecCc---cccCHHHHHHHHHHHHhCCCCC-CHHHHHHHhCCCCCC Q lcl|NC_020866. 364 RIKIARP---KTEDLTKLASSLDTLVRLGMEI-EENEVRSRFGFSQPK 407 (547) Q Consensus 364 ~~~~~~~---~~ed~~~~a~~~~~l~~~G~~v-~~~~~~e~~Glp~p~ 407 (547) +++|... +...+.+.|+++.||++.|..+ +.+.+++.+|+..++ T Consensus 362 ~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 362 KPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred eEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 6677633 3333566789999999999644 668999999998765 No 197 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=97.93 E-value=1e-05 Score=47.93 Aligned_cols=354 Identities=10% Similarity=-0.010 Sum_probs=156.3 Q ss_pred hhhhhhhhhhhhhHHHHhhh-hccCCC---HHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCce Q lcl|NC_020866. 20 LTEEVAAPTVTGVRSPLSGY-PGDGLN---PLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDIT 95 (547) Q Consensus 20 l~~~~a~~~~~~~~~~~~~~-~~~~l~---p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~ 95 (547) |.--. +...-..+.+.+. +...+. |+.++..++ .-. ....-+++.....+.=-.|+ T Consensus 1 l~~~~--~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~-~v~-----------------nw~~~~Vds~a~rl~~~Gf~ 60 (410) T protein:vir:95 1 MNLYQ--SRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQ-AVL-----------------GWAAKGVDSLADRLIFRAFA 60 (410) T ss_pred CCcch--hhHHHHHHHhcCCCCccccchhccHHHHhHHH-hhc-----------------chhHHHHHHhHhhhcccccc Confidence 11111 1121111222111 111111 111111111 000 11111111111111111122 Q ss_pred EecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeE--ecccCCc- Q lcl|NC_020866. 96 VEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFR--FQKSDLA- 171 (547) Q Consensus 96 v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~--~~~~~~~- 171 (547) . ++. .+.++++.-+|+....+++ +|+.||.|.. ++|.-.++.- .|..++|++.. ++...+. T Consensus 61 ~----~d~-------~l~~i~~~N~ld~~~~~~~~~al~~G~sf~-~v~~~~d~~~---~i~~~sP~~~~~i~Dp~~~~~ 125 (410) T protein:vir:95 61 N----DDF-------NVTEIFDRNNPDIFFDSAILSALIGSCSFV-YISKGEDDEV---RLQVIESSNATGVIDPITGLL 125 (410) T ss_pred C----CCc-------hHHHHHhhcChHHHHHHHHHHHHHhCceeE-EEecCCCCce---EEEEEcccceEEEEeCCCCce Confidence 1 111 1456666667887777766 7999999654 5676444432 34455554322 1111110 Q ss_pred -------------c-----e-------eeccCCCCc-e--eeccCceEEEeecCCcCccccchhH-HHHHHHHHHHHHHH Q lcl|NC_020866. 172 -------------T-----P-------LLLDDNGAE-V--PLPAFRFIFADIPAKSGILLRSGLA-RVAAWGWMFKAFTA 222 (547) Q Consensus 172 -------------~-----~-------~~~~~~~~g-~--~l~~~k~i~~~~~~~~~~p~G~gll-~~~~~~~~~K~~~~ 222 (547) . + ....+++.. + +++.--++.+.++.+.+.|+|.+-+ +.+....--=+..+ T Consensus 126 ~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~ 205 (410) T protein:vir:95 126 VEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTL 205 (410) T ss_pred EEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHH Confidence 0 0 001111110 1 1111234556667777888988844 55544333333345 Q ss_pred HHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCc---eEEEEecCCCCCchHHHHHHHHHHH Q lcl|NC_020866. 223 RDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPETM---SIDFVESKSIGSSVDLYEKRINHLD 297 (547) Q Consensus 223 ~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~---~ie~~e~~~~~~~~~~f~~li~~~d 297 (547) ..-..-.|-|+.|.++.+ .+.+...+ .+ .+..+....+|++. .++|.+.. +++...|...++.+- T Consensus 206 ~~~~~~~e~~a~pqr~i~G~d~d~~~~~---~~-----~~~~~~i~~~~~~~~~~~~~v~q~~--~~~l~~~~~~l~~l~ 275 (410) T protein:vir:95 206 ERADITAEFYSWPQKYILGLDPDAEPME---KW-----KATVSSLLTISSSDKGVKPSVGQFT--TASMSPFTEQLRTAA 275 (410) T ss_pred HHHHHHHHHhcchhheeeccCCCCCcCc---hh-----hhhhhhheeccCCCCCCcceEEecC--CCChHHHHHHHHHHH Confidence 555566688999966543 22221111 11 11223456677653 36665543 344456766666565 Q ss_pred HHHHHHHhcchh---ccCcc--ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCC---cccccEEEe Q lcl|NC_020866. 298 QQISKAVLGQTT---TTDAV--AGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLE--YGP---QKRYPRIKI 367 (547) Q Consensus 298 ~~Isk~ilGqtl---ts~~~--ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~N--fg~---~~~~p~~~~ 367 (547) ++||-. .+-+. ...+. .+.-|+...+ +-....++.-.+.+...+. ++.+..+.+- ++. ...-.+++| T Consensus 276 ~~~a~~-s~lP~~~lg~~~~NpsSa~Al~a~~-~~L~~ka~~k~~~fg~~l~-~~~rla~~i~~~~~~~~~~~~~~~v~W 352 (410) T protein:vir:95 276 AGFAGE-MGLTLDDLGFVSDNPSSVEAIKASH-ENLRLAGRKAQRSLGAGLL-NVAYVAACLRDEFRYTRSQFVRTAVKW 352 (410) T ss_pred HHHhhh-cCCCHHHhccccCchhHHHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCCcccccceeeEEe Confidence 555543 22111 11111 1112222222 2233344455666778884 4666655552 321 111235556 Q ss_pred c---CccccCHHHHHHHHHHHHhCC-CCCCHHHHHHHhCCCCCCCcchhccccccccc Q lcl|NC_020866. 368 A---RPKTEDLTKLASSLDTLVRLG-MEIEENEVRSRFGFSQPKPGAKLLRPLENPAT 421 (547) Q Consensus 368 ~---~~~~ed~~~~a~~~~~l~~~G-~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~ 421 (547) . ..+...+.+.++++.||.++| ..++.+-+++.+|+.+++.-............ T Consensus 353 ~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 353 EPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred eecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHHHHHHHHHHHhCCC Confidence 5 444456788899999999985 23778889999999754321111111111000 No 198 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.91 E-value=1.1e-05 Score=47.67 Aligned_cols=381 Identities=11% Similarity=0.016 Sum_probs=179.9 Q ss_pred ccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH------HHHH-----HHHHh-----hhHHH Q lcl|NC_020866. 15 VQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY------LDLA-----EVIEE-----RDPHY 78 (547) Q Consensus 15 ~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~------~~L~-----~~m~~-----~D~~v 78 (547) |....+.+-+..-. ...... ..++... +..-.|..... .... ..... ..... T Consensus 1 ~~~~~~~~~i~~~~-~~~~~~----------~~~~~~~-~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 68 (470) T protein:vir:10 1 MELDALKKLIQNTS-TSRNDL----------INNYKQA-VNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFY 68 (470) T ss_pred CchHHHHHHHHHHH-HHHHHH----------HHHHHHH-HHHhccccchhccccchhcccccccccccccCCcccccchH Confidence 44444444332110 000000 1111111 11111110000 0000 00000 13334 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEe Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIW 157 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~ 157 (547) .-.+.+...-+.+.+..+...++. ..+.+.++++. +|.+.+..+. ++.-||.+..- +|...+|.+. +.. T Consensus 69 k~Iv~~~~~yl~G~p~~~~~~d~~-----~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~-~y~d~~~~~~---~~~ 138 (470) T protein:vir:10 69 QLLVDQEAGYVASVFPDIDVGKDA-----DNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLH-YWIDEDGNFR---YGI 138 (470) T ss_pred HHHHHhhhhheeccceeeecCchH-----HHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEE-EEecCCCceE---EEE Confidence 455666777888888887754432 23456666654 5666666665 67789988754 4543445433 333 Q ss_pred eccceeEe--cccCCcce------eeccC-CCC----ceee--------------------------------------- Q lcl|NC_020866. 158 RDPRHFRF--QKSDLATP------LLLDD-NGA----EVPL--------------------------------------- 185 (547) Q Consensus 158 ~~~~~~~~--~~~~~~~~------~~~~~-~~~----g~~l--------------------------------------- 185 (547) ++|..... +......+ ....+ .+. -..+ T Consensus 139 ~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (470) T protein:vir:10 139 IQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQS 218 (470) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccc Confidence 44432211 11100000 00000 000 0000 Q ss_pred ----ccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020866. 186 ----PAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIA 261 (547) Q Consensus 186 ----~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~ 261 (547) ..+..|-+.+ -.+|.+|.|.+..+....--=...+.+++..++.|+-|+++.+--...+.+ +...++. T Consensus 219 ~~~~~~~g~vPvv~--~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~------~~~~~~~ 290 (470) T protein:vir:10 219 NTLKHNFGRVPFIE--FSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLH------QFMNDLR 290 (470) T ss_pred cccccCCCeeeEEE--eecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccc------hhhhhhh Confidence 0011111111 123567899999877766666778899999999999998887742222111 1223444 Q ss_pred cCeeEEecC-----CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHHHH----HHH Q lcl|NC_020866. 262 GDCAAIIPE-----TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLVQE----DIE 332 (547) Q Consensus 262 ~~~~~vip~-----~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~~----~~~ 332 (547) ....+.++. +..++++..... ...++..++.+.+.|.+.--+..++.++ . |.++|.. ..... ..+ T Consensus 291 ~~~~i~~~~~~~~~~~~~~~lt~~~~---~~~~~~~~~~L~~~I~~~s~~p~~~~~~-~-gn~Sg~A-lk~~~~~l~~k~ 364 (470) T protein:vir:10 291 KYKSIKINNTGNGDNSGVDKLQIDIP---VEARDDALKITRKNIFLFGQGIDPANFE-S-SNASGVA-IKMLYSHLELKA 364 (470) T ss_pred hcCeEeccCCCCCcCceeEEEeecCC---hHHHHHHHHHHHHHHHHHhCCCCCCccc-c-ccchHHH-HHHHHHHHHHHH Confidence 555666654 456888875532 3468888999999987764444444333 2 2333322 22222 223 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hcCC-CcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCc Q lcl|NC_020866. 333 TSDAMALAAILNRDLIRPWIQ-LEYG-PQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRFGF-SQPKPG 409 (547) Q Consensus 333 ~ad~~~i~~~ln~~li~~l~~-~Nfg-~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Gl-p~p~~~ 409 (547) ..-.+.+...|. ++++.++. +|.. .+.....+.|....+.+..+.++.+.++. |+ ++.+.+.+.++. +.|+ T Consensus 365 ~~~~~~~~~~l~-~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~--g~-iS~et~l~~~p~v~D~~-- 438 (470) T protein:vir:10 365 AKTQTYFEHAIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVA--NY-SSKEAVAKANPIVDDWQ-- 438 (470) T ss_pred HHHHHHHHHHHH-HHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHHHh--cc-CcHHHHHHhCCCCCCHH-- Confidence 333455556663 35555554 3322 12344678899999999999999998874 65 899999998874 3332 Q ss_pred chhccccccc--ccc--cccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 410 AKLLRPLENP--ATQ--AAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 410 e~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) +++-...... ..+ ...+...++.. .. ++ T Consensus 439 ~E~eri~~E~~e~~~~~~~~~~~~~~~~--dd---------------------------------e~ 470 (470) T protein:vir:10 439 QELKDLAKDKEENDPYSNQADELNGKGV--ND---------------------------------EQ 470 (470) T ss_pred HHHHHHHHHHHHHHHhhccccccCCCCC--CC---------------------------------CC Confidence 2221111110 000 00000000000 00 00 No 199 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=97.89 E-value=1.2e-05 Score=47.49 Aligned_cols=419 Identities=14% Similarity=0.071 Sum_probs=193.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhH-HHHhhhhcc-CCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVR-SPLSGYPGD-GLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~-~~~~~~~~~-~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |+++.+=+++. +|+......-|-+-+...-.| +.+..+-.- .-+-+.|..+||.-+.++ +..++ |+-- T Consensus 1 ~~~~~~~~~~~-~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~---~r~~~------~ps~ 70 (527) T protein:vir:10 1 MGQDKRQYGST-QQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGD---QRPIY------VPNG 70 (527) T ss_pred CCccccccCCC-cCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccc---cceee------ehhh Confidence 99887766653 344333333221111111100 011111110 112234455555444332 22222 2222 Q ss_pred HHHHHHHHHHHhcCCceEecCC----CCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeec---CCee Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPAS----DDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTS---EGQY 150 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~----~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~---~g~~ 150 (547) ..+.+..-+|...+ .+..++++.+.++.+++.-+|+....+.- ++..-|=.|+-+.|+.+ +++. T Consensus 71 --------~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~ 142 (527) T protein:vir:10 71 --------EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRL 142 (527) T ss_pred --------HHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCc Confidence 23344443333221 13344556677788887766776665555 79999999999999742 2333 Q ss_pred eEEeeEeeccce---------------------------------------eEecccCCc------ceeeccC------C Q lcl|NC_020866. 151 FPDKLIWRDPRH---------------------------------------FRFQKSDLA------TPLLLDD------N 179 (547) Q Consensus 151 ~~~~l~~~~~~~---------------------------------------~~~~~~~~~------~~~~~~~------~ 179 (547) ... .++|+. |++..++.+ +...... . T Consensus 143 ~v~---~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w 219 (527) T protein:vir:10 143 SLH---EVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKW 219 (527) T ss_pred eEe---ecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccc Confidence 221 222111 110000000 0000000 0 Q ss_pred ----------------CCcee---e-ccCce---EEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 180 ----------------GAEVP---L-PAFRF---IFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 180 ----------------~~g~~---l-~~~k~---i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ..++. + .+..| +.+...+..+..||.|-|..+.-..---+....+....++-.|.|+ T Consensus 220 ~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 220 DDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 00110 1 12233 3345556788899999999888877777778888888889999999 Q ss_pred EEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhcc-Cccc Q lcl|NC_020866. 237 RIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTT-DAVA 315 (547) Q Consensus 237 ~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts-~~~g 315 (547) ++.+--+.. +.+.+... -+++..+..=+|++.++.++.... ....|+..++++.++|+..- +-.... +.-. T Consensus 300 ~~~tg~~~v--d~~G~~~~--~~VgPG~iweL~e~ak~~~v~~~~---~la~~~~h~~~L~~~l~~vA-~~PavA~G~vD 371 (527) T protein:vir:10 300 YATDSAPPR--DSRGNMVP--WTISPLGMVEHGQNNKIYRVNGVA---SLEPSQTHMNKAEEAMQQTK-GIPDIAVGVVD 371 (527) T ss_pred eeecccccc--cccCCcCc--cccCCceeEecCCCcceeeccchh---hhHHHHHHHHHHHHHHHHhh-cCCeeeecccc Confidence 887532111 11111111 123333444488899999886432 33458999999998877552 211111 1011 Q ss_pred cccchhHHHHHHH----HHHHHHHHHHHHHHH----HHHHHHHHHHhc-CCCcc----cccEEEecCccccCHHHHHHHH Q lcl|NC_020866. 316 GGHAVSQEHRLVQ----EDIETSDAMALAAIL----NRDLIRPWIQLE-YGPQK----RYPRIKIARPKTEDLTKLASSL 382 (547) Q Consensus 316 gs~a~~~vh~~v~----~~~~~ad~~~i~~~l----n~~li~~l~~~N-fg~~~----~~p~~~~~~~~~ed~~~~a~~~ 382 (547) .+.+.+.+-.++. ..........+..++ +.-++.||-.+- ++... ..-++.|...-+.|.++..+.+ T Consensus 372 ~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v 451 (527) T protein:vir:10 372 AAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQL 451 (527) T ss_pred CCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHH Confidence 1222222222221 112222222222222 111223332211 22111 1238899999999999999999 Q ss_pred HHHHhCCCCCCHHHHHHHh----CCCCCCCc-chhcccccccccc----ccc-ccccccccCCccccccccccccchhhh Q lcl|NC_020866. 383 DTLVRLGMEIEENEVRSRF----GFSQPKPG-AKLLRPLENPATQ----AAP-PGMDPGAGDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 383 ~~l~~~G~~v~~~~~~e~~----Glp~p~~~-e~~~~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (547) .+|++.|+ ++.+...+++ |+..|+.+ +++....+..+.. ..+ .....+.+..+..+.. +.... T Consensus 452 ~tL~~aGi-iS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d------~~~~~ 524 (527) T protein:vir:10 452 LELWEAGL-IPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDD------QALNG 524 (527) T ss_pred HHHHHcCc-hhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcc------cccCC Confidence 99999998 7877777776 76555432 1222221111110 000 0000111111100000 00000 Q ss_pred hhcccchhhhcccccccchhHH Q lcl|NC_020866. 453 RGDLSLNSEMPSTGQKTGGAEI 474 (547) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~ 474 (547) .. + T Consensus 525 ~~-------------------~ 527 (527) T protein:vir:10 525 QP-------------------L 527 (527) T ss_pred CC-------------------C Confidence 00 0 No 200 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=97.89 E-value=1.3e-05 Score=47.46 Aligned_cols=419 Identities=13% Similarity=0.067 Sum_probs=193.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhH-HHHhhhhcc-CCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVR-SPLSGYPGD-GLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHY 78 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~-~~~~~~~~~-~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v 78 (547) |+++.+=+++. +|+......-|-+-+...-.| +.+..+-.- .-+-+.|..+||.-+.++ +..++ |+-- T Consensus 1 ~~~~~~~~~~~-~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~---~r~~~------~ps~ 70 (527) T protein:vir:10 1 MGQDKRQYGST-QQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGD---QRPIY------VPNG 70 (527) T ss_pred CCccccccCCC-cCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccc---cceee------ehhh Confidence 99887766653 344333333221111111100 011111110 112234455555444332 22222 2222 Q ss_pred HHHHHHHHHHHhcCCceEecCC----CCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeec---CCee Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPAS----DDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTS---EGQY 150 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~----~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~---~g~~ 150 (547) ..+.+..-+|...+ .+..++++.+.++.+++.-+|+....+.- ++..-|=.|+-+.|+.+ +++. T Consensus 71 --------~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~ 142 (527) T protein:vir:10 71 --------EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRL 142 (527) T ss_pred --------HHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCc Confidence 23344443333221 13344556677788887766776665555 79999999999999742 2333 Q ss_pred eEEeeEeeccce---------------------------------------eEecccCCc------ceeeccC------C Q lcl|NC_020866. 151 FPDKLIWRDPRH---------------------------------------FRFQKSDLA------TPLLLDD------N 179 (547) Q Consensus 151 ~~~~l~~~~~~~---------------------------------------~~~~~~~~~------~~~~~~~------~ 179 (547) ... .++|+. |++..++.+ +...... . T Consensus 143 ~v~---~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w 219 (527) T protein:vir:10 143 SLH---EVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKW 219 (527) T ss_pred eEe---ecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccc Confidence 221 222111 110000000 0000000 0 Q ss_pred ----------------CCcee---e-ccCce---EEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Q lcl|NC_020866. 180 ----------------GAEVP---L-PAFRF---IFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL 236 (547) Q Consensus 180 ----------------~~g~~---l-~~~k~---i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~ 236 (547) ..++. + .+..| +.+...+..+..||.|-|..+.-..---+....+....++-.|.|+ T Consensus 220 ~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 220 DDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 00110 1 12233 3345556788899999999888877777778888888889999999 Q ss_pred EEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhcc-Cccc Q lcl|NC_020866. 237 RIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTT-DAVA 315 (547) Q Consensus 237 ~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts-~~~g 315 (547) ++.+--+.. +.+.+... -+++..+..=+|++.++.++.... ....|+..++++.++|+..- +-.... +.-. T Consensus 300 ~~~tg~~~v--d~~G~~~~--~~VgPG~iweL~e~ak~~~v~~~~---~la~~~~h~~~L~~~l~~vA-~~PavA~G~vD 371 (527) T protein:vir:10 300 YATDSAPPR--DSRGNMVP--WTISPLGMVEHGQNNKIYRVNGVA---SLEPSQTHMTKAEEAMQQTK-GIPDIAVGVVD 371 (527) T ss_pred eeecccccc--cccCCcCc--cccCCceeEecCCCcceeeccchh---hhHHHHHHHHHHHHHHHHhh-cCCeeeecccc Confidence 887532111 11111111 123333444488899999886432 33458999999998887552 211111 1011 Q ss_pred cccchhHHHHHHH----HHHHHHHHHHHHHHH----HHHHHHHHHHhc-CCCcc----cccEEEecCccccCHHHHHHHH Q lcl|NC_020866. 316 GGHAVSQEHRLVQ----EDIETSDAMALAAIL----NRDLIRPWIQLE-YGPQK----RYPRIKIARPKTEDLTKLASSL 382 (547) Q Consensus 316 gs~a~~~vh~~v~----~~~~~ad~~~i~~~l----n~~li~~l~~~N-fg~~~----~~p~~~~~~~~~ed~~~~a~~~ 382 (547) .+.+.+.+-.++. ..........+..++ +.-++.||-.+- ++... ..-++.|...-+.|.++..+.+ T Consensus 372 ~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v 451 (527) T protein:vir:10 372 AAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQL 451 (527) T ss_pred CCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHH Confidence 1222222222221 112222222222222 111223332211 22111 1238899999999999999999 Q ss_pred HHHHhCCCCCCHHHHHHHh----CCCCCCCc-chhcccccccccc----ccc-ccccccccCCccccccccccccchhhh Q lcl|NC_020866. 383 DTLVRLGMEIEENEVRSRF----GFSQPKPG-AKLLRPLENPATQ----AAP-PGMDPGAGDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 383 ~~l~~~G~~v~~~~~~e~~----Glp~p~~~-e~~~~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (547) .+|++.|+ ++.+...+++ |+..|+.+ +++....+..+.. ..+ .....+.+..+..+.. +.... T Consensus 452 ~tL~~aGi-~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d------~~~~~ 524 (527) T protein:vir:10 452 LQLWEAGL-IPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDD------QALNG 524 (527) T ss_pred HHHHHcCc-hhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcc------cccCC Confidence 99999998 7877777776 76555432 1222221111110 000 0000111111100000 00000 Q ss_pred hhcccchhhhcccccccchhHH Q lcl|NC_020866. 453 RGDLSLNSEMPSTGQKTGGAEI 474 (547) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~ 474 (547) .. + T Consensus 525 ~~-------------------~ 527 (527) T protein:vir:10 525 QP-------------------L 527 (527) T ss_pred CC-------------------C Confidence 00 0 No 201 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.78 E-value=2e-05 Score=46.35 Aligned_cols=404 Identities=12% Similarity=0.103 Sum_probs=168.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhh--cCCCHHHHHH-HHHHHHhhh-- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAA--DHGDPVRYLD-LAEVIEERD-- 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a--~~g~~~~~~~-L~~~m~~~D-- 75 (547) |. |.+.-.+-|.+..-..-+. .+ +.+ +..++.-.+++..+.++-+.. -.|+...+.. ......... T Consensus 1 m~----~~~~~k~~~~~~~~~~~~~-~~---~~~-~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~ 71 (508) T protein:vir:15 1 MG----LIQRIKDLFWKGAAATGVT-GS---LSK-ITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLK 71 (508) T ss_pred CC----hHHHHHHHHHHHHHHhccc-cc---hHH-hhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccce Confidence 54 2222222211100000000 00 000 111221122333322221110 0111100000 000000000 Q ss_pred ---HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeee Q lcl|NC_020866. 76 ---PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYF 151 (547) Q Consensus 76 ---~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~ 151 (547) .--+.+..+--.-|++-+-.|...+++.. -+++.++++.-.|...+...+ ++..+|-.++-+.|.. +.. T Consensus 72 ~sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~----~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--~~~- 144 (508) T protein:vir:15 72 NTINMAKTAARRIASVVFNEKAEIHVKDNNEA----DKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG--NHI- 144 (508) T ss_pred eecchHHHHHHHHHhhhhCCCceEEeCCchHH----HHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC--Cee- Confidence 00122333444555555556665443322 246778887767777776655 7888998888777752 221 Q ss_pred EEeeEeeccceeE---ec---------------------------------ccCCcce---eeccCC----CCceee--- Q lcl|NC_020866. 152 PDKLIWRDPRHFR---FQ---------------------------------KSDLATP---LLLDDN----GAEVPL--- 185 (547) Q Consensus 152 ~~~l~~~~~~~~~---~~---------------------------------~~~~~~~---~~~~~~----~~g~~l--- 185 (547) +|.++++..|. ++ .++..++ +..+.+ |..+++ T Consensus 145 --~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~ 222 (508) T protein:vir:15 145 --KIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTL 222 (508) T ss_pred --EEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhc Confidence 12222222111 00 0001111 111111 111111 Q ss_pred ------ccC---------ceEEEee----cCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe-----C Q lcl|NC_020866. 186 ------PAF---------RFIFADI----PAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK-----W 241 (547) Q Consensus 186 ------~~~---------k~i~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk-----~ 241 (547) .+. -|++++. +...++|+|.|.+..+....-.=...+..|+.-+ +.|-+-+++. + T Consensus 223 ~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~ 301 (508) T protein:vir:15 223 PVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRF 301 (508) T ss_pred ccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcC Confidence 111 1333322 2244689999999999876666666666666655 4555544441 2 Q ss_pred CCCCCHHHHHHHHHHHHHHhcCeeEEec----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hcc-hhccCccc Q lcl|NC_020866. 242 QPGASEQDKDTLFRAVANIAGDCAAIIP----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LGQ-TTTTDAVA 315 (547) Q Consensus 242 ~~~~~~~~~~~l~~al~~~~~~~~~vip----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lGq-tlts~~~g 315 (547) +.+.. . ..+ ....-+..++ .+..|+.+...-. .+.|...++.+-+.|...+ ++. +++.++ + T Consensus 302 d~~~~-~----~~~----~~~~~~~~~~~~~~~~~~i~~~~~~ir---~e~~~~~~~~~l~~~~~~~gls~~~f~~~~-~ 368 (508) T protein:vir:15 302 DDEHK-P----TFD----TEQNVYVGVLSDDNNGLGVKDMTTPIR---TVQYKDAIDHFIKEFEVQIGLSTGTFSYSN-D 368 (508) T ss_pred CCCCc-c----ccC----CCCeeEEeccCCCCCCCceeEeecccC---hHHHHHHHHHHHHHHHHHhCCCchhccccc-C Confidence 21110 0 000 0111122222 1222444332211 1125555555555555443 332 333332 2 Q ss_pred cccchhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cC----C---------CcccccEEEecCccccCHHH Q lcl|NC_020866. 316 GGHAVSQEHR--LVQEDIETSDAMALAAILNRDLIRPWIQL---EY----G---------PQKRYPRIKIARPKTEDLTK 377 (547) Q Consensus 316 gs~a~~~vh~--~v~~~~~~ad~~~i~~~ln~~li~~l~~~---Nf----g---------~~~~~p~~~~~~~~~ed~~~ 377 (547) |...+-++.. .-....+..-.+.+...|. +|++.++.+ +. | ....-+.|.|+..-.+|.++ T Consensus 369 ~~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~ 447 (508) T protein:vir:15 369 GVKTATEVVSNNSMTYQTRSSYLTMVEKAID-ELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDK 447 (508) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHH Confidence 2222223322 2233344566677777774 476665543 21 1 11122578888888999999 Q ss_pred HHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccc-cccccccccccCCcc Q lcl|NC_020866. 378 LASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQ-AAPPGMDPGAGDLKT 437 (547) Q Consensus 378 ~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 437 (547) .++...+++.+|+--.+.++.+.||+++.+..+++-......+.. +..+...+..+.... T Consensus 448 ~~~~~~~~v~aGi~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 448 QLEEDAKVLAIGALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 999999999999855566778888987654333333222211111 111111111111100 No 202 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=97.73 E-value=2.4e-05 Score=45.88 Aligned_cols=355 Identities=9% Similarity=-0.018 Sum_probs=155.3 Q ss_pred ccchhhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcC Q lcl|NC_020866. 15 VQRAQLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQL 92 (547) Q Consensus 15 ~~~~~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~ 92 (547) |+...+.+=... ......++....|-+.. -++..+..-...+...++.+ =....-+++.....+.=. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~--------~~~~~~~~~p~~~~~~~~~v---~nw~~~iVds~a~rl~~~ 69 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKH--------VDRFKGITIPQALSQQYRSI---LGWCAKGVDSLADRLVFR 69 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccC--------chhhcchhhhHHHHHHHhhh---cChhHHHHHHhHhhcccc Confidence 554433222211 01111111111111110 00000000000011111100 011111111111100001 Q ss_pred CceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEeccc--- Q lcl|NC_020866. 93 DITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS--- 168 (547) Q Consensus 93 ~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~--- 168 (547) .|+ .++ . .+.++++.-+|+....+++ +|+.||.|.. .+|.-.+|.. .|..++|++.....+ T Consensus 70 Gf~----~~d---~----~l~~i~~~N~ld~~~~~~~~~al~yG~sf~-~v~~~~dg~~---~i~~~sP~~~~~i~D~~~ 134 (409) T protein:vir:16 70 EFE----NDD---F----TVNEIFEENNPDIFFDSTVLSALIASCSFT-YISKGENDAV---RLQVIEATNATGIIDPIT 134 (409) T ss_pred ccc----Ccc---h----HHHHHHHhcChhHHHHHHHHHHHHhCceeE-EEecCCCCce---EEEEEcccceEEEeeccc Confidence 111 111 1 2456666667887777766 7999999755 6786555542 344444433221110 Q ss_pred ---------------CCc-c--eee-------ccCCCCce----eeccCceEEEeecCCcCccccchhH-HHHHHHHHHH Q lcl|NC_020866. 169 ---------------DLA-T--PLL-------LDDNGAEV----PLPAFRFIFADIPAKSGILLRSGLA-RVAAWGWMFK 218 (547) Q Consensus 169 ---------------~~~-~--~~~-------~~~~~~g~----~l~~~k~i~~~~~~~~~~p~G~gll-~~~~~~~~~K 218 (547) +.. . ++. ..+++.+. ++..--++.+.++.+-+.|+|.|-+ +.+....--= T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~ 214 (409) T protein:vir:16 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred ccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHH Confidence 000 0 000 11111111 1111125556666677788988854 4443333323 Q ss_pred HHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecC---CceEEEEecCCCCCchHHHHHHH Q lcl|NC_020866. 219 AFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPE---TMSIDFVESKSIGSSVDLYEKRI 293 (547) Q Consensus 219 ~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~---~~~ie~~e~~~~~~~~~~f~~li 293 (547) +..+..-..-.|-|+.|.++.+ .+.+.. .+.+ .+..+....+|+ |..+++.+.. +++...|...+ T Consensus 215 ~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~---~~~~-----~~~~~~i~~~~~d~~g~~~~v~q~~--~~~l~~~~~~l 284 (409) T protein:vir:16 215 KRTLERADVTAEFYSFPQKYVTGLSDDAEP---METW-----KATVSSMLQFTKDEDGDKPTLGQFT--QPSMSPFTEQL 284 (409) T ss_pred HHHHHHHHHHHHHhcChhheeEecCCCCCc---cchh-----hhhhhHhhccCCCCCCCCceEEecC--CCChhHHHHHH Confidence 3344455566688999977654 221111 1111 111233455664 3446665543 34445676666 Q ss_pred HHHHHHHHHHHhcchhccCcccccc-chhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-----cccE Q lcl|NC_020866. 294 NHLDQQISKAVLGQTTTTDAVAGGH-AVSQ---EHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQK-----RYPR 364 (547) Q Consensus 294 ~~~d~~Isk~ilGqtlts~~~ggs~-a~~~---vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~-----~~p~ 364 (547) +.+-+++|-. .+-+...=+..+.+ ++++ ..+.-.....+.-.+.+...+. ++.+..+.+-.+.+. .--+ T Consensus 285 ~~~~~~~a~~-s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~-~~~rla~~~~~~~~~~~~~~~~~~ 362 (409) T protein:vir:16 285 RTAAAGFAGE-TGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLL-NVAYLAACLRDDVPYLREQFSKTK 362 (409) T ss_pred HHHHHHHhhh-cCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccchhhccce Confidence 6666666543 22121110101111 2222 1222233334444566777774 466665555222111 1135 Q ss_pred EEecCcc---ccCHHHHHHHHHHHHhCCCCC-CHHHHHHHhCCCCCC Q lcl|NC_020866. 365 IKIARPK---TEDLTKLASSLDTLVRLGMEI-EENEVRSRFGFSQPK 407 (547) Q Consensus 365 ~~~~~~~---~ed~~~~a~~~~~l~~~G~~v-~~~~~~e~~Glp~p~ 407 (547) ++|.... ...+.+.|+++.||++.|..+ +.+.+++.+|+..++ T Consensus 363 v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 363 PKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred EEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 6676444 334678899999999999655 567889999998765 No 203 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=97.64 E-value=3.3e-05 Score=45.13 Aligned_cols=421 Identities=13% Similarity=0.052 Sum_probs=174.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhh---hhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH----HHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVA---APTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY----LDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a---~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~----~~L~~~m~~ 73 (547) |+.. .++.-....+.- .+.-.-+++.+.+ ...+|.++.-.++.+ .+=|+.... T Consensus 1 m~~~---------~~~~v~~~h~~y~a~~~~W~~ird~~~G-----------~~~~r~~g~~YLPk~~~E~~~~Y~~rl~ 60 (513) T protein:vir:97 1 MADK---------DPKSPATTSGAYDQMLPRWHVIETLLGG-----------TEAMREAGETYLPRHQEETDKGYQERLA 60 (513) T ss_pred CCCC---------CCCCCCcCCHHHHHHHHHHHHHHHHhcC-----------hHHHHhhcccCCCCCCCCCHHHHHHHHh Confidence 7742 222222222211 1111122222211 133444432221111 011222111 Q ss_pred ---hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHH--hCCCHHHHHHHHHH-hhhhcCeeeeeEEee-- Q lcl|NC_020866. 74 ---RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWL--KREELQQEVFHILD-CISKGYSFTEIVWNT-- 145 (547) Q Consensus 74 ---~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l--~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~-- 145 (547) --++++..++.....|.+.+-++.. +.+.. ..+.+.+.. .+.+++.+++.++. ++.||.+.+=+-+-. T Consensus 61 rA~~~n~~~~tl~~l~G~vf~k~p~~~~--~~p~~--~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~ 136 (513) T protein:vir:97 61 SAVLLNMVEQTLDTLSGKPFSEPIKLNE--DVPKA--IEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPA 136 (513) T ss_pred cccCCChHHHHHHHHhhhhhhcCcccCc--CchHH--HHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCC Confidence 2567777777777777777655532 22221 222221112 33478999998885 888997754443311 Q ss_pred -------------------------------------cCCeeeEEeeEeecc------------ceeEecccCCcceeec Q lcl|NC_020866. 146 -------------------------------------SEGQYFPDKLIWRDP------------RHFRFQKSDLATPLLL 176 (547) Q Consensus 146 -------------------------------------~~g~~~~~~l~~~~~------------~~~~~~~~~~~~~~~~ 176 (547) .+|...+..+..+.. .+++.-..+..++... T Consensus 137 ~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~ 216 (513) T protein:vir:97 137 PREDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEP 216 (513) T ss_pred CccchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEe Confidence 122222222211110 0000001111111101 Q ss_pred cCCC------------CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC Q lcl|NC_020866. 177 DDNG------------AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG 244 (547) Q Consensus 177 ~~~~------------~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~ 244 (547) ...+ .+..+..=-|+++... +.+-..+.+.|..++..-+--.....+.-..+..-+.|+++..- T Consensus 217 ~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~-~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G--- 292 (513) T protein:vir:97 217 VKKSNAQKEEWALADEWATGLNYVPLVTFYAD-RQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSG--- 292 (513) T ss_pred ecCCCccccceEEecCCCCcCCceeEEEEecC-CCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeec--- Confidence 0000 0011111113333222 33444566666666655543334455555556666888877752 Q ss_pred CCHHHHHHHHHHHHHHhcCeeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccc-cccchhH Q lcl|NC_020866. 245 ASEQDKDTLFRAVANIAGDCAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVA-GGHAVSQ 322 (547) Q Consensus 245 ~~~~~~~~l~~al~~~~~~~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~g-gs~a~~~ 322 (547) .+++..+. ..+|+++++.+|. +.++.++|.++ .+.......++.+..+|. .+|..+.....+ -|--... T Consensus 293 ~~~~~~~~-----i~iG~~~~~~lpe~~~~~~yie~~g--~~i~~~~~~l~~le~qm~--~~Ga~ll~~~~~~~Ta~a~~ 363 (513) T protein:vir:97 293 ASGEDSDP-----VVVGPNKVLYNPDPAGRFYYVEHTG--QAIAAGRTDLKDLEEQMA--GYGAEFLKRKTGGQTATARA 363 (513) T ss_pred CCcCCCCc-----eEeeccccccCCCCCCcceeeccCc--hhHHHHHHHHHHHHHHHH--HHHHHhhccCCccccHHHHH Confidence 12221111 2478888899996 88999999874 343446666667777773 344444332211 1111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecC-ccccCH-HHHHHHHHHHHhCCCCCCHHHHHHH Q lcl|NC_020866. 323 EHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIAR-PKTEDL-TKLASSLDTLVRLGMEIEENEVRSR 400 (547) Q Consensus 323 vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~-~~~ed~-~~~a~~~~~l~~~G~~v~~~~~~e~ 400 (547) ........++..-+..+++.++ ++++++..+-- .+...++|.+.. ....++ ...++.+.++.+.|. ++.+.+++. T Consensus 364 ~~~~~~~S~L~~~a~~le~al~-~~l~~~a~wlg-~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~-is~~t~~~~ 440 (513) T protein:vir:97 364 LDSAEATSDLSAMTGLFEDALA-QALDITADWLR-LGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRD-ISRKTYLNG 440 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhC-CCCCccEEEeccccCcccCCHHHHHHHHHHHhCCC-CCHHHHHHH Confidence 3333445567778888999996 58899988863 222224454422 112222 344667777888887 776444332 Q ss_pred ---hCCCCCCCc-----chhccccccccc----ccccccccccccCCccccccccccccchhhhhhcccchhhhcccccc Q lcl|NC_020866. 401 ---FGFSQPKPG-----AKLLRPLENPAT----QAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQK 468 (547) Q Consensus 401 ---~Glp~p~~~-----e~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (547) .|+=.|+.+ |++......... ...+....++.+.............+. +....+.. T Consensus 441 L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~ 508 (513) T protein:vir:97 441 LRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGG------------EGGEGGGN 508 (513) T ss_pred HHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCC------------CccccCCC Confidence 355222212 222211100000 000000011101000000000000000 00000000 Q ss_pred cchhH Q lcl|NC_020866. 469 TGGAE 473 (547) Q Consensus 469 ~~~~~ 473 (547) ..... T Consensus 509 ~~~~~ 513 (513) T protein:vir:97 509 PGGES 513 (513) T ss_pred CCCCC Confidence 00000 No 204 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.64 E-value=3.4e-05 Score=45.11 Aligned_cols=411 Identities=9% Similarity=0.015 Sum_probs=162.6 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHH---HHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLA---QILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~---~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |-++ -+-+|. .|+.+. ...+....+.+..+-.+.-...++- +-++..+..-...+..+ +. .=.. T Consensus 1 ~~~~------~~~~~~--gl~~~~-~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~-~~---v~nw 67 (474) T protein:vir:81 1 MIQQ------QTVRIP--SLSNDE-NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNL-GL---VLGW 67 (474) T ss_pred CcCC------CcCcCC--CCChhH-HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHH-Hh---hcCh Confidence 2211 011110 111111 0011111111111111100000000 00000000000111110 00 0112 Q ss_pred HHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeE Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLI 156 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~ 156 (547) .+-+++.....+.-..|.+ + +++..+. .+.+++..-+++.....++ +|+.||.|. .+||.-+++.-.+ .|. T Consensus 68 ~~~~Vd~~a~rl~~~Gf~~-~-d~~~~~~----~l~~iw~~N~ld~~~~~~~~~al~~G~sf-~~V~~~~d~~~~~-~i~ 139 (474) T protein:vir:81 68 TGKAVDALARRCNLEGFVW-P-DGDLDSL----GGTEVVDDNHLLSEIDSAIVAAMQHGPAF-LINTVGEDDEPEA-LIH 139 (474) T ss_pred HHHHHHHHHhhhcccceEC-C-CCCccch----HHHHHHHhcChhHHHHHHHHHHHhhCcee-EEEecCCCCCcee-EEE Confidence 2222222222222234443 2 2222222 2456666667887777755 799999995 5667544443222 244 Q ss_pred eeccceeE--ec----------------ccCCcc---e-------eeccCCCCc--------eeeccCceEEEeecCCcC Q lcl|NC_020866. 157 WRDPRHFR--FQ----------------KSDLAT---P-------LLLDDNGAE--------VPLPAFRFIFADIPAKSG 200 (547) Q Consensus 157 ~~~~~~~~--~~----------------~~~~~~---~-------~~~~~~~~g--------~~l~~~k~i~~~~~~~~~ 200 (547) .++|++.. ++ .++... + ....+.+.+ -++. .-++.+.++.+.+ T Consensus 140 ~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~g-vPvV~~~n~~~~~ 218 (474) T protein:vir:81 140 VKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYG-VPAQVLPYKPAPK 218 (474) T ss_pred EeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCC-cceEEeccccccc Confidence 44444322 11 111100 0 000001110 0111 1145567777777 Q ss_pred ccccchhH-HHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHH------HHHHHHHHHHHhcCeeEEecCC Q lcl|NC_020866. 201 ILLRSGLA-RVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQD------KDTLFRAVANIAGDCAAIIPET 271 (547) Q Consensus 201 ~p~G~gll-~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~------~~~l~~al~~~~~~~~~vip~~ 271 (547) .|+|.+-+ +.+....--=+..+..-..-.|-|+.|.++.. .+...++++ .+.....+-.+..+.-+-+|.. T Consensus 219 ~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~ 298 (474) T protein:vir:81 219 RPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQL 298 (474) T ss_pred CcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCccccccccc Confidence 88887744 44433322222334444556689999977654 222111111 1111222222333333333443 Q ss_pred ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchh-----cc-CccccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 272 MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTT-----TT-DAVAGGHAVSQEHRLVQEDIETSDAMALAAILNR 345 (547) Q Consensus 272 ~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtl-----ts-~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~ 345 (547) ...+|-+.. ..+...|...++.+-++||-. .+-+. ++ ++-.+.-|+... ++-....++.-.+.+...+. T Consensus 299 ~~~~~~q~~--~a~l~~~~~~l~~~~~~~a~~-t~iP~~~lG~~~~~np~SaeAi~a~-~~~l~~kae~k~~~fg~~l~- 373 (474) T protein:vir:81 299 ARADVKQFP--AASPDAHWSDINGLAKLFARE-ASLPDTAVAISGLSNPTSAESYDAS-QYELIAEAEGAVDDFTPALR- 373 (474) T ss_pred ccccccccC--CCChhHHHHHHHHHHHHHHhh-hCCCHHHhcccccccccHHHHHHHH-HHHHHHHHHHHHHHHHHHHH- Confidence 445555433 344445655555554444432 22111 11 111111222222 22233345555667888884 Q ss_pred HHHHHHHHhc--CCCcc-----cccEEEecCccccCHHHHHHHHHHHHhCCCC-CCHHHHHHHhCCCCCCCcchhccccc Q lcl|NC_020866. 346 DLIRPWIQLE--YGPQK-----RYPRIKIARPKTEDLTKLASSLDTLVRLGME-IEENEVRSRFGFSQPKPGAKLLRPLE 417 (547) Q Consensus 346 ~li~~l~~~N--fg~~~-----~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~-v~~~~~~e~~Glp~p~~~e~~~~~~~ 417 (547) ++++..+.+. +.... .--+++|.......+.+.+++..||+++|.. ++.+-+++.+|+.+.+- +.+..... T Consensus 374 ~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i-~~~~~~~~ 452 (474) T protein:vir:81 374 KAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQA-RRAMADKR 452 (474) T ss_pred HHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHH-HHHHHHHH Confidence 4777666664 22111 1236678888888889999999999999854 55667788899964321 11111100 Q ss_pred ccccccccccccccccCCcccc Q lcl|NC_020866. 418 NPATQAAPPGMDPGAGDLKTAE 439 (547) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~ 439 (547) .......-.+.........+++ T Consensus 453 ~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 453 RVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HHhHHHHHHHHHhcCCCCCCCC Confidence 0000000000000000000000 No 205 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=97.61 E-value=3.7e-05 Score=44.86 Aligned_cols=433 Identities=16% Similarity=0.147 Sum_probs=209.2 Q ss_pred CCCCCeeeCCCCCcccchh-----hhhhhhhhhhhh-----------hHHHHhhhhccCCCHHHH----HHHHHhhcCCC Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQ-----LTEEVAAPTVTG-----------VRSPLSGYPGDGLNPLRL----AQILRAADHGD 60 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~-----l~~~~a~~~~~~-----------~~~~~~~~~~~~l~p~~~----~~ilr~a~~g~ 60 (547) |.++- |..-.+.+ +.-++-.-.+.. +..+..++.+..++-... ..-|+.-.+ + T Consensus 1 ~~~~~------~~~~~~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~-n 73 (525) T protein:vir:10 1 MTRTK------GSKNKSTTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFN-N 73 (525) T ss_pred CCCCc------CCcccccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhc-C Confidence 66432 22222222 211111111111 112333443332211110 111222221 1 Q ss_pred HHHHH----HHHHHHHhhhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC-CCHHHHHHHHHHhh--- Q lcl|NC_020866. 61 PVRYL----DLAEVIEERDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR-EELQQEVFHILDCI--- 132 (547) Q Consensus 61 ~~~~~----~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~-~~~~~~i~~~l~a~--- 132 (547) +..+. +|..-+.-.|+.|....+ ...++-.+++.|..-+-+....+..-.+...|+. +...++.+++|.-+ T Consensus 74 pd~~~~~i~~l~~y~yi~~~~v~ql~~-li~~lp~l~y~i~~~~~~k~~~~~~s~~n~~l~k~i~hk~ltrdll~q~a~~ 152 (525) T protein:vir:10 74 PDKYINNIVNLLTYYYIIDGNVFQLYD-LIFSLPPLDYQIKVLKRDKDYKEDLSTINLYLEKKIQHKQLTRDLLVQLAHS 152 (525) T ss_pred hHHHHHHHHHHHHHhhhhcchHHHHHH-HHHhcCCcceeehhhhhccchhhHHHHHHHHHHHhHHHHHHHHHHHHHhhcc Confidence 22221 122212225777776544 4456667788886544333344455556666655 55566666666322 Q ss_pred ------hhc-------CeeeeeEE----eecCCeeeEEeeEeeccceeEecc-cCCcc------eeecc--------CC- Q lcl|NC_020866. 133 ------SKG-------YSFTEIVW----NTSEGQYFPDKLIWRDPRHFRFQK-SDLAT------PLLLD--------DN- 179 (547) Q Consensus 133 ------~~G-------~sv~Ei~w----~~~~g~~~~~~l~~~~~~~~~~~~-~~~~~------~~~~~--------~~- 179 (547) |.| |-+-|+-| .+.+|.|+. .++-.||.--. ..+.. .+..+ .+ T Consensus 153 gtlig~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~----vid~~~f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~ 228 (525) T protein:vir:10 153 GTLIGTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVA----VIDLQWFDEMSELERKLTFENLSPLITENKYKKWKEYNG 228 (525) T ss_pred CceeEeeecCCCCcchhhhhhhhhhccccccCCceEE----EEehHHhhhhhHHHHHHHHHhhchhhhhhhhhHHhhccc Confidence 222 11222222 223444432 23334443111 11111 01100 01 Q ss_pred -----CCceeeccCceEEEeecCCcCccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCC------CCH Q lcl|NC_020866. 180 -----GAEVPLPAFRFIFADIPAKSGILL-RSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPG------ASE 247 (547) Q Consensus 180 -----~~g~~l~~~k~i~~~~~~~~~~p~-G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~------~~~ 247 (547) -.-+++|-.+.++-+...-.-||- |.++.-+.+..+..|+-.....-..+.+.-.|+++-|+... ..+ T Consensus 229 ~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn~mk~p~ 308 (525) T protein:vir:10 229 ENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDNDSKVKE 308 (525) T ss_pred ccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCccccCch Confidence 123578888887776665555554 88888888888888887666666677888889888776431 234 Q ss_pred HHHHHHHHHHHHHhcC--------eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hcchhccCcccccc Q lcl|NC_020866. 248 QDKDTLFRAVANIAGD--------CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LGQTTTTDAVAGGH 318 (547) Q Consensus 248 ~~~~~l~~al~~~~~~--------~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lGqtlts~~~ggs~ 318 (547) ..+++++..|+.+-+- +++.+|.-.+|+|-+...+..+.+. .-.+..+.-|..+. +.+.|+++ +||+| T Consensus 309 ~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg--~K~d~I~~DI~~A~GlS~sL~nG-dggNy 385 (525) T protein:vir:10 309 SAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDP--KKYDSIDNDITNATGISQVLTNG-TKGNY 385 (525) T ss_pred HHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCCc--hhhhhhhhhhhhhhccceeeecC-CCCce Confidence 4566677666655332 2345699999998776653222222 23456777887763 45666555 68899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-cccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHH Q lcl|NC_020866. 319 AVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQK-RYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEV 397 (547) Q Consensus 319 a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~-~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~ 397 (547) |.+.+..++|-..+---.+.|.++-| +|+.|+ |+... ---.|.++...+.++++..+.+-+|...|+ +..++ T Consensus 386 Ataslnld~fykkigVm~e~Iee~y~-kL~d~V----l~~~k~~nyifnydkd~pi~~kkk~d~LIkL~d~g~--s~k~v 458 (525) T protein:vir:10 386 ASAKLNLDVFYKKIGVMLEIIEEIYN-QLIDII----LGEEKGCNYIFQYNKDTPIEREKKLDTLIKLEAQGY--SAKYV 458 (525) T ss_pred eeeeeeHHHHHHHHHHHHHHHHHHHH-HHHhhh----cCcccCcceEEecCCCchhhhhhhhhhhhhhhccch--hhhhh Confidence 99999999988877777777776664 466655 44322 112556677778888998888888888885 33444 Q ss_pred HHHhCCCCCCCcchhcccc--------c-ccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccc Q lcl|NC_020866. 398 RSRFGFSQPKPGAKLLRPL--------E-NPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQK 468 (547) Q Consensus 398 ~e~~Glp~p~~~e~~~~~~--------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (547) -+..|+.....-|+.+... . +|.....-.+ ..+.. -..+...+.+.++++-- +. +++ T Consensus 459 ldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SG-k~~n~-iG~P~~dd~~~~dati~-----s~-----~~~-- 524 (525) T protein:vir:10 459 LDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSG-KDGND-IGSPKLDDSDSSDATIE-----SK-----ERG-- 524 (525) T ss_pred hhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeec-ccccc-ccCCccCCCcchhhhhh-----hh-----hcC-- Confidence 5555665433333221110 0 0000000000 00000 00000001111000000 00 000 Q ss_pred c Q lcl|NC_020866. 469 T 469 (547) Q Consensus 469 ~ 469 (547) + T Consensus 525 ~ 525 (525) T protein:vir:10 525 V 525 (525) T ss_pred C Confidence 0 No 206 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=97.59 E-value=4e-05 Score=44.67 Aligned_cols=425 Identities=13% Similarity=0.051 Sum_probs=177.1 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhh------hhhHHHHhhhhccCCCHHHHH------HHHHhhcCCCHHHH---- Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTV------TGVRSPLSGYPGDGLNPLRLA------QILRAADHGDPVRY---- 64 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~------~~~~~~~~~~~~~~l~p~~~~------~ilr~a~~g~~~~~---- 64 (547) ||++-. .--|-++++.+.-|++-|+. ..+.+.-..|.....-...++ ..+|.++.-.++.. T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~ 77 (535) T protein:vir:80 1 MARKRT---TIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDS 77 (535) T ss_pred CCcchh---hhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCccc Confidence 886432 22344455555556554321 111111011100000011111 22333322111110 Q ss_pred -----HHHHHHHHh---hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHh-----CCCHHHHHHHHHH- Q lcl|NC_020866. 65 -----LDLAEVIEE---RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLK-----REELQQEVFHILD- 130 (547) Q Consensus 65 -----~~L~~~m~~---~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~-----~~~~~~~i~~~l~- 130 (547) ..-|+.... .-++++..++.....|.+.+..++- + +.++.+++ +.+++.+++.++. T Consensus 78 ~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~----p------~~l~~l~~d~D~~G~~L~~f~~~~~~~ 147 (535) T protein:vir:80 78 RDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQL----P------PALEAIVEDIDGEGVSLDQQAKKALGY 147 (535) T ss_pred CCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceec----c------HHHHHHHhccCCCCCCHHHHHHHHHHH Confidence 112333322 2566777777777777777665531 1 12333333 3478999988884 Q ss_pred hhhhcCeeeeeEEeec----------------------------------CCeeeEEeeEeeccc-------------ee Q lcl|NC_020866. 131 CISKGYSFTEIVWNTS----------------------------------EGQYFPDKLIWRDPR-------------HF 163 (547) Q Consensus 131 a~~~G~sv~Ei~w~~~----------------------------------~g~~~~~~l~~~~~~-------------~~ 163 (547) ++.||.+.+=+-|-.. +|...+..+..+... .+ T Consensus 148 ~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~ 227 (535) T protein:vir:80 148 TMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQW 227 (535) T ss_pred HHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEE Confidence 6688887554443211 222112212111110 00 Q ss_pred ---EecccCCcceee-ccCC------CCceeec------cCc---eEEEeecCCcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 164 ---RFQKSDLATPLL-LDDN------GAEVPLP------AFR---FIFADIPAKSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 164 ---~~~~~~~~~~~~-~~~~------~~g~~l~------~~k---~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) ..+.++.-+... +... ..+...+ ... |+++ +....+-..+..-|..++..-+--.....+ T Consensus 228 RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~-~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd 306 (535) T protein:vir:80 228 RVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFI-GPLDNNADIDHPPLLDLCEVNIGHYRNSAD 306 (535) T ss_pred EEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEe-ecCCCCCCCCccchHHHHHHHHHHhhchhH Confidence 000001000000 0000 0011111 112 3322 222334445555666666664432233334 Q ss_pred HHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 225 WAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV 304 (547) Q Consensus 225 w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i 304 (547) .-.-+..-|.|+++..-......+ +.....--.+|+.+++.+|++.+..+++.+..+- ....++-...+|.. T Consensus 307 ~~~il~~~~~P~l~i~G~~~~~~~--~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~----a~~~l~~~e~qM~~-- 378 (535) T protein:vir:80 307 YEEMAFVAGQPTAFFTGLTKDWVE--DVFKDFKVHLGSRAIIPLPQGATAGILQITPNSV----PFEAMTHKESQMIA-- 378 (535) T ss_pred HHHHHHHhcCceeeeecCchhhhh--cCCCCcceEecCcccccCCCCCCcceeeeccchh----HHHHHHHHHHHHHH-- Confidence 444555667777766521111101 0001111346888899999999999998765332 22334444455444 Q ss_pred hcchhccCccccccchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CcccccEEEec--Ccc-ccCHHHH Q lcl|NC_020866. 305 LGQTTTTDAVAGGHAVSQ--EHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYG-PQKRYPRIKIA--RPK-TEDLTKL 378 (547) Q Consensus 305 lGqtlts~~~ggs~a~~~--vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg-~~~~~p~~~~~--~~~-~ed~~~~ 378 (547) +|..+.... ++.....+ ........++..-+..+++.++ +++++++.+--. .+..-+.|++. ... ..+. .. T Consensus 379 lGa~ll~~~-~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~-~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~-~~ 455 (535) T protein:vir:80 379 MGANLLVKS-GGNRTFGEAQQEEASEQSILSACTKNVSMAFR-KALRWANQFQTGIVNDETVEYNLNTDFPAARLTP-NE 455 (535) T ss_pred HHHHhhccC-cccccHHHHHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcCCccCCCceEEEeccccccccCCH-HH Confidence 343332211 11121111 2222234567788888999996 488988888521 12223444432 222 2233 34 Q ss_pred HHHHHHHHhCCCCCCHHHHHH---HhCCCCCCC--cchhc--ccccccccccccccccccccCCcccccccc-ccccchh Q lcl|NC_020866. 379 ASSLDTLVRLGMEIEENEVRS---RFGFSQPKP--GAKLL--RPLENPATQAAPPGMDPGAGDLKTAERKLK-PLSGQAD 450 (547) Q Consensus 379 a~~~~~l~~~G~~v~~~~~~e---~~Glp~p~~--~e~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 450 (547) ++.+.++.+.|. |+.+.+++ +.|+..|.. .|+.. .......+..++.....+.+..+.+..... ..++.+. T Consensus 456 ~~all~~~~~G~-Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~~~ 534 (535) T protein:vir:80 456 RAELILEWQQGA-ITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGNQAG 534 (535) T ss_pred HHHHHHHHhcCC-CCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCccccccCC Confidence 566667788886 77665544 347755432 22221 111111111111111222222222221111 1100100 Q ss_pred h Q lcl|NC_020866. 451 L 451 (547) Q Consensus 451 ~ 451 (547) . T Consensus 535 ~ 535 (535) T protein:vir:80 535 N 535 (535) T ss_pred C Confidence 0 No 207 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.49 E-value=5.7e-05 Score=43.86 Aligned_cols=419 Identities=11% Similarity=0.039 Sum_probs=168.0 Q ss_pred CeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhc--CCCHHHHH--HHHHHHHhhhH---- Q lcl|NC_020866. 5 PVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAAD--HGDPVRYL--DLAEVIEERDP---- 76 (547) Q Consensus 5 ~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~--~g~~~~~~--~L~~~m~~~D~---- 76 (547) =.|.+. .+.+=+..........-..+..|+.-.+++.++.+|-+... .|+..... ....... .+. T Consensus 1 m~~~~~------~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~-~~~~~sl 73 (522) T protein:vir:47 1 MSLFQK------VKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIK-SRPMNHL 73 (522) T ss_pred CchHHH------HHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchh-cccceec Confidence 000000 00000000000000000111223333345555555433211 11111000 0000000 000 Q ss_pred HH-HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeec-------- Q lcl|NC_020866. 77 HY-LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTS-------- 146 (547) Q Consensus 77 ~v-~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~-------- 146 (547) -+ .....+--.-|++-+-.|..+ +.+..+++.++++.-.|...+...+ .|...|=.++-+.|..+ T Consensus 74 nl~~~i~~~~A~lv~~e~~~i~v~-----d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~i~~v~ 148 (522) T protein:vir:47 74 PIARTASKKIASLVYNEQATITTK-----NEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVAFIQ 148 (522) T ss_pred chHHHHHHHHhhhhcCCcceeecC-----ChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCceEEEEEc Confidence 11 122333344444544455432 2345678888888777888777776 57778877777777521 Q ss_pred CCeeeEEee----------Eee-----ccc-----eeEecc--------------cCCcce---eeccCC----CCce-- Q lcl|NC_020866. 147 EGQYFPDKL----------IWR-----DPR-----HFRFQK--------------SDLATP---LLLDDN----GAEV-- 183 (547) Q Consensus 147 ~g~~~~~~l----------~~~-----~~~-----~~~~~~--------------~~~~~~---~~~~~~----~~g~-- 183 (547) .+.+.|... ..+ ... ++.+.. .+...+ +..+.. |..+ T Consensus 149 ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (522) T protein:vir:47 149 APVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNL 228 (522) T ss_pred CCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccc Confidence 111122100 000 000 000000 000010 011100 1111 Q ss_pred -------eeccC---------ceEEEee----cCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE----E Q lcl|NC_020866. 184 -------PLPAF---------RFIFADI----PAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI----G 239 (547) Q Consensus 184 -------~l~~~---------k~i~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v----g 239 (547) .+++. -|++++. +...++|+|.|.+..+....-.-...+..|..=++.-...+.| - T Consensus 229 ~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l 308 (522) T protein:vir:47 229 SELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLT 308 (522) T ss_pred cccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHh Confidence 11111 1333322 2245789999999999866655555555555544433333222 1 Q ss_pred eCC-CCCCHH-HH----HHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHH-Hhcc-hhcc Q lcl|NC_020866. 240 KWQ-PGASEQ-DK----DTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKA-VLGQ-TTTT 311 (547) Q Consensus 240 k~~-~~~~~~-~~----~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~-ilGq-tlts 311 (547) +.. ...+.+ .. +.-.+....+..+. +.+..|+.....-.. ..|...++.+-+.|+.. .++. |++. T Consensus 309 ~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~----~~~~~i~~~~~~ir~---e~~~~~~~~~l~~i~~~~gls~~tf~~ 381 (522) T protein:vir:47 309 QRQYQRPDGTIDFRPRFDVEQNVYMQIGGSS----MDAGGITDLTSPIRA---NDYILAISEGLKLFEMQIGVSSGMFTF 381 (522) T ss_pred ccCCCCCCcccccccccCcccceEeecCCCC----CCCCcceeeccccCh---HHHHHHHHHHHHHHHHHhCCCccccCc Confidence 211 111100 00 00001111111110 122224433322111 13555555555555443 3553 4444 Q ss_pred CccccccchhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CCC-cccccEEEecCccccCHHHHHHH Q lcl|NC_020866. 312 DAVAGGHAVSQEHRL--VQEDIETSDAMALAAILNRDLIRPWIQLE-------YGP-QKRYPRIKIARPKTEDLTKLASS 381 (547) Q Consensus 312 ~~~ggs~a~~~vh~~--v~~~~~~ad~~~i~~~ln~~li~~l~~~N-------fg~-~~~~p~~~~~~~~~ed~~~~a~~ 381 (547) +++ |...+.++..+ -.-.-+..-.+.+..+| ++|+..++.+- ..+ ....+.+.|+..-.+|.++.++. T Consensus 382 ~~~-~~kTAtEi~s~~~~~~~t~~~~~~~~~~al-~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~ 459 (522) T protein:vir:47 382 DGQ-GMKTATEIVSENSDTYQMRSSIVALVEQSI-KELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDY 459 (522) T ss_pred ccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHH Confidence 433 22223344322 22233556667777777 45777777442 111 22336788998889999999999 Q ss_pred HHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccccccc Q lcl|NC_020866. 382 LDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPL 445 (547) Q Consensus 382 ~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (547) +.+++..|+--.+.++.+.||+++.+..+++-.......+ ..++.....+...+.....+... T Consensus 460 ~~~~v~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~-~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 460 WAKMVAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLP-MNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhcc-CCCCCCCCCCCCCcccccCCCCC Confidence 9999999985556678888998765433333322221111 11111111111111000000000 No 208 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=97.43 E-value=6.8e-05 Score=43.42 Aligned_cols=438 Identities=11% Similarity=0.051 Sum_probs=182.4 Q ss_pred CCCCCee---eCCCCC-cccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVL---LDRWGN-PVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~---~~~~g~-~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~ 76 (547) |++.-.+ .+.+|. +|.. .+..+.++....+ .++. ...+ +...+..-|+.|...++ T Consensus 20 ~~~~~~~~~p~~~dG~s~i~~-~~~~~~~~~~~~~--~~~g-------------g~~~-----n~~eLI~~YR~ma~~~p 78 (533) T protein:vir:58 20 LSPMYGMGAPHGAGGSSMIPI-NMYHPFATAGYAS--RFYG-------------GIEF-----NRFFLYDMYDRMDYTDP 78 (533) T ss_pred hchhhcccCccCCCCCccccC-CCCcchhhhhhhh--hhhc-------------cccc-----cHHHHHHHHHHhhccCc Confidence 2221111 123442 3332 2222222211111 1111 0111 33556777888865789 Q ss_pred HHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhC-CCHHHHHHHHHH-hhhhcCeeeeeEEeecCCeeeEEe Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKR-EELQQEVFHILD-CISKGYSFTEIVWNTSEGQYFPDK 154 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~-~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~g~~~~~~ 154 (547) .|.++++.....+.-.+-.-.|-.-+-...++.+.|++.+.. ++|+.-..++.. -..+|--.+.++- ++..-.+.+ T Consensus 79 EVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkii--k~~k~GI~e 156 (533) T protein:vir:58 79 LISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILE--KGSDGTIEK 156 (533) T ss_pred chhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEecc--CCcccchhh Confidence 999999988776554321111111111122233334333322 234444444332 2335555555532 233445667 Q ss_pred eEeeccceeEeccc--CCcceeecc-------CCCCceeeccCceEEEeec-CCcCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 155 LIWRDPRHFRFQKS--DLATPLLLD-------DNGAEVPLPAFRFIFADIP-AKSGILLRSGLARVAAWGWMFKAFTARD 224 (547) Q Consensus 155 l~~~~~~~~~~~~~--~~~~~~~~~-------~~~~g~~l~~~k~i~~~~~-~~~~~p~G~gll~~~~~~~~~K~~~~~~ 224 (547) |..++|+.+++..+ ....+.... .+..++.+|++..+++.+. .....+++.|.|+.+..++---+...-. T Consensus 157 lr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDA 236 (533) T protein:vir:58 157 FQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDA 236 (533) T ss_pred heecCCeeeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHH Confidence 88889988754332 222222222 2334578888877766665 4556799999999999887655554433 Q ss_pred HHHHHHHcCCC------eEEEeCCCCCCHHHHHHHHHHHHH-H--hcCeeEE-----------------ec-----CCce Q lcl|NC_020866. 225 WAIFTQTYGQP------LRIGKWQPGASEQDKDTLFRAVAN-I--AGDCAAI-----------------IP-----ETMS 273 (547) Q Consensus 225 w~~f~E~~G~P------~~vgk~~~~~~~~~~~~l~~al~~-~--~~~~~~v-----------------ip-----~~~~ 273 (547) .+.|- .-=-| +=||..|..-.++=...+....++ + .+.++-| +| .|++ T Consensus 237 lVIYR-isRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTE 315 (533) T protein:vir:58 237 LMLYR-VVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVE 315 (533) T ss_pred HHHHh-hcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccce Confidence 33332 22223 233444433222222222222221 0 0011111 22 5789 Q ss_pred EEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 274 IDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVAGGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRP 350 (547) Q Consensus 274 ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~ggs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~ 350 (547) |+.+.. ++-| -..-|+|+.+++-+++--.. |..++ +.+++..-+..++ |...+......+...|.+||+ T Consensus 316 I~TLpG-g~lg----emeDV~YF~kkLy~ALnVP~sRl~~e~-~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLi-- 387 (533) T protein:vir:58 316 IDILQG-SKVD----LAEDVEYMLNRLISALKVPKAFIGYEG-DVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVR-- 387 (533) T ss_pred eeecCC-CCCC----cHHHHHHHHHHHHHHhCCCeeecCCCC-CCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 999863 2222 23558999999999953322 22332 2234433334444 556677777777777777653 Q ss_pred HHHhcCCCcccccEEEecCc----cccCHHHHHHHHHHHHhCCCCCCHHHHHHH-hCCCCCCCcc-hhcccccccc---c Q lcl|NC_020866. 351 WIQLEYGPQKRYPRIKIARP----KTEDLTKLASSLDTLVRLGMEIEENEVRSR-FGFSQPKPGA-KLLRPLENPA---T 421 (547) Q Consensus 351 l~~~Nfg~~~~~p~~~~~~~----~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~-~Glp~p~~~e-~~~~~~~~~~---~ 421 (547) ++---...-.++.|... +-.+.+-+.+++..|..+-=-|.+.|+++. +.++.....+ +.+....... . T Consensus 388 ---lk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~ 464 (533) T protein:vir:58 388 ---MNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDT 464 (533) T ss_pred ---cccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCC Confidence 33111111113444322 223444555666666554222677887664 3555311111 1111000000 0 Q ss_pred ccccccccccccCCccccccccccc---------cchhhh--hhcccchhh------hcccccccchhH Q lcl|NC_020866. 422 QAAPPGMDPGAGDLKTAERKLKPLS---------GQADLS--RGDLSLNSE------MPSTGQKTGGAE 473 (547) Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~--~~~~~~~~~------~~~~~~~~~~~~ 473 (547) +.......++........+...+.. ..+.+- .....+.+. .......-+... T Consensus 465 ~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 465 GGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred CCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCCCCCCCC Confidence 0000000000000000000000000 000000 000000000 000000000000 No 209 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.30 E-value=0.0001 Score=42.51 Aligned_cols=396 Identities=11% Similarity=0.089 Sum_probs=169.4 Q ss_pred hhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHH------HHHHHHHh---hhH--H--H-HHHHHHHHH Q lcl|NC_020866. 22 EEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYL------DLAEVIEE---RDP--H--Y-LGVIGTRKR 87 (547) Q Consensus 22 ~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~------~L~~~m~~---~D~--~--v-~s~l~~Rk~ 87 (547) .-|=+.-..-+...+.+-|. +....+....++.... +...+. .++..+.. .+- | + +.+..+..+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ 78 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPN-GSEPELIPKYLPLVPD-NQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAE 78 (518) T ss_pred CcchhhHHHHHHHhhcCCCC-ccchhccHHHhhhccc-chhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHH Confidence 11100000011111222221 1122344444443332 221110 11111100 001 1 1 123445555 Q ss_pred HHhcCCceEec-CCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCCeeeEEeeEeeccceeEe Q lcl|NC_020866. 88 SVSQLDITVEP-ASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRF 165 (547) Q Consensus 88 ~v~~~~w~v~p-~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~ 165 (547) -|.+-+-+|.- +.+...++...+++.+.++.-.|...+...+ .+...|=.++-+.|. ++.. ++.++++..|.. T Consensus 79 ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d--~~~~---~i~~v~ad~~~P 153 (518) T protein:vir:78 79 YISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL--NGRP---SISVHSSSQFWI 153 (518) T ss_pred hhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE--CCee---EEEEEcCCeeEE Confidence 56666656643 2232233445677888888777777776665 688888887777664 2322 122233222211 Q ss_pred ------------------------------cc----------cCCcce---eeccCCCCcee------------------ Q lcl|NC_020866. 166 ------------------------------QK----------SDLATP---LLLDDNGAEVP------------------ 184 (547) Q Consensus 166 ------------------------------~~----------~~~~~~---~~~~~~~~g~~------------------ 184 (547) .. ++...+ ++.++.+.+++ T Consensus 154 ~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~ 233 (518) T protein:vir:78 154 DFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTND 233 (518) T ss_pred EeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccccccccccccc Confidence 00 000000 11111111110 Q ss_pred ------ec---cCceEEEeec-----CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe-----CC-CC Q lcl|NC_020866. 185 ------LP---AFRFIFADIP-----AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK-----WQ-PG 244 (547) Q Consensus 185 ------l~---~~k~i~~~~~-----~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk-----~~-~~ 244 (547) ++ +.-|++|..+ ...++|+|.|.+..+.-..-.=...+..|..-++. |-+-+++. .. .+ T Consensus 234 ~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~ 312 (518) T protein:vir:78 234 IQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNK 312 (518) T ss_pred CccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCC Confidence 00 1124444332 23578999999999987766666666777776665 66555442 11 11 Q ss_pred CCHHHHHHHHHHHHHHhcCeeEEecCC----c----eEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hc-chhccCcc Q lcl|NC_020866. 245 ASEQDKDTLFRAVANIAGDCAAIIPET----M----SIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LG-QTTTTDAV 314 (547) Q Consensus 245 ~~~~~~~~l~~al~~~~~~~~~vip~~----~----~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lG-qtlts~~~ 314 (547) .....+.. -....+.+..++.. . .|+.+...-. .+.|...++.+-++|...+ ++ ++++.+ T Consensus 313 ~~~~~~~~-----fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir---~e~~~~~~~~~l~~~~~~~G~s~~tfg~~-- 382 (518) T protein:vir:78 313 STDKEEWS-----MNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFR---DGSYRETMEYFAQKAVSKSGYNPATFNLG-- 382 (518) T ss_pred CCCccccc-----cCCCCceEEEecCcCCCCCccccceeeeecccC---hHHHHHHHHHHHHHHHHhhCCChhhcCcc-- Confidence 11000000 01111222222211 1 1333332211 1235555555555554443 22 344322 Q ss_pred ccccchhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhc---CC------C-cccccEEEecCccccCHHHHHHHH Q lcl|NC_020866. 315 AGGHAVSQEHRLVQE--DIETSDAMALAAILNRDLIRPWIQLE---YG------P-QKRYPRIKIARPKTEDLTKLASSL 382 (547) Q Consensus 315 ggs~a~~~vh~~v~~--~~~~ad~~~i~~~ln~~li~~l~~~N---fg------~-~~~~p~~~~~~~~~ed~~~~a~~~ 382 (547) +|.....++..+-.. ..+..-.+.+...+ ++|+..++.+- ++ + ...-+.|.|+..-.+|.++.++.. T Consensus 383 ~~~~TATei~s~~~~~~~t~~~~~~~~e~al-~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~ 461 (518) T protein:vir:78 383 NREVKATEIWSLQDATVRKIEKKKRLIQNVY-EQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTL 461 (518) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHH Confidence 222333444433222 23445555666666 35666665431 11 1 122378899999999999999999 Q ss_pred HHHHhCCCCCCHH-HHHHHh-CCCCCCCcchhccccccccccccc-ccccccccCCcc Q lcl|NC_020866. 383 DTLVRLGMEIEEN-EVRSRF-GFSQPKPGAKLLRPLENPATQAAP-PGMDPGAGDLKT 437 (547) Q Consensus 383 ~~l~~~G~~v~~~-~~~e~~-Glp~p~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 437 (547) ++++..|+ ++.+ ++++.+ |..+.+..+++.......+....+ |....+-.+... T Consensus 462 ~~~v~aGi-mS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 462 NNMNSALA-MSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHhcCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 99999998 6655 466653 665433333332222221111111 110011000000 No 210 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=97.30 E-value=0.0001 Score=42.48 Aligned_cols=365 Identities=9% Similarity=-0.015 Sum_probs=157.2 Q ss_pred ccch---hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHH--HHH Q lcl|NC_020866. 15 VQRA---QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRK--RSV 89 (547) Q Consensus 15 ~~~~---~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk--~~v 89 (547) |+.. .|.+.+... ....++....|-+.. .+ +.....-...+..+++.+ =....-+++... ..+ T Consensus 1 m~~~~i~~L~~~~~~~-~~r~~~~~~yy~g~~-~~-------~~~~~~~p~~~~~~~~~v---~nw~~~~Vd~~a~rl~~ 68 (422) T protein:vir:97 1 MNYMGMGYLRRKLALF-KTGVDKRYRYYAMDD-RD-------DTRSIVMPNNVREMYRSV---LEWTAKGVDSLADRIIF 68 (422) T ss_pred CChHHHHHHHHHHHHH-HHHHHHHHHHHhcCC-Ch-------hhcCccccHHHHHHHHhh---cchhHHHHHHHHhcccc Confidence 3332 233333221 111222222221110 00 000000001111121100 011111222211 111 Q ss_pred hcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee-cCCeeeEEeeEeeccceeEe-- Q lcl|NC_020866. 90 SQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT-SEGQYFPDKLIWRDPRHFRF-- 165 (547) Q Consensus 90 ~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~-~~g~~~~~~l~~~~~~~~~~-- 165 (547) .+ |.+ + +. .+.++++.-+++.....++ +|+.||.|.. ++|.- ++|. | .+..++|++... T Consensus 69 ~G--f~~--~--d~-------~l~~~w~~N~ld~~~~~~~~~al~~G~sf~-~v~~~~~~~~--p-~i~~~sp~~~~~i~ 131 (422) T protein:vir:97 69 RE--FTN--D--DF-------NAWEIFKANNPDIFFDTAIQSALIASCCFV-YIMPGAEDGL--P-KMQVIEASKATGIL 131 (422) T ss_pred ce--eeC--C--ch-------hHHHHHHhcChHHHHHHHHHHHHHhcceeE-EEeeCCCCCe--e-EEEEechhhEEEEE Confidence 22 221 1 11 1445666566777777766 7899999765 45532 3343 2 244555543321 Q ss_pred cc----------------cCCcce--ee-------ccCCCCce----eeccCceEEEeecCCcCccccchhH-HHHHHHH Q lcl|NC_020866. 166 QK----------------SDLATP--LL-------LDDNGAEV----PLPAFRFIFADIPAKSGILLRSGLA-RVAAWGW 215 (547) Q Consensus 166 ~~----------------~~~~~~--~~-------~~~~~~g~----~l~~~k~i~~~~~~~~~~p~G~gll-~~~~~~~ 215 (547) +. ++.... +. ....+... +++.--++.+.++.+.+.|+|.|-+ +.+.... T Consensus 132 D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~ 211 (422) T protein:vir:97 132 DPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQ 211 (422) T ss_pred eCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHH Confidence 11 111100 00 00111111 1111135556677788889998865 4443333 Q ss_pred HHHHHHHHHHHHHHHHcCCCeEEEe--CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCc---eEEEEecCCCCCchHHHH Q lcl|NC_020866. 216 MFKAFTARDWAIFTQTYGQPLRIGK--WQPGASEQDKDTLFRAVANIAGDCAAIIPETM---SIDFVESKSIGSSVDLYE 290 (547) Q Consensus 216 ~~K~~~~~~w~~f~E~~G~P~~vgk--~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~---~ie~~e~~~~~~~~~~f~ 290 (547) --=+..+..-..-.|-|+.|.++.+ .+.+...+ .+. +.-+....+|.+. .+++-+.. +++...|. T Consensus 212 da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~---~~~-----~~~~~i~~~~~de~~~~~~v~q~~--~~~l~~~~ 281 (422) T protein:vir:97 212 KAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPME---KWR-----ATVSTLLEISKDEDGDKPTVGQFT--TASMAPFM 281 (422) T ss_pred HHHHHHHHHHHHHHHHhcchhhhhcccCcccccCc---hhh-----hhhhhhhccCCCCCCCcceeeecC--CCChhHHH Confidence 3333345555566788999976643 22221111 111 1112345566543 36665443 34444566 Q ss_pred HHHHHHHHHHHHHHhcchhccCc-cccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-----c Q lcl|NC_020866. 291 KRINHLDQQISKAVLGQTTTTDA-VAGGHAVSQ---EHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQK-----R 361 (547) Q Consensus 291 ~li~~~d~~Isk~ilGqtlts~~-~ggs~a~~~---vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~-----~ 361 (547) ..++.+-++||-. .+-+...=+ .+.+.++++ ....-....++.-.+.+...+. ++.+.++.+.-+... . T Consensus 282 ~~l~~~~~~~a~~-s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~-~~~rla~~~~~~~~~~~~~~~ 359 (422) T protein:vir:97 282 EHLKMYASLFAGG-SGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFL-NVAYIAVCLRDEFPYLRNQFM 359 (422) T ss_pred HHHHHHHHHHhcc-cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCcccchhhc Confidence 6665555555433 111111101 111112222 2333344455566777888885 477766666422111 1 Q ss_pred ccEEEecCccccC---HHHHHHHHHHHHhCC-CCCCHHHHHHHhCCCCCCCcchhcccccccc Q lcl|NC_020866. 362 YPRIKIARPKTED---LTKLASSLDTLVRLG-MEIEENEVRSRFGFSQPKPGAKLLRPLENPA 420 (547) Q Consensus 362 ~p~~~~~~~~~ed---~~~~a~~~~~l~~~G-~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~ 420 (547) -..+.|....+.+ +.+.|+++.||+++| ...+.+.+++.+|+..+.+.-.-+....... T Consensus 360 ~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 360 DTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKPIPAITEVTTDG 422 (422) T ss_pred cceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHHHHHHHhhhccC Confidence 1356666333333 667788899999986 3477889999999965433222111111000 No 211 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=96.97 E-value=0.00023 Score=40.52 Aligned_cols=405 Identities=13% Similarity=0.058 Sum_probs=175.2 Q ss_pred eeCCCCCcccchhhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC------CCHHHHHHHHHHHHh---hh Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADH------GDPVRYLDLAEVIEE---RD 75 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~------g~~~~~~~L~~~m~~---~D 75 (547) ++...|+-.+-..-..+... +.-.-+++.+.+-- ....+++.. .+..+ |+.... -- T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~---------~~~~r~~yl~~~~~~~~e~~----Y~~rl~rA~~~ 67 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDL---------VGYLRNVGLNEPDKAYGEAR----QAEYEAGGIVY 67 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcc---------hhhcccCCCcCCCCCCCHHH----HHHHHhcccCC Confidence 56666775554433333321 11222223322200 001111111 11122 332222 25 Q ss_pred HHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEee-------- Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNT-------- 145 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~-------- 145 (547) ++++..++.....|.+.+..++- +.. +..+++.+ ..+.+++.+++.++ .++.||.+.+=+-+-. T Consensus 68 n~~~~tl~~l~G~vfrk~p~~~~----p~~--l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ad 141 (491) T protein:vir:95 68 NFTRRTLSGMVGSVMRKEPEINI----PKE--LEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAE 141 (491) T ss_pred ChHHHHHHHHhchhhcCCceeec----cHH--HHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHH Confidence 77778888888888888877742 111 11222111 12346888888888 4777887755444411 Q ss_pred -------------------------cCCeeeEEeeEeeccc-------eeE-----------ecccCCcceee--ccCCC Q lcl|NC_020866. 146 -------------------------SEGQYFPDKLIWRDPR-------HFR-----------FQKSDLATPLL--LDDNG 180 (547) Q Consensus 146 -------------------------~~g~~~~~~l~~~~~~-------~~~-----------~~~~~~~~~~~--~~~~~ 180 (547) .+|...+..+..+... .|. .+.++.-+... ....+ T Consensus 142 e~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g 221 (491) T protein:vir:95 142 QNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEG 221 (491) T ss_pred HHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCC Confidence 1222222222211110 000 00011100000 00000 Q ss_pred C-------------ceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHH-HHHHHHHHHHHcCCCeEEEeCCCCCC Q lcl|NC_020866. 181 A-------------EVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAF-TARDWAIFTQTYGQPLRIGKWQPGAS 246 (547) Q Consensus 181 ~-------------g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~-~~~~w~~f~E~~G~P~~vgk~~~~~~ 246 (547) . +..++.=-|+++.. ...+-..+.+.|..++..-+ ++| ...+.-.-+..-+.|+++..-..+.+ T Consensus 222 ~~~~~~~~~~~~~g~~~l~~IPfv~~~~-~~~~~~~~~pPLl~LA~lni-~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~ 299 (491) T protein:vir:95 222 GAQEEVVEIYPDLGESLRGVIPFTFIGA-TNNDATIDDAPLLPLAELNI-GHYRNSADNEESSFVVGQPTLFIYPGDNLT 299 (491) T ss_pred cceeeeeeeeecCCCcccCeeEEEEEec-CCCCCCCCcCchHHHHHHHH-HHhhhhhHHHHHHHHcccceeeeecCcccC Confidence 0 00111111332222 23344445655666666544 333 33344444556688887765333333 Q ss_pred HHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHH Q lcl|NC_020866. 247 EQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRL 326 (547) Q Consensus 247 ~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~ 326 (547) ++-.......--.++..++..+|.+.+..+++++..+ . ..+. ++-...+| ..+|-.|...++.-|--....... T Consensus 300 ~~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~~~~--~-~~~~-l~~~e~qm--~~~Ga~l~~~~~~~Ta~~~~~~~~ 373 (491) T protein:vir:95 300 PQSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAGENN--L-ARQN-MLDKEQQA--IQIGAQLITPSQQITAESARIQRG 373 (491) T ss_pred cchhhccCcceeEecCcCCcCCCCCCccceeecCcch--H-HHHH-HHHHHHHH--HHHHHHhccCCcchhHHHHHHHHH Confidence 3322222222235677888899999999999986432 1 2333 33233333 334444433211111111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEe--cCcc-ccCHHHHHHHHHHHHhCCCCCCHHHHHH---H Q lcl|NC_020866. 327 VQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKI--ARPK-TEDLTKLASSLDTLVRLGMEIEENEVRS---R 400 (547) Q Consensus 327 v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~--~~~~-~ed~~~~a~~~~~l~~~G~~v~~~~~~e---~ 400 (547) ....++.+-+..+++.+++ ++++++.+---....-+.|.. +... ..+ ....+.+.++.+.|. ++.+.+++ + T Consensus 374 ~~~S~L~~~a~~~e~al~~-~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~-~~~~~all~~~~~G~-is~~t~~~~L~~ 450 (491) T protein:vir:95 374 ADTSVMATIARNVSQAYTD-ALRWVAMMLGKPEDSEVEFQLNMDFFLQPMT-AQDRAAWMADINAGL-LPATAYYAALRK 450 (491) T ss_pred HhhHHHHHHHHHHHHHHHH-HHHHHHHHcCCCCCCceEEEeecccccccCC-HHHHHHHHHHHhcCC-CCHHHHHHHHHh Confidence 3456777888899999964 889999985212222233322 2222 223 334566667777886 77654433 4 Q ss_pred hCCCCCCCcchhcccccccccccccccccccccCCcccccccccc Q lcl|NC_020866. 401 FGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPL 445 (547) Q Consensus 401 ~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (547) .|++.+..+++.......+ .+.+ ...+..+..+.+.- .... T Consensus 451 ~~vl~~~~e~~~~~ie~~~-~~~~--~~~~~~~~~~~~~~-~~~~ 491 (491) T protein:vir:95 451 AGVTDWTDEDILNAIEDAP-LPSG--AVTQVAGEIPQAAQ-QQQE 491 (491) T ss_pred CCCCCccHHHHHHHHHhcC-CCCC--ccccccccchhhhh-hccC Confidence 5787665433333222211 1101 11111111000000 0000 No 212 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=96.61 E-value=0.00047 Score=38.85 Aligned_cols=402 Identities=13% Similarity=0.077 Sum_probs=170.3 Q ss_pred CCCCCeeeCCCCCcccch---hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhh--cCCCHHHH--HHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA---QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAA--DHGDPVRY--LDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~---~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a--~~g~~~~~--~~L~~~m~~ 73 (547) |. |.+.-.+-|... ..++.+. .+..++.-.+++.++..+-+.. =.|+.... ........ T Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~---------~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~- 66 (500) T protein:vir:98 1 MG----VIQKIKNLVTRSKYVMTTQSLT---------NITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETK- 66 (500) T ss_pred Cc----hHHHHHHHHHHHHHHhhcchhh---------hhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcc- Confidence 55 333322222211 0111111 1112222233443333322110 01111100 00000000 Q ss_pred hhH----HH-HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC Q lcl|NC_020866. 74 RDP----HY-LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE 147 (547) Q Consensus 74 ~D~----~v-~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~ 147 (547) .+. -+ .....+--..|++-+-.|... +.+..+++.++++.-.|...+...+ .|..+|=.++-+.|+. T Consensus 67 ~~~~~slnl~~~i~~~~A~lv~~e~~~i~~~-----d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-- 139 (500) T protein:vir:98 67 KRDLNHLPIARTAAKKIASLVFNEQAEIKVD-----DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-- 139 (500) T ss_pred cCceeecchHHHHHHHHhhhhcCCcceEecC-----ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-- Confidence 000 01 122334444555555555443 2345678889888877888777776 5888898888877753 Q ss_pred CeeeEEeeEeeccceeE---------------------------------ecc--cC-Ccce---eeccCC----CCcee Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFR---------------------------------FQK--SD-LATP---LLLDDN----GAEVP 184 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~---------------------------------~~~--~~-~~~~---~~~~~~----~~g~~ 184 (547) +.. +|.++++..|. +.. ++ ..++ +..+.+ |..++ T Consensus 140 ~~~---~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 216 (500) T protein:vir:98 140 DKV---RVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVP 216 (500) T ss_pred Cce---EEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccc Confidence 221 12222222110 000 00 0011 111111 11111 Q ss_pred -------eccC---------ceEEEe----ecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE----- Q lcl|NC_020866. 185 -------LPAF---------RFIFAD----IPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIG----- 239 (547) Q Consensus 185 -------l~~~---------k~i~~~----~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vg----- 239 (547) +++. -|.+++ .+...++|+|.|.+..+....-.=...+..|+.-++. |-+-+++ T Consensus 217 l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l 295 (500) T protein:vir:98 217 LSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLT 295 (500) T ss_pred cccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHh Confidence 1111 133322 2234578999999999988777767777777776665 4332222 Q ss_pred eCCC-CCCHHHHHHHHHHHHHHhcCeeEEecC----CceEEEEecCCCCCchHHHHHHHHHHHHHHHHH-Hhcc-hhccC Q lcl|NC_020866. 240 KWQP-GASEQDKDTLFRAVANIAGDCAAIIPE----TMSIDFVESKSIGSSVDLYEKRINHLDQQISKA-VLGQ-TTTTD 312 (547) Q Consensus 240 k~~~-~~~~~~~~~l~~al~~~~~~~~~vip~----~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~-ilGq-tlts~ 312 (547) +... +.+.+. +....-.....-+..++. +..|+.....- -.+.|...++.+-++|+.. .+++ +++.+ T Consensus 296 ~~~~~~~~g~~---~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~gls~~~~~~~ 369 (500) T protein:vir:98 296 ALTVRTTDGDV---VPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPI---RADDYIKAINEGLSLFEMQIGVSAGLFSFD 369 (500) T ss_pred cccCCCCCccc---cCCcccCCCcceEEEcCCCCCcCcceeEecccc---ChHHHHHHHHHHHHHHHHHhCCCccccccC Confidence 1111 111000 000000111111222221 12244332221 1123555555555555433 2443 44434 Q ss_pred ccccccchhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cC-C---CcccccEEEecCccccCHHHHHHHH Q lcl|NC_020866. 313 AVAGGHAVSQEHR--LVQEDIETSDAMALAAILNRDLIRPWIQL----EY-G---PQKRYPRIKIARPKTEDLTKLASSL 382 (547) Q Consensus 313 ~~ggs~a~~~vh~--~v~~~~~~ad~~~i~~~ln~~li~~l~~~----Nf-g---~~~~~p~~~~~~~~~ed~~~~a~~~ 382 (547) ++ |...+-++.. .-....+..-.+.+..+|. +|++.++.+ ++ + +...-+.+.|+..-.+|.++.++.+ T Consensus 370 ~~-g~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 447 (500) T protein:vir:98 370 GK-SMKTATEIVSENSDTYQMRNSIVALVEQSLK-ELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYW 447 (500) T ss_pred cC-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHH Confidence 33 3222233322 2223345556677777774 577777644 22 1 1122357888888889999999999 Q ss_pred HHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccccc Q lcl|NC_020866. 383 DTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLK 443 (547) Q Consensus 383 ~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (547) .+++..|+--.+.++.+.||+++.+..+++-.......+.... +. +....-+. T Consensus 448 ~~~v~aGi~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~----~~----~~~~~~g~ 500 (500) T protein:vir:98 448 IKVVNAGFGTREMAIQKVLNVTEEKAQEIAAEINTGIVDEINQ----QR----TDTHLYGE 500 (500) T ss_pred HHHHHcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccCCC----CC----ccccccCC Confidence 9999999844456678888986543222222111110000000 00 00000000 No 213 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=96.61 E-value=0.00047 Score=38.85 Aligned_cols=402 Identities=13% Similarity=0.077 Sum_probs=170.3 Q ss_pred CCCCCeeeCCCCCcccch---hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhh--cCCCHHHH--HHHHHHHHh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA---QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAA--DHGDPVRY--LDLAEVIEE 73 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~---~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a--~~g~~~~~--~~L~~~m~~ 73 (547) |. |.+.-.+-|... ..++.+. .+..++.-.+++.++..+-+.. =.|+.... ........ T Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~---------~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~- 66 (500) T protein:vir:30 1 MG----VIQKIKNLVTRSKYVMTTQSLT---------NITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETK- 66 (500) T ss_pred Cc----hHHHHHHHHHHHHHHhhcchhh---------hhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcc- Confidence 55 333322222211 0111111 1112222233443333322110 01111100 00000000 Q ss_pred hhH----HH-HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecC Q lcl|NC_020866. 74 RDP----HY-LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSE 147 (547) Q Consensus 74 ~D~----~v-~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~ 147 (547) .+. -+ .....+--..|++-+-.|... +.+..+++.++++.-.|...+...+ .|..+|=.++-+.|+. T Consensus 67 ~~~~~slnl~~~i~~~~A~lv~~e~~~i~~~-----d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-- 139 (500) T protein:vir:30 67 KRDLNHLPIARTAAKKIASLVFNEQAEIKVD-----DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-- 139 (500) T ss_pred cCceeecchHHHHHHHHhhhhcCCcceEecC-----ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-- Confidence 000 01 122334444555555555443 2345678889888877888777776 5888898888877753 Q ss_pred CeeeEEeeEeeccceeE---------------------------------ecc--cC-Ccce---eeccCC----CCcee Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFR---------------------------------FQK--SD-LATP---LLLDDN----GAEVP 184 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~---------------------------------~~~--~~-~~~~---~~~~~~----~~g~~ 184 (547) +.. +|.++++..|. +.. ++ ..++ +..+.+ |..++ T Consensus 140 ~~~---~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 216 (500) T protein:vir:30 140 DKV---RVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVP 216 (500) T ss_pred Cce---EEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccc Confidence 221 12222222110 000 00 0011 111111 11111 Q ss_pred -------eccC---------ceEEEe----ecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE----- Q lcl|NC_020866. 185 -------LPAF---------RFIFAD----IPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIG----- 239 (547) Q Consensus 185 -------l~~~---------k~i~~~----~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vg----- 239 (547) +++. -|.+++ .+...++|+|.|.+..+....-.=...+..|+.-++. |-+-+++ T Consensus 217 l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l 295 (500) T protein:vir:30 217 LSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLT 295 (500) T ss_pred cccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHh Confidence 1111 133322 2234578999999999988777767777777776665 4332222 Q ss_pred eCCC-CCCHHHHHHHHHHHHHHhcCeeEEecC----CceEEEEecCCCCCchHHHHHHHHHHHHHHHHH-Hhcc-hhccC Q lcl|NC_020866. 240 KWQP-GASEQDKDTLFRAVANIAGDCAAIIPE----TMSIDFVESKSIGSSVDLYEKRINHLDQQISKA-VLGQ-TTTTD 312 (547) Q Consensus 240 k~~~-~~~~~~~~~l~~al~~~~~~~~~vip~----~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~-ilGq-tlts~ 312 (547) +... +.+.+. +....-.....-+..++. +..|+.....- -.+.|...++.+-++|+.. .+++ +++.+ T Consensus 296 ~~~~~~~~g~~---~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~gls~~~~~~~ 369 (500) T protein:vir:30 296 ALTVRTTDGDV---VPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPI---RADDYIKAINEGLSLFEMQIGVSAGLFSFD 369 (500) T ss_pred cccCCCCCccc---cCCcccCCCcceEEEcCCCCCcCcceeEecccc---ChHHHHHHHHHHHHHHHHHhCCCccccccC Confidence 1111 111000 000000111111222221 12244332221 1123555555555555433 2443 44434 Q ss_pred ccccccchhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cC-C---CcccccEEEecCccccCHHHHHHHH Q lcl|NC_020866. 313 AVAGGHAVSQEHR--LVQEDIETSDAMALAAILNRDLIRPWIQL----EY-G---PQKRYPRIKIARPKTEDLTKLASSL 382 (547) Q Consensus 313 ~~ggs~a~~~vh~--~v~~~~~~ad~~~i~~~ln~~li~~l~~~----Nf-g---~~~~~p~~~~~~~~~ed~~~~a~~~ 382 (547) ++ |...+-++.. .-....+..-.+.+..+|. +|++.++.+ ++ + +...-+.+.|+..-.+|.++.++.+ T Consensus 370 ~~-g~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 447 (500) T protein:vir:30 370 GK-SMKTATEIVSENSDTYQMRNSIVALVEQSLK-ELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYW 447 (500) T ss_pred cC-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHH Confidence 33 3222233322 2223345556677777774 577777644 22 1 1122357888888889999999999 Q ss_pred HHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCcccccccc Q lcl|NC_020866. 383 DTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLK 443 (547) Q Consensus 383 ~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (547) .+++..|+--.+.++.+.||+++.+..+++-.......+.... +. +....-+. T Consensus 448 ~~~v~aGi~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~----~~----~~~~~~g~ 500 (500) T protein:vir:30 448 IKVVNAGFGTREMAIQKVLNVTEEKAQEIAAEINTGIVDEINQ----QR----TDTHLYGE 500 (500) T ss_pred HHHHHcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccCCC----CC----ccccccCC Confidence 9999999844456678888986543222222111110000000 00 00000000 No 214 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.47 E-value=0.00059 Score=38.28 Aligned_cols=406 Identities=12% Similarity=0.024 Sum_probs=167.4 Q ss_pred eeCCCCCcccchhhhhhhhh--hhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHH--HHHHHHHHhh---hHHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAA--PTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRY--LDLAEVIEER---DPHYL 79 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~--~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~--~~L~~~m~~~---D~~v~ 79 (547) ++...|+-.+-..-..+... +.-.-+++.+.+--. ...+.+....+.+. ..-|+....+ -++++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~---------~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~ 71 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELV---------SYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTR 71 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCccc---------ccccCCCCCCCCCCCChHHHHHHHhccccCChHH Confidence 55666775554433333321 122223333332100 00111111100000 0113322221 46666 Q ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-HhhhhcCeeeeeEEee------------ Q lcl|NC_020866. 80 GVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISKGYSFTEIVWNT------------ 145 (547) Q Consensus 80 s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~------------ 145 (547) ..++.....|.+.+-.++ .+. ++..+++.+ ..+.+++.+++.++ .++.||.+.+=+-+-. T Consensus 72 ~tl~~l~G~vfrk~p~~~----~p~--~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~ 145 (489) T protein:vir:78 72 RTLSGMVGSVMRKEPEIN----IPK--ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAG 145 (489) T ss_pred HHHHHHhchhhcCCccee----ccH--HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHh Confidence 677777777777765553 111 122222211 12346888988888 4778888755444411 Q ss_pred ---------------------cCCeeeEEeeEeeccc-------eeEe-----------cccCCccee-ec-c-CCCC-- Q lcl|NC_020866. 146 ---------------------SEGQYFPDKLIWRDPR-------HFRF-----------QKSDLATPL-LL-D-DNGA-- 181 (547) Q Consensus 146 ---------------------~~g~~~~~~l~~~~~~-------~~~~-----------~~~~~~~~~-~~-~-~~~~-- 181 (547) .+|...+..+..+... .|.. +.++.-+.. .+ . ++.. T Consensus 146 ~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~ 225 (489) T protein:vir:78 146 LLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQE 225 (489) T ss_pred cCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccc Confidence 1232222222211110 0000 000000000 00 0 0000 Q ss_pred ----------ceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHH Q lcl|NC_020866. 182 ----------EVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKD 251 (547) Q Consensus 182 ----------g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~ 251 (547) +..+..=-|+++.. ...+-..+.+.|..++..-+--.....+.-.-+..-+.|+++..-....+++... T Consensus 226 ~~~~~~~~~g~~~l~~IPfv~~~~-~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~ 304 (489) T protein:vir:78 226 DVVEIYPDLGESLRGVIPFTFIGA-TNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFK 304 (489) T ss_pred eeeEEeccCCCCccCeeeEEEEec-CCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCccccc Confidence 00111111222222 2233344555566666554422223344444555667887775422223322222 Q ss_pred HHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHHHHHH Q lcl|NC_020866. 252 TLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLVQEDI 331 (547) Q Consensus 252 ~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~~~~ 331 (547) .....--.+++.++..+|.+.+..+++++..+ . .. +.++-...+|. .+|-.|.+.+..-|--...........+ T Consensus 305 ~~~~~~i~~g~~~~~~lp~~~~~~~ie~~~~~--~-~r-~~l~~le~qm~--~lGa~l~~~~~~~Ta~~~~~~~~~~~S~ 378 (489) T protein:vir:78 305 EANPNGIKFGSRRGHNLGYGGSAQLIQAGENN--L-AR-QNMLDKEQQAI--QIGAQLITPTQQITAQSARIQRGADTSV 378 (489) T ss_pred ccCccceeeCCcccccCCCCCCcceeccCcch--H-HH-HHHHHHHHHHH--HHhhhhccCCcchhHHHHHHHHHHhhHH Confidence 22222234678888999999999999987422 1 22 22322333333 3454333221111111112223334567 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEe--cCc-cccCHHHHHHHHHHHHhCCCCCCHHHHHH---HhCCCC Q lcl|NC_020866. 332 ETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKI--ARP-KTEDLTKLASSLDTLVRLGMEIEENEVRS---RFGFSQ 405 (547) Q Consensus 332 ~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~--~~~-~~ed~~~~a~~~~~l~~~G~~v~~~~~~e---~~Glp~ 405 (547) +.+-+..+++.++ +++++++.+---.+..-+.|.. +.. ...| ....+.+.++.+.|. ++.+.+++ +-|+.. T Consensus 379 L~~~a~~~e~al~-~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d-~~~~~al~~~~~~G~-is~~t~~~~L~~~gv~d 455 (489) T protein:vir:78 379 MATIARNVSQAYT-DALRWVAVMLGKPEDTEVEFRLNMDFFLEPMT-AQDRAAWMADINAGL-LPATAYYAALRKAGVTD 455 (489) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHcCCCCCCceEEEeecccCcccCC-HHHHHHHHHHHhcCC-CCHHHHHHHHHhCCCCC Confidence 7788889999996 5899999984212222233322 222 1223 234556667778886 77654433 446765 Q ss_pred CCCcchhcccccccc--cccccccccccccCCcc Q lcl|NC_020866. 406 PKPGAKLLRPLENPA--TQAAPPGMDPGAGDLKT 437 (547) Q Consensus 406 p~~~e~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 437 (547) +.+.+........+. +...++..+.+.+.... T Consensus 456 ~~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 456 WTDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred ccHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 433222211111110 00000000110000000 No 215 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=96.16 E-value=0.00093 Score=37.22 Aligned_cols=409 Identities=12% Similarity=0.096 Sum_probs=163.0 Q ss_pred CCCCCeeeCCCCCc---ccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHH--hhcCCCHHHHHHHHHH---HH Q lcl|NC_020866. 1 MAQKPVLLDRWGNP---VQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILR--AADHGDPVRYLDLAEV---IE 72 (547) Q Consensus 1 ma~~~~~~~~~g~~---~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr--~a~~g~~~~~~~L~~~---m~ 72 (547) |.=..+|.+...+- +..+.+.+.. .++.-.+++....+|-+ .--.|+...+....-+ .. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~-------------~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~ 67 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSIN-------------DHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQE 67 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhh-------------cCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccc Confidence 33111121111100 0000111000 00000111111111110 0001111111000000 00 Q ss_pred hh--hHHHH-HHHHHHHHHHhcCCceEecCCC------CHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeE Q lcl|NC_020866. 73 ER--DPHYL-GVIGTRKRSVSQLDITVEPASD------DPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIV 142 (547) Q Consensus 73 ~~--D~~v~-s~l~~Rk~~v~~~~w~v~p~~~------~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~ 142 (547) ++ ---+. .+..+....|.+-.-.|...+. +...+...+++.++++.-+|...+...+ ++...|=.++-+. T Consensus 68 ~~~~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~ 147 (517) T protein:vir:98 68 RDYMTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPY 147 (517) T ss_pred cceeecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEE Confidence 00 00011 1223444445554444443221 2223446688999888877877777666 6888888888777 Q ss_pred EeecCCeeeEEeeEeeccceeE---e----------------cccCC-------------------cce-----eeccCC Q lcl|NC_020866. 143 WNTSEGQYFPDKLIWRDPRHFR---F----------------QKSDL-------------------ATP-----LLLDDN 179 (547) Q Consensus 143 w~~~~g~~~~~~l~~~~~~~~~---~----------------~~~~~-------------------~~~-----~~~~~~ 179 (547) |.. +.. +|.++++..|. + ..++. +.+ +..+.. T Consensus 148 ~d~--~~~---~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~ 222 (517) T protein:vir:98 148 VDN--GEI---EFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDN 222 (517) T ss_pred EeC--Cee---EEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCC Confidence 753 211 12222221110 0 00000 001 011111 Q ss_pred ----CCcee-------eccCc---------eEEEee----cCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020866. 180 ----GAEVP-------LPAFR---------FIFADI----PAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQP 235 (547) Q Consensus 180 ----~~g~~-------l~~~k---------~i~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P 235 (547) |..++ +++.. |.+++. +...++|+|.|.+..+....-.-...+..|..=++.-... T Consensus 223 ~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~ 302 (517) T protein:vir:98 223 EGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRT 302 (517) T ss_pred CccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcc Confidence 11111 11111 222222 1233679999999999877666666666666655553222 Q ss_pred eEE----EeCCCCCCH-------HHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHH- Q lcl|NC_020866. 236 LRI----GKWQPGASE-------QDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKA- 303 (547) Q Consensus 236 ~~v----gk~~~~~~~-------~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~- 303 (547) +.| .+..+.... +..+.+...+. .+ +.+.-|+.....- -.+.|.+-++.+=++|+.. T Consensus 303 i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~---~~-----~~~~~i~~~~~~i---R~e~~~~~~~~~L~~i~~~~ 371 (517) T protein:vir:98 303 VFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIR---MG-----TDEEFVKDVTHDI---RTEQYKEAINQALRTLEMEL 371 (517) T ss_pred eecChhhhccccCCCCcccCCCCCcccceeeecc---CC-----CCCCceeeecccc---chHHHHHHHHHHHHHHHHHh Confidence 222 111111000 00011111110 00 0011122211110 0112444455555555433 Q ss_pred Hhc-chhccCccccccchhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----CCC---cccccEEEecCccc Q lcl|NC_020866. 304 VLG-QTTTTDAVAGGHAVSQEHRL--VQEDIETSDAMALAAILNRDLIRPWIQLE-----YGP---QKRYPRIKIARPKT 372 (547) Q Consensus 304 ilG-qtlts~~~ggs~a~~~vh~~--v~~~~~~ad~~~i~~~ln~~li~~l~~~N-----fg~---~~~~p~~~~~~~~~ 372 (547) .++ ++++-+++ |...+.++..+ -.-.-+.+-.+.+..+| ++|++.++.+. ++. ....+.+.|+..-. T Consensus 372 Gls~~t~~~~~~-~~kTATEi~s~~~~~~~t~~~~~~~~~~aL-~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~ 449 (517) T protein:vir:98 372 KLSVGTFSFDGR-SMKTATEIVSENDLTYRTRNDHVYEVEQFI-KGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVF 449 (517) T ss_pred CCCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCC Confidence 344 45554443 33323344332 22234555666677777 45777665431 322 22346889999999 Q ss_pred cCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchh Q lcl|NC_020866. 373 EDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD 450 (547) Q Consensus 373 ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (547) +|.++.++...+++..|+--.+.++.+.||+.+.+..+++-......... .+.. ..........+... T Consensus 450 ~D~~~~~~~~~~~v~aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~-~~~~---------~~~~~~~~~~gd~e 517 (517) T protein:vir:98 450 QDRSALLRFYGQAKTFGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIEL-DPVT---------ISQRAQKRMFGDEE 517 (517) T ss_pred CCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCChHHHHHHHHHHHHhcccc-CCCC---------ccccccCCCCCCCC Confidence 99999999999999999844567889999987544333332221111110 0000 00000000000000 No 216 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=96.15 E-value=0.00094 Score=37.18 Aligned_cols=438 Identities=13% Similarity=0.049 Sum_probs=193.3 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhh---hccCC-CHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGY---PGDGL-NPLRLAQILRAADHGDPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~---~~~~l-~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~ 76 (547) |++..+=+++.-+-+.-..-+ -+..... +++..| -.--. +-..+..|++.. + ...++ |+ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~-wV~~~D~----~RlaaY~ly~d~y~n~~~el~~il~G~---d---r~~~~------~p 63 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDN-IVDENDK----NRVRAYDLYENIYLNSAETLKLVLRGD---D---SVPIL------MP 63 (563) T ss_pred CCccccccCCCcccccccccc-cCCHHHH----HHHHHHHHHHHhhcCchhhhhhhcCCC---c---eeeec------cc Confidence 998887766655433221111 1111111 111111 11000 111223333322 1 11221 33 Q ss_pred HHHHHHHHHHHHHhcCCce--EecCCCCH-HHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEee---cCCe Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQLDIT--VEPASDDP-QDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNT---SEGQ 149 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~~w~--v~p~~~~~-~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~---~~g~ 149 (547) --+-.+++ ...+++.+.+ |++...++ ..+.+-..++.+.+.-+|.....+.- +|..-|=.|+-+.|+. .+++ T Consensus 64 s~r~~V~~-~~~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R 142 (563) T protein:vir:74 64 SGRKIVEA-VHRFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGER 142 (563) T ss_pred hHHHHHHH-HHHhcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCC Confidence 33334445 4455566544 45555333 33334456677777666666655554 7999999999999974 3344 Q ss_pred eeEEeeEeeccceeE-------------------------------------ecccCCcc----eeeccC---------- Q lcl|NC_020866. 150 YFPDKLIWRDPRHFR-------------------------------------FQKSDLAT----PLLLDD---------- 178 (547) Q Consensus 150 ~~~~~l~~~~~~~~~-------------------------------------~~~~~~~~----~~~~~~---------- 178 (547) .... .++|+++. +-.++... ++..-+ T Consensus 143 ~rv~---~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r 219 (563) T protein:vir:74 143 ISVD---EVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDR 219 (563) T ss_pred ceEe---ecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhcccccccc Confidence 4433 22222111 10000000 000000 Q ss_pred C-------------------CCceeec-cCceE---EEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020866. 179 N-------------------GAEVPLP-AFRFI---FADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQP 235 (547) Q Consensus 179 ~-------------------~~g~~l~-~~k~i---~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P 235 (547) + ++...+| +..|| ++......+..||.|-|..+.-...--+....+-...++..|.| T Consensus 220 ~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~p 299 (563) T protein:vir:74 220 GAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLG 299 (563) T ss_pred CccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCC Confidence 0 0000111 22333 34455667889999999998888777777888888888899999 Q ss_pred eEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCce---EEEEecCCCCCchHHHHHHHHHHHH-HHHHHHhcch--- Q lcl|NC_020866. 236 LRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMS---IDFVESKSIGSSVDLYEKRINHLDQ-QISKAVLGQT--- 308 (547) Q Consensus 236 ~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~---ie~~e~~~~~~~~~~f~~li~~~d~-~Isk~ilGqt--- 308 (547) +.+-.-... .+.-+..+.. =+++.++..=+|...+ ++.+. +..+...++..+++++. .|+.. -+-+ T Consensus 300 i~vl~~~~p-~d~~~g~~~~--w~vgpG~i~El~~~~~~g~l~~v~---g~~~l~~~q~Hm~~l~eral~~~-s~tPavA 372 (563) T protein:vir:74 300 MYVTNASAP-VDPNTGELTD--WNIGPMQIVEIAGNRNDNYFERVS---GVQDVSPFQDHMKWIDEKGIAEG-SGTPEVA 372 (563) T ss_pred eEEeccccc-cccccccccc--cccCCceeEeccCCccccceeeec---chhhhHHHHHHHHHHHHHHHHhh-ccCccee Confidence 876542111 1111111111 2345555555665433 44432 11222234444444443 33321 1111 Q ss_pred h-ccCc--cccccchhH----HHH-HHHHHH-HHHHHHHHHHHHHHHHHHHHHHhc--------CCCcc----cccEEEe Q lcl|NC_020866. 309 T-TTDA--VAGGHAVSQ----EHR-LVQEDI-ETSDAMALAAILNRDLIRPWIQLE--------YGPQK----RYPRIKI 367 (547) Q Consensus 309 l-ts~~--~ggs~a~~~----vh~-~v~~~~-~~ad~~~i~~~ln~~li~~l~~~N--------fg~~~----~~p~~~~ 367 (547) + |-+. .-+++|+-- .-. --++.+ +.+-++++..-..+.|++.+-.+- ||... ....+.| T Consensus 373 ~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf 452 (563) T protein:vir:74 373 IGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIF 452 (563) T ss_pred ecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEe Confidence 1 1121 113444321 111 112222 555566666666666666655531 22111 1124568 Q ss_pred cCccccCHHHHHHHHHHHHhCCCCCCHHHHHHHh---CCCCCCCcc--------hhccccccccccccccccc-ccccCC Q lcl|NC_020866. 368 ARPKTEDLTKLASSLDTLVRLGMEIEENEVRSRF---GFSQPKPGA--------KLLRPLENPATQAAPPGMD-PGAGDL 435 (547) Q Consensus 368 ~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e~~---Glp~p~~~e--------~~~~~~~~~~~~~~~~~~~-~~~~~~ 435 (547) ...-+.|.++..+-...|++.|+ ++.+-..+++ |.|.|+-+. .+.....+.+....+-+.. .+.+.. T Consensus 453 ~p~~P~d~~~vv~~~~tl~~aGi-iSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~ 531 (563) T protein:vir:74 453 ADPMPVNKTQVTQDTLLLQQAHL-ILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGA 531 (563) T ss_pred CCCCCccHHHHHHHHHHHHHcCc-hhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCC Confidence 99999999998888889999998 7766665555 777665222 1211111111111111111 111111 Q ss_pred ccccccccccccchhhhhhcccchhhhcccccccchhH Q lcl|NC_020866. 436 KTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAE 473 (547) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (547) +. ...++...+-..+....+.++....++... T Consensus 532 ~~------~~~dd~g~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 532 GE------QQFDDQGNPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred Cc------ccccccCCchhHcCCcccCCccccccCCCC Confidence 10 011111111111222222222222222111 No 217 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=95.39 E-value=0.0022 Score=35.19 Aligned_cols=433 Identities=12% Similarity=0.098 Sum_probs=181.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhh----hhhHHH-HhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTV----TGVRSP-LSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~----~~~~~~-~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) |||.- +|--|+...+....++... .+.-.+ .+++-..-.+. .. .--+-..+..-|+.|. .+ T Consensus 1 ~~~~l-----fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~---e~-----~~~~~~eLI~~YR~ma-~~ 66 (537) T protein:vir:10 1 MAQQL-----FGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDF---DG-----TIRNDHELITRYREMV-LN 66 (537) T ss_pred Ccccc-----ccceeecccccccCCcccCCCcccccceeeccccccccccc---cc-----ccchHHHHHHHHHHHh-hc Confidence 99643 3666655433322222110 000000 01111111111 01 1112345566777776 59 Q ss_pred HHHHHHHHHHHHHHhcCCc-----eEecCC---CCHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhcCeeeeeEEeec Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLDI-----TVEPAS---DDPQDVKIADMVRDWLKREELQQEVFHILD-CISKGYSFTEIVWNTS 146 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~w-----~v~p~~---~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~ 146 (547) |.|-++++....-+.-.+- .|.-.+ .....+++.++.+..++-++|+.-..++.. -...|--.+.++-... T Consensus 67 pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k 146 (537) T protein:vir:10 67 PECDSAVDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPK 146 (537) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC Confidence 9999999988877665432 222211 111223455555555555566555555542 2224444555554444 Q ss_pred CCeeeEEeeEeeccceeEeccc-----C-Ccc--------------eeecc------CCCCceeeccCceEEEee--cCC Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQKS-----D-LAT--------------PLLLD------DNGAEVPLPAFRFIFADI--PAK 198 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~~~-----~-~~~--------------~~~~~------~~~~g~~l~~~k~i~~~~--~~~ 198 (547) +..-.+.++..++||.+++... . ... +.... .+..++.+|.+ -|+|.| -.. T Consensus 147 ~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~d-AI~y~hSGl~d 225 (537) T protein:vir:10 147 KPRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPD-SIAYCHSGIQD 225 (537) T ss_pred CccccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHh-heeeeccccee Confidence 4344456777788887643221 0 110 01111 13345667764 555555 345 Q ss_pred cCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC------eeE--- Q lcl|NC_020866. 199 SGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD------CAA--- 266 (547) Q Consensus 199 ~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~------~~~--- 266 (547) .++++..|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.+.-+..+ +=|.+.+....+- ++- T Consensus 226 ~n~~~i~syLhkAiKp~NQLkm~EDAlVIY-RitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d 304 (537) T protein:vir:10 226 LNKNMVLSHLHKAIKAVNQLRMIEDSLVIY-RLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (537) T ss_pred CCCCeeeeeehhhhHHHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc Confidence 566888999999988875444433333332 2222342 22222333222222 1122222222211 111 Q ss_pred -----------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCcccc--ccchhHHHHH Q lcl|NC_020866. 267 -----------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVAG--GHAVSQEHRL 326 (547) Q Consensus 267 -----------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~gg--s~a~~~vh~~ 326 (547) -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |.+++ +. +++..=+..+ T Consensus 305 drk~msMlEDyWLPRReGgrgTEItTLpGgqnlge----m~DV~YF~kKLy~aLnVP~SRl~~e~-~f~~Gr~~EItRDE 379 (537) T protein:vir:10 305 DKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE----LEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDE 379 (537) T ss_pred cchhhhhhhhhcccccCCCcccceeeccccCCcCh----HHHHHHHHHHHHHHhCCCccccCCCC-cccccccchhhHHH Confidence 123 5789998865443332 3347899999998853322 22232 21 2332233444 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHHHHHHhC----CCC Q lcl|NC_020866. 327 V-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSLDTLVRL----GME 391 (547) Q Consensus 327 v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~~~l~~~----G~~ 391 (547) + |...+......++..|..-|-..|+--| |..-...-+|.|... +-.+.+-+.+++..|..+ |-- T Consensus 380 iKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 459 (537) T protein:vir:10 380 VKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKY 459 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 4 4455666666666666654444444333 211112223333222 122333444555555443 434 Q ss_pred CCHHHHHHHh-CCCCCC-----------Ccchhcccccccccccccccc---cccccCCccccccccccccchhhhhhcc Q lcl|NC_020866. 392 IEENEVRSRF-GFSQPK-----------PGAKLLRPLENPATQAAPPGM---DPGAGDLKTAERKLKPLSGQADLSRGDL 456 (547) Q Consensus 392 v~~~~~~e~~-Glp~p~-----------~~e~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (547) ++.+|+++.. .+.+.+ ..+.+...+..........+. -+..+..+.+.++... T Consensus 460 ~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------ 527 (537) T protein:vir:10 460 FSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAV------------ 527 (537) T ss_pred cchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCC------------ Confidence 7888887643 433211 000011100000000000000 0000000000000000 Q ss_pred cchhhhcccccccchhHH Q lcl|NC_020866. 457 SLNSEMPSTGQKTGGAEI 474 (547) Q Consensus 457 ~~~~~~~~~~~~~~~~~~ 474 (547) .++.+..+- + T Consensus 528 --~~~~~~~~~------~ 537 (537) T protein:vir:10 528 --SPADQKRGE------L 537 (537) T ss_pred --CCCCccCCC------C Confidence 000000000 0 No 218 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=94.90 E-value=0.0032 Score=34.24 Aligned_cols=462 Identities=11% Similarity=0.094 Sum_probs=195.6 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhc-cCCCHHHHHHHHHhhcCCCHHHHHHHHH-HHH------ Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPG-DGLNPLRLAQILRAADHGDPVRYLDLAE-VIE------ 72 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~-~~l~p~~~~~ilr~a~~g~~~~~~~L~~-~m~------ 72 (547) || +..|-+- .++..=++.++. .+...|..+.. |..=..+|....+-.-.. ....+.|. ... T Consensus 3 ~~-~~~~~~~----~~~~~~~~~~~~----~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~--~~~~~y~~~~~~~~~~~~ 71 (651) T protein:vir:80 3 LA-TTTTDKN----RQTYDETHDVSS----YVKKEYKRFCDARQVCEETWLEAWGMYLST--PEAQDYLRDQVLRSVGDV 71 (651) T ss_pred cc-ccccchh----hhhhhhhHHHHH----HHHHHHHHHHHHhhhhhhhHHHHHHhhccc--HHHHHhhccccccccCCC Confidence 66 2222111 111111222221 12222222221 111112333322211110 01111110 000 Q ss_pred -------hhhHHHHHHHHHHHHHHhcC-----C-ceEecCCCCHHHHHHHHHHHHHHh----CCCHHHHHHHH-HHhhhh Q lcl|NC_020866. 73 -------ERDPHYLGVIGTRKRSVSQL-----D-ITVEPASDDPQDVKIADMVRDWLK----REELQQEVFHI-LDCISK 134 (547) Q Consensus 73 -------~~D~~v~s~l~~Rk~~v~~~-----~-w~v~p~~~~~~~~~~ae~v~~~l~----~~~~~~~i~~~-l~a~~~ 134 (547) -.++.|+.+++.+...+... + +.|+|..+...++..++.|..++. ..+|...+..+ .+++.+ T Consensus 72 ~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~ 151 (651) T protein:vir:80 72 NADWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLIT 151 (651) T ss_pred CCCCCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhccc Confidence 02567888888777777663 2 667777666656666677776654 56898888776 589999 Q ss_pred cCeeeeeEEeecC---------------C--eee----------EEeeEeeccceeEecccCCc---------------- Q lcl|NC_020866. 135 GYSFTEIVWNTSE---------------G--QYF----------PDKLIWRDPRHFRFQKSDLA---------------- 171 (547) Q Consensus 135 G~sv~Ei~w~~~~---------------g--~~~----------~~~l~~~~~~~~~~~~~~~~---------------- 171 (547) |.+|+-+.|++.- + .+. --.+..+||..|.++..-.. T Consensus 152 G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~ 231 (651) T protein:vir:80 152 GNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKAD 231 (651) T ss_pred CceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHH Confidence 9999988886320 0 000 00234444443333321000 Q ss_pred ---------------------------------------------------cee----------------eccCCCCce- Q lcl|NC_020866. 172 ---------------------------------------------------TPL----------------LLDDNGAEV- 183 (547) Q Consensus 172 ---------------------------------------------------~~~----------------~~~~~~~g~- 183 (547) .++ .....+..+ T Consensus 232 l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il 311 (651) T protein:vir:80 232 ILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVL 311 (651) T ss_pred HHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEe Confidence 000 000000000 Q ss_pred ---eec---cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHH Q lcl|NC_020866. 184 ---PLP---AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAV 257 (547) Q Consensus 184 ---~l~---~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al 257 (547) ..+ ..-|++++.....+..||.|..+.+.......+...+.....+.+...|.....-+.....++ + T Consensus 312 ~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~-------l 384 (651) T protein:vir:80 312 RFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPED-------V 384 (651) T ss_pred cccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHH-------h Confidence 111 124777777888899999999999999999999999999999999988876554332222111 1 Q ss_pred HHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc-cC-ccc-cccchhHHHHH--HHHHHH Q lcl|NC_020866. 258 ANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT-TD-AVA-GGHAVSQEHRL--VQEDIE 332 (547) Q Consensus 258 ~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt-s~-~~g-gs~a~~~vh~~--v~~~~~ 332 (547) . .+.++.+......++..+..... .......++++++..|.....-..+. +. ..+ +...+++++.. .....+ T Consensus 385 ~-~~pg~vi~~~~~~~~~~l~~~~~--~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l 461 (651) T protein:vir:80 385 Y-TEPGKVFLVSDHGDLQPLANQSS--NFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRL 461 (651) T ss_pred h-cCCCceEEecCCCCceeeccCcc--cchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHH Confidence 1 23455555566666666654321 22234567888888887773221221 11 111 22233455433 344556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc--CCCcccccEEE--------ecCccccC------------------HHHHHHHHHH Q lcl|NC_020866. 333 TSDAMALAAILNRDLIRPWIQLE--YGPQKRYPRIK--------IARPKTED------------------LTKLASSLDT 384 (547) Q Consensus 333 ~ad~~~i~~~ln~~li~~l~~~N--fg~~~~~p~~~--------~~~~~~ed------------------~~~~a~~~~~ 384 (547) ..-.+.+..++-+.|++.++.++ |+.....|++. +.....+| .....+.. . T Consensus 462 ~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~-~ 540 (651) T protein:vir:80 462 SGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRL-T 540 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHH-H Confidence 66677777766666777777766 44333333220 00000011 11112222 2 Q ss_pred HHhCCCCCC-----------HHHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhh Q lcl|NC_020866. 385 LVRLGMEIE-----------ENEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSR 453 (547) Q Consensus 385 l~~~G~~v~-----------~~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (547) +.+.+...+ ...+.+..|++.+. ..+....+.+++.+. ++ ..... T Consensus 541 ~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~---~~l~~~~q~~~~~~~----~~-----------------~~~q~ 596 (651) T protein:vir:80 541 FIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPE---AYLKQQDQQAPANPQ----EA-----------------LLSQA 596 (651) T ss_pred HHHhhccCCccchhhhHHHHHHHHHHHcCCCCcH---HhcCCCccchhhhhh----HH-----------------HHhhH Confidence 222222111 13456677886543 233222111110000 00 00000 Q ss_pred hcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHhcCCCHHHHHH Q lcl|NC_020866. 454 GDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSLGEFSDMLDARYGEIGASQLAS 527 (547) Q Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~~~~~l~~ 527 (547) .... .+.....+..++....+..+..-.....+.++ +...+..-=..|--.+|+. T Consensus 597 ~~~~------------~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 597 KDVG------------GQAMSNMLQNQLQADGGTQMMSEMYGTPNADQ-------MQQELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccC Confidence 0000 00000000011000000000000000000000 0000100000111112222 No 219 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=94.75 E-value=0.0036 Score=33.99 Aligned_cols=182 Identities=14% Similarity=0.065 Sum_probs=84.7 Q ss_pred EEeCCC-----CCCHHHHHHHHHHHHHHhc--CeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc Q lcl|NC_020866. 238 IGKWQP-----GASEQDKDTLFRAVANIAG--DCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT 310 (547) Q Consensus 238 vgk~~~-----~~~~~~~~~l~~al~~~~~--~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt 310 (547) |.|... ..++.+..+-...+.+..+ ++.++...+.+++.+...- +| ...++...-.+||-+ .|..+| T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~l-sG----l~d~l~~~~~~iaa~-s~iP~t 74 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDI-GG----IDTFLSQKFDRIVAL-SGIHEI 74 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCc-CC----hHHHHHHHHHHHHhH-hcCchh Confidence 333210 1122333333444444443 3444555668899887654 34 345555555666544 454443 Q ss_pred ---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCcccc-------CHHHHHH Q lcl|NC_020866. 311 ---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTE-------DLTKLAS 380 (547) Q Consensus 311 ---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~e-------d~~~~a~ 380 (547) +.+.+|=.|.|+.-...+-+.+++.+.....-+.+.|++ +...+. --.|+|...... -.++.++ T Consensus 75 ~LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~----~~~~~~--~~~~~f~pL~~~s~kekAei~~~~a~ 148 (201) T protein:vir:10 75 ILKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLP----FIVTEQ--EWSVEFNPLSQVSDKDKSEILEKNVN 148 (201) T ss_pred hhcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH----hhcCCC--CceEeeCCCCCCCHHHHHHHHHHHHH Confidence 233445556677777777888888775443333333444 321121 124444433322 2355678 Q ss_pred HHHHHHhCCCCCCHHHHHHHhCCCCCC--CcchhcccccccccccccccccccccCCcccccccc Q lcl|NC_020866. 381 SLDTLVRLGMEIEENEVRSRFGFSQPK--PGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLK 443 (547) Q Consensus 381 ~~~~l~~~G~~v~~~~~~e~~Glp~p~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (547) +++++++.|+ ++.+++++++--.... -++.........+ ....|...+... T Consensus 149 a~~~~~~~g~-i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~-----------e~~dp~~~~~~~ 201 (201) T protein:vir:10 149 SVAALIAAGI-IDADEARDTLRAISTEVKIGEGSIQTEVVIN-----------ESEDPLDVSANN 201 (201) T ss_pred HHHHHHHcCC-CCHHHHHHHHHhcCCcCCCCCCCCCcccccc-----------ccCCCCCCCCCC Confidence 8888899997 9999999887421100 0000000000000 000000000000 No 220 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=94.15 E-value=0.0053 Score=33.08 Aligned_cols=514 Identities=11% Similarity=-0.003 Sum_probs=183.4 Q ss_pred CCCCCe----eeCCCCCcccch---hh--hhhhhhhhhhh------------------------hHHHHhhhhc--cCCC Q lcl|NC_020866. 1 MAQKPV----LLDRWGNPVQRA---QL--TEEVAAPTVTG------------------------VRSPLSGYPG--DGLN 45 (547) Q Consensus 1 ma~~~~----~~~~~g~~~~~~---~l--~~~~a~~~~~~------------------------~~~~~~~~~~--~~l~ 45 (547) |.+.+- -+-.+||..... .| ...|+++...+ -.+....++. .+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (694) T protein:vir:10 1 MSRRNAKKRTQLARTGRRPEVAKAAALAAAATIATAAAQPVPADFARRGALNALDAAPVAEPSPSLRLARQFEVDVSNYT 80 (694) T ss_pred CCccchhhHHHHhhcCCCcchhhhhhhhhhhhhhhcCCCcccCCccccccchhhcccccCCCCcchhhhhhccccccCCC Confidence 322211 122233322111 00 00111100000 0000000100 0011 Q ss_pred HHHHHHHHHhhcCC--------------CHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEec-CCC--------- Q lcl|NC_020866. 46 PLRLAQILRAADHG--------------DPVRYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEP-ASD--------- 101 (547) Q Consensus 46 p~~~~~ilr~a~~g--------------~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p-~~~--------- 101 (547) |.. +++..-+.+. .+.-|--| ..|. .-+.++++....-...++. |.-.- +.. T Consensus 81 ~~~-~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~l-a~la-Q~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~ 156 (694) T protein:vir:10 81 PRE-RRAASYALDFNGTSMDALSFVTSSGFPGFPTL-VLLA-QLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLA 156 (694) T ss_pred ccc-cchhhhhhccCcccccchhhhhccCcchHHHH-HHHh-hccchhhHHHHHHHHhhcc-cceeccccchhhhhhccc Confidence 111 0011111100 11111111 1221 3455666666666655544 62211 111 Q ss_pred ------CHHHHHHHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeecCC--------------eeeEEeeEeecc Q lcl|NC_020866. 102 ------DPQDVKIADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTSEG--------------QYFPDKLIWRDP 160 (547) Q Consensus 102 ------~~~~~~~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~~g--------------~~~~~~l~~~~~ 160 (547) +..+.+..+.+...++++...+.+++.+ .+-+||-+++=+.=.-++. .-.++.|..++| T Consensus 157 ~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp 236 (694) T protein:vir:10 157 AGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEP 236 (694) T ss_pred ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecc Confidence 1111244566777888887666666666 6999999874433222111 112334666677 Q ss_pred ceeEecccCCcceeecc------CCCCceeeccCceEEEeecC------CcCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 161 RHFRFQKSDLATPLLLD------DNGAEVPLPAFRFIFADIPA------KSGILLRSGLARVAAWGWMFKAFTARDWAIF 228 (547) Q Consensus 161 ~~~~~~~~~~~~~~~~~------~~~~g~~l~~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 228 (547) .|+.....+....+-.+ -...|..+-..+++.+.... ..-+.+|.++...++..+.-=.-....-... T Consensus 237 ~~vtP~~~n~~dP~spdfgkP~~y~V~G~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~L 316 (694) T protein:vir:10 237 YWVTPNNYNSINPVADDFYKPSTWWMIGTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI 316 (694) T ss_pred cccccchhhhccchhhccCCCceEEEeceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH Confidence 66654322111100000 00011222223333332221 2335678888888775533211111112222 Q ss_pred HHHcCCCeEEEeCC-----CCCCHHHHHHHHHHHHHHhcC-eeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_020866. 229 TQTYGQPLRIGKWQ-----PGASEQDKDTLFRAVANIAGD-CAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQIS 301 (547) Q Consensus 229 ~E~~G~P~~vgk~~-----~~~~~~~~~~l~~al~~~~~~-~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Is 301 (547) +..+.+..+ |.+ .+....+...-.+.+..+.++ +..++.+ +.+++.+..+ -+| ...++...-.+|| T Consensus 317 i~~~~v~~l--k~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~sts-lSG----LddVi~qf~q~VA 389 (694) T protein:vir:10 317 VKQFSVSGI--LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTP-LSG----LDALQAQAQEQMS 389 (694) T ss_pred HHhhhhHHH--HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecc-cCC----HHHHHHHHHHHHH Confidence 233332221 111 111222333334555566555 4556774 6788877643 344 4555655556666 Q ss_pred HHHhcchhc---cCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH--- Q lcl|NC_020866. 302 KAVLGQTTT---TDAVAGGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL--- 375 (547) Q Consensus 302 k~ilGqtlt---s~~~ggs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~--- 375 (547) -+ .+-.+| +.+-.|=+|.|+.-..+.-+.+++.....-..+-+.++.-+..--||....-..|+|....+-+- T Consensus 390 ga-a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~Ek 468 (694) T protein:vir:10 390 AV-SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEV 468 (694) T ss_pred hh-hcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHH Confidence 44 444332 22334556777777777777777765543333334455544444466432223444443222222 Q ss_pred ----HHHHHHHHHHHhCCCCCCHHHHHHHhCCCCC----------------CCcc-hhc---cccccccccccccccccc Q lcl|NC_020866. 376 ----TKLASSLDTLVRLGMEIEENEVRSRFGFSQP----------------KPGA-KLL---RPLENPATQAAPPGMDPG 431 (547) Q Consensus 376 ----~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p----------------~~~e-~~~---~~~~~~~~~~~~~~~~~~ 431 (547) ++.|+.++.+++.|+ |+.++++.++.-++. .+++ +.. .......+..++++...+ T Consensus 469 AeI~~k~A~~d~~~~~~gv-I~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (694) T protein:vir:10 469 AESRYKQAQSDVLYVQEQV-IRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARA 547 (694) T ss_pred HHHHhhhhHHHHHHHHhcC-CCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccc Confidence 445677888889997 999999999654211 1110 000 000001111111111111 Q ss_pred ccCCccccccccc-----cccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCH Q lcl|NC_020866. 432 AGDLKTAERKLKP-----LSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIRTMLEAASSL 506 (547) Q Consensus 432 ~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ 506 (547) ....++....-.. .++..+..-.... ...-. ....+ |..+-...|.- =-..|+.++++ T Consensus 548 g~~~~~~v~~~~~~~~~~~ag~~~~~~~~ag--~v~~~-----~g~vL--l~kr~~g~W~l--------PgG~vE~gEt~ 610 (694) T protein:vir:10 548 GATAPPTVANVNANVNPREAGAQDAAMRAAG--AVYVV-----DGKVL--LMKRPAGDWGL--------PAGKVEGNETP 610 (694) T ss_pred cccCCCcccccccccCccccCCCCccceeeE--EEEEe-----CCEEE--EEEecCCCccC--------CccccCCCCCH Confidence 1111111000000 0000000000000 00000 00000 00000001110 01123334444 Q ss_pred HHHHHHHHHHhcC--CCHH-HHHHHH-HHHH---HHHHHHhHHhhccC Q lcl|NC_020866. 507 GEFSDMLDARYGE--IGAS-QLASRI-SAAM---LASEFAGREGALDG 547 (547) Q Consensus 507 ee~~~~L~~l~~~--~~~~-~l~~~l-~~al---~~A~~~G~~~~~~g 547 (547) ++...| ++..+ ++.+ ++...- -..+ |.+++.+. +++-. T Consensus 611 ~~a~~R--E~~EEtGl~~~~el~~~~~~~~~~h~f~~~~e~~-~v~l~ 655 (694) T protein:vir:10 611 EEAARR--ETREETGYDHDGELVPLGKFDGFFHAFVAHLEPF-DVELN 655 (694) T ss_pred HHHHHH--HHHHHhCCccccceeeeeeecceEEEEEEeeccc-CcccC Confidence 433221 11000 1100 000000 0000 11111000 00000 No 221 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=94.14 E-value=0.0053 Score=33.07 Aligned_cols=515 Identities=11% Similarity=0.024 Sum_probs=194.9 Q ss_pred CCCCCeeeCCCCCccc----chhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC--------------CCHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ----RAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH--------------GDPV 62 (547) Q Consensus 1 ma~~~~~~~~~g~~~~----~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~--------------g~~~ 62 (547) |-+..-+---+--|+- +-.|++..-. . ..+-+|.. +++..-+.+ -.+. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~------------~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~F~ 111 (695) T protein:vir:78 46 MGRRGALNALDAAPVAEPSPSLRLARQFEV-D------------VSNYTPRE-RRAASYALDFNGTSMDALSFVTSSGFP 111 (695) T ss_pred hcccccccccccccccCCCcccccceecee-c------------cccCCccc-cchhhhhhcccccccccchhhhccCcc Confidence 4433322111222221 1112111100 0 00111111 111111111 1112 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEec-CC---------------CCHHHHHHHHHHHHHHhCCCHHHHHH Q lcl|NC_020866. 63 RYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEP-AS---------------DDPQDVKIADMVRDWLKREELQQEVF 126 (547) Q Consensus 63 ~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p-~~---------------~~~~~~~~ae~v~~~l~~~~~~~~i~ 126 (547) -|--| ..|. .-+.++++....-...++. |.-.- +. .+..+.+..+.+...++++...+.++ T Consensus 112 Gy~~l-a~la-Q~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~ 188 (695) T protein:vir:78 112 GFPTL-VLLA-QLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVR 188 (695) T ss_pred hHHHH-HHHh-hccchhhHHHHHHHHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2222 4566777777666666544 62211 11 11111244566777888887666666 Q ss_pred HHH-HhhhhcCeeeeeEEeecCC--------------eeeEEeeEeeccceeEecccCCcceeecc------CCCCceee Q lcl|NC_020866. 127 HIL-DCISKGYSFTEIVWNTSEG--------------QYFPDKLIWRDPRHFRFQKSDLATPLLLD------DNGAEVPL 185 (547) Q Consensus 127 ~~l-~a~~~G~sv~Ei~w~~~~g--------------~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------~~~~g~~l 185 (547) +.+ .+-+||-+++=+.=.-++. .-.++.|..++|.|+.....+....+-.+ -...|..+ T Consensus 189 eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~kI 268 (695) T protein:vir:78 189 TTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGTEV 268 (695) T ss_pred HHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEeceEE Confidence 666 6999999874333221111 11233466667766654322111110000 00011222 Q ss_pred ccCceEEEeecC------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC-----CCCCHHHHHHHH Q lcl|NC_020866. 186 PAFRFIFADIPA------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ-----PGASEQDKDTLF 254 (547) Q Consensus 186 ~~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~-----~~~~~~~~~~l~ 254 (547) -..+++.+.... ..-+.+|.++...++..+.-=.-....-...+..+.+..+ |.+ .+....+...-. T Consensus 269 H~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~l--k~dla~~L~~g~~~~l~~R~ 346 (695) T protein:vir:78 269 HATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI--LMDLAQALMPGANVDLSMRA 346 (695) T ss_pred eeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHH--HHHHHHhhcChhHHHHHHHH Confidence 223333332221 2335678888888875533211111122222233332221 111 111222333334 Q ss_pred HHHHHHhcC-eeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHH Q lcl|NC_020866. 255 RAVANIAGD-CAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQE 329 (547) Q Consensus 255 ~al~~~~~~-~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~ 329 (547) +.+..+.++ +..++.+ +.+++.+..+ -+| ...++...-.+||-+ .+-.+| +.+-.|=+|.|+.-..+.- T Consensus 347 eli~~~Rsn~G~~llDk~~Eefeq~sts-lSG----LddVi~qf~q~VAga-a~IPltkLfGqSPkGlNATGE~D~rnYY 420 (695) T protein:vir:78 347 ELINRYRDNRNILFLDKATEEFFQFNTP-LSG----LDALQAQAQEQMSAV-SHIPLIKLLGITPTGLNASSEGEIRVWY 420 (695) T ss_pred HHHHHhcCccceEEEecCCcceEEEecc-cCC----HHHHHHHHHHHHHhh-hcCchhhhhccCCccccccchhhHHHHH Confidence 555566555 4556774 6788877643 344 455565555666644 444332 2233455677777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH-------HHHHHHHHHHHhCCCCCCHHHHHHHhC Q lcl|NC_020866. 330 DIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL-------TKLASSLDTLVRLGMEIEENEVRSRFG 402 (547) Q Consensus 330 ~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~-------~~~a~~~~~l~~~G~~v~~~~~~e~~G 402 (547) +.+++.....-..+-+.++.-|..--||....-..|+|....+-+- ++.|+.++.+++.|+ |+.++++.++. T Consensus 421 D~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv-I~~~evr~rL~ 499 (695) T protein:vir:78 421 DYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQV-IRPDQVAARLN 499 (695) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC-CCHHHHHHHHh Confidence 7777765543333334455544444466432223444433222222 445677888889997 99999999964 Q ss_pred CCC----------------CCCcc-hhc---ccccccccccccccccccccCCccccccc--cc---cccchhhhhhccc Q lcl|NC_020866. 403 FSQ----------------PKPGA-KLL---RPLENPATQAAPPGMDPGAGDLKTAERKL--KP---LSGQADLSRGDLS 457 (547) Q Consensus 403 lp~----------------p~~~e-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---~~~~~~~~~~~~~ 457 (547) -++ |.+++ +.. .......+..++++...+....++....- .. .++..+..-.... T Consensus 500 ~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ag~~~~~~~aag 579 (695) T protein:vir:78 500 TEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVKPREAGAQDAAMRAAG 579 (695) T ss_pred cCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCCCCCCCCCCCceeeeeccccccccCCCCcccceeE Confidence 321 11111 000 00111111111111111111111111100 00 0111111100000 Q ss_pred -----------chh-----hhcccccccchhHHHHHHHHHHHHHHHHHH----H--HHHHHH-HHHHccCCHH-HHHHHH Q lcl|NC_020866. 458 -----------LNS-----EMPSTGQKTGGAEIAALTDQLELEGQPEIV----A--MIEEIR-TMLEAASSLG-EFSDML 513 (547) Q Consensus 458 -----------~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~----~--~~~~i~-~~l~~a~s~e-e~~~~L 513 (547) ..+ .++...-...+...+++...+.++..=... . .+..+. .++......+ ++.+.- T Consensus 580 ~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~aa~RE~~EEtGl~~~~el~~~g~~~~~~~~f~~~~e~~~~~l~dEh 659 (695) T protein:vir:78 580 AVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAARRETREETGYDHDGELVPLGKFDGFFHAFVAHLEPFDVELNDEH 659 (695) T ss_pred EEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHHHHHHHHhCCccccceeeeeeecceEEEEEEeecccCcccCchh Confidence 000 011111112222233333332222110000 0 000000 0000001000 000000 Q ss_pred HHHhcCCCHHH--------HHHHHHHHHHHHHHHhHH Q lcl|NC_020866. 514 DARYGEIGASQ--------LASRISAAMLASEFAGRE 542 (547) Q Consensus 514 ~~l~~~~~~~~--------l~~~l~~al~~A~~~G~~ 542 (547) . .|.=+++++ ..+.|..|..+=+-+++. T Consensus 660 ~-~~~Wf~pdeLP~pL~~~~aavl~aA~k~ld~~~~~ 695 (695) T protein:vir:78 660 T-AFDWFNPDELPHPLHRDTAAIVDAACKALDRLERA 695 (695) T ss_pred h-hcccCChhhcCcccchHHHHHHHHHHHHHHhhcCC Confidence 0 011122222 334554454444444544 No 222 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=93.93 E-value=0.0059 Score=32.80 Aligned_cols=419 Identities=9% Similarity=0.058 Sum_probs=178.3 Q ss_pred CCCCCeeeCCCCCcccchhh--hh-------hhhhhhhhhhHHHHhh-----hhccCCCHHHHHHHHHhhc-CCCHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQL--TE-------EVAAPTVTGVRSPLSG-----YPGDGLNPLRLAQILRAAD-HGDPVRYL 65 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l--~~-------~~a~~~~~~~~~~~~~-----~~~~~l~p~~~~~ilr~a~-~g~~~~~~ 65 (547) |+= | ++.-.|.-++..+. .+ .++.+...+....+.. .+..+.-. +.+..-+ --+...+. T Consensus 1 m~~-~-~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~----~~~~~~~~~~n~~eLI 74 (521) T protein:vir:10 1 MNP-I-FLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQ----SVLGYAPKIQNTKDLI 74 (521) T ss_pred CCc-c-hhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhh----hhhccccccchHHHHH Confidence 551 2 23333333322211 11 1112222211111100 01111111 1111111 11345567 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHhcCC-----ceEecCC---CCHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhcC Q lcl|NC_020866. 66 DLAEVIEERDPHYLGVIGTRKRSVSQLD-----ITVEPAS---DDPQDVKIADMVRDWLKREELQQEVFHILD-CISKGY 136 (547) Q Consensus 66 ~L~~~m~~~D~~v~s~l~~Rk~~v~~~~-----w~v~p~~---~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G~ 136 (547) .-|+.|. .+|.|-++++....-+.-.+ -.|.-.+ .+....++.++.+..++-++|+.-..++.. -...|- T Consensus 75 ~~YR~ma-~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR 153 (521) T protein:vir:10 75 NQYRSLS-KYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSR 153 (521) T ss_pred HHHHHHh-hccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeee Confidence 7788885 69999999998887766443 2232211 122223455555555555566555555442 222344 Q ss_pred eeeeeEEeecCCeeeEEeeEeeccceeEeccc------CCc-------ceeecc-----------CCCCceeeccCceEE Q lcl|NC_020866. 137 SFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS------DLA-------TPLLLD-----------DNGAEVPLPAFRFIF 192 (547) Q Consensus 137 sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~------~~~-------~~~~~~-----------~~~~g~~l~~~k~i~ 192 (547) -.+.++-...+..-.+.++..++||.+.+... +.. ...+.. ....++.+|. .-|+ T Consensus 154 i~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~-daI~ 232 (521) T protein:vir:10 154 IYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPI-DAIV 232 (521) T ss_pred EEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeech-hhee Confidence 44454444444444456777888887654221 000 011111 1123355666 4566 Q ss_pred Eeec--CCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC---- Q lcl|NC_020866. 193 ADIP--AKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD---- 263 (547) Q Consensus 193 ~~~~--~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~---- 263 (547) |.|. ...+.++..|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.|.-+..+ +=|.+.+....+- T Consensus 233 y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIY-RitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYD 311 (521) T protein:vir:10 233 YSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIY-RITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYD 311 (521) T ss_pred eecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEe Confidence 6663 34456889999999998875444433333332 2222342 22222322222222 1122222222111 Q ss_pred --eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCcccc--cc Q lcl|NC_020866. 264 --CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVAG--GH 318 (547) Q Consensus 264 --~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~gg--s~ 318 (547) ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |..++.|- ++ T Consensus 312 a~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlge----m~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr 387 (521) T protein:vir:10 312 SSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGE----MDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGA 387 (521) T ss_pred ccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcCh----HHHHHHHHHHHHHHhCCCccccCCCCCceeccc Confidence 111 123 5789999865443332 3347899999998853322 22332221 22 Q ss_pred chhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHHHHHHh Q lcl|NC_020866. 319 AVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSLDTLVR 387 (547) Q Consensus 319 a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~~~l~~ 387 (547) +..=+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++..|.. T Consensus 388 ~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~ 467 (521) T protein:vir:10 388 GNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQT 467 (521) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHh Confidence 222233343 4455666666666666654444444333 211112223333221 12233333444444443 Q ss_pred ------CCCCCCHHHHHHH-hCCCCCCCc--chhcccccccccccccccccccccCCccccccccccccchhh Q lcl|NC_020866. 388 ------LGMEIEENEVRSR-FGFSQPKPG--AKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADL 451 (547) Q Consensus 388 ------~G~~v~~~~~~e~-~Glp~p~~~--e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (547) .|--++.+|+++. +.+++.+-. +..+.......--..| .+...++ T Consensus 468 ~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p-------------------~~e~~df 521 (521) T protein:vir:10 468 LASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNP-------------------EDPMEEF 521 (521) T ss_pred hcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCC-------------------cchhhcC Confidence 3435888999775 466532111 1111111000000000 0001111 No 223 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=92.41 E-value=0.012 Score=31.22 Aligned_cols=433 Identities=12% Similarity=0.094 Sum_probs=174.8 Q ss_pred CCCCCeeeCCCCCcccchhhhh---hhhhhhhhhhHHHH--hhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTE---EVAAPTVTGVRSPL--SGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~---~~a~~~~~~~~~~~--~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) |++ |. |--|+...-+. ..+.+........+ +++-..-.+. ..-+ -+-..+..-|+.|. .+ T Consensus 1 m~~---lf---g~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~---e~~~-----~~~~eLI~~YR~ma-~~ 65 (533) T protein:vir:10 1 MSQ---LF---GFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDF---DGQV-----RNEYQLISRYREMV-LQ 65 (533) T ss_pred Ccc---cc---ccccccccccccCCCCCCCCcccccceeecccccceeeec---cccc-----chHHHHHHHHHHHh-hc Confidence 876 43 33332211110 01111110000011 1111111111 1111 12345566777776 59 Q ss_pred HHHHHHHHHHHHHHhcCC-----ceEecCCC---CHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhcCeeeeeEEeec Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQLD-----ITVEPASD---DPQDVKIADMVRDWLKREELQQEVFHILD-CISKGYSFTEIVWNTS 146 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~~-----w~v~p~~~---~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~ 146 (547) +.|-++++....-+.-.+ -.|.-.+- ....+++.++.+..++-++|+.-..++.. -...|--.+.++-..+ T Consensus 66 pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~ 145 (533) T protein:vir:10 66 PECDSAVDDIVNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPD 145 (533) T ss_pred cchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCC Confidence 999999998887766443 22322221 11222355555555555566655555542 2234544555555444 Q ss_pred CCeeeEEeeEeeccceeEec------ccCCcc--------------eeecc------CCCCceeeccCceEEEeecC--C Q lcl|NC_020866. 147 EGQYFPDKLIWRDPRHFRFQ------KSDLAT--------------PLLLD------DNGAEVPLPAFRFIFADIPA--K 198 (547) Q Consensus 147 ~g~~~~~~l~~~~~~~~~~~------~~~~~~--------------~~~~~------~~~~g~~l~~~k~i~~~~~~--~ 198 (547) +..-.+.++..++||.+.+- ..+..+ +.+.. .+..++.+|.+ -|+|.|.. . T Consensus 146 ~pk~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~d-AI~y~hSGl~d 224 (533) T protein:vir:10 146 NPQGGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPD-SICYVHSGIMD 224 (533) T ss_pred CccccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchh-heeeeecccee Confidence 44445567778888876651 111111 11111 12345667764 45555432 2 Q ss_pred cCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC------eeE--- Q lcl|NC_020866. 199 SGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD------CAA--- 266 (547) Q Consensus 199 ~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~------~~~--- 266 (547) .++..=.|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.|.-+..+ +=|.+.+....+- ++- T Consensus 225 ~~~~~i~syLhkAiKp~NQLkm~EDAlVIY-RitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d 303 (533) T protein:vir:10 225 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIY-RLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 303 (533) T ss_pred CCCCceeccchHhHHHHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc Confidence 223333477888877765444333222222 2222342 22222323222222 1122222222211 111 Q ss_pred -----------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccc-cccchhHHHHHH Q lcl|NC_020866. 267 -----------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVA-GGHAVSQEHRLV 327 (547) Q Consensus 267 -----------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~g-gs~a~~~vh~~v 327 (547) -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |.++++- -+++..=+..++ T Consensus 304 drk~msMlEDyWLPRReGgrgTEItTLpGgqnLge----m~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEi 379 (533) T protein:vir:10 304 DKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGE----LEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEV 379 (533) T ss_pred cchhhhhHhhhcccccCCCCccceeeccccCCcCh----HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHH Confidence 123 5789998865443332 3347899999998853222 2223211 123322334444 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHHHHHHhC----CCCC Q lcl|NC_020866. 328 -QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSLDTLVRL----GMEI 392 (547) Q Consensus 328 -~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~~~l~~~----G~~v 392 (547) |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++..|..+ |--+ T Consensus 380 KF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~ 459 (533) T protein:vir:10 380 KFQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYF 459 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 4455666666666666654444444333 211112223333222 223334445556555554 4347 Q ss_pred CHHHHHHHh-CCCCCC-----------CcchhcccccccccccccccccccccCCccccccccccccchhhhhh Q lcl|NC_020866. 393 EENEVRSRF-GFSQPK-----------PGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRG 454 (547) Q Consensus 393 ~~~~~~e~~-Glp~p~-----------~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (547) +.+|+++.. .+.+.+ ..+.+...+.....+...+..+...+......++..+.+.....++. T Consensus 460 S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 460 SVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPSDERKAEF 533 (533) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcchhhccCC Confidence 888887643 443211 00011111100000000000000000000000000010000000000 No 224 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=92.25 E-value=0.012 Score=31.08 Aligned_cols=515 Identities=12% Similarity=0.035 Sum_probs=186.2 Q ss_pred CCCCCeeeCCCCCccc----chhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC--------------CCHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ----RAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH--------------GDPV 62 (547) Q Consensus 1 ma~~~~~~~~~g~~~~----~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~--------------g~~~ 62 (547) |-+..-+---+--|+- +-.|++..-. . ..+-+|.. +++..-+.+ -.+. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~------------~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~F~ 111 (698) T protein:vir:10 46 MGRRGALNALDAAPVAEPSPSLRLARQFEV-D------------VSNYTPRE-RRAASYALDFNGTSMDALSFVTSSGFP 111 (698) T ss_pred hcccccccccccccccCCCcccccccccee-c------------cccCCccc-cchhhhhhcccccccccchhhhccCcc Confidence 4433322111222221 1112211100 0 00111111 111111111 0112 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEec-CC---------------CCHHHHHHHHHHHHHHhCCCHHHHHH Q lcl|NC_020866. 63 RYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEP-AS---------------DDPQDVKIADMVRDWLKREELQQEVF 126 (547) Q Consensus 63 ~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p-~~---------------~~~~~~~~ae~v~~~l~~~~~~~~i~ 126 (547) -|--| ..|. .-+.++++....-...++. |.-.- +. .+..+.+..+.|...++++...+.++ T Consensus 112 Gy~~l-a~la-Q~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~ 188 (698) T protein:vir:10 112 GFPTL-VLLA-QLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVR 188 (698) T ss_pred hHHHH-HHHh-hccchhhHHHHHHHHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 22222 1222 4566777777666666544 62211 11 11111244566777888887666666 Q ss_pred HHH-HhhhhcCeeeeeEEeecCC--------------eeeEEeeEeeccceeEecccCCcceeecc------CCCCceee Q lcl|NC_020866. 127 HIL-DCISKGYSFTEIVWNTSEG--------------QYFPDKLIWRDPRHFRFQKSDLATPLLLD------DNGAEVPL 185 (547) Q Consensus 127 ~~l-~a~~~G~sv~Ei~w~~~~g--------------~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------~~~~g~~l 185 (547) +.+ .+-+||-+++=+.=.-++. .-.++.|..++|.|+.....+....+-.+ -...|..+ T Consensus 189 eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~~I 268 (698) T protein:vir:10 189 TTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGSEV 268 (698) T ss_pred HHHHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCceEEEeccee Confidence 666 6999999874443222111 11123366667766654322111100000 00011223 Q ss_pred ccCceEEEeecC------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC------CCCCHHHHHHH Q lcl|NC_020866. 186 PAFRFIFADIPA------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ------PGASEQDKDTL 253 (547) Q Consensus 186 ~~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~------~~~~~~~~~~l 253 (547) -..+++.+.... ..-+.+|.++...++..+.-=.-....-...+.++.+..+ +.+ .+ ...+...- T Consensus 269 H~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l--~~dla~aL~~g-~~~~l~~R 345 (698) T protein:vir:10 269 HATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI--LMDLAQALTPG-ANVDLSMR 345 (698) T ss_pred cceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHH--HHHHHHhcCCh-hhHHHHHH Confidence 333333333221 2335678898888876643211111111122222222211 111 11 12222222 Q ss_pred HHHHHHHhcC-eeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHH Q lcl|NC_020866. 254 FRAVANIAGD-CAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQ 328 (547) Q Consensus 254 ~~al~~~~~~-~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~ 328 (547) .+.+..+.++ +..++.+ +.+++.+..+ .+| ...++...-.+||-+ .+-.+| +.+-.|=+|.|+.-..+. T Consensus 346 ~eli~~~Rsn~G~~llDk~~Eefeq~st~-lSG----LddVi~qf~q~VAga-a~IPltkLfGqSPkGlNATGE~D~rnY 419 (698) T protein:vir:10 346 AELINRYRDNRNILFLDKATEEFFQFNTP-LSG----LDALQAQAQEQMSAV-SHIPLIKLLGITPTGLNASSEGEIRVW 419 (698) T ss_pred HHHHHHhcCccceEEEecCCcceEEEecC-cCC----HHHHHHHHHHHHHhh-hcCchhhhhccCCcccCccchhhHHHH Confidence 3455555554 4456774 6778877643 344 455566555666644 333332 223345556777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH-------HHHHHHHHHHHhCCCCCCHHHHHHHh Q lcl|NC_020866. 329 EDIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL-------TKLASSLDTLVRLGMEIEENEVRSRF 401 (547) Q Consensus 329 ~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~-------~~~a~~~~~l~~~G~~v~~~~~~e~~ 401 (547) -+.+++....-...+-+.++.-|..--||....-..|+|....+-+- ++.|++++.+.+.|+ |+.++++.++ T Consensus 420 YD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv-I~~~evr~rL 498 (698) T protein:vir:10 420 YDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQV-IRPDQVAARL 498 (698) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC-CCHHHHHHHH Confidence 77777765433333223355544444465422223344433222222 445677888889987 9999999987 Q ss_pred CCCC----------------CCCcc--hhccc--ccccccccccccccccccCCccccccccccccchh--hhh-h---- Q lcl|NC_020866. 402 GFSQ----------------PKPGA--KLLRP--LENPATQAAPPGMDPGAGDLKTAERKLKPLSGQAD--LSR-G---- 454 (547) Q Consensus 402 Glp~----------------p~~~e--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~---- 454 (547) .-.+ |++++ .++.. .....+....+..+.+.....+.+++........+ .+. . T Consensus 499 ~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (698) T protein:vir:10 499 NTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDAAM 578 (698) T ss_pred hccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCccccee Confidence 4210 01110 11100 00000001111111111111111111000000000 000 0 Q ss_pred -----cccch-------h-----hhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHccCCHHHHH Q lcl|NC_020866. 455 -----DLSLN-------S-----EMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEIR-------TMLEAASSLGEFS 510 (547) Q Consensus 455 -----~~~~~-------~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~-------~~l~~a~s~ee~~ 510 (547) .+... + .++.......+...+++...+.++..-...-.+.++- .++...... ++ T Consensus 579 ~a~giv~~~g~~vLL~~r~~g~W~lPgG~ie~GEt~~~aa~RE~~EEtG~~~~~~l~~~g~~de~~~~f~ad~~p~-~~- 656 (698) T protein:vir:10 579 RAAGIVFRAGDKVLLMKRPAGDWGLPAGKVEDGETPEEAARRETLEETGHAGDYVLAPLGKYDEFFHAFVADVNPF-DV- 656 (698) T ss_pred eEEEEEEEcCCeEEEEEecCCCcccCccccCCCCCHHHHHHHHHHhhcccccchhhhcccccceEEEEEEEEecCc-ce- Confidence 00000 0 0111111111222222222222211100000011000 000011000 00 Q ss_pred HHHHH---HhcCCCHHHHHHHHHHHHH-----HHHHHhHHhhc Q lcl|NC_020866. 511 DMLDA---RYGEIGASQLASRISAAML-----ASEFAGREGAL 545 (547) Q Consensus 511 ~~L~~---l~~~~~~~~l~~~l~~al~-----~A~~~G~~~~~ 545 (547) .|.. .|.=+++++|-+-|.+... +...+-+.+.. T Consensus 657 -~l~dEh~~~~WfdpdeLP~pL~~~~~~ii~~A~kald~~~~~ 698 (698) T protein:vir:10 657 -ELNDEHTAFDWFDPDELPHPLHRDTAAIVDAACKALDRLERA 698 (698) T ss_pred -eeccccccccccChHhcccccCHHHHHHHHHHHHHHhhhcCC Confidence 0000 0000233343332222210 11111111111 No 225 >protein:vir:572 Length: 506 # NCBI annotation: unknown # Family: family:all:6660 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046607;genbank:gi:9630180;genbank:GeneID:1261432 Probab=89.68 E-value=0.025 Score=29.37 Aligned_cols=425 Identities=14% Similarity=0.084 Sum_probs=187.8 Q ss_pred eeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHH----HHHHHhhcCCCHHHHHH----HHHHHHhhhHHH Q lcl|NC_020866. 7 LLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRL----AQILRAADHGDPVRYLD----LAEVIEERDPHY 78 (547) Q Consensus 7 ~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~----~~ilr~a~~g~~~~~~~----L~~~m~~~D~~v 78 (547) ++.....-++...+.+-+-.- .+-+..+..++.+...+-... ..-|+.-.. ++..+.+ |..-+.-.|+.| T Consensus 1 mvTl~K~~i~~E~~~~~lN~Y-~TY~~~F~~GFi~~~~~NG~v~~i~~~~L~~~F~-NPD~~~~~I~~L~~Y~YI~~~~i 78 (506) T protein:vir:57 1 MVTLNKVDIESEEYKQMLNDY-STYTSTFASGFISNMFSNGIVTEIEAEQLKNYFS-NPDEFQEEIEDLAQYFYISTAEI 78 (506) T ss_pred CceeechhccHHHHHHHHhhh-hHHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhc-ChHHHHHHHHHHHHHhhhhcchH Confidence 334444556666665554321 111223444444332211110 112222221 1222221 222122257777 Q ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhhh---------hc-------CeeeeeE Q lcl|NC_020866. 79 LGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDWLKREELQQEVFHILDCIS---------KG-------YSFTEIV 142 (547) Q Consensus 79 ~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~i~~~l~a~~---------~G-------~sv~Ei~ 142 (547) ....+ ...++-.+++.|..-+-+....+..-.+...|+.+...++.+++|.-+- .| |-+-|+- T Consensus 79 ~QL~~-LI~aLP~L~Y~I~~~~k~K~~~~~iS~lN~~L~Kv~HK~LTRDLL~Q~A~aGTLvG~WLG~~k~PY~~iF~~iK 157 (506) T protein:vir:57 79 HQLFE-LIEALPTLNYKIDSFNKVKSSDKHISLLNKSLHKVKHKRLTRDLLKQVATAGTLVGIWLGDAKSPYPFIFDEIK 157 (506) T ss_pred HHHHH-HHHhcCCcceeehhhhhccchhhHHHHHHHHHHHHHHHHHHHHHHHHhhccCceeEeeecCCCCcchhhhhhhh Confidence 76554 4456667788886544444445556667777777777777777774222 22 1122222 Q ss_pred E----eecCCeeeEEeeEeeccceeEecc-cCCcce------eeccC----------CCCceeeccCceEEEe-e----c Q lcl|NC_020866. 143 W----NTSEGQYFPDKLIWRDPRHFRFQK-SDLATP------LLLDD----------NGAEVPLPAFRFIFAD-I----P 196 (547) Q Consensus 143 w----~~~~g~~~~~~l~~~~~~~~~~~~-~~~~~~------~~~~~----------~~~g~~l~~~k~i~~~-~----~ 196 (547) | .+.+|.|+. .++-.||.--. +.+... +..+. .-.-..+|..+.++.| | + T Consensus 158 YVFP~~R~~G~~V~----VvD~~~F~~~~~~~R~~~~~~LSP~I~~~~Y~~~~~~~~~~R~~~LP~~rT~~~R~~TL~RN 233 (506) T protein:vir:57 158 YVFPSFRRNGDWVC----VVDMELFTKYKDDQRNELLKSLSPYIKQSDYENFMKDREKYRFKELPQERTFPLRTGTLKRN 233 (506) T ss_pred hhccccccCCceEE----EEehHHhhhhhHHHHHHHHHhhhhhhhhhhhhhHhhhHHhhhhhhcccccchhheeeeeccc Confidence 2 223444432 23444443211 111111 11110 0111345555544322 2 2 Q ss_pred CCcCccccchhHHHHHHHHHHHHHHHHHHHH-HHHH--cCCCeEEEeCCCCC--------CHHHHHHHH----HHHHHHh Q lcl|NC_020866. 197 AKSGILLRSGLARVAAWGWMFKAFTARDWAI-FTQT--YGQPLRIGKWQPGA--------SEQDKDTLF----RAVANIA 261 (547) Q Consensus 197 ~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~-f~E~--~G~P~~vgk~~~~~--------~~~~~~~l~----~al~~~~ 261 (547) -+-|.|||..-|--+ .-|+- +++.-+ .+.+ ..|-++....+.+. .+..++++. +||..-. T Consensus 234 Q~LG~~~~T~~L~Dv----~HK~K-LkD~E~SIA~KII~A~AVL~~~~~~~Ngeyt~~K~~~a~K~Ki~~GVK~ALEK~~ 308 (506) T protein:vir:57 234 QGLGTSWVTPGLYDV----LHKKK-LKDVERSIANKIINAVAVLTIGTDKGNGEYTNMKLPKAVKQKIHGGVKTALEKNQ 308 (506) T ss_pred ccccccccchhHHHH----HHHHH-HHHHHHHHHHHHhhhheeeeeecccCCcccccccchHHHHHHHHHHHHHHHhccc Confidence 345566766433322 22322 222221 2222 23333322222221 223344433 3444334 Q ss_pred cC--eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHH-hcchhccCccccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 262 GD--CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAV-LGQTTTTDAVAGGHAVSQEHRLVQEDIETSDAMA 338 (547) Q Consensus 262 ~~--~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~i-lGqtlts~~~ggs~a~~~vh~~v~~~~~~ad~~~ 338 (547) .| +++.+|.-.+|+|-+.+..+-.+.-| +..|..|..+- +.|.|+. +++|.||.+....++|-..+.--+.. T Consensus 309 KDGv~~vs~PDFA~~~FP~vK~~~LD~~K~----D~I~~DI~~A~GlS~~L~N-G~~GNYAts~LNLD~FYKrIGV~~E~ 383 (506) T protein:vir:57 309 KDGVTVVSIPDFADINFPDVKADGLDGAKF----DHINSDIQSAYGLSGSLLN-GDGGNYATSSLNLDTFYKRIGVLMED 383 (506) T ss_pred ccCeEEEecccccccccccccccCCCchhh----cccchhhhhhhccchheec-CCCcceeeeechHHHHHHHHHHHHHH Confidence 44 45678999999997776654433334 56777786662 4566644 46889999999999998888777777 Q ss_pred HHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCCCHHHHHH-HhCCCCCCCcchhc---- Q lcl|NC_020866. 339 LAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEIEENEVRS-RFGFSQPKPGAKLL---- 413 (547) Q Consensus 339 i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v~~~~~~e-~~Glp~p~~~e~~~---- 413 (547) |+.-+-++|+..++--- ..-.-.|.++...+..+++..+.+-+|-..|+ +..++-+ ..|+....--|+.+ T Consensus 384 IEqEvY~~L~~lvL~~~---~~~NY~~~Y~KD~Pl~~~~K~D~LIKL~~~G~--S~K~V~Dnl~GvS~E~Y~E~tlYE~E 458 (506) T protein:vir:57 384 IEQEVYQKLFNLVLPAA---QKDNYYMNYDKDKPLTLKEKMDILIKLNDKGW--SIKHVVDNLAGVSWESYLEQTLYETE 458 (506) T ss_pred HHHHHHHHHHHhhcccc---cCCceeEeeCCCCccchhhhhchheeecccCc--cHHHHHHhhhccchHHHHHHHHHHHH Confidence 77444445665442211 11223677788888888888888888888886 3333333 33443322111111 Q ss_pred --------ccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccchhHH Q lcl|NC_020866. 414 --------RPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEI 474 (547) Q Consensus 414 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (547) .+....-.-.+..-..|.. +....++.--+..+. ..+..+ T Consensus 459 ~LKL~EKI~P~~~s~~~tGN~vG~P~~---------~~~~~D~Tv~Satsn------------gndnpi 506 (506) T protein:vir:57 459 ELKLQEKIRPYQTSYTFTGNEVGRPNE---------GNKNNDNTVKSATSN------------GNDNPI 506 (506) T ss_pred HhhHHhhcCcccccceecccccCCCCC---------CCCcccchhhhcccC------------CCCCCC Confidence 1111100000000000000 000000000000000 000001 No 226 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=86.85 E-value=0.043 Score=28.09 Aligned_cols=419 Identities=12% Similarity=0.101 Sum_probs=173.4 Q ss_pred CCCCCeeeCCCCCcccch-----hhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA-----QLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~-----~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) ||.+.+ .|..=+.. .|...-. ..-..++-+..|.. |. .+.....+.......+| | T Consensus 1 m~~~~~----~~~~~~~~k~r~~~l~~~R~--~~e~~w~e~~~~~l----P~----~~~~~~~~~~~~~~~~~------d 60 (535) T protein:vir:15 1 MADSKR----TGLGEDGAKATYDRLTNDRR--AYETRAENCAQYTI----PS----LFPKESDNESTDYTTPW------Q 60 (535) T ss_pred CCccch----hccchHHHHHHHHHHHHHhh--HHHHHHHHHHHHhc----cc----ccCCCCCcccccccccc------c Confidence 996431 12111110 1111100 01111122233321 11 11111122222222333 6 Q ss_pred HHHHHHHHHHHHHHhcC-----Cc-eEecCC--------CCHHHHHHH-------HHHHHHHhCCCHHHHHHHHH-Hhhh Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQL-----DI-TVEPAS--------DDPQDVKIA-------DMVRDWLKREELQQEVFHIL-DCIS 133 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~-----~w-~v~p~~--------~~~~~~~~a-------e~v~~~l~~~~~~~~i~~~l-~a~~ 133 (547) ++-.-++++....+.+. +| ++.+.+ +..+..++. +.+...|+..+|...+.+++ +-+. T Consensus 61 st~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~ 140 (535) T protein:vir:15 61 AVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIV 140 (535) T ss_pred ccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 66666666666665543 33 222221 111111122 23334466678877777776 4456 Q ss_pred hcCeeeeeEEeecCCeeeEEeeEeeccceeEecc---------------------------------------------- Q lcl|NC_020866. 134 KGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQK---------------------------------------------- 167 (547) Q Consensus 134 ~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~---------------------------------------------- 167 (547) +|-++.=+.+.. ++... +..+|=+.|.+.. T Consensus 141 ~G~a~l~~~~~~-~~~~~---f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~ 216 (535) T protein:vir:15 141 AGNALLYLPEPE-GSYNP---MKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTH 216 (535) T ss_pred hCceeEEeecCC-CCcee---eEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEE Confidence 777654332221 11110 0001101111100 Q ss_pred ----cCCcce-eeccCCC-------CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020866. 168 ----SDLATP-LLLDDNG-------AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQP 235 (547) Q Consensus 168 ----~~~~~~-~~~~~~~-------~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P 235 (547) .++..+ ....-.+ .+.++..+-|++.++...+|+.||.|....++.-..--+...+.-+..+++---| T Consensus 217 v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p 296 (535) T protein:vir:15 217 VYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKV 296 (535) T ss_pred EEEecCCCcEEEEEEeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 010011 1100000 1112223458899999999999999999999999888888888888999887777 Q ss_pred eEEEeCCCCCCHHHHHHHHHHHHHHhc-C-eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCc Q lcl|NC_020866. 236 LRIGKWQPGASEQDKDTLFRAVANIAG-D-CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDA 313 (547) Q Consensus 236 ~~vgk~~~~~~~~~~~~l~~al~~~~~-~-~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~ 313 (547) ..++.-+ +.... .++.. . +.++-....++..++... ++....-...|+.+...|+.+.+...++.. T Consensus 297 ~~lv~~~-g~~~~---------~~l~~~~~g~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~~~~~~~- 364 (535) T protein:vir:15 297 IGLVNPA-GITQP---------RRLTKAQTGDFVPGRREDIDFLQLEK-QADFTVAKAVSDQIEARLSYAFMLNSAVQR- 364 (535) T ss_pred ceeeccc-ccccc---------hhcccCCceeeecCCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccC- Confidence 6554322 11111 12211 2 233333445667765443 345556678899999999999866544332 Q ss_pred cccccchhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhcCC--------C--cccccEEEecCc-----cccCHH Q lcl|NC_020866. 314 VAGGHAVSQEHRLVQE--DIETSDAMALAAILNRDLIRPWIQLEYG--------P--QKRYPRIKIARP-----KTEDLT 376 (547) Q Consensus 314 ~ggs~a~~~vh~~v~~--~~~~ad~~~i~~~ln~~li~~l~~~Nfg--------~--~~~~p~~~~~~~-----~~ed~~ 376 (547) +++..-..||+....+ ..+-.-. +.|+..++.||+..-|. | +...-++++... ...++. T Consensus 365 ~~~r~TAtEV~~r~~E~~~~LG~v~----~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~ 440 (535) T protein:vir:15 365 TGERVTAEEIRYVASELEDTLGGVY----SILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLD 440 (535) T ss_pred CCccccHHHHHHHHHHHHHHHhHHH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHH Confidence 2333445666654332 2222322 23333344444433221 1 111113333221 122334 Q ss_pred HHHHHHHHHHhCCC----C-CCH----HHHHHHhCCCCC---CCcchhcccccccccccccccccccccCCccccccccc Q lcl|NC_020866. 377 KLASSLDTLVRLGM----E-IEE----NEVRSRFGFSQP---KPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKP 444 (547) Q Consensus 377 ~~a~~~~~l~~~G~----~-v~~----~~~~e~~Glp~p---~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (547) .+...++.+..++= + |+. +++.+.+|+|.. +..|++.....+.....+........+....+.+.. T Consensus 441 ~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~-- 518 (535) T protein:vir:15 441 KLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATS-- 518 (535) T ss_pred HHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhcc-- Confidence 44445555555431 1 333 467788899842 333433222111111100000000000000000000 Q ss_pred cccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHH Q lcl|NC_020866. 445 LSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQ 486 (547) Q Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 486 (547) ..+....++++.-.++. T Consensus 519 -------------------------~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 519 -------------------------SPEAMQGAAAQAGLDAT 535 (535) T ss_pred -------------------------ChHHHHHHHhccCCCCC Confidence 01111122221110000 No 227 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=84.24 E-value=0.062 Score=27.20 Aligned_cols=413 Identities=12% Similarity=0.127 Sum_probs=164.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhh--------------hHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTG--------------VRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLD 66 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~--------------~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~ 66 (547) || .+++++.+|..+... .++-+..|. .|. .....+.+....... T Consensus 1 ~~--------------~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~----lP~----~~~~~~~~~~~~~~~ 58 (535) T protein:vir:94 1 MA--------------SSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYT----IPS----LFPKDSDNASTDYTT 58 (535) T ss_pred CC--------------chhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHh----ccc----cCCCCCCccccccCC Confidence 55 333444333322111 111112221 110 111112222222222 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHhcC-----Cc-eEecCCC-------C-HHHHHHHHH-------HHHHHhCCCHHHHH Q lcl|NC_020866. 67 LAEVIEERDPHYLGVIGTRKRSVSQL-----DI-TVEPASD-------D-PQDVKIADM-------VRDWLKREELQQEV 125 (547) Q Consensus 67 L~~~m~~~D~~v~s~l~~Rk~~v~~~-----~w-~v~p~~~-------~-~~~~~~ae~-------v~~~l~~~~~~~~i 125 (547) +| |++-.-++++....+.+. +| ++.+.+. + .+..++.++ +...|++.+|..-+ T Consensus 59 ~~------dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~ 132 (535) T protein:vir:94 59 PW------QAVGARGLNNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTL 132 (535) T ss_pred cc------cccHHHHHHHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHH Confidence 32 666666777666666543 45 2222210 1 111112222 23335566787777 Q ss_pred HHHH-HhhhhcCeeeeeEE------------------eecC-CeeeEEeeEee--------cc---------------ce Q lcl|NC_020866. 126 FHIL-DCISKGYSFTEIVW------------------NTSE-GQYFPDKLIWR--------DP---------------RH 162 (547) Q Consensus 126 ~~~l-~a~~~G~sv~Ei~w------------------~~~~-g~~~~~~l~~~--------~~---------------~~ 162 (547) .+++ +-+.+|-+++=+.+ ..+. |.+ ..+..+ ++ .+ T Consensus 133 ~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~y~v~~d~~G~v--d~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 210 (535) T protein:vir:94 133 FETLKQLVVAGNALLYIPEPEGTYNPMKLYRLSSYVVQRDAFGTV--LQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEM 210 (535) T ss_pred HHHHHHHHhhCcEeEeeccCcCcccceEEEEcCeEEEeeCCCCCe--EEEEeeeeccHHHhhHHHHHHHHhccccCCCce Confidence 6665 44456766443222 1111 111 000000 00 00 Q ss_pred eE-----ecccCCc-ceeeccCCCCce-------eeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 163 FR-----FQKSDLA-TPLLLDDNGAEV-------PLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFT 229 (547) Q Consensus 163 ~~-----~~~~~~~-~~~~~~~~~~g~-------~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 229 (547) +. +...+.. ......-.+..+ .+..+-|++.|+...+|+.||.|....++.-..--+...+.-+... T Consensus 211 v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 290 (535) T protein:vir:94 211 IDVYTHIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMS 290 (535) T ss_pred eEEEEEEEeeCCCCcEEEEEEecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000100 111111011101 2223468999999999999999999999988777776777777777 Q ss_pred HHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHh-cCeeEEec-CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020866. 230 QTYGQPLRIGKWQPGASEQDKDTLFRAVANIA-GDCAAIIP-ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQ 307 (547) Q Consensus 230 E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~-~~~~~vip-~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGq 307 (547) ++---|+.+..- .+.... .++. ...+.++| ...+|.+++... ++....=...|+.+...|..+.+-. T Consensus 291 ~~a~~~~~lv~p-~g~~~~---------~~~~~~~~g~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~ 359 (535) T protein:vir:94 291 MISAKVIGLVNP-AGITQV---------RRLTKAQTGDFVSGRPEDISFLQLEK-AADFSVARAVSEQIEGRLSYAFMLN 359 (535) T ss_pred HHhccCCccccc-ccccch---------hhcccCCCceeecCCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhHh Confidence 765444433332 121111 1221 12233444 345566766443 3444445678899999999998644 Q ss_pred hhccCccccccchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcCC--------C--cccccEEEecCc----- Q lcl|NC_020866. 308 TTTTDAVAGGHAVSQEHRLVQED--IETSDAMALAAILNRDLIRPWIQLEYG--------P--QKRYPRIKIARP----- 370 (547) Q Consensus 308 tlts~~~ggs~a~~~vh~~v~~~--~~~ad~~~i~~~ln~~li~~l~~~Nfg--------~--~~~~p~~~~~~~----- 370 (547) .++.. ++...-..||+....+. .+-. +-+.|+..++-||+..-|. | +...-+..+... T Consensus 360 ~~~~~-d~~rvTAtEV~~r~~E~~~~LGp----v~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~ 434 (535) T protein:vir:94 360 SAVQR-TGERVTAEEIRYVASELEDTLGG----VYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALG 434 (535) T ss_pred hhccC-CCCCccHHHHHHHHHHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHH Confidence 33322 33344557776544332 2222 2222333344444333221 1 111112222111 Q ss_pred cccCHHHHHHHHHHHHhCC---C-C-CCH----HHHHHHhCCCCC---CCcchhcccccccccccccccccccccCCccc Q lcl|NC_020866. 371 KTEDLTKLASSLDTLVRLG---M-E-IEE----NEVRSRFGFSQP---KPGAKLLRPLENPATQAAPPGMDPGAGDLKTA 438 (547) Q Consensus 371 ~~ed~~~~a~~~~~l~~~G---~-~-v~~----~~~~e~~Glp~p---~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (547) ...++..+...++.+.++| + + |+. +++.+.+|+|.+ +..+++-..........+........+..... T Consensus 435 r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~ 514 (535) T protein:vir:94 435 RGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGT 514 (535) T ss_pred HHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 1123344444555555554 1 1 333 567778899843 34444332221111111000000000000000 Q ss_pred cccccccccchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 439 ERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAM 492 (547) Q Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 492 (547) .....+.. -....++ ..+.+. T Consensus 515 ~~~~~~~~---------------------------~~~~~~~------~g~~~~ 535 (535) T protein:vir:94 515 MATASPEN---------------------------MKAAAAQ------AGMAPN 535 (535) T ss_pred ccccChHH---------------------------HHHHHHH------hccCCC Confidence 00000000 0000000 001111 No 228 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=82.14 E-value=0.079 Score=26.61 Aligned_cols=516 Identities=11% Similarity=-0.001 Sum_probs=197.1 Q ss_pred CCCCCeeeCCCCCccc----chhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcC--------------CCHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQ----RAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADH--------------GDPV 62 (547) Q Consensus 1 ma~~~~~~~~~g~~~~----~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~--------------g~~~ 62 (547) .|+..-+---+--|+. +-.|++..-. . ..+-+|.. +++..-+.+ -.+. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~------------~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~F~ 111 (695) T protein:vir:36 46 FARRGALNALDAAPVVEPSPSLRLARQFEV-D------------VSNYTPRE-RRAASYALDFNGTSMDALSFVTSSGFP 111 (695) T ss_pred hhhcccccccccccccCCCcccccceecee-c------------ccccCccc-cchhhhhhcccccccccchhhhccCcc Confidence 1111111111111111 1111111100 0 00111111 011111111 0111 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHhcCCceEec-CCC---------------CHHHHHHHHHHHHHHhCCCHHHHHH Q lcl|NC_020866. 63 RYLDLAEVIEERDPHYLGVIGTRKRSVSQLDITVEP-ASD---------------DPQDVKIADMVRDWLKREELQQEVF 126 (547) Q Consensus 63 ~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~w~v~p-~~~---------------~~~~~~~ae~v~~~l~~~~~~~~i~ 126 (547) -|--| ..|. .-+.++++....-...++. |.-.- +.. +..+.+..+.+...++++...+.++ T Consensus 112 Gy~~l-a~la-Q~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~ 188 (695) T protein:vir:36 112 GFPTL-VLLA-QLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVR 188 (695) T ss_pred hHHHH-HHHh-hccchhhHHHHHHHHhhcc-cceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHH Confidence 22222 1222 4566777776666665544 62211 111 1111144566777788887666666 Q ss_pred HHH-HhhhhcCeeeeeEEeecCC--------------eeeEEeeEeeccceeEecccCCcceeecc------CCCCceee Q lcl|NC_020866. 127 HIL-DCISKGYSFTEIVWNTSEG--------------QYFPDKLIWRDPRHFRFQKSDLATPLLLD------DNGAEVPL 185 (547) Q Consensus 127 ~~l-~a~~~G~sv~Ei~w~~~~g--------------~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------~~~~g~~l 185 (547) +.+ .+-+||-+++=+.=.-++. .-.++.|..++|.|+.....+....+-.+ -...|..+ T Consensus 189 eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~kI 268 (695) T protein:vir:36 189 TTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGTEV 268 (695) T ss_pred HHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEeceEE Confidence 666 6999999874333221111 11233466667766654322111100000 00011222 Q ss_pred ccCceEEEeecC------CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCC-----CCCCHHHHHHHH Q lcl|NC_020866. 186 PAFRFIFADIPA------KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQ-----PGASEQDKDTLF 254 (547) Q Consensus 186 ~~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~-----~~~~~~~~~~l~ 254 (547) -..+++.+.... ..-+.+|.++...++..+.-=.-....-...+.++.+..+ |.+ .+..+.+...-. T Consensus 269 H~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~l--k~dla~aL~~g~~~~l~~R~ 346 (695) T protein:vir:36 269 HATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI--LMDLAQALMPGANVDLSMRA 346 (695) T ss_pred eeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHH--HHHHHHhhcChhHHHHHHHH Confidence 223333332221 2335678888888775533211111111122222222211 111 011222333334 Q ss_pred HHHHHHhcC-eeEEecC-CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc---cCccccccchhHHHHHHHH Q lcl|NC_020866. 255 RAVANIAGD-CAAIIPE-TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT---TDAVAGGHAVSQEHRLVQE 329 (547) Q Consensus 255 ~al~~~~~~-~~~vip~-~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt---s~~~ggs~a~~~vh~~v~~ 329 (547) +.+..+.++ +..++.+ +.+++.+..+ -+| ...++...-.+||-+ .+-.+| +.+-.|=+|.|+.-..+.- T Consensus 347 eli~~~Rsn~G~~llDk~~Eefeq~sts-lSG----LddVi~qf~q~VAga-a~IPltkLfGqSPkGlNATGE~D~rnYY 420 (695) T protein:vir:36 347 ELINRYRDNRNILFLDKATEEFFQFNTP-LSG----LDALQAQAQEQMSAV-SHIPLIKLLGITPTGLNASSEGEIRVWY 420 (695) T ss_pred HHHHHhcCccceEEEecCCcceEEEecc-cCC----HHHHHHHHHHHHHhh-hcCchhhhhccCcccccccchhhHHHHH Confidence 555566555 4556774 6788877643 344 455565555666644 444332 2233455677777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccccCH-------HHHHHHHHHHHhCCCCCCHHHHHHHhC Q lcl|NC_020866. 330 DIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKTEDL-------TKLASSLDTLVRLGMEIEENEVRSRFG 402 (547) Q Consensus 330 ~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~ed~-------~~~a~~~~~l~~~G~~v~~~~~~e~~G 402 (547) +.+++.....-..+-+.++.-|..--||....-..|+|....+-+- ++.++.++.+++.|+ |+.++++.++. T Consensus 421 D~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv-I~~~evr~rL~ 499 (695) T protein:vir:36 421 DYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQV-IRPDQVAARLN 499 (695) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC-CCHHHHHHHHh Confidence 7777765543333334455544444466432223444443222222 445677888889997 99999999964 Q ss_pred CCC----------------CCCcc-hhc---ccccccccccccccccccccCCccccccccc-----cccchhhhhhccc Q lcl|NC_020866. 403 FSQ----------------PKPGA-KLL---RPLENPATQAAPPGMDPGAGDLKTAERKLKP-----LSGQADLSRGDLS 457 (547) Q Consensus 403 lp~----------------p~~~e-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~ 457 (547) -++ |.+++ +.. .......+..++++...+....++....-.. .++..+..-.... T Consensus 500 ~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~aag 579 (695) T protein:vir:36 500 TEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAG 579 (695) T ss_pred cCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeE Confidence 321 11111 000 0011111111111111111111111110000 0111111100000 Q ss_pred -----------chh-----hhcccccccchhHHHHHHHHHHHHHHHHHH----H--HHHHHH-HHHHccC------CHHH Q lcl|NC_020866. 458 -----------LNS-----EMPSTGQKTGGAEIAALTDQLELEGQPEIV----A--MIEEIR-TMLEAAS------SLGE 508 (547) Q Consensus 458 -----------~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~----~--~~~~i~-~~l~~a~------s~ee 508 (547) ..+ .++...-...+...+++...+.++..=... . .+..+. .++.... +-|- T Consensus 580 ~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~aa~RE~~EEtGl~~~~el~~~g~~~~~~~~f~~~~e~~~~~l~dEh 659 (695) T protein:vir:36 580 AVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAARRETREETGYDHDGELVPLGKFDGFFHAFVAHLEPFDVELNDEH 659 (695) T ss_pred EEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHHHHHHHHhCCccccceeeeeeecceEEEEEEeecccCcccCchh Confidence 000 011111122223333333333222110000 0 000100 0110001 1111 Q ss_pred HHHHHH--HHhcCCCHHHHHHHHHHHHHHHHHHhHH Q lcl|NC_020866. 509 FSDMLD--ARYGEIGASQLASRISAAMLASEFAGRE 542 (547) Q Consensus 509 ~~~~L~--~l~~~~~~~~l~~~l~~al~~A~~~G~~ 542 (547) ..-+.. +-+|.-=....++.|..|..+=+-.++. T Consensus 660 ~~~~Wf~pdeLP~pL~~~~a~vl~aa~k~l~~~~~~ 695 (695) T protein:vir:36 660 TAFDWFNPDELPHPLHRDTAAIVDAACKALDRLERA 695 (695) T ss_pred hhcccCChhhcCccccHHHHHHHHHHHHHHHhhccC Confidence 111111 1123222334556776666665666665 No 229 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=80.74 E-value=0.092 Score=26.26 Aligned_cols=395 Identities=9% Similarity=-0.035 Sum_probs=155.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHH----------------H Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVR----------------Y 64 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~----------------~ 64 (547) |.+---|.- .|--|..+.-..+-.. ...-+..+.. -....+|.++.-.++. + T Consensus 1 ~~~~~~~~~-~~~~m~V~~~hp~y~a--~~~~W~~~~d---------~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~ 68 (488) T protein:vir:96 1 MLKCLYIKH-RGFFMLTPIYHPDYLV--NAPQWLRNLD---------CVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTAL 68 (488) T ss_pred CceeEEEee-cceeecccccCHHHHH--HhhhhhHhhh---------hhhHHHHHhhhhcCCCCCCccccccCcchhhhh Confidence 665333322 2222211111111111 1111111110 0112233322211111 0 Q ss_pred ----HHHHHHHHh----hhHHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHH-HhCCCHHHHHHHHH-Hhhhh Q lcl|NC_020866. 65 ----LDLAEVIEE----RDPHYLGVIGTRKRSVSQLDITVEPASDDPQDVKIADMVRDW-LKREELQQEVFHIL-DCISK 134 (547) Q Consensus 65 ----~~L~~~m~~----~D~~v~s~l~~Rk~~v~~~~w~v~p~~~~~~~~~~ae~v~~~-l~~~~~~~~i~~~l-~a~~~ 134 (547) .+-|+.... --++++-.++.....|.+.++.++-. +.+ ++..+++.+ ..+.+++.+++.++ .++.| T Consensus 69 ~~~~~~~y~~~~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~-~~~---~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~ 144 (488) T protein:vir:96 69 AAKIEKDWEDLTWRLANYVNIVNPTMNAITGAVMRREPEFDTM-DNP---VLIGLRDNIDGKGNGIDQECKQALNALQWG 144 (488) T ss_pred hccchhhhHhhhhhccccCchhHHHHHHhcchhhccCceeccC-CcH---HHHHHHhccCCCCCCHHHHHHHHHHHHHhc Confidence 111111100 13666667777777777777766521 111 122222211 12346888888888 47778 Q ss_pred cCeeeeeEEe--------------------------------ecCCeeeEEeeEee------cccee------Ee--ccc Q lcl|NC_020866. 135 GYSFTEIVWN--------------------------------TSEGQYFPDKLIWR------DPRHF------RF--QKS 168 (547) Q Consensus 135 G~sv~Ei~w~--------------------------------~~~g~~~~~~l~~~------~~~~~------~~--~~~ 168 (547) |.+.+=+-+- ..+|...+..+..+ ++-.+ ++ -.+ T Consensus 145 G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~ 224 (488) T protein:vir:96 145 SRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVD 224 (488) T ss_pred CeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEEC Confidence 8775443331 12332222222221 11000 00 001 Q ss_pred CCcceeeccCCCCce----------eeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE Q lcl|NC_020866. 169 DLATPLLLDDNGAEV----------PLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRI 238 (547) Q Consensus 169 ~~~~~~~~~~~~~g~----------~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~v 238 (547) +..+.........+. .++.=-|+++.. ...+-..+.+.|..++..-+--.....+.-..+..-+.|+.+ T Consensus 225 g~~~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~-~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv 303 (488) T protein:vir:96 225 GLCEFQEVTDDEYSDEWTPVLINSKQSDTIPFFLASS-QSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWM 303 (488) T ss_pred cEEEEEEEecCCcccceEeecCCCcccCeeEEEEEec-CCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceee Confidence 111111111111111 111111333322 223334455555566555432222333333344555777666 Q ss_pred EeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCcccccc Q lcl|NC_020866. 239 GKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGH 318 (547) Q Consensus 239 gk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~ 318 (547) -..+ +.+++........=..++.......|.| ++++++++. .+ ..++-++.+..+|.. +|.-+.+.+.+-|- T Consensus 304 ~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~g-~~~~~e~~~--~~--l~~~~l~~l~~qm~~--~Ga~l~~~~~~~Ta 375 (488) T protein:vir:96 304 VDMG-DMNKTMASEMNPLGFTLAGRMPYYVKNG-DVKVIQAQF--SP--ETENKVEKLFEQAVK--VGASLFTQQSNETA 375 (488) T ss_pred eccC-CCCcccccccccceeeecccccccccCC-ceeecCCch--hH--HHHHHHHHHHHHHHH--HhHhhccCCCcchH Confidence 4432 2233322222111112222333333555 688887653 22 234445555566532 44333222111111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----cccccEEEecCc--c-ccCHHHHHHHHHHHHhCCCC Q lcl|NC_020866. 319 AVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLEYGP----QKRYPRIKIARP--K-TEDLTKLASSLDTLVRLGME 391 (547) Q Consensus 319 a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~Nfg~----~~~~p~~~~~~~--~-~ed~~~~a~~~~~l~~~G~~ 391 (547) -............+.+-+..+++.++ ++++++..+--.. ...-++|.+... . ..| ....+.+.++.+.|. T Consensus 376 ~~~~~~~~~~~S~L~~~a~~le~al~-~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld-~~~~~al~~~~~~G~- 452 (488) T protein:vir:96 376 TGAAIRSGSSTASMATLGNNVEDTVR-NMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVN-PQMLQVAYAAMMEGN- 452 (488) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCC-HHHHHHHHHHHhcCC- Confidence 11122223345677788888999996 4889888875211 122245554421 1 223 344666777778886 Q ss_pred CCHHHHHH---HhCCCCCC-Ccchhcccccccccccccccc Q lcl|NC_020866. 392 IEENEVRS---RFGFSQPK-PGAKLLRPLENPATQAAPPGM 428 (547) Q Consensus 392 v~~~~~~e---~~Glp~p~-~~e~~~~~~~~~~~~~~~~~~ 428 (547) |+.+.+.+ +-|+=.|+ +.|+....... .+.+. T Consensus 453 Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~-----~g~~~ 488 (488) T protein:vir:96 453 LPQVSWFELLKRARVVRGDMSKEEFDEHIAE-----LGFGM 488 (488) T ss_pred CCHHHHHHHHHhCCcCCccCCHHHHHHHHhh-----cCCCC Confidence 77654433 34553222 11222211111 00000 No 230 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=78.68 E-value=0.11 Score=25.80 Aligned_cols=421 Identities=11% Similarity=0.093 Sum_probs=172.4 Q ss_pred CCCCCeeeCCCCCcccchh-hhhhhhhh--hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQ-LTEEVAAP--TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~-l~~~~a~~--~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) ||.+- +.|..=+..+ .=+...+. ..-..++-+..|. .|. .+...+.+.......+| |++ T Consensus 1 m~~~~----~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~----lP~----~~~~~~~~~~~~~~~~~------dst 62 (535) T protein:vir:33 1 MADSK----RTGLGEDGAKATYDRLTNDRRAYETRAENCAQYT----IPS----LFPKESDNESTDYTTPW------QAV 62 (535) T ss_pred CChhh----hhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHh----ccc----ccCCCCCcccccccccc------ccc Confidence 88522 2222111110 00001000 0111112222231 121 11111222222222233 666 Q ss_pred HHHHHHHHHHHHhcC-----Cc-eEecCCC--------CHHHHHHH-------HHHHHHHhCCCHHHHHHHHH-Hhhhhc Q lcl|NC_020866. 78 YLGVIGTRKRSVSQL-----DI-TVEPASD--------DPQDVKIA-------DMVRDWLKREELQQEVFHIL-DCISKG 135 (547) Q Consensus 78 v~s~l~~Rk~~v~~~-----~w-~v~p~~~--------~~~~~~~a-------e~v~~~l~~~~~~~~i~~~l-~a~~~G 135 (547) -.-++++....+.+. +| ++.+.+. ..+..++. +.+...|+..+|...+.+++ +-+.+| T Consensus 63 ~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G 142 (535) T protein:vir:33 63 GARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAG 142 (535) T ss_pred HHHHHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhC Confidence 666666666665543 33 2222210 01111122 23334466678877776666 445677 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEeccc----------------------------------------------- Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS----------------------------------------------- 168 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~----------------------------------------------- 168 (547) -++.=+.+.. ++.+. +..+|=+.|.+..+ T Consensus 143 ~a~l~~~~~~-~~~~~---f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~ 218 (535) T protein:vir:33 143 NALLYLPEPE-GSYNP---MKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVY 218 (535) T ss_pred ceeEEeecCC-CCcee---eEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEE Confidence 7655433321 11111 11111111111111 Q ss_pred ----CCcceeecc--CC-----CCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeE Q lcl|NC_020866. 169 ----DLATPLLLD--DN-----GAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLR 237 (547) Q Consensus 169 ----~~~~~~~~~--~~-----~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~ 237 (547) +........ +. ..+.++..+-|++.++...+|+.||.|....++.-..--+...+.-+..+++---|.. T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~ 298 (535) T protein:vir:33 219 LDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIG 298 (535) T ss_pred eeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 000000000 00 0011222345889999999999999999999999988888888888899988777765 Q ss_pred EEeCCCCCCHHHHHHHHHHHHHHhc-C-eeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccc Q lcl|NC_020866. 238 IGKWQPGASEQDKDTLFRAVANIAG-D-CAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVA 315 (547) Q Consensus 238 vgk~~~~~~~~~~~~l~~al~~~~~-~-~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~g 315 (547) ++.-+ +... ..++.. . +.++-....++..++... ++....-...|+.+...|+.+.+...++.. ++ T Consensus 299 lv~~~-g~~~---------~~~~~~~~~g~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~~~~~~~-~~ 366 (535) T protein:vir:33 299 LVNPA-GITQ---------PRRLTKAQTGDFVPGRREDIDFLQLEK-QADFTVAKAVSDQIEARLSYAFMLNSAVQR-TG 366 (535) T ss_pred eeccc-cccc---------hhhcccCCceeeecCCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccC-CC Confidence 54322 1111 112211 2 233333445667765443 345556678899999999999876544332 23 Q ss_pred cccchhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhcCC--------C--cccccEEEecCc-----cccCHHHH Q lcl|NC_020866. 316 GGHAVSQEHRLVQE--DIETSDAMALAAILNRDLIRPWIQLEYG--------P--QKRYPRIKIARP-----KTEDLTKL 378 (547) Q Consensus 316 gs~a~~~vh~~v~~--~~~~ad~~~i~~~ln~~li~~l~~~Nfg--------~--~~~~p~~~~~~~-----~~ed~~~~ 378 (547) +..-..||+....+ ..+-.-. +.|+..++.||+..-|. | +...-++++... ...++..+ T Consensus 367 ~r~TAtEV~~r~~E~~~~LG~v~----~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l 442 (535) T protein:vir:33 367 ERVTAEEIRYVASELEDTLGGVY----SILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKL 442 (535) T ss_pred ccccHHHHHHHHHHHHHHHhHHH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHH Confidence 33445666654332 2222322 23333344444433211 1 111113333221 12233444 Q ss_pred HHHHHHHHhCCC----C-CCH----HHHHHHhCCCCC---CCcchhcccccccccccccccccccccCCccccccccccc Q lcl|NC_020866. 379 ASSLDTLVRLGM----E-IEE----NEVRSRFGFSQP---KPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLS 446 (547) Q Consensus 379 a~~~~~l~~~G~----~-v~~----~~~~e~~Glp~p---~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (547) ...++.+..++= + |+. +++.+.+|+|.. +..|++.....+.....+........+..-.+.+... T Consensus 443 ~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~--- 519 (535) T protein:vir:33 443 ERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSS--- 519 (535) T ss_pred HHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcC--- Confidence 445555554431 1 333 467778899842 3333332221111100000000000000000000000 Q ss_pred cchhhhhhcccchhhhcccccccchhHHHHHHHHHHHHHH Q lcl|NC_020866. 447 GQADLSRGDLSLNSEMPSTGQKTGGAEIAALTDQLELEGQ 486 (547) Q Consensus 447 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 486 (547) .+....+++..--+.. T Consensus 520 ------------------------~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 520 ------------------------PEAMQGAAAKAGLNAT 535 (535) T ss_pred ------------------------ChhHHHHHHhccCCCC Confidence 0001111111100000 No 231 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=75.24 E-value=0.15 Score=25.12 Aligned_cols=506 Identities=13% Similarity=0.110 Sum_probs=187.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhh-hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC-----HHHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAP-TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD-----PVRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~-~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~-----~~~~~~L~~~m~~~ 74 (547) ||-++- |.+.+++-..+.+.+ .++....+ . .+| ...++++..|+ -.+-.++|+.+ T Consensus 1 ma~~~l------r~~rrpk~~p~~~rr~~ltaAsq~-~------~~p---~~~~kt~~~~~ar~~WQ~eAW~~~d~v--- 61 (639) T protein:vir:10 1 MAATSL------RVVRRPKGSAPAARRRSLTAASQL-I------TDP---QKQMKTSLMGTARNEWQSEAWDFSESI--- 61 (639) T ss_pred CCccce------eeeecCCCCCcchhhHHHhhhhhc-c------CCc---ccchhhhccccchhhhhhhhhhhhhhh--- Confidence 885532 222332222222211 12221111 1 233 23444553333 12334555544 Q ss_pred hHHHHHHHHHHHHHHhcCCceEe---cC----------CCCHHHHHHHHHHHHHHhC-CCHHHHHHHHHHhhh-hcCeee Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQLDITVE---PA----------SDDPQDVKIADMVRDWLKR-EELQQEVFHILDCIS-KGYSFT 139 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~~w~v~---p~----------~~~~~~~~~ae~v~~~l~~-~~~~~~i~~~l~a~~-~G~sv~ 139 (547) +.++.-..-|-.++.++..... |. ++++...++++.++.+... ++-.++++.+...+- -|..-+ T Consensus 62 -~Elry~vgW~~~s~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi 140 (639) T protein:vir:10 62 -GELSYYVSWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWI 140 (639) T ss_pred -hhHHHHhhhhhhhhceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEE Confidence 6677778888888887765442 21 1234445677777766554 456666666553222 222222 Q ss_pred eeEEeecCCeeeEEeeEeeccceeEecc------cCCcceeeccCCCCceeeccC-ceEEEeecCCc-CccccchhHHHH Q lcl|NC_020866. 140 EIVWNTSEGQYFPDKLIWRDPRHFRFQK------SDLATPLLLDDNGAEVPLPAF-RFIFADIPAKS-GILLRSGLARVA 211 (547) Q Consensus 140 Ei~w~~~~g~~~~~~l~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~l~~~-k~i~~~~~~~~-~~p~G~gll~~~ 211 (547) -++-....+.. .....--++|+.+.. .+....+...+ +..-.+... ..++-..+.+. .+.+-.|..+.| T Consensus 141 ~~l~r~~k~~~--~~~~~~~~~W~vvs~~Ei~~~~~~~~~i~lPd-G~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~ 217 (639) T protein:vir:10 141 AVLIRQEKDPV--TGLAAPRARWYAVTREEIKSKAGETAEISLPD-GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRAC 217 (639) T ss_pred EEEEecCcccc--CcccccccceeeeeHHHhcccCCCeeEeecCC-CCCccccCCCceEEEEeCCCcccccCCcchhHHH Confidence 22221111100 000000123333321 11111111111 111111111 22222222221 112223444444 Q ss_pred HHHHHHHHHHHHHHHHHH--H--HcCC---------CeEEEeCCCC---------------CCHHHHHH-HHHHHHHHhc Q lcl|NC_020866. 212 AWGWMFKAFTARDWAIFT--Q--TYGQ---------PLRIGKWQPG---------------ASEQDKDT-LFRAVANIAG 262 (547) Q Consensus 212 ~~~~~~K~~~~~~w~~f~--E--~~G~---------P~~vgk~~~~---------------~~~~~~~~-l~~al~~~~~ 262 (547) ......-...-+--...+ . -.|+ |..-+.++++ .+.++... |.++...... T Consensus 218 l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~ 297 (639) T protein:vir:10 218 LETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAME 297 (639) T ss_pred HHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhc Confidence 333221111111111111 1 1222 2222222111 11222222 3333332221 Q ss_pred ---CeeEEecC-----CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc-----cCccccccchhHHHHHHHH Q lcl|NC_020866. 263 ---DCAAIIPE-----TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT-----TDAVAGGHAVSQEHRLVQE 329 (547) Q Consensus 263 ---~~~~vip~-----~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt-----s~~~ggs~a~~~vh~~v~~ 329 (547) ..+++||- +.-++-+..-...+ +.-+.-|+.-+..|...-.|-.++ +-+++.-.+.=++..+-.. T Consensus 298 De~S~aA~vPiia~~p~E~l~~ikhl~f~~--ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvr 375 (639) T protein:vir:10 298 DENSQAAYIPLVASVAAEHLEKVQHIKFGN--EVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQ 375 (639) T ss_pred CCCCccceeeeeEeechHHhcCeeeeeecC--chhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEeccccee Confidence 23344452 21122111111222 234555777777777775554433 3222211111222233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccc-cCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC Q lcl|NC_020866. 330 DIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKT-EDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP 408 (547) Q Consensus 330 ~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~-ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~ 408 (547) .-|+--..+||+.++++++++.++--.-....| .|+|+...= .|.....++++ +.+.|. |+.+.+|+.+|+-+..- T Consensus 376 lHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kY-vvW~DaS~Lt~dPd~~deA~q-a~drGA-It~eAlR~~lG~~edd~ 452 (639) T protein:vir:10 376 LHIKPVMDLICQAIYNDILTPLLAREGIDPTKY-ILWYDASGLTSDPDLSDEAVE-AHDRGA-ITSAALRRLLNVGEDSG 452 (639) T ss_pred eecchhHHHHHHHHHhhHHHHHHHHhCCCHHHh-EeeecCcccccCCCCcHHHHH-HHHcCC-ccHHHHHHHhccccccC Confidence 446778889999999999999877653233444 677775431 23333334444 567787 99999999999865320 Q ss_pred cc--------------------------hhccccccccccc-ccccccccccCCcccccccc---ccccchhhhhhcccc Q lcl|NC_020866. 409 GA--------------------------KLLRPLENPATQA-APPGMDPGAGDLKTAERKLK---PLSGQADLSRGDLSL 458 (547) Q Consensus 409 ~e--------------------------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 458 (547) -+ ..+.+..+...-+ ++....++.+.....+.+.. +..+..+.. ..... T Consensus 453 yd~~t~e~~~~~A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~-~~~~a 531 (639) T protein:vir:10 453 YDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTEDER-STEEA 531 (639) T ss_pred CCCCCcHHHHHHHHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCccccc-CCccc Confidence 00 0000000000000 00000001000000000000 000000000 00000 Q ss_pred hhhhcccccccchhHH---HHHHHHHHHHHHH------------------------HHHHH-HHHHHHHHHccCCHHHHH Q lcl|NC_020866. 459 NSEMPSTGQKTGGAEI---AALTDQLELEGQP------------------------EIVAM-IEEIRTMLEAASSLGEFS 510 (547) Q Consensus 459 ~~~~~~~~~~~~~~~~---~~l~~~~~~~~~~------------------------~~~~~-~~~i~~~l~~a~s~ee~~ 510 (547) . ........+ +-+..+.+..+.+ .+.+. -+.+..++.. ++..+ T Consensus 532 ~------~~~~~a~~v~a~~llv~RALelAGkRr~~~~~r~~~a~~r~vp~he~H~~l~Pv~~~~~~rli~g---wd~~l 602 (639) T protein:vir:10 532 A------SLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAG---WDTAL 602 (639) T ss_pred c------CcCchhHHHHHHHHHHHHHHHhhcccccCCCChhhHHHhhcCChhHceeecCCCChHHHHHHHHH---HHhHH Confidence 0 000000011 1122211111100 00000 0112233322 34444 Q ss_pred HHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 511 DMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 511 ~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) +.-..+-=.||.+.++..+..... +++-+ .+.|| T Consensus 603 d~~~~a~lg~D~~~lr~~v~~~v~--~~lt~-~~i~~ 636 (639) T protein:vir:10 603 EDEVVASLGLDNEKLRNAVLATVR--RQLTQ-PLIEG 636 (639) T ss_pred HHHHHHHhCCCHHHHHHHHHHHHH--HHHhh-hhhcc Confidence 322222226888888887776644 22221 35677 No 232 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=75.24 E-value=0.15 Score=25.12 Aligned_cols=506 Identities=13% Similarity=0.110 Sum_probs=187.2 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhh-hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCC-----HHHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAP-TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGD-----PVRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~-~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~-----~~~~~~L~~~m~~~ 74 (547) ||-++- |.+.+++-..+.+.+ .++....+ . .+| ...++++..|+ -.+-.++|+.+ T Consensus 1 ma~~~l------r~~rrpk~~p~~~rr~~ltaAsq~-~------~~p---~~~~kt~~~~~ar~~WQ~eAW~~~d~v--- 61 (639) T protein:vir:97 1 MAATSL------RVVRRPKGSAPAARRRSLTAASQL-I------TDP---QKQMKTSLMGTARNEWQSEAWDFSESI--- 61 (639) T ss_pred CCccce------eeeecCCCCCcchhhHHHhhhhhc-c------CCc---ccchhhhccccchhhhhhhhhhhhhhh--- Confidence 885532 222332222222211 12221111 1 233 23444553333 12334555544 Q ss_pred hHHHHHHHHHHHHHHhcCCceEe---cC----------CCCHHHHHHHHHHHHHHhC-CCHHHHHHHHHHhhh-hcCeee Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQLDITVE---PA----------SDDPQDVKIADMVRDWLKR-EELQQEVFHILDCIS-KGYSFT 139 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~~w~v~---p~----------~~~~~~~~~ae~v~~~l~~-~~~~~~i~~~l~a~~-~G~sv~ 139 (547) +.++.-..-|-.++.++..... |. ++++...++++.++.+... ++-.++++.+...+- -|..-+ T Consensus 62 -~Elry~vgW~~~s~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi 140 (639) T protein:vir:97 62 -GELSYYVSWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWI 140 (639) T ss_pred -hhHHHHhhhhhhhhceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEE Confidence 6677778888888887765442 21 1234445677777766554 456666666553222 222222 Q ss_pred eeEEeecCCeeeEEeeEeeccceeEecc------cCCcceeeccCCCCceeeccC-ceEEEeecCCc-CccccchhHHHH Q lcl|NC_020866. 140 EIVWNTSEGQYFPDKLIWRDPRHFRFQK------SDLATPLLLDDNGAEVPLPAF-RFIFADIPAKS-GILLRSGLARVA 211 (547) Q Consensus 140 Ei~w~~~~g~~~~~~l~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~l~~~-k~i~~~~~~~~-~~p~G~gll~~~ 211 (547) -++-....+.. .....--++|+.+.. .+....+...+ +..-.+... ..++-..+.+. .+.+-.|..+.| T Consensus 141 ~~l~r~~k~~~--~~~~~~~~~W~vvs~~Ei~~~~~~~~~i~lPd-G~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~ 217 (639) T protein:vir:97 141 AVLIRQEKDPV--TGLAAPRARWYAVTREEIKSKAGETAEISLPD-GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRAC 217 (639) T ss_pred EEEEecCcccc--CcccccccceeeeeHHHhcccCCCeeEeecCC-CCCccccCCCceEEEEeCCCcccccCCcchhHHH Confidence 22221111100 000000123333321 11111111111 111111111 22222222221 112223444444 Q ss_pred HHHHHHHHHHHHHHHHHH--H--HcCC---------CeEEEeCCCC---------------CCHHHHHH-HHHHHHHHhc Q lcl|NC_020866. 212 AWGWMFKAFTARDWAIFT--Q--TYGQ---------PLRIGKWQPG---------------ASEQDKDT-LFRAVANIAG 262 (547) Q Consensus 212 ~~~~~~K~~~~~~w~~f~--E--~~G~---------P~~vgk~~~~---------------~~~~~~~~-l~~al~~~~~ 262 (547) ......-...-+--...+ . -.|+ |..-+.++++ .+.++... |.++...... T Consensus 218 l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~ 297 (639) T protein:vir:97 218 LETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAME 297 (639) T ss_pred HHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhc Confidence 333221111111111111 1 1222 2222222111 11222222 3333332221 Q ss_pred ---CeeEEecC-----CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhc-----cCccccccchhHHHHHHHH Q lcl|NC_020866. 263 ---DCAAIIPE-----TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTT-----TDAVAGGHAVSQEHRLVQE 329 (547) Q Consensus 263 ---~~~~vip~-----~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlt-----s~~~ggs~a~~~vh~~v~~ 329 (547) ..+++||- +.-++-+..-...+ +.-+.-|+.-+..|...-.|-.++ +-+++.-.+.=++..+-.. T Consensus 298 De~S~aA~vPiia~~p~E~l~~ikhl~f~~--ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvr 375 (639) T protein:vir:97 298 DENSQAAYIPLVASVAAEHLEKVQHIKFGN--EVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQ 375 (639) T ss_pred CCCCccceeeeeEeechHHhcCeeeeeecC--chhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEeccccee Confidence 23344452 21122111111222 234555777777777775554433 3222211111222233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccEEEecCccc-cCHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCC Q lcl|NC_020866. 330 DIETSDAMALAAILNRDLIRPWIQLEYGPQKRYPRIKIARPKT-EDLTKLASSLDTLVRLGMEIEENEVRSRFGFSQPKP 408 (547) Q Consensus 330 ~~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~~~p~~~~~~~~~-ed~~~~a~~~~~l~~~G~~v~~~~~~e~~Glp~p~~ 408 (547) .-|+--..+||+.++++++++.++--.-....| .|+|+...= .|.....++++ +.+.|. |+.+.+|+.+|+-+..- T Consensus 376 lHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kY-vvW~DaS~Lt~dPd~~deA~q-a~drGA-It~eAlR~~lG~~edd~ 452 (639) T protein:vir:97 376 LHIKPVMDLICQAIYNDILTPLLAREGIDPTKY-ILWYDASGLTSDPDLSDEAVE-AHDRGA-ITSAALRRLLNVGEDSG 452 (639) T ss_pred eecchhHHHHHHHHHhhHHHHHHHHhCCCHHHh-EeeecCcccccCCCCcHHHHH-HHHcCC-ccHHHHHHHhccccccC Confidence 446778889999999999999877653233444 677775431 23333334444 567787 99999999999865320 Q ss_pred cc--------------------------hhccccccccccc-ccccccccccCCcccccccc---ccccchhhhhhcccc Q lcl|NC_020866. 409 GA--------------------------KLLRPLENPATQA-APPGMDPGAGDLKTAERKLK---PLSGQADLSRGDLSL 458 (547) Q Consensus 409 ~e--------------------------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 458 (547) -+ ..+.+..+...-+ ++....++.+.....+.+.. +..+..+.. ..... T Consensus 453 yd~~t~e~~~~~A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~-~~~~a 531 (639) T protein:vir:97 453 YDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTEDER-STEEA 531 (639) T ss_pred CCCCCcHHHHHHHHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCccccc-CCccc Confidence 00 0000000000000 00000001000000000000 000000000 00000 Q ss_pred hhhhcccccccchhHH---HHHHHHHHHHHHH------------------------HHHHH-HHHHHHHHHccCCHHHHH Q lcl|NC_020866. 459 NSEMPSTGQKTGGAEI---AALTDQLELEGQP------------------------EIVAM-IEEIRTMLEAASSLGEFS 510 (547) Q Consensus 459 ~~~~~~~~~~~~~~~~---~~l~~~~~~~~~~------------------------~~~~~-~~~i~~~l~~a~s~ee~~ 510 (547) . ........+ +-+..+.+..+.+ .+.+. -+.+..++.. ++..+ T Consensus 532 ~------~~~~~a~~v~a~~llv~RALelAGkRr~~~~~r~~~a~~r~vp~he~H~~l~Pv~~~~~~rli~g---wd~~l 602 (639) T protein:vir:97 532 A------SLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKTKLRDVPAHEYHRVLPPVRSSEIPRLIAG---WDTAL 602 (639) T ss_pred c------CcCchhHHHHHHHHHHHHHHHhhcccccCCCChhhHHHhhcCChhHceeecCCCChHHHHHHHHH---HHhHH Confidence 0 000000011 1122211111100 00000 0112233322 34444 Q ss_pred HHHHHHhcCCCHHHHHHHHHHHHHHHHHHhHHhhccC Q lcl|NC_020866. 511 DMLDARYGEIGASQLASRISAAMLASEFAGREGALDG 547 (547) Q Consensus 511 ~~L~~l~~~~~~~~l~~~l~~al~~A~~~G~~~~~~g 547 (547) +.-..+-=.||.+.++..+..... +++-+ .+.|| T Consensus 603 d~~~~a~lg~D~~~lr~~v~~~v~--~~lt~-~~i~~ 636 (639) T protein:vir:97 603 EDEVVASLGLDNEKLRNAVLATVR--RQLTQ-PLIEG 636 (639) T ss_pred HHHHHHHhCCCHHHHHHHHHHHHH--HHHhh-hhhcc Confidence 322222226888888887776644 22221 35677 No 233 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=68.07 E-value=0.24 Score=23.95 Aligned_cols=412 Identities=12% Similarity=0.064 Sum_probs=159.2 Q ss_pred CC---CCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhhccCCCHHHHHHHHHhhcCC-CHHHHHHHHHHHHhhhH Q lcl|NC_020866. 1 MA---QKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYPGDGLNPLRLAQILRAADHG-DPVRYLDLAEVIEERDP 76 (547) Q Consensus 1 ma---~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g-~~~~~~~L~~~m~~~D~ 76 (547) .| -...|+-..|-|+....--.-++. ..+...+.+... . ..++..-+-+ +..+.+.+|++|. .|| T Consensus 33 ~~~~~~~~~~~s~~g~p~~~~~~~~~~~~-~~t~~~D~~~~g------~---~~~~~~~~~pr~R~qiY~~~eeM~-~~p 101 (569) T protein:vir:10 33 LAVHGGSGYLFSRAGAPVQLSGFLGGKPG-DSGMAGDGLVDG------S---RFIFDEVQLPEDRLQRYPLLEEMA-VYS 101 (569) T ss_pred heeecCcceEEeecCcchhhhhhhccCcc-ccchhhhhHHHH------H---HHHhhhccCchhHHHHHHHHHHHh-cCc Confidence 11 123455666777655433222221 111111111111 0 2223222222 3445567788886 699 Q ss_pred HHHHHHHHHHHHHhcC------CceEecCC-CC----HHHHHHHHHHHHHHhCCCHHHHHHHH-HHhhhhcCeeeeeEEe Q lcl|NC_020866. 77 HYLGVIGTRKRSVSQL------DITVEPAS-DD----PQDVKIADMVRDWLKREELQQEVFHI-LDCISKGYSFTEIVWN 144 (547) Q Consensus 77 ~v~s~l~~Rk~~v~~~------~w~v~p~~-~~----~~~~~~ae~v~~~l~~~~~~~~i~~~-l~a~~~G~sv~Ei~w~ 144 (547) .|+.+|+....+-++- -+.|+|.. .. +..+++++.+..-|..+ +...+..+ .+++.||-|..-+ |. T Consensus 102 ~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~-iNr~~~~lA~~~~aFGdsYaRi-Y~ 179 (569) T protein:vir:10 102 TIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT-INKEVAGWAFIMSVFGVAYVRP-YA 179 (569) T ss_pred hhhhhhhhhhheeecccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH-HHHHhhHHHHHHHhhhhhheee-ec Confidence 9999999887766543 46777642 22 33334555544422222 22222222 3678899998877 43 Q ss_pred e-cCCeeeEEeeEeeccceeEecccCCcc--------------eeeccC-------CCCceeecc----CceEE---Eee Q lcl|NC_020866. 145 T-SEGQYFPDKLIWRDPRHFRFQKSDLAT--------------PLLLDD-------NGAEVPLPA----FRFIF---ADI 195 (547) Q Consensus 145 ~-~~g~~~~~~l~~~~~~~~~~~~~~~~~--------------~~~~~~-------~~~g~~l~~----~k~i~---~~~ 195 (547) . ..|-..+..-.+..|..|+.-+.++.. +..... .+.-.++|- ++++. -.. T Consensus 180 ~~~~GV~dl~~s~yt~PsfIqpFE~g~~tvGF~~~~~~~~~~ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~ 259 (569) T protein:vir:10 180 KEGIGITSFECSYYTLPSFIKEFEVSGNLAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLD 259 (569) T ss_pred cCCceeEEEEecccccccccchhhhcCceEEeecccCCccccceeeechhhhhhhcccceeeccccchhhhhhhheeecc Confidence 3 334433433334444443321111111 110000 000011111 11111 112 Q ss_pred cCCcCcc-----ccchhHHHHHHHHHHHHHHH------HHHHHHHHHc------CCCeEEEe-CCCCCCHHHHHHHHHHH Q lcl|NC_020866. 196 PAKSGIL-----LRSGLARVAAWGWMFKAFTA------RDWAIFTQTY------GQPLRIGK-WQPGASEQDKDTLFRAV 257 (547) Q Consensus 196 ~~~~~~p-----~G~gll~~~~~~~~~K~~~~------~~w~~f~E~~------G~P~~vgk-~~~~~~~~~~~~l~~al 257 (547) +..+.+| ||.|+|..++.+|.--...+ +.-..-.|++ |||-...- |-.+ -..-.++-.+++ T Consensus 260 d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~-lt~~LKr~~d~i 338 (569) T protein:vir:10 260 NPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRT-ITQTLKRAADLM 338 (569) T ss_pred cccccccccchhhhhHHHHHHHhHHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHH-HHHHHHHHHHHH Confidence 3334444 59999999999998655533 2333333332 34322111 1110 011112222233 Q ss_pred HHHhcC-------eeEEecC------CceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchh--------ccC--cc Q lcl|NC_020866. 258 ANIAGD-------CAAIIPE------TMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTT--------TTD--AV 314 (547) Q Consensus 258 ~~~~~~-------~~~vip~------~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtl--------ts~--~~ 314 (547) +....+ ..-++|. ...|+...-.....+ -+.+.-|+ ++++-+ ||-.+ .++ ++ T Consensus 339 e~a~~gg~~~~~~~~H~LPv~gekq~~~tvDt~~~~A~~~g---IEdvM~~~-R~Laga-LGlD~SMlGwAD~LsGGLGe 413 (569) T protein:vir:10 339 ERRARGANNMPTVTNTLLPIMGDGKGQMTIDTQTIQADING---IEDILTYM-RQLAAA-LGLDYTLLGWADQMSGGLGE 413 (569) T ss_pred HHHhccCccccccceeeeeeecCccccccccccccccCccc---HHHHHHHH-HHHHhh-hccchhHhhHHHHhcccccc Confidence 322211 2235663 233443321111111 23333333 445444 44332 222 22 Q ss_pred ccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHH--HHHHHhc--CCCcccccEEEecCcc-------ccCHHHH---- Q lcl|NC_020866. 315 AGGHAVSQEHRLVQEDIETSDAMA-LAAILNRDLI--RPWIQLE--YGPQKRYPRIKIARPK-------TEDLTKL---- 378 (547) Q Consensus 315 ggs~a~~~vh~~v~~~~~~ad~~~-i~~~ln~~li--~~l~~~N--fg~~~~~p~~~~~~~~-------~ed~~~~---- 378 (547) || -.|........-.+.|+ +.+.+|+ +| .|-..+| |.++.++-.+.|.... ....... T Consensus 414 GG-----~frtSaQaa~RS~~iRqa~~e~in~-iidiH~~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a 487 (569) T protein:vir:10 414 GG-----FLRTAIQAAMRASWIQQGVEEFIQR-AIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYA 487 (569) T ss_pred cH-----HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHH Confidence 22 22333322222222222 3444443 43 3444555 5566666677875332 1111111 Q ss_pred ---HHHHHHHHh---CCCCCCHHH----HHHHhCCC-------------CCCCcchhcccccccccccccccccccccCC Q lcl|NC_020866. 379 ---ASSLDTLVR---LGMEIEENE----VRSRFGFS-------------QPKPGAKLLRPLENPATQAAPPGMDPGAGDL 435 (547) Q Consensus 379 ---a~~~~~l~~---~G~~v~~~~----~~e~~Glp-------------~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (547) .+.+-.|.+ +| ++++. +.+.+|+. .|+|+|-+....-..+ T Consensus 488 ~i~~Q~la~l~e~n~Lg--~de~~m~y~l~d~~~~De~~~e~l~ae~~akp~DEe~~~~~~~~~~--------------- 550 (569) T protein:vir:10 488 TIVTQILDAVSNNSVLA--NSDAFKRYLFSDVLEIDEKISEALVNELKAKSEDDDHLMDSIIKTP--------------- 550 (569) T ss_pred HHHHHHHHHhhhccccc--ccHHHHHHHHHHHhhcchhHHHHHHhhcCCCcchhHHHHHHHhcCC--------------- Confidence 233334444 33 34432 33445653 3333332221111000 Q ss_pred ccccccccccccchhhhhhcccchhhhcccccccc Q lcl|NC_020866. 436 KTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTG 470 (547) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (547) +..+++..-+.. .+.... + T Consensus 551 ------------~~~~~~~~~~~~---~~~~~~-~ 569 (569) T protein:vir:10 551 ------------PQELAQILESVF---KEGNDN-D 569 (569) T ss_pred ------------hHHHHHHHHHHh---hccCCC-C Confidence 000000000000 000000 0 No 234 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=67.60 E-value=0.25 Score=23.89 Aligned_cols=425 Identities=11% Similarity=0.124 Sum_probs=170.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhh---hhh--hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEV---AAP--TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~---a~~--~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) ||++- ..|. ..+.+.+.. .+. ..-..++-+..|.. |. .+.....+.......+| | T Consensus 1 ~~~~~----~~~~--~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~l----P~----~~~~~~~~~~~~~~~~~------d 60 (543) T protein:vir:88 1 MAETK----REGL--AEEGAKAVYERLKNDRVPYETRAENCAKVTI----PS----LFPKDSDNSSTDYTTPW------Q 60 (543) T ss_pred Ccccc----cCcc--hHHHHHHHHHHHHHHHhHHHHHHHHHHHHhc----cc----cCCCCCCcccccccccc------c Confidence 99632 2232 222222211 110 01111222233321 21 11112222222222233 6 Q ss_pred HHHHHHHHHHHHHHhcC-----Cc-eEecCCC-------CHHH-HHH-------HHHHHHHHhCCCHHHHHHHHH-Hhhh Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQL-----DI-TVEPASD-------DPQD-VKI-------ADMVRDWLKREELQQEVFHIL-DCIS 133 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~-----~w-~v~p~~~-------~~~~-~~~-------ae~v~~~l~~~~~~~~i~~~l-~a~~ 133 (547) ++-.-++++....+.+. +| ++.+.+. ++.+ .++ .+.+...|+..+|..-+.+++ +-+. T Consensus 61 st~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~ 140 (543) T protein:vir:88 61 AVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLAL 140 (543) T ss_pred chHHHHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 66666777666666543 33 2222210 1101 111 223334455667777776665 3345 Q ss_pred hcCeee---------------------eeEEeecC-Ceee-E-EeeEe------------------ecc-ceeE-----e Q lcl|NC_020866. 134 KGYSFT---------------------EIVWNTSE-GQYF-P-DKLIW------------------RDP-RHFR-----F 165 (547) Q Consensus 134 ~G~sv~---------------------Ei~w~~~~-g~~~-~-~~l~~------------------~~~-~~~~-----~ 165 (547) +|-+++ +.+..++. |.+. + .+... ..| .+|. + T Consensus 141 ~G~a~ly~~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~ 220 (543) T protein:vir:88 141 AGTALIYLPPPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIY 220 (543) T ss_pred hCceeeeeccCccccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEE Confidence 665542 22222221 1110 0 00000 000 0111 1 Q ss_pred cccCCcceee--------ccCCCCceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeE Q lcl|NC_020866. 166 QKSDLATPLL--------LDDNGAEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLR 237 (547) Q Consensus 166 ~~~~~~~~~~--------~~~~~~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~ 237 (547) ...++..... ........++...-|++.++...+|+.||.|....++.-..--+...+.-+..+++---|.. T Consensus 221 pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 300 (543) T protein:vir:88 221 IDDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVG 300 (543) T ss_pred eecCCCcccccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 1111111110 00011112333456899999999999999999999999988888888888899988877766 Q ss_pred EEeCCCCCCHHHHHHHHHHHHHHhc-CeeEEec-CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccc Q lcl|NC_020866. 238 IGKWQPGASEQDKDTLFRAVANIAG-DCAAIIP-ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVA 315 (547) Q Consensus 238 vgk~~~~~~~~~~~~l~~al~~~~~-~~~~vip-~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~g 315 (547) ++.-+.... ..++.. +.+.++| ...++..++... ++....-...|+.+...|..+++...++.. ++ T Consensus 301 ~v~~~g~~~----------~~~~~~~~~g~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~-~~ 368 (543) T protein:vir:88 301 LVNPNGITQ----------VRRLVKAQTGDFVAGRKADIEFLQLEK-TADFTVAKSVADAIEARLSYVFMLNSAVQR-SG 368 (543) T ss_pred eeccccccc----------hhhcccCCCceeecCCCCcceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhhhccC-CC Confidence 554321111 112211 2223333 345677766543 345556678899999999999876544322 23 Q ss_pred cccchhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhcC------CCccccc----EEEecCc-----cccCHHHH Q lcl|NC_020866. 316 GGHAVSQEHRLVQE--DIETSDAMALAAILNRDLIRPWIQLEY------GPQKRYP----RIKIARP-----KTEDLTKL 378 (547) Q Consensus 316 gs~a~~~vh~~v~~--~~~~ad~~~i~~~ln~~li~~l~~~Nf------g~~~~~p----~~~~~~~-----~~ed~~~~ 378 (547) +..-..||+....+ ..+-.- -+.|+..++.||+..-| |--..+| ++.+... ...++..+ T Consensus 369 ~r~TAtEV~~r~~E~~~~LG~v----~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l 444 (543) T protein:vir:88 369 ERVTAEEIRYVASELEDTLGGV----YSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKL 444 (543) T ss_pred CcccHHHHHHHHHHHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHH Confidence 34455666654332 222232 22333334444443322 1000111 2222211 11122233 Q ss_pred HHHHHHHHhCC---C--CCCH----HHHHHHhCCCCC---CCcchhccccccccc----ccccccccccccCCccccccc Q lcl|NC_020866. 379 ASSLDTLVRLG---M--EIEE----NEVRSRFGFSQP---KPGAKLLRPLENPAT----QAAPPGMDPGAGDLKTAERKL 442 (547) Q Consensus 379 a~~~~~l~~~G---~--~v~~----~~~~e~~Glp~p---~~~e~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ 442 (547) ...++.+..++ + .++. +++.+.+|+|.+ +..+++-....+... ..+......+. ...... T Consensus 445 ~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~----~~~~~~ 520 (543) T protein:vir:88 445 TQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGV----AAQATA 520 (543) T ss_pred HHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhch----hhhhcc Confidence 33333333332 1 1333 466677799543 333333222111100 01111100000 000000 Q ss_pred cccccchhhhhhcccchhhhcccccc Q lcl|NC_020866. 443 KPLSGQADLSRGDLSLNSEMPSTGQK 468 (547) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (547) .+.... .....+.-+..+...+. T Consensus 521 ~~~~~~---~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 521 SPEAME---SAMDTAGVQPGPIATQV 543 (543) T ss_pred ChHHHH---HHhhhcCCCCCCCCCCC Confidence 000000 00000000011111111 No 235 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=66.03 E-value=0.27 Score=23.67 Aligned_cols=440 Identities=13% Similarity=0.111 Sum_probs=170.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhh--hhh-hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVA--APT-VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPH 77 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a--~~~-~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~ 77 (547) |++ + +|-.|+......-.+ .+. ..+..-+.+++-+.-.+..- ..-. -+-..+..-|+.|. .++. T Consensus 1 m~~---l---fgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g-~~~~-----~n~~eLI~~YR~ma-~~pE 67 (564) T protein:vir:10 1 MSQ---L---FGFLINEKEGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSG-GQNS-----RNEYELIRRYRDMS-LHPE 67 (564) T ss_pred Ccc---h---hcceeeeeccCCCCCcccCCcCCChhhhhccccceeeeccc-ccch-----hhHHHHHHHHHHHh-hccc Confidence 875 3 355554332211110 111 11111111222111111000 0000 12345666777885 6999 Q ss_pred HHHHHHHHHHHHhcCC-----ceEecCCCCHHH----HHHHHHHHHHHhCCCHHHHHHHHHH-hhhhcCeeeeeEEeecC Q lcl|NC_020866. 78 YLGVIGTRKRSVSQLD-----ITVEPASDDPQD----VKIADMVRDWLKREELQQEVFHILD-CISKGYSFTEIVWNTSE 147 (547) Q Consensus 78 v~s~l~~Rk~~v~~~~-----w~v~p~~~~~~~----~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~~~ 147 (547) |-++++....-+.-.+ ..|.-. +.+-+ +++.++.+..++-++|+.-..++.. -...|--.+.++-..++ T Consensus 68 Vd~Av~eIVneaIv~d~~~~pV~vdL~-~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 68 VDSAIDEIVNEFVVNDGDDKPVEVDLQ-NLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred hhhHHHHhhcceeEecCCCceEEEEec-ccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCC Confidence 9999998877654332 222221 11122 2355555555555566555554442 22345445555544444 Q ss_pred CeeeEEeeEeeccceeEecc------c-CCc----------------ceeeccC-----------------CCCceeecc Q lcl|NC_020866. 148 GQYFPDKLIWRDPRHFRFQK------S-DLA----------------TPLLLDD-----------------NGAEVPLPA 187 (547) Q Consensus 148 g~~~~~~l~~~~~~~~~~~~------~-~~~----------------~~~~~~~-----------------~~~g~~l~~ 187 (547) ..-.+.+|..++||.+++-. + .+. .+..... ++.++.++. T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~ 226 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIAS 226 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeech Confidence 33335667777887655322 0 000 1111110 012344554 Q ss_pred CceEEEeecC--CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhc Q lcl|NC_020866. 188 FRFIFADIPA--KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAG 262 (547) Q Consensus 188 ~k~i~~~~~~--~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~ 262 (547) + .|+|.|.. ..++..=.|.|+.+..++-=-+...-..+.| ..-=-|- ||.-.+.+.-+..+ +=|.+.+....+ T Consensus 227 d-aI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIY-RitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KN 304 (564) T protein:vir:10 227 D-AIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIY-RLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRN 304 (564) T ss_pred h-hcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 4 33343331 1233344567777777764434322222222 1222232 22222333222222 112222222221 Q ss_pred C------eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccc Q lcl|NC_020866. 263 D------CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVA 315 (547) Q Consensus 263 ~------~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~g 315 (547) - ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |..++.| T Consensus 305 klVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLge----m~DV~YF~kKLY~aLnVP~SRl~~e~~~ 380 (564) T protein:vir:10 305 KLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE----LKDVEYFKKKLYNSLNLPPSRLTDDNKA 380 (564) T ss_pred eEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcch----HHHHHHHHHHHHHHhCCCcccccCCCce Confidence 1 111 123 5789998864443332 2347899999999853322 2233222 Q ss_pred c--ccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHH Q lcl|NC_020866. 316 G--GHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSL 382 (547) Q Consensus 316 g--s~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~ 382 (547) . +++..-+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++ T Consensus 381 f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl 460 (564) T protein:vir:10 381 FNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRV 460 (564) T ss_pred eecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 2 23322233444 4456666666666666654444444333 211112223333222 222334445555 Q ss_pred HHHHhC----CCCCCHHHHHHHh-CCC---------------------CCCCcchhcccccccccccccccccccccCCc Q lcl|NC_020866. 383 DTLVRL----GMEIEENEVRSRF-GFS---------------------QPKPGAKLLRPLENPATQAAPPGMDPGAGDLK 436 (547) Q Consensus 383 ~~l~~~----G~~v~~~~~~e~~-Glp---------------------~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (547) ..|..+ |--++.+|+++.+ .+. +|.+.+ .+... +..+.+-.|....+.+..+ T Consensus 461 ~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~-~~~~~-~~~~~~~~p~~~~~~~~~~ 538 (564) T protein:vir:10 461 NLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVN-MLDDM-EKQNQAFAPELQAAQDDLA 538 (564) T ss_pred HHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhh-cCCCc-cCCCCcCCcchhhhccccc Confidence 555544 4347888887643 332 111111 11110 0000000111111101000 Q ss_pred cccccccccccchhhhhhcccchhhhcccccccchhHHH Q lcl|NC_020866. 437 TAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGGAEIA 475 (547) Q Consensus 437 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (547) . .+.......++..+.. +....+.-. T Consensus 539 ~-~~~~~~~~~a~~~~~~------------~~~~~~~~~ 564 (564) T protein:vir:10 539 A-EREIKKLNSAPKPPPS------------QQSKSQSNK 564 (564) T ss_pred c-ccChhhhccCCCCCCC------------CCCcCcCCC Confidence 0 0000000000000000 000000000 No 236 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=63.21 E-value=0.32 Score=23.29 Aligned_cols=411 Identities=12% Similarity=0.082 Sum_probs=164.4 Q ss_pred ccchhhhhhhhhhh--hhhhHHHHhhh--hccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_020866. 15 VQRAQLTEEVAAPT--VTGVRSPLSGY--PGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSVS 90 (547) Q Consensus 15 ~~~~~l~~~~a~~~--~~~~~~~~~~~--~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~ 90 (547) |+.+..=+...+.. .-..++-+..| |.++... ....+...+...+| |++-.-++++....+. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~--------~~~~~~~~~~~~~~------dstg~~a~~~LAa~l~ 66 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDD--------ISSRPNHKSLTVPW------QSVGAKCCVTLAAKLM 66 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCC--------CCCCcccccccccc------cchHHHHHHHHHHHHH Confidence 44443333332211 11222333333 3333211 01111112222343 5666666666555554 Q ss_pred cC------Cc-eEecCCC------CHH----HHH----HHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeee--------- Q lcl|NC_020866. 91 QL------DI-TVEPASD------DPQ----DVK----IADMVRDWLKREELQQEVFHIL-DCISKGYSFT--------- 139 (547) Q Consensus 91 ~~------~w-~v~p~~~------~~~----~~~----~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~--------- 139 (547) +. +| ++.+.+. +++ .++ +.+.+...|+..+|..-+.+++ +-+.+|-.++ T Consensus 67 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~ 146 (522) T protein:vir:10 67 LAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKDGLKT 146 (522) T ss_pred HhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCCCceE Confidence 33 44 3332221 111 111 1122334456667777776666 4556676542 Q ss_pred ----eeEEeecC-Ceee--EEeeEe----------------------eccce-----eEecccCCcceeeccCCCCc--- Q lcl|NC_020866. 140 ----EIVWNTSE-GQYF--PDKLIW----------------------RDPRH-----FRFQKSDLATPLLLDDNGAE--- 182 (547) Q Consensus 140 ----Ei~w~~~~-g~~~--~~~l~~----------------------~~~~~-----~~~~~~~~~~~~~~~~~~~g--- 182 (547) +.++..++ |.+. ..++.. .+..+ ..++..+......... ..+ T Consensus 147 ~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~-~~~~~~ 225 (522) T protein:vir:10 147 FPLTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQE-AFDKII 225 (522) T ss_pred EEcceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEc-cCCccc Confidence 33332221 1111 000000 00000 1111111111111110 111 Q ss_pred ------eeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHH Q lcl|NC_020866. 183 ------VPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRA 256 (547) Q Consensus 183 ------~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~a 256 (547) ..+...-|++.|+...+|+.||.|....++.-..--+...+.-+..+++---|..++.-+. .... ..+ T Consensus 226 ~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~-~~~~--~~l--- 299 (522) T protein:vir:10 226 PDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSS-TTKP--ATI--- 299 (522) T ss_pred cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccc-cccc--ccc--- Confidence 1222335889999999999999999999999888878888888888888766665543221 1110 111 Q ss_pred HHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCccccccchhHHHHHHHHH--HHHH Q lcl|NC_020866. 257 VANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGHAVSQEHRLVQED--IETS 334 (547) Q Consensus 257 l~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~a~~~vh~~v~~~--~~~a 334 (547) .. +..++++-....++..++.. .++....-...|+-+.+.|+.+.+ +++.-+++..-..||+....+. .+-. T Consensus 300 -~~-~~~~~~v~g~~~~v~~~~~~-~~~d~~~~~~~i~~~~~ri~~aFl---~~~~~d~~rvTAtEV~~r~~E~~~~LGp 373 (522) T protein:vir:10 300 -AK-AGNGAIVQGRPEDVAVIQVG-KTADFSTAANMATAIEKRLLEAFL---VMNVRNAERVTAEEVRLTQLELEQQLGG 373 (522) T ss_pred -cC-CCCcceecCCCccceeeccc-ccccchHHHHHHHHHHHHHHHHHh---hccCCCCCCCCHHHHHHHHHHHHHHhhH Confidence 11 12233343444556666543 244444567888899999998864 2233334455667777554332 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCC----Ccc--cccE-E----EecCcc----ccCHHHHHHHHHHHHh-CCCC-----CC Q lcl|NC_020866. 335 DAMALAAILNRDLIRPWIQLEYG----PQK--RYPR-I----KIARPK----TEDLTKLASSLDTLVR-LGME-----IE 393 (547) Q Consensus 335 d~~~i~~~ln~~li~~l~~~Nfg----~~~--~~p~-~----~~~~~~----~ed~~~~a~~~~~l~~-~G~~-----v~ 393 (547) -.. .|+..++-|++..-|. .+. .+|. + ...... ..++..+...++.+.+ +|-+ |+ T Consensus 374 v~~----rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id 449 (522) T protein:vir:10 374 IFS----LLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLN 449 (522) T ss_pred HHH----HHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCC Confidence 222 2222233333222211 110 1111 1 011111 1122333334444432 2311 33 Q ss_pred H----HHHHHHhCCCCC---CCcchhccccccccccccccccccccc---CCccccccccccccchhhhhhcccchhhhc Q lcl|NC_020866. 394 E----NEVRSRFGFSQP---KPGAKLLRPLENPATQAAPPGMDPGAG---DLKTAERKLKPLSGQADLSRGDLSLNSEMP 463 (547) Q Consensus 394 ~----~~~~e~~Glp~p---~~~e~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (547) . +++.+.+|+|.+ +..|++...........+........+ ..+.+.+..++ T Consensus 450 ~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~------------------- 510 (522) T protein:vir:10 450 PLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNP------------------- 510 (522) T ss_pred HHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccH------------------- Confidence 2 567778899843 233333222211111111000000000 00000000000 Q ss_pred ccccccchhHHHHHHHHHHHHHHH Q lcl|NC_020866. 464 STGQKTGGAEIAALTDQLELEGQP 487 (547) Q Consensus 464 ~~~~~~~~~~~~~l~~~~~~~~~~ 487 (547) +..++|..-. .. T Consensus 511 --------~~~~~~~~~~----~~ 522 (522) T protein:vir:10 511 --------QLMDEEQPPM----EE 522 (522) T ss_pred --------HHHHHhCCCC----CC Confidence 0000000000 00 No 237 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=46.58 E-value=0.73 Score=21.33 Aligned_cols=421 Identities=10% Similarity=0.086 Sum_probs=172.5 Q ss_pred CCCCCeeeCCCCCcccchh--hhhh-------hhhhhhhhhHHHHhhhhccCCCHH-HHHHHHHhh---cCC---CHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQ--LTEE-------VAAPTVTGVRSPLSGYPGDGLNPL-RLAQILRAA---DHG---DPVRY 64 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~--l~~~-------~a~~~~~~~~~~~~~~~~~~l~p~-~~~~ilr~a---~~g---~~~~~ 64 (547) |+= -++.-+|.-++..+ .+++ .+.|...+... ..+..+..+. .+..+-.+. -.+ +-..+ T Consensus 1 m~f--~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~---~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eL 75 (523) T protein:vir:68 1 MKF--NILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAK---EYEVSENEAQQTYNAMFQRMFGSQEPGLKSTREL 75 (523) T ss_pred CCC--chhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcce---eeeccccccccccchhhhhhhhccccccchHHHH Confidence 551 13333343333211 1111 12222222100 0111111111 001111111 011 33455 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHhcCC-----ceEecCCC---CHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhc Q lcl|NC_020866. 65 LDLAEVIEERDPHYLGVIGTRKRSVSQLD-----ITVEPASD---DPQDVKIADMVRDWLKREELQQEVFHILD-CISKG 135 (547) Q Consensus 65 ~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~-----w~v~p~~~---~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G 135 (547) ..-|+.|. .+|.|-++++....-+.-.+ -.|.-.+. .....++.++.+..++-++|+.-..++.. -...| T Consensus 76 I~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDg 154 (523) T protein:vir:68 76 IDTYRNLM-TNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDS 154 (523) T ss_pred HHHHHHHh-hccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeee Confidence 66777785 69999999998887765443 23332221 11223455555555555566655555542 22244 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEe-----cccCCcc--------eeeccC-------------CCCceeeccCc Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRF-----QKSDLAT--------PLLLDD-------------NGAEVPLPAFR 189 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~-----~~~~~~~--------~~~~~~-------------~~~g~~l~~~k 189 (547) --.+.++-...+..-.+.++..++||.+++ +....+. ..+... .+.++.+|.. T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~d- 233 (523) T protein:vir:68 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKA- 233 (523) T ss_pred EEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchh- Confidence 445555544444444566788888887765 2111111 111111 1233445544 Q ss_pred eEEEeecCC--cCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHHH--HHHHHHHHHhcC- Q lcl|NC_020866. 190 FIFADIPAK--SGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDKD--TLFRAVANIAGD- 263 (547) Q Consensus 190 ~i~~~~~~~--~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~~--~l~~al~~~~~~- 263 (547) -|+|.|..- .+.-.=.|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.|.-+..+. =|.+.+..+.+- T Consensus 234 AI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIY-RitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKl 312 (523) T protein:vir:68 234 AIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIY-RITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRI 312 (523) T ss_pred heeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhhccee Confidence 455555321 111122377777777764333322222222 1222342 222223232222221 122222222211 Q ss_pred -----eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCcccc- Q lcl|NC_020866. 264 -----CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVAG- 316 (547) Q Consensus 264 -----~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~gg- 316 (547) ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |.+++++- T Consensus 313 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge----m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~ 388 (523) T protein:vir:68 313 AYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGN----MEDVRWFRNALYMALRIPITRIPSDQGGIQ 388 (523) T ss_pred EEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcCh----HHHHHHHHHHHHHHhCCcceeecCCCccee Confidence 111 133 5789999865443333 3347899999999853322 22232111 Q ss_pred -ccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHHHH Q lcl|NC_020866. 317 -GHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSLDT 384 (547) Q Consensus 317 -s~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~~~ 384 (547) +++..=+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++.. T Consensus 389 ~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 468 (523) T protein:vir:68 389 FDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINM 468 (523) T ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHH Confidence 13322233444 4455666666666666654444444333 111112223333221 12233334445554 Q ss_pred HHhC----CCCCCHHHHHHHh-CCCCCCCcchhccccccc-ccccccccccccccCCccccccccccccchhh Q lcl|NC_020866. 385 LVRL----GMEIEENEVRSRF-GFSQPKPGAKLLRPLENP-ATQAAPPGMDPGAGDLKTAERKLKPLSGQADL 451 (547) Q Consensus 385 l~~~----G~~v~~~~~~e~~-Glp~p~~~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (547) |..+ |--++.+|+++.. .+.+ +++.....+- .....+-...+. ....++ T Consensus 469 l~~~dpyvGky~s~~yi~k~ILr~tD----eei~~~~kqI~~E~k~~~~~~p~--------------~e~~~f 523 (523) T protein:vir:68 469 LQMAEPFIGKYISHRTAMKDILQMSD----EEIEQEAKQIEEESKEARFQDPD--------------QEQEDF 523 (523) T ss_pred HHHhhhhhcccchhHHHHHHHhccCH----HHHHHHHHHHHHHhhcCCCCCCc--------------hhhhcC Confidence 4433 4457889987754 5542 2111110000 000000000000 011111 No 238 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=42.85 E-value=0.87 Score=20.92 Aligned_cols=414 Identities=9% Similarity=0.096 Sum_probs=164.5 Q ss_pred CCCCCeeeCCCCCcccchhhhhhh---hhhh--hhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEV---AAPT--VTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~---a~~~--~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) ||.+-+ .|..-+ .+.+.. .+.. .-..++-+..|. .|.+. -..... .-.+...+| | T Consensus 1 m~~~~~----~~~~~~--~~~~r~~~l~~~R~~~e~~w~e~~~~~----lP~~~---~~~~~~-~~~~~~~~~------d 60 (532) T protein:vir:99 1 MAEVEK----TGFAAD--GAAAAYNRLKNDRGAYETRAEDCATYT----IPSVF---PSATAD-GSTSYTTPW------Q 60 (532) T ss_pred Ccchhh----ccccHH--HHHHHHHHHHHHhhHHHHHHHHHHHHh----hhccc---CCCCCc-chhhccccc------c Confidence 985332 233221 121111 1100 111112222221 12110 011111 112222333 7 Q ss_pred HHHHHHHHHHHHHHhcC------Cc-eEecCCC-------CHH-HHH-------HHHHHHHHHhCCCHHHHHHHHH-Hhh Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQL------DI-TVEPASD-------DPQ-DVK-------IADMVRDWLKREELQQEVFHIL-DCI 132 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~------~w-~v~p~~~-------~~~-~~~-------~ae~v~~~l~~~~~~~~i~~~l-~a~ 132 (547) ++-.-++++....+.+. +| ++.+.+. ++. ..+ +.+.+...|++.+|..-+.+++ +-+ T Consensus 61 st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~ 140 (532) T protein:vir:99 61 SIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLL 140 (532) T ss_pred chHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH Confidence 77777777777666643 45 3333211 000 011 1123334455667777776655 445 Q ss_pred hhcCeee---------------------eeEEeecC-CeeeEEeeEee------------------------ccceeEe- Q lcl|NC_020866. 133 SKGYSFT---------------------EIVWNTSE-GQYFPDKLIWR------------------------DPRHFRF- 165 (547) Q Consensus 133 ~~G~sv~---------------------Ei~w~~~~-g~~~~~~l~~~------------------------~~~~~~~- 165 (547) .+|-+++ +.++..+. |.+ ..+..+ |...|.+ T Consensus 141 ~~G~a~l~~~~~~~~~~~~~~f~~~pl~~y~v~~d~~G~v--~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~ 218 (532) T protein:vir:99 141 VAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNV--LQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIY 218 (532) T ss_pred hHCcEeEEecccccccCcccceEEEEcCeEEEeeCCCCCe--eeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEE Confidence 5665543 22222221 211 011100 0001111 Q ss_pred ----cccCCcce-eeccCCCC-------ceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020866. 166 ----QKSDLATP-LLLDDNGA-------EVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYG 233 (547) Q Consensus 166 ----~~~~~~~~-~~~~~~~~-------g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G 233 (547) ...++..+ ......+. +.++..+-|++.|+...+|..||.|....++.-..--+...+.-+...++-- T Consensus 219 ~~v~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~ 298 (532) T protein:vir:99 219 THVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISS 298 (532) T ss_pred EEEEecCCCCeeEEEEeecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 00111101 11110111 1123334689999999999999999999999988877777788888887765 Q ss_pred CCeEEEeCCCCCCHHHHHHHHHHHHHH-hcCeeEEecCC-ceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhcc Q lcl|NC_020866. 234 QPLRIGKWQPGASEQDKDTLFRAVANI-AGDCAAIIPET-MSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTT 311 (547) Q Consensus 234 ~P~~vgk~~~~~~~~~~~~l~~al~~~-~~~~~~vip~~-~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts 311 (547) -|+.+..-+ +.... .++ ....+.++|.. .++..++... ++....=...|+.+...|+++.+...++. T Consensus 299 ~~~~lv~p~-g~~~~---------~~~~~~~~g~~v~g~~~~i~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~ 367 (532) T protein:vir:99 299 KVLFFVNPN-GVTQI---------RRVAKANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQ 367 (532) T ss_pred CCCceeccc-cccch---------hhhccCCCcceecCCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc Confidence 555444322 21111 122 11233445533 3466665442 34444457789999999999986654433 Q ss_pred CccccccchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcC---------C--Ccc-cccEEEecCcc----cc Q lcl|NC_020866. 312 DAVAGGHAVSQEHRLVQED--IETSDAMALAAILNRDLIRPWIQLEY---------G--PQK-RYPRIKIARPK----TE 373 (547) Q Consensus 312 ~~~ggs~a~~~vh~~v~~~--~~~ad~~~i~~~ln~~li~~l~~~Nf---------g--~~~-~~p~~~~~~~~----~e 373 (547) . +++..-..||+....+. .+-.. -+.|+..++-||+..-| + |+. .-|.+.- ... .. T Consensus 368 ~-d~~r~TAtEV~~r~~E~~~~LGpv----~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~-~is~Laraq 441 (532) T protein:vir:99 368 R-GGDRVTAEEIRYVAGELEDTLGGV----YSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT-GLEALGRGH 441 (532) T ss_pred C-CCCcccHHHHHHHHHHHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceee-cchHHHHHH Confidence 2 23334556666543332 22222 22233334444433322 1 111 1122211 111 11 Q ss_pred CHHHHHHHHHHHHhC-CC---CCCH----HHHHHHhCCCCC---CCcchhcccccccccccccccccccccCCccccccc Q lcl|NC_020866. 374 DLTKLASSLDTLVRL-GM---EIEE----NEVRSRFGFSQP---KPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKL 442 (547) Q Consensus 374 d~~~~a~~~~~l~~~-G~---~v~~----~~~~e~~Glp~p---~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (547) ++..++..++.|.+. |- .++. +++.+.+|+|.+ ...|++.....+.+...+ ................. T Consensus 442 ~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~-~~~a~~~~~~~~~~~~~ 520 (532) T protein:vir:99 442 DLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAG-MVTAGQQMGAAGGQAAA 520 (532) T ss_pred HHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcc Confidence 223333344444333 21 1322 567778899643 233333222111110000 00000000000000000 Q ss_pred cccccchhhhhhcccchhhhcccccccchh Q lcl|NC_020866. 443 KPLSGQADLSRGDLSLNSEMPSTGQKTGGA 472 (547) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (547) .+ ... .++. +.+ T Consensus 521 ~~--------------~~~--~~~~--~~~ 532 (532) T protein:vir:99 521 AM--------------MQQ--QAGM--PTQ 532 (532) T ss_pred hh--------------HHh--hcCC--CCC Confidence 00 000 0000 000 No 239 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=42.83 E-value=0.87 Score=20.92 Aligned_cols=465 Identities=14% Similarity=0.066 Sum_probs=143.5 Q ss_pred CCCCCeee--CCCCCcccchhhhhhhhhhhhhhhHHHHhhhh--------------ccCCCHHHHHHHHHhhcCCCHHHH Q lcl|NC_020866. 1 MAQKPVLL--DRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP--------------GDGLNPLRLAQILRAADHGDPVRY 64 (547) Q Consensus 1 ma~~~~~~--~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~--------------~~~l~p~~~~~ilr~a~~g~~~~~ 64 (547) |+-+.-.. +.-|.|.+.....+.. ......+++.+..+. +.-.+.. ...+|+..+.+- . T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~-~~~~l~~~g~p~--~- 96 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELH-SRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQD-EIDELKERGQAP--T- 96 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHH-HHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHH-HHHHHHhcCCce--E- Confidence 32211111 1123333332222211 111111111111111 1111111 111121111110 0 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHH--hcCCceEecCCCCHHHHHHHHHHHHHHhC----CCHHHHHHHHH-HhhhhcCe Q lcl|NC_020866. 65 LDLAEVIEERDPHYLGVIGTRKRSV--SQLDITVEPASDDPQDVKIADMVRDWLKR----EELQQEVFHIL-DCISKGYS 137 (547) Q Consensus 65 ~~L~~~m~~~D~~v~s~l~~Rk~~v--~~~~w~v~p~~~~~~~~~~ae~v~~~l~~----~~~~~~i~~~l-~a~~~G~s 137 (547) .--.|+..++.....- .+.+|.|.|. ++.+.+.|+.+...++. -+++.....+. +++..|+. T Consensus 97 ---------~~N~i~~~i~~v~g~~~~nr~~~~~~p~--~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G 165 (776) T protein:vir:93 97 ---------VYNVISQSVNWIIGSEKRGRSDFKVLPR--RKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIG 165 (776) T ss_pred ---------EecchHHHHHHHHHHHHhCCcceEEecC--ChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcc Confidence 0111222222222111 2347999985 34567778877766654 34555555555 78999999 Q ss_pred eeeeEEeecCCeeeEEeeEeeccceeEecccCCc---------------------------------------------- Q lcl|NC_020866. 138 FTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLA---------------------------------------------- 171 (547) Q Consensus 138 v~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~---------------------------------------------- 171 (547) ++|+.|+++.... +.....++|+.|.+|..... T Consensus 166 ~~~v~~d~~~~~~-~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 244 (776) T protein:vir:93 166 WLESQVQDENDGE-PIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDD 244 (776) T ss_pred eEEEEeeccCCCC-ceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhc Confidence 9999998763321 22334445444444321110 Q ss_pred -----------------------------ceeec--------------cCCC--C-----------------c------- Q lcl|NC_020866. 172 -----------------------------TPLLL--------------DDNG--A-----------------E------- 182 (547) Q Consensus 172 -----------------------------~~~~~--------------~~~~--~-----------------g------- 182 (547) +++.. ..++ . | T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~ 324 (776) T protein:vir:93 245 IDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVS 324 (776) T ss_pred ccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehhe Confidence 00000 0000 0 0 Q ss_pred --------------------eeec--cCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe Q lcl|NC_020866. 183 --------------------VPLP--AFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGK 240 (547) Q Consensus 183 --------------------~~l~--~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk 240 (547) .+++ .+-|+++.+.......+|.|+.+.+..+--+.+...+.....+-+ .++.+ T Consensus 325 ~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~~--~~~~~-- 400 (776) T protein:vir:93 325 PMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILST--NKVLM-- 400 (776) T ss_pred eeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhcC--Cceee-- Confidence 0111 123666666666667789999999998887777655544333322 23322 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcCeeEEecCCc--eEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCcc-c-- Q lcl|NC_020866. 241 WQPGASEQDKDTLFRAVANIAGDCAAIIPETM--SIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAV-A-- 315 (547) Q Consensus 241 ~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~--~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~-g-- 315 (547) +.++-+. .+.+.+.. ...++++.+-.|. .+.+...... ...+-.++.+.+..|-. +.|-+-.+.|. + T Consensus 401 -~~gav~~-~d~~~~~~--~rp~~vi~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~i~~-~tGi~~~~~G~~~n~ 472 (776) T protein:vir:93 401 -EEGAVDD-IDEFRREA--ARPDAVMTVKNGKLGAVKMDVDRDL---APAHLELASRSIQMIQQ-VGGVTDEMLGRTTNA 472 (776) T ss_pred -ccccccc-hHHHHHhc--ccCCceeeeCCccccccccccCcCc---cHHHHHHHHHHHHHHHH-hhCcChHHhCCCcch Confidence 1222211 12222211 1223333333333 3444433222 12356667777666544 45543211111 1 Q ss_pred -cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCcccccEEEecCccccCHHHHHHHHHHHHhCCCCC Q lcl|NC_020866. 316 -GGHAVSQEHRLVQEDIETSDAMALAAILNRDLIRPWIQLE--YGPQKRYPRIKIARPKTEDLTKLASSLDTLVRLGMEI 392 (547) Q Consensus 316 -gs~a~~~vh~~v~~~~~~ad~~~i~~~ln~~li~~l~~~N--fg~~~~~p~~~~~~~~~ed~~~~a~~~~~l~~~G~~v 392 (547) ++.|.... .+--...+....+.+...+ +.+.+.++.++ |-...+.-++.-..... ..... +-|+ T Consensus 473 ~Sg~ai~~~-~~~~~~~~~~~~dn~~~~~-~~~~~~~l~li~~~~~~~r~~ri~~~~~~~----~~v~i-----n~~~-- 539 (776) T protein:vir:93 473 VSGVAIQAR-QEQGSVATNKLFDNLRLAF-QQHGEKELSLIEQYMTEEKQFRITNSRGNP----EYVTV-----NDGL-- 539 (776) T ss_pred hhHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCcceEEEEeecCCCc----ceEEe-----cccc-- Confidence 11111111 1111112222333333333 22444444443 22333332222111110 00000 0111 Q ss_pred CHHHHH-HHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcccccccch Q lcl|NC_020866. 393 EENEVR-SRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQKTGG 471 (547) Q Consensus 393 ~~~~~~-e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (547) ..+.++ -+|.+- +.... + .+......+... ........ . T Consensus 540 ~~nd~~~~~~dv~--------v~~~~-------------~-----------~~s~r~~~~~~l-~ql~~~~~-------p 579 (776) T protein:vir:93 540 PENDITRTKADFI--------IDEAE-------------W-----------RATMRQAAVAEL-MEVIGKMP-------P 579 (776) T ss_pred hhhhhccceeeEE--------Eeecc-------------c-----------chhHHHHHHHHH-HHHHhhcC-------h Confidence 111111 112210 00000 0 000000000000 00000000 0 Q ss_pred hHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcc------CCHHH--------HHHHHHHHhcCCCHHHHHHHHHHHHHH- Q lcl|NC_020866. 472 AEIAALTDQLELEG-QPEIVAMIEEIRTMLEAA------SSLGE--------FSDMLDARYGEIGASQLASRISAAMLA- 535 (547) Q Consensus 472 ~~~~~l~~~~~~~~-~~~~~~~~~~i~~~l~~a------~s~ee--------~~~~L~~l~~~~~~~~l~~~l~~al~~- 535 (547) +....+...+.... -+....+.+++....... ...++ ....+...-..+.....+.-..++-.- T Consensus 580 ~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea 659 (776) T protein:vir:93 580 EIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEA 659 (776) T ss_pred hhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHH Confidence 00000001000000 011112222222211000 00000 000000000000000000000000000 Q ss_pred ------HHHHhHHhhccC Q lcl|NC_020866. 536 ------SEFAGREGALDG 547 (547) Q Consensus 536 ------A~~~G~~~~~~g 547 (547) |..........+ T Consensus 660 ~~~~aqa~~~~~~a~~~~ 677 (776) T protein:vir:93 660 QVAEAKAKHISRMAIREG 677 (776) T ss_pred HHHhhhhhhhhhcchhhh Confidence 011111011111 No 240 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=40.70 E-value=0.96 Score=20.68 Aligned_cols=413 Identities=10% Similarity=0.099 Sum_probs=161.4 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhh---hh--hhhhhHHHHhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVA---AP--TVTGVRSPLSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEERD 75 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a---~~--~~~~~~~~~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~D 75 (547) ||. .+|-.- ..+..... +. ..-..++-+..|. .|. .+...+.+.......+| | T Consensus 1 ~~~------~~~~~~--~~~~~r~~~l~~~R~~~e~~w~e~~~y~----lP~----~~~~~~~~~~~~~~~~~------d 58 (522) T protein:vir:94 1 MAE------REGFAA--EGAKAVYDRLKNGRQPYETRAQNCAAVT----IPS----LFPKESDNSSTEYTTPW------Q 58 (522) T ss_pred Ccc------cchhhH--HHHHHHHHHHHHHhhHHHHHHHHHHHHh----ccc----ccCCCCCcccccccccc------c Confidence 884 122111 11111110 10 0111112223331 121 11111222212222233 6 Q ss_pred HHHHHHHHHHHHHHhcC-----Cc-eEecCC--------CC---HHHHH----HHHHHHHHHhCCCHHHHHHHHH-Hhhh Q lcl|NC_020866. 76 PHYLGVIGTRKRSVSQL-----DI-TVEPAS--------DD---PQDVK----IADMVRDWLKREELQQEVFHIL-DCIS 133 (547) Q Consensus 76 ~~v~s~l~~Rk~~v~~~-----~w-~v~p~~--------~~---~~~~~----~ae~v~~~l~~~~~~~~i~~~l-~a~~ 133 (547) ++-.-++++....+.+. +| ++.+.+ +. .+.++ +.+.+...|+..+|..-+.+++ +-+. T Consensus 59 st~~~a~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~ 138 (522) T protein:vir:94 59 AVGARCLNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIV 138 (522) T ss_pred ccHHHHHHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 77777777776666654 44 221111 00 11111 1223334456667877776666 4456 Q ss_pred hcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcc-----------------------------------e----- Q lcl|NC_020866. 134 KGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLAT-----------------------------------P----- 173 (547) Q Consensus 134 ~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~-----------------------------------~----- 173 (547) +|-+++=+. ....+.. ..+..+|=..|.+..+..++ + T Consensus 139 ~G~a~l~~~-~~~~~~~--~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~ 215 (522) T protein:vir:94 139 SGNCLLYIP-EPEQGTY--SPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIY 215 (522) T ss_pred hCcEeEeee-ccCCCce--eeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEE Confidence 776553111 1111110 00111111111111110000 0 Q ss_pred -------eeccCCC-------CceeeccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE Q lcl|NC_020866. 174 -------LLLDDNG-------AEVPLPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIG 239 (547) Q Consensus 174 -------~~~~~~~-------~g~~l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vg 239 (547) ...+-.+ ...++..+-|++.++...+|+.||.|....+..-..--+...+.-+..+++---|..++ T Consensus 216 ~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v 295 (522) T protein:vir:94 216 RQDDEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLV 295 (522) T ss_pred eeCCceeEEeeccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee Confidence 0000000 01123344689999999999999999999999998888888889999999887777655 Q ss_pred eCCCCCCHHHHHHHHHHHHHHhcCeeEEec-CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCcccccc Q lcl|NC_020866. 240 KWQPGASEQDKDTLFRAVANIAGDCAAIIP-ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAVAGGH 318 (547) Q Consensus 240 k~~~~~~~~~~~~l~~al~~~~~~~~~vip-~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ggs~ 318 (547) .-+. .... ..+ ... ..+.++| ...+|..++... ++....=...|+.+...|..+.+...++.. +++.. T Consensus 296 ~~~g-~~~~--~~~----~~~--~~g~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~-~~~r~ 364 (522) T protein:vir:94 296 NPNG-ITQP--RRL----NKA--ATGEFVAGRVEDINFLQLTK-GQDFTIAKSVADAIEQRLGWAFLLNSAVQR-NAERV 364 (522) T ss_pred cccc-cccc--hhe----ecc--CCceeecCCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhhhccC-CCccc Confidence 4322 1111 111 111 2233444 445577766443 334444567889999999999876544433 23345 Q ss_pred chhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhcCC----Ccc--ccc----EEEecCc-----cccCHHHHHHH Q lcl|NC_020866. 319 AVSQEHRLVQE--DIETSDAMALAAILNRDLIRPWIQLEYG----PQK--RYP----RIKIARP-----KTEDLTKLASS 381 (547) Q Consensus 319 a~~~vh~~v~~--~~~~ad~~~i~~~ln~~li~~l~~~Nfg----~~~--~~p----~~~~~~~-----~~ed~~~~a~~ 381 (547) -..||+....+ ..+-... +.|+..++.||+..-|. .+. .+| ++++... ...++..+... T Consensus 365 TAtEV~~r~~E~~~~LG~v~----~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~ 440 (522) T protein:vir:94 365 TAEEIRYVAGELEATLGGVY----SVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQA 440 (522) T ss_pred cHHHHHHHHHHHHHHHhHHH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHH Confidence 55677654433 2222222 22333344444332221 111 111 2222111 01112222222 Q ss_pred HHHHHhCCC-----CCCH----HHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhh Q lcl|NC_020866. 382 LDTLVRLGM-----EIEE----NEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLS 452 (547) Q Consensus 382 ~~~l~~~G~-----~v~~----~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (547) ++.+.+++= .|+. +++.+.+|+|.+. T Consensus 441 ~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~--------------------------------------------- 475 (522) T protein:vir:94 441 VNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAG--------------------------------------------- 475 (522) T ss_pred HHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhh--------------------------------------------- Confidence 222222220 0111 2233333332110 Q ss_pred hhcccchhhhcccccccchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHccCCHHHHHHH Q lcl|NC_020866. 453 RGDLSLNSEMPSTGQKTGGAEIAALTDQLEL-EGQPEIVAMIEEIRTMLEAASSLGEFSDM 512 (547) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~i~~~l~~a~s~ee~~~~ 512 (547) -...++.+.++.++... +...+......+-.++....++.++.... T Consensus 476 --------------ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 476 --------------LLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred --------------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhhcC Confidence 00012222222222110 00000000000001111111112221111 No 241 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=40.66 E-value=0.96 Score=20.68 Aligned_cols=424 Identities=11% Similarity=0.076 Sum_probs=174.2 Q ss_pred CCCCCeeeCCCCCcccchhh---------hhhhhhhhhhhhHHHH----hhhhccCCCHHHHHHHHHhhcCCCHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQL---------TEEVAAPTVTGVRSPL----SGYPGDGLNPLRLAQILRAADHGDPVRYLDL 67 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l---------~~~~a~~~~~~~~~~~----~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L 67 (547) ||.-.-++.-+|.-++..+. .+.++.+...+....+ ...+..++..+-+..+. ..--+-..+..- T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e--~~~~~~~eLI~~ 78 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNE--PEVKNTRELIDT 78 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhccc--chhhhHHHHHHH Confidence 66544444333333222111 1112222222111110 00111111111111110 000123455667 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHhcCC-----ceEecCCC---CHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhcCee Q lcl|NC_020866. 68 AEVIEERDPHYLGVIGTRKRSVSQLD-----ITVEPASD---DPQDVKIADMVRDWLKREELQQEVFHILD-CISKGYSF 138 (547) Q Consensus 68 ~~~m~~~D~~v~s~l~~Rk~~v~~~~-----w~v~p~~~---~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G~sv 138 (547) |+.|. .+|.|-++++....-+.-.+ ..|.-.+- ...-.++.++.+..++-++|+.-..++.. -...|--. T Consensus 79 YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~ 157 (524) T protein:vir:10 79 YRNLM-NNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIF 157 (524) T ss_pred HHHHh-hccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEE Confidence 77775 69999999998887665442 22222111 11122345555555555555555444442 22234444 Q ss_pred eeeEEeecCCeeeEEeeEeeccceeEecc-----c-CCc-------ceeecc-------------CCCCceeeccCceEE Q lcl|NC_020866. 139 TEIVWNTSEGQYFPDKLIWRDPRHFRFQK-----S-DLA-------TPLLLD-------------DNGAEVPLPAFRFIF 192 (547) Q Consensus 139 ~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~-----~-~~~-------~~~~~~-------------~~~~g~~l~~~k~i~ 192 (547) +.++-...+..-.+.++..++||.+.+.. . +.. ...+.. .+..++.+|..- |+ T Consensus 158 fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dA-Iv 236 (524) T protein:vir:10 158 FHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAA-VV 236 (524) T ss_pred EEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhh-ee Confidence 44444444444445677888888775511 1 111 111111 122334556554 44 Q ss_pred EeecCC--cCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC---- Q lcl|NC_020866. 193 ADIPAK--SGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD---- 263 (547) Q Consensus 193 ~~~~~~--~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~---- 263 (547) |.|..- .+.-.=.|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.|.-+..+ +=|.+.+..+.+- T Consensus 237 y~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIY-RitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYD 315 (524) T protein:vir:10 237 YAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIY-RITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYD 315 (524) T ss_pred eeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 443321 111233477887777765444333222222 1222342 22222322222222 1122222222111 Q ss_pred --eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccc---cc Q lcl|NC_020866. 264 --CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVA---GG 317 (547) Q Consensus 264 --~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~g---gs 317 (547) ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |-+++.+ ++ T Consensus 316 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge----m~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~g 391 (524) T protein:vir:10 316 ASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSD----MDDVLYFRTALYRALRIPESRIPSESNSGVMFD 391 (524) T ss_pred ccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcCh----HHHHHHHHHHHHHHhCCCchhccCCCCcccccc Confidence 111 133 5789999865443333 3347899999998853322 2233321 33 Q ss_pred cchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHHHHHH Q lcl|NC_020866. 318 HAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSLDTLV 386 (547) Q Consensus 318 ~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~~~l~ 386 (547) ++..=+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++..|. T Consensus 392 r~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 471 (524) T protein:vir:10 392 AGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLT 471 (524) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 4444344554 4456666666677666654444444433 211112223333221 1223333444555444 Q ss_pred hC----CCCCCHHHHHHHh-CCCCCCCcchhcccccccc-cccccccccccccCCccccccccccccchhh Q lcl|NC_020866. 387 RL----GMEIEENEVRSRF-GFSQPKPGAKLLRPLENPA-TQAAPPGMDPGAGDLKTAERKLKPLSGQADL 451 (547) Q Consensus 387 ~~----G~~v~~~~~~e~~-Glp~p~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (547) .+ |--++.+|+++.. .+.+ +++.....+-. ....+-...+. ....++ T Consensus 472 ~~dpyvGky~s~~yi~k~ILr~tD----eei~~~~k~I~~E~k~~~~~~~~--------------~~~~~f 524 (524) T protein:vir:10 472 MAEPFIGKYISHQTAMKDFLQMTD----EEINQEAKQIEEESKEARFQNPD--------------EEEEDF 524 (524) T ss_pred HhhhhhcccchhHHHHHHHhccCH----HHHHHHHHHHHHHhhcCCCCCCC--------------hhhhcC Confidence 43 4457899997754 5542 22111100000 00000000000 001111 No 242 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=38.11 E-value=1.1 Score=20.39 Aligned_cols=414 Identities=11% Similarity=0.042 Sum_probs=163.2 Q ss_pred ccchhhhhhhhhhh--hhhhHHHHhhhhccCCCHHHHHHHHHh---hcCCCHHHHHHHHHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_020866. 15 VQRAQLTEEVAAPT--VTGVRSPLSGYPGDGLNPLRLAQILRA---ADHGDPVRYLDLAEVIEERDPHYLGVIGTRKRSV 89 (547) Q Consensus 15 ~~~~~l~~~~a~~~--~~~~~~~~~~~~~~~l~p~~~~~ilr~---a~~g~~~~~~~L~~~m~~~D~~v~s~l~~Rk~~v 89 (547) |+.+.|.+...... -..+...|... ....-|.+ ...... .+.....+...+| |++-.-++++.-..+ T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~-~~~~lP~~-~~~~~~~~~~~~~~~~~~~~i~------dst~~~a~~~Las~L 72 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCI-RKYIMPMR-SDFFSDLRSEGSINWNQNREVF------DSTAGDGLETLSSSL 72 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHH-HHHhcccc-cccccCCCCCcccccccccccc------cchHHHHHHHHHHHH Confidence 77777666543211 11112222211 00111111 111111 1111112233333 666666666666555 Q ss_pred hcC------Cc-eEecCCCC-HHHHH-------HHHHHHHHHhCCCHHHHHHHHH-HhhhhcCeeeeeEEeec-CCeeeE Q lcl|NC_020866. 90 SQL------DI-TVEPASDD-PQDVK-------IADMVRDWLKREELQQEVFHIL-DCISKGYSFTEIVWNTS-EGQYFP 152 (547) Q Consensus 90 ~~~------~w-~v~p~~~~-~~~~~-------~ae~v~~~l~~~~~~~~i~~~l-~a~~~G~sv~Ei~w~~~-~g~~~~ 152 (547) .+. +| ++.+.+.+ .+..+ +.+.+...|++.+|...+.+++ +-+.+|-+++=+....+ .+.+ T Consensus 73 ~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~-- 150 (547) T protein:vir:10 73 HGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSV-- 150 (547) T ss_pred HHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCce-- Confidence 533 43 34443321 11112 2233345566678887766555 45556766432211110 0100 Q ss_pred EeeEeeccceeEecccCCcce----------------------------------------------------------- Q lcl|NC_020866. 153 DKLIWRDPRHFRFQKSDLATP----------------------------------------------------------- 173 (547) Q Consensus 153 ~~l~~~~~~~~~~~~~~~~~~----------------------------------------------------------- 173 (547) .+..+|-..|.+..+..+++ T Consensus 151 -r~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~ 229 (547) T protein:vir:10 151 -VFQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRN 229 (547) T ss_pred -eEEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCcc Confidence 01111111111110000000 Q ss_pred --------------eeccCCCCcee-----eccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020866. 174 --------------LLLDDNGAEVP-----LPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQ 234 (547) Q Consensus 174 --------------~~~~~~~~g~~-----l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~ 234 (547) +.....+.... +..+-|++.++...+|+.||.|....+..-..--+...+.-+..+++--- T Consensus 230 ~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~ 309 (547) T protein:vir:10 230 AGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVID 309 (547) T ss_pred ccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 00000000011 23345888999999999999999999999988777778888888888776 Q ss_pred CeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhccCcc Q lcl|NC_020866. 235 PLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTTDAV 314 (547) Q Consensus 235 P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts~~~ 314 (547) |.....- .+...+ .+...++..+......+.-++. ++....=...|+.+...|..+.+..-+... + T Consensus 310 pp~~v~~-~g~~~~---------~~~~pgg~~~~~~~~~v~pl~~---~~~~~~~~~~i~~~~~rI~~af~~d~~~~~-~ 375 (547) T protein:vir:10 310 PAIMVTE-RGLISD---------IDLGASGLTVVRDMESMKPFES---RARFDVSSIQLTDLRSAVRRIYYVDQLQMK-D 375 (547) T ss_pred Cceeccc-cccccc---------ceecCCeeeecCCcccceeeec---ccchHHHHHHHHHHHHHHHHHhhhhhhhcC-C Confidence 7654432 122211 1233444444333444553432 233334467788999999999876544322 2 Q ss_pred ccccchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--ccc---------EEEecCcccc-------C Q lcl|NC_020866. 315 AGGHAVSQEHRLVQED--IETSDAMALAAILNRDLIRPWIQLEYGPQK--RYP---------RIKIARPKTE-------D 374 (547) Q Consensus 315 ggs~a~~~vh~~v~~~--~~~ad~~~i~~~ln~~li~~l~~~Nfg~~~--~~p---------~~~~~~~~~e-------d 374 (547) +...-..||+....+. .+-.....+...|-.-+|...+.+-+..+. .+| .+.+....+. + T Consensus 376 ~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~ 455 (547) T protein:vir:10 376 SPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQ 455 (547) T ss_pred CccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHH Confidence 3445567777554332 222222222222222222222222211110 111 1222222221 1 Q ss_pred HHHHHHHHHH---HHhCCC----CCCH----HHHHHHhCCCCC--CCcchhcccccccccccccccccccccCCcccccc Q lcl|NC_020866. 375 LTKLASSLDT---LVRLGM----EIEE----NEVRSRFGFSQP--KPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERK 441 (547) Q Consensus 375 ~~~~a~~~~~---l~~~G~----~v~~----~~~~e~~Glp~p--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (547) +..+...++. |.+.+= .++. +++.+.+|+|.. ..++++-....+.+...+.....+..... .... T Consensus 456 ~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~--g~~m 533 (547) T protein:vir:10 456 AASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAE--GNAM 533 (547) T ss_pred HHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH Confidence 1111122222 222210 1333 577788899842 22333221111110000000000000000 0000 Q ss_pred ccccccchhhhhhc Q lcl|NC_020866. 442 LKPLSGQADLSRGD 455 (547) Q Consensus 442 ~~~~~~~~~~~~~~ 455 (547) .....+.+.+.++. T Consensus 534 ~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 534 EAQGKGQAALKENQ 547 (547) T ss_pred HhhcCcccchhccC Confidence 00000000000000 No 243 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=37.41 E-value=1.1 Score=20.31 Aligned_cols=450 Identities=10% Similarity=0.064 Sum_probs=175.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhhhhhhhHHHHhhhh-ccCCCHHHHHHHHH------------------hhcCCCH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAPTVTGVRSPLSGYP-GDGLNPLRLAQILR------------------AADHGDP 61 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~~~~~~~~~~~~~~-~~~l~p~~~~~ilr------------------~a~~g~~ 61 (547) |- +|.|-|.. +=++...+..+++ .+. +.|+..- .|..--.+|....+ ..+.... T Consensus 5 ~~-~~~~~~~~--~~~~~~~~~~~~~-~l~---~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (641) T protein:vir:94 5 MP-TPIIEDKE--SAKRKLSTDRIGG-VVI---SKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDA 77 (641) T ss_pred CC-cccccCCc--chhhcCCchhHHH-HHH---HHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchh Confidence 43 45554331 1122111122221 111 1111110 11111112222111 0010000 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHhc-----CCc-eEecCCCCHHHHHHHHHHHH----HHhCCCHHHHHHHHH-H Q lcl|NC_020866. 62 VRYLDLAEVIEERDPHYLGVIGTRKRSVSQ-----LDI-TVEPASDDPQDVKIADMVRD----WLKREELQQEVFHIL-D 130 (547) Q Consensus 62 ~~~~~L~~~m~~~D~~v~s~l~~Rk~~v~~-----~~w-~v~p~~~~~~~~~~ae~v~~----~l~~~~~~~~i~~~l-~ 130 (547) .....+ .|+++..+++.....+.+ .+| +++|.+.+ +.+.|+.+.+ .+...+|.+.+..++ + T Consensus 78 ~~r~ki------~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~e--d~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d 149 (641) T protein:vir:94 78 DWRHRI------NTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPE--LADAARVVKQLTKTKLEAASIRDIFETYVRN 149 (641) T ss_pred cccccc------cchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCC--hHHHHHHHHHHHHHHHhhcchHHHHHHHHHH Confidence 001111 377777777776666655 245 66664322 3334444443 344456677666666 7 Q ss_pred hhhhcCeeeeeEEeec-----------CCee-------------eEEeeEeeccceeEecccCC---------------- Q lcl|NC_020866. 131 CISKGYSFTEIVWNTS-----------EGQY-------------FPDKLIWRDPRHFRFQKSDL---------------- 170 (547) Q Consensus 131 a~~~G~sv~Ei~w~~~-----------~g~~-------------~~~~l~~~~~~~~~~~~~~~---------------- 170 (547) ++.+|-+++.+-|... ++.+ .-.++.+++|..|.++...+ T Consensus 150 ~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~ 229 (641) T protein:vir:94 150 LVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREEL 229 (641) T ss_pred HhhcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHH Confidence 8889988777666421 1110 00011222221111110000 Q ss_pred ----------------------------------------cceee-c-----cC----------CC------Cceee-cc Q lcl|NC_020866. 171 ----------------------------------------ATPLL-L-----DD----------NG------AEVPL-PA 187 (547) Q Consensus 171 ----------------------------------------~~~~~-~-----~~----------~~------~g~~l-~~ 187 (547) .+.+. . ++ .+ .+.+. .. T Consensus 230 ~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~ 309 (641) T protein:vir:94 230 HELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCG 309 (641) T ss_pred HHHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCc Confidence 00000 0 00 00 00000 01 Q ss_pred CceEEEeecCCcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEE Q lcl|NC_020866. 188 FRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAI 267 (547) Q Consensus 188 ~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~v 267 (547) .-|++.++....+..||.|....+..-....+...+.-+..+++..-|..++.-+......+. ..+.++.+. T Consensus 310 ~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l--------~~~PG~ii~ 381 (641) T protein:vir:94 310 SPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDV--------KAKPGAVFK 381 (641) T ss_pred CCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccccee--------eccCCccee Confidence 147788888889999999999999999999999999999999888777766554322222111 223334444 Q ss_pred ecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcchhcc--Cccccc-cchhHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_020866. 268 IPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQTTTT--DAVAGG-HAVSQEHRL--VQEDIETSDAMALAAI 342 (547) Q Consensus 268 ip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqtlts--~~~ggs-~a~~~vh~~--v~~~~~~ad~~~i~~~ 342 (547) ......+..+.... .........+++++..|.++.....+.. +...|. .-+.+|... .....+..-.+.+++. T Consensus 382 ~~~~~~v~pl~~~~--~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e 459 (641) T protein:vir:94 382 VAQHGSLQPIDMGR--QDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDS 459 (641) T ss_pred eCCCCcceeecCCc--cccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444566553322 1222345678888888888754333322 112222 223444432 2223444555556655 Q ss_pred HHHHHHHHHHHhc--CCCc-----------------cccc-EE--EecC--ccccCHHHHHHHHHHHHh----CCC---- Q lcl|NC_020866. 343 LNRDLIRPWIQLE--YGPQ-----------------KRYP-RI--KIAR--PKTEDLTKLASSLDTLVR----LGM---- 390 (547) Q Consensus 343 ln~~li~~l~~~N--fg~~-----------------~~~p-~~--~~~~--~~~ed~~~~a~~~~~l~~----~G~---- 390 (547) +-..|+.+++.++ +... ...| .+ .++. .........+..++.|.. .|. T Consensus 460 ~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v 539 (641) T protein:vir:94 460 STLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQI 539 (641) T ss_pred HHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChhh Confidence 5555666655544 1100 0111 11 1111 111111122222332221 221 Q ss_pred --CCCH----HHHHHHhCCCCCCCcchhcccccccccccccccccccccCCccccccccccccchhhhhhcccchhhhcc Q lcl|NC_020866. 391 --EIEE----NEVRSRFGFSQPKPGAKLLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPS 464 (547) Q Consensus 391 --~v~~----~~~~e~~Glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (547) .++. ..+.+..|++.|.. .+.....+. ++.. T Consensus 540 ~d~~d~~~~~~~~~~~~g~~~p~~---~ir~~~~~~---~~~~------------------------------------- 576 (641) T protein:vir:94 540 GQSLDYALILEDLLRQMRFTDPMR---YIKKAEAPP---AAPP------------------------------------- 576 (641) T ss_pred hhcCCHHHHHHHHHHHhCCCCchh---hccCccCch---hHHH------------------------------------- Confidence 1222 34455667765532 222111000 0000 Q ss_pred cccccchhHHHHHHHHHHHHHHH--------HHHHH-HHHHHHHHHccC--CHHHHHHHHHHHhcCCCHHHH Q lcl|NC_020866. 465 TGQKTGGAEIAALTDQLELEGQP--------EIVAM-IEEIRTMLEAAS--SLGEFSDMLDARYGEIGASQL 525 (547) Q Consensus 465 ~~~~~~~~~~~~l~~~~~~~~~~--------~~~~~-~~~i~~~l~~a~--s~ee~~~~L~~l~~~~~~~~l 525 (547) ..-..+...+...++. ++..+ .+++.+.....+ .-+-+...+.+.-|+.+...| T Consensus 577 -------~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 641 (641) T protein:vir:94 577 -------IAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDVAPEAMAAATQQITSGAL 641 (641) T ss_pred -------HHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhhhHHHHhcccccccccCC Confidence 0000000000111111 11110 111111111100 001111222222233333333 No 244 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=27.60 E-value=1.8 Score=19.15 Aligned_cols=408 Identities=10% Similarity=0.137 Sum_probs=173.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhh-------hhhhhhhhHHHH---------hhhhccCCCHHHHHHHHHhhcCCCHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEV-------AAPTVTGVRSPL---------SGYPGDGLNPLRLAQILRAADHGDPVRY 64 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~-------a~~~~~~~~~~~---------~~~~~~~l~p~~~~~ilr~a~~g~~~~~ 64 (547) |++ +...+-+ .+....++.+ +.|...+....+ .++-..-++. ...+ -+...+ T Consensus 3 ~~~---lf~f~~~-~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~---~~~~-----~~~~eL 70 (516) T protein:vir:10 3 FLD---LFKFWDR-VDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGI---DNNI-----SGTKDL 70 (516) T ss_pred chH---hcccccc-hhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeecc---cccc-----chHHHH Confidence 443 3333221 1111111111 122211111100 1111000111 0011 133455 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHhcCC-----ceEecCC-C--CHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhc Q lcl|NC_020866. 65 LDLAEVIEERDPHYLGVIGTRKRSVSQLD-----ITVEPAS-D--DPQDVKIADMVRDWLKREELQQEVFHILD-CISKG 135 (547) Q Consensus 65 ~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~-----w~v~p~~-~--~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G 135 (547) ..-|+.|. .+|.|.++++....-+.-.+ ..|.-.+ + ...-.++.++.+..++-++|+.-..++.. -...| T Consensus 71 I~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDg 149 (516) T protein:vir:10 71 INTYRQLI-NNPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDS 149 (516) T ss_pred HHHHHHHh-hccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcc Confidence 66777885 69999999998887665442 2222211 1 12223455666666666666665555552 22244 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEeccc----CC--c-------ceeecc-------------CCCCceeeccCc Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS----DL--A-------TPLLLD-------------DNGAEVPLPAFR 189 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~----~~--~-------~~~~~~-------------~~~~g~~l~~~k 189 (547) --.+.++-. +..-.+.++..++||.+.+... +. . ....+. ..+.++.+|.+ T Consensus 150 Ri~fhKiid--~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~d- 226 (516) T protein:vir:10 150 RIFFHKIMP--NPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRS- 226 (516) T ss_pred eEEEEEEec--CccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechh- Confidence 444454322 3333456777788876654221 00 0 111111 01123344544 Q ss_pred eEEEeecC---CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC Q lcl|NC_020866. 190 FIFADIPA---KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD 263 (547) Q Consensus 190 ~i~~~~~~---~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~ 263 (547) -|+|.|.. ..++.+ .|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.|.-+..+ +=|.+.+....+. T Consensus 227 AI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIY-RitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNk 304 (516) T protein:vir:10 227 AVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIY-RITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNR 304 (516) T ss_pred heeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 46666632 234444 788888888765444433333322 2222342 22222322222222 1122222222211 Q ss_pred ------eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccc- Q lcl|NC_020866. 264 ------CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVA- 315 (547) Q Consensus 264 ------~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~g- 315 (547) ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |.+++++ T Consensus 305 lvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge----m~DV~YF~kkLy~aLnVP~sRl~~e~~~~ 380 (516) T protein:vir:10 305 VVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGD----MDDVRWFNKKLYEALRIPLSRIPRDDGGM 380 (516) T ss_pred eEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcCh----HHHHHHHHHHHHHHhCCCcccccCCCCce Confidence 111 123 5789999865443333 3347899999999853322 2233321 Q ss_pred --cccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHH Q lcl|NC_020866. 316 --GGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSL 382 (547) Q Consensus 316 --gs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~ 382 (547) ++++..-+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++ T Consensus 381 ~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~ 460 (516) T protein:vir:10 381 VIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRV 460 (516) T ss_pred eeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 145444445554 4456666666677766654444444433 211112223333221 122333344455 Q ss_pred HHHHhC----CCCCCHHHHHHH-hCCCCCCCcch---hcccccccccccccccccccccCCccccccccccccchhh Q lcl|NC_020866. 383 DTLVRL----GMEIEENEVRSR-FGFSQPKPGAK---LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADL 451 (547) Q Consensus 383 ~~l~~~----G~~v~~~~~~e~-~Glp~p~~~e~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (547) ..|..+ |--++.+|+++. +.++..+-.++ +-.....+-- . .| .+ ..++ T Consensus 461 ~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~--~----~p-------~~--------~~~f 516 (516) T protein:vir:10 461 DALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRF--Q----NP-------EN--------EDDF 516 (516) T ss_pred HHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCC--C----CC-------Cc--------cccC Confidence 444433 545899999775 46653221111 1111100000 0 00 00 0011 No 245 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=27.60 E-value=1.8 Score=19.15 Aligned_cols=408 Identities=10% Similarity=0.137 Sum_probs=173.0 Q ss_pred CCCCCeeeCCCCCcccchhhhhhh-------hhhhhhhhHHHH---------hhhhccCCCHHHHHHHHHhhcCCCHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEV-------AAPTVTGVRSPL---------SGYPGDGLNPLRLAQILRAADHGDPVRY 64 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~-------a~~~~~~~~~~~---------~~~~~~~l~p~~~~~ilr~a~~g~~~~~ 64 (547) |++ +...+-+ .+....++.+ +.|...+....+ .++-..-++. ...+ -+...+ T Consensus 3 ~~~---lf~f~~~-~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~---~~~~-----~~~~eL 70 (516) T protein:vir:10 3 FLD---LFKFWDR-VDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGI---DNNI-----SGTKDL 70 (516) T ss_pred chH---hcccccc-hhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeecc---cccc-----chHHHH Confidence 443 3333221 1111111111 122211111100 1111000111 0011 133455 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHhcCC-----ceEecCC-C--CHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhc Q lcl|NC_020866. 65 LDLAEVIEERDPHYLGVIGTRKRSVSQLD-----ITVEPAS-D--DPQDVKIADMVRDWLKREELQQEVFHILD-CISKG 135 (547) Q Consensus 65 ~~L~~~m~~~D~~v~s~l~~Rk~~v~~~~-----w~v~p~~-~--~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G 135 (547) ..-|+.|. .+|.|.++++....-+.-.+ ..|.-.+ + ...-.++.++.+..++-++|+.-..++.. -...| T Consensus 71 I~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDg 149 (516) T protein:vir:10 71 INTYRQLI-NNPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDS 149 (516) T ss_pred HHHHHHHh-hccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcc Confidence 66777885 69999999998887665442 2222211 1 12223455666666666666665555552 22244 Q ss_pred CeeeeeEEeecCCeeeEEeeEeeccceeEeccc----CC--c-------ceeecc-------------CCCCceeeccCc Q lcl|NC_020866. 136 YSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKS----DL--A-------TPLLLD-------------DNGAEVPLPAFR 189 (547) Q Consensus 136 ~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~----~~--~-------~~~~~~-------------~~~~g~~l~~~k 189 (547) --.+.++-. +..-.+.++..++||.+.+... +. . ....+. ..+.++.+|.+ T Consensus 150 Ri~fhKiid--~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~d- 226 (516) T protein:vir:10 150 RIFFHKIMP--NPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRS- 226 (516) T ss_pred eEEEEEEec--CccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechh- Confidence 444454322 3333456777788876654221 00 0 111111 01123344544 Q ss_pred eEEEeecC---CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC Q lcl|NC_020866. 190 FIFADIPA---KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD 263 (547) Q Consensus 190 ~i~~~~~~---~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~ 263 (547) -|+|.|.. ..++.+ .|.|+.+..++-=-+...-..+.| -.-=-|- ||.-.+.|.-+..+ +=|.+.+....+. T Consensus 227 AI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIY-RitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNk 304 (516) T protein:vir:10 227 AVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIY-RITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNR 304 (516) T ss_pred heeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 46666632 234444 788888888765444433333322 2222342 22222322222222 1122222222211 Q ss_pred ------eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccc- Q lcl|NC_020866. 264 ------CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVA- 315 (547) Q Consensus 264 ------~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~g- 315 (547) ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |.+++++ T Consensus 305 lvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge----m~DV~YF~kkLy~aLnVP~sRl~~e~~~~ 380 (516) T protein:vir:10 305 VVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGD----MDDVRWFNKKLYEALRIPLSRIPRDDGGM 380 (516) T ss_pred eEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcCh----HHHHHHHHHHHHHHhCCCcccccCCCCce Confidence 111 123 5789999865443333 3347899999999853322 2233321 Q ss_pred --cccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHH Q lcl|NC_020866. 316 --GGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSL 382 (547) Q Consensus 316 --gs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~ 382 (547) ++++..-+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++ T Consensus 381 ~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~ 460 (516) T protein:vir:10 381 VIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRV 460 (516) T ss_pred eeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 145444445554 4456666666677766654444444433 211112223333221 122333344455 Q ss_pred HHHHhC----CCCCCHHHHHHH-hCCCCCCCcch---hcccccccccccccccccccccCCccccccccccccchhh Q lcl|NC_020866. 383 DTLVRL----GMEIEENEVRSR-FGFSQPKPGAK---LLRPLENPATQAAPPGMDPGAGDLKTAERKLKPLSGQADL 451 (547) Q Consensus 383 ~~l~~~----G~~v~~~~~~e~-~Glp~p~~~e~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (547) ..|..+ |--++.+|+++. +.++..+-.++ +-.....+-- . .| .+ ..++ T Consensus 461 ~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~--~----~p-------~~--------~~~f 516 (516) T protein:vir:10 461 DALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRF--Q----NP-------EN--------EDDF 516 (516) T ss_pred HHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCC--C----CC-------Cc--------cccC Confidence 444433 545899999775 46653221111 1111100000 0 00 00 0011 No 246 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=25.65 E-value=2 Score=18.89 Aligned_cols=413 Identities=12% Similarity=0.030 Sum_probs=158.8 Q ss_pred CCCCCeeeCCCCCcccchhhhhhhhhh--hhhhhHH----HHhhhhccCCCHHHHHHH---HHhhcCCCHHHHHHHHHHH Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRAQLTEEVAAP--TVTGVRS----PLSGYPGDGLNPLRLAQI---LRAADHGDPVRYLDLAEVI 71 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~~l~~~~a~~--~~~~~~~----~~~~~~~~~l~p~~~~~i---lr~a~~g~~~~~~~L~~~m 71 (547) |.+ +++++-+.+... ....-|+ .|+.. .....| +.... -+........+...+| T Consensus 1 m~~------------d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~-~~~~lP-~~~~~~~~~~~~~~~~~~~~~~~~--- 63 (549) T protein:vir:10 1 MTN------------DDAKILQALNADHGRMKEKRQSYEAVWNDV-IDYLMP-RLDKFGQLPRPDSEKGRERSQKMF--- 63 (549) T ss_pred CCc------------chHHHHHHHHHHHHHHHHHhhhHHHHHHHH-HHHhcc-ccccccccCCCCCCcccccccccc--- Confidence 442 223333332210 1111111 11111 111111 00000 0001111111222233 Q ss_pred HhhhHHHHHHHHHHHHHHhcC------Cc-eEecCCCC-HHHHHHHHHH---H----HHH--hCCCHHHHHHHHH-Hhhh Q lcl|NC_020866. 72 EERDPHYLGVIGTRKRSVSQL------DI-TVEPASDD-PQDVKIADMV---R----DWL--KREELQQEVFHIL-DCIS 133 (547) Q Consensus 72 ~~~D~~v~s~l~~Rk~~v~~~------~w-~v~p~~~~-~~~~~~ae~v---~----~~l--~~~~~~~~i~~~l-~a~~ 133 (547) |++-.-++++.-..+.+. +| ++..+++. .+...+.+++ + ..+ +..+|...+.+++ +-+. T Consensus 64 ---dstg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~ 140 (549) T protein:vir:10 64 ---DSTAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGL 140 (549) T ss_pred ---cchHHHHHHHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHh Confidence 666666666666655533 44 34443322 1112222222 2 211 2457777766555 4456 Q ss_pred hcCeeeeeEEeecCCeeeEEeeEeeccceeEecccCCcce---------------------------------------- Q lcl|NC_020866. 134 KGYSFTEIVWNTSEGQYFPDKLIWRDPRHFRFQKSDLATP---------------------------------------- 173 (547) Q Consensus 134 ~G~sv~Ei~w~~~~g~~~~~~l~~~~~~~~~~~~~~~~~~---------------------------------------- 173 (547) +|-+++=+......+ + .+..+|-..|.+..+..+++ T Consensus 141 ~Gta~l~~~~~~~~~-~---~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~ 216 (549) T protein:vir:10 141 FGPGALMIEHDVGKG-I---VYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAI 216 (549) T ss_pred hcceeeEEeecCCCe-e---EEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEE Confidence 776654332221111 0 11111111111111100000 Q ss_pred -------------------------eeccCCCCcee-----eccCceEEEeecCCcCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_020866. 174 -------------------------LLLDDNGAEVP-----LPAFRFIFADIPAKSGILLRSGLARVAAWGWMFKAFTAR 223 (547) Q Consensus 174 -------------------------~~~~~~~~g~~-----l~~~k~i~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 223 (547) +..+.. .... +..+-|++.++...+|..||.|....++.-..--+...+ T Consensus 217 v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~-~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~ 295 (549) T protein:vir:10 217 FYHAVEPRADRDPRKLDGRNMQFASYWLDEG-RDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAK 295 (549) T ss_pred EEEEeecCCCCCccccccccCceEEEEEEec-CCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHH Confidence 000000 0111 123458888999999999999999999999888888888 Q ss_pred HHHHHHHHcCCCeEEEeCCCCCCHHHHHHHHHHHHHHhcCeeEEecCCceEEEEecCCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 224 DWAIFTQTYGQPLRIGKWQPGASEQDKDTLFRAVANIAGDCAAIIPETMSIDFVESKSIGSSVDLYEKRINHLDQQISKA 303 (547) Q Consensus 224 ~w~~f~E~~G~P~~vgk~~~~~~~~~~~~l~~al~~~~~~~~~vip~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ 303 (547) .-+..+++---|..++.-+ +.... . . +.. +.-.+++...+....+.... .++....-...|+.+...|+.+ T Consensus 296 ~~l~~~~~~~~p~~~v~~~-g~~~~-~-~----l~p-gg~~~~~~~~~~~~~~~pl~-~~~~~~~~~~~i~~~~~rI~~a 366 (549) T protein:vir:10 296 TNIRGAQKLVDPPLLANED-GVLDG-F-D----LRS-GALNWGGLNDKGEEMVKPLL-TGKQAQIGIEFAQDTRQTINQW 366 (549) T ss_pred HHHHHHHHHhcCceeeccc-ccccc-c-e----ecc-CCccccccCCCCccceeeec-cccchhHHHHHHHHHHHHHHHH Confidence 8889999887776664322 11111 0 0 111 11112222222233343332 2344445567799999999999 Q ss_pred HhcchhccCccccccchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcC---------CC--cccc-c--EEEe Q lcl|NC_020866. 304 VLGQTTTTDAVAGGHAVSQEHRLVQED--IETSDAMALAAILNRDLIRPWIQLEY---------GP--QKRY-P--RIKI 367 (547) Q Consensus 304 ilGqtlts~~~ggs~a~~~vh~~v~~~--~~~ad~~~i~~~ln~~li~~l~~~Nf---------g~--~~~~-p--~~~~ 367 (547) .+..-+....++...-..||+....+. .+-.. -+.|...++-||+..-| ++ .... | .+.+ T Consensus 367 f~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv----~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i 442 (549) T protein:vir:10 367 FYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPT----LGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDV 442 (549) T ss_pred HhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEE Confidence 876543222233445566777554332 22222 22222334444432221 11 1100 1 1222 Q ss_pred cCcc-------ccCHHHHHHHHH---HHHhCCC----CCCH----HHHHHHhCCCCC--CCcchhccccc---ccccccc Q lcl|NC_020866. 368 ARPK-------TEDLTKLASSLD---TLVRLGM----EIEE----NEVRSRFGFSQP--KPGAKLLRPLE---NPATQAA 424 (547) Q Consensus 368 ~~~~-------~ed~~~~a~~~~---~l~~~G~----~v~~----~~~~e~~Glp~p--~~~e~~~~~~~---~~~~~~~ 424 (547) .... ..++..+...++ .|.++|= .|+. +++.+.+|+|.. ..++++..... ..+...+ T Consensus 443 ~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~ 522 (549) T protein:vir:10 443 EYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQ 522 (549) T ss_pred EeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHH Confidence 2111 122222222222 2222321 1332 577788899852 23333322111 1110000 Q ss_pred cccccccccCCccccccccccccchhhhhhcccchhhhcccccc Q lcl|NC_020866. 425 PPGMDPGAGDLKTAERKLKPLSGQADLSRGDLSLNSEMPSTGQK 468 (547) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (547) ..+..+..+. .....+.....+.+... T Consensus 523 ~~~~a~~a~~-----------------~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 523 MLAAAPVAAG-----------------AIKDLSDAQTAAQTARV 549 (549) T ss_pred HHHHHHHHHH-----------------HHHhhhhhcCCCcccCC Confidence 0000000000 00000000000000010 No 247 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=23.47 E-value=2.3 Score=18.60 Aligned_cols=453 Identities=12% Similarity=0.094 Sum_probs=171.6 Q ss_pred CCCCCeeeCCCCCcccch-hhhhhhhh---hhhhhhHHH--HhhhhccCCCHHHHHHHHHhhcCCCHHHHHHHHHHHHhh Q lcl|NC_020866. 1 MAQKPVLLDRWGNPVQRA-QLTEEVAA---PTVTGVRSP--LSGYPGDGLNPLRLAQILRAADHGDPVRYLDLAEVIEER 74 (547) Q Consensus 1 ma~~~~~~~~~g~~~~~~-~l~~~~a~---~~~~~~~~~--~~~~~~~~l~p~~~~~ilr~a~~g~~~~~~~L~~~m~~~ 74 (547) |++ + +|--++.. .+.....+ +...+.-.. ..++-.+-.+. ....+ +-..+..-|+.|. . T Consensus 1 m~~---l---fgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~---~~~~~-----~~~eLI~~YR~ma-~ 65 (558) T protein:vir:10 1 MAK---L---FGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDI---EGAYR-----SEYDLIRRYREMA-L 65 (558) T ss_pred Ccc---h---hcchhhhhhhhccCCccccCCCccccccceeccceeeeeecc---cchhh-----hHHHHHHHHHHHh-h Confidence 775 2 34444211 22222111 111111000 11111111111 11222 2245566777785 6 Q ss_pred hHHHHHHHHHHHHHHhcCC-----ceEecCCC---CHHHHHHHHHHHHHHhCCCHHHHHHHHHH-hhhhcCeeeeeEEee Q lcl|NC_020866. 75 DPHYLGVIGTRKRSVSQLD-----ITVEPASD---DPQDVKIADMVRDWLKREELQQEVFHILD-CISKGYSFTEIVWNT 145 (547) Q Consensus 75 D~~v~s~l~~Rk~~v~~~~-----w~v~p~~~---~~~~~~~ae~v~~~l~~~~~~~~i~~~l~-a~~~G~sv~Ei~w~~ 145 (547) +|.|-++++....-+.-.+ -.|.-.+- +...+++.++.+..++-++|+.-..++.. -...|--.+.++-.. T Consensus 66 ~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~ 145 (558) T protein:vir:10 66 HPEADGAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDT 145 (558) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeC Confidence 9999999998887765442 22322211 11233455555566665666665555553 223455555555544 Q ss_pred cCCeeeEEeeEeeccceeEecccC------C----------c--------ceeeccC-------------CCCceeeccC Q lcl|NC_020866. 146 SEGQYFPDKLIWRDPRHFRFQKSD------L----------A--------TPLLLDD-------------NGAEVPLPAF 188 (547) Q Consensus 146 ~~g~~~~~~l~~~~~~~~~~~~~~------~----------~--------~~~~~~~-------------~~~g~~l~~~ 188 (547) .+..-.+.++..++||.+.+-+.- . . .+.++.. .+.++.++.. T Consensus 146 k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~d 225 (558) T protein:vir:10 146 KNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKD 225 (558) T ss_pred CCccccceeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechh Confidence 444444567777888876442210 0 0 0011110 1112333333 Q ss_pred ceEEEeecC--CcCccccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCe-EEEeCCCCCCHHHH--HHHHHHHHHHhcC Q lcl|NC_020866. 189 RFIFADIPA--KSGILLRSGLARVAAWGWMFKAFTARDWAIFTQTYGQPL-RIGKWQPGASEQDK--DTLFRAVANIAGD 263 (547) Q Consensus 189 k~i~~~~~~--~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E~~G~P~-~vgk~~~~~~~~~~--~~l~~al~~~~~~ 263 (547) -|+|.|.. ....-.=.|.|+.+..++-=-+...-..+.| ..-=-|- ||.-.+.|.-+..+ +=|.+.+....+- T Consensus 226 -AI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIY-RitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNk 303 (558) T protein:vir:10 226 -SITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIY-RLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNK 303 (558) T ss_pred -heeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHH-hhhccccceEEEEecCCCCchhHHHHHHHHHHhccce Confidence 44444441 1111122467777777654333322222222 1212232 22222322222222 1122222222211 Q ss_pred ------eeE--------------Eec-----CCceEEEEecCCCCCchHHHHHHHHHHHHHHHHHHhcch--hccCccc- Q lcl|NC_020866. 264 ------CAA--------------IIP-----ETMSIDFVESKSIGSSVDLYEKRINHLDQQISKAVLGQT--TTTDAVA- 315 (547) Q Consensus 264 ------~~~--------------vip-----~~~~ie~~e~~~~~~~~~~f~~li~~~d~~Isk~ilGqt--lts~~~g- 315 (547) ++- -+| .|++|..+..+.+-|. ..=++|..+.+-+++--.. |..+++- T Consensus 304 lVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLge----m~DV~YF~kKLy~aLnVP~SRl~~e~~f~ 379 (558) T protein:vir:10 304 LVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGE----LSDVDYFQKKLYRALGVPESRIAAEGGFN 379 (558) T ss_pred EEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcch----HHHHHHHHHHHHHHhCCCccccCCCCccc Confidence 111 123 5789998864433332 2347899999988853222 2233211 Q ss_pred cccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCcccccEEEecCc----cccCHHHHHHHHHH Q lcl|NC_020866. 316 GGHAVSQEHRLV-QEDIETSDAMALAAILNRDLIRPWIQLE------YGPQKRYPRIKIARP----KTEDLTKLASSLDT 384 (547) Q Consensus 316 gs~a~~~vh~~v-~~~~~~ad~~~i~~~ln~~li~~l~~~N------fg~~~~~p~~~~~~~----~~ed~~~~a~~~~~ 384 (547) -+++..-+..++ |...+......++..|..-|-..|+--| |..-...-++.|... +-.+.+-+.+++.. T Consensus 380 ~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~ 459 (558) T protein:vir:10 380 LGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGM 459 (558) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 123322334444 4456666666666666654444444333 211112223333222 12233334445554 Q ss_pred HHhC----CCCCCHHHHHHHh-CCCCCCCcc--hhcccccccccccccccccccccCCcc--ccccccccccchhhhhhc Q lcl|NC_020866. 385 LVRL----GMEIEENEVRSRF-GFSQPKPGA--KLLRPLENPATQAAPPGMDPGAGDLKT--AERKLKPLSGQADLSRGD 455 (547) Q Consensus 385 l~~~----G~~v~~~~~~e~~-Glp~p~~~e--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 455 (547) |..+ |--++.+|+++.+ .+.+.+-.+ ..+.......--..|....+-.+..-+ ..+........+..+.. T Consensus 460 l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 538 (558) T protein:vir:10 460 LATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDL- 538 (558) T ss_pred HHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCccccc- Confidence 4443 4447888887653 443211000 000000000000000000000000000 00000000000000000 Q ss_pred ccchhhhcccccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020866. 456 LSLNSEMPSTGQKTGGAEIAALTDQLELEGQPEIVAMIEEI 496 (547) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i 496 (547) ...++..++.+++-.+.+. + T Consensus 539 ------------~~~~~~~~~~~~~~~~~~~---------~ 558 (558) T protein:vir:10 539 ------------EAQAQAVDAQYSKDTKKAE---------L 558 (558) T ss_pred ------------ccchhhhhhhhhhhhhhhc---------C Confidence 0001111111111111000 0 Done!