Query lcl|NC_021537.1_cdsid_YP_008126541.1 [gene=HALG_00006] [protein=hypothetical protein] [protein_id=YP_008126541.1] [location=complement(4128..5936)] Match_columns 602 No_of_seqs 283 out of 1266 Neff 9.1 Searched_HMMs 1612 Date Thu Nov 7 17:18:12 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99452 Length: 651 100.0 3E-143 2E-146 801.8 43.6 589 1-602 24-651 (651) 2 protein:vir:3153 Length: 467 # 100.0 8.2E-86 5.1E-89 487.0 40.2 452 32-523 1-467 (467) 3 protein:vir:1884 Length: 424 # 100.0 5.7E-84 3.5E-87 477.0 40.1 395 1-488 24-424 (424) 4 protein:vir:7853 Length: 518 # 100.0 1.1E-83 6.7E-87 475.4 40.2 491 1-602 8-505 (518) 5 protein:vir:4509 Length: 424 # 100.0 1.9E-83 1.2E-86 474.0 40.8 394 1-479 24-424 (424) 6 protein:vir:101648 Length: 518 100.0 1.8E-83 1.1E-86 474.2 40.1 489 1-602 15-505 (518) 7 protein:vir:189 Length: 424 # 100.0 5.1E-83 3.2E-86 471.7 39.8 396 1-488 24-424 (424) 8 protein:vir:93610 Length: 454 100.0 1E-82 6.3E-86 470.1 40.7 432 1-532 8-454 (454) 9 protein:vir:102855 Length: 432 100.0 6.6E-83 4.1E-86 471.1 39.6 411 1-490 13-432 (432) 10 protein:vir:105002 Length: 432 100.0 6.6E-83 4.1E-86 471.1 39.6 411 1-490 13-432 (432) 11 protein:vir:107605 Length: 432 100.0 6.6E-83 4.1E-86 471.1 39.6 411 1-490 13-432 (432) 12 protein:vir:100249 Length: 431 100.0 2.5E-82 1.6E-85 467.9 40.8 391 1-477 17-431 (431) 13 protein:vir:100150 Length: 437 100.0 2.4E-82 1.5E-85 468.0 40.0 412 1-492 1-437 (437) 14 protein:vir:102080 Length: 429 100.0 4.5E-82 2.8E-85 466.6 40.6 412 1-488 10-429 (429) 15 protein:vir:4337 Length: 434 # 100.0 3.9E-82 2.4E-85 466.9 39.0 408 1-491 9-434 (434) 16 protein:vir:81152 Length: 411 100.0 9.2E-82 5.7E-85 464.9 38.5 399 1-477 11-411 (411) 17 protein:vir:105064 Length: 421 100.0 1.7E-81 1E-84 463.5 39.5 405 1-497 6-421 (421) 18 protein:vir:102118 Length: 409 100.0 1.9E-81 1.2E-84 463.1 39.5 400 1-476 7-409 (409) 19 protein:vir:10362 Length: 432 100.0 1.9E-81 1.2E-84 463.1 39.2 403 1-491 17-432 (432) 20 protein:vir:5737 Length: 419 # 100.0 3E-81 1.9E-84 462.0 40.2 403 1-487 7-419 (419) 21 protein:vir:1380 Length: 422 # 100.0 2.8E-81 1.7E-84 462.2 39.9 401 1-486 11-422 (422) 22 protein:vir:4454 Length: 414 # 100.0 4.3E-81 2.7E-84 461.2 40.5 401 1-485 8-414 (414) 23 protein:vir:6240 Length: 457 # 100.0 8.2E-81 5.1E-84 459.7 41.0 434 1-521 8-457 (457) 24 protein:vir:4194 Length: 540 # 100.0 6.9E-81 4.3E-84 460.0 40.5 495 1-583 17-540 (540) 25 protein:vir:94666 Length: 723 100.0 3.3E-82 2.1E-85 467.3 32.9 501 12-602 1-520 (723) 26 protein:vir:81218 Length: 423 100.0 1.7E-80 1E-83 458.0 40.9 404 1-475 7-423 (423) 27 protein:vir:1431 Length: 419 # 100.0 1.2E-80 7.3E-84 458.8 40.1 404 1-494 8-419 (419) 28 protein:vir:97060 Length: 432 100.0 1.2E-80 7.4E-84 458.8 39.7 403 1-491 18-432 (432) 29 protein:vir:81072 Length: 432 100.0 1.1E-80 7.1E-84 458.9 39.3 403 1-491 17-432 (432) 30 protein:vir:483 Length: 413 # 100.0 7.3E-80 4.6E-83 454.4 40.6 401 1-495 7-413 (413) 31 protein:vir:1266 Length: 416 # 100.0 7.9E-80 4.9E-83 454.2 39.4 403 1-487 7-416 (416) 32 protein:vir:1326 Length: 457 # 100.0 1.7E-79 1E-82 452.5 41.1 432 1-511 7-457 (457) 33 protein:vir:80333 Length: 419 100.0 2E-79 1.2E-82 452.1 39.9 404 1-495 3-419 (419) 34 protein:vir:101647 Length: 460 100.0 7.3E-79 4.5E-82 449.0 40.7 422 1-488 9-460 (460) 35 protein:vir:96980 Length: 409 100.0 1.2E-78 7.7E-82 447.7 39.1 396 1-481 1-409 (409) 36 protein:vir:93943 Length: 409 100.0 1.5E-78 9.5E-82 447.2 38.8 397 1-487 1-409 (409) 37 protein:vir:80644 Length: 551 100.0 5.8E-78 3.6E-81 444.0 41.9 458 1-539 43-551 (551) 38 protein:vir:4156 Length: 542 # 100.0 9E-78 5.6E-81 443.0 42.0 496 1-583 17-542 (542) 39 protein:vir:2683 Length: 412 # 100.0 2.3E-78 1.4E-81 446.2 38.3 397 1-481 4-412 (412) 40 protein:vir:63755 Length: 547 100.0 6.2E-78 3.9E-81 443.9 40.5 454 1-539 39-547 (547) 41 protein:vir:94426 Length: 409 100.0 4E-78 2.5E-81 444.9 39.1 397 1-487 1-409 (409) 42 protein:vir:98396 Length: 441 100.0 1.9E-77 1.2E-80 441.2 39.4 401 1-491 29-441 (441) 43 protein:vir:3868 Length: 417 # 100.0 2.3E-77 1.5E-80 440.7 38.7 410 1-499 3-417 (417) 44 protein:vir:79984 Length: 441 100.0 4.3E-77 2.7E-80 439.3 39.2 403 1-491 14-441 (441) 45 protein:vir:9408 Length: 441 # 100.0 4.3E-77 2.7E-80 439.3 39.2 403 1-491 14-441 (441) 46 protein:vir:8418 Length: 409 # 100.0 9.2E-77 5.7E-80 437.4 40.1 397 1-491 8-409 (409) 47 protein:vir:80796 Length: 574 100.0 1.1E-75 7.1E-79 431.4 42.8 469 1-541 54-574 (574) 48 protein:vir:9702 Length: 406 # 100.0 9.7E-76 6E-79 431.8 39.7 400 1-492 4-406 (406) 49 protein:vir:4598 Length: 416 # 100.0 1.8E-75 1.1E-78 430.3 39.2 401 1-491 7-416 (416) 50 protein:vir:81095 Length: 416 100.0 1.8E-75 1.1E-78 430.3 39.2 401 1-491 7-416 (416) 51 protein:vir:99312 Length: 563 100.0 6.5E-75 4E-78 427.3 41.9 461 1-544 46-563 (563) 52 protein:vir:95599 Length: 563 100.0 6.5E-75 4E-78 427.3 41.9 461 1-544 46-563 (563) 53 protein:vir:102727 Length: 945 100.0 6.4E-76 4E-79 432.8 35.5 513 1-602 71-630 (945) 54 protein:vir:8317 Length: 409 # 100.0 1.2E-75 7.6E-79 431.3 36.8 365 1-461 33-409 (409) 55 protein:vir:96579 Length: 576 100.0 8.8E-73 5.5E-76 415.6 42.5 465 1-545 45-576 (576) 56 protein:vir:960 Length: 413 # 100.0 2E-73 1.2E-76 419.2 38.1 386 1-477 18-413 (413) 57 protein:vir:8100 Length: 466 # 100.0 4.6E-73 2.9E-76 417.1 39.2 413 1-490 10-466 (466) 58 protein:vir:9359 Length: 348 # 100.0 2.2E-73 1.4E-76 418.9 36.4 347 53-487 1-348 (348) 59 protein:vir:100691 Length: 535 100.0 3.9E-72 2.4E-75 412.0 41.2 444 1-523 32-535 (535) 60 protein:vir:95378 Length: 406 100.0 1.3E-71 8.1E-75 409.2 40.6 392 1-491 10-406 (406) 61 protein:vir:3843 Length: 397 # 100.0 7.7E-72 4.8E-75 410.4 38.1 388 1-484 7-397 (397) 62 protein:vir:104259 Length: 403 100.0 8.6E-72 5.3E-75 410.2 37.6 388 1-482 8-403 (403) 63 protein:vir:80134 Length: 403 100.0 1.6E-71 1E-74 408.7 38.2 391 1-491 8-403 (403) 64 protein:vir:100187 Length: 385 100.0 9.6E-71 6E-74 404.4 38.0 369 1-475 9-385 (385) 65 protein:vir:100882 Length: 383 100.0 1.6E-69 9.7E-73 397.8 37.6 367 1-473 9-383 (383) 66 protein:vir:79772 Length: 648 100.0 8.6E-70 5.3E-73 399.2 34.6 513 1-602 52-615 (648) 67 protein:vir:6210 Length: 394 # 100.0 2.5E-69 1.5E-72 396.7 37.1 381 1-490 11-394 (394) 68 protein:vir:4854 Length: 386 # 100.0 8.7E-69 5.4E-72 393.7 36.5 378 1-476 7-386 (386) 69 protein:vir:7407 Length: 392 # 100.0 1.7E-68 1.1E-71 392.1 36.0 359 1-457 9-392 (392) 70 protein:vir:1023 Length: 392 # 100.0 2.2E-67 1.4E-70 386.0 37.3 370 1-483 9-392 (392) 71 protein:vir:3989 Length: 392 # 100.0 2.2E-67 1.4E-70 386.0 37.3 370 1-483 9-392 (392) 72 protein:vir:1082 Length: 359 # 100.0 1.2E-67 7.2E-71 387.5 35.7 346 1-448 7-359 (359) 73 protein:vir:4995 Length: 384 # 100.0 2.3E-67 1.4E-70 386.0 31.6 367 1-456 7-384 (384) 74 protein:vir:101289 Length: 395 100.0 4.8E-66 3E-69 378.7 35.4 384 1-489 8-395 (395) 75 protein:vir:100650 Length: 395 100.0 4.8E-66 3E-69 378.7 35.4 384 1-489 8-395 (395) 76 protein:vir:9507 Length: 395 # 100.0 4.8E-66 3E-69 378.7 35.4 384 1-489 8-395 (395) 77 protein:vir:95965 Length: 385 100.0 1.9E-65 1.2E-68 375.4 36.8 370 1-475 7-385 (385) 78 protein:vir:4952 Length: 386 # 100.0 8.9E-65 5.5E-68 371.7 36.2 374 1-476 7-386 (386) 79 protein:vir:4828 Length: 382 # 100.0 2.1E-63 1.3E-66 364.2 36.3 370 1-476 4-382 (382) 80 protein:vir:4089 Length: 395 # 100.0 2.5E-63 1.6E-66 363.7 33.7 378 1-487 11-395 (395) 81 protein:vir:78310 Length: 376 100.0 8.1E-63 5E-66 361.0 34.4 366 1-470 7-376 (376) 82 protein:vir:267 Length: 348 # 100.0 1.1E-62 6.8E-66 360.3 32.5 319 1-424 1-348 (348) 83 protein:vir:103971 Length: 376 100.0 1.2E-62 7.3E-66 360.1 31.6 314 1-419 26-376 (376) 84 protein:vir:100328 Length: 346 100.0 1.3E-62 7.8E-66 359.9 30.6 315 1-415 1-346 (346) 85 protein:vir:94002 Length: 378 100.0 2.8E-62 1.8E-65 358.0 32.4 362 1-491 4-378 (378) 86 protein:vir:98643 Length: 395 100.0 9.8E-62 6.1E-65 355.1 35.3 382 1-487 8-395 (395) 87 protein:vir:1661 Length: 378 # 100.0 7.3E-62 4.5E-65 355.8 33.8 362 1-491 4-378 (378) 88 protein:vir:93867 Length: 378 100.0 3.2E-62 2E-65 357.7 31.8 362 1-491 4-378 (378) 89 protein:vir:79207 Length: 351 100.0 4.5E-62 2.8E-65 356.9 31.9 314 1-419 1-351 (351) 90 protein:vir:98567 Length: 340 100.0 2.8E-62 1.8E-65 358.0 30.8 311 1-414 1-340 (340) 91 protein:vir:78191 Length: 351 100.0 5.4E-62 3.3E-65 356.5 31.5 314 1-419 1-351 (351) 92 protein:vir:9641 Length: 395 # 100.0 1.1E-61 6.9E-65 354.8 32.8 373 1-479 7-395 (395) 93 protein:vir:3743 Length: 345 # 100.0 2.5E-61 1.5E-64 352.9 32.3 318 1-412 1-345 (345) 94 protein:vir:79150 Length: 368 100.0 1.1E-61 6.9E-65 354.8 28.0 323 1-430 27-368 (368) 95 protein:vir:1150 Length: 350 # 100.0 3.5E-61 2.2E-64 352.0 30.4 310 1-410 1-350 (350) 96 protein:vir:3780 Length: 345 # 100.0 3.5E-61 2.2E-64 352.0 30.0 318 1-412 1-345 (345) 97 protein:vir:5691 Length: 344 # 100.0 3.8E-61 2.4E-64 351.8 29.6 312 1-417 1-344 (344) 98 protein:vir:78749 Length: 337 100.0 1.1E-60 6.9E-64 349.3 31.1 308 1-411 1-337 (337) 99 protein:vir:6058 Length: 344 # 100.0 1.6E-60 1E-63 348.4 30.7 312 1-417 1-344 (344) 100 protein:vir:2013 Length: 344 # 100.0 6.4E-60 4E-63 345.1 30.1 312 1-415 1-344 (344) 101 protein:vir:858 Length: 378 # 100.0 1.1E-58 7.1E-62 338.2 32.5 363 1-491 4-378 (378) 102 protein:vir:94869 Length: 378 100.0 3.2E-58 2E-61 335.8 32.7 363 1-491 4-378 (378) 103 protein:vir:78641 Length: 278 100.0 4.4E-58 2.7E-61 335.1 30.5 277 53-410 1-278 (278) 104 protein:vir:98853 Length: 219 100.0 2.5E-46 1.5E-49 270.6 22.5 217 159-414 1-219 (219) 105 protein:vir:4698 Length: 251 # 100.0 2.9E-41 1.8E-44 242.8 23.5 238 1-304 7-251 (251) 106 protein:vir:5249 Length: 437 # 100.0 1.6E-27 9.8E-31 167.5 34.9 400 1-490 1-437 (437) 107 protein:vir:79538 Length: 502 99.9 3.3E-28 2.1E-31 171.2 29.5 440 1-492 11-502 (502) 108 protein:vir:389 Length: 530 # 99.9 2.3E-28 1.4E-31 172.1 27.0 461 1-490 1-530 (530) 109 protein:vir:3420 Length: 533 # 99.9 5.2E-28 3.2E-31 170.2 27.0 461 1-496 9-533 (533) 110 protein:vir:96738 Length: 505 99.9 2.8E-28 1.7E-31 171.6 23.8 445 1-487 1-505 (505) 111 protein:vir:6382 Length: 553 # 99.9 4.4E-27 2.7E-30 165.1 29.5 465 1-491 18-553 (553) 112 protein:vir:95542 Length: 548 99.9 8.4E-27 5.2E-30 163.5 30.2 476 1-526 11-548 (548) 113 protein:vir:10321 Length: 495 99.9 2.3E-26 1.4E-29 161.1 24.0 441 1-488 9-495 (495) 114 protein:vir:107742 Length: 537 99.9 6E-23 3.7E-26 142.4 35.1 421 1-502 58-537 (537) 115 protein:vir:94049 Length: 532 99.9 7.8E-23 4.9E-26 141.8 33.1 435 1-512 35-532 (532) 116 protein:vir:108215 Length: 469 99.9 1.4E-21 8.7E-25 134.9 33.3 433 1-521 2-469 (469) 117 protein:vir:80040 Length: 461 99.8 2.4E-20 1.5E-23 128.1 33.2 406 1-490 1-461 (461) 118 protein:vir:79647 Length: 435 99.8 1.2E-20 7.2E-24 129.9 30.5 386 1-491 5-435 (435) 119 protein:vir:104338 Length: 422 99.8 2E-20 1.2E-23 128.6 31.0 386 1-486 1-422 (422) 120 protein:vir:96068 Length: 765 99.8 4.2E-19 2.6E-22 121.3 35.8 506 1-602 59-631 (765) 121 protein:vir:99563 Length: 862 99.8 5.2E-19 3.2E-22 120.8 35.5 504 1-602 88-660 (862) 122 protein:vir:107662 Length: 427 99.8 9.8E-20 6.1E-23 124.8 29.1 387 2-490 1-427 (427) 123 protein:vir:99232 Length: 526 99.8 1E-18 6.5E-22 119.2 31.0 472 1-585 12-526 (526) 124 protein:vir:103860 Length: 528 99.8 2.7E-18 1.7E-21 116.9 31.6 474 1-585 12-528 (528) 125 protein:vir:79233 Length: 526 99.8 1.7E-18 1.1E-21 118.0 30.1 472 1-585 12-526 (526) 126 protein:vir:95254 Length: 488 99.8 1.3E-17 8.3E-21 113.1 30.9 439 1-499 1-488 (488) 127 protein:vir:77981 Length: 448 99.7 3.9E-17 2.4E-20 110.6 32.9 409 1-506 1-448 (448) 128 protein:vir:79063 Length: 491 99.7 4.1E-17 2.5E-20 110.4 31.5 453 1-573 13-491 (491) 129 protein:vir:99853 Length: 488 99.7 7.1E-18 4.4E-21 114.6 27.2 455 1-576 1-488 (488) 130 protein:vir:107880 Length: 491 99.7 7.3E-17 4.6E-20 109.0 32.5 454 1-573 13-491 (491) 131 protein:vir:1986 Length: 512 # 99.7 3.1E-16 1.9E-19 105.6 33.9 451 1-549 17-512 (512) 132 protein:vir:79511 Length: 448 99.7 1.2E-15 7.3E-19 102.4 33.9 409 1-495 1-448 (448) 133 protein:vir:98816 Length: 446 99.7 4.8E-16 3E-19 104.5 29.1 389 1-452 5-446 (446) 134 protein:vir:105782 Length: 449 99.5 3E-13 1.8E-16 89.3 27.4 388 1-481 9-449 (449) 135 protein:vir:78161 Length: 355 99.4 1.4E-12 8.9E-16 85.5 25.0 325 120-518 1-355 (355) 136 protein:vir:106716 Length: 698 99.2 3.5E-10 2.2E-13 72.4 32.2 509 1-602 83-677 (698) 137 protein:vir:3648 Length: 695 # 99.2 4.3E-10 2.7E-13 71.9 31.8 512 1-602 61-674 (695) 138 protein:vir:7768 Length: 484 # 99.2 4.9E-11 3.1E-14 77.1 25.2 426 1-496 5-484 (484) 139 protein:vir:104082 Length: 485 99.2 1.8E-11 1.1E-14 79.5 22.5 429 1-499 21-485 (485) 140 protein:vir:78589 Length: 695 99.2 6.9E-10 4.3E-13 70.8 32.4 509 1-602 83-674 (695) 141 protein:vir:101541 Length: 694 99.2 7.7E-10 4.8E-13 70.6 31.8 511 1-602 60-673 (694) 142 protein:vir:99916 Length: 504 99.2 1.4E-10 8.8E-14 74.6 25.3 438 1-503 18-504 (504) 143 protein:vir:94742 Length: 409 99.2 2.9E-10 1.8E-13 72.9 26.7 377 1-448 9-409 (409) 144 protein:vir:98444 Length: 434 99.2 8.6E-11 5.3E-14 75.8 23.1 405 23-496 1-434 (434) 145 protein:vir:2427 Length: 485 # 99.1 2.7E-10 1.6E-13 73.1 23.9 426 1-499 26-485 (485) 146 protein:vir:8184 Length: 474 # 99.1 3.5E-10 2.2E-13 72.4 23.0 419 1-478 12-474 (474) 147 protein:vir:2341 Length: 488 # 99.1 4.1E-10 2.5E-13 72.1 22.9 433 1-495 1-488 (488) 148 protein:vir:4223 Length: 486 # 99.1 1.1E-09 7E-13 69.6 25.2 428 1-496 21-486 (486) 149 protein:vir:5839 Length: 533 # 99.0 1.1E-09 6.5E-13 69.8 24.5 446 1-508 38-533 (533) 150 protein:vir:99072 Length: 479 99.0 2.6E-09 1.6E-12 67.7 24.4 419 1-518 9-479 (479) 151 protein:vir:1634 Length: 409 # 98.9 1.2E-08 7.5E-12 64.0 26.3 373 1-448 9-409 (409) 152 protein:vir:5961 Length: 503 # 98.9 1.4E-08 8.6E-12 63.7 30.4 427 1-495 34-503 (503) 153 protein:vir:80680 Length: 441 98.9 2.6E-09 1.6E-12 67.7 21.5 408 1-497 12-441 (441) 154 protein:vir:4898 Length: 502 # 98.9 5.4E-09 3.4E-12 65.9 23.0 426 1-501 45-502 (502) 155 protein:vir:7987 Length: 456 # 98.9 9.9E-10 6.1E-13 70.0 18.5 406 1-486 13-456 (456) 156 protein:vir:105819 Length: 456 98.9 4.6E-09 2.8E-12 66.3 21.7 412 1-486 1-456 (456) 157 protein:vir:102602 Length: 456 98.9 4.6E-09 2.8E-12 66.3 21.7 412 1-486 1-456 (456) 158 protein:vir:96494 Length: 501 98.9 7.5E-09 4.7E-12 65.1 22.7 429 1-493 44-501 (501) 159 protein:vir:78537 Length: 480 98.8 2.7E-08 1.7E-11 62.1 25.3 431 1-498 11-480 (480) 160 protein:vir:9751 Length: 422 # 98.8 1.2E-08 7.7E-12 63.9 22.7 390 1-470 1-422 (422) 161 protein:vir:2732 Length: 501 # 98.8 1.8E-08 1.1E-11 63.1 23.5 432 1-501 44-501 (501) 162 protein:vir:78227 Length: 480 98.8 5.9E-08 3.6E-11 60.2 25.1 432 1-498 16-480 (480) 163 protein:vir:102426 Length: 631 98.7 1.7E-08 1.1E-11 63.2 21.2 490 1-602 22-599 (631) 164 protein:vir:38 Length: 496 # N 98.7 7.2E-08 4.5E-11 59.7 30.4 430 1-493 28-496 (496) 165 protein:vir:105889 Length: 474 98.7 7.8E-08 4.8E-11 59.6 27.7 410 1-491 24-474 (474) 166 protein:vir:94101 Length: 474 98.7 7.8E-08 4.8E-11 59.6 27.7 410 1-491 24-474 (474) 167 protein:vir:2500 Length: 501 # 98.7 2.5E-08 1.6E-11 62.3 21.7 425 1-495 23-501 (501) 168 protein:vir:8654 Length: 629 # 98.7 5.6E-09 3.5E-12 65.8 17.6 501 1-602 9-597 (629) 169 protein:vir:99088 Length: 629 98.7 5.8E-09 3.6E-12 65.7 17.3 498 1-602 9-608 (629) 170 protein:vir:106491 Length: 646 98.6 8.1E-08 5E-11 59.5 22.4 516 1-602 4-601 (646) 171 protein:vir:107517 Length: 639 98.6 6.7E-09 4.2E-12 65.4 15.4 514 1-602 9-625 (639) 172 protein:vir:97900 Length: 639 98.6 6.7E-09 4.2E-12 65.4 15.4 514 1-602 9-625 (639) 173 protein:vir:93747 Length: 472 98.6 2.8E-07 1.7E-10 56.5 24.1 406 1-491 1-472 (472) 174 protein:vir:9568 Length: 410 # 98.6 3E-07 1.8E-10 56.4 24.6 390 1-471 1-410 (410) 175 protein:vir:99522 Length: 470 98.6 3E-07 1.8E-10 56.4 23.7 413 1-488 30-470 (470) 176 protein:vir:106639 Length: 481 98.5 3.2E-07 2E-10 56.2 27.8 417 1-489 39-481 (481) 177 protein:vir:95113 Length: 474 98.5 3.5E-07 2.2E-10 56.0 28.4 410 1-491 32-474 (474) 178 protein:vir:9871 Length: 429 # 98.5 4.7E-07 2.9E-10 55.3 26.9 401 1-482 9-429 (429) 179 protein:vir:3964 Length: 453 # 98.5 5E-07 3.1E-10 55.1 27.8 408 1-491 26-453 (453) 180 protein:vir:95806 Length: 440 98.5 5E-07 3.1E-10 55.1 24.1 416 1-491 2-440 (440) 181 protein:vir:80959 Length: 499 98.4 6.6E-07 4.1E-10 54.5 31.3 430 1-493 22-499 (499) 182 protein:vir:3609 Length: 452 # 98.4 8E-07 4.9E-10 54.0 28.0 407 1-492 11-452 (452) 183 protein:vir:1236 Length: 483 # 98.4 8.8E-07 5.4E-10 53.8 28.1 403 1-491 44-483 (483) 184 protein:vir:733 Length: 453 # 98.4 9.1E-07 5.6E-10 53.7 25.9 407 1-489 26-453 (453) 185 protein:vir:97447 Length: 474 98.4 1.1E-06 6.5E-10 53.4 29.4 411 1-491 32-474 (474) 186 protein:vir:94498 Length: 474 98.4 1.1E-06 6.5E-10 53.4 29.4 411 1-491 32-474 (474) 187 protein:vir:96266 Length: 474 98.4 1.1E-06 6.7E-10 53.3 23.0 412 1-493 32-474 (474) 188 protein:vir:95899 Length: 474 98.4 1.1E-06 6.7E-10 53.3 23.0 412 1-493 32-474 (474) 189 protein:vir:105292 Length: 478 98.3 1.3E-06 8.2E-10 52.8 27.8 414 1-491 20-478 (478) 190 protein:vir:94805 Length: 492 98.3 1.5E-06 9E-10 52.6 27.7 405 1-491 53-492 (492) 191 protein:vir:106027 Length: 629 98.3 4.6E-07 2.8E-10 55.3 18.3 511 1-602 11-615 (629) 192 protein:vir:96839 Length: 474 98.3 1.5E-06 9.6E-10 52.4 28.3 406 1-495 35-474 (474) 193 protein:vir:97336 Length: 492 98.3 1.6E-06 9.7E-10 52.4 27.8 406 1-491 53-492 (492) 194 protein:vir:79043 Length: 479 98.3 1.7E-06 1.1E-09 52.2 26.7 411 1-483 28-479 (479) 195 protein:vir:4782 Length: 522 # 98.2 2.9E-06 1.8E-09 50.9 31.3 432 1-492 14-522 (522) 196 protein:vir:1587 Length: 508 # 98.2 3.4E-06 2.1E-09 50.6 27.8 429 1-491 20-508 (508) 197 protein:vir:103219 Length: 201 98.1 2.5E-07 1.5E-10 56.8 13.6 186 263-487 1-201 (201) 198 protein:vir:9922 Length: 489 # 98.1 4E-06 2.5E-09 50.2 23.0 423 1-483 24-489 (489) 199 protein:vir:9306 Length: 511 # 97.9 1E-05 6.4E-09 47.9 27.1 424 1-492 45-511 (511) 200 protein:vir:107112 Length: 478 97.9 1E-05 6.4E-09 47.9 28.5 415 1-491 20-478 (478) 201 protein:vir:106571 Length: 499 97.9 1.1E-05 6.7E-09 47.8 29.1 422 1-509 25-499 (499) 202 protein:vir:97171 Length: 512 97.9 1.3E-05 7.9E-09 47.4 26.9 424 1-492 45-512 (512) 203 protein:vir:99781 Length: 511 97.8 1.7E-05 1E-08 46.8 25.9 429 1-492 45-511 (511) 204 protein:vir:94546 Length: 506 97.8 1.8E-05 1.1E-08 46.6 27.1 427 1-505 31-506 (506) 205 protein:vir:96179 Length: 468 97.8 1.9E-05 1.2E-08 46.4 26.8 405 1-484 17-468 (468) 206 protein:vir:79703 Length: 505 97.8 2.1E-05 1.3E-08 46.3 27.9 430 1-491 20-505 (505) 207 protein:vir:98883 Length: 517 97.8 2.2E-05 1.4E-08 46.1 28.7 439 1-491 23-517 (517) 208 protein:vir:102950 Length: 471 97.7 2.7E-05 1.7E-08 45.6 27.9 410 1-493 15-471 (471) 209 protein:vir:96240 Length: 511 97.6 3.6E-05 2.3E-08 44.9 27.8 426 1-493 45-511 (511) 210 protein:vir:103951 Length: 511 97.6 3.7E-05 2.3E-08 44.9 26.2 422 1-492 49-511 (511) 211 protein:vir:78805 Length: 511 97.6 4.4E-05 2.7E-08 44.5 26.4 423 1-492 45-511 (511) 212 protein:vir:96366 Length: 511 97.6 4.4E-05 2.7E-08 44.5 26.4 423 1-492 45-511 (511) 213 protein:vir:105461 Length: 470 97.5 4.7E-05 2.9E-08 44.3 27.6 406 1-491 10-470 (470) 214 protein:vir:104892 Length: 558 97.5 4.8E-05 3E-08 44.3 27.6 461 1-528 16-558 (558) 215 protein:vir:78907 Length: 518 97.5 6E-05 3.7E-08 43.7 28.9 435 1-480 15-518 (518) 216 protein:vir:9815 Length: 500 # 97.5 6.1E-05 3.8E-08 43.7 27.6 432 1-486 11-500 (500) 217 protein:vir:3028 Length: 500 # 97.5 6.1E-05 3.8E-08 43.7 27.6 432 1-486 11-500 (500) 218 protein:vir:104500 Length: 537 97.3 0.00011 6.8E-08 42.3 27.8 448 1-492 16-537 (537) 219 protein:vir:5665 Length: 511 # 96.9 0.00028 1.7E-07 40.0 26.6 431 1-489 16-511 (511) 220 protein:vir:78083 Length: 537 96.6 0.00046 2.8E-07 38.9 32.7 440 1-510 21-537 (537) 221 protein:vir:102330 Length: 451 96.1 0.00098 6.1E-07 37.1 24.0 391 1-491 6-451 (451) 222 protein:vir:4995 Length: 384 # 96.1 0.00098 6.1E-07 37.1 15.0 346 56-476 1-384 (384) 223 protein:vir:106999 Length: 564 96.1 0.001 6.2E-07 37.0 26.5 471 1-523 14-564 (564) 224 protein:vir:94709 Length: 522 96.1 0.0011 6.6E-07 36.9 14.8 452 1-512 16-522 (522) 225 protein:vir:103177 Length: 533 95.4 0.0021 1.3E-06 35.3 27.4 441 1-492 9-533 (533) 226 protein:vir:101806 Length: 516 95.4 0.0021 1.3E-06 35.3 27.7 431 1-489 22-516 (516) 227 protein:vir:101189 Length: 516 95.4 0.0021 1.3E-06 35.3 27.7 431 1-489 22-516 (516) 228 protein:vir:4073 Length: 279 # 95.4 0.00033 2E-07 39.7 8.3 273 100-450 1-279 (279) 229 protein:vir:81017 Length: 521 95.3 0.0022 1.4E-06 35.1 24.7 433 1-509 24-521 (521) 230 protein:vir:106282 Length: 521 95.2 0.0025 1.5E-06 34.9 27.7 431 1-489 25-521 (521) 231 protein:vir:98265 Length: 524 94.9 0.0031 1.9E-06 34.3 28.8 432 1-489 28-524 (524) 232 protein:vir:101494 Length: 527 94.9 0.0033 2.1E-06 34.2 26.2 438 3-490 1-527 (527) 233 protein:vir:102239 Length: 527 94.7 0.0036 2.2E-06 34.0 26.2 438 3-490 1-527 (527) 234 protein:vir:108049 Length: 524 94.4 0.0044 2.7E-06 33.5 27.0 433 1-489 26-524 (524) 235 protein:vir:7208 Length: 524 # 93.4 0.0075 4.7E-06 32.2 26.8 432 1-489 29-524 (524) 236 protein:vir:103458 Length: 524 93.2 0.0083 5.2E-06 32.0 26.9 432 1-489 29-524 (524) 237 protein:vir:6896 Length: 523 # 92.9 0.0094 5.8E-06 31.7 25.5 433 1-489 24-523 (523) 238 protein:vir:1785 Length: 555 # 92.5 0.011 6.9E-06 31.3 18.7 463 1-518 1-555 (555) 239 protein:vir:7321 Length: 556 # 92.2 0.013 7.8E-06 31.0 19.3 451 1-524 30-556 (556) 240 protein:vir:105154 Length: 525 92.0 0.013 8.3E-06 30.8 15.6 427 1-504 31-525 (525) 241 protein:vir:1538 Length: 535 # 91.5 0.015 9.6E-06 30.5 16.3 446 1-522 18-535 (535) 242 protein:vir:6596 Length: 521 # 91.2 0.017 1E-05 30.3 28.3 433 1-489 23-521 (521) 243 protein:vir:98506 Length: 555 90.6 0.02 1.2E-05 29.9 19.3 457 1-520 31-555 (555) 244 protein:vir:107822 Length: 555 90.6 0.02 1.2E-05 29.9 19.3 457 1-520 31-555 (555) 245 protein:vir:107404 Length: 555 90.6 0.02 1.2E-05 29.9 19.3 457 1-520 31-555 (555) 246 protein:vir:3361 Length: 535 # 90.6 0.02 1.3E-05 29.9 20.4 455 1-522 18-535 (535) 247 protein:vir:78942 Length: 510 90.5 0.02 1.3E-05 29.9 18.1 450 1-504 1-510 (510) 248 protein:vir:6322 Length: 510 # 90.4 0.021 1.3E-05 29.8 16.4 448 1-504 1-510 (510) 249 protein:vir:96988 Length: 516 85.5 0.053 3.3E-05 27.6 17.9 456 1-512 20-516 (516) 250 protein:vir:100598 Length: 516 82.8 0.074 4.6E-05 26.8 26.8 432 1-489 22-516 (516) 251 protein:vir:103330 Length: 517 82.5 0.076 4.7E-05 26.7 15.3 452 1-524 16-517 (517) 252 protein:vir:100039 Length: 522 80.4 0.096 5.9E-05 26.2 20.9 452 1-520 1-522 (522) 253 protein:vir:105641 Length: 516 79.4 0.1 6.5E-05 26.0 17.2 453 1-512 20-516 (516) 254 protein:vir:95315 Length: 559 78.8 0.11 6.8E-05 25.8 20.7 451 1-524 34-559 (559) 255 protein:vir:7017 Length: 515 # 77.3 0.13 7.8E-05 25.5 19.5 451 1-507 19-515 (515) 256 protein:vir:2198 Length: 536 # 76.1 0.14 8.7E-05 25.3 21.0 474 1-523 17-536 (536) 257 protein:vir:102668 Length: 547 75.3 0.15 9.2E-05 25.1 21.6 454 1-524 27-547 (547) 258 protein:vir:10447 Length: 536 73.9 0.16 0.0001 24.9 21.0 474 1-523 17-536 (536) 259 protein:vir:99672 Length: 532 64.5 0.3 0.00019 23.5 16.6 453 1-524 18-532 (532) 260 protein:vir:101418 Length: 569 59.4 0.39 0.00024 22.8 19.1 448 1-522 57-569 (569) 261 protein:vir:80165 Length: 651 58.6 0.41 0.00025 22.7 22.3 464 1-528 40-651 (651) 262 protein:vir:103765 Length: 549 54.5 0.5 0.00031 22.2 21.5 438 1-525 33-549 (549) 263 protein:vir:78696 Length: 542 52.0 0.57 0.00035 21.9 20.6 435 1-526 20-542 (542) 264 protein:vir:95149 Length: 501 50.9 0.6 0.00037 21.8 26.7 417 1-491 11-501 (501) 265 protein:vir:80453 Length: 535 44.8 0.79 0.00049 21.1 24.6 427 1-494 42-535 (535) 266 protein:vir:7430 Length: 563 # 42.4 0.89 0.00055 20.9 29.1 456 1-511 23-563 (563) 267 protein:vir:94956 Length: 452 37.5 1.1 0.00069 20.3 26.6 399 1-487 1-452 (452) 268 protein:vir:94572 Length: 535 37.0 1.1 0.00071 20.3 16.5 454 1-522 19-535 (535) 269 protein:vir:8883 Length: 543 # 33.1 1.4 0.00086 19.8 23.1 462 1-514 18-543 (543) 270 protein:vir:96783 Length: 488 27.8 1.8 0.0011 19.2 23.1 408 1-474 49-488 (488) 271 protein:vir:78393 Length: 489 23.6 2.3 0.0014 18.6 22.1 412 1-491 31-489 (489) No 1 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=3.2e-143 Score=801.76 Aligned_cols=589 Identities=54% Similarity=0.909 Sum_probs=485.7 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) +.|+++|+|+...++... .++|+|||||..|+.++++|++|++||++++++||++||+|+++.+.+.+++..++++.+. T Consensus 24 ~~~~~~~~~~~~~~~~~~-~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~ 102 (651) T protein:vir:99 24 LAKSPNSTQIPDHRIQSH-NVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVAR 102 (651) T ss_pred ccccccccccchhhhccc-CCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHH Confidence 778899999999888654 5799999999999999999999999999999999999999999998888888889999999 Q ss_pred Hhhhccchhhhhhcc-CCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQIGPE-GTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~l~~~-pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) .+++.+++.+..... +|..+|+.+|++.++.|++.+||+|++++++..|++++|+++|+..+|+..+............ T Consensus 103 ~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll 182 (651) T protein:vir:99 103 NFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEE 182 (651) T ss_pred HHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhcChhheeeecccccccchhhhhh Confidence 999999988876654 6888999999999999999999999999999999999999999999998765543322111111 Q ss_pred hhcc-------cCceeEEEEcCCcceeeccccccc------------------------ccceeeecccceEEec-Ccee Q lcl|NC_021537. 160 ENIE-------SGHGYVQVRQGRRRYFGEAGDRYG------------------------DDKRFVDKETGEVASD-AGEL 207 (602) Q Consensus 160 ~~~~-------~~~~~~qi~~~~~~~~~~~~~~~~------------------------~~~~~~~~~~g~~~~~-~~~~ 207 (602) ...+ ....|+|++.....|+..++..+. ......+..+|.+... .+.. T Consensus 183 ~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~ 262 (651) T protein:vir:99 183 GRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGL 262 (651) T ss_pred hcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCce Confidence 1110 011233333333334433333211 1122334556655543 4567 Q ss_pred EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhh Q lcl|NC_021537. 208 KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLK 287 (602) Q Consensus 208 ~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~ 287 (602) .+++++||||||.+++.+++||+||+..+..+|..+.++++++.++|+||++|++||+++++.+++++++++++.|++.. T Consensus 263 ~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~ 342 (651) T protein:vir:99 263 ENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLR 342 (651) T ss_pred eEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHh Confidence 88999999999999989999999999999999999999999999999999999999999988899999999999999865 Q ss_pred cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH Q lcl|NC_021537. 288 GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ 367 (602) Q Consensus 288 g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~ 367 (602) + |+|++++++.+.... ..+.+.+++|+||+.++++|+||+|++++++.+||++|||||.+||+.+++|+||+|++ T Consensus 343 ~--nagk~~vL~~~~~~~---~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~ 417 (651) T protein:vir:99 343 E--ESHRAVVLEVEKFQS---QLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQ 417 (651) T ss_pred c--cCCceEEeecccccc---cccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHH Confidence 4 788999887654322 33457899999999888889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCccccccce--EEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCC Q lcl|NC_021537. 368 TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEW--TIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLA 445 (602) Q Consensus 368 ~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~--~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~ 445 (602) .+.|+++||+||+++||++||++|++..+...++ +++|+..++++. |.+.+++++.+++++|+||+||+|+++||| T Consensus 418 ~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~--D~~~~~e~~~~~i~~G~~T~NE~R~~lglp 495 (651) T protein:vir:99 418 DKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQ--EAQLAEQRVRAMRLAGVGLVDEAREELGLD 495 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhc--cHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 9999999999999999999999999998877765 455666666655 888889999999999999999999999999 Q ss_pred CCCCCcccccccccccc-ccccccCCCcCccccccccccccccccccccccccc---ccccchhhhhcchhhhhhheecc Q lcl|NC_021537. 446 PFEDDRGDMTLSEFEAE-FGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVD---VSKDPIEQTTFSSSNLDEGLYDF 521 (602) Q Consensus 446 p~~~g~~d~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~m~~~~v~ss~~~~~~yd~ 521 (602) |+++++++.++.+.... .+...++++. ....+++.+++..++...... ..++.|+|++|+||+|+|+|||+ T Consensus 496 pi~~~~gd~~l~~~~~~~~g~~~~gge~-----~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~~gyd~ 570 (651) T protein:vir:99 496 PLGEPYGEMTLSEFEAEVAGDVAGGGET-----EAVHEPPEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDEGLYDF 570 (651) T ss_pred CCCCccccccccccccccccccccCCCC-----cccccCccccccccchhhhhhhhhcccchhhhhhHHHHHHHhhcCCC Confidence 99987777665443322 2322222222 222223333333333332221 24579999999999999999999 Q ss_pred cccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccccccccchhcccCCCCCChhhcCCccccc Q lcl|NC_021537. 522 GERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPGEAPEDVPSD 601 (602) Q Consensus 522 ~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 601 (602) ++++|+|+|+.+.++|+||+|++||+++|++|++|+|+|+|||++||++|+|+||++.|+|||+++.||++|||++||++ T Consensus 571 ~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 650 (651) T protein:vir:99 571 GENELYLSFLRDEGQSSLYAYVDVPASEWSALANAGSHGGYHYDNIRLEYPYLEITNFHDRLPEGPAPDAGDVPDGVPDE 650 (651) T ss_pred ccceEEEEEeecCCCCceeeeeCCCHHHHHHHhcCcccceeehhccccccchhhhhhhhhhCCCCCCCCcCCCCCCCccc Confidence 99999999997777899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred C Q lcl|NC_021537. 602 I 602 (602) Q Consensus 602 ~ 602 (602) | T Consensus 651 ~ 651 (651) T protein:vir:99 651 I 651 (651) T ss_pred C Confidence 9 No 2 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=8.2e-86 Score=487.04 Aligned_cols=452 Identities=37% Similarity=0.649 Sum_probs=341.2 Q ss_pred HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHH Q lcl|NC_021537. 32 LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQ 111 (602) Q Consensus 32 l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~ 111 (602) ||+++++|++|++||++||++||++||+|+++.+.+.........+.+..++..+.+.......++.++|+.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 99999999999999999999999999999988765554444455555555555554444444455566788999999999 Q ss_pred HHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccce Q lcl|NC_021537. 112 DYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKR 191 (602) Q Consensus 112 d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~ 191 (602) +++++||||++++|+..|++++|+||||.+|++..+ +..|++...+...|+..++..+..... T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMD-----------------ERGFVQLLEEKEKYFGVAGDRYQTNGN 143 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeee-----------------cceeEeecCCceeeEEeccccceeecc Confidence 999999999999999999999999999999997654 345677777777777776654433221 Q ss_pred e-eecccce-EEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc Q lcl|NC_021537. 192 F-VDKETGE-VASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG 269 (602) Q Consensus 192 ~-~~~~~g~-~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 269 (602) . .....+. .....+....+++++|||+|.+++.+++||+||+.+++.++..+.++++++.++|+||++|+|+|+++++ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 223 (467) T protein:vir:31 144 GDLDPVFVDADDGSTGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGA 223 (467) T ss_pred cceeeeeeeeccccccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCc Confidence 1 1111111 1123456789999999999999999999999999999999999999999999999999999999999888 Q ss_pred cCCHHHHHHHHHHHHHhh------------cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHh Q lcl|NC_021537. 270 TLSEDSKEDLRNLMDNLK------------GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRER 337 (602) Q Consensus 270 ~~~~~~~~~l~~~~~~~~------------g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~ 337 (602) .+++++.+++++.|++.. |..|+++++++..|.++. +.+++++||+.++++|+||++++++ T Consensus 224 ~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~-------~~~~~~~~ls~~~~~d~qf~e~~~~ 296 (467) T protein:vir:31 224 ELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRS-------DVEIRLEPLTVGIDEEASFLEFRGR 296 (467) T ss_pred CCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCccc-------ccceeEEeccccChhhHHHHHHHHH Confidence 899999999999997644 456788888877765544 5678999999999999999999999 Q ss_pred hHHHHHHHhcCChHHhhccccCCc-cCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH Q lcl|NC_021537. 338 NEHEIAKVHGVPPVLINVTSTSNR-ANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD 416 (602) Q Consensus 338 ~~~~Ia~~fgVPp~~lg~~~~~~~-sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d 416 (602) ++++||++|||||.+||+.+++++ +|++++.+.|+++||+|+++.|+++||.+|++..+...+++++|+++.+++. | T Consensus 297 ~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~--d 374 (467) T protein:vir:31 297 NEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTK--L 374 (467) T ss_pred HHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhcc--C Confidence 999999999999999999887776 6899999999999999999999999999999988888899999999999877 6 Q ss_pred HHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCccccccccccccccccccccccc Q lcl|NC_021537. 417 AKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVD 496 (602) Q Consensus 417 ~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (602) .+.++++++.++++|++|+||+|+++||+|+++++ .+ ..... . ....++..+....+...+++.+++..+.. T Consensus 375 ~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~--~~-~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 446 (467) T protein:vir:31 375 QDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEH--VY-GGETL-V-AEVTGGSGPGGGIGDQIEQLVEDRADEII--- 446 (467) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccc--cc-CCccc-c-cccccccCCCCcccCcCCCCCCCcccchH--- Confidence 67778889999999999999999999999996543 21 11100 0 01111111111111111111111111100 Q ss_pred ccccccchhhhhcchhhhhhheecccc Q lcl|NC_021537. 497 VDVSKDPIEQTTFSSSNLDEGLYDFGE 523 (602) Q Consensus 497 ~~~~~~~m~~~~v~ss~~~~~~yd~~~ 523 (602) ... ...+++.....+|-.+++ T Consensus 447 -~~~-----~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 447 -DSY-----QADLETEQLIEIGANADS 467 (467) T ss_pred -hhh-----hhccccchhhhhccccCC Confidence 000 011122233333433333 No 3 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=5.7e-84 Score=476.96 Aligned_cols=395 Identities=16% Similarity=0.152 Sum_probs=308.3 Q ss_pred CCC---CcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSK---AEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k---~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) ..+ ...++.+..-.++.. +..-...+++. .+..+++|++||++||++||++||+++++.+++..... T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~----~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~----- 93 (424) T protein:vir:18 24 FVGGRLVTPNQGSQTGPVSAH-GHLGDSSINDE----RILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV----- 93 (424) T ss_pred hcccccccccccccccccccc-cccccccccHH----HhhccHHHHHHHHHHHHhhccCceEEEEeecCCceeee----- Confidence 101 011111111111110 00001123332 23446889999999999999999999987665543211 Q ss_pred HHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccc Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDG 156 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~ 156 (602) ...|+++.++. +||+.||+.+||+.++.+++++||+|++++|+..|++++|+||+|.+|++..+.. T Consensus 94 ------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~------- 160 (424) T protein:vir:18 94 ------DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK------- 160 (424) T ss_pred ------ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC------- Confidence 12366776664 7999999999999999999999999999999999999999999999998643211 Q ss_pred hhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHH Q lcl|NC_021537. 157 EEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAA 236 (602) Q Consensus 157 ~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~ 236 (602) ..+ +.+..+|...+|+++||||+|+++ .++++|+||+..+ T Consensus 161 ---------~~~------------------------------y~~~~~g~~~~~~~~eIih~r~~~-~dg~~G~spi~~~ 200 (424) T protein:vir:18 161 ---------KVV------------------------------YRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFA 200 (424) T ss_pred ---------eEE------------------------------EEEEeCCeEEEeccccEEEecCcC-CCCcccccHHHHH Confidence 001 111234566789999999999886 5789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccccc Q lcl|NC_021537. 237 MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNI 316 (602) Q Consensus 237 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~ 316 (602) ..+|..+.++++++.++|+||++|++||++++..+++++++++++.|++..++.|+|+++++++|++ T Consensus 201 ~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~~------------- 267 (424) T protein:vir:18 201 CKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFS------------- 267 (424) T ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCce------------- Confidence 9999999999999999999999999999998878899999999999999889899999999877654 Q ss_pred ccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc--cCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 317 ELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR--ANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 317 ~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) |++++ ++++|+||+|++++++++||++|||||++||+.+.+++ ||+|++.+.|+++||+||++.||++||++|+++ T Consensus 268 -~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~ 345 (424) T protein:vir:18 268 -TSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPA 345 (424) T ss_pred -EEecC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 55554 45689999999999999999999999999999887765 899999999999999999999999999999998 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCc Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEA 474 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~ 474 (602) .+. .+++++||++++++. |.+.+++++.+++++|+||+||+|+++||||+|||+ ..+++.+++++.......+ T Consensus 346 ~~~-~~~~~~fd~~~llr~--d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD-~~~~~~n~~~l~~~~~~~~--- 418 (424) T protein:vir:18 346 KDV-GRIHAEHNLDGLLRG--DSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD-VAMRQSQYVPITDLGTNKE--- 418 (424) T ss_pred ccc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeeccCccchHhhhccCC--- Confidence 775 578999999999876 778889999999999999999999999999998864 4455666665542111110 Q ss_pred cccccccccccccc Q lcl|NC_021537. 475 MLTRSKAAPPLENK 488 (602) Q Consensus 475 ~~~~~~~~~~~~~~ 488 (602) |..+.+ T Consensus 419 --------p~~~ga 424 (424) T protein:vir:18 419 --------PRNNGA 424 (424) T ss_pred --------CccCCC Confidence 000111 No 4 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=1.1e-83 Score=475.45 Aligned_cols=491 Identities=18% Similarity=0.204 Sum_probs=331.4 Q ss_pred CCCCcc-c---ccccchhh-hcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh Q lcl|NC_021537. 1 MSKAEE-T---TQLDERHI-ATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES 75 (602) Q Consensus 1 ~~k~~~-~---~~~~~~~~-~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~ 75 (602) +.-+.. . ..+.+..+ ++..+.-+.. +.......+..+++|++||++||++||++||+++.+.+...... T Consensus 8 ~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~--~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~~~---- 81 (518) T protein:vir:78 8 TLSAPAMAELSPQMQDSYYYAPAVGMQLER--QFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEE---- 81 (518) T ss_pred eeccchhhhhhhhhhhcccccceeceeccc--ccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCccccc---- Confidence 111111 0 11111011 1111111222 23334466778999999999999999999999998765443221 Q ss_pred HHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 76 YQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 76 ~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) .+++.+.|+.+||++||+.+||+.++.+++++||+|++++|+..|++++|+||+|.+|++..+... T Consensus 82 ---------~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~----- 147 (518) T protein:vir:78 82 ---------HDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT----- 147 (518) T ss_pred ---------cchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCC----- Confidence 235566788899999999999999999999999999999999999999999999999987554211 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) +..++++... ...++..++|+++||||+|++++.+..+|+||+.. T Consensus 148 ---------~~~~y~~~~~--------------------------~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~ 192 (518) T protein:vir:78 148 ---------GRYEYYFQAG--------------------------AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMES 192 (518) T ss_pred ---------CEEEEEEEec--------------------------CCccceeEEecCCcEEEecCCCCCcccccccHHHH Confidence 1111111100 01233556899999999999987666799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) +...|....++++++.++|+||++|+++|++++ .+++++.+++++.|++ +.|..|+|++++++.|+ T Consensus 193 ~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~-~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~------------ 259 (518) T protein:vir:78 193 LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGM------------ 259 (518) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCc------------ Confidence 999999999999999999999999999999875 5899999999999976 45668999999987765 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.+++|++|+|++++.|+++||+||+.+||++||++|++. T Consensus 260 --~~~~l~-~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~ 336 (518) T protein:vir:78 260 --EPIPLQ-LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY 336 (518) T ss_pred --eEEecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 455665 45689999999999999999999999999999999999999999999999999999999999999999987 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc-cccccccccccccCCCcC Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT-LSEFEAEFGADASDGDAE 473 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~-~~~~~~~~~~~~~~~~~~ 473 (602) .+. +++++|+.+.++++ |.+.+++++.+++++|+||+||+|+++||+|++++.+|.+ ++.++.+++....+.. . T Consensus 337 ~~~--~~~~~fd~~~Llr~--D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~-~ 411 (518) T protein:vir:78 337 WVR--KNRMKFDIDDVIQP--DWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAV-E 411 (518) T ss_pred ccC--cceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceeccccccccc-C Confidence 653 67899999999877 7778889999999999999999999999999997776764 5556665543322211 1 Q ss_pred cccccccccccccccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHH Q lcl|NC_021537. 474 AMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSAL 553 (602) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~ 553 (602) +...+...++.... ....+.......+...+.++.....-|-+...+ + T Consensus 412 g~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------------~ 459 (518) T protein:vir:78 412 GEEAPAPKRPASTP----VASLDQSPPASVPGLSPTNSDRSTDSGKTEPRR----------------------------L 459 (518) T ss_pred CCCCCCCCCCCccc----ccccccCccccCCCCCcccccccccccccchhc----------------------------c Confidence 11111111111100 001111112222333333444443333332222 1 Q ss_pred hCCCccchhhhhhhcccccccccccchhcccCCCCCChhhcCCcccccC Q lcl|NC_021537. 554 VSAPSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPGEAPEDVPSDI 602 (602) Q Consensus 554 ~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 602 (602) |.-+|---|...|.|-.-.+--++...+-+++- -++.-|+++-+-+ T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 505 (518) T protein:vir:78 460 MQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQ---LAEKYPDDLEDIL 505 (518) T ss_pred cCCCCcccccchHHHHHHHHhhcCCcchhhhhh---hhhhcchhHHHHH Confidence 111222222222222111111111110000000 0111222222111 No 5 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=1.9e-83 Score=474.04 Aligned_cols=394 Identities=15% Similarity=0.126 Sum_probs=303.8 Q ss_pred CCCCcc--cccccchhhhcccCccccC--CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEE--TTQLDERHIATDVGRGIQP--PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~--~~~~~~~~~~~~~~~~i~p--~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) =+|+-+ ++......+. ..+.... .+++ +.+..+++|++||++||++||++||+++.+.+.+... T Consensus 24 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~vs~----~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~------ 91 (424) T protein:vir:45 24 RSKSLENPSTPITGDAVD--TDGLFRADVYVSP----ETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEP------ 91 (424) T ss_pred cccCCCCCccccchhhhh--hhccccCCceech----HHhhccHHHHHHHHHHHHHHhhCceEEEEecCCceee------ Confidence 112111 1111111111 1111111 1222 2244568899999999999999999998765433221 Q ss_pred HHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) ...|+++.++ .+||+.||+.+||+.++.+++++||+|++++|+..|++++|+||+|..|++..+.. T Consensus 92 -------~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~~------ 158 (424) T protein:vir:45 92 -------ARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGG------ 158 (424) T ss_pred -------cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcCC------ Confidence 1236667666 48999999999999999999999999999999999999999999999987643211 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) ..++ .+ ...+....++++||||+|.++ .++++|+||+.. T Consensus 159 ----------~~~y-----------------------------~~-~~~~~~~~~~~~eVih~r~~~-~d~~~G~spi~~ 197 (424) T protein:vir:45 159 ----------RYTY-----------------------------GL-YNEYGAFAISPDDMIHIRALG-NNQKMGLSPIMQ 197 (424) T ss_pred ----------eEEE-----------------------------EE-EecCceEEECcccEEEecCcC-CCCcccccHHHH Confidence 0001 11 112334679999999999886 578999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhh-cc-cccCcceeccCCccceeccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLK-GS-RYRTAILEVEEFVDDHGLGDGGSD 313 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~-g~-~nag~~~~~~~g~~~~~~~~~~~~ 313 (602) +..+|..+.++++++.++|+||++|++||++++ .+++++.+++++.|++.. |. +|+|+++++++|+ T Consensus 198 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~----------- 265 (424) T protein:vir:45 198 HAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQWQKASQALRRQENKTMLLPADL----------- 265 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhccccccCCceeEcCCCc----------- Confidence 999999999999999999999999999999986 489999999999997654 53 6899999987665 Q ss_pred cccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 314 VNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 314 ~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.++++++|+|++.+.|+++||+||++.||++||.+|++ T Consensus 266 ---~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~ 341 (424) T protein:vir:45 266 ---DYKALT-VSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFT 341 (424) T ss_pred ---eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 455565 4568999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcC Q lcl|NC_021537. 394 DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAE 473 (602) Q Consensus 394 ~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~ 473 (602) ..+...+++++||.+.+++. |.+.+++++++++++|+||+||+|+++|+||++||+ ..+.+.|+.....+.+..... T Consensus 342 ~~e~~~g~~i~fd~~~llr~--d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD-~~~~~~n~~~~~~~~~~~~~~ 418 (424) T protein:vir:45 342 RAELAAGYYVRFNLTGLLRG--TPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLD-EMLVSVNAANPAGDFKPPKND 418 (424) T ss_pred hhhhcCCcEEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeecccccccccccCCCCCC Confidence 98888889999999999877 778888999999999999999999999999999864 334444444322211111111 Q ss_pred cccccc Q lcl|NC_021537. 474 AMLTRS 479 (602) Q Consensus 474 ~~~~~~ 479 (602) +..++. T Consensus 419 ~~~~~~ 424 (424) T protein:vir:45 419 EGKTNE 424 (424) T ss_pred CCCCCC Confidence 111000 No 6 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=1.8e-83 Score=474.22 Aligned_cols=489 Identities=17% Similarity=0.190 Sum_probs=331.5 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) .+|+.....++. .++..+.-+. ........++..+++|++||++||++||++||+++.+.+.+.... T Consensus 15 ~e~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~~~--------- 81 (518) T protein:vir:10 15 AELSPQMQDSYY--YAPAVGMQLE--RQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEE--------- 81 (518) T ss_pred hhhhhhhhcccc--cccccceecc--cccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCceec--------- Confidence 111111111111 1111111122 122334456778899999999999999999999998766543221 Q ss_pred HhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhhh Q lcl|NC_021537. 81 DFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVE 160 (602) Q Consensus 81 ~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~ 160 (602) ..++.+.++.+||++||+.+||+.++.+++++||+|++++|+.+|++++|+||+|+.|++..+... T Consensus 82 ----~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~---------- 147 (518) T protein:vir:10 82 ----SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT---------- 147 (518) T ss_pred ----cchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCC---------- Confidence 235567788899999999999999999999999999999999999999999999999987654211 Q ss_pred hcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHH Q lcl|NC_021537. 161 NIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTM 240 (602) Q Consensus 161 ~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i 240 (602) +..++.+..+. ..++...+|+++||||+|++++.+..+|+||+..+..+| T Consensus 148 ----~~~~y~~~~~~--------------------------~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i 197 (518) T protein:vir:10 148 ----GRYEYYFQAGA--------------------------GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTI 197 (518) T ss_pred ----CEEEEEEEecC--------------------------CccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHH Confidence 11111111000 112345689999999999998776679999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccccccccccccc Q lcl|NC_021537. 241 GADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELE 319 (602) Q Consensus 241 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~ 319 (602) ....++++++.++|+||++|+|||++++ .+++++.+++++.|++. .|..|+|++++++.|+ +|+ T Consensus 198 ~~~~a~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~--------------~~~ 262 (518) T protein:vir:10 198 FSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGM--------------EPI 262 (518) T ss_pred HHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHHHHHHHHHHhcCccccCcceEcCCCc--------------eEE Confidence 9999999999999999999999999876 48999999999999764 5668999999987765 455 Q ss_pred cccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccccc Q lcl|NC_021537. 320 PIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVD 399 (602) Q Consensus 320 pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~ 399 (602) +++ ++++|+||+|++++++++||++|||||++||+.+.+|++|+|++.+.|+++||+||+..||++||++|++..+. T Consensus 263 ~l~-~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~-- 339 (518) T protein:vir:10 263 PLQ-LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-- 339 (518) T ss_pred Ecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-- Confidence 665 45689999999999999999999999999999999999999999999999999999999999999999987653 Q ss_pred ceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc-cccccccccccccCCCcCccccc Q lcl|NC_021537. 400 EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT-LSEFEAEFGADASDGDAEAMLTR 478 (602) Q Consensus 400 ~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~-~~~~~~~~~~~~~~~~~~~~~~~ 478 (602) +++++|+.+.+++. |.+.+++++.+++++|+||+||+|+++||+|++++++|.+ ++.++.+++....+... +...+ T Consensus 340 ~~~~~fd~~~llr~--D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~-g~~~~ 416 (518) T protein:vir:10 340 KNRMKFDIDDVIQP--DWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVE-GEEAP 416 (518) T ss_pred CceEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccC-CCCCC Confidence 67899999999877 7788889999999999999999999999999987666654 55566655433222211 11111 Q ss_pred ccccccccccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCc Q lcl|NC_021537. 479 SKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPS 558 (602) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s 558 (602) ...++.... ....+.......+...+.++......|-+... . +|.-+| T Consensus 417 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------------~----~~~~~~ 464 (518) T protein:vir:10 417 APKRPASTP----VASLDQSPPTSVPGLSPTNSDRSTDSGKTEPR------------------------R----LMQKPP 464 (518) T ss_pred CCCCCCccc----cccccccccccCCCCCcccccccccccccchh------------------------c----cccCCC Confidence 111111100 00011111122222222233222222222111 1 223333 Q ss_pred cchhhhhhhcccccccccccchhcccCCCCCChhhcCCcccccC Q lcl|NC_021537. 559 AGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPGEAPEDVPSDI 602 (602) Q Consensus 559 ~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 602 (602) --.|...|.|-.-.+--++...+-++.- -++.-|+++-+-+ T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 505 (518) T protein:vir:10 465 PKESSPKHLRAVKGAMGRGKDIKGFALQ---LAEKYPDDLEDIL 505 (518) T ss_pred cccccchHHHHHHHHhhcCccchhHhhh---hhhhcchhHHHHH Confidence 3333333333221111111111111100 0112222222111 No 7 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=5.1e-83 Score=471.72 Aligned_cols=396 Identities=15% Similarity=0.149 Sum_probs=307.5 Q ss_pred CCCCcccccccchhhhc-ccCcccc-CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIAT-DVGRGIQ-PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQT 78 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~-~~~~~i~-p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~ 78 (602) ..+.............. ...++.. ..++.. -+..+++|++||++||++||++||+++.+..++.... T Consensus 24 f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~~------- 92 (424) T protein:vir:18 24 FVGGRLVTPNQGSQTGPVSAHGYLGDSSINDE----RILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK------- 92 (424) T ss_pred ccccccccccchhhccccccccccccccccHH----HhhccHHHHHHHHHHHHhhccCceEEEEeccCCceee------- Confidence 11111010000000000 0001111 112332 2344678999999999999999999988766543221 Q ss_pred HHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 79 VRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 79 ~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) +...|+++.++. +||+.||+.+||+.++.+++++||||++++|+..|++++|+||+|.+|++..+.. T Consensus 93 ----~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~-------- 160 (424) T protein:vir:18 93 ----VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------- 160 (424) T ss_pred ----eccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC-------- Confidence 112366666664 7999999999999999999999999999999999999999999999998643211 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAM 237 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~ 237 (602) ..++ .+..+|...+|+++||||+|+++ .++++|+||+..+. T Consensus 161 --------~~~y------------------------------~~~~~g~~~~~~~~eVihir~~~-~dg~~G~spi~~~~ 201 (424) T protein:vir:18 161 --------KVVY------------------------------RYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFAC 201 (424) T ss_pred --------eEEE------------------------------EEEeCCeEEEeccccEEEecCcC-CCCcccccHHHHHH Confidence 1111 11234566789999999999886 67899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccc Q lcl|NC_021537. 238 QTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 238 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) .+|..+.++++++.++|+||++|+++|++++..+++++++++++.|++..|+.|+|+++++++|++ T Consensus 202 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~-------------- 267 (424) T protein:vir:18 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFS-------------- 267 (424) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCce-------------- Confidence 999999999999999999999999999998877899999999999999999999999999877654 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc--cCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR--ANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA 395 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~ 395 (602) |++++ +++.|+||+|++++++++||++|||||++||+.+.+++ ||+|++.+.|+++||.|+++.||++||++|+++. T Consensus 268 ~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~ 346 (424) T protein:vir:18 268 TSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPSK 346 (424) T ss_pred EEecC-CChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Confidence 55554 45689999999999999999999999999999887765 8899999999999999999999999999999987 Q ss_pred ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcc Q lcl|NC_021537. 396 LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAM 475 (602) Q Consensus 396 ~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~ 475 (602) +. .+++++||++++++. |.+.+++++.+++++|+||+||+|+++|+||+|||+ ..+++.+++++........ T Consensus 347 ~~-~~~~~~fd~~~llr~--d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD-~~~~~~n~~~l~~~~~~~~---- 418 (424) T protein:vir:18 347 DV-GRLHAEHNLDGLLRG--DSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGD-VAMRQAQYVPITDLGTNKE---- 418 (424) T ss_pred cc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeeccCccchhhhhccCC---- Confidence 76 579999999999877 778889999999999999999999999999998864 4455666665532111100 Q ss_pred ccccccccccccc Q lcl|NC_021537. 476 LTRSKAAPPLENK 488 (602) Q Consensus 476 ~~~~~~~~~~~~~ 488 (602) +..+.+ T Consensus 419 -------~~~n~a 424 (424) T protein:vir:18 419 -------PRNNGA 424 (424) T ss_pred -------ccccCC Confidence 000011 No 8 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=1e-82 Score=470.08 Aligned_cols=432 Identities=14% Similarity=0.110 Sum_probs=310.6 Q ss_pred CCCCcccc-cccch----hhhc---ccCcc--ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETT-QLDER----HIAT---DVGRG--IQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~-~~~~~----~~~~---~~~~~--i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) -+|.+... ..... .|.+ ..++. ..-.+++.. +..++.|++||++||++||++||+++.+..++... T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~----al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g~~~ 83 (454) T protein:vir:93 8 TRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEA----VLSFHAVFACISLISQDIAKMRLRLMQTDAQGIRR 83 (454) T ss_pred CcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHH----hhccHHHHHHHHHHHHhhccCceEEEEeccCCccc Confidence 22222111 11110 0110 00110 001234432 33468899999999999999999999876554322 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) +. ..++.+.|+.+||+.||+.+||+.++.+++++||+|++++|+.+|++.+|+||+|++|++..+.. T Consensus 84 ~~------------~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~- 150 (454) T protein:vir:93 84 ET------------RRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADD- 150 (454) T ss_pred hh------------hhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCC- Confidence 11 23556778889999999999999999999999999999999999999999999999998754321 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) |..++.+..... ...+...+++++||||+|...+.++++|+ T Consensus 151 --------------g~~~y~~~~~~~-------------------------~~~~~~~~~~~~eViH~k~~~~~~~~~G~ 191 (454) T protein:vir:93 151 --------------GEVFYRITPDRN-------------------------CGITEAVTVPAREVIHDRFNCFFHPLIGL 191 (454) T ss_pred --------------CcEEEEEEeccc-------------------------cccceeEEecCcceEEeccCCCCCCceec Confidence 122222111100 01133567999999999987778999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) ||+..+...+....++++++.++|+||++|+++|++++ .+++++.+++++.|++..++.|+|+++++++|+ T Consensus 192 sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~-------- 262 (454) T protein:vir:93 192 PPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG-SITEENAKKLKSNWDSGYTGENAGKTAILSNGA-------- 262 (454) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-CCCHHHHHHHHHHHHHHhcccccCCceeccCCc-------- Confidence 99999999999999999999999999999999999986 589999999999999888889999999987765 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKI 390 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 390 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.++++++|+|++.+.|+++||.||++.||++||.+ T Consensus 263 ------~~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 335 (454) T protein:vir:93 263 ------KYNPTT-FSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEA 335 (454) T ss_pred ------eEEEcc-cChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 455555 4568999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCC Q lcl|NC_021537. 391 IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDG 470 (602) Q Consensus 391 Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~ 470 (602) |++.. +++++|+++.+++. |.+.+++++.+++++|+||+||+|+++||+|++||+ ..+++.++.......+.. T Consensus 336 L~~~~----~~~~~f~~~~ll~~--D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD-~~~~~~~~~~~~~~~~~~ 408 (454) T protein:vir:93 336 LETGE----NESTEFDVTTLLRM--DSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGD-ALYLQQQNYSLEALSRRD 408 (454) T ss_pred hcCCC----CcEEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eeeeccCccchHhhhccC Confidence 98753 46899999999876 777888999999999999999999999999999874 344555554443222111 Q ss_pred CcCcccc---ccccc--ccccccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEec Q lcl|NC_021537. 471 DAEAMLT---RSKAA--PPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKR 532 (602) Q Consensus 471 ~~~~~~~---~~~~~--~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~ 532 (602) +...+.. ....+ +......++... ....|+....+.=.|++ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~---------------------e~~~d~~~~~~~~~~~~ 454 (454) T protein:vir:93 409 AREDPFASSGKTASVPQAVAASDGNKAIT---------------------ETEHDAVKAMFRGILKK 454 (454) T ss_pred cccCCCCCCccCCCCCCCCCCCCCCCCcc---------------------CCccchhhhhhhhhhcC Confidence 1111100 00000 000001011000 00011111111111211 No 9 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=6.6e-83 Score=471.12 Aligned_cols=411 Identities=15% Similarity=0.121 Sum_probs=314.8 Q ss_pred CCCCcccccccc----hhhhcccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccch Q lcl|NC_021537. 1 MSKAEETTQLDE----RHIATDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGG 73 (602) Q Consensus 1 ~~k~~~~~~~~~----~~~~~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~ 73 (602) ..|.+.+..... ..+..-.+ ..+ .++. ..+..+++|++||++||++||++||+++.+.+++.... T Consensus 13 ~~~r~~~~~~~~~~~~~~~~~~~g--~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~-- 84 (432) T protein:vir:10 13 FEKRQTSQVIELNKDDEKLLEWLG--ISPSTISVKG----KNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQRG-- 84 (432) T ss_pred ccccCcccccccCCchHHHHHHhC--CCcCccccch----hhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeec-- Confidence 222222211111 11111111 111 1222 22345789999999999999999999988765442211 Q ss_pred hhHHHHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccc Q lcl|NC_021537. 74 ESYQTVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 74 ~~~~~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~ 152 (602) ..|++..++. +||+.||+.+|++.++.+++++||+|++++|+..|++++|+||+|++|++..+..... T Consensus 85 -----------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~ 153 (432) T protein:vir:10 85 -----------TKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLL 153 (432) T ss_pred -----------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc Confidence 1356666664 8999999999999999999999999999999999999999999999998765433211 Q ss_pred cccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccH Q lcl|NC_021537. 153 REDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPD 232 (602) Q Consensus 153 ~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~sp 232 (602) ... ...++++..+|..+.++++||||+|.+.+.++++|+|| T Consensus 154 ~~~---------------------------------------~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~ 194 (432) T protein:vir:10 154 NSK---------------------------------------TKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPT 194 (432) T ss_pred ccc---------------------------------------ceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccH Confidence 100 01122334456678899999999998888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccccc Q lcl|NC_021537. 233 WVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 233 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~ 311 (602) +..+..+|....++++++.++|+||+.|+++|++++ .+++++.+++++.|++. .|..|+++++++++|+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~--------- 264 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGY--------- 264 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcccccCCcceecCCCc--------- Confidence 999999999999999999999999999999999876 58999999999999864 5668999999987654 Q ss_pred cccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKII 391 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L 391 (602) ++++++ ++++|+||++++++++++||++|||||++||..++++++|+|++.++|+++||+|+++.||++||++| T Consensus 265 -----~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 265 -----QFQPIS-LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred -----eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 556665 45689999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCC Q lcl|NC_021537. 392 HQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGD 471 (602) Q Consensus 392 l~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~ 471 (602) ++..+...+++++|+.+.+++. |.+.+++++++++++|++|+||+|+++||+|+|||+ ..+++.++.++....+... T Consensus 339 l~~~~~~~g~~~~fd~~~l~~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD-~~~~~~n~~~~~~~~~~~~ 415 (432) T protein:vir:10 339 FLDSELDKGFYSKFNVDAILRA--DIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD-RLLVNGNMLPIDMAGQAYL 415 (432) T ss_pred cChhhcCCCcEEEeechhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeecccccchhhcccccc Confidence 9998888899999999999876 777888999999999999999999999999999865 4555666655532222111 Q ss_pred cCccccccccccccccccc Q lcl|NC_021537. 472 AEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~ 490 (602) .++....... ...++.+ T Consensus 416 k~~~~~~~~~--~~~~~~~ 432 (432) T protein:vir:10 416 KGGDTNGEVS--KEGNEGN 432 (432) T ss_pred CCCCCCCCCC--CCCCCCC Confidence 1111111000 0011111 No 10 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=6.6e-83 Score=471.12 Aligned_cols=411 Identities=15% Similarity=0.121 Sum_probs=314.8 Q ss_pred CCCCcccccccc----hhhhcccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccch Q lcl|NC_021537. 1 MSKAEETTQLDE----RHIATDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGG 73 (602) Q Consensus 1 ~~k~~~~~~~~~----~~~~~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~ 73 (602) ..|.+.+..... ..+..-.+ ..+ .++. ..+..+++|++||++||++||++||+++.+.+++.... T Consensus 13 ~~~r~~~~~~~~~~~~~~~~~~~g--~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~-- 84 (432) T protein:vir:10 13 FEKRQTSQVIELNKDDEKLLEWLG--ISPSTISVKG----KNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQRG-- 84 (432) T ss_pred ccccCcccccccCCchHHHHHHhC--CCcCccccch----hhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeec-- Confidence 222222211111 11111111 111 1222 22345789999999999999999999988765442211 Q ss_pred hhHHHHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccc Q lcl|NC_021537. 74 ESYQTVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 74 ~~~~~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~ 152 (602) ..|++..++. +||+.||+.+|++.++.+++++||+|++++|+..|++++|+||+|++|++..+..... T Consensus 85 -----------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~ 153 (432) T protein:vir:10 85 -----------TKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLL 153 (432) T ss_pred -----------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc Confidence 1356666664 8999999999999999999999999999999999999999999999998765433211 Q ss_pred cccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccH Q lcl|NC_021537. 153 REDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPD 232 (602) Q Consensus 153 ~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~sp 232 (602) ... ...++++..+|..+.++++||||+|.+.+.++++|+|| T Consensus 154 ~~~---------------------------------------~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~ 194 (432) T protein:vir:10 154 NSK---------------------------------------TKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPT 194 (432) T ss_pred ccc---------------------------------------ceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccH Confidence 100 01122334456678899999999998888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccccc Q lcl|NC_021537. 233 WVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 233 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~ 311 (602) +..+..+|....++++++.++|+||+.|+++|++++ .+++++.+++++.|++. .|..|+++++++++|+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~--------- 264 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGY--------- 264 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcccccCCcceecCCCc--------- Confidence 999999999999999999999999999999999876 58999999999999864 5668999999987654 Q ss_pred cccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKII 391 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L 391 (602) ++++++ ++++|+||++++++++++||++|||||++||..++++++|+|++.++|+++||+|+++.||++||++| T Consensus 265 -----~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 265 -----QFQPIS-LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred -----eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 556665 45689999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCC Q lcl|NC_021537. 392 HQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGD 471 (602) Q Consensus 392 l~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~ 471 (602) ++..+...+++++|+.+.+++. |.+.+++++++++++|++|+||+|+++||+|+|||+ ..+++.++.++....+... T Consensus 339 l~~~~~~~g~~~~fd~~~l~~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD-~~~~~~n~~~~~~~~~~~~ 415 (432) T protein:vir:10 339 FLDSELDKGFYSKFNVDAILRA--DIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD-RLLVNGNMLPIDMAGQAYL 415 (432) T ss_pred cChhhcCCCcEEEeechhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeecccccchhhcccccc Confidence 9998888899999999999876 777888999999999999999999999999999865 4555666655532222111 Q ss_pred cCccccccccccccccccc Q lcl|NC_021537. 472 AEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~ 490 (602) .++....... ...++.+ T Consensus 416 k~~~~~~~~~--~~~~~~~ 432 (432) T protein:vir:10 416 KGGDTNGEVS--KEGNEGN 432 (432) T ss_pred CCCCCCCCCC--CCCCCCC Confidence 1111111000 0011111 No 11 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=6.6e-83 Score=471.12 Aligned_cols=411 Identities=15% Similarity=0.121 Sum_probs=314.8 Q ss_pred CCCCcccccccc----hhhhcccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccch Q lcl|NC_021537. 1 MSKAEETTQLDE----RHIATDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGG 73 (602) Q Consensus 1 ~~k~~~~~~~~~----~~~~~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~ 73 (602) ..|.+.+..... ..+..-.+ ..+ .++. ..+..+++|++||++||++||++||+++.+.+++.... T Consensus 13 ~~~r~~~~~~~~~~~~~~~~~~~g--~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~-- 84 (432) T protein:vir:10 13 FEKRQTSQVIELNKDDEKLLEWLG--ISPSTISVKG----KNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQRG-- 84 (432) T ss_pred ccccCcccccccCCchHHHHHHhC--CCcCccccch----hhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeec-- Confidence 222222211111 11111111 111 1222 22345789999999999999999999988765442211 Q ss_pred hhHHHHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccc Q lcl|NC_021537. 74 ESYQTVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 74 ~~~~~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~ 152 (602) ..|++..++. +||+.||+.+|++.++.+++++||+|++++|+..|++++|+||+|++|++..+..... T Consensus 85 -----------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~ 153 (432) T protein:vir:10 85 -----------TKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLL 153 (432) T ss_pred -----------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc Confidence 1356666664 8999999999999999999999999999999999999999999999998765433211 Q ss_pred cccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccH Q lcl|NC_021537. 153 REDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPD 232 (602) Q Consensus 153 ~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~sp 232 (602) ... ...++++..+|..+.++++||||+|.+.+.++++|+|| T Consensus 154 ~~~---------------------------------------~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~ 194 (432) T protein:vir:10 154 NSK---------------------------------------TKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPT 194 (432) T ss_pred ccc---------------------------------------ceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccH Confidence 100 01122334456678899999999998888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccccc Q lcl|NC_021537. 233 WVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 233 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~ 311 (602) +..+..+|....++++++.++|+||+.|+++|++++ .+++++.+++++.|++. .|..|+++++++++|+ T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~--------- 264 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGY--------- 264 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcccccCCcceecCCCc--------- Confidence 999999999999999999999999999999999876 58999999999999864 5668999999987654 Q ss_pred cccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKII 391 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L 391 (602) ++++++ ++++|+||++++++++++||++|||||++||..++++++|+|++.++|+++||+|+++.||++||++| T Consensus 265 -----~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 265 -----QFQPIS-LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred -----eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 556665 45689999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCC Q lcl|NC_021537. 392 HQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGD 471 (602) Q Consensus 392 l~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~ 471 (602) ++..+...+++++|+.+.+++. |.+.+++++++++++|++|+||+|+++||+|+|||+ ..+++.++.++....+... T Consensus 339 l~~~~~~~g~~~~fd~~~l~~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD-~~~~~~n~~~~~~~~~~~~ 415 (432) T protein:vir:10 339 FLDSELDKGFYSKFNVDAILRA--DIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD-RLLVNGNMLPIDMAGQAYL 415 (432) T ss_pred cChhhcCCCcEEEeechhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeecccccchhhcccccc Confidence 9998888899999999999876 777888999999999999999999999999999865 4555666655532222111 Q ss_pred cCccccccccccccccccc Q lcl|NC_021537. 472 AEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~ 490 (602) .++....... ...++.+ T Consensus 416 k~~~~~~~~~--~~~~~~~ 432 (432) T protein:vir:10 416 KGGDTNGEVS--KEGNEGN 432 (432) T ss_pred CCCCCCCCCC--CCCCCCC Confidence 1111111000 0011111 No 12 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=2.5e-82 Score=467.91 Aligned_cols=391 Identities=15% Similarity=0.103 Sum_probs=301.1 Q ss_pred CCCCcccccccchhhhcccCccccC------------------CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQP------------------PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p------------------~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) ++...+++..-...+.+..|..+.. .++. .-+..+++|++||++||++||++||+++. T Consensus 17 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~V~~ci~~Ia~~iA~lp~~v~~ 92 (431) T protein:vir:10 17 ARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRE----TRALRNMAVLRCVTLISGTIGMLPMNLIS 92 (431) T ss_pred cccccccccccccccccccccccccccchHHHHhhccCccCcceech----hhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 0000000000000011101111110 1222 12234789999999999999999999987 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAAT 141 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~ 141 (602) +++... ....|+.+.++. +||++||+.+||+.++.+++++||+|++++|+. |.+++|+|++|.+ T Consensus 93 ~~~~~~--------------~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~ 157 (431) T protein:vir:10 93 SDDSKQ--------------VLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGS 157 (431) T ss_pred ecCcee--------------eeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCce Confidence 643221 112366666664 899999999999999999999999999999985 8999999999999 Q ss_pred ccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCC Q lcl|NC_021537. 142 VRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNP 221 (602) Q Consensus 142 v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~ 221 (602) |++..+.. +..+ +++...+|....++++||||+|++ T Consensus 158 v~~~~~~~---------------~~~~-----------------------------y~~~~~~g~~~~~~~~dViHir~~ 193 (431) T protein:vir:10 158 AKGRLTST---------------WQIV-----------------------------YDYTTPTGDKIELPAREVFHLRDL 193 (431) T ss_pred eEEEEcCC---------------CeEE-----------------------------EEEEeCCceEEEEchhhEEEecCc Confidence 98643221 1111 122334566778999999999987 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccC Q lcl|NC_021537. 222 SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEE 300 (602) Q Consensus 222 ~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~ 300 (602) + .++++|+||+..+..+|..+.++++++.++|+||++|++||++++ .+++++.+++++.|++ +.|.+|+|+++++++ T Consensus 194 ~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 271 (431) T protein:vir:10 194 S-IDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK-ELSDNAYGRMKASVQENHTGSENAGSWMLLEE 271 (431) T ss_pred C-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-CCCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 6 678999999999999999999999999999999999999999986 5899999999999976 566789999999877 Q ss_pred CccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 301 FVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQ 380 (602) Q Consensus 301 g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~ 380 (602) |+ +|++++ ++++|+||+|++++++++||++|||||++||+.+++++||+|++.+.|+++||.||+ T Consensus 272 g~--------------~~~~l~-~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~ 336 (431) T protein:vir:10 272 GA--------------TAKQFS-NTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIFFIQYGLSHWF 336 (431) T ss_pred Cc--------------eEEEcc-CChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHH Confidence 65 455565 456899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC----cccHHHHHHHhCCCCCCCCcccccc Q lcl|NC_021537. 381 AKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG----VGTVNEAREELDLAPFEDDRGDMTL 456 (602) Q Consensus 381 ~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G----~~T~NE~R~~~Gl~p~~~g~~d~~~ 456 (602) +.||++||++|+++.+. .+++|+||++.+++. |.+.+++++++++.+| +||+||+|+++||||++++++|++. T Consensus 337 ~~ie~~ln~~Ll~~~~~-~~~~~~fd~~~llr~--d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~ 413 (431) T protein:vir:10 337 VSWEQAAARAFLPEKML-GQRQFKFNEGALLRG--TLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQLR 413 (431) T ss_pred HHHHHHHHhhccChhhc-CCceEEEechhhhcc--CHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCcccccee Confidence 99999999999987655 578999999999876 7788888888888655 5999999999999999998888876 Q ss_pred ccccccccccccCCCcCcccc Q lcl|NC_021537. 457 SEFEAEFGADASDGDAEAMLT 477 (602) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~ 477 (602) .+.+.....+... +...+ T Consensus 414 ~p~n~~~~~~~~~---~p~~~ 431 (431) T protein:vir:10 414 NPMTQKQKGSGDE---PPATT 431 (431) T ss_pred cccccccCCCCCC---CCCCC Confidence 6554332211111 11111 No 13 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=2.4e-82 Score=468.04 Aligned_cols=412 Identities=16% Similarity=0.144 Sum_probs=307.2 Q ss_pred CCCCcccccccchhhhcccCccccCCCCH---HH---------------HHHHHhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNP---ET---------------LAAFQELNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~---~~---------------l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) |||..+..-. ++......|+.+|+++ .. -.+-+..+++|++||++||++||++||+++. T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~ 77 (437) T protein:vir:10 1 MKQGKQRALG---RIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQ 77 (437) T ss_pred CCcchhhhhh---hhHHhhhhhcCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEE Confidence 6544432111 1111112222222221 00 0123445789999999999999999999998 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAAT 141 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~ 141 (602) +.+++.... ...|+++.++ .+||+.||+.+||+.++.+++++||+|++++|+. |++++|+||+|.. T Consensus 78 ~~~~g~~~~------------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~ 144 (437) T protein:vir:10 78 TKPDGTRVL------------AKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQR 144 (437) T ss_pred EcCCCceee------------ccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcc Confidence 765543221 1235555555 5899999999999999999999999999999994 9999999999999 Q ss_pred ccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCC Q lcl|NC_021537. 142 VRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNP 221 (602) Q Consensus 142 v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~ 221 (602) |++..+..+ ..+ +++...+|....++++||||+|++ T Consensus 145 v~i~~~~~g---------------~~~-----------------------------y~~~~~~g~~~~~~~~dIih~r~~ 180 (437) T protein:vir:10 145 TTVKRLTSG---------------ALQ-----------------------------YTYRNVDGTVSTLAEDDVFHVRGF 180 (437) T ss_pred eEEEECCCC---------------eEE-----------------------------EEEEecCceEEEEccccEEEecCc Confidence 986543211 111 112233456678999999999988 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccC Q lcl|NC_021537. 222 SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEE 300 (602) Q Consensus 222 ~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~ 300 (602) + .++++|+||+..+..+|....++++++.++|+||++|++||++++ .+++++.+++++.|++ +.|..|+|+++++++ T Consensus 181 ~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~ 258 (437) T protein:vir:10 181 S-LDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQ-ILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEA 258 (437) T ss_pred C-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceeccC Confidence 6 688999999999999999999999999999999999999999875 5899999999999976 567789999999877 Q ss_pred CccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc--cCHHHHHHHHHHHHHHH Q lcl|NC_021537. 301 FVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR--ANSKEQTREFAKGIIEP 378 (602) Q Consensus 301 g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~f~~~~l~P 378 (602) |++ |++++ +++.|+||+|++++++++||++|||||++||+.+.+++ +|++++.+.|+++||+| T Consensus 259 g~~--------------~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P 323 (437) T protein:vir:10 259 GMK--------------YQAIT-MNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTLRP 323 (437) T ss_pred Cce--------------EEecc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHH Confidence 654 55554 45689999999999999999999999999999877654 89999999999999999 Q ss_pred HHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccc Q lcl|NC_021537. 379 EQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSE 458 (602) Q Consensus 379 ~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~ 458 (602) |+..||++||++|+++.+. .+++|+||++.+++. |.+.+++++++++++|+||+||+|+++||+|++||+....++. T Consensus 324 ~~~~ie~~l~~kll~~~e~-~~~~~~fd~~~ll~~--d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~ 400 (437) T protein:vir:10 324 WLTRIEQAARRSLLRPGER-DQFYAEFSVEGLLRA--DSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQS 400 (437) T ss_pred HHHHHHHHHHhhccCcccc-CceEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecC Confidence 9999999999999988766 457899999999876 7788889999999999999999999999999998764344455 Q ss_pred ccccccccccCCCcC---ccccccccccccccccccc Q lcl|NC_021537. 459 FEAEFGADASDGDAE---AMLTRSKAAPPLENKIGER 492 (602) Q Consensus 459 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 492 (602) ++.++....+..... ..........+......|+ T Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 401 ALLPIDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred cccchhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 554432111110000 0000011111111111111 No 14 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=4.5e-82 Score=466.56 Aligned_cols=412 Identities=15% Similarity=0.114 Sum_probs=313.6 Q ss_pred CCCCccc--ccc--cchhhhcccCccccC--CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEET--TQL--DERHIATDVGRGIQP--PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~--~~~--~~~~~~~~~~~~i~p--~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) ..|...+ .-+ ....+..-.| .-.+ .++. ..+..+++|++||++||++||++||+++.+.+.+... T Consensus 10 ~~~r~~~~~~~~~~~~~~~~~~~g-~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~~~---- 80 (429) T protein:vir:10 10 FEKRQTSQVIELNKDDEKLLEWLG-ISPSTISVKG----KNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQR---- 80 (429) T ss_pred ccccCcccccccCCChHHHHHHhc-CCCCcceech----hhhhccHHHHHHHHHHHHhhccCceEEEEecCCceee---- Confidence 1121111 110 1111111111 1111 1222 2234578999999999999999999998875544221 Q ss_pred hHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIER 153 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~ 153 (602) ...|++..++ .+||+.||+.+||+.++.+++++||+|++++|+..|++++|+||+|++|++..+...... T Consensus 81 ---------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~ 151 (429) T protein:vir:10 81 ---------GTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLN 151 (429) T ss_pred ---------ccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccc Confidence 1135566655 489999999999999999999999999999999999999999999999987554332111 Q ss_pred ccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 154 EDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 154 ~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) .. ...++++..+|..+.++++||||+|...+.++++|+||+ T Consensus 152 ~~---------------------------------------~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i 192 (429) T protein:vir:10 152 SK---------------------------------------TKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTM 192 (429) T ss_pred cc---------------------------------------ceEEEEEccCCeEEEEccccEEEecCCCCCCCcccccHH Confidence 00 011223344566788999999999998889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceecccccc Q lcl|NC_021537. 234 VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGGS 312 (602) Q Consensus 234 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~~ 312 (602) ..+..++..+.++++++.++|+||++|+++|++++ .+++++.+++++.|++. .|..|+++++++++|+ T Consensus 193 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~---------- 261 (429) T protein:vir:10 193 EYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGY---------- 261 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhccccccCceeecCCCc---------- Confidence 99999999999999999999999999999999876 58999999999999764 5668999999987665 Q ss_pred ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 313 DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 313 ~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) ++++++ +++.|+||+|++++++++||++|||||.+||..++++++|++++.+.|++.||+|+++.|+++||++|+ T Consensus 262 ----~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~ 336 (429) T protein:vir:10 262 ----QFQPIS-LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLF 336 (429) T ss_pred ----eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 455565 456899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCc Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDA 472 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~ 472 (602) +..+...+++++|+.+.+++. |.+.+++++++++++|+||+||+|+++||||+|+|+ ..+++.++.++....+.... T Consensus 337 ~~~~~~~g~~~~fd~~~ll~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-~~~~~~n~~~~d~~~~~~~k 413 (429) T protein:vir:10 337 LDSELDKGFYSKFNVDAILRA--DIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD-RLLVNGNMLPIDMAGQAYLK 413 (429) T ss_pred ChhhcCCCcEEEeechhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeecccccchhhccccccC Confidence 998888899999999999876 777788999999999999999999999999998754 44556666554322111111 Q ss_pred Cccccccccccccccc Q lcl|NC_021537. 473 EAMLTRSKAAPPLENK 488 (602) Q Consensus 473 ~~~~~~~~~~~~~~~~ 488 (602) ++........+..++. T Consensus 414 ~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 414 GGDTNGEVSKEGNEGN 429 (429) T ss_pred CCCCCCCCCCCCCCCC Confidence 1111111111111111 No 15 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=3.9e-82 Score=466.91 Aligned_cols=408 Identities=17% Similarity=0.147 Sum_probs=306.4 Q ss_pred CCCCcc------------cccccchhhhcccCcc---ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEE------------TTQLDERHIATDVGRG---IQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~------------~~~~~~~~~~~~~~~~---i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) ++++-. +..+.+..+-....+. ....+++. -+-.++.|++||++||++||++||+++.+.. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~----~al~~~~V~~~i~~ia~~ia~lp~~~~~~~~ 84 (434) T protein:vir:43 9 LSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVD----KAMKLSAVWACVRLISTSVAGLPLGVYERKA 84 (434) T ss_pred hhhcccccchhhhcccccccccCchHHHHHHhcCCccCCceechh----hhhccHHHHHHHHHHHHhhhhCceEEEEEcC Confidence 221111 1111111111101000 00112222 2334688999999999999999999998766 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) ++..... ..|+++.++ .+||++||+.+||+.++.+++++||+|+++.++ .|++++|+||+|.+|++ T Consensus 85 ~g~~~~~------------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~ 151 (434) T protein:vir:43 85 DGSRVDA------------RSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDL 151 (434) T ss_pred CCccccc------------cccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEE Confidence 5432211 235666666 579999999999999999999999999998877 69999999999999986 Q ss_pred cccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCC Q lcl|NC_021537. 145 RKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPL 224 (602) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~ 224 (602) ..+..+ ..+ +++...+|..+.++++||||+|++ +. T Consensus 152 ~~~~~g---------------~~~-----------------------------y~~~~~~g~~~~~~~~eVih~~~~-~~ 186 (434) T protein:vir:43 152 ECDENG---------------RLK-----------------------------YFYTTKKGARREIERTNMLHIPAF-TL 186 (434) T ss_pred EEcCCC---------------eEE-----------------------------EEEEecCceEEEEccccEEEecCc-CC Confidence 543211 111 122334566788999999999987 47 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccc Q lcl|NC_021537. 225 ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDD 304 (602) Q Consensus 225 ~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~ 304 (602) ++++|+||+..+..+|....++++++.++|+||++|+++|++++ .+++++.++++++|+++.|+.|+|+++++++|+ T Consensus 187 dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~-- 263 (434) T protein:vir:43 187 DGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR-ILQPAQREEFREYVKSVSGAMNSGRSPVLEQGI-- 263 (434) T ss_pred CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC-CCCHHHHHHHHHHHHHhcCccccCCccccCCCc-- Confidence 88999999999999999999999999999999999999999976 589999999999999999999999999987665 Q ss_pred eeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC--ccCHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 305 HGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN--RANSKEQTREFAKGIIEPEQAK 382 (602) Q Consensus 305 ~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~sn~e~~~~~f~~~~l~P~~~~ 382 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.+.++ +||++++...|+++||.||+.+ T Consensus 264 ------------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ 330 (434) T protein:vir:43 264 ------------TPETIG-INPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQ 330 (434) T ss_pred ------------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHH Confidence 455554 4568999999999999999999999999999877554 8999999999999999999999 Q ss_pred HHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccc Q lcl|NC_021537. 383 FSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAE 462 (602) Q Consensus 383 ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~ 462 (602) ||++||++|++..+. .+++++||++++++. |.+.+++++.+++++|+||+||+|+++||+|+|||+ ..+++.++++ T Consensus 331 ie~~ln~kL~~~~~~-~~~~~~fd~~~llr~--d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-~~~~~~n~~~ 406 (434) T protein:vir:43 331 IQQCVNKRLLTAPER-IRYYAEFSLEGFLKA--DSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGD-ILTVQSNLVP 406 (434) T ss_pred HHHHHHhhcCChhhh-cCceEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeeccCccc Confidence 999999999998764 578999999999877 778889999999999999999999999999998864 4455666655 Q ss_pred ccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 463 FGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +....+....... .........+....| T Consensus 407 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 407 IDQLGQSNKSQAV-RAALMNWFSQPEPQE 434 (434) T ss_pred hhhhhccCCCcch-hhhhhccCCCCCCCC Confidence 4322111111110 000000000011111 No 16 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=9.2e-82 Score=464.85 Aligned_cols=399 Identities=14% Similarity=0.135 Sum_probs=312.2 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) ..+.+.+..+.+..+..-.++ +.+++. ... .++.|++||++||++||++||+++++.+++.... T Consensus 11 ~~~~~~~~~~~~~~~~~~~g~---~~~~~~---~al-~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~--------- 74 (411) T protein:vir:81 11 FRPRNETVDMTNPLLLQWLGV---DPDTPR---NQL-SEATYFACLKILSESLGKLPLKMYQKTERGIVKS--------- 74 (411) T ss_pred ccCcccccccchHHHHHHhcC---cccChh---hhh-ccHHHHHHHHHHHHhHhhCceeEEEecCCceeee--------- Confidence 333333333333232221111 123322 222 3688999999999999999999998766543211 Q ss_pred Hhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..|+++.++. +||+.||+.+||+.++.+++++||||++++|+ .|++.+|+||+|+.|++..+........ T Consensus 75 ----~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~---- 145 (411) T protein:vir:81 75 ----DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEK---- 145 (411) T ss_pred ----cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCccccccc---- Confidence 1355665554 79999999999999999999999999999998 5999999999999998765433211100 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ...+++ +....+|....++++||||+|.+.+.++++|+||+..+..+ T Consensus 146 -----~~~~~~----------------------------~~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~ 192 (411) T protein:vir:81 146 -----NAIWYR----------------------------YNDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHT 192 (411) T ss_pred -----ceEEEE----------------------------EEecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHH Confidence 000111 01123456678999999999988888999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceecccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGGSDVNIEL 318 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~ 318 (602) +....++++++.++|+||++|+++|++++ .+++++.+++++.|++. .|.+|+|+++++++|+ +| T Consensus 193 i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~--------------~~ 257 (411) T protein:vir:81 193 VDGALESQKFMNNLYKTGLTGKAVLEYTG-DLNQEARDRLVKGFEQFANGSKNAGKIIPVPLGM--------------KL 257 (411) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCHHHHHHHHHHHHHHhcCccccCCceecCCCc--------------eE Confidence 99999999999999999999999999875 58999999999999875 5668999999987665 45 Q ss_pred ccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccc Q lcl|NC_021537. 319 EPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDV 398 (602) Q Consensus 319 ~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~ 398 (602) ++++ ++++|+||+|++++++++||++|||||++||+.++++++|+|++...|+++||.|+++.||++||++|++..+.. T Consensus 258 ~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~ 336 (411) T protein:vir:81 258 VPLD-IKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDLIS 336 (411) T ss_pred EEcc-CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcC Confidence 5664 456899999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred cceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccc Q lcl|NC_021537. 399 DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLT 477 (602) Q Consensus 399 ~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~ 477 (602) .+++++||.+.+++. |.+.+++++++++++|+||+||+|+++|+||+|+|+ ..+++.+++++....++...++. + T Consensus 337 ~~~~~~fd~~~ll~~--d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD-~~~~~~n~~pl~~~~~~~~kgGd-~ 411 (411) T protein:vir:81 337 QGHYFKFNVNVILRA--DIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGN-NLMANGNYIPLSMLGANYGKGGD-S 411 (411) T ss_pred CCcEEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eeeeccCccchhhhhhhhccCCC-C Confidence 899999999999876 777888999999999999999999999999998764 34456666655322111111110 0 No 17 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=1.7e-81 Score=463.46 Aligned_cols=405 Identities=14% Similarity=0.123 Sum_probs=306.3 Q ss_pred CCCCcccccccchhhhcccCcc-ccC-----CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRG-IQP-----PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~-i~p-----~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) +-|+...+-+.......-.++. ..+ .+++. .+..++.|++||++||++||++||+++.+.+++.... T Consensus 6 ~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~----~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~--- 78 (421) T protein:vir:10 6 MFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPE----TALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQR--- 78 (421) T ss_pred hhcccccccCcchhhHHHhhhhccCcccCCceechH----HhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceee--- Confidence 3333322222222211111111 111 13333 2445788999999999999999999998766554321 Q ss_pred hHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIER 153 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~ 153 (602) ...|+++.++ .+||++||+.+||+.++.+++++||||++++|+.+|+|.+||||+|+.|++..+.. T Consensus 79 ---------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~---- 145 (421) T protein:vir:10 79 ---------ATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPD---- 145 (421) T ss_pred ---------cccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCC---- Confidence 1135555555 57999999999999999999999999999999999999999999999998754321 Q ss_pred ccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 154 EDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 154 ~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) +..|+++. ..| .++++++|||+++++ .++++|+||+ T Consensus 146 -----------g~~~y~~~------------------------------~~g--~~~~~~eiih~~~~~-~d~~~G~spi 181 (421) T protein:vir:10 146 -----------GMPYYEIP------------------------------EIG--ETLPMRMMHHVKVFS-LDGYIGSSPI 181 (421) T ss_pred -----------ceEEEEEc------------------------------CCC--cEEchhhEEEecCcC-CCCcccccHH Confidence 22222221 111 257899999999876 6889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc---cCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccc Q lcl|NC_021537. 234 VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG---TLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 234 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~---~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~ 309 (602) ..+..+|....++++++.++|+||++|+|+|++++. .+++++.+++++.|++. .|..|+++++++++|++ T Consensus 182 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~------ 255 (421) T protein:vir:10 182 QTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMS------ 255 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCce------ Confidence 999999999999999999999999999999998753 35899999999999765 56689999999877654 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) +++++ ++++|+||+|++++++++||++|||||++||+.+.+|++|+|++.+.|+++||+|++++||++||+ T Consensus 256 --------~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~ 326 (421) T protein:vir:10 256 --------YKQMS-QDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQR 326 (421) T ss_pred --------EEecC-CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 55555 456899999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccC Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASD 469 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~ 469 (602) +|+++.+. .+++++|+.+.+++. |.+.+++++++++++|+||+||+|+++|+||++||+ ..+++.+++..+....+ T Consensus 327 kL~~~~~~-~~~~v~fd~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-~~~~~~n~~~~~~~~~~ 402 (421) T protein:vir:10 327 DLLLPSER-RDLYIEFNVSGLLRG--DQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGD-KYLTPLNMVDSAQIIPG 402 (421) T ss_pred hccCcccc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeeccccccccccccC Confidence 99998775 588999999999876 778888999999999999999999999999998764 33445555444322221 Q ss_pred CCcCcccccccccccccccccccccccc Q lcl|NC_021537. 470 GDAEAMLTRSKAAPPLENKIGERDSVDV 497 (602) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (602) +..+. ...+++.++.. .++ T Consensus 403 ~~~~~-----~~~~~e~d~~~----~~~ 421 (421) T protein:vir:10 403 DKKPT-----AQQMAEIDTIL----SRT 421 (421) T ss_pred CCCcc-----cccCccccccc----ccC Confidence 11110 00111111111 111 No 18 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=1.9e-81 Score=463.12 Aligned_cols=400 Identities=14% Similarity=0.112 Sum_probs=309.8 Q ss_pred CCCCcccccccchhhhcccCc-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGR-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) -++..++.....-.+..-.+. .....++.. -+..++.|++||++||++||++||+++.+.+... . T Consensus 7 ~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~----~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~-~--------- 72 (409) T protein:vir:10 7 FKNQSQEISIDDKKILEWLGINPSETYVNGK----SCLKQATVFGCIRILSDNISKLPIKIYQKKDGIK-R--------- 72 (409) T ss_pred ccCcCCCCCCChHHHHHHhcCCcCcceechh----hhhccHHHHHHHHHHHHhhhhCceEEEEecCCee-e--------- Confidence 122121211122222211111 111123332 2345788999999999999999999987533211 1 Q ss_pred HHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchh Q lcl|NC_021537. 80 RDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEE 158 (602) Q Consensus 80 ~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~ 158 (602) ...|++..++ .+||+.||+.+||+.++.+++++||||++++|+..|++++|+||+|++|++..+..+..... T Consensus 73 ----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~--- 145 (409) T protein:vir:10 73 ----VPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSE--- 145 (409) T ss_pred ----ccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCcccccc--- Confidence 1125555555 58999999999999999999999999999999999999999999999998765433211100 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) ....+.+....|....++++||||+|.++ .++++|+||+..+.. T Consensus 146 -----------------------------------~~~~y~~~~~~g~~~~~~~~evih~r~~~-~d~~~G~s~i~~~~~ 189 (409) T protein:vir:10 146 -----------------------------------NNVWYLYTDDLGQRHKFMSDEILHFKGLT-ADGLAGLSVIELLNH 189 (409) T ss_pred -----------------------------------ceEEEEEEeCCceeEEeccccEEEecCcC-CCCcccccHHHHHHH Confidence 00112233445667789999999999887 567999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccccccccccc Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) ++....++++++.++|+||++|++||++++ .+++++.+++++.|++. .|..|+|+++++++|+ + T Consensus 190 ~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~--------------~ 254 (409) T protein:vir:10 190 LIENGKSSETYLNNFFKNGLQVKGLVQYAG-DLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGY--------------K 254 (409) T ss_pred HHHHHHHHHHHHHHHHhccCCCcEEEEcCC-CCCHHHHHHHHHHHHHHhccccccCCceecCCCc--------------e Confidence 999999999999999999999999999876 58999999999999875 4668899999987665 4 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALD 397 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~ 397 (602) +++++ +++.|+||+|++++++++||++|||||.+||..++++++|++++.+.|+++||+|+++.||++||++|++..+. T Consensus 255 ~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~ 333 (409) T protein:vir:10 255 FEPIS-QKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLFLISEI 333 (409) T ss_pred EEEcc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhc Confidence 55554 45689999999999999999999999999999999999999999999999999999999999999999998887 Q ss_pred ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCccc Q lcl|NC_021537. 398 VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAML 476 (602) Q Consensus 398 ~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~ 476 (602) ..+++++|+.+++++. |.+.+++++.+++++|+||+||+|+++|+||+|||+ ..+++.+++++....+....++.. T Consensus 334 ~~~~~~~fd~~~ll~~--d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD-~~~~~~n~~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 334 KNGFYSKFNVDTILRA--DIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGD-VLLINGNMIPVKMAGEQYSKGGEK 409 (409) T ss_pred cCCcEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeeccCccchhhccccccccCCC Confidence 7889999999999876 777888999999999999999999999999999874 445566665543222111111110 No 19 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=1.9e-81 Score=463.09 Aligned_cols=403 Identities=15% Similarity=0.151 Sum_probs=298.6 Q ss_pred CCCCc----ccccccc--hhhhcccCcccc---CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAE----ETTQLDE--RHIATDVGRGIQ---PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~----~~~~~~~--~~~~~~~~~~i~---p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) ..+.+ ...++.. .......+.... ..++. ..+..+++|++||++||++||++||+++.+.+++... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~----~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~- 91 (432) T protein:vir:10 17 FVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNA----DAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKE- 91 (432) T ss_pred cCCccccccccccccccCcchhhhhcccccccCcccch----hhhhcchHHHHHHHHHHHhhhhCceeEEEecCCCccc- Confidence 11111 1111100 001011111000 11222 2244568999999999999999999998876544321 Q ss_pred chhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ...|+++.++ .+||++||+.+||+.++.+++++||||++++|+ +|++.+|+||+|+.|++..+..+ T Consensus 92 ------------~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g 158 (432) T protein:vir:10 92 ------------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKG 158 (432) T ss_pred ------------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCC Confidence 1136666665 589999999999999999999999999999997 59999999999999987543211 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) .. .|++...+|...++++++|||+|+++ .++++|+ T Consensus 159 ---------------~~-----------------------------~y~~~~~~g~~~~~~~~~iih~~~~~-~dg~~G~ 193 (432) T protein:vir:10 159 ---------------NT-----------------------------AYRYRRTDGQMIDIPKQQIWKIMGYS-LDGENGL 193 (432) T ss_pred ---------------cE-----------------------------EEEEEecCceEEEEcCccEEEecCCC-CCCcccc Confidence 11 11223345667789999999999775 6889999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) ||+..+..+|..+.++++++.++|+||++|++||++++ .+++++.+++++.|. |..|+|+++++++|++ T Consensus 194 spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~---~~~nag~~~vl~~g~~------- 262 (432) T protein:vir:10 194 SAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDDQYDSFAKKVS---GSVEAGRAPLLEGGMD------- 262 (432) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHHHHHHHHHHHh---hhhhCCCceecCCCce------- Confidence 99999999999999999999999999999999999876 589999888877664 5678899999877654 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC---ccCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN---RANSKEQTREFAKGIIEPEQAKFSARL 387 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~---~sn~e~~~~~f~~~~l~P~~~~ie~~l 387 (602) |++++ ++++|+||+|++++++++||++|||||++||+.+.++ ++|+|++.+.|+++||.||++.||++| T Consensus 263 -------~~~l~-~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l 334 (432) T protein:vir:10 263 -------VKSLG-LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSI 334 (432) T ss_pred -------EEEcc-CChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55554 4568999999999999999999999999999887655 478999999999999999999999999 Q ss_pred hhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 388 YKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 388 n~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) |++|+++.++ .+++++||.+.+++. |.+.+++++++++++|+||+||+|+++||||++|++...+++.+..++.... T Consensus 335 n~kL~~~~~~-~~~~~~fd~~~ll~~--d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~~~~~~~~~pl~~~~ 411 (432) T protein:vir:10 335 ALNLLSPAER-RRYFADFDTSALLRA--DSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIG 411 (432) T ss_pred HhhhcCcccc-CceEEEeechhhhcc--CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhc Confidence 9999998765 578999999999877 7788889999999999999999999999999987653344455554443211 Q ss_pred cCCCcCcccccccccccccccccc Q lcl|NC_021537. 468 SDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +............ +.++...+ T Consensus 412 ~~~~~~~~~~~~~---~~~~~~~~ 432 (432) T protein:vir:10 412 LQASPEPASGLGN---QQQDKVSK 432 (432) T ss_pred ccCCCCCCCCCCC---cccccccC Confidence 1111111100000 00111111 No 20 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=3e-81 Score=462.02 Aligned_cols=403 Identities=14% Similarity=0.083 Sum_probs=305.5 Q ss_pred CCCCcccccccchhhhccc--Ccc-ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV--GRG-IQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~--~~~-i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) .+|......+......... +.. --..+++.. +..++.|++||++||++||++||+++++.+++.... T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~----al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~------ 76 (419) T protein:vir:57 7 WKGRPSENRVNWQVVPGGMRSSSSQAGVIITPET----ALALSAVRACVTLLAESVAQLPCVLYRRTENGGREI------ 76 (419) T ss_pred hccCCccccccccccccccccccccCCceechHH----hhccHHHHHHHHHHHHhhccCceEEEEEcCCCceec------ Confidence 4444332222221110000 000 001123322 234678999999999999999999998776654321 Q ss_pred HHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccc Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDG 156 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~ 156 (602) ...|+++.++ .+||++||+.+||+.++.+++++||+|++|+|+.+|++++|+||+|++|++..+.. T Consensus 77 ------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~------- 143 (419) T protein:vir:57 77 ------AFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPD------- 143 (419) T ss_pred ------cccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCC------- Confidence 1235666666 58999999999999999999999999999999999999999999999998754322 Q ss_pred hhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHH Q lcl|NC_021537. 157 EEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAA 236 (602) Q Consensus 157 ~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~ 236 (602) +..|+++ . +....+++++|||+|+++ .++++|+||+..+ T Consensus 144 --------g~~~y~~------------------------------~--~~~~~~~~~~vih~r~~~-~d~~~G~s~i~~~ 182 (419) T protein:vir:57 144 --------GMPYYDI------------------------------P--SIGEILPMRMVHHIKSFS-LDGYIGTSPIQTN 182 (419) T ss_pred --------ceEEEEE------------------------------c--CCceEEchhhEEEecCcC-CCCcccccHHHHH Confidence 1112221 0 112357899999999875 6789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEecc---ccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccc Q lcl|NC_021537. 237 MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG---GTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGS 312 (602) Q Consensus 237 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~ 312 (602) ..+|....++++++.++|+||++|+++|++++ ..+++++.+++++.|.+ +.|..|+|+++++++|+ T Consensus 183 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~---------- 252 (419) T protein:vir:57 183 PDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGM---------- 252 (419) T ss_pred HHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCc---------- Confidence 99999999999999999999999999999864 45789999999999976 55668999999987664 Q ss_pred ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 313 DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 313 ~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) +|++++ ++++|+||+|++++++++||++|||||.+||..+.++++|+|++.+.|+++||+|+++.||++||++|+ T Consensus 253 ----~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll 327 (419) T protein:vir:57 253 ----TYKQLS-QDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLL 327 (419) T ss_pred ----eEEEcC-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 556665 466899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCc Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDA 472 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~ 472 (602) ++.+. .+++++||++++++. |.+.+++++++++++|+||+||+|+++|+||+|||+ ..+++.+.+... ....... T Consensus 328 ~~~~~-~~~~i~fd~~~ll~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-~~~~~~n~~~~~-~~~~~~~ 402 (419) T protein:vir:57 328 LPSER-RDFYIEFNVSSLLRG--DQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGD-KYLTPLNMVDSK-ALTGIGK 402 (419) T ss_pred Ccccc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeecccccccc-ccccccC Confidence 88665 589999999999876 778888999999999999999999999999998763 333444544332 2221111 Q ss_pred C--cccccccccccccc Q lcl|NC_021537. 473 E--AMLTRSKAAPPLEN 487 (602) Q Consensus 473 ~--~~~~~~~~~~~~~~ 487 (602) + .+..+..+....++ T Consensus 403 ~~~~~~~~~~~~~~~~~ 419 (419) T protein:vir:57 403 ATPQQLKDIEAILCTRN 419 (419) T ss_pred CCcccCcchhhhhhccC Confidence 1 11111111111111 No 21 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=2.8e-81 Score=462.22 Aligned_cols=401 Identities=14% Similarity=0.123 Sum_probs=306.8 Q ss_pred CCCCccccc---------ccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQ---------LDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~---------~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) .++....+. .....+....+ +.+... .-..-+..+++|++||++||++||++|++++++... . T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~--v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~~-~--- 82 (422) T protein:vir:13 11 KNNNDEKRSNYDEDIGIDISDSNFWEKFG--IKLNFS--VRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKEE-Y--- 82 (422) T ss_pred cCCccchhhhhhhccccccCcchhhhhcc--ccCCcc--cchhhhhccHHHHHHHHHHHHhhhhCceEEEecCcc-c--- Confidence 111111111 00001111111 111111 111122346889999999999999999999864211 1 Q ss_pred chhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ..|++..++ .+||++||+.+||+.++.+++++||||++++|+..|++++|+||+|++|++..+..+ T Consensus 83 -------------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~ 149 (422) T protein:vir:13 83 -------------KEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDN 149 (422) T ss_pred -------------ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCc Confidence 123444445 589999999999999999999999999999999999999999999999987655432 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) ..... ....+.+...+|...+++++||||++.+.+.++++|+ T Consensus 150 ~~~~~--------------------------------------~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~ 191 (422) T protein:vir:13 150 FLSSL--------------------------------------SKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGI 191 (422) T ss_pred ceecc--------------------------------------ceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccc Confidence 21100 0011123334567788999999999988888999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceeccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~ 309 (602) ||+..+..+|....++++++.++|+||++|+|+|++++ .+++++.+++++.|++. .|..|+++++++++|++ T Consensus 192 s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~------ 264 (422) T protein:vir:13 192 KPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG-DLDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQ------ 264 (422) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhcCccccCCceecCCCce------ Confidence 99999999999999999999999999999999999976 58999999999999875 56688999999877654 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) |++++ +++.|+||+|++++++++||++|||||++||..++++++|++++.+.|+++||+|++++||++||. T Consensus 265 --------~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~ 335 (422) T protein:vir:13 265 --------FQPIS-LSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQD 335 (422) T ss_pred --------eeecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55554 456799999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccC Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASD 469 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~ 469 (602) +|+++.+...+++++|+.+.+++. |.+.+++++++++++|+||+||+|+++|++|+|||+ ..+++.+++++....+. T Consensus 336 ~Ll~~~~~~~g~~i~fd~~~l~r~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD-~~~~~~n~~~l~~~~~~ 412 (422) T protein:vir:13 336 KLFSQYETLQDVKAEFNVDTILRS--DIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGD-RLLVNGNMIPIEMAGEQ 412 (422) T ss_pred hhCChhhhcCCceEEeechhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeeccCccchhhcccc Confidence 999998887889999999999876 777788999999999999999999999999998864 44556666554321111 Q ss_pred CCcCccccccccccccc Q lcl|NC_021537. 470 GDAEAMLTRSKAAPPLE 486 (602) Q Consensus 470 ~~~~~~~~~~~~~~~~~ 486 (602) ...++.. ..+ T Consensus 413 ~~~~g~~-------~g~ 422 (422) T protein:vir:13 413 YKKGGEK-------GGK 422 (422) T ss_pred cccCCCc-------CCC Confidence 1100000 000 No 22 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=4.3e-81 Score=461.19 Aligned_cols=401 Identities=13% Similarity=0.089 Sum_probs=303.5 Q ss_pred CCCCcccccccchhhhcccC----ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVG----RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~----~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) ..|..++............+ ......++++ .+..+++|++||++||++||++||+++++.+..... T Consensus 8 f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~------ 77 (414) T protein:vir:44 8 FQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQ----RAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQR------ 77 (414) T ss_pred hccCccCcccchhhHhHhhccCccccCCceechh----hhhccHHHHHHHHHHHHHhccCceEEEEecCCceee------ Confidence 22211121111111111111 0111112332 234578999999999999999999999876543221 Q ss_pred HHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) ...|+++.++ .+||++||+.+||+.++.+++++||||++++++ .|++.+|+||+|..|.+..+... T Consensus 78 -------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~----- 144 (414) T protein:vir:44 78 -------ATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSW----- 144 (414) T ss_pred -------cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCC----- Confidence 1134555555 589999999999999999999999999999887 59999999999999976432110 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) .. .+++...+|....+++++|||+|+++ .++++|+||+.. T Consensus 145 ----------~~-----------------------------~y~~~~~~g~~~~~~~~evih~~~~~-~d~~~G~s~i~~ 184 (414) T protein:vir:44 145 ----------EP-----------------------------VYQVTFPDGSTDVLSQEDIWHVRTLT-LDGLVGLNPIAY 184 (414) T ss_pred ----------cE-----------------------------EEEEEecCceEEEEccccEEEecCCC-CCCcccccHHHH Confidence 00 12233345667789999999999874 688999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) +..+|....++++++.++|+||++|+++|++++ .+++++.+++++.|++ +.|..|+|+++++++|++ T Consensus 185 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~----------- 252 (414) T protein:vir:44 185 AREAISLAAATEEHGARLFSNGAVTSGVLRTEQ-TLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLD----------- 252 (414) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCCCce----------- Confidence 999999999999999999999999999999876 5899999999999975 556689999999877654 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) +++++ ++++|+||+|++++++++||++|||||++||..++++++|+|++.+.|+++||+|+++.||++||++|+++ T Consensus 253 ---~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~ 328 (414) T protein:vir:44 253 ---WKSMA-LNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRK 328 (414) T ss_pred ---EEEcc-CChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 55554 45689999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCc Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEA 474 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~ 474 (602) .+. .+++++|+.+.+++. |.+.+++++++++++|+||+||+|+++|+||+|||+ ..+++.+++....+......++ T Consensus 329 ~~~-~~~~i~fd~~~ll~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD-~~~~~~n~~~~~~~~~~~~~~~ 404 (414) T protein:vir:44 329 SKQ-GVFYAKFNAGALLRG--DMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD-VYLTPMNMTTKPSDGSKAGKQK 404 (414) T ss_pred ccc-CceEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eecccccccccCCccccCCCCC Confidence 875 578999999999876 777888999999999999999999999999999874 3334444443322211111111 Q ss_pred ccccccccccc Q lcl|NC_021537. 475 MLTRSKAAPPL 485 (602) Q Consensus 475 ~~~~~~~~~~~ 485 (602) . ....+++.. T Consensus 405 ~-~~~~d~~~~ 414 (414) T protein:vir:44 405 D-NANADETTS 414 (414) T ss_pred C-CCCCCCCCC Confidence 0 000110110 No 23 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=8.2e-81 Score=459.66 Aligned_cols=434 Identities=17% Similarity=0.161 Sum_probs=309.3 Q ss_pred CCCCcccc--cccchhhh------cccCcc--ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETT--QLDERHIA------TDVGRG--IQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~--~~~~~~~~------~~~~~~--i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) .++..... ....+.+. ...++. -..++++.. +..+++|++||++||++||++||+++++.+..... T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~----al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~ 83 (457) T protein:vir:62 8 FGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHD----ALQVSAVFASVRLLSETIATLPLSTYSKRGGTRKE 83 (457) T ss_pred hccccccccccccccccccchhhhhhccccccCCceechHH----hhccHHHHHHHHHHHHhHhhCceEEEEecCCcccc Confidence 11111100 00000000 001110 011233332 33468899999999999999999998765432211 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ...+....++.+||+.||+.+||+.++.+++++||||+++.++ .|++.+|+||+|.+|++..+... T Consensus 84 -------------~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:62 84 -------------IDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVD 149 (457) T ss_pred -------------ccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccC Confidence 1234566788899999999999999999999999999998665 69999999999999987543221 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecC--ceeEEechhHEEEecCCCCCCCcc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA--GELKNGPANELIFLPNPSPLALYY 228 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~eviH~r~~~~~~~~~ 228 (602) ... ...|.. +.+...+ .....|+++||||||.+++.+.++ T Consensus 150 ~~~-----------~~~~~~---------------------------y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~ 191 (457) T protein:vir:62 150 GLR-----------RKVFEA---------------------------YDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFV 191 (457) T ss_pred Ccc-----------ceeEEE---------------------------EEEccCCceeEEEeeCccceEEecCCCCCCcee Confidence 000 000000 0000011 123578999999999999888899 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceec Q lcl|NC_021537. 229 GVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGL 307 (602) Q Consensus 229 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~ 307 (602) |+||+..+..+|....++++++.++|+||++|++||++++ .+++++.+++++.|++. .|..|+|+++++++|+ T Consensus 192 G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~----- 265 (457) T protein:vir:62 192 GCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG-TMSEEGLARAREAWRAANSGVDNAHRVALLTEGA----- 265 (457) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCCCc----- Confidence 9999999999999999999999999999999999999986 58999999999999875 5668999999987665 Q ss_pred cccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc--cCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 308 GDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR--ANSKEQTREFAKGIIEPEQAKFSA 385 (602) Q Consensus 308 ~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~f~~~~l~P~~~~ie~ 385 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.+++++ ||+|++.+.|+++||+||++.||+ T Consensus 266 ---------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~ 335 (457) T protein:vir:62 266 ---------KFSKVA-MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEA 335 (457) T ss_pred ---------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 455665 46689999999999999999999999999999888775 889999999999999999999999 Q ss_pred HHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-ccccccc Q lcl|NC_021537. 386 RLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFG 464 (602) Q Consensus 386 ~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~ 464 (602) +||++|+++.+. .+++++|+++.+++. |.+.+++++.+++++|+||+||+|+++||||++||.+|.++. .++...+ T Consensus 336 ~ln~~L~~~~~~-~~~~i~fd~~~l~~~--d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~ 412 (457) T protein:vir:62 336 GFNRLLFAETAD-RFRFVKFNLDEIKRG--APKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIG 412 (457) T ss_pred HHHhhhcCcccc-CceEEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccc Confidence 999999998775 578899999999876 778888999999999999999999999999999998776654 4555555 Q ss_pred ccccCCCcCcccccccccccccccccccccccccccccchhhhhcchhhhhhheecc Q lcl|NC_021537. 465 ADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDF 521 (602) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~ 521 (602) ...+..+.+.+.+... ++.++............+ +..-...+.|+ T Consensus 413 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~----------d~~~~~~~~~~ 457 (457) T protein:vir:62 413 EEPEPEPAPAPPAIDP--PAEEPADDEEPDNAEGDP----------DEGETEDDDDA 457 (457) T ss_pred ccccccccCCCccCCC--CccCCCCCCCCCCCCCCC----------ccccccccccC Confidence 4433222222111111 111111011000000000 00011112222 No 24 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=6.9e-81 Score=460.04 Aligned_cols=495 Identities=22% Similarity=0.335 Sum_probs=332.3 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) ++++.+++++.... .+++++||+|+..|+++++.|++|++||++||++|+++||+++.+... T Consensus 17 ~~~~~~~~~~~~~~----~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~~-------------- 78 (540) T protein:vir:41 17 IKGDTDSQALKEDR----FEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDGG-------------- 78 (540) T ss_pred hhccccccccccCC----CCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCccc-------------- Confidence 44555566654433 357899999999999999999999999999999999999998643211 Q ss_pred HhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhhh Q lcl|NC_021537. 81 DFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVE 160 (602) Q Consensus 81 ~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~ 160 (602) ...+ .||+.+|+.+||++++.|++++||||++++|+..|++++|+||+|.+|++..+ T Consensus 79 --------~~~~--lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~------------- 135 (540) T protein:vir:41 79 --------VEEL--LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRD------------- 135 (540) T ss_pred --------hhhh--ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEc------------- Confidence 1112 27889999999999999999999999999999999999999999999998654 Q ss_pred hcccCceeEEEEcCCccee-ecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 161 NIESGHGYVQVRQGRRRYF-GEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 161 ~~~~~~~~~qi~~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) +..|+|+.++....+ ..++.. ..++. ..+.....++++||||+|.+++.+++||+||+.++..+ T Consensus 136 ----~~~~~~~~d~~~~~~~~~~~~~-----~~~~~------~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~ 200 (540) T protein:vir:41 136 ----GSRYMQTWDGIHVTYFKDYRYE-----GEVNP------DNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPS 200 (540) T ss_pred ----CceeEeeecCceeeeeeccccc-----ceeec------cccccceeecccceEEecCCCCCCCcccccHHHHHHHH Confidence 345667666654322 222110 11111 12334568999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHH---------HHHHHHHHHHH-hhc-ccccCcceeccCCccceecc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSED---------SKEDLRNLMDN-LKG-SRYRTAILEVEEFVDDHGLG 308 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~---------~~~~l~~~~~~-~~g-~~nag~~~~~~~g~~~~~~~ 308 (602) |..+.++++++.++|+||++|++||++++...++. ..+.+++.|++ +.| ..|+|++++++. T Consensus 201 i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~-------- 272 (540) T protein:vir:41 201 ILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSI-------- 272 (540) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEec-------- Confidence 99999999999999999999999999986543332 23556666654 344 368999988752 Q ss_pred ccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 309 DGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREFAKGIIEPEQAKFSAR 386 (602) Q Consensus 309 ~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f~~~~l~P~~~~ie~~ 386 (602) +++.+.+++|+||+. +++|+||+|++++++++||++|||||++||+.+. .|+||++++.+.|+++||+|++++||++ T Consensus 273 ~~~~~~g~~~~pl~~-~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ 351 (540) T protein:vir:41 273 PGGDTVEVTFTPLNT-SQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSV 351 (540) T ss_pred CCCcccceeEEeccc-chhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH Confidence 223456889999986 5689999999999999999999999999998754 5689999999999999999999999999 Q ss_pred HhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh-CCCCCCCCcccccccc-cccc-- Q lcl|NC_021537. 387 LYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSE-FEAE-- 462 (602) Q Consensus 387 ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~-~~~~-- 462 (602) ||++|++..+ .+++|+|+..++++. |.+ ..+.+++++|++|+||+|+.+ |++|.+ +.++.+ +... T Consensus 352 ln~~L~~~~~--~~~~i~f~~~~ll~~--D~~---~~~~~lv~~G~lT~NE~Re~L~g~e~gd----d~~l~p~n~~~~~ 420 (540) T protein:vir:41 352 LTDFIQLKLD--PGARFVFNEEILMES--EFV---HNYALLVQCGVLTPSEVREKLFGLDGGP----DMFMVPSSIGKSA 420 (540) T ss_pred HHHhhhhccC--CceEEEecchhhcch--HHH---HHHHHHHhCCCCCHHHHHHHhCcCcCCC----ccccccccccccc Confidence 9999987654 478999999999876 433 346678999999999999854 666533 323222 2221 Q ss_pred ccccccCCCcCcccc----cccccccccccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCC-c Q lcl|NC_021537. 463 FGADASDGDAEAMLT----RSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQ-N 537 (602) Q Consensus 463 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~-~ 537 (602) ...+......+.... ....++..+...++...... ..+.++. ....+..-.+..+.+.| .|.++.|+ . T Consensus 421 ~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 493 (540) T protein:vir:41 421 MKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLED--KKKKIDE---VLSDFRAEAYENGKKML--SIAGDMGTMS 493 (540) T ss_pred ccccccccCCCCccccccccchhcccccCcccccccccc--ccccccc---cccccCCccccchhHHH--HHhhhhhhhh Confidence 111111111110000 00011111111111000000 0000000 00111111222344554 34555444 3 Q ss_pred ceeeeccCC------HHHHHHHhCCCccchhhhhhhcccccccccccchhcc Q lcl|NC_021537. 538 SLYVYVDVP------AAVWSALVSAPSAGSYHYSEIRLQYGYLEVTNNHERL 583 (602) Q Consensus 538 ~~y~y~~v~------~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~ 583 (602) ++-+-..|- .+-|++|+.|+--- -. ..|| +|-|.-|+=.. + T Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~-~~~~~~~~~~~--~ 540 (540) T protein:vir:41 494 AINRGVSMIPPKPSNLEAYEDLLAASVDD-IV-ERIR-HYLYKVIGWRE--L 540 (540) T ss_pred hhhcCceecCCCCcchHHHHHHHHhhHHH-HH-HHHH-HHHHHHhhhcc--C Confidence 344433332 46799999885311 00 0111 23333332111 0 No 25 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=3.3e-82 Score=467.28 Aligned_cols=501 Identities=15% Similarity=0.114 Sum_probs=339.1 Q ss_pred chhhhcccCccccCCC-CHHHH-HHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchh Q lcl|NC_021537. 12 ERHIATDVGRGIQPPY-NPETL-AAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSR 89 (602) Q Consensus 12 ~~~~~~~~~~~i~p~~-~~~~l-~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (602) -..+.+..|++..|.. +...+ ...+.++++|++||++||++||++||+++..++. ....|++ T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~----------------~~~~~~l 64 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGE----------------LDELHPL 64 (723) T ss_pred CcccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCc----------------cchhhHH Confidence 2334444455444422 22222 3455678999999999999999999998743211 0112556 Q ss_pred hhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCceEEEEEeCcccccccccccccccccchhhhhcccC Q lcl|NC_021537. 90 WQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESG 165 (602) Q Consensus 90 ~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~ 165 (602) +.++. +||++||+.+||+.++.+++++||+|++++|++ .|.|.+|++|+++.+.+....... T Consensus 65 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~-------------- 130 (723) T protein:vir:94 65 SQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAAD-------------- 130 (723) T ss_pred HHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCc-------------- Confidence 66665 799999999999999999999999999999765 488999999998766543221110 Q ss_pred ceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHH Q lcl|NC_021537. 166 HGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQA 245 (602) Q Consensus 166 ~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~ 245 (602) .+... ....+.+...+|....++++||||||.+++.++++|+||+..+..+|....+ T Consensus 131 -----------~~~~~------------~~~~y~~~~~~G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~a 187 (723) T protein:vir:94 131 -----------AVPQA------------QIIGYVIERTDGVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFY 187 (723) T ss_pred -----------cceee------------eeeEEEEEecCceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHH Confidence 00000 0011223334567788999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccccccccccccccc Q lcl|NC_021537. 246 AKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR 324 (602) Q Consensus 246 ~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~ 324 (602) +++++.++|+||++|+|||+.+ .+++++.+++++.|++ +.|..|+|+++++++... .+.+++.+++|++++ + T Consensus 188 a~~~~~~~f~NG~~p~giL~~~--~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~----~~~vl~~G~~~~~l~-~ 260 (723) T protein:vir:94 188 AATWQRQSFKNGARPGGVVNLG--DMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGS----DGGAAGKGATFTSLS-M 260 (723) T ss_pred HHHHHHHHHhcCCCcceEEEcC--CCCHHHHHHHHHHHHHHhhchhhcCcceeeccccc----ccccccCCceEEEcc-C Confidence 9999999999999999999975 3899999999999976 567799999999975432 344567789999998 5 Q ss_pred chHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEE Q lcl|NC_021537. 325 EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTID 404 (602) Q Consensus 325 ~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~ 404 (602) +++|+||+|++++++++||++|||||++|+. .++++|++++.+.|+++||+||++.||++||.+|++..+. .++++ T Consensus 261 s~~D~q~le~r~~~~~eIa~afgVPp~~i~~--~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~g~--~~~~~ 336 (723) T protein:vir:94 261 SPAEMDYINSRMHSAEEVMLAFGIRKDALLG--GSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLPDIGW--TVEWD 336 (723) T ss_pred CHHHHHHHHHHHHhHHHHHHHhCCChhHcCC--CCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcccccC--ceEEe Confidence 6789999999999999999999999999964 5689999999999999999999999999999999976532 46677 Q ss_pred eccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCccccccccccc Q lcl|NC_021537. 405 FELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPP 484 (602) Q Consensus 405 f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (602) |+...+++. |.+.+++++++++++|+||+||+|+++||||+|||+++.++.+.....+......+..+... . T Consensus 337 f~~~~lLr~--D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~------~ 408 (723) T protein:vir:94 337 FNSVPALQE--DLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGA------A 408 (723) T ss_pred ecchhhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhh------H Confidence 777777755 88888899999999999999999999999999999988776655443332211111100000 0 Q ss_pred ccccccccccccccccccchhhhhcchhhh--hhheecccccE----------EEEEEecccCCcceeeeccCCHHHHHH Q lcl|NC_021537. 485 LENKIGERDSVDVDVSKDPIEQTTFSSSNL--DEGLYDFGERE----------LYLSFKRESGQNSLYVYVDVPAAVWSA 552 (602) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~--~~~~yd~~~~~----------l~~~f~~~~~~~~~y~y~~v~~~~~~~ 552 (602) .-.+..++..++...+ ..++..+.. ..-|-|++... |.++.. -+-..+-...+.. T Consensus 409 ~~~~~~~~~~~~~p~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~ 475 (723) T protein:vir:94 409 RMLALLERVAADRPLP-----ELPVRATTVLHHDPGPDPQQTLYERLEALLQPLLVELG--------RRQAAVTLREFDL 475 (723) T ss_pred hhhhhccccccccCcC-----CCCCCCCCCCCCCcccCCchhHHHHHHHHHhhhHHHHH--------HHHHHHHHHhhch Confidence 0001111111111111 111211111 11111111100 000000 0001111222222 Q ss_pred HhCCCccchhhhhhhcccccccccccchhcccCCCCCChhhcCCcccccC Q lcl|NC_021537. 553 LVSAPSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPGEAPEDVPSDI 602 (602) Q Consensus 553 ~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 602 (602) ++.-.-...-+-...+.. +++.+.|.....+|++.+|+...=.++ T Consensus 476 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~v~~~~~~~~ 520 (723) T protein:vir:94 476 LMRGERAAALWLADVRAV-----ASEAYERGALLAPPDAEEVPPARLTRL 520 (723) T ss_pred hhcchHHHHHHHHHHHHH-----HHhccccceeccccccchhhHHHHHHH Confidence 333333333333333322 234455666677888777765433333 No 26 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=1.7e-80 Score=457.98 Aligned_cols=404 Identities=13% Similarity=0.072 Sum_probs=305.5 Q ss_pred CCCCcccccccchhhhcccCccccCC---CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPP---YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~---~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) +.+...... ....+ +-.+....+. -.-..+.+.+..+++|++||++||++||++||+++++..++..+.. T Consensus 7 ~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~----- 79 (423) T protein:vir:81 7 LGLAPSVVA-TPEPI-ELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERV----- 79 (423) T ss_pred hcccccccc-Ccccc-ccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeee----- Confidence 211111100 00000 0011111111 1123567778889999999999999999999999887655433211 Q ss_pred HHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC--CceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD--GTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~--G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) ..|+.+.++.+||++||+.+||+.++.+++++||+|+++.|+.. +.+..|+|+++..|++....... T Consensus 80 -------~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~---- 148 (423) T protein:vir:81 80 -------REGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGW---- 148 (423) T ss_pred -------ccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCC---- Confidence 23667778889999999999999999999999999999999863 45677888888777654321110 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) +..++++ ......+|...+++++||||+|.+++.+.++|+||+.. T Consensus 149 ---------~~~~Y~~--------------------------~~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~ 193 (423) T protein:vir:81 149 ---------GSLDYII--------------------------IESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQS 193 (423) T ss_pred ---------cceEEEE--------------------------EEecCCCceEEEEcccceEEecCCCCCCccccccHHHH Confidence 1111111 01112356677899999999999998888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc----ccCCHHHHHHHHHHHHHhh--cccccCcceeccCCccceeccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG----GTLSEDSKEDLRNLMDNLK--GSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~----~~~~~~~~~~l~~~~~~~~--g~~nag~~~~~~~g~~~~~~~~ 309 (602) +..+|....++++++.++|+||+.|+++|+++. +.+++++.+++++.|++.. +..|+|+++++++|+ T Consensus 194 ~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~------- 266 (423) T protein:vir:81 194 LRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGM------- 266 (423) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCc------- Confidence 999999999999999999999999999998753 3479999999999998753 568899999987765 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.++++|+|+|++.+.|+++||+|+++.||++||+ T Consensus 267 -------~~~~l~-~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~ 338 (423) T protein:vir:81 267 -------KAENFH-TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNL 338 (423) T ss_pred -------eEEecc-CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455565 456899999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCcccc-ccceEEEeccchhcchhHHHHHHHHHHHHHH-hCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 390 IIHQDALD-VDEWTIDFELRGAEQPEQDAKMAEQRVRAMR-LAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 390 ~Ll~~~~~-~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~-~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) +|+++.+. ..+++++||.+++++. |.+.+++++++++ ++|+||+||+|+++||+|+|||| ..+++.|+...+... T Consensus 339 ~L~~~~~~~~~~~~~~fd~~~llr~--d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD-~~~~p~n~~~~~~~~ 415 (423) T protein:vir:81 339 FLLPRVGIDNEKFYFEFNLEEKLRA--SFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGD-DLARPLNTEFGDSED 415 (423) T ss_pred hhcCccccccCccEEEecchhhhcc--CHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc-eeecccccccCccCC Confidence 99998664 4678999999999877 7777888888887 56999999999999999999875 334444443322211 Q ss_pred cCCCcCcc Q lcl|NC_021537. 468 SDGDAEAM 475 (602) Q Consensus 468 ~~~~~~~~ 475 (602) ..++..++ T Consensus 416 ~~~~~~~t 423 (423) T protein:vir:81 416 APGEEVET 423 (423) T ss_pred CCCCCCCC Confidence 11111111 No 27 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=1.2e-80 Score=458.77 Aligned_cols=404 Identities=15% Similarity=0.125 Sum_probs=304.2 Q ss_pred CCCCcccccccchhhhcccCc---cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGR---GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~---~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) .+++.++.-.....+..-.++ .....++... +..+++|++||++||++||++||+++.+.+.+... T Consensus 8 ~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~~~----al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~------- 76 (419) T protein:vir:14 8 LSNLGQTQMSAGGWVSALLGSSRSDSGQVVTPAS----ALALTVLQNCVTLLAESIAQLPIELYERSGEDRKP------- 76 (419) T ss_pred cccccccccCcchhhHHhhcCCCccCCcccchHH----hhccHHHHHHHHHHHHhhccCceEEEEecCCcccc------- Confidence 344444433333222211111 1111233332 34568899999999999999999998876543221 Q ss_pred HHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccc Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDG 156 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~ 156 (602) ...|+++.++. +||++||+.+||+.++.+++++||+|++++|+.+|++++|+||+|++|++..+..+ T Consensus 77 ------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~------ 144 (419) T protein:vir:14 77 ------ATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL------ 144 (419) T ss_pred ------ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc------ Confidence 12356666554 79999999999999999999999999999999999999999999999986543211 Q ss_pred hhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHH Q lcl|NC_021537. 157 EEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAA 236 (602) Q Consensus 157 ~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~ 236 (602) ..++++. ....+++++|||+++++ .++++|+||+..+ T Consensus 145 ---------~~~y~~~---------------------------------~~~~~~~~~i~h~~~~~-~dg~~G~s~i~~~ 181 (419) T protein:vir:14 145 ---------KPVYRVR---------------------------------GSDPMPQRLVHHVRWMS-INGYTGLSPVLLH 181 (419) T ss_pred ---------eEEEEEc---------------------------------cCcccchhheeEecCcC-CCCcccccHHHHH Confidence 1111110 11236789999999875 6889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc---cCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccc Q lcl|NC_021537. 237 MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG---TLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGS 312 (602) Q Consensus 237 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~---~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~ 312 (602) ..+|....++++++.++|+||++|+|+|++++. ..++++.+++++.|++ +.|..|+|+++++++|+ T Consensus 182 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~---------- 251 (419) T protein:vir:14 182 ANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGM---------- 251 (419) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCc---------- Confidence 999999999999999999999999999998753 3468999999999976 55678999999987765 Q ss_pred ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 313 DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 313 ~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) +|++++ ++++|+||+|++++++++||++|||||++||..++++++|+|++.+.|+++||.|++++||++||++|+ T Consensus 252 ----~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll 326 (419) T protein:vir:14 252 ----TFRPLS-MTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLL 326 (419) T ss_pred ----eEEEcc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 455555 346799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCc Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDA 472 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~ 472 (602) ++.+. .+++++|+.+++++. |.+.+++++++++++|++|+||+|+++|++|+|||+ ..+.+.+++..+. +...+. T Consensus 327 ~~~~~-~~~~i~fd~~~l~r~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD-~~~~~~n~~~~~~-~~~~~~ 401 (419) T protein:vir:14 327 LPSER-KQYFIEYNLAGLLRG--DQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD-IYLSPMNMVDASK-PQQLPV 401 (419) T ss_pred Ccccc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeeccccccccc-cccccC Confidence 88775 589999999999876 778888999999999999999999999999999874 3344445444332 111111 Q ss_pred Cccccccccccccccccccccc Q lcl|NC_021537. 473 EAMLTRSKAAPPLENKIGERDS 494 (602) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~ 494 (602) +..... ++......+.-+ T Consensus 402 ~~~~~~----~~~~~e~~~~l~ 419 (419) T protein:vir:14 402 GKSEPT----KAAIDEIGRILS 419 (419) T ss_pred CCCCCc----cccccchhcccC Confidence 111000 000011111111 No 28 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=1.2e-80 Score=458.77 Aligned_cols=403 Identities=15% Similarity=0.149 Sum_probs=298.6 Q ss_pred CCCCc---ccccccc--hhhhcccCcccc---CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSKAE---ETTQLDE--RHIATDVGRGIQ---PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k~~---~~~~~~~--~~~~~~~~~~i~---p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~ 72 (602) ...++ ...++.. .......+.... ..++.. .+..++.|++||++||++||++||+++.+.+++... T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~----~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~~g~~~-- 91 (432) T protein:vir:97 18 VPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNAD----AIMRLDAVAACVKLVSQAVAAMPLMMYMRTPDGRKE-- 91 (432) T ss_pred CCccccccccccccccCchhhhhhcccccccCcccchH----hhhcchHHHHHHHHHHHhhccCceEEEEecCCCccc-- Confidence 11111 0111100 001111111100 112222 244568999999999999999999998876543221 Q ss_pred hhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccc Q lcl|NC_021537. 73 GESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTI 151 (602) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~ 151 (602) ...|+++.++ .+||++||+.+||+.++.+++++||||++++|+ +|++.+|+||+|+.|++..+..+ T Consensus 92 -----------~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g- 158 (432) T protein:vir:97 92 -----------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKG- 158 (432) T ss_pred -----------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCC- Confidence 1236666666 589999999999999999999999999999997 59999999999999987543211 Q ss_pred ccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCccccc Q lcl|NC_021537. 152 EREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVP 231 (602) Q Consensus 152 ~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~s 231 (602) ..+ |++...+|...++++++|||+|+++ .++++|+| T Consensus 159 --------------~~~-----------------------------y~~~~~~g~~~~~~~~~iih~r~~~-~dg~~G~s 194 (432) T protein:vir:97 159 --------------NTA-----------------------------YRYRRTDGQMIDIPRQQIWKIMGYS-LDGENGLS 194 (432) T ss_pred --------------cEE-----------------------------EEEEecCceEEEEccccEEEecCcC-CCCccccc Confidence 111 1222345566789999999999875 68899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccc Q lcl|NC_021537. 232 DWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 232 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~ 311 (602) |+..+...|..+.++++++.++|+||++|++||++++ .+++++++++++.| .+..|+|+++++++|++ T Consensus 195 pi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~---~~~~nag~~~vl~~g~~-------- 262 (432) T protein:vir:97 195 AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDDQYDSFSKKV---SGSVEAGRAPLLEGGMD-------- 262 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC-CCCHHHHHHHHHHH---hhhhcCCCceecCCCce-------- Confidence 9999999999999999999999999999999999876 48999988877655 46678899999877654 Q ss_pred cccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc---cCHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR---ANSKEQTREFAKGIIEPEQAKFSARLY 388 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---sn~e~~~~~f~~~~l~P~~~~ie~~ln 388 (602) |++++ ++++|+||+|++++++++||++|||||++||+.+.+++ +|+|++.+.|+++||.||++.||++|| T Consensus 263 ------~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln 335 (432) T protein:vir:97 263 ------VKSLG-LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIA 335 (432) T ss_pred ------EEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 45554 45689999999999999999999999999998876654 789999999999999999999999999 Q ss_pred hhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccccccccc Q lcl|NC_021537. 389 KIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADAS 468 (602) Q Consensus 389 ~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~ 468 (602) ++|+++.++ .+++++||.+.+++. |.+.+++++.+++++|+||+||+|+++||||++|++...+++.+++++....+ T Consensus 336 ~kLl~~~e~-~~~~~~fd~~~llr~--d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~ 412 (432) T protein:vir:97 336 LNLLTPAER-RRYFADFDTSALLRA--DSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGL 412 (432) T ss_pred hhccCcccc-CceEEEeechhhhcc--CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecccccchhhhcc Confidence 999998775 578999999999877 77888999999999999999999999999999876544445555554432211 Q ss_pred CCCcCcccccccccccccccccc Q lcl|NC_021537. 469 DGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .....+....... .++...+ T Consensus 413 ~~~~~~~~~~~~~---~~~~~~~ 432 (432) T protein:vir:97 413 QASPEPASGLGNQ---QQDKVSK 432 (432) T ss_pred cCCCCCCCCCCCc---ccccccC Confidence 1111111000000 0111111 No 29 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=1.1e-80 Score=458.86 Aligned_cols=403 Identities=15% Similarity=0.152 Sum_probs=298.9 Q ss_pred CCCCcccccccchhh------hcccCccc---cCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQLDERHI------ATDVGRGI---QPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~~~~~~~------~~~~~~~i---~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) ..|+..........+ ....+... ...++. ..+..+++|++||++||++||++||+++.+.+++.... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~~ 92 (432) T protein:vir:81 17 FVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNA----DAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKEA 92 (432) T ss_pred cccccccccccccccccCccchhhhcccccccCcccch----HhhhccHHHHHHHHHHHHhhhhCceeeEEecCCcceec Confidence 111111000000000 00011000 011222 22445688999999999999999999988765443211 Q ss_pred chhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ..|+++.++ .+||++||+.+||+.++.+++++||||++++++ +|++++|+||+|+.|++..+..+ T Consensus 93 -------------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g 158 (432) T protein:vir:81 93 -------------VNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKG 158 (432) T ss_pred -------------ccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCC Confidence 135666665 589999999999999999999999999999997 59999999999999987543211 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) .. .+.+...+|....+++++|||+|+++ .++++|+ T Consensus 159 ---------------~~-----------------------------~y~~~~~~g~~~~~~~~~iih~r~~~-~dg~~G~ 193 (432) T protein:vir:81 159 ---------------NT-----------------------------AYRYRRTDGQMIDIPKQQIWKIMGYS-LDGENGL 193 (432) T ss_pred ---------------cE-----------------------------EEEEEecCceEEEEccccEEEecCCC-CCCcccc Confidence 11 11233345667789999999999775 6789999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) ||+..+..+|..+.++++++.++|+||++|+++|++++ .+++++.+++++.+ .|..|+|+++++++|++ T Consensus 194 spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~---~~~~nag~~~vl~~g~~------- 262 (432) T protein:vir:81 194 SAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDDQYDSFAKKV---SGSVEAGRAPLLEGGMD------- 262 (432) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHHHHHHHHHHH---hhhhcCCCceecCCCce------- Confidence 99999999999999999999999999999999999875 58999988887765 46678899999877654 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc---cCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR---ANSKEQTREFAKGIIEPEQAKFSARL 387 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---sn~e~~~~~f~~~~l~P~~~~ie~~l 387 (602) |++++ ++++|+||+|++++++++||++|||||++||+.+.+++ +|+|++.+.|+++||.||++.||++| T Consensus 263 -------~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l 334 (432) T protein:vir:81 263 -------VKSLG-LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSI 334 (432) T ss_pred -------EEEcc-CCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45554 45689999999999999999999999999998876654 78999999999999999999999999 Q ss_pred hhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 388 YKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 388 n~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) |++|+++.+. .+++++||++.+++. |.+.+++++.+++++|+||+||+|+++|+||++|++....++.++.++.... T Consensus 335 ~~kLl~~~~~-~~~~~~fd~~~llr~--d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~ 411 (432) T protein:vir:81 335 ALNLLSPAER-RRYFADFDTSALLRA--DSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIG 411 (432) T ss_pred HhhccCcccc-CceEEEeechhhhcc--CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhc Confidence 9999998775 578999999999877 7788899999999999999999999999999987654344555555443221 Q ss_pred cCCCcCcccccccccccccccccc Q lcl|NC_021537. 468 SDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +.....+...... +.++...+ T Consensus 412 ~~~~~~~~~~~~n---~~~~~~~~ 432 (432) T protein:vir:81 412 LQASPEPASGLGN---QQQDKVSK 432 (432) T ss_pred cCCCCCCCCCCCC---cccccccC Confidence 1111111000000 00111111 No 30 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=7.3e-80 Score=454.43 Aligned_cols=401 Identities=13% Similarity=0.089 Sum_probs=303.8 Q ss_pred CCC-Ccccccccc---hhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSK-AEETTQLDE---RHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k-~~~~~~~~~---~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) .+| +........ ..++.....+-...++. +.+..+++|++||++||++||++|++++++.+...... T Consensus 7 f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~----- 77 (413) T protein:vir:48 7 FQRKSDAPVTTPAELAEAIGLSYDTYTGKRISS----QRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTRV----- 77 (413) T ss_pred hccCccCCccchHHHHHhhhcCcccccCceech----hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcceee----- Confidence 222 221111110 11111000000111222 22345789999999999999999999988755432211 Q ss_pred HHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) ..|++..++ .+||+.||+.+||+.++.+++++||||++++|+ .|++.+|+||+|++|++..+... T Consensus 78 --------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~----- 143 (413) T protein:vir:48 78 --------VDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQW----- 143 (413) T ss_pred --------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCc----- Confidence 235556566 479999999999999999999999999999987 58999999999999986543211 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) ..+ +.+...+|....++++||||+|.++ .++++|+||+.. T Consensus 144 ----------~~~-----------------------------y~~~~~~g~~~~~~~~evih~~~~~-~d~~~G~s~i~~ 183 (413) T protein:vir:48 144 ----------QPV-----------------------------YQVTFPDGSVDVLTQDEIWHVRTLT-LDGLVGLNPIAY 183 (413) T ss_pred ----------eEE-----------------------------EEEEecCceEEEEccccEEEecCcC-CCCcccccHHHH Confidence 001 1223345666789999999999886 578999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) +..+|....++++++.++|+||++|++||++++ .+++++.+++++.|++. .|..|+|+++++++|+ T Consensus 184 ~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~------------ 250 (413) T protein:vir:48 184 AREAISLAAATEEHGARLFGNGAVTSGVLRTEQ-KLTPDAYERLKKDFEERHTGLGNAHRPMILEMGL------------ 250 (413) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCCCc------------ Confidence 999999999999999999999999999999986 47999999999999865 5668999999987765 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) +|++++ ++++|+||+|++++++++||++|||||++||..++++++|++++.+.|++.||+|+++.||++||++|+++ T Consensus 251 --~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~~ 327 (413) T protein:vir:48 251 --DWKSMA-LNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRE 327 (413) T ss_pred --eEEecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCc Confidence 455555 45689999999999999999999999999999989999999999999999999999999999999999988 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCc Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEA 474 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~ 474 (602) .+. .+++++|+++.+++. |.+.+++++++++++|+||+||+|+++|+||+|||+ ..+++.+++........ ..+. T Consensus 328 ~~~-~~~~~~fd~~~l~~~--d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD-~~~~~~n~~~~~~~~~~-~~~~ 402 (413) T protein:vir:48 328 SKQ-GKFYAKFNAGALLRG--DMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD-VYLTPMNMTTSPSAGDD-NGKK 402 (413) T ss_pred ccc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeecccccccccccccc-CCCC Confidence 775 588999999999876 778888999999999999999999999999999875 34444454433221111 1111 Q ss_pred ccccccccccccccccccccc Q lcl|NC_021537. 475 MLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~ 495 (602) .... ..++.++ T Consensus 403 ~~~~----------~~~~~~~ 413 (413) T protein:vir:48 403 KESG----------DADKTAS 413 (413) T ss_pred CCCC----------CccccCC Confidence 1100 1111111 No 31 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=7.9e-80 Score=454.25 Aligned_cols=403 Identities=16% Similarity=0.124 Sum_probs=303.0 Q ss_pred CCCCcccccccc---hhhhcccCcc-cc--CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDE---RHIATDVGRG-IQ--PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~---~~~~~~~~~~-i~--p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) .+|...++.... .....-.++. .. ..++... +-.+++|++||++||++||++||+++.+.+.+.... T Consensus 7 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~----al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~--- 79 (416) T protein:vir:12 7 FEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESN----SLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIERK--- 79 (416) T ss_pred cccccCccccCccchhHHHHhhcCcccccCceechhh----hhccHHHHHHHHHHHHhhhhCceEEEEecCCccccc--- Confidence 223222221111 0011101111 11 1122222 224688999999999999999999987654432211 Q ss_pred hHHHHHHhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIER 153 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~ 153 (602) ..|+++. ++.+||+.||+.+||+.++.+++++||||+++.|+..|++.+|+||+|.+|++..+... T Consensus 80 ----------~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~--- 146 (416) T protein:vir:12 80 ----------PEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTT--- 146 (416) T ss_pred ----------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCC--- Confidence 1244444 45689999999999999999999999999999999999999999999999986533211 Q ss_pred ccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 154 EDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 154 ~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) +..| +.+..+|..+++++++|||+|+++ .++++|+||+ T Consensus 147 -----------~~~~------------------------------~~~~~~g~~~~~~~~eiih~~~~~-~~~~~G~s~i 184 (416) T protein:vir:12 147 -----------GMLW------------------------------YQTVLNGKAIELYDYEVLHFKGLS-TDGIHGKSPI 184 (416) T ss_pred -----------cEEE------------------------------EEEecCCeEEEecCccEEEecCcC-CCCcccccHH Confidence 1111 112234566789999999999876 5789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccc Q lcl|NC_021537. 234 VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSD 313 (602) Q Consensus 234 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~ 313 (602) .++..++....++++++.++|+||+.|++||++++ .+++++.+++++.|+... ++++++++++|+ T Consensus 185 ~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~~---~~~~~~vl~~g~----------- 249 (416) T protein:vir:12 185 GVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA-FLDEKPKENVRKEWKRVN---KVENIAIIDYGL----------- 249 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC-CCCHHHHHHHHHHHHHHh---cCCCeeecCCCc----------- Confidence 99999999999999999999999999999999976 589999999999998654 457778776654 Q ss_pred cccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 314 VNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 314 ~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) ++++++ ++++|+||+|+++++.++||++|||||.+||..++++++|++++.+.|+++||.|+++.||++||++|++ T Consensus 250 ---~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~ 325 (416) T protein:vir:12 250 ---EYQSIS-MPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFL 325 (416) T ss_pred ---eEEEcc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 455665 4568999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcC Q lcl|NC_021537. 394 DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAE 473 (602) Q Consensus 394 ~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~ 473 (602) +.+...+++|+|+++++++. |.+.+++++.+++++|+||+||+|+++|+||+|||+ ..+++.+++......+..... T Consensus 326 ~~~~~~g~~i~fd~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd-~~~~~~n~~~~~~~~~~~~~~ 402 (416) T protein:vir:12 326 DHDQKSGHYVKFNIDSELRG--DSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGD-KYISSLNYVFLDFLEEYQRLK 402 (416) T ss_pred chhhcCCceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeeccccccccccchhhccc Confidence 98888899999999999776 777888999999999999999999999999998864 344555555543211111110 Q ss_pred cccccccccccccc Q lcl|NC_021537. 474 AMLTRSKAAPPLEN 487 (602) Q Consensus 474 ~~~~~~~~~~~~~~ 487 (602) ...+....++..+. T Consensus 403 ~~~~~~gge~~~~g 416 (416) T protein:vir:12 403 AGGAMKGGDNKNEG 416 (416) T ss_pred cccccCCCCCcCCC Confidence 00000011111111 No 32 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=1.7e-79 Score=452.49 Aligned_cols=432 Identities=16% Similarity=0.171 Sum_probs=307.3 Q ss_pred CCCCccccc---ccchh------hhcccCc--cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKAEETTQ---LDERH------IATDVGR--GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~~~~~~---~~~~~------~~~~~~~--~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) +.+...+.. ...+. .....++ .-..++++.. +-.++.|++||++||++||++||+++++.+.+.+ T Consensus 7 l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~----al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~ 82 (457) T protein:vir:13 7 LFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHD----ALQVSAVFASVRLLSETIATLPLSTYSKRGGSRK 82 (457) T ss_pred hhcccccccccccccccccccchHHHhhcccccCCceechHH----hhccHHHHHHHHHHHHhhccCceEEEEecCCccc Confidence 211111111 00000 0010111 0112233322 3346789999999999999999999876543322 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) .. ..++++.++..++..||+.+||+.++.+++++||+|++|.++ .|++++|+||+|..|++..+.. T Consensus 83 ~~-------------~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~ 148 (457) T protein:vir:13 83 EI-------------VTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMV 148 (457) T ss_pred cc-------------ccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecC Confidence 11 134566677777778999999999999999999999999776 5999999999999998754322 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecC--ceeEEechhHEEEecCCCCCCCc Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA--GELKNGPANELIFLPNPSPLALY 227 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~eviH~r~~~~~~~~ 227 (602) .... ...| +. +.+...+ .....|++++|||++.+++.+.+ T Consensus 149 ~~~~-----------~~~~-----------~~----------------y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~ 190 (457) T protein:vir:13 149 DGLR-----------RKVF-----------EA----------------YDIDADGNEVLLGWFTPRDVLHIPGMMLPGDF 190 (457) T ss_pred CCcc-----------ceeE-----------EE----------------EEEecCCceeeEEeeCccceEEecCCCCCCcc Confidence 1000 0000 00 0000111 12356899999999999988889 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCcccee Q lcl|NC_021537. 228 YGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDDHG 306 (602) Q Consensus 228 ~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~~~ 306 (602) +|+||+..+..+|....++++++.++|+||++|++||++++ .+++++.+++++.|++. .|..|+|+++++++|+ T Consensus 191 ~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~---- 265 (457) T protein:vir:13 191 VGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG-TMSEEGLARAREAWRAANSGVDNAHRVALLTEGA---- 265 (457) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCCCc---- Confidence 99999999999999999999999999999999999999976 58999999999999865 5678999999987765 Q ss_pred ccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc--cCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 307 LGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR--ANSKEQTREFAKGIIEPEQAKFS 384 (602) Q Consensus 307 ~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~f~~~~l~P~~~~ie 384 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.+++++ ||++++.+.|+++||.||++.|| T Consensus 266 ----------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie 334 (457) T protein:vir:13 266 ----------KFSKVA-MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIE 334 (457) T ss_pred ----------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHH Confidence 455565 45689999999999999999999999999999887765 88999999999999999999999 Q ss_pred HHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc-ccccccc Q lcl|NC_021537. 385 ARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL-SEFEAEF 463 (602) Q Consensus 385 ~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~-~~~~~~~ 463 (602) ++||++|+++.+. .+++++|+++.+++. |.+.+++++.+++++|+||+||+|+++||+|+++|.+|.+. +.++... T Consensus 335 ~~ln~~L~~~~~~-~~~~i~fd~~~l~~~--D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~ 411 (457) T protein:vir:13 335 AGFNRLLFAETAD-RFRFVKFNLDEIKRG--APKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEV 411 (457) T ss_pred HHHHHhhcCcccc-CceeEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccc Confidence 9999999998775 578899999999876 77888899999999999999999999999999999877664 4455555 Q ss_pred cccccCCCcCcccccccc--cccccccccccccccccccccchhhhhcch Q lcl|NC_021537. 464 GADASDGDAEAMLTRSKA--APPLENKIGERDSVDVDVSKDPIEQTTFSS 511 (602) Q Consensus 464 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~m~~~~v~s 511 (602) +........+.+.+...+ ++..+.............+..+.+ +| T Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~----~~ 457 (457) T protein:vir:13 412 GEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDED----DA 457 (457) T ss_pred cccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccc----cC Confidence 443332222111111111 111111111111111100000000 01 No 33 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=2e-79 Score=452.07 Aligned_cols=404 Identities=15% Similarity=0.108 Sum_probs=300.7 Q ss_pred CCC---C--cccccccchhhhcccCc---cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSK---A--EETTQLDERHIATDVGR---GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k---~--~~~~~~~~~~~~~~~~~---~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~ 72 (602) .+| + .++.......+..-.++ .....+++.. +..+++|++||++||++||++||+++.+.+++... T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~----al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~-- 76 (419) T protein:vir:80 3 FSRQLLSNLGQTQPGSGGWVSALLGSARSEAGQVVTPAS----ALSLTVLQNCVTLLAESIAQLPVELYERSGDDRKP-- 76 (419) T ss_pred cccccccccCcCCCCcchhhHHhhcccccccCcccChHH----hhccHHHHHHHHHHHHhhccCceEEEEecCCCccc-- Confidence 111 1 11111111112111111 1111233332 33578999999999999999999999876544221 Q ss_pred hhhHHHHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccc Q lcl|NC_021537. 73 GESYQTVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTI 151 (602) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~ 151 (602) ...|+++.++. +||+.||+.+||+.++.+++++||||++++|+.+|++.+|+||+|++|++..+... T Consensus 77 -----------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~- 144 (419) T protein:vir:80 77 -----------ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDL- 144 (419) T ss_pred -----------ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc- Confidence 12355666555 89999999999999999999999999999999999999999999999986533211 Q ss_pred ccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCccccc Q lcl|NC_021537. 152 EREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVP 231 (602) Q Consensus 152 ~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~s 231 (602) ..++++ .....+++++|+|++.++ .++++|+| T Consensus 145 --------------~~~y~~---------------------------------~~~~~~~~~~i~h~~~~~-~d~~~G~s 176 (419) T protein:vir:80 145 --------------KPMYRV---------------------------------AGADPLPQRLVHHVRWMS-INGYTGLS 176 (419) T ss_pred --------------eEEEEE---------------------------------cCccccchhheEEecCCC-CCCccccc Confidence 111111 011247889999999875 68899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc---ccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceec Q lcl|NC_021537. 232 DWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG---GTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGL 307 (602) Q Consensus 232 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~ 307 (602) |+..+..+|....++++++.++|+||++|+++|++++ ...++++.+++++.|++ +.|..|+|+++++++|. T Consensus 177 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~----- 251 (419) T protein:vir:80 177 PVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGM----- 251 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCc----- Confidence 9999999999999999999999999999999999874 33578899999999976 45668999999987665 Q ss_pred cccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 308 GDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARL 387 (602) Q Consensus 308 ~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l 387 (602) +|++++ +++.|+||+|++++++++||++|||||.+||+.++++++|+|++.+.|+++||.|+++.||++| T Consensus 252 ---------~~~~l~-~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l 321 (419) T protein:vir:80 252 ---------KFKPLS-MTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAK 321 (419) T ss_pred ---------eEEecc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455565 4568999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 388 YKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 388 n~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) |++|+++.++ .+++++||++.+++. |.+.+++++++++++|+||+||+|+++|+||+|||+ ..+++.+++..+. + T Consensus 322 ~~kll~~~~~-~~~~i~fd~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD-~~~~~~n~~~~~~-~ 396 (419) T protein:vir:80 322 TRDLLLPSER-KQYFIEYNLAGLLRG--DQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD-IYLSPMNMVDASK-P 396 (419) T ss_pred hhhccCcccc-CCeEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeeccccccccc-c Confidence 9999988765 578999999999876 778888999999999999999999999999999864 3344445443321 1 Q ss_pred cCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 468 SDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) ...+.++.. +.. + .....++..+ T Consensus 397 ~~~~~~~~~-~~~--~--~~~~~~~~l~ 419 (419) T protein:vir:80 397 QPIPMGKTE-PTK--A--ALDEIGRILS 419 (419) T ss_pred ccccCCCCC-chh--h--hHHHHHhhcC Confidence 111100000 000 0 0000011111 No 34 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=7.3e-79 Score=448.97 Aligned_cols=422 Identities=11% Similarity=0.121 Sum_probs=312.5 Q ss_pred CCCCcccccccchhhhcccCccccC-CCCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh--- Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQP-PYNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES--- 75 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p-~~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~--- 75 (602) ++|++...-.+...+....|..+++ +.+.. .....+..+++|++||++||++||++||+++.+...+........ T Consensus 9 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~ 88 (460) T protein:vir:10 9 LRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQLNNLNIS 88 (460) T ss_pred HhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccchhhhhhhhh Confidence 4444433333333332222333332 34433 345567788999999999999999999999987665432211100 Q ss_pred --------HHH----HHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC----CCceEEEEEeCc Q lcl|NC_021537. 76 --------YQT----VRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG----DGTPVGLAHVPA 139 (602) Q Consensus 76 --------~~~----~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~----~G~~~~L~~l~p 139 (602) ... +.......+....|+.+||++||+.+||+.++.+++++||||++++|+. .|.+.+|+||+| T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~ 168 (460) T protein:vir:10 89 TKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYVLPA 168 (460) T ss_pred hhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCceeEEEEEEcC Confidence 000 0111112234456888999999999999999999999999999999964 478999999999 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEec Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLP 219 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r 219 (602) ++|++..+..+.... + .....++.+..++....++++|||||| T Consensus 169 ~~v~v~~~~~~~~~~---------------~----------------------~~~~~~~~~~~~g~~~~~~~~evih~r 211 (460) T protein:vir:10 169 HLIKIVLKDDINLLS---------------T----------------------DSPIKSYMLIQGDQFIEFNEDEVIHTK 211 (460) T ss_pred ceEEEEEcCCCceee---------------e----------------------eeeeeEEEEecCceeEEecccceEEEe Confidence 999976543321000 0 001112333456777899999999999 Q ss_pred CCCCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccC Q lcl|NC_021537. 220 NPSPL-----ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRT 293 (602) Q Consensus 220 ~~~~~-----~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag 293 (602) .+++. ++++|+||+..+..+|....++++++.++|+||+.|+++++.+ ..+++++.+++++.|++. .|..|+| T Consensus 212 ~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~-~~l~~e~~~~~~~~~~~~~~g~~n~g 290 (460) T protein:vir:10 212 YANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGS-TGLTQPQADSLKQRLTEMDKSPDRLS 290 (460) T ss_pred cCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecC-CCCCHHHHHHHHHHHHHHhcCccccC Confidence 87765 4689999999999999999999999999999999999988765 568999999999999875 5668999 Q ss_pred cceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHH Q lcl|NC_021537. 294 AILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREF 371 (602) Q Consensus 294 ~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f 371 (602) +++++++|. +|++++ ++++|+||+|++++++++||++|||||++||+.+. +++||+|++.+.| T Consensus 291 ~~~vl~~g~--------------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f 355 (460) T protein:vir:10 291 QIAGASGEI--------------AFTKIS-LNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRV 355 (460) T ss_pred CceecCCCc--------------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHH Confidence 999887665 455554 45679999999999999999999999999998754 4699999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR 451 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~ 451 (602) +++||.|+++.||++||++|+++.+...+++++|+++.+...+.|.+.++ .++++|+||+||+|+++|+||++++. T Consensus 356 ~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~----~~~~~g~~T~NE~R~~~g~~pi~~~~ 431 (460) T protein:vir:10 356 VTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMA----SWLNTIPVTPNEIRIAMKYETLNQDG 431 (460) T ss_pred HHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHH----HHHhCCCCCHHHHHHHhCCCCCCCCC Confidence 99999999999999999999999888889999999998866666666544 46789999999999999999998766 Q ss_pred cccccc-cccccccccccCCCcCccccccccccccccc Q lcl|NC_021537. 452 GDMTLS-EFEAEFGADASDGDAEAMLTRSKAAPPLENK 488 (602) Q Consensus 452 ~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (602) +|.+.. .+++++.. ......++... +++ T Consensus 432 gD~~~~~~n~~~~~~-~~~~~~~~~~n--------q~~ 460 (460) T protein:vir:10 432 MDIVFMPSNKVRIDD-VSNNLIDSAFN--------QNQ 460 (460) T ss_pred CCeeeecccccchhh-cccccCCCccc--------CCC Confidence 676544 45544331 11111111110 011 No 35 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=1.2e-78 Score=447.69 Aligned_cols=396 Identities=15% Similarity=0.125 Sum_probs=297.5 Q ss_pred CCCCcccccccc----hhhhcccCccccC------CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETTQLDE----RHIATDVGRGIQP------PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~~~~~----~~~~~~~~~~i~p------~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) |+|..--+.+-. .......++...| .+.. .-...+..+++|++||++||++||++||+++++.+.. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~~--- 76 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSASKLYDFSPWKNKSFWG-VINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVV--- 76 (409) T ss_pred CccccchhhhhhHHhhhhhccccccccccccccCccccc-cchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccccc--- Confidence 655544222211 1111111111111 1111 1123455689999999999999999999998643211 Q ss_pred cchhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) .|+++.++ .+||+.||+.+||+.++.+++++||||++++|+..|++++|+||+|+.|++..+.. T Consensus 77 ---------------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~ 141 (409) T protein:vir:96 77 ---------------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 141 (409) T ss_pred ---------------chhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCC Confidence 13444445 57999999999999999999999999999999999999999999999998754321 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCccc Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYG 229 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G 229 (602) . +..+ +.+....|....++++||||+|.+++.++++| T Consensus 142 ~--------------~~~~-----------------------------y~~~~~~g~~~~~~~~evih~r~~~~~~~~~G 178 (409) T protein:vir:96 142 S--------------RELY-----------------------------YSIHAATGNKLIVHNMDMLHFKHIVASNMVQG 178 (409) T ss_pred C--------------cEEE-----------------------------EEEEcCCceEEEEccccEEEeCCCCCCCcccc Confidence 1 0001 12233445667899999999999888999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 230 VPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 230 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) +||+..+..++....+++++. ++.++..++++++ .+..+++++.+++++.|++.. +|+++++++++|+ T Consensus 179 ~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~-~~~~l~~e~~~~~~~~~~~~~--~n~g~~~vl~~g~------- 246 (409) T protein:vir:96 179 ISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLK-YGSNVSTEKRQQVLEDFKQYY--EENGGILFQEPGV------- 246 (409) T ss_pred ccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEe-cCCCCCHHHHHHHHHHHHHHh--hcCCCeeecCCCc------- Confidence 999999999999999988874 4444444444554 456799999999999998765 3677888877665 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) ++++++ ++++|+||+|++++++++||++|||||++||..++++++|+|++.+.|+++||+|+++.||++||+ T Consensus 247 -------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~ 318 (409) T protein:vir:96 247 -------EIEPLP-KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNR 318 (409) T ss_pred -------eEEEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 455665 456899999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccC Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASD 469 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~ 469 (602) +|+++.+...+++|+||.+++++. |.+.+++++++++++|++|+||+|+++|+||+|||+ ..+++.+++++...... T Consensus 319 ~Ll~~~~~~~g~~i~fd~~~ll~~--d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD-~~~~~~n~~~~~~~~~~ 395 (409) T protein:vir:96 319 KLLTKTDREKNRYFKFNVKSYLRA--DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-KPLISGDLYPIDTPLEL 395 (409) T ss_pred hcCCcccccCcceEEeechhhhcc--CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcc-eeeecccccccccchhh Confidence 999998888899999999999876 778889999999999999999999999999998763 33455566555322111 Q ss_pred C--CcCcccccccc Q lcl|NC_021537. 470 G--DAEAMLTRSKA 481 (602) Q Consensus 470 ~--~~~~~~~~~~~ 481 (602) . ..++..+.... T Consensus 396 ~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 396 RKSLKGGDKNVNES 409 (409) T ss_pred cccccCCCCCcCCC Confidence 1 11111000000 No 36 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=1.5e-78 Score=447.20 Aligned_cols=397 Identities=15% Similarity=0.121 Sum_probs=297.2 Q ss_pred CCCCccccccc----chhhhcccCccccCC----CCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQLD----ERHIATDVGRGIQPP----YNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~~~----~~~~~~~~~~~i~p~----~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) |+|.--.+..- ...+.....+...|. .+.. .-...+..+++|++||++||++||++||+++++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~~---- 76 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVV---- 76 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccccc---- Confidence 44433222110 011111111111110 0110 1123355678999999999999999999998654221 Q ss_pred chhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) .|++..++ .+||+.||+.+||+.++.+++++||+|++++|+..|++.+|+||+|++|++..+... T Consensus 77 --------------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~ 142 (409) T protein:vir:93 77 --------------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS 142 (409) T ss_pred --------------cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC Confidence 13334444 579999999999999999999999999999999999999999999999986543211 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) +..+ |.+...+|....++++||||+|++++.++++|+ T Consensus 143 --------------~~~~-----------------------------y~~~~~~g~~~~~~~~eVih~r~~~~~~~~~G~ 179 (409) T protein:vir:93 143 --------------RELY-----------------------------YSIHAATGNKLIVHNMDMLHFKHIVASNMVQGI 179 (409) T ss_pred --------------cEEE-----------------------------EEEEcCCceEEEEccccEEEeCCCCCCCccccc Confidence 0001 122334456678999999999998889999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) ||+.++..++....++++++ ++.++..++++++. +..+++++.+++++.|++..+ |+++++++++|+ T Consensus 180 s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~-------- 246 (409) T protein:vir:93 180 SPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQYYE--ENGGILFQEPGV-------- 246 (409) T ss_pred cHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEec-CCCCCHHHHHHHHHHHHHHhh--cCCCeeecCCCc-------- Confidence 99999999999999998874 55555555566654 567899999999999987553 567788776654 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKI 390 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 390 (602) ++++++ ++++|+||+|++++++++||++|||||++||..++++++|+|++.+.|++.||+|+++.||++||++ T Consensus 247 ------~~~~l~-~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ 319 (409) T protein:vir:93 247 ------EIEPLP-KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRK 319 (409) T ss_pred ------eEEEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 566665 4568999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCC Q lcl|NC_021537. 391 IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDG 470 (602) Q Consensus 391 Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~ 470 (602) |+++.+...+++|+||.+++++. |.+.+++++++++++|++|+||+|+++|+||+|||+ ..+++.+++++....... T Consensus 320 Ll~~~~~~~~~~~~fd~~~ll~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD-~~~~~~n~~~~~~~~~~~ 396 (409) T protein:vir:93 320 LLTKTDREKNRYFKFNVKSYLRA--DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-KPLISGDLYPIDTPLELR 396 (409) T ss_pred cCCcccccCcceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeecccccccccchhhc Confidence 99998887889999999999877 778888999999999999999999999999998864 345566666554322111 Q ss_pred C--cCcccccccccccccc Q lcl|NC_021537. 471 D--AEAMLTRSKAAPPLEN 487 (602) Q Consensus 471 ~--~~~~~~~~~~~~~~~~ 487 (602) . .++...... . T Consensus 397 ~~~~gG~~n~~e------~ 409 (409) T protein:vir:93 397 KSLKGGDKNVNE------S 409 (409) T ss_pred ccccCCCCCcCC------C Confidence 1 111000000 0 No 37 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=5.8e-78 Score=444.03 Aligned_cols=458 Identities=17% Similarity=0.144 Sum_probs=297.8 Q ss_pred CCCCccccc-------ccchhhhcccC--ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhcc-----------CceEE Q lcl|NC_021537. 1 MSKAEETTQ-------LDERHIATDVG--RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAG-----------YGFEI 60 (602) Q Consensus 1 ~~k~~~~~~-------~~~~~~~~~~~--~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~-----------~~~~i 60 (602) ++|+...+. .....+...+. ..+.|+.++..+.+.+..||+|++||++|+++||+ ++|.+ T Consensus 43 ~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i 122 (551) T protein:vir:80 43 ISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEV 122 (551) T ss_pred HHHhhccCcceeecccccceecCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceE Confidence 334443221 11122322222 13445566666666666689999999999999997 56776 Q ss_pred EEecCCC-CcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCc Q lcl|NC_021537. 61 VAHPSAD-EPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPA 139 (602) Q Consensus 61 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p 139 (602) ..++... ...+.....+.+..+++++++. .++..+|+.+|+++++.|++++||+|++++|+..|+|++|+||+| T Consensus 123 ~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~-----~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p 197 (551) T protein:vir:80 123 RLKDLDKKPTSHDEATIKRIESFIEKTGVD-----NDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDP 197 (551) T ss_pred EecccCcccChhHHHHHHHHHHHHHhcCCC-----CCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCC Confidence 6554332 2233344445556666665542 223346999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEec Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLP 219 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r 219 (602) .+|++..+..+...+ ....|+|+ ..++....|+++||||++ T Consensus 198 ~~V~v~~~~~g~~~~---------~~~~y~~~------------------------------~~g~~~~~~~~~eiiH~~ 238 (551) T protein:vir:80 198 TTIFFATTADGKIPD---------NGNRFVQV------------------------------IDQKIVATFNAREMAFAV 238 (551) T ss_pred ceeEEEECCcccccc---------CceEEEEE------------------------------eCCcEEEEEcccceEEec Confidence 999986544321110 11123332 223445679999999999 Q ss_pred CCC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc-ccCCHHHHHHHHHHHHH-hhcccccCc Q lcl|NC_021537. 220 NPS---PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG-GTLSEDSKEDLRNLMDN-LKGSRYRTA 294 (602) Q Consensus 220 ~~~---~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~~~~~~~~~~l~~~~~~-~~g~~nag~ 294 (602) .++ +.++.||+||+.++..+|..+.++++++.++|+||++|+|||++++ ..+++++.+++++.|++ +.|..|+|+ T Consensus 239 ~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~ 318 (551) T protein:vir:80 239 RNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQ 318 (551) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCc Confidence 754 3446799999999999999999999999999999999999999875 45899999999999976 567789999 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc----------CCccCH Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST----------SNRANS 364 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~----------~~~sn~ 364 (602) ++++.+ .+++|++++ ++++|+||+|++++++++||++|||||++||+.+. .|+||+ T Consensus 319 ~~vl~~-------------~g~~~~~l~-~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~ 384 (551) T protein:vir:80 319 IPVVSA-------------EDVKFVNMT-PSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNS 384 (551) T ss_pred cccccC-------------CCceEEEcc-CChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhH Confidence 866532 135677776 56789999999999999999999999999997544 378999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCC Q lcl|NC_021537. 365 KEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDL 444 (602) Q Consensus 365 e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl 444 (602) +++...|+++||+||+++||++||++|++..+ ..++|+|+..+.. +...++++ .+++.+|+||+||+|+++|| T Consensus 385 e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~--~~~~f~f~~~~~~----~~~~~~~~-~~~~~~g~lT~NE~R~~~gl 457 (551) T protein:vir:80 385 AEKNQASKNKGLQPLLGFIEDFINKHIVAEFG--DKYTFQFVGGDIK----SELESVKI-LAEKAKVAMTVNEVRKELNL 457 (551) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhccccC--CceEEEeeccChh----hHHHHHHH-HHHHhcCCcCHHHHHHHhCC Confidence 99999999999999999999999999998754 3566777644432 22333343 35777899999999999999 Q ss_pred CC-CCCCcccccccccc-ccccccccCCCcCc-------------ccccccccccccccccccccccccccccchhhhhc Q lcl|NC_021537. 445 AP-FEDDRGDMTLSEFE-AEFGADASDGDAEA-------------MLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTF 509 (602) Q Consensus 445 ~p-~~~g~~d~~~~~~~-~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v 509 (602) +| +++| |.++.+.. ...+...+....+. .......+++.+....+... ........... T Consensus 458 ~P~~egG--D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-~~~~~~~~~~~--- 531 (551) T protein:vir:80 458 PGDVIGG--DIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTG-DIGKDGQRKDK--- 531 (551) T ss_pred CCCCCCC--ceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCC-CccccccccCc--- Confidence 98 6766 44443332 22221111110000 00000000000000000000 00000000000 Q ss_pred chhhhhhheecccccEEEEEEecccCCcce Q lcl|NC_021537. 510 SSSNLDEGLYDFGERELYLSFKRESGQNSL 539 (602) Q Consensus 510 ~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~ 539 (602) +.++-..-|+. =.|.++.. . T Consensus 532 ~~~~~~~~~~~-------~~~~~~~~---~ 551 (551) T protein:vir:80 532 DNANAGKQGMK-------GDKPNDWQ---T 551 (551) T ss_pred cccchhhhhcC-------CCCccccC---C Confidence 00111111111 01211100 0 No 38 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=9e-78 Score=442.97 Aligned_cols=496 Identities=21% Similarity=0.312 Sum_probs=335.9 Q ss_pred CC-CCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MS-KAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~-k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) ++ ++..+.++.. ...+++++||+|+..|+++++.|++|++||++||++||++||++..... +. T Consensus 17 i~~~~~~s~~~~~----~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~-----------~~- 80 (542) T protein:vir:41 17 IKREEVESQALGE----TRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDE-----------GV- 80 (542) T ss_pred hhhcccccccccc----ccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccc-----------hh- Confidence 22 2222222222 2335789999999999999999999999999999999999999853211 00 Q ss_pred HHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 80 RDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 80 ~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) +.. ..||+.+|+.+|++.++.+++++||||++++|+..|++.+|+||||++|++..+.. T Consensus 81 ---------l~~--~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~---------- 139 (542) T protein:vir:41 81 ---------VDE--FIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGS---------- 139 (542) T ss_pred ---------hhh--hcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCC---------- Confidence 111 13788999999999999999999999999999999999999999999999866533 Q ss_pred hhcccCceeEEEEcCCc-ceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRR-RYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) .|.+...+.. .++..++..+ .++..+ +.....++++||||+|.+++.+++||+||+..+.. T Consensus 140 -------~~~~~~~~~~~~~~~~y~~~~-----~~~~~~------g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~ 201 (542) T protein:vir:41 140 -------RYRQTWDGVNITHFKDYRYEG-----EINPET------GEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAP 201 (542) T ss_pred -------eeEeeecCCcceeEEeecccc-----cccccc------cccccccCcccEEEecCCCCCCCcccccHHHHHHH Confidence 2333333322 1222222111 111111 22345789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccc---------cCCHHHHHHHHHHHHH-hhcc-cccCcceeccCCccceec Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGG---------TLSEDSKEDLRNLMDN-LKGS-RYRTAILEVEEFVDDHGL 307 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~---------~~~~~~~~~l~~~~~~-~~g~-~nag~~~~~~~g~~~~~~ 307 (602) ++....++++++.++|+||++|++||++++. .+++++.+++++.|++ +.|. .|+|++++++.. T Consensus 202 ~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~------ 275 (542) T protein:vir:41 202 AILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIP------ 275 (542) T ss_pred HHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeecc------ Confidence 9999999999999999999999999998753 4678999999999976 4554 688888887521 Q ss_pred cccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC--CccCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 308 GDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS--NRANSKEQTREFAKGIIEPEQAKFSA 385 (602) Q Consensus 308 ~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~f~~~~l~P~~~~ie~ 385 (602) ++.+.+++|+|++. +++|+||++++++++++||++|||||.+||+.+.+ +++|+|++++.|+++||+|++++|++ T Consensus 276 --~~~~~g~~~~pl~~-~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~ 352 (542) T protein:vir:41 276 --GGDTVKVTFTPLNT-SQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISS 352 (542) T ss_pred --CCcccceeEEEcCC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 23456789999975 57899999999999999999999999999998665 55899999999999999999999999 Q ss_pred HHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh-CCCCCCCCcccccccccccccc Q lcl|NC_021537. 386 RLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEFG 464 (602) Q Consensus 386 ~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~ 464 (602) +||++|++..+ .+++++|+..++++. |.. ..++.++++|++|+||+|+.+ |++|.+ +.++.+.... . T Consensus 353 ~ln~~L~~~~~--~~~~~~f~~~~ll~~--d~~---~~~~~~v~~GilT~NE~Re~L~g~~pgd----d~~l~p~~~~-~ 420 (542) T protein:vir:41 353 ILTDFFQVKFN--PKTRFKFNDETLLES--DSV---RNCALLVQSGVLTPAEARERLFGLDGGP----DIFMVPSKGA-A 420 (542) T ss_pred HHHhhcccccC--CceEEEecchhhcch--HHH---HHHHHHHhCCCCCHHHHHHhhCCCCCCC----cccccccccc-c Confidence 99999988765 368899999988765 322 346779999999999999853 666543 2232222221 1 Q ss_pred ccccCCCcCcccccccc----cccccccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCC-cce Q lcl|NC_021537. 465 ADASDGDAEAMLTRSKA----APPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQ-NSL 539 (602) Q Consensus 465 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~-~~~ 539 (602) .....++.+.+..+... ........++......+.....-. ..-......+-.|..+.+.|-|- .+.|| .++ T Consensus 421 ~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 497 (542) T protein:vir:41 421 KSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKK-IDESLAEFRAEAYEAGKKMLIIG--GDMGSMSAL 497 (542) T ss_pred cccccCCcCCCCCchhhhhhcccccCccccccccccccchhhccc-ccchhhhhHHhHHhcCceEEEee--cCchhhhhh Confidence 11221111111111000 001111111111111111111000 00011344455577778888772 22233 223 Q ss_pred eeeccC------CHHHHHHHhCCCc---cchhhhhhhcccccccccccchhcc Q lcl|NC_021537. 540 YVYVDV------PAAVWSALVSAPS---AGSYHYSEIRLQYGYLEVTNNHERL 583 (602) Q Consensus 540 y~y~~v------~~~~~~~~~~a~s---~g~~~~~~i~~~~~~~~~~~~~~~~ 583 (602) -+-..| -.+-|++|+.|.- +||- | +|-|.-|+=.. + T Consensus 498 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~-~~~~~~~~~~~--~ 542 (542) T protein:vir:41 498 NQGVSVIPSKPLNLERYEELLEASVEDMIGRI-----R-HYLYKVIGWRE--L 542 (542) T ss_pred hccceeccCCCcChHHHHHHHHhhHHHHHHHH-----H-HHHHHHhhhcc--C Confidence 333333 2467999999853 2331 1 23333332111 0 No 39 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=2.3e-78 Score=446.21 Aligned_cols=397 Identities=15% Similarity=0.122 Sum_probs=296.4 Q ss_pred CCCCccccc----ccchhhhcccCccccC-C---CCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQ----LDERHIATDVGRGIQP-P---YNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~----~~~~~~~~~~~~~i~p-~---~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) +.|+...+. +..+.+.....+...+ + .+.. .-...+..+++|++||++||++||++||+++++.+.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~---- 79 (412) T protein:vir:26 4 IAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVV---- 79 (412) T ss_pred chhhhhhhhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccccc---- Confidence 222111111 1111111111111111 0 0111 1134455689999999999999999999998653221 Q ss_pred chhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) .|+.+.++ .+||+.||+.+||+.++.+++++||+|++++|+..|++.+|+||+|++|++..+... T Consensus 80 --------------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~ 145 (412) T protein:vir:26 80 --------------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS 145 (412) T ss_pred --------------cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC Confidence 13344444 579999999999999999999999999999999999999999999999987543221 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) . ..+ |.+...+|....++++||||||++++.++++|+ T Consensus 146 ~--------------~~~-----------------------------y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~ 182 (412) T protein:vir:26 146 R--------------ELY-----------------------------YSIHAATGNKLIVHNMDMLHFKHIVASNMVQGI 182 (412) T ss_pred c--------------EEE-----------------------------EEEEcCCceEEEEccccEEEeCCCCCCCCcccc Confidence 0 001 122233456678999999999998889999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) ||+..+..++..+.++++++ ++.++..++++++. +..+++++.+++++.|++..+ |+|+++++++|+ T Consensus 183 s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~-------- 249 (412) T protein:vir:26 183 SPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQYYE--ENGGILFQEPGV-------- 249 (412) T ss_pred cHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEec-CCCCCHHHHHHHHHHHHHHhh--cCCCeeecCCCc-------- Confidence 99999999999999998884 45555556666665 456899999999999987654 567788876654 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKI 390 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 390 (602) ++++++ ++++|+||+|++++++++||++|||||.+||..++++++|+|++.+.|+++||+|+++.||++||++ T Consensus 250 ------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~k 322 (412) T protein:vir:26 250 ------EIEPLP-KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRK 322 (412) T ss_pred ------eEEEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 566665 4568999999999999999999999999999988899999999999999999999999999999999 Q ss_pred cCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCC Q lcl|NC_021537. 391 IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDG 470 (602) Q Consensus 391 Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~ 470 (602) |++..+...+++|+||.+++++. |.+.+++++++++++|++|+||+|+++|+||+|||+ ..+++.++.++....... T Consensus 323 Ll~~~~~~~~~~~~fd~~~l~~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD-~~~~~~n~~~~~~~~~~~ 399 (412) T protein:vir:26 323 LLTKTDREKNRYFKFNVKSYLRA--DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-KPLISGDLYPIDTPLELR 399 (412) T ss_pred cCCcccccCcceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eeeecccccccccchhhc Confidence 99998888889999999999876 778888999999999999999999999999999864 334556665553221111 Q ss_pred C--cCcccccccc Q lcl|NC_021537. 471 D--AEAMLTRSKA 481 (602) Q Consensus 471 ~--~~~~~~~~~~ 481 (602) . .++....... T Consensus 400 ~~~~gG~~n~~e~ 412 (412) T protein:vir:26 400 KSLKGGDKNVNES 412 (412) T ss_pred ccccCCCCCcCCC Confidence 1 1111000000 No 40 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=6.2e-78 Score=443.85 Aligned_cols=454 Identities=17% Similarity=0.158 Sum_probs=301.8 Q ss_pred CCCCcc-------cccccchhhhcccC--ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhcc-----------CceEE Q lcl|NC_021537. 1 MSKAEE-------TTQLDERHIATDVG--RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAG-----------YGFEI 60 (602) Q Consensus 1 ~~k~~~-------~~~~~~~~~~~~~~--~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~-----------~~~~i 60 (602) ++|+.+ +..+....|+..++ ..+.|++++..+.+.+..||+|++||++++++||+ ++|.+ T Consensus 39 ~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~i 118 (547) T protein:vir:63 39 ISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEV 118 (547) T ss_pred HHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCcee Confidence 444433 22333344443333 23455666766666667789999999999999996 35666 Q ss_pred EEecCCC-CcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCc Q lcl|NC_021537. 61 VAHPSAD-EPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPA 139 (602) Q Consensus 61 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p 139 (602) ..+.... ...+.....+.+..+++++++. .++..+|+.+|+++++.+++++||+|++++|+.+|++++|+|||| T Consensus 119 r~k~~~~~~~~~~~~~~~~l~~~l~~pn~~-----~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p 193 (547) T protein:vir:63 119 RLKDLDKKPTSHDEATIKRIESFIEKTGVD-----NDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDP 193 (547) T ss_pred EecccccccChhhHHHHHHHHHHHHhhCCC-----CCCccchHHHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecC Confidence 5554322 2233344455666677666542 233457999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEec Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLP 219 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r 219 (602) .+|++..+..+... ..+..|+|+. .++....++++||||+| T Consensus 194 ~~V~~~~~~~g~~~---------~~~~~y~~~~------------------------------~~~~~~~~~~~eiih~r 234 (547) T protein:vir:63 194 TTIFFATTADGKIP---------DNGNRFVQVI------------------------------DQKIVATFNAREMAFAV 234 (547) T ss_pred ceeEEEECCccccc---------cCceEEEEEc------------------------------CCcEEEEeccccEEEec Confidence 99998654432111 1112233322 23445678999999999 Q ss_pred CCCCC---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHH-hhcccccCc Q lcl|NC_021537. 220 NPSPL---ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDN-LKGSRYRTA 294 (602) Q Consensus 220 ~~~~~---~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~-~~g~~nag~ 294 (602) .++.. .+.||+||+..+..+|..+.++++++.++|+||++|+|||++++. .+++++.+++++.|+. +.|..|+|+ T Consensus 235 ~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk 314 (547) T protein:vir:63 235 RNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQ 314 (547) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccc Confidence 76543 367899999999999999999999999999999999999998753 5899999999999976 567789999 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc----------CCccCH Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST----------SNRANS 364 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~----------~~~sn~ 364 (602) ++++.+ .+++|++++ ++++|+||+|++++++++||++|||||++||+.+. .|+||+ T Consensus 315 ~~vl~~-------------~g~~~~~l~-~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~ 380 (547) T protein:vir:63 315 IPVVSA-------------EDVKFVNMT-PSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNS 380 (547) T ss_pred cccccC-------------CCceEEEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhH Confidence 866532 235677776 56789999999999999999999999999997544 378999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCC Q lcl|NC_021537. 365 KEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDL 444 (602) Q Consensus 365 e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl 444 (602) +++.+.|+++||+||+++||++||.+|++..+ ..++|+|+..+..+. ..+++ +.+++.+|+||+||+|+++|| T Consensus 381 e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~--~~~~~~f~~~~~~~~----~~~~~-~~~~~~~g~lT~NE~R~~~gl 453 (547) T protein:vir:63 381 AEKNQASKNKGLQPLLGFIEDFINKHIVAEFG--DKYTFQFVGGDIKSE----LESVK-ILAEKAKVAMTVNEVRKELNL 453 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccccC--CceEEEeeccccccH----HHHHH-HHHHHhCCCcCHHHHHHHhCC Confidence 99999999999999999999999999997654 357777776554332 22333 345778999999999999999 Q ss_pred CC-CCCCcccccccccccc-ccccccCCCcC--cc---------------cccccccccccccccccccccccccccchh Q lcl|NC_021537. 445 AP-FEDDRGDMTLSEFEAE-FGADASDGDAE--AM---------------LTRSKAAPPLENKIGERDSVDVDVSKDPIE 505 (602) Q Consensus 445 ~p-~~~g~~d~~~~~~~~~-~~~~~~~~~~~--~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~ 505 (602) +| ++|| |.++.+.... .+...+....+ .+ .++...+|.......+.........+ T Consensus 454 ~P~~egG--D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~---- 527 (547) T protein:vir:63 454 PGDVIGG--DIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDK---- 527 (547) T ss_pred CCCCCCC--ceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccccCc---- Confidence 98 5766 4444333322 22111110000 00 00000000000000000000000000 Q ss_pred hhhcchhhhhhheecccccEEEEEEecccCCcce Q lcl|NC_021537. 506 QTTFSSSNLDEGLYDFGERELYLSFKRESGQNSL 539 (602) Q Consensus 506 ~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~ 539 (602) +.++-..-|+. =.|.++.. . T Consensus 528 ----~~~~~~~~~~~-------~~~~~~~~---~ 547 (547) T protein:vir:63 528 ----DNANAGKQGMK-------GDKPNDWQ---T 547 (547) T ss_pred ----cccchhhhhcC-------CCCccccC---C Confidence 00111111111 01211100 0 No 41 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=4e-78 Score=444.90 Aligned_cols=397 Identities=15% Similarity=0.120 Sum_probs=299.1 Q ss_pred CCCCcccccccc----hhhhcccCccccCC----CCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQLDE----RHIATDVGRGIQPP----YNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~~~~----~~~~~~~~~~i~p~----~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) |+|..--+.+-. ..+.....+...+. -+.. .-++.+..+++|++||++||++||++||+++.+.+.. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~---- 76 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVV---- 76 (409) T ss_pred CcccccchhhhhHHhhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccccc---- Confidence 555443222221 11111111111110 0111 1233455689999999999999999999998653321 Q ss_pred chhhHHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) .|+.+.++ .+||+.||+.+||+.++.+++++||+|++++|+.+|++++|+||+|++|++..+... T Consensus 77 --------------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~ 142 (409) T protein:vir:94 77 --------------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS 142 (409) T ss_pred --------------chhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCC Confidence 13344444 579999999999999999999999999999999999999999999999986543211 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) +..+ +.+...+|....++++||||+|++++.++++|+ T Consensus 143 --------------~~~~-----------------------------y~~~~~~g~~~~~~~~dvih~r~~~~~~~~~G~ 179 (409) T protein:vir:94 143 --------------RELY-----------------------------YSIHAATGNKLIVHNMDMLHFKHIVASNMVQGI 179 (409) T ss_pred --------------cEEE-----------------------------EEEEcCCceEEEEccccEEEecCCCCCCccccc Confidence 0001 122233456678999999999998889999999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) ||+..+..++....++++++ ++.++..++++++. +..+++++.+++++.|++..+ |+++++++++|+ T Consensus 180 s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~-------- 246 (409) T protein:vir:94 180 SPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQYYE--ENGGILFQEPGV-------- 246 (409) T ss_pred cHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEec-CCCCCHHHHHHHHHHHHHHhh--cCCCeeecCCCc-------- Confidence 99999999999999998875 45555555566655 456899999999999988654 677888876654 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKI 390 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 390 (602) ++++++ ++++|+||+|.+++++++||++|||||++||..++++++|+|++.+.|+++||+|+++.||++||++ T Consensus 247 ------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 319 (409) T protein:vir:94 247 ------EIEPLP-KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRK 319 (409) T ss_pred ------eEEEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 556665 4568999999999999999999999999999998999999999999999999999999999999999 Q ss_pred cCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCC Q lcl|NC_021537. 391 IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDG 470 (602) Q Consensus 391 Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~ 470 (602) |++..+...+++|+||.+++++. |.+.+++++++++++|+||+||+|+++|+||+|+|+ ..+++.++.++....... T Consensus 320 Ll~~~~~~~~~~i~fd~~~ll~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD-~~~~~~n~~~~~~~~~~~ 396 (409) T protein:vir:94 320 LLTKTDREKNRYFKFNVKSYLRA--DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD-KPLISGDLYPIDTPLELR 396 (409) T ss_pred hCCcccccCcceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-eEeecccccccccchhhc Confidence 99998887889999999999877 778888999999999999999999999999998864 334555665554221111 Q ss_pred C--cCcccccccccccccc Q lcl|NC_021537. 471 D--AEAMLTRSKAAPPLEN 487 (602) Q Consensus 471 ~--~~~~~~~~~~~~~~~~ 487 (602) . .++...... . T Consensus 397 ~~~kGG~~n~~e------~ 409 (409) T protein:vir:94 397 KSLKGGDKNVNE------S 409 (409) T ss_pred ccccCCCCCcCC------C Confidence 1 111000000 0 No 42 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=1.9e-77 Score=441.24 Aligned_cols=401 Identities=13% Similarity=0.129 Sum_probs=290.0 Q ss_pred CCCCccccccc--chh---hhccc----CccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQLD--ERH---IATDV----GRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~~~--~~~---~~~~~----~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) ..|.. ++.+. ... +.... +..++ .++... +-.++.|++||++||++||++|++++.... T Consensus 29 f~~~e-~r~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----al~~~~V~acv~~Ia~~iA~lpl~~~~~~~------ 96 (441) T protein:vir:98 29 FYKNE-KRDLQYNEDDLQMMVQTLPGFQGTKLR-QYKDIE----AIRHSDIFTAVMMIASDLARMPIRVTVNGQ------ 96 (441) T ss_pred ccccc-cccccCCCcchHHHHHHhhcccccCcc-ccchhh----hhccHHHHHHHHHHHHhhccCceEEecCCc------ Confidence 11111 11111 000 00000 01111 122221 234678999999999999999999974211 Q ss_pred chhhHHHHHHhhhccchhhhh-hccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQI-GPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l-~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ....|+.+.+ +.+||+.||+.+||+.++.+++++||||++++|+.+|+|++|+||+|+.|++..+..+ T Consensus 97 -----------~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g 165 (441) T protein:vir:98 97 -----------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARG 165 (441) T ss_pred -----------ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCC Confidence 0112444544 4689999999999999999999999999999999999999999999999987554321 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) ..++++.. + .....+..+.++++||||+|+++ .++++|+ T Consensus 166 ---------------~~~~~~~~-------------------~------~~~~~~~~~~~~~~dviHir~~~-~dg~~G~ 204 (441) T protein:vir:98 166 ---------------RLYYFHQR-------------------I------DSNGNNIERNVKFEDMLDIKFYS-LDGINGL 204 (441) T ss_pred ---------------cEEEEEEE-------------------e------ccCcceeeEEEccccEEEeccCC-CCCcccc Confidence 11111100 0 00112345689999999999875 6789999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceeccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) ||+..+..+|..+.++++++.++|+||++|+|||++++...++++++++++.|++ +.|..|+|+++++++|++ T Consensus 205 spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~------ 278 (441) T protein:vir:98 205 SLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMT------ 278 (441) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCce------ Confidence 9999999999999999999999999999999999999876678999999999976 556789999999877654 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) |+|++ ++++|+||+|.+++++++||++|||||++||... .+ ++.+++...|. +||+||++.||++||+ T Consensus 279 --------~~~l~-~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~-~~-~s~~q~~~~y~-~tl~P~~~~ie~~ln~ 346 (441) T protein:vir:98 279 --------FDQLE-VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-AN-MSITDANLDYL-STLKPYITCVCAELNF 346 (441) T ss_pred --------EEEcc-CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CC-ccHHHHHHHHH-HHHHHHHHHHHHHHHh Confidence 55554 4678999999999999999999999999999643 23 35567766665 6999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc-ccccccccccccc Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT-LSEFEAEFGADAS 468 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~-~~~~~~~~~~~~~ 468 (602) +|++.. .+++++||.+.+++. |.+.+++++++++++|+||+||+|+++||||++||+.+.+ ++.+++++....+ T Consensus 347 ~L~~~~---~~~~~~fd~~~llr~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~ 421 (441) T protein:vir:98 347 KFNDEY---VNREFKFDTTEIRVV--DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDE 421 (441) T ss_pred hccccc---cCceEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccc Confidence 998764 367899999999877 7778889999999999999999999999999999987664 4445554432111 Q ss_pred CCCcCcccccccccccccccccc Q lcl|NC_021537. 469 DGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ........+ .........++ T Consensus 422 ~q~~~~~~~---~~~~kgGe~ne 441 (441) T protein:vir:98 422 YQMNKSRAT---DKKLKGGEENE 441 (441) T ss_pred ccccccccc---ccccCCCCCCC Confidence 110000000 00000000111 No 43 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=2.3e-77 Score=440.69 Aligned_cols=410 Identities=11% Similarity=0.073 Sum_probs=290.8 Q ss_pred CCCCcc-cccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEE-TTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~-~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) +=++.. .....--.+. ...++.|...-.-+...+..++.|++||++||++||++||+++.+..+...+ T Consensus 3 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~--------- 71 (417) T protein:vir:38 3 LFRGLATEVDPHWADHL--LDSGVIPSFRGGYLGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVID--------- 71 (417) T ss_pred cccccccCCCccchhhh--cccccccccCCceechhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcceec--------- Confidence 111110 0000000000 0001111111001112234578899999999999999999998754332111 Q ss_pred HHhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-CceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 80 RDFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-GTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 80 ~~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) .++... ++.+||++||+.+||+.++.+++++||+|++++|+.. |.|..|+|++|+.|++..... T Consensus 72 ------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~-------- 137 (417) T protein:vir:38 72 ------LANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDP-------- 137 (417) T ss_pred ------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCC-------- Confidence 123333 4468999999999999999999999999999999875 679999999999998643211 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEE-ecCceeEEechhHEEEecCCCCCCCcccccHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVA-SDAGELKNGPANELIFLPNPSPLALYYGVPDWVAA 236 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~ 236 (602) +..++ ++. ..++....++++||||||+++ .++++|+||+.++ T Consensus 138 -------~~~~y-----------------------------~~~~~~~~~~~~~~~~dviH~r~~~-~d~~~G~s~l~~~ 180 (417) T protein:vir:38 138 -------DNIIY-----------------------------RFTPYNSSMQKVCGFEDVIHWKFFS-YDTIMGRSPLLSL 180 (417) T ss_pred -------CeEEE-----------------------------EEEEcCCcEEEEecCcceEEecCCC-CCCccccCHHHHH Confidence 11111 111 223445678999999999875 6889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccccc Q lcl|NC_021537. 237 MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNI 316 (602) Q Consensus 237 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~ 316 (602) .++|..+.++++++.++|+||++|++||+.++ .+++++.+++++.|++.+++.|+|+++++++|+ T Consensus 181 ~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~-~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~-------------- 245 (417) T protein:vir:38 181 GDEIGLQESGVSTLQKFFKSGLKGSIIKAKES-RLSAEARQKIREDFERAQAGADAGSPIIVDATM-------------- 245 (417) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC-CCCHHHHHHHHHHHHHHhcccccCCceeccCCc-------------- Confidence 99999999999999999999999999999875 589999999999999888878999999987765 Q ss_pred ccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccc Q lcl|NC_021537. 317 ELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDAL 396 (602) Q Consensus 317 ~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~ 396 (602) +|++++ ++++|+||+|++++++++||++|||||++||. .++++|++++.+.|+++||+|+++.||++||.+|+++.+ T Consensus 246 ~~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~--~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~ 322 (417) T protein:vir:38 246 DYQPLE-VDTNVLNLINSNNYSTAQIAKALRVPAYRLAQ--NSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQ 322 (417) T ss_pred eEEEcc-CCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhh Confidence 455554 45689999999999999999999999999984 568999999999999999999999999999999998876 Q ss_pred cccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc-ccccccccccccCCCcCcc Q lcl|NC_021537. 397 DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL-SEFEAEFGADASDGDAEAM 475 (602) Q Consensus 397 ~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~-~~~~~~~~~~~~~~~~~~~ 475 (602) . .+++++||.+.+...+ ...+++++++|+||+||+|+++|+||+++|++|.+. +.+++......+....... T Consensus 323 ~-~~~~~~fd~~~l~~~~------~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~ 395 (417) T protein:vir:38 323 R-HQYCIGFDTKSVNGLP------IADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAA 395 (417) T ss_pred c-ccceEEechhhhhHHH------HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccccccccc Confidence 5 4789999988764332 123678999999999999999999999999887654 4455554422221111110 Q ss_pred cccccccccccccccccccccccc Q lcl|NC_021537. 476 LTRSKAAPPLENKIGERDSVDVDV 499 (602) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~ 499 (602) .....+.+...+ ++.......+ T Consensus 396 ~~kgg~~~~~~~--~~~~~~~~~~ 417 (417) T protein:vir:38 396 ELKGGDTNAKGN--QNGSGTNANS 417 (417) T ss_pred ccCCCCCCCCCC--CcCCCCcCCC Confidence 000000000000 0000000000 No 44 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=4.3e-77 Score=439.26 Aligned_cols=403 Identities=12% Similarity=0.105 Sum_probs=290.0 Q ss_pred CCCCcccccccc---------hhhh----------cccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCce Q lcl|NC_021537. 1 MSKAEETTQLDE---------RHIA----------TDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGYGF 58 (602) Q Consensus 1 ~~k~~~~~~~~~---------~~~~----------~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~ 58 (602) -.|.++.+++-. |... ....++..- .++.. -+..++.|++||++||++||++|| T Consensus 14 ~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~al~~~~V~~cv~~Ia~~iA~lp~ 89 (441) T protein:vir:79 14 KSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI----EAIRHSDIFTAVMMIASDLARMPI 89 (441) T ss_pred cccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchh----hhhccHHHHHHHHHHHHhhccCce Confidence 222222222110 0000 000000000 12111 123467899999999999999999 Q ss_pred EEEEecCCCCcccchhhHHHHHHhhhccchhhhh-hccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEe Q lcl|NC_021537. 59 EIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQI-GPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHV 137 (602) Q Consensus 59 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l 137 (602) +++.... ....|+++.+ +.+||+.||+.+||+.++.+++++||||++++|+..|+|++|+|| T Consensus 90 ~~~~~~~-----------------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i 152 (441) T protein:vir:79 90 RVTVNGQ-----------------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFR 152 (441) T ss_pred eeecCcc-----------------ccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 9874211 1112444544 468999999999999999999999999999999999999999999 Q ss_pred CcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 138 PAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) +|+.|++..+..+ ..++.+.. + .....+..+.++++|||| T Consensus 153 ~~~~v~v~~d~~g---------------~~~~~~~~-------------------~------~~~~~~~~~~~~~~dvih 192 (441) T protein:vir:79 153 KTSEIELKSDARG---------------RLYYFHQR-------------------I------DSNGNNIERNVKFEDMLD 192 (441) T ss_pred cCceeEEEECCCc---------------cEEEEEEE-------------------e------ccCCceeEEEEccccEEE Confidence 9999987554321 11111100 0 001123456899999999 Q ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcce Q lcl|NC_021537. 218 LPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAIL 296 (602) Q Consensus 218 ~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~ 296 (602) +|+++ .++++|+||+..+..+|..+.++++++.++|+||++|+|||++++...++++++++++.|++ +.|..|+|+++ T Consensus 193 ~k~~~-~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~ 271 (441) T protein:vir:79 193 IKFYS-LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVV 271 (441) T ss_pred eccCC-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcce Confidence 99764 78899999999999999999999999999999999999999999876788999999999976 55678999999 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHH Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGII 376 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l 376 (602) ++++|++ |+|++ ++++|+||+|++++++++||++|||||.+||... .++ +.+++...| .+|| T Consensus 272 vl~~G~~--------------~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~-s~~q~~~~~-~~tl 333 (441) T protein:vir:79 272 VLDESMT--------------FDQLE-VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANM-SITDANLDY-LSTL 333 (441) T ss_pred ecCCCce--------------EEEcc-CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CCc-cHHHHHHHH-HHHH Confidence 9877654 55555 4568999999999999999999999999999643 333 456665555 5699 Q ss_pred HHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc- Q lcl|NC_021537. 377 EPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT- 455 (602) Q Consensus 377 ~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~- 455 (602) +|+++.||++||++|+++. .+++++||++.+++. |.+.+++++++++++|+||+||+|+++||||++||+.+.+ T Consensus 334 ~P~~~~ie~eln~kl~~~~---~~~~~~fd~~~llr~--D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~ 408 (441) T protein:vir:79 334 KPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVV--DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 408 (441) T ss_pred HHHHHHHHHHHhhhccccc---cCceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 9999999999999998764 467899999999877 7788889999999999999999999999999999987665 Q ss_pred cccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 456 LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ++.+++++....+........+ ..+......++ T Consensus 409 ~~~n~~~~~~~~~~~~~~~~~~---~~~~kgGe~~e 441 (441) T protein:vir:79 409 VDLNHVNIELVDEYQMNKSRAT---DKKLKGGEENE 441 (441) T ss_pred eccccccccccccccccccccc---ccccCCCCCCC Confidence 3445554432211110000000 00000011111 No 45 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=4.3e-77 Score=439.26 Aligned_cols=403 Identities=12% Similarity=0.105 Sum_probs=290.0 Q ss_pred CCCCcccccccc---------hhhh----------cccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCce Q lcl|NC_021537. 1 MSKAEETTQLDE---------RHIA----------TDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGYGF 58 (602) Q Consensus 1 ~~k~~~~~~~~~---------~~~~----------~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~ 58 (602) -.|.++.+++-. |... ....++..- .++.. -+..++.|++||++||++||++|| T Consensus 14 ~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~al~~~~V~~cv~~Ia~~iA~lp~ 89 (441) T protein:vir:94 14 KSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI----EAIRHSDIFTAVMMIASDLARMPI 89 (441) T ss_pred cccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchh----hhhccHHHHHHHHHHHHhhccCce Confidence 222222222110 0000 000000000 12111 123467899999999999999999 Q ss_pred EEEEecCCCCcccchhhHHHHHHhhhccchhhhh-hccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEe Q lcl|NC_021537. 59 EIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQI-GPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHV 137 (602) Q Consensus 59 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l 137 (602) +++.... ....|+++.+ +.+||+.||+.+||+.++.+++++||||++++|+..|+|++|+|| T Consensus 90 ~~~~~~~-----------------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i 152 (441) T protein:vir:94 90 RVTVNGQ-----------------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFR 152 (441) T ss_pred eeecCcc-----------------ccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 9874211 1112444544 468999999999999999999999999999999999999999999 Q ss_pred CcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 138 PAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) +|+.|++..+..+ ..++.+.. + .....+..+.++++|||| T Consensus 153 ~~~~v~v~~d~~g---------------~~~~~~~~-------------------~------~~~~~~~~~~~~~~dvih 192 (441) T protein:vir:94 153 KTSEIELKSDARG---------------RLYYFHQR-------------------I------DSNGNNIERNVKFEDMLD 192 (441) T ss_pred cCceeEEEECCCc---------------cEEEEEEE-------------------e------ccCCceeEEEEccccEEE Confidence 9999987554321 11111100 0 001123456899999999 Q ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcce Q lcl|NC_021537. 218 LPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAIL 296 (602) Q Consensus 218 ~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~ 296 (602) +|+++ .++++|+||+..+..+|..+.++++++.++|+||++|+|||++++...++++++++++.|++ +.|..|+|+++ T Consensus 193 ~k~~~-~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~ 271 (441) T protein:vir:94 193 IKFYS-LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVV 271 (441) T ss_pred eccCC-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcce Confidence 99764 78899999999999999999999999999999999999999999876788999999999976 55678999999 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHH Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGII 376 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l 376 (602) ++++|++ |+|++ ++++|+||+|++++++++||++|||||.+||... .++ +.+++...| .+|| T Consensus 272 vl~~G~~--------------~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~-s~~q~~~~~-~~tl 333 (441) T protein:vir:94 272 VLDESMT--------------FDQLE-VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANM-SITDANLDY-LSTL 333 (441) T ss_pred ecCCCce--------------EEEcc-CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CCc-cHHHHHHHH-HHHH Confidence 9877654 55555 4568999999999999999999999999999643 333 456665555 5699 Q ss_pred HHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc- Q lcl|NC_021537. 377 EPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT- 455 (602) Q Consensus 377 ~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~- 455 (602) +|+++.||++||++|+++. .+++++||++.+++. |.+.+++++++++++|+||+||+|+++||||++||+.+.+ T Consensus 334 ~P~~~~ie~eln~kl~~~~---~~~~~~fd~~~llr~--D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~ 408 (441) T protein:vir:94 334 KPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVV--DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 408 (441) T ss_pred HHHHHHHHHHHhhhccccc---cCceEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 9999999999999998764 467899999999877 7788889999999999999999999999999999987665 Q ss_pred cccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 456 LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ++.+++++....+........+ ..+......++ T Consensus 409 ~~~n~~~~~~~~~~~~~~~~~~---~~~~kgGe~~e 441 (441) T protein:vir:94 409 VDLNHVNIELVDEYQMNKSRAT---DKKLKGGEENE 441 (441) T ss_pred eccccccccccccccccccccc---ccccCCCCCCC Confidence 3445554432211110000000 00000011111 No 46 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=9.2e-77 Score=437.44 Aligned_cols=397 Identities=14% Similarity=0.095 Sum_probs=291.7 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHH-HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPET-LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~-l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) .++..+.+.... ..+.. ..+..+.+.... -...+..+++|++||++||++||++||+++.+.+.... T Consensus 8 f~~~~~~~~~~~-~~~~~-~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~---------- 75 (409) T protein:vir:84 8 FSGPSEERTLTK-ISGIP-SPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRI---------- 75 (409) T ss_pred hcCCCccccccc-ccccc-cccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccc---------- Confidence 222222221111 11000 000001000000 11223457899999999999999999999876543211 Q ss_pred HHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEe-eCCCCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 80 RDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEIL-VEGDGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 80 ~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~-r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ..|++..++ .+||+.||+.+|++.++.+++++||+|+++. ++..|++.+|+||+|.+|++....... T Consensus 76 -----~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~------ 144 (409) T protein:vir:84 76 -----PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDED------ 144 (409) T ss_pred -----ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCc------ Confidence 124455555 5899999999999999999999999999986 688899999999999999865322110 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAM 237 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~ 237 (602) +. +++ .++..+| ++++++||||++.+++.+.++|+||+..+. T Consensus 145 -------~~-~~~----------------------------~~~~~~g--~~~~~~dvih~~~~~~~~~~~G~s~i~~~~ 186 (409) T protein:vir:84 145 -------GD-WIE----------------------------PVYRIDG--KVVPNHRIMHIKRYPVAGCALGMSPIEKAA 186 (409) T ss_pred -------ce-EEE----------------------------EEecCCc--eEEchhhEEEecCCCCCcccccccHHHHHH Confidence 00 000 0011112 458899999999998888889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccc Q lcl|NC_021537. 238 QTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 238 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) .+|....++++++.++|+||++|+|+|++++ .+++++.+++++.|.+.. .|+|+++++++|+ + T Consensus 187 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~--~n~g~~~vl~~g~--------------~ 249 (409) T protein:vir:84 187 SAIGLGLAAERYGLRWFRDSANPSGILSSDA-DLTPDQVKQTQKQWIQSH--HNRRLPAVMSAGI--------------K 249 (409) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHHHHHHHHHHh--ccCCCeeecCCCc--------------e Confidence 9999999999999999999999999999875 589999999999996543 5778888887654 4 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC--ccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN--RANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA 395 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~ 395 (602) |++++ ++++|+||+|++++++++||++|||||++||+.+.++ +||+|++.+.|+++||.||++.||++||++|. T Consensus 250 ~~~~~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~--- 325 (409) T protein:vir:84 250 WQSVS-ITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFLP--- 325 (409) T ss_pred EEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--- Confidence 55565 4568999999999999999999999999999877665 48899999999999999999999999999873 Q ss_pred ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcc Q lcl|NC_021537. 396 LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAM 475 (602) Q Consensus 396 ~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~ 475 (602) .+++++|+++.+++. |.+.+++++.+++++|+||+||+|+++|+||+|||+ ..+.+.+++.++......+..+. T Consensus 326 ---~g~~i~fd~~~l~~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD-~~~~~~n~~~~~~~~~~~~~~~~ 399 (409) T protein:vir:84 326 ---RGQFVKFNVDGLMRG--DVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGD-IHLQPMNFVPLGYVPPEEPAQEP 399 (409) T ss_pred ---CCCeEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeecccccccccCCccccCcCC Confidence 367899999999876 778888999999999999999999999999998864 44555666655533222111111 Q ss_pred cccccccccccccccc Q lcl|NC_021537. 476 LTRSKAAPPLENKIGE 491 (602) Q Consensus 476 ~~~~~~~~~~~~~~~~ 491 (602) .+......++ T Consensus 400 ------~~~~~~~gn~ 409 (409) T protein:vir:84 400 ------QPNSATEGNK 409 (409) T ss_pred ------CCCCccCCCC Confidence 1111111111 No 47 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=1.1e-75 Score=431.44 Aligned_cols=469 Identities=16% Similarity=0.118 Sum_probs=298.3 Q ss_pred CCCCcccccccchhhhccc--CccccCCCCHHHHHHHHhhhHHHHHHHHHHH-----------HhhccCceEEEEecCCC Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV--GRGIQPPYNPETLAAFQELNETHQACIRKKS-----------RYEAGYGFEIVAHPSAD 67 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~--~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia-----------~~ia~~~~~i~~~~~~~ 67 (602) ..|+..+++.....+...+ ...+.|+.++..+.+....+++|++||++++ .+|+++||+|+.++.+. T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~ 133 (574) T protein:vir:80 54 KTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEA 133 (574) T ss_pred hcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCC Confidence 2233333333333333322 2355667776554444445677666666655 56678999998876543 Q ss_pred CcccchhhHHHHHHhhhccchhhhhhc----cCCcc-CCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGSDSRWQIGP----EGTAM-STPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATV 142 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~l~~----~pn~~-~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v 142 (602) ... .+. ....|+++.++. .||+. +|+.+|++.++.+++++||+|++++|+.+|+|++|+||+|.+| T Consensus 134 ~~~--~~~-------~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V 204 (574) T protein:vir:80 134 EPT--SHD-------IANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTI 204 (574) T ss_pred Ccc--chh-------hhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 211 000 111233343332 23443 5899999999999999999999999999999999999999999 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCC Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPS 222 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~ 222 (602) ++..+..+... ..+..|+|+. .++....++++||||++.+. T Consensus 205 ~v~~d~~~~~~---------~~~~~y~~~~------------------------------~g~~~~~~~~~eiih~~~~~ 245 (574) T protein:vir:80 205 FLATNGEGKLI---------KNGERFVQVI------------------------------DNRIVAKFNERELAFAVRNP 245 (574) T ss_pred EEEEcCccccc---------cCceEEEEEe------------------------------CCceEEEEccccEEEEeccC Confidence 98765543211 1223344433 23455678999999999764 Q ss_pred CC---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc-ccCCHHHHHHHHHHHHH-hhcccccCccee Q lcl|NC_021537. 223 PL---ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG-GTLSEDSKEDLRNLMDN-LKGSRYRTAILE 297 (602) Q Consensus 223 ~~---~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~~~~~~~~~~l~~~~~~-~~g~~nag~~~~ 297 (602) .. ++.||+|||.++..+|..+.++++++.++|+||++|+|||++++ ..+++++.+++++.|++ +.|..|+|++++ T Consensus 246 ~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~v 325 (574) T protein:vir:80 246 RADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPV 325 (574) T ss_pred CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccccccee Confidence 33 46799999999999999999999999999999999999999875 45899999999999976 567789999765 Q ss_pred ccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC----------CccCHHHH Q lcl|NC_021537. 298 VEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS----------NRANSKEQ 367 (602) Q Consensus 298 ~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------~~sn~e~~ 367 (602) +.+ .+++|++++ ++++|+||+|++++++++||++|||||++||+.+.+ |++|+|++ T Consensus 326 l~~-------------~G~~~~~l~-~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~ 391 (574) T protein:vir:80 326 VSA-------------EDVKFVNMT-PSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEK 391 (574) T ss_pred ecC-------------CCceEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHH Confidence 532 235677775 566899999999999999999999999999986543 57999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCC Q lcl|NC_021537. 368 TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPF 447 (602) Q Consensus 368 ~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~ 447 (602) .+.|+++||+|++.+||++||++|++..+ .+++++|+..++... +... + +..++.+|+||+||+|+++||+|+ T Consensus 392 ~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~--~~~~~~f~~~d~~~~--~~~~--~-~~~~~~~G~lT~NE~R~~lgl~Pi 464 (574) T protein:vir:80 392 MQASQNKGLQPLLRFIEDTVNTYIVAEFG--EKYQFQFRGGDLSAQ--LDKL--K-IIEQEGKVFRTVNEIRHDKGLEPI 464 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhcC--CceEEEecccchhhH--HHHH--H-HHHHHhCCccCHHHHHHHhCCCCC Confidence 99999999999999999999999998765 468899998876543 2221 2 245788999999999999999999 Q ss_pred CCCccccccccccccccccccCCCcCcccc-----------------ccccccccc-cccccc-ccccccccccchhhhh Q lcl|NC_021537. 448 EDDRGDMTLSEFEAEFGADASDGDAEAMLT-----------------RSKAAPPLE-NKIGER-DSVDVDVSKDPIEQTT 508 (602) Q Consensus 448 ~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~-~~~~~~-~~~~~~~~~~~m~~~~ 508 (602) +||+ ..+.+.++...+...+....+.+.. ++..+|... ....+. ........-+...+.. T Consensus 465 ~gGD-~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~ 543 (574) T protein:vir:80 465 KGGD-VILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNG 543 (574) T ss_pred CCCC-EeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcC Confidence 8864 2333444444432222111111000 000000000 000000 0000000000001111 Q ss_pred cchhhhhhheecccccEEEEEEecccCCcceee Q lcl|NC_021537. 509 FSSSNLDEGLYDFGERELYLSFKRESGQNSLYV 541 (602) Q Consensus 509 v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~ 541 (602) ...+.+..-||-..++---- ... .++|.--. T Consensus 544 ~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~ 574 (574) T protein:vir:80 544 KVDDNVGKDGQLKSEENTNS-TKH-GTDGIKKE 574 (574) T ss_pred Cccccccccccccccccccc-ccc-cCccccCC Confidence 11111222222111100000 000 00111111 No 48 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=9.7e-76 Score=431.83 Aligned_cols=400 Identities=11% Similarity=0.018 Sum_probs=294.8 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) .+|...........+..-.++.....+... -|..++.|++||++||++||++||+++.+++.. T Consensus 4 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~------------- 66 (406) T protein:vir:97 4 FQPLGTSKVSYDDYISSVLAGDVSQKYLGV----SALKNSDILTATSIIAGDIARFPLVKKDVNGDI------------- 66 (406) T ss_pred ccccCCCCCCcchHHHHHhcCCCCcccccc----hhhccHHHHHHHHHHHHhhhhCeeEEEecCccc------------- Confidence 444333333333333332232222223222 233468899999999999999999776433211 Q ss_pred Hhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CCceEEEEEeCcccccccccccccccccchh Q lcl|NC_021537. 81 DFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DGTPVGLAHVPAATVRVRKTTTTIEREDGEE 158 (602) Q Consensus 81 ~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~ 158 (602) ...|+.+.++ .+||+.||+.+||+.++.++++.||||++++|+. .|++.+|+||+|+.|++..+.. T Consensus 67 ---~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~--------- 134 (406) T protein:vir:97 67 ---IHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDN--------- 134 (406) T ss_pred ---cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCC--------- Confidence 1135556666 5899999999999999999999999999999985 6899999999999998643321 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) +..++++ ....++....++++||||||+++ .++++|+||+.++.. T Consensus 135 ------~~~~y~~----------------------------~~~~~~~~~~~~~~evih~r~~~-~dg~~G~spi~~~~~ 179 (406) T protein:vir:97 135 ------HEIVYTF----------------------------TDMLTAKQVKCFAHDVIHWKFFS-HDTILGRSPLLSLGD 179 (406) T ss_pred ------ceEEEEE----------------------------EecCCceEEEEccccEEEecCCC-CCCcccccHHHHHHH Confidence 1111111 01234566789999999999764 788999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccccccc Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIEL 318 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~ 318 (602) +|..+.++++++.++|+||+.|++++. ++..+++++.+++++.|++..++.|+|++++++.|++ | T Consensus 180 ~i~~~~a~~~~~~~~f~ng~~~~~i~~-~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~--------------~ 244 (406) T protein:vir:97 180 EIDLQTGGINTLIKFFKDGFSSGILTM-KGAQLSGDARQRARQEFEKMREGSVGGSPLVFDSTME--------------Y 244 (406) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEe-cCCCCCHHHHHHHHHHHHHHhcccccCceeecCCCce--------------E Confidence 999999999999999999998876655 4567899999999999999888889999999876654 5 Q ss_pred ccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccc Q lcl|NC_021537. 319 EPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDV 398 (602) Q Consensus 319 ~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~ 398 (602) ++++ .++.|+||+|++++++++||++|||||.+||. .++++|++++...|++.||+||++.||++||++|+++.+. T Consensus 245 ~~l~-~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~--~~~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~- 320 (406) T protein:vir:97 245 TPLE-IDTNVLQLITSNNFSTAQIAKALRVPSYKLGV--NSPNQSVAQLMEDYVTNDLPFYFDAITSELGLKTLNDKDR- 320 (406) T ss_pred EEcc-CCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHHhhhhcChhhc- Confidence 5554 45689999999999999999999999999985 4578899999999999999999999999999999988664 Q ss_pred cceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-cccccccccccCCCcCcccc Q lcl|NC_021537. 399 DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFGADASDGDAEAMLT 477 (602) Q Consensus 399 ~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~~~~~~~~~~~~~~ 477 (602) .+++++|+++..+. .+++.+.+++++|+||+||+|+++|++|++++++|.+.. .+++++.......+...... T Consensus 321 ~~~~i~fd~~~~~~------~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~ 394 (406) T protein:vir:97 321 RLYHIEFDTRSVTG------RNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKG 394 (406) T ss_pred cceeEEEecCccch------hhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccccccccccccc Confidence 56889999876432 334567789999999999999999999999988777654 45555543221111111100 Q ss_pred ccccccccccccccc Q lcl|NC_021537. 478 RSKAAPPLENKIGER 492 (602) Q Consensus 478 ~~~~~~~~~~~~~~~ 492 (602) . ..+...+. ++. T Consensus 395 ~-gg~~~~~~--~~~ 406 (406) T protein:vir:97 395 K-GGEVNAEE--DKS 406 (406) T ss_pred C-CCCCCCCC--CCC Confidence 0 00000000 000 No 49 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=1.8e-75 Score=430.34 Aligned_cols=401 Identities=12% Similarity=0.099 Sum_probs=288.4 Q ss_pred CCCCccccc-ccchhhhcccCcccc---CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQ-LDERHIATDVGRGIQ---PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~-~~~~~~~~~~~~~i~---p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) -+|...+.. .+...+.....++.. ..++.. . +-.++.|++||++||++||++||+++.... T Consensus 7 ~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----------- 71 (416) T protein:vir:45 7 NEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI---E-AIRHSDIFTAVMMIASDLARMPIRVTVNGQ----------- 71 (416) T ss_pred cccccccCCCcchhHHHHHhccccccCccccchh---h-hhcchHHHHHHHHHHHhhccCceEEecCcc----------- Confidence 222111110 000001110111111 112221 1 223577899999999999999999874211 Q ss_pred HHHHHhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) ....|+.+. |+.+||+.||+.+||+.++.+++++||||++++|+..|++++|+||+|+.|++..+..+ T Consensus 72 ------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g----- 140 (416) T protein:vir:45 72 ------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARG----- 140 (416) T ss_pred ------ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCc----- Confidence 011244444 44689999999999999999999999999999999999999999999999987544321 Q ss_pred chhhhhcccCceeEE--EEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQ--VRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 156 ~~~~~~~~~~~~~~q--i~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) ..++. .+.+ ...+..+.+++++|||+|+++ .++++|+||+ T Consensus 141 ----------~~~~~~~~~~~---------------------------~~~~~~~~~~~~evihir~~~-~d~~~G~s~i 182 (416) T protein:vir:45 141 ----------RLYYFHQRIDS---------------------------NGNNIERNVKFEDMLDIKFYS-LDGINGLSLL 182 (416) T ss_pred ----------cEEEEEEEecC---------------------------CCceeEEEEccccEEEeccCC-CCCccccCHH Confidence 11111 0000 112334689999999999765 6889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccc Q lcl|NC_021537. 234 VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGS 312 (602) Q Consensus 234 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~ 312 (602) ..+.++|..+.++++++.++|+||++|++||++++...++++++++++.|++ +.|..|+|+++++++|++ T Consensus 183 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~--------- 253 (416) T protein:vir:45 183 DTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMT--------- 253 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCce--------- Confidence 9999999999999999999999999999999999877788999999999976 456689999999877654 Q ss_pred ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 313 DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 313 ~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) |++++ ++++|+||+|++++++++||++|||||.++|... .+ ++.+++...| .+||+|+++.||++||++|+ T Consensus 254 -----~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~-~~~~~~~~~~-~~~l~P~~~~ie~~ln~~l~ 324 (416) T protein:vir:45 254 -----FDQLE-VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-AN-MSITDANLDY-LSTLKPYITCVCAELNFKFN 324 (416) T ss_pred -----eEecc-CCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CC-ccHHHHHHHH-HHHHHHHHHHHHHHHhhhcc Confidence 55554 4568999999999999999999999999999643 23 3456665555 56999999999999999998 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc-cccccccccccccCCC Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT-LSEFEAEFGADASDGD 471 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~-~~~~~~~~~~~~~~~~ 471 (602) +.. .+++++|+++.+++. |.+.+++++++++++|+||+||+|+++|+||++||+.+++ ++.+++++....+... T Consensus 325 ~~~---~~~~~~f~~~~l~~~--D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~ 399 (416) T protein:vir:45 325 DEY---VNREFKFDTTEIRVV--DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQM 399 (416) T ss_pred ccc---cCceEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCc Confidence 765 367899999999876 7788889999999999999999999999999999988765 4445554432111000 Q ss_pred cCcccccccccccccccccc Q lcl|NC_021537. 472 AEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~ 491 (602) .. ......+-.....+| T Consensus 400 ~~---~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 400 NK---SRATDKKLKGGEENE 416 (416) T ss_pred cc---ccccccccCCCCCCC Confidence 00 000000000111111 No 50 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=1.8e-75 Score=430.34 Aligned_cols=401 Identities=12% Similarity=0.099 Sum_probs=288.4 Q ss_pred CCCCccccc-ccchhhhcccCcccc---CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQ-LDERHIATDVGRGIQ---PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~-~~~~~~~~~~~~~i~---p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) -+|...+.. .+...+.....++.. ..++.. . +-.++.|++||++||++||++||+++.... T Consensus 7 ~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----------- 71 (416) T protein:vir:81 7 NEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI---E-AIRHSDIFTAVMMIASDLARMPIRVTVNGQ----------- 71 (416) T ss_pred cccccccCCCcchhHHHHHhccccccCccccchh---h-hhcchHHHHHHHHHHHhhccCceEEecCcc----------- Confidence 222111110 000001110111111 112221 1 223577899999999999999999874211 Q ss_pred HHHHHhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) ....|+.+. |+.+||+.||+.+||+.++.+++++||||++++|+..|++++|+||+|+.|++..+..+ T Consensus 72 ------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g----- 140 (416) T protein:vir:81 72 ------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARG----- 140 (416) T ss_pred ------ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCc----- Confidence 011244444 44689999999999999999999999999999999999999999999999987544321 Q ss_pred chhhhhcccCceeEE--EEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQ--VRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 156 ~~~~~~~~~~~~~~q--i~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) ..++. .+.+ ...+..+.+++++|||+|+++ .++++|+||+ T Consensus 141 ----------~~~~~~~~~~~---------------------------~~~~~~~~~~~~evihir~~~-~d~~~G~s~i 182 (416) T protein:vir:81 141 ----------RLYYFHQRIDS---------------------------NGNNIERNVKFEDMLDIKFYS-LDGINGLSLL 182 (416) T ss_pred ----------cEEEEEEEecC---------------------------CCceeEEEEccccEEEeccCC-CCCccccCHH Confidence 11111 0000 112334689999999999765 6889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccc Q lcl|NC_021537. 234 VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGS 312 (602) Q Consensus 234 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~ 312 (602) ..+.++|..+.++++++.++|+||++|++||++++...++++++++++.|++ +.|..|+|+++++++|++ T Consensus 183 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~--------- 253 (416) T protein:vir:81 183 DTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMT--------- 253 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCce--------- Confidence 9999999999999999999999999999999999877788999999999976 456689999999877654 Q ss_pred ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 313 DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 313 ~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) |++++ ++++|+||+|++++++++||++|||||.++|... .+ ++.+++...| .+||+|+++.||++||++|+ T Consensus 254 -----~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~-~~~~~~~~~~-~~~l~P~~~~ie~~ln~~l~ 324 (416) T protein:vir:81 254 -----FDQLE-VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-AN-MSITDANLDY-LSTLKPYITCVCAELNFKFN 324 (416) T ss_pred -----eEecc-CCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CC-ccHHHHHHHH-HHHHHHHHHHHHHHHhhhcc Confidence 55554 4568999999999999999999999999999643 23 3456665555 56999999999999999998 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc-cccccccccccccCCC Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT-LSEFEAEFGADASDGD 471 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~-~~~~~~~~~~~~~~~~ 471 (602) +.. .+++++|+++.+++. |.+.+++++++++++|+||+||+|+++|+||++||+.+++ ++.+++++....+... T Consensus 325 ~~~---~~~~~~f~~~~l~~~--D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~ 399 (416) T protein:vir:81 325 DEY---VNREFKFDTTEIRVV--DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQM 399 (416) T ss_pred ccc---cCceEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCc Confidence 765 367899999999876 7788889999999999999999999999999999988765 4445554432111000 Q ss_pred cCcccccccccccccccccc Q lcl|NC_021537. 472 AEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~ 491 (602) .. ......+-.....+| T Consensus 400 ~~---~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 400 NK---SRATDKKLKGGEENE 416 (416) T ss_pred cc---ccccccccCCCCCCC Confidence 00 000000000111111 No 51 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=6.5e-75 Score=427.30 Aligned_cols=461 Identities=16% Similarity=0.134 Sum_probs=290.5 Q ss_pred CCCCccccccc-chhhh----cc--c---CccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhcc-----------CceE Q lcl|NC_021537. 1 MSKAEETTQLD-ERHIA----TD--V---GRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAG-----------YGFE 59 (602) Q Consensus 1 ~~k~~~~~~~~-~~~~~----~~--~---~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~-----------~~~~ 59 (602) ++|+.+.++.- ...|. .. + ...+.|++++..+.+...+|++|++||++++++||. ++|. T Consensus 46 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~ 125 (563) T protein:vir:99 46 LTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFE 125 (563) T ss_pred HHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccce Confidence 66655544422 11121 11 0 125788999877777767789999999999999995 3344 Q ss_pred EEEecCCCC-cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEe--eCCCCceEEEEE Q lcl|NC_021537. 60 IVAHPSADE-PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEIL--VEGDGTPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~--r~~~G~~~~L~~ 136 (602) |..+..... ..+.....+.+..++..+.+ ..+|+ ++|+.+|+++++.+++++||+|++++ |+..|++++|+| T Consensus 126 i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~----~~~p~-~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~p 200 (563) T protein:vir:99 126 VRLRDLDAEPGRKEKEEMKRIEDFIVNTGK----DKDVD-RDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIA 200 (563) T ss_pred eEEeecCCCcchhhhhhhHHHHHHhhhcCC----CCCCC-cchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEE Confidence 443332211 11112222223333332221 12222 47999999999999999999999876 788899999999 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) |+|++|++..+..+.. ......|+|+. .++....+++++|| T Consensus 201 l~p~~V~v~~~~~g~~---------~~~~~~y~~~~------------------------------~g~~~~~~~~~evI 241 (563) T protein:vir:99 201 VDPSTIFYATDKKGKI---------IKGGKRFVQVV------------------------------DKRVVASFTSRELA 241 (563) T ss_pred eCCceeEEEECCCCce---------eccceeEEEEe------------------------------CCceeEEecCcceE Confidence 9999999865543211 11222233332 23345678899988 Q ss_pred EecCCCC---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHH-hhcccc Q lcl|NC_021537. 217 FLPNPSP---LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDN-LKGSRY 291 (602) Q Consensus 217 H~r~~~~---~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~-~~g~~n 291 (602) |++.+.. ..+.||+||+.++..+|....++++++.++|+||++|+|||++++. .+++++.+++++.|++ +.|..| T Consensus 242 ~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~n 321 (563) T protein:vir:99 242 MGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGING 321 (563) T ss_pred EEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Confidence 7654433 3378999999999999999999999999999999999999999864 5899999999999987 567789 Q ss_pred cCcc-eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC----------- Q lcl|NC_021537. 292 RTAI-LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS----------- 359 (602) Q Consensus 292 ag~~-~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------- 359 (602) +|++ +++++| ++|++++ ++++|+||+|++++++++||++|||||++||+.+.+ T Consensus 322 agk~~~vl~~G--------------~~~~~l~-~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~ 386 (563) T protein:vir:99 322 SWQIPVVMADD--------------IKFVNMT-PTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTL 386 (563) T ss_pred cccceEEcCCC--------------ceEEecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccch Confidence 9997 566555 4566665 456899999999999999999999999999987654 Q ss_pred CccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHH--HHHHHhCCcccHHH Q lcl|NC_021537. 360 NRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQR--VRAMRLAGVGTVNE 437 (602) Q Consensus 360 ~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~--~~~~~~~G~~T~NE 437 (602) +++|++++.+.|+++||+||++.||++||++|++..+ .+++++|...+ .+.+.+. +..++++|+||+|| T Consensus 387 ~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~--~~~~~~f~r~D-------~~~~~e~~~~~~~~~~G~lT~NE 457 (563) T protein:vir:99 387 NEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG--DKYTFQFVGGD-------TKSATDKLNILKLETQIFKTVNE 457 (563) T ss_pred hhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc--cccEEEeccCC-------HHHHHHHHHHHHHhcCCccCHHH Confidence 5588999999999999999999999999999998754 35666664332 2233333 34578999999999 Q ss_pred HHHHhCCCCCCCCcccccccccc-ccccccccCCCc--Cccc----------ccccccccc-cccccccccccccccccc Q lcl|NC_021537. 438 AREELDLAPFEDDRGDMTLSEFE-AEFGADASDGDA--EAML----------TRSKAAPPL-ENKIGERDSVDVDVSKDP 503 (602) Q Consensus 438 ~R~~~Gl~p~~~g~~d~~~~~~~-~~~~~~~~~~~~--~~~~----------~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 503 (602) +|+++||+|++||+ .++.+++ ...+...+.... .... ..+.+.++. .........+......+. T Consensus 458 ~R~~~gl~Pi~gGD--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (563) T protein:vir:99 458 AREEQGKKPIEGGD--IILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQI 535 (563) T ss_pred HHHHhCCCCCCCcc--eeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcccccccccc Confidence 99999999999874 4443333 222211111000 0000 000000000 000000000000000000 Q ss_pred hhhhhcchhhhhhheecccccEEEEEEecccCCcceeeecc Q lcl|NC_021537. 504 IEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVD 544 (602) Q Consensus 504 m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~ 544 (602) -......|....+-+ +--..+-.--|+ . T Consensus 536 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~------------~ 563 (563) T protein:vir:99 536 KGDDNVYRTQTSNKG-QGRKGEKSSDFK------------H 563 (563) T ss_pred ccccccccccCcccc-ccccCcCccccc------------C Confidence 001111111110000 000000000111 1 No 52 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=6.5e-75 Score=427.30 Aligned_cols=461 Identities=16% Similarity=0.134 Sum_probs=290.5 Q ss_pred CCCCccccccc-chhhh----cc--c---CccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhcc-----------CceE Q lcl|NC_021537. 1 MSKAEETTQLD-ERHIA----TD--V---GRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAG-----------YGFE 59 (602) Q Consensus 1 ~~k~~~~~~~~-~~~~~----~~--~---~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~-----------~~~~ 59 (602) ++|+.+.++.- ...|. .. + ...+.|++++..+.+...+|++|++||++++++||. ++|. T Consensus 46 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~ 125 (563) T protein:vir:95 46 LTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFE 125 (563) T ss_pred HHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccce Confidence 66655544422 11121 11 0 125788999877777767789999999999999995 3344 Q ss_pred EEEecCCCC-cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEe--eCCCCceEEEEE Q lcl|NC_021537. 60 IVAHPSADE-PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEIL--VEGDGTPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~--r~~~G~~~~L~~ 136 (602) |..+..... ..+.....+.+..++..+.+ ..+|+ ++|+.+|+++++.+++++||+|++++ |+..|++++|+| T Consensus 126 i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~----~~~p~-~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~p 200 (563) T protein:vir:95 126 VRLRDLDAEPGRKEKEEMKRIEDFIVNTGK----DKDVD-RDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIA 200 (563) T ss_pred eEEeecCCCcchhhhhhhHHHHHHhhhcCC----CCCCC-cchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEE Confidence 443332211 11112222223333332221 12222 47999999999999999999999876 788899999999 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) |+|++|++..+..+.. ......|+|+. .++....+++++|| T Consensus 201 l~p~~V~v~~~~~g~~---------~~~~~~y~~~~------------------------------~g~~~~~~~~~evI 241 (563) T protein:vir:95 201 VDPSTIFYATDKKGKI---------IKGGKRFVQVV------------------------------DKRVVASFTSRELA 241 (563) T ss_pred eCCceeEEEECCCCce---------eccceeEEEEe------------------------------CCceeEEecCcceE Confidence 9999999865543211 11222233332 23345678899988 Q ss_pred EecCCCC---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHH-hhcccc Q lcl|NC_021537. 217 FLPNPSP---LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDN-LKGSRY 291 (602) Q Consensus 217 H~r~~~~---~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~-~~g~~n 291 (602) |++.+.. ..+.||+||+.++..+|....++++++.++|+||++|+|||++++. .+++++.+++++.|++ +.|..| T Consensus 242 ~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~n 321 (563) T protein:vir:95 242 MGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGING 321 (563) T ss_pred EEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Confidence 7654433 3378999999999999999999999999999999999999999864 5899999999999987 567789 Q ss_pred cCcc-eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC----------- Q lcl|NC_021537. 292 RTAI-LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS----------- 359 (602) Q Consensus 292 ag~~-~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------- 359 (602) +|++ +++++| ++|++++ ++++|+||+|++++++++||++|||||++||+.+.+ T Consensus 322 agk~~~vl~~G--------------~~~~~l~-~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~ 386 (563) T protein:vir:95 322 SWQIPVVMADD--------------IKFVNMT-PTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTL 386 (563) T ss_pred cccceEEcCCC--------------ceEEecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccch Confidence 9997 566555 4566665 456899999999999999999999999999987654 Q ss_pred CccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHH--HHHHHhCCcccHHH Q lcl|NC_021537. 360 NRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQR--VRAMRLAGVGTVNE 437 (602) Q Consensus 360 ~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~--~~~~~~~G~~T~NE 437 (602) +++|++++.+.|+++||+||++.||++||++|++..+ .+++++|...+ .+.+.+. +..++++|+||+|| T Consensus 387 ~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~--~~~~~~f~r~D-------~~~~~e~~~~~~~~~~G~lT~NE 457 (563) T protein:vir:95 387 NEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG--DKYTFQFVGGD-------TKSATDKLNILKLETQIFKTVNE 457 (563) T ss_pred hhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc--cccEEEeccCC-------HHHHHHHHHHHHHhcCCccCHHH Confidence 5588999999999999999999999999999998754 35666664332 2233333 34578999999999 Q ss_pred HHHHhCCCCCCCCcccccccccc-ccccccccCCCc--Cccc----------ccccccccc-cccccccccccccccccc Q lcl|NC_021537. 438 AREELDLAPFEDDRGDMTLSEFE-AEFGADASDGDA--EAML----------TRSKAAPPL-ENKIGERDSVDVDVSKDP 503 (602) Q Consensus 438 ~R~~~Gl~p~~~g~~d~~~~~~~-~~~~~~~~~~~~--~~~~----------~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 503 (602) +|+++||+|++||+ .++.+++ ...+...+.... .... ..+.+.++. .........+......+. T Consensus 458 ~R~~~gl~Pi~gGD--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (563) T protein:vir:95 458 AREEQGKKPIEGGD--IILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQI 535 (563) T ss_pred HHHHhCCCCCCCcc--eeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcccccccccc Confidence 99999999999874 4443333 222211111000 0000 000000000 000000000000000000 Q ss_pred hhhhhcchhhhhhheecccccEEEEEEecccCCcceeeecc Q lcl|NC_021537. 504 IEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVD 544 (602) Q Consensus 504 m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~ 544 (602) -......|....+-+ +--..+-.--|+ . T Consensus 536 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~------------~ 563 (563) T protein:vir:95 536 KGDDNVYRTQTSNKG-QGRKGEKSSDFK------------H 563 (563) T ss_pred ccccccccccCcccc-ccccCcCccccc------------C Confidence 001111111110000 000000000111 1 No 53 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=6.4e-76 Score=432.82 Aligned_cols=513 Identities=15% Similarity=0.102 Sum_probs=318.2 Q ss_pred CCCCcccccc-------cchhhhc--ccCccccCCCCH------HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEETTQL-------DERHIAT--DVGRGIQPPYNP------ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~~~~~-------~~~~~~~--~~~~~i~p~~~~------~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) ++|....... ..+-+.. .....+....++ ..+.+.+..++.|++||++||++||++|++++++.+ T Consensus 71 ~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~e 150 (945) T protein:vir:10 71 LKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIE 150 (945) T ss_pred HHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecc Confidence 2333322210 0111111 111111111222 245677778999999999999999999999998765 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHH----HHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEE----VLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAAT 141 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~----~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~ 141 (602) ++........ ....|+.+.++.+||+.||+.+ |++.++.+++++||+|++++|+.+|++++|+||||++ T Consensus 151 dG~~~~~~kk-------~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~ 223 (945) T protein:vir:10 151 DKHVNYYLKR-------IRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTT 223 (945) T ss_pred cCcccccccc-------cccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcc Confidence 5433221111 2234677788889999999887 6677889999999999999999999999999999999 Q ss_pred ccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE-EecC Q lcl|NC_021537. 142 VRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI-FLPN 220 (602) Q Consensus 142 v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi-H~r~ 220 (602) |++..+..+... ..|.+. ..++....++++++| |++. T Consensus 224 Vti~~ddDG~~~------------y~Yv~~------------------------------idG~~~~~v~a~DvIlhirn 261 (945) T protein:vir:10 224 IKPILSEDTGIV------------VGYVQE------------------------------VDGAIVAHFDKRDVVLFRQN 261 (945) T ss_pred eEEEEcCCCcEE------------EEEEEe------------------------------cCCceEEEecCCceEEEecc Confidence 987543321100 001111 122334567788866 5566 Q ss_pred CCCCCC--cccccHHHHHHHHHHHHHHHHHHHHHHHH-hcCCCceEEEecc---------ccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 221 PSPLAL--YYGVPDWVAAMQTMGADQAAKEWNHDVFD-NLGIPHYAVKVTG---------GTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 221 ~~~~~~--~~G~spl~~~~~~i~~~~~~~~~~~~~f~-ng~~p~gil~~~~---------~~~~~~~~~~l~~~~~~~~g 288 (602) +++.+. .+|+||+.++.+++..+.+++++++++|. ||++|+|+|++++ +.+++++.+++++.|++..+ T Consensus 262 ~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~s 341 (945) T protein:vir:10 262 LTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMM 341 (945) T ss_pred CCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhC Confidence 665443 36999999999999999999999999995 7889999998753 45799999999999998877 Q ss_pred ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH Q lcl|NC_021537. 289 SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT 368 (602) Q Consensus 289 ~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 368 (602) +.|.|+++++++|+ ++++++ ++++|+||+|++++++++||++|||||++||+.+++++||++++. T Consensus 342 G~NnG~piVLdeGm--------------ef~pLs-~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~ 406 (945) T protein:vir:10 342 GDYTQVPILSGGKF--------------TWIDFK-GKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMA 406 (945) T ss_pred CcccccceecCCCc--------------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHH Confidence 78888877776654 555664 456899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCC Q lcl|NC_021537. 369 REFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFE 448 (602) Q Consensus 369 ~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~ 448 (602) ..|+++||+|++++||++||++|+.... ..+++++|+..++. |.+.+++++++++++|+||+||+|+++|+||++ T Consensus 407 ~~Fv~~tL~Pil~~IEqeLNrkLl~~~e-g~~i~fdFd~ldl~----D~ksraEal~kli~sGiLTiNEvRe~lGLpPIe 481 (945) T protein:vir:10 407 SLTKAKGLEPLMATISKGFDEVVSEFRN-EKDIKLWFKEDDLE----KERDWWNIIQGQLNTGFRSINEARMEKGLEPVP 481 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccc-CceeEEEecchhcc----CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 9999999999999999999999876543 34567777766654 345677889999999999999999999999999 Q ss_pred CCccccccccccccccccccC--CCcCccccccccc-ccccccccccccccccccc-cchhhhhcchhhhhhheeccccc Q lcl|NC_021537. 449 DDRGDMTLSEFEAEFGADASD--GDAEAMLTRSKAA-PPLENKIGERDSVDVDVSK-DPIEQTTFSSSNLDEGLYDFGER 524 (602) Q Consensus 449 ~g~~d~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~m~~~~v~ss~~~~~~yd~~~~ 524 (602) ||+.......+..+....... +..+++......+ +..+............... ..++-....-..+.+..+..-.. T Consensus 482 GGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~~e~~~~l~~~~~~~a~e~i~~ 561 (945) T protein:vir:10 482 WGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAGLEVLRNLFKSLDANASENLKQ 561 (945) T ss_pred CcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHH Confidence 875432222233332211111 1111111111111 1111111111111111111 11111111223333433343343 Q ss_pred EEEEEEecccCCcceeeeccCCHHHHHHHhCCCccch-hhhhhhcccccccccccchhcccCCCCCChhh--cCCcccc- Q lcl|NC_021537. 525 ELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGS-YHYSEIRLQYGYLEVTNNHERLPEGPTPDPGE--APEDVPS- 600 (602) Q Consensus 525 ~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~-~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~- 600 (602) .+ ||.+...+| --.+.-...+. |.|- =..+.|+|.- +.--++..-|--. --.-|-+ T Consensus 562 ~~--e~~~~~~~~-------~~~~~~~~~~~--~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~ 621 (945) T protein:vir:10 562 VI--ELTNDDNYL-------KEKELLTRVLK--SVGLDSVSEFIENNS---------QTDVEVSAKDILSFKYNSLVEDE 621 (945) T ss_pred HH--hhcCCCchh-------HHHHHHHHHHH--HhhhHHHHHHHhcCC---------ccceeechhhhhhhhhhhhcccc Confidence 33 776533222 11111111111 1110 1223444321 1000000000000 0001111 Q ss_pred -------cC Q lcl|NC_021537. 601 -------DI 602 (602) Q Consensus 601 -------~~ 602 (602) || T Consensus 622 ~~~~~~~~~ 630 (945) T protein:vir:10 622 TIYATEKDI 630 (945) T ss_pred ceeecchhh Confidence 12 No 54 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=1.2e-75 Score=431.29 Aligned_cols=365 Identities=16% Similarity=0.149 Sum_probs=279.4 Q ss_pred CCCCcccccccchhh---h-cccCcc----ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSKAEETTQLDERHI---A-TDVGRG----IQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k~~~~~~~~~~~~---~-~~~~~~----i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~ 72 (602) ++.+..+.+...+.. . +..+++ ......... .+.+..+++|++||++||++||++||++++..+.. T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t-~~~~~~~~~v~acV~~Ia~~iA~lpl~~~~~~~~~----- 106 (409) T protein:vir:83 33 VEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQ-DKLRTLIDVAWACIDLNASVLSSMPIYRMRNGRII----- 106 (409) T ss_pred eeccCCCcchhhhhcccccccccccccccccccCccccc-hhhHhhhHHHHHHHHHHHHhhccCceEEeeCCccc----- Confidence 222222222222111 1 111111 111111112 23344578999999999999999999988532110 Q ss_pred hhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEE-eeCCCCceEEEEEeCccccccccccccc Q lcl|NC_021537. 73 GESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEI-LVEGDGTPVGLAHVPAATVRVRKTTTTI 151 (602) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i-~r~~~G~~~~L~~l~p~~v~~~~~~~~~ 151 (602) ....+.+..+||+.||+.+|++.++.++++ ||+|+++ .|+.+|.+++|+||+|+.|++..+.. T Consensus 107 -------------~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~p~~v~v~~~~~-- 170 (409) T protein:vir:83 107 -------------DSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVPPWLVNVELKKG-- 170 (409) T ss_pred -------------cchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEECCcceEEEEcCC-- Confidence 112234567899999999999999999887 9999985 48899999999999999987643321 Q ss_pred ccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCccccc Q lcl|NC_021537. 152 EREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVP 231 (602) Q Consensus 152 ~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~s 231 (602) +..++++. ....++||||+|.+++.+++||+| T Consensus 171 -------------g~~~y~~~-----------------------------------~~~~~~eiiHir~~~~~~~~~G~s 202 (409) T protein:vir:83 171 -------------ARREYRIG-----------------------------------GLNVTDEILHIRYQGNTADAHGHG 202 (409) T ss_pred -------------ceEEEEEc-----------------------------------cccCccceEEeCCCCCCCCccccc Confidence 11112210 012357899999998899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccc Q lcl|NC_021537. 232 DWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 232 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~ 311 (602) |+..+..+|....++++++.++|+||++|+|+|++++ .+++++.+++++.|++..+ .|+|+++++++|.++. T Consensus 203 pi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~-~ls~e~~~~~~~~~~~~~~-~nag~~~il~~g~~~~------ 274 (409) T protein:vir:83 203 PLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVER-RLSETEAVDLMDRWIESRS-KYAGHPALVTGGATLN------ 274 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCC-CCCHHHHHHHHHHHHHhhC-CccCccceecCCcccc------ Confidence 9999999999999999999999999999999999876 5899999999999987553 4888888887765432 Q ss_pred cccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc---cCCccCHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS---TSNRANSKEQTREFAKGIIEPEQAKFSARLY 388 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~---~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln 388 (602) ++++ ++++||||+|++++++++||++|||||++||+.+ +.+|||+|++.+.|+++||.||+++||++|| T Consensus 275 -------~~~~-~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~ 346 (409) T protein:vir:83 275 -------QAKS-MSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALD 346 (409) T ss_pred -------cccC-CCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2333 4678999999999999999999999999999654 3479999999999999999999999999999 Q ss_pred hhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccc Q lcl|NC_021537. 389 KIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEA 461 (602) Q Consensus 389 ~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~ 461 (602) ++|++.. .+++|+.+++++. |.+.+++++++++++|+||+||+|+++||||++||+. + +...+ T Consensus 347 ~~Ll~~~-----~~~~f~~~~llr~--d~~~r~~~~~~~~~~G~lT~NE~R~~~glpp~~ggd~--l-~~~gv 409 (409) T protein:vir:83 347 RWALPSP-----QHLELNRDDYTRP--SLVERATAYKIMIEAGVMEPNEARAMERLHSEAAAVR--L-SGGGV 409 (409) T ss_pred HhhCCCC-----cEEEeehhhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc--c-CCCCC Confidence 9999763 4799999999877 7778899999999999999999999999999998752 1 11111 No 55 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=8.8e-73 Score=415.62 Aligned_cols=465 Identities=16% Similarity=0.157 Sum_probs=292.3 Q ss_pred CCCCcccccc-cchhhh---cccCccccCCCCHH---HH----HHHHhhhHHHHHHHHHHHHhhcc-----------Cce Q lcl|NC_021537. 1 MSKAEETTQL-DERHIA---TDVGRGIQPPYNPE---TL----AAFQELNETHQACIRKKSRYEAG-----------YGF 58 (602) Q Consensus 1 ~~k~~~~~~~-~~~~~~---~~~~~~i~p~~~~~---~l----~~~~~~~~~v~~cI~~ia~~ia~-----------~~~ 58 (602) ++|+.++++. +...+. +..-+...+|+... .+ +.++ .||+|++||++||++||+ ++| T Consensus 45 ~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~-~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~ 123 (576) T protein:vir:96 45 LNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFG-NNPILNAIILTRSNQVAMYCQPSRYNERGLGF 123 (576) T ss_pred hccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhh-cCHHHHHHHHHHHHHHHhhhhhhhhccccccc Confidence 8888876664 233321 11113556665433 22 2333 479999999999999996 567 Q ss_pred EEEEecCCCCc-ccchhhHHHHHHhhhccchhhhhhccCCc-cCCHHHHHHHHHHHHHhcCCeEEEEeeC--CCCceEEE Q lcl|NC_021537. 59 EIVAHPSADEP-DEGGESYQTVRDFWYGSDSRWQIGPEGTA-MSTPEEVLELGRQDYHGIGWAALEILVE--GDGTPVGL 134 (602) Q Consensus 59 ~i~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~pn~-~~t~~~~~~~~~~d~l~~Gna~~~i~r~--~~G~~~~L 134 (602) .|..+...... .+.....+.+..+++ .++..|++ .+|+.+|++.++.|++++||+|++++++ +.|++++| T Consensus 124 ~i~lk~~~~~~~~~~~~~~~~l~~~l~------~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L 197 (576) T protein:vir:96 124 EVRMRDLDAEPGKKEKEEIKRIENFIL------NTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKF 197 (576) T ss_pred eeEEecCcCccchhhhHhhhhHHhhHh------hccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEE Confidence 77665544321 111112222223332 23333333 4799999999999999999999999855 46789999 Q ss_pred EEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhH Q lcl|NC_021537. 135 AHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANE 214 (602) Q Consensus 135 ~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~e 214 (602) +||+|.+|++..+..+... .....|+| ...++....+++++ T Consensus 198 ~pl~p~~V~v~~~~dg~~~---------~~~~~~~~------------------------------~~~~~~~~~~~~~d 238 (576) T protein:vir:96 198 IAVDPSTIFYATDKNGKII---------KGGKRFVQ------------------------------VINKKVVASFTSRE 238 (576) T ss_pred EEeCCceeEEEECCCCcee---------eeeeEEEE------------------------------ecCCceEEEecccc Confidence 9999999998655432110 00111222 12334566889999 Q ss_pred EEEecCCCCCC---CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHH-hhcc Q lcl|NC_021537. 215 LIFLPNPSPLA---LYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDN-LKGS 289 (602) Q Consensus 215 viH~r~~~~~~---~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~-~~g~ 289 (602) |||++.+...+ +.||+||+.++..+|....++++++.++|+||++|+|||++++. .+++++.+++++.|++ +.|. T Consensus 239 ii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~ 318 (576) T protein:vir:96 239 MAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGI 318 (576) T ss_pred eEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Confidence 99876543333 67999999999999999999999999999999999999999764 5899999999999986 5677 Q ss_pred cccCcc-eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC--------- Q lcl|NC_021537. 290 RYRTAI-LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS--------- 359 (602) Q Consensus 290 ~nag~~-~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--------- 359 (602) .|+|++ +++++| ++|++++ ++++|+||+|++++++++||++|||||++||+.+.+ T Consensus 319 ~nag~~p~vl~~G--------------~~~~~ls-~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~ 383 (576) T protein:vir:96 319 NGSWQVPVVMADD--------------IKFVNMT-PTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGN 383 (576) T ss_pred cccccceeecCCC--------------ceEEecc-CChhhHHHHHHHHHhHHHHHHHhCCCHHHcccccccccccccccc Confidence 899985 666554 5677775 566899999999999999999999999999987544 Q ss_pred --CccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHH--HHHHhCCcccH Q lcl|NC_021537. 360 --NRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRV--RAMRLAGVGTV 435 (602) Q Consensus 360 --~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~--~~~~~~G~~T~ 435 (602) ||+|+|++.+.|+++||+||+++||++||.+|++..+ .+++++|+..+ .+.+++.+ ..++.+|+||+ T Consensus 384 s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~--~~~~~~f~r~d-------~~~~~e~~~~~~~~~~G~lT~ 454 (576) T protein:vir:96 384 TLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYS--DKYVFQFVGGD-------TKSELDKIKILQEEVKTYKTV 454 (576) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc--CceEEEeccCC-------HHHHHHHHHHHHHHhcCccCH Confidence 7899999999999999999999999999999998754 45667765433 22233333 34567899999 Q ss_pred HHHHHHhCCCCCCCCcccccccc-ccccccccccCCCcCcccccc----------------ccccccccccccccccccc Q lcl|NC_021537. 436 NEAREELDLAPFEDDRGDMTLSE-FEAEFGADASDGDAEAMLTRS----------------KAAPPLENKIGERDSVDVD 498 (602) Q Consensus 436 NE~R~~~Gl~p~~~g~~d~~~~~-~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~ 498 (602) ||+|+++||+|++||+ .++.+ ++...+...+.+..+....+. +..+..+...+.+...... T Consensus 455 NE~R~~~gl~piegGD--~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~ 532 (576) T protein:vir:96 455 NEARKEKGLKPIEGGD--VLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPT 532 (576) T ss_pred HHHHHHhCCCCCCCcc--eeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCC Confidence 9999999999999875 33333 222222111110000000000 0000000000000000000 Q ss_pred -----ccccchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccC Q lcl|NC_021537. 499 -----VSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDV 545 (602) Q Consensus 499 -----~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v 545 (602) ..+..+...+- +..|.++-...+- ..|.|...--.|.+- T Consensus 533 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~ 576 (576) T protein:vir:96 533 KIDSPVGTDGQLKDQD---NVKSQEGSNKGQG-----TKGKGNEKPSDFKNN 576 (576) T ss_pred CCCCccccccccCCCC---ccccccccccccc-----ccccCCCCcccccCC Confidence 01112221111 1111111111000 000000011112221 No 56 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=2e-73 Score=419.15 Aligned_cols=386 Identities=12% Similarity=0.070 Sum_probs=285.5 Q ss_pred CCCCcc-cccccchhhhcccCccccCCCC-----HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEE-TTQLDERHIATDVGRGIQPPYN-----PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~-~~~~~~~~~~~~~~~~i~p~~~-----~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) =+|+.. .++......+. ......++.- ...... +..+++|++||++||++||++||+++++.++..+.. T Consensus 18 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~--- 92 (413) T protein:vir:96 18 NKRSPTEESKAKDEIPKA-PQVVMTLPNFFKELISDGYTK-LSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDKRI--- 92 (413) T ss_pred cCCCcchhhhhhcccccc-ccccccchhhHhhhccchhHH-HhhchHHHHHHHHHHHhhccCceEEEEecCCCcccc--- Confidence 111110 11111111100 0000011100 011122 345799999999999999999999998765432211 Q ss_pred hHHHHHHhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC-ceEEEEEeCcccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG-TPVGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G-~~~~L~~l~p~~v~~~~~~~~~~ 152 (602) .|+... ++.+||+.||+.+||+.++.+++++||||++++|+.+| .+.+|+|++|++|++..+.. T Consensus 93 -----------~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~--- 158 (413) T protein:vir:96 93 -----------KNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDD--- 158 (413) T ss_pred -----------ccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCC--- Confidence 234444 44689999999999999999999999999999999887 57899999999998643211 Q ss_pred cccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecC-CCCCCCccccc Q lcl|NC_021537. 153 REDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPN-PSPLALYYGVP 231 (602) Q Consensus 153 ~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~-~~~~~~~~G~s 231 (602) ..++++ .. . ..+++++||||||. +++.++++|+| T Consensus 159 -------------~~~y~~-----------------------------~~-~--~~~~~~~evih~k~~~~~~~~~~G~s 193 (413) T protein:vir:96 159 -------------DLDYSI-----------------------------TF-D--NKEYDPSTLLHFVLNPSIERPFIGTG 193 (413) T ss_pred -------------eEEEEE-----------------------------ee-c--CcEEchhhEEEEeccCCCCCcccccc Confidence 111111 00 1 13578999999995 56778899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccc Q lcl|NC_021537. 232 DWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDG 310 (602) Q Consensus 232 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~ 310 (602) |+.++..+|....++++++.++|+||++|+++|++++ .+++++.+++++.|++ +.|..|+|++++++.|... T Consensus 194 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~------ 266 (413) T protein:vir:96 194 YKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS-DSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVN------ 266 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhcCccccCceeeecCCccc------ Confidence 9999999999999999999999999999999999875 5899999999999976 5567899999988765432 Q ss_pred ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021537. 311 GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKI 390 (602) Q Consensus 311 ~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~ 390 (602) ++.+...+++|+||+|++++++++||++|||||.+||..+ +.+++..+|+++||+||++.||++||++ T Consensus 267 -------~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~-----~~~~~~~~~~~~~l~P~~~~ie~~ln~~ 334 (413) T protein:vir:96 267 -------VQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGT-----YNKDEFNNFINTKIMSIAQVIQQTYNKL 334 (413) T ss_pred -------ccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCc-----chHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2223334568999999999999999999999999998532 3477888999999999999999999999 Q ss_pred cCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCC Q lcl|NC_021537. 391 IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDG 470 (602) Q Consensus 391 Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~ 470 (602) |+++ +++++|+.+.+++. |.+.+++++.+++++|+||+||+|+++|++|+|+|+ ..+++.+++++....... T Consensus 335 ll~~-----~~~~~fd~~~ll~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~gd-~~~~~~n~~~~~~~~~~~ 406 (413) T protein:vir:96 335 IVEE-----DMYFSLNPRSLYNY--SLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEMD-DLLVLENYLQQKDLVNQK 406 (413) T ss_pred hCCC-----CcEEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeecccccchhhccccc Confidence 9864 57899999999876 777788999999999999999999999999998864 344566666554322222 Q ss_pred CcCcccc Q lcl|NC_021537. 471 DAEAMLT 477 (602) Q Consensus 471 ~~~~~~~ 477 (602) ...+..| T Consensus 407 ~~~~~dt 413 (413) T protein:vir:96 407 KLIQDET 413 (413) T ss_pred CCCCCCC Confidence 2112212 No 57 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=4.6e-73 Score=417.15 Aligned_cols=413 Identities=16% Similarity=0.136 Sum_probs=285.7 Q ss_pred CCCCc------------ccccccc----------hhhhcccCccccC-CCCHHH-HHHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_021537. 1 MSKAE------------ETTQLDE----------RHIATDVGRGIQP-PYNPET-LAAFQELNETHQACIRKKSRYEAGY 56 (602) Q Consensus 1 ~~k~~------------~~~~~~~----------~~~~~~~~~~i~p-~~~~~~-l~~~~~~~~~v~~cI~~ia~~ia~~ 56 (602) -.+++ +...... +-..+-.++..++ +.+... -.+.+..+++|++||++||++||++ T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~~Ia~~ia~l 89 (466) T protein:vir:81 10 TRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACMLVRQLVFSSV 89 (466) T ss_pred ccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHHHHHHhhccC Confidence 11111 0000000 0000000111111 111111 1334556799999999999999999 Q ss_pred ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-------- Q lcl|NC_021537. 57 GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-------- 128 (602) Q Consensus 57 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-------- 128 (602) ||+++.+.+.+... ...++.+.|+.+||+.||+.+||+.++.+++++||||++++|+.. T Consensus 90 p~~~~~~~~~~~~~-------------~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~ 156 (466) T protein:vir:81 90 RFRWQRLRDGKPSD-------------TFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWV 156 (466) T ss_pred ceEEEEecCCceee-------------ccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccC Confidence 99998765432211 124667788999999999999999999999999999999999765 Q ss_pred CceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeE Q lcl|NC_021537. 129 GTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELK 208 (602) Q Consensus 129 G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 208 (602) |.+++|+||+|..|++..+...... ..|....++. ..++... T Consensus 157 g~~~~l~~l~~~~v~~~~~~~~~~~------------~~y~~~~~~~--------------------------~~~~~~~ 198 (466) T protein:vir:81 157 DVVVEERMVRGGRGELGGGQLGWRK------------VGYLYTEGGR--------------------------QSGNESV 198 (466) T ss_pred cceeEEEEecCcceEEEEcCCCceE------------EEEEEEecCc--------------------------cccccee Confidence 4589999999999987543322100 0011101100 1123456 Q ss_pred EechhHEEEecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-h Q lcl|NC_021537. 209 NGPANELIFLPNP-SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-L 286 (602) Q Consensus 209 ~~~~~eviH~r~~-~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~ 286 (602) +++++||||||.+ ++.++++|+||+..+.++|....++++++.++|+||++|++||++++ .+++++.+++++.|++ + T Consensus 199 ~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~ 277 (466) T protein:vir:81 199 GFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNP-MADPAAVKKWADEVNSKH 277 (466) T ss_pred eeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHHHHHHHHHHHHHHh Confidence 7999999999975 57899999999999999999999999999999999999999999875 5899999999999976 5 Q ss_pred hcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcc---ccCCccC Q lcl|NC_021537. 287 KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVT---STSNRAN 363 (602) Q Consensus 287 ~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~---~~~~~sn 363 (602) .|..|+|+++++++|+ ++++++ ++++|+||+|++++++++||++|||||++||+. ..++|+| T Consensus 278 ~g~~n~g~~~vl~~g~--------------~~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn 342 (466) T protein:vir:81 278 AGVDNAWKNLNLYPGA--------------DADVVG-SNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSN 342 (466) T ss_pred cCccccccceEcCCCc--------------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCcccccc Confidence 6778999999987665 455665 456899999999999999999999999999975 3578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHH-------HHHHHHHhCCcccHH Q lcl|NC_021537. 364 SKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAE-------QRVRAMRLAGVGTVN 436 (602) Q Consensus 364 ~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~-------~~~~~~~~~G~~T~N 436 (602) +|++.+.|+++||.|++++||++||++|++..++ ..++++|+..++++. |.+.++ +.++.++++|+ |+| T Consensus 343 ~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~-~~~~~~f~~~~llr~--d~~~r~~~~~~~~~~~~~~~~~g~-t~n 418 (466) T protein:vir:81 343 YGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPD-VRLWYDADDVPFLRE--DEKDAADIQKVRAETINTLITAGY-EPE 418 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccC-cceEEEecchhhhcc--CHHHHHHHHHHHHHHHHHHHHcCC-Chh Confidence 9999999999999999999999999999987665 457899999999877 444443 34778899995 999 Q ss_pred HHHHHhCCCCCCCCccccccccccccccccccCCCcCccccccccccccccccc Q lcl|NC_021537. 437 EAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 437 E~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (602) |+|+.+ ++|+...+......... ....+...+...+.......++..+ T Consensus 419 E~r~~~-----~~gd~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 419 SVVAAV-----NSGDLRLLKHTGLTSVQ-LLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred hccccc-----cCCccccccCCCcchhh-hcccccccccCCCCcccCCCCcCCC Confidence 999643 34432211111111111 1111111111000000000000000 No 58 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=2.2e-73 Score=418.88 Aligned_cols=347 Identities=14% Similarity=0.112 Sum_probs=275.6 Q ss_pred hccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce Q lcl|NC_021537. 53 EAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP 131 (602) Q Consensus 53 ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~ 131 (602) ||++||+++++.+.. .|+++.++. +||++||+.+||+.++.+++++||||++++|+..|++ T Consensus 1 ia~lp~~~~~~~~~~------------------~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~ 62 (348) T protein:vir:93 1 MASLPLKMYEDYKVV------------------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQP 62 (348) T ss_pred CcccceEeEecCcCc------------------ccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 999999998643211 255666665 8999999999999999999999999999999999999 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) ++|+||+|++|++..+.... . ..+.+...+|..++++ T Consensus 63 ~~L~~l~~~~v~~~~~~~~~--------------~-----------------------------~~y~~~~~~g~~~~~~ 99 (348) T protein:vir:93 63 SKLFLLNPDVVEMLIENQSR--------------E-----------------------------LYYSIHAATGNKLIVH 99 (348) T ss_pred EEEEEEcCCceEEEEeCCCc--------------E-----------------------------EEEEEEcCCCeEEEEc Confidence 99999999999865432110 0 0012233445667899 Q ss_pred hhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccc Q lcl|NC_021537. 212 ANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRY 291 (602) Q Consensus 212 ~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~n 291 (602) ++||||||++++.++++|+||+..+..++....++++++ ++.++..++++++ .+..+++++.+++++.|++..+ | T Consensus 100 ~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~-~~~~l~~e~~~~~~~~~~~~~~--n 174 (348) T protein:vir:93 100 NMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLK-YGSNVSTEKRQQVLEDFKQYYE--E 174 (348) T ss_pred cccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEe-cCCCCCHHHHHHHHHHHHHHhh--c Confidence 999999999888999999999999999999999998886 3333334445555 4567999999999999988763 6 Q ss_pred cCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHH Q lcl|NC_021537. 292 RTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREF 371 (602) Q Consensus 292 ag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f 371 (602) +++++++++|+ +|++++ ++++|+||+|++++++++||++|||||.+||..++++++|+|++.+.| T Consensus 175 ~~~~~vl~~g~--------------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~ 239 (348) T protein:vir:93 175 NGGILFQEPGV--------------EIEPLP-KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFY 239 (348) T ss_pred CCCeeecCCCc--------------eEEEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 77888876654 556665 456899999999999999999999999999999899999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR 451 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~ 451 (602) +++||+|+++.|+++||++|++..+...+++|+|+.+.+++. |.+.+++++.+++++|++|+||+|+++|++|+|||+ T Consensus 240 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~--d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD 317 (348) T protein:vir:93 240 LQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRA--DSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD 317 (348) T ss_pred HHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhcc--CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcC Confidence 999999999999999999999998888899999999999876 778888999999999999999999999999999863 Q ss_pred cccccccccccccccccCCCcCcccccccccccccc Q lcl|NC_021537. 452 GDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLEN 487 (602) Q Consensus 452 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (602) ..+++.++.++..........+....... +. T Consensus 318 -~~~~~~n~~~~~~~~~~~~~~~gg~~n~~----~~ 348 (348) T protein:vir:93 318 -KPLISGDLYPIDTPLELRKSLKGGDKNVN----ES 348 (348) T ss_pred -eEeecccccccccchhhcccccCCCCCcC----CC Confidence 44556666655432111110000000000 00 No 59 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=3.9e-72 Score=412.04 Aligned_cols=444 Identities=14% Similarity=0.081 Sum_probs=283.1 Q ss_pred CCCCcccccccchh-h------hccc-Ccc----ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhc-------------c Q lcl|NC_021537. 1 MSKAEETTQLDERH-I------ATDV-GRG----IQPPYNPETLAAFQELNETHQACIRKKSRYEA-------------G 55 (602) Q Consensus 1 ~~k~~~~~~~~~~~-~------~~~~-~~~----i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia-------------~ 55 (602) |.|+-.....--+. + .++. |.. +....++..|++.+.+++++++||+++++.|+ + T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~ 111 (535) T protein:vir:10 32 VNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVG 111 (535) T ss_pred HHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCc Confidence 22222211110000 0 0110 111 11224677777777778888887777777665 5 Q ss_pred CceEEEEecCCCCcccchhhHHHHHHhhhccchhhh-hhccCCccCCHH----HHHHHHHHHHHhcC-CeEEEEeeCCCC Q lcl|NC_021537. 56 YGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQ-IGPEGTAMSTPE----EVLELGRQDYHGIG-WAALEILVEGDG 129 (602) Q Consensus 56 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~pn~~~t~~----~~~~~~~~d~l~~G-na~~~i~r~~~G 129 (602) ++++++.++...... . ....|++.. |..+||++|++. +|+++++.|++++| ++|++++|+..| T Consensus 112 ~~i~l~~~~~~~~~~-~----------~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G 180 (535) T protein:vir:10 112 FKVELKDATKVMSKA-Q----------IKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSN 180 (535) T ss_pred ceeEEEeccCCCcch-h----------hhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCC Confidence 566665443221111 1 112244444 445799988765 46777788877665 789999999999 Q ss_pred ceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEE Q lcl|NC_021537. 130 TPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKN 209 (602) Q Consensus 130 ~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 209 (602) +|++|+||+|.+|++..+.... ..+..|+++ ..++.... T Consensus 181 ~~~~L~~l~p~~V~v~~d~~~~-----------~~~~~~~~~------------------------------~~~~~~~~ 219 (535) T protein:vir:10 181 ELDHFNAVDASKVVISYSPRSK-----------DQPRKFEQF------------------------------VSETKSVK 219 (535) T ss_pred cEEEEEEeCCceeEEEEcCccc-----------cCceEEEEE------------------------------ecCceeEE Confidence 9999999999999875443211 011122222 23345567 Q ss_pred echhHEEEecCCCCC---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc---ccCCHHHHHHHHHHH Q lcl|NC_021537. 210 GPANELIFLPNPSPL---ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG---GTLSEDSKEDLRNLM 283 (602) Q Consensus 210 ~~~~eviH~r~~~~~---~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~~~~~~~~~~l~~~~ 283 (602) ++++||||++.+++. ++.||+||+.++..+|..+.++++|+.++|+||++|+|||++++ ..+++++.+++++.| T Consensus 220 ~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~ 299 (535) T protein:vir:10 220 FSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQW 299 (535) T ss_pred ECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHH Confidence 999999999986543 46789999999999999999999999999999999999999975 358999999999999 Q ss_pred HH-hhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCcc Q lcl|NC_021537. 284 DN-LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRA 362 (602) Q Consensus 284 ~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s 362 (602) ++ +.|..|+|+++++.+ .+++|++++. +++|+||+|++++++++||++|||||++||+.+++||+ T Consensus 300 ~~~~~G~~nag~~~vl~~-------------~g~~~~~l~~-~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~s 365 (535) T protein:vir:10 300 TSQGSGLGGAWKIPILAA-------------KDAKFVNMTQ-NSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGST 365 (535) T ss_pred HHHhcCcccccccccccC-------------CCceEEecCC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccc Confidence 76 557789999877653 1356666664 56899999999999999999999999999999888776 Q ss_pred C------------HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhC Q lcl|NC_021537. 363 N------------SKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLA 430 (602) Q Consensus 363 n------------~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~ 430 (602) | +|++...|+++||+||+++||++||++|++..+. + ++|+++.+++. |.+.++++++.+ .+ T Consensus 366 n~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~--~--~~f~f~~l~~~--d~~~r~~~~~~~-~~ 438 (535) T protein:vir:10 366 GKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDT--D--YRFSFTLGDAQ--DKLQEEQVWKLK-LA 438 (535) T ss_pred cchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCC--e--EEEEecccccc--CHHHHHHHHHHH-Hc Confidence 4 5677788999999999999999999999976542 3 45555666666 455566766644 46 Q ss_pred CcccHHHHHHHhCCCCCCCCcccccc--ccccccccc--cccCCCcCc-ccc--ccccccccccccccccccccc-c--c Q lcl|NC_021537. 431 GVGTVNEAREELDLAPFEDDRGDMTL--SEFEAEFGA--DASDGDAEA-MLT--RSKAAPPLENKIGERDSVDVD-V--S 500 (602) Q Consensus 431 G~~T~NE~R~~~Gl~p~~~g~~d~~~--~~~~~~~~~--~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~-~--~ 500 (602) |+||+||+|+++||||++||+..... ..+....+. ++..++..+ ... .....+...+........+.. . . T Consensus 439 g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~ 518 (535) T protein:vir:10 439 NGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPL 518 (535) T ss_pred CCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCC Confidence 88999999999999999988643211 112211111 111111110 000 000000000010101111111 1 1 Q ss_pred ccchhhhhcchhhhhhheecccc Q lcl|NC_021537. 501 KDPIEQTTFSSSNLDEGLYDFGE 523 (602) Q Consensus 501 ~~~m~~~~v~ss~~~~~~yd~~~ 523 (602) .++++-..++. --|..+ T Consensus 519 ~~~~~~~~~~~------~~~~~~ 535 (535) T protein:vir:10 519 PKPSESDDVSN------NEDADT 535 (535) T ss_pred CcCCCCCcccc------ccccCC Confidence 11111111111 012222 No 60 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=1.3e-71 Score=409.20 Aligned_cols=392 Identities=15% Similarity=0.089 Sum_probs=288.2 Q ss_pred CCCCcccccccchhhhcccCccccC-CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQP-PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p-~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) -++.............+..+....+ ..+.. .+..+++|++||++||++||+++|+++++.+.+.... T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~-------- 77 (406) T protein:vir:95 10 TKRKSKIRADTGYVGLFMSGEDVSFLVPGYV----RLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDIRI-------- 77 (406) T ss_pred ccccccccccchhhhhhccCcccCccccCHH----HHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceee-------- Confidence 1222212111111111111111111 12222 2345799999999999999999999987765432211 Q ss_pred HHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCe--EEEEeeCCCCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 80 RDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWA--ALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 80 ~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna--~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ..+..+.++.+||+.||+.+||+.++.++++.|++ |+.+.|+..|++.+||||+|.+|++..+..+ T Consensus 78 -----~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~------- 145 (406) T protein:vir:95 78 -----RNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDG------- 145 (406) T ss_pred -----cchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCe------- Confidence 12344567789999999999999999999999665 6677899999999999999999987543211 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecC-CCCCCCcccccHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPN-PSPLALYYGVPDWVAA 236 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~-~~~~~~~~G~spl~~~ 236 (602) | + +..++ .+|+++||||+|. .++.++++|+||+..+ T Consensus 146 ----------~-~------------------------------~~~~~--~~~~~~evih~~~~~~~~~~~~G~s~i~~~ 182 (406) T protein:vir:95 146 ----------Y-Q------------------------------VLYGG--QTFNYDEVLHFIYNPDPERPYIGRGYRVVL 182 (406) T ss_pred ----------E-E------------------------------EEecc--EEEchhHEEEeeccCCCCCCccccCHHHHH Confidence 1 0 00111 3688999999996 5778889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 237 MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 237 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) ..++....++++++.++|+||++|+++|++++ .+++++.+++++.|.+ +.|..|+++++++..+.. T Consensus 183 ~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~------------ 249 (406) T protein:vir:95 183 KDIADNLKQATATKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELL------------ 249 (406) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhccccccCCceeecCCCc------------ Confidence 99999999999999999999999999999876 4799999999999965 667789999888765421 Q ss_pred cccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA 395 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~ 395 (602) +++++..++++|+||+|++++++++||++|||||++||..+ +.+++..+|++.||+|+++.||++||++|+++. T Consensus 250 -~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~-----~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~ 323 (406) T protein:vir:95 250 -EVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGE-----FNRDEYNNFINSTILPIAKGIEQELTRKLLISP 323 (406) T ss_pred -cccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-----chHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 23344445568999999999999999999999999998532 457888999999999999999999999998763 Q ss_pred ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcc Q lcl|NC_021537. 396 LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAM 475 (602) Q Consensus 396 ~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~ 475 (602) +++++|+++.+++. |.+.+++.+.+++++|+||+||+|+++|++|+|+|+ ..+++.++.++..........+. T Consensus 324 ----~~~~~fd~~~l~~~--d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd-~~~~~~n~~~~~~~~~~~~~k~g 396 (406) T protein:vir:95 324 ----DLYFKFNPRSLYAY--DLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLS-ELVILENYIPLDKIGDQSKLKGG 396 (406) T ss_pred ----CcEEEeechhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-eeeeccCccchhhcccccccCCC Confidence 56899999999776 677788999999999999999999999999998864 44456666655432221111111 Q ss_pred cccccccccccccccc Q lcl|NC_021537. 476 LTRSKAAPPLENKIGE 491 (602) Q Consensus 476 ~~~~~~~~~~~~~~~~ 491 (602) .... ++...+ T Consensus 397 ~~~~------~~~~~~ 406 (406) T protein:vir:95 397 DNSG------ADGQTD 406 (406) T ss_pred CCCC------CCCCCC Confidence 1110 000011 No 61 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=7.7e-72 Score=410.45 Aligned_cols=388 Identities=12% Similarity=0.045 Sum_probs=280.4 Q ss_pred CCCCcccccccchhhhcccC-ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVG-RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~-~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) .+++...+.+....+..... +.....++... +..++.|++||++||++||++||++. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~----al~~~~V~~~v~~ia~~ia~~p~~~~------------------ 64 (397) T protein:vir:38 7 NKSHSQGFSLNDPDWVNFLTGGEAQKYVSADT----ALKNSDIFSLIMQLSGDLAMVRYTSE------------------ 64 (397) T ss_pred hhcccCcccCCchhhhhhhcCCcCCceechHH----hhccHHHHHHHHHHHHHHhhCccccc------------------ Confidence 22222222222111111101 00011133322 33478999999999999999998642 Q ss_pred HHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 80 RDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 80 ~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) .+..+.++.+||+.||+.+||+.++.+++++||||++++|+.+|++++|+||+|++|++..+..+ T Consensus 65 ------~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~--------- 129 (397) T protein:vir:38 65 ------SDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDG--------- 129 (397) T ss_pred ------ccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC--------- Confidence 13345688899999999999999999999999999999999999999999999999987543321 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) +..++++.. .....+..++++++||||++.+++.+.+||+||+.++..+ T Consensus 130 -----~~~~y~~~~--------------------------~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~ 178 (397) T protein:vir:38 130 -----SGLIYNINF--------------------------DEPAIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINE 178 (397) T ss_pred -----ceEEEEEEe--------------------------ccccccceeEecCccEEEecCCCCCCccccccHHHHHHHH Confidence 011111100 0112345678999999999999988889999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELE 319 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~ 319 (602) |....++++++.++|+||++|+++|++++. +++++.+++++.|+...++.|+++++++++|+ +|+ T Consensus 179 i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~--------------~~~ 243 (397) T protein:vir:38 179 QQIKDASNELTLKALKQSVTASAVLTIQKG-GLLDAETRIARSKEISKQIHNSDGPVVIDALE--------------DYK 243 (397) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHHHHHHHHHHHHHHhcccccCCceecCCCc--------------eEE Confidence 999999999999999999999999999864 78889999999999988889999998887654 455 Q ss_pred cccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccccc Q lcl|NC_021537. 320 PIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVD 399 (602) Q Consensus 320 pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~ 399 (602) +++ +++.|+||+|++++++++||++|||||.+||..+++ ++|++++ ..|+.+||+|+++.|+++||++|+++.+ T Consensus 244 ~l~-~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~-~~~~e~~-~~~~~~~l~P~~~~ie~~ln~~l~~~~~--- 317 (397) T protein:vir:38 244 PLE-VKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQ-QSSITQI-SGQYAKSLNRYVQAIVGELNDKLHANIS--- 317 (397) T ss_pred ecC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccHHHHH-HHHHHHHHHHHHHHHHHHHHHhccChhc--- Confidence 555 456799999999999999999999999999987654 4677654 5678899999999999999999998643 Q ss_pred ceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccccc-ccccCCCcCccc-c Q lcl|NC_021537. 400 EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFG-ADASDGDAEAML-T 477 (602) Q Consensus 400 ~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~-~~~~~~~~~~~~-~ 477 (602) +.++|.+ +. |.+.+++++++++++|+||+||+|+++|++|+++++....-........ ...++++..+.. + T Consensus 318 -~~~~~~~----~~--d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 390 (397) T protein:vir:38 318 -ANIRFAI----DA--MGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNNSD 390 (397) T ss_pred -ccccccc----cC--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccccccCCCCCCCCC Confidence 3444433 23 5667788999999999999999999999999987752111111111111 111111111111 1 Q ss_pred ccccccc Q lcl|NC_021537. 478 RSKAAPP 484 (602) Q Consensus 478 ~~~~~~~ 484 (602) ++..+|. T Consensus 391 e~~~~~~ 397 (397) T protein:vir:38 391 ERGSDPE 397 (397) T ss_pred CCCCCCC Confidence 1111111 No 62 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=8.6e-72 Score=410.19 Aligned_cols=388 Identities=17% Similarity=0.140 Sum_probs=282.0 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) +.|-. ..+.-.+.... .. -.+...+....+.+..+++|++||++||+.||++||+++.+.......... T Consensus 8 ~~~~~-~~~~~~~~~~~-~~--~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~------- 76 (403) T protein:vir:10 8 TEKLN-PGQRIIRDMEP-VS--HRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGV------- 76 (403) T ss_pred hhccc-hhhhhhhcccc-cc--cccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeeccccccccccc------- Confidence 22221 11111111100 00 011122222334555689999999999999999999998664432211110 Q ss_pred Hhhhccchhh-hhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRW-QIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~-~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..++++ .|+.+||+.||+.+|++.++.+++++||||+++.+ ..|++||++.|++..+.... T Consensus 77 ----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~-------- 138 (403) T protein:vir:10 77 ----KTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKF-------- 138 (403) T ss_pred ----ccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCce-------- Confidence 012333 35558999999999999999999999999988743 25899999988765432110 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCC----CCCCcccccHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPS----PLALYYGVPDWVA 235 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~----~~~~~~G~spl~~ 235 (602) .+. +.. +....+++++|||++..+ +.++++|+||+.+ T Consensus 139 -------~~~------------------------------~~~--~~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~ 179 (403) T protein:vir:10 139 -------IKK------------------------------FIF--NNQINYRVDEIIFIKDNSYVCGTNSQISGQSRVAT 179 (403) T ss_pred -------EEE------------------------------EEe--cCceeecccceEEecccccccCCCCCcccccHHHH Confidence 000 000 112357789999999654 3578999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) +..++....++++++.++|+||++|++||+.++ .+++++.+++++.|++ +.|..|+|+++++++|++ T Consensus 180 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~----------- 247 (403) T protein:vir:10 180 VIDSLEKRSKMLNFKEKFLDNGTVIGLILETDE-ILNKKLRERKQEELQLDYNPSTGQSSVLILDGGMK----------- 247 (403) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhCCcccCcceeecCCCce----------- Confidence 999999999999999999999999999999875 5899999999999986 456789999999887654 Q ss_pred ccccccccc-cchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 315 NIELEPIGA-REDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 315 ~~~~~pl~~-~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) +++++. .+++|+||+|++++++++||++|||||++||. ++++|++++.+.|+++||.||++.|+++||++| T Consensus 248 ---~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~---~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L-- 319 (403) T protein:vir:10 248 ---AKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDG---GNNANIRPNIELFYYMTIIPMLNKLTSSLTFFF-- 319 (403) T ss_pred ---eEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCC---CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhc-- Confidence 455543 35679999999999999999999999999974 578899999999999999999999999999988 Q ss_pred ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc-cCCCc Q lcl|NC_021537. 394 DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA-SDGDA 472 (602) Q Consensus 394 ~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~-~~~~~ 472 (602) +++++|+++.+.....|.+.+++++++++++|+||+||+|+++|++|+++++++.+..+........+ .+++. T Consensus 320 ------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~~~~~~~~e~ 393 (403) T protein:vir:10 320 ------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIRIPANVAGSATGVSGQEG 393 (403) T ss_pred ------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccccccccccccccccCCCCcC Confidence 34677888766444557888899999999999999999999999999998888877655443322222 22222 Q ss_pred Cccccccccc Q lcl|NC_021537. 473 EAMLTRSKAA 482 (602) Q Consensus 473 ~~~~~~~~~~ 482 (602) +++.+....+ T Consensus 394 ~~~~~~~~g~ 403 (403) T protein:vir:10 394 GRPKGSTEGD 403 (403) T ss_pred CCCCCCcCCC Confidence 2221111111 No 63 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=1.6e-71 Score=408.65 Aligned_cols=391 Identities=15% Similarity=0.093 Sum_probs=281.6 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) -+|+..........+.+..+... ...+. ...+ ..+|+|++||++||++||++|++++++.+.+... T Consensus 8 ~~k~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~-~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~---------- 73 (403) T protein:vir:80 8 RRKTRSEPTNAISWFLTQEAYDT-LAIPG--YTRL-SDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIR---------- 73 (403) T ss_pred cccccccccchhhhhcccccccc-cccch--hhhh-hhhHHHHHHHHHHHHhhhhCceEEEEecCCceee---------- Confidence 22322211111111211111000 01111 1233 3468999999999999999999998765443211 Q ss_pred Hhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHh--cCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 81 DFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHG--IGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 81 ~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~--~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ..++... ++.+||+.||+.+||+.++.++++ +||||+++.|+..|++.+||||+|+.|++..+..++ T Consensus 74 ----~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~------ 143 (403) T protein:vir:80 74 ----IKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGY------ 143 (403) T ss_pred ----cCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCce------ Confidence 1234444 445899999999999999999998 588999999999999999999999999764332210 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecC-CCCCCCcccccHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPN-PSPLALYYGVPDWVAA 236 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~-~~~~~~~~G~spl~~~ 236 (602) .+ +. . ...++++|||||+. +++.++++|+||+..+ T Consensus 144 ------------~~-----------------------------~y-~--~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~ 179 (403) T protein:vir:80 144 ------------QI-----------------------------WY-Q--GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVL 179 (403) T ss_pred ------------EE-----------------------------EE-e--ecccchhhEEEEeccCCCcCccccccHHHHH Confidence 00 00 0 12478899999994 6778889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHH-HhhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 237 MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMD-NLKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 237 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~-~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) ..++....++++++.++|+||++|++||++++. +++++.+++++.|. .+.+..|+|++++++.+.. .. T Consensus 180 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~----------~~ 248 (403) T protein:vir:80 180 KDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAA-TAELSSEEGRNAVFKKYLEASEAGQPWIIPAELL----------DV 248 (403) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CChHHHHHHHHHHHHHHhhhhhcCCeeeeccccc----------cc Confidence 999999999999999999999999999998764 67777788888874 5667889999888765421 11 Q ss_pred cccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA 395 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~ 395 (602) .+++++ +.+|+||+|.+++++.+||++|||||++||..+ ..++...+|+++||.|+++.||++||++|+++. T Consensus 249 ~~~~~l---~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-----~~~~~~~~f~~~~l~P~~~~ie~~l~~kll~~~ 320 (403) T protein:vir:80 249 EQVKPL---SLKDLAIHETVELDKRTVAGIFGVPAFLLGVGK-----YDKDEYNNFINSTILPIAKGIEQELTRKLLISP 320 (403) T ss_pred ceeccC---CHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-----ccHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 234444 467999999999999999999999999998532 223455679999999999999999999998764 Q ss_pred ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcc Q lcl|NC_021537. 396 LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAM 475 (602) Q Consensus 396 ~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~ 475 (602) +++++|+.+.+++. |.+.+++++.+++++|+||+||+|+++||||+++|+ ..+++.+++++....+.....+. T Consensus 321 ----~~~~~f~~~~ll~~--d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd-~~~~~~n~~pl~~~~~~~~~k~g 393 (403) T protein:vir:80 321 ----DLYFKFNPRSLYAY--DLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS-ELVILENYIPLDKIGDQNKLKGG 393 (403) T ss_pred ----CcEEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeecccccchhhccchhhccCC Confidence 47899999999876 777788999999999999999999999999999864 45556666665422221111111 Q ss_pred cccccccccccccccc Q lcl|NC_021537. 476 LTRSKAAPPLENKIGE 491 (602) Q Consensus 476 ~~~~~~~~~~~~~~~~ 491 (602) ..... ....+ T Consensus 394 e~~~~------~~~~~ 403 (403) T protein:vir:80 394 EKGGA------DGQTD 403 (403) T ss_pred CCCCC------CCCCC Confidence 10000 00001 No 64 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=9.6e-71 Score=404.44 Aligned_cols=369 Identities=10% Similarity=0.031 Sum_probs=279.7 Q ss_pred CCCCccccc---ccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKAEETTQ---LDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~~~~~~---~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) -.|.+.... .....+....++...+.++... +..+++|++||++||++||++||+++.+ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~----al~~~~v~~~i~~ia~~ia~~p~~v~~~-------------- 70 (385) T protein:vir:10 9 FNKRKAKNMVYPSNPAFFTTTVGGMQLSYVSALS----ALQNTNVYSVINRIASDVASAHFKTENT-------------- 70 (385) T ss_pred cccccccccccccchhhhhhhccccCccccCHHH----hhccHHHHHHHHHHHHHHhhCceeeecc-------------- Confidence 112221111 1111122212222223344433 3346889999999999999999998632 Q ss_pred HHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ....++.+||+.||+.+||+.++.+++++||||++++|+ +.+++|+++.+|++..+... T Consensus 71 ----------~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~------- 129 (385) T protein:vir:10 71 ----------ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMG------- 129 (385) T ss_pred ----------chhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCc------- Confidence 223467799999999999999999999999999999875 46788888777765432211 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCC--CCCcccccHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSP--LALYYGVPDWVA 235 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~--~~~~~G~spl~~ 235 (602) .+++ .....++..++++++||||||.+++ .++++|+||+.. T Consensus 130 ---------~~~~----------------------------~~~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~ 172 (385) T protein:vir:10 130 ---------IVYT----------------------------VLESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLES 172 (385) T ss_pred ---------eEEE----------------------------EEEcCCceEEEEccccEEEeccCCCCcccccccccHHHH Confidence 0000 1122345567899999999998654 557899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) +..+|....++++++.++|+||++|+++|++++...++++.+++++.|++..++.|+|+++++++|+ T Consensus 173 ~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~------------- 239 (385) T protein:vir:10 173 LQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGF------------- 239 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCc------------- Confidence 9999999999999999999999999999999987778999999999999988889999999987665 Q ss_pred cccccccccchHHHHHH-HHHHhhHHHHHHHhcCChHHhhcc--ccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQ-AFRERNEHEIAKVHGVPPVLINVT--STSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~-e~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) ++++++ +++.|+||+ |++++++++||++|||||++||.. ++++++|+|++... +.+||.|+++.|+++||.+|+ T Consensus 240 -~~~~l~-~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~-~~~~l~P~~~~ie~~l~~~l~ 316 (385) T protein:vir:10 240 -DYTQLE-MKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKAT-YLANLNSYVNPIVDELRLKMN 316 (385) T ss_pred -eEEecC-CChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHH-HHHHHHHHHHHHHHHHHHhhC Confidence 455555 456789975 999999999999999999999864 45678999877555 467999999999999999997 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCc Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDA 472 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~ 472 (602) +. .++|+.+.+++. |.+.+++++++++++|+||+||+|+++|++|+++++++....+..... +++. T Consensus 317 ~~-------~~~f~~~~ll~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~~~~~~-----~g~~ 382 (385) T protein:vir:10 317 AP-------DLELDIKDMLDV--DDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTTQVK-----GGDE 382 (385) T ss_pred Cc-------eEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCcccccC-----CCCC Confidence 53 488999998877 777888999999999999999999999999999877665543322211 1111 Q ss_pred Ccc Q lcl|NC_021537. 473 EAM 475 (602) Q Consensus 473 ~~~ 475 (602) ++- T Consensus 383 ~dn 385 (385) T protein:vir:10 383 GDN 385 (385) T ss_pred CCC Confidence 111 No 65 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=1.6e-69 Score=397.81 Aligned_cols=367 Identities=10% Similarity=0.052 Sum_probs=274.2 Q ss_pred CCCCcccccc---cchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKAEETTQL---DERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~~~~~~~---~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) -.|....... ...-+....++.-...++.. .+..+++|++||++||++||++||+++.+ T Consensus 9 ~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----~~l~~~~v~~~i~~ia~~ia~~~~~~~~~-------------- 70 (383) T protein:vir:10 9 FSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSAL----SALQNTNVYSVINRIASDVSSAHFKTENT-------------- 70 (383) T ss_pred cccccccccccccchhhhhhhccCccccccchh----HhhcchHHHHHHHHHHHhhccCceeeccc-------------- Confidence 2222211111 11111110111000112222 23346889999999999999999988632 Q ss_pred HHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ....++.+||+.||+.+||+.++.++++.||||++++++ +.+++|+++.+|++..+... T Consensus 71 ----------~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~------- 129 (383) T protein:vir:10 71 ----------ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMG------- 129 (383) T ss_pred ----------chhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCc------- Confidence 223467799999999999999999999999999999875 45677777666654322110 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCC--CCCcccccHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSP--LALYYGVPDWVA 235 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~--~~~~~G~spl~~ 235 (602) .++. .....++..++|+++||||||.+++ .++.+|+||+.+ T Consensus 130 ---------~~~~----------------------------~~~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~ 172 (383) T protein:vir:10 130 ---------IVYT----------------------------VLESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLES 172 (383) T ss_pred ---------eEEE----------------------------EEEcCCceEEEEcccceEEeccCCCCcccccccccHHHH Confidence 0000 1122345677899999999997654 456899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) +...|....++++++.++|+||++|+++|++++...++++.+++++.|++..++.|+|+++++++|+ T Consensus 173 ~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~------------- 239 (383) T protein:vir:10 173 LQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGF------------- 239 (383) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCc------------- Confidence 9999999999999999999999999999999987778999999999999988888999999887665 Q ss_pred cccccccccchHHHHHH-HHHHhhHHHHHHHhcCChHHhhccc--cCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQ-AFRERNEHEIAKVHGVPPVLINVTS--TSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~-e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) +|++++ +++.|+||+ +++++++++||++|||||++||..+ +.+++|++++...| .+||+|+++.||++|+++|+ T Consensus 240 -~~~~l~-~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~-~~~l~P~~~~ie~~l~~~l~ 316 (383) T protein:vir:10 240 -DYTQLE-MKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY-LANLNSYVNPIVDELRLKMN 316 (383) T ss_pred -eEEecC-CChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHH-HHHHHHHHHHHHHHHHHhhC Confidence 455555 455789975 9999999999999999999999644 56789999887655 56999999999999999996 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCc Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDA 472 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~ 472 (602) .. +++|+.+.+++. |.+.+++++.+++++|+||+||+|+++|++|+++++.+....... ..++++. T Consensus 317 ~~-------~~~f~~~~l~~~--d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~-----~~~gGd~ 382 (383) T protein:vir:10 317 AP-------DLELDIKDMLDV--DDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTN-----ETKGGDD 382 (383) T ss_pred Cc-------eEEeechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCCCcc-----cCCCCCC Confidence 43 588999998766 777788999999999999999999999999999887544322111 1111111 Q ss_pred C Q lcl|NC_021537. 473 E 473 (602) Q Consensus 473 ~ 473 (602) + T Consensus 383 e 383 (383) T protein:vir:10 383 K 383 (383) T ss_pred C Confidence 1 No 66 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=8.6e-70 Score=399.22 Aligned_cols=513 Identities=13% Similarity=0.089 Sum_probs=293.0 Q ss_pred CCCCcccccccch---hhhcc-cC--ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDER---HIATD-VG--RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~~~-~~--~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) -.|.+......-| ...+. .| ++++||+|+..|.++++.||+|++||+++|++|++++|.++.+.+..... T Consensus 52 ~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~---- 127 (648) T protein:vir:79 52 SAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEY---- 127 (648) T ss_pred cccccchhHHHHHhHHHHHhhcCCccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchh---- Confidence 1222322222222 11111 12 36899999999999999999999999999999999999997764332110 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc---------------eEEEEEeCc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT---------------PVGLAHVPA 139 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~---------------~~~L~~l~p 139 (602) .+...++.+||+.+|+.+||+.++.+++++||||++++|+.+|. +..|+||+| T Consensus 128 ------------~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p 195 (648) T protein:vir:79 128 ------------IRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNL 195 (648) T ss_pred ------------hHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecC Confidence 11122456899999999999999999999999999999999884 367888888 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEec Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLP 219 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r 219 (602) .+|++..+..+. ...|.+ ....++....+++++||||| T Consensus 196 ~~v~v~~d~~g~-------------~~~Y~y-----------------------------~~~g~~~~~~~~~~dIIHik 233 (648) T protein:vir:79 196 ASMKVKRDKFGM-------------IKGWQQ-----------------------------EQEGQDKPQKFKPEDIVHIY 233 (648) T ss_pred ceeEEEEcCCCc-------------eeeeEE-----------------------------EecCCceeEEecCccEEEEc Confidence 888764332110 000100 01123455678999999999 Q ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceecc Q lcl|NC_021537. 220 NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVE 299 (602) Q Consensus 220 ~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~ 299 (602) .+++.+++||+||+.++..+|....++++++.++|+||++|+++|+++.+....++.+++.+.+.+...+. .+.+ T Consensus 234 ~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~-----~i~g 308 (648) T protein:vir:79 234 YKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENM-----DVEG 308 (648) T ss_pred cCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccc-----cccc Confidence 98889999999999999999999999999999999999999999998644444455555555554332221 1222 Q ss_pred CCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHH Q lcl|NC_021537. 300 EFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPE 379 (602) Q Consensus 300 ~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~ 379 (602) +++.+.. +.+.+. ++++|+||++++++++++||++|||||++||+.+++++++++++...| ..++.|+ T Consensus 309 g~v~~~~---------~~i~~~--~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~-~~~i~~l 376 (648) T protein:vir:79 309 GMVTTER---------VNISSI--ASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDF-KDRIKAL 376 (648) T ss_pred cccccce---------eecccc--CCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHH-HHHHHHH Confidence 2222221 223332 245799999999999999999999999999999889999988776655 6678887 Q ss_pred HHHHHHHHhh----hcCCccc----cccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021537. 380 QAKFSARLYK----IIHQDAL----DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR 451 (602) Q Consensus 380 ~~~ie~~ln~----~Ll~~~~----~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~ 451 (602) +..++..++. .++.... ...+++++|+++++++. |.+.+++.+.+++++|+||+||+|+++||+|+++|+ T Consensus 377 ~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~--D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~ 454 (648) T protein:vir:79 377 QKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMD--SKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGE 454 (648) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchh--hHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 7666554443 3322211 12346789999888766 677778889999999999999999999999999887 Q ss_pred cccccccccccccccccC-CCcCcccccccccccccccccccccccccccccchhhhhcchhhhhhheeccc-----ccE Q lcl|NC_021537. 452 GDMTLSEFEAEFGADASD-GDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFG-----ERE 525 (602) Q Consensus 452 ~d~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~-----~~~ 525 (602) +...+..+.......... ...+.+..........+.+..+.............+..+-.-++-.+++|+.. ++. T Consensus 455 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (648) T protein:vir:79 455 GRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQTNGRHVRYMQEMLLEYTTL 534 (648) T ss_pred CccccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCccccchhhhhhhhhhhhcchhh Confidence 654444333332211111 00000000000000000000000000000000000000111122222222211 111 Q ss_pred EEEEEecccCCcceeeeccCCH---HHHHHHhCCCccchhhhhhhcc-------c--ccccccccchhcccCC--CCCCh Q lcl|NC_021537. 526 LYLSFKRESGQNSLYVYVDVPA---AVWSALVSAPSAGSYHYSEIRL-------Q--YGYLEVTNNHERLPEG--PTPDP 591 (602) Q Consensus 526 l~~~f~~~~~~~~~y~y~~v~~---~~~~~~~~a~s~g~~~~~~i~~-------~--~~~~~~~~~~~~~~~~--~~~~~ 591 (602) -++ .+ ++-..|.-.++-+ .+=..|+-.+ |+|...--+- + -++.+-.++. -+. +|=+- T Consensus 535 ~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ 605 (648) T protein:vir:79 535 NEA-IK---ALIERYYQYGSKEHLKSINGSLMYTE--GRLLELTTQYWGEEVTEKVRIPFHRMTENL---REEVMSTIDK 605 (648) T ss_pred hHH-Hh---hHHHHHHHHhHHHHHHhhhhhheecc--chhHHHHHHHhhhhhhceeeeeHHHHHHHH---HHHHHhhhhh Confidence 110 00 0111122222211 1111222211 4443322110 0 0001100000 000 00000 Q ss_pred hhcCCccc--ccC Q lcl|NC_021537. 592 GEAPEDVP--SDI 602 (602) Q Consensus 592 ~~~~~~~~--~~~ 602 (602) .+ +|- ++| T Consensus 606 ~~---~~~~~~~~ 615 (648) T protein:vir:79 606 VE---GVAEASDI 615 (648) T ss_pred hh---hhHHHHHH Confidence 00 000 000 No 67 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=2.5e-69 Score=396.69 Aligned_cols=381 Identities=9% Similarity=-0.002 Sum_probs=273.9 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) ..+..++.......++..... -.-+++. ..+..+++|++||++||++||++||+++.+++.. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~-~~~~vt~----~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~------------- 72 (394) T protein:vir:62 11 LFKKAEKRGYLDNVLGKSIRY-SGVYVTD----SNILQSSDVYELLQDISNQMVLADIVVEDEFGNE------------- 72 (394) T ss_pred ccCCCCchhhhhhhhhccccc-CccccCh----hhhhccHHHHHHHHHHHHhhcccceEEEcCCCcc------------- Confidence 222222221111112111000 0001222 2234568899999999999999999998643211 Q ss_pred HhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhhh Q lcl|NC_021537. 81 DFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVE 160 (602) Q Consensus 81 ~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~ 160 (602) ...|+.+.|+.+||+.||+.+||+.++.+++++||+|+++.++..|.+ + .|.+..+. T Consensus 73 ---~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~------~--~~~~~~~~------------ 129 (394) T protein:vir:62 73 ---IKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLA------S--NVFTELDD------------ 129 (394) T ss_pred ---cchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecc------c--cceEEECC------------ Confidence 113667788999999999999999999999999999999875543321 1 12211100 Q ss_pred hcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHH Q lcl|NC_021537. 161 NIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTM 240 (602) Q Consensus 161 ~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i 240 (602) ..+ +.+..+ .++|+++||||+|.++ .++++|+||+..+..+| T Consensus 130 -----~~~------------------------------~~~~~~--~~~~~~~eiih~r~~~-~d~~~G~s~~~~~~~~i 171 (394) T protein:vir:62 130 -----NLV------------------------------EHFNIG--GHEIPPCMIRHVKNIG-ADHLRGKGILDLGRDTL 171 (394) T ss_pred -----ceE------------------------------EEEeeC--CEEechhheEEecCcC-CCCccccChHHHHHHHH Confidence 000 011111 2568999999999886 68899999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEEEecccc-CCHHHHHHHHHHHHH-hhcccccCcceeccCCccceecccccccccccc Q lcl|NC_021537. 241 GADQAAKEWNHDVFDNLGIPHYAVKVTGGT-LSEDSKEDLRNLMDN-LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIEL 318 (602) Q Consensus 241 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~~~l~~~~~~-~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~ 318 (602) ....++++++.++|+||++|+++|++++.. .++++.+++++.|++ +.|..|+|++++++.|.++ ++ T Consensus 172 ~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~------------~~ 239 (394) T protein:vir:62 172 EGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGY------------SI 239 (394) T ss_pred HHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCce------------eE Confidence 999999999999999999999999997653 356677888999965 5677899999988766533 44 Q ss_pred ccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccc Q lcl|NC_021537. 319 EPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDV 398 (602) Q Consensus 319 ~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~ 398 (602) ++++ +++.|+||+|+++++.++||++|||||.+||. .++||+|++.+.|+++||+|+++.||++||++|+++.++ T Consensus 240 ~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~---~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kll~~~~~- 314 (394) T protein:vir:62 240 DTLK-SPLDDEKTLAYLNVYKKDLGKFLGINVDTYTE---LIKEDIEKAMMYIHNKAVRPIMKNFEDHLSLLFYAQNSG- 314 (394) T ss_pred EecC-CCcchHHHHHHHHHHHHHHHHHhCCCHHHcCC---CCCcCHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccc- Confidence 4554 35679999999999999999999999999984 457899999999999999999999999999999988665 Q ss_pred cceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc-ccccccccccccCCCcCcccc Q lcl|NC_021537. 399 DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL-SEFEAEFGADASDGDAEAMLT 477 (602) Q Consensus 399 ~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~-~~~~~~~~~~~~~~~~~~~~~ 477 (602) .+++|+||...+++. +. +++++.+++++|+||+||+|+++|++|+++++++.+. +.++.+.+......+. T Consensus 315 ~~~~~~fd~~~~~~~--~~--~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~~~~~~~~----- 385 (394) T protein:vir:62 315 KRIKFKINILDFVTY--SN--KTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDVTEIGKKEATDGS----- 385 (394) T ss_pred CceEEEechhhhcCH--HH--HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccccccccccccccc----- Confidence 568899998888766 33 4567889999999999999999999999888777654 4444444322111100 Q ss_pred ccccccccccccc Q lcl|NC_021537. 478 RSKAAPPLENKIG 490 (602) Q Consensus 478 ~~~~~~~~~~~~~ 490 (602) ....+.+ .+ T Consensus 386 ~kgge~~----en 394 (394) T protein:vir:62 386 LGGGEEN----EN 394 (394) T ss_pred CCCCCCC----CC Confidence 0000000 00 No 68 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=8.7e-69 Score=393.72 Aligned_cols=378 Identities=13% Similarity=0.058 Sum_probs=276.4 Q ss_pred CCCCcccccccchhhhcc-cCccccCCCCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATD-VGRGIQPPYNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQT 78 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~-~~~~i~p~~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~ 78 (602) .+++.++.+.....+... ...+.++-..-. .-.+.+..+++|++||++||++||++|++++.+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~--------------- 71 (386) T protein:vir:48 7 TNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRK--------------- 71 (386) T ss_pred ccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccc--------------- Confidence 333322222111111111 111111100000 001223357899999999999999999998742 Q ss_pred HHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchh Q lcl|NC_021537. 79 VRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEE 158 (602) Q Consensus 79 ~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~ 158 (602) ....++.+||+.||+.+|++.++.+++++||+|++++|+..|++++|+||+|++|++..+..+ T Consensus 72 ---------~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~-------- 134 (386) T protein:vir:48 72 ---------QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNK-------- 134 (386) T ss_pred ---------hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCC-------- Confidence 123467799999999999999999999999999999999999999999999999987543321 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) +..++++.... ...+..+.++++||||+|.+++.++++|+||+..+.. T Consensus 135 ------~~~~y~~~~~~--------------------------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~ 182 (386) T protein:vir:48 135 ------DGIYYNITFDD--------------------------PRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSR 182 (386) T ss_pred ------ceEEEEEEecC--------------------------ccccceeEecCccEEEecCCCCCCceeeccHHHHHHH Confidence 01111111000 0123456899999999999998888999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccccccc Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIEL 318 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~ 318 (602) ++....++++++.++|+||++|+++|+.++. +++++.+++++.|.. +..|+|+++++++|+ +| T Consensus 183 ~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~-~~~e~~~~~~~~~~~--~~~n~g~~~vl~~g~--------------~~ 245 (386) T protein:vir:48 183 ELNIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKLSRSRQA--MKQMQGGPLVLDDLE--------------EF 245 (386) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHHHHHHHHH--hhcCCCCceecCCCc--------------eE Confidence 9999999999999999999999999999864 788888888888865 446788888887665 45 Q ss_pred ccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccc Q lcl|NC_021537. 319 EPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDV 398 (602) Q Consensus 319 ~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~ 398 (602) ++++ ++++|+||+|++++++++||++|||||.+||. .+++++++++.+.|++.||+|+++.||++||++|++.. T Consensus 246 ~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~--~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~--- 319 (386) T protein:vir:48 246 TPLE-IKSNVSQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKLSCDV--- 319 (386) T ss_pred EEcC-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC--CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh--- Confidence 5554 45689999999999999999999999999996 45788999999999999999999999999999998653 Q ss_pred cceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCccc Q lcl|NC_021537. 399 DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAML 476 (602) Q Consensus 399 ~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~ 476 (602) +++.....+. |....+..+.+++++|++|+||+|+++|++|+++++.......+..+ .++++.++.. T Consensus 320 -----~~~~~~~~~~--d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~~~~~~~----~~gGd~~~~~ 386 (386) T protein:vir:48 320 -----DADILPAVDP--TGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPEGENPNKTT----LKGGEINGED 386 (386) T ss_pred -----hcchhhhhcc--ChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhcCCCCCc----cCCCCCCCCC Confidence 3444333333 44455667889999999999999999999998876533322222111 1111111111 No 69 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=1.7e-68 Score=392.06 Aligned_cols=359 Identities=12% Similarity=0.038 Sum_probs=260.1 Q ss_pred CCCCc---ccccccc-------hhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAE---ETTQLDE-------RHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~---~~~~~~~-------~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) .++.. ++..... ..+.....+.-.-.+++ ..+-.+++|++||++||++||++|++++.+.. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~----- 79 (392) T protein:vir:74 9 INQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSA----RAALRNSDLFSIILQLSSDLAIVKINAEKKKN----- 79 (392) T ss_pred hhcccCcccccccccccccCchhhhhhhccCCCCcccch----hhhhcchHHHHHHHHHHHhhccCceeeccchh----- Confidence 22211 1111100 00000000000001122 22335789999999999999999999875321 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ..++.+||+.||+.+||+.++.+++++||||++++|+.+|++++|+||+|++|++..+..+ T Consensus 80 -------------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~ 140 (392) T protein:vir:74 80 -------------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE 140 (392) T ss_pred -------------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC Confidence 1266789999999999999999999999999999999999999999999999987543321 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) +..++++... ....+....++++||||++.+++.+.++|+ T Consensus 141 --------------~~~~y~~~~~--------------------------~~~~~~~~~~~~~evih~~~~~~~~~~~G~ 180 (392) T protein:vir:74 141 --------------NGMYYNITFD--------------------------DPKIEPILQAPQSDLIHMKLLSIDGGKTGI 180 (392) T ss_pred --------------ceEEEEEEec--------------------------CCccceeEEEcCccEEEecCCCCCCccccc Confidence 1111111000 011234578999999999999877779999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC-CHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL-SEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) ||+.++..+|....++++++.++|+||++|+++|++++... ++++++. ..+.+.|..|+|+++++++|+ T Consensus 181 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~---~~~~~~~~~n~g~~~vl~~g~------- 250 (392) T protein:vir:74 181 SPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKAS---RSRSFMKRSRSGGPVVLDDLE------- 250 (392) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHH---HHHHHhccccCCCeeecCCCc------- Confidence 99999999999999999999999999999999999986533 3333332 334567888999999887665 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) +|++++ ++++|+||+|++++++++||++|||||++||+.++.+ +.+++.++|+++||.|+++.|+++||+ T Consensus 251 -------~~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~e~~~~~~~~~l~p~~~~ie~~l~~ 320 (392) T protein:vir:74 251 -------EFTALE-IKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ--SSIQQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred -------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455554 3467999999999999999999999999999765433 445668899999999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh---CCC-----------CCCCCccccc Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL---DLA-----------PFEDDRGDMT 455 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---Gl~-----------p~~~g~~d~~ 455 (602) +|++. ++|+...+.+. |.+.+++.+.+++++|++|+||+|+++ |+. |+++|+...+ T Consensus 321 ~l~~~--------~~~~~~~~~~~--d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~~~p 390 (392) T protein:vir:74 321 KLSDH--------ISVNMRPAIDP--LGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEP 390 (392) T ss_pred hccch--------hcccchhhhcC--CHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCCCCC Confidence 99754 45777666655 556677889999999999999999987 333 2222222111 Q ss_pred cc Q lcl|NC_021537. 456 LS 457 (602) Q Consensus 456 ~~ 457 (602) ++ T Consensus 391 ~p 392 (392) T protein:vir:74 391 VP 392 (392) T ss_pred CC Confidence 11 No 70 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=2.2e-67 Score=385.99 Aligned_cols=370 Identities=12% Similarity=0.049 Sum_probs=260.0 Q ss_pred CCCCccc--ccccchhhhcccCccc-c-------CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEET--TQLDERHIATDVGRGI-Q-------PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~--~~~~~~~~~~~~~~~i-~-------p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) ..|...+ .+.....+.......+ + -.++. +.+..+++|++||++||++||++|++++.+.. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~----- 79 (392) T protein:vir:10 9 INQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSA----RAALRNSDLFSIILQLSSDLAIVKINAEKKKN----- 79 (392) T ss_pred hhcccccccccccccccccCchhhhhhhhcCCCCceech----HHhhccHHHHHHHHHHHHhhccCceeeccchh----- Confidence 2221111 1111111111000000 0 01222 22334689999999999999999999874321 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ..++.+||+.||+.+||+.++.+++++||||++++|+.+|++++|+||+|++|++..+..+ T Consensus 80 -------------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~ 140 (392) T protein:vir:10 80 -------------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE 140 (392) T ss_pred -------------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC Confidence 1266789999999999999999999999999999999999999999999999987543221 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) +..++++... ...++....++++||||+|++++.+.++|+ T Consensus 141 --------------~~~~y~~~~~--------------------------~~~~~~~~~~~~~eiih~~~~~~~~~~~G~ 180 (392) T protein:vir:10 141 --------------NGMYYNITFD--------------------------DPKIEPILQAPQSDLIHMKLLSIDGGKTGI 180 (392) T ss_pred --------------ceEEEEEEec--------------------------CcccceeEEEccccEEEecCCCCCCccccc Confidence 1111111100 011234578999999999999887779999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-CCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-LSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) ||+.++..+|....++++++.++|+||++|+++|++++.. .++++++. ..+.+.+..|+|+++++++|+ T Consensus 181 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~---~~~~~~~~~~~g~~~vl~~g~------- 250 (392) T protein:vir:10 181 SPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKAS---RSRSFMKRSRSGGPVVLDDLE------- 250 (392) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHH---HHHHHhccccCCCeeecCCCc------- Confidence 9999999999999999999999999999999999998653 33333332 234566788999999887664 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) +|++++ ++++|+||++++++++++||++|||||++||+.++. ++.+++.++|+++||.|+++.|+++||. T Consensus 251 -------~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~--~~~~~~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:10 251 -------EFTALE-IKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ--QSSIQQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred -------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc--ccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455665 346799999999999999999999999999975433 3445678899999999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh---CCCCCCCCcccccccccccccccc Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL---DLAPFEDDRGDMTLSEFEAEFGAD 466 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---Gl~p~~~g~~d~~~~~~~~~~~~~ 466 (602) +|++. ++|+...+.+. |.+.+++.+.+++++|++|+||+|+++ |+.|.+.. . ..+..+. T Consensus 321 ~L~~~--------~~~d~~~~~~~--d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r---~--~e~l~~~--- 382 (392) T protein:vir:10 321 KLSDH--------ISVNMRPAIDP--LGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP---A--PENTNKK--- 382 (392) T ss_pred hcccc--------ccccchhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccc---h--hcCCCCC--- Confidence 99754 45666666554 556677888999999999999999987 55432111 0 0011000 Q ss_pred ccCCCcCcccccccccc Q lcl|NC_021537. 467 ASDGDAEAMLTRSKAAP 483 (602) Q Consensus 467 ~~~~~~~~~~~~~~~~~ 483 (602) .+++. ..+.| T Consensus 383 -~~Gd~------~~p~p 392 (392) T protein:vir:10 383 -TTGQS------NEPVP 392 (392) T ss_pred -CCCCC------CCCCC Confidence 00000 00001 No 71 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=2.2e-67 Score=385.99 Aligned_cols=370 Identities=12% Similarity=0.049 Sum_probs=260.0 Q ss_pred CCCCccc--ccccchhhhcccCccc-c-------CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEET--TQLDERHIATDVGRGI-Q-------PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~--~~~~~~~~~~~~~~~i-~-------p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) ..|...+ .+.....+.......+ + -.++. +.+..+++|++||++||++||++|++++.+.. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~----- 79 (392) T protein:vir:39 9 INQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSA----RAALRNSDLFSIILQLSSDLAIVKINAEKKKN----- 79 (392) T ss_pred hhcccccccccccccccccCchhhhhhhhcCCCCceech----HHhhccHHHHHHHHHHHHhhccCceeeccchh----- Confidence 2221111 1111111111000000 0 01222 22334689999999999999999999874321 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ..++.+||+.||+.+||+.++.+++++||||++++|+.+|++++|+||+|++|++..+..+ T Consensus 80 -------------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~ 140 (392) T protein:vir:39 80 -------------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE 140 (392) T ss_pred -------------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC Confidence 1266789999999999999999999999999999999999999999999999987543221 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) +..++++... ...++....++++||||+|++++.+.++|+ T Consensus 141 --------------~~~~y~~~~~--------------------------~~~~~~~~~~~~~eiih~~~~~~~~~~~G~ 180 (392) T protein:vir:39 141 --------------NGMYYNITFD--------------------------DPKIEPILQAPQSDLIHMKLLSIDGGKTGI 180 (392) T ss_pred --------------ceEEEEEEec--------------------------CcccceeEEEccccEEEecCCCCCCccccc Confidence 1111111100 011234578999999999999887779999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-CCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 231 PDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-LSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 231 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) ||+.++..+|....++++++.++|+||++|+++|++++.. .++++++. ..+.+.+..|+|+++++++|+ T Consensus 181 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~---~~~~~~~~~~~g~~~vl~~g~------- 250 (392) T protein:vir:39 181 SPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKAS---RSRSFMKRSRSGGPVVLDDLE------- 250 (392) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHH---HHHHHhccccCCCeeecCCCc------- Confidence 9999999999999999999999999999999999998653 33333332 234566788999999887664 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYK 389 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~ 389 (602) +|++++ ++++|+||++++++++++||++|||||++||+.++. ++.+++.++|+++||.|+++.|+++||. T Consensus 251 -------~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~--~~~~~~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:39 251 -------EFTALE-IKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ--QSSIQQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred -------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc--ccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455665 346799999999999999999999999999975433 3445678899999999999999999999 Q ss_pred hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh---CCCCCCCCcccccccccccccccc Q lcl|NC_021537. 390 IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL---DLAPFEDDRGDMTLSEFEAEFGAD 466 (602) Q Consensus 390 ~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---Gl~p~~~g~~d~~~~~~~~~~~~~ 466 (602) +|++. ++|+...+.+. |.+.+++.+.+++++|++|+||+|+++ |+.|.+.. . ..+..+. T Consensus 321 ~L~~~--------~~~d~~~~~~~--d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r---~--~e~l~~~--- 382 (392) T protein:vir:39 321 KLSDH--------ISVNMRPAIDP--LGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP---A--PENTNKK--- 382 (392) T ss_pred hcccc--------ccccchhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccc---h--hcCCCCC--- Confidence 99754 45666666554 556677888999999999999999987 55432111 0 0011000 Q ss_pred ccCCCcCcccccccccc Q lcl|NC_021537. 467 ASDGDAEAMLTRSKAAP 483 (602) Q Consensus 467 ~~~~~~~~~~~~~~~~~ 483 (602) .+++. ..+.| T Consensus 383 -~~Gd~------~~p~p 392 (392) T protein:vir:39 383 -TTGQS------NEPVP 392 (392) T ss_pred -CCCCC------CCCCC Confidence 00000 00001 No 72 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=1.2e-67 Score=387.54 Aligned_cols=346 Identities=16% Similarity=0.128 Sum_probs=263.8 Q ss_pred CCCCcccccccchhhhcccCcccc-CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQ-PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~-p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) .+|....+....-.+....+.... -.++... +-.++.|++||++||++||++|+. T Consensus 7 f~~r~~~~~~~~~~~~~~~~~~~~~~~v~~~~----al~~~av~~cv~~ia~~ia~~p~~-------------------- 62 (359) T protein:vir:10 7 FERRSSITPNNYYPFMVQNGSIVPNSLVDATE----ALKNSDLYAVTSLISSDIAGTRFI-------------------- 62 (359) T ss_pred hhccccCCCCcchhhhhccccccCCcccCHHH----hhcchHHHHHHHHHHHhhhcCccc-------------------- Confidence 333322111111111111111111 1133332 234678999999999999999972 Q ss_pred HHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 80 RDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 80 ~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..+....|+.+||+.||+.+||+.++.+++++||+|++++|+.+|++.+|+||+|++|++..+.. T Consensus 63 -----~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~---------- 127 (359) T protein:vir:10 63 -----GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD---------- 127 (359) T ss_pred -----cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC---------- Confidence 01334567889999999999999999999999999999999999999999999999998643211 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCC----CCCCcccccHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPS----PLALYYGVPDWVA 235 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~----~~~~~~G~spl~~ 235 (602) ..++++ ....++...+++++||||||.++ +.++++|+||+.+ T Consensus 128 ------~~~y~~----------------------------~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~ 173 (359) T protein:vir:10 128 ------TLTYEV----------------------------NQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLES 173 (359) T ss_pred ------eEEEEE----------------------------EecCCceEEEEcccceEEeccCCCCCCccCccccccHHHH Confidence 111111 01123456789999999999765 3578999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) +...+....+++++..++|+||++|+|+|+++++.+++++.+++++.|++.+|+.|+|+++++++|+ T Consensus 174 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~------------- 240 (359) T protein:vir:10 174 LTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSA------------- 240 (359) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecCCCc------------- Confidence 9999999999999999999999999999999877899999999999999999999999999987664 Q ss_pred cccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc--cCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS--TSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) +++|++ ++++|+||+|++++++++||++|||||++||..+ .+++++++++...|+..+|.||...|+..|++++.. T Consensus 241 -~~~~l~-~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~~~ 318 (359) T protein:vir:10 241 -DFSTVS-INADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKCDSSIGV 318 (359) T ss_pred -ceeeec-CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 456665 4678999999999999999999999999998754 346777888888888888999888888888776543 Q ss_pred ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCC Q lcl|NC_021537. 394 DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFE 448 (602) Q Consensus 394 ~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~ 448 (602) .. ++.++|+. + .....+.+++++|+||+||+|+++|++|+- T Consensus 319 ~~----~~~~~~d~--------~--~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 319 DM----SPITDYSN--------S--VFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred cc----hhhhhcCH--------H--HHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 22 22333331 1 122346789999999999999999999986 No 73 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=2.3e-67 Score=385.96 Aligned_cols=367 Identities=11% Similarity=0.032 Sum_probs=278.5 Q ss_pred CCCCcccccccchhhhcccC-cccc-----CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVG-RGIQ-----PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~-~~i~-----p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) ..++........+.+..... ..+. ..++. .. +..+++|++||++||++||++||+++.+.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~---~~-al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~--------- 73 (384) T protein:vir:49 7 TNLATESPPSNQDSFFDITDPEFLDALNGSEWVSA---ET-ALKNSDLFSIISQLSNDLATAKITTSRKQL--------- 73 (384) T ss_pred cccCcccccccchhhccccchhhcccccCCceech---hh-hhccHHHHHHHHHHHHHHhhCceeeecchh--------- Confidence 11111111111111110000 0010 01222 22 234788999999999999999999874321 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) ..++.+||+.||+.+|++.++.+++++||+|++++|+..|++++|+||+|++|++..+... T Consensus 74 ---------------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~---- 134 (384) T protein:vir:49 74 ---------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQ---- 134 (384) T ss_pred ---------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCC---- Confidence 1266789999999999999999999999999999999999999999999999987543211 Q ss_pred cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHH Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWV 234 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~ 234 (602) +..++++... ....+..++++++||||+|.+++.+.++|+||+. T Consensus 135 ----------~~~~y~~~~~--------------------------~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~ 178 (384) T protein:vir:49 135 ----------NGLYYNITFD--------------------------DPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLM 178 (384) T ss_pred ----------ceEEEEEEec--------------------------CccccceeEecCccEEEecCCCCCCceeeccHHH Confidence 1111111000 0123456789999999999998888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 235 AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 235 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) ++...+....++++++.++|+||++|+++|++++....++.. ++.++...+..|+|+++++++|++ T Consensus 179 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~---~~~~~~~~~~~n~~~~~vl~~g~~----------- 244 (384) T protein:vir:49 179 ALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKT---KQSRSRQAMKQMQGGPLVLDDLED----------- 244 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHH---HHHHHHHhcccCCccceecCCCce----------- Confidence 999999999999999999999999999999998765444332 334455677889999999877654 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) |++++ ++++|+||+|++++++++||++|||||++||..+. ++++++++....|++.+|.||+..|+++|+.+|. T Consensus 245 ---~~~l~-~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~ 320 (384) T protein:vir:49 245 ---FTPLE-IKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVD 320 (384) T ss_pred ---EEEcc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhh Confidence 55554 45679999999999999999999999999997543 4567788999999999999999999999999874 Q ss_pred ---CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc Q lcl|NC_021537. 393 ---QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL 456 (602) Q Consensus 393 ---~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~ 456 (602) ....+..+++++|+.+.+++. +.+++.++...+...|+++ ||+|+.+|++|++||+.+... T Consensus 321 ~~~~~~~~~~~~~~~~~~~~l~~~--~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 321 ADILPAVDPTGSNYIGLINSMVKT--GTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred hhhhhhhhccchHHHHHHHHHhhc--CcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCCCCCCCC Confidence 333444567889999988765 6778889999999999986 999999999999999876544 No 74 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=4.8e-66 Score=378.69 Aligned_cols=384 Identities=14% Similarity=0.081 Sum_probs=267.2 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) -++...+..... ...+..+ ....+..+++|++||++||++||++||+++.+... T Consensus 8 f~~~~~~~~~~~----~~~~~~v--------~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~-------------- 61 (395) T protein:vir:10 8 FKTRKDITYMLD----LDMIEDL--------SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRI-------------- 61 (395) T ss_pred hccCcccccccc----chhcccc--------chhhhhhhHHHHHHHHHHHHhhccceeEeccCCcc-------------- Confidence 122211111111 1111111 12334567999999999999999999998753210 Q ss_pred Hhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..++... ++.+||+.||+.+||+.++.++++.|++|+++.++. | ++++++..+++.... T Consensus 62 ----~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-~----~~~~~~~~~~~~~~~----------- 121 (395) T protein:vir:10 62 ----QKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-E----LLIADSFYREEYALY----------- 121 (395) T ss_pred ----ccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-C----eEecCCccceeEeec----------- Confidence 1133344 456899999999999999999999999988665442 2 455554444321100 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ...+.+ ......+...+++++||||+|.+++.+..+|.||+..+..+ T Consensus 122 -----~~~~~~----------------------------~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~ 168 (395) T protein:vir:10 122 -----DDIFKD----------------------------VTVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKI 168 (395) T ss_pred -----CcceeE----------------------------EEEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHHH Confidence 000111 01112233467899999999999888899999999999888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc--ceeccCCccceeccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA--ILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~--~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) +.... +.|.+|+.++++|++++..+++++++++++.|++..++.++++ ++++++|+++.++.....+ T Consensus 169 ~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~---- 237 (395) T protein:vir:10 169 FGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKN---- 237 (395) T ss_pred HHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccc---- Confidence 76544 3577888999999998888999999999999998877655544 4456777666554322211 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALD 397 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~ 397 (602) ++..++||+|++++++++||++|||||++|| ++++|++++.++|+++||+|+++.||++||++|+++.+. T Consensus 238 ------~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~ 307 (395) T protein:vir:10 238 ------SNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMY 307 (395) T ss_pred ------cchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhh Confidence 2345679999999999999999999999996 688999999999999999999999999999999988765 Q ss_pred ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-cccccccccccCCCcCccc Q lcl|NC_021537. 398 VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFGADASDGDAEAML 476 (602) Q Consensus 398 ~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~~~~~~~~~~~~~ 476 (602) ..+ ++|+++.+++. |.+.+++++.+++++|+||+||+|+++|+||+++|++|.+.. .++.++. .....+..... T Consensus 308 ~~~--~~f~~~~l~~~--D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~-~~~~~~~~~~~ 382 (395) T protein:vir:10 308 LKD--TRIEIVGVNKK--DPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKAN-SGENDEKEKDE 382 (395) T ss_pred ccc--ceecchhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccccccc-ccccccCcccc Confidence 444 57888888766 777888999999999999999999999999999998887654 4444433 22222222221 Q ss_pred ccccccccccccc Q lcl|NC_021537. 477 TRSKAAPPLENKI 489 (602) Q Consensus 477 ~~~~~~~~~~~~~ 489 (602) .........++.+ T Consensus 383 ~~~kgg~~~~~g~ 395 (395) T protein:vir:10 383 NTLKGGDEDESGD 395 (395) T ss_pred cccCCCCCCCCCC Confidence 1111111111111 No 75 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=4.8e-66 Score=378.69 Aligned_cols=384 Identities=14% Similarity=0.081 Sum_probs=267.2 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) -++...+..... ...+..+ ....+..+++|++||++||++||++||+++.+... T Consensus 8 f~~~~~~~~~~~----~~~~~~v--------~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~-------------- 61 (395) T protein:vir:10 8 FKTRKDITYMLD----LDMIEDL--------SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRI-------------- 61 (395) T ss_pred hccCcccccccc----chhcccc--------chhhhhhhHHHHHHHHHHHHhhccceeEeccCCcc-------------- Confidence 122211111111 1111111 12334567999999999999999999998753210 Q ss_pred Hhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..++... ++.+||+.||+.+||+.++.++++.|++|+++.++. | ++++++..+++.... T Consensus 62 ----~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-~----~~~~~~~~~~~~~~~----------- 121 (395) T protein:vir:10 62 ----QKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-E----LLIADSFYREEYALY----------- 121 (395) T ss_pred ----ccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-C----eEecCCccceeEeec----------- Confidence 1133344 456899999999999999999999999988665442 2 455554444321100 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ...+.+ ......+...+++++||||+|.+++.+..+|.||+..+..+ T Consensus 122 -----~~~~~~----------------------------~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~ 168 (395) T protein:vir:10 122 -----DDIFKD----------------------------VTVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKI 168 (395) T ss_pred -----CcceeE----------------------------EEEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHHH Confidence 000111 01112233467899999999999888899999999999888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc--ceeccCCccceeccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA--ILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~--~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) +.... +.|.+|+.++++|++++..+++++++++++.|++..++.++++ ++++++|+++.++.....+ T Consensus 169 ~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~---- 237 (395) T protein:vir:10 169 FGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKN---- 237 (395) T ss_pred HHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccc---- Confidence 76544 3577888999999998888999999999999998877655544 4456777666554322211 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALD 397 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~ 397 (602) ++..++||+|++++++++||++|||||++|| ++++|++++.++|+++||+|+++.||++||++|+++.+. T Consensus 238 ------~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~ 307 (395) T protein:vir:10 238 ------SNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMY 307 (395) T ss_pred ------cchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhh Confidence 2345679999999999999999999999996 688999999999999999999999999999999988765 Q ss_pred ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-cccccccccccCCCcCccc Q lcl|NC_021537. 398 VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFGADASDGDAEAML 476 (602) Q Consensus 398 ~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~~~~~~~~~~~~~ 476 (602) ..+ ++|+++.+++. |.+.+++++.+++++|+||+||+|+++|+||+++|++|.+.. .++.++. .....+..... T Consensus 308 ~~~--~~f~~~~l~~~--D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~-~~~~~~~~~~~ 382 (395) T protein:vir:10 308 LKD--TRIEIVGVNKK--DPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKAN-SGENDEKEKDE 382 (395) T ss_pred ccc--ceecchhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccccccc-ccccccCcccc Confidence 444 57888888766 777888999999999999999999999999999998887654 4444433 22222222221 Q ss_pred ccccccccccccc Q lcl|NC_021537. 477 TRSKAAPPLENKI 489 (602) Q Consensus 477 ~~~~~~~~~~~~~ 489 (602) .........++.+ T Consensus 383 ~~~kgg~~~~~g~ 395 (395) T protein:vir:10 383 NTLKGGDEDESGD 395 (395) T ss_pred cccCCCCCCCCCC Confidence 1111111111111 No 76 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=4.8e-66 Score=378.69 Aligned_cols=384 Identities=14% Similarity=0.081 Sum_probs=267.2 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) -++...+..... ...+..+ ....+..+++|++||++||++||++||+++.+... T Consensus 8 f~~~~~~~~~~~----~~~~~~v--------~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~-------------- 61 (395) T protein:vir:95 8 FKTRKDITYMLD----LDMIEDL--------SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRI-------------- 61 (395) T ss_pred hccCcccccccc----chhcccc--------chhhhhhhHHHHHHHHHHHHhhccceeEeccCCcc-------------- Confidence 122211111111 1111111 12334567999999999999999999998753210 Q ss_pred Hhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..++... ++.+||+.||+.+||+.++.++++.|++|+++.++. | ++++++..+++.... T Consensus 62 ----~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-~----~~~~~~~~~~~~~~~----------- 121 (395) T protein:vir:95 62 ----QKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-E----LLIADSFYREEYALY----------- 121 (395) T ss_pred ----ccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-C----eEecCCccceeEeec----------- Confidence 1133344 456899999999999999999999999988665442 2 455554444321100 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ...+.+ ......+...+++++||||+|.+++.+..+|.||+..+..+ T Consensus 122 -----~~~~~~----------------------------~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~ 168 (395) T protein:vir:95 122 -----DDIFKD----------------------------VTVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKI 168 (395) T ss_pred -----CcceeE----------------------------EEEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHHH Confidence 000111 01112233467899999999999888899999999999888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc--ceeccCCccceeccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA--ILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~--~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) +.... +.|.+|+.++++|++++..+++++++++++.|++..++.++++ ++++++|+++.++.....+ T Consensus 169 ~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~---- 237 (395) T protein:vir:95 169 FGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKN---- 237 (395) T ss_pred HHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccc---- Confidence 76544 3577888999999998888999999999999998877655544 4456777666554322211 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALD 397 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~ 397 (602) ++..++||+|++++++++||++|||||++|| ++++|++++.++|+++||+|+++.||++||++|+++.+. T Consensus 238 ------~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~ 307 (395) T protein:vir:95 238 ------SNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMY 307 (395) T ss_pred ------cchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhh Confidence 2345679999999999999999999999996 688999999999999999999999999999999988765 Q ss_pred ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-cccccccccccCCCcCccc Q lcl|NC_021537. 398 VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFGADASDGDAEAML 476 (602) Q Consensus 398 ~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~~~~~~~~~~~~~ 476 (602) ..+ ++|+++.+++. |.+.+++++.+++++|+||+||+|+++|+||+++|++|.+.. .++.++. .....+..... T Consensus 308 ~~~--~~f~~~~l~~~--D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~-~~~~~~~~~~~ 382 (395) T protein:vir:95 308 LKD--TRIEIVGVNKK--DPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKAN-SGENDEKEKDE 382 (395) T ss_pred ccc--ceecchhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccccccc-ccccccCcccc Confidence 444 57888888766 777888999999999999999999999999999998887654 4444433 22222222221 Q ss_pred ccccccccccccc Q lcl|NC_021537. 477 TRSKAAPPLENKI 489 (602) Q Consensus 477 ~~~~~~~~~~~~~ 489 (602) .........++.+ T Consensus 383 ~~~kgg~~~~~g~ 395 (395) T protein:vir:95 383 NTLKGGDEDESGD 395 (395) T ss_pred cccCCCCCCCCCC Confidence 1111111111111 No 77 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=1.9e-65 Score=375.44 Aligned_cols=370 Identities=12% Similarity=0.072 Sum_probs=266.7 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHH----HHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETL----AAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l----~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) +-|.... ....++++.+ ...+..+++|++||++||++||++||+++++.... T Consensus 7 ~f~~~~~---------------~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~--------- 62 (385) T protein:vir:95 7 VFKRHSE---------------LSWMYDLEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKE--------- 62 (385) T ss_pred hhccCcc---------------cccccchhhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCccc--------- Confidence 2221100 0000111111 23344578999999999999999999998643211 Q ss_pred HHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) .|+...++ .+||+.||+.+||+.++.++++.||||+++.+++ |.+..++++.+..+.+.. T Consensus 63 ---------~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~--------- 123 (385) T protein:vir:95 63 ---------KGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYS--------- 123 (385) T ss_pred ---------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeecccccccccccccc--------- Confidence 13444445 5899999999999999999999999999877654 333333333322221100 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) ..|.++. ....+..+.++++||||+|.+++.+..+|.||+.. T Consensus 124 ----------~~~~~~~----------------------------~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~~~~ 165 (385) T protein:vir:95 124 ----------HRFTNVL----------------------------VNDFEFKRVFTMDDVIYLKYNNQKLDAFSLGLFED 165 (385) T ss_pred ----------ccceeee----------------------------ecccceeeeeccccEEEecCCCCCcccccchHHHH Confidence 0111110 01123346789999999999988888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc-ccCCHHHHHHHHHHHHHhh-cc-cccCcceeccCCccceecccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG-GTLSEDSKEDLRNLMDNLK-GS-RYRTAILEVEEFVDDHGLGDGGS 312 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~~~~~~~~~~l~~~~~~~~-g~-~nag~~~~~~~g~~~~~~~~~~~ 312 (602) +...+....++.. +++.|+++|++++ ..+++++.+++++.|++.. |. .+.++++++++|+++.++... T Consensus 166 ~~~~i~~~~~~~~-------~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~-- 236 (385) T protein:vir:95 166 YGEIFGRMIDLQM-------LNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNR-- 236 (385) T ss_pred HHHHHHHHHHHHH-------hcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeeccc-- Confidence 9998876655432 2345789998864 4579999999999998764 44 345667888888776655321 Q ss_pred ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_021537. 313 DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIH 392 (602) Q Consensus 313 ~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll 392 (602) +....++.|+||+|++++++.+||++|||||++|+ ++++|++++...|++.||+|+++.||++||++|+ T Consensus 237 -------~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~L~ 305 (385) T protein:vir:95 237 -------GAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL----GEMADLEKTIESYLQFCINPLLRKIEAELNSKFF 305 (385) T ss_pred -------ccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 11235678999999999999999999999999994 6899999999999999999999999999999999 Q ss_pred CccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-cccccccccccCCC Q lcl|NC_021537. 393 QDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFGADASDGD 471 (602) Q Consensus 393 ~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~~~~~~~~ 471 (602) ++.+. .+++++|+.+.+++. |.+.+++++.+++++|+||+||+|+++|++|++++++|.+.. .++..++ ..++++ T Consensus 306 ~~~~~-~~~~~~fd~~~l~~~--D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~~n~~~~~-~~kgge 381 (385) T protein:vir:95 306 YQDEY-LNDDMHIKVVGIDKR--DPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFIITKNLQSAD-AFKGGE 381 (385) T ss_pred Chhhc-ccceEEEechhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecc-cccCCC Confidence 98776 456899999999876 777888999999999999999999999999998877777654 4444443 222222 Q ss_pred cCcc Q lcl|NC_021537. 472 AEAM 475 (602) Q Consensus 472 ~~~~ 475 (602) ..+. T Consensus 382 ~~~e 385 (385) T protein:vir:95 382 SNEE 385 (385) T ss_pred CCCC Confidence 1111 No 78 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=8.9e-65 Score=371.72 Aligned_cols=374 Identities=11% Similarity=0.041 Sum_probs=271.3 Q ss_pred CCCCcccccccchhhhcccC-ccccC-----CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVG-RGIQP-----PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~-~~i~p-----~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) .++..++.......+..... ....+ .++.. .+..+++|++||++||++||++|++++.+.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~--------- 73 (386) T protein:vir:49 7 TNLATESPPINQESFFDIADSDFLASLNSSEWVSAE----NALKNSDLFSIISQLSNDLATAKITTSRKQL--------- 73 (386) T ss_pred hccCCCCcccchhhhhhhhhccccccccCCceechh----hhhccHHHHHHHHHHHHHhhhCceeeccchh--------- Confidence 33333222222222211111 11111 12222 2334789999999999999999999875321 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) ..++.+||+.||+.+||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.... T Consensus 74 ---------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~--- 135 (386) T protein:vir:49 74 ---------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN--- 135 (386) T ss_pred ---------------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCc--- Confidence 12667899999999999999999999999999999999999999999999999876433210 Q ss_pred cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHH Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWV 234 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~ 234 (602) ..++++.- ....++..+.++++||||+|.+++.++++|+||+. T Consensus 136 -----------~~~y~~~~--------------------------~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~ 178 (386) T protein:vir:49 136 -----------GLYYNITF--------------------------DDPHIAPKQHVPQNDILHFRLLSVDGGLTSVSPLM 178 (386) T ss_pred -----------eEEEEEEE--------------------------cCccccceeEEccccEEEecCCCCCCccccccHHH Confidence 01111100 00123456789999999999999888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 235 AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 235 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) ++..+|....++++++.++|+||++|+++|++++. +++++.+++++.|+. +..|+|+++++++|+ T Consensus 179 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-~~~~~~~~~~~~~~~--~~~n~g~~~vl~~g~------------ 243 (386) T protein:vir:49 179 ALGREFNIQKASDKLTISALKNALNANGILKIKGG-GLLDFKTKVSRSRQA--MKQMQGGPLVLDDLE------------ 243 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCC-CChHHHHHHHHHHHH--hccCCCCceecCCCc------------ Confidence 99999999999999999999999999999999865 677777778887765 447889999887665 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) +|++++ +++.|+||+|++++++++||++|||||.+||.. ..++++.++ .+.|+..+|+|+++.|+++|+++|+. T Consensus 244 --~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~~~~~~~~-~~~~~~~~i~~~l~~i~~~~~~~l~~- 317 (386) T protein:vir:49 244 --DFTPLE-IKSNVAQLLSQADWTTGQFAKVYGIPESIVGGD-GDQQSSLEM-IYNIYFKSVSRYLRPFVSEMSKKLSC- 317 (386) T ss_pred --eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCccchHHH-HHHHHHHHHHHHHHHHHHHHHHHhcc- Confidence 455564 456799999999999999999999999999964 345666654 46788999999999999999999863 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCc Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEA 474 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~ 474 (602) +++|+...+++. |.+.++..+.+++++|++|+||+|++++..++..++.......+. ...++++.++ T Consensus 318 -------~~~~~~~~~~~~--d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~----~~~~gGd~~~ 384 (386) T protein:vir:49 318 -------EVDVDISPAVDP--TGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPNR----TSLKGGEINE 384 (386) T ss_pred -------hhcccchhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccCC----CCCCCCCCCC Confidence 367787777665 556677888999999999999999999766553222111100000 1111111111 Q ss_pred cc Q lcl|NC_021537. 475 ML 476 (602) Q Consensus 475 ~~ 476 (602) .. T Consensus 385 ~~ 386 (386) T protein:vir:49 385 QD 386 (386) T ss_pred CC Confidence 11 No 79 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=2.1e-63 Score=364.23 Aligned_cols=370 Identities=12% Similarity=0.050 Sum_probs=260.3 Q ss_pred CCCCcccccccchhhh----cccCccccC--CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIA----TDVGRGIQP--PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~----~~~~~~i~p--~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) .+|...++....+.+. ....+.... .++. .. +..+++|++||++||++||++||+++.+.. T Consensus 4 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~---~~-~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~--------- 70 (382) T protein:vir:48 4 FNLATESPPDNQGGFFDVVDSDFLASLKGNEWVSA---ET-ALRNSDLFSIINQLSNDLATVKLITSRKKL--------- 70 (382) T ss_pred ccccccCCcccccccccchhhhccccccCCcccch---Hh-hhccHHHHHHHHHHHHhhccCceeeecchh--------- Confidence 2221111111111100 000111111 1222 22 234688999999999999999999875321 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) ..|+.+||+.||+.+|++.++.+++++||||++++|+.+|++++|+||+|++|++..+..+ T Consensus 71 ---------------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~---- 131 (382) T protein:vir:48 71 ---------------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNK---- 131 (382) T ss_pred ---------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC---- Confidence 1267789999999999999999999999999999999999999999999999987543321 Q ss_pred cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHH Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWV 234 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~ 234 (602) +..++++..+. ...+..+.++++||||+|.+++.+.++|+||+. T Consensus 132 ----------~~~~y~~~~~~--------------------------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~ 175 (382) T protein:vir:48 132 ----------DGIYYNITFDD--------------------------PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLM 175 (382) T ss_pred ----------CeEEEEEEecC--------------------------ccccceeEEcCccEEEecCCCCCCccccccHHH Confidence 01111111100 112445789999999999999888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 235 AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 235 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) ++..+|....++++++.++|+||++|+++|++++. +++++.+++++.|.. +..|+|+++++++|++ T Consensus 176 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~--~~~n~g~~~vl~~g~~----------- 241 (382) T protein:vir:48 176 ALSRELDIQKASGNLTINSLKNALNANGILKIKGG-GLLDFKTKLSRSRQA--MKQMQGGPLVLDDLED----------- 241 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CChHHHHHHHHHHHh--hccCCCCeeEcCCCce----------- Confidence 99999999999999999999999999999999864 677788888777765 4467889988876654 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) |++++ ++++|+||+|++++++++||++|||||.+||..++ .++.+++.+.|++.||+|+++.|+++||++|+++ T Consensus 242 ---~~~l~-~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~--~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~ 315 (382) T protein:vir:48 242 ---FTPLE-IKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGD--QQSSLEMSSDLYSKAVSRYLRPFLSELSQKLSCD 315 (382) T ss_pred ---EEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCh Confidence 55554 45679999999999999999999999999997544 3467788899999999999999999999999877 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCC---CCccccccccccccccccccCCC Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFE---DDRGDMTLSEFEAEFGADASDGD 471 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~---~g~~d~~~~~~~~~~~~~~~~~~ 471 (602) .+... ...++. +.......+.+++++|++|+||+|+.++..++. ...++.+. ...++++ T Consensus 316 ~~~~~--~~~~~~--------~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~--------~~~~GGd 377 (382) T protein:vir:48 316 VDADI--FPAVDP--------TGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPN--------STLKGGE 377 (382) T ss_pred hhhhh--hhhhcc--------chhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCC--------CCCCCCC Confidence 54321 112221 222233446778999999999999988533221 11111100 0111112 Q ss_pred cCccc Q lcl|NC_021537. 472 AEAML 476 (602) Q Consensus 472 ~~~~~ 476 (602) ..+.. T Consensus 378 ~~~~~ 382 (382) T protein:vir:48 378 EDGQD 382 (382) T ss_pred CCCCC Confidence 11111 No 80 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=2.5e-63 Score=363.75 Aligned_cols=378 Identities=13% Similarity=0.071 Sum_probs=252.3 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) ..|...+..+.+ +..+..........+..+++|++||++||++||++||+++.+.+. . T Consensus 11 ~~~~~~~~~~~~---------~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~------------~- 68 (395) T protein:vir:40 11 FNEEQRTLNLTD---------TVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEE------------V- 68 (395) T ss_pred hccccccccccc---------chhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCcc------------c- Confidence 222222222111 112222223334455568899999999999999999998753211 0 Q ss_pred Hhhhccchhhh-hhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQ-IGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~-l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) .++... |+.+||+.||+.+||+.++.+++++||||+++.++.. ++.++ .++..... T Consensus 69 -----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~------~~~~~-~~~~~~~~----------- 125 (395) T protein:vir:40 69 -----RKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI------YVADS-FTKNDKSL----------- 125 (395) T ss_pred -----cchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce------eecCC-cccccccc----------- Confidence 123333 4458999999999999999999999999999887642 22221 11110000 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ....|.++... ..+..++++++||||+|+.+.....++.+.+..+... T Consensus 126 ----~~~~~~~v~~~----------------------------~~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~ 173 (395) T protein:vir:40 126 ----YENTYTEVTLK----------------------------DLTLKKEFKESEVLHLTLNNESIKSIIDGFYLLYGDL 173 (395) T ss_pred ----ccceeeeeeec----------------------------CceeeeeeccccEEEeecCCCCccccchhHHHHHHHH Confidence 00111111100 0012346899999999976544333444444443333 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhh-cc-cccCcceeccCCccceeccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLK-GS-RYRTAILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~-g~-~nag~~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) +... .+...+.++..+.++++.+ ..+++++.+++++.|++.. +. .|+++++++++|++ T Consensus 174 ~~~~-----~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~-------------- 233 (395) T protein:vir:40 174 LTAA-----VNKYKKLNSRKIIVKLKAM-FGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGME-------------- 233 (395) T ss_pred HHHH-----HHHHHhcCCCCceEEEecc-cCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCce-------------- Confidence 3222 2233344554554444443 4589999999999998754 42 57888888776654 Q ss_pred cccccccchHHHHHHHHHHhhH---HHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNE---HEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~---~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) |++++ +++.|+||+|++++.. ++||++|||||.+|| ++++|++++...|+++||.|++++||++||++||+. T Consensus 234 ~~~l~-~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~----~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~ 308 (395) T protein:vir:40 234 IDELA-GDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAK----GDTVGLSEQVNSFLMFSINPIAEMFTDEGNRKFYGR 308 (395) T ss_pred EEecc-CChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 55554 4567999999998874 799999999999996 678999999999999999999999999999999999 Q ss_pred cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-cccccccccccCCCcC Q lcl|NC_021537. 395 ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-EFEAEFGADASDGDAE 473 (602) Q Consensus 395 ~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-~~~~~~~~~~~~~~~~ 473 (602) .+...+++|+||++.+++. |.+.+++++.+++++|+||+||+|+++|++|++++++|.+.. .++++.+...... .+ T Consensus 309 ~~~~~g~~i~fd~~~ll~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~-kg 385 (395) T protein:vir:40 309 DSVLERTYMKLDTTRIKVQ--DIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERFVTKNYAPLGENEEDL-KG 385 (395) T ss_pred hhhcCCceEEEechhhhcc--CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceeeecccccccccccccc-CC Confidence 8888899999999999877 778888999999999999999999999999999988887644 4445444221111 11 Q ss_pred cccccccccccccc Q lcl|NC_021537. 474 AMLTRSKAAPPLEN 487 (602) Q Consensus 474 ~~~~~~~~~~~~~~ 487 (602) +...+... ++ T Consensus 386 ge~~~~~~----~~ 395 (395) T protein:vir:40 386 GDINENKG----DS 395 (395) T ss_pred CCCCCCcC----CC Confidence 11000000 00 No 81 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=8.1e-63 Score=361.00 Aligned_cols=366 Identities=14% Similarity=0.091 Sum_probs=252.9 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) +-|....... ...... ... ++ ...+..++.|++||++||++||++||+++.+.. T Consensus 7 l~~~~~~~~~---~~~~~~---~~~-~~----~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~--------------- 60 (376) T protein:vir:78 7 LFKRNKEIEW---MWDLDF---LED-KT----TKVYLKKMALNTCVKHIARTIAKSDFRLKNGET--------------- 60 (376) T ss_pred hhccCCcccc---ccchhh---ccc-cc----hhhhhhhHHHHHHHHHHHHhhcccceeeccccc--------------- Confidence 2121111000 011111 111 11 112334688999999999999999999874321 Q ss_pred Hhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ...|++..++ .+||+.||+.+||+.++.++++.||+|+++.|+..|.+..++++.+..+.... T Consensus 61 ---~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~------------- 124 (376) T protein:vir:78 61 ---SVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDV------------- 124 (376) T ss_pred ---cccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeee------------- Confidence 1124455444 58999999999999999999999999999999999999999998776543210 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) |.++. ....+....++++||||+|........++.+.+..+... T Consensus 125 --------~~~~~----------------------------~~~~~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~~ 168 (376) T protein:vir:78 125 --------FEGVT----------------------------VKDYRYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGEL 168 (376) T ss_pred --------eeeee----------------------------eecceeeeeeccccEEEeccCCCCchhhhhHHHHHHHHH Confidence 11110 001122356899999999976543322333332222222 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcc--cccCcceeccCCccceeccccccccccc Q lcl|NC_021537. 240 MGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGS--RYRTAILEVEEFVDDHGLGDGGSDVNIE 317 (602) Q Consensus 240 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~--~nag~~~~~~~g~~~~~~~~~~~~~~~~ 317 (602) +. .....++.+++.+.+++......+++++.+++++.|++..++ .+++.++++++|+++.++.....+ T Consensus 169 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~---- 238 (376) T protein:vir:78 169 FG------KMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVN---- 238 (376) T ss_pred HH------HHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccc---- Confidence 21 222233334443333333334568999999999999876544 456678888888877766433221 Q ss_pred cccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccc Q lcl|NC_021537. 318 LEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALD 397 (602) Q Consensus 318 ~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~ 397 (602) ++.+|+||+|++++++++||++|||||.+|| ++++|+|++.+.|+++||.|+++.||++||++|+++.+ T Consensus 239 ------~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~----~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~- 307 (376) T protein:vir:78 239 ------NSQSFDEVKKLRKEMIDYVASILGIPSSLLH----GDMADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTFSE- 307 (376) T ss_pred ------cchhHHHHHHHHHHHHHHHHHHhCCCHHHhC----CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCccc- Confidence 2346789999999999999999999999996 58899999999999999999999999999999998753 Q ss_pred ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc-ccccccccccccCC Q lcl|NC_021537. 398 VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL-SEFEAEFGADASDG 470 (602) Q Consensus 398 ~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~-~~~~~~~~~~~~~~ 470 (602) ++++|++..+++. |.+.+++++++++++|++|+||+|+++|+||+++|++|.+. +.++++++.-.+.| T Consensus 308 ---~~~~~~~~~ll~~--d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 308 ---FLAGEHIKIIHKK--DIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYLITKNYQSADEGGEDG 376 (376) T ss_pred ---ceecccchhhccc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccCceehhccccCC Confidence 5677777777655 77888999999999999999999999999999999877665 45555544222221 No 82 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=1.1e-62 Score=360.25 Aligned_cols=319 Identities=19% Similarity=0.252 Sum_probs=256.4 Q ss_pred CCC-----------------------Cccc--ccccchh--hhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_021537. 1 MSK-----------------------AEET--TQLDERH--IATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYE 53 (602) Q Consensus 1 ~~k-----------------------~~~~--~~~~~~~--~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~i 53 (602) |.| .... .-+.+-. +....+++++||+++..|+++.+.|+.+.+||..+.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~h~~~i~~k~N~l 80 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGYHGSLLKARANYV 80 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhhhhhhHhhhhhHH Confidence 110 0000 0011111 222456799999999999999999999999999988776 Q ss_pred ccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEE Q lcl|NC_021537. 54 AGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVG 133 (602) Q Consensus 54 a~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~ 133 (602) ++. -.||+.+|..+|++. +.|++++||||++++|+..|++++ T Consensus 81 ~~~-------------------------------------~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~ 122 (348) T protein:vir:26 81 AGR-------------------------------------FMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIA 122 (348) T ss_pred hhc-------------------------------------ccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEE Confidence 641 147889999999764 579999999999999999999999 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechh Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 213 (602) |+|||+.+|++..+. . |++ +..++..+.|+++ T Consensus 123 L~~l~~~~v~~~~d~-----------------~-~~~------------------------------~~~~g~~~~f~~~ 154 (348) T protein:vir:26 123 LEPLPMVHMRKRKNG-----------------D-FVQ------------------------------LLRNNEQKVFKAK 154 (348) T ss_pred EEEecCceeEeeecC-----------------c-EEE------------------------------EEecCeEEEEcCc Confidence 999999999864321 1 111 1224556789999 Q ss_pred HEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccC Q lcl|NC_021537. 214 ELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRT 293 (602) Q Consensus 214 eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag 293 (602) +|||+|.+++.+++||+||+.++++++....+++.|++++|+||++|++||+++++.+++++.+++++.|++.+|..|++ T Consensus 155 dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~ 234 (348) T protein:vir:26 155 DVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFR 234 (348) T ss_pred cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccc Confidence 99999999999999999999999999999999999999999999999999999988899999999999999989999999 Q ss_pred cceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc--cCCccCHHHHHHHH Q lcl|NC_021537. 294 AILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS--TSNRANSKEQTREF 371 (602) Q Consensus 294 ~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~~~f 371 (602) +++++.++ +.+.+++++|++.++ +|+||++.+++++++||++|||||.++|+.. +++++|++++.+.| T Consensus 235 ~~~vl~~~---------g~~~Gi~~~pis~~~-~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f 304 (348) T protein:vir:26 235 SMFVNIPN---------GKEKGIQLIPVGDIA-TKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVY 304 (348) T ss_pred ceeEEcCC---------CCccceeEEEccCCh-hHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHH Confidence 98887544 345688999998654 6899999999999999999999999999754 46899999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHH Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRV 424 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~ 424 (602) +++||.|+++.||++||++|..+. +++++|+++...+. .+.. ++ T Consensus 305 ~~~~l~P~~~~ie~~ln~~l~~~~----~~~~~fdl~~~~e~-~~~~----a~ 348 (348) T protein:vir:26 305 DFYEVIPVCKRFMDAVNNDPEIPD----NLKLKFNLNPGVES-ANGS----AV 348 (348) T ss_pred HHHHHHHHHHHHHHHHhhhhCCCC----ccEEEEecCccccc-chhh----cC Confidence 999999999999999999876432 45677776532211 1111 11 No 83 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=1.2e-62 Score=360.09 Aligned_cols=314 Identities=18% Similarity=0.246 Sum_probs=258.4 Q ss_pred CCCCc------------------cccc------------cc-----chhhhcccCccccCCCCHHHHHHHHhhhHHHHHH Q lcl|NC_021537. 1 MSKAE------------------ETTQ------------LD-----ERHIATDVGRGIQPPYNPETLAAFQELNETHQAC 45 (602) Q Consensus 1 ~~k~~------------------~~~~------------~~-----~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~c 45 (602) |+|.. .+.+ +. +-.-.+..|++++||+++..|+++.+.|+++.+| T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s~ 105 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 105 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhhh Confidence 11100 0000 11 1112233467999999999999999999999999 Q ss_pred HHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee Q lcl|NC_021537. 46 IRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV 125 (602) Q Consensus 46 I~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r 125 (602) |..+++.+++. -+||+.+|..+|++ ++.|++++||||++++| T Consensus 106 l~~k~n~l~~~-------------------------------------~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~r 147 (376) T protein:vir:10 106 LFFKANVLAST-------------------------------------FRPHRWLSRHAFER-WALDFLTFGNGYLERRR 147 (376) T ss_pred HHHHhHHHHhc-------------------------------------cCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEE Confidence 99988766541 14788999999975 56799999999999999 Q ss_pred CCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCc Q lcl|NC_021537. 126 EGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAG 205 (602) Q Consensus 126 ~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 205 (602) +..|++++|+||+|.+|++..+. ..|+++. .++ T Consensus 148 n~~G~~~~L~pl~~~~vr~~~d~-----------------~~~~~~~------------------------------~~~ 180 (376) T protein:vir:10 148 NMVGGTLRLEPALAKYVRRKADF-----------------NGFVYVN------------------------------GWQ 180 (376) T ss_pred CCCCCEEEEEEeCCcceEEEeeC-----------------CeEEEEE------------------------------cCC Confidence 99999999999999999865432 1233321 234 Q ss_pred eeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH Q lcl|NC_021537. 206 ELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN 285 (602) Q Consensus 206 ~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~ 285 (602) ....|+++||||+|.+++.+++||+|++.+++.++....+++.|+.++|+||++|++||++++..+++++.+++++.|++ T Consensus 181 ~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~e~~~~lr~~~~~ 260 (376) T protein:vir:10 181 ERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKN 260 (376) T ss_pred eEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHH Confidence 55678999999999999999999999999999999999999999999999999999999999888999999999999999 Q ss_pred hhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccC Q lcl|NC_021537. 286 LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRAN 363 (602) Q Consensus 286 ~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn 363 (602) .+|..|+++++++..+ +.+.+++++|++.+ ++|+||+|++++++++||++|||||.++|+.++ ++++| T Consensus 261 ~~G~~N~~~~~vl~~~---------g~~~Gi~~~pls~~-~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn 330 (376) T protein:vir:10 261 AKGPGNFRNVFMYAPG---------GKKDGIQLIPVSEV-AAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGT 330 (376) T ss_pred hcCccccCceeEecCC---------CCccceEEEEccCC-HHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCccc Confidence 9999999998887543 34568899999865 578999999999999999999999999998764 46999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHH Q lcl|NC_021537. 364 SKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKM 419 (602) Q Consensus 364 ~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~ 419 (602) +|++.+.|+++||.|+++.|++ +|.+|.. ..++|+..++++. |++. T Consensus 331 ~eq~~~~f~~~~L~Pl~~~iee-ln~~L~~-------~~~~F~~~~Llr~--d~ka 376 (376) T protein:vir:10 331 PDTAARVFGRNEIRPLQARFAE-LNDWLGE-------EVVRFDDYEIPPA--PVAA 376 (376) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHhhccc-------cccccChhHhhcc--cccC Confidence 9999999999999999999985 7777732 2489999999877 4433 No 84 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=1.3e-62 Score=359.93 Aligned_cols=315 Identities=18% Similarity=0.204 Sum_probs=259.5 Q ss_pred CCCCccc-------------c----------cccch----hh-h-cccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_021537. 1 MSKAEET-------------T----------QLDER----HI-A-TDVGRGIQPPYNPETLAAFQELNETHQACIRKKSR 51 (602) Q Consensus 1 ~~k~~~~-------------~----------~~~~~----~~-~-~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~ 51 (602) |+|..+. . -++.+ .+ . ...|+|++||+++..|+++.+.|+++.+||.+.++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i~~k~n 80 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAIITKAN 80 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhhhhhhh Confidence 2222100 0 00111 11 1 24577999999999999999999999999987654 Q ss_pred hhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce Q lcl|NC_021537. 52 YEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP 131 (602) Q Consensus 52 ~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~ 131 (602) .+ ..++.+||+.||+.+|++ ++.|++++||||++++|+..|++ T Consensus 81 ~l------------------------------------~~l~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~ 123 (346) T protein:vir:10 81 IL------------------------------------LSTCEVDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQV 123 (346) T ss_pred hH------------------------------------HHHHhCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcE Confidence 32 234557999999999986 56899999999999999999999 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) ++|+||+|.+|++..+... |.+ .+...+|....|+ T Consensus 124 ~~L~pl~~~~v~~~~~~~~-----------------~~~----------------------------~~~~~~g~~~~~~ 158 (346) T protein:vir:10 124 QRIESPLAKYVRKGLEAGQ-----------------FYY----------------------------VPQRFDHQEHEFA 158 (346) T ss_pred EEEEEecCCceEEEEcCCe-----------------EEE----------------------------EEEccCCeEEEEe Confidence 9999999999987543221 111 1122345677899 Q ss_pred hhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccc Q lcl|NC_021537. 212 ANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRY 291 (602) Q Consensus 212 ~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~n 291 (602) +++|||+|.+++.+++||+||+..++.++....++++++.++|+||++|++||++++..+++++.+++++.|++.+|..| T Consensus 159 ~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n 238 (346) T protein:vir:10 159 KGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQLKQSKGVGN 238 (346) T ss_pred cccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCccc Confidence 99999999999899999999999999999999999999999999999999999999888999999999999999999999 Q ss_pred cCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHH Q lcl|NC_021537. 292 RTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTR 369 (602) Q Consensus 292 ag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~ 369 (602) +++++++.+|. .+.+++++|++.+ +.|+||++++++++++||++|||||.+||+..+ ++++|+|++.+ T Consensus 239 ~~~~~vl~~~~---------~~~gi~~~pis~~-~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~ 308 (346) T protein:vir:10 239 FKNLFVHAPNG---------KKDGIQIIPIADV-SAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAE 308 (346) T ss_pred cCceeEecCCC---------CccceeEEecCCC-hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH Confidence 99999887654 3567899999864 578999999999999999999999999998754 46999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhH Q lcl|NC_021537. 370 EFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQ 415 (602) Q Consensus 370 ~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~ 415 (602) .|++++|.|++++||+ +|.+|.. ..++|+..++++.++ T Consensus 309 ~f~~~~l~P~~~~iee-~n~~L~~-------e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 309 VFFITEIEPLQERLKE-FNQWLGQ-------EVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHH-HHhhccc-------ceeeechhhhcccCC Confidence 9999999999999986 6666632 258999999988765 No 85 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=2.8e-62 Score=358.02 Aligned_cols=362 Identities=10% Similarity=0.006 Sum_probs=250.1 Q ss_pred CCCCcccccccchhhhcc-cCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATD-VGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~-~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) ..|... +.+.+... ......+ .-...+.++++|++||++||++||++||+++++.+.+...... T Consensus 4 f~~~~~----~~~~~~~~~~~~~~~~-----~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~------ 68 (378) T protein:vir:94 4 FGKVVS----FSRGKLNNDTQRVTAW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTL------ 68 (378) T ss_pred cccchh----cccccccCCcceeeee-----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccc------ Confidence 111110 00111000 0000111 1123456789999999999999999999988876655432111 Q ss_pred HHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CCceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 80 RDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DGTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 80 ~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) .....|+++.++. +||+.||+.+||+.++.+++++||+|++++++. .|+++.|+|.. T Consensus 69 --~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~------------------- 127 (378) T protein:vir:94 69 --ISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD------------------- 127 (378) T ss_pred --cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC------------------- Confidence 1123467777776 799999999999999999999999999987654 46665554321 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAM 237 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~ 237 (602) ..++++++||||+|.+ .++..|+||+..+. T Consensus 128 ------------------------------------------------~~~~~~~~diiH~~~~--~~~~~g~s~l~~~~ 157 (378) T protein:vir:94 128 ------------------------------------------------DKKEYKPEELVRLTSP--FYINEDTSILDNAL 157 (378) T ss_pred ------------------------------------------------CeeEeeeeeeEEecCc--CCccchhHHHHHHH Confidence 1134678899999954 56778999999888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHH----HHhhcccccCcceeccCCccceeccccccc Q lcl|NC_021537. 238 QTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLM----DNLKGSRYRTAILEVEEFVDDHGLGDGGSD 313 (602) Q Consensus 238 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~----~~~~g~~nag~~~~~~~g~~~~~~~~~~~~ 313 (602) +.+.. .+++ +.++|+|++++ .+++++.+++++.| +...++.|+|+++++++|++ T Consensus 158 ~~i~~----------~~~~-~~~~gil~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~---------- 215 (378) T protein:vir:94 158 ASIQT----------KLEQ-GKLRGLLKINA-FLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTE---------- 215 (378) T ss_pred HHHHH----------HHhc-ccccceeeeCC-cCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCce---------- Confidence 77643 2333 46889999875 46766555555544 44567788889998877654 Q ss_pred cccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 314 VNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 314 ~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) |++++ .+++|+|+ +.++++.++||++|||||.+|+ ++ +++++..+|+++||.||+++||++||++|++ T Consensus 216 ----~~~l~-~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~----~~--~se~~~~~f~~~tL~P~~~~ie~~l~~~Ll~ 283 (378) T protein:vir:94 216 ----IVELK-KDYSVLNK-DEIDLIKSELLTGYFMNENILL----GT--ASQEQQIYFYNSTIIPLLIQLEKELTYKLIS 283 (378) T ss_pred ----EEEcc-CChhhhhH-HHHHHHHHHHHHHhCCCHHHhc----CC--hHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 55554 34578997 5678999999999999999994 33 3478899999999999999999999999999 Q ss_pred ccccccce------EEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 394 DALDVDEW------TIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 394 ~~~~~~~~------~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) +.++..++ .++|+.+.+++. |.+.+++++++++++|+||+||+|+++||||+|||+ ..+++.++++..... T Consensus 284 ~~er~~g~~~~~~~~~~f~~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD-~~~~~~n~~~~~~~~ 360 (378) T protein:vir:94 284 TNRRRVVKGNLYYERIIVDNQLFKFA--TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-VYIANLNAVAVKNLS 360 (378) T ss_pred hhHhhhhhhcccccceeecchhhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eeeecccccccccch Confidence 87654443 478999998877 777888999999999999999999999999999864 445566666554222 Q ss_pred cCCCcCcccccccccccccccccc Q lcl|NC_021537. 468 SDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ..... .... .+.+...++ T Consensus 361 ~~~~~-~~~~-----~~~~e~~n~ 378 (378) T protein:vir:94 361 DLQGS-RKDV-----TSTDETNNQ 378 (378) T ss_pred hhcCC-cCCC-----CCCCCCCCC Confidence 11111 1100 011111111 No 86 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=9.8e-62 Score=355.07 Aligned_cols=382 Identities=13% Similarity=0.101 Sum_probs=260.3 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) ..|.. ..+.... .+..+. ......+..+++|++||++||++||++||+++.+.+... T Consensus 8 ~~~~~--~~~~~~~----~~~~~~-----~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~------------ 64 (395) T protein:vir:98 8 SFKKS--GTLSDDD----SGSTTS-----EKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTE------------ 64 (395) T ss_pred cCCCc--ccccccc----cchhhh-----hhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCccc------------ Confidence 22211 1110000 011111 112233445789999999999999999999986432211 Q ss_pred Hhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ..++.+.++ .+||+.||+.+||+.++.+++++||||++++++..+ ++++..++... T Consensus 65 ----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~------~~~~~~~~~~~------------- 121 (395) T protein:vir:98 65 ----NQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGI------YVADSFTQDKK------------- 121 (395) T ss_pred ----ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCce------ecCCccccccc------------- Confidence 123444444 489999999999999999999999999999987532 22222221100 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ..+..|.++.. .......+++++||||+|+.++....++.+++...... T Consensus 122 ---~~~~~~~~~~~----------------------------~~~~~~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~ 170 (395) T protein:vir:98 122 ---ISGSQFKVSRV----------------------------QGQTYEKTFTFDQVIYLKNDNSDLMSKVESLWEEYGEL 170 (395) T ss_pred ---ccCcccceeee----------------------------cCceeeeEecCccEEEecCCCCCccccccchhhhHHHH Confidence 00001111100 00112357889999999987765555666666655555 Q ss_pred HHHHHHH--HHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcc--cccCcceeccCCccceeccccccccc Q lcl|NC_021537. 240 MGADQAA--KEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGS--RYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 240 i~~~~~~--~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~--~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) +...... .....+++.++..+.+++.......++++.+..++++++..++ .+.++++++++|+++.++.... T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~---- 246 (395) T protein:vir:98 171 LGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKN---- 246 (395) T ss_pred HHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEeccccc---- Confidence 5443333 3445678888888888887776667788888888888876665 4566777788887776654322 Q ss_pred cccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA 395 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~ 395 (602) .....+.|+||++++++++++||++|||||++|| ++++|.|++.+.|+++||.|+++.||++||++|+++. T Consensus 247 -----~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~----~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~ 317 (395) T protein:vir:98 247 -----TGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNYELLLEGPIESLITNIVDGLEYAIFDKS 317 (395) T ss_pred -----ccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChh Confidence 1224567899999999999999999999999996 6899999999999999999999999999999999988 Q ss_pred ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc-ccccccccccccCCCcCc Q lcl|NC_021537. 396 LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL-SEFEAEFGADASDGDAEA 474 (602) Q Consensus 396 ~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~-~~~~~~~~~~~~~~~~~~ 474 (602) +...+++ |+++.+++. |.+.+++++++++++|++|+||+|+++|+||++++++|.++ +.+++++... +++..+ T Consensus 318 ~~~~g~~--f~~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~~~~~~n~~~~~~~--gge~~~ 391 (395) T protein:vir:98 318 ETLQGSF--IKVTGLKNY--DLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNYESVLER--GGEVDE 391 (395) T ss_pred hhcCcce--eeehhhhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccc--cCCCCC Confidence 7766665 555677766 67778899999999999999999999999999998777654 4455554321 111100 Q ss_pred ccccccccccccc Q lcl|NC_021537. 475 MLTRSKAAPPLEN 487 (602) Q Consensus 475 ~~~~~~~~~~~~~ 487 (602) .. . + T Consensus 392 ~~----~-----~ 395 (395) T protein:vir:98 392 EV----E-----T 395 (395) T ss_pred CC----C-----C Confidence 00 0 0 No 87 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=7.3e-62 Score=355.77 Aligned_cols=362 Identities=10% Similarity=0.005 Sum_probs=248.7 Q ss_pred CCCCcccccccchhhh-cccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIA-TDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~-~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) .-|... +.+.+. ++......+ .-..++.++++|++||++||++||++||+++++.+.+....... T Consensus 4 f~~~~~----~~~~~~~~~~~~~~~~-----~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~----- 69 (378) T protein:vir:16 4 FGKVVS----FSRGKLNNDTQRVTAW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLI----- 69 (378) T ss_pred chhhhh----hhcccccCCcceeeec-----ccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccccc----- Confidence 222110 000000 000000110 01234567899999999999999999999988766543221111 Q ss_pred HHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-CceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 80 RDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-GTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 80 ~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ....|+++.++. +||+.||+.+||+.++.++++.||+|++++|+.. |++..|+|.. T Consensus 70 ---~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~------------------- 127 (378) T protein:vir:16 70 ---SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD------------------- 127 (378) T ss_pred ---ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC------------------- Confidence 123466777776 7999999999999999999999999999988753 5554443321 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAM 237 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~ 237 (602) ...+|+++||||+|. +.++..|.|++..+. T Consensus 128 ------------------------------------------------~~~~~~~~diih~r~--~~~~~~~~s~l~~~~ 157 (378) T protein:vir:16 128 ------------------------------------------------DKKEYKPEELVRLTS--PFYINEDTSILDNAL 157 (378) T ss_pred ------------------------------------------------CeeEecccceEEecC--ccCccchhHHHHHHH Confidence 123567899999995 356678899988887 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHH----HHHHHHHhhcccccCcceeccCCccceeccccccc Q lcl|NC_021537. 238 QTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKED----LRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSD 313 (602) Q Consensus 238 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~----l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~ 313 (602) ..+.. .+. ++.++|+|+.++ .+++++.++ +++.++...++.|+|+++++++|++ T Consensus 158 ~~i~~----------~~~-~~~~~g~l~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~---------- 215 (378) T protein:vir:16 158 ASIQT----------KLE-QGKLRGLLKINA-FLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTE---------- 215 (378) T ss_pred HHHHH----------HHh-cCccceeeEeCC-cCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCce---------- Confidence 66532 233 456889999875 456665544 4444455567889999999877665 Q ss_pred cccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 314 VNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 314 ~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) |++++ ++++|+|+. .+++++++||++|||||.+|+ + ++++++..+|+++||.||++.||++||++|++ T Consensus 216 ----~~~l~-~~~~~~~~~-~~~~~~~~Ia~~fgVPp~~l~----g--~~~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~ 283 (378) T protein:vir:16 216 ----IVELK-KDYSVLNKD-EIDLIKSELLTGYFMNENILL----G--TASQEQQIYFYNSTIIPLLIQLEKELTYKLIS 283 (378) T ss_pred ----EEEcc-CChhhhhHH-HHHHHHHHHHHHhCCCHHHhc----C--CchHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 44554 345789974 568999999999999999994 2 34578999999999999999999999999999 Q ss_pred ccccccce------EEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 394 DALDVDEW------TIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 394 ~~~~~~~~------~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) +.++..++ .++|+.+.+++. |.+.+++++.+++++|+||+||+|+++|+||+|||+ ..+++.+++++.... T Consensus 284 ~~e~~~~~~~~~~~~~~f~~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD-~~~~~~n~~~~~~~~ 360 (378) T protein:vir:16 284 TNRRRVVKGNLYYERIIVDNQLFKFA--TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-VYIANLNAVAVKNLS 360 (378) T ss_pred hhhhhhhhhcccccceeeccchhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeeccccccccchh Confidence 87654432 478999998877 777788999999999999999999999999999864 445566666554222 Q ss_pred cCCCcCcccccccccccccccccc Q lcl|NC_021537. 468 SDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ......... .+.+...+| T Consensus 361 ~~~~~~~~~------~~~~e~~ne 378 (378) T protein:vir:16 361 DLQGSRKDV------TSTDETNNQ 378 (378) T ss_pred hhcCccCCC------CCCCCCCCC Confidence 211111110 111111111 No 88 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=3.2e-62 Score=357.73 Aligned_cols=362 Identities=10% Similarity=-0.006 Sum_probs=249.0 Q ss_pred CCCCcccccccchhhh-cccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIA-TDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~-~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) .-|... +.+... ++......+ .-...+.++++|++||++||++||++||+++++.+.+....... T Consensus 4 f~~~~~----f~~~~~~~~~~~~~~~-----~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~----- 69 (378) T protein:vir:93 4 FGKVVS----FSRGKLNNDTQRVTAW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLI----- 69 (378) T ss_pred chhhhh----hhccccCCCcceeeec-----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccccc----- Confidence 111110 000000 000000000 01234557889999999999999999999988765543221111 Q ss_pred HHhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-CceEEEEEeCcccccccccccccccccch Q lcl|NC_021537. 80 RDFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-GTPVGLAHVPAATVRVRKTTTTIEREDGE 157 (602) Q Consensus 80 ~~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G~~~~L~~l~p~~v~~~~~~~~~~~~~~~ 157 (602) ....|+++.++. +||++||+.+||+.++.+++++||+|++++++.. |++..|+|.. T Consensus 70 ---~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~~------------------- 127 (378) T protein:vir:93 70 ---SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD------------------- 127 (378) T ss_pred ---ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC------------------- Confidence 123466777776 7999999999999999999999999999887643 5555443310 Q ss_pred hhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHH Q lcl|NC_021537. 158 EVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAM 237 (602) Q Consensus 158 ~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~ 237 (602) ...+++++||||+|. +.++..|.|++..+. T Consensus 128 ------------------------------------------------~~~~~~~~diih~r~--~~~~~~~~s~l~~~~ 157 (378) T protein:vir:93 128 ------------------------------------------------DKKEYKTEELVRLTS--PFYINEDTSILDNAL 157 (378) T ss_pred ------------------------------------------------CeeEeccceeEEecC--ccccchhhHHHHHHH Confidence 123577899999995 456678999988877 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHH----HHhhcccccCcceeccCCccceeccccccc Q lcl|NC_021537. 238 QTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLM----DNLKGSRYRTAILEVEEFVDDHGLGDGGSD 313 (602) Q Consensus 238 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~----~~~~g~~nag~~~~~~~g~~~~~~~~~~~~ 313 (602) ..+. .+|.+| .++|+|++++ .+++++.+++++.| +...++.|+++++++++|++ T Consensus 158 ~~i~----------~~~~~~-~~~g~l~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~---------- 215 (378) T protein:vir:93 158 ASIQ----------TKLEQG-KLRGLLKINA-FLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTE---------- 215 (378) T ss_pred HHHH----------HHHhcC-cccceeeeCC-cCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCce---------- Confidence 6553 344554 6899999875 46776655555554 44567778888998876654 Q ss_pred cccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Q lcl|NC_021537. 314 VNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQ 393 (602) Q Consensus 314 ~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~ 393 (602) |++++ .+++|+|+ +.++++.++||++|||||.+|+ .++++++...|++.||.|+++.||++||++|++ T Consensus 216 ----~~~l~-~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~------g~~~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~ 283 (378) T protein:vir:93 216 ----IVELK-KDYSVLNK-DEIDLIKSELLTGYFMNENILL------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLIS 283 (378) T ss_pred ----EEEcc-CChhhhhH-HHHHHHHHHHHHHhCCCHHHhc------CCcHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 55554 34578997 6678999999999999999993 234578999999999999999999999999999 Q ss_pred ccccccce------EEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccc Q lcl|NC_021537. 394 DALDVDEW------TIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADA 467 (602) Q Consensus 394 ~~~~~~~~------~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~ 467 (602) +.++..++ .++|+++.+++. |.+.+++++++++++|+||+||+|+++||||+|||+ ..+++.+.++..... T Consensus 284 ~~er~~~~~~~~~~~~~fd~~~l~~~--d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD-~~~~~~n~~~~~~~~ 360 (378) T protein:vir:93 284 TNRRRVVKGNLYYERIIVDNQLFKFA--TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-VYIANLNAVAVKNLS 360 (378) T ss_pred hhHhhhhhhcccccceeeccchhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eeeeccccccccchh Confidence 87655443 478999998877 777888999999999999999999999999999864 344555665543221 Q ss_pred cCCCcCcccccccccccccccccc Q lcl|NC_021537. 468 SDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ..... ... +++.++..++ T Consensus 361 ~~~~~-~~~-----~~~~~e~~n~ 378 (378) T protein:vir:93 361 DLQGS-RKD-----VTSTDETNNQ 378 (378) T ss_pred hhcCc-cCC-----CCCCCCCCCC Confidence 11111 110 1111111111 No 89 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=4.5e-62 Score=356.92 Aligned_cols=314 Identities=18% Similarity=0.242 Sum_probs=258.0 Q ss_pred CCCC------------------ccccc------------ccch-----hhhcccCccccCCCCHHHHHHHHhhhHHHHHH Q lcl|NC_021537. 1 MSKA------------------EETTQ------------LDER-----HIATDVGRGIQPPYNPETLAAFQELNETHQAC 45 (602) Q Consensus 1 ~~k~------------------~~~~~------------~~~~-----~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~c 45 (602) |+|. +.+.+ +..+ .-.+..|++++||+++..|+++.+.|+++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhhh Confidence 1111 00011 1111 12233467999999999999999999999999 Q ss_pred HHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee Q lcl|NC_021537. 46 IRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV 125 (602) Q Consensus 46 I~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r 125 (602) |..+++.+++. -+||+.+|..+|+ +++.|++++||||++++| T Consensus 81 l~~k~n~l~~~-------------------------------------~~Pnp~~t~~~f~-~~v~d~ll~Gnay~~~~r 122 (351) T protein:vir:79 81 LFFKANVLAST-------------------------------------FRPHRWLSRHAFE-RWALDFLTFGNGYLERRR 122 (351) T ss_pred hhhhhhHHhhc-------------------------------------ccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEE Confidence 98887766541 1488899999996 567899999999999999 Q ss_pred CCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCc Q lcl|NC_021537. 126 EGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAG 205 (602) Q Consensus 126 ~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 205 (602) +..|++++|+||+|.+|++..+. ..|+++ ..++ T Consensus 123 ~~~G~~~~L~~l~~~~v~~~~~~-----------------~~~~~~------------------------------~~~g 155 (351) T protein:vir:79 123 NMVGGTLRLEPALAKYVRRKADF-----------------SGFVYV------------------------------NGWQ 155 (351) T ss_pred CCCCCEEEEEEeCCcceeeeecC-----------------CeEEEE------------------------------ecCc Confidence 99999999999999999864332 122222 2234 Q ss_pred eeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH Q lcl|NC_021537. 206 ELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN 285 (602) Q Consensus 206 ~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~ 285 (602) ...+|+++||||+|.+++.+++||+|++.+++.++....+++.|+.++|+||++|++||++++..+++++.+++++.|++ T Consensus 156 ~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~ 235 (351) T protein:vir:79 156 ERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKN 235 (351) T ss_pred eEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHH Confidence 56689999999999999999999999999999999999999999999999999999999999888999999999999999 Q ss_pred hhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccC Q lcl|NC_021537. 286 LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRAN 363 (602) Q Consensus 286 ~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn 363 (602) .+|..|+++++++..+ +.+.+++++|++.+ ++|+||++++++++++||++|||||.++|+.++ ++++| T Consensus 236 ~~G~~N~~~~~v~~~~---------g~~~gi~~~pl~~~-~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n 305 (351) T protein:vir:79 236 AKGPGNFRNVFMYAPG---------GKKDGIQLIPVSEV-AAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGT 305 (351) T ss_pred hcCccccCceeEecCC---------CCccceEEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCccc Confidence 9999999999886543 34568899999865 578999999999999999999999999998764 56899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHH Q lcl|NC_021537. 364 SKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKM 419 (602) Q Consensus 364 ~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~ 419 (602) +|++.+.|+++||.|+++.||+ +|.+|. ...++|+..++++. |++. T Consensus 306 ~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg-------~~~~~F~~~~llr~--d~~a 351 (351) T protein:vir:79 306 PDTAARVFGRNEIRPLQARFAE-LNDWLG-------DEVVTFDDYEIPPA--PVAA 351 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHhhcC-------cceeeeChhhhccc--cccC Confidence 9999999999999999999985 777662 12479999988877 4332 No 90 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=2.8e-62 Score=358.02 Aligned_cols=311 Identities=20% Similarity=0.244 Sum_probs=256.9 Q ss_pred CCCCc------------cccc----------ccch-----hhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_021537. 1 MSKAE------------ETTQ----------LDER-----HIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYE 53 (602) Q Consensus 1 ~~k~~------------~~~~----------~~~~-----~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~i 53 (602) |+|.. .... ++.+ --....+++++||+++..|+++.+.|+++.+||..+++.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~n~l 80 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKRNVL 80 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhhhHH Confidence 22211 0000 1111 1223346799999999999999999999999999988776 Q ss_pred ccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEE Q lcl|NC_021537. 54 AGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVG 133 (602) Q Consensus 54 a~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~ 133 (602) ++. -+||+.+|..+|+ +++.|++++||||++++||..|++++ T Consensus 81 ~~~-------------------------------------~~Pn~~lt~~~f~-~~~~d~ll~Gnay~~~~rn~~G~~~~ 122 (340) T protein:vir:98 81 AST-------------------------------------YIPHPLLSRQDFS-RFALDYLVFGNAFLEQRHSVTGQLIK 122 (340) T ss_pred hhc-------------------------------------cCCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCcEEE Confidence 641 1478899999986 56679999999999999999999999 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechh Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 213 (602) |+|+++.+|++..+. ..|+++ ..++....|+++ T Consensus 123 L~pl~~~~vr~~~~~-----------------~~~~~~------------------------------~~~~~~~~~~~~ 155 (340) T protein:vir:98 123 LLTSPAKYTRRGVDD-----------------SVFWFV------------------------------ENFTQPHEFAPD 155 (340) T ss_pred EEEeCCceEEEcccC-----------------cEEEEE------------------------------ecCCeEEEEccc Confidence 999999999864321 223332 223456678999 Q ss_pred HEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccC Q lcl|NC_021537. 214 ELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRT 293 (602) Q Consensus 214 eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag 293 (602) ||||+|.+++.+++||+|++..+++++....+++.|+.++|+||++|++||.+++..+++++.+++++.|++.+|..|++ T Consensus 156 eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~ 235 (340) T protein:vir:98 156 TVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFK 235 (340) T ss_pred cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccC Confidence 99999999999999999999999999999999999999999999999999999988899999999999999999999999 Q ss_pred cceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHH Q lcl|NC_021537. 294 AILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREF 371 (602) Q Consensus 294 ~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f 371 (602) +++++.++ +.+.+++++|++.+ ++|+||++++++++++||++|||||.++|+.++ ++++|+|++.+.| T Consensus 236 ~~~vl~~~---------g~~~g~~~~pls~~-~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f 305 (340) T protein:vir:98 236 NLFFYSPN---------GKPDGIKIVPLSEV-ATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVF 305 (340) T ss_pred ceeEecCC---------CCccceEEEEcCCC-hhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHH Confidence 99887543 34568899999865 578999999999999999999999999998654 5689999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchh Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPE 414 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~ 414 (602) +++||.|+++.||+ +|.+|..+ .++|+..++++.+ T Consensus 306 ~~~~l~Pl~~~iee-~n~~L~~e-------~~rF~~~~l~~~d 340 (340) T protein:vir:98 306 VRNELSPLQDRFRE-VNDWLGME-------VIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHHHHH-HHhccccc-------ccccCccccccCC Confidence 99999999999985 78877432 2678888888764 No 91 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=5.4e-62 Score=356.49 Aligned_cols=314 Identities=18% Similarity=0.241 Sum_probs=258.1 Q ss_pred CCCC------------------ccccc------------ccch-----hhhcccCccccCCCCHHHHHHHHhhhHHHHHH Q lcl|NC_021537. 1 MSKA------------------EETTQ------------LDER-----HIATDVGRGIQPPYNPETLAAFQELNETHQAC 45 (602) Q Consensus 1 ~~k~------------------~~~~~------------~~~~-----~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~c 45 (602) |+|. +.+.+ +..+ .-.+..|++++||+++..|+++.+.|+.+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhhh Confidence 1111 00011 1111 12233467999999999999999999999999 Q ss_pred HHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee Q lcl|NC_021537. 46 IRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV 125 (602) Q Consensus 46 I~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r 125 (602) |..+++.+++. -+||+.+|..+|+ +++.|++++||||++++| T Consensus 81 l~~k~n~l~~~-------------------------------------~~Pn~~~t~~~f~-~~~~d~ll~Gnay~~~~r 122 (351) T protein:vir:78 81 LFFKANVLAST-------------------------------------FRPHRWLSRHAFE-RWALDFLTFGNGYLERRR 122 (351) T ss_pred hhhhhhHHhhc-------------------------------------ccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEE Confidence 99887776541 1478899999996 466799999999999999 Q ss_pred CCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCc Q lcl|NC_021537. 126 EGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAG 205 (602) Q Consensus 126 ~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 205 (602) +..|++++|+||++.+|++..+.. .|+++ ..++ T Consensus 123 n~~G~~~~L~pl~~~~v~~~~~~~-----------------~~~~~------------------------------~~~~ 155 (351) T protein:vir:78 123 NMVGGTLRLEPALAKYVRRKADFS-----------------GFVYV------------------------------NGWQ 155 (351) T ss_pred CCCCCEEEEEEecCcceEEeeeCC-----------------eEEEE------------------------------ecCC Confidence 999999999999999998654321 22222 1234 Q ss_pred eeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH Q lcl|NC_021537. 206 ELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN 285 (602) Q Consensus 206 ~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~ 285 (602) ...+|+++||||+|.+++.+++||+|++..+++++....++..|+.++|+||++|++||+++++.+++++.+++++.|++ T Consensus 156 ~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~~ 235 (351) T protein:vir:78 156 ERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKN 235 (351) T ss_pred eEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHH Confidence 56789999999999999999999999999999999999999999999999999999999999888999999999999999 Q ss_pred hhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccC Q lcl|NC_021537. 286 LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRAN 363 (602) Q Consensus 286 ~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn 363 (602) .+|..|+++++++..+ +.+.+++++|++.+ +.|+||+|++++++++||++|||||.++|+.++ ++++| T Consensus 236 ~~G~~N~~~~~v~~~~---------g~~~g~k~~pls~~-~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn 305 (351) T protein:vir:78 236 AKGPGNFRNVFMYAPG---------GKKDGIQLIPVSEV-AAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGT 305 (351) T ss_pred hcCcccccceeeecCC---------CCccceeEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCccc Confidence 9999999999887543 34568899999865 578999999999999999999999999998765 56899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHH Q lcl|NC_021537. 364 SKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKM 419 (602) Q Consensus 364 ~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~ 419 (602) +|++.+.|+++||.|+++.||+ +|.+|.. ..|+|+..++++.+. +. T Consensus 306 ~e~~~~~f~~~~l~P~~~~iee-~n~~l~~-------~~~~F~~~~Llr~d~--ka 351 (351) T protein:vir:78 306 PDTAARVFGRNEIRPLQARFAE-LNDWLGD-------EVVRFDDYEIPPAPV--AA 351 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHhhcCc-------cceecChhhhccccc--cC Confidence 9999999999999999999986 6666532 248999999987743 33 No 92 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=1.1e-61 Score=354.77 Aligned_cols=373 Identities=13% Similarity=0.100 Sum_probs=244.7 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) +.+.. .+......+ +..+.. .....+..+++|++||++||++||++||+++.+.+.. T Consensus 7 ~~~~~-~~~~~~~~~----~~~~~~-----~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~------------- 63 (395) T protein:vir:96 7 FSFKK-SGTLSDDDS----GSTTSE-----KLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLT------------- 63 (395) T ss_pred hcCCC-Ccccccccc----ccchhh-----hcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccc------------- Confidence 22211 111111111 111111 1123344578999999999999999999998642211 Q ss_pred Hhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchhh Q lcl|NC_021537. 81 DFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEV 159 (602) Q Consensus 81 ~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~ 159 (602) ...++...++ .+||++||+.+||+.++.++++.||||+++.|+..+. +.+ ..+... T Consensus 64 ---~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~-----~~~-~~~~~~-------------- 120 (395) T protein:vir:96 64 ---ENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIY-----VAD-AFTQDK-------------- 120 (395) T ss_pred ---cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCcee-----cCC-cccccc-------------- Confidence 1124455555 4899999999999999999999999999999875432 111 111100 Q ss_pred hhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHH Q lcl|NC_021537. 160 ENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQT 239 (602) Q Consensus 160 ~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~ 239 (602) ...+..|.++..+ .......++++||||||..++....++.+++..+... T Consensus 121 --~~~~~~~~~v~~~----------------------------~~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~ 170 (395) T protein:vir:96 121 --KLSGNKFKVSRVQ----------------------------GQTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEYGEL 170 (395) T ss_pred --ccccceeeeeeec----------------------------cceeeeEeccCceEEecccCCccccccccccchHHHH Confidence 0001112221100 0112356899999999987765555555544443333 Q ss_pred HHH------HHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcc--cccCcceeccCCccceeccccc Q lcl|NC_021537. 240 MGA------DQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGS--RYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 240 i~~------~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~--~nag~~~~~~~g~~~~~~~~~~ 311 (602) +.. ...+.++..++|.+|+.+.+++...+.. ..+..+++|++..+. .++++++++++|++ T Consensus 171 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~-------- 238 (395) T protein:vir:96 171 LGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGR----QPKSDKDFFKRTIEKIRTESVVGIPVTANTN-------- 238 (395) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccceeeccCchh----hHHHHHHHHHHHHHHhhcCCcceEEccCCce-------- Confidence 322 2334467889999999999998876543 334455555554433 34555666666654 Q ss_pred cccccccccccccchHHHHHHHHHHhh------HHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERN------EHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSA 385 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~------~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~ 385 (602) +++++ +++.|+|+++.+++. +++||++|||||++|| ++++|+|++.+.|+++||.||++.||+ T Consensus 239 ------~~~l~-~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~----~~~sn~e~~~~~f~~~~L~P~~~~ie~ 307 (395) T protein:vir:96 239 ------YEEYG-SKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNYELLLEGPIESLITNIVD 307 (395) T ss_pred ------eEecc-cChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCccHHHHHHHHHHHHHHHHHHHHHH Confidence 44554 345678887777665 5899999999999996 688999999999999999999999999 Q ss_pred HHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc-cccccccc Q lcl|NC_021537. 386 RLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL-SEFEAEFG 464 (602) Q Consensus 386 ~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~-~~~~~~~~ 464 (602) +|+++|+++.+...+++ |+++.+++. |.+.+++++++++++|++|+||+|+++|+||++++++|.+. +.|+++.. T Consensus 308 ~l~~~Ll~~~e~~~~~~--f~~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~~~~~~N~~~~~ 383 (395) T protein:vir:96 308 GLEYAIFDKSETLEGSF--IKVTGLKNY--DLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNYESVL 383 (395) T ss_pred HHHhhcCChhhhcCcee--Eeecchhcc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceech Confidence 99999999877666655 555667766 77788899999999999999999999999999998888655 44555543 Q ss_pred ccccCCCcCcccccc Q lcl|NC_021537. 465 ADASDGDAEAMLTRS 479 (602) Q Consensus 465 ~~~~~~~~~~~~~~~ 479 (602) .. +++.++. .+. T Consensus 384 ~~--gge~~~~-~~~ 395 (395) T protein:vir:96 384 ER--GGEVDEE-VET 395 (395) T ss_pred hc--cCCCCCC-CCC Confidence 21 1111000 000 No 93 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=2.5e-61 Score=352.87 Aligned_cols=318 Identities=19% Similarity=0.218 Sum_probs=258.4 Q ss_pred CCCCccc------------------------ccccchhh-hcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhcc Q lcl|NC_021537. 1 MSKAEET------------------------TQLDERHI-ATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAG 55 (602) Q Consensus 1 ~~k~~~~------------------------~~~~~~~~-~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~ 55 (602) |+|-... ..++.... ..+.+++++||+++..|+++.+.|+.+.+||..+++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~n~l~~ 80 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMVSA 80 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhhhhHHhh Confidence 2221110 11111111 1235679999999999999999999999999888876653 Q ss_pred CceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEE Q lcl|NC_021537. 56 YGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLA 135 (602) Q Consensus 56 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~ 135 (602) . -+||+.+|..+|++ ++.|++++||||++++|+..|++++|+ T Consensus 81 ~-------------------------------------~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~ 122 (345) T protein:vir:37 81 T-------------------------------------YEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLV 122 (345) T ss_pred c-------------------------------------cCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEE Confidence 1 14788999999965 567999999999999999999999999 Q ss_pred EeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHE Q lcl|NC_021537. 136 HVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANEL 215 (602) Q Consensus 136 ~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ev 215 (602) |++|.+|++..+. ..++++.. ......+...+|+++|| T Consensus 123 pl~~~~vr~~~d~-----------------~~~~~~~~-------------------------~~~~~~g~~~~~~~~eV 160 (345) T protein:vir:37 123 PLSSLYLRVHKDG-----------------GYSYLMKK-------------------------SLYDTAQEIYRYDAKDI 160 (345) T ss_pred EecCceeEEeecC-----------------CeeEEEee-------------------------eeeccCceEEEEccccE Confidence 9999999875432 22333221 11223456678999999 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) ||+|.+++.+++||+|++..+++++....++++|+.++|+||++|++||.++++.+++++.+++++.|++.+|+.|.+.+ T Consensus 161 iHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~ 240 (345) T protein:vir:37 161 IFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSM 240 (345) T ss_pred EEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCce Confidence 99999999999999999999999999999999999999999999999999998889999999999999998888887766 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHHHH Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREFAK 373 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f~~ 373 (602) ++..++ +...+++++|++.++ .|+||++++++++++||++|||||.++|+.++ ++++|+|++.+.|++ T Consensus 241 ~i~~~~---------g~~~G~~~~pl~~~~-~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~ 310 (345) T protein:vir:37 241 FVNIAG---------GHPDGLKVIPIGDTG-TKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHY 310 (345) T ss_pred eEecCC---------CCccceeEEEccCCh-hHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHH Confidence 655433 234578999998754 68999999999999999999999999998654 569999999999999 Q ss_pred HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcc Q lcl|NC_021537. 374 GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQ 412 (602) Q Consensus 374 ~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~ 412 (602) +||.|++++|++++|+.+ +...++.++|+..++++ T Consensus 311 ~~l~P~~~~ie~~ln~~~----e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 311 DEVMPLQEIIAETINQDP----EIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHhhhhh----ccCCcceEEECchhhcC Confidence 999999999999999743 12246789999988876 No 94 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=1.1e-61 Score=354.76 Aligned_cols=323 Identities=18% Similarity=0.208 Sum_probs=255.0 Q ss_pred CCCCcccccc------------cch-----hhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MSKAEETTQL------------DER-----HIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~k~~~~~~~------------~~~-----~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) -.|+..+.+. +.+ .-.+..+++++||+++..|+++++.++.+.+|+... T Consensus 27 ~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~~~~~h~~~~~~~-------------- 92 (368) T protein:vir:79 27 EHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFRAAAHHSSAVYVK-------------- 92 (368) T ss_pred hhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHhhccccchhhhhh-------------- Confidence 0010001110 011 112333568889999988888888777766554332 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccc Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVR 143 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~ 143 (602) ++...++.+||+.||+.+|++ ++.|++++||||++++|+..|++++|+||+|.+|+ T Consensus 93 -----------------------~n~l~l~~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~ 148 (368) T protein:vir:79 93 -----------------------RNILVSTFIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVR 148 (368) T ss_pred -----------------------cchhhhhcCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccce Confidence 233345668999999999975 78899999999999999999999999999999998 Q ss_pred ccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCC Q lcl|NC_021537. 144 VRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSP 223 (602) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~ 223 (602) +..+. ..|+++ ..++...+|++++|||+|.+++ T Consensus 149 ~~~~~-----------------~~~~~~------------------------------~~~~~~~~~~~~dIihir~~~~ 181 (368) T protein:vir:79 149 RGLDL-----------------NTYFFV------------------------------QNWQQPYTFAAGSVFHLQEPDI 181 (368) T ss_pred eeccC-----------------CEEEEE------------------------------ecCCeEEEEccccEEEecCCCC Confidence 65432 223222 1234566899999999999999 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCcc Q lcl|NC_021537. 224 LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVD 303 (602) Q Consensus 224 ~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~ 303 (602) .+++||+||+.+++.++....+++.|+.++|+||++|++||++++..+++++.+++++.|++.+|..|+++++++.++ T Consensus 182 ~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~-- 259 (368) T protein:vir:79 182 NQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPN-- 259 (368) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCceeEecCC-- Confidence 999999999999999999999999999999999999999999998889999999999999999999999999987543 Q ss_pred ceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 304 DHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREFAKGIIEPEQA 381 (602) Q Consensus 304 ~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f~~~~l~P~~~ 381 (602) +.+.+++++|++. +++|+||+|++++++++||++|||||.+||+.++ ++++|+|++.+.|+++||.|+++ T Consensus 260 -------g~~~g~~~~pls~-~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~ 331 (368) T protein:vir:79 260 -------GKKDGIQLLPVSE-VAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQD 331 (368) T ss_pred -------CCccceeEEEcCC-CHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 3466889999985 5579999999999999999999999999998765 45899999999999999999999 Q ss_pred HHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhC Q lcl|NC_021537. 382 KFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLA 430 (602) Q Consensus 382 ~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~ 430 (602) .|+ ++|.+|.. ..++|+...+++. |.+.++.. ....+ T Consensus 332 ~ie-~ln~~l~~-------e~~rF~~~~l~~~--D~~a~a~~--~~rsa 368 (368) T protein:vir:79 332 RLL-AINDWIGD-------EVVRFAPYALGGH--DQPAAAPG--GQRSA 368 (368) T ss_pred HHH-HHHhccCc-------ceeeechhHhhcc--cccccCCc--ccccC Confidence 998 57877632 2588999888877 55544431 11111 No 95 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=3.5e-61 Score=351.99 Aligned_cols=310 Identities=19% Similarity=0.207 Sum_probs=253.3 Q ss_pred CCCCcc---------------------ccc------------ccch-----hhhcccCccccCCCCHHHHHHHHhhhHHH Q lcl|NC_021537. 1 MSKAEE---------------------TTQ------------LDER-----HIATDVGRGIQPPYNPETLAAFQELNETH 42 (602) Q Consensus 1 ~~k~~~---------------------~~~------------~~~~-----~~~~~~~~~i~p~~~~~~l~~~~~~~~~v 42 (602) |+|... ... +..+ ...+..+++++||+++..|+++.+.|+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~h 80 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVYL 80 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhhh Confidence 222110 000 1111 12233467999999999999999999999 Q ss_pred HHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEE Q lcl|NC_021537. 43 QACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALE 122 (602) Q Consensus 43 ~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~ 122 (602) .+||..+.+.+++. -+||+.+|..+|++ ++.|++++||||++ T Consensus 81 ~~~l~~k~n~l~~~-------------------------------------~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~ 122 (350) T protein:vir:11 81 QSGLKFKRNMLAKT-------------------------------------FIPHRLLSRATFEQ-FSLDWLTFGSAYLE 122 (350) T ss_pred ccchhhhhhhhhhc-------------------------------------ccCCCCCCHHHHHH-HHHHHHhcCCeEEE Confidence 99998876654431 15888999999975 67799999999999 Q ss_pred EeeCCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEe Q lcl|NC_021537. 123 ILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVAS 202 (602) Q Consensus 123 i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 202 (602) ++|+..|++++|+||||.+|++..+. ..|+++ . T Consensus 123 ~~rn~~G~~~~L~~l~~~~vr~~~~~-----------------~~~~~~------------------------------~ 155 (350) T protein:vir:11 123 QPRSRLGTRMPLQAPLAKYMRRGTDL-----------------ETFYQV------------------------------R 155 (350) T ss_pred EEEcCCCCEEEEEEeCCceeEeeecC-----------------CeEEEE------------------------------e Confidence 99999999999999999999864332 223322 2 Q ss_pred cCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHH Q lcl|NC_021537. 203 DAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNL 282 (602) Q Consensus 203 ~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~ 282 (602) .++...+|+++||||+|.+++.+++||+||+.+++.++....+++.|+.++|+||++|++||++++..+++++.+++++. T Consensus 156 ~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~l~~~ 235 (350) T protein:vir:11 156 SWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDALRTA 235 (350) T ss_pred eCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHH Confidence 23456789999999999999999999999999999999999999999999999999999999999888999999999999 Q ss_pred HHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CC Q lcl|NC_021537. 283 MDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SN 360 (602) Q Consensus 283 ~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~ 360 (602) |++.+|..|+++++++.++. .+.+++++|++.+ ++|+||+|++++++++||++|||||.++|+..+ ++ T Consensus 236 ~~~~~G~~N~~~~~v~~~~g---------~~~g~~~~pl~~~-~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~ 305 (350) T protein:vir:11 236 LKTAKGPGNFRNLFVYAPNG---------KKEGIQLIPVSEV-AAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGG 305 (350) T ss_pred HHHhcCccccCceeeecCCC---------CccceEEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCC Confidence 99999999999998876543 3568899999865 578999999999999999999999999998755 56 Q ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchh Q lcl|NC_021537. 361 RANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGA 410 (602) Q Consensus 361 ~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~ 410 (602) ++|+|++.+.|+++||.|+++.||+ +|.+|..+.. . ..+|+..++ T Consensus 306 ~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~---~-F~~~~~~~l 350 (350) T protein:vir:11 306 FGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVV---R-FAQFDAPGL 350 (350) T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcccc---c-cCcccccCC Confidence 9999999999999999999999985 7878754321 1 235677666 No 96 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=3.5e-61 Score=352.00 Aligned_cols=318 Identities=17% Similarity=0.207 Sum_probs=257.0 Q ss_pred CCCCcc-----------------------cccccchh--hhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhcc Q lcl|NC_021537. 1 MSKAEE-----------------------TTQLDERH--IATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAG 55 (602) Q Consensus 1 ~~k~~~-----------------------~~~~~~~~--~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~ 55 (602) |+|-.. ...+.+-. +..+.|+|++||+++..|+++.+.|+.+.+||...++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~n~l~~ 80 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMVSS 80 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceeeechHHHh Confidence 211110 00011111 11245679999999999999999999999999776554432 Q ss_pred CceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEE Q lcl|NC_021537. 56 YGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLA 135 (602) Q Consensus 56 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~ 135 (602) .-+||+.+|..+|++ ++.|++++||||++++|+..|++++|+ T Consensus 81 -------------------------------------~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~ 122 (345) T protein:vir:37 81 -------------------------------------LYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLV 122 (345) T ss_pred -------------------------------------hccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEE Confidence 125889999999975 567999999999999999999999999 Q ss_pred EeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHE Q lcl|NC_021537. 136 HVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANEL 215 (602) Q Consensus 136 ~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ev 215 (602) ||+|.+|++..+.. .++++.. .....+|...+|+++|| T Consensus 123 pl~~~~vr~~~d~~-----------------~~~~~~~-------------------------~~~~~~g~~~~~~~~dV 160 (345) T protein:vir:37 123 PLSSLYLRVRKDGG-----------------YSYLMKK-------------------------SLYDTAQEIYRYDAKDI 160 (345) T ss_pred EEcCceeEEEEeCC-----------------eeEEEEE-------------------------eEecCCceEEEEccccE Confidence 99999998754322 2222211 11123456778999999 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) ||+|.+++.+++||+|++.+++.++....++++|+.++|+||++|++||+++++.+++++.+++++.|++.+|..|.+++ T Consensus 161 ihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~ 240 (345) T protein:vir:37 161 IFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSM 240 (345) T ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccce Confidence 99999999999999999999999999999999999999999999999999998889999999999999999999999988 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHHHHHHH Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQTREFAK 373 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~f~~ 373 (602) +++.++ +.+.+++++|++.+ ++|+||++++++++++||++|||||.++|+..+ ++++|+|++.+.|++ T Consensus 241 ~i~~p~---------g~~~G~~~~pls~~-~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~ 310 (345) T protein:vir:37 241 FVNIAN---------GHPDGLKVIPIGDT-GTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHY 310 (345) T ss_pred EEEcCC---------CcccceEEEEccCC-hhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHH Confidence 876543 34668899999865 468999999999999999999999999998654 578999999999999 Q ss_pred HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcc Q lcl|NC_021537. 374 GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQ 412 (602) Q Consensus 374 ~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~ 412 (602) +||.|+++.|++++|+.+. ...+..++|+..++.. T Consensus 311 ~~l~P~~~~ie~~ln~~~~----~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 311 DEVMPLQEIIAETINQDPE----IKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHhhhhcc----CCCcceEEecchhhcC Confidence 9999999999999997542 1235678898777654 No 97 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=3.8e-61 Score=351.81 Aligned_cols=312 Identities=19% Similarity=0.208 Sum_probs=249.2 Q ss_pred CCCCccc------------------------------ccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_021537. 1 MSKAEET------------------------------TQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKS 50 (602) Q Consensus 1 ~~k~~~~------------------------------~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia 50 (602) |+|.... .-+.+.--....|.|++||+++..|+++.+.|+.+.+||..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k~ 80 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVKR 80 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccceehh Confidence 2221110 0011111223446799999999999999999999999998877 Q ss_pred HhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc Q lcl|NC_021537. 51 RYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT 130 (602) Q Consensus 51 ~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~ 130 (602) +.+++. -+||+.+|..+| ++++.|++++||||++++|+..|+ T Consensus 81 n~l~~~-------------------------------------~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~ 122 (344) T protein:vir:56 81 NILAST-------------------------------------FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGK 122 (344) T ss_pred hhHHhh-------------------------------------cCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCc Confidence 766541 158899999999 678899999999999999999999 Q ss_pred eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 131 PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 131 ~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) +++|+|+++.+|++..+. ..|+++ ..+|....+ T Consensus 123 ~~~L~pl~~~~v~~~~~~-----------------~~~~~~------------------------------~~~g~~~~~ 155 (344) T protein:vir:56 123 VIRLETSPAKYTRRGVEE-----------------DVYWWV------------------------------PSFNEPTAF 155 (344) T ss_pred EEEEEEeCCceeEEeecC-----------------CEEEEE------------------------------ecCCeEEEE Confidence 999999999999864322 223222 234556789 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccc Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSR 290 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~ 290 (602) ++++|||+|.+++.+++||+||+.+++.++....++++|+.++|+||++|++||+++++.+++++.+++++.|++.+|. T Consensus 156 ~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~~~~~~g~- 234 (344) T protein:vir:56 156 APGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGR- 234 (344) T ss_pred cCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCC- Confidence 9999999999999999999999999999999999999999999999999999999998889999999999999987654 Q ss_pred ccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHH Q lcl|NC_021537. 291 YRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQT 368 (602) Q Consensus 291 nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~ 368 (602) |+++++++. .+++...+++++|++.+ +.|+||+|++++++++||++|||||.++|+.++ ++++|+|++. T Consensus 235 ~~~r~l~l~--------~p~g~~~G~~~~pis~~-~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~ 305 (344) T protein:vir:56 235 NNFKNLFLY--------APQGKADGIKIIPLSEV-ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVA 305 (344) T ss_pred CCccceEEe--------cCCCCccceeEEEcCCC-hHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHH Confidence 667777653 23344568899999865 568999999999999999999999999998664 5699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH Q lcl|NC_021537. 369 REFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA 417 (602) Q Consensus 369 ~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~ 417 (602) +.|+++||.||++.||+ +|.+|..+. ++|+.-.+... |+ T Consensus 306 ~~f~~~tL~Pl~~~ie~-~n~~l~~~~-------~~F~~y~l~~~--~~ 344 (344) T protein:vir:56 306 KVFVRNELIPLQDRIRE-INGWIGQEV-------IRFKNYSLDTD--NG 344 (344) T ss_pred HHHHHHHHHHHHHHHHH-HHhhhcccc-------ccCCCcccccc--CC Confidence 99999999999999985 777775332 44544444322 22 No 98 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=1.1e-60 Score=349.26 Aligned_cols=308 Identities=19% Similarity=0.257 Sum_probs=250.1 Q ss_pred CCCCcccc-------------------cccch----h---hhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_021537. 1 MSKAEETT-------------------QLDER----H---IATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEA 54 (602) Q Consensus 1 ~~k~~~~~-------------------~~~~~----~---~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia 54 (602) |+|-.+.+ -++.+ . +....|++++||+++..|+++.+.|+++++|+..+.+.++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~pP~~~~~La~l~~~~~~h~~~L~~k~N~~~ 80 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAIDPTAWMTDYTGVFYNPYGEYYQPPIDRKGLAKVARANAHHGAILMARRNMVA 80 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccccCcchhHhhhhhhhccCcceecCCCCHHHHHHHhhcchhhhhHHHhhhcccc Confidence 33221100 01111 1 1224477999999999999999999999999988766443 Q ss_pred cCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEE Q lcl|NC_021537. 55 GYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGL 134 (602) Q Consensus 55 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L 134 (602) + .+++..+++++++.|++++||||++++||..|++++| T Consensus 81 ~------------------------------------------~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L 118 (337) T protein:vir:78 81 G------------------------------------------RFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGL 118 (337) T ss_pred c------------------------------------------cCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEE Confidence 3 1222346788899999999999999999999999999 Q ss_pred EEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhH Q lcl|NC_021537. 135 AHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANE 214 (602) Q Consensus 135 ~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~e 214 (602) +||||.+|++..+.. +++ ...++....|+++| T Consensus 119 ~pl~~~~v~~~~d~~------------------~~~------------------------------~~~~~~~~~~~~~e 150 (337) T protein:vir:78 119 HPLSSVYLRRREDGC------------------FVY------------------------------LQQGKPNLIYRPDD 150 (337) T ss_pred EEeCCceeEeeeCCe------------------EEE------------------------------EEcCCceEEECCcc Confidence 999999998654311 111 11234456789999 Q ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc Q lcl|NC_021537. 215 LIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA 294 (602) Q Consensus 215 viH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~ 294 (602) |||+|.+++.+++||+||+..+++++....++++++.++|+||++|++||++++..+++++.+++++.|++.+|..|.++ T Consensus 151 IiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~ 230 (337) T protein:vir:78 151 VIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRS 230 (337) T ss_pred EEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccc Confidence 99999999999999999999999999999999999999999999999999999888999999999999999999999999 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc---cCCccCHHHHHHHH Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS---TSNRANSKEQTREF 371 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~---~~~~sn~e~~~~~f 371 (602) ++++.+| +.+.+++++|++.+ +.|+||++++++++++||++|||||.++|+.. .++++|+|++.+.| T Consensus 231 ~~v~~~~---------g~~~Gi~~~pis~~-~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f 300 (337) T protein:vir:78 231 MFVNIPD---------GKPDGIKLIPVGDI-ATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATY 300 (337) T ss_pred eEEEcCC---------CCccceeEEEcCCC-hhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHH Confidence 8876544 34678899999865 57899999999999999999999999999754 45788999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhc Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAE 411 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~ 411 (602) +++||.|+++.||+++|.+|++... ..+|+|+...++ T Consensus 301 ~~~~L~P~~~~ie~~~n~~ll~~~~---~~~f~~~~~~~~ 337 (337) T protein:vir:78 301 ARNEVLPLCELVQDAINSAGLPRAL---WVTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHhhhcCChhh---ceeccccccccC Confidence 9999999999999999998886543 234556555554 No 99 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=1.6e-60 Score=348.38 Aligned_cols=312 Identities=18% Similarity=0.208 Sum_probs=247.0 Q ss_pred CCCC-------------cccccc-----------------cchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_021537. 1 MSKA-------------EETTQL-----------------DERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKS 50 (602) Q Consensus 1 ~~k~-------------~~~~~~-----------------~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia 50 (602) |+|. +.+.+. .+.--....|.|++||+++..|+++.+.|+.+.+||..++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k~ 80 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIYVKR 80 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchhhhh Confidence 1111 111111 0111223446799999999999999999999999999877 Q ss_pred HhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc Q lcl|NC_021537. 51 RYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT 130 (602) Q Consensus 51 ~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~ 130 (602) +.+++. -+||+.+|..+| ++++.|++++||||++++|+..|+ T Consensus 81 n~l~~~-------------------------------------~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~ 122 (344) T protein:vir:60 81 NILAST-------------------------------------FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGK 122 (344) T ss_pred hHHHhh-------------------------------------ccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCc Confidence 665541 158889999998 678899999999999999999999 Q ss_pred eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 131 PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 131 ~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) +++|+|||+.+|++..+. ..|+++ ..++....| T Consensus 123 ~~~L~~l~~~~vr~~~~~-----------------~~~~~v------------------------------~~~~~~~~~ 155 (344) T protein:vir:60 123 VIRLETSPAKYTRRGVEE-----------------DVYWWV------------------------------PSFNEPTAF 155 (344) T ss_pred EEEEEEcCcceEEEeecC-----------------CeEEEE------------------------------ccCCeEEEE Confidence 999999999999864322 223332 223456689 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccc Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSR 290 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~ 290 (602) +++||||+|.+++.+++||+||+..++.++....+++.|+.++|+||++|++||++++..+++++.+++++.|++.+|. T Consensus 156 ~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~- 234 (344) T protein:vir:60 156 APGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGR- 234 (344) T ss_pred cCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCC- Confidence 9999999999999999999999999999999999999999999999999999999998889999999999999987765 Q ss_pred ccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHH Q lcl|NC_021537. 291 YRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQT 368 (602) Q Consensus 291 nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~ 368 (602) ++++.+++. .+++...+++++|++.+ +.|+||+|++++++++||++|||||.++|+.++ ++++|+|++. T Consensus 235 ~~~r~~~l~--------~p~g~~~g~~~~pis~~-~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~ 305 (344) T protein:vir:60 235 NNFKNLFLY--------APQGKADGIKIIPLSEV-ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVA 305 (344) T ss_pred CCCcceEEe--------cCCCCccceeEEEcCCC-hhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHH Confidence 555555542 22334567899999865 468999999999999999999999999998664 4599999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH Q lcl|NC_021537. 369 REFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA 417 (602) Q Consensus 369 ~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~ 417 (602) +.|+++||.||++.|| +||.+|... .++|+..++..- |+ T Consensus 306 ~~f~~~~L~Pl~~~~e-~ln~~lg~~-------~i~F~~~~l~~~--d~ 344 (344) T protein:vir:60 306 KVFVRNELIPLQDRIR-EINGWLGQE-------VIRFKNYSLDTD--NG 344 (344) T ss_pred HHHHHHHHHHHHHHHH-HHHHhcCCc-------ccccCccccCCC--CC Confidence 9999999999999998 588887422 134544443322 22 No 100 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=6.4e-60 Score=345.09 Aligned_cols=312 Identities=19% Similarity=0.222 Sum_probs=246.9 Q ss_pred CCCCc-------------cccc------------ccch-----hhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_021537. 1 MSKAE-------------ETTQ------------LDER-----HIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKS 50 (602) Q Consensus 1 ~~k~~-------------~~~~------------~~~~-----~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia 50 (602) |+|.. .+.+ ++.+ --+...|.|++||+++..|+++.+.|+.+.+||..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k~ 80 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVKR 80 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhCccceehh Confidence 22110 0000 1111 1223446799999999999999999999999998877 Q ss_pred HhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc Q lcl|NC_021537. 51 RYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT 130 (602) Q Consensus 51 ~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~ 130 (602) +.+++. -+||+.+|..+| ++++.|++++||||++++|+..|+ T Consensus 81 n~l~~~-------------------------------------~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~ 122 (344) T protein:vir:20 81 NILAST-------------------------------------FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGK 122 (344) T ss_pred hhHHHh-------------------------------------ccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCc Confidence 655541 147889999998 678899999999999999999999 Q ss_pred eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 131 PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 131 ~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) +++|+|+++.+|++..+. ..|+++ ..++....| T Consensus 123 ~~~L~pl~~~~vr~~~~~-----------------~~~~~~------------------------------~~~~~~~~~ 155 (344) T protein:vir:20 123 VIRLETSPAKYTRRGVEE-----------------DVYWWV------------------------------PSFNEPTAF 155 (344) T ss_pred EEEEEEcCCceeEeeecC-----------------CEEEEE------------------------------ccCCeEEEE Confidence 999999999999864322 223332 223456789 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccc Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSR 290 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~ 290 (602) +++||||+|.+++.+++||+||+..++.++....+++.|+.++|+||++|++||+++++.+++++.+++++.|++.+|. T Consensus 156 ~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~ik~~~~~~~g~- 234 (344) T protein:vir:20 156 APGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGR- 234 (344) T ss_pred cCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCC- Confidence 9999999999999999999999999999999999999999999999999999999998889999999999999887765 Q ss_pred ccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc--CCccCHHHHH Q lcl|NC_021537. 291 YRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST--SNRANSKEQT 368 (602) Q Consensus 291 nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~ 368 (602) +.++.+++. .+++...+++++|++.++ .|+||+|++++++++||++|||||.++|+.++ ++++|+|++. T Consensus 235 ~n~r~l~l~--------~p~g~~~gi~~~pis~~~-~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~ 305 (344) T protein:vir:20 235 NNFKNLFLY--------APQGKADGIKIIPLSEVA-TKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVA 305 (344) T ss_pred CCccceEEe--------cCCCCccceeEEEcCCCh-hHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHH Confidence 556655542 223345688999998654 68999999999999999999999999998654 5699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhH Q lcl|NC_021537. 369 REFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQ 415 (602) Q Consensus 369 ~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~ 415 (602) +.|++++|.|+++.|| ++|.+|... .++|+..++..-++ T Consensus 306 ~~f~~~~l~P~~~~~e-~in~~lg~~-------~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 306 KVFVRNELIPLQDRIR-EINGWLGQE-------VIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHHHHHH-HHHHhcCCc-------ccccCccccccCCC Confidence 9999999999999998 577777432 24455444422211 No 101 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=1.1e-58 Score=338.25 Aligned_cols=363 Identities=11% Similarity=-0.012 Sum_probs=240.2 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) +.|-...... ... ++......+ . -...+.++++|++||++||++||++||+++++.+.+...... T Consensus 4 f~k~~~~~~~--~~~-~~~~~~~~~-~----~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~------- 68 (378) T protein:vir:85 4 FGKVVSFSRG--KLN-NDTQRVTAW-Q----NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTL------- 68 (378) T ss_pred hhhhhhhhhc--ccc-cCCcceeee-e----ccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccc------- Confidence 2221110000 000 000000000 0 023456789999999999999999999999877654432110 Q ss_pred Hhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEee-CCCCceEEEEEeCcccccccccccccccccchh Q lcl|NC_021537. 81 DFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEILV-EGDGTPVGLAHVPAATVRVRKTTTTIEREDGEE 158 (602) Q Consensus 81 ~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r-~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~ 158 (602) .....|++..++. +||+.||+.+||+.++.++++.||||+++++ +..|++..+++.. T Consensus 69 -~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~~-------------------- 127 (378) T protein:vir:85 69 -ISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFAN-------------------- 127 (378) T ss_pred -cccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEecC-------------------- Confidence 1122456666665 7999999999999999999999999998654 4455544332210 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) ..+.+.++||||++.+...++ +.+.+..+.. T Consensus 128 -----------------------------------------------~~~~~~~~dvih~~~~~~~~~--~~~~~~~a~~ 158 (378) T protein:vir:85 128 -----------------------------------------------DKKEYKPEELVRLVSPFYINE--DTSILDNALA 158 (378) T ss_pred -----------------------------------------------CCEEEcccceEEEecCcCccc--hhhHHHHHHH Confidence 112456789999985432332 3344444433 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHH----HHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLM----DNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~----~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) .+ ...++ ++.++|+|+.++ .+++++.+.+++.| +...++.|+++++++++|+++. T Consensus 159 ~~----------~~~~~-~~~~~g~l~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~--------- 217 (378) T protein:vir:85 159 SI----------QTKLE-QGKLRGLLKINA-FLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIV--------- 217 (378) T ss_pred HH----------HHHHh-cCCcceEEEeCC-cCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEE--------- Confidence 32 22334 457899999876 47777766665555 4456778899999988776544 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) +++ ++++++++ +.++++.++||++|||||.+|+ .++++++...|+++||.||++.||++||++|+++ T Consensus 218 -----~l~-~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~------~s~~e~~~~~f~~~tL~P~~~~ie~~l~~kLl~~ 284 (378) T protein:vir:85 218 -----ELK-KDYSVLNK-DEIELIKSELLTGYFMNENILL------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLIST 284 (378) T ss_pred -----ecc-CChhhhhH-HHHHHHHHHHHHHhCCCHHHhc------CCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCh Confidence 444 34567886 6678999999999999999994 2445888999999999999999999999999998 Q ss_pred cccccce------EEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccccccccc Q lcl|NC_021537. 395 ALDVDEW------TIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADAS 468 (602) Q Consensus 395 ~~~~~~~------~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~ 468 (602) .++..++ +++|+.+.+++. |.+.+++++.+++++|+||+||+|+++|+||++||+ ..+++.|.++++.... T Consensus 285 ~er~~~~~~~~~~~~~f~~~~l~~~--d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD-~~~~~~N~~~~~~~~~ 361 (378) T protein:vir:85 285 NRRRVVKGNLYYERIIVDNQLFKFA--TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-IYIANLNAVAVKNLSD 361 (378) T ss_pred hhhhhhhhccccceeeecchhhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eEeecccccccccchh Confidence 7665443 477888888876 777888999999999999999999999999999864 3445666665543222 Q ss_pred CCCcCcccccccccccccccccc Q lcl|NC_021537. 469 DGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .... ....... +...++ T Consensus 362 ~~~~-~~~~~~~-----~e~~n~ 378 (378) T protein:vir:85 362 LQGS-RKDVAST-----DETNNQ 378 (378) T ss_pred hcCc-cCCCCCC-----CCCCCC Confidence 2111 1111111 111111 No 102 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=3.2e-58 Score=335.76 Aligned_cols=363 Identities=10% Similarity=-0.022 Sum_probs=241.7 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVR 80 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~ 80 (602) +-|-..-. ++.+..+ ..|-.... =...+.++++|++||++||++||++|++++++...+....... T Consensus 4 f~~~~~~~-----~~~~~~~--~~~~~~~~-~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~------ 69 (378) T protein:vir:94 4 FGKVVSFS-----RGKLNND--TQRVTAWQ-NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLI------ 69 (378) T ss_pred hHHhHhhh-----hcccccC--cceeeeee-cchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccccc------ Confidence 11111100 0100000 01100000 0133556789999999999999999999988766544321111 Q ss_pred Hhhhccchhhhhhc-cCCccCCHHHHHHHHHHHHHhcCCeEEEEe-eCCCCceEEEEEeCcccccccccccccccccchh Q lcl|NC_021537. 81 DFWYGSDSRWQIGP-EGTAMSTPEEVLELGRQDYHGIGWAALEIL-VEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEE 158 (602) Q Consensus 81 ~~~~~~~~~~~l~~-~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~-r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~ 158 (602) ....|++..++. +||+.||+.+||+.++.++++.||||++.+ ++..|++..+++.. T Consensus 70 --~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~~-------------------- 127 (378) T protein:vir:94 70 --SMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFAN-------------------- 127 (378) T ss_pred --ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEec-------------------- Confidence 123356666665 799999999999999999999999999855 45556654433210 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) ..+++++++|||++.+...+. +.+++..+.. T Consensus 128 -----------------------------------------------~~~~~~~~dvih~~~~~~~~~--~~~~~~~~~~ 158 (378) T protein:vir:94 128 -----------------------------------------------DKKEYKPEELVRLTSPFYINE--DTSILDNALA 158 (378) T ss_pred -----------------------------------------------CcEEechhceeeecCcCCccc--chhHHHHHHH Confidence 013578899999996543332 4556665554 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHH----HHHHHHHHHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSK----EDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) .+. ..+++ +.++|+|+.++ .+++++. +++++.|++..++.|+++++++++|+++ T Consensus 159 ~~~----------~~~~~-~~~~g~l~~~~-~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~---------- 216 (378) T protein:vir:94 159 SIQ----------TKLEQ-GKLRGLLKINA-FLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEI---------- 216 (378) T ss_pred HHH----------HHHhh-CCcccceeeCC-cCCHHHHHHHHHHHHHHHHHhhcccccccceeccCCceE---------- Confidence 432 22333 46789999876 4665544 5555566666778888999998877654 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQD 394 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~ 394 (602) ++++ ++++|+|+ +.++++.++||++|||||.+|+ +++ ++++...|+++||.||++.||++||++|+++ T Consensus 217 ----~~l~-~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~----g~~--~e~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~ 284 (378) T protein:vir:94 217 ----VELK-KDYSVLNK-DEIDLIKSELLTGYFMNENILL----GTA--TQEQQIYFYNSTIIPLLIQLEKELTYKLIST 284 (378) T ss_pred ----EEcc-CChHHhhH-HHHHHHHHHHHHHhCCCHHHhc----CCc--hHHHHHHHHHHHHHHHHHHHHHHHHhhcCCh Confidence 4444 35678896 6678999999999999999994 333 3788899999999999999999999999988 Q ss_pred cccccce------EEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccccccccc Q lcl|NC_021537. 395 ALDVDEW------TIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADAS 468 (602) Q Consensus 395 ~~~~~~~------~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~ 468 (602) .++..++ .++|+.+.+++. |.+.+++++.+++++|+||+||+|+++|+||++||+ ..+++.+++++..... T Consensus 285 ~e~~~g~~~~~~~~~~f~~~~l~~~--d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd-~~~~~~n~~~~~~~~~ 361 (378) T protein:vir:94 285 NRRRVVKGNLYYERIIVDNQLFKFA--TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-VYIANLNAVAVKNLSD 361 (378) T ss_pred hHhhhhhhhcccceeEeecchhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eeeecccccchhcchh Confidence 6654443 477999998876 778889999999999999999999999999999864 4455556665542222 Q ss_pred CCCcCcccccccccccccccccc Q lcl|NC_021537. 469 DGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ....... ....+ ...++ T Consensus 362 ~~~~~~~-~~~~~-----e~~n~ 378 (378) T protein:vir:94 362 LQGNRKD-VTSTD-----ETNNQ 378 (378) T ss_pred cccccCC-CCCCC-----CCCCC Confidence 2111111 01111 11111 No 103 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=4.4e-58 Score=335.05 Aligned_cols=277 Identities=13% Similarity=0.123 Sum_probs=232.9 Q ss_pred hccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhh-hccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce Q lcl|NC_021537. 53 EAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQI-GPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP 131 (602) Q Consensus 53 ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~ 131 (602) ||++||+++.+.+.. .+++..+ +.+||+.||+.+||+.++.+++++||||++++|+.+|++ T Consensus 1 ia~l~~~~~~~~~~~------------------~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~ 62 (278) T protein:vir:78 1 MASLPLKMYEDYKVV------------------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQP 62 (278) T ss_pred CccceeEEEecCccc------------------ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcE Confidence 999999998643221 1334444 458999999999999999999999999999999999999 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) ++|+||+|++|++..+..+. ..+++ +...+|...+++ T Consensus 63 ~~l~~l~~~~v~v~~~~~~~--------------~~~y~-----------------------------~~~~~g~~~~~~ 99 (278) T protein:vir:78 63 SKLFLLNPDVVEMLIENQSR--------------ELYYS-----------------------------IHAATGNKLIVH 99 (278) T ss_pred EEEEEECCceeEEEEcCCCc--------------eEEEE-----------------------------EEcCCceEEEEc Confidence 99999999999876443210 11111 223345677899 Q ss_pred hhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccc Q lcl|NC_021537. 212 ANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRY 291 (602) Q Consensus 212 ~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~n 291 (602) ++||||+|.+++.++++|+||+.++..++....++++++...|.+ .|+++++.++ .+++++.+++++.|++.. .+ T Consensus 100 ~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~~~~-~l~~e~~~~~~~~~~~~~--~~ 174 (278) T protein:vir:78 100 NMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLKYGS-NVGKEKRQQVLEDFKQYY--EE 174 (278) T ss_pred cccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEEeCC-CCCHHHHHHHHHHHHHHh--cc Confidence 999999999989999999999999999999999999987655544 5788887654 689999999999998755 36 Q ss_pred cCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHH Q lcl|NC_021537. 292 RTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREF 371 (602) Q Consensus 292 ag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f 371 (602) +|+++++++|+ ++++++ +++.|+||.|+++++.++||++|||||.++|..+++|++|++++.+.| T Consensus 175 ~g~~~vl~~g~--------------~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~ 239 (278) T protein:vir:78 175 NGGILFQEPGV--------------EIEPLP-KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFY 239 (278) T ss_pred CCCceecCCCc--------------eEEEcc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 78888887654 455555 356799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchh Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGA 410 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~ 410 (602) +++||+|+++.|+++||++|+++.+...+++|+||.+.+ T Consensus 240 ~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 240 LQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 999999999999999999999998888899999999987 No 104 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=2.5e-46 Score=270.63 Aligned_cols=217 Identities=14% Similarity=0.236 Sum_probs=175.9 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ 238 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~ 238 (602) +.+..+|..+++.. ...+...|...+++++||||+|.+++.+++||+||+.+++. T Consensus 1 ~r~~~dg~~~y~~~-------------------------~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~ 55 (219) T protein:vir:98 1 MRVCKDGNYKYLMK-------------------------KSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGIT 55 (219) T ss_pred CceeecCeEEEEEe-------------------------cceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHH Confidence 22222222111111 01112346678999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccccccc Q lcl|NC_021537. 239 TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIEL 318 (602) Q Consensus 239 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~ 318 (602) ++....++++|+.+||+||++|+|||+++++.+++++++++++.|++.+|..|++.++++.+|. ...+++| T Consensus 56 ~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg---------~~~G~~~ 126 (219) T protein:vir:98 56 SALLNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGG---------HPDGLKV 126 (219) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCC---------CccceeE Confidence 9999999999999999999999999999988899999999999999888988887777664432 2346899 Q ss_pred ccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc--cCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccc Q lcl|NC_021537. 319 EPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS--TSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDAL 396 (602) Q Consensus 319 ~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~ 396 (602) +|++. +++|+||+|++++++.+||++|||||++||+.+ +++++|+|++.+.|+++||+||+++||++||++++.+. T Consensus 127 ~~~~~-~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~- 204 (219) T protein:vir:98 127 IPIGD-TGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKS- 204 (219) T ss_pred EEccC-CHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCC- Confidence 99984 567999999999999999999999999999764 46799999999999999999999999999998755443 Q ss_pred cccceEEEeccchhcchh Q lcl|NC_021537. 397 DVDEWTIDFELRGAEQPE 414 (602) Q Consensus 397 ~~~~~~~~f~~~~~~~~~ 414 (602) +.+++|+.....+.. T Consensus 205 ---~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 205 ---ALKVNFKQPEKRDKN 219 (219) T ss_pred ---ccEEeecCcccccCC Confidence 346788766554442 No 105 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=2.9e-41 Score=242.83 Aligned_cols=238 Identities=11% Similarity=0.023 Sum_probs=171.1 Q ss_pred CCCCcc-cccccc----hhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh Q lcl|NC_021537. 1 MSKAEE-TTQLDE----RHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES 75 (602) Q Consensus 1 ~~k~~~-~~~~~~----~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~ 75 (602) .+|... ...... ..+.+-.+ ..-..++. +-+..+++|++||++||++||++||+++++.... T Consensus 7 ~~~r~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~----~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~-------- 73 (251) T protein:vir:46 7 NEKRDLQYNEDDLQMMVQTLPSFQG-TKLRQYKD----IEAIRHSDIFTAVMMIASDLARMPIRVTVNGQIN-------- 73 (251) T ss_pred ccccccCCCccchhhhhhhhccccC-cCcceech----hhhhccHHHHHHHHHHHHhHhhCceEEeeCcccc-------- Confidence 111111 000000 00111000 00111332 2234578899999999999999999997532111 Q ss_pred HHHHHHhhhccchhhhhh-ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 76 YQTVRDFWYGSDSRWQIG-PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 76 ~~~~~~~~~~~~~~~~l~-~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) ..|+++.++ .+||+.+|+.+||+.++.+++++||||++++|+.+|++++|+||+|++|++..+..+ T Consensus 74 ---------~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g---- 140 (251) T protein:vir:46 74 ---------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARG---- 140 (251) T ss_pred ---------ccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCC---- Confidence 125556555 689999999999999999999999999999999999999999999999987654321 Q ss_pred cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHH Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWV 234 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~ 234 (602) ..++.+.. ......+....++++||||+|.++ .++++|+||+. T Consensus 141 -----------~~~~~~~~-------------------------~~~~~~g~~~~~~~~diiH~r~~~-~dg~~G~spi~ 183 (251) T protein:vir:46 141 -----------RLYYFHQR-------------------------IDSNGNNIERNVKFEDMLDIKFYS-LDGINGLSLLD 183 (251) T ss_pred -----------cEEEEEEE-------------------------eccCCcceeEEECCccEEEecCcC-CCCeeecCHHH Confidence 11111000 001123556789999999999885 68899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-hcccccCcceeccCCccc Q lcl|NC_021537. 235 AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-KGSRYRTAILEVEEFVDD 304 (602) Q Consensus 235 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-~g~~nag~~~~~~~g~~~ 304 (602) ++..+|..+.++++++.++|+||++|+|+|++++...++++++++++.|++. .|.+|+|++.+ |++- T Consensus 184 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~~~~~g~~n~g~~~~---gm~~ 251 (251) T protein:vir:46 184 TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFPKVLVELNKLGKLSY---SMNQ 251 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccccccc---ccCC Confidence 9999999999999999999999999999999998766888899999999775 55689998664 3221 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.95 E-value=1.6e-27 Score=167.53 Aligned_cols=400 Identities=13% Similarity=0.092 Sum_probs=224.5 Q ss_pred CCCCcccccccchhhhcccC-----ccccC----CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVG-----RGIQP----PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~-----~~i~p----~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) |...+-=. .+....| .+.++ ..+...|.++++.|+.++++|+++++++.+-+|.|.-. +. T Consensus 1 ~~~~D~~~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~------d~ 69 (437) T protein:vir:52 1 MKFFDGIK-----SLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSN------DL 69 (437) T ss_pred CchhhhhH-----hHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecC------CC Confidence 44433200 0111111 11222 25678899999999999999999999999999998631 11 Q ss_pred chhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---------CceEEEEEeCcccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---------GTPVGLAHVPAATV 142 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---------G~~~~L~~l~p~~v 142 (602) ..+..+.+...+.+.+ ..+.+...++..-++|.|++.+++++. |.+..|.++|+..| T Consensus 70 ~~~~~~~~~~~~~~l~--------------~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v 135 (437) T protein:vir:52 70 NSKQLDLFTKFERSLK--------------LRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKI 135 (437) T ss_pred CHHHHHHHHHHHHhhc--------------HHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhc Confidence 2223334444333322 233444555556689999999998763 67788888888877 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCC- Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNP- 221 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~- 221 (602) .+...... .+.. ..| -.+..+.+ ..++....+.++.||||... T Consensus 136 ~~~~~~~~------dp~s-----~~f------------------------g~p~~y~v-~~~~~~~~iH~SRii~~~~~~ 179 (437) T protein:vir:52 136 SPTGTKDD------DVLS-----PNF------------------------GRYSEYSI-LGGSQSITVHHSRLIILNAND 179 (437) T ss_pred cccccccc------cccc-----ccc------------------------CcceEEEE-ecCCcceeEccceeEEecCcc Confidence 64321100 0000 111 11111111 12233457889999999643 Q ss_pred --CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc--ccCCHHHHHHHHHHHHHhhcccccCccee Q lcl|NC_021537. 222 --SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG--GTLSEDSKEDLRNLMDNLKGSRYRTAILE 297 (602) Q Consensus 222 --~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~~~~~~~~l~~~~~~~~g~~nag~~~~ 297 (602) .+.+.++|+|.++.+...|.....+.......+.+...+ ++++++ ..++....+.+.+.++......+.+.+++ T Consensus 180 ~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 257 (437) T protein:vir:52 180 APLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLL 257 (437) T ss_pred CCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEE Confidence 345678999999999999999988888888877776544 344443 12333233344444444333344456666 Q ss_pred ccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHH---- Q lcl|NC_021537. 298 VEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAK---- 373 (602) Q Consensus 298 ~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~---- 373 (602) +..+.++..... +.+ .+.+.......+||++++||..+|...+.++.++.+...+.|+. T Consensus 258 ~d~~~~~e~~~~------------~~s-----gl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd~i~~ 320 (437) T protein:vir:52 258 LDAENEYDRKEL------------TFT-----GLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHEAIRR 320 (437) T ss_pred EcCCcceEEEec------------CcC-----CHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHHHHHH Confidence 665544443321 111 12355667788999999999987754455667887878888876 Q ss_pred ---HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH---HHHHHHHHHHHHhCCcccHHHHHHHhC---- Q lcl|NC_021537. 374 ---GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD---AKMAEQRVRAMRLAGVGTVNEAREELD---- 443 (602) Q Consensus 374 ---~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d---~~~~~~~~~~~~~~G~~T~NE~R~~~G---- 443 (602) ..|+|+++.+-..|-...+.. ...+++++|+--..+...+. .++.++++++++++|+++++|+|+++. T Consensus 321 ~Qe~~l~p~le~l~~~i~~~~~g~--~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~ 398 (437) T protein:vir:52 321 LQETRLRPIFEIIDPLICNELFGG--LPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGL 398 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCC--CCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCC Confidence 356777777766665544332 23467888862222221221 245677899999999999999999873 Q ss_pred CCCCCCCccccccccccccccccccCCCcCccccccccccccccccc Q lcl|NC_021537. 444 LAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 444 l~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (602) ++.+++.+... .........+.+..+.......+. .+.+ T Consensus 399 ~~~i~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 437 (437) T protein:vir:52 399 FANISAEHIEE--LKNADEFAGNFEEPEKMEGAQVQN------SEDQ 437 (437) T ss_pred CCCCCcccccc--ccCCCCCCCccCCCCCCCCCCCCC------CCCC Confidence 33343322110 001111111111000000000000 0000 No 107 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.95 E-value=3.3e-28 Score=171.24 Aligned_cols=440 Identities=12% Similarity=0.050 Sum_probs=239.5 Q ss_pred CCCCc-----ccccccchhhhcccC---ccccCCCCH------------HHHHHHHhhhHHHHHHHHHHHHhhccC-ceE Q lcl|NC_021537. 1 MSKAE-----ETTQLDERHIATDVG---RGIQPPYNP------------ETLAAFQELNETHQACIRKKSRYEAGY-GFE 59 (602) Q Consensus 1 ~~k~~-----~~~~~~~~~~~~~~~---~~i~p~~~~------------~~l~~~~~~~~~v~~cI~~ia~~ia~~-~~~ 59 (602) ++... ...+.....-+...+ ++..+..++ ...|++++||+++..+|+.+.+++.|. ++. T Consensus 11 ~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~ 90 (502) T protein:vir:79 11 FSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGII 90 (502) T ss_pred cChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCcee Confidence 11000 000000000000000 111122221 334788999999999999999999997 788 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-------CceE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-------GTPV 132 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-------G~~~ 132 (602) +.++....... ..+.+...++..+..|.........++++++.+.+++.++..|++|+.+++... +.+. T Consensus 91 ~~~~~~~~~~~----~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l 166 (502) T protein:vir:79 91 VEPHPVLRNGA----IARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHF 166 (502) T ss_pred eeeccCCCChh----HHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCCCcccce Confidence 87776543222 223344445555555655556677899999999999999999999999876543 2367 Q ss_pred EEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 133 GLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 133 ~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) .|..|+|+.|.... ..+- ....|..|-. .|+ ...|.+...+|.+ + ...+...+|+ T Consensus 167 ~lq~iepd~l~~~~-~~~~---------~i~~GVe~d~--~Gr-~~aY~i~~~hPgd--------~----~~~~~~rvpA 221 (502) T protein:vir:79 167 WLEALEPDFIPMTS-DESN---------RLNQGVFVDD--WGR-PEKYLVYKSRPVS--------G----RQMETKEVDA 221 (502) T ss_pred EEEEecchhcCCCC-CCCC---------eeEeeeEECC--CCc-eEEEEEeecCCCC--------C----cccceeEech Confidence 99999999985322 1111 1112221100 011 1111111222221 1 1234578999 Q ss_pred hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccccc Q lcl|NC_021537. 213 NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYR 292 (602) Q Consensus 213 ~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~na 292 (602) ++|||+......++.+|+|++..++..+.......+....--+-.+...++|+.+.+.... . ...+.... T Consensus 222 ~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~--~--------~~~~~~~~ 291 (502) T protein:vir:79 222 ERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYE--P--------DGNGSKEN 291 (502) T ss_pred hheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccc--c--------ccCCCCCc Confidence 9999999998899999999999998888776666666555555677778888765321100 0 00111111 Q ss_pred CcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHH Q lcl|NC_021537. 293 TAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREF 371 (602) Q Consensus 293 g~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f 371 (602) .....+++|..+.. +..|.+++.++... ....|.++.+.....||+.+|||.+.| |+.+ +|||++.+.+..| T Consensus 292 ~~~~~l~pG~i~~~-----L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s-~nySs~R~~~~e~ 364 (502) T protein:vir:79 292 ERELTIQPGIIYDD-----LKPGEEIGMVKSDR-PNPNLETFRNGQLRAVAAGSRLSFSSTARNYN-GTYSAQRQELVES 364 (502) T ss_pred cccccccCCccccc-----cCCCceeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-chHHHHHHHHHHH Confidence 12223334432222 22233444433222 235788999999999999999997766 5654 5999987665544 Q ss_pred -----------HHHHHHHHHHH-HHHHHhhhcCCc--cc-cccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHH Q lcl|NC_021537. 372 -----------AKGIIEPEQAK-FSARLYKIIHQD--AL-DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVN 436 (602) Q Consensus 372 -----------~~~~l~P~~~~-ie~~ln~~Ll~~--~~-~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~N 436 (602) ....++|+.+. ++.++-...++. +. ...-+.++|..... ...|..+.+++...++++|++|.- T Consensus 365 ~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~--~~iDP~Ke~~a~~~~i~~Gl~t~~ 442 (502) T protein:vir:79 365 TDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVM--PWIDPVKEAEAWKIQIRGGAATES 442 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCc--cccChHHHHHHHHHHHHcCCCCHH Confidence 33456665554 344444333321 11 11122334443333 344666677778889999999999 Q ss_pred HHHHHhCCCCCCCCc--------cccccccccccccccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 437 EAREELDLAPFEDDR--------GDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 437 E~R~~~Gl~p~~~g~--------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) |+-++.|.+|-+.-. .+.+-......++.++...+.... ..+++......|. T Consensus 443 ~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~----~~e~~~~~~~~e~ 502 (502) T protein:vir:79 443 DWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATK----RQEPQHTDDQSEE 502 (502) T ss_pred HHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCC Confidence 999999998743211 000000111111111111111111 1111111111111 No 108 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.95 E-value=2.3e-28 Score=172.13 Aligned_cols=461 Identities=11% Similarity=-0.023 Sum_probs=243.5 Q ss_pred CCCCcccc-----cc---cchhhh-----cccCccccCCCCH------------HHHHHHHhhhHHHHHHHHHHHHhhcc Q lcl|NC_021537. 1 MSKAEETT-----QL---DERHIA-----TDVGRGIQPPYNP------------ETLAAFQELNETHQACIRKKSRYEAG 55 (602) Q Consensus 1 ~~k~~~~~-----~~---~~~~~~-----~~~~~~i~p~~~~------------~~l~~~~~~~~~v~~cI~~ia~~ia~ 55 (602) |+.-+-+. .+ .....+ ....+|..+..++ ...|++++||+++.+||+.+.++|.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 22111111 00 000000 0111222222222 33568899999999999999999999 Q ss_pred CceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhc----cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-Cc Q lcl|NC_021537. 56 YGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGP----EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-GT 130 (602) Q Consensus 56 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~----~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G~ 130 (602) .||++..+.+......+.+..+.+...++..+..|.... .....+|+.++.+.+++.++..|++|+.+.+... |. T Consensus 81 ~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~ 160 (530) T protein:vir:38 81 SFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTR 160 (530) T ss_pred CCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCC Confidence 999998877654333344445555555555555554432 3456789999999999999999999999887643 32 Q ss_pred --eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeeccccc-ccccceeeecccceEEecCcee Q lcl|NC_021537. 131 --PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDR-YGDDKRFVDKETGEVASDAGEL 207 (602) Q Consensus 131 --~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~ 207 (602) +..|..|+|+.|....+.. ++ ..+..|..|-. .|+.. .|.+... ++.+ .+......... T Consensus 161 ~~~~~lq~ie~d~l~~~~~~~-----~~---~~i~~GIe~d~--~Gr~~-aY~i~~~~~~~~-------~~~~~~~~~~~ 222 (530) T protein:vir:38 161 LFRTQFKMVSPKRVSNPNNIG-----DT---RNCRAGVKIND--SGAAL-GYYVSDDGYPGW-------MAQNWTYIPRE 222 (530) T ss_pred ccceEEEEechhhcCCCCCCC-----CC---CeeEeeeEECC--CCceE-EEEEeeccCCCc-------cccccceeeee Confidence 6789999999986432210 01 11122222111 11111 1111000 0100 00000011123 Q ss_pred EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC----------CHHHHH Q lcl|NC_021537. 208 KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL----------SEDSKE 277 (602) Q Consensus 208 ~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~----------~~~~~~ 277 (602) ..+++.+|||+....+.++.+|+|++..++..+.......+....--+-.+.-.++|+...+.. ..+... T Consensus 223 ~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~ 302 (530) T protein:vir:38 223 LPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQS 302 (530) T ss_pred eccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccc Confidence 5688999999999988899999999999988887766666665555555667777777543211 111111 Q ss_pred HHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hcc Q lcl|NC_021537. 278 DLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVT 356 (602) Q Consensus 278 ~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~ 356 (602) .+........+..+. ..+.+++|. +. .+..|-+++.++.. ....+|.++.+.+...||+.+|||.+.| |+. T Consensus 303 ~~~~~~~~~~~~~~~-~~~~l~pG~-i~-----~L~pGe~i~~~~p~-~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~ 374 (530) T protein:vir:38 303 KLTGWLGEMAAYYSA-APVRLGGAR-VP-----HLLPGDSLNLQSAQ-DTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNY 374 (530) T ss_pred cccccchhhhhcccc-cceeccCce-ee-----ecCCCCeeeeeCCC-CCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc Confidence 122111111111111 112223332 11 12223333333322 1235788999999999999999998866 888 Q ss_pred ccCCccCHHHHHHHHHH-----------HHHHHHHH-HHHHHHhhhcCCccc-cc--------cceEEEeccchhcchhH Q lcl|NC_021537. 357 STSNRANSKEQTREFAK-----------GIIEPEQA-KFSARLYKIIHQDAL-DV--------DEWTIDFELRGAEQPEQ 415 (602) Q Consensus 357 ~~~~~sn~e~~~~~f~~-----------~~l~P~~~-~ie~~ln~~Ll~~~~-~~--------~~~~~~f~~~~~~~~~~ 415 (602) ++.|||++.+.+..|.+ ..++|+.. +++.++....++-.. .. .-..++|..... +.. T Consensus 375 s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~--~~i 452 (530) T protein:vir:38 375 SQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGR--MAI 452 (530) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCc--ccc Confidence 88999998776555543 33455554 345555544333111 00 011244444433 344 Q ss_pred HHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcc----ccccccccccccccccCCCcCccccccccccccccccc Q lcl|NC_021537. 416 DAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRG----DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 416 d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~----d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (602) |..+.+++...++++|+.|.-++.++.|.+|-+--.. .............++......+. .....+++...... T Consensus 453 DP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~-~~~~~~~~d~~~~a 530 (530) T protein:vir:38 453 DGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGV-KKSNEEEQDGARAA 530 (530) T ss_pred ChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCC-CCCCCCCCCCCCCC Confidence 6677777788999999999999999999877432110 00000000001111111000000 00000000000000 No 109 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.94 E-value=5.2e-28 Score=170.15 Aligned_cols=461 Identities=12% Similarity=0.008 Sum_probs=241.4 Q ss_pred CCCCccccccc--chhh--hcccC----ccccCCCCH------------HHHHHHHhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_021537. 1 MSKAEETTQLD--ERHI--ATDVG----RGIQPPYNP------------ETLAAFQELNETHQACIRKKSRYEAGYGFEI 60 (602) Q Consensus 1 ~~k~~~~~~~~--~~~~--~~~~~----~~i~p~~~~------------~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i 60 (602) +...+.-+... ..+. +...+ +|..+.-++ ...|++++||+++..||+.+.++|.|.||++ T Consensus 9 ~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 88 (533) T protein:vir:34 9 LLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRL 88 (533) T ss_pred hhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 11111111100 0000 01111 122222222 3356889999999999999999999999999 Q ss_pred EEecCCCCcccchhhHHHHHHhhhccchhhhhhc----cCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-C--ceEE Q lcl|NC_021537. 61 VAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGP----EGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-G--TPVG 133 (602) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~----~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G--~~~~ 133 (602) ..+.+......+.+..+.+...++..+..|.-.. ......|+.++.+.+++.++..|++|+.+.+... | .+.. T Consensus 89 ~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~ 168 (533) T protein:vir:34 89 SHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQ 168 (533) T ss_pred eeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCCCccceE Confidence 8876543333333444555555555555554332 3455679999999999999999999999876544 2 2678 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccc-cccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGD-RYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) |..|+|+.|....+.. ++ ..+..|..|-. .|+. ..|.+.. .++.+. +...........+++ T Consensus 169 lq~ie~d~l~~~~~~~-----~~---~~i~~GIe~d~--~Gr~-~aY~i~~~~~~~~~-------~~~~~~~~~~~~v~a 230 (533) T protein:vir:34 169 FRMVSPKRISNPNNTG-----DS---RNCRAGVQIND--SGAA-LGYYVSEDGYPGWM-------PQKWTWIPRELPGGR 230 (533) T ss_pred EEEechhhcCCCCCCC-----CC---CceEeeeEECC--CCCe-EEEEEeecCCCCcc-------ccccceeeeeeccCh Confidence 9999999987432211 00 11122221111 1111 1111100 011100 000000112345789 Q ss_pred hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC----------CHHHHHHHHHH Q lcl|NC_021537. 213 NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL----------SEDSKEDLRNL 282 (602) Q Consensus 213 ~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~----------~~~~~~~l~~~ 282 (602) .+|||+....+.++.+|+|++..++..+.......+....--+-.+.-.++|+.+.+.. ..+..+.+... T Consensus 231 ~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (533) T protein:vir:34 231 ASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGW 310 (533) T ss_pred hHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCccccccccccc Confidence 99999999998999999999999988887766666666555566677778887543211 11111111111 Q ss_pred HHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCc Q lcl|NC_021537. 283 MDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNR 361 (602) Q Consensus 283 ~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~ 361 (602) .....+ ...+..+.+++|. +.. +..|-+++.++.. .....|.++.+.+...||+.+|||.+.| |+.+++|| T Consensus 311 ~~~~~~-~~~~~~~~l~pG~-i~~-----L~pGe~i~~~~~~-~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nY 382 (533) T protein:vir:34 311 IGEIAA-YYAAAPVRLGGAK-VPH-----LMPGDSLNLQTAQ-DTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSY 382 (533) T ss_pred chhhhh-ccCcceeeccCce-eee-----cCCCCeeeecCCC-CCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccH Confidence 111100 0111122233332 111 2223333333222 2245788999999999999999997765 78888999 Q ss_pred cCHHHHHHHH-----------HHHHHHHHHHH-HHHHHhhhcCC-ccc------cccc--eEEEeccchhcchhHHHHHH Q lcl|NC_021537. 362 ANSKEQTREF-----------AKGIIEPEQAK-FSARLYKIIHQ-DAL------DVDE--WTIDFELRGAEQPEQDAKMA 420 (602) Q Consensus 362 sn~e~~~~~f-----------~~~~l~P~~~~-ie~~ln~~Ll~-~~~------~~~~--~~~~f~~~~~~~~~~d~~~~ 420 (602) |++.+.+..| ....++|+... ++.++-...++ +.. .... ..++|..... +..|..+. T Consensus 383 SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~--~~iDP~Ke 460 (533) T protein:vir:34 383 STARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGR--MAIDGLKE 460 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCc--cccChHHH Confidence 9987655444 33445666654 44444443332 110 0001 1344444433 34477777 Q ss_pred HHHHHHHHhCCcccHHHHHHHhCCCCCCCCcc----ccccccccccccccccCCCcCccccccccccccccccccccccc Q lcl|NC_021537. 421 EQRVRAMRLAGVGTVNEAREELDLAPFEDDRG----DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVD 496 (602) Q Consensus 421 ~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~----d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (602) +++...++++|++|.-|+-++.|.+|-+.-.. .............++......+... ...+++.+.. +. T Consensus 461 ~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~-~~~~~~~~~~------~~ 533 (533) T protein:vir:34 461 VQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQ-STEEEKSDSR------AA 533 (533) T ss_pred HHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCC-CCCCCcccCC------CC Confidence 77888999999999999999999887432110 0000000011111111100000000 0000000000 00 No 110 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.94 E-value=2.8e-28 Score=171.65 Aligned_cols=445 Identities=11% Similarity=-0.003 Sum_probs=240.1 Q ss_pred CCCCcccccccch------------------hh-----hcccCccc-cCC-CCH------------HHHHHHHhhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDER------------------HI-----ATDVGRGI-QPP-YNP------------ETLAAFQELNETHQ 43 (602) Q Consensus 1 ~~k~~~~~~~~~~------------------~~-----~~~~~~~i-~p~-~~~------------~~l~~~~~~~~~v~ 43 (602) |..+.....+.++ .| +....+|. .|+ .++ ...|++++||+++. T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 1111111110000 00 00111122 121 121 33468899999999 Q ss_pred HHHHHHHHhhcc-CceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhh--ccCCccCCHHHHHHHHHHHHHhcCCeE Q lcl|NC_021537. 44 ACIRKKSRYEAG-YGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIG--PEGTAMSTPEEVLELGRQDYHGIGWAA 120 (602) Q Consensus 44 ~cI~~ia~~ia~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~pn~~~t~~~~~~~~~~d~l~~Gna~ 120 (602) .+|+.+.++|.| .||.+..+.+......+.+ +...++..+..|... .+....+|++++.+.+++.++..|++| T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~----~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f 156 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDR----ANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVL 156 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHH----HHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceE Confidence 999999999998 7999888765432222222 333344444445332 234567899999999999999999999 Q ss_pred EEEeeCCCC-ceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccce Q lcl|NC_021537. 121 LEILVEGDG-TPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGE 199 (602) Q Consensus 121 ~~i~r~~~G-~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 199 (602) +.+.+...+ .+..|..|+|+.|....+... .++ ..+..|..|-. .| ....|.+...+|.+..... T Consensus 157 ~~~~~~~~~~~~~~lqliepd~l~~~~n~~~---~~~---~~i~~GIe~d~--~G-r~~aY~i~~~hPgd~~~~~----- 222 (505) T protein:vir:96 157 VREHRGYPNKWGYALQILECDRLDLNYNADL---QNG---NRIRMSIELDA--WE-RPVAYHLLVNHPGDNSYCY----- 222 (505) T ss_pred EEEeecCCCCcceEEEEechhhcCCCCCccc---CCc---CeEEeceEECC--CC-ceEEEEEeecCCCcccccc----- Confidence 988765433 467899999999864322110 000 11122222111 01 1112222222232211110 Q ss_pred EEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHH Q lcl|NC_021537. 200 VASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDL 279 (602) Q Consensus 200 ~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l 279 (602) .........+|+.+|||+......++.+|+|.+..++..+.......+....-.+=.+...++|+...+...+...+ T Consensus 223 -~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~-- 299 (505) T protein:vir:96 223 -HYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPED-- 299 (505) T ss_pred -ccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcccc-- Confidence 01123456789999999999999999999999999988877666666665555555677778887644332221111 Q ss_pred HHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hcccc Q lcl|NC_021537. 280 RNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTST 358 (602) Q Consensus 280 ~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~ 358 (602) .....+..+++|+ +. .+..|-+++.++... ....|.++.+.+.+.||+.+|||.+.| |+.++ T Consensus 300 ----------~~~~~~~~l~pG~-i~-----~L~pGe~i~~~~~~~-p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~ 362 (505) T protein:vir:96 300 ----------DQGEIVEEVEAGT-YQ-----LLPYGIRFKEHKIDH-PHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEG 362 (505) T ss_pred ----------ccCccccccCCce-ee-----ecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc Confidence 0011112222222 11 122333444443322 236789999999999999999997765 78888 Q ss_pred CCccCHHHHHHHH-----------HHHHHHHHHHH-HHHHHhhhcCCccccccc--eEEEeccchhcchhHHHHHHHHHH Q lcl|NC_021537. 359 SNRANSKEQTREF-----------AKGIIEPEQAK-FSARLYKIIHQDALDVDE--WTIDFELRGAEQPEQDAKMAEQRV 424 (602) Q Consensus 359 ~~~sn~e~~~~~f-----------~~~~l~P~~~~-ie~~ln~~Ll~~~~~~~~--~~~~f~~~~~~~~~~d~~~~~~~~ 424 (602) .|||++.+.+..| ....++|+... ++.++-...++-...... ..+.|.... ....|..+.+++. T Consensus 363 ~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~--~~~iDP~Ke~~a~ 440 (505) T protein:vir:96 363 VNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRG--WDWVDPAKDSKAH 440 (505) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCC--ccccChHHHHHHH Confidence 9999987765544 34567776664 455554443321111111 234444433 3444777777888 Q ss_pred HHHHhCCcccHHHHHHHhCCCCCCCCcc----ccccccccccccccccCCCcCcccccccccccccc Q lcl|NC_021537. 425 RAMRLAGVGTVNEAREELDLAPFEDDRG----DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLEN 487 (602) Q Consensus 425 ~~~~~~G~~T~NE~R~~~Gl~p~~~g~~----d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (602) ..++++|+.|+-|+-++.|.+|-+--+. .............+... .....++..+.++.++ T Consensus 441 ~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~--~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 441 SESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQE--SKDATTDEEDDSASDD 505 (505) T ss_pred HHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCC--CCCCCCCCCCCCCCCC Confidence 8999999999999988899887432110 00000000000100000 0000000001001111 No 111 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.94 E-value=4.4e-27 Score=165.09 Aligned_cols=465 Identities=11% Similarity=-0.029 Sum_probs=242.8 Q ss_pred CCCCcccccccchhhhc---ccCccccCCCCH------------HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEETTQLDERHIAT---DVGRGIQPPYNP------------ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~---~~~~~i~p~~~~------------~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) +..+......+. -.++ ...+|..+.-++ ...|++++||+++..+|+.+.++|.|.||.+..+.+ T Consensus 18 ~~~~~~~~~~y~-gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~Gi~~~~~~~ 96 (553) T protein:vir:63 18 EQSASLGGGGLE-GASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQRDSIVGAQYRLNSMPD 96 (553) T ss_pred hhhhhhhccccc-ccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeccc Confidence 111111111000 0011 111222222222 334688999999999999999999999999988765 Q ss_pred CCCc-ccchhhHHHHHHhhhccchhhhhh----ccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-C--ceEEEEEe Q lcl|NC_021537. 66 ADEP-DEGGESYQTVRDFWYGSDSRWQIG----PEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-G--TPVGLAHV 137 (602) Q Consensus 66 ~~~~-~~~~~~~~~~~~~~~~~~~~~~l~----~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G--~~~~L~~l 137 (602) .... ..+.+..+.+...++..+..|.-. .......+++++.+.+++.++..|++|+.+.+... | .+..|..| T Consensus 97 ~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~i 176 (553) T protein:vir:63 97 INVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAANRPYATCFQMV 176 (553) T ss_pred hhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCCCCcccceEEEe Confidence 4321 123344455555555555555432 23456789999999999999999999998876543 2 25689999 Q ss_pred CcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 138 PAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) +|+.|....+... + ..+..|..| -.......|.+...+|.+..........+ ........+++.+||| T Consensus 177 e~drl~~~~~~~~-----~---~~i~~GVE~---d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~-~r~~~~~~v~a~~vlH 244 (553) T protein:vir:63 177 STDRLSNPYQQLD-----T---PTLRRGVQY---DKRGRPQGYWIQVAHPGDLYQMAPDMYKW-KFVQQSKPWGRRQVIH 244 (553) T ss_pred chhhcCCCCCCCC-----C---CeeEeeeEE---CCCCceEEEEeeccCCCccccccccccce-eeeccccccChhHhee Confidence 9999875432210 0 111222211 11111222222233333321111111100 1112335688999999 Q ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh----------- Q lcl|NC_021537. 218 LPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL----------- 286 (602) Q Consensus 218 ~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~----------- 286 (602) +......++.+|+|++..++..+......++....--+=.+.-.++|+.+.+ ++...+.+......- T Consensus 245 ~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (553) T protein:vir:63 245 ILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP--PEFIHSQMSGGSPNADMVGIFGKYMD 322 (553) T ss_pred cccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--hhhhhhhccccccccccccccccccc Confidence 9999889999999999999888776666666555555556677778775432 222222221111000 Q ss_pred --hcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccC Q lcl|NC_021537. 287 --KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRAN 363 (602) Q Consensus 287 --~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn 363 (602) ....+...+..+++|. + ..+..+-+++.++... ....|.++.+.+...||+.+|||.+.| |+.++.|||+ T Consensus 323 ~~~~~~~~~~~~~l~pG~-i-----~~L~pGe~i~~~~p~~-p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS 395 (553) T protein:vir:63 323 ALKAYVGGANNIQIDGAK-I-----PHLFPGTKLNLKPMGT-PGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSS 395 (553) T ss_pred ccccccccccceeecCce-e-----eecCCCCeeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHH Confidence 0000001112222221 1 1122233333333221 235789999999999999999997755 8888899999 Q ss_pred HHHHHHHH-----------HHHHHHHHHHHH-HHHHhhhcCC-cccccc-----------ceEEEeccchhcchhHHHHH Q lcl|NC_021537. 364 SKEQTREF-----------AKGIIEPEQAKF-SARLYKIIHQ-DALDVD-----------EWTIDFELRGAEQPEQDAKM 419 (602) Q Consensus 364 ~e~~~~~f-----------~~~~l~P~~~~i-e~~ln~~Ll~-~~~~~~-----------~~~~~f~~~~~~~~~~d~~~ 419 (602) +.+.+..| ....++|+.+.| +.++-...++ +..... -..++|.... ....|..+ T Consensus 396 ~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~--~~~iDP~K 473 (553) T protein:vir:63 396 IQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGAS--QGQIDQLK 473 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCC--ccccChHH Confidence 87655544 334556655543 4444332221 111000 0123343333 33447777 Q ss_pred HHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc-----c---ccccccccccccccccCCCcCcccccccccccc--cccc Q lcl|NC_021537. 420 AEQRVRAMRLAGVGTVNEAREELDLAPFEDDR-----G---DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPL--ENKI 489 (602) Q Consensus 420 ~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~-----~---d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 489 (602) .+++...++++|+.|+-|+.++.|.+|-+--. . +.+-......+......+. ...+....+++. ..+. T Consensus 474 e~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 551 (553) T protein:vir:63 474 ETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGR--DAATGIAEDPAAAQTSQQ 551 (553) T ss_pred HHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCc--ccCCCCCCCCCCCCcccc Confidence 77888899999999999999988987743211 0 0000000111111111000 011111111100 0111 Q ss_pred cc Q lcl|NC_021537. 490 GE 491 (602) Q Consensus 490 ~~ 491 (602) .| T Consensus 552 ~e 553 (553) T protein:vir:63 552 GE 553 (553) T ss_pred cC Confidence 11 No 112 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.94 E-value=8.4e-27 Score=163.55 Aligned_cols=476 Identities=13% Similarity=0.065 Sum_probs=245.6 Q ss_pred CCCCcccccccchh----h-h---cccCccccCCCCH------------HHHHHHHhhhHHHHHHHHHHHHhhcc-CceE Q lcl|NC_021537. 1 MSKAEETTQLDERH----I-A---TDVGRGIQPPYNP------------ETLAAFQELNETHQACIRKKSRYEAG-YGFE 59 (602) Q Consensus 1 ~~k~~~~~~~~~~~----~-~---~~~~~~i~p~~~~------------~~l~~~~~~~~~v~~cI~~ia~~ia~-~~~~ 59 (602) ++....-+-...|. | + +....+..++.+. ...|++++||+++..+|+.+.++|.| .++. T Consensus 11 ~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~G~~ 90 (548) T protein:vir:95 11 LAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGGSGIG 90 (548) T ss_pred cchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCccccc Confidence 21110000000000 0 0 0001112222222 34578899999999999999999998 5777 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-----C--ceE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-----G--TPV 132 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-----G--~~~ 132 (602) |.++.... +.+..+.+...++..+..|..........|++++.+.+++.++..|++|+.+.+... | .+. T Consensus 91 i~p~~l~~----d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~ 166 (548) T protein:vir:95 91 VEPLPLRL----DGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPF 166 (548) T ss_pred eeeeecCC----CHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCcccce Confidence 77665322 223334445555555666665556677889999999999999999999999886542 2 367 Q ss_pred EEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 133 GLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 133 ~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) .|..|+|+.|....+... .....|..|- .......|.+-..+|.+. ...........+|+ T Consensus 167 ~lqliepd~l~~~~~~~~---------~~i~~GIE~D---~~Grp~aY~i~~~hPgd~--------~~~~~~~~~~rvpA 226 (548) T protein:vir:95 167 ALELLEPDYLPFSYNNLS---------KGIVQGIERD---TWRRKRAYHLLKDHPGNL--------QTLGGSLAVKRVEA 226 (548) T ss_pred EEEEechhhcCCCCCCCC---------CceeeeeEEC---CCCceEEEEEeecCCCcc--------cccccccceeeech Confidence 899999999864322211 1122222110 001111122222222221 11122334678999 Q ss_pred hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccccc Q lcl|NC_021537. 213 NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYR 292 (602) Q Consensus 213 ~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~na 292 (602) ++|||+......++.+|+|.+..++..+......++....--+=.+...++|+.+.+.. ...+ .+.... T Consensus 227 ~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~--~~~~---------~~~~~~ 295 (548) T protein:vir:95 227 ERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDS--YTVE---------PGKDRK 295 (548) T ss_pred hHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcc--ccCC---------CCcccc Confidence 99999999988999999999999988877766666665555555677778887654321 1000 011122 Q ss_pred CcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHH Q lcl|NC_021537. 293 TAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREF 371 (602) Q Consensus 293 g~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f 371 (602) .....+++|+.+..+. .|-+++.++.. .....|.++.+.+...||+.+|||.+.| |+.+ +|||++.+.+..| T Consensus 296 ~~~~~~~pG~iv~~L~-----pGe~i~~~~p~-~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R~~l~e~ 368 (548) T protein:vir:95 296 NRTIPIAPGMVFDDLE-----PGEDVGMIESN-RPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQRQELVEG 368 (548) T ss_pred cccccccCCccccccC-----CCceeeecCCC-CCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHHHHHHHH Confidence 2233344444332222 22333333322 1235789999999999999999997766 6654 6999987765544 Q ss_pred H-----------HHHHHHHHHH-HHHHHhhhcCC--ccccc-cceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHH Q lcl|NC_021537. 372 A-----------KGIIEPEQAK-FSARLYKIIHQ--DALDV-DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVN 436 (602) Q Consensus 372 ~-----------~~~l~P~~~~-ie~~ln~~Ll~--~~~~~-~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~N 436 (602) + ...++|+... ++.++-...++ .+... ..+.++|... .....|..+.+++...++++|+.|.- T Consensus 369 ~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P--~~~~iDP~Kea~A~~~~i~~Gl~T~~ 446 (548) T protein:vir:95 369 WLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGP--VMPWINPMHEANAWELLVKAGFADEA 446 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecC--CccccChHHHHHHHHHHHHcCCCCHH Confidence 3 3456665554 44444433322 11111 1223444333 33445777777888899999999999 Q ss_pred HHHHHhCCCCCCCCc--------cccccccccccccc----cccCCCcCcc---cccccccccccc-ccccccccccccc Q lcl|NC_021537. 437 EAREELDLAPFEDDR--------GDMTLSEFEAEFGA----DASDGDAEAM---LTRSKAAPPLEN-KIGERDSVDVDVS 500 (602) Q Consensus 437 E~R~~~Gl~p~~~g~--------~d~~~~~~~~~~~~----~~~~~~~~~~---~~~~~~~~~~~~-~~~~~~~~~~~~~ 500 (602) |+-++.|.+|-+--. .+.+-......... +...+.++.+ .......+..+. ....+..+.+..+ T Consensus 447 ~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (548) T protein:vir:95 447 EVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVP 526 (548) T ss_pred HHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCC Confidence 998888987743110 00000000000000 0000000000 000000011110 0011233333332 Q ss_pred ccchhhhhcchhhhhhheec--ccccEE Q lcl|NC_021537. 501 KDPIEQTTFSSSNLDEGLYD--FGEREL 526 (602) Q Consensus 501 ~~~m~~~~v~ss~~~~~~yd--~~~~~l 526 (602) -+...- .|+ .+|-| +.+-.- T Consensus 527 ~~~~~~---~~~---~~~~~~~~~~~~~ 548 (548) T protein:vir:95 527 GPDFPN---ESN---NGGADGQPSNPDP 548 (548) T ss_pred CCCCCc---ccc---cCCCCCCCCCCCC Confidence 221110 111 11111 000000 No 113 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.92 E-value=2.3e-26 Score=161.12 Aligned_cols=441 Identities=11% Similarity=0.026 Sum_probs=222.7 Q ss_pred CCCCccc--ccccchhhhcccCc-cccC-CCCH------------HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEET--TQLDERHIATDVGR-GIQP-PYNP------------ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~~--~~~~~~~~~~~~~~-~i~p-~~~~------------~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) .+.+... ......+-+...++ +-.+ ..++ ...|++++||+++..||+.+.+++.|.||.+..+. T Consensus 9 ~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~~ 88 (495) T protein:vir:10 9 QSLASGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPRWRM 88 (495) T ss_pred cccchhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccccCC Confidence 1111000 00000000001111 1011 1121 33568899999999999999999999998776654 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC--CCC--ceEEEEEeCcc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE--GDG--TPVGLAHVPAA 140 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~--~~G--~~~~L~~l~p~ 140 (602) ++ +.+...++..+..|.-.......+++.++.+.+++.++..|++|+.+.+. ..| .+..|..|+|+ T Consensus 89 ~~----------~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd 158 (495) T protein:vir:10 89 KE----------QELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPD 158 (495) T ss_pred ch----------HHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechh Confidence 32 12333344444455444455667899999999999999999999987754 333 47899999999 Q ss_pred cccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecC Q lcl|NC_021537. 141 TVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPN 220 (602) Q Consensus 141 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~ 220 (602) .|....+.... ..+ ..+..|..|-. .|+ ...|.+...+|. ++.+.........+|+++|||+.. T Consensus 159 ~l~~~~~~~~~--~~g---~~i~~GIe~d~--~Gr-~vaY~i~~~hpg--------d~~~~~~~~~~~rvpA~~vlH~f~ 222 (495) T protein:vir:10 159 MLASDIPDETL--PSG---GYVKGGIRFSN--GGK-RKAYCFYRNHPA--------ESSLIGDPVDTVWIKAEHVLHVTV 222 (495) T ss_pred hcCCCCCCCCC--CCC---CEEEeceEECC--CCc-eEEEEEeecCCC--------cccccccccceeeechhheEeccc Confidence 98632211000 000 01112221110 011 111111111221 122222233557799999999974 Q ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhh----cccccCcce Q lcl|NC_021537. 221 PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLK----GSRYRTAIL 296 (602) Q Consensus 221 ~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~----g~~nag~~~ 296 (602) ...++.+|+|.+..+. .+......++....--+-.+...++|+.+.+. +......-..-.... ..-..|.+. T Consensus 223 -~r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~pG~i~ 298 (495) T protein:vir:10 223 -LTVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQEATAD--STGGPTIGQPKRSKGGKRITGLNPGTLQ 298 (495) T ss_pred -cCCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCc--cccccccCccccccCcccceecCCceee Confidence 5678999999765443 35444444444444444456667777754321 111100000000000 001123333 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHHH--- Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFA--- 372 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~--- 372 (602) .+.+|. +++.++... ....|.++.+.+...||+.+|||.+.| |+.+++|||++.+.+..|. T Consensus 299 ~L~pGe--------------~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~ 363 (495) T protein:vir:10 299 YLQPGQ--------------EVKFSNPAD-VGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLC 363 (495) T ss_pred ecCCCC--------------eeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHH Confidence 333333 333333221 235688999999999999999998866 8888999999876554443 Q ss_pred ---------HHHHHHHHHH-HHHHHhhhcC--Cccccccc--eEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHH Q lcl|NC_021537. 373 ---------KGIIEPEQAK-FSARLYKIIH--QDALDVDE--WTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEA 438 (602) Q Consensus 373 ---------~~~l~P~~~~-ie~~ln~~Ll--~~~~~~~~--~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~ 438 (602) .+.++|+.+. ++.++-...+ +.+..... ..++|.... ....|..+.+++...++++|++|+-|+ T Consensus 364 ~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~--~~~vDP~Ke~~A~~~~i~~G~~s~~~~ 441 (495) T protein:vir:10 364 QQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPR--WEEVDPLKKHLADLGDVRAGFAPISDK 441 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCC--ccccChHHHHHHHHHHHHcCCCCHHHH Confidence 3345565554 3444433222 22211111 123443333 334477777778889999999999999 Q ss_pred HHHhCCCCCCCCcc----ccccccccccccccccCCCcCccccccccccccccc Q lcl|NC_021537. 439 REELDLAPFEDDRG----DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENK 488 (602) Q Consensus 439 R~~~Gl~p~~~g~~----d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (602) -++.|++|-+--.. .............++......+.......++..+++ T Consensus 442 ~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 442 QAERGYDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 98999987431110 000000001111111110000110001110000011 No 114 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.90 E-value=6e-23 Score=142.43 Aligned_cols=421 Identities=9% Similarity=0.032 Sum_probs=207.6 Q ss_pred CCCCcc-----cccccc---hhhhcccC---------ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MSKAEE-----TTQLDE---RHIATDVG---------RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~k~~~-----~~~~~~---~~~~~~~~---------~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) +...+. ...... ..++...+ -+..+.+...+|.++++.|+.++++|+++++++.+-+|+|.-. T Consensus 58 ~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~ 137 (537) T protein:vir:10 58 MPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGHQMCALIATHWLVNKACSQMPRDAMRKGYKIISD 137 (537) T ss_pred cccccccccchhccccccchhhhhhhccccccchhhhhccccCCccHHHHHHHHhCchhhhhhhhhhHHhhcCCceeecC Confidence 111110 111111 11211111 1223345556788888999999999999999999999998643 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC-CCC------------- Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE-GDG------------- 129 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~-~~G------------- 129 (602) ++++ .+.+..+.+...+.+.+ .+..|.. .++...++|.+++.+.-. .++ T Consensus 138 ~~~~---~~~~~~~~l~~~~~~l~-------------~~~~l~~-a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~ 200 (537) T protein:vir:10 138 DGNE---LDPKDAKFIDRYDRAFN-------------IKKHAIQ-FVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVM 200 (537) T ss_pred Cccc---ccHHHHHHHHHHHHHhh-------------HHHHHHH-HHHhcccccceEEEEeecCcCCccccccccccccc Confidence 3222 22233344443333221 1233444 444445689998877542 222 Q ss_pred --ceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCcee Q lcl|NC_021537. 130 --TPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGEL 207 (602) Q Consensus 130 --~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 207 (602) ....|..|+|..+.+.... +....|..+.+-.+..+.+ . . T Consensus 201 kg~~k~l~vidp~~~~~~~~~---------------------------------~~~~dp~sp~fg~P~~y~v---~--g 242 (537) T protein:vir:10 201 PGAYKGIVQIDPYWCAPLLDA---------------------------------QASSNPVSMHFYEPTYWLI---N--G 242 (537) T ss_pred ccceeEEEEechhhcccccch---------------------------------hhhccCCccccCCceeeee---c--C Confidence 2234444454444321100 0000001111111111111 1 1 Q ss_pred EEechhHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHH Q lcl|NC_021537. 208 KNGPANELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRN 281 (602) Q Consensus 208 ~~~~~~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~ 281 (602) ..+.++.||||.... +..+++|+|.++.+...|.....+.......+........-+.......++++....-+ T Consensus 243 ~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~ 322 (537) T protein:vir:10 243 KKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMS 322 (537) T ss_pred eEecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHH Confidence 356788899986442 33457899999999999988888888887777776655333322222234444333333 Q ss_pred HHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHH-hhccccCC Q lcl|NC_021537. 282 LMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVL-INVTSTSN 360 (602) Q Consensus 282 ~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~ 360 (602) .++..+ +|.+.+++-..+.++.... .+++. +.+++....+.||.+.+||... +|....+- T Consensus 323 ~~~~~r--~n~g~~~id~e~e~~e~~~----------~~lsg-------l~~~l~~~~~~iAa~~~IP~t~L~G~sp~Gl 383 (537) T protein:vir:10 323 WWTATR--DNYQVRVVDKDNEDVVQID----------TTLND-------LDKVIMNQYQLVCAIARTPAPKMLGTVPTGF 383 (537) T ss_pred HHHhhc--CCcceeEecCCCceeEEEe----------ccCCC-------HHHHHHHHHHHHHhhhCCCceeeccCCcccc Confidence 344333 3444444332234333322 12221 2356667778899999999885 46554455 Q ss_pred ccCHHHHHHHHHHH------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH-----HHHHHHHHHHHHh Q lcl|NC_021537. 361 RANSKEQTREFAKG------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD-----AKMAEQRVRAMRL 429 (602) Q Consensus 361 ~sn~e~~~~~f~~~------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d-----~~~~~~~~~~~~~ 429 (602) .|+.+.....|+.. .|.|.++.+.+.+-+..+.. ..+|.++|+ .+..+... .+..++++++++. T Consensus 384 natGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~~~---~~~~~i~f~--pL~~~s~kEkAei~~~~a~a~~~~~~ 458 (537) T protein:vir:10 384 NSTGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHLRK---RIRVKVEFP--PMDAPKESERADTFLKKMQAAKLAFE 458 (537) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---CcceEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHH Confidence 56667666666643 47888888877776655432 345666665 44333222 2345578899999 Q ss_pred CCcccHHHHHHHhCCCCCCCCccccccccccccc---cccccCCCcC---cccccc--cccccccccccccccccccccc Q lcl|NC_021537. 430 AGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEF---GADASDGDAE---AMLTRS--KAAPPLENKIGERDSVDVDVSK 501 (602) Q Consensus 430 ~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~---~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 501 (602) +|++++||+|+.++.+|..+.. ........... ..+....+.+ .+.... ....+......+...+. .+.+ T Consensus 459 ~G~i~~~Evr~~L~~~~~~g~~-~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~~~ 536 (537) T protein:vir:10 459 MGAVDGVDVNEYLRMDPTLGFT-SITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSG-AAFE 536 (537) T ss_pred cCCCCHHHHHHHHhccCccccc-cccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCc-cccC Confidence 9999999999999887654321 11100000000 0000000000 000000 00000000000000000 0001 Q ss_pred c Q lcl|NC_021537. 502 D 502 (602) Q Consensus 502 ~ 502 (602) + T Consensus 537 ~ 537 (537) T protein:vir:10 537 D 537 (537) T ss_pred C Confidence 0 No 115 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.89 E-value=7.8e-23 Score=141.78 Aligned_cols=435 Identities=9% Similarity=0.021 Sum_probs=210.7 Q ss_pred CCCCcc------cccccchh---hh-----------cccCccccC-CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSKAEE------TTQLDERH---IA-----------TDVGRGIQP-PYNPETLAAFQELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k~~~------~~~~~~~~---~~-----------~~~~~~i~p-~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) ++.+.. ........ +. .....+..+ .++..+|.++++.|+.++.+|+++++++..-+|+ T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~ 114 (532) T protein:vir:94 35 LATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETPADECVRAWGK 114 (532) T ss_pred hhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccchHHHhhCCce Confidence 111000 00000000 10 000011112 2455677788888999999999999999999999 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---------- Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---------- 129 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---------- 129 (602) |.-..+.+. ..+....+...+.+.. ..+.+...++...++|.+++.+.....| T Consensus 115 i~~~~~~~~---~~~~~~~i~~~~~~l~--------------v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~ 177 (532) T protein:vir:94 115 ITCSSKDEL---AADKATRITQKLEQYN--------------VRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLL 177 (532) T ss_pred EeeCCcccc---chHHHHHHHHHHHhhh--------------HHHHHHHHHHhhhcccceEEEEEeccCCcccccccccc Confidence 965333222 1223333333332221 1234444555567899999887654333 Q ss_pred ---------ceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceE Q lcl|NC_021537. 130 ---------TPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEV 200 (602) Q Consensus 130 ---------~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 200 (602) .+..|.+++|..|.+...... .+.....--+.++++. T Consensus 178 l~~~~I~~g~~~~l~vld~~~v~p~~~~~~------dp~sp~fg~P~~y~v~---------------------------- 223 (532) T protein:vir:94 178 LSPSFVQRGCLIGFATIEPMWLSPNAYNAT------DPTLPSFYKPDSWIAT---------------------------- 223 (532) T ss_pred ccccccccceeeEEEeechheecccccccc------cccccccCCceeEEEc---------------------------- Confidence 234555566555544321100 0000000001111111 Q ss_pred EecCceeEEechhHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCH Q lcl|NC_021537. 201 ASDAGELKNGPANELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSE 273 (602) Q Consensus 201 ~~~~~~~~~~~~~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~ 273 (602) . ...+.++.||||.... +...++|+|.+..++..|.....+............... +++. ...++. T Consensus 224 ---~--g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v--~k~~~a~~ls~ 296 (532) T protein:vir:94 224 ---S--GKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTN--LATDMAQLLAP 296 (532) T ss_pred ---c--CeeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCce--eeechHHhhcc Confidence 1 1357788899996442 234568999999999999888888777777655544332 2332 123454 Q ss_pred HHHHHHHHHHHHhhc-ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHH Q lcl|NC_021537. 274 DSKEDLRNLMDNLKG-SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVL 352 (602) Q Consensus 274 ~~~~~l~~~~~~~~g-~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~ 352 (602) +..+.+.+.++.... .+|.+.+++.....++..... +++ . +.+......+.||++.+||..+ T Consensus 297 ~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~~----------~ls--g-----l~~~l~~~~~~iAaa~~IP~t~ 359 (532) T protein:vir:94 297 GGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTNT----------PLS--G-----LDSLQAQSQEQMAAVSHIPLVK 359 (532) T ss_pred hhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEec----------ccC--C-----HHHHHHHHHHHHHhHhCCCeee Confidence 555666666654332 234343333212222222211 111 1 2345566778999999999885 Q ss_pred h-hccccCCccCHHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH---HHHH Q lcl|NC_021537. 353 I-NVTSTSNRANSKEQTREFAK-------GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA---KMAE 421 (602) Q Consensus 353 l-g~~~~~~~sn~e~~~~~f~~-------~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~---~~~~ 421 (602) | |....+-.|+.+.....|+. ..|.|+++.+-+.|-...+.. ...+++|+|+--..+...+.+ +..+ T Consensus 360 LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~--~~~d~~~~f~pL~~~s~kEkAei~~~~a 437 (532) T protein:vir:94 360 LLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQ--IDPGLAWEWSPLMELDDKELAEVRQLNA 437 (532) T ss_pred eecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCCceEEeCCCCCCCHHHHHHHHHHHH Confidence 5 65433333555666666665 347788888777776544322 234677887621111111111 3456 Q ss_pred HHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccccc---cccc-CCCcCcccccccccccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFG---ADAS-DGDAEAMLTRSKAAPPLENKIGERDSVDV 497 (602) Q Consensus 422 ~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (602) +++++++.+|++++||+|++++.+|..+..............- .... ....+....+....++.+....+...... T Consensus 438 ~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 517 (532) T protein:vir:94 438 STDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPD 517 (532) T ss_pred HHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccC Confidence 7888999999999999999999988765433221111100000 0000 00000000000000111111111111001 Q ss_pred cccccchhhhhcchh Q lcl|NC_021537. 498 DVSKDPIEQTTFSSS 512 (602) Q Consensus 498 ~~~~~~m~~~~v~ss 512 (602) ....+.-...||.-. T Consensus 518 ~~~~~~~~~~~~~~~ 532 (532) T protein:vir:94 518 AQADPAQNDQPVGNR 532 (532) T ss_pred CCccccccCCCcCCC Confidence 111111111222111 No 116 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.87 E-value=1.4e-21 Score=134.90 Aligned_cols=433 Identities=13% Similarity=0.041 Sum_probs=231.4 Q ss_pred CCCCccccc-ccchhhhc----ccCccccCC---------CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021537. 1 MSKAEETTQ-LDERHIAT----DVGRGIQPP---------YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSA 66 (602) Q Consensus 1 ~~k~~~~~~-~~~~~~~~----~~~~~i~p~---------~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~ 66 (602) +++-+.++. ....+.++ ..++.++.. -.....+++.+..+.|.+|++.+...|.+++|+|.+-.+. T Consensus 2 ~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~~ 81 (469) T protein:vir:10 2 TERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANGAS 81 (469) T ss_pred CCcccCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCCC Confidence 222222111 11122221 112222211 1122345666678999999999999999999999864321 Q ss_pred CCcccchhhHHHHHHhhhccchhh---hhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-----Cc--eEEEEE Q lcl|NC_021537. 67 DEPDEGGESYQTVRDFWYGSDSRW---QIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-----GT--PVGLAH 136 (602) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~---~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-----G~--~~~L~~ 136 (602) .+..+.+...+....... ....+-....+|.+++..++.+.+.+|.++.|+++... |. +..|.+ T Consensus 82 ------~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~ 155 (469) T protein:vir:10 82 ------DEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAP 155 (469) T ss_pred ------HHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeee Confidence 111121222111111000 00111122457888998888888899999999998643 32 455666 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) .|+.++.- |...-++......+... .....+..+..++....+|+...| T Consensus 156 rp~~~i~~-----------------------~~~~~~~~l~~~~~~~~--------~~~~~~~~~~~~~~~~~lp~~k~i 204 (469) T protein:vir:10 156 RPQWTISK-----------------------FNVAPDGGLESIEQIAP--------PARTRGSLYVANIAPPEIPVNRLV 204 (469) T ss_pred cCccccee-----------------------eeeccCCceeeeeecCc--------ccccccccccCCCCccccccCcEE Confidence 66554421 00000000000000000 000011111222334567777777 Q ss_pred EecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcce Q lcl|NC_021537. 217 FLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAIL 296 (602) Q Consensus 217 H~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~ 296 (602) ++++....+.++|.|.+..|...........++...|...-++|--+.+.+.+ .++++++.+.+...++.++.+++.+ T Consensus 205 ~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~-a~~~ek~~l~~a~~~~~~g~~a~~i- 282 (469) T protein:vir:10 205 VYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSA-TDEDEVRKMAALARSVRGGINAGVG- 282 (469) T ss_pred EEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCC-CCHHHHHHHHHHHHHHhcCCceEEE- Confidence 77766667778999999999999999999999999999999999888877643 5788888888888888766565544 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHH Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGII 376 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l 376 (602) ++.|++...+ .-+ .+ ...|.++.++..++|+.+.--. .+.....+++++..+..... ..+.+ T Consensus 283 -ip~~~~ie~~------------ea~-g~--~~~~~~li~~~d~~Isk~iLG~-tlTs~~~gGS~a~~~vh~ev-~~d~~ 344 (469) T protein:vir:10 283 -LAQGQILELL------------GVS-GN--LPDIRRAIEGHDRSIALSGLAH-FLNLDGKGGSYALASVLEDP-FTQAV 344 (469) T ss_pred -ccCCceEEEe------------ecC-CC--chHHHHHHHHHHHHHHHHHhcc-cccccCccchhhHHHHHHHH-HHHHH Confidence 3444443322 211 11 2357888888889998877432 22222234556555544443 46678 Q ss_pred HHHHHHHHHHHhhhcCCccccc----cc--eEEEeccchhcchhHHHHHHHHHHHHHHhCCcc-----cHHHHHHHhCCC Q lcl|NC_021537. 377 EPEQAKFSARLYKIIHQDALDV----DE--WTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVG-----TVNEAREELDLA 445 (602) Q Consensus 377 ~P~~~~ie~~ln~~Ll~~~~~~----~~--~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~-----T~NE~R~~~Gl~ 445 (602) +-.++.++..||+.|+.+.-.. .. .+|+|+. .. .+.+..+++++++++.|++ +.+.+|+.+|+| T Consensus 345 ~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~--~e---~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip 419 (469) T protein:vir:10 345 HAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDP--IG---SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLP 419 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecC--CC---CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCC Confidence 8899999999998776542211 11 2455543 22 2345567889999999995 456789999998 Q ss_pred CCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcchhhhhhheecc Q lcl|NC_021537. 446 PFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDF 521 (602) Q Consensus 446 p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~ 521 (602) +-.+++.............+......... ....+.......+. + ..+ -|+ T Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~--~-----~~l----~da 469 (469) T protein:vir:10 420 SELNDTPSAEPEEPAAVPNQSAAPARTRS---------------SGNADARARAPKAD--Q-----GVL----FDA 469 (469) T ss_pred CCCCCcccccchhcccCCCCCccccccCC---------------CCCcccccccCCCh--H-----Hhh----ccC Confidence 66554321110001111111100000000 00000000000000 0 000 011 No 117 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.85 E-value=2.4e-20 Score=128.12 Aligned_cols=406 Identities=11% Similarity=0.060 Sum_probs=206.0 Q ss_pred CCCCcccccccchh-------hhcccC-----------ccccC-CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_021537. 1 MSKAEETTQLDERH-------IATDVG-----------RGIQP-PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIV 61 (602) Q Consensus 1 ~~k~~~~~~~~~~~-------~~~~~~-----------~~i~p-~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~ 61 (602) |.+-.+.++...+. |-...| .+..| .++...|.++++.|+.++++|+++++++.+-||+|. T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i~ 80 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSLK 80 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeeee Confidence 44433333222210 111111 12222 258899999999999999999999999999999885 Q ss_pred EecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CC----------- Q lcl|NC_021537. 62 AHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DG----------- 129 (602) Q Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G----------- 129 (602) -. +.+..+.+..++.+.+ ..+-+...++...++|.+++.+.... +. T Consensus 81 ~~--------~~~~~~~~~~~~~~l~--------------~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~ 138 (461) T protein:vir:80 81 TD--------NKEMKKNIESKWRKLK--------------TKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDP 138 (461) T ss_pred cC--------CHHHHHHHHHHHHHhh--------------HHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCccc Confidence 21 1223333333332211 12344455566778999999876422 11 Q ss_pred -ceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEE-ecCcee Q lcl|NC_021537. 130 -TPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVA-SDAGEL 207 (602) Q Consensus 130 -~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~ 207 (602) .+.+|.+|.+....... .......+.....--+.++++....... .+.+. ..+... T Consensus 139 ~~~~~~~~l~~~~~~~i~----~~~~~~dp~sp~fg~P~~y~i~~~~~~~------------------~~~~~~~~~~~~ 196 (461) T protein:vir:80 139 KTIKSIPYINTFNTQKVT----QLYLNQDMFSEHFGEVEFFEVNRVSQLG------------------EEILSGTTASTS 196 (461) T ss_pred ccccceeEEEeccccccc----hhhhcccCcCcccccceEEEEecccccc------------------ccccccccCccc Confidence 11122222221111000 0000000111111112222222111000 00001 123345 Q ss_pred EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHHh Q lcl|NC_021537. 208 KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDNL 286 (602) Q Consensus 208 ~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~~ 286 (602) +.+.++.||||.+....+..+|.|.++.+...|.....+......+..+...+ ++++++. .+..+....+.+.++.+ T Consensus 197 ~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~ 274 (461) T protein:vir:80 197 EQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFM 274 (461) T ss_pred eEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHh Confidence 78899999999988878888999999999999988888888887777665544 4454431 22333334455555544 Q ss_pred hcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHH Q lcl|NC_021537. 287 KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSK 365 (602) Q Consensus 287 ~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e 365 (602) .+ |.+ ++++....++.... .+.+ .+.+..+.....||++-+||...| |.. .+..|+.+ T Consensus 275 ~~--~~g-~~~~d~~e~~e~~~------------~~ls-----gl~~~l~~~~~~iaa~s~iP~t~L~G~s-~g~~asge 333 (461) T protein:vir:80 275 FR--TEA-LAIIKGDEQLTKES------------TNVS-----GMKDLLDYGWDYLAGAVRMPKTVLKGQE-AGTLTGAQ 333 (461) T ss_pred cC--Cce-EEEEcCCcceEEEe------------cCcC-----CHHHHHHHHHHHHhhhhcCCeeeeeccc-CCccccch Confidence 43 223 33444333332221 1111 234666778889999999999766 554 46667777 Q ss_pred HHHHHHHHH-------HHHHHHHHHHHHHhhhcCCccc----cccceEEEeccchhcchhH--H---HHHHHHHHHHHHh Q lcl|NC_021537. 366 EQTREFAKG-------IIEPEQAKFSARLYKIIHQDAL----DVDEWTIDFELRGAEQPEQ--D---AKMAEQRVRAMRL 429 (602) Q Consensus 366 ~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~----~~~~~~~~f~~~~~~~~~~--d---~~~~~~~~~~~~~ 429 (602) .....|+.. .++|+++.+-..|-...+.... ...+|.++|+ .+..+.. . .+..++++++++. T Consensus 334 ~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~--~L~~~s~kekAe~~~~~a~a~~~~~~ 411 (461) T protein:vir:80 334 YDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFN--PLWNLDSKTDAEVRKLTAEADQIYIV 411 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 777777553 3566666665555433221111 1124566665 3332222 2 2345678999999 Q ss_pred CCcccHHHHHHHh-C---CCCCCCCccccccccccccccccccCCCcCccccccccccccccccc Q lcl|NC_021537. 430 AGVGTVNEAREEL-D---LAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 430 ~G~~T~NE~R~~~-G---l~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (602) +|+++++|+|+.+ + ++|... + .+.++...+.... ....+..++... T Consensus 412 ~g~is~~e~r~~l~~~~~~~~~~~----------~--~~~~~~~~~~~~~---~~~~~~~e~~~g 461 (461) T protein:vir:80 412 NGVLDPDEVKETRFGRFGLENSSK----------F--SGDSAEIDKLAKL---VYDAYAKKNADG 461 (461) T ss_pred cCCCCHHHHHHHHHHhcCCCCCcc----------C--CCCCchhhhhhhh---ccccccccCCCC Confidence 9999999999865 3 322110 0 0000000000000 000000000000 No 118 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.84 E-value=1.2e-20 Score=129.89 Aligned_cols=386 Identities=12% Similarity=0.107 Sum_probs=194.8 Q ss_pred CCCCcccccccc---hhhhcccCc-----cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSKAEETTQLDE---RHIATDVGR-----GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k~~~~~~~~~---~~~~~~~~~-----~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~ 72 (602) |.|...+....+ ..|....+. +..+.++...|.++++.|+.++++|+++|++..+-+|+|.-.. T Consensus 5 m~~~~~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~-------- 76 (435) T protein:vir:79 5 MSDKVKAITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVK-------- 76 (435) T ss_pred cccccccchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCCC-------- Confidence 444432211111 113322221 1234467889999999999999999999999999999874210 Q ss_pred hhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee-CC---------CCceEEEEEeCcccc Q lcl|NC_021537. 73 GESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV-EG---------DGTPVGLAHVPAATV 142 (602) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r-~~---------~G~~~~L~~l~p~~v 142 (602) ..+++...+.+.. ..+.+...++...++|.+++.+.. +. +|....|.++++..| T Consensus 77 --~~~~~~~~~~~l~--------------~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i 140 (435) T protein:vir:79 77 --NEKSFKSRWDELR--------------LNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQI 140 (435) T ss_pred --hHHHHHHHHHHhh--------------HHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhc Confidence 1122332222211 123444455556788998887764 22 233445555665555 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCc-eeEEechhHEEEecCC Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAG-ELKNGPANELIFLPNP 221 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~eviH~r~~ 221 (602) .+..... .|..+.+-.+..+++...++ ..+.+.++.||||... T Consensus 141 ~~~~~~~------------------------------------dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~ 184 (435) T protein:vir:79 141 TIHERET------------------------------------NARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGE 184 (435) T ss_pred cchhhcc------------------------------------CCcccccCcceEEEEecCCCCCceEEcceeEEEecCC Confidence 4321100 01111111222223322222 3567888999999632 Q ss_pred ------CCCCCcccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc--ccC-CHHHHHHHHHHHHHhhcc-c Q lcl|NC_021537. 222 ------SPLALYYGVPDW-VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG--GTL-SEDSKEDLRNLMDNLKGS-R 290 (602) Q Consensus 222 ------~~~~~~~G~spl-~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~-~~~~~~~l~~~~~~~~g~-~ 290 (602) .+.+.++|.|++ +.+...|.....+.......+....... +++++ ..+ +.+....+.+.+...... . T Consensus 185 ~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v--~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~ 262 (435) T protein:vir:79 185 RVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAV--WKARDLALMCDDEEGRYAARLRLAQVDDESG 262 (435) T ss_pred cchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc--ccchhHHHhhcCccchHHHHHHHHHHHHhcC Confidence 345678999998 5788888888888888777666554432 33332 111 111222222222222111 1 Q ss_pred ccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHH Q lcl|NC_021537. 291 YRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTR 369 (602) Q Consensus 291 nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~ 369 (602) +-+.+++...+.++.... .+.+ .+.+......++||++.+||...| |....+-.|+.+.... T Consensus 263 ~~~~~~i~~~~e~~e~~~------------~~ls-----gl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~ 325 (435) T protein:vir:79 263 VGKAIGIDATDEEYEVLN------------SDVS-----GVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALE 325 (435) T ss_pred CCCceeEecCCcceEEEe------------cccC-----CHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHH Confidence 223334433322333221 1111 124566777889999999998766 5544333455666666 Q ss_pred HHHHH-------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH---HHHHHHHHHHHHhCCcccHHHHH Q lcl|NC_021537. 370 EFAKG-------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD---AKMAEQRVRAMRLAGVGTVNEAR 439 (602) Q Consensus 370 ~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d---~~~~~~~~~~~~~~G~~T~NE~R 439 (602) .|+.. .++|.++.+-..+ ....+|+++|+--..+.-.+. .+..++++++++.+|+++++|+| T Consensus 326 ~yyd~i~~~Qe~~l~p~l~~l~~li--------~~s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r 397 (435) T protein:vir:79 326 TFYKLIDRKRVEDYKPILEFLLPFM--------ISETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETR 397 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh--------hcCCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHH Confidence 77653 2344433332221 122577888763222222111 24567788999999999999999 Q ss_pred HHh-CCCC---CCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 440 EEL-DLAP---FEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 440 ~~~-Gl~p---~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +.+ ...+ +.++..+.+ .++...+ .++..+...++ T Consensus 398 ~~L~~~~~~~~~~~~~~~~~---------~~~~d~~---------~~~~~e~g~~~ 435 (435) T protein:vir:79 398 DTLRSICPDLKIMDNDNIEL---------PEPEDLD---------PEPGQEGGLNK 435 (435) T ss_pred HHHHHhccccCCCCcccccC---------CccccCC---------CCCCCCCCCCC Confidence 877 2221 111110100 0000000 00000111111 No 119 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.84 E-value=2e-20 Score=128.57 Aligned_cols=386 Identities=14% Similarity=0.137 Sum_probs=197.4 Q ss_pred CCCCcccccccchhhhcc-cCcccc--CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATD-VGRGIQ--PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~-~~~~i~--p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) |-|++--... -.+.. .+.... ...++..|.++++.|+.++++|+++|+++.+-+|+|.-. + ..+ T Consensus 1 ~~~~D~~~n~---~~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~---~-------~~~ 67 (422) T protein:vir:10 1 MVKTDSYANI---FLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI---D-------DEP 67 (422) T ss_pred CccchhhHHH---HcCCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCC---C-------HHH Confidence 6555532211 11111 011111 125788999999999999999999999999999998421 0 111 Q ss_pred HHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee-C---------CCCceEEEEEeCccccccccc Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV-E---------GDGTPVGLAHVPAATVRVRKT 147 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r-~---------~~G~~~~L~~l~p~~v~~~~~ 147 (602) ++...+.+. ...+.+...++...++|.+++.+.. + ..|....|.++++..|.+... T Consensus 68 ~~~~~~~~l--------------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~ 133 (422) T protein:vir:10 68 AFWSRWDDL--------------EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTR 133 (422) T ss_pred HHHHHHHHh--------------hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhc Confidence 222222221 1233444555566788999988775 2 234455666666666654221 Q ss_pred ccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecC-ceeEEechhHEEEecCC----- Q lcl|NC_021537. 148 TTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA-GELKNGPANELIFLPNP----- 221 (602) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~~~~~~eviH~r~~----- 221 (602) .. .|..+.+-.+..+++...+ +....+.++.||||... T Consensus 134 ~~------------------------------------dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~ 177 (422) T protein:vir:10 134 EE------------------------------------NPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNV 177 (422) T ss_pred cc------------------------------------CccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhh Confidence 10 0111111122222222222 23357778889999543 Q ss_pred -CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc--ccC-CHHHHHHHHHHHHHhhc-ccccCcc Q lcl|NC_021537. 222 -SPLALYYGVPDWVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG--GTL-SEDSKEDLRNLMDNLKG-SRYRTAI 295 (602) Q Consensus 222 -~~~~~~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~-~~~~~~~l~~~~~~~~g-~~nag~~ 295 (602) .+.+.++|.|++.. +...|.....+.......|...... ++++++ ..+ +......+.+.++.... ..+.+.+ T Consensus 178 ~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~ 255 (422) T protein:vir:10 178 MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAI 255 (422) T ss_pred hcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchhHHHhcCCccchHHHHHHHHHHHHhcCCccce Confidence 34566799999986 6788888888888877777665543 333332 111 12222222233322221 1222333 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHHHHH Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFAKG 374 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~~~ 374 (602) ++...+.++..... +.+ .+.+......++||++.+||...| |....+-.|+.+...+.|+.. T Consensus 256 ~l~~~~e~~e~~~~------------~ls-----gl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~ 318 (422) T protein:vir:10 256 GIDAESEEYSVLNS------------DIG-----GIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKL 318 (422) T ss_pred eEecCCcceEEEec------------ccC-----ChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHH Confidence 44333333333221 111 134566777889999999998866 554333235566666677652 Q ss_pred -------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH---HHHHHHHHHHHHhCCcccHHHHHHHhCC Q lcl|NC_021537. 375 -------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD---AKMAEQRVRAMRLAGVGTVNEAREELDL 444 (602) Q Consensus 375 -------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d---~~~~~~~~~~~~~~G~~T~NE~R~~~Gl 444 (602) .+.|.++.+-..+ ....+|+++|+--..+.-.+. .+..++++++++.+|+++++|+|+.+-- T Consensus 319 i~~~Qe~~l~p~l~~l~~~i--------~~s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~ 390 (422) T protein:vir:10 319 VDRKRNAELLPILEFLIPFI--------VNAEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRT 390 (422) T ss_pred HHHHHHHHHHHHHHHHHHHh--------cccCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhh Confidence 2344433332221 223478888873222222211 2446788899999999999999998832 Q ss_pred CCCCCCccccccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 445 APFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 445 ~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) .....+..+ ...+ ...+..+...++..+|+.+ T Consensus 391 ~~~~~~~~~-----~~~~-----~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 391 IAPEVKIND-----GSVE-----TEVTISETSNDPLEVPTDD 422 (422) T ss_pred hcccccCCC-----CCCc-----cccchhhcCCCCCCCCCCC Confidence 221111100 0000 0000000001111111111 No 120 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.83 E-value=4.2e-19 Score=121.34 Aligned_cols=506 Identities=11% Similarity=0.038 Sum_probs=210.4 Q ss_pred CCCCcccc------ccc-c---hhhh------------cccCccccC-CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_021537. 1 MSKAEETT------QLD-E---RHIA------------TDVGRGIQP-PYNPETLAAFQELNETHQACIRKKSRYEAGYG 57 (602) Q Consensus 1 ~~k~~~~~------~~~-~---~~~~------------~~~~~~i~p-~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~ 57 (602) ++.-.... ... . ..+. .....++.+ -+.-.+|..+++.|+.++++|++++++..+-+ T Consensus 59 ~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~g 138 (765) T protein:vir:96 59 VKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAARNG 138 (765) T ss_pred CCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhcCC Confidence 11000000 000 0 0000 000011111 23335678888899999999999999999999 Q ss_pred eEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CCceEEEEE Q lcl|NC_021537. 58 FEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DGTPVGLAH 136 (602) Q Consensus 58 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G~~~~L~~ 136 (602) |+|.-. .++...+..+.+...+.+.. ..+.+...++..-++|.+|+.+.-+. ++.... .| T Consensus 139 ~~I~~~----~~e~~~~~~~~l~~~~~rl~--------------v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~-~P 199 (765) T protein:vir:96 139 WELKSD----GRKLSDEQSALIARRDMEFR--------------VKDNLVELNRFKNVFGVRIALFVVESDDPDYYE-KP 199 (765) T ss_pred ceeecC----ccccCHHHHHHHHHHHHHhh--------------HHHHHHHHHHHhhhceeeEEEEEecccCcchhh-cc Confidence 998632 11222333444444443321 23445555666678999998765432 222111 23 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeec----ccccccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGE----AGDRYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~----~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) |++..|..... .+..+++ ..|... .....|..+.+-.+..+.+ .+ ..+.+ T Consensus 200 L~~~~I~kg~~-------------------kgl~vld--p~~~~~~~v~e~~~Dp~sp~fg~P~~y~i---~g--~~IH~ 253 (765) T protein:vir:96 200 FNPDGIAPGSY-------------------KGISQID--PYWAMPQLTAESTADPSAEHFYEPDFWII---SG--KKYHR 253 (765) T ss_pred cccccccccee-------------------eEEEEec--hhhcccccchhccccccccccCcceeeee---cC--ceecc Confidence 43333321100 0000000 000000 0000011111111111111 11 35667 Q ss_pred hHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc--cCCHHHHHHHHHHHH Q lcl|NC_021537. 213 NELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG--TLSEDSKEDLRNLMD 284 (602) Q Consensus 213 ~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~~~~~~~~l~~~~~ 284 (602) +.||||.... +...++|+|.++.++..|.....+......++...... ++++... ..++++...-.+.+. T Consensus 254 SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~~~l~~r~~~~~ 331 (765) T protein:vir:96 254 SHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIANEDAFNARLAFWI 331 (765) T ss_pred ceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhccHHHHHHHHHHHH Confidence 8899986543 34556899999999999999888888777777776554 3333322 123333222222233 Q ss_pred HhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccC Q lcl|NC_021537. 285 NLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRAN 363 (602) Q Consensus 285 ~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn 363 (602) ..+ +|.+ ++++..+.++..+. ++.+ .+.+.+....++||++.+||...| |..-.|-.++ T Consensus 332 ~~r--~n~g-~~~id~ee~~e~~s------------~~ls-----gl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnAT 391 (765) T protein:vir:96 332 ANR--DNHG-VKVIGIDETMEQFD------------TNLS-----DFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNAT 391 (765) T ss_pred Hhc--CCce-eEEecCCcceeEEe------------cccC-----CHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCc Confidence 333 2333 34444443333322 1111 123556667789999999997554 6543454566 Q ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH---HHHHHHHHHHHhCCcc Q lcl|NC_021537. 364 SKEQTREFAK-------GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA---KMAEQRVRAMRLAGVG 433 (602) Q Consensus 364 ~e~~~~~f~~-------~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~---~~~~~~~~~~~~~G~~ 433 (602) .+...+.|+. ..|.|.++.+-..|-.. .....+++++|+--..+.-.+.+ +..++++++++.+|++ T Consensus 392 Ge~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s----~~i~~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvi 467 (765) T protein:vir:96 392 GEHETISYHEELESIQEHIFDPLLERHYLLLAKS----ESIDVQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVV 467 (765) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCC Confidence 6777777766 23555555544443321 22334677777633222222222 3356788999999999 Q ss_pred cHHHHHHHhCCCCC------CCCcccccccccccc-ccccccCC-----CcCcccccccccccccccccccccccccccc Q lcl|NC_021537. 434 TVNEAREELDLAPF------EDDRGDMTLSEFEAE-FGADASDG-----DAEAMLTRSKAAPPLENKIGERDSVDVDVSK 501 (602) Q Consensus 434 T~NE~R~~~Gl~p~------~~g~~d~~~~~~~~~-~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (602) +++|+|+.++.+|. ++.+.+. .+...+ ..++.... ..........+.+. ...............+ T Consensus 468 s~dEvR~~L~~~~~~g~~~l~d~~~e~--~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~-~~eg~~~~~~~~p~~~ 544 (765) T protein:vir:96 468 SPDEVRERLRDDPRSGYNRLTDDQAET--EPGMSPENLAELEKAGAQSAKAKGEAERAEAQAG-AVEGAGDPVPAAPRGT 544 (765) T ss_pred CHHHHHHHHhccccCCCCCCCcccccc--ccCCCccccccccCCCcccccccCccccccCCCC-ccCCCCcccccCCccc Confidence 99999999865543 2211110 000000 00000000 00000000000000 0000000000000011 Q ss_pred cchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHHh-C-CCccchhhhhhhcccccccc-ccc Q lcl|NC_021537. 502 DPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALV-S-APSAGSYHYSEIRLQYGYLE-VTN 578 (602) Q Consensus 502 ~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~-~-a~s~g~~~~~~i~~~~~~~~-~~~ 578 (602) .+..+..+..+...+. .+. ..+|...-.... . .++.+++-.......++-.. .++ T Consensus 545 ~p~~~~~~~~~g~~~~--~p~--------------------~~~p~~~~~~~~~~~~~~~~~~~~~a~~~g~~v~~~~~~ 602 (765) T protein:vir:96 545 KPLAKAAEEGAGEAAT--PPS--------------------RPNPRAELRNLLSDLLSKLEALDDAQAPDGVDIEQDDAP 602 (765) T ss_pred CCccccccccCccccC--ccc--------------------cccccccchhcccchhhhhhccccccccCCCCCCCCccc Confidence 1111111111110000 000 011111000000 0 01222222222222221100 112 Q ss_pred chhc--ccCCCCCChhhcCCcc---cccC Q lcl|NC_021537. 579 NHER--LPEGPTPDPGEAPEDV---PSDI 602 (602) Q Consensus 579 ~~~~--~~~~~~~~~~~~~~~~---~~~~ 602 (602) ++++ .|..++|++..++.+- |..- T Consensus 603 a~~~a~~ps~a~~~~~~~~~~~~~~P~~~ 631 (765) T protein:vir:96 603 GLKRTSKPSVSGMEPSVFSSNRIVGPRDH 631 (765) T ss_pred hhhhhhccccCCCCCcccCCCCCCCCccc Confidence 2111 1222333333332221 1111 No 121 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.82 E-value=5.2e-19 Score=120.84 Aligned_cols=504 Identities=11% Similarity=0.022 Sum_probs=214.1 Q ss_pred CCCCcc-------------cccccch----hhh-cc-cCccccCC-CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_021537. 1 MSKAEE-------------TTQLDER----HIA-TD-VGRGIQPP-YNPETLAAFQELNETHQACIRKKSRYEAGYGFEI 60 (602) Q Consensus 1 ~~k~~~-------------~~~~~~~----~~~-~~-~~~~i~p~-~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i 60 (602) +..+.. ...+... .+. +. ...++.+. +.-.+|.++++.|+.++.+|+++++++.+-+|+| T Consensus 88 ~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I 167 (862) T protein:vir:99 88 VRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHL 167 (862) T ss_pred cchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceE Confidence 100000 0000000 000 00 01122221 2224577788889999999999999999999999 Q ss_pred EEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC-CCC---------- Q lcl|NC_021537. 61 VAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE-GDG---------- 129 (602) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~-~~G---------- 129 (602) .-..+.+ +.+.+..+.+...+.+.. ..+-+...++..-++|.+++.++.+ .++ T Consensus 168 ~~~~d~~--e~~~e~~~~ie~~~~rL~--------------v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e 231 (862) T protein:vir:99 168 KSLGEGE--EIDEESLEKFKAIDVEFK--------------VKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPD 231 (862) T ss_pred eecCccc--ccCHHHHHHHHHHHHHhh--------------HHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcc Confidence 7433222 112233444444443322 1233334444455778777765532 222 Q ss_pred -----ceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecC Q lcl|NC_021537. 130 -----TPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA 204 (602) Q Consensus 130 -----~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 204 (602) .+..|..|+|.++.+.... .. ...+....| -.+..+.+ . T Consensus 232 ~I~kG~lkgl~vlDp~w~~p~~v~----~~-----~~Dp~sp~y------------------------GkP~~y~I---~ 275 (862) T protein:vir:99 232 GITPGSYRGISQIDPYWMMPMLTA----ES-----TADPSSQFF------------------------YEPEFWII---S 275 (862) T ss_pred cccccceeEEEEechhhhcccccc----cc-----ccccccccc------------------------CCceeeee---c Confidence 2344445555444321100 00 000000011 11111111 1 Q ss_pred ceeEEechhHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc--cCCHHHH Q lcl|NC_021537. 205 GELKNGPANELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG--TLSEDSK 276 (602) Q Consensus 205 ~~~~~~~~~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~~~~~~ 276 (602) + ..+.++.||||.... +...++|+|.++.+...|.....+......++.+.... ++++.+. ..+++.. T Consensus 276 g--~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~--v~ktd~l~~l~~ed~l 351 (862) T protein:vir:99 276 G--QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTT--AIHTDTAKAIANEDKF 351 (862) T ss_pred C--eeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHhhhccHHHH Confidence 1 245677788886443 34456899999999999999988888888887775543 3344332 1222222 Q ss_pred HHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHH-hhc Q lcl|NC_021537. 277 EDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVL-INV 355 (602) Q Consensus 277 ~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~-lg~ 355 (602) ..-.+.++..+ +|.| ++++..+.++..+. ++.+. +.+.+....++||++.+||... +|. T Consensus 352 ~~r~~~~~~~r--dN~G-i~liD~eEe~e~ls------------~slSG-----L~dll~~~~q~IAaas~IP~tiLfGq 411 (862) T protein:vir:99 352 IQRLMFWVRYR--DNHA-VKVLGTDETMEQFD------------TSLAD-----FDAVIMGQYQLVASIAKTPATKLLGT 411 (862) T ss_pred HHHHHHHHhcc--Ccce-eEEecCCCceeEEe------------cccCC-----hHHHHHHHHHHHHhhhCCCceeeccc Confidence 21112233332 3333 44454443333222 11111 2345566677999999999884 566 Q ss_pred cccCCccCHHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH---HHHHHHHH Q lcl|NC_021537. 356 TSTSNRANSKEQTREFAK-------GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA---KMAEQRVR 425 (602) Q Consensus 356 ~~~~~~sn~e~~~~~f~~-------~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~---~~~~~~~~ 425 (602) .-.|..++.+.....|+. .-|+|+++++...+...+. ...++.++|+--..+.-.+.+ +..+++++ T Consensus 412 spaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg----~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~ 487 (862) T protein:vir:99 412 APKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLG----IQHEIDVVMEPVASMTAQQQADLNKTKAEGGK 487 (862) T ss_pred CcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCCcceEEeCCCCCCCHHHHHHHHHHHHHHHH Confidence 545666777777777776 3477888888877765543 224677777632222222222 23457788 Q ss_pred HHHhCCcccHHHHHHHh------CCCCCCCCccccccccccccccccccCCCcCccccccccccccc-cccccccccccc Q lcl|NC_021537. 426 AMRLAGVGTVNEAREEL------DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE-NKIGERDSVDVD 498 (602) Q Consensus 426 ~~~~~G~~T~NE~R~~~------Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 498 (602) +++.+|+++++|+|+++ |++.+++.+.+.-. .. ..+...+. .........++..+ ............ T Consensus 488 ~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~--~~-~~e~~~~~---e~~g~a~~~ap~de~~aga~~~~~e~d 561 (862) T protein:vir:99 488 VLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETP--GA-SPENLAAY---QKAGAAQETASAKETQAGAAVTTAEGD 561 (862) T ss_pred HHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccC--CC-Cccccccc---ccCCcccccccccccccccCCccccCC Confidence 99999999999999976 45555433221100 00 00000000 00000000000000 000000000000 Q ss_pred ccccchh-hhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccccccc Q lcl|NC_021537. 499 VSKDPIE-QTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGYLEVT 577 (602) Q Consensus 499 ~~~~~m~-~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~ 577 (602) .+..++. ............+..+....-.= .-...-++++.+.+.|..-... .++|.-..|..- . .. T Consensus 562 ~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~--~~~~~~~~~~e~~~~~~~~~~~------v~~~~~~~~~~~-~--~~- 629 (862) T protein:vir:99 562 QPNVQMVPSMKPGQMVGPEVGITAPMPEDDA--PVAGVVAKLAELQQAQMGAVTG------VLARLVEQLDRM-H--DR- 629 (862) T ss_pred cccccccCCCCCCCccccccccccCCCcccc--ccCcccccchhhhcCcchhhcc------hhhhhHHHHHhh-h--hh- Confidence 0000000 00000000011111110000000 0000012344555544222111 112222222100 0 00 Q ss_pred cchhcccCCC------CCChhhcCCcccccC Q lcl|NC_021537. 578 NNHERLPEGP------TPDPGEAPEDVPSDI 602 (602) Q Consensus 578 ~~~~~~~~~~------~~~~~~~~~~~~~~~ 602 (602) .++++.+.+. |..|+-+-.--|.+- T Consensus 630 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 660 (862) T protein:vir:99 630 TIAEGADIGQYDASGRTVKPGTIATIRPSVS 660 (862) T ss_pred hhhhhcchhhhccccccccccccCCCCCccc Confidence 1122222111 111100000012222 No 122 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.81 E-value=9.8e-20 Score=124.79 Aligned_cols=387 Identities=14% Similarity=0.113 Sum_probs=193.8 Q ss_pred CCCcccccccchhhhcccCccccCC---CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHH Q lcl|NC_021537. 2 SKAEETTQLDERHIATDVGRGIQPP---YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQT 78 (602) Q Consensus 2 ~k~~~~~~~~~~~~~~~~~~~i~p~---~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~ 78 (602) -|.-+.-.+ ..-++....+-..|+ .+...|.++++.|+.++++|+++|+++.+-+|+|.-. +..++ T Consensus 1 ~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~----------~~~~~ 69 (427) T protein:vir:10 1 MKIVKHDGY-NDIFNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGV----------KDEKE 69 (427) T ss_pred CCccccchH-HHHhhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCc----------cHHHH Confidence 111111111 111211111111222 2445788899999999999999999999999998421 01122 Q ss_pred HHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee----------CCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 79 VRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV----------EGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 79 ~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r----------~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +...+.+.+ ..+-+..+++...++|.+++.+.- +..|.+..|.++++..|++.... T Consensus 70 ~~~~~~~l~--------------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~ 135 (427) T protein:vir:10 70 FKSLWDSYK--------------LDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRV 135 (427) T ss_pred HHHHHHHhh--------------HHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccc Confidence 333232211 123444555666788999987753 23466777888877777653211 Q ss_pred cccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecC-ceeEEechhHEEEecCC------ Q lcl|NC_021537. 149 TTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA-GELKNGPANELIFLPNP------ 221 (602) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~~~~~~eviH~r~~------ 221 (602) . . |..+.+-.+..+++.... ...+.+.++.||||... T Consensus 136 ~------------d------------------------p~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~ 179 (427) T protein:vir:10 136 T------------N------------------------ARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQA 179 (427) T ss_pred c------------C------------------------ccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhh Confidence 0 0 001111112222222211 23367888889999643 Q ss_pred CCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc--cC-CHHHHHHHHHHHHHhh-cccccCcce Q lcl|NC_021537. 222 SPLALYYGVPDWVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG--TL-SEDSKEDLRNLMDNLK-GSRYRTAIL 296 (602) Q Consensus 222 ~~~~~~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~-~~~~~~~l~~~~~~~~-g~~nag~~~ 296 (602) .+.+.++|.|++.. +...|.....+.......|...... ++++++- .+ +.+....+.+.+.... ...+-+.++ T Consensus 180 ~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ 257 (427) T protein:vir:10 180 RKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIG 257 (427) T ss_pred cccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccc--cccchhHHHHhcCccchHHHHHHHHHHHHhcCccccee Confidence 34567899999864 6677887777777777766665433 3344321 11 1111112222222211 112233344 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHHHHH- Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFAKG- 374 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~~~- 374 (602) +...+.++..... ++ + .+.+......++||++.+||...| |....+-.|+.+.....|+.. T Consensus 258 l~~~~e~~e~~~~----------~l--s-----gl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i 320 (427) T protein:vir:10 258 IDAETEEYDVLNS----------DI--S-----GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV 320 (427) T ss_pred eecCCCceeEEec----------cc--C-----ChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHH Confidence 4333333332221 11 1 123556667789999999998866 544333335556666666653 Q ss_pred ------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH---HHHHHHHHHHHhCCcccHHHHHHHh--- Q lcl|NC_021537. 375 ------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA---KMAEQRVRAMRLAGVGTVNEAREEL--- 442 (602) Q Consensus 375 ------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~---~~~~~~~~~~~~~G~~T~NE~R~~~--- 442 (602) .|.|.++.+-..+ ....+|+++|+--..+.-.+.+ ++.++++++++++|+++++|+|+.+ T Consensus 321 ~~~Qe~~l~p~l~~l~~~i--------~~s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~ 392 (427) T protein:vir:10 321 DRKREEDYRPLLEFLLPFI--------VDEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSI 392 (427) T ss_pred HHHHHHHHHHHHHHHHHHh--------hcCCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhh Confidence 2445444433322 1224788888743333323322 3567889999999999999999877 Q ss_pred -CCCCCCCCccccccccccccccccccCCCcCccccccccccccccccc Q lcl|NC_021537. 443 -DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 443 -Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (602) +.+.+.++. + ... ..........+.+... . ...+ T Consensus 393 ~~~~~~~~~~-~-------~~~-e~~~~~~e~~p~~~e~--~---~d~~ 427 (427) T protein:vir:10 393 APEFKLKDGN-N-------INI-REPEETTEPEPGLGEK--L---EDEN 427 (427) T ss_pred hccccCCCCc-c-------ccc-cccchhcCCCCCCCCC--C---CCCC Confidence 333332211 0 000 0000000000000000 0 0000 No 123 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.79 E-value=1e-18 Score=119.15 Aligned_cols=472 Identities=13% Similarity=0.123 Sum_probs=240.3 Q ss_pred CCCCc----cccccc-c-hhhhcccCccccCC-----------CCHHHHHHHH----hhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSKAE----ETTQLD-E-RHIATDVGRGIQPP-----------YNPETLAAFQ----ELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k~~----~~~~~~-~-~~~~~~~~~~i~p~-----------~~~~~l~~~~----~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) |..++ ++..+. . +.+.....+++.|. -|+.....++ +..+.|.+|++.+...|.+++|. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~~~w~ 91 (526) T protein:vir:99 12 IRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGLDWA 91 (526) T ss_pred cccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCce Confidence 32221 122221 1 22333333333331 1222222232 35899999999999999999999 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---ceEEEEE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---TPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---~~~~L~~ 136 (602) |.+..+... .+....+.++..+. ...+|.+++..++ +.+.+|-+++|+++..+| .+..|.+ T Consensus 92 I~p~~~~~~--~~~~~a~~v~~~l~-------------~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~ 155 (526) T protein:vir:99 92 VEPPRNASA--AEKADADYLHELLL-------------DLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHH 155 (526) T ss_pred EecCCCCCH--HHHHHHHHHHHHHh-------------cccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeee Confidence 986533221 11122222222211 1124677777666 467899999999987654 3667888 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhH-E Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANE-L 215 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~e-v 215 (602) +++.+++...+... ...+.. ....-..+++.. | T Consensus 156 r~~~~f~~~~~~~~-----------------~l~~~~-----------------------------~~~~g~~l~~~k~i 189 (526) T protein:vir:99 156 RPQSWFQLNPEDQN-----------------ELRLRD-----------------------------NSPAGEALQPFGWI 189 (526) T ss_pred ecccceeeccCCCc-----------------EEEecC-----------------------------CCCCceeecCCCeE Confidence 88776654332110 000000 000111233333 5 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) +|. +....+.++|.+.+..+...........++...|....|+|--+.+++.+ .++++++.+.+.+.++.. ++ . T Consensus 190 ~~~-~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-a~~~ek~~L~~av~~i~~--d~--~ 263 (526) T protein:vir:99 190 IHR-PRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG-TADEEKATLLRAVTGLGH--AA--A 263 (526) T ss_pred EEe-ecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC-CCHHHHHHHHHHHHHHhh--Cc--E Confidence 554 44456778999999999999999999999999999999999888887644 477888888888877643 22 3 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhc---cccCCccCHHHHHHHHH Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINV---TSTSNRANSKEQTREFA 372 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~---~~~~~~sn~e~~~~~f~ 372 (602) ++++.|++ ++|...+... -.-|.++.++..++|+.++ +-..+... ...++++..+.... .. T Consensus 264 ~iiP~~~~------------ie~~ea~~~~--~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS~a~g~vh~~-v~ 327 (526) T protein:vir:99 264 GIIPETMA------------IDFQQAAQGS--SEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGGAFALGQVHNE-VR 327 (526) T ss_pred EEecCCce------------eEEeecCCCC--HHHHHHHHHHHHHHHHHHH-hhhhhccccccCcchhhhHHHHHHH-HH Confidence 44444443 3444433222 2347888888999998875 11222221 12234444333333 33 Q ss_pred HHHHHHHHHHHHHHHhhhcCCcccccc---------ceEEEeccchhcchhHHHHHHHHHHHHHHhCCc-ccHHHHHHHh Q lcl|NC_021537. 373 KGIIEPEQAKFSARLYKIIHQDALDVD---------EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGV-GTVNEAREEL 442 (602) Q Consensus 373 ~~~l~P~~~~ie~~ln~~Ll~~~~~~~---------~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~ 442 (602) .+.++--++.++..||+.|+.+.-... .-+++|+... ..|.+..+++++++++.|+ ++..++|+.+ T Consensus 328 ~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e----~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~ 403 (526) T protein:vir:99 328 HDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLRE----QADITSMAQSIPALVNVGLEIPSAWVYDKL 403 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCC----cccHHHHHHHHHHHHhCCCccCHHHHHHHh Confidence 556778888999999987764322211 1245666544 3355667889999999998 8889999999 Q ss_pred CCCCCCCCccccccccccccccccccCCCcCccccccccc-ccccccccccccccc-cccccchh--hhhcchhhhhhh- Q lcl|NC_021537. 443 DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAA-PPLENKIGERDSVDV-DVSKDPIE--QTTFSSSNLDEG- 517 (602) Q Consensus 443 Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~m~--~~~v~ss~~~~~- 517 (602) |+|.-.+++ +.+......... ....+............ .+............. ....+.+. ..++. ..+.+. T Consensus 404 Gip~~~~~e-~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~l~~i~-~~l~~~~ 480 (526) T protein:vir:99 404 GIPQPAKNE-PVLRSAAQPAIL-SRQHGQRVAALATIVGPRYGDQQALDKALADLPAKDMQNQANDLLAPLL-EAVNRGD 480 (526) T ss_pred CCCCCCCcc-cccCCCCCCccc-ccccccccccccccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcC Confidence 996543332 221111000000 00000000000000000 000000000000000 00000000 01111 111111 Q ss_pred eecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccccccccchhcccC Q lcl|NC_021537. 518 LYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGYLEVTNNHERLPE 585 (602) Q Consensus 518 ~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~ 585 (602) -|+ +|.. ...=-|-+.+...+.++|.. ..|..++.++|.-+ -+. ++ T Consensus 481 s~e--------e~~~----~L~~l~~~ld~~~l~~~l~~----a~~~A~l~Gr~~~~-----~e~-~~ 526 (526) T protein:vir:99 481 SET--------ELLG----ALAEAFPDMDDSALTDALHR----LLFAADTWGRLHGN-----LDR-ID 526 (526) T ss_pred CHH--------HHHH----HHHHHhccCCHHHHHHHHHH----HHHHHHHhhhhhhh-----hcc-cC Confidence 122 2221 11113456777777766643 23555555553211 000 00 No 124 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.78 E-value=2.7e-18 Score=116.90 Aligned_cols=474 Identities=12% Similarity=0.091 Sum_probs=241.1 Q ss_pred CCCCccccc----ccc--hhhhcccCccccCC-----------CCHHHHHHHH----hhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSKAEETTQ----LDE--RHIATDVGRGIQPP-----------YNPETLAAFQ----ELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k~~~~~~----~~~--~~~~~~~~~~i~p~-----------~~~~~l~~~~----~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) +++++.++. +.. +.+.....+++.|. -|+..+..++ +..+.|.+|++.+...|.+++|. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~Rk~av~~~~w~ 91 (528) T protein:vir:10 12 LRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSKRKRAVLGLDWT 91 (528) T ss_pred cccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 333322211 111 22333333333331 1222222232 35889999999999999999999 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---ceEEEEE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---TPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---~~~~L~~ 136 (602) |.+..+... .+ +....+++... . ....|.+++..++ +.+.+|.+++|+++..+| .+..|.+ T Consensus 92 I~p~~~~~~--~~----~~~a~~v~~~l------~---~~~~f~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~~~~ 155 (528) T protein:vir:10 92 IEPPRNASA--AE----KADAEYLHELL------L---DLEGIEDLMLDCM-DGVGHGYSAIELDWSLQGREWLPQAFDH 155 (528) T ss_pred EecCCCCCH--HH----HHHHHHHHHHH------h---CCccHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEEEeee Confidence 987533221 11 12222222211 0 1123566665544 367799999999986543 3667888 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) +|+.++++..+... ...+..+ ...-..+++...| T Consensus 156 r~~~~f~~~~~~~~-----------------~l~~~~~-----------------------------~~~g~~l~~~k~i 189 (528) T protein:vir:10 156 RPQSWFQLNPDDQD-----------------ELRLRDN-----------------------------SIAGEVLQPFGWI 189 (528) T ss_pred ecccceeeccCCCc-----------------EEeccCC-----------------------------CCCceeecCCCeE Confidence 88776654332110 0000000 0011223444434 Q ss_pred EecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcce Q lcl|NC_021537. 217 FLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAIL 296 (602) Q Consensus 217 H~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~ 296 (602) ..++....+.++|.+.+..+...........++...|....|+|--+.+++.+ .++++++.|.+.+.++... + .+ T Consensus 190 v~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-a~~~ek~~L~~al~~i~~~--~--~~ 264 (528) T protein:vir:10 190 MHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPG-TPDEEKVTLLRAVTGLGHA--A--AG 264 (528) T ss_pred EEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC-CCHHHHHHHHHHHHHHhhC--c--EE Confidence 44455556778999999999999999999999999999999999888887644 5778888888887766432 2 23 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhc---cccCCccCHHHHHHHHHH Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINV---TSTSNRANSKEQTREFAK 373 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~---~~~~~~sn~e~~~~~f~~ 373 (602) +++.|++ ++|...+.... .-|.++.++..++|+.+.-= ..+... ...++++-.+... .... T Consensus 265 iiP~~~~------------ie~~ea~~~~~--~~f~~li~~~d~~Isk~iLG-qtlTs~~~~g~~gS~Alg~vh~-~v~~ 328 (528) T protein:vir:10 265 IIPESMS------------IDFQEASKGSA--EPFMAMMRWCDDSMSKAILG-GTLTSQTSESGGGAYALGQVHN-EVRH 328 (528) T ss_pred EecCCce------------eEEeecCCCCh--hHHHHHHHHHHHHHHHHHhh-hhhhccccccccchhhhHHHHH-HHHH Confidence 4444433 34444332222 34788888888999887622 233222 1223444333333 3345 Q ss_pred HHHHHHHHHHHHHHhhhcCCcccccc---------ceEEEeccchhcchhHHHHHHHHHHHHHHhCCc-ccHHHHHHHhC Q lcl|NC_021537. 374 GIIEPEQAKFSARLYKIIHQDALDVD---------EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGV-GTVNEAREELD 443 (602) Q Consensus 374 ~~l~P~~~~ie~~ln~~Ll~~~~~~~---------~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~G 443 (602) +.++-.++.++..||+.|+.+.-... ..+++|+...- .|.+..++++++++..|+ ++..++|+.+| T Consensus 329 di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~----eDl~~~a~~~~~L~~~G~~i~~~~i~e~~g 404 (528) T protein:vir:10 329 DLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDR----ADLAAMATSLPPLVKLGVQVPVNWVQEQLG 404 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCc----ccHHHHHHHHHHHHhCCCCCCHHHHHHHhC Confidence 67788888999999987754322111 12566666543 355667888999999998 89999999999 Q ss_pred CCCCCCCccccccccccccccccccCCCc-Ccccccccc-ccccccccccccccccc-ccccchhh--hhcchhhhhhhe Q lcl|NC_021537. 444 LAPFEDDRGDMTLSEFEAEFGADASDGDA-EAMLTRSKA-APPLENKIGERDSVDVD-VSKDPIEQ--TTFSSSNLDEGL 518 (602) Q Consensus 444 l~p~~~g~~d~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~m~~--~~v~ss~~~~~~ 518 (602) +|.-.+++ +.+............+.... ....+.... .........+....... ...+.+.. .++ -..+...+ T Consensus 405 ip~p~~~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~l~~i-~~~l~~~~ 482 (528) T protein:vir:10 405 IPLPANGE-AVLGDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQVLASLPAQDMQNQADSLVAPL-LDVISRGG 482 (528) T ss_pred CCCCCCCc-ccccCCCcccccccCcccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhcC Confidence 96544332 22222111111111110000 000000000 00000000000000000 00000000 011 11111111 Q ss_pred -ecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccccccccchhcccC Q lcl|NC_021537. 519 -YDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGYLEVTNNHERLPE 585 (602) Q Consensus 519 -yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~ 585 (602) |+ +|.. ...=-|-+.....+.++|.. ..|..++.+++.-+. +. ++ T Consensus 483 s~e--------e~~~----~L~~l~~~~d~~~l~~~l~~----a~~~A~l~G~~~~~~-----e~-~~ 528 (528) T protein:vir:10 483 SEA--------ELLG----ALAEAFPDMDDSALADALHR----LLFVADTWGRLNGTL-----DR-ID 528 (528) T ss_pred CHH--------HHHH----HHHHHhhcCCHHHHHHHHHH----HHHHHHHhhhhhccc-----cc-cC Confidence 22 2221 00113446777777776644 335666666633111 00 00 No 125 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.78 E-value=1.7e-18 Score=117.99 Aligned_cols=472 Identities=13% Similarity=0.115 Sum_probs=242.4 Q ss_pred CCCC----cccccccc--hhhhcccCccccCCC-----------CHHHHHHHH----hhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSKA----EETTQLDE--RHIATDVGRGIQPPY-----------NPETLAAFQ----ELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k~----~~~~~~~~--~~~~~~~~~~i~p~~-----------~~~~l~~~~----~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) ++.. .++..+.. +.++....+++.|.. |+.....++ +..+.|.+|++.+...|.+++|. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~Rk~av~~~~w~ 91 (526) T protein:vir:79 12 IRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGLDWA 91 (526) T ss_pred cCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCce Confidence 2222 12222222 334444444444321 222222232 35899999999999999999999 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---ceEEEEE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---TPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---~~~~L~~ 136 (602) |.+..+... .+....+.++..+. ...+|.+++..++. -+.+|.+++|+++..+| .+..|.+ T Consensus 92 I~p~~~~~~--~~~~~a~~v~~~l~-------------~~~~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~l~~ 155 (526) T protein:vir:79 92 VEPPRNASA--AEKADADYLHELLL-------------DLEGLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLAFHH 155 (526) T ss_pred EecCCCCCh--HHHHHHHHHHHHHh-------------cccCHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEEeee Confidence 987533221 11122222222211 11246677776655 67799999999987654 3667777 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) .|+.+.+...+... ...... ....-..+++...| T Consensus 156 r~~~~F~~~~~~~~-----------------~l~~~~-----------------------------~~~~g~~l~~~k~i 189 (526) T protein:vir:79 156 RPQSWFQLNPEDQN-----------------ELRLRD-----------------------------NSPAGEALQPFGWI 189 (526) T ss_pred ecccceEeccCCCc-----------------EEEecC-----------------------------CCCCceeecCCceE Confidence 77766654322110 000000 00111233444333 Q ss_pred EecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcce Q lcl|NC_021537. 217 FLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAIL 296 (602) Q Consensus 217 H~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~ 296 (602) ..++....+.++|.+.+..+...........++...|...-|+|--+.+++.+ .++++++.+.+.+.++.. ++ .+ T Consensus 190 v~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~-a~~~ek~~L~~av~~i~~--da--~~ 264 (526) T protein:vir:79 190 IHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG-TADEEKATLLRAVTGLGH--AA--AG 264 (526) T ss_pred EEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC-CCHHHHHHHHHHHHHHhc--Cc--EE Confidence 33455556778999999999999999888999999999999999888887644 477788888888777643 22 34 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhh---ccccCCccCHHHHHHHHHH Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLIN---VTSTSNRANSKEQTREFAK 373 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg---~~~~~~~sn~e~~~~~f~~ 373 (602) +++.|++ ++|...+... -.-|.++.++..++|+.+. +-..+.. ....++++..+..... .. T Consensus 265 iiP~~~~------------ie~~ea~~~~--~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS~a~g~vh~~v-~~ 328 (526) T protein:vir:79 265 IIPETMA------------IDFQQAAQGS--SEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGGAFALGQVHNEV-RH 328 (526) T ss_pred EecCCce------------eEEeecCCCC--HHHHHHHHHHHHHHHHHHH-hhhhhccccccCcchhhhhHHHHHHH-HH Confidence 4445443 3444433222 2358888889999998875 1122222 1223344444433333 45 Q ss_pred HHHHHHHHHHHHHHhhhcCCcccccc---------ceEEEeccchhcchhHHHHHHHHHHHHHHhCCc-ccHHHHHHHhC Q lcl|NC_021537. 374 GIIEPEQAKFSARLYKIIHQDALDVD---------EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGV-GTVNEAREELD 443 (602) Q Consensus 374 ~~l~P~~~~ie~~ln~~Ll~~~~~~~---------~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~G 443 (602) +.+.-.++.++..||+.|+.+.-... .-+++|+... ..|.+..++.++++++.|+ ++..++|+.+| T Consensus 329 di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e----~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~g 404 (526) T protein:vir:79 329 DILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLRE----QADITSMAQSIPALVNVGLEIPSAWVYDKLG 404 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCC----cccHHHHHHHHHHHHhCCCcCCHHHHHHHhC Confidence 66788889999999987764432211 1245665543 3366677889999999998 78888999999 Q ss_pred CCCCCCCccccccccccccccccccCCCcCccccccccc-ccccccccccc----cccccccccchhhhhcchhhhhh-h Q lcl|NC_021537. 444 LAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAA-PPLENKIGERD----SVDVDVSKDPIEQTTFSSSNLDE-G 517 (602) Q Consensus 444 l~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~----~~~~~~~~~~m~~~~v~ss~~~~-~ 517 (602) +|.-.++ .+.+...... .......+............ .+......+.. .........++ ..++.. .+.+ . T Consensus 405 ip~~~~~-e~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~-~~~i~~-~~~~~~ 480 (526) T protein:vir:79 405 IPQPAKN-EPVLRPAAQP-AILSRQHGQRVAALATIVGPRYGDQQALDKALADLPAKDMQNQANDL-LAPLLD-AVNRGD 480 (526) T ss_pred CCCCCCc-hhhccccCCc-cccccccccccccccccccccCchhhHHHHHHHHHHHHHHHHHHHHH-HHHHHH-HHHhcC Confidence 9543322 2222111100 00000000000000000000 00000000000 00001111111 011111 1211 1 Q ss_pred eecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccccccccchhcccC Q lcl|NC_021537. 518 LYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGYLEVTNNHERLPE 585 (602) Q Consensus 518 ~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~ 585 (602) -|+.-...|. =-|-+.+...+.++|.. ..|..++.+++.-+ -+. ++ T Consensus 481 s~ee~~~~L~------------~l~~~ld~~~l~~~l~~----a~~~A~l~Gr~~~~-----~e~-~~ 526 (526) T protein:vir:79 481 SETELLGALA------------EAFPDMDDSALTDALHR----LLFAADTWGRLHGN-----LDR-ID 526 (526) T ss_pred CHHHHHHHHH------------HHhccCCHHHHHHHHHH----HHHHHHHhhhhhhh-----hcc-cC Confidence 1332111111 13346777777776643 33556666653311 000 11 No 126 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.75 E-value=1.3e-17 Score=113.09 Aligned_cols=439 Identities=10% Similarity=0.069 Sum_probs=210.1 Q ss_pred CCCCccc-ccccchhhhcc---c-----CccccC-------CCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEET-TQLDERHIATD---V-----GRGIQP-------PYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~~-~~~~~~~~~~~---~-----~~~i~p-------~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) |+..+++ +-++-.+.+.. . +.+.+. +-.....+++. ..+.|.+|++.+...|.+++|.|.+-. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~-~D~hi~s~l~~Rk~av~~~~w~v~p~~ 79 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMM-RDPAVAASVNIIKMFVRKVNWRFVPPK 79 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHh-hChHHHHHHHHHHHHHhcCCceEecCC Confidence 5444432 22222222211 1 112221 11223445554 489999999999999999999998653 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-------------Cc- Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-------------GT- 130 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-------------G~- 130 (602) +........+ ...++..+.. +-..+|.+++..++ |.+.+|-+++|+++... |. T Consensus 80 ~~~~d~~~~~----~a~~v~~~l~--------~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~ 146 (488) T protein:vir:95 80 GKEQDPKMLE----RADFFNSLMD--------DMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLI 146 (488) T ss_pred CCchhHHHHH----HHHHHHHHHh--------ccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCee Confidence 3322111111 1222222111 11235777887775 57889999999999643 21 Q ss_pred -eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEE Q lcl|NC_021537. 131 -PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKN 209 (602) Q Consensus 131 -~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 209 (602) +..|.+.|+.+++- |.....++.... . .. .+.......... +......... T Consensus 147 ~~~~i~~Rpq~~~~~-----------------------f~~d~d~~l~~~-~-~~-~~~~~~~~~~~~--~~~~~~~~~~ 198 (488) T protein:vir:95 147 GWAKLPIRNQSTLDK-----------------------WYFDEDFRRVTG-V-RQ-NLRNVSHIAGAI--NLGERPLTRK 198 (488) T ss_pred eeeeeeecCcccccc-----------------------eeeccCCCceee-c-cc-cccccccccccc--cccccccccc Confidence 23333333322110 000000000000 0 00 000000000000 0001122345 Q ss_pred echhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc---cCCHHHHHHHHHHHHH- Q lcl|NC_021537. 210 GPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG---TLSEDSKEDLRNLMDN- 285 (602) Q Consensus 210 ~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~---~~~~~~~~~l~~~~~~- 285 (602) +|+...|+.++....+.+||.+.+..+............+...|....+.|--+.+.+-. ..+++....+.+...+ T Consensus 199 lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i 278 (488) T protein:vir:95 199 LPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTV 278 (488) T ss_pred ccccceEEEeecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHH Confidence 677776666665556778999999999999988888888888888876555444444311 1233334433333332 Q ss_pred ---hhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCcc Q lcl|NC_021537. 286 ---LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRA 362 (602) Q Consensus 286 ---~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s 362 (602) ..++..++. +++.|+.... ....++++-++.....-..|.++.++.-++|+.+.--.---++...+|+++ T Consensus 279 ~~~~~~~~~ag~--iiP~g~~~~~-----k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~A 351 (488) T protein:vir:95 279 VNDMIANDRAGL--IWPRYIDPDT-----KEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFS 351 (488) T ss_pred HHHhhccchhhe--eecccccccc-----chhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhh Confidence 333333443 3344433221 111223333332222223467777888888888764321112222334555 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccc-----ccc-ceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccH- Q lcl|NC_021537. 363 NSKEQTREFAKGIIEPEQAKFSARLYKIIHQDAL-----DVD-EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTV- 435 (602) Q Consensus 363 n~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~-----~~~-~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~- 435 (602) ..+.... .....+.-.++.+++.||+.|+.+.- ... .-+|+|+... ..|.+..+++++++++.|+.-+ T Consensus 352 l~~vh~e-v~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~~~~e----~~Dl~~~ae~~~~L~~~G~~i~~ 426 (488) T protein:vir:95 352 LADSKTS-LLAMSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITYDDIE----TPDLEAIGSYIQKTVAVGALEVD 426 (488) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCcC----hhhHHHHHHHHHHHHhCCCcccc Confidence 4443333 33567788899999999987775431 111 1245665433 3366677899999999999876 Q ss_pred ----HHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccc Q lcl|NC_021537. 436 ----NEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDV 499 (602) Q Consensus 436 ----NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (602) +.+|+.+|+|+-+++. ..... .. ...++..+.... ......+..........+.+... T Consensus 427 ~~~~~~i~e~~gip~~~~~e-~~~~~--~~-~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 427 KELSNKLREHIGLPPADESQ-PVSEK--LS-PNSQSRSGDGYK--TAGEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred HHHHHHHHHHhCCCCCCCCc-ccccc--CC-CCCCCCCCcccC--CCcccCCcccccccchhhhhccC Confidence 4589999998654332 21111 11 111111111000 00000000000000000000000 No 127 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.75 E-value=3.9e-17 Score=110.56 Aligned_cols=409 Identities=11% Similarity=0.036 Sum_probs=214.5 Q ss_pred CCCCccccccc-----------------c-hhhhcc-cCccccCCC--------CHHHHHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_021537. 1 MSKAEETTQLD-----------------E-RHIATD-VGRGIQPPY--------NPETLAAFQELNETHQACIRKKSRYE 53 (602) Q Consensus 1 ~~k~~~~~~~~-----------------~-~~~~~~-~~~~i~p~~--------~~~~l~~~~~~~~~v~~cI~~ia~~i 53 (602) |+|+.++.+-. . ..+.+. +.+.+.+.. ++....++.+ .+.|.+|++.+...| T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~-D~hi~s~l~~Rk~av 79 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLS-DGTVKNALNYIFGRI 79 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHHhh-ChHHHHHHHHHHHHH Confidence 76666544211 0 111111 112221111 2233445544 899999999999999 Q ss_pred ccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC--CCc- Q lcl|NC_021537. 54 AGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG--DGT- 130 (602) Q Consensus 54 a~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~--~G~- 130 (602) .+++|.|.+..+. ..+ +.+..+++.+..... -.....+|.+++..++ |.+.+|.+++|+++.. +|. T Consensus 80 ~~~~w~v~p~~~~---~~d----~~~ae~v~~~l~~~~---~~~~~~~f~~~i~~~l-da~~~G~s~~Eivw~~~~dg~~ 148 (448) T protein:vir:77 80 RSAKWYVEPASTD---PED----IAIAAFIHAQLGIDD---ASVGKYPFGRLFAIYE-NAYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred hcCCceEecCCCC---HHH----HHHHHHHHHHhhchh---hhhccCCHHHHHHHHH-HhhhhcceeEEEEEeecCCCce Confidence 9999999763221 111 222233332211000 0112346888888774 6889999999999853 454 Q ss_pred -eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEE-cCCcceeecccccccccceeeecccceEEecCceeE Q lcl|NC_021537. 131 -PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVR-QGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELK 208 (602) Q Consensus 131 -~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 208 (602) +..|.+.++.+++- +++. .++.+.....+ ...|. ..+.... T Consensus 149 ~~~~l~~r~~~~~~~------------------------f~~~~~~~l~~~~~~~-----------~~~~~--~~~~~~~ 191 (448) T protein:vir:77 149 ILDKIVPIHPFNIDE------------------------VLYDEEGGPKALKLSG-----------EVKGG--SQFVNGL 191 (448) T ss_pred eeccccccCCCccce------------------------eeeecCCceEEEecCC-----------ccccc--ccCCCcc Confidence 33555555543320 0000 00000000000 00000 0111223 Q ss_pred EechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHHhh Q lcl|NC_021537. 209 NGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDNLK 287 (602) Q Consensus 209 ~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~~~ 287 (602) .+|..-++|.+.. ..+.++|.+.+..|...........++...|...-++|--+.+.+.+ ..++++++.+.+...++. T Consensus 192 ~lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~ 270 (448) T protein:vir:77 192 EIPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFV 270 (448) T ss_pred ccccceEEEEecC-CcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHh Confidence 5577778888754 45678999999999999999999999999999999999888887643 345677888888888877 Q ss_pred cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH Q lcl|NC_021537. 288 GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ 367 (602) Q Consensus 288 g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~ 367 (602) ++.+++. +++.|++..-+.. +... ..+.++.++.-++|+.+..-. .+.-..+.++++...+. T Consensus 271 ~g~~a~~--iiP~g~~ie~~ea------------~~~~---~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~g~~~~~~~~ 332 (448) T protein:vir:77 271 QKPRHGI--ILPDDWKFDTVDL------------KSAM---PDAIPYLTYHDAGIARALGID-FNTVQLNMGVQAVNIGE 332 (448) T ss_pred cCCceEE--EecCCceEEEEec------------CCCc---cCHHHHHHHHHHHHHHHHhcc-ccccccccchhhhhhhh Confidence 6666653 3455554443322 1111 224466677778888876432 22212222333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccc-eEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHH Q lcl|NC_021537. 368 TREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDE-WTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREE 441 (602) Q Consensus 368 ~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~-~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 441 (602) ......+.++-.++.+++.||+.|+.+. ..... -+|.|+..+. .|.+..++++++++ +-.|+. T Consensus 333 ~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~----eDl~~~a~~~~~l~-------~~~~~~ 401 (448) T protein:vir:77 333 FVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEER----NDFSAAANLMGMLI-------NAVKDS 401 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCCh----hhHHHHHHHhHHHH-------HHHHHH Confidence 2344566778888999999998877543 11122 2567765543 36666778888775 458899 Q ss_pred hCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccccchhh Q lcl|NC_021537. 442 LDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQ 506 (602) Q Consensus 442 ~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~ 506 (602) +|+|.-.++..+. ..... +...+. .+ .+++ ..+..++.+....-..+ T Consensus 402 ~~ip~~~~~~~~~----~~~~~--~~~~~~-----~~--~~~~-----~~~~~~~~~~~~~r~~~ 448 (448) T protein:vir:77 402 EDIPTELKALIDA----LPSKM--RRALGV-----VD--EVRE-----AVRQPADSRYLYTRRRR 448 (448) T ss_pred hcCCccCCcCCCC----Cchhc--ccccCC-----CC--CCCc-----hhhcchhhHHHHhhhcC Confidence 9986422211010 00000 000000 00 0000 00000111110000000 No 128 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.74 E-value=4.1e-17 Score=110.45 Aligned_cols=453 Identities=12% Similarity=0.101 Sum_probs=234.1 Q ss_pred CCCCcccccccchh-------hhcccCccccCCCCH---------HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEETTQLDERH-------IATDVGRGIQPPYNP---------ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~~~~~~~~~-------~~~~~~~~i~p~~~~---------~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) ++.++..+. .... +.....+.+-|..++ ....++. .++.|.+|++.+...|.+++|.|.+-. T Consensus 13 ~~~~~~~~~-~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~-~D~~i~s~l~~Rk~av~~~~w~i~~~~ 90 (491) T protein:vir:79 13 VKFGEPDKS-LSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELR-ADAHVGGCVRRRKAAVKALEWGLDRGK 90 (491) T ss_pred ccccccchh-HHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHh-hChHHHHHHHHHHHHHhCCCcEEecCC Confidence 333332211 1112 222223344444343 3334544 589999999999999999999998643 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---ceEEEEEeCccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---TPVGLAHVPAAT 141 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---~~~~L~~l~p~~ 141 (602) +.+ +..+.+... +. ...+.+++..++ +.+.+|.+++|+++...| .|..|.++|+.+ T Consensus 91 ~~~------~~a~~i~e~----------l~----~~~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~ 149 (491) T protein:vir:79 91 AKS------RVAKSIADV----------FA----DLDLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADW 149 (491) T ss_pred CCH------HHHHHHHHH----------Hh----cCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEeeeeecccc Confidence 221 111222221 11 235777777665 577899999999986654 356788888877 Q ss_pred ccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCC Q lcl|NC_021537. 142 VRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNP 221 (602) Q Consensus 142 v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~ 221 (602) +++..+... . +... ........+++...|++++. T Consensus 150 f~~d~~~~l-----------------~--l~~~---------------------------~~~~~g~~lp~~k~i~~~~~ 183 (491) T protein:vir:79 150 FVYDPENQL-----------------R--FRSK---------------------------EHWVQGEELPARKFLVPRQE 183 (491) T ss_pred eeeccCCce-----------------E--Eeec---------------------------CCCCCceeecCCCeEEEEec Confidence 764332100 0 0000 00011234455556666666 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCC Q lcl|NC_021537. 222 SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEF 301 (602) Q Consensus 222 ~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g 301 (602) ...+.++|.|.+..+...........++...|....++|-.+.+++.+ .++++++.+.+.+.++.+ +++ ++++.| T Consensus 184 ~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~-a~~~ek~~l~~al~~~~~--~a~--~viP~~ 258 (491) T protein:vir:79 184 ATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS-ASDAETNLLLDRLEDMVQ--DAV--AVIPDD 258 (491) T ss_pred CCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC-CCHHHHHHHHHHHHHHhc--CeE--EEecCC Confidence 556778999999999999999999999999999999999888887643 577778888777776532 222 344444 Q ss_pred ccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 302 VDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQA 381 (602) Q Consensus 302 ~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~ 381 (602) ++. +|...+.....-..|.++.++..++|+.+.-= -.+... .+++++..+..... ....++-.++ T Consensus 259 ~~i------------e~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG-qtlTt~-~~gs~a~~~vh~~v-~~~i~~~D~~ 323 (491) T protein:vir:79 259 SSI------------EIKEAAGKSGSADVYERLLHFCRGEVSIALLG-QNQTTE-ATSTRASAQAGLEV-TDDIRDGDKA 323 (491) T ss_pred cee------------EEEeccCCCCChhHHHHHHHHHHHHHHHHHhh-hhhccC-cccchhhHHHHHHH-HHHHHHHHHH Confidence 333 33322211111123777778888888876521 111111 34556655544443 3556677788 Q ss_pred HHHHHHhhhcCCccc-----cccceEEEeccchhcchhHHHHHHHHHHHHHHhCCc-ccHHHHHHHhCCCCCCCCccccc Q lcl|NC_021537. 382 KFSARLYKIIHQDAL-----DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGV-GTVNEAREELDLAPFEDDRGDMT 455 (602) Q Consensus 382 ~ie~~ln~~Ll~~~~-----~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~Gl~p~~~g~~d~~ 455 (602) .++..||+ |+.+.- .... ++|.+..... +.+..++.++++++.|+ ++.+++|+.+|+|+-+.+..... T Consensus 324 ~i~~tln~-li~~l~~~N~~~~~~--p~f~~~e~ee---~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gip~~~~~e~~~~ 397 (491) T protein:vir:79 324 IVVEAMNM-LIRWICDLNFDGAAR--PVFDMWEQEQ---VDEIQAGRDEKLTRAGARFTPAYFKRAYNLQDGDLDERPLP 397 (491) T ss_pred HHHHHHHH-HHHHHHHhcCCCCCc--ceEeecCcCc---hhHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCCccccC Confidence 88888885 443221 1222 3444443322 22456788999999988 78888999999986544332111 Q ss_pred cccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcchhhhhhhe-ecccccEEEEEEeccc Q lcl|NC_021537. 456 LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGL-YDFGERELYLSFKRES 534 (602) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~-yd~~~~~l~~~f~~~~ 534 (602) ...... ... ...... .. ...+..+....+...........++- .++. ..|.+.+ |+ +|.. T Consensus 398 ~~~~~~-~~~-~~~~~~-~~----~~~~~~d~~~~~~~~~~~~~~~~~~~-~~i~-~~l~~~~s~~--------e~~~-- 458 (491) T protein:vir:79 398 VSAVDA-VGA-ASFAEF-EA----PDQDALDAALNALSARDLNADAQALV-APLL-KRIANGASAD--------ELLG-- 458 (491) T ss_pred cCcccc-ccc-cccccc-CC----CCCcchHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHhcCCHH--------HHHH-- Confidence 111000 000 000000 00 00000000000000000010100000 1111 1122211 22 2221 Q ss_pred CCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccc Q lcl|NC_021537. 535 GQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGY 573 (602) Q Consensus 535 ~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~ 573 (602) ...=.|-+.+...+.++|..-- |.+++.++..- T Consensus 459 --~L~~l~~~~d~~~l~~~l~~a~----~~A~l~Gr~~a 491 (491) T protein:vir:79 459 --MLAELYPSLDTDALQERLARAI----FVANLWGRLHA 491 (491) T ss_pred --HHHHHhhcCCHHHHHHHHHHHH----HHHHHhhhccC Confidence 1012345677777776663321 34444443211 No 129 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.74 E-value=7.1e-18 Score=114.60 Aligned_cols=455 Identities=12% Similarity=0.042 Sum_probs=238.3 Q ss_pred CCCCcccccc----cchhhhcccCccccCC-CC---------HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021537. 1 MSKAEETTQL----DERHIATDVGRGIQPP-YN---------PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSA 66 (602) Q Consensus 1 ~~k~~~~~~~----~~~~~~~~~~~~i~p~-~~---------~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~ 66 (602) |+|..-+++. ..+.+...+.+...|| .+ +...+.+.+ .+.|.+|++.+...|.+++|.|.+..+ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~-D~~i~s~l~~rk~av~~~~w~i~p~~~- 78 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILS-DAQVKTVWGQRQLAVVSREWKVEAGGD- 78 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhh-ChHHHHHHHHHHHHHhcCCceEEcCCC- Confidence 7775533332 2222322222222332 12 233455544 789999999999999999999986432 Q ss_pred CCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---ceEEEEEeCccccc Q lcl|NC_021537. 67 DEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---TPVGLAHVPAATVR 143 (602) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---~~~~L~~l~p~~v~ 143 (602) +..+.+ ...+++.. +. ...|.++++.++ +.+.+|.+++|+++..+| .+..|.++|+.+++ T Consensus 79 --~~~~~~----~ae~v~~~------l~----~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~ 141 (488) T protein:vir:99 79 --RPIDQA----AAEHLEQQ------LQ----RVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFR 141 (488) T ss_pred --ChHHHH----HHHHHHHH------Hh----CCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeeccccee Confidence 112222 22222221 11 235788888776 468899999999996544 35678888877665 Q ss_pred ccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechh--HEEEecCC Q lcl|NC_021537. 144 VRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN--ELIFLPNP 221 (602) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~--eviH~r~~ 221 (602) ...+... . +.... .......+|.. =|+|.... T Consensus 142 ~d~~~~l-----------------~--~~~~~---------------------------~~~~g~~lp~~~~~i~~~~~~ 175 (488) T protein:vir:99 142 YDQDGGL-----------------R--LLTPN---------------------------NMFEGEPCPAPYFWHFSTGAD 175 (488) T ss_pred ecCCCce-----------------E--EeccC---------------------------CCCCccccccCceEEEEeecC Confidence 4322110 0 00000 00001122211 14554433 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCC Q lcl|NC_021537. 222 SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEF 301 (602) Q Consensus 222 ~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g 301 (602) . .+.++|.|.+..|...........++...|....++|-.+.+++....++++++.+.+.+.++... + .++++.| T Consensus 176 ~-~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~--~--~~viP~~ 250 (488) T protein:vir:99 176 N-DDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTD--S--AIIMPAG 250 (488) T ss_pred C-CCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcC--c--EEEecCC Confidence 3 567899999999999999999999999999999999988877764345778888888777765422 2 2344444 Q ss_pred ccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 302 VDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQA 381 (602) Q Consensus 302 ~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~ 381 (602) ++ ++|...+.... ..|.++.++..++|+.+.-= ..+.+...+++++..+..... ..+.++-.++ T Consensus 251 ~~------------ie~~ea~~~~~--~~~~~li~~~d~~Isk~iLG-qtlts~~~~Gs~a~~~vh~~v-~~d~~~aDa~ 314 (488) T protein:vir:99 251 MQ------------AELLEAGRSGT--ADYKTLHDTMDATIAKVGLG-QVASTQGTPGRLGNDDLQADV-RLDLVKADAD 314 (488) T ss_pred ce------------eEEeecCCCCh--HHHHHHHHHHHHHHHHHHhh-hhhcccccccchhhHHHHHHH-HHHHHHHHHH Confidence 33 33443332222 35788888889999887521 233344444556665554443 4667888999 Q ss_pred HHHHHHhhhcCCccccc-----cceEEEeccchhcchhHHHHHHHHHHHHHHhC-Cc-ccHHHHHHHhCCCCCCCCcccc Q lcl|NC_021537. 382 KFSARLYKIIHQDALDV-----DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLA-GV-GTVNEAREELDLAPFEDDRGDM 454 (602) Q Consensus 382 ~ie~~ln~~Ll~~~~~~-----~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~-G~-~T~NE~R~~~Gl~p~~~g~~d~ 454 (602) .++..||+.|+.+.-.. ..-++.|+...- .|.+..+++++++++. |+ ++..++|+.+|+|+-..++ +. T Consensus 315 ~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~----edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~-~~ 389 (488) T protein:vir:99 315 LICESFNLGPARWLTEWNFPGAQPPRVYRVIEEP----EDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQA-EA 389 (488) T ss_pred HHHHHHHHHHHHHHHHhCcCCcCCceeEecCCCc----ccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCccccc-cc Confidence 99999998776532221 112456655443 3556678889999986 64 6777899999998754332 21 Q ss_pred ccccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcchhhhhh-heecccccEEEEEEecc Q lcl|NC_021537. 455 TLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDE-GLYDFGERELYLSFKRE 533 (602) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~-~~yd~~~~~l~~~f~~~ 533 (602) ....... +..... . ..........+..+..........++ + -..+.+ .-|+.-...|. T Consensus 390 ~~~~~~~------~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~-----i-~~~l~~a~s~ee~~~~L~------ 448 (488) T protein:vir:99 390 TAPTPST------EFAEGD-Q--PSDPAAAMAPQLAEAMQPVVGNWTTQ-----L-RTLIEQASSLEDLRERLL------ 448 (488) T ss_pred ccCCCcc------cCCCCC-C--CCCchHHHHHHHHHHHHHHHHHHHHH-----H-HHHHHhcCCHHHHHHHHH------ Confidence 1111000 000000 0 00000000000000000000000011 1 111111 11221111111 Q ss_pred cCCcceeeeccCCHHH-----HHHHhCCCccchhhh-hhhccccccccc Q lcl|NC_021537. 534 SGQNSLYVYVDVPAAV-----WSALVSAPSAGSYHY-SEIRLQYGYLEV 576 (602) Q Consensus 534 ~~~~~~y~y~~v~~~~-----~~~~~~a~s~g~~~~-~~i~~~~~~~~~ 576 (602) .+ |.+.+... .+.|..|+-.|++=- ..++++ ++| T Consensus 449 ----~l--~~~~d~~~l~~~l~~a~~~a~l~G~~~~~~e~~~~---~~~ 488 (488) T protein:vir:99 449 ----DL--APQLSLDQYAQAMAEGLEAAHLAGRNDVQEELDGR---EQI 488 (488) T ss_pred ----HH--hccCCHHHHHHHHHHHHHHHHHhhhhhHhhhhccc---CCC Confidence 11 23344443 334445555555522 233332 222 No 130 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.73 E-value=7.3e-17 Score=109.04 Aligned_cols=454 Identities=12% Similarity=0.118 Sum_probs=232.8 Q ss_pred CCCCcccc----cccchhhhccc-CccccCC-CC---------HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEETT----QLDERHIATDV-GRGIQPP-YN---------PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~~~----~~~~~~~~~~~-~~~i~p~-~~---------~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) +.+.+..+ ....++-..+. ..+..|+ .+ .....++. .++.|.+|++.+...|.+++|.|.+-.+ T Consensus 13 ~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m~-~D~~i~s~l~~Rk~av~~~~w~i~~~~~ 91 (491) T protein:vir:10 13 VTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELR-ADAHVGGCVRRRKAAVKALEWGLDRGKA 91 (491) T ss_pred cCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcCCCHHHHHHHh-hChHHHHHHHHHHHHHhCCCcEEecCCC Confidence 44333222 22222211111 1222332 22 23334443 5899999999999999999999976432 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC---ceEEEEEeCcccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG---TPVGLAHVPAATV 142 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G---~~~~L~~l~p~~v 142 (602) . .+..+.+...+ . ...|.+++..++ |.+.+|.+++|+++...| .+..|.++|+.++ T Consensus 92 ~------~~~~e~v~e~l----------~----~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f 150 (491) T protein:vir:10 92 K------SRVAKSIADVF----------A----DLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWF 150 (491) T ss_pred C------HHHHHHHHHHH----------h----cCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccce Confidence 1 11112222211 1 235778888776 578899999999997654 3667888888777 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCC Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPS 222 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~ 222 (602) ++..+... .+...+ .......+++...|++++.. T Consensus 151 ~~d~~~~l-----------------~~~~~~-----------------------------~~~~g~~l~~~k~i~~~~~~ 184 (491) T protein:vir:10 151 VYDPENQL-----------------RFRSKD-----------------------------HWMQGEELPARKFLVPRQEA 184 (491) T ss_pred eeccCCce-----------------EEecCC-----------------------------CCCCcceecCCCEEEEEecC Confidence 65332110 000000 00111234555556666555 Q ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCc Q lcl|NC_021537. 223 PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFV 302 (602) Q Consensus 223 ~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~ 302 (602) ..+.+||.|.+..+...........++...|...-++|--+.+++.+ .++++++.+.+.+.++.. ++ .++++.|+ T Consensus 185 ~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-a~~~ek~~l~~al~~~~~--~a--~~viP~~~ 259 (491) T protein:vir:10 185 TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS-ASDGEKNLLLDCLEDMVQ--DA--VAVVPDDS 259 (491) T ss_pred CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCC-CCHHHHHHHHHHHHHHhc--Cc--EEEecCCc Confidence 56778999999999999999999999999999999999888887643 577788888887777643 22 23444444 Q ss_pred cceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 303 DDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAK 382 (602) Q Consensus 303 ~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ 382 (602) +. +|...+.....-.-|.++.++..++|+.+.-= -.+... .+++++..+..... ....++-.++. T Consensus 260 ~i------------e~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG-qtlTt~-~~gs~a~~~vh~~v-~~di~~~D~~~ 324 (491) T protein:vir:10 260 SI------------EIKEAAGKTGSADVYERLLHFCRGEVSIALLG-QNQTTE-ATSTRASAQAGLEV-TDDIRDGDKAV 324 (491) T ss_pred ee------------EEEecCCCCCChhHHHHHHHHHHHHHHHHHhh-hhcccC-cccchhHHHHHHHH-HHHHHHHHHHH Confidence 33 33322211111123777778888888877421 111211 34455555444433 35566777888 Q ss_pred HHHHHhhhcCCcc-----ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCc-ccHHHHHHHhCCCCCCCCcccccc Q lcl|NC_021537. 383 FSARLYKIIHQDA-----LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGV-GTVNEAREELDLAPFEDDRGDMTL 456 (602) Q Consensus 383 ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~-~T~NE~R~~~Gl~p~~~g~~d~~~ 456 (602) ++..||+ |+.+. ......+|+|.. .. .+.+..++.++++++.|+ ++..++|+.+|+|+-+.+...... T Consensus 325 i~~tln~-li~~l~~~N~~~~~~p~f~~~~--~~---e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~ 398 (491) T protein:vir:10 325 VSEAMNM-LIRWICDLNFDGADRPVFDMWE--QE---QVDEIQAGRDQKLTQAGARFTPAYFKRAYNLQDGDLDERPLPV 398 (491) T ss_pred HHHHHHH-HHHHHHHhcCCCCCcceEEecC--cC---chhHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCcCcccccc Confidence 8888885 44321 122233445543 22 233567788999999998 777889999999765443221111 Q ss_pred ccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcchhhhhhhe-ecccccEEEEEEecccC Q lcl|NC_021537. 457 SEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGL-YDFGERELYLSFKRESG 535 (602) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~-yd~~~~~l~~~f~~~~~ 535 (602) ..+..... .. ...... ...+..+................++ ..++. ..+.+.+ |+ +|.. T Consensus 399 ~~~~~~~~-~~----~~~~~~--~~~~~~d~~~~~~~~~~~~~~~~~~-~~~i~-~~l~~~~s~~--------e~~~--- 458 (491) T protein:vir:10 399 SAVDTVGA-AS----FAEFEA--PDQDALDAALNTLSARDLNADAQAL-VAPLL-KRIANGASAD--------ELLG--- 458 (491) T ss_pred CCCCCccc-cc----ccccCC--CCCCchHHHHHHHHHHHHHHHHHHH-HHHHH-HHHHhcCCHH--------HHHH--- Confidence 11110000 00 000000 0000000000000000111111111 01111 1122211 22 2211 Q ss_pred CcceeeeccCCHHHHHHHhCCCccchhhhhhhcccccc Q lcl|NC_021537. 536 QNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQYGY 573 (602) Q Consensus 536 ~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~~~ 573 (602) ...=.|-+.+...+.++|..-- |.+++.++..- T Consensus 459 -~L~~l~~~~d~~~l~~~l~~a~----~~A~l~G~~~a 491 (491) T protein:vir:10 459 -MLAELYPSLDADALQERLARAI----FVANLWGRLHA 491 (491) T ss_pred -HHHHHhhcCCHHHHHHHHHHHH----HHHHHhhhccC Confidence 0011334666666665553221 33344333111 No 131 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.72 E-value=3.1e-16 Score=105.58 Aligned_cols=451 Identities=11% Similarity=0.048 Sum_probs=220.7 Q ss_pred CCCCcccccccc-hhhhcccCccccCC-----------CCHHHHHHH----HhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEETTQLDE-RHIATDVGRGIQPP-----------YNPETLAAF----QELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~~~~~~~-~~~~~~~~~~i~p~-----------~~~~~l~~~----~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) +.+.+..+-... +.+.....+++.|. -|+.....+ .+..+.|.+|++.+...|.+++|.|.+.. T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~Rk~av~~~~w~I~p~~ 96 (512) T protein:vir:19 17 EMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSKRRLAIQALEWRIAPAR 96 (512) T ss_pred ccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCceEecCC Confidence 222221111111 22333333333321 122222222 23578999999999999999999998753 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---CceEEEEEeCccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---GTPVGLAHVPAAT 141 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---G~~~~L~~l~p~~ 141 (602) +.+ ..+.+..+.+...+.. ..+|.+++..++ |.+.+|-+++|+++... ..|..|.++|+.+ T Consensus 97 ~~~--~~~~~~a~~v~~~l~~-------------~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~ 160 (512) T protein:vir:19 97 DAS--AQEKKDADMLNEYLHD-------------AAWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDPAL 160 (512) T ss_pred CCC--HHHHHHHHHHHHHHhc-------------CCCHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeecccc Confidence 321 1111222222222211 124677777665 47789999999998543 3577888888877 Q ss_pred ccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCC Q lcl|NC_021537. 142 VRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNP 221 (602) Q Consensus 142 v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~ 221 (602) ++...+.... .....+ ...-..+++...|..++. T Consensus 161 f~~~~~~~~~-----------------lr~~~~-----------------------------~~~G~~l~~~k~i~~~~~ 194 (512) T protein:vir:19 161 FCANPDNLNE-----------------LRLRDA-----------------------------SYHGLELQPFGWFMHRAK 194 (512) T ss_pred ceeccCCCcE-----------------EEecCC-----------------------------CCCceeecCCceEEEecc Confidence 7644321100 000000 001112334434444444 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCC Q lcl|NC_021537. 222 SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEF 301 (602) Q Consensus 222 ~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g 301 (602) ...+.++|.+.+..+...........++...|...-++|--+-+++.+ .++++++.|.+.+.++.. ++ .++++.| T Consensus 195 ~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-a~~~ek~~L~~al~~~~~--~a--~~iiP~~ 269 (512) T protein:vir:19 195 SRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTG-STNREKATLMQAVMDIGR--RA--GGIIPMG 269 (512) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCC-CCHHHHHHHHHHHHHHhh--Cc--EEEecCC Confidence 456778999999999999999999999999999999999888777643 577788888888777642 22 3344444 Q ss_pred ccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 302 VDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFAKGIIEPEQ 380 (602) Q Consensus 302 ~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~~~~l~P~~ 380 (602) ++ ++|...+.. ...-|.++.++..++|+.+.--. .+. +..++++++..+.... .....+.-.+ T Consensus 270 ~~------------ie~~ea~~~--~~~~y~~li~~~d~~Isk~iLGq-tlTs~~g~~Gs~a~~~vh~e-v~~di~~aDa 333 (512) T protein:vir:19 270 MT------------LDFQSAADG--QSDPFMAMIGWAEKAISKAILGG-TLTTEAGDKGARSLGEVHDE-VRREIRNADV 333 (512) T ss_pred ce------------EEEeecCCC--CHHHHHHHHHHHHHHHHHHHhhh-hhcccccccchhhHHHHHHH-HHHHHHHHHH Confidence 33 334333222 22458888899999999873111 111 1122334554443333 3466778899 Q ss_pred HHHHHHHhhhcCCcccccc---------ceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021537. 381 AKFSARLYKIIHQDALDVD---------EWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR 451 (602) Q Consensus 381 ~~ie~~ln~~Ll~~~~~~~---------~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~ 451 (602) +.++..||+.|+.+.-... --+++|+...- .|.+..++.+.++...--++..++|+.+|+|.-.+++ T Consensus 334 ~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~----eDl~~~a~~~~~l~~G~~i~~~~i~e~~Gip~~~~~e 409 (512) T protein:vir:19 334 GQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEA----GDITALSDAIPKLAAGMRIPVSWIQEKLHIPQPVGDE 409 (512) T ss_pred HHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCh----hhHHHHHHHHHHHhcCCCCCHHHHHHHhCCCCCCCcc Confidence 9999999988776432111 12466765543 3556667777777655556778899999996433332 Q ss_pred cccccc-cccccccccccCCCcCcccccccccccccccccccccccccccccchhh--hhcchhhhhhheecccccEEEE Q lcl|NC_021537. 452 GDMTLS-EFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQ--TTFSSSNLDEGLYDFGERELYL 528 (602) Q Consensus 452 ~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~--~~v~ss~~~~~~yd~~~~~l~~ 528 (602) +.+.. +.....+........ .... + .+.... +...........+.. .++ -..+.+..|+.-...|.= T Consensus 410 -~~~~~~~~~~~~~~~~~~~~~----~~~~--~-~~~~~d-~~~~~~~~~~~~~~~~~~~i-~~~~~~~s~ee~~~~L~~ 479 (512) T protein:vir:19 410 -AVFTIQPVVPDNGSQKEAALS----AEDI--P-QEDDID-RMGVSPEDWQRSVDPLLKPV-IFSVLKDGPEAAMNKAAS 479 (512) T ss_pred -ccccCCCcccccccccccccc----ccCC--C-chhhHh-HHhhhHHHHHHHHHHHHHHH-HHHHHhCCHHHHHHHHHH Confidence 22211 111111100000000 0000 0 000000 000000000000000 000 011111122211110100 Q ss_pred EEecccCC-------------cceeeeccCCHHH Q lcl|NC_021537. 529 SFKRESGQ-------------NSLYVYVDVPAAV 549 (602) Q Consensus 529 ~f~~~~~~-------------~~~y~y~~v~~~~ 549 (602) -|- .... ..+.=|.+|-.+. T Consensus 480 l~~-~ld~~~l~~~l~~a~~~A~l~G~~~~~~e~ 512 (512) T protein:vir:19 480 LYP-QMDDAELIDMLTRAIFVADIWGRLDAAADH 512 (512) T ss_pred Hhc-cCCHHHHHHHHHHHHHHHHHhhhhhhhccC Confidence 000 0000 0011111111111 No 132 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.69 E-value=1.2e-15 Score=102.43 Aligned_cols=409 Identities=11% Similarity=0.028 Sum_probs=212.5 Q ss_pred CCCCccccc----------------ccc--hhhh-cccCccccCCCC--------HHHHHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_021537. 1 MSKAEETTQ----------------LDE--RHIA-TDVGRGIQPPYN--------PETLAAFQELNETHQACIRKKSRYE 53 (602) Q Consensus 1 ~~k~~~~~~----------------~~~--~~~~-~~~~~~i~p~~~--------~~~l~~~~~~~~~v~~cI~~ia~~i 53 (602) |+|+.+..+ +.. ..+. ..+++...+..+ +...+++. ..+.|.+|++.+...| T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~-~D~hi~s~l~~Rk~av 79 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKML-SDGTVKNALNYIFGRI 79 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHHh-hChHHHHHHHHHHHHH Confidence 555444221 000 1111 112222222222 22344554 4899999999999999 Q ss_pred ccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC--CCc- Q lcl|NC_021537. 54 AGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG--DGT- 130 (602) Q Consensus 54 a~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~--~G~- 130 (602) .+++|.|.+..+. ..+....+.+...+..+... ....+|.+++..++. .+.+|.+++|+++.. +|. T Consensus 80 ~~~~w~v~p~~~~---~~~~~~ae~v~~~l~~~~~~-------~~~~~f~~~~~~~ld-a~~~G~s~~Eivw~~~~~g~~ 148 (448) T protein:vir:79 80 RSAKWYVEPASTD---PEDIAIAAFIHAQLGIDDAS-------VGKYPFGRLFAIYEN-AYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred hcCCceEecCCCC---HHHHHHHHHHHHHhhhhhhh-------hccCCHHHHHHHHHH-hhhhcceeEEEEeeecCCCce Confidence 9999999753221 12222222222222111110 113467777766544 678999999999853 454 Q ss_pred -eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEE Q lcl|NC_021537. 131 -PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKN 209 (602) Q Consensus 131 -~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 209 (602) +..|.+.++.+++- |.....++.......+ ...+. ..+..... T Consensus 149 ~~~~l~~r~~~~~~~-----------------------f~~~~d~~l~~~~~~~-----------~~~~~--~~~~~~~~ 192 (448) T protein:vir:79 149 ILDKIVPIHPFNIDE-----------------------VLYDEEGGPKALKLSG-----------EVKGG--SQFVSGLE 192 (448) T ss_pred ecccccccCCccccc-----------------------eeeecCCceEEeecCC-----------ccccc--ccCCCccc Confidence 33455555543320 0000000000000000 00000 00111234 Q ss_pred echhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc-cCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 210 GPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG-TLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 210 ~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~~~l~~~~~~~~g 288 (602) +|..-++|.... ..+.++|.+.+..|...........++...|...-++|--+.+.+.+ ..++++++.+.++..+..+ T Consensus 193 lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~ 271 (448) T protein:vir:79 193 IPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQ 271 (448) T ss_pred cccceEEEEecC-ccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhc Confidence 577778888654 45678999999999999999999999999999999999888887643 3456777788888888776 Q ss_pred ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH Q lcl|NC_021537. 289 SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT 368 (602) Q Consensus 289 ~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 368 (602) +.+++.+ ++.|++..-+ ..+... ..+.++.++..++|+.+..= ..+-...++++++.....- T Consensus 272 g~~a~~i--iP~~~~ie~~------------ea~~~~---~~~~~~i~~~d~~Isk~iLG-qtlTs~~~~g~~~~~~~~~ 333 (448) T protein:vir:79 272 KPRHGII--LPDDWKFDTV------------DLKSAM---PDAIPYLTYHDAGIARALGI-DFNTVQLNMGVQAINIGEF 333 (448) T ss_pred CCceEEE--ecCCceEEEE------------ecCCCc---ccHHHHHHHHHHHHHHHHhh-hhhccccccchhhhhhhhH Confidence 6666543 4555543332 222111 22446777778888776632 1222122223333332222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCcccc-----ccc-eEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh Q lcl|NC_021537. 369 REFAKGIIEPEQAKFSARLYKIIHQDALD-----VDE-WTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL 442 (602) Q Consensus 369 ~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~-----~~~-~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 442 (602) .....+.++--++.++..||+.|+.+.-. ... -+|+|+..+ ..|.+..++++++++..+-...+-.|+.+ T Consensus 334 ~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~e----~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~ 409 (448) T protein:vir:79 334 VSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEMEE----RNDFSAAANLMGMLINAVKDSEDIPTELK 409 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCCC----hHHHHHHHHHhhhhhccchhhHHHHHHhh Confidence 23445667888899999999887754311 112 256676543 33667778889998887655545567778 Q ss_pred CCC-CCCCCccccccccccccccccccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 443 DLA-PFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 443 Gl~-p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) |+| |.++.. .......+ +.. .....|+..--.=.+... T Consensus 410 ~~p~~~~~~~-------~~a~~~~~-~~~-------~~~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 410 ALIDALPSKM-------RRALGVVD-EVR-------EAVRQPADSRYLYTRRRR 448 (448) T ss_pred cCCCCCCCcc-------ccccCCCC-ccc-------ccccCCccccchhhcccC Confidence 876 222211 00000000 000 000000000000000011 No 133 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.67 E-value=4.8e-16 Score=104.54 Aligned_cols=389 Identities=12% Similarity=0.029 Sum_probs=207.6 Q ss_pred CCCCcccccccchhhhccc------CccccCC-------CCH----HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV------GRGIQPP-------YNP----ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~------~~~i~p~-------~~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) |.-+... ...+++.+.. .+++.|. -++ ...+++.+..+.|.+|++.+...|.+++|+|.+. T Consensus 5 ~~~~p~~--~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p~ 82 (446) T protein:vir:98 5 VRNAPTP--AIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQHG 82 (446) T ss_pred ccCCCch--hhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceecCc Confidence 3222211 1111111111 1122221 122 3345666778999999999999999999999752 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-Cc--eEE----EEE Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-GT--PVG----LAH 136 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G~--~~~----L~~ 136 (602) +.+..+.+...++. +.+ ++....+.|.+.+|.++.|+++... |. |.. +.. T Consensus 83 --------~~~~a~~v~~~l~~--------------~~~-~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~ 139 (446) T protein:vir:98 83 --------DKRIKKFIDDQLRN--------------RAK-TWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVN 139 (446) T ss_pred --------cHHHHHHHHHHHhh--------------cCc-hhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccc Confidence 11222222222211 111 3333446788899999999998643 21 111 111 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) ..|..++...+... ....... ....|...+.... ..+ ....++. ......+....+|....+ T Consensus 140 ~~~~~~r~~~~~~~------~~~~~~~--~~~~~~~~~~~~~------~~~--~~~~~~~--~~~~~~g~~~~iP~~kfi 201 (446) T protein:vir:98 140 YHPLQVMLIANDNG------RIVDGDT--VTASQYKSGYWVP------LPP--YRIGDPP--KKVDVVGSHVRLPSHKRL 201 (446) T ss_pred cccccceeeeccCC------ccccccc--cchhhcccccccC------ccc--chhhhhh--hhcccCcccccccccceE Confidence 11111111000000 0000000 0000000000000 000 0000000 001122334568999999 Q ss_pred EecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCC-----HHHHHH---H-HHHHHHhh Q lcl|NC_021537. 217 FLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLS-----EDSKED---L-RNLMDNLK 287 (602) Q Consensus 217 H~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-----~~~~~~---l-~~~~~~~~ 287 (602) |+++....+.+||.|.+..+...........++...|...-++|--+-+.+.+... ++..+. + ++.++.+. T Consensus 202 ~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~ 281 (446) T protein:vir:98 202 FINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALR 281 (446) T ss_pred EEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHH Confidence 99888777889999999999999999999999999999999999888888644321 111111 1 22334443 Q ss_pred ccc-ccCcce---eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc--cCCc Q lcl|NC_021537. 288 GSR-YRTAIL---EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS--TSNR 361 (602) Q Consensus 288 g~~-nag~~~---~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~ 361 (602) ... .++.++ .++.|++.. +..-+... -..|.++.++..++|+.+.....-.+|... .+++ T Consensus 282 ~~~~da~~ii~~~~~P~g~eie------------~~ea~~~~--~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ 347 (446) T protein:vir:98 282 RLSTDSGLVLTQLSKEQPVQVG------------ALTTGNNF--SDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG 347 (446) T ss_pred hccccceeeeecccCCCCceEE------------eeccccCC--hhhHHHHHHHHHHHHHHHHhcccccccccccccchh Confidence 222 222222 113343332 22222221 234888889999999999877654455432 3444 Q ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcccccc-----------ceEEEeccchhcchhHHHHHHHHHHHHHHhC Q lcl|NC_021537. 362 ANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVD-----------EWTIDFELRGAEQPEQDAKMAEQRVRAMRLA 430 (602) Q Consensus 362 sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~-----------~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~ 430 (602) +-.+..... ..+.++-.++.+++.||+.|+.+.-... .-+++|++.+ ..|.+..+++++++++. T Consensus 348 ala~vh~~V-~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~~~~~~~~~~e----~eDl~~~a~~~~~L~~~ 422 (446) T protein:vir:98 348 RASEIQLEL-FDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASNTGYITRLPGR----ATDLAALVEAIKQMHDM 422 (446) T ss_pred hhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccceeccCC----hhhHHHHHHHHHHHHhC Confidence 444444443 3566788899999999988765432111 0123455443 34677788999999999 Q ss_pred CcccH---HHHHHHhCCCCCCCCcc Q lcl|NC_021537. 431 GVGTV---NEAREELDLAPFEDDRG 452 (602) Q Consensus 431 G~~T~---NE~R~~~Gl~p~~~g~~ 452 (602) |.+++ +.+|+.+|+|+-. ++. T Consensus 423 G~~~p~~~~~ire~~giP~~~-~~~ 446 (446) T protein:vir:98 423 GFLVDGDKDHIRSITGLPDAI-SST 446 (446) T ss_pred CccccccHHHHHHHhCcCCCC-CCC Confidence 99875 4499999996543 222 No 134 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.48 E-value=3e-13 Score=89.26 Aligned_cols=388 Identities=11% Similarity=0.070 Sum_probs=167.4 Q ss_pred CCCCcccccccchh--hh---cccC--------c-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021537. 1 MSKAEETTQLDERH--IA---TDVG--------R-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSA 66 (602) Q Consensus 1 ~~k~~~~~~~~~~~--~~---~~~~--------~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~ 66 (602) |..+.+...+.-.+ +. ...| . ++....+...|.++++.|.+++.||+.+++.+-.-...|....+. T Consensus 9 ~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~~~ 88 (449) T protein:vir:10 9 VNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGDDA 88 (449) T ss_pred HhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccccCccc Confidence 33333322211111 11 1111 0 123346788999999999999999999998763322223221111 Q ss_pred CCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee-CCCC---------ceEEEEE Q lcl|NC_021537. 67 DEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV-EGDG---------TPVGLAH 136 (602) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r-~~~G---------~~~~L~~ 136 (602) ........+...+++.+. ..-|..+.....+ -.++|-+++.+.- ++.+ .+..|.+ T Consensus 89 ----~~~~~~~~~e~~~~~l~~----------~~~~~~l~ea~~~-~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v 153 (449) T protein:vir:10 89 ----DDSEDETSWEKKSKQVFT----------NRLWRSFAEADRR-RLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSV 153 (449) T ss_pred ----cchhhhHHHHHHHHHHHH----------HHHHHHHHHHHHh-hhccCcEEEEEEecCCCCCCcccccCcceeeEEe Confidence 111111111111111100 0012223333333 3467877776543 3221 1222222 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEe----cCceeEEech Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVAS----DAGELKNGPA 212 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~----~~~~~~~~~~ 212 (602) +....+.+... ...|..+.+-.++.+++.. ..+....+.+ T Consensus 154 ~~~~~i~~~~~------------------------------------~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~ 197 (449) T protein:vir:10 154 SWAGSLKVAEW------------------------------------DTGINSKTYGQPKLWKYTERLPNGSSRRVDIHP 197 (449) T ss_pred eccccCChhhh------------------------------------hcCCCCCCCCCceEEEEeeeccCCCccceeecc Confidence 22222221100 0011111222222232221 1123346788 Q ss_pred hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHH-HHHHHHHHhcC-----------CCceEEEeccccCCHHHHHHHH Q lcl|NC_021537. 213 NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAK-EWNHDVFDNLG-----------IPHYAVKVTGGTLSEDSKEDLR 280 (602) Q Consensus 213 ~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~-~~~~~~f~ng~-----------~p~gil~~~~~~~~~~~~~~l~ 280 (602) +.||||-.. +.-|.|.++.+.+.+.....+. .+...+++|-. ...++.... +...++..+++. T Consensus 198 SRl~~~~~~----~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~-~~~~e~~~~~~~ 272 (449) T protein:vir:10 198 DRVFILGDY----SEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY-GVSIDELQDKFN 272 (449) T ss_pred ceeEeecCC----CCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh-hCCchHHHHHHH Confidence 889988432 3348888888877553332221 22222222211 111111111 112333344454 Q ss_pred HHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccC Q lcl|NC_021537. 281 NLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTS 359 (602) Q Consensus 281 ~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~ 359 (602) +..+.+..+.+ ..+ +..+.++.... .+++ .+.+......++||++-+||...| |..- + T Consensus 273 ~~~~~~~~~~~--~~~-i~~~~d~~~~~----------~~~s-------gl~d~l~~~~q~iaaa~~IP~t~L~Gqsp-~ 331 (449) T protein:vir:10 273 EVAGEINRGND--VLM-TTQGATVTPLV----------TSVA-------DPTATYNVNLQTAAAGVDIPTRILIGNQQ-A 331 (449) T ss_pred HHHHHHhccch--hee-ecCCcceEEEe----------cccC-------ChhHHHHHHHHHHHHHhCCCeeeeeccCc-c Confidence 44444332222 122 22222222211 1111 223455667788999999997655 5544 3 Q ss_pred CccCHHHHHHHHHHH------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH---HHHHHHHHHHHhC Q lcl|NC_021537. 360 NRANSKEQTREFAKG------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA---KMAEQRVRAMRLA 430 (602) Q Consensus 360 ~~sn~e~~~~~f~~~------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~---~~~~~~~~~~~~~ 430 (602) ..++.+ -...|+.. -|+|.++.+-+.|-..-+.. ...+|.|+|+--.-+.-++.+ +..+++.++++++ T Consensus 332 glnst~-D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~--~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~a 408 (449) T protein:vir:10 332 ERSSTE-DQKYFNARCQSRRVDLSFEIEDFCDKLIELKIID--AVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGS 408 (449) T ss_pred ccccch-hHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCC--CCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc Confidence 333333 33445432 36677766666554332211 123577777633222222222 2336677788877 Q ss_pred C---cccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccc Q lcl|NC_021537. 431 G---VGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKA 481 (602) Q Consensus 431 G---~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (602) | +++.+|+|+.+|++|..+...+ ....++....++..+ T Consensus 409 g~~~~~~~~EiR~~~~~~~~~~~~~~-------------~e~~de~~~~~d~~a 449 (449) T protein:vir:10 409 GDNPAFSREEIRTAAGYDNDDEEPLG-------------EEDGDEEDKATDSAA 449 (449) T ss_pred cccCCcCHHHHHHHhcccCCCCCCCC-------------CCCCccccccCCcCC Confidence 7 9999999999999886432100 000000000000000 No 135 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.38 E-value=1.4e-12 Score=85.51 Aligned_cols=325 Identities=12% Similarity=-0.009 Sum_probs=157.1 Q ss_pred EEEEeeCCCCc---eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecc Q lcl|NC_021537. 120 ALEILVEGDGT---PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKE 196 (602) Q Consensus 120 ~~~i~r~~~G~---~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (602) +.|+++...|. |..|.+.|+.++.-- ...-++....... T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f-----------------------~~~~~~~l~~~~~--------------- 42 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRF-----------------------DVAPDGGLVAIEQ--------------- 42 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeee-----------------------eeccCCceeEEEe--------------- Confidence 78898876543 556767776544310 0000000000000 Q ss_pred cceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-C---- Q lcl|NC_021537. 197 TGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-L---- 271 (602) Q Consensus 197 ~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~---- 271 (602) ....+.....+|....|++++....+.+||.+.+..|...........++...|...-+.|--+.+.+.+. . T Consensus 43 ---~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d 119 (355) T protein:vir:78 43 ---WGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARD 119 (355) T ss_pred ---cCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccch Confidence 00011223456666667666665567789999999999999999999999999999875554444443211 1 Q ss_pred -------CHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHH Q lcl|NC_021537. 272 -------SEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAK 344 (602) Q Consensus 272 -------~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~ 344 (602) +++.++.+.........+.+++. +++.|+++.-+. -+ .....|.++.++..++|+. T Consensus 120 ~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~--iip~g~~ie~~e------------a~---g~~~~~~~~i~~~d~~Isk 182 (355) T protein:vir:78 120 TARAEQWLNDQKEEGLQLAKEFRAGEAAGG--YIPHGANFTLTG------------VQ---GKLPEMDGPIRYHDEQIAR 182 (355) T ss_pred hhhHHHHHHHHHHHHHHHHHHhhCCcceeE--eecCCceEEEee------------cC---CCcccHHHHHHHHHHHHHH Confidence 22334445555555544444443 344444433222 11 1223466777888889988 Q ss_pred HhcCChHHhhcc--ccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccc-----ccc-ceEEEeccchhcchhHH Q lcl|NC_021537. 345 VHGVPPVLINVT--STSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDAL-----DVD-EWTIDFELRGAEQPEQD 416 (602) Q Consensus 345 ~fgVPp~~lg~~--~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~-----~~~-~~~~~f~~~~~~~~~~d 416 (602) ++.-. .+.... ..++++-.+... ....+.+.-.++.+++.||+.|+...- ... ..+|+|+. .. .+ T Consensus 183 ~iLGq-tlTs~~~~~gGS~Alg~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~--~~---~~ 255 (355) T protein:vir:78 183 AVLAH-FLTLGGDKSTGSYALGDTFA-SFFTGSLNAVMKHIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQ--LG---KE 255 (355) T ss_pred HHhhh-hhccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecC--cC---hh Confidence 87443 232211 224445444433 334667788889999999987765321 111 12455543 22 23 Q ss_pred HHHHHHHHHHHHhCCcccHHH-----HHHHhCCCCCCCCcccccccccccc-ccccccCCCcCccccccccccccccccc Q lcl|NC_021537. 417 AKMAEQRVRAMRLAGVGTVNE-----AREELDLAPFEDDRGDMTLSEFEAE-FGADASDGDAEAMLTRSKAAPPLENKIG 490 (602) Q Consensus 417 ~~~~~~~~~~~~~~G~~T~NE-----~R~~~Gl~p~~~g~~d~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (602) .+..+++++++++.|++.+++ +|+.+|+|.-+.++ +......... .......++.........+ ........ T Consensus 256 ~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~~~a~~~ 333 (355) T protein:vir:78 256 QPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERD-DGADAAAAKAAGRRRAKRLPGQRQGAALPS-RSPRADPP 333 (355) T ss_pred HHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCC-cccCCccccccccccccccCCccccccccc-cCCCCCCh Confidence 345678899999999987754 69999996433332 2111111100 1111111100000000000 00000000 Q ss_pred ccccc-cccccccchhhhhcchhhhhhhe Q lcl|NC_021537. 491 ERDSV-DVDVSKDPIEQTTFSSSNLDEGL 518 (602) Q Consensus 491 ~~~~~-~~~~~~~~m~~~~v~ss~~~~~~ 518 (602) .+... ..+...+.. + .....| T Consensus 334 ~~~~~~~~~~~~~~~-~------~~~~~~ 355 (355) T protein:vir:78 334 RRRGPLRRRPRHPAH-R------RCAPDG 355 (355) T ss_pred hhhHHHHHHhhcccc-C------CCCCCC Confidence 00000 001011000 0 011111 No 136 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.24 E-value=3.5e-10 Score=72.43 Aligned_cols=509 Identities=13% Similarity=0.088 Sum_probs=209.1 Q ss_pred CCCCcccccccchhhhcccCccccCC-----CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEE------------e Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPP-----YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVA------------H 63 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~-----~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~------------~ 63 (602) -.+.+-+-.++ |.........+- +=.-.|+.+++ .|-.++|+.++++.+..-..++.. . T Consensus 83 ~~~~~~~~~~~---~~~~~~~~l~~~~~~~F~Gy~~la~laQ-~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~ 158 (698) T protein:vir:10 83 RERRAASYALD---FNGTSMDALSFVTSSGFPGFPTLVLLAQ-LPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAA 158 (698) T ss_pred cccchhhhhhc---ccccccccchhhhccCcchHHHHHHHhh-ccchhhHHHHHHHHhhcccceeccccchhhhhhcccc Confidence 11111111111 111111111111 11245666666 577899999999988776444321 1 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---------------- Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---------------- 127 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---------------- 127 (602) ..+..+..+.++.+++...+++.+. .+-++..+..--+||-+.+.+.-++ T Consensus 159 ~~~~~~~~d~dqi~~L~~e~erl~V--------------~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I 224 (698) T protein:vir:10 159 GGNAASTSDGDQLKQINDEIERLRI--------------RDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTV 224 (698) T ss_pred cccccccccHHHHHHHHHHHHHHHH--------------HHHHHHHHHhcccccceEEEEEeecCccccccccccccccc Confidence 1122223334566666665555432 2233334444456777765554322 Q ss_pred -CCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCce Q lcl|NC_021537. 128 -DGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGE 206 (602) Q Consensus 128 -~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 206 (602) +|....|..|+|..|.+..... ..+.....-.+.|++|.+ . T Consensus 225 ~kGslKGL~ViDp~~vtP~~~n~------~dP~spdfgkP~~y~V~G--------------------------------~ 266 (698) T protein:vir:10 225 PKGSFQGLRVVEPYWVTPNNYNS------INPVADDFYKPSTWWMIG--------------------------------S 266 (698) T ss_pred cCccceeeeeecccccccchhhh------ccchhhccCCCceEEEec--------------------------------c Confidence 2334446666666665532211 011111111222222221 1 Q ss_pred eEEechhHEEEecCC------CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHH Q lcl|NC_021537. 207 LKNGPANELIFLPNP------SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLR 280 (602) Q Consensus 207 ~~~~~~~eviH~r~~------~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~ 280 (602) ++..+.++.|... .+...+.|+|-.+.+...+................-.. .++.+--...++......+. T Consensus 267 --~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~-~~l~~dla~aL~~g~~~~l~ 343 (698) T protein:vir:10 267 --EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSV-SGILMDLAQALTPGANVDLS 343 (698) T ss_pred --eecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhH-HHHHHHHHHhcCChhhHHHH Confidence 2334444434322 12234579999999988887766665555554433221 11110000011111111122 Q ss_pred ---HHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hcc Q lcl|NC_021537. 281 ---NLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVT 356 (602) Q Consensus 281 ---~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~ 356 (602) +.++.++ +|.|.+++-...-++.... .+|+-.. +......+.||.+-+||...| |.+ T Consensus 344 ~R~eli~~~R--sn~G~~llDk~~Eefeq~s----------t~lSGLd-------dVi~qf~q~VAgaa~IPltkLfGqS 404 (698) T protein:vir:10 344 MRAELINRYR--DNRNILFLDKATEEFFQFN----------TPLSGLD-------ALQAQAQEQMSAVSHIPLIKLLGIT 404 (698) T ss_pred HHHHHHHHhc--CccceEEEecCCcceEEEe----------cCcCCHH-------HHHHHHHHHHHhhhcCchhhhhccC Confidence 3344554 4445444431233333322 2233322 333344579999999997765 554 Q ss_pred ccCCccCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH--H---HHHHHHH Q lcl|NC_021537. 357 STSNRANSKEQTREFAKG-------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD--A---KMAEQRV 424 (602) Q Consensus 357 ~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d--~---~~~~~~~ 424 (602) -.|=.++.|.-..+||.. -|+|.++++-+.|-+..+.. ....+.++|+ .+..+... + ++.++.. T Consensus 405 PkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~--idp~i~~~fn--PL~qmtd~EkAeI~~k~A~~d 480 (698) T protein:vir:10 405 PTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGA--VDPSIKWQWN--ALRELDDLEVAEARYKQAQSD 480 (698) T ss_pred CcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCcceEEeC--CCCCcCHHHHHHHHhhhhHHH Confidence 444346667666777653 57888888877776665433 2244555555 33333221 1 3345566 Q ss_pred HHHHhCCcccHHHHHHHhCCCCCCCCc------ccccccc--cccc-----ccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 425 RAMRLAGVGTVNEAREELDLAPFEDDR------GDMTLSE--FEAE-----FGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 425 ~~~~~~G~~T~NE~R~~~Gl~p~~~g~------~d~~~~~--~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +.++..|+++++|+|.++.-+|- ++. .|.+..+ +.+. .+....+++.+.+.++...-+.. ... T Consensus 481 ~~~~~~gvI~~~evr~rL~~d~~-s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 556 (698) T protein:vir:10 481 VLYVQEQVIRPDQVAARLNTEPD-GPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGA---TAP 556 (698) T ss_pred HHHHHhcCCCHHHHHHHHhccCC-CccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCC---CCC Confidence 77899999999999998854431 111 1111000 1110 01112222222211111111100 000 Q ss_pred cccccccccccchhhhhcchh-hhhhheecccccEEEEEEecc--cCCcceeeeccCCH-----HHHHHHhCCCccchhh Q lcl|NC_021537. 492 RDSVDVDVSKDPIEQTTFSSS-NLDEGLYDFGERELYLSFKRE--SGQNSLYVYVDVPA-----AVWSALVSAPSAGSYH 563 (602) Q Consensus 492 ~~~~~~~~~~~~m~~~~v~ss-~~~~~~yd~~~~~l~~~f~~~--~~~~~~y~y~~v~~-----~~~~~~~~a~s~g~~~ 563 (602) ...+.......+-...+..++ ....+-+..+.++|-++=..+ .-+|+.-+.=.-|+ +++++-- -.|.|. T Consensus 557 ~~~~~~~~~~~~~~~~~~~~~~~a~giv~~~g~~vLL~~r~~g~W~lPgG~ie~GEt~~~aa~RE~~EEtG---~~~~~~ 633 (698) T protein:vir:10 557 PAAANVNANANPREAGAQDAAMRAAGIVFRAGDKVLLMKRPAGDWGLPAGKVEDGETPEEAARRETLEETG---HAGDYV 633 (698) T ss_pred cccccccCCCCccccCcccceeeEEEEEEEcCCeEEEEEecCCCcccCccccCCCCCHHHHHHHHHHhhcc---cccchh Confidence 001111111111111111222 112334555677887743211 11222222222222 2333211 112222 Q ss_pred hhhhc--ccccccccccc---hhcccCCCC----CChhhcCCcccccC Q lcl|NC_021537. 564 YSEIR--LQYGYLEVTNN---HERLPEGPT----PDPGEAPEDVPSDI 602 (602) Q Consensus 564 ~~~i~--~~~~~~~~~~~---~~~~~~~~~----~~~~~~~~~~~~~~ 602 (602) ...+. +.|-+.-+.+. .-++..-.. =+ |+.+|.-+ T Consensus 634 l~~~g~~de~~~~f~ad~~p~~~~l~dEh~~~~Wfd----pdeLP~pL 677 (698) T protein:vir:10 634 LAPLGKYDEFFHAFVADVNPFDVELNDEHTAFDWFD----PDELPHPL 677 (698) T ss_pred hhcccccceEEEEEEEEecCcceeeccccccccccC----hHhccccc Confidence 21111 00000001111 001100000 02 33344322 No 137 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.23 E-value=4.3e-10 Score=71.93 Aligned_cols=512 Identities=11% Similarity=0.033 Sum_probs=210.2 Q ss_pred CCCCccc--ccccc-----------------hhhhcccCccccCC-----CCHHHHHHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_021537. 1 MSKAEET--TQLDE-----------------RHIATDVGRGIQPP-----YNPETLAAFQELNETHQACIRKKSRYEAGY 56 (602) Q Consensus 1 ~~k~~~~--~~~~~-----------------~~~~~~~~~~i~p~-----~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~ 56 (602) |+.|..- .+.++ -.|.........+- +=.-.|+.+++ .|-.++|+.++++.+..- T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ-~~eyr~~~~~ia~e~~R~ 139 (695) T protein:vir:36 61 VEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQ-LPEYRAMHEVLADECIRT 139 (695) T ss_pred cCCCcccccceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhh-ccchhhHHHHHHHHhhcc Confidence 3333210 00000 00111111111111 11245666666 577899999999988776 Q ss_pred ceEEEE------------ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEe Q lcl|NC_021537. 57 GFEIVA------------HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEIL 124 (602) Q Consensus 57 ~~~i~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~ 124 (602) ..++.. ...+.....+.++.+++...+++.+. .+-++..++.--+||-+.+.+. T Consensus 140 w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V--------------~~~l~eaik~aRlfGGa~~~i~ 205 (695) T protein:vir:36 140 WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRI--------------RDAVRTTVIHDQAFGRAHPYFK 205 (695) T ss_pred cceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHH--------------HHHHHHHHHhhccccceEEEEE Confidence 444321 11122223334566666666554332 2333444455556777776654 Q ss_pred eCC-----------------CCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeeccccccc Q lcl|NC_021537. 125 VEG-----------------DGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYG 187 (602) Q Consensus 125 r~~-----------------~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~ 187 (602) -++ +|....|..|+|.+|.+...... .+.....-.+.|++|. T Consensus 206 i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~------dP~spdfgkP~~y~V~--------------- 264 (695) T protein:vir:36 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI------NPVADDFYKPSTWWMI--------------- 264 (695) T ss_pred eccCccccccccccccccccCcceeeeEeecccccccchhhhc------cchhhccCCCceEEEe--------------- Confidence 433 23344466666666655322110 0111111112222221 Q ss_pred ccceeeecccceEEecCceeEEechhHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_021537. 188 DDKRFVDKETGEVASDAGELKNGPANELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPH 261 (602) Q Consensus 188 ~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 261 (602) | .++..+.++.|.... +...+.|+|..+.+...+................-. .. T Consensus 265 -----------------G--~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~-v~ 324 (695) T protein:vir:36 265 -----------------G--TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFS-VS 324 (695) T ss_pred -----------------c--eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhh-HH Confidence 1 123334444343221 223468999999988888776666555555443321 11 Q ss_pred eEEEecc-ccC---CHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHh Q lcl|NC_021537. 262 YAVKVTG-GTL---SEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRER 337 (602) Q Consensus 262 gil~~~~-~~~---~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~ 337 (602) ++ ++.. ..+ ...+....-+.++.++ +|.|.+++-...-++.... .+|+-.. +.... T Consensus 325 ~l-k~dla~aL~~g~~~~l~~R~eli~~~R--sn~G~~llDk~~Eefeq~s----------tslSGLd-------dVi~q 384 (695) T protein:vir:36 325 GI-LMDLAQALMPGANVDLSMRAELINRYR--DNRNILFLDKATEEFFQFN----------TPLSGLD-------ALQAQ 384 (695) T ss_pred HH-HHHHHHhhcChhHHHHHHHHHHHHHhc--CccceEEEecCCcceEEEe----------cccCCHH-------HHHHH Confidence 11 1000 011 1111222223445554 4444444421233333321 2333332 23333 Q ss_pred hHHHHHHHhcCChHHh-hccccCCccCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhcCCccccccceEEEeccch Q lcl|NC_021537. 338 NEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFAKG-------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRG 409 (602) Q Consensus 338 ~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~ 409 (602) ..+.||.+-+||...| |.+-.|=.++.|.-..+|+.. -|+|.++.+-+.|-+..+... ...+.++|+ . T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i--dpdi~~~fn--P 460 (695) T protein:vir:36 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV--DPSIKWQWN--A 460 (695) T ss_pred HHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcceEEeC--C Confidence 4579999999997765 554444346667666677653 578888888777766554332 244555555 3 Q ss_pred hcchhHH--H---HHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcc------cccccc--ccccccccccCCCcCccc Q lcl|NC_021537. 410 AEQPEQD--A---KMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRG------DMTLSE--FEAEFGADASDGDAEAML 476 (602) Q Consensus 410 ~~~~~~d--~---~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~------d~~~~~--~~~~~~~~~~~~~~~~~~ 476 (602) +..+... + ++.++..+.++..|+++++|+|.++.-+|- ++.. |.+..+ +........-.+..++.. T Consensus 461 L~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~-s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~ 539 (695) T protein:vir:36 461 LRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPD-GPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGD 539 (695) T ss_pred CCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCC-cccccccccccCCCcCccchhhhhHhhhcCcccccc Confidence 3333221 1 334566778899999999999999876542 1111 111000 000000000000000000 Q ss_pred ccccccccccccccccccccccccccchhhhhcchh-hhhhheecccccEEEEEEec--ccCCcceeeeccCCH-----H Q lcl|NC_021537. 477 TRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSS-NLDEGLYDFGERELYLSFKR--ESGQNSLYVYVDVPA-----A 548 (602) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss-~~~~~~yd~~~~~l~~~f~~--~~~~~~~y~y~~v~~-----~ 548 (602) +.....+.. ........+.+.....+-.-..+++. ....+-+..+.++|-++=.. +.-+|+.-+.-.-|+ + T Consensus 540 ~~~~~~~~~-g~~~~~~v~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~aa~RE 618 (695) T protein:vir:36 540 TGAPGGARA-GATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAARRE 618 (695) T ss_pred cCCCCcccc-cccCCCcccccccccCccccCCCCccceeeEEEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHHHH Confidence 000000000 00000000111111100000111111 11233355568888875422 122333333333332 2 Q ss_pred HHHHHhC-----CCccchhhhhhhcccccccccccchhcccCCCCCChh-----hcCCcccccC Q lcl|NC_021537. 549 VWSALVS-----APSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPG-----EAPEDVPSDI 602 (602) Q Consensus 549 ~~~~~~~-----a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 602 (602) +.++.-= .-..|+| ...|.+-.. .. ....+...+.. =-|+++|..+ T Consensus 619 ~~EEtGl~~~~el~~~g~~-----~~~~~~f~~-~~--e~~~~~l~dEh~~~~Wf~pdeLP~pL 674 (695) T protein:vir:36 619 TREETGYDHDGELVPLGKF-----DGFFHAFVA-HL--EPFDVELNDEHTAFDWFNPDELPHPL 674 (695) T ss_pred HHHHhCCccccceeeeeee-----cceEEEEEE-ee--cccCcccCchhhhcccCChhhcCccc Confidence 2332110 0122221 111111000 00 00000001000 1245555444 No 138 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.23 E-value=4.9e-11 Score=77.09 Aligned_cols=426 Identities=15% Similarity=0.137 Sum_probs=173.4 Q ss_pred CCCCcc-----------------cccccc-hhhhcccCc-cc--cCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSKAEE-----------------TTQLDE-RHIATDVGR-GI--QPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k~~~-----------------~~~~~~-~~~~~~~~~-~i--~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) +...++ ...+.. .++.. |. .+ -+.--+..++++.-...+.+.||+..++.+...||. T Consensus 5 ~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~--G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~ 82 (484) T protein:vir:77 5 LQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYE--SERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFR 82 (484) T ss_pred ccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHh--ccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhccCcee Confidence 111110 011111 11111 11 11 011112444555445678889999999988777775 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce-------E Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP-------V 132 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~-------~ 132 (602) +- +. .+..+.+..++.. -.+......+..+.+++|.||+.+.++.+|.+ . T Consensus 83 ~~---~~------~~~~~~l~~i~~~--------------N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~ 139 (484) T protein:vir:77 83 LG---GA------DKADEQLWDWWQA--------------NDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVP 139 (484) T ss_pred cC---Cc------chhHHHHHHHHHh--------------cCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccc Confidence 31 11 1111222222211 23567788899999999999999988887754 2 Q ss_pred EEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 133 GLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 133 ~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) .|..++|..+.+..+...-.. .-+..++....+...+.... ..++........+|.+.........+.. T Consensus 140 ~i~~~~p~~~~~~~D~~~~~~---------~~a~~~~~~~~~~~~~~~~~--y~~~~~~~~~~~~~~~~~~~~~~~~~g~ 208 (484) T protein:vir:77 140 IIRVEPPTNLYAQIDPRTRQV---------MRAIRAIEDEEGNEVIGATL--YLPNNTVIWNREDGQWVQVANVAHNLEM 208 (484) T ss_pred eEEEeccceeEEEecCCCCce---------EEEEEEEEeecCCcEEEEEE--EecCeEEEEEecCCceEeeccccCCCCC Confidence 577788888865543221110 00111111111111110000 0111111122222222222112222333 Q ss_pred hHEEEecCCCCCCCcccccHHHHHH-HHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHH--HHHHHHHhhcc Q lcl|NC_021537. 213 NELIFLPNPSPLALYYGVPDWVAAM-QTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKED--LRNLMDNLKGS 289 (602) Q Consensus 213 ~eviH~r~~~~~~~~~G~spl~~~~-~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~--l~~~~~~~~g~ 289 (602) =.|+||.+....+..+|.|.+.-.. ..++....+..-........+.|..+|. |....+...+. -...|+.. T Consensus 209 vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~~--- 283 (484) T protein:vir:77 209 VPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLF--GVKGEELGVDPETGQTLFDAY--- 283 (484) T ss_pred cceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh--CCCcchhcccccccchhhhhh--- Confidence 3478998776677789999775322 2233332222222222233344444443 21111111110 00111111 Q ss_pred cccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH Q lcl|NC_021537. 290 RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR 369 (602) Q Consensus 290 ~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~ 369 (602) .++++.+++ .+.++..+...+. --|++..+..+..|+..-++|++.+|.... |-++.++... T Consensus 284 --~~~~~~~~~-------------~~~~~~q~~~~~~--e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~ 345 (484) T protein:vir:77 284 --LARILAFED-------------HESKAQQFSAAEL--RNFVDALDALDRKAAAYTGLPPYYLSFSSE-NPASAEAIRS 345 (484) T ss_pred --hhhhcccCC-------------CCceeEeecCCCh--HHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHH Confidence 122333221 1223333332221 137788888889999999999999975432 3233333221 Q ss_pred HH-------------HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC--ccc Q lcl|NC_021537. 370 EF-------------AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG--VGT 434 (602) Q Consensus 370 ~f-------------~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G--~~T 434 (602) .+ +...|.-.+..+....+..-. .....+..+.| .+.... +....++++.+++.+| +++ T Consensus 346 ~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~--~~~~~~i~v~w--~~~~~~--s~~~~ad~~~kl~~~g~gi~s 419 (484) T protein:vir:77 346 SESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDI--PPEYYRMESIW--RDPSTP--TYAAKADAATKLYNNGQGVIP 419 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc--ccccccceEEe--cCCCCC--CHHHHHHHHHHHHhccCCCCC Confidence 11 111121111111111111000 00112234444 332222 2334567788888876 888 Q ss_pred HHHHHHHhCCCCCCCCcccccccc--------ccccccccccCCCcCccccccccccccccccccccccc Q lcl|NC_021537. 435 VNEAREELDLAPFEDDRGDMTLSE--------FEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVD 496 (602) Q Consensus 435 ~NE~R~~~Gl~p~~~g~~d~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (602) ..-+++++|+.+-+-......-.. .....+.+.+.+..+.. .+++......+...+. T Consensus 420 ~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 420 KERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDN-----PETPEPQPNPAEEAAA 484 (484) T ss_pred HHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCC-----CCcccccCCCccccCC Confidence 888999998854321111100000 00000111111100000 0000000000000011 No 139 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.22 E-value=1.8e-11 Score=79.47 Aligned_cols=429 Identities=14% Similarity=0.098 Sum_probs=173.8 Q ss_pred CCCCcccccccchhhhccc-Cc-cc--cCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV-GR-GI--QPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~-~~-~i--~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) +.+.=+......+....-+ |. .+ -+.--+..++++.....+...||+.++..+...||.+- ++ .+.. T Consensus 21 l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~---~~------~~~~ 91 (485) T protein:vir:10 21 MVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFRFG---DA------DEAD 91 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhcccceecC---CC------chhH Confidence 1111111111111111101 11 11 11112344455544456889999999998876676531 11 1111 Q ss_pred HHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc-------eEEEEEeCccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT-------PVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~-------~~~L~~l~p~~v~~~~~~~ 149 (602) +.+..++.. -.+..+...+..+.+++|.||+.+.++..+. ...+..++|..+.+..+.. T Consensus 92 ~~~~~i~~~--------------N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~ 157 (485) T protein:vir:10 92 EELWQWWQA--------------NNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPR 157 (485) T ss_pred HHHHHHHHh--------------cCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCC Confidence 223333221 2456777889999999999999988775432 2357788888876554322 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeecccccccc-cceeeecccceEEecCceeEEechhHEEEecCCCCCCCcc Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGD-DKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYY 228 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~ 228 (602) .... ..+..+++...+....... .|.. .........+.+.........+..=.|+||.+.....+.+ T Consensus 158 ~~~~---------~~~~~~~~~~~~~~~~~~~---~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~ 225 (485) T protein:vir:10 158 IGRV---------SKAIRVAYDAEGNEIQAAT---LYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLY 225 (485) T ss_pred CCce---------eEEEEEEEeeCCCeEEEEE---EEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCC Confidence 1100 0011111111111111100 1111 0111111112222222222334445688888776677789 Q ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHH--HHHHHHHHhhcccccCcceeccCCccce Q lcl|NC_021537. 229 GVPDWVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKE--DLRNLMDNLKGSRYRTAILEVEEFVDDH 305 (602) Q Consensus 229 G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~--~l~~~~~~~~g~~nag~~~~~~~g~~~~ 305 (602) |.|.+.. +...++....+..-........+.|..+|+ |..+.....+ .-...|+. ..++++.+++ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~----- 293 (485) T protein:vir:10 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF--GIKPEEIGVDPETGQTLFDA-----YLARILAFED----- 293 (485) T ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHh--cCCcccccccccccchhhhh-----cccceeccCC----- Confidence 9997654 223333333222222223333344444433 2111111000 00011111 1123333221 Q ss_pred eccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHH-------------HH Q lcl|NC_021537. 306 GLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTRE-------------FA 372 (602) Q Consensus 306 ~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~-------------f~ 372 (602) .+.+|..+...+. -.|++.++..+.+|+..=++|+..+|.... |.++.++.... .+ T Consensus 294 --------~d~k~~q~~~~~~--~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~k~~~k~~~f 362 (485) T protein:vir:10 294 --------AEGKIQQFSAAEL--ANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKVERKNSIF 362 (485) T ss_pred --------CCceEEeecccch--HHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHH Confidence 1123322222221 137788888888999999999999875432 33333322111 11 Q ss_pred HHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC--cccHHHHHHHhCCCCCCCC Q lcl|NC_021537. 373 KGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG--VGTVNEAREELDLAPFEDD 450 (602) Q Consensus 373 ~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~p~~~g 450 (602) ...|+.+++.+....+ ........+.+++.+.+.... +....++++.+++.+| +++..-+++++|+.+-+-. T Consensus 363 ~~~l~~~~~l~~~~~~----~~~~~~~~~~i~v~w~~~~~~--~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~ 436 (485) T protein:vir:10 363 GGAWEEAMRLAYRMMK----GGDVPPDMLRMETVWRDPSTP--TYAAKADAASKLYNGGTGVIPRERARKDMGYSIAERE 436 (485) T ss_pred HHHHHHHHHHHHHHhC----CCCCcccceeeeEEecCCCCC--CHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHH Confidence 2222222222211111 011111112333333332222 3344567888888866 8888889999998653211 Q ss_pred cccccccccc----cccc--ccccCCCcCcccccccccccccccccccccccccc Q lcl|NC_021537. 451 RGDMTLSEFE----AEFG--ADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDV 499 (602) Q Consensus 451 ~~d~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (602) .....-.... .... ..+..+..+.....+..+++... +..+.+ T Consensus 437 ~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~ 485 (485) T protein:vir:10 437 EMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALE------SGGDAA 485 (485) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCC------CCCCCC Confidence 1110000000 0000 00000000011111111111000 001111 No 140 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.19 E-value=6.9e-10 Score=70.82 Aligned_cols=509 Identities=11% Similarity=0.038 Sum_probs=209.9 Q ss_pred CCCCcccccccchhhhcccCccccCC-----CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEE------------e Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPP-----YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVA------------H 63 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~-----~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~------------~ 63 (602) -.+.+-+-.++ |.........+- +=.-.|+.+++ .|-.++|+.++++.+..-..++.. . T Consensus 83 ~~~~~~~~~~~---~~~~~~~~l~~~~~~~F~Gy~~la~laQ-~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~ 158 (695) T protein:vir:78 83 RERRAASYALD---FNGTSMDALSFVTSSGFPGFPTLVLLAQ-LPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAA 158 (695) T ss_pred cccchhhhhhc---ccccccccchhhhccCcchHHHHHHHhh-ccchhhHHHHHHHHhhcccceeccccchhhhhhcccc Confidence 11111111111 111111111111 11245666666 577899999999988776444321 1 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---------------- Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---------------- 127 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---------------- 127 (602) ..+..+..+.++.+++...+++.+. .+-++..++.--+||-+.+.+.-++ T Consensus 159 ~~~~~~~~d~dqi~~L~~e~erL~V--------------~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I 224 (695) T protein:vir:78 159 GGNAASTSDGDQLKQINDEIERLRI--------------RDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTV 224 (695) T ss_pred cccccccccHHHHHHHHHHHHHHHH--------------HHHHHHHHHhhccccceEEEEEeccCccccccccccccccc Confidence 1122222333566666665554432 2233444455556777776654433 Q ss_pred -CCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCce Q lcl|NC_021537. 128 -DGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGE 206 (602) Q Consensus 128 -~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 206 (602) +|....|..|+|..|.+...... .+.....-.+.|++|. | T Consensus 225 ~kGslKGl~ViDp~~vtP~~~n~~------dP~spdfgkP~~y~V~--------------------------------G- 265 (695) T protein:vir:78 225 PKGSFQGLRVVEPYWVTPNNYNSI------NPVADDFYKPSTWWMI--------------------------------G- 265 (695) T ss_pred cCcceeeeEeecccccccchhhhc------cchhhccCCCceEEEe--------------------------------c- Confidence 23344466666666655322110 0111111112222221 1 Q ss_pred eEEechhHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc-ccCC-HH--HH Q lcl|NC_021537. 207 LKNGPANELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG-GTLS-ED--SK 276 (602) Q Consensus 207 ~~~~~~~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~~~~-~~--~~ 276 (602) .++..+.++.|.... +...+.|+|..+.+...+................-. ..++. +.- ..+. .. +. T Consensus 266 -~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~-v~~lk-~dla~~L~~g~~~~l 342 (695) T protein:vir:78 266 -TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFS-VSGIL-MDLAQALMPGANVDL 342 (695) T ss_pred -eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhh-hHHHH-HHHHHhhcChhHHHH Confidence 123334444343221 223468999999998888776666655555543322 22221 100 0111 11 12 Q ss_pred HHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hc Q lcl|NC_021537. 277 EDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NV 355 (602) Q Consensus 277 ~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~ 355 (602) ...-+.++.++ +|.|.+++-...-++.... .+|+-.. +......+.||.+-+||...| |. T Consensus 343 ~~R~eli~~~R--sn~G~~llDk~~Eefeq~s----------tslSGLd-------dVi~qf~q~VAgaa~IPltkLfGq 403 (695) T protein:vir:78 343 SMRAELINRYR--DNRNILFLDKATEEFFQFN----------TPLSGLD-------ALQAQAQEQMSAVSHIPLIKLLGI 403 (695) T ss_pred HHHHHHHHHhc--CccceEEEecCCcceEEEe----------cccCCHH-------HHHHHHHHHHHhhhcCchhhhhcc Confidence 21223345554 4444444421233333321 2333332 233334579999999997765 55 Q ss_pred cccCCccCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH--H---HHHHHH Q lcl|NC_021537. 356 TSTSNRANSKEQTREFAKG-------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD--A---KMAEQR 423 (602) Q Consensus 356 ~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d--~---~~~~~~ 423 (602) +-.|=.++.|.-..+|+.. -|+|.++.+-+.|-+..+... ...+.++|+ .+..+... + ++.++. T Consensus 404 SPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i--dpdi~~~fn--PL~qmtd~EkAeI~~k~A~~ 479 (695) T protein:vir:78 404 TPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV--DPSIKWQWN--ALRELDDLEVAESRYKQAQS 479 (695) T ss_pred CCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcceEEeC--CCCCcCHHHHHHHHhhhhHH Confidence 4444346667666677653 578888888777766554332 234555555 33333221 1 334566 Q ss_pred HHHHHhCCcccHHHHHHHhCCCCCCCCcc------cccccc--ccccccccccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 424 VRAMRLAGVGTVNEAREELDLAPFEDDRG------DMTLSE--FEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 424 ~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~------d~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) .+.++..|+++++|+|.++.-+|- ++.. |.+..+ +........-.+..++..+.....+....... ...+ T Consensus 480 d~~~~~~gvI~~~evr~rL~~d~~-s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~~~~ 557 (695) T protein:vir:78 480 DVLYVQEQVIRPDQVAARLNTEPD-GPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAP-PTVA 557 (695) T ss_pred HHHHHHhcCCCHHHHHHHHhcCCC-cccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCCCCCCCCCC-Ccee Confidence 778899999999999999876542 1111 111100 00000000000001111111111110000000 0000 Q ss_pred cccccccchhhhhcchh-hhhhheecccccEEEEEEec--ccCCcceeeeccCCH-----HHHHHHhC-----CCccchh Q lcl|NC_021537. 496 DVDVSKDPIEQTTFSSS-NLDEGLYDFGERELYLSFKR--ESGQNSLYVYVDVPA-----AVWSALVS-----APSAGSY 562 (602) Q Consensus 496 ~~~~~~~~m~~~~v~ss-~~~~~~yd~~~~~l~~~f~~--~~~~~~~y~y~~v~~-----~~~~~~~~-----a~s~g~~ 562 (602) .+...-.+-+-..+++. ....+-+..+.++|-++=.. +.-+|+.-+.-.-|+ ++.++.-= .-..|+| T Consensus 558 ~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~aa~RE~~EEtGl~~~~el~~~g~~ 637 (695) T protein:vir:78 558 NVNANVKPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAARRETREETGYDHDGELVPLGKF 637 (695) T ss_pred eeeccccccccCCCCcccceeEEEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHHHHHHHHhCCccccceeeeeee Confidence 00000000000111111 11233345568888875422 122333333333332 22332210 0122221 Q ss_pred hhhhhcccccccccccchhcccCCCCCChh-----hcCCcccccC Q lcl|NC_021537. 563 HYSEIRLQYGYLEVTNNHERLPEGPTPDPG-----EAPEDVPSDI 602 (602) Q Consensus 563 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 602 (602) ...|.+-.. .. ....+...+.. =-|+++|..+ T Consensus 638 -----~~~~~~f~~-~~--e~~~~~l~dEh~~~~Wf~pdeLP~pL 674 (695) T protein:vir:78 638 -----DGFFHAFVA-HL--EPFDVELNDEHTAFDWFNPDELPHPL 674 (695) T ss_pred -----cceEEEEEE-ee--cccCcccCchhhhcccCChhhcCccc Confidence 111110000 00 00000001000 0244455444 No 141 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.18 E-value=7.7e-10 Score=70.55 Aligned_cols=511 Identities=11% Similarity=0.043 Sum_probs=210.0 Q ss_pred CCCCcccccc---cc-------------hhhhcccCc----cccCC-----CCHHHHHHHHhhhHHHHHHHHHHHHhhcc Q lcl|NC_021537. 1 MSKAEETTQL---DE-------------RHIATDVGR----GIQPP-----YNPETLAAFQELNETHQACIRKKSRYEAG 55 (602) Q Consensus 1 ~~k~~~~~~~---~~-------------~~~~~~~~~----~i~p~-----~~~~~l~~~~~~~~~v~~cI~~ia~~ia~ 55 (602) ++.+. |-+| +. ..++.+.++ ...+- +=.-.|+.+++ .|-.++|+.++++.+.. T Consensus 60 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ-~~eyr~~~~~ia~e~~R 137 (694) T protein:vir:10 60 AEPSP-SLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQ-LPEYRAMHEVLADECIR 137 (694) T ss_pred CCCCc-chhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhh-ccchhhHHHHHHHHhhc Confidence 22221 1100 00 011111111 11110 11245666666 57789999999998877 Q ss_pred CceEEEE------------ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEE Q lcl|NC_021537. 56 YGFEIVA------------HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEI 123 (602) Q Consensus 56 ~~~~i~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i 123 (602) -..++.. ...+..+..+.++.+++...+++.+. .+-++..++.--+||-+.+.+ T Consensus 138 ~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V--------------~~~l~eaik~aRlfGGa~~~i 203 (694) T protein:vir:10 138 TWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRI--------------RDAVRTTVIHDQAFGRAHPYF 203 (694) T ss_pred ccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHH--------------HHHHHHHHHhhccccceEEEE Confidence 6444321 11122223333566666665555432 223344445555677777665 Q ss_pred eeCC-----------------CCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccc Q lcl|NC_021537. 124 LVEG-----------------DGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRY 186 (602) Q Consensus 124 ~r~~-----------------~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~ 186 (602) .-++ +|....|..|+|..|.+...... .+.....-.+.|++|. T Consensus 204 ~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~------dP~spdfgkP~~y~V~-------------- 263 (694) T protein:vir:10 204 KIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI------NPVADDFYKPSTWWMI-------------- 263 (694) T ss_pred EeecCccccccccccccccccCcceeeeEeecccccccchhhhc------cchhhccCCCceEEEe-------------- Confidence 4333 23344466666666655432110 0111111112222221 Q ss_pred cccceeeecccceEEecCceeEEechhHEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_021537. 187 GDDKRFVDKETGEVASDAGELKNGPANELIFLPNPS------PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIP 260 (602) Q Consensus 187 ~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~------~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p 260 (602) | .++..+.++.|.... +...+.|+|....+...+................-. . T Consensus 264 ------------------G--~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~-v 322 (694) T protein:vir:10 264 ------------------G--TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFS-V 322 (694) T ss_pred ------------------c--eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhh-h Confidence 1 123334444343221 223468999999988888776666655555543322 1 Q ss_pred ceEEEecc-ccCC-HH--HHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHH Q lcl|NC_021537. 261 HYAVKVTG-GTLS-ED--SKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRE 336 (602) Q Consensus 261 ~gil~~~~-~~~~-~~--~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~ 336 (602) .++. +.- ..+. .. +....-+.++.++ +|.|.+++-...-++.... .+|+-.. +... T Consensus 323 ~~lk-~dla~~L~~g~~~~l~~R~eli~~~R--sn~G~~llDk~~Eefeq~s----------tslSGLd-------dVi~ 382 (694) T protein:vir:10 323 SGIL-MDLAQALMPGANVDLSMRAELINRYR--DNRNILFLDKATEEFFQFN----------TPLSGLD-------ALQA 382 (694) T ss_pred HHHH-HHHHHhhcChhHHHHHHHHHHHHHhc--CccceEEEecCCcceEEEe----------cccCCHH-------HHHH Confidence 2211 100 0111 11 1221223345554 4444444421233333321 2333332 2333 Q ss_pred hhHHHHHHHhcCChHHh-hccccCCccCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhcCCccccccceEEEeccc Q lcl|NC_021537. 337 RNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFAKG-------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELR 408 (602) Q Consensus 337 ~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~ 408 (602) ...+.||.+-+||...| |.+-.|=.++.|.-..+|+.. -|+|.++.+-+.|-+..+... ...+.++|+ T Consensus 383 qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i--dp~i~~~fn-- 458 (694) T protein:vir:10 383 QAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV--DPSIKWQWN-- 458 (694) T ss_pred HHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcceEEeC-- Confidence 34579999999997765 554444346667666677653 578888888777766554332 244555555 Q ss_pred hhcchhHH--H---HHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcc------cccccc--ccccccccccCCCcCcc Q lcl|NC_021537. 409 GAEQPEQD--A---KMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRG------DMTLSE--FEAEFGADASDGDAEAM 475 (602) Q Consensus 409 ~~~~~~~d--~---~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~------d~~~~~--~~~~~~~~~~~~~~~~~ 475 (602) .+..+... + ++.++..+.++..|+++++|+|.++.-+|- ++.. |.+..+ +........-.+..++. T Consensus 459 PL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~-s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~ 537 (694) T protein:vir:10 459 ALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPD-GPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGG 537 (694) T ss_pred CCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCC-cccccccccccCCCcCccchhhhhHhhhcCccccc Confidence 33333221 1 334566778899999999999999876542 1111 111000 00000000000000011 Q ss_pred cccccccccccccccccccccccccccchhhhhcchh-hhhhheecccccEEEEEEec--ccCCcceeeeccCCH----- Q lcl|NC_021537. 476 LTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSS-NLDEGLYDFGERELYLSFKR--ESGQNSLYVYVDVPA----- 547 (602) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss-~~~~~~yd~~~~~l~~~f~~--~~~~~~~y~y~~v~~----- 547 (602) .+.....+... .......+.+.....+-.-..+++. ....+-+..+.++|-++=.. +.-+|+.-+.-.-|+ T Consensus 538 ~~~~~~~~~~g-~~~~~~v~~~~~~~~~~~ag~~~~~~~~ag~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~a~~R 616 (694) T protein:vir:10 538 DTGAPGGARAG-ATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAARR 616 (694) T ss_pred ccCCCCccccc-ccCCCcccccccccCccccCCCCccceeeEEEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHHH Confidence 00000000000 0000000111111100000111111 12233355568888875421 122233333333332 Q ss_pred HHHHHHhC-----CCccchhhhhhhcccccccccccchhcccCCCCCChh-----hcCCcccccC Q lcl|NC_021537. 548 AVWSALVS-----APSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPG-----EAPEDVPSDI 602 (602) Q Consensus 548 ~~~~~~~~-----a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 602 (602) ++.++.-= .-..|+| ...|.+-.. .. ....+...+.. =-|+++|..+ T Consensus 617 E~~EEtGl~~~~el~~~~~~-----~~~~h~f~~-~~--e~~~v~l~dEh~~~~Wf~pdeLP~pL 673 (694) T protein:vir:10 617 ETREETGYDHDGELVPLGKF-----DGFFHAFVA-HL--EPFDVELNDEHTAFDWFNPDELPHPL 673 (694) T ss_pred HHHHHhCCccccceeeeeee-----cceEEEEEE-ee--cccCcccCchhhhcccCChhhcCccc Confidence 22222110 0112221 111111000 00 00000001000 0244555444 No 142 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.17 E-value=1.4e-10 Score=74.58 Aligned_cols=438 Identities=13% Similarity=0.038 Sum_probs=172.8 Q ss_pred CCCCcc-----------cccccchhhh-cccCc----cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEE-----------TTQLDERHIA-TDVGR----GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~-----------~~~~~~~~~~-~~~~~----~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) |.+.+. ..+...+... +.-|. .+.+ --+..++++.-...+.+.||+.+++.+.--||.+- T Consensus 18 l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~-~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~--- 93 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGN-LIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWP--- 93 (504) T ss_pred CCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccc-cccHHHHHHhhccCcHHHHHHHHHhhhccceeeCC--- Confidence 111110 0010111110 00011 1111 11344555544567888899999998877777531 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceE-EEEEeCccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPV-GLAHVPAATVR 143 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~-~L~~l~p~~v~ 143 (602) +.+ ...+.+...+. .-++......+..+.+++|.||+.+..+.+|++. .+.+++|..+. T Consensus 94 d~~------~~~~~l~~i~~--------------~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~ 153 (504) T protein:vir:99 94 DGD------YGSIGGPDVWD--------------ENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQAT 153 (504) T ss_pred CCC------hhhHHHHHHHH--------------hcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeE Confidence 111 11112222221 1234567778899999999999999888888764 67788999887 Q ss_pred ccccccccccccchhhhhcccCceeEEE-EcCCcceeecccccccccceeee-cccceEEecCceeEEechhHEEEecCC Q lcl|NC_021537. 144 VRKTTTTIEREDGEEVENIESGHGYVQV-RQGRRRYFGEAGDRYGDDKRFVD-KETGEVASDAGELKNGPANELIFLPNP 221 (602) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~qi-~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~~~~~~~eviH~r~~ 221 (602) +..+........ +..++.. ..|...++..+ .+....... ...+.+.... ....+. -.|++|.+. T Consensus 154 ~iyD~~~~~~~~---------a~~~~~~d~~g~~~~~~~y---~~~~~~~~~~~~~~~~~~~~-~~~~~g-vPvV~~~n~ 219 (504) T protein:vir:99 154 GEWNSRRNAMDS---------LLSITSRDAEGHPTGIALY---EDGVTVTADMDDDGDWHADV-RTHKLG-VPVEVLPYK 219 (504) T ss_pred EEEeCCCCceeE---------EEEEEEecCCCeEEEEEEE---cCCcEEEEEEcCCceeeecc-ccCCCC-cceEEeccc Confidence 554322111000 0001100 00000111111 111111111 1111111110 000111 138888877 Q ss_pred CCCCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCH------HHHHHHHHHHHHhhcccc Q lcl|NC_021537. 222 SPLALYYGVPDWV----AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSE------DSKEDLRNLMDNLKGSRY 291 (602) Q Consensus 222 ~~~~~~~G~spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~------~~~~~l~~~~~~~~g~~n 291 (602) ...+..+|.|.+. .+...+.....-......||. .|.-+|. |..+.+ .....++... T Consensus 220 ~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a---~p~r~i~--G~~~~~~~~~d~~~~~~~~~~~-------- 286 (504) T protein:vir:99 220 PREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYS---FPQLILL--GADAKNFRNKDGSMKPAWQIAL-------- 286 (504) T ss_pred ccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhc--cCCccccccccccccchhhhhh-------- Confidence 6667788988653 333333322222222233322 3333332 211110 0111122211 Q ss_pred cCcceeccCCccceeccccccccccccccccccchHHH-HHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHH Q lcl|NC_021537. 292 RTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDM-EFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTRE 370 (602) Q Consensus 292 ag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~-qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~ 370 (602) ++++.++...+..... ..+.++..+... ++ .|++.++..+..|++.=++|++.+|+.+..|.++.++.... T Consensus 287 -~~i~~~~~~~~~~~~~----~~~~~~~q~~~~---~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~ 358 (504) T protein:vir:99 287 -ARVFALPDDEDEPDAA----RARADVKQFPAS---SPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIAS 358 (504) T ss_pred -hhhhcCCCcccccccc----CccceeeecCCC---ChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHH Confidence 1233333221111100 112233333322 23 38889999999999999999999998776666665543211 Q ss_pred HH--HHHHHHHHHHHHHHHhhh------cCCcc--ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCccc--HHH- Q lcl|NC_021537. 371 FA--KGIIEPEQAKFSARLYKI------IHQDA--LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGT--VNE- 437 (602) Q Consensus 371 f~--~~~l~P~~~~ie~~ln~~------Ll~~~--~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T--~NE- 437 (602) .. ...+.-..+.|...+.+. +.... .....+.++..+.+.... +....++++.+++.+|.+. ..| T Consensus 359 ~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~--s~a~~aDa~~Kl~~ag~~l~~~~~~ 436 (504) T protein:vir:99 359 REDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYL--SKAAQADAGAKMLGAGPEWLKETEV 436 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCcc--CHHHHHHHHHHHHhhccccccchHH Confidence 11 111111222233333221 10000 011112333333333222 3344678888899988643 233 Q ss_pred HHHHhCCCCCCCCc--ccccccc----ccccccccccCCCcCcccccccccccccccccccccccccccccc Q lcl|NC_021537. 438 AREELDLAPFEDDR--GDMTLSE----FEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDP 503 (602) Q Consensus 438 ~R~~~Gl~p~~~g~--~d~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (602) ..+++|+.+-+-.. .+.--.. .......+...+..+.....+..+++.. +...+.-+=.... T Consensus 437 l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~----~~~~~~~~p~~~~ 504 (504) T protein:vir:99 437 GLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPAN----EPPAALGRPTLVG 504 (504) T ss_pred HHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCC----CCCccCCCcccCC Confidence 45677886532110 0000000 0000011111111111101111111100 0000000000000 No 143 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.17 E-value=2.9e-10 Score=72.91 Aligned_cols=377 Identities=10% Similarity=-0.024 Sum_probs=167.4 Q ss_pred CCCCcccccccchhhhcccCc-----cccCCCCHHHHHHHHh-hhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGR-----GIQPPYNPETLAAFQE-LNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~-----~i~p~~~~~~l~~~~~-~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) +.+.=...+...++...-+.+ .+.+. -|..++...+ ...+...+|+.+++.+.=-||+. .+. T Consensus 9 L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~----------~d~- 76 (409) T protein:vir:94 9 LRFKLSVHKRRAEMRYDQYAMKYVDRFKGIT-IPQALSQQYRSILGWCAKGVDSLADRLVFREFEN----------DDF- 76 (409) T ss_pred HHHHHHHHhHHHHHHHHHhcccCchhhcChh-hhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC----------Cch- Confidence 111000011111111111111 11111 1234443322 23678889999888666555531 010 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) .+...+. .-++......+..+.+++|.+|+.+..+.+|+| .+.+++|..+.+..+...-. T Consensus 77 ---~l~~i~~--------------~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~~~-- 136 (409) T protein:vir:94 77 ---TVNEIFE--------------ENNPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPITGL-- 136 (409) T ss_pred ---HHHHHHH--------------hcChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCCCc-- Confidence 1111111 123456677888899999999999998888876 68888998887554332110 Q ss_pred cchhhhhcccCceeEEEE-cCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVR-QGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) ...++.+. .+.........-..++.........+.+..... .+.-=.|+||.+....+..+|.|.+ T Consensus 137 ----------~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n---~~g~vPvV~f~n~~~~~~~~G~s~I 203 (409) T protein:vir:94 137 ----------LTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIAN---PTGHPLLVPIIHRPDAVRPFGRSRI 203 (409) T ss_pred ----------eeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeC---CCCCcceEEeccccccccccCcccc Confidence 01111111 111111100000111111111111121111111 1122237888876666778999865 Q ss_pred H----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 234 V----AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 234 ~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) . .+...+.....-......| .+.|.-++. |-..+.+..+.++.... +++.++...+ T Consensus 204 ~e~v~~l~da~~r~~~~~~~~~e~---~a~pqr~i~--G~d~d~~~~~~~~~~~~---------~i~~~~~d~d------ 263 (409) T protein:vir:94 204 TRSGMYWQSNAKRTLERADVTAEF---YSFPQKYVT--GLSDDAEPMETWKATVS---------SMLQFTKDED------ 263 (409) T ss_pred chhHHHHHHHHHHHHHHHHHHHHH---hcChhheeE--ecCCCCcccchhhhhHH---------HhhcCCCCCC------ Confidence 2 3333333333222233333 344555543 21111122233333222 2333322111 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAK--GIIEPEQAKFSARL 387 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~--~~l~P~~~~ie~~l 387 (602) ..+.++..+...+. ..|++.++..+.++|+.-++|++.+|.... |-++.++....... ....-..+.|.+.+ T Consensus 264 ---g~~~~v~q~~~~~l--~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~ 337 (409) T protein:vir:94 264 ---GDKPTLGQFTQPSM--SPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRSLGAGL 337 (409) T ss_pred ---CCCceEEecCCCCh--hHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01223333333222 138899999999999999999999986543 33443332211100 00111111122112 Q ss_pred hhh------cCCccc--cccceEEEeccchhcchhH-HHHHHHHHHHHHHhCC--cccHHHHHHHhCCCCCC Q lcl|NC_021537. 388 YKI------IHQDAL--DVDEWTIDFELRGAEQPEQ-DAKMAEQRVRAMRLAG--VGTVNEAREELDLAPFE 448 (602) Q Consensus 388 n~~------Ll~~~~--~~~~~~~~f~~~~~~~~~~-d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~p~~ 448 (602) .+. +..... ....+.+++.+........ .....++++.+++.+| ++.-+-+++++|+..-+ T Consensus 338 ~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 338 LNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 111 111100 0111223333333322222 1233568899999999 55668899999996543 No 144 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.15 E-value=8.6e-11 Score=75.77 Aligned_cols=405 Identities=13% Similarity=0.081 Sum_probs=172.4 Q ss_pred ccCCCCHHHHHHHHh--hhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccC Q lcl|NC_021537. 23 IQPPYNPETLAAFQE--LNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMS 100 (602) Q Consensus 23 i~p~~~~~~l~~~~~--~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~ 100 (602) +-|+=....++.+.+ ...+.+.||+.+++.+...+|... + .+..+.+..++.. - T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~---d-------~~~~~~~~~i~~~--------------N 56 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGP---D-------GEPDTRASRWWQA--------------N 56 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceecC---C-------CchHHHHHHHHHh--------------c Confidence 444433344444332 235889999999998777776431 1 1111222222211 2 Q ss_pred CHHHHHHHHHHHHHhcCCeEEEEeeCCCCc------eEEEEEeCcccccccccccccccccchhh-hhcccCceeEEEEc Q lcl|NC_021537. 101 TPEEVLELGRQDYHGIGWAALEILVEGDGT------PVGLAHVPAATVRVRKTTTTIEREDGEEV-ENIESGHGYVQVRQ 173 (602) Q Consensus 101 t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~------~~~L~~l~p~~v~~~~~~~~~~~~~~~~~-~~~~~~~~~~qi~~ 173 (602) ++......+..+.+++|.+|+.+.++.+|. ...+..++|.++.+..+...-...-.... ....++..+..+.. T Consensus 57 ~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~ 136 (434) T protein:vir:98 57 RLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFF 136 (434) T ss_pred ChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEE Confidence 345677788999999999999888765442 22477789988876554322111100000 00001111111110 Q ss_pred CCcceeecccccccccceeeecccceEEe----cCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 174 GRRRYFGEAGDRYGDDKRFVDKETGEVAS----DAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEW 249 (602) Q Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~----~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~ 249 (602) ....+.+.......... ......+.. .......+..=.|+||++....+. .|+|.++.....++....+..- T Consensus 137 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~ 212 (434) T protein:vir:98 137 DDTSFPYRTRERTGARL---PWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILN 212 (434) T ss_pred eCcEEEEEEeecccccc---ccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHH Confidence 00000000000000000 000000000 011112233445788876654444 6999988887777776666555 Q ss_pred HHHHHHhcCCCceEEEeccccCCHH--HHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchH Q lcl|NC_021537. 250 NHDVFDNLGIPHYAVKVTGGTLSED--SKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDL 327 (602) Q Consensus 250 ~~~~f~ng~~p~gil~~~~~~~~~~--~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~ 327 (602) ..+...-.+.|..+|+ |..+.+. ........++.+... .++++.+++ .+.++..+..... T Consensus 213 ~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~~~~~--~~~i~~~~~-------------~~~~~~q~~~~~~- 274 (434) T protein:vir:98 213 RMAASRFSGFRQKWIK--GHKFAKRTDPATGMTVVDQPFVPS--PSAVWASEG-------------ENTQFGQLDATDL- 274 (434) T ss_pred HHHHHHHhcchhhhhc--CCCcccccccccccchhhhhhhcc--ccccccCCC-------------CCceEEEecCcch- Confidence 5555555556655553 2111111 111111112222111 122222221 1223333332221 Q ss_pred HHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHH----HHHHHHHHHhhh------cCCcccc Q lcl|NC_021537. 328 DMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEP----EQAKFSARLYKI------IHQDALD 397 (602) Q Consensus 328 d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P----~~~~ie~~ln~~------Ll~~~~~ 397 (602) -.|++.++..+..|+..=++|++.+|.. .+.++.++.. +....|.- ..+.|...|.+. +...... T Consensus 275 -~~~~~~l~~~i~~~~~~~~~p~~~~~~~--~~n~Sg~Al~--~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~ 349 (434) T protein:vir:98 275 -SGFLKEHASDVRDMLTISQTPTYLYATD--LVNISADTIG--ALDILHVAKVREHIASFSEGLESVLALAAAQAGVPED 349 (434) T ss_pred -HHHHHHHHHHHHHHhcccCCCHHHhccc--cCChHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChh Confidence 2377888888999999999999999742 2223333221 11111111 112222222211 1111111 Q ss_pred ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccc----ccccccCCCcC Q lcl|NC_021537. 398 VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAE----FGADASDGDAE 473 (602) Q Consensus 398 ~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~----~~~~~~~~~~~ 473 (602) .....++|.-... . .....++++.+++..|+ +..-+++++|+++-+ ..+........ .....+.++.. T Consensus 350 ~~~~~v~w~~~~~--~--s~~~~ada~~kl~~~g~-~~e~~~~~lg~~~~e---~~r~~~e~~~~~~~~~~~~~~~~~~~ 421 (434) T protein:vir:98 350 YTEAEVRWANPAH--V--TMAVKADAATKLKSIGY-PLDVIAEELDESPAR---VRRIVAGAASQALLAASLLPAPGAPS 421 (434) T ss_pred heeeeEEecCCCC--C--CHHHHHHHHHHHHhcCC-cHHHHHHhCCCCHHH---HHHHHHHHHHHHHHHHhhhccCCCCC Confidence 2234455543322 2 33446778888888785 666778888875521 11111000000 00000011100 Q ss_pred ccccccccccccccccccccccc Q lcl|NC_021537. 474 AMLTRSKAAPPLENKIGERDSVD 496 (602) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~ 496 (602) +...+. . +.. ... T Consensus 422 ~g~~~~-~----~~~-----~dg 434 (434) T protein:vir:98 422 AGNVPD-S----GGA-----VDG 434 (434) T ss_pred CCCCCc-c----cCC-----CCC Confidence 000000 0 000 000 No 145 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.11 E-value=2.7e-10 Score=73.08 Aligned_cols=426 Identities=14% Similarity=0.087 Sum_probs=172.1 Q ss_pred CCCCcccccccchhhhcccCccc-cCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGI-QPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i-~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) ..+-..-..+. +++... ...- .+.--+..++++.-...+...+|+..+..+...||.+- +. .+..+.+ T Consensus 26 ~~~~~r~~~~~-~YY~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~---~~------~~~~~~l 94 (485) T protein:vir:24 26 EDQNQNLRSNT-SYYEAE-RRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFRLG---DA------DEADEEL 94 (485) T ss_pred HHHHHHHHHHH-HHHhcc-CchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccCceecC---CC------chhHHHH Confidence 11111111111 111100 0000 01111233444444456888899999888877787532 11 1111222 Q ss_pred HHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce-------EEEEEeCcccccccccccccc Q lcl|NC_021537. 80 RDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP-------VGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 80 ~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~-------~~L~~l~p~~v~~~~~~~~~~ 152 (602) ..++.. -.+..+...+..+.+++|.||+.+-++.++.. ..+.+++|..+.+..+...-. T Consensus 95 ~~i~~~--------------N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~~ 160 (485) T protein:vir:24 95 WQWWQA--------------NNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRIGR 160 (485) T ss_pred HHHHHh--------------cChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCcCc Confidence 322211 13567788899999999999999888765432 267888888886554322111 Q ss_pred cccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccH Q lcl|NC_021537. 153 REDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPD 232 (602) Q Consensus 153 ~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~sp 232 (602) ... +..++.-..+........ ..++.........|.+.........+..=.|+||++.....+.+|.|. T Consensus 161 ~~~---------~~~~~~~~~~~~~~~~~~--y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~ 229 (485) T protein:vir:24 161 PAK---------AIRVAYDAEGNEIQAATL--YTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSE 229 (485) T ss_pred eeE---------EEEEEEeecCCeEEEEEE--EcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCccc Confidence 000 000111001110100000 001111111122222322222233445556899987766777899998 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHH--HHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 233 WVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKE--DLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 233 l~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~--~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) +.. +...++....+..-........+.|..+|. |..+.....+ .-...|+. ..++++.+++ T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~--------- 293 (485) T protein:vir:24 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIF--GIKPEEIGVDPETGQTLFDA-----YLARILAFED--------- 293 (485) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhc--cCCccccccccccccchhhh-----cccceeccCC--------- Confidence 764 333344333333333333344445555443 2111110000 00011111 1222333221 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH-------------HHHHHHHH Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT-------------REFAKGII 376 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~f~~~~l 376 (602) .+.++..+...+. -.|++.++..+.+++..=++|+..+|.... |.++.++.. +..+...| T Consensus 294 ----~~~~~~q~~~~~~--e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l 366 (485) T protein:vir:24 294 ----AEGKIQQFSAAEL--ANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKVERKNAIFGGAW 366 (485) T ss_pred ----CCceEEeecccch--HHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122322222211 136777777888888888999999874432 222322211 11112233 Q ss_pred HHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC--cccHHHHHHHhCCCCCCCCcccc Q lcl|NC_021537. 377 EPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG--VGTVNEAREELDLAPFEDDRGDM 454 (602) Q Consensus 377 ~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~p~~~g~~d~ 454 (602) +-+++.+....+..- ......+..+.|.-. ... +....++.+.+++.+| +++..-+++++|+.+-+-..... T Consensus 367 ~~~~~l~~~~~~~~~--~~~d~~~i~v~f~~~--~~~--s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~ 440 (485) T protein:vir:24 367 EEAMRLAYRLMKGGD--VPPDMLRMETVWRDP--STP--TYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRR 440 (485) T ss_pred HHHHHHHHHHhcCCC--CccccceeeEEecCC--CCC--CHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHH Confidence 333332222111100 011122344555322 211 2233456677777765 77877788888885432111111 Q ss_pred ccccc--------cccccccccCCCcCcccccccccccccccccccccccccc Q lcl|NC_021537. 455 TLSEF--------EAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDV 499 (602) Q Consensus 455 ~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (602) ..... ....+...+.++.+.....+..++.. .....+ T Consensus 441 ~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~--------~~~~~a 485 (485) T protein:vir:24 441 WDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAI--------EGGDSA 485 (485) T ss_pred HHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCC--------CCCCCC Confidence 00000 00001111111100000000000000 001111 No 146 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.07 E-value=3.5e-10 Score=72.41 Aligned_cols=419 Identities=11% Similarity=0.001 Sum_probs=174.1 Q ss_pred CCCCcc-----------cccccchhhh-cccCc----cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEE-----------TTQLDERHIA-TDVGR----GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~-----------~~~~~~~~~~-~~~~~----~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) |.+.+. ..+...+... +.-|. .+.+ --|..++.+.....|...||+.+++.+.--||.+- T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~-~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~--- 87 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGT-LIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVWP--- 87 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccc-cccHHHHHHHhhcChHHHHHHHHHhhhcccceECC--- Confidence 222211 0110111111 10111 1111 11345565544567889999999998888887541 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce-EEEEEeCccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP-VGLAHVPAATVR 143 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~-~~L~~l~p~~v~ 143 (602) +.+. .+ ..+...+. .-++......+..+.+++|.+|+.+..+.+|.+ ..+.+++|..+. T Consensus 88 d~~~--~~----~~l~~iw~--------------~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~ 147 (474) T protein:vir:81 88 DGDL--DS----LGGTEVVD--------------DNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEAT 147 (474) T ss_pred CCCc--cc----hHHHHHHH--------------hcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEE Confidence 1111 11 11111121 113456677888899999999999988777764 467888999887 Q ss_pred ccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeeccc--ceEEecCceeEEechhHEEEecCC Q lcl|NC_021537. 144 VRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKET--GEVASDAGELKNGPANELIFLPNP 221 (602) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~eviH~r~~ 221 (602) +..|...-... .++.....+..+......-..++....+.... +.+.... ....+. -.|++|.+. T Consensus 148 ~~~D~~~~~~~-----------~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~-~~~~~g-vPvV~~~n~ 214 (474) T protein:vir:81 148 GEWNRRRRGLN-----------NLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDR-DEHVYG-VPAQVLPYK 214 (474) T ss_pred EEEeCCCCcce-----------eeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeecc-CCCCCC-cceEEeccc Confidence 65443211110 11111111111100000000111111111111 1111110 011111 237888877 Q ss_pred CCCCCcccccHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCceEEE-eccccCCH---HHHHHHHHHHHHhhcccccC Q lcl|NC_021537. 222 SPLALYYGVPDWV----AAMQTMGADQAAKEWNHDVFDNLGIPHYAVK-VTGGTLSE---DSKEDLRNLMDNLKGSRYRT 293 (602) Q Consensus 222 ~~~~~~~G~spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~~---~~~~~l~~~~~~~~g~~nag 293 (602) ...+..+|.|.+. .+...+.....-......| .+.|.-++. .......+ .....++.... T Consensus 215 ~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~---~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~--------- 282 (474) T protein:vir:81 215 PAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDV---FSYPEFWLLGADESALKNADGTIKSVWEARLG--------- 282 (474) T ss_pred ccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHH---hcchhheeecCChhhcccccccccchhhhhHH--------- Confidence 6677778988652 3333333222222223333 334444442 11000010 11122222222 Q ss_pred cceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHH Q lcl|NC_021537. 294 AILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFA 372 (602) Q Consensus 294 ~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~ 372 (602) +++.++.+.+... ....+.++-.+... +++ |++.++..+..||+.=++|++.+|+.+..|-++++....... T Consensus 283 ~i~~~~~d~d~~~----~~~~~~~~~q~~~a---~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~ 355 (474) T protein:vir:81 283 RIKGLPDDADADI----PQLARADVKQFPAA---SPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQY 355 (474) T ss_pred HHhcCCCcccccc----cccccccccccCCC---ChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHH Confidence 2222222211100 01112233333332 333 889999999999999999999999876566666554322221 Q ss_pred HH--HHHHHHHHHHHHHhhh----cCCccc------cccceEEEeccchhcchhHHHHHHHHHHHHHHhCCccc--HHHH Q lcl|NC_021537. 373 KG--IIEPEQAKFSARLYKI----IHQDAL------DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGT--VNEA 438 (602) Q Consensus 373 ~~--~l~P~~~~ie~~ln~~----Ll~~~~------~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T--~NE~ 438 (602) .. ...-..+.|...+.+. +.-... ....+.++..+.+.... .....++++.+++.+|..- ..=+ T Consensus 356 ~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~--s~a~~aDa~~Kl~~a~~~~~~~~~~ 433 (474) T protein:vir:81 356 ELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYL--SKSAQADAGMKQLAAVPWLAETEVG 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCcc--CHHHHHHHHHHHHhcccCCCcHHHH Confidence 11 1111122222222211 111000 00112333333332222 2234567888888887433 3446 Q ss_pred HHHhCCCCCCCCc-c-ccccccccccccccccCCCcCccccc Q lcl|NC_021537. 439 REELDLAPFEDDR-G-DMTLSEFEAEFGADASDGDAEAMLTR 478 (602) Q Consensus 439 R~~~Gl~p~~~g~-~-d~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (602) ++++|+.+-+-.. . ++--......+.. ......++...+ T Consensus 434 ~~~lg~t~~~i~~~~~~~~~~~~~~~~~~-l~~~~~~~~~aq 474 (474) T protein:vir:81 434 LELIGLTPQQARRAMADKRRVQGRGTLQA-LIDRSNNGATAQ 474 (474) T ss_pred HhhcCCCHHHHHHHHHHHHHHhHHHHHHH-HHhcCCCCCCCC Confidence 8888987532110 0 0000000000000 000000000000 No 147 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.05 E-value=4.1e-10 Score=72.05 Aligned_cols=433 Identities=14% Similarity=0.108 Sum_probs=173.0 Q ss_pred CCCCcc---------------ccccc----chhhhcccCcccc-CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_021537. 1 MSKAEE---------------TTQLD----ERHIATDVGRGIQ-PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEI 60 (602) Q Consensus 1 ~~k~~~---------------~~~~~----~~~~~~~~~~~i~-p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i 60 (602) |.++++ ..... .+++... ...-. +.--+..+++..-...+...||+.+++.+.-.||.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~ 79 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAE-RRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRI 79 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceec Confidence 111111 00000 1111110 01100 111123333332335688899999998877667765 Q ss_pred EEecCCCCcc-cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC--------CCce Q lcl|NC_021537. 61 VAHPSADEPD-EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG--------DGTP 131 (602) Q Consensus 61 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~--------~G~~ 131 (602) -...+..... .+.+..+.+..++. .-.+......+..+.+++|.||+.+.++. .|. T Consensus 80 ~~~~~~~~~~~~d~~~~~~l~~i~~--------------~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~- 144 (488) T protein:vir:23 80 PSANGEEPESGGENDPASELWDWWQ--------------ANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEV- 144 (488) T ss_pred cCCcccccccccchhHHHHHHHHHH--------------hcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCc- Confidence 4322222111 11122222222221 12456777888999999999999876542 222 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) ..+..++|..+.+..+...- ...-+..+++-..+...+.... ..++.........|.+.........+. T Consensus 145 ~~i~~~~p~~~~~~~d~~~~---------~~~~~~~~~~~~~~~~~~~~~~--y~~~~~~~~~~~~~~~~~~~~~~h~~g 213 (488) T protein:vir:23 145 PLIRVEPPTALYAEVDPRTR---------KVLYAIRAIYGADGNEIVSATL--YLPDTTMTWLRAEGEWEAPTSTPHGLE 213 (488) T ss_pred ceEEEeccceeEEEEecCCC---------ceEEEEEEEEecCCCcEEEEEE--EecCcEEEEEecCCceEeccccccCCC Confidence 24667788877655432111 0111111222111111111110 011111111122222222222223344 Q ss_pred hhHEEEecCCCCCCCcccccHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHH--HHHHHHHHhhc Q lcl|NC_021537. 212 ANELIFLPNPSPLALYYGVPDWVAA-MQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKE--DLRNLMDNLKG 288 (602) Q Consensus 212 ~~eviH~r~~~~~~~~~G~spl~~~-~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~--~l~~~~~~~~g 288 (602) .=.|+||++.....+.+|.|.+... ...++....+..-......-.+.|..+|+ |..+++...+ .-...|+.. T Consensus 214 ~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~~-- 289 (488) T protein:vir:23 214 MVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIF--GAKPEELGINAETGQRMFDAY-- 289 (488) T ss_pred CcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh--CCCcccccccccccchhhhhh-- Confidence 4457899877667778999977532 22233322222222222222333433332 2111111110 001111111 Q ss_pred ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH Q lcl|NC_021537. 289 SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT 368 (602) Q Consensus 289 ~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 368 (602) .++++.+++|.+ .++..+...+. -.|++.++..+.+|+..=++|+..+|.... |.++.++.. T Consensus 290 ---~~~v~~~~~g~~------------~~~~q~~~~~~--~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Al~ 351 (488) T protein:vir:23 290 ---MARILAFEGGEG------------AHAEQFSAAEL--RNFVDALDALDRKAASYSGLPPQYLSSSSD-NPASAEAIK 351 (488) T ss_pred ---hhhhccCCCCCC------------ceeEecCCCCh--HHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHH Confidence 223444444322 12222222221 237788888889999999999999875432 223333221 Q ss_pred H-------------HHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC--cc Q lcl|NC_021537. 369 R-------------EFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG--VG 433 (602) Q Consensus 369 ~-------------~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G--~~ 433 (602) . ..+...|+-++..+...++..-.+ ....+..++|.-... . +....++++.+++.+| ++ T Consensus 352 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~--~~~~~i~v~f~~~~~--~--s~~~~ada~~kl~~~g~~~~ 425 (488) T protein:vir:23 352 AAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIP--TEYYRMETVWRDPST--P--TYAAKADAAAKLFANGAGLI 425 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--hhhccceEEecCCCC--C--CHHHHHHHHHHHHhcccccC Confidence 1 111222222222222221211000 111234455532221 1 2233456677787765 78 Q ss_pred cHHHHHHHhCCCCCCCCccccccccc--------cccccccccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 434 TVNEAREELDLAPFEDDRGDMTLSEF--------EAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 434 T~NE~R~~~Gl~p~~~g~~d~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) +..-+++++|+-+-+-...+...... ....+.....+..+...+...+++ +...+ T Consensus 426 s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------e~~~a 488 (488) T protein:vir:23 426 PRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEPPAP-------EPDAA 488 (488) T ss_pred CHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCCCCC-------CCCCC Confidence 88888999987442211111100000 000011111111111111111111 11111 No 148 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.05 E-value=1.1e-09 Score=69.64 Aligned_cols=428 Identities=15% Similarity=0.102 Sum_probs=171.7 Q ss_pred CCCCcccccccchhhhcccCc--ccc--CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGR--GIQ--PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~--~i~--p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) +.+.=..+.........-+.+ -|. +...+..++++.-...+...||+.++..+...||.+- +. .... T Consensus 21 l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~---~~------~~~~ 91 (486) T protein:vir:42 21 MISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLG---DA------DEAD 91 (486) T ss_pred HHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhcccceecC---CC------chhH Confidence 111100111111111100111 111 1112344454433456888999999988877776531 11 1111 Q ss_pred HHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc-------eEEEEEeCccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT-------PVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~-------~~~L~~l~p~~v~~~~~~~ 149 (602) +.+..++.. -.+......+..+.+++|.||+.+.++..|. ...+..++|..+.+..+.. T Consensus 92 ~~~~~i~~~--------------N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~ 157 (486) T protein:vir:42 92 EELWQWWQA--------------NNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPR 157 (486) T ss_pred HHHHHHHHh--------------cChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCC Confidence 222222221 1345667788999999999999887765332 2367778888887654432 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeeccccccccc-ceeeecccceEEecCceeEEechhHEEEecCCCCCCCcc Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDD-KRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYY 228 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~-~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~ 228 (602) .-. ..-+..++....+..... ...|..+ ........|.+.........+..=.|++|++.....+.+ T Consensus 158 ~~~---------~~~~~~~~~~~~~~~~~~---~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~ 225 (486) T protein:vir:42 158 INR---------VSKAIRVAYDKEGNEIQA---ATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLY 225 (486) T ss_pred CCC---------eEEEEEEEEecCCCeEEE---EEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCC Confidence 110 001111111111111110 1111111 111111222222222223334444688888766667789 Q ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHH--HHHHHHHHhhcccccCcceeccCCccce Q lcl|NC_021537. 229 GVPDWVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKE--DLRNLMDNLKGSRYRTAILEVEEFVDDH 305 (602) Q Consensus 229 G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~--~l~~~~~~~~g~~nag~~~~~~~g~~~~ 305 (602) |.|.+.. +...++....+..-........+.|..+|+ |..+.....+ .-...|+. ..++++.++.+ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~---- 294 (486) T protein:vir:42 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF--GIKPEEIGVDSETGQTLFDA-----YLARILAFEDA---- 294 (486) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhh--cCCccccccccccccchhhh-----hhchhcccCCC---- Confidence 9997653 222233333222222223333344444443 2111110000 00011111 12233333211 Q ss_pred eccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH-------------HHH Q lcl|NC_021537. 306 GLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR-------------EFA 372 (602) Q Consensus 306 ~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~-------------~f~ 372 (602) +.+|..+...+. -.|++..+..+.+++..=++|+..+|.... |-++.++... ..+ T Consensus 295 ---------~~~~~q~~~~~~--e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~~~~~f 362 (486) T protein:vir:42 295 ---------EGKIQQFSAAEL--ANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKVERKNLMF 362 (486) T ss_pred ---------CceEEeecccCH--HHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHH Confidence 123332222221 137788888888899989999998875432 2233322211 111 Q ss_pred HHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhC--CcccHHHHHHHhCCCCCCCC Q lcl|NC_021537. 373 KGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLA--GVGTVNEAREELDLAPFEDD 450 (602) Q Consensus 373 ~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~--G~~T~NE~R~~~Gl~p~~~g 450 (602) ...|+-+++.+....+..-. . .......++|.-... . +....++++.+++++ |+++..-+++++|+.+-+-. T Consensus 363 ~~~l~~~~~l~~~~~~~~~~-~-~d~~~i~v~w~~~~~--~--s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~ 436 (486) T protein:vir:42 363 GGAWEEAMRIAYRIMKGGDV-P-PDMLRMETVWRDPST--P--TYAAKADAATKLYGNGQGVIPRERARIDMGYSVKERE 436 (486) T ss_pred HHHHHHHHHHHHHHhcCCCc-c-ccceeeeEEecCCCC--C--CHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHH Confidence 22333333322222111100 0 011234455533221 1 223356778888876 77888888988888543211 Q ss_pred ccccc--------cccccccccccccCCCcCccccccccccccccccccccccc Q lcl|NC_021537. 451 RGDMT--------LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVD 496 (602) Q Consensus 451 ~~d~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (602) ..... ........+.....+..+.+..+...++....... +. T Consensus 437 e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 486 (486) T protein:vir:42 437 EMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGG----DA 486 (486) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCC----CC Confidence 11110 00000000111111111110000011111011000 00 No 149 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=99.04 E-value=1.1e-09 Score=69.80 Aligned_cols=446 Identities=12% Similarity=0.032 Sum_probs=190.4 Q ss_pred CCCCcccccccchhhhcccCccccCCCC-HHHHHHHHhhhHHHHHHHHHHHHhhccC-----ceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYN-PETLAAFQELNETHQACIRKKSRYEAGY-----GFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~-~~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~~~i~~~~~~~~~~~~~~ 74 (602) |.++......+.-..+.-+++.+.-... +...|.++..+|-|..||+.|++.+.-. |..|... .++.+. T Consensus 38 i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~~~~pV~v~l~----~~e~s~- 112 (533) T protein:vir:58 38 IPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNENGNIVDVVTK----DIELAK- 112 (533) T ss_pred ccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecCCCceeEeecc----cccccH- Confidence 2222111111111111122222221111 3555677766799999999999887632 3333211 111222 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC-CCCceEEEEEeCccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE-GDGTPVGLAHVPAATVRVRKTTTTIER 153 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~-~~G~~~~L~~l~p~~v~~~~~~~~~~~ 153 (602) .++.-+.+ + +++..=-..+++.|.+.|..|..++-+ .++.+.+|.+|||..|+..... T Consensus 113 ---~iK~kI~~------l-------ldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~----- 171 (533) T protein:vir:58 113 ---AILSYLDY------V-------INIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNP----- 171 (533) T ss_pred ---HHHHHHHH------H-------hcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCeeeEEEEee----- Confidence 22221111 1 122222334455677789999998743 4566889999999999753211 Q ss_pred ccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCC-CCCCCcccccH Q lcl|NC_021537. 154 EDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNP-SPLALYYGVPD 232 (602) Q Consensus 154 ~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~-~~~~~~~G~sp 232 (602) ..+ .+++.+...+ ......+....+|.+.|+|+..- ...++.+++|- T Consensus 172 ------------------~t~--~eyyvy~~~~------------~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisy 219 (533) T protein:vir:58 172 ------------------ETD--TWYYVITDVY------------RNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSY 219 (533) T ss_pred ------------------ccc--eEEEeecccc------------cccccCccccccchhheeeeeeccccCCCCceehh Confidence 111 1122221111 11123344578899999999743 44566799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC-CHHHHHHHHHHHHHhhc----ccccCcceeccCCc----- Q lcl|NC_021537. 233 WVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL-SEDSKEDLRNLMDNLKG----SRYRTAILEVEEFV----- 302 (602) Q Consensus 233 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~l~~~~~~~~g----~~nag~~~~~~~g~----- 302 (602) |..|.+.+.....++...--|=-.-+.-+-|+.+.=+.+ ...+.+-++.....++. ..+.|.+.-.-.-+ T Consensus 220 LhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sM 299 (533) T protein:vir:58 220 LESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESI 299 (533) T ss_pred hhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhh Confidence 999977776666666655443222222233443322222 33333344444444332 11122221000000 Q ss_pred -cceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccC-HHHHHHHHHHHHHHHHH Q lcl|NC_021537. 303 -DDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRAN-SKEQTREFAKGIIEPEQ 380 (602) Q Consensus 303 -~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn-~e~~~~~f~~~~l~P~~ 380 (602) ..-++.--.-..+.+++.|.-.+.-+| +-.++..+.+.++++||.+.++..++.+.++ +.-.-.-| ...|.-+. T Consensus 300 lEDyWLpRReGgrgTEI~TLpGg~lgem---eDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF-~KFI~rLR 375 (533) T protein:vir:58 300 LKDYFIPRRGDRRAVEIDILQGSKVDLA---EDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKF-NNTIKRIQ 375 (533) T ss_pred HhhhcccccCCCccceeeecCCCCCCcH---HHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHH-HHHHHHHH Confidence 000000000011223333332222223 4456678999999999999998655544443 21112224 56788899 Q ss_pred HHHHHHHhhhcCCccc-cccceEEEeccchhcchhHHHHHHHHHHHHHHh-CCc------------ccH-----HHHHHH Q lcl|NC_021537. 381 AKFSARLYKIIHQDAL-DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRL-AGV------------GTV-----NEAREE 441 (602) Q Consensus 381 ~~ie~~ln~~Ll~~~~-~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~-~G~------------~T~-----NE~R~~ 441 (602) .+|.+.|...|+.... ...+|.+.|..+.-..-..+.+...+++..+-. .++ ||- .|.-+. T Consensus 376 ~rF~~ll~~qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~ 455 (533) T protein:vir:58 376 GFFVEELERMVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEA 455 (533) T ss_pred HHHHHHHhcccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHH Confidence 9999999887765432 224577777766544433444444443332211 011 121 111122 Q ss_pred hCCCCC-CC--Ccccccc---ccccccccccccCCCcCcccccccccccccc----cccc-cccccccccccchhhhh Q lcl|NC_021537. 442 LDLAPF-ED--DRGDMTL---SEFEAEFGADASDGDAEAMLTRSKAAPPLEN----KIGE-RDSVDVDVSKDPIEQTT 508 (602) Q Consensus 442 ~Gl~p~-~~--g~~d~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~~~~m~~~~ 508 (602) .+.+++ +. .+++... .+....+...+.........+....+..... +..+ .....-....++|.... T Consensus 456 E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 456 AGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred hhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCCCCCCCC Confidence 222211 11 1111111 0111111111111111111111111111000 0000 00000011111111111 No 150 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.98 E-value=2.6e-09 Score=67.68 Aligned_cols=419 Identities=10% Similarity=0.030 Sum_probs=170.3 Q ss_pred CCCCccc---------------ccccchhhhcccCcccc--CCCCH----HHHHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSKAEET---------------TQLDERHIATDVGRGIQ--PPYNP----ETLAAFQELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k~~~~---------------~~~~~~~~~~~~~~~i~--p~~~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) +.+.+-. ..+....-.+.....|. ++... ..+.... .+.+...||+.++..+...+|. T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~-~~n~~~~iVd~~~~~l~~~gf~ 87 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLS-RKPWMGLMVNSFAQQLIVDGYR 87 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHh-hcCcHHHHHHHHHhhccccccc Confidence 1111100 00100000111111111 11111 1222222 2357888999988877655653 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee-----CCCCceEEE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV-----EGDGTPVGL 134 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r-----~~~G~~~~L 134 (602) +- +. +....+..++.. | .+......+..+.+++|.||+.+-. +..|.+ .+ T Consensus 88 ~~---d~-------~~~~~~~~i~~~-----------N---~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i 142 (479) T protein:vir:99 88 KT---GT-------NENAKGWDTWRL-----------N---QMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RI 142 (479) T ss_pred CC---Cc-------hhhHHHHHHHHh-----------c---ChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EE Confidence 21 11 111122222211 1 2456677888999999999987764 334443 57 Q ss_pred EEeCcccccccccccccccccchhhhhcccCceeEEEEcC-CcceeecccccccccceeeecccceEEecCceeEEechh Q lcl|NC_021537. 135 AHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQG-RRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 135 ~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 213 (602) ..++|..+.+..+...... ...|..-.+. ...+++. .....+.....+.+.........+..= T Consensus 143 ~~~~p~~~~~iydd~~~~~-----------~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~h~~g~v 206 (479) T protein:vir:99 143 KCIDPRDAFAIWEDPYWDE-----------WPKYLLERQPNGQYWWWT-----EEDYSIFEFKQGKFIYRETVSHDYGHI 206 (479) T ss_pred EEechhheEEEecCCcccc-----------eeeEEEeecCceeEEEEe-----cceEEEEEecCCceeeccccccCCCCc Confidence 7788888875432211000 0011111110 0111111 011112222223222222222233344 Q ss_pred HEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccC Q lcl|NC_021537. 214 ELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRT 293 (602) Q Consensus 214 eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag 293 (602) .|+||++.... ..+|.|.++.....++....+..-..+.+.-.+.|..+|. |..+.++.... ...|.- ..+ T Consensus 207 Pvv~f~n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~-~~~~~~-----~~~ 277 (479) T protein:vir:99 207 PFVRYVNVMDL-RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT--GLMLPEGANAD-QEKMRF-----AQE 277 (479) T ss_pred ceEEeecCCCc-CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc--CCCcccccccc-hhcccc-----ccc Confidence 57888866443 3479999888777777766655555555566666665553 22121111100 000110 112 Q ss_pred cceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH---- Q lcl|NC_021537. 294 AILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR---- 369 (602) Q Consensus 294 ~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~---- 369 (602) +++..++ .+.++..+...+. -.+.+.++..+.+|+..=++|++.+|..++ ++.++... T Consensus 278 ~i~~~~~-------------~~~~~~q~~~~~~--~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n---~Sg~Al~~~~~~ 339 (479) T protein:vir:99 278 SMLISQN-------------EKASFGAIPAAPL--DGLLNAYKESLLEFLALAQLPPHIAGQIVN---VAADALAAGTRQ 339 (479) T ss_pred cceeecC-------------CCceEEEecccch--HHHHHHHHHHHHHHhccCCCCHHHcccccc---hHHHHHHHHHHH Confidence 2322221 1122322222111 236677788888999888999999986433 22222111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHH Q lcl|NC_021537. 370 ---------EFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEARE 440 (602) Q Consensus 370 ---------~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~ 440 (602) ..+..+|+-+++.+.... ........+.+++.+.+..-. +....++++.+++.+|+++...+.+ T Consensus 340 l~~ka~~~~~~f~~al~~~~~l~~~~~-----~~~~~~~~~~i~~~w~~~~~~--s~~~~ad~~~kl~~ag~is~et~l~ 412 (479) T protein:vir:99 340 TMQKLFEKQATWKASHNQTMRLVNKIE-----GRTEEATDLDFTITWQDVTIQ--SLAQFADAWAKMVESLKIPAEGVWD 412 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHc-----CCCccccceeeeEEecCCCCC--CHHHHHHHHHHHHhcCCCCHHHHHH Confidence 111222222222222111 111111223444444333222 2234567788899999999988887 Q ss_pred Hh-CCCCCCCCccccc------cccccccc--cccc--cCCCcCcccc-cccccccccccccccccccccccccchhhhh Q lcl|NC_021537. 441 EL-DLAPFEDDRGDMT------LSEFEAEF--GADA--SDGDAEAMLT-RSKAAPPLENKIGERDSVDVDVSKDPIEQTT 508 (602) Q Consensus 441 ~~-Gl~p~~~g~~d~~------~~~~~~~~--~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~ 508 (602) ++ |+++-+-...... ........ +.++ +.+..++... ++....+.+. ++ T Consensus 413 ~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~------------ 473 (479) T protein:vir:99 413 MIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEP-------AS------------ 473 (479) T ss_pred hcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcch-------hc------------ Confidence 77 7754210000000 00000000 0000 0000000000 0000000000 11 Q ss_pred cchhhhhhhe Q lcl|NC_021537. 509 FSSSNLDEGL 518 (602) Q Consensus 509 v~ss~~~~~~ 518 (602) |.++- | T Consensus 474 ~~~~~----~ 479 (479) T protein:vir:99 474 LNKSG----A 479 (479) T ss_pred cCCCC----C Confidence 11110 0 No 151 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.93 E-value=1.2e-08 Score=63.99 Aligned_cols=373 Identities=10% Similarity=-0.011 Sum_probs=165.5 Q ss_pred CCCCcccccccchhhhcccCc-----cccCCCCHHHHHHHHh-hhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGR-----GIQPPYNPETLAAFQE-LNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~-----~i~p~~~~~~l~~~~~-~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) +.+.=...+...++...-+.+ .+.+- -|..++...+ ...+...+|+.+++.+.=-||+. + + T Consensus 9 L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~----~------d-- 75 (409) T protein:vir:16 9 LRFKLSVHKRRAEMRYEQYAMKHVDRFKGIT-IPQALSQQYRSILGWCAKGVDSLADRLVFREFEN----D------D-- 75 (409) T ss_pred HHHHHHHHhHHHHHHHHHHhccCchhhcchh-hhHHHHHHHhhhcChhHHHHHHhHhhcccccccC----c------c-- Confidence 111100011111111111111 11111 1233432222 23678889999888666556531 0 1 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) ..+...+. .-++......+..+.+++|.+|+.+..+.+|+| .+.+++|..+....+...... T Consensus 76 --~~l~~i~~--------------~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~~~~- 137 (409) T protein:vir:16 76 --FTVNEIFE--------------ENNPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPITGLL- 137 (409) T ss_pred --hHHHHHHH--------------hcChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeecccccc- Confidence 01111111 123456677888899999999999998888875 788899988875543321110 Q ss_pred cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCce----eEEechhHEEEecCCCCCCCcccc Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGE----LKNGPANELIFLPNPSPLALYYGV 230 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~----~~~~~~~eviH~r~~~~~~~~~G~ 230 (602) . .++........+......-..+ ..+..+...++. ...+..=.|++|.+....+..+|. T Consensus 138 --------~--~a~~~~~~d~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~ 200 (409) T protein:vir:16 138 --------T--EGYAVLERDENNNVVLEAHFLP-------DRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGR 200 (409) T ss_pred --------e--eeeEEEEecCCCceEEEEEEec-------CcEEEEEecCccccceecCCCCcceEEecccccccccCCc Confidence 0 1111111111111000000011 111111111111 111222248888877666778999 Q ss_pred cHH----HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCcccee Q lcl|NC_021537. 231 PDW----VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHG 306 (602) Q Consensus 231 spl----~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~ 306 (602) |.+ ..+...+.....-......|| +.|.-++. |-..+.+..+.++... ++++.++...+ T Consensus 201 seI~~~v~~l~da~~r~~~~~~~~~e~~---a~pqr~i~--G~d~d~~~~~~~~~~~---------~~i~~~~~d~~--- 263 (409) T protein:vir:16 201 SRITRSGMYWQSNAKRTLERADVTAEFY---SFPQKYVT--GLSDDAEPMETWKATV---------SSMLQFTKDED--- 263 (409) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHh---cChhheeE--ecCCCCCccchhhhhh---------hHhhccCCCCC--- Confidence 855 334444443333333344443 44555543 2111111222222211 23333332111 Q ss_pred ccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH---HHHHHHHHHHHHHHH Q lcl|NC_021537. 307 LGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT---REFAKGIIEPEQAKF 383 (602) Q Consensus 307 ~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~---~~f~~~~l~P~~~~i 383 (602) ..+.++..+...+.. .|++.++..+..+|+.=++|++.+|.... |-++.++.. ..+.. ...-..+.| T Consensus 264 ------g~~~~v~q~~~~~l~--~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~-ka~~k~~~f 333 (409) T protein:vir:16 264 ------GDKPTLGQFTQPSMS--PFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRL-AGRKAQRSL 333 (409) T ss_pred ------CCCceEEecCCCChh--HHHHHHHHHHHHHhhhcCCCHHHcccccC-chhHHHHHHHHHHHHHH-HHHHHHHHH Confidence 112233333333221 48999999999999999999999986543 334433321 11111 111111112 Q ss_pred HHHHhhh----cC--Ccc-cc-ccceEEEeccchhcchhH-HHHHHHHHHHHHHhCCc-c-cHHHHHHHhCCCCCC Q lcl|NC_021537. 384 SARLYKI----IH--QDA-LD-VDEWTIDFELRGAEQPEQ-DAKMAEQRVRAMRLAGV-G-TVNEAREELDLAPFE 448 (602) Q Consensus 384 e~~ln~~----Ll--~~~-~~-~~~~~~~f~~~~~~~~~~-d~~~~~~~~~~~~~~G~-~-T~NE~R~~~Gl~p~~ 448 (602) ...+.+. +. ... +. ...+.+++.+.+...... .....++++.+++.+|. + .-+-+++++|+..-+ T Consensus 334 g~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 334 GAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 2221111 10 000 00 011223333333322221 22345788899999873 3 345679999996543 No 152 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.92 E-value=1.4e-08 Score=63.65 Aligned_cols=427 Identities=12% Similarity=0.031 Sum_probs=168.8 Q ss_pred CCC---Ccccccccc--hhhhcccCccccCC-----------CCHHHH--HHHHhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSK---AEETTQLDE--RHIATDVGRGIQPP-----------YNPETL--AAFQELNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k---~~~~~~~~~--~~~~~~~~~~i~p~-----------~~~~~l--~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) ++| ..+...+.- +++... --|.-. ...... .++ .+++.+.+|+..+..+.+-|+.+.. T Consensus 34 i~~~i~~~~~~~~~~~~~YY~g~--~~i~~~~~~~~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~yl~g~~~~~~~ 109 (503) T protein:vir:59 34 IQKLIDEHNPEPLLKGVRYYMCE--NDIEKKRRTYYDAAGQQLVDDTKTNNRT--SHAWHKLFVDQKTQYLVGEPVTFTS 109 (503) T ss_pred HHHHHHhhcHHHHHHHHHHhccc--cchhhccchhccccccccccccccccee--ecchHHHHHHHHHhhhhcCCeeecc Confidence 111 111111100 111100 001000 000000 012 2567889999999999999987632 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATV 142 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v 142 (602) . +.+..+.++.+. ...+......+.++.+.+|.+|+.+-.+.+|++ .+..++|..+ T Consensus 110 ~--------d~~~~~~l~~~~---------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~ 165 (503) T protein:vir:59 110 D--------NKTLLEYVNELA---------------DDDFDDILNETVKNMSNKGIEYWHPFVDEEGEF-DYVIFPAEEM 165 (503) T ss_pred C--------cHHHHHHHHHHH---------------hcCHHHHHHHHHHHHhhCCeEEEEEeecCCCce-EEEEEcccee Confidence 1 111112222111 014566777788999999999999999888875 5888999887 Q ss_pred cccccccc-cccccchhhhhc----ccCceeEEEEcCCcceeecccccc--cccceeeecccceEEecCceeEEechhHE Q lcl|NC_021537. 143 RVRKTTTT-IEREDGEEVENI----ESGHGYVQVRQGRRRYFGEAGDRY--GDDKRFVDKETGEVASDAGELKNGPANEL 215 (602) Q Consensus 143 ~~~~~~~~-~~~~~~~~~~~~----~~~~~~~qi~~~~~~~~~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~~ev 215 (602) -+..+... -...-....... .....++.+......+.+...... ........... ......+....+..=.| T Consensus 166 ~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vPi 244 (503) T protein:vir:59 166 IVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPR-PHMTKGGQAIGWGRVPI 244 (503) T ss_pred EEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccc-cceeecceeccCCccce Confidence 65433211 000000000000 000112222222221111111000 00000000000 00001111222223335 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) +.++.. ..|.|.+..+...++....+..-..+.+...+.|-.+++ |....+ .......+. ..++ T Consensus 245 v~~~nn-----~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~--g~~~~~--~~~~~~~~~-------~~~~ 308 (503) T protein:vir:59 245 IPFKNN-----EEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLK--NYDGEN--PKEFTANLR-------YHSV 308 (503) T ss_pred EEecCC-----CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEee--cCCccc--cchhhhhhh-------cccc Confidence 555432 368888888777777666555555556677777766554 321111 111111111 1122 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHH--------- Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKE--------- 366 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--------- 366 (602) +.++++.+ .++ +.... ....+....+...+.|...-++|..-.+.. .++- +..+ T Consensus 309 ~~~~~~~~------------~~~--l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~~-Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 309 IKVSGDGG------------VDT--LRAEI-PVDSAAKELERIQDELYKSAQAVDNSPETI-GGGA-TGPALENLYALLD 371 (503) T ss_pred eeccCCCc------------cee--EeccC-CHHHHHHHHHHHHHHHHHHhcccCCCcccc-cccc-cHHHHHHHHHHHH Confidence 32322211 111 11111 122344555555666655555553221111 1221 2111 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh Q lcl|NC_021537. 367 ----QTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL 442 (602) Q Consensus 367 ----~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 442 (602) .....+...|+-++..+...++..-.........+.+.|...-.. +....++.+.+++.+|+++...+.+++ T Consensus 372 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~----d~~~~~~~~~kl~~~GiiS~et~l~~l 447 (503) T protein:vir:59 372 LKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQ----NDSEIVQSLVQGVTGGIMSKETAVARN 447 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCC----CHHHHHHHHHHHHhCCCCchHHHHHhC Confidence 122223444444444444444432222212223356666443222 334456788899999999999999887 Q ss_pred CCCCCCCCccc--cccccccccc-cccccCCCcCccccccccccc--ccccccccccc Q lcl|NC_021537. 443 DLAPFEDDRGD--MTLSEFEAEF-GADASDGDAEAMLTRSKAAPP--LENKIGERDSV 495 (602) Q Consensus 443 Gl~p~~~g~~d--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 495 (602) +. +++...+ +.-....... ......+...+........+. .........++ T Consensus 448 ~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 448 PF--VQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred CC--CCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 65 3332211 1100000000 000000011111111110000 00011111111 No 153 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.89 E-value=2.6e-09 Score=67.66 Aligned_cols=408 Identities=11% Similarity=-0.016 Sum_probs=170.7 Q ss_pred CCCCcccccccch---hhhcccCccc--cCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh Q lcl|NC_021537. 1 MSKAEETTQLDER---HIATDVGRGI--QPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES 75 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~~~~~~~i--~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~ 75 (602) +.+.=.......+ ++.-. ...+ .+.--+..++++.-.+.+...||+..++.+...+|..- +.+ T Consensus 12 l~~~~~~~~~r~~~l~~Yy~G-~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~----------d~~- 79 (441) T protein:vir:80 12 MYDRIQRLSSWHCCIEGYYEG-SNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNG----------DGY- 79 (441) T ss_pred HHHHHHHHHHHHHHHHHHHhc-CCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccCC----------ChH- Confidence 1100000000000 11100 0011 01111223333333456788888888887755555311 111 Q ss_pred HHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 76 YQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 76 ~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) .+..++. .-++..+...+..+.+++|.||+.+.++.+|.+ .+..++|..+.+..+........ T Consensus 80 --~l~~i~~--------------~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 142 (441) T protein:vir:80 80 --GLDGVYA--------------ANRLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSRLDA 142 (441) T ss_pred --HHHHHHH--------------hcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCceeE Confidence 1222111 124677888899999999999999999888887 58889999887654332211100 Q ss_pred chh-hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHH Q lcl|NC_021537. 156 GEE-VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWV 234 (602) Q Consensus 156 ~~~-~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~ 234 (602) ... .........++.+..... ...+.....+.+.........+..=.|+||++....+..+|.|.+. T Consensus 143 ~~~~~~~~~~~~~~~~vy~~~~------------~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~ 210 (441) T protein:vir:80 143 GLVVQQTCDPEVVEAELLLPDV------------IVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEIT 210 (441) T ss_pred EEEEEEEecCceEEEEEEecCe------------EEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccch Confidence 000 000001111111110000 0011111122222212222334444588998776677789999765 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccc Q lcl|NC_021537. 235 A-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSD 313 (602) Q Consensus 235 ~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~ 313 (602) . +...++.......-........+.|-.+|+ |..+++...+. ++. ..++++.++.+.+. T Consensus 211 ~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--G~~~~~~~~~~----~~~-----~~~~i~~~~~~~~~--------- 270 (441) T protein:vir:80 211 RSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--GVSADEFSQPG----WVL-----SMASVWAVDKDDDG--------- 270 (441) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCceeeee--cCCccccccch----hhh-----cccccccCCCCCCC--------- Confidence 4 333344333333333333444455544443 43333322221 111 12233333322111 Q ss_pred cccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH-------------HHHHHHHHHHH Q lcl|NC_021537. 314 VNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR-------------EFAKGIIEPEQ 380 (602) Q Consensus 314 ~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~-------------~f~~~~l~P~~ 380 (602) ...++..+...+. -.|++.++..+..|+..-++|+..+|.... +-++.++... ..+...|+-.+ T Consensus 271 ~~~~~~~~~~~~~--~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~ 347 (441) T protein:vir:80 271 DTPNVGSFPVNSP--TPYSDQMRLLAQLTAGEAAVPERYFGFITS-NPPSGEALAAEESRLVKRAERRQTSFGQGWLSVG 347 (441) T ss_pred CcceeEecCccch--HHHHHHHHHHHHHHhcccCCCHHHhccCCC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1123322222111 237788888899999999999999886443 2223222211 11122222222 Q ss_pred HHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccH--HHHHHHhCCCCCCCCcccccccc Q lcl|NC_021537. 381 AKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTV--NEAREELDLAPFEDDRGDMTLSE 458 (602) Q Consensus 381 ~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~--NE~R~~~Gl~p~~~g~~d~~~~~ 458 (602) +.+...++...-... ......+.|... ... +....++++.+++.+|+++. .-+++.+|+.+-+- .+.... T Consensus 348 ~l~~~~~~~~~~~~~-~~~~i~~~f~~~--~~~--~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~---~~~~~e 419 (441) T protein:vir:80 348 FLAAKALDSRVDEAD-FFGDVGLRWRDA--STP--TRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV---EAVMRH 419 (441) T ss_pred HHHHHHhcCCCcccc-cceeeeEEeCCC--CCc--CHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH---HHHHHH Confidence 222222221111110 112334445433 222 33446677888999998654 34677777643211 110000 Q ss_pred ccccccccccCCCcCcccccccccccccccccccccccc Q lcl|NC_021537. 459 FEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDV 497 (602) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (602) . .-..+.. ... ....+.....+ T Consensus 420 ~--~e~~~~~-----~~~----------~~~~~~~~~~~ 441 (441) T protein:vir:80 420 R--AESSDPL-----AVL----------AGAISRQTNEV 441 (441) T ss_pred H--HHHHHHH-----HHH----------hhhhhcccccC Confidence 0 0000000 000 00000000000 No 154 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.89 E-value=5.4e-09 Score=65.91 Aligned_cols=426 Identities=10% Similarity=0.020 Sum_probs=174.6 Q ss_pred CCCC---cccccc---c--chhhhcccCccccCCCC-HHH--HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKA---EETTQL---D--ERHIATDVGRGIQPPYN-PET--LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~---~~~~~~---~--~~~~~~~~~~~i~p~~~-~~~--l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) |+|- -.+.++ . .+.+...-......+.. ... ..++ ..++...+|+..+..+.+-|+++...++.+ T Consensus 45 i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki--~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~-- 120 (502) T protein:vir:48 45 LKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRA--VHNYGRMISKFKTGYLAGNPIRVEYDDNED-- 120 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccee--ecchHHHHHHHHhhhhcccCeeEecCCccc-- Confidence 1111 001110 0 01111100011111100 000 1122 246778899999999999998876432211 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) .....+.+..++ ..-.+......+..+.+.+|.||+.+.++.+|.+ .+..++|..+.+.-+.. T Consensus 121 --~~~~~~~l~~~~--------------~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vydd~ 183 (502) T protein:vir:48 121 --NSQNDDAIKRIG--------------RINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNS 183 (502) T ss_pred --hhHHHHHHHHHH--------------hhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCC Confidence 111111122221 1224567888899999999999999989888875 57788888886543321 Q ss_pred c-cccc---cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCC Q lcl|NC_021537. 150 T-IERE---DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLA 225 (602) Q Consensus 150 ~-~~~~---~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~ 225 (602) . .... ..........+..++.+......+.+.. .+.+.........+..=.|++++.. T Consensus 184 ~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~--------------~~~~~~~~~~~~~~g~vPvv~~~nn---- 245 (502) T protein:vir:48 184 LEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDA--------------SDSFNEISVTPHAFGTVPITEFLNN---- 245 (502) T ss_pred CCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEe--------------CCceeeccceecCCCccceEEecCC---- Confidence 0 0000 0000000011111121211111111100 0000000111111222236666532 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccce Q lcl|NC_021537. 226 LYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDH 305 (602) Q Consensus 226 ~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~ 305 (602) ..|+|.+..+...++....+..-..+.+.....|-.+++-......++....+++. + .+.+....... T Consensus 246 -~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~----------~-~~~~~~~~~~~ 313 (502) T protein:vir:48 246 -ADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRT----------R-LMQLKPPKSAD 313 (502) T ss_pred -CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhc----------c-eeecccccccc Confidence 36889898888777777666666667777777776665432221122222222111 1 11111100000 Q ss_pred eccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHH Q lcl|NC_021537. 306 GLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFA 372 (602) Q Consensus 306 ~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~ 372 (602) +...+.+++.++... .+..+....+...+.|+..-++|+...+... ++- +.++. ....+ T Consensus 314 -----~~~~~~d~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~-Sg~Alk~~~~~l~~k~~~~~~~~ 385 (502) T protein:vir:48 314 -----GKEGTVKAEYLTKSY-DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFS-GNA-SGEALKYKLFGLDQDRVDTQSQF 385 (502) T ss_pred -----ccccCcceeEeeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCccccc-cCc-hHHHHHHHHHHHHHHHHHHHHHH Confidence 011112222222111 1234556678888999999999875543321 222 22221 11223 Q ss_pred HHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcc Q lcl|NC_021537. 373 KGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRG 452 (602) Q Consensus 373 ~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~ 452 (602) ...|+-+++.+...++..--.......+..+.|...-.. +....++++.++ .|+++..-+.+++++ +++... T Consensus 386 ~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~----d~~e~a~~~~kl--~g~iS~et~l~~l~~--v~D~~~ 457 (502) T protein:vir:48 386 TQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPK----SLYEQVSILNDL--GGQVSQETALSLSGL--VENPTE 457 (502) T ss_pred HHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCcHHHHHHhCCC--CCCHHH Confidence 334444444444444432111111223445666433221 334455667766 589998888888865 232211 Q ss_pred --ccccccccc-cc-cccccCCCcCcccccccccccccccccccccccccccc Q lcl|NC_021537. 453 --DMTLSEFEA-EF-GADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSK 501 (602) Q Consensus 453 --d~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (602) .++...... .. .......+..+...+...+.+.+ ......+ T Consensus 458 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~--------~~~~~~~ 502 (502) T protein:vir:48 458 ELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTD--------DFERVYE 502 (502) T ss_pred HHHHHHHHHHhhhhhcccccccccccccCCCccCCCCc--------CcCCCCC Confidence 111000000 00 00000000000000000000000 0000000 No 155 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.87 E-value=9.9e-10 Score=69.96 Aligned_cols=406 Identities=12% Similarity=0.054 Sum_probs=158.7 Q ss_pred CCCCcccccccchh---hhcccCccc-cCCCCHHHHHHHHh--hhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERH---IATDVGRGI-QPPYNPETLAAFQE--LNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~---~~~~~~~~i-~p~~~~~~l~~~~~--~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) +.+.=+......+. +...-.... -+.--+..++...+ .+.+...+|+..+..+.+-|+.+....+ .+ T Consensus 13 l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~d-------~~ 85 (456) T protein:vir:79 13 LTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD-------SD 85 (456) T ss_pred HHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCCCCC-------cc Confidence 11110000001111 111000111 01111222333221 2358899999999999999987532111 11 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERE 154 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~ 154 (602) ..+.+..++.. -.+..+.+.+..+.+++|.||+.+-.+.+|.+ .+..++|..+.+..+...... T Consensus 86 ~~~~~~~~~~~--------------n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~~- 149 (456) T protein:vir:79 86 LALRARRIWRD--------------NRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWR- 149 (456) T ss_pred HHHHHHHHHHh--------------cChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCCc- Confidence 11222222211 13456777899999999999998888888887 578889988876543211100 Q ss_pred cchhhhhcccCceeEEEEcCCcceeeccc-ccccccc--eeee-cccc-eEEecCceeE-------EechhHEEEecCCC Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAG-DRYGDDK--RFVD-KETG-EVASDAGELK-------NGPANELIFLPNPS 222 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~-~~~~~~~--~~~~-~~~g-~~~~~~~~~~-------~~~~~eviH~r~~~ 222 (602) ..-...|+...++...+...+. +...... .+.. ..+. ......+... .+..-.|+++. T Consensus 150 -------~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~--- 219 (456) T protein:vir:79 150 -------IRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ--- 219 (456) T ss_pred -------eEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEec--- Confidence 0000111111111100000000 0000000 0000 0000 0000011000 00111122221 Q ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCH----HHHHHH--HHHHHHhhcccccCcce Q lcl|NC_021537. 223 PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSE----DSKEDL--RNLMDNLKGSRYRTAIL 296 (602) Q Consensus 223 ~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~----~~~~~l--~~~~~~~~g~~nag~~~ 296 (602) ...|+|.++.....++....+..-........+.|..++. |...+. +.-+.+ .+.|.. ..+.++ T Consensus 220 ---N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~--G~~~~~~~~d~~g~~i~~~~~~~~-----~~~~~~ 289 (456) T protein:vir:79 220 ---NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK--SSEHRLPKVDENGNAIDYASIFEA-----APGALW 289 (456) T ss_pred ---CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHh--cCCcccccccccccccchhhhhhh-----hccccc Confidence 2357777776655554433332222222222223322221 111110 000000 011111 112233 Q ss_pred eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccC--HHHHHHHH--- Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRAN--SKEQTREF--- 371 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn--~e~~~~~f--- 371 (602) .++++.+ +..+...+. -.|.+.++..+.+|++.-++|+..+|... +|-|. ++.....+ T Consensus 290 ~~~~~~~--------------~~q~~~~~~--~~~~~~l~~~i~~i~~~t~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k 352 (456) T protein:vir:79 290 ELPPGVD--------------IWESQTNDF--TPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFK 352 (456) T ss_pred cCCCCcc--------------eeeecccCh--HHHHHHHHHHHHHHHhhcCCChhHhcccc-cCcHHHHHHHHHHHHHHH Confidence 3333322 222222221 23788899999999999999999997432 22221 11111111 Q ss_pred -------HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCC Q lcl|NC_021537. 372 -------AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDL 444 (602) Q Consensus 372 -------~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl 444 (602) +..+|+-+++.+.. +.... ......+.|.-... . +....++++.+++.+|+++..-+++++|+ T Consensus 353 ~~~~~~~f~~~l~~~~~l~~~-----~~g~~-~~~~i~v~w~~~~~--~--s~~~~ada~~kl~~~G~~~~~~~~~~lg~ 422 (456) T protein:vir:79 353 CEDRLSIAKIGLEAILVKALQ-----IEGES-VEDTVDVSFESPDR--V--TLGEKYSAASLAKAAGESWASIRRNILNY 422 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----hcCCC-ccccceEEeCCCCC--c--CHHHHHHHHHHHHhcCCChHHHHHhcCCC Confidence 11222222221111 11111 11234455533221 1 23446778888999999999888899998 Q ss_pred CCCCCC--ccccccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 445 APFEDD--RGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 445 ~p~~~g--~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) .+-+-. +.++.-.......+.-.+.+++++. . T Consensus 423 ~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~----------~ 456 (456) T protein:vir:79 423 NADQIKQDDLDRAREQITLFAGNPVQRPQEDGS----------R 456 (456) T ss_pred CHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCC----------C Confidence 653211 0111000000000000000000000 0 No 156 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.86 E-value=4.6e-09 Score=66.31 Aligned_cols=412 Identities=12% Similarity=0.058 Sum_probs=163.3 Q ss_pred CCCCcc------------cccccchhhhccc-Cc-cc--cCCCCHHHHHHHHh--hhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKAEE------------TTQLDERHIATDV-GR-GI--QPPYNPETLAAFQE--LNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~~~------------~~~~~~~~~~~~~-~~-~i--~p~~~~~~l~~~~~--~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) |..... ......+....-+ |. .+ -|+--+..++.+.+ .+.+...+|+..+..+.+-|+.+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 111110 0000111111001 11 11 12112233333211 2568899999999999999987532 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATV 142 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v 142 (602) ..+. +....+..++.. -....+...+..+.+++|.||..+-.+.+|.+ .+..++|..+ T Consensus 81 ~~d~-------~~~~~~~~i~~~--------------N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~ 138 (456) T protein:vir:10 81 SADS-------DLALRARRIWRD--------------NRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETM 138 (456) T ss_pred CCCc-------chHHHHHHHHHh--------------cChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEcccee Confidence 1111 111222222211 13455667788999999999998888888876 4677888888 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecc-----------ccccccc-ceeeecccceEEecCceeEEe Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEA-----------GDRYGDD-KRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~-----------~~~~~~~-~~~~~~~~g~~~~~~~~~~~~ 210 (602) .+..+...... ..-...|+...++...+...+ ...+... ........+.+...+.....+ T Consensus 139 ~~i~d~~~~~~--------~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (456) T protein:vir:10 139 VVSVDPLQPWR--------IRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTG 210 (456) T ss_pred EEEEcCCCCcc--------eEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCC Confidence 76544321100 000111111111111111100 0000000 000000111111111100000 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc--CCHHHHHH--HHHHHHHh Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT--LSEDSKED--LRNLMDNL 286 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~--~~~~~~~~--l~~~~~~~ 286 (602) ..-.|+++ .+ ..|+|.++.....++....+..-........+.|..++.-.... ..++.-.. ....|+.. T Consensus 211 ~~~pvv~~--~N----~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~ 284 (456) T protein:vir:10 211 SPPPVVVY--QN----PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA 284 (456) T ss_pred CceeEEEe--cC----CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh Confidence 01112222 12 35788877776666554443333222223333333333211000 00000000 01112111 Q ss_pred hcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHH Q lcl|NC_021537. 287 KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKE 366 (602) Q Consensus 287 ~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 366 (602) .++++.++++.+ +..+...+. -.|.+.++..+.+|++.=++|+..+|... +|- +.++ T Consensus 285 -----~~~~~~~~~~~~--------------~~q~~~~~~--~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N~-Sg~A 341 (456) T protein:vir:10 285 -----PGALWELPPGVD--------------IWESQANDF--TPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQ-SAEG 341 (456) T ss_pred -----ccccccCCCCcc--------------eEEecccCh--hHHHHHHHHHHHHHHhccCCChHHhcccc-cCh-HHHH Confidence 123333333332 222222111 23788899999999999999999997532 222 2222 Q ss_pred H---HHHHHHHHHHHHHHHHHHHHhhhcC----Ccc-ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHH Q lcl|NC_021537. 367 Q---TREFAKGIIEPEQAKFSARLYKIIH----QDA-LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEA 438 (602) Q Consensus 367 ~---~~~f~~~~l~P~~~~ie~~ln~~Ll----~~~-~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~ 438 (602) . ...+.. -..-..+.|...|.+.+. ... ....+..+.|.-... . +....++++.+++.+|+++..-+ T Consensus 342 i~~~~~~l~~-k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~~--~--~~~~~ada~~kl~~~gi~~~~~~ 416 (456) T protein:vir:10 342 AHNIEKGFLF-KCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDR--V--TLGEKYSAASLAKAAGESWASIR 416 (456) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCC--c--CHHHHHHHHHHHHHcCCChHHHH Confidence 1 111111 111111222222211110 000 111234455533221 2 23445678888999999999889 Q ss_pred HHHhCCCCCCCC--ccccccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 439 REELDLAPFEDD--RGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 439 R~~~Gl~p~~~g--~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) ++++|+.+-+-. +.++.-.......+...+.++.++. + T Consensus 417 ~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~----------~ 456 (456) T protein:vir:10 417 RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS----------R 456 (456) T ss_pred HhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC----------C Confidence 999998653111 1111000000000000000000000 0 No 157 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.86 E-value=4.6e-09 Score=66.31 Aligned_cols=412 Identities=12% Similarity=0.058 Sum_probs=163.3 Q ss_pred CCCCcc------------cccccchhhhccc-Cc-cc--cCCCCHHHHHHHHh--hhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKAEE------------TTQLDERHIATDV-GR-GI--QPPYNPETLAAFQE--LNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~~~------------~~~~~~~~~~~~~-~~-~i--~p~~~~~~l~~~~~--~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) |..... ......+....-+ |. .+ -|+--+..++.+.+ .+.+...+|+..+..+.+-|+.+.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 111110 0000111111001 11 11 12112233333211 2568899999999999999987532 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATV 142 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v 142 (602) ..+. +....+..++.. -....+...+..+.+++|.||..+-.+.+|.+ .+..++|..+ T Consensus 81 ~~d~-------~~~~~~~~i~~~--------------N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~ 138 (456) T protein:vir:10 81 SADS-------DLALRARRIWRD--------------NRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETM 138 (456) T ss_pred CCCc-------chHHHHHHHHHh--------------cChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEcccee Confidence 1111 111222222211 13455667788999999999998888888876 4677888888 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecc-----------ccccccc-ceeeecccceEEecCceeEEe Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEA-----------GDRYGDD-KRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~-----------~~~~~~~-~~~~~~~~g~~~~~~~~~~~~ 210 (602) .+..+...... ..-...|+...++...+...+ ...+... ........+.+...+.....+ T Consensus 139 ~~i~d~~~~~~--------~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (456) T protein:vir:10 139 VVSVDPLQPWR--------IRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTG 210 (456) T ss_pred EEEEcCCCCcc--------eEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCC Confidence 76544321100 000111111111111111100 0000000 000000111111111100000 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc--CCHHHHHH--HHHHHHHh Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT--LSEDSKED--LRNLMDNL 286 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~--~~~~~~~~--l~~~~~~~ 286 (602) ..-.|+++ .+ ..|+|.++.....++....+..-........+.|..++.-.... ..++.-.. ....|+.. T Consensus 211 ~~~pvv~~--~N----~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~ 284 (456) T protein:vir:10 211 SPPPVVVY--QN----PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA 284 (456) T ss_pred CceeEEEe--cC----CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh Confidence 01112222 12 35788877776666554443333222223333333333211000 00000000 01112111 Q ss_pred hcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHH Q lcl|NC_021537. 287 KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKE 366 (602) Q Consensus 287 ~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 366 (602) .++++.++++.+ +..+...+. -.|.+.++..+.+|++.=++|+..+|... +|- +.++ T Consensus 285 -----~~~~~~~~~~~~--------------~~q~~~~~~--~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N~-Sg~A 341 (456) T protein:vir:10 285 -----PGALWELPPGVD--------------IWESQANDF--TPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQ-SAEG 341 (456) T ss_pred -----ccccccCCCCcc--------------eEEecccCh--hHHHHHHHHHHHHHHhccCCChHHhcccc-cCh-HHHH Confidence 123333333332 222222111 23788899999999999999999997532 222 2222 Q ss_pred H---HHHHHHHHHHHHHHHHHHHHhhhcC----Ccc-ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHH Q lcl|NC_021537. 367 Q---TREFAKGIIEPEQAKFSARLYKIIH----QDA-LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEA 438 (602) Q Consensus 367 ~---~~~f~~~~l~P~~~~ie~~ln~~Ll----~~~-~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~ 438 (602) . ...+.. -..-..+.|...|.+.+. ... ....+..+.|.-... . +....++++.+++.+|+++..-+ T Consensus 342 i~~~~~~l~~-k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~~--~--~~~~~ada~~kl~~~gi~~~~~~ 416 (456) T protein:vir:10 342 AHNIEKGFLF-KCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDR--V--TLGEKYSAASLAKAAGESWASIR 416 (456) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCC--c--CHHHHHHHHHHHHHcCCChHHHH Confidence 1 111111 111111222222211110 000 111234455533221 2 23445678888999999999889 Q ss_pred HHHhCCCCCCCC--ccccccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 439 REELDLAPFEDD--RGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 439 R~~~Gl~p~~~g--~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) ++++|+.+-+-. +.++.-.......+...+.++.++. + T Consensus 417 ~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~----------~ 456 (456) T protein:vir:10 417 RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS----------R 456 (456) T ss_pred HhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC----------C Confidence 999998653111 1111000000000000000000000 0 No 158 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.85 E-value=7.5e-09 Score=65.11 Aligned_cols=429 Identities=10% Similarity=0.027 Sum_probs=175.1 Q ss_pred CCCCc---cccc---ccc--hhhhcccCccccCCCCH-HH--HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKAE---ETTQ---LDE--RHIATDVGRGIQPPYNP-ET--LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~~---~~~~---~~~--~~~~~~~~~~i~p~~~~-~~--l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) |+|.= ...+ +.. +.+...-.....++... .. ..++ ..++...+|+..+..+.+-|+++...++.+ T Consensus 44 i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri--~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~-- 119 (501) T protein:vir:96 44 LKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRA--VHNYGRMISKFKTGYLAGNPIRVEYDDNDD-- 119 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCcccccccee--ecchHHHHHHHHhhhhcccCeeEeeCCccc-- Confidence 11110 0010 000 11111001111111111 11 1122 256788899999999999898875432211 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) .......+..++. .-.+......+..+.+.+|.||+.+.++.+|.+ .+..++|..+.+..+.. T Consensus 120 --~~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~ 182 (501) T protein:vir:96 120 --NSQNDDAIKRIGR--------------INDLDSLNRTLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNS 182 (501) T ss_pred --hhHHHHHHHHHHH--------------hcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEccceeEEEEcCC Confidence 1111222222221 234567788899999999999999999888875 57788998887554321 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeecccccccccce-eeecccceEEecCceeEEechhHEEEecCCCCCCCcc Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKR-FVDKETGEVASDAGELKNGPANELIFLPNPSPLALYY 228 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~ 228 (602) .. + ...-+..|+......... .+...+..+.. ..... +.+.........+..=.|++|+.. .. T Consensus 183 ~~----~----~~~~~v~~~~~~~~~~~~--~~~~vyt~~~i~~~~~~-~~~~~~~~~~~~~g~vPvv~~~nn-----~~ 246 (501) T protein:vir:96 183 LE----D----NSIAAVRYYNRGTLQSAK--DVVEIYTDEHIYTLDAS-DDFNEISVTTHAFGTVPITEYLNN-----ID 246 (501) T ss_pred CC----C----ceEEEEEEEEeecCCCcE--EEEEEEcCCcEEEEeeC-CCceeccccccCCCccceEEecCC-----cc Confidence 10 0 000111122211111100 00000100000 00000 000000011111222236776532 36 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecc Q lcl|NC_021537. 229 GVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLG 308 (602) Q Consensus 229 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~ 308 (602) |.|.+..+...++....+..-..+.+...+.|-.+++-.......+....++. .+.+.+...+. .. T Consensus 247 g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~----------~~~~~~~~~~~-~~--- 312 (501) T protein:vir:96 247 GIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKR----------TRLMQLKPPKS-AD--- 312 (501) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhh----------cCeeeeccccc-cc--- Confidence 88988888777777666666666666766777666542211111111111111 11111111110 00 Q ss_pred ccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHHHH Q lcl|NC_021537. 309 DGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAKGI 375 (602) Q Consensus 309 ~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~~~ 375 (602) ....+.+++| ++.. ..+..+....+...+.|...-++|....+... ++-|. ++. ....+..+ T Consensus 313 ~~~~~~~~~~--l~~~-~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg-~Al~~~~~~l~~ka~~~~~~~~~~ 387 (501) T protein:vir:96 313 GKEGTVKAEY--LTKS-YDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFS-GNTSG-EALKYKLFGLDQDRVDTQSQFTKG 387 (501) T ss_pred ccccCcceee--Eecc-CCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchH-HHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011122222 2211 12344667778888889888899865554322 22222 211 11223334 Q ss_pred HHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc-- Q lcl|NC_021537. 376 IEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD-- 453 (602) Q Consensus 376 l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d-- 453 (602) |+-+++.+...++..--..........+.|...-.. +....++.+.++ .|+++..-+.++++. +++...+ T Consensus 388 l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~----n~~e~ad~~~kl--~g~iS~et~~~~l~~--v~D~~~E~~ 459 (501) T protein:vir:96 388 LKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPK----SLNEQVSILTGL--GGQVSQETALSLSGL--VESPNEELD 459 (501) T ss_pred HHHHHHHHHHHHHhcccccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCchHHHHHhCCC--CCCHHHHHH Confidence 444444444443322111111223355666543222 334455667776 488998888888764 3322211 Q ss_pred cccccccc-cc-cccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 454 MTLSEFEA-EF-GADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 454 ~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) +......- .. ....+..+..+..++...+.+.+.+..+.. T Consensus 460 ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 460 KINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHHHHHHhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 11100000 00 000000011111111111111111111111 No 159 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.84 E-value=2.7e-08 Score=62.05 Aligned_cols=431 Identities=14% Similarity=0.098 Sum_probs=165.5 Q ss_pred CCCCcccccccchhhhcccCc--cc--cCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGR--GI--QPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~--~i--~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) +.+.-..+.........-+.+ -+ .+.--+..++++.-...+...||+..+..+...+|.+- ++ .+.. T Consensus 11 L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~---~d------~~~~ 81 (480) T protein:vir:78 11 LQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS---ED------SEGL 81 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecC---CC------chhH Confidence 111110011010111000000 01 00111233333323346788899999998877776431 11 1112 Q ss_pred HHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee------CCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV------EGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r------~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) +.+..++.. -.+......+..+.+++|.||+.+.+ +.+|.+ .+..++|..+.+..+... T Consensus 82 ~~l~~i~~~--------------N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D~~~ 146 (480) T protein:vir:78 82 EELWNWWQA--------------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRN 146 (480) T ss_pred HHHHHHHHh--------------cCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEcCCC Confidence 223332211 13566778889999999999988765 234544 578899998876554321 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccc-cccceee---ecccceEEecC-ceeEEechhHEEEecCCCCCC Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRY-GDDKRFV---DKETGEVASDA-GELKNGPANELIFLPNPSPLA 225 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~-~~~~~~~---~~~~g~~~~~~-~~~~~~~~~eviH~r~~~~~~ 225 (602) ... . .-...|+.-......+ .....| ++..... ......+.... .....+..=.|+||++....+ T Consensus 147 ~~~----~----~~~i~~~~~~d~~~~~--~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~ 216 (480) T protein:vir:78 147 TRR----V----TRAVRLYTTRDDVAVP--DRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) T ss_pred ccc----e----EEEEEEEEeecCCcce--EEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccC Confidence 100 0 0001111111110000 000000 0000000 00000001100 011123334688998776677 Q ss_pred CcccccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccc Q lcl|NC_021537. 226 LYYGVPDWVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDD 304 (602) Q Consensus 226 ~~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~ 304 (602) ..+|.|.+.- +...++.......-..+...-.+.|..+|+ |..+.+...+.-...+... .++++.+++ T Consensus 217 ~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~-----~~~~~~~~~---- 285 (480) T protein:vir:78 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDIY-----YGRILTLAS---- 285 (480) T ss_pred CccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--CCCccccccccccchhhhh-----hhhhccCCC---- Confidence 7899997764 333343333332222233333444544443 3222111111000111111 112222221 Q ss_pred eeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHH------------- Q lcl|NC_021537. 305 HGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREF------------- 371 (602) Q Consensus 305 ~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f------------- 371 (602) .+.+|..+...+. -.|++..+..+.+|+..=++|+..+|.... |.++.++....+ T Consensus 286 ---------~~~~~~~~~~~~~--~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~ 353 (480) T protein:vir:78 286 ---------EAAKISEFKAAEL--RNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRI 353 (480) T ss_pred ---------CCceEEecCccCH--HHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHH Confidence 1122333322221 137788888999999999999999985332 223332221111 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC--cccHHHHHHHhCCCCCCC Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG--VGTVNEAREELDLAPFED 449 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~p~~~ 449 (602) +...|+-.++.+... ...........+++.+.+.... +....++.+.+++.+| +++..-+++++|+.+-+- T Consensus 354 f~~~l~~~~rl~~~~-----~~~~~~~~~~~i~v~w~~~~~~--s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~ 426 (480) T protein:vir:78 354 FGGAWERAMRIAMQI-----MGREVTEEYTRLETVWRDPSTP--TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR 426 (480) T ss_pred HHHHHHHHHHHHHHH-----cCCCccccceeeeEEecCCCCC--CHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHH Confidence 111122222222111 1111111122333333322211 2223455666777766 667666788888864321 Q ss_pred Cccccc--------cccccccccccccCCCcCccccccccccccccccccccccccc Q lcl|NC_021537. 450 DRGDMT--------LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVD 498 (602) Q Consensus 450 g~~d~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (602) ...... .........+++...+.+... +. ++..++..........+ T Consensus 427 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 427 EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-ET--KTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHHHHHHHHHhhccccCCCccccCCCCC-CC--CCccCCCcccCCCcCCC Confidence 110000 011111111111111110000 00 01111111111111111 No 160 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.81 E-value=1.2e-08 Score=63.93 Aligned_cols=390 Identities=12% Similarity=0.053 Sum_probs=165.4 Q ss_pred CCCCc--------ccccccchhhhcccCc-----cccCCCCHHHHHHHHhh-hHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021537. 1 MSKAE--------ETTQLDERHIATDVGR-----GIQPPYNPETLAAFQEL-NETHQACIRKKSRYEAGYGFEIVAHPSA 66 (602) Q Consensus 1 ~~k~~--------~~~~~~~~~~~~~~~~-----~i~p~~~~~~l~~~~~~-~~~v~~cI~~ia~~ia~~~~~i~~~~~~ 66 (602) |.... ...+...+....-+.+ .+.+ --+..++.+.+. ..+...+|+.+++.+.=.||..- T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~-~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~----- 74 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSI-VMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTND----- 74 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCc-cccHHHHHHHHhhcchhHHHHHHHHhccccceeeCC----- Confidence 11100 0000111111111111 1111 113445544432 36788899999887666666421 Q ss_pred CCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CCceEEEEEeCccccccc Q lcl|NC_021537. 67 DEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DGTPVGLAHVPAATVRVR 145 (602) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G~~~~L~~l~p~~v~~~ 145 (602) +. .+.. .... | ++......+..+.+++|.||+.+.++. +|.| .+.+++|..+... T Consensus 75 -----d~----~l~~----------~w~~-N---~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i 130 (422) T protein:vir:97 75 -----DF----NAWE----------IFKA-N---NPDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGI 130 (422) T ss_pred -----ch----hHHH----------HHHh-c---ChHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEE Confidence 11 0111 1111 1 345566678889999999999998874 5665 6888899988765 Q ss_pred ccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCC Q lcl|NC_021537. 146 KTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLA 225 (602) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~ 225 (602) .|...... . .++...-....+.... .-.+++...+.-...+....... .+..=.|+||.+..... T Consensus 131 ~D~~~~~~---------~--~a~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~---~~g~vPvv~~~n~~~~~ 195 (422) T protein:vir:97 131 LDPTTFLL---------T--EGYAILESDSNGNPTL-EAYFTDKDIWYYPKKGKPYNIKN---PTGHPLLVPIIHRPDAV 195 (422) T ss_pred EeCCCCcc---------e--eeEEEEEecCCCcEEE-EEEEcCceEEEEcCCCccccccC---CCCCcceEEecccCCCc Confidence 43321110 0 1111111111110000 00011111111111111111111 11222478888776677 Q ss_pred CcccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccc Q lcl|NC_021537. 226 LYYGVPDW-VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDD 304 (602) Q Consensus 226 ~~~G~spl-~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~ 304 (602) ..+|.|.+ +.....++..............-.+.|.-++. |-..+....+.++.. .++++.++...+ T Consensus 196 ~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~--G~d~d~~~~~~~~~~---------~~~i~~~~~de~- 263 (422) T protein:vir:97 196 RPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVL--GMDPDAKPMEKWRAT---------VSTLLEISKDED- 263 (422) T ss_pred cccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--ccCcccccCchhhhh---------hhhhhccCCCCC- Confidence 78999865 23322222222222222222222334444442 211111112222221 223444332211 Q ss_pred eeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH---HHHHHHHHHHHHH Q lcl|NC_021537. 305 HGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT---REFAKGIIEPEQA 381 (602) Q Consensus 305 ~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~---~~f~~~~l~P~~~ 381 (602) ..++++..+...+.. .|++.++..+..|++.=++|++.+|.... |-++.++.. ..+.. .+.-..+ T Consensus 264 --------~~~~~v~q~~~~~l~--~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~-ka~~k~~ 331 (422) T protein:vir:97 264 --------GDKPTVGQFTTASMA--PFMEHLKMYASLFAGGSGLTLDDLGFPSD-NPSSVESIKAAHENLRA-AGRKAQR 331 (422) T ss_pred --------CCcceeeecCCCChh--HHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHH-HHHHHHH Confidence 012334333333321 38999999999999999999999997553 333333321 11111 1111111 Q ss_pred HHHHHHhhh------cCCccc----cccceEEEeccchhcchhH-HHHHHHHHHHHHHhC--CcccHHHHHHHhCCCCCC Q lcl|NC_021537. 382 KFSARLYKI------IHQDAL----DVDEWTIDFELRGAEQPEQ-DAKMAEQRVRAMRLA--GVGTVNEAREELDLAPFE 448 (602) Q Consensus 382 ~ie~~ln~~------Ll~~~~----~~~~~~~~f~~~~~~~~~~-d~~~~~~~~~~~~~~--G~~T~NE~R~~~Gl~p~~ 448 (602) .|...+.+. +..... ...+..++|. ....... .....++++.+++.+ |++..+-+++++|+...+ T Consensus 332 ~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~--p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~ 409 (422) T protein:vir:97 332 SFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWE--PLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGAD 409 (422) T ss_pred HHHHHHHHHHHHHHHHhcCCcccchhhccceEEEc--cCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchh Confidence 222222111 101000 1112344553 2221111 122345777888887 788888899999995432 Q ss_pred CCccccccccccccccccccCC Q lcl|NC_021537. 449 DDRGDMTLSEFEAEFGADASDG 470 (602) Q Consensus 449 ~g~~d~~~~~~~~~~~~~~~~~ 470 (602) . ..... . ...+.+ T Consensus 410 ~---~~~~~---~---~~~~d~ 422 (422) T protein:vir:97 410 K---PIPAI---T---EVTTDG 422 (422) T ss_pred H---HHHHH---H---hhhccC Confidence 1 11000 0 000111 No 161 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.81 E-value=1.8e-08 Score=63.09 Aligned_cols=432 Identities=9% Similarity=0.015 Sum_probs=173.1 Q ss_pred CCCC---cccccc-cchhhh-cccC--ccccCCC-C-HHH--HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKA---EETTQL-DERHIA-TDVG--RGIQPPY-N-PET--LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~---~~~~~~-~~~~~~-~~~~--~~i~p~~-~-~~~--l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) |+|. -.+.+. ..+... +-.| ..|..+- . ... ..++ ..++...+|+..+..+.+-|+.+...+... T Consensus 44 l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki--~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~-- 119 (501) T protein:vir:27 44 LKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRA--VHNYGRMISKFKTGYLAGNPIRVEYDDNDN-- 119 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCcccccccee--ccchHHHHHHHHhhhhcccCeeEecCCccc-- Confidence 1111 001110 011100 0011 1121111 0 001 1122 247788899999999999888775332211 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) .......+..++ ..-.+......+..+.+.+|.+|+.+.++.+|++ .+..++|..+.+..+.. T Consensus 120 --~~~~~~~l~~~~--------------~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~v~d~~ 182 (501) T protein:vir:27 120 --NSQNDDTIKRIG--------------RINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFVIYDNS 182 (501) T ss_pred --hHHHHHHHHHHH--------------HhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEEEecCC Confidence 111111122211 1224567888899999999999999999888875 57788888886543321 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCccc Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYG 229 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G 229 (602) .. ....-+..|+......... .+...+..+..+.-...+.+.........+.-=.|++++.. ..| T Consensus 183 ~~--------~~~~~~ir~~~~~~~~~~~--~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g 247 (501) T protein:vir:27 183 LE--------DNSIAAVRYYNRGTLQNAK--DVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNN-----VDG 247 (501) T ss_pred CC--------CceEEEEEEEEeeecCCcE--EEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCC-----CCC Confidence 10 0001111222221111110 00001111000000000000000111111222236666543 368 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 230 VPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 230 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) +|.+..+...++....+..-..+.+.....|-.+++-.......+....++. .+.+.+...+. .. T Consensus 248 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~----------~~~~~~~~~~~-~~---- 312 (501) T protein:vir:27 248 IGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKR----------TRLMQLKPPKS-AD---- 312 (501) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhh----------cCceeeccccc-cc---- Confidence 8988888777777666666666666666666555542211112222222211 11111111111 00 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHHHHH Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAKGII 376 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~~~l 376 (602) +...+.+++.++... .+..+....+...+.|+..-++|....+... +|-| ..+. ....+...| T Consensus 313 -~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~S-g~Al~~~~~~l~~ka~~~~~~~~~~l 388 (501) T protein:vir:27 313 -GKEGTVKAEYLTKSY-DVSGAEAYKTRLNRDIHIFTNIPDMSDTNFS-GNTS-GEALKYKLFGLDQDRVDTQSQFTQGL 388 (501) T ss_pred -CCCCCcceeeeeccC-CHHHHHHHHHHHHHHHHHHhCCcccCccccc-cCch-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 011122222222111 2334567778888899999999865443221 2222 2211 112223344 Q ss_pred HHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc--c Q lcl|NC_021537. 377 EPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD--M 454 (602) Q Consensus 377 ~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d--~ 454 (602) +-+++.+...++..-........+..+.|...-.. +....++++.++ .|+++..-+.+++++ +++...+ + T Consensus 389 ~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~----n~~e~ad~~~kl--~g~iS~et~l~~l~~--v~D~~~E~er 460 (501) T protein:vir:27 389 KRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPK----SLNEQVSILTGL--GGQVSQETALSLSGL--VESPNEELDK 460 (501) T ss_pred HHHHHHHHHHHhhcccccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCcHHHHHHhCCC--CCCHHHHHHH Confidence 44444444443322111111223345666443222 233345666665 589998888888754 3322111 1 Q ss_pred ccccccccccccccCCCcCcccccccccccccccccccccccccccc Q lcl|NC_021537. 455 TLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSK 501 (602) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (602) ...... ........++..+......+.+.......+ ..+.+ T Consensus 461 i~~E~~-e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~-----e~~~~ 501 (501) T protein:vir:27 461 INKEVS-EIDFKGYSNDFNEHVGKYTDEVKETHTDDF-----ERAYE 501 (501) T ss_pred HHHHHH-hhhHhhhcCccccccccccCCCCCCccccc-----cccCC Confidence 110000 000000000000000000000000000000 00000 No 162 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.76 E-value=5.9e-08 Score=60.23 Aligned_cols=432 Identities=14% Similarity=0.114 Sum_probs=165.3 Q ss_pred CCCCcccccccchhhhcccCccccCCC-CHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPY-NPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTV 79 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~-~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~ 79 (602) ..+.+.-..+. +++... ...-..+. -+..+++..-...+...||+..+..+...+|.+- + +.+..+.+ T Consensus 16 ~~~~~r~~~l~-~Yy~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~---~------d~~~~~~l 84 (480) T protein:vir:78 16 ARDLPNLLEAE-AYRNGT-RRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS---E------DSEGLEEL 84 (480) T ss_pred HHHHHHHHHHH-HHHhcc-ccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecC---C------CchhHHHH Confidence 11111111111 111100 00000011 1223333322346788899999998877776431 1 11112223 Q ss_pred HHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC------CCCceEEEEEeCccccccccccccccc Q lcl|NC_021537. 80 RDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE------GDGTPVGLAHVPAATVRVRKTTTTIER 153 (602) Q Consensus 80 ~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~------~~G~~~~L~~l~p~~v~~~~~~~~~~~ 153 (602) ..++.. -.+......+..+.+++|.||+.+-+. .+|.+ .+..++|..+.+..+.... T Consensus 85 ~~i~~~--------------N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D~~~~-- 147 (480) T protein:vir:78 85 WNWWQA--------------NDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNT-- 147 (480) T ss_pred HHHHHh--------------cCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEcCCCc-- Confidence 332211 134567788899999999999887652 34443 5778889888765442210 Q ss_pred ccchhhhhcccCceeEEEEcCCcceeeccccccc-ccceeeecc---cceEEec-CceeEEechhHEEEecCCCCCCCcc Q lcl|NC_021537. 154 EDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYG-DDKRFVDKE---TGEVASD-AGELKNGPANELIFLPNPSPLALYY 228 (602) Q Consensus 154 ~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~-~~~~~~~~~---~g~~~~~-~~~~~~~~~~eviH~r~~~~~~~~~ 228 (602) . ...-...|+.-..+.... .+...|. +........ ...+... ......+..=.|+||++....+..+ T Consensus 148 --~----~~~~~i~~~~~~~~~~~~--~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~ 219 (480) T protein:vir:78 148 --R----RVTRAVRLYTTRDDVAVP--DRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRY 219 (480) T ss_pred --c----ceEEEEEEEEeecCCCce--EEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCcc Confidence 0 000001111101110000 0000000 000000000 0000110 0111223444688898776667789 Q ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceec Q lcl|NC_021537. 229 GVPDWVA-AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGL 307 (602) Q Consensus 229 G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~ 307 (602) |.|.+.- +...++....+..-......-.+.|..+|+ |..+.+...+.-...|.... ++++.++ | T Consensus 220 G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~-----~~~~~~~-~------ 285 (480) T protein:vir:78 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDIYY-----GRILTLA-S------ 285 (480) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--cCCccccccccccchhhhhh-----hhhccCC-C------ Confidence 9997764 333444333333333333333445554443 32222211111111111111 1122211 1 Q ss_pred cccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHH--HHHHHHHHHHHH Q lcl|NC_021537. 308 GDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAK--GIIEPEQAKFSA 385 (602) Q Consensus 308 ~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~--~~l~P~~~~ie~ 385 (602) .+.++..+...+. -.|++..+..+.+|+..=++|+..+|.... |.++.++....+.. .-..-..+.|.. T Consensus 286 ------~~~~~~~~~~~~~--~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Alk~~~~~l~~ka~~~~~~f~~ 356 (480) T protein:vir:78 286 ------EAAKISEFKAAEL--RNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) T ss_pred ------CCceEEecCccCH--HHHHHHHHHHHHHHhcccCCChHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1123333332221 126777888888999999999999985432 32333332211111 001111111111 Q ss_pred HHhhh------cCCcc--ccccceEEEeccchhcchhHHHHHHHHHHHHHHhCC--cccHHHHHHHhCCCCCCCCccccc Q lcl|NC_021537. 386 RLYKI------IHQDA--LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAG--VGTVNEAREELDLAPFEDDRGDMT 455 (602) Q Consensus 386 ~ln~~------Ll~~~--~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~p~~~g~~d~~ 455 (602) .|.+. +.... ....+..++|. +.... +....++.+.+++.+| +++..-+++.+|+.+-+-...+.. T Consensus 357 ~l~~~~~l~~~~~g~~~~~~~~~i~v~f~--~~~~~--s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~ 432 (480) T protein:vir:78 357 AWERAMRIAMQIMGREVTEEYTRLETVWR--DPSTP--TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDW 432 (480) T ss_pred HHHHHHHHHHHHcCCCccccceeeeEEec--CCCCC--CHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHH Confidence 11111 11111 11123344453 22211 2223445566666655 677777888888864321110000 Q ss_pred --------cccccccccccccCCCcCccccccccccccccccccccccccc Q lcl|NC_021537. 456 --------LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVD 498 (602) Q Consensus 456 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (602) +........+++...+.+... ...++.++.......+..+ T Consensus 433 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 433 DKQETEDMIDTLYSTTKAQADATPKPTVT---ETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHHHhhccccccCCCCCCCCCC---CCCCccccccCCCCcccCC Confidence 000000000111111101000 0011111111111111111 No 163 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=98.74 E-value=1.7e-08 Score=63.18 Aligned_cols=490 Identities=13% Similarity=0.042 Sum_probs=218.1 Q ss_pred CCCCccccccc--chhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc-----ccch Q lcl|NC_021537. 1 MSKAEETTQLD--ERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP-----DEGG 73 (602) Q Consensus 1 ~~k~~~~~~~~--~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~-----~~~~ 73 (602) -+=.+.++++. .+.+++..+..-.-..-.+.. .+++.-+-++-.|.-+++.++.+-+....-+.+.+. +++. T Consensus 22 r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW-~~~d~v~Elry~vgW~~~s~sr~rL~as~idpDtg~ptg~iee~~ 100 (631) T protein:vir:10 22 RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAW-EAVDLVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDN 100 (631) T ss_pred hhhhhhhccccchhhhhhhhcCCcccchhhHHHH-HHHHhhhhHHHHhhhhhhhhceeeeEeeeeccCCCCCccccccCC Confidence 00011122332 233333322110101111221 233444677788888999999988766555433111 1111 Q ss_pred hhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEe-eCCCCc----------eEEEEEeCcccc Q lcl|NC_021537. 74 ESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEIL-VEGDGT----------PVGLAHVPAATV 142 (602) Q Consensus 74 ~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~-r~~~G~----------~~~L~~l~p~~v 142 (602) ....++.....+ =+...+...++++.+..++-+-|++|+-++ +..+|. .-+++.+....| T Consensus 101 ~~~~~v~~~~~~---------i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r~~~~W~~vt~~ei 171 (631) T protein:vir:10 101 TEGERVREIVSK---------IADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEI 171 (631) T ss_pred chhHHHHHHHHh---------cCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccccccceeeccHHHH Confidence 112233332221 133456788999999999999999999764 333321 223444455444 Q ss_pred cccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec-hhHEEEec-- Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP-ANELIFLP-- 219 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-~~eviH~r-- 219 (602) +...... +..+ ....|+ ...|- ..+ +-|| T Consensus 172 ~~~~~g~---------------g~~v----------~lp~g~----------------------~h~~~~~~D-~l~RiW 203 (631) T protein:vir:10 172 KKSNKGS---------------GTNI----------VLPTGE----------------------EHEFVKGTD-IIFRVW 203 (631) T ss_pred hcccCcc---------------ccee----------ecCCCC----------------------ccceecCCc-eEEEee Confidence 3221110 0000 000011 00010 111 2233 Q ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC--------------------CHHHHHHH Q lcl|NC_021537. 220 NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL--------------------SEDSKEDL 279 (602) Q Consensus 220 ~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~--------------------~~~~~~~l 279 (602) .+.|....+--||+.+++..+....-......+..+.-.+-.|+|.+|.... +.-+..+| T Consensus 204 ~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l 283 (631) T protein:vir:10 204 IPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQL 283 (631) T ss_pred CCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHH Confidence 3445555677899999988887766666666666666666677776664321 11244444 Q ss_pred HHHHHH-----hh--cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHH Q lcl|NC_021537. 280 RNLMDN-----LK--GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVL 352 (602) Q Consensus 280 ~~~~~~-----~~--g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~ 352 (602) .+.+=+ +. +...+..+++.. .++..-. +++-|...+..+.--+.+++..+..+|+.+-|||+. T Consensus 284 ~~~l~q~a~tai~De~S~aA~vPii~~--------~p~E~i~--~i~hlkf~~ei~e~aiktR~daI~RlA~glDi~pE~ 353 (631) T protein:vir:10 284 TDMLFQVAETAVEDEDSQAAFIPVIAG--------VPGEQIK--DVKHIRFDNEITEVAIKTRNDAIARLAMGLDVSPER 353 (631) T ss_pred HHHHHHHHhhhhcCCCCccceeeeeEe--------echHHhc--CeeEEeecCchhHHHHhhHHHHHHHHHhccCCchhh Confidence 444321 11 122222222221 1111111 334444456667777899999999999999998774 Q ss_pred -hhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccceEEEeccchhcchhHHHHHHHHHHHH Q lcl|NC_021537. 353 -INVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDEWTIDFELRGAEQPEQDAKMAEQRVRA 426 (602) Q Consensus 353 -lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~ 426 (602) ||..+++|-=++=+....-++-.|.|.+..|+++|++.+|.+. -+..+|-+-||.+.+.. |.....+++ . T Consensus 354 LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~---dPdr~deA~-q 429 (631) T protein:vir:10 354 LLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDPSQLTI---DPDKSDEAK-F 429 (631) T ss_pred heeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc---CCCCcHHHH-H Confidence 5776566532222222223466799999999999998776432 23356788899887643 222223343 4 Q ss_pred HHhCCcccHHHHHHHhCCCCCCCCcc----------------cccccccccccc-ccccC--------CCcCcccccccc Q lcl|NC_021537. 427 MRLAGVGTVNEAREELDLAPFEDDRG----------------DMTLSEFEAEFG-ADASD--------GDAEAMLTRSKA 481 (602) Q Consensus 427 ~~~~G~~T~NE~R~~~Gl~p~~~g~~----------------d~~~~~~~~~~~-~~~~~--------~~~~~~~~~~~~ 481 (602) +.+.|.||-...|+.+|+.--.+.+- +-.+.+.+.++. .+.+. ...++.+....+ T Consensus 430 a~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpaLip~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~ 509 (631) T protein:vir:10 430 AYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDA 509 (631) T ss_pred HHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhcccCcchhhHHHHHHHhhhccCCCCCCCCCCCCCccccc Confidence 78899999999999999854222110 000112222211 11111 111111111111 Q ss_pred cccccccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCccch Q lcl|NC_021537. 482 APPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPSAGS 561 (602) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~ 561 (602) + +.++...+.+..+..... +. .+... .+ |-.-++.+|--|.-. T Consensus 510 ~-~~~~g~~epdt~d~~p~~---~~---a~~~~-~i---------------------------v~llv~RALelAGkR-- 552 (631) T protein:vir:10 510 D-DLDDGEQEPDTEDDDDGT---QK---AGLET-GI---------------------------VDLMVDRALELVGKR-- 552 (631) T ss_pred c-ccccCCCCCCCCCCCCcc---cc---ccchH-HH---------------------------HHHHHHHHHHhhcch-- Confidence 1 111111111111111000 00 00000 00 112223333333211 Q ss_pred hhhhhhcccccccccccc--hhcccCCCCCChhhcC------Ccccc-cC Q lcl|NC_021537. 562 YHYSEIRLQYGYLEVTNN--HERLPEGPTPDPGEAP------EDVPS-DI 602 (602) Q Consensus 562 ~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~------~~~~~-~~ 602 (602) -.++.=..+|+|.-|... |..+..+ | .++++ +.+=+ ++ T Consensus 553 l~~r~r~~~ar~~~v~~he~H~~~~Pv--~-~~ev~rli~gwd~~ld~~~ 599 (631) T protein:vir:10 553 RRGRDRETLARLSGVRERDYHRYMDPV--P-ESEVDRLMSGWDSALDDKI 599 (631) T ss_pred hcCCcccchhHHhcccccccccccCCC--C-HHHHHHHHHHHHHHHHHHH Confidence 001111111222222111 1111100 0 01111 00000 01 No 164 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.74 E-value=7.2e-08 Score=59.75 Aligned_cols=430 Identities=12% Similarity=0.023 Sum_probs=171.3 Q ss_pred CCCCcccccccc-----hhhhcccCcccc-CC----CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETTQLDE-----RHIATDVGRGIQ-PP----YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~~~~~-----~~~~~~~~~~i~-p~----~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) -.+-..+.+... +.+...-...+. +. -++..-+.+ .......+++..|..+.+-|..|.-. T Consensus 28 ~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~--~~n~~k~i~~~~a~~l~~~p~~i~~~------- 98 (496) T protein:vir:38 28 HKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQL--SMNLPKVTAKYMSKLLFNEKVKINID------- 98 (496) T ss_pred cCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCcccccee--ecchHHHHHHHHhhhhhCCcceEeeC------- Confidence 011111111111 111100000000 00 000000111 12566788899999988877766421 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) +....+.+...+. ...+...+..++.+...+|.+|+.+..|.+|.+ .+.+++|..+-+..+... T Consensus 99 -d~~~~e~l~~~~~--------------~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~~~~~~P~~~~~~ 162 (496) T protein:vir:38 99 -DKAAEEFVLNVLK--------------TNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSE 162 (496) T ss_pred -ChHHHHHHHHHHh--------------ccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EEEEEcccceEEEEecCC Confidence 1122222222221 234667778888899999999999999887774 678889888765433221 Q ss_pred cccccchhhhh-cccCceeEEEE--c---CCcceeec-cccccccc-ceeeecccceEEec---CceeEEechhHEEEec Q lcl|NC_021537. 151 IEREDGEEVEN-IESGHGYVQVR--Q---GRRRYFGE-AGDRYGDD-KRFVDKETGEVASD---AGELKNGPANELIFLP 219 (602) Q Consensus 151 ~~~~~~~~~~~-~~~~~~~~qi~--~---~~~~~~~~-~~~~~~~~-~~~~~~~~g~~~~~---~~~~~~~~~~eviH~r 219 (602) -.. ....+.. ...+..|.++- . +.....+. +....... ..-+... .++.. .+....++.--+.|++ T Consensus 163 ~~~-~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~--~~~~~~~~~~~~~~~~~~~f~~~~ 239 (496) T protein:vir:38 163 NVD-ECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLT--LLFDDIEPVVPLPDFTRPTFIYIK 239 (496) T ss_pred cEE-EEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccc--ccccccccceeecCCCcceEEEec Confidence 100 0000000 11222222211 0 00000000 00000000 0000000 00000 0001111222355565 Q ss_pred CCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 220 NPS----PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 220 ~~~----~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) .+- ..+..+|+|.+..+...++....+..-..+-|..| .+..++ +... .+.. ....|. ... T Consensus 240 ~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v--~~~~-----l~~~----~~~~g~---~~~ 304 (496) T protein:vir:38 240 PNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLV--PSSF-----VKTA----VNLDGS---TTQ 304 (496) T ss_pred CCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceec--chHH-----hhcc----CCCCCc---ccc Confidence 431 23456899999999888877665544444455543 333222 2110 0000 000000 000 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------- Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------- 367 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------- 367 (602) .....-..+........+.+-.++.++. ....-++.+..+...++|+..-|++|..+|...++. +|+.+. T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~-~tAtei~~~~~~l~ 382 (496) T protein:vir:38 305 YFDSTDEAFFLYQGDQDDNGKAIKDISV-EIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGL-KTATEVVSEKSETY 382 (496) T ss_pred CCCCccceEEEeecCCCcccccceeecc-ccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCcccc-chHHHHHHHHHHHH Confidence 0000000011111111111111222221 112345778888888999999999999998765443 233322 Q ss_pred -----HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh Q lcl|NC_021537. 368 -----TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL 442 (602) Q Consensus 368 -----~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 442 (602) ....++.+|+.++..+.+..+..............+.|.+.+.... |....++.+.+++.+|+|+...++..+ T Consensus 383 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~--d~~~~~~~~~~~~~~GiiS~et~l~~~ 460 (496) T protein:vir:38 383 QTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQ--DEDTTINRYTNAKNQGMIPLKIALQRA 460 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCC--CHHHHHHHHHHHHhcCCCCHHHHHHhc Confidence 1112233444444444433332222112222333444444433323 333345567788899999988887654 Q ss_pred -CCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 443 -DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 443 -Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) |. .+++...-+......-..+....+..+.. .+.+ T Consensus 461 ~~~---~d~ea~~el~ri~~E~~~~~~~~d~~~~~-------------~~~e 496 (496) T protein:vir:38 461 WNI---TEAEADEWAEMLAKEKQAEMPNNDMNGIF-------------GEEE 496 (496) T ss_pred CCC---ChHHHHHHHHHHHHhhhccCccccccCCC-------------CCCC Confidence 43 22222111111000000000000000000 0000 No 165 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.73 E-value=7.8e-08 Score=59.56 Aligned_cols=410 Identities=11% Similarity=0.029 Sum_probs=174.5 Q ss_pred CCC-Ccccccccc--hhhhcccC--ccccCCCCHH----------------HHHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSK-AEETTQLDE--RHIATDVG--RGIQPPYNPE----------------TLAAFQELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k-~~~~~~~~~--~~~~~~~~--~~i~p~~~~~----------------~l~~~~~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) +.+ ......+.. +++-.... ..+..+.... .-.+++ +++...+|+..+..+.|-|+. T Consensus 24 i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~ivd~~~~yl~g~pv~ 101 (474) T protein:vir:10 24 IESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN--NSFDSEIVDTRVGYLHGVPVT 101 (474) T ss_pred HHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc--cchHHHHHHhHhhheecccee Confidence 100 000001100 01100000 0011000000 000222 567788899999999898887 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCc Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPA 139 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p 139 (602) +....+.+ .+.+..+.+..++. ...+......+..+.+.+|.||..+..+.+|++ .+..++| T Consensus 102 ~~~~~~~~---~~e~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p 163 (474) T protein:vir:10 102 YDLDENAE---KNEKLKKFITNFAI--------------RNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDP 163 (474) T ss_pred EeeCCCCc---chHHHHHHHHHHHh--------------hcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcc Confidence 65422211 11122222222221 124566778888999999999999888888874 6788888 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeeccccccccccee--eecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRF--VDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) ..+-+..+... .. .-+..|+....+.....+.....+.....+ .....+.+...+.....+..=.|+| T Consensus 164 ~~~~~v~d~~~-~~---------~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 233 (474) T protein:vir:10 164 YNVIFVGDNIL-EP---------TYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFG 233 (474) T ss_pred cceEEEEcCCC-ce---------EEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEE Confidence 88765443221 10 011222222222221111111111111000 0000000011111111222334677 Q ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCccee Q lcl|NC_021537. 218 LPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILE 297 (602) Q Consensus 218 ~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~ 297 (602) |++. ..|.|.+......++....+..-..+.+...+.|-.+++ |..++++....++ ..+.+.+ T Consensus 234 ~~n~-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g~~~~~~~~~~~~----------~~~~i~~ 296 (474) T protein:vir:10 234 VPNN-----KEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--GMGMSEEMIQETQ----------KSGAFEL 296 (474) T ss_pred ecCC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--cCCCCchhhhhhh----------hcceeEe Confidence 7643 368888887777776666555555555565556655553 4334443333221 1233333 Q ss_pred ccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH---------- Q lcl|NC_021537. 298 VEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ---------- 367 (602) Q Consensus 298 ~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~---------- 367 (602) .+.+.+. ++ ++... .+..+....+...+.|...-++|..-.+... ++-|. ++. T Consensus 297 ~~~~~~~------------~~--l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg-~Al~~~~~~l~~k 359 (474) T protein:vir:10 297 FDKDMDV------------KY--LTKDV-NDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPI-IGMKLKLMALENK 359 (474) T ss_pred cCCCCce------------eE--EeccC-CHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchH-HHHHHHHHHHHHH Confidence 3333222 11 11111 1345667778888899998888864433221 22221 211 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhcCC-ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhC Q lcl|NC_021537. 368 ---TREFAKGIIEPEQAKFSARLYKIIHQ-DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELD 443 (602) Q Consensus 368 ---~~~f~~~~l~P~~~~ie~~ln~~Ll~-~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~G 443 (602) ....+..+|+-.++.+...++.+-.. ......+..+.|...-.. +....++.+.++ .|+++..-+.++++ T Consensus 360 ~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~----d~~e~a~~~~kl--~g~iS~et~~~~l~ 433 (474) T protein:vir:10 360 CMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPV----NKLEESQVLINL--KGQVSERTRLGQSQ 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCC----CHHHHHHHHHHH--hccCchHHHHHhCC Confidence 11233444555555555544432111 111123445556433221 334455667766 48999988888887 Q ss_pred CCCCCCCcc--cccccccccccc--ccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 444 LAPFEDDRG--DMTLSEFEAEFG--ADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 444 l~p~~~g~~--d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .- ++... ++.-........ .+...++..+...... .+ T Consensus 434 ~v--~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~---------s~ 474 (474) T protein:vir:10 434 LV--DDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQ---------SE 474 (474) T ss_pred CC--CCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCcccc---------CC Confidence 53 22211 111000000000 0000000000000000 00 No 166 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.73 E-value=7.8e-08 Score=59.56 Aligned_cols=410 Identities=11% Similarity=0.029 Sum_probs=174.5 Q ss_pred CCC-Ccccccccc--hhhhcccC--ccccCCCCHH----------------HHHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_021537. 1 MSK-AEETTQLDE--RHIATDVG--RGIQPPYNPE----------------TLAAFQELNETHQACIRKKSRYEAGYGFE 59 (602) Q Consensus 1 ~~k-~~~~~~~~~--~~~~~~~~--~~i~p~~~~~----------------~l~~~~~~~~~v~~cI~~ia~~ia~~~~~ 59 (602) +.+ ......+.. +++-.... ..+..+.... .-.+++ +++...+|+..+..+.|-|+. T Consensus 24 i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~ivd~~~~yl~g~pv~ 101 (474) T protein:vir:94 24 IESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN--NSFDSEIVDTRVGYLHGVPVT 101 (474) T ss_pred HHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc--cchHHHHHHhHhhheecccee Confidence 100 000001100 01100000 0011000000 000222 567788899999999898887 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCc Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPA 139 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p 139 (602) +....+.+ .+.+..+.+..++. ...+......+..+.+.+|.||..+..+.+|++ .+..++| T Consensus 102 ~~~~~~~~---~~e~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p 163 (474) T protein:vir:94 102 YDLDENAE---KNEKLKKFITNFAI--------------RNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDP 163 (474) T ss_pred EeeCCCCc---chHHHHHHHHHHHh--------------hcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcc Confidence 65422211 11122222222221 124566778888999999999999888888874 6788888 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeeccccccccccee--eecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRF--VDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) ..+-+..+... .. .-+..|+....+.....+.....+.....+ .....+.+...+.....+..=.|+| T Consensus 164 ~~~~~v~d~~~-~~---------~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 233 (474) T protein:vir:94 164 YNVIFVGDNIL-EP---------TYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFG 233 (474) T ss_pred cceEEEEcCCC-ce---------EEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEE Confidence 88765443221 10 011222222222221111111111111000 0000000011111111222334677 Q ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCccee Q lcl|NC_021537. 218 LPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILE 297 (602) Q Consensus 218 ~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~ 297 (602) |++. ..|.|.+......++....+..-..+.+...+.|-.+++ |..++++....++ ..+.+.+ T Consensus 234 ~~n~-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g~~~~~~~~~~~~----------~~~~i~~ 296 (474) T protein:vir:94 234 VPNN-----KEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--GMGMSEEMIQETQ----------KSGAFEL 296 (474) T ss_pred ecCC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--cCCCCchhhhhhh----------hcceeEe Confidence 7643 368888887777776666555555555565556655553 4334443333221 1233333 Q ss_pred ccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH---------- Q lcl|NC_021537. 298 VEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ---------- 367 (602) Q Consensus 298 ~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~---------- 367 (602) .+.+.+. ++ ++... .+..+....+...+.|...-++|..-.+... ++-|. ++. T Consensus 297 ~~~~~~~------------~~--l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg-~Al~~~~~~l~~k 359 (474) T protein:vir:94 297 FDKDMDV------------KY--LTKDV-NDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPI-IGMKLKLMALENK 359 (474) T ss_pred cCCCCce------------eE--EeccC-CHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchH-HHHHHHHHHHHHH Confidence 3333222 11 11111 1345667778888899998888864433221 22221 211 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhcCC-ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhC Q lcl|NC_021537. 368 ---TREFAKGIIEPEQAKFSARLYKIIHQ-DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELD 443 (602) Q Consensus 368 ---~~~f~~~~l~P~~~~ie~~ln~~Ll~-~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~G 443 (602) ....+..+|+-.++.+...++.+-.. ......+..+.|...-.. +....++.+.++ .|+++..-+.++++ T Consensus 360 ~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~----d~~e~a~~~~kl--~g~iS~et~~~~l~ 433 (474) T protein:vir:94 360 CMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPV----NKLEESQVLINL--KGQVSERTRLGQSQ 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCC----CHHHHHHHHHHH--hccCchHHHHHhCC Confidence 11233444555555555544432111 111123445556433221 334455667766 48999988888887 Q ss_pred CCCCCCCcc--cccccccccccc--ccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 444 LAPFEDDRG--DMTLSEFEAEFG--ADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 444 l~p~~~g~~--d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .- ++... ++.-........ .+...++..+...... .+ T Consensus 434 ~v--~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~---------s~ 474 (474) T protein:vir:94 434 LV--DDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQ---------SE 474 (474) T ss_pred CC--CCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCcccc---------CC Confidence 53 22211 111000000000 0000000000000000 00 No 167 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.72 E-value=2.5e-08 Score=62.26 Aligned_cols=425 Identities=11% Similarity=0.055 Sum_probs=160.2 Q ss_pred CCCC--------------cccccccc--hhhhcccCccccCCCCHHHHHHHHh--hhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKA--------------EETTQLDE--RHIATDVGRGIQPPYNPETLAAFQE--LNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~--------------~~~~~~~~--~~~~~~~~~~i~p~~~~~~l~~~~~--~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) +... .....+.. +++...-.-..-|.--+..++.+.+ .+.+.+.||+..+..+.-.+|.+- T Consensus 23 ~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~- 101 (501) T protein:vir:25 23 MSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNA- 101 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceecC- Confidence 0000 00011111 1111100000011111223333322 135788899988887766666431 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATV 142 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v 142 (602) +. .....+..++. .-++......+..+.+++|.||+.+.++..|. .+..++|..+ T Consensus 102 --d~-------~~~~~l~~i~~--------------~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~ 156 (501) T protein:vir:25 102 --LA-------KENDPAWEMWQ--------------RNRMDARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQI 156 (501) T ss_pred --Cc-------cchHHHHHHHH--------------hcChhHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccE Confidence 11 01111222221 12345667788999999999999998887774 4556788887 Q ss_pred cccc-cccccc----cccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEec-------------C Q lcl|NC_021537. 143 RVRK-TTTTIE----REDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASD-------------A 204 (602) Q Consensus 143 ~~~~-~~~~~~----~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-------------~ 204 (602) .+.. +..... ...............+..+......+.+..... .......+.+... . T Consensus 157 ~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:25 157 LAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEV-----VLGDAGGGQATQQPVNVREVTDVIEHG 231 (501) T ss_pred EEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCce-----eeeeccccccccccccccccccccccc Confidence 6432 111000 000000000000111111111111100000000 0000000000000 0 Q ss_pred ceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHH Q lcl|NC_021537. 205 GELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMD 284 (602) Q Consensus 205 ~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~ 284 (602) .....++.=.|+||++....+ .+|.|.++.....++................+.|..++. |-..++ .+. |+ T Consensus 232 ~~~~~~~~vPiv~f~N~~~~~-~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~--G~~~~~--~~~----~~ 302 (501) T protein:vir:25 232 ATFEGKPVCPVVRFVNGRDAD-DMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS--GWTGSK--AEV----LK 302 (501) T ss_pred cccCCccceeeEeccCccccC-ccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh--CCCCCc--cch----hh Confidence 011123333578887654333 368887766554444444433333333333444544332 211111 111 11 Q ss_pred HhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccC Q lcl|NC_021537. 285 NLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRAN 363 (602) Q Consensus 285 ~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn 363 (602) . ..++++.++++ +.++..+. ..+++ |++.++..+..|++.=++|+..+|.... | .+ T Consensus 303 ~-----~~~~i~~~~~~-------------~~~~~q~~---~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~-N-~S 359 (501) T protein:vir:25 303 A-----SALRVWTFEDP-------------EVKAQAFP---PASVEPYNLILEEMLQHVAMVAQISPAQVTGKMI-N-VS 359 (501) T ss_pred h-----cccceeccCCC-------------CceEEEec---ccChHHHHHHHHHHHHHHHhhcCCChhhhccccC-C-hH Confidence 1 12344444321 11222222 22333 8889999999999999999999874332 2 22 Q ss_pred HHHHHHHHHHHHH----HHHHHHHHHHHhhh------cCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcc Q lcl|NC_021537. 364 SKEQTREFAKGII----EPEQAKFSARLYKI------IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVG 433 (602) Q Consensus 364 ~e~~~~~f~~~~l----~P~~~~ie~~ln~~------Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~ 433 (602) .++.. +....| .-..+.|...|.+. +.........+.+++.+.+.... +....++++.+++..|+ T Consensus 360 g~Al~--~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~--s~~~~ada~~kl~~~gi- 434 (501) T protein:vir:25 360 AEALA--AAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEAR--SFGAVVDGITKLASAGI- 434 (501) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCC--CHHHHHHHHHHHHhcCC- Confidence 23221 111111 11112222222211 11111111223344444433322 33456788888888886 Q ss_pred cHHHHH-HHhCCCCCCCC-ccccc-----cccccccccccccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 434 TVNEAR-EELDLAPFEDD-RGDMT-----LSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 434 T~NE~R-~~~Gl~p~~~g-~~d~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) +...+. .+.|+.+-+-. +.+.. ........+.++.+............++....... ..+ T Consensus 435 s~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~g~ 501 (501) T protein:vir:25 435 PIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGN--GGA 501 (501) T ss_pred CHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCC--CCC Confidence 444443 34587642100 00000 00000011111111111111110001000000000 000 No 168 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=98.71 E-value=5.6e-09 Score=65.83 Aligned_cols=501 Identities=13% Similarity=0.055 Sum_probs=211.5 Q ss_pred CC--CC-----------cccccccch--hhhcccCccccCCCC--HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MS--KA-----------EETTQLDER--HIATDVGRGIQPPYN--PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~--k~-----------~~~~~~~~~--~~~~~~~~~i~p~~~--~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) |. |+ +.++++..- .+... ..+++.-+ .+. =.+++.-+-++-.|.-+++.++.+-+....- T Consensus 9 ~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks--~~~~~~~~WQ~eA-W~~~d~v~Elry~vgW~~~s~Sr~rL~as~i 85 (629) T protein:vir:86 9 VRRPKSEPVSTRQRALVAASQPVENPGKAFRKA--MGSSTRTDWQEDA-WKAYDAVGELRYYVGWRSSSASRVRLIASAI 85 (629) T ss_pred eecCCCCChhhhhhhhhhhhhccccccchhhhh--cCCCchhhhhHHH-HHHHHhhhhHHHHhhhhhhhhceeeeEeeee Confidence 11 11 111222111 11110 11121111 111 1334445777888889999999988766555 Q ss_pred cCCCCc-----ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC------ceE Q lcl|NC_021537. 64 PSADEP-----DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG------TPV 132 (602) Q Consensus 64 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G------~~~ 132 (602) +.+.+. +++.....++...+... ....+-..++++.+..++-+-|++|+-++--..| .++ T Consensus 86 dpDtg~ptg~i~e~~~~~~~v~~~v~~i---------~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~ 156 (629) T protein:vir:86 86 DPDTGLPTGSIDEDDRVGARVQQIVNQI---------AGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPV 156 (629) T ss_pred cCCCCCCccccCCCchhHHHHHHHHHhh---------cCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcch Confidence 433221 11111112222222221 1223567899999999999999999987633222 222 Q ss_pred E-EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 133 G-LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 133 ~-L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) . ++.|-+.-|+-... + +.+.. +.+..-...+ T Consensus 157 ~eW~~vt~~ei~~~~~-----------------~-------------------------~~i~l------P~g~~~e~~~ 188 (629) T protein:vir:86 157 PEWLALTPEEVRASEK-----------------K-------------------------TIIEL------PTGDKHEFRD 188 (629) T ss_pred hhheeechHHhhhccC-----------------c-------------------------eeeEc------CCCCcceeeC Confidence 2 22222222210000 0 00000 0111111122 Q ss_pred hhHEEEecC--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-C------------C---- Q lcl|NC_021537. 212 ANELIFLPN--PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-L------------S---- 272 (602) Q Consensus 212 ~~eviH~r~--~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~------------~---- 272 (602) ..+++ ||. +.|....+--||+.+++..+....-......+..+.-.+-.|+|.++... + . T Consensus 189 ~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p 267 (629) T protein:vir:86 189 GLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAP 267 (629) T ss_pred CCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccCcccCccCCCCCCCCCCcccc Confidence 22333 553 44455567789999998887666555555555555555556665554321 0 0 Q ss_pred ----HHHHHHHHHHHHH-----hh--cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHH Q lcl|NC_021537. 273 ----EDSKEDLRNLMDN-----LK--GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHE 341 (602) Q Consensus 273 ----~~~~~~l~~~~~~-----~~--g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~ 341 (602) .-+.+.|.+.+-+ +. +...+...++.. .++..-. +++-|...+..+.--+.+++..+.. T Consensus 268 ~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~--------~P~E~i~--~i~hlkf~~ei~e~aiktR~daI~R 337 (629) T protein:vir:86 268 PILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAA--------APGELIK--NVTHLKFDNQVTEVAIKTRNDAIAR 337 (629) T ss_pred cccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEe--------echHHhc--CeeEEeecCchhHHHHhhHHHHHHH Confidence 1133334444432 11 122222222221 1111111 3344444566677778999999999 Q ss_pred HHHHhcCChH-HhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccceEEEeccchhcchhH Q lcl|NC_021537. 342 IAKVHGVPPV-LINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDEWTIDFELRGAEQPEQ 415 (602) Q Consensus 342 Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~~~~ 415 (602) +|+.+-|||+ +||..+++|-=++=+....-++-.|.|.+..|+++|++.+|.+. -+..+|-+-||.+.+.. T Consensus 338 lA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~--- 414 (629) T protein:vir:86 338 LAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLMREGIDPNAYVVWHDASQLTV--- 414 (629) T ss_pred HHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc--- Confidence 9999999877 45776566532222222223466799999999999998776432 23356788899887643 Q ss_pred HHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc-----------cccc-ccccc----ccccccccCCCcCcccccc Q lcl|NC_021537. 416 DAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR-----------GDMT-LSEFE----AEFGADASDGDAEAMLTRS 479 (602) Q Consensus 416 d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~-----------~d~~-~~~~~----~~~~~~~~~~~~~~~~~~~ 479 (602) |.....+++ .+.+.|.||-...|+.+|+.--.+.+ .+.. ..+.. .+...+..... .+++. T Consensus 415 dPd~~deA~-~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a~l~~~~a~~~---~P~~~ 490 (629) T protein:vir:86 415 DPDKTDEAR-DAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLAVLIPELADVE---FPTPT 490 (629) T ss_pred CCCCcHHHH-HHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhhhhhhhhcccc---cCccC Confidence 222223343 47889999999999999985422211 0000 00110 00100000000 00110 Q ss_pred cccccc-cccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCc Q lcl|NC_021537. 480 KAAPPL-ENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPS 558 (602) Q Consensus 480 ~~~~~~-~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s 558 (602) ...++. +++.++..........++-+..+ .+...-..+-++. + +. -|-.-++.+|--|.- T Consensus 491 ~~~pp~~e~~~~dE~sga~~~~ep~te~d~-~~~~a~~aa~~~~-------~-----~a------~V~llv~RALelAGk 551 (629) T protein:vir:86 491 VALPPAEEQDGDEEASGASRREEPDTEDDA-GTDDSDQASLDSR-------E-----TA------MVEALVFRALELAGK 551 (629) T ss_pred CCCCccccCCCcccccCCCcCCCCCCCCCC-cccccCCCCCCCc-------H-----HH------HHHHHHHHHHHhcCC Confidence 111110 00000000000000000000000 0000000000000 0 00 033334444444444 Q ss_pred cchhhhhhhccccccccccc--chhcccCCCCCChhhcCCc----ccccC Q lcl|NC_021537. 559 AGSYHYSEIRLQYGYLEVTN--NHERLPEGPTPDPGEAPED----VPSDI 602 (602) Q Consensus 559 ~g~~~~~~i~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~----~~~~~ 602 (602) +++ .+. .+|.|+.|.. .|..+.-++......+-++ +-.++ T Consensus 552 R~r--~r~--~~a~~r~v~~he~h~~l~Pv~~~~v~rli~gwd~~ld~~~ 597 (629) T protein:vir:86 552 RSR--TRS--LPYELRQLSDRELVRRLEPVRREHVADLIRGWDSMLEERA 597 (629) T ss_pred cCC--Chh--hHHHHhccChhhcceecCCCChHHHHHHHHHHHHHHHHHH Confidence 331 011 1223333311 1111111100000000000 00011 No 169 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=98.69 E-value=5.8e-09 Score=65.73 Aligned_cols=498 Identities=13% Similarity=0.062 Sum_probs=212.5 Q ss_pred CC--CC-----------cccccccch--hhhcccCccccCCCC--HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MS--KA-----------EETTQLDER--HIATDVGRGIQPPYN--PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~--k~-----------~~~~~~~~~--~~~~~~~~~i~p~~~--~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) |. |+ +.++++..- .+... ..+++.-+ .+. =.+++.-+-++-.|.-+++.++.+-+....- T Consensus 9 ~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks--~~~~~~~~WQ~eA-W~~~d~v~Elry~vgW~~~s~Sr~rL~as~i 85 (629) T protein:vir:99 9 VRRPKSEPVSTRQRALVAASQPVENPGKAFRKA--MGSSTRTDWQDDA-WKAYDAVGELRYYVGWRSSSASRVRLIASAI 85 (629) T ss_pred eecCCCCChhhhhhhhhhhhhcccccchhhhhh--cCCCchhhhhHHH-HHHHHhhhhHHHHhhhhhhhhceeeeEeeee Confidence 10 11 111222111 11110 11121111 111 1334445777888889999999988766555 Q ss_pred cCCCCc-----ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC------ceE Q lcl|NC_021537. 64 PSADEP-----DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG------TPV 132 (602) Q Consensus 64 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G------~~~ 132 (602) +.+.+. +++.....++...+... ....+-..++++.+..++-+-|++|+-++--..| .++ T Consensus 86 dpDtg~ptg~i~e~~~~~~~v~~~v~~i---------~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~ 156 (629) T protein:vir:99 86 DPDTGLPTGSIDEDDRVGARVQQIVNQI---------AGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPV 156 (629) T ss_pred cCCCCCCccccCCCchhHHHHHHHHHhh---------cCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcch Confidence 433221 11111112222222221 1223567899999999999999999987733222 222 Q ss_pred E-EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 133 G-LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 133 ~-L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) . ++.|-+.-|+-... + +.+... .+..-...+ T Consensus 157 ~eW~~vt~~ei~~~~~-----------------~-------------------------~~i~lP------~g~~~e~~~ 188 (629) T protein:vir:99 157 PEWLALTPEEVRASEK-----------------K-------------------------TIIELP------TGDKHEFRD 188 (629) T ss_pred hhheeechHHhhhccC-----------------c-------------------------eeEEcC------CCCccceeC Confidence 2 22222222211000 0 000000 011111112 Q ss_pred hhHEEEecC--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-C------------C---- Q lcl|NC_021537. 212 ANELIFLPN--PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-L------------S---- 272 (602) Q Consensus 212 ~~eviH~r~--~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~------------~---- 272 (602) ..+++ ||. +.|....+--||+.+++..+....-......+..+.-.+-.|+|.++... + . T Consensus 189 ~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p 267 (629) T protein:vir:99 189 GLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAP 267 (629) T ss_pred CCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccCcccCccCCCCCCCCCCcccc Confidence 23333 553 34455567789999998887666655555555555555556665554321 0 0 Q ss_pred ----HHHHHHHHHHHHH-----hh--cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHH Q lcl|NC_021537. 273 ----EDSKEDLRNLMDN-----LK--GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHE 341 (602) Q Consensus 273 ----~~~~~~l~~~~~~-----~~--g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~ 341 (602) .-+.+.|.+.+-+ +. +...+...++.. .++..-. +++-|...+..+.--+.+++..+.. T Consensus 268 ~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~--------~P~E~i~--~i~hlkf~~ei~e~aiktR~daI~R 337 (629) T protein:vir:99 268 PILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAA--------APGELIK--NVTHLKFDNQVTEVAIKTRNDAIAR 337 (629) T ss_pred cccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEe--------echHHhc--CeeEEeecCchhHHHHhhHHHHHHH Confidence 1133334444432 11 122222222221 1111111 3344444566677778999999999 Q ss_pred HHHHhcCChH-HhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccceEEEeccchhcchhH Q lcl|NC_021537. 342 IAKVHGVPPV-LINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDEWTIDFELRGAEQPEQ 415 (602) Q Consensus 342 Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~~~~ 415 (602) +|+.+-|||+ +||..+++|-=++=+....-++-.|.|.+..|+++|++.+|.+. -+..+|-+-||.+.+.. T Consensus 338 lA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~--- 414 (629) T protein:vir:99 338 LAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLMREGIDPNAYVVWHDASQLTV--- 414 (629) T ss_pred HHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc--- Confidence 9999999877 45776566532222222223466799999999999998776432 23356788899887643 Q ss_pred HHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc-----------cccc-ccccc----ccccccccCCCcCcccccc Q lcl|NC_021537. 416 DAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR-----------GDMT-LSEFE----AEFGADASDGDAEAMLTRS 479 (602) Q Consensus 416 d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~-----------~d~~-~~~~~----~~~~~~~~~~~~~~~~~~~ 479 (602) |.....+++ .+.+.|.||-...|+.+|+.--.+.+ .+.. ..+.. ++...+..... .+++. T Consensus 415 dPd~~deA~-~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a~l~~~~a~~~---~P~~~ 490 (629) T protein:vir:99 415 DPDKTDEAR-DAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLAVLIPELADVE---FPTPT 490 (629) T ss_pred CCCCcHHHH-HHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhhhhhhhhcccc---cCccC Confidence 222223343 47889999999999999985422211 0000 00110 00100000000 00111 Q ss_pred cccccc-cccccccccccccccccchhhhhcchhhhhhheecccccEEEEEEecccCCcceeeeccCCHHHHHHHhCCCc Q lcl|NC_021537. 480 KAAPPL-ENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWSALVSAPS 558 (602) Q Consensus 480 ~~~~~~-~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s 558 (602) ...++. +++.++..........++-+..+ .+...-..+-++. + +. -|-.-++.+|--|.- T Consensus 491 ~~~pp~~e~~~~dE~sga~~~~ep~te~d~-~~~~a~~aa~~~~-------~-----~a------~V~llv~RALelAGk 551 (629) T protein:vir:99 491 VALPPAEEQDGDEEASGASRREEPDTEDDA-GTDDSDQASLDSR-------E-----TA------MVEALVFRALELAGK 551 (629) T ss_pred CCCCccccCCCcccccCCCcCCCCCCCCCC-cccccCCCCCCCc-------H-----HH------HHHHHHHHHHHhcCC Confidence 111110 00000000000000000000000 0000000000000 0 00 033344455554544 Q ss_pred cchhhhhhhcccccccccccc--hhcccCCCCCChhhcCC---c----c-----------cccC Q lcl|NC_021537. 559 AGSYHYSEIRLQYGYLEVTNN--HERLPEGPTPDPGEAPE---D----V-----------PSDI 602 (602) Q Consensus 559 ~g~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~---~----~-----------~~~~ 602 (602) +++ .+. .+|.|+.|... |..+..++ ..+++. + + +..| T Consensus 552 R~r--~r~--~~ar~r~v~~he~h~~l~Pv~---~~~i~rli~gwd~~ld~~~~~~Lg~d~~~l 608 (629) T protein:vir:99 552 RSR--TRS--LPYELRQLSDRELVRRLEPVR---REHVADLIRGWDSMLEERAVQALNMNIPGI 608 (629) T ss_pred cCC--Chh--hHHHHhcCchhhceeecCCCC---HHHHHHHHHHHHHHHHHHHHHHhCCCHHHH Confidence 332 011 12333333211 11111110 111110 0 0 0000 No 170 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=98.65 E-value=8.1e-08 Score=59.46 Aligned_cols=516 Identities=10% Similarity=0.045 Sum_probs=212.9 Q ss_pred CCC------------------CcccccccchhhhcccCccccCCCC--HHHHHHHHhhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_021537. 1 MSK------------------AEETTQLDERHIATDVGRGIQPPYN--PETLAAFQELNETHQACIRKKSRYEAGYGFEI 60 (602) Q Consensus 1 ~~k------------------~~~~~~~~~~~~~~~~~~~i~p~~~--~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i 60 (602) +.. .+.+.++....-... ..++.+.-+ .+. =.+++.-|-++-.|.-+++.++.+-+.. T Consensus 4 ~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~-kt~~~~~~~WQ~eA-W~~~d~vpELry~vgW~~~a~SR~rL~a 81 (646) T protein:vir:10 4 LKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSW-KTWKFGNKDWQTEG-WRLYDIIPEHHFLAGRIGDSVAQARLYV 81 (646) T ss_pred cCCCCCCCCcccccccchhhhhhccccccCCCccee-ecCCCcchhhhHHH-HHHHhhhhhHhhHhhhhhhhhceeeeee Confidence 110 111223322211000 011222111 111 1334444678888899999999998877 Q ss_pred EEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEE---eeCCCCceEEEEEe Q lcl|NC_021537. 61 VAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEI---LVEGDGTPVGLAHV 137 (602) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i---~r~~~G~~~~L~~l 137 (602) ...++. ++....-....+........ ....-..++++.+..++-+-|++|+.. .....+.--.++.+ T Consensus 82 seiddt-G~~tg~v~~~~v~~iv~~~~---------Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vv 151 (646) T protein:vir:10 82 TEVDDT-GEETGEVQDERIKRLAAVPL---------GTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVV 151 (646) T ss_pred eeecCC-CCCcCccchHHHHHHhhhhc---------cchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeee Confidence 655533 33222222233333322211 112335789999999999999999842 11111111123333 Q ss_pred CcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 138 PAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) -...|...... .++. .. ... .++..+.....++ . T Consensus 152 t~~Ev~~tg~~--------------------~~i~--------~p-----~~~------------~g~~~v~~~~~d~-l 185 (646) T protein:vir:10 152 TGSAISRTGDE--------------------IAVR--------RP-----QQR------------GGSKLVLVDGQDI-L 185 (646) T ss_pred cHHHhccCCCe--------------------eeee--------cC-----ccC------------CCCCcceecCCce-E Confidence 33333211000 0000 00 000 0011111222233 2 Q ss_pred ecC--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc------CCHHHHHHHHHHHHH---- Q lcl|NC_021537. 218 LPN--PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT------LSEDSKEDLRNLMDN---- 285 (602) Q Consensus 218 ~r~--~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~------~~~~~~~~l~~~~~~---- 285 (602) ||. +.|....+--||+.+++.++....-......+..+.-.+-.|+|.+|.+. -++.....|...+-+ T Consensus 186 vRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~t 265 (646) T protein:vir:10 186 IRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAA 265 (646) T ss_pred EEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceeeeccccccCCCCCCCcchhHHHHHHHHHHHh Confidence 343 34455567789999998888776666666666666666777787776432 122233344444322 Q ss_pred -hh--cccccCcceeccCCccceecccccc-ccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHH-hhccccCC Q lcl|NC_021537. 286 -LK--GSRYRTAILEVEEFVDDHGLGDGGS-DVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVL-INVTSTSN 360 (602) Q Consensus 286 -~~--g~~nag~~~~~~~g~~~~~~~~~~~-~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~ 360 (602) +. +...+...++... ++.. ..--+++.+...+..+.--+.+++..+..+|+.+-|||+. ||.. ++| T Consensus 266 Ai~De~S~aA~vPiia~~--------P~E~i~~~~~ik~l~f~~eite~aiktR~daI~RlA~glDIppE~LLGlg-d~N 336 (646) T protein:vir:10 266 SMADQSRASAMVPIMATI--------PNEMMEHLDKIKPLTFWSELSAEITPMKDKAIARLASSAEIPGEVLTGIG-DAN 336 (646) T ss_pred hhcCCCCccceeeeEEee--------ChHHHhhhhcceeeccCchhhHHHhhhHHHHHHHHHhccCCchhheeecc-ccc Confidence 11 1222222222211 1100 0001445555666667777899999999999999998875 4765 454 Q ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccc------cccceEEEeccchhcchhHHHHHHHHHHHHHHhCCccc Q lcl|NC_021537. 361 RANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDAL------DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGT 434 (602) Q Consensus 361 ~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~------~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T 434 (602) .=++=+....-++ .|.|.+..|+++|+..+|.+.. +..+|-+-||.+.+.. |.....+++ .+.+.|.|| T Consensus 337 HWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~---~pd~~deA~-qa~drGAIt 411 (646) T protein:vir:10 337 HWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLAS---KPNRLDEAI-QLHERNLIK 411 (646) T ss_pred eeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCccccc---CCCCcHHHH-HHHHcCCcc Confidence 3222122222234 6999999999999988765322 2345788899887643 222223343 478899999 Q ss_pred HHHHHHHhCCCCCCCCccccc----------ccccc-------ccccccc-cCCC-cC--ccccccc-cccccccccccc Q lcl|NC_021537. 435 VNEAREELDLAPFEDDRGDMT----------LSEFE-------AEFGADA-SDGD-AE--AMLTRSK-AAPPLENKIGER 492 (602) Q Consensus 435 ~NE~R~~~Gl~p~~~g~~d~~----------~~~~~-------~~~~~~~-~~~~-~~--~~~~~~~-~~~~~~~~~~~~ 492 (602) -...|+.+|+.--++...++. ..+.. ...+... +... .+ .+.++.. +++..+....+. T Consensus 412 ~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~~~~~~dg~~~~~e~~g~~~~~ 491 (646) T protein:vir:10 412 DEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPPTAAQRTDGDLDDDESEGAPNGG 491 (646) T ss_pred HHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCCcccccccCCCCChhhcCCCCCC Confidence 999999999853322211000 00100 0000000 0000 00 0000000 000000000000 Q ss_pred ccccccccccchhhhhcchhhhhhheecccccEEEEEEeccc-CCcceeeeccCCHHHHHHHhCCCccchhhhhhhcccc Q lcl|NC_021537. 493 DSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYLSFKRES-GQNSLYVYVDVPAAVWSALVSAPSAGSYHYSEIRLQY 571 (602) Q Consensus 493 ~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~~f~~~~-~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~i~~~~ 571 (602) ......-..+ -..+. ....+.+ ++--+|+--. .+...|. .-|-..++..|--|.-.=+- .+.. ++ T Consensus 492 E~~~~pda~~---~~a~~-~~~~~r~------~~~~~~~~~~~~p~a~~~-aav~l~v~RAL~lAG~Rlrt-~~~~--~a 557 (646) T protein:vir:10 492 EAPDQPDADE---ARAIT-AALDRRI------ALAARPVLALPSPEAVFN-ASAKLMILRALELAGGRLTT-PAER--RG 557 (646) T ss_pred ccCCCCCCCc---ccccc-ccccccc------hhhhhhhhccccchhHHH-HHHHHHHHHHHHhccccccC-chhh--hH Confidence 0000000000 00000 0000100 1111111000 0000000 11222233333333222110 0001 11 Q ss_pred cccccccc--hhcccCCCCCChh-----------hcCCcccccC Q lcl|NC_021537. 572 GYLEVTNN--HERLPEGPTPDPG-----------EAPEDVPSDI 602 (602) Q Consensus 572 ~~~~~~~~--~~~~~~~~~~~~~-----------~~~~~~~~~~ 602 (602) .|+.|... |..+.-+...... +|--.++-|. T Consensus 558 ~~r~vp~he~h~~l~Pv~~~~~~rl~~G~wd~~~~v~~~lg~D~ 601 (646) T protein:vir:10 558 RWSDVPRHELHHHVGPITPDKARRVTEGAWNHVAVAAADLGVDA 601 (646) T ss_pred HhhcCChhhceeecCCCChhhHHHHHhcccccHHHHHHhcCCCh Confidence 22222111 1111111000000 1111111111 No 171 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.61 E-value=6.7e-09 Score=65.40 Aligned_cols=514 Identities=13% Similarity=0.089 Sum_probs=211.8 Q ss_pred CC--CCcc-----------ccccc--chhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MS--KAEE-----------TTQLD--ERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~--k~~~-----------~~~~~--~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) |. |+.. ++.+. .+.+++..++.=.-..-.+.. .+++.-+-++-.|.-+++.++.+-+....-+. T Consensus 9 ~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW-~~~d~v~Elry~vgW~~~s~sr~rL~as~idp 87 (639) T protein:vir:10 9 VRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYVSWRANSCSRTTLIPSAIDP 87 (639) T ss_pred eecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhh-hhhhhhhhHHHHhhhhhhhhceeeeEeeeecc Confidence 11 1100 01110 122222211110000011111 33344466777888899999988876655443 Q ss_pred CCCcc------cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEe-eCCCCc------eE Q lcl|NC_021537. 66 ADEPD------EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEIL-VEGDGT------PV 132 (602) Q Consensus 66 ~~~~~------~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~-r~~~G~------~~ 132 (602) +.+.. ++......+...+.+ + ....+-..++++.+..++-+-|.+|+-++ |..++. +. T Consensus 88 Dtg~PtG~V~~E~d~~~~~v~~~v~~------i---agG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~ 158 (639) T protein:vir:10 88 DTGLPTGEVDIEEDPDAQTVADYVKG------I---ADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (639) T ss_pred ccCCCCCccccccccCcchHHHHHHh------h---cCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccc Confidence 32211 111111122222111 1 12235668899999999999999998654 344432 22 Q ss_pred EEEE-eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 133 GLAH-VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 133 ~L~~-l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) +-|+ +-...|+.. +..+.++. ..+| ...+|. T Consensus 159 ~~W~vvs~~Ei~~~-------------------~~~~~~i~----------------------lPdG-------~~he~~ 190 (639) T protein:vir:10 159 ARWYAVTREEIKSK-------------------AGETAEIS----------------------LPDG-------KTHEFN 190 (639) T ss_pred cceeeeeHHHhccc-------------------CCCeeEee----------------------cCCC-------CCcccc Confidence 2222 222222210 11111110 0011 000111 Q ss_pred h--hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC------------------ Q lcl|NC_021537. 212 A--NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL------------------ 271 (602) Q Consensus 212 ~--~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~------------------ 271 (602) - +-++-+=.+.|....+--||+.+++..+....-......+..+.-.+-.|+|.+|.... T Consensus 191 ~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~ 270 (639) T protein:vir:10 191 RDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGA 270 (639) T ss_pred CCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcc Confidence 0 11222223445555677899999988876666555555555555555666665543210 Q ss_pred ------CHHHHHHHHHHHHH-----hh--cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhh Q lcl|NC_021537. 272 ------SEDSKEDLRNLMDN-----LK--GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERN 338 (602) Q Consensus 272 ------~~~~~~~l~~~~~~-----~~--g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~ 338 (602) +..+.+.|...+-+ +. +...+...++... + .+.--+++.|...+..+.--+.+++.. T Consensus 271 ~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~--------p--~E~l~~ikhl~f~~ei~e~aiktR~da 340 (639) T protein:vir:10 271 PVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV--------A--AEHLEKVQHIKFGNEVTEVEIKTRIDA 340 (639) T ss_pred cccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEee--------c--hHHhcCeeeeeecCchhHHHHhhHHHH Confidence 01123334444322 11 1122222222211 1 112225666776777778889999999 Q ss_pred HHHHHHHhcCChHH-hhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccceEEEeccchhcc Q lcl|NC_021537. 339 EHEIAKVHGVPPVL-INVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDEWTIDFELRGAEQ 412 (602) Q Consensus 339 ~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~ 412 (602) +..+|+.+-|||+. ||. +++|-=++=+....-++-.|.|.+..|+++|+..+|.+. -+..+|-+-||.+.+.. T Consensus 341 I~RlA~glDi~pE~LLGl-~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~ 419 (639) T protein:vir:10 341 ITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTS 419 (639) T ss_pred HHHHHhccCCchhheeec-ccccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc Confidence 99999999998875 576 445432221222223466799999999999998776432 23456788899887643 Q ss_pred hhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc----------------ccccccccc-ccccCCCcCcc Q lcl|NC_021537. 413 PEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT----------------LSEFEAEFG-ADASDGDAEAM 475 (602) Q Consensus 413 ~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~----------------~~~~~~~~~-~~~~~~~~~~~ 475 (602) |.....+++ .+.+.|.||-.-.|+.+|+.--++-+-+.. +.+...+.. ...+.-+.+ T Consensus 420 ---dPd~~deA~-qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~~apl~~P~lq~~e~p-- 493 (639) T protein:vir:10 420 ---DPDLSDEAV-EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFP-- 493 (639) T ss_pred ---CCCCcHHHH-HHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchhhhhhhccCccceecccC-- Confidence 222223443 478899999888899999853321110000 001111111 111110100 Q ss_pred ccccccccccccccc-----------ccccccccccccchh----hhhcc--hhhhhhheecccccEEEEEEecccCCcc Q lcl|NC_021537. 476 LTRSKAAPPLENKIG-----------ERDSVDVDVSKDPIE----QTTFS--SSNLDEGLYDFGERELYLSFKRESGQNS 538 (602) Q Consensus 476 ~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~m~----~~~v~--ss~~~~~~yd~~~~~l~~~f~~~~~~~~ 538 (602) +.+...+..+.+.. +.............. -..|. .-.+.....-++.+.+-++ .... T Consensus 494 -tp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~~~~-----~r~~ 567 (639) T protein:vir:10 494 -QPANAIESTREDEEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVN-----DAAL 567 (639) T ss_pred -CCCCCCCCCCCCCCcccccCCCCCcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHhhcccccCCC-----Chhh Confidence 00111110000000 000000000000000 00000 0001111111111111110 0122 Q ss_pred eeeeccCCHHHHHHHh-CCCccchhhhhhhcccccccccccchhcccCCCCCChhhcCCcccccC Q lcl|NC_021537. 539 LYVYVDVPAAVWSALV-SAPSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPGEAPEDVPSDI 602 (602) Q Consensus 539 ~y~y~~v~~~~~~~~~-~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 602 (602) -++|.+||++.|---| ..++ .-..+-|.+ ..-+- .+.......-|++.+---|-.-+ T Consensus 568 ~a~~r~vp~he~H~~l~Pv~~--~~~~rli~g---wd~~l--d~~~~a~lg~D~~~lr~~v~~~v 625 (639) T protein:vir:10 568 KTKLRDVPAHEYHRVLPPVRS--SEIPRLIAG---WDTAL--EDEVVASLGLDNEKLRNAVLATV 625 (639) T ss_pred HHHhhcCChhHceeecCCCCh--HHHHHHHHH---HHhHH--HHHHHHHhCCCHHHHHHHHHHHH Confidence 3555666655432111 0000 000000000 00000 00000001111111000000000 No 172 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.61 E-value=6.7e-09 Score=65.40 Aligned_cols=514 Identities=13% Similarity=0.089 Sum_probs=211.8 Q ss_pred CC--CCcc-----------ccccc--chhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MS--KAEE-----------TTQLD--ERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~--k~~~-----------~~~~~--~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) |. |+.. ++.+. .+.+++..++.=.-..-.+.. .+++.-+-++-.|.-+++.++.+-+....-+. T Consensus 9 ~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW-~~~d~v~Elry~vgW~~~s~sr~rL~as~idp 87 (639) T protein:vir:97 9 VRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYVSWRANSCSRTTLIPSAIDP 87 (639) T ss_pred eecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhh-hhhhhhhhHHHHhhhhhhhhceeeeEeeeecc Confidence 11 1100 01110 122222211110000011111 33344466777888899999988876655443 Q ss_pred CCCcc------cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEe-eCCCCc------eE Q lcl|NC_021537. 66 ADEPD------EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEIL-VEGDGT------PV 132 (602) Q Consensus 66 ~~~~~------~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~-r~~~G~------~~ 132 (602) +.+.. ++......+...+.+ + ....+-..++++.+..++-+-|.+|+-++ |..++. +. T Consensus 88 Dtg~PtG~V~~E~d~~~~~v~~~v~~------i---agG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~ 158 (639) T protein:vir:97 88 DTGLPTGEVDIEEDPDAQTVADYVKG------I---ADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPR 158 (639) T ss_pred ccCCCCCccccccccCcchHHHHHHh------h---cCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccc Confidence 32211 111111122222111 1 12235668899999999999999998654 344432 22 Q ss_pred EEEE-eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 133 GLAH-VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 133 ~L~~-l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) +-|+ +-...|+.. +..+.++. ..+| ...+|. T Consensus 159 ~~W~vvs~~Ei~~~-------------------~~~~~~i~----------------------lPdG-------~~he~~ 190 (639) T protein:vir:97 159 ARWYAVTREEIKSK-------------------AGETAEIS----------------------LPDG-------KTHEFN 190 (639) T ss_pred cceeeeeHHHhccc-------------------CCCeeEee----------------------cCCC-------CCcccc Confidence 2222 222222210 11111110 0011 000111 Q ss_pred h--hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC------------------ Q lcl|NC_021537. 212 A--NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL------------------ 271 (602) Q Consensus 212 ~--~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~------------------ 271 (602) - +-++-+=.+.|....+--||+.+++..+....-......+..+.-.+-.|+|.+|.... T Consensus 191 ~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~ 270 (639) T protein:vir:97 191 RDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGA 270 (639) T ss_pred CCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcc Confidence 0 11222223445555677899999988876666555555555555555666665543210 Q ss_pred ------CHHHHHHHHHHHHH-----hh--cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhh Q lcl|NC_021537. 272 ------SEDSKEDLRNLMDN-----LK--GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERN 338 (602) Q Consensus 272 ------~~~~~~~l~~~~~~-----~~--g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~ 338 (602) +..+.+.|...+-+ +. +...+...++... + .+.--+++.|...+..+.--+.+++.. T Consensus 271 ~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~--------p--~E~l~~ikhl~f~~ei~e~aiktR~da 340 (639) T protein:vir:97 271 PVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV--------A--AEHLEKVQHIKFGNEVTEVEIKTRIDA 340 (639) T ss_pred cccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEee--------c--hHHhcCeeeeeecCchhHHHHhhHHHH Confidence 01123334444322 11 1122222222211 1 112225666776777778889999999 Q ss_pred HHHHHHHhcCChHH-hhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccceEEEeccchhcc Q lcl|NC_021537. 339 EHEIAKVHGVPPVL-INVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDEWTIDFELRGAEQ 412 (602) Q Consensus 339 ~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~ 412 (602) +..+|+.+-|||+. ||. +++|-=++=+....-++-.|.|.+..|+++|+..+|.+. -+..+|-+-||.+.+.. T Consensus 341 I~RlA~glDi~pE~LLGl-~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~ 419 (639) T protein:vir:97 341 ITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTS 419 (639) T ss_pred HHHHHhccCCchhheeec-ccccceEEEEecccceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc Confidence 99999999998875 576 445432221222223466799999999999998776432 23456788899887643 Q ss_pred hhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccc----------------ccccccccc-ccccCCCcCcc Q lcl|NC_021537. 413 PEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMT----------------LSEFEAEFG-ADASDGDAEAM 475 (602) Q Consensus 413 ~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~----------------~~~~~~~~~-~~~~~~~~~~~ 475 (602) |.....+++ .+.+.|.||-.-.|+.+|+.--++-+-+.. +.+...+.. ...+.-+.+ T Consensus 420 ---dPd~~deA~-qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~~apl~~P~lq~~e~p-- 493 (639) T protein:vir:97 420 ---DPDLSDEAV-EAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAMYAPLLSSQLAGIEFP-- 493 (639) T ss_pred ---CCCCcHHHH-HHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchhhhhhhccCccceecccC-- Confidence 222223443 478899999888899999853321110000 001111111 111110100 Q ss_pred ccccccccccccccc-----------ccccccccccccchh----hhhcc--hhhhhhheecccccEEEEEEecccCCcc Q lcl|NC_021537. 476 LTRSKAAPPLENKIG-----------ERDSVDVDVSKDPIE----QTTFS--SSNLDEGLYDFGERELYLSFKRESGQNS 538 (602) Q Consensus 476 ~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~m~----~~~v~--ss~~~~~~yd~~~~~l~~~f~~~~~~~~ 538 (602) +.+...+..+.+.. +.............. -..|. .-.+.....-++.+.+-++ .... T Consensus 494 -tp~~a~~~a~~~~~~de~~ga~~~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~~~~-----~r~~ 567 (639) T protein:vir:97 494 -QPANAIESTREDEEDDEDSGARQQREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVN-----DAAL 567 (639) T ss_pred -CCCCCCCCCCCCCCcccccCCCCCcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHhhcccccCCC-----Chhh Confidence 00111110000000 000000000000000 00000 0001111111111111110 0122 Q ss_pred eeeeccCCHHHHHHHh-CCCccchhhhhhhcccccccccccchhcccCCCCCChhhcCCcccccC Q lcl|NC_021537. 539 LYVYVDVPAAVWSALV-SAPSAGSYHYSEIRLQYGYLEVTNNHERLPEGPTPDPGEAPEDVPSDI 602 (602) Q Consensus 539 ~y~y~~v~~~~~~~~~-~a~s~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 602 (602) -++|.+||++.|---| ..++ .-..+-|.+ ..-+- .+.......-|++.+---|-.-+ T Consensus 568 ~a~~r~vp~he~H~~l~Pv~~--~~~~rli~g---wd~~l--d~~~~a~lg~D~~~lr~~v~~~v 625 (639) T protein:vir:97 568 KTKLRDVPAHEYHRVLPPVRS--SEIPRLIAG---WDTAL--EDEVVASLGLDNEKLRNAVLATV 625 (639) T ss_pred HHHhhcCChhHceeecCCCCh--HHHHHHHHH---HHhHH--HHHHHHHhCCCHHHHHHHHHHHH Confidence 3555666655432111 0000 000000000 00000 00000001111111000000000 No 173 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.56 E-value=2.8e-07 Score=56.54 Aligned_cols=406 Identities=13% Similarity=0.070 Sum_probs=164.3 Q ss_pred CCCCcc-------------------------------cccccchhhhccc-Cc-cc--cCC-C----CHHHHHHHHh-hh Q lcl|NC_021537. 1 MSKAEE-------------------------------TTQLDERHIATDV-GR-GI--QPP-Y----NPETLAAFQE-LN 39 (602) Q Consensus 1 ~~k~~~-------------------------------~~~~~~~~~~~~~-~~-~i--~p~-~----~~~~l~~~~~-~~ 39 (602) |-.+-+ .+....+....-+ |. -| .+. + .....+...+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 000000 0000000000000 00 00 000 0 0001110001 14 Q ss_pred HHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_021537. 40 ETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWA 119 (602) Q Consensus 40 ~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna 119 (602) ++.+.+|+..+..+.+-|+.+... +.+..+.+..++. ..+......+..+.+.+|.| T Consensus 81 n~~~~ivd~~~~~l~g~~~~~~~~--------d~~~~~~l~~~~~---------------n~~~~~~~~~~~~~~~~G~~ 137 (472) T protein:vir:93 81 NFHANLVDQKVSYIVGKPIAFKHT--------DDEVVKRIDEVLG---------------NRFDDKLHSVLTGASNKGIE 137 (472) T ss_pred chHHHHHHHHhhhhcccCeeeccC--------ChHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhcCeE Confidence 788889999999998888776321 1112222222211 12456666778899999999 Q ss_pred EEEEeeCCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccce Q lcl|NC_021537. 120 ALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGE 199 (602) Q Consensus 120 ~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 199 (602) |+.+..+.+|++ .+..++|..+.+..+.... ....-+..++...+.....++.....+ ......+. T Consensus 138 ~~~v~~d~d~~~-~i~~~~p~~~~~i~d~~~~--------~~~~~~ir~~~~~~~~~~~~~~~~~~~-----~~~~~~~~ 203 (472) T protein:vir:93 138 WLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEH--------EELEAFIRMYKLENETKVEYWDKVTVN-----YYVYENGS 203 (472) T ss_pred EEEEEECCCCce-EEEEEcccceEEEEcCCCC--------CceEEEEEEEEeecceeEEEEecCeEE-----EEEEecCe Confidence 999988888875 5778899888765332110 000011111111111111111000000 00000000 Q ss_pred EEec----------CceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccc Q lcl|NC_021537. 200 VASD----------AGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGG 269 (602) Q Consensus 200 ~~~~----------~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 269 (602) .... ......+..=.|++|+.. ..|.|.+......++....+..-..+.+...+.|-.+++ |. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~--g~ 276 (472) T protein:vir:93 204 LIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NY 276 (472) T ss_pred eeecccccccccccccccCCCCCcceEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEee--cC Confidence 0000 000111222236666542 368898888777777666665556666677777766654 32 Q ss_pred cCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCC Q lcl|NC_021537. 270 TLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVP 349 (602) Q Consensus 270 ~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVP 349 (602) . .+........+. ..+++.++.+. +.++ ++. +..+..+....+...+.|+..-++| T Consensus 277 ~--~~~~~~~~~~~~-------~~~~~~~~~~~------------~~~~--l~~-~~~~~~~~~~~~~l~~~i~~~s~~p 332 (472) T protein:vir:93 277 D--DQELPEFKRLLR-------YYGAIKVSDNG------------GVDT--IQV-EVPVENSKKYLDELYQKIMLFGQAV 332 (472) T ss_pred C--cccchhhHHHHh-------hccccccCCCC------------ccee--Eee-cCCHHHHHHHHHHHHHHHHHHhCCC Confidence 1 111111111111 11222222221 1222 111 1124566778888889999999998 Q ss_pred hHHhhccccCCccCHHHHH-------------HHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHH Q lcl|NC_021537. 350 PVLINVTSTSNRANSKEQT-------------REFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQD 416 (602) Q Consensus 350 p~~lg~~~~~~~sn~e~~~-------------~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d 416 (602) ..-.+..+ ++ .+.++.. ...+...|+-+++.+...++. .....+..+.|...... + T Consensus 333 ~~~~~~~~-~n-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~-----~~~~~~i~v~f~~~~p~----~ 401 (472) T protein:vir:93 333 DFSSDKFG-SA-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KGEHKDVDISFNYNKVA----N 401 (472) T ss_pred CCCccccc-cC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeEEeCCCCCC----C Confidence 65443221 22 1222211 111222333333333333321 12234455666543322 2 Q ss_pred HHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcc--ccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 417 AKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 417 ~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ....++.+.++ .|+++..-+.+++++- ++... ++..... .............++......+.+ ++...+ T Consensus 402 ~~~~~~~~~k~--~giis~et~l~~l~~~--~d~~~E~~ri~~E~-~~~~~~~~~~~~~~~d~~~~~~~~-~~~~~e 472 (472) T protein:vir:93 402 TELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQ-MEYNKQLPNLDDGGADGAQQQERS-NNKESE 472 (472) T ss_pred HHHHHHHHHHH--hccCchHHHHHhCCCC--CCHHHHHHHHHHHH-HHHHHhccCcCcccCCCCCCCCCC-CcccCC Confidence 33345666665 5899988888888652 22111 1110000 000000000000000000000000 000000 No 174 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.55 E-value=3e-07 Score=56.36 Aligned_cols=390 Identities=11% Similarity=0.045 Sum_probs=167.5 Q ss_pred CCCCcccccccchhhhccc-CccccCCCCHHHHHHHHh-hhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHHH Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV-GRGIQPPYNPETLAAFQE-LNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQT 78 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~-~~~i~p~~~~~~l~~~~~-~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~~ 78 (602) +..=+.--+...+++...- -..+.+-+ |..++...+ ...+...+|+.+++.+.=-||+. + +. . T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~-p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~----~------d~----~ 65 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITI-PAHIRAKYQAVLGWAAKGVDSLADRLIFRAFAN----D------DF----N 65 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhc-cHHHHhHHHhhcchhHHHHHHhHhhhccccccC----C------Cc----h Confidence 1111100011111111100 00011111 233442222 24688889999988776666631 1 10 0 Q ss_pred HHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccccccchh Q lcl|NC_021537. 79 VRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEE 158 (602) Q Consensus 79 ~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~ 158 (602) +...+ ..-++......+..+.+++|.||+.+..+.+|.| .+.+++|..+....|..... + T Consensus 66 l~~i~--------------~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~-----~ 125 (410) T protein:vir:95 66 VTEIF--------------DRNNPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGL-----L 125 (410) T ss_pred HHHHH--------------hhcChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCc-----e Confidence 11111 1124556677888999999999999988888875 68889998887655432111 1 Q ss_pred hhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH----H Q lcl|NC_021537. 159 VENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW----V 234 (602) Q Consensus 159 ~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl----~ 234 (602) .. ++.....+.........-..++.........+.+.. ...+..=.|+||.+....+..+|.|.+ . T Consensus 126 ~~------al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~ 195 (410) T protein:vir:95 126 VE------GYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSV----TNETGIPLLVPVIHRPDAVRPFGRSRITRAGM 195 (410) T ss_pred EE------EEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccc----cCCCCCcceEEecccccCCccCCccccchhHH Confidence 00 110111111111000000011111111111111100 011222357888766556677998854 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 235 AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 235 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) .+.+.+.....-......|| +.|.-++. |-..+....+.++.. .++++.++...+ .. T Consensus 196 ~l~da~~r~~~~~~~~~e~~---a~pqr~i~--G~d~d~~~~~~~~~~---------~~~i~~~~~~~~---------~~ 252 (410) T protein:vir:95 196 YYQKYAKRTLERADITAEFY---SWPQKYIL--GLDPDAEPMEKWKAT---------VSSLLTISSSDK---------GV 252 (410) T ss_pred HHHHHHHHHHHHHHHHHHHh---cchhheee--ccCCCCCcCchhhhh---------hhhheeccCCCC---------CC Confidence 33343333333333344443 44544443 211111212222211 233444432211 01 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH---HHHHHHHHHHHHHHHHHHHhhh- Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT---REFAKGIIEPEQAKFSARLYKI- 390 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~---~~f~~~~l~P~~~~ie~~ln~~- 390 (602) ..+|..+...+.. .|++.++..+..||+.=++|++.+|.... |-++.++.. ..+.. ...-..+.|...+.+. T Consensus 253 ~~~v~q~~~~~l~--~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~-ka~~k~~~fg~~l~~~~ 328 (410) T protein:vir:95 253 KPSVGQFTTASMS--PFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NPSSVEAIKASHENLRL-AGRKAQRSLGAGLLNVA 328 (410) T ss_pred cceEEecCCCChH--HHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 2233333332221 38999999999999999999999986543 333433321 11111 1111112222222211 Q ss_pred -----cCCccc--cccceEEEeccchhcchhH-HHHHHHHHHHHHHhC--CcccHHHHHHHhCCCCCCCCcccccccccc Q lcl|NC_021537. 391 -----IHQDAL--DVDEWTIDFELRGAEQPEQ-DAKMAEQRVRAMRLA--GVGTVNEAREELDLAPFEDDRGDMTLSEFE 460 (602) Q Consensus 391 -----Ll~~~~--~~~~~~~~f~~~~~~~~~~-d~~~~~~~~~~~~~~--G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~ 460 (602) +..... ....+.++..+..+.++.. .....++++.+++.+ |+++..-+++++|+.+-+ ... ... T Consensus 329 rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~--~~~-~~~--- 402 (410) T protein:vir:95 329 YVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDM--SAK-PVV--- 402 (410) T ss_pred HHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHH--HHH-HHH--- Confidence 101110 1112233333321111111 223366788888887 777777899999995421 100 000 Q ss_pred ccccccccCCC Q lcl|NC_021537. 461 AEFGADASDGD 471 (602) Q Consensus 461 ~~~~~~~~~~~ 471 (602) ..+.+.+. T Consensus 403 ---~e~~~~g~ 410 (410) T protein:vir:95 403 ---SEGGSNGE 410 (410) T ss_pred ---HHHHhCCC Confidence 00000000 No 175 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.55 E-value=3e-07 Score=56.36 Aligned_cols=413 Identities=11% Similarity=0.013 Sum_probs=171.0 Q ss_pred CCC---Cccc------ccccchhhhcccCccccCCCCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSK---AEET------TQLDERHIATDVGRGIQPPYNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k---~~~~------~~~~~~~~~~~~~~~i~p~~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) +++ ...+ +.+. +++... ......+.... .-.++ ..++...+|+..+..+.+-|+.+...+++ T Consensus 30 i~~~i~~~~~~~~~~~~~l~-~Yy~g~-~~i~~~~~~~~~~~~ki--~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~---- 101 (470) T protein:vir:99 30 LLGFIAYNETVLKPRYRENM-KLYLGK-HKILTAPEKETGADNRI--VVNSAKYVVDVYNGYFCGIEPKLALLNDS---- 101 (470) T ss_pred HHHHHHHHHHhhHHHHHHHH-HHhccc-cccccCcccccCCccee--ecchHHHHHHHHhhhhccCCeeEeeCCch---- Confidence 111 0001 0110 111100 01111111100 00111 24678889999999999888876432211 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) .....+..++. ...+......+..+.+.+|.+|+.+..+.+|++ .+..++|..+.+..+... T Consensus 102 ---~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~ 163 (470) T protein:vir:99 102 ---SKIDEIARWNR--------------QENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSPNHAFIIYDDTV 163 (470) T ss_pred ---hHHHHHHHHHH--------------hcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEccceeEEEEcCCC Confidence 11122222221 235567788899999999999999988888876 578889988876543321 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeeccccccccccee--eecccce-EEecCceeEEechhHEEEecCCCCCCCc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRF--VDKETGE-VASDAGELKNGPANELIFLPNPSPLALY 227 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~--~~~~~g~-~~~~~~~~~~~~~~eviH~r~~~~~~~~ 227 (602) ... ..-+..|+....+... ..+...+..+..+ .....+. +.........+..=.|++++.. . T Consensus 164 ~~~--------~~~~vr~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~ 228 (470) T protein:vir:99 164 QRQ--------PLAFVHYQIDNSNNWT--DAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFEN-----E 228 (470) T ss_pred Ccc--------eEEEEEEEEEecCCee--EEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCC-----C Confidence 100 0001111111111100 0000001000000 0000000 0000111112222235565532 3 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHH-HHHHHHHHHHhhcccccCcceeccCCcccee Q lcl|NC_021537. 228 YGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDS-KEDLRNLMDNLKGSRYRTAILEVEEFVDDHG 306 (602) Q Consensus 228 ~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~-~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~ 306 (602) .|+|.+..+...++....+.....+.+...+.|-.+++ |..+.++. -+.+.. +.. .+++.+... T Consensus 229 ~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~g~~~~~-~~~-------~~~~~~~~~----- 293 (470) T protein:vir:99 229 ERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMI--GFKLPEDDEGNPKFD-FKN-------NRVLYVSQL----- 293 (470) T ss_pred CCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcccccccchhhh-hhh-------cceeeecCC----- Confidence 68888888777777766666666666677777766664 32222221 111111 110 111111100 Q ss_pred ccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHH Q lcl|NC_021537. 307 LGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAK 373 (602) Q Consensus 307 ~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~ 373 (602) ....+. ++..++.. ..+..+....+...+.|+..-++|+...+... ++- +.++. ....+. T Consensus 294 --~~~~~~--~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~ 366 (470) T protein:vir:99 294 --DPDTNP--QIGFIAKP-DADQMQENLIQHLTDFIFMMAMVPNIQDKNFA-GNS-SGVALQYKLFAMKNKADSKERKFD 366 (470) T ss_pred --CCCCCC--cceEEeec-CChHHHHHHHHHHHHHHHHHhCCccccccccc-cCc-hHHHHHHHHHHHHHHHHHHHHHHH Confidence 000111 12222211 12344566778889999999999975443221 222 22221 112223 Q ss_pred HHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc Q lcl|NC_021537. 374 GIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD 453 (602) Q Consensus 374 ~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d 453 (602) .+|+-+++.+...++..-... ....+..+.|...-.. +....++.+.++ .|+++...++++++.-. +..+.+ T Consensus 367 ~~l~~~~~li~~~~~~~~~~~-~~~~~i~v~f~~~~p~----~~~e~a~~~~kl--~giis~et~l~~l~~vd-~~~E~e 438 (470) T protein:vir:99 367 KSLMQLYRIVLATLFNNKQDQ-ELWSELDFKFTRNLPE----DMASAIDNAKNA--EGIVSKKTQLGMIPDIE-PDAEMK 438 (470) T ss_pred HHHHHHHHHHHHHHhccCCcc-cccccceEEeCCCCCc----CHHHHHHHHHHH--hccCCHHHHHHhCCCCC-HHHHHH Confidence 344444444444433322111 1223445666433221 334445666666 48999888888876521 111111 Q ss_pred ccccccccccc-ccccCCCcCccccccccccccccc Q lcl|NC_021537. 454 MTLSEFEAEFG-ADASDGDAEAMLTRSKAAPPLENK 488 (602) Q Consensus 454 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 488 (602) ++......... ......+......++.+ +.+ T Consensus 439 ri~~E~~~~~~~~~~~~~~~d~~~~d~~~----ee~ 470 (470) T protein:vir:99 439 QIAKEKADAIKQTQQLSMPIDILKRDNNA----EEE 470 (470) T ss_pred HHHHHHHHHHHHHHhhcCCCCcCCCCCCc----cCC Confidence 11111000000 00000000111111000 000 No 176 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.54 E-value=3.2e-07 Score=56.16 Aligned_cols=417 Identities=11% Similarity=0.060 Sum_probs=164.1 Q ss_pred CCCCccccccc-chhhh-cccCc---cccCCCCH-----HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETTQLD-ERHIA-TDVGR---GIQPPYNP-----ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~~~~-~~~~~-~~~~~---~i~p~~~~-----~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~ 70 (602) |++-. +.... .+... +..|. ...+.... ..-.++ .+++...+|+..+..+.+-|+.+... T Consensus 39 i~~~~-~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki--~~n~~~~ivd~~~~~l~g~~~~~~~~------- 108 (481) T protein:vir:10 39 ISRHQ-TEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRA--VHNYAKYVSRFIVGYLTGNPITITHQ------- 108 (481) T ss_pred HHHHH-HHHHHHHHHHHHHhcCCCcccccCcccccccccccccee--ecchHHHHHHHHHhhhccCCceEecC------- Confidence 22211 11100 00100 00111 00110000 000112 24677889999999998888766432 Q ss_pred cchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) +....+.+..++.. ..+..+...+..+.+++|.+|+.+.++.+|++ .+..++|..+.+..+... T Consensus 109 -d~~~~~~l~~~~~~--------------n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~~p~~~~~v~d~~~ 172 (481) T protein:vir:10 109 -DNQTNDKIIELNDL--------------NDADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVLDPKSTFVVYDQTL 172 (481) T ss_pred -ChhHHHHHHHHHHh--------------cChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEEcccceEEEEcCCC Confidence 11222233332221 24567888899999999999999989888876 578889988865433211 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeeccccccc-ccceeeecccceEEecCceeEEechhHEEEecCCCCCCCccc Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYG-DDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYG 229 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G 229 (602) .. ...-+..++........... ....|. ..........+.+.........+..=.|+|++.. .+| T Consensus 173 ~~--------~~~~~i~~~~~~~~~~~~~~-~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g 238 (481) T protein:vir:10 173 DK--------KVVAGVRYFEKQDKDKVPVQ-HVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLND-----QFK 238 (481) T ss_pred CC--------ceEEEEEEEEEeeCCCceEE-EEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecC-----CCC Confidence 00 00001111111111111100 000010 0000000011111000000111122236776542 368 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccc Q lcl|NC_021537. 230 VPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGD 309 (602) Q Consensus 230 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~ 309 (602) .|.+..+...++.......-..+.+...+.|-.+++-. ...+++....++. ++.+.+..+...... T Consensus 239 ~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~-- 304 (481) T protein:vir:10 239 QGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGN-VDLDSEDAKAFRD-----------ANMIHLEPGTNANGS-- 304 (481) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC-cCCCccchhhhhh-----------ccceeccccccccCC-- Confidence 88777666666554444344444455555665555421 1123322222221 011111111111000 Q ss_pred cccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH-------------HHHHHHH Q lcl|NC_021537. 310 GGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR-------------EFAKGII 376 (602) Q Consensus 310 ~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~-------------~f~~~~l 376 (602) ....++++ ++.. ..+..+.+..+...+.|...-++|....+..+ ++- +.++... ..+..++ T Consensus 305 -~~~~~~~~--l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l 378 (481) T protein:vir:10 305 -EGKAEVKY--VYKQ-YDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQ-SGESMKYKLFGLEQVRAIKERLFKKGL 378 (481) T ss_pred -CCCcceeE--Eeec-CCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01112222 1111 12355677888888999999999976655332 222 2222111 1111222 Q ss_pred HHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc--ccc Q lcl|NC_021537. 377 EPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR--GDM 454 (602) Q Consensus 377 ~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~--~d~ 454 (602) +-+++.+...++..-... .......+.|...-. . +....++.+.++ .|+++...+.+++++ +++.. .++ T Consensus 379 ~~~~~li~~~~~~~~~~~-~~~~~i~v~f~~~~~--~--~~~~~a~~~~kl--~g~is~et~~~~l~~--i~d~~~E~~r 449 (481) T protein:vir:10 379 MKRYKLLLNNVNLTGLKQ-HNYAELTITFTPNLP--K--SMMESINAFNAL--SGGVSESTRLSLLDF--IDNPKEELEK 449 (481) T ss_pred HHHHHHHHHHHhccCCCc-cccceeeEEeCCCCC--c--CHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHH Confidence 222222222222211111 112234556643322 1 334455667666 488998888888765 22211 111 Q ss_pred ccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 455 TLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ....... ..........+... ....+..+.+. T Consensus 450 i~~E~~~-~~~~~~~~~~~~~~--~~~~~~dd~~g 481 (481) T protein:vir:10 450 MQEEEAQ-REKQADKRGYGEAF--ENHLNVDDSNG 481 (481) T ss_pred HHHHHHH-HHhhhhhccCCccC--CCCCCCCCCCC Confidence 1000000 00000000000000 00000000000 No 177 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.53 E-value=3.5e-07 Score=55.99 Aligned_cols=410 Identities=12% Similarity=-0.013 Sum_probs=162.0 Q ss_pred CCCCcc---cccccchhhhcccCc--cc--cCC-------CCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEE---TTQLDERHIATDVGR--GI--QPP-------YNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~---~~~~~~~~~~~~~~~--~i--~p~-------~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) ++|--+ ......+....-+.+ -| .+. .+.. .-.++ ..++...+|+..+..+.+-|+.+... T Consensus 32 i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki--~~n~~~~Ivd~~~~~l~g~p~~~~~~-- 107 (474) T protein:vir:95 32 IIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRI--TTNFHQNLVDQKVSYVASKPVTYSCE-- 107 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCchhcccccccccccccccccccee--ccchHHHHHHHHHhhhccCCceeccC-- Confidence 000000 000000000000000 00 000 0000 00111 24677889999999999988876321 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVR 145 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~ 145 (602) +.+..+.+..++. ..+......+..+.+.+|.||+.+.++.+|++ .+..++|..+-+. T Consensus 108 ------d~~~~~~l~~~~~---------------n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v 165 (474) T protein:vir:95 108 ------DESVLKIIHDVLD---------------TRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPI 165 (474) T ss_pred ------chHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEcccceEEE Confidence 1112222222111 13455667778899999999999888888875 5777888777644 Q ss_pred ccccc-cccccchhhhhcccCceeEEEEcCCcceeecccc--cccccceeeecccceEEecCceeEEechhHEEEecCCC Q lcl|NC_021537. 146 KTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGD--RYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPS 222 (602) Q Consensus 146 ~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~ 222 (602) .+... ........ .-...+..++.+......+.+.... ..+.... ...+ .........+..=.|++++.. T Consensus 166 ~d~~~~~~~~~~i~-~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~---~~~~--~~~~~~~~~~g~iPvv~~~nn- 238 (474) T protein:vir:95 166 WVDKEREELKSFIR-YYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYY---GANH--IQSHFSNGNWGRVPFIAFKNN- 238 (474) T ss_pred EcCCCCCceEEEEE-EEEEcCeeEEEEEeCCeEEEEEEcCCcccccccc---Cccc--ccccccccCCCccceEeecCC- Confidence 32210 00000000 0000111111111111111111000 0000000 0000 000000111222236666543 Q ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCc Q lcl|NC_021537. 223 PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFV 302 (602) Q Consensus 223 ~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~ 302 (602) ..|.|.+..+...++....+..-..+.+.....|-.+++ |....+ .+.+.... ...+++.++++. T Consensus 239 ----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~~~~~--~~~~~~~~-------~~~~~i~~~~~~ 303 (474) T protein:vir:95 239 ----PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILK--GYEGQD--LEEFMRGL-------KYYKAINVDGDG 303 (474) T ss_pred ----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhhhhhh-------hccceeeccCCC Confidence 358888888777777666555555556666666755554 322221 11111111 123334333332 Q ss_pred cceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HH Q lcl|NC_021537. 303 DDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TR 369 (602) Q Consensus 303 ~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~ 369 (602) +. ++ ++.. .....+....+...+.|+..-++|....+-. .++- +..+. .. T Consensus 304 ~~------------~~--l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~-Sg~Alk~~~~~l~~k~~~k~ 366 (474) T protein:vir:95 304 GV------------ET--IQVE-VPVSSTKEYIDLMRAYIMEFGQGVDFQTDKF-GSAP-SGIALKFLYGNLDLKANKLK 366 (474) T ss_pred ce------------eE--Eeec-CCHHHHHHHHHHHHHHHHHHhCCcccccccc-cccc-hHHHHHHHHHHHHHHHHHHH Confidence 21 11 1111 1234566777888899999999985322211 1221 21211 11 Q ss_pred HHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCC Q lcl|NC_021537. 370 EFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFED 449 (602) Q Consensus 370 ~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~ 449 (602) ..+...|+-+++.|...+.. ........+.|+..... +.. +.++.++++|+++...+.+++++ +++ T Consensus 367 ~~~~~~l~~~~~li~~~~g~-----~~d~~~i~v~f~~~~p~----d~~---e~a~~~~~~g~iS~et~i~~l~~--v~d 432 (474) T protein:vir:95 367 NKATVAIQELIGFIIDFNNL-----KMDVKDIEISFNFNRMM----NDA---EQSQIIAQSQYLSRETLVKSSPL--VDD 432 (474) T ss_pred HHHHHHHHHHHHHHHHHhCC-----CcccceeeEEeccCCCc----CHH---HHHHHHHhcCCCchHHHHHhCCC--CCC Confidence 22233444444444333321 12334456667544322 222 22345667899999888888765 222 Q ss_pred Ccc--ccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 450 DRG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 450 g~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ... ++..-... ....+.......+.......++..+.. .+ T Consensus 433 ~~~E~~ri~~E~~-~~~~~~~~~~~~~~d~~~~~~~~~~~~-~~ 474 (474) T protein:vir:95 433 YKAELERIEQEQM-EYNKQLPNLDDGGADGAQQQERSNDKE-SE 474 (474) T ss_pred HHHHHHHHHHHHH-HHHhcccccccccCCCCcCCCCCccCC-CC Confidence 211 11110000 000000000000000000000000000 00 No 178 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.49 E-value=4.7e-07 Score=55.29 Aligned_cols=401 Identities=10% Similarity=0.030 Sum_probs=168.8 Q ss_pred CCCCcccccccch---hhhcccCccc-cCCC-CHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh Q lcl|NC_021537. 1 MSKAEETTQLDER---HIATDVGRGI-QPPY-NPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES 75 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~~~~~~~i-~p~~-~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~ 75 (602) +.+.=++.....+ ++...-...+ .++. ......++. +++.+.+|+..+..+.+-|..+... +... T Consensus 9 ~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~--~n~~~~ivd~~~~~l~g~~~~~~~~--------~~~~ 78 (429) T protein:vir:98 9 LIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLV--VNFAKYIVDTFNGYFIGVPVQTSHE--------NKQV 78 (429) T ss_pred HHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceee--cchHHHHHHHHhhhhcccCceeecC--------ChHH Confidence 1000000100110 0110000111 1111 111112222 4688889999999999988776421 1111 Q ss_pred HHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccccc Q lcl|NC_021537. 76 YQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIERED 155 (602) Q Consensus 76 ~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~ 155 (602) .+.+..++. ...+......+.++.+.+|.+|+.+..+.+|++ .+..++|..+.+.-+.... T Consensus 79 ~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~dd~~~---- 139 (429) T protein:vir:98 79 SNYLELLDG--------------YNDQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVYDDSIR---- 139 (429) T ss_pred HHHHHHHHh--------------hcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEEeCCCC---- Confidence 222222221 123456778888999999999999999888875 5778888888654332110 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) .... -+..|+...++.. ...+.. ...........+.+.........+..=.|+|++.. ..|.|.+.. T Consensus 140 ~~~~----~~i~~~~~~~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~ 206 (429) T protein:vir:98 140 QKPL----FAVRYFYNKGGVL--EGSYSD--ASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVEN-----EERQSLLAS 206 (429) T ss_pred CceE----EEEEEEEecCceE--EEEEEe--CceEEEEEecCCceEecccccccCCccceEEecCC-----CCCCCcHHH Confidence 0000 0011111111100 000000 00000000000101111111111222246777643 368898888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) +...++....+..-..+.+...+.|-.+++ |...+++....++. .+++.+..+ ++.+.+ T Consensus 207 v~~liD~~d~~~s~~~~~~~~~~~p~~~i~--g~~~~~~~~~~~~~-----------~~~~~~~~~--------~~~~~~ 265 (429) T protein:vir:98 207 VVTLINAFNKAISEKANDVEYFADAYLKIL--GAELDDETLKSLRD-----------TRIINLKDT--------DAQQLT 265 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCCCcchhhhHhh-----------CceeeccCC--------CCCCcc Confidence 777777766666666666677777766664 43344433322211 122222211 011122 Q ss_pred cccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHH-------------HHHHHHHHHHHHHH Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQT-------------REFAKGIIEPEQAK 382 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~f~~~~l~P~~~~ 382 (602) .++ ++... .+..+....+...+.|+..-++|..-. ...+| ++.++.. +..+...|+-+++. T Consensus 266 ~~~--l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~--~~~gn-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l 339 (429) T protein:vir:98 266 VEF--LQKPD-ADATQEHLLDRLENLIFRTAMVANISD--ESFGT-ASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKL 339 (429) T ss_pred eeE--EeecC-CHHHHHHHHHHHHHHHHHHhCccccCc--ccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 21111 133456677888899999989885322 22222 1222211 11222333333333 Q ss_pred HHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc--ccccccc Q lcl|NC_021537. 383 FSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD--MTLSEFE 460 (602) Q Consensus 383 ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d--~~~~~~~ 460 (602) +...++..-. .....+..++|...... +....++.+.++ +|+++..-+.++++.- ++...+ +.--... T Consensus 340 i~~~~~~~~~--~~d~~~i~v~f~~~~p~----~~~~~a~~~~kl--~g~is~et~~~~l~~v--~d~~~E~~ri~~E~~ 409 (429) T protein:vir:98 340 IASYPTSKIG--PKDWIGIKYKFTRNLPA----NLLEESQIAGNL--AGIVSEETQVGVLSIV--ENPQKEIERKNSDKS 409 (429) T ss_pred HHHHhccCCC--ccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCchHHHHHhCCCC--CCHHHHHHHHHHHHH Confidence 3333332111 11223445666533221 333345666665 6899987788888652 222111 1100000 Q ss_pred ccccccccCCCcCccccccccc Q lcl|NC_021537. 461 AEFGADASDGDAEAMLTRSKAA 482 (602) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~ 482 (602) . ..+.+.+...+..++...+ T Consensus 410 ~--~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 410 T--LISRQAGGLNGQNTTTILE 429 (429) T ss_pred H--HHHHHHhhhcCCCCCCCCC Confidence 0 0001111111111111111 No 179 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.48 E-value=5e-07 Score=55.14 Aligned_cols=408 Identities=10% Similarity=0.021 Sum_probs=169.7 Q ss_pred CCCCc-cccccc--chhhhcccCccccCCCCH-HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhH Q lcl|NC_021537. 1 MSKAE-ETTQLD--ERHIATDVGRGIQPPYNP-ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESY 76 (602) Q Consensus 1 ~~k~~-~~~~~~--~~~~~~~~~~~i~p~~~~-~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~ 76 (602) +.|=. +...+. .+++...-.-.-.++.+. ....++. .++...+|+..+..+.+-|+.+... +.+.. T Consensus 26 i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~--~n~~~~ivd~~~~~l~g~~~~~~~~--------d~~~~ 95 (453) T protein:vir:39 26 MEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLT--VNFTKYIVDTFTGYFNGIPVKKSHS--------DKETL 95 (453) T ss_pred HHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccceee--cchHHHHHHHHhhhhcccCceeccC--------ChHHH Confidence 11100 000000 011111000001111111 1122332 4688889999999998888776421 11122 Q ss_pred HHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccc-cccc Q lcl|NC_021537. 77 QTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTI-ERED 155 (602) Q Consensus 77 ~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~-~~~~ 155 (602) +.+..++.. -.+......+..+.+.+|.||+.+.++.+|.+ .+..++|..+.+..+...- ...- T Consensus 96 ~~l~~i~~~--------------N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~ 160 (453) T protein:vir:39 96 SKLQEFDNL--------------NDMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPENMFMVYDDTIKQEPLF 160 (453) T ss_pred HHHHHHHHh--------------cChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEecCCCCCeEEE Confidence 233332221 23456777888999999999999999988875 5677888888654332110 0000 Q ss_pred chhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHH Q lcl|NC_021537. 156 GEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVA 235 (602) Q Consensus 156 ~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~ 235 (602) ............|+.+......+.+ ....+.+.........+..=.|++|+.. ..|+|.++. T Consensus 161 ~ir~~~~~~~~~~~~~yt~~~i~~~-------------~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~ 222 (453) T protein:vir:39 161 AVRYGYDDDYKLYGEVYTKETTYAL-------------NGTMGFYNMTEQAPNPFDDLPVVEFYFN-----EERMSIFES 222 (453) T ss_pred EEEEEEeCCeEEEEEEEeCCeEEEE-------------EecCCceeeecccccCCCceeEEEecCC-----CCCCcchhh Confidence 0000000111111111111111111 1111111100001111222236666542 368888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccc Q lcl|NC_021537. 236 AMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVN 315 (602) Q Consensus 236 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~ 315 (602) ....++....+..-..+.+...+.|-.+++ |..++++....++. .+++.+.++.. ...+.+ T Consensus 223 v~~liDa~~~~~s~~~~~~~~~~~p~~~~~--g~~~~~~~~~~~~~-----------~~~~~~~~~~~------~~~~~~ 283 (453) T protein:vir:39 223 VISLVNAFNKAISEKANDVDYFSDQYLTFL--GAAVEEEDLKNIRS-----------NRVINYYGESS------EAKNVD 283 (453) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhCceeeee--cCCCCchhhhhhhh-----------cceeeecCCCC------CCCCCc Confidence 776666655555555555566666766654 43445444433221 11111111110 001112 Q ss_pred cccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHHHHHHHHHHH Q lcl|NC_021537. 316 IELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAKGIIEPEQAK 382 (602) Q Consensus 316 ~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~~~l~P~~~~ 382 (602) +++ ++. +..+..+....+...+.|+..-++|..-.+.. ++- +.++. .+..+..+|+.+++. T Consensus 284 ~~~--lt~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--gn~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~l 357 (453) T protein:vir:39 284 VKF--LEK-PDSDSQTENLLDRLTKLIFQTTMVANISDESF--GSS-SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKL 357 (453) T ss_pred eeE--Eee-cCCHHHHHHHHHHHHHHHHHHhCCcccccccc--cCC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 221 11234566677788888888888874322211 221 11211 112233445555554 Q ss_pred HHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc--ccccccc Q lcl|NC_021537. 383 FSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD--MTLSEFE 460 (602) Q Consensus 383 ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d--~~~~~~~ 460 (602) +...++..- ......+..+.|...... +....++++.++ +|+++..-+.++++.- ++...+ +.....- T Consensus 358 i~~~~~~~~--~~~~~~~i~v~f~~~~p~----~~~~~a~~~~kl--~g~is~et~l~~l~~v--~D~~~E~~ri~~E~~ 427 (453) T protein:vir:39 358 YCELSTNVS--NKEAWKDIEYTFTRNEPK----DIKEQAETANIL--MGITSQETALSVISVI--PDVQAEMEKIKKEEA 427 (453) T ss_pred HHHHHhccC--CccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHH Confidence 444433221 111223445667543322 334445666666 6889998888888652 222111 1111000 Q ss_pred ccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 461 AEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) . .....+....+....+.... ....| T Consensus 428 ~-~~~~~~~~~~~~~~~~~~~~----~~~~e 453 (453) T protein:vir:39 428 S-TAIFDKDKQPSEKGTDTVVP----ETNEE 453 (453) T ss_pred H-HHHHHHhccCCCCCCCCCCC----CcCCC Confidence 0 00000000000000000000 00000 No 180 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.48 E-value=5e-07 Score=55.12 Aligned_cols=416 Identities=11% Similarity=0.012 Sum_probs=170.9 Q ss_pred CCCCcccccccch---hhhcccCccc-cCCC--CHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccch Q lcl|NC_021537. 1 MSKAEETTQLDER---HIATDVGRGI-QPPY--NPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGG 73 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~~~~~~~i-~p~~--~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~ 73 (602) |++--.......+ ++...-...+ .+.. +.. .-.++ .+++...+|+..+..+.+-|.++...+. .+. T Consensus 2 ~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki--~~n~~~~ivd~~~~~l~g~~~~~~~~~~-----~~~ 74 (440) T protein:vir:95 2 LAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRV--RHKWGGYISSFATGYVIGNPVSIGVMEG-----GSA 74 (440) T ss_pred hhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCccee--ecchHHHHHHhhhhheeccCceEeeCCC-----ccH Confidence 1111111110111 1110000001 1100 000 00112 2467778888888888888877643211 112 Q ss_pred hhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccccc Q lcl|NC_021537. 74 ESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIER 153 (602) Q Consensus 74 ~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~~ 153 (602) +....+..++. ..........+.++.+++|.+|+.+..+.+|++ .+..++|..+.+..+...... T Consensus 75 ~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~ 139 (440) T protein:vir:95 75 DQLSTIKDIEW--------------QNDINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQN 139 (440) T ss_pred HHHHHHHHHHH--------------hcCHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCc Confidence 22222322222 124566777888999999999999999888876 477788888876543321100 Q ss_pred ccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHH Q lcl|NC_021537. 154 EDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDW 233 (602) Q Consensus 154 ~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl 233 (602) ...-..........++.+......+.+... ...++.+.........+..=.|+|+++. ..|.|.+ T Consensus 140 ~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~ 204 (440) T protein:vir:95 140 IIAAVHLPIYADKVNMTVYTKDKVITYKPY----------SNNSVRLVVDDVKKHSYNDVPVVEWWNN-----RFRMGDY 204 (440) T ss_pred eEEEEEEEEecCceEEEEEeCCeEEEEEEe----------cCCccceeecceeeccCceeeEEEeeCC-----CCCCCch Confidence 000000000011111221111111111000 0011111111111122222346777643 2578888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc--ccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccc Q lcl|NC_021537. 234 VAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG--GTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGG 311 (602) Q Consensus 234 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~ 311 (602) +.....++....+.....+.....+.|-.+++-.. ...+++....+++... +.+. .+.... ..+ T Consensus 205 e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~----------~~~~-~~~~~~---~~~ 270 (440) T protein:vir:95 205 ESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANM----------LFLK-TGISTT---GQQ 270 (440) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccc----------eecc-cccccc---cCC Confidence 77776666655555555555566666766654321 2224444443332111 1110 000000 000 Q ss_pred cccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHHHHHHH Q lcl|NC_021537. 312 SDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAKGIIEP 378 (602) Q Consensus 312 ~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~~~l~P 378 (602) .+.++++ ++... .+..+....+...+.|+..-++|..-.+... ++- +.++. .+..+...++. T Consensus 271 ~~~~~~~--lt~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~ 345 (440) T protein:vir:95 271 TTADASY--IYKQY-DVNGTEAYKNRLANDIHRFSRIPNLDDDRFN-STS-SGIALLYKMIGLEQVRKDKETYFTKALRR 345 (440) T ss_pred CCcceeE--EeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1112222 21111 1344667788888999999999875443221 221 22211 11223344555 Q ss_pred HHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccc Q lcl|NC_021537. 379 EQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSE 458 (602) Q Consensus 379 ~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~ 458 (602) +++.+...++..-.. ........++|...... +....++.+.++ .|+++..-+.++++.-..+.. ..+.... T Consensus 346 ~~~li~~~~~~~~~~-~~~~~~v~i~f~~~~p~----~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~~E-~~ri~~E 417 (440) T protein:vir:95 346 RYELISNIHKAINGP-VIEANKLTFTFHPNIPQ----DVWTEIKAYIEA--GGEISQETLMENASFTDYKTE-HSRILKQ 417 (440) T ss_pred HHHHHHHHHhhcCCc-ccccccceEEeCCCCCC----CHHHHHHHHHHH--hccCcHHHHHHhCCCCCcHHH-HHHHHHH Confidence 555444444322111 11223445666543322 334455667776 688998777777764211100 0111000 Q ss_pred cccccc-ccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 459 FEAEFG-ADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 459 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ...... .....++..+. ....| T Consensus 418 ~~~~~~~~~~~~~~~~~~-----------~~~~e 440 (440) T protein:vir:95 418 GGSSDLEIGQIVGDADVG-----------QADTE 440 (440) T ss_pred HHHhhhhHHhhccCCCCC-----------CcCCC Confidence 000000 00000000000 00000 No 181 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.44 E-value=6.6e-07 Score=54.48 Aligned_cols=430 Identities=11% Similarity=0.043 Sum_probs=172.2 Q ss_pred CCCCc------ccccccc-----hhhh-cccCccccCCCC----HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAE------ETTQLDE-----RHIA-TDVGRGIQPPYN----PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~------~~~~~~~-----~~~~-~~~~~~i~p~~~----~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) ++++- .+.+... +.+. .....+-.+... ...-+.+ .......+++..|+-+.+-|..+.-. T Consensus 22 ~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~--s~n~~~~iv~~~a~~l~~ep~~i~~~- 98 (499) T protein:vir:80 22 LKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQL--SMNLPKVTAKYMSKLLFNEKVKINID- 98 (499) T ss_pred hhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcccccee--ecchHHHHHHHHHHhhhCCcceEeeC- Confidence 22111 0111111 1111 111111111000 0000111 12456778888898888876665321 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) +....+.+...+ ....+...++.++.+.+..|.+|+.+..|.+|++ .+.+++|..+-+ T Consensus 99 -------d~~~~e~l~~~~--------------~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~a~~~~P 156 (499) T protein:vir:80 99 -------DETAEEFVLNVL--------------KTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYP 156 (499) T ss_pred -------CHHHHHHHHHHH--------------hhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EEEEEcCCceEE Confidence 111222222211 1223566777788888999999999999988774 578899988776 Q ss_pred cccccccccccchhhhhcc-cCceeEEEEc----CC-cc-eee---cccccccccc-eeeecccceEEecCceeEE---e Q lcl|NC_021537. 145 RKTTTTIEREDGEEVENIE-SGHGYVQVRQ----GR-RR-YFG---EAGDRYGDDK-RFVDKETGEVASDAGELKN---G 210 (602) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~-~~~~~~qi~~----~~-~~-~~~---~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~---~ 210 (602) .....+. ......+.... .+..|.++-. +. .+ +.- .+........ .-+...+ +......... + T Consensus 157 i~~d~~~-~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~--~~~~~~~~~~~~~~ 233 (499) T protein:vir:80 157 LSNDSEN-VDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKL--LFNDIEPVVPLPSL 233 (499) T ss_pred EEecCCC-eEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhh--hccCcCCceeecCC Confidence 4332221 00000000001 1111111100 00 00 000 0000000000 0000000 0000000111 1 Q ss_pred chhHEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh Q lcl|NC_021537. 211 PANELIFLPNPSP----LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL 286 (602) Q Consensus 211 ~~~eviH~r~~~~----~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~ 286 (602) ..--+.||+.+-+ .+.++|+|.+..+...++.....-.-..+-|..|.. ..++ +...+ +.. .+. T Consensus 234 ~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~-~i~v--~~~~l-----~~~----~~~ 301 (499) T protein:vir:80 234 TRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKK-KVLV--PSSFV-----KTA----VNL 301 (499) T ss_pred CccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhccc-ceec--chhhh-----hcc----CCC Confidence 2223667765422 245689999999988887766655555555665432 2222 11110 000 000 Q ss_pred hcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHH Q lcl|NC_021537. 287 KGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKE 366 (602) Q Consensus 287 ~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 366 (602) .| .+ .......-..+........+.+-.++.++... .+-++.+..+...++|...-|+++..+|...++.. |+.+ T Consensus 302 ~g-~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i-r~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~-TAte 376 (499) T protein:vir:80 302 DG-ST--TQYFDSTDEAFFLYQGEQDDNGKAIKDISVEI-RSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLK-TATE 376 (499) T ss_pred CC-Cc--ccCCCcccceeeEeeccCCCCcCceeEecCcC-ChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccch-hHHH Confidence 00 00 00000000011111111111111122222122 23456677788889999999999999997655432 3332 Q ss_pred HH-------------HHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcc Q lcl|NC_021537. 367 QT-------------REFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVG 433 (602) Q Consensus 367 ~~-------------~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~ 433 (602) .. +..++.+|..++..+....+...+..........+.+++++-... |.+..++...+++.+|+| T Consensus 377 i~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~--d~~~~~~~~~~~~~~Gi~ 454 (499) T protein:vir:80 377 VVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQ--DEDTTINRYTTAKNQGMI 454 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCC--CHHHHHHHHHHHHHcCCC Confidence 21 112233344444443333222222222222333455444443333 333444567788999999 Q ss_pred cHHHHHHHh-CCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 434 TVNEAREEL-DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 434 T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) +...++... |. ++++.+..+.........+...++..+. .++.. T Consensus 455 S~et~l~~~~~~---~d~ea~~el~~i~~E~~~~~~~~d~~g~-------------~ge~e 499 (499) T protein:vir:80 455 PLKIALQRAWNI---TEAEADEWAEMLAKEKQAEIPNNDMTGI-------------FGEEE 499 (499) T ss_pred CHHHHHhhcCCC---ChHHHHHHHHHHHHHhhcCCCCCCcccc-------------CCCCC Confidence 999887654 43 2222221111110000000000000000 00000 No 182 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.41 E-value=8e-07 Score=54.02 Aligned_cols=407 Identities=11% Similarity=0.041 Sum_probs=170.2 Q ss_pred CCCCcccc-------------cc-cch---hhhcccCccc-cCCCCHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_021537. 1 MSKAEETT-------------QL-DER---HIATDVGRGI-QPPYNPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIV 61 (602) Q Consensus 1 ~~k~~~~~-------------~~-~~~---~~~~~~~~~i-~p~~~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~ 61 (602) +.|..+.+ .+ ..+ .+...--... .++.+.. ...++. .++...+|+..+..+.+-|+.+. T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~--~n~~~~ivd~~~~~l~g~~~~~~ 88 (452) T protein:vir:36 11 FSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLA--VNFTKYIVDTFTGYFNGIPVKKS 88 (452) T ss_pred cCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccceee--cchHHHHHHHHhhhhcccCceee Confidence 11000000 00 000 1110000011 1111111 112222 46788889999999999887764 Q ss_pred EecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccc Q lcl|NC_021537. 62 AHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAAT 141 (602) Q Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~ 141 (602) ..+ ....+.+..++. .-.+......+.++.+.+|.+|..+.++.+|.+ .+..++|.. T Consensus 89 ~~d--------~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~ 145 (452) T protein:vir:36 89 HSD--------KEILTKLQEFDN--------------LNDMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSPEN 145 (452) T ss_pred cCC--------hhHHHHHHHHHh--------------hcChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccc Confidence 211 111222222221 123556777888999999999999988888875 577888888 Q ss_pred ccccccccc-cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecC Q lcl|NC_021537. 142 VRVRKTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPN 220 (602) Q Consensus 142 v~~~~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~ 220 (602) +.+..+... ....-.........+..++.+......+.+.. ..+.+....+....+..=.|++++. T Consensus 146 ~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~-------------~~~~~~~~~~~~~~~g~iPvv~~~n 212 (452) T protein:vir:36 146 MFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISG-------------ENDEISFGEGTYNPYPDLPVVEFYF 212 (452) T ss_pred eEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEE-------------cCCceEEecceeccCCcccEEEecC Confidence 865433211 00000000001111222222222211111110 0111111111111222223666654 Q ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccC Q lcl|NC_021537. 221 PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEE 300 (602) Q Consensus 221 ~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~ 300 (602) . ..|.|.+......++....+..-..+.+...+.|-.++ +|..++++....++. .+++.+.. T Consensus 213 ~-----~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~--~g~~~~~~~~~~~~~-----------~~~~~~~~ 274 (452) T protein:vir:36 213 N-----EERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTF--LGAAVEEEDLKNIRS-----------NRVINYYA 274 (452) T ss_pred C-----CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe--ecCCcCchhhhhhhh-----------cceEEecC Confidence 3 25888887776666666555555555566666665555 454445443333221 11222222 Q ss_pred CccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------------- Q lcl|NC_021537. 301 FVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------------- 367 (602) Q Consensus 301 g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------------- 367 (602) +.. ....++++ ++. +..+..+....+...+.|+..-++|..-.+ ..++- +.++. T Consensus 275 ~~~-------~~~~~~~~--l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~--~~gn~-Sg~Al~~~~~~l~~k~~~ 341 (452) T protein:vir:36 275 DGE-------GKNVDVKF--LEK-PDSDSQTENLLDRLTKLIFQTTMVANISDE--SFGSS-SGVSLAYKLQAMSNLALS 341 (452) T ss_pred CCC-------ccCCccee--Eee-cCCHHHHHHHHHHHHHHHHHHhCccccCcc--cccCC-cHHHHHHHHHHHHHHHHH Confidence 110 01112222 111 112345667778888899888889853222 22222 22221 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCC Q lcl|NC_021537. 368 TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPF 447 (602) Q Consensus 368 ~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~ 447 (602) ....+..+|+.+++.+...++..- ......+..+.|...-.. +....++.+.++ .|+++..-+.++++.- T Consensus 342 ~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~i~i~f~~~~p~----d~~~~a~~~~k~--~g~iS~et~~~~~~~~-- 411 (452) T protein:vir:36 342 FQRKFQSSLNSRYKLFCELSTNVS--NKDSWKDIEYTFTRNEPK----DIKEQAETANIL--MGITSQETALSVISVI-- 411 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHhccC--CccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCChHHHHHhCCCC-- Confidence 112233344444444444333221 111223445667543222 333345666665 6889988888888652 Q ss_pred CCCccc--cccccccccccccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 448 EDDRGD--MTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 448 ~~g~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) ++...+ +.-...-. .....+....+........ ....++ T Consensus 412 ~d~~~E~~ri~~E~~~-~~~~~~~~~~~~~~~~~~~-----~~~~~e 452 (452) T protein:vir:36 412 PDVQAEMEKIKKEEAS-TAIFDKDKQPSEKGTDTVV-----SETNEE 452 (452) T ss_pred CCHHHHHHHHHHHHHH-HHHHHhhccCCCCcccccC-----ccccCC Confidence 222111 11000000 0000000000000000000 000000 No 183 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.39 E-value=8.8e-07 Score=53.79 Aligned_cols=403 Identities=12% Similarity=0.064 Sum_probs=165.2 Q ss_pred CCCCc-ccccccc--hhhhcccCccc-cCC-C----CHHHH---HHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MSKAE-ETTQLDE--RHIATDVGRGI-QPP-Y----NPETL---AAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~k~~-~~~~~~~--~~~~~~~~~~i-~p~-~----~~~~l---~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) +.+-. .-..+.. +++... -... .+. + ..... .++ ..++.+.+|+..+..+.+-|+.+... T Consensus 44 i~~~~~~~~r~~~l~~YY~g~-~~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~Ivd~~~~~l~G~p~~~~~~----- 115 (483) T protein:vir:12 44 IKQHLEKLPEISIGQEYYEQR-PDIVKEPKPVDATGAVDPLKPDDRM--ITNFHANLVDQKVSYIVGKPIAFKHT----- 115 (483) T ss_pred HHHHHHHHHHHHHHHHHhccc-ccccccccccccccccccccccccc--ccchHHHHHHHHhhhhcccCceeccC----- Confidence 11100 0000000 011000 0000 010 0 00011 112 24788889999999998888776321 Q ss_pred cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +.+..+.+..++. | ........+..+.+.+|.+|+.+-.+.+|++ .+..++|..+-+..+. T Consensus 116 ---d~~~~~~l~~~~~------------n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~-~i~~~~p~~~~~v~d~ 176 (483) T protein:vir:12 116 ---DDEVVKRIDEVLG------------N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 176 (483) T ss_pred ---ChHHHHHHHHHHh------------c---cHHHHHHHHHHHHhhCCeEEEEEEEcCCCce-EEEEEcccceEEEEcC Confidence 1112222222211 1 2345566778899999999999999988875 5888999888664332 Q ss_pred cccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEec----------CceeEEechhHEEEe Q lcl|NC_021537. 149 TTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASD----------AGELKNGPANELIFL 218 (602) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~----------~~~~~~~~~~eviH~ 218 (602) ... ....-+..|+...+.....++.....+ ......+..... ......+..=.|++| T Consensus 177 ~~~--------~~~~~~ir~~~~~~~~~~~~y~~~~v~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 243 (483) T protein:vir:12 177 KEH--------EELEAFIRMYKLENETKVEYWDKVTVN-----YYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF 243 (483) T ss_pred CCC--------CceEEEEEEEEeecceEEEEEecCeEE-----EEEEeCCeeeecccccccccccccccCCCCccceEEe Confidence 110 000111111111111111111100000 000000000000 000011112225555 Q ss_pred cCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceec Q lcl|NC_021537. 219 PNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEV 298 (602) Q Consensus 219 r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~ 298 (602) +.. ..|.|.++.....++....+..-..+.+...+.|-.+++ |....+ .......+. ..+++.+ T Consensus 244 ~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~~~~~-------~~~~~~~ 307 (483) T protein:vir:12 244 KNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NYDDQE--LPEFKRLLR-------YYGAIKV 307 (483) T ss_pred cCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhHHHhhh-------hcccccc Confidence 532 368888888777777666655556666666677766554 321111 111111111 1122222 Q ss_pred cCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH----------- Q lcl|NC_021537. 299 EEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ----------- 367 (602) Q Consensus 299 ~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~----------- 367 (602) +.+.+ +++ ++. +..+..+....+...+.|+..-++|....+..+ ++- +.++. T Consensus 308 ~~~~~------------~~~--l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~ 370 (483) T protein:vir:12 308 SDNGG------------VDT--IQV-EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAP-SGVALEFLYTNLNLKA 370 (483) T ss_pred CCCCc------------ceE--Eee-cCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCc-HHHHHHHHHHHHHHHH Confidence 22221 122 111 112345667777888888888888864332211 121 22221 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCC Q lcl|NC_021537. 368 --TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLA 445 (602) Q Consensus 368 --~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~ 445 (602) ....+..+|+-+++.+...++. .....+..+.|...... +....++.+.++ .|+++..-++++++.- T Consensus 371 ~~~~~~f~~~l~~~~~li~~~~~~-----~~~~~~i~v~f~~~~p~----~~~~~a~~~~kl--~GiiS~et~~~~~~~v 439 (483) T protein:vir:12 371 DKLARKAKVAIQELLWFVFEHFDI-----KGEHKDVDISFNYNKVA----NTELQVQTAQQS--MGIVSHETVLENHPFV 439 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcC-----CCccceeeEEeCCCCCC----CHHHHHHHHHHH--hccCchHHHHHhCCCC Confidence 1122333444444444443332 12234456666544322 333345666666 5899998888888652 Q ss_pred CCCCCcc--ccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 446 PFEDDRG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 446 p~~~g~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) ++... ++...... ............+.......+ ..++...| T Consensus 440 --~d~~~E~~ri~~E~~-~~~~~~~~~~~~~~d~~~~~~-~~~~~e~e 483 (483) T protein:vir:12 440 --EDLQAELERIEQEQM-EYNKQLPNLDDGGADGAQQQE-RSNNKESE 483 (483) T ss_pred --CCHHHHHHHHHHHHH-HHHhhcccccccccCCcccCC-CCCcccCC Confidence 22211 11100000 000000000000000000000 00011111 No 184 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.39 E-value=9.1e-07 Score=53.70 Aligned_cols=407 Identities=9% Similarity=0.006 Sum_probs=166.6 Q ss_pred CCCCcccccccchhhhccc-C--ccc-cCCCCH-HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV-G--RGI-QPPYNP-ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES 75 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~-~--~~i-~p~~~~-~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~ 75 (602) +.+-. ...........-+ | ... .++..+ ....++. .++...+|+..+..+.+-|+.+... +... T Consensus 26 i~~~~-~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~--~n~~~~ivd~~~~~l~g~~~~~~~~--------d~~~ 94 (453) T protein:vir:73 26 MKKHQ-EEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLT--NNFAKYIVDTFVGYFNGIPIKKTHD--------DKSV 94 (453) T ss_pred HHHHH-HHHHHHHHHHHHhccccchhcCCCCCccCccceee--cchHHHHHHHhhhhhcccCceeecC--------ChHH Confidence 11111 1111110000000 0 000 011111 0111222 4677888999998888888776421 1112 Q ss_pred HHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccc-cccc Q lcl|NC_021537. 76 YQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTT-IERE 154 (602) Q Consensus 76 ~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~-~~~~ 154 (602) .+.+..++. ...+......+..+.+.+|.+|+.+.++.+|.+ .+..++|..+-+..+... -... T Consensus 95 ~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~dd~~~~~~~ 159 (453) T protein:vir:73 95 LEAMQLFDN--------------LNDMEDEESELAKIACVYGRAYELMYQNESTES-EVIYCSPLNVFMVYDDSIKQKPL 159 (453) T ss_pred HHHHHHHHH--------------hcChhHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEeCCCCceeE Confidence 222322221 123456777888999999999999999888876 567788887755432211 0000 Q ss_pred cchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHH Q lcl|NC_021537. 155 DGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWV 234 (602) Q Consensus 155 ~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~ 234 (602) -.........+..++.+......+.+. ...+.+.........+..=.|++|+.. ..|.|.+. T Consensus 160 ~~i~~~~~~~~~~~~~vyt~~~i~~~~-------------~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~s~~~ 221 (453) T protein:vir:73 160 FAVYYGFDEEGNLSGTVYTLLETISIT-------------GKAGEVKFGESTYNVYSDLPIVEYNFN-----EERQSIFE 221 (453) T ss_pred EEEEEEEecCceEEEEEEeCCeEEEEE-------------ecCCceEEccceeccCCceeEEEecCC-----CCCCcchh Confidence 000001111222222222211111110 111111111111111222236666542 36888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceecccccccc Q lcl|NC_021537. 235 AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDV 314 (602) Q Consensus 235 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~ 314 (602) .....++....+..-..+.....+.|-.+++ |..++++....++..---.......+.. +....+. T Consensus 222 ~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~ 287 (453) T protein:vir:73 222 PVHSLINSYNKVTSEKANDVEYFSDQYLVFL--GAEVDEEDAKNIKDNRLINFFDKNSNGQ------------GTNAAKV 287 (453) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCchhhhcccccccccccccccccc------------cccccCc Confidence 7777776655555555555556666666554 4444544444332210000000000000 0001111 Q ss_pred ccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHHHHHHHHHH Q lcl|NC_021537. 315 NIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAKGIIEPEQA 381 (602) Q Consensus 315 ~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~~~l~P~~~ 381 (602) +++| ++. +..+..+....+...+.|+..-++|.. +....++ ++.++. ....+...|+-+++ T Consensus 288 d~~~--l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~--~~~~~gn-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 361 (453) T protein:vir:73 288 DVKF--LDK-PDSDVQTENLLNRLERSIFQFTMAANI--SDENFGN-SSGVALAYKLQAMSNLALSFQRKFQSALNRRYS 361 (453) T ss_pred eeEE--eee-cCCHHHHHHHHHHHHHHHHHHhCCccc--CcccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 221 112345567777888888888888752 2222122 222221 11222334444444 Q ss_pred HHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc--cccccc Q lcl|NC_021537. 382 KFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD--MTLSEF 459 (602) Q Consensus 382 ~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d--~~~~~~ 459 (602) .+...++..-. .....+..++|+..-.. +....++.+.+++ |+++..-+.++++. +++...+ ++.... T Consensus 362 li~~~~~~~~~--~~~~~~i~v~f~~~~p~----~~~~~a~~~~k~~--giis~et~~~~~~~--~~d~~~E~~ri~~E~ 431 (453) T protein:vir:73 362 LWSSLSTNASN--KDAWKDIEYTFTRNEPK----DIKEQAETANILK--GITSEETALSVISV--IPDVQAEMEKIKKKK 431 (453) T ss_pred HHHHHHhccCC--ccccccceEEeCCCCCC----CHHHHHHHHHHHh--ccCcHHHHHHhCCC--CCCHHHHHHHHHHHH Confidence 44333332211 11223456666543222 3344556677764 88888777777765 2221111 100000 Q ss_pred cccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 460 EAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ......+ +. ....+..+..... T Consensus 432 ~~~~~~~-~~-------~~~~~~~~~~~~~ 453 (453) T protein:vir:73 432 LLQLSLT-RT-------SNLVRMKQMRGNL 453 (453) T ss_pred HHHHHHH-Hh-------ccCCcchhhhcCC Confidence 0000000 00 0000000000000 No 185 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.36 E-value=1.1e-06 Score=53.36 Aligned_cols=411 Identities=13% Similarity=-0.011 Sum_probs=163.3 Q ss_pred CCCC---cccccccc----hhhhcccCcccc------CCCCHHH---HHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKA---EETTQLDE----RHIATDVGRGIQ------PPYNPET---LAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~---~~~~~~~~----~~~~~~~~~~i~------p~~~~~~---l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) ++|- -+.+.... +++...- .... +...... ..++ ..++...+|+..+..+.+-|+.+... T Consensus 32 i~~~i~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~Ivd~~~~~l~g~p~~~~~~- 107 (474) T protein:vir:97 32 IVRLIDDHRKQLDKITVGQRYYDKDN-DIVKQMKKVDVHGNIDYDKPDWRI--TTNFHQNLVDQKVSYVASKPVTYSCE- 107 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccc-chhcccchhccccccccccCccee--ecchHHHHHHHHHhhhhcCCceeccC- Confidence 1000 00000000 1111000 0000 0000000 0111 14677889999999999988876321 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) +....+.+..++ ...+......+.++.+.+|.||+.+.++.+|.+ .+..++|..+-+ T Consensus 108 -------d~~~~~~l~~~~---------------~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~ 164 (474) T protein:vir:97 108 -------DENVLKVIHDVL---------------DTRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIP 164 (474) T ss_pred -------cHHHHHHHHHHH---------------hccHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEE Confidence 111122222211 123456667778899999999999988888874 577888888875 Q ss_pred cccccc-cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCC Q lcl|NC_021537. 145 RKTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSP 223 (602) Q Consensus 145 ~~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~ 223 (602) ..+... ....-..... ...+..++.+......+.+.......... ...... ..........+..=.|++|+.. T Consensus 165 v~d~~~~~~~~~~ir~~-~~~~~~~~~~yt~~~~~~y~~~~~~~~~~-~~~~~~--~~~~~~~~~~~g~vPvv~~~nn-- 238 (474) T protein:vir:97 165 IWVDKEREELKSFIRYY-KFNNEEKVEFWTDTTVTYYVLENGGLIPD-YYYGAN--HVQSHFSNGNWGRVPFIAFKNN-- 238 (474) T ss_pred EEcCCCCCceEEEEEEE-EecCeEEEEEEeCCeEEEEEEcCCccccc-cccCcC--cccccccccCCCccceEEecCC-- Confidence 533211 0000000000 00111111111111111111000000000 000000 0000000111222236666543 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCcc Q lcl|NC_021537. 224 LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVD 303 (602) Q Consensus 224 ~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~ 303 (602) .+|+|.+..+...++....+.....+.+...+.|-.+++ |...++ .+.+.... ...+++.++++.+ T Consensus 239 ---~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~~~~-------~~~~~i~~~~~~~ 304 (474) T protein:vir:97 239 ---PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--GYEGED--LEEFMRGL-------KYYKAINVDGDGG 304 (474) T ss_pred ---cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhhhhhh-------hccceeeccCCCc Confidence 368898888877777766655555566666666666554 322221 11111111 1223344433322 Q ss_pred ceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHH Q lcl|NC_021537. 304 DHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TRE 370 (602) Q Consensus 304 ~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~ 370 (602) . ++ ++.. .....+....+...+.|...-++|..-.+-. .++- +..+. ... T Consensus 305 ~------------~~--l~~~-~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~-Sg~Al~~~~~~l~~k~~~k~~ 367 (474) T protein:vir:97 305 V------------ET--IQVE-VPVSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAP-SGIALKFLYGNLDLKANKLKN 367 (474) T ss_pred e------------eE--Eeec-CCHHHHHHHHHHHHHHHHHHhCccccCcccc-cccc-HHHHHHHHHHHHHHHHHHHHH Confidence 1 22 1111 1123455667777788888888875322111 1221 22211 112 Q ss_pred HHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCC Q lcl|NC_021537. 371 FAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDD 450 (602) Q Consensus 371 f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g 450 (602) .+..+|+-++..+...++. ........+.|+.... ..+. +.++.++++|+++..-++++++. +++. T Consensus 368 ~~~~~l~~~~~li~~~~~~-----~~d~~~i~v~f~~~~p---~~~~----e~a~~~~~~g~iS~et~l~~l~~--v~D~ 433 (474) T protein:vir:97 368 KATVAIQELISFIIDFNNL-----KTDVKDIEISFNFNRM---MNDA----EQSQIIAQSQYLSRETLVKSSPL--VDDY 433 (474) T ss_pred HHHHHHHHHHHHHHHHhCC-----CcccceeeEEeccCcc---cCHH----HHHHHHHHcCCCCHHHHHHhCCC--CCCH Confidence 2333444444444433321 1223445666754332 2122 22345566899999889988865 3322 Q ss_pred cc--ccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 451 RG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 451 ~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .. ++.--... ............+...... ....++...| T Consensus 434 ~~E~eri~~E~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~e 474 (474) T protein:vir:97 434 KAELERIEQEQM-EYNKQLPNLDDGGADGAQQ-QEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHHH-HHHhhccccCCCCCCCccc-CCCCcccccC Confidence 11 11100000 0000000000000000000 0000111111 No 186 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.36 E-value=1.1e-06 Score=53.36 Aligned_cols=411 Identities=13% Similarity=-0.011 Sum_probs=163.3 Q ss_pred CCCC---cccccccc----hhhhcccCcccc------CCCCHHH---HHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKA---EETTQLDE----RHIATDVGRGIQ------PPYNPET---LAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~---~~~~~~~~----~~~~~~~~~~i~------p~~~~~~---l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) ++|- -+.+.... +++...- .... +...... ..++ ..++...+|+..+..+.+-|+.+... T Consensus 32 i~~~i~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~Ivd~~~~~l~g~p~~~~~~- 107 (474) T protein:vir:94 32 IVRLIDDHRKQLDKITVGQRYYDKDN-DIVKQMKKVDVHGNIDYDKPDWRI--TTNFHQNLVDQKVSYVASKPVTYSCE- 107 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccc-chhcccchhccccccccccCccee--ecchHHHHHHHHHhhhhcCCceeccC- Confidence 1000 00000000 1111000 0000 0000000 0111 14677889999999999988876321 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) +....+.+..++ ...+......+.++.+.+|.||+.+.++.+|.+ .+..++|..+-+ T Consensus 108 -------d~~~~~~l~~~~---------------~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~ 164 (474) T protein:vir:94 108 -------DENVLKVIHDVL---------------DTRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIP 164 (474) T ss_pred -------cHHHHHHHHHHH---------------hccHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEE Confidence 111122222211 123456667778899999999999988888874 577888888875 Q ss_pred cccccc-cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCC Q lcl|NC_021537. 145 RKTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSP 223 (602) Q Consensus 145 ~~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~ 223 (602) ..+... ....-..... ...+..++.+......+.+.......... ...... ..........+..=.|++|+.. T Consensus 165 v~d~~~~~~~~~~ir~~-~~~~~~~~~~yt~~~~~~y~~~~~~~~~~-~~~~~~--~~~~~~~~~~~g~vPvv~~~nn-- 238 (474) T protein:vir:94 165 IWVDKEREELKSFIRYY-KFNNEEKVEFWTDTTVTYYVLENGGLIPD-YYYGAN--HVQSHFSNGNWGRVPFIAFKNN-- 238 (474) T ss_pred EEcCCCCCceEEEEEEE-EecCeEEEEEEeCCeEEEEEEcCCccccc-cccCcC--cccccccccCCCccceEEecCC-- Confidence 533211 0000000000 00111111111111111111000000000 000000 0000000111222236666543 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCcc Q lcl|NC_021537. 224 LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVD 303 (602) Q Consensus 224 ~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~ 303 (602) .+|+|.+..+...++....+.....+.+...+.|-.+++ |...++ .+.+.... ...+++.++++.+ T Consensus 239 ---~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~~~~-------~~~~~i~~~~~~~ 304 (474) T protein:vir:94 239 ---PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--GYEGED--LEEFMRGL-------KYYKAINVDGDGG 304 (474) T ss_pred ---cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCccc--chhhhhhh-------hccceeeccCCCc Confidence 368898888877777766655555566666666666554 322221 11111111 1223344433322 Q ss_pred ceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHH Q lcl|NC_021537. 304 DHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TRE 370 (602) Q Consensus 304 ~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~ 370 (602) . ++ ++.. .....+....+...+.|...-++|..-.+-. .++- +..+. ... T Consensus 305 ~------------~~--l~~~-~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~-Sg~Al~~~~~~l~~k~~~k~~ 367 (474) T protein:vir:94 305 V------------ET--IQVE-VPVSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAP-SGIALKFLYGNLDLKANKLKN 367 (474) T ss_pred e------------eE--Eeec-CCHHHHHHHHHHHHHHHHHHhCccccCcccc-cccc-HHHHHHHHHHHHHHHHHHHHH Confidence 1 22 1111 1123455667777788888888875322111 1221 22211 112 Q ss_pred HHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCC Q lcl|NC_021537. 371 FAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDD 450 (602) Q Consensus 371 f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g 450 (602) .+..+|+-++..+...++. ........+.|+.... ..+. +.++.++++|+++..-++++++. +++. T Consensus 368 ~~~~~l~~~~~li~~~~~~-----~~d~~~i~v~f~~~~p---~~~~----e~a~~~~~~g~iS~et~l~~l~~--v~D~ 433 (474) T protein:vir:94 368 KATVAIQELISFIIDFNNL-----KTDVKDIEISFNFNRM---MNDA----EQSQIIAQSQYLSRETLVKSSPL--VDDY 433 (474) T ss_pred HHHHHHHHHHHHHHHHhCC-----CcccceeeEEeccCcc---cCHH----HHHHHHHHcCCCCHHHHHHhCCC--CCCH Confidence 2333444444444433321 1223445666754332 2122 22345566899999889988865 3322 Q ss_pred cc--ccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 451 RG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 451 ~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .. ++.--... ............+...... ....++...| T Consensus 434 ~~E~eri~~E~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~e 474 (474) T protein:vir:94 434 KAELERIEQEQM-EYNKQLPNLDDGGADGAQQ-QEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHHH-HHHhhccccCCCCCCCccc-CCCCcccccC Confidence 11 11100000 0000000000000000000 0000111111 No 187 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.36 E-value=1.1e-06 Score=53.28 Aligned_cols=412 Identities=12% Similarity=-0.030 Sum_probs=159.0 Q ss_pred CCCCcc--cccc-cchhhh-cccCc-cccC--CC-------CHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEE--TTQL-DERHIA-TDVGR-GIQP--PY-------NPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~--~~~~-~~~~~~-~~~~~-~i~p--~~-------~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) ++|--+ ...+ ...... +..|. -|.. .. +.. .-.++ .+++...+|+..+..+.|-|+.+... T Consensus 32 i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~Iv~~~~~yl~g~p~~~~~~-- 107 (474) T protein:vir:96 32 IIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRI--TTNFHQNLVDQKVSYVAGKPVTYAHD-- 107 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccccccc--ccchHHHHHHhhhhhhcccCceeccC-- Confidence 000000 0000 000000 00000 0000 00 000 00011 24677888999999998888876421 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVR 145 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~ 145 (602) +....+.+..++. ..+......+..+.+.+|.||..+-++.+|.+ .+..++|..+-+. T Consensus 108 ------~~~~~~~l~~~~~---------------n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v 165 (474) T protein:vir:96 108 ------DDKVLDVIHQVLD---------------TRWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPI 165 (474) T ss_pred ------ChHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEE Confidence 1111222222110 13566677788999999999999989888875 6777888887654 Q ss_pred ccccc-cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCC Q lcl|NC_021537. 146 KTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPL 224 (602) Q Consensus 146 ~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~ 224 (602) .+... ....-... .....+..++.+......+.+....... ........ ...........+..=.|++++.. T Consensus 166 ~d~~~~~~~~a~ir-~~~~~~~~~~~vy~~~~i~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~vPvv~~~nn--- 238 (474) T protein:vir:96 166 WTDKEREQLNAFIR-IFTFNGETKVEYWTAETVTYYVYENGGL-IPDFYYGD--EHIQTHFSTGSWERVPFIAFKNN--- 238 (474) T ss_pred EcCCCCCceEEEEE-EEeecCeeEEEEEeCCeEEEEEEcCCce-eecccccc--ccccCcccccCCCccceEEecCC--- Confidence 32211 00000000 0000111111111111111110000000 00000000 00000001112222236666543 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccc Q lcl|NC_021537. 225 ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDD 304 (602) Q Consensus 225 ~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~ 304 (602) ..|.|.+......++....+..-..+.+...+.|-.+++ |.... ........+ ...+++.++++.+ T Consensus 239 --~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~~~~--~~~~~~~~~-------~~~~~i~~~~~~~- 304 (474) T protein:vir:96 239 --PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--GYEGE--DLSEFMEGL-------KYYKAINVSSDGG- 304 (474) T ss_pred --CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--CCCcc--cccchhhhh-------hccceeeccCCCc- Confidence 368888887777777666555555555666666655543 32111 111111111 1223333333321 Q ss_pred eeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHH Q lcl|NC_021537. 305 HGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREF 371 (602) Q Consensus 305 ~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f 371 (602) +++ ++. +..+..+....+...+.|...-++|.....-. .++ .+..+. .... T Consensus 305 -----------~~~--l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n-~Sg~Alk~~~~~l~~k~~~~~~~ 368 (474) T protein:vir:96 305 -----------VET--IQV-EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSA-TSGIALKFLYTNLNLKANKLKNK 368 (474) T ss_pred -----------eeE--Eec-cCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccc-cHHHHHHHHHHHHHHHHHHHHHH Confidence 222 111 11234566777888888999889885432211 122 121211 1112 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR 451 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~ 451 (602) +...|+-+++.+...++. ........+.|+..... +....+ +.+.++|+++...++++++.- ++.. T Consensus 369 ~~~~l~~~~~~i~~~~g~-----~~d~~~i~i~f~~~~p~----~~~e~a---~~~~~~giiS~et~~~~lp~v--~D~~ 434 (474) T protein:vir:96 369 ANVALQELMQFILDFNKI-----KLDAKEIEITFNFNVMV----NDLEQS---QIGAQSQYLSKETLVRHHPWV--DDPK 434 (474) T ss_pred HHHHHHHHHHHHHHHhCC-----CcccceeeEEecCCCcc----CHHHHH---HHHHHcCCCChHHHHHhCCCC--CCHH Confidence 223333333333332221 11223455666543322 222222 345568999999999888652 2221 Q ss_pred c--ccccccccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 452 G--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 452 ~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) . ++.-.... ......+.....+... ..++ .+....+.. T Consensus 435 ~E~eri~~E~~-~~~~~~~~~~~~~~~~--~~~~-~~~~~~e~~ 474 (474) T protein:vir:96 435 AELERLDEEQL-ELNKQLPNLDDGGADG--AQQQ-QQSENNQSK 474 (474) T ss_pred HHHHHHHHHHH-HHHhhccccccccCCC--CCCc-CCCCccccC Confidence 1 11100000 0000000000000000 0000 000000000 No 188 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.36 E-value=1.1e-06 Score=53.28 Aligned_cols=412 Identities=12% Similarity=-0.030 Sum_probs=159.0 Q ss_pred CCCCcc--cccc-cchhhh-cccCc-cccC--CC-------CHH-HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEE--TTQL-DERHIA-TDVGR-GIQP--PY-------NPE-TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~--~~~~-~~~~~~-~~~~~-~i~p--~~-------~~~-~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) ++|--+ ...+ ...... +..|. -|.. .. +.. .-.++ .+++...+|+..+..+.|-|+.+... T Consensus 32 i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~Iv~~~~~yl~g~p~~~~~~-- 107 (474) T protein:vir:95 32 IIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRI--TTNFHQNLVDQKVSYVAGKPVTYAHD-- 107 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccccccc--ccchHHHHHHhhhhhhcccCceeccC-- Confidence 000000 0000 000000 00000 0000 00 000 00011 24677888999999998888876421 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVR 145 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~ 145 (602) +....+.+..++. ..+......+..+.+.+|.||..+-++.+|.+ .+..++|..+-+. T Consensus 108 ------~~~~~~~l~~~~~---------------n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v 165 (474) T protein:vir:95 108 ------DDKVLDVIHQVLD---------------TRWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPI 165 (474) T ss_pred ------ChHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEE Confidence 1111222222110 13566677788999999999999989888875 6777888887654 Q ss_pred ccccc-cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEecCCCCC Q lcl|NC_021537. 146 KTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLPNPSPL 224 (602) Q Consensus 146 ~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~ 224 (602) .+... ....-... .....+..++.+......+.+....... ........ ...........+..=.|++++.. T Consensus 166 ~d~~~~~~~~a~ir-~~~~~~~~~~~vy~~~~i~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~vPvv~~~nn--- 238 (474) T protein:vir:95 166 WTDKEREQLNAFIR-IFTFNGETKVEYWTAETVTYYVYENGGL-IPDFYYGD--EHIQTHFSTGSWERVPFIAFKNN--- 238 (474) T ss_pred EcCCCCCceEEEEE-EEeecCeeEEEEEeCCeEEEEEEcCCce-eecccccc--ccccCcccccCCCccceEEecCC--- Confidence 32211 00000000 0000111111111111111110000000 00000000 00000001112222236666543 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccc Q lcl|NC_021537. 225 ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDD 304 (602) Q Consensus 225 ~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~ 304 (602) ..|.|.+......++....+..-..+.+...+.|-.+++ |.... ........+ ...+++.++++.+ T Consensus 239 --~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~~~~--~~~~~~~~~-------~~~~~i~~~~~~~- 304 (474) T protein:vir:95 239 --PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--GYEGE--DLSEFMEGL-------KYYKAINVSSDGG- 304 (474) T ss_pred --CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--CCCcc--cccchhhhh-------hccceeeccCCCc- Confidence 368888887777777666555555555666666655543 32111 111111111 1223333333321 Q ss_pred eeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHH Q lcl|NC_021537. 305 HGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREF 371 (602) Q Consensus 305 ~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f 371 (602) +++ ++. +..+..+....+...+.|...-++|.....-. .++ .+..+. .... T Consensus 305 -----------~~~--l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n-~Sg~Alk~~~~~l~~k~~~~~~~ 368 (474) T protein:vir:95 305 -----------VET--IQV-EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSA-TSGIALKFLYTNLNLKANKLKNK 368 (474) T ss_pred -----------eeE--Eec-cCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccc-cHHHHHHHHHHHHHHHHHHHHHH Confidence 222 111 11234566777888888999889885432211 122 121211 1112 Q ss_pred HHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021537. 372 AKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDR 451 (602) Q Consensus 372 ~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~ 451 (602) +...|+-+++.+...++. ........+.|+..... +....+ +.+.++|+++...++++++.- ++.. T Consensus 369 ~~~~l~~~~~~i~~~~g~-----~~d~~~i~i~f~~~~p~----~~~e~a---~~~~~~giiS~et~~~~lp~v--~D~~ 434 (474) T protein:vir:95 369 ANVALQELMQFILDFNKI-----KLDAKEIEITFNFNVMV----NDLEQS---QIGAQSQYLSKETLVRHHPWV--DDPK 434 (474) T ss_pred HHHHHHHHHHHHHHHhCC-----CcccceeeEEecCCCcc----CHHHHH---HHHHHcCCCChHHHHHhCCCC--CCHH Confidence 223333333333332221 11223455666543322 222222 345568999999999888652 2221 Q ss_pred c--ccccccccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 452 G--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 452 ~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) . ++.-.... ......+.....+... ..++ .+....+.. T Consensus 435 ~E~eri~~E~~-~~~~~~~~~~~~~~~~--~~~~-~~~~~~e~~ 474 (474) T protein:vir:95 435 AELERLDEEQL-ELNKQLPNLDDGGADG--AQQQ-QQSENNQSK 474 (474) T ss_pred HHHHHHHHHHH-HHHhhccccccccCCC--CCCc-CCCCccccC Confidence 1 11100000 0000000000000000 0000 000000000 No 189 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.32 E-value=1.3e-06 Score=52.80 Aligned_cols=414 Identities=13% Similarity=0.023 Sum_probs=163.5 Q ss_pred CCCCcc--------------c--cccc--chhhhcccCcccc-C--------CCCHHHHHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_021537. 1 MSKAEE--------------T--TQLD--ERHIATDVGRGIQ-P--------PYNPETLAAFQELNETHQACIRKKSRYE 53 (602) Q Consensus 1 ~~k~~~--------------~--~~~~--~~~~~~~~~~~i~-p--------~~~~~~l~~~~~~~~~v~~cI~~ia~~i 53 (602) +....+ . ..+. .+.+... ..... + ......-.+++ +++...+|+..+..+ T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~-~~i~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~ivd~~~~~l 96 (478) T protein:vir:10 20 IKPKYETQEEMILRLVREHKENIDNITMGERYYNHH-PDILDAPPKRDVNGDYDETKPDWRMY--TNYHQNLVDQKVAYA 96 (478) T ss_pred HhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Cchhccccccccccccccccccceec--cchHHHHHHHHHhhh Confidence 000000 0 0000 0001000 00000 0 00000001122 357788999999999 Q ss_pred ccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEE Q lcl|NC_021537. 54 AGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVG 133 (602) Q Consensus 54 a~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~ 133 (602) .+-|..+.. + +.+..+.+..++ . .++......+.++.+.+|.+|+.+..+.+|++ . T Consensus 97 ~g~~~~~~~--~------~d~~~~~l~~~~----------~-----n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~-~ 152 (478) T protein:vir:10 97 VANPVTFGV--D------NDKALKQIQHTL----------N-----HKWDDKLVDILTAASNKGIEWVQPYVDEEGEF-K 152 (478) T ss_pred ccCCeeeec--C------ChHHHHHHHHHH----------h-----cCHHHHHHHHHHHHHhcCeEEEEEEecCCCee-E Confidence 888877632 1 111122222211 1 13566777788999999999999888888875 5 Q ss_pred EEEeCcccccccccccc-cccccchhhhhcccCceeEEEEcCCcceeecc--cccccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEA--GDRYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) +..++|..+.+..+... ....-....+ ...+..++.+......+.+.. +.............. .+...+.....+ T Consensus 153 ~~~~~p~~~~~i~d~~~~~~~~~~v~~~-~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 230 (478) T protein:vir:10 153 TFRVPAEQAVPIWTNKERDELQAFIRVY-ELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQ-PHYYQGNKLMSW 230 (478) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEE-EecCceEEEEEeCCeEEEEEEcCCeeeccccccccccc-cceecccccccC Confidence 77888888875433211 0000000000 011112222211111111110 000000000000000 000111112233 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccc Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSR 290 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~ 290 (602) ..=.|+||+.. .+|.|.+......++....+..-..+.+...+.|-.+++ |....+. .......+ T Consensus 231 ~~vPvv~~~n~-----~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~--g~~~~~~--~~~~~~~~------ 295 (478) T protein:vir:10 231 GRVPFIPFKNN-----PQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--GYEGEDM--KDFMHNLK------ 295 (478) T ss_pred CccceEEeccC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee--cCCcccc--chhhhhhh------ Confidence 33347777643 378888887777776666555555555666666655543 3222211 11111111 Q ss_pred ccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH--- Q lcl|NC_021537. 291 YRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ--- 367 (602) Q Consensus 291 nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~--- 367 (602) ..+++.+.+. . ..+.++. +. +.....+....+...+.|...-++|..-.+.. .++- +.++. T Consensus 296 -~~~~~~~~~~--------~--~~~~~~l--~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~-Sg~Al~~~ 359 (478) T protein:vir:10 296 -YYKAISVAGE--------S--GSGVDTI--KV-EVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSP-SGIALKFM 359 (478) T ss_pred -hcceEEecCC--------C--CCcceEE--ee-cCChHHHHHHHHHHHHHHHHHhCccccCcccc-cccc-HHHHHHHH Confidence 1122222100 0 0111221 11 11234566777888888888888875332211 1121 22211 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHH Q lcl|NC_021537. 368 ----------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNE 437 (602) Q Consensus 368 ----------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE 437 (602) ....+..+|+-+++.+...+. .........+.|+..-.. +....++.+.++ +|+++... T Consensus 360 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g-----~~~~~~~i~i~f~~~~p~----d~~e~a~~~~kl--~g~iS~et 428 (478) T protein:vir:10 360 YSNLDLKANKLKNKTLTALQELLQYIIDFYR-----LDVKVQDIEITFNFNVMV----NELENSQIAMNS--TGLLSKET 428 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccccceEEecCCCCC----CHHHHHHHHHHH--hCCCChHH Confidence 111222233333333322221 112223456666543322 333445666665 78999988 Q ss_pred HHHHhCCCCCCCCcc--ccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 438 AREELDLAPFEDDRG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 438 ~R~~~Gl~p~~~g~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +++++++- ++... ++.-... ........... .+...+...+ .++...+ T Consensus 429 ~~~~l~~v--~D~~~E~~ri~~E~-~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~ 478 (478) T protein:vir:10 429 ILSNHAWV--EDPVAEMERIEQEN-IELNQQLPDIE-EGLNGEQQRQ--SENNQPE 478 (478) T ss_pred HHHhCCCC--CCHHHHHHHHHHHH-HHHHhhccccc-cccCCCCCCC--CCCCCCC Confidence 99988753 22211 1111000 00000000000 0000000000 0000000 No 190 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.31 E-value=1.5e-06 Score=52.58 Aligned_cols=405 Identities=12% Similarity=0.060 Sum_probs=163.3 Q ss_pred CCCCc-ccccccc--hhhhcccCcc-ccCC-CC----HHHH---HHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MSKAE-ETTQLDE--RHIATDVGRG-IQPP-YN----PETL---AAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~k~~-~~~~~~~--~~~~~~~~~~-i~p~-~~----~~~l---~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) +.|-. .-..+.. +++... ... -.+. +. .... .++ ..++...+|+..+..+.|-|+.+... T Consensus 53 i~~~~~~~~r~~~l~~YY~g~-~~I~~~~~~~~~~~~~~~~~~~~ri--~~n~~k~Ivd~~~~yl~G~p~~~~~~----- 124 (492) T protein:vir:94 53 IKQHLEKLPEISIGQEYYEQR-PDIVKEPKPVDATGAVDPLKPDDRM--ITNFHANLVDQKVSYIVGKPIAFKHT----- 124 (492) T ss_pred HHHHHHHHHHHHHHHHHhccc-ccccccccccccccccccccccccc--ccchHHHHHHHHHhhhcccCceeccC----- Confidence 00000 0000000 001000 000 0010 00 0000 011 24677888999998888888766321 Q ss_pred cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +.+..+.+..++. .........+..+.+.+|.+|..+-.+.+|++ .+..++|..+.+..+. T Consensus 125 ---d~~~~~~l~~~~~---------------n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~v~d~ 185 (492) T protein:vir:94 125 ---DDEVVKRIDEVLG---------------NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 185 (492) T ss_pred ---chHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcC Confidence 1112222222111 12455667788899999999999988888875 5778899887654321 Q ss_pred cccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecC----------ceeEEechhHEEEe Q lcl|NC_021537. 149 TTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA----------GELKNGPANELIFL 218 (602) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~----------~~~~~~~~~eviH~ 218 (602) .... ...-+..|+........+.+.....+ .++ ...+...... .....+..=.|+++ T Consensus 186 ~~~~--------~~~a~ir~~~~~~~~~~~~y~~~~v~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 252 (492) T protein:vir:94 186 KEHE--------ELEAFIRMYKLENETKVEYWDKVTVN----YYV-YENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF 252 (492) T ss_pred CCCC--------ceEEEEEEEeeccceeEEEEecCeEE----EEE-EecCeeeeccccccccccccccccCCCccceEEe Confidence 1100 00011112211111111111000000 000 0000000000 00011111224555 Q ss_pred cCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceec Q lcl|NC_021537. 219 PNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEV 298 (602) Q Consensus 219 r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~ 298 (602) +.. ..|+|.++.....++....+..-.++.+...+.|-.+++ |. +.+....+...+. ..+++.+ T Consensus 253 ~nn-----~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~--~~~~~~~~~~~~~-------~~~~~~~ 316 (492) T protein:vir:94 253 KNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK--NY--DDQELPEFKRLLR-------YYGAIKV 316 (492) T ss_pred cCC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cC--CcccchhhHHHHh-------hccceec Confidence 432 368898988877777777666666667777777766654 31 1111111211111 1122223 Q ss_pred cCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH----------- Q lcl|NC_021537. 299 EEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ----------- 367 (602) Q Consensus 299 ~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~----------- 367 (602) +.+.+ .++ ++. +..+..+....+...+.|+..-++|..-.+..+ ++ .+.++. T Consensus 317 ~~~~~------------~~~--l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n-~Sg~Al~~~~~~l~~k~ 379 (492) T protein:vir:94 317 SDNGG------------VDT--IQV-EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SA-PSGVALEFLYTNLNLKA 379 (492) T ss_pred CCCCc------------cee--Eec-cCCHHHHHHHHHHHHHHHHHHhCCcCCCccccc-cC-chHHHHHHHHHHHHHHH Confidence 22221 122 110 111234566777778888888888753332111 22 122221 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCC Q lcl|NC_021537. 368 --TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLA 445 (602) Q Consensus 368 --~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~ 445 (602) ....+...|+-+++.+...++. .....++.+.|+..-.. +....++++.++ .|+++..-++++++.- T Consensus 380 ~~k~~~f~~~l~~~~~li~~~~~~-----~~~~~~i~v~f~~~~p~----~~~e~~~~~~kl--~giiS~et~~~~l~~v 448 (492) T protein:vir:94 380 DKLARKAKVAIQELLWFVFEHFDI-----KGEHKDVDISFNYNKVA----NTELQVQTAQQS--MGIVSHETVLENHPFV 448 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcC-----CcccceeeEEecCCCCC----CHHHHHHHHHHH--hccCchHHHHHhCCCC Confidence 1112223344444444333332 12234456666544332 233345666666 4899988888888753 Q ss_pred CCCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 446 PFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 446 p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +-+..+.++...... ....+.+.....+..+.... ...++...| T Consensus 449 ~d~~~E~eri~~E~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~e~e 492 (492) T protein:vir:94 449 EDLQAELERIEQEQM-EYNKQLPNLDDGGADSAQQQ-ERSNNKESE 492 (492) T ss_pred CCHHHHHHHHHHHHH-HHHhhccccccccCCCCccc-cCCccccCC Confidence 211111111110000 00000000000000000000 001111111 No 191 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.30 E-value=4.6e-07 Score=55.35 Aligned_cols=511 Identities=13% Similarity=0.076 Sum_probs=211.3 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHH------HHH--------HHHhhhHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPE------TLA--------AFQELNETHQACIRKKSRYEAGYGFEIVAHPSA 66 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~------~l~--------~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~ 66 (602) =.|.... ++.-+-...-++|...+. .++ .+++.-+-++-.|.-+++.++.+-+....-+.+ T Consensus 11 rpk~~p~-----~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss~Sr~rL~as~idpD 85 (629) T protein:vir:10 11 RPKGSPA-----RRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASSCSRVELIASELDPD 85 (629) T ss_pred cCCCccc-----eeeeccccCCCCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhhheeeeEEEeeecCC Confidence 0011100 111111111122222220 011 122223556666777888888887765544433 Q ss_pred CCc-cc----chhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc----eE-EEEE Q lcl|NC_021537. 67 DEP-DE----GGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT----PV-GLAH 136 (602) Q Consensus 67 ~~~-~~----~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~----~~-~L~~ 136 (602) .+. .. +...-..+...+.. + ....+-..++++.+..++-+-|..|+.++--..+. +. .++. T Consensus 86 tg~ptg~i~ed~p~~~~v~~~v~~------i---agG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~W~v 156 (629) T protein:vir:10 86 TGKPTGGIRDDDPDGLRFLEIVKT------M---AGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHNWYV 156 (629) T ss_pred CCCCccccccCchhHHHHHHHHHH------h---cCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccceee Confidence 221 11 11111111111111 1 12345668899999999999999999887544442 22 2222 Q ss_pred eCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEE Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELI 216 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~evi 216 (602) |....|+. +|..+..+ ...+|....|.-+.=+ T Consensus 157 Vt~~Ei~~-------------------kg~g~~~i-----------------------------~lpdg~~he~~~~~D~ 188 (629) T protein:vir:10 157 VTNDEVKN-------------------KGAGKTDI-----------------------------ELPDGTIHEYSKGRDV 188 (629) T ss_pred ecHHHhcc-------------------ccCceeEE-----------------------------EcCCCceeeeeCCCee Confidence 23222221 01111111 0011111222111112 Q ss_pred Eec--CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-C--------------------CH Q lcl|NC_021537. 217 FLP--NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-L--------------------SE 273 (602) Q Consensus 217 H~r--~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~--------------------~~ 273 (602) -|| .+.|.....--||+.+++.++....-..+...+..+.-.+-.|+|.++... + +. T Consensus 189 l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~ 268 (629) T protein:vir:10 189 MFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGV 268 (629) T ss_pred EEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHHhhCceeEeccCcccccccCCCCCCCCcccccccCCC Confidence 233 334445557789999998877666555555555555555556665544211 0 11 Q ss_pred HHHHHHHHHHHH-----hhc--ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHh Q lcl|NC_021537. 274 DSKEDLRNLMDN-----LKG--SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVH 346 (602) Q Consensus 274 ~~~~~l~~~~~~-----~~g--~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~f 346 (602) ...+.|...+-+ +.. ...+...++.. .++ +.--+++.|...+..+.--+.+++..+..+|+.+ T Consensus 269 aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~--------vP~--E~l~~ikhLkf~~eite~~iktR~daI~RlAmgl 338 (629) T protein:vir:10 269 AAADELSNLLFQTAAAAVDDEDSQAALIPLLAT--------VPG--EHLQKIFHLKIGNEITEVEIKTRNDAIARLAMGL 338 (629) T ss_pred cchHHHHHHHHHHHHhhhcCCCCccceeeeEEe--------ech--HHhcCeeeeeecCchhHHHHhhHHHHHHHHHhcc Confidence 233444444422 111 11222222211 111 1222556666667777778899999999999999 Q ss_pred cCChHH-hhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCcc-----ccccceEEEeccchhcchhHHHHHH Q lcl|NC_021537. 347 GVPPVL-INVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDA-----LDVDEWTIDFELRGAEQPEQDAKMA 420 (602) Q Consensus 347 gVPp~~-lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~-----~~~~~~~~~f~~~~~~~~~~d~~~~ 420 (602) -|||+. ||..+++|-=++=+....=++-.|.|.+..|+++|++.+|... -+..+|-+-||.+.+.- |.... T Consensus 339 DispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~~eGiDp~~Yvvw~DaS~Lt~---dPd~~ 415 (629) T protein:vir:10 339 DVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLRAEGIDPDRYVLWYDASGLTV---DPDKT 415 (629) T ss_pred CCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHHHhCCCHHHhEeeecCccccc---CCCCc Confidence 998775 5776566533322222223466799999999999988766432 23356788898887642 22222 Q ss_pred HHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccc------------cc----ccccccccc-cCCCcCcccccccccc Q lcl|NC_021537. 421 EQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTL------------SE----FEAEFGADA-SDGDAEAMLTRSKAAP 483 (602) Q Consensus 421 ~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~------------~~----~~~~~~~~~-~~~~~~~~~~~~~~~~ 483 (602) .+++ .+.+.|.||-...|+.+|+.--++.+-+.+. .+ ...+.+.+. +.-+.+ .+.....+ T Consensus 416 deA~-~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~~apll~~~l~~i~~P--~p~~a~~~ 492 (629) T protein:vir:10 416 DEAT-AAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKVLAPLLTDELAEIDWP--EPPAALPP 492 (629) T ss_pred HHHH-HHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhhhhhhcCCcccccccc--CCCCcCCC Confidence 2333 4788999999999999998543321101100 00 111111010 111111 11111111 Q ss_pred cccccccccccccccc--cccchhhhhcchhh----------hhhheecccccEEEEEEecccCCcceeeeccCCHHHHH Q lcl|NC_021537. 484 PLENKIGERDSVDVDV--SKDPIEQTTFSSSN----------LDEGLYDFGERELYLSFKRESGQNSLYVYVDVPAAVWS 551 (602) Q Consensus 484 ~~~~~~~~~~~~~~~~--~~~~m~~~~v~ss~----------~~~~~yd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~ 551 (602) ..+.+..+++...... .++.-+.-+..+.. +.....-++.+-+.. ...-.-++|.+||++.|- T Consensus 493 ~~~~~~~~E~~~~~~e~~~e~dA~~a~~~~~~aa~~~A~rllv~RALelAGkRl~~~-----rdR~~~ar~~~vp~he~h 567 (629) T protein:vir:10 493 GEDDQADEEQDTTGSEPSTEDDAEAAARISSVADMVLAERLLTVRALGLAGKRRVNT-----NDRAQKARLAGIAPHDYH 567 (629) T ss_pred CCcccCccccCCCCCCcCCCcchhhcccCCchhhHHHHHHHHHHHHHHHccccccCC-----CchhhHHHhhcCChhhce Confidence 1111111111100000 00000111111111 111111111111111 001123666666665432 Q ss_pred HHh-CCCccchhhhhhhcccccccccccc-hhcccCCCCCChhhcCCcccc---cC Q lcl|NC_021537. 552 ALV-SAPSAGSYHYSEIRLQYGYLEVTNN-HERLPEGPTPDPGEAPEDVPS---DI 602 (602) Q Consensus 552 ~~~-~a~s~g~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~---~~ 602 (602) --| ...+. -..+-|.+ .. .. .+.......-|++.++--+-. .+ T Consensus 568 ~~l~Pv~~~--~v~rli~g-wd-----~~l~~~~~a~lg~D~~~~~~~~sav~~~v 615 (629) T protein:vir:10 568 RVMGPVADA--DIPRLIAG-WD-----EGLEEEALALLGVDSRRTEALRSAVRAQI 615 (629) T ss_pred eecCCCChh--HHHHHHHh-hh-----hHHHHHHHHHhCCChhhhHHHHHHHHHHH Confidence 110 00000 00000000 00 00 000000011111111100000 00 No 192 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.30 E-value=1.5e-06 Score=52.44 Aligned_cols=406 Identities=11% Similarity=0.014 Sum_probs=166.1 Q ss_pred CCCCcccccccchhhhccc-Cc-ccc--C-------CCCH-HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDV-GR-GIQ--P-------PYNP-ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~-~~-~i~--p-------~~~~-~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) +.+-. +.....+....-+ |. -|. + ..++ ..-.+++ .++...+|+..+..+.+-|..+... T Consensus 35 i~~~~-~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~Ivd~~~~~l~g~p~~~~~~----- 106 (474) T protein:vir:96 35 INDHK-PKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMF--TNYHQNLVDQKVAYAVANPVTFSSD----- 106 (474) T ss_pred HHHHH-HHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcc--cchHHHHHHhhhhhhcccCceeecC----- Confidence 11100 0000111110000 00 000 0 0000 0011222 4677889999999999988876421 Q ss_pred cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +.+..+.+..++. ..+......+..+...+|.+|+.+-++.+|++ .+..++|..+-+.-+. T Consensus 107 ---d~~~~~~l~~~~~---------------n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~-~i~~~~p~~~~~v~d~ 167 (474) T protein:vir:96 107 ---DDKSLKTIQEVLN---------------HKWDDKLVDILTAASNKGIEWLQPYIDENGEF-KTFRVPAEQAIPIWTN 167 (474) T ss_pred ---chHHHHHHHHHHh---------------cCHHHHHHHHHHHHHhcCeeEEEEEecCCCce-EEEEEcccceEEEEcC Confidence 1122233333221 12344556677889999999998888888875 5888999888755332 Q ss_pred ccc-ccccchhhhhcccCceeEEEEcCCcceeecc--cccccccceee-ecccceEEecCceeEEechhHEEEecCCCCC Q lcl|NC_021537. 149 TTI-EREDGEEVENIESGHGYVQVRQGRRRYFGEA--GDRYGDDKRFV-DKETGEVASDAGELKNGPANELIFLPNPSPL 224 (602) Q Consensus 149 ~~~-~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~--~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~ 224 (602) ... ...-.... ....+..++.+......+.+.. +.......... ....+.. .......+..=.|+||++. T Consensus 168 ~~~~~~~~~vr~-~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~iPvv~~~nn--- 241 (474) T protein:vir:96 168 KERDTLKAFIRY-YRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYY--VGNKRVSWGRVPFIPFKNN--- 241 (474) T ss_pred CCCCceEEEEEE-EeecCceEEEEEeCCeEEEEEecCCceeecccccccccccccc--ccccccCCCceeEEEeccC--- Confidence 111 00000000 0011111221111111111111 00000000000 0000000 0111122223346777653 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceecc-CCcc Q lcl|NC_021537. 225 ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVE-EFVD 303 (602) Q Consensus 225 ~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~-~g~~ 303 (602) ..|.|.+......++....+..-..+.+...+.|-.+++ |....+ ...+. ... ...+++.++ .|.+ T Consensus 242 --~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~---~~~----~~~~~i~~~~~~~~ 308 (474) T protein:vir:96 242 --PQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILK--GYEGQD--LDEFM---RNL----KYYKAINVDGDGSG 308 (474) T ss_pred --CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCccc--ccchh---hhh----hcCceEEecCCCCc Confidence 368898888877777766666666677777777755554 322111 11111 111 122333332 1211 Q ss_pred ceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------------HHH Q lcl|NC_021537. 304 DHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TRE 370 (602) Q Consensus 304 ~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~ 370 (602) +++ ++.. .....+...++...+.|+..-++|..-.+-. .++ .+..+. .+. T Consensus 309 ------------~~~--l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n-~Sg~Al~~~~~~l~~k~~~k~~ 371 (474) T protein:vir:96 309 ------------VDT--IQIE-VPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNS-PSGIALKFMYSNLDLKANKLKN 371 (474) T ss_pred ------------eeE--Eeec-CChHHHHHHHHHHHHHHHHHhCCcccccccc-ccc-cHHHHHHHHHHHHHHHHHHHHH Confidence 111 1111 1123456777888899999999986432211 122 122221 111 Q ss_pred HHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCC Q lcl|NC_021537. 371 FAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDD 450 (602) Q Consensus 371 f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g 450 (602) .+...|+-+++.|...+. .........+.|+..... .+ . +.++.+..+|+++...++++++. +++. T Consensus 372 ~~~~~l~~~~~~i~~~~~-----~~~~~~~i~i~f~~~~p~---~~-~---e~~~~~~~ag~iS~et~~~~~~~--v~d~ 437 (474) T protein:vir:96 372 KTLTALQELLQYIIDFYK-----LNIKVQDVEITFNFNVMV---NE-L---EQSQIGVQSQYLSKETVVTNHPW--VDDP 437 (474) T ss_pred HHHHHHHHHHHHHHHHhC-----CCcccceeeEEeccCCCc---CH-H---HHHHHHHhcCCCchHHHHHhCCC--CCCH Confidence 222333333333333221 111223445666544322 12 2 22344667899999999988754 3332 Q ss_pred ccc--ccccccccccccc--ccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 451 RGD--MTLSEFEAEFGAD--ASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 451 ~~d--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) ..+ +..-.. ...... ...++..+...+ .++.+. T Consensus 438 ~~E~~ri~~E~-~e~~~~~~~~~~~~~~~~~d-----------~~~e~~ 474 (474) T protein:vir:96 438 VAELERIEQDN-IDFNKQLPPLEGDANGRAQD-----------NESETN 474 (474) T ss_pred HHHHHHHHHHH-HHHHhcccccccccccccCC-----------CcccCC Confidence 211 110000 000000 000111111000 111111 No 193 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.30 E-value=1.6e-06 Score=52.42 Aligned_cols=406 Identities=12% Similarity=0.053 Sum_probs=165.5 Q ss_pred CCCC-cccccccc-hhhhcccCcc-ccCC-CC----HHHH---HHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKA-EETTQLDE-RHIATDVGRG-IQPP-YN----PETL---AAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~-~~~~~~~~-~~~~~~~~~~-i~p~-~~----~~~l---~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) +.+- .....+.. ..+...-... -.+. ++ .... .++ .+++...+|+..+..+.+-|+.+... T Consensus 53 i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri--~~n~~k~Ivd~~~~yl~g~p~~~~~~------ 124 (492) T protein:vir:97 53 IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRM--ITNFHANLVDQKVSYIVGKPIAFKHT------ 124 (492) T ss_pred HHHHHHHHHHHHHHHHHhcccCcccccccccccccccccccccccc--ccchHHHHHHHHhhhhcccCceeccC------ Confidence 1110 00000000 0111000000 0110 00 0000 112 25788899999999999888776321 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +.+..+.+..++. .........+..+.+.+|.||..+..+.+|++ .+..++|..+.+..+.. T Consensus 125 --d~~~~~~l~~~~~---------------n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d~~ 186 (492) T protein:vir:97 125 --DDEVVKRIDEVLG---------------NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDK 186 (492) T ss_pred --chHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCC Confidence 1112222222211 12345666788899999999999999888875 57788998886653321 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEec----------CceeEEechhHEEEec Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASD----------AGELKNGPANELIFLP 219 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~----------~~~~~~~~~~eviH~r 219 (602) .. + ...-+..|+...+....+++.....+. .....+..... ......+..=.|++|+ T Consensus 187 ~~----~----~~~~~vr~~~~~~~~~~~~y~~~~v~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 253 (492) T protein:vir:97 187 EH----E----ELEAFIRMYKLENETKVEYWDKVTVNY-----YVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK 253 (492) T ss_pred CC----C----ceEEEEEEEeeccceeEEEEecCeEEE-----EEEecCeeeecccccccccccccccCCCCCcceEEec Confidence 10 0 000111111111111111111000000 00000000000 0000112222356665 Q ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceecc Q lcl|NC_021537. 220 NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVE 299 (602) Q Consensus 220 ~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~ 299 (602) .. ..|+|.++.....++....+..-..+.+...+.|-.+++ |. +.+........+. ..+++.++ T Consensus 254 nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~--g~--~~~~~~~~~~~~~-------~~~~~~~~ 317 (492) T protein:vir:97 254 NN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK--NY--DDQELPEFKRLLR-------YYGAIKVS 317 (492) T ss_pred CC-----CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeee--cC--CcccchhHHHHHh-------hccceecC Confidence 42 268888888877777766666666666677777766554 31 1111111111111 11222222 Q ss_pred CCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------------ Q lcl|NC_021537. 300 EFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------------ 367 (602) Q Consensus 300 ~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------------ 367 (602) .+.+ .++ ++. +..+..+....+...+.|+..-++|....+..+ ++ .+.++. T Consensus 318 ~~~~------------~~~--l~~-~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n-~Sg~Al~~~~~~l~~ka~ 380 (492) T protein:vir:97 318 DNGG------------VDT--IQV-EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SA-PSGVALEFLYTNLNLKAD 380 (492) T ss_pred CCCc------------cee--Eec-cCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cC-cHHHHHHHHHHHHHHHHH Confidence 2211 122 111 112345677778888889888888864332211 12 122221 Q ss_pred -HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCC Q lcl|NC_021537. 368 -TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAP 446 (602) Q Consensus 368 -~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p 446 (602) ....+...|+.+++.+...++. ........+.|...... +....++++.++ .|+++..-+.++++.-. T Consensus 381 ~~~~~f~~~l~~~~~li~~~~~~-----~~~~~~i~v~f~~~~p~----~~~e~a~~~~kl--~G~iS~et~l~~l~~v~ 449 (492) T protein:vir:97 381 KLARKAKVAIQELLWFVFEHFDI-----KGEHKDVDISFNYNKVA----NTELQVQTAQQS--MGIVSHETVLENHPFVE 449 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHhcC-----CcccceeeEEecCCCCC----CHHHHHHHHHHH--hccCchHHHHHhCCCCC Confidence 1112233444444444433321 12234455666543322 233345666666 58999888888887522 Q ss_pred CCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 447 FEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 447 ~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) -+..+.++.-.... ......+.....+....... ...+++.++ T Consensus 450 d~~~Eleri~~E~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e 492 (492) T protein:vir:97 450 DLQAELERIEQEQT-EYNKQLPNLDDGGADSAQQQ-ERSNNKESE 492 (492) T ss_pred CHHHHHHHHHHHHH-HHHHhhhccccCCCCCCccc-ccccccccC Confidence 11111111110000 00000000000000000000 000011111 No 194 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.28 E-value=1.7e-06 Score=52.20 Aligned_cols=411 Identities=10% Similarity=-0.041 Sum_probs=165.8 Q ss_pred CCCCcccccc-cchhh-hcccCc-cc-c-CCC---------CHHH-HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021537. 1 MSKAEETTQL-DERHI-ATDVGR-GI-Q-PPY---------NPET-LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPS 65 (602) Q Consensus 1 ~~k~~~~~~~-~~~~~-~~~~~~-~i-~-p~~---------~~~~-l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~ 65 (602) |++--++... ..++. ....|. -| . |.. +... -.++ .+++...+|+..+..+.+-|..+... T Consensus 28 i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki--~~~~~~~Ivd~~~~~l~g~p~~~~~~-- 103 (479) T protein:vir:79 28 IEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKA--INNYHKLLVDQKVGYSVGNPIVFNAD-- 103 (479) T ss_pred HHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCccee--ecchHHHHHHHHHhhhhcCCceeccC-- Confidence 1111100000 00010 000010 00 0 000 0000 0011 25677888999999999888776421 Q ss_pred CCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccc Q lcl|NC_021537. 66 ADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVR 145 (602) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~ 145 (602) +....+.+..+. ...+......+..+.+.+|.+|..+..+.+|++ .+..++|..+.+. T Consensus 104 ------~~~~~~~~~~~~---------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v 161 (479) T protein:vir:79 104 ------DDNLTKLLNDLL---------------GEEFDDTITELYLNASNKGVEWLHPYINRKGEF-KYVIIPAEEAIPI 161 (479) T ss_pred ------CHHHHHHHHHHH---------------hcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEccceeEEE Confidence 111111111111 013566777788999999999999988888875 5788899888655 Q ss_pred cccccc-ccccchhhhhc--c--cCceeEEEEcCCcceeeccccccc-ccc-----eeeecccceEEecCceeEEechhH Q lcl|NC_021537. 146 KTTTTI-EREDGEEVENI--E--SGHGYVQVRQGRRRYFGEAGDRYG-DDK-----RFVDKETGEVASDAGELKNGPANE 214 (602) Q Consensus 146 ~~~~~~-~~~~~~~~~~~--~--~~~~~~qi~~~~~~~~~~~~~~~~-~~~-----~~~~~~~g~~~~~~~~~~~~~~~e 214 (602) .+.... ........+.. . ....++.+......+.+....... ... .........+.........+..=. T Consensus 162 ~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 241 (479) T protein:vir:79 162 WDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVP 241 (479) T ss_pred EeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCccc Confidence 322110 00000000000 0 001112111111111111100000 000 000000000000001111222335 Q ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc Q lcl|NC_021537. 215 LIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA 294 (602) Q Consensus 215 viH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~ 294 (602) |++++.. .+|.|.+......++....+..-..+.+...+.|-.+++--++...++....+ ..++ T Consensus 242 vv~~~nn-----~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~-----------~~~~ 305 (479) T protein:vir:79 242 FIPFKNN-----EKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNI-----------RYYK 305 (479) T ss_pred EEEecCC-----CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhh-----------hhcc Confidence 6777543 36888888877777766666555566667777776665421111122211111 1223 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------- Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------- 367 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------- 367 (602) ++.++++.+. ++. +. +..+..+....+...+.|...-++|..-.+. .++-| .++. T Consensus 306 ~i~~~~~~~~------------~~l--~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~gn~S-g~Ai~~~~~~l 367 (479) T protein:vir:79 306 SIKVDGGGGV------------DKL--EI-NIPVEAKKELLDRLEKNIIIFGQGVNPESQN--TGDKS-GVALKFLYSLL 367 (479) T ss_pred ceecCCCCcc------------eEE--ec-cCCHHHHHHHHHHHHHHHHHHhCcccccccc--ccchh-HHHHHHHHHHH Confidence 3333333221 221 11 1123445677777888888888887643322 22222 2211 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHH Q lcl|NC_021537. 368 ------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREE 441 (602) Q Consensus 368 ------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 441 (602) ....+...|+-+++.+...++..-.. .....+..+.|...-.. +.+..++.+.++ .|+++...+.++ T Consensus 368 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~i~i~f~~~~p~----~~~~~a~~~~kl--~g~iS~et~l~~ 440 (479) T protein:vir:79 368 DLKCSKTEKKFKKAIRELLWFVCEYLKISGNK-SYDYKTVQITFNHSMII----NEAEKIDMAAKS--TGIVSDETIVSN 440 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCcHHHHHHh Confidence 11222334444444444443322111 11223445566443222 333345666666 589998888888 Q ss_pred hCCCCCCCCccc--cccccccccccccccCCCcCcccccccccc Q lcl|NC_021537. 442 LDLAPFEDDRGD--MTLSEFEAEFGADASDGDAEAMLTRSKAAP 483 (602) Q Consensus 442 ~Gl~p~~~g~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (602) ++. +++...+ +.--................+.. ..+. T Consensus 441 l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~---~~e~ 479 (479) T protein:vir:79 441 HPW--VEDVNDELERLKKQEDTQKEYDDLIPNNQDGV---IDET 479 (479) T ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHhccCcccCCC---cCcC Confidence 765 2222111 11000000000000000000000 0000 No 195 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.19 E-value=2.9e-06 Score=50.91 Aligned_cols=432 Identities=9% Similarity=-0.031 Sum_probs=173.9 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHH---------------------------hhhHHHHHHHHHHHHhh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQ---------------------------ELNETHQACIRKKSRYE 53 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~---------------------------~~~~~v~~cI~~ia~~i 53 (602) |.+.-+.+.+- .+...+.+..+++.+.++. ..-.....+++.+|+-| T Consensus 14 ~~~~~~~~~~~------~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv 87 (522) T protein:vir:47 14 GRYYMQTSNLN------SILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKIASLV 87 (522) T ss_pred HHHHhhcccch------hccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHHHHHhhhh Confidence 11111111100 0011111222222222111 11144566777777777 Q ss_pred ccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEE Q lcl|NC_021537. 54 AGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVG 133 (602) Q Consensus 54 a~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~ 133 (602) .+=+-.+.-. +....+.+...+. ...+...++..+...+..|.+++.+..+. |+ +. T Consensus 88 ~~e~~~i~v~--------d~~~~~~l~~~l~--------------~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~ 143 (522) T protein:vir:47 88 YNEQATITTK--------NEILQKFLDDMLT--------------NDRFNKNFERYLESCLALGGLAMRPYIDG-DK-VR 143 (522) T ss_pred cCCcceeecC--------ChHHHHHHHHHHh--------------hcchHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eE Confidence 6644443210 1111122222111 23456677778888888899999888874 33 56 Q ss_pred EEEeCcccccccccccccccccchh---hhhcccCceeEEEE---cCCcc-----eeecccccccccceeee----cccc Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEE---VENIESGHGYVQVR---QGRRR-----YFGEAGDRYGDDKRFVD----KETG 198 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~---~~~~~~~~~~~qi~---~~~~~-----~~~~~~~~~~~~~~~~~----~~~g 198 (602) +.++++..+-|.........+--.. .........|+... .+... .+...+..+........ ...| T Consensus 144 i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG 223 (522) T protein:vir:47 144 VAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLG 223 (522) T ss_pred EEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccC Confidence 7888888776642221111000000 00000111111100 00000 00000000000000000 0000 Q ss_pred eE--------EecCceeEEe---chhHEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCce- Q lcl|NC_021537. 199 EV--------ASDAGELKNG---PANELIFLPNPSP----LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHY- 262 (602) Q Consensus 199 ~~--------~~~~~~~~~~---~~~eviH~r~~~~----~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g- 262 (602) .- +........+ +.--..||+.+.+ .+.++|+|.+..+...++.....-.-...-|+.|...-. T Consensus 224 ~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v 303 (522) T protein:vir:47 224 QRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIV 303 (522) T ss_pred ccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeec Confidence 00 0000000111 1122446765422 135689999999998887766655555555666554211 Q ss_pred ---EEEeccccCCHHHHHHHHHHHHHhhcccccCcceecc-CCccceeccccccccccccccccccchHHHHHHHHHHhh Q lcl|NC_021537. 263 ---AVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVE-EFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERN 338 (602) Q Consensus 263 ---il~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~-~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~ 338 (602) +++......+ +.. .....+. ...-+..+.... ..+-.++.++ ..-.+-++....+.. T Consensus 304 ~~~~l~~~~~~~~----------------g~~-~~~~~fd~~~~~f~~~~~~~-~~~~~i~~~~-~~ir~e~~~~~~~~~ 364 (522) T protein:vir:47 304 PEHLTQRQYQRPD----------------GTI-DFRPRFDVEQNVYMQIGGSS-MDAGGITDLT-SPIRANDYILAISEG 364 (522) T ss_pred chHHhccCCCCCC----------------ccc-ccccccCcccceEeecCCCC-CCCCcceeec-cccChHHHHHHHHHH Confidence 1221110000 000 0000000 000111111111 1111233332 223566888889999 Q ss_pred HHHHHHHhcCChHHhhccccCCccCHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEe Q lcl|NC_021537. 339 EHEIAKVHGVPPVLINVTSTSNRANSKEQ-------------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDF 405 (602) Q Consensus 339 ~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f 405 (602) .+.|+...|+++..+|+..++ -.|+.+. .+..++.+|..++..+....+..-+........+.+.+ T Consensus 365 l~~i~~~~gls~~tf~~~~~~-~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v 443 (522) T protein:vir:47 365 LKLFEMQIGVSSGMFTFDGQG-MKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISV 443 (522) T ss_pred HHHHHHHhCCCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEE Confidence 999999999999999876553 3344433 22334445555555554433321111112223455666 Q ss_pred ccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHh-CCCCCCCCccccccccccccc-cccccCCCcCcccccccccc Q lcl|NC_021537. 406 ELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEF-GADASDGDAEAMLTRSKAAP 483 (602) Q Consensus 406 ~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 483 (602) ++++-...+ .+...+...+++.+|+|+.-+++.+. |+ .+++...-+....... ...+...+..+.. .+ T Consensus 444 ~f~D~i~~D--~~~~~~~~~~~v~aG~~s~e~~i~~~~g~---~eeea~~el~ri~~E~~~~~~~~~~~~~~~-----~~ 513 (522) T protein:vir:47 444 NLDDGVFTD--RHAELDYWAKMVAAGFSTKKRAIGKTLNI---SGVEAEKELNAINSELLPMNDAELAIYGMH-----DQ 513 (522) T ss_pred EcCCCCCCC--HHHHHHHHHHHHhcCCCCHHHHHHhcCCC---ChHHHHHHHHHHHHhhccCCCCCCCCCCCC-----Cc Confidence 666554443 33444567788999999999988764 43 3332222111111100 0001111111111 11 Q ss_pred ccccccccc Q lcl|NC_021537. 484 PLENKIGER 492 (602) Q Consensus 484 ~~~~~~~~~ 492 (602) +.+..+.+. T Consensus 514 ~~~~~d~~~ 522 (522) T protein:vir:47 514 NEEKADDKG 522 (522) T ss_pred ccccCCCCC Confidence 111111111 No 196 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.16 E-value=3.4e-06 Score=50.59 Aligned_cols=429 Identities=11% Similarity=0.035 Sum_probs=171.2 Q ss_pred CCCCcccccccc-----------------hhhhcccCcccc-CCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKAEETTQLDE-----------------RHIATDVGRGIQ-PPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~~~~~~~~~-----------------~~~~~~~~~~i~-p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) +.|+ -++..+ +.+...-..++. .+.+...-.+....-...+.+++..|+-+.+=+..+.- T Consensus 20 ~~~~--~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i~~~~A~lv~~e~~~i~v 97 (508) T protein:vir:15 20 VTGS--LSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTAARRIASVVFNEKAEIHV 97 (508) T ss_pred cccc--hHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHHHHHHHhhhhCCCceEEe Confidence 1111 010000 011100001110 00111111111111245567778888877765544432 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATV 142 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v 142 (602) .++ ....+.+...+. ...+...++..+.+.+..|.+++.+..+.. -+.+.+++|..+ T Consensus 98 ~~~-------~~~~e~l~~il~--------------~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~--~~~i~~v~ad~~ 154 (508) T protein:vir:15 98 KDN-------NEADKFLNDVLE--------------DNDFKNKFEEALEKGVALGGFAMRPYIDGN--HIKIAWVRADQF 154 (508) T ss_pred CCc-------hHHHHHHHHHHH--------------hccHHHHHHHHHHHHhhcCceEEEEEEeCC--eeEEEEEcCCee Confidence 111 111112222111 123456667778888999999998888743 357888988887 Q ss_pred cccccccccccccchhhhhc----ccCceeEE---EEc----CCcceeec-ccccccccc-eeeecccceEEecCceeEE Q lcl|NC_021537. 143 RVRKTTTTIEREDGEEVENI----ESGHGYVQ---VRQ----GRRRYFGE-AGDRYGDDK-RFVDKETGEVASDAGELKN 209 (602) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~q---i~~----~~~~~~~~-~~~~~~~~~-~~~~~~~g~~~~~~~~~~~ 209 (602) -+......... ........ .....|+. ... +....-+. |........ .-+...+..-+........ T Consensus 155 ~P~~~d~~~~~-~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~ 233 (508) T protein:vir:15 155 YPLQSNTNDIS-EAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVT 233 (508) T ss_pred EEEEEcCCCeE-EEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceE Confidence 65322222110 00000000 00111111 000 00000000 000000000 0000000000000000001 Q ss_pred ---echhHEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHH Q lcl|NC_021537. 210 ---GPANELIFLPNPSP----LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNL 282 (602) Q Consensus 210 ---~~~~eviH~r~~~~----~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~ 282 (602) ++---..||+.+-+ .+.++|+|.+..+...++.....-.....-|+.| .+..++ +...+..+ T Consensus 234 ~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v--~~~~l~~d-------- 302 (508) T protein:vir:15 234 ISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLG-QKHIAV--QPGMLRFD-------- 302 (508) T ss_pred ecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhc-ccceee--chHHhcCC-------- Confidence 11122456664322 1356899999999988887766666566666544 333333 22111100 Q ss_pred HHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCcc Q lcl|NC_021537. 283 MDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRA 362 (602) Q Consensus 283 ~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s 362 (602) .+ +...+-....-+..+.. ..+.+..++.++ ..-.+-++.+..+...+.|....|++|..+|+..++. . T Consensus 303 -------~~-~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~-~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~-~ 371 (508) T protein:vir:15 303 -------DE-HKPTFDTEQNVYVGVLS-DDNNGLGVKDMT-TPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGV-K 371 (508) T ss_pred -------CC-CccccCCCCeeEEeccC-CCCCCCceeEee-cccChHHHHHHHHHHHHHHHHHhCCCchhcccccCcc-c Confidence 00 00111111111111111 111222233332 1224557888899999999999999999998765543 3 Q ss_pred CHHHH-------------HHHHHHHHHHHHHHHHHHHHhhh-cCCc-------cccccceEEEeccchhcchhHHHHHHH Q lcl|NC_021537. 363 NSKEQ-------------TREFAKGIIEPEQAKFSARLYKI-IHQD-------ALDVDEWTIDFELRGAEQPEQDAKMAE 421 (602) Q Consensus 363 n~e~~-------------~~~f~~~~l~P~~~~ie~~ln~~-Ll~~-------~~~~~~~~~~f~~~~~~~~~~d~~~~~ 421 (602) |+.+. .+..++.+|..++..|....+.. +... ......+.+.+++++-.-.+.++ .. T Consensus 372 TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~--~~ 449 (508) T protein:vir:15 372 TATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDK--QL 449 (508) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHH--HH Confidence 44332 11223334444444433332211 1111 01122345556666554443333 34 Q ss_pred HHHHHHHhCCcccHHHHHHHh-CCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 422 ~~~~~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +...+++.+|+|+..+++... |+ .+++.+..+.... .+.......+......+ ....| T Consensus 450 ~~~~~~v~aGi~s~e~~i~~~~g~---~deea~~el~ri~----~E~~~~~~~~~~~~~~~-----g~~ge 508 (508) T protein:vir:15 450 EEDAKVLAIGALSKQTFLQRNYGM---TDEQAAEELAKIQ----SEAPTDTFEGGRSAILN-----GGDGE 508 (508) T ss_pred HHHHHHHhcCCCCHHHHHHhcCCC---ChHHHHHHHHHHH----HhccccCccccccccCC-----CCCCC Confidence 556788999999999988654 43 3332221111110 00000000000000000 00011 No 197 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.14 E-value=2.5e-07 Score=56.79 Aligned_cols=186 Identities=17% Similarity=0.169 Sum_probs=83.3 Q ss_pred EEEecc--ccC--CHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhh Q lcl|NC_021537. 263 AVKVTG--GTL--SEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERN 338 (602) Q Consensus 263 il~~~~--~~~--~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~ 338 (602) |+++++ ..+ ++.+....-+.+...++.. +.+.+...+-++..... +|+ .+.+..... T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~--~~~~ld~~~e~~e~~~~----------~ls-------Gl~d~l~~~ 61 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVG--QAIGIDADSEEYNVLNS----------DIG-------GIDTFLSQK 61 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhh--hhheeecCCcceeeeec----------CcC-------ChHHHHHHH Confidence 444332 011 1112211112223333321 22222222233333221 121 122445566 Q ss_pred HHHHHHHhcCChHHh-hccccCCccCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhcCCccccccceEEEeccchh Q lcl|NC_021537. 339 EHEIAKVHGVPPVLI-NVTSTSNRANSKEQTREFAKG-------IIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGA 410 (602) Q Consensus 339 ~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~ 410 (602) .+.||++-+||...| |..-.+=.++.+.-...||.. .|+|.++++-.. .....+|.|+|+--.. T Consensus 62 ~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~--------~~~~~~~~~~f~pL~~ 133 (201) T protein:vir:10 62 FDRIVALSGIHEIILKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPF--------IVTEQEWSVEFNPLSQ 133 (201) T ss_pred HHHHHhHhcCchhhhcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHh--------hcCCCCceEeeCCCCC Confidence 789999999997766 543333234556566666653 345555443332 2233578888875433 Q ss_pred cchhHHH---HHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccc Q lcl|NC_021537. 411 EQPEQDA---KMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLEN 487 (602) Q Consensus 411 ~~~~~d~---~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (602) +.-++.+ ++.++++++++.+|+++++|+|+.+--.+..+..++-. +.... .......+.+.+.++ T Consensus 134 ~s~kekAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~-----~~~~~-------~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 134 VSDKDKSEILEKNVNSVAALIAAGIIDADEARDTLRAISTEVKIGEGS-----IQTEV-------VINESEDPLDVSANN 201 (201) T ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCC-----CCccc-------cccccCCCCCCCCCC Confidence 3333322 34567888999999999999999875444332211100 00000 000000000001111 No 198 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.13 E-value=4e-06 Score=50.16 Aligned_cols=423 Identities=11% Similarity=0.019 Sum_probs=157.5 Q ss_pred CCCCccc--ccccc-hhhhcccCccccCCCCHH---HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEET--TQLDE-RHIATDVGRGIQPPYNPE---TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~--~~~~~-~~~~~~~~~~i~p~~~~~---~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) |.+=... ..+.. +.+...-......+.... .-+++. .++.+.+|+..+..+.+-|..+... +.. T Consensus 24 i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~--~n~~~~iv~~~~~~l~g~~~~~~~~--------d~~ 93 (489) T protein:vir:99 24 ISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIA--SDFAKYITVFEQGYMLGVPVEYKNE--------NKD 93 (489) T ss_pred HHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceee--cchHHHHHHHHhhhhccCCceeecC--------Chh Confidence 1110000 00000 001110011111111111 011232 4677888999998888888776421 111 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEee----CCCCceEEEEEeCcccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILV----EGDGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r----~~~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) ..+.+..++. ...+..+...+..+.+++|.+|..+.. +.+|+ ..+..++|..+.+..+... T Consensus 94 ~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i~~~~p~~~~~v~dd~~ 158 (489) T protein:vir:99 94 LQAAIDLMSV--------------RNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKLYQLPAEQTFVIYDDTY 158 (489) T ss_pred HHHHHHHHHh--------------hcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEEEEEcccceEEEEcCCC Confidence 1122222111 123456677888999999999987654 33333 5688888888765432111 Q ss_pred cccccchhhhhcccCceeEEEEcCCcceeecccccccccce--eeecc--cceEEecCceeEEechhHEEEecCCCCCCC Q lcl|NC_021537. 151 IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKR--FVDKE--TGEVASDAGELKNGPANELIFLPNPSPLAL 226 (602) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~--~~~~~--~g~~~~~~~~~~~~~~~eviH~r~~~~~~~ 226 (602) . . ...-+..++++..+... .......|-.+.. +.... ...+.........+..=.|+||++.. T Consensus 159 ~----~----~~~~~i~~~~~~~~~~~-~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~---- 225 (489) T protein:vir:99 159 Q----R----NSLMAVHFYDIDYGSGK-RKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNE---- 225 (489) T ss_pred C----C----ceEEEEEEEEEecCCCc-eEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCC---- Confidence 0 0 00111122222211110 0000111110000 00000 00000000001112222477887532 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHh-------hcccccCcceecc Q lcl|NC_021537. 227 YYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNL-------KGSRYRTAILEVE 299 (602) Q Consensus 227 ~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~-------~g~~nag~~~~~~ 299 (602) .|.|.+......++....+..-..+.....+.|-.++ .|.....+........+.-. ......++++.+. T Consensus 226 -~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (489) T protein:vir:99 226 -ERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVI--AGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILD 302 (489) T ss_pred -CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhh--ccCCcccccchhhhhhcccccccccccccccccceeeeec Confidence 4677666655555544444333333333344443333 23222222222222222110 0111223333333 Q ss_pred CCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHh-hccccCCccCHHHH----------- Q lcl|NC_021537. 300 EFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLI-NVTSTSNRANSKEQ----------- 367 (602) Q Consensus 300 ~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~----------- 367 (602) .+..... .+. +.+.|+... .+..+....+...+.|...-++|..-. +.. ++- +.++. T Consensus 303 ~~~~~~~-----~~~--~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~n~-Sg~Al~~~~~~l~~k~ 371 (489) T protein:vir:99 303 DNPNPNG-----VKP--QAYFLKKEY-DTAGSEAYKNRLVADILRFTFTPDTQDMKFS--GVQ-SGESMKYKLMASDNYR 371 (489) T ss_pred cccCccc-----ccc--ceeeeeecC-ChHHHHHHHHHHHHHHHHHhCCccccccccc--ccc-hHHHHHHHHHHHHHHH Confidence 3222111 111 222222111 123345666778888888888885322 221 222 22221 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhcCC--ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhC Q lcl|NC_021537. 368 --TREFAKGIIEPEQAKFSARLYKIIHQ--DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELD 443 (602) Q Consensus 368 --~~~f~~~~l~P~~~~ie~~ln~~Ll~--~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~G 443 (602) ....+...|+-+++.+...++..-.. ......+..+.|+..-.. +....++++.++ .|+|+...+.++++ T Consensus 372 ~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~----d~~~~~~~~~kl--~giis~et~~~~l~ 445 (489) T protein:vir:99 372 EKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQ----NDNEIVTAAQNL--YGIVSDQTIFEILN 445 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCCHHHHHHhcC Confidence 11222334444444444443321111 011112345666543322 233345566666 48999988888874 Q ss_pred CCCCCCCccc----cccccccc--cccccccCCCcCcccccccccc Q lcl|NC_021537. 444 LAPFEDDRGD----MTLSEFEA--EFGADASDGDAEAMLTRSKAAP 483 (602) Q Consensus 444 l~p~~~g~~d----~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 483 (602) . +.+.+.. +....... ........++..+...+....| T Consensus 446 ~--v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 446 T--VTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred C--CCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 3 2211111 11000000 0000000111111111111111 No 199 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.93 E-value=1e-05 Score=47.93 Aligned_cols=424 Identities=10% Similarity=-0.005 Sum_probs=171.1 Q ss_pred CCCC---cccccc-cc---hhhhcccCcc-ccCCCCHH---HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKA---EETTQL-DE---RHIATDVGRG-IQPPYNPE---TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~---~~~~~~-~~---~~~~~~~~~~-i~p~~~~~---~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) |++. -.+.+. .. .++...-... ..+...+. .-.+++ .++...+|+..+..+.+-|+++... T Consensus 45 i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~g~p~~~~~~------ 116 (511) T protein:vir:93 45 VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQYQDD------ 116 (511) T ss_pred HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceee--cchHHHHHHHHhhhhcccCeeeccC------ Confidence 1111 001110 00 0111000000 11111110 011222 3677888999999898888776321 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +.+..+.+..++. .-.+......+..+.+++|.||..+.++.+|++ .+..++|..+-+.-+.. T Consensus 117 --d~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~-~i~~~~p~~~~~vydd~ 179 (511) T protein:vir:93 117 --DKDVLEVIEAFND--------------LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNT 179 (511) T ss_pred --ChHHHHHHHHHHh--------------hcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCC Confidence 1111222222221 224567778889999999999999999888875 57788888886543221 Q ss_pred ccccccchhhhhcccCceeEEEEcCCc--ceeecccccccccceeeecccceEEecCc------------eeEEechhHE Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRR--RYFGEAGDRYGDDKRFVDKETGEVASDAG------------ELKNGPANEL 215 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~--~~~~~~~~~~~~~~~~~~~~~g~~~~~~~------------~~~~~~~~ev 215 (602) .. + ...-+..|++...... .........|..+. ..++...++ ....+..=.| T Consensus 180 ~~----~----~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~------i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (511) T protein:vir:93 180 IE----R----NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG------VYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred CC----C----ceEEEEEEEEeeeccccccceEEEEEEEeCCc------EEEEEecCCCccccccccccccccCCCccce Confidence 10 0 0011112222111000 00000000000000 000101000 0011112236 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) ++|+.. ..|.|.++.+...++....+..-.++.+...+.|-.+++-. ...+.+.....++ +++ T Consensus 246 v~~~nn-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~~~~~~~~-----------~~~ 308 (511) T protein:vir:93 246 TEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN-LNLDPVEVRKQKE-----------ANV 308 (511) T ss_pred EEecCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecC-cccCchhhccccc-----------ccc Confidence 666542 36888888887777776666555566666666665555421 1122222221111 111 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------- Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------- 367 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------- 367 (602) +.+..+..... .......+.+++.|+... .+..+....+...+.|...-++|..-.+..+ +|-| ..+. T Consensus 309 ~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~n~S-g~Al~~~~~~l~ 384 (511) T protein:vir:93 309 LFLEPTVYADS-EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQS-GEAMKYKLFGLE 384 (511) T ss_pred eeccccccccc-ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccc-ccch-HHHHHHHHHHHH Confidence 11111110000 000111122333333111 2344567778888999999999865443222 2222 2211 Q ss_pred -----HHHHHHHHHHHHHHHHHHHHhhhcCCc-cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHH Q lcl|NC_021537. 368 -----TREFAKGIIEPEQAKFSARLYKIIHQD-ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREE 441 (602) Q Consensus 368 -----~~~f~~~~l~P~~~~ie~~ln~~Ll~~-~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 441 (602) ....+..+|+-.++.+...++..--.. ........+.|...-.. +....++.+.++ .|+++..-++++ T Consensus 385 ~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~----n~~e~~~~~~kl--~g~iS~et~~~~ 458 (511) T protein:vir:93 385 QRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCC----CHHHHHHHHHHH--hccCchHHHHHh Confidence 112233444444444444443321111 11223345666433222 233345666666 589998888888 Q ss_pred hCCCCCCCCcc--cccccccccccc--ccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 442 LDLAPFEDDRG--DMTLSEFEAEFG--ADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 442 ~Gl~p~~~g~~--d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) +++- ++... ++.--....... .........+.......+...+....+. T Consensus 459 l~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 459 FSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCC--CCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccccccccC Confidence 7652 22221 111100000000 0000001111110111111111111111 No 200 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.93 E-value=1e-05 Score=47.91 Aligned_cols=415 Identities=13% Similarity=0.013 Sum_probs=160.2 Q ss_pred CCCCcccc-------------cc-cchhhhcccCc---cccCCCC-------HHHH--HHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_021537. 1 MSKAEETT-------------QL-DERHIATDVGR---GIQPPYN-------PETL--AAFQELNETHQACIRKKSRYEA 54 (602) Q Consensus 1 ~~k~~~~~-------------~~-~~~~~~~~~~~---~i~p~~~-------~~~l--~~~~~~~~~v~~cI~~ia~~ia 54 (602) +....++. .+ .......-+.+ ...-+.. .... .++ .+++...+|+..+..+. T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~ivd~~~~yl~ 97 (478) T protein:vir:10 20 IKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRM--YTNYHQNLVDQKVAYAV 97 (478) T ss_pred hhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhccccccccccccee--ccchHHHHHHHHhhhhc Confidence 11111000 00 00000000000 0000000 0000 011 24678889999999999 Q ss_pred cCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEE Q lcl|NC_021537. 55 GYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGL 134 (602) Q Consensus 55 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L 134 (602) +-|..+.. + +.+..+.+..++. ..+......+.++...+|.+|+.+-.+.+|++ .+ T Consensus 98 g~p~~~~~--~------~~~~~~~l~~~~~---------------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~ 153 (478) T protein:vir:10 98 ANPVTFGV--D------NDKALKQIQHTLN---------------HKWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KT 153 (478) T ss_pred ccCceeec--C------ChHHHHHHHHHHh---------------ccHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EE Confidence 98887632 1 1112222222111 13556667778899999999999888888875 67 Q ss_pred EEeCccccccccccc-ccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccc--eEEecCceeEEec Q lcl|NC_021537. 135 AHVPAATVRVRKTTT-TIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETG--EVASDAGELKNGP 211 (602) Q Consensus 135 ~~l~p~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~~~~~~~~ 211 (602) ..++|..+.+..+.. .....-....+ ...+..++.+......+.+......... ..+...++ .+...++....+. T Consensus 154 ~~~~p~~~~~v~d~~~~~~~~~~ir~~-~~~~~~~~~~y~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g 231 (478) T protein:vir:10 154 FRVPAEQAVPIWTNKERDELQAFIRVY-ELDGAERVEYWTKDDVTFYELKEGQLIP-DFYRSEDHIQPHYYQGNKLMSWG 231 (478) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEE-eeeCceEEEEEeCCcEEEEEecCCeeec-cccccccccccceecccccccCC Confidence 788888876543211 10000000000 0111111111111111111100000000 00000000 0000111112222 Q ss_pred hhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccc Q lcl|NC_021537. 212 ANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRY 291 (602) Q Consensus 212 ~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~n 291 (602) .=.|++++.. ..|.|.+......++....+..-..+.+...+.|-.+++ |....+ .......++. T Consensus 232 ~vPvv~~~n~-----~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~--g~~~~~--~~~~~~~~~~------ 296 (478) T protein:vir:10 232 RVPFIPFKNN-----PQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--GYEGED--MKDFMHNLKY------ 296 (478) T ss_pred cceEEEeccC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeee--cCCccc--ccchhhhhhh------ Confidence 3346777643 368888888777776666555555555555556644443 321111 1111111111 Q ss_pred cCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH---- Q lcl|NC_021537. 292 RTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ---- 367 (602) Q Consensus 292 ag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~---- 367 (602) .+++.+... ...+.++. +. +..+..+.+..+...+.|...-++|..-.+-. .++- +..+. T Consensus 297 -~~~~~~~~~----------~~~~~~~l--~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~-Sg~Ai~~~~ 360 (478) T protein:vir:10 297 -YKAISVAGE----------SGSGVDTI--KV-EVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSP-SGIALKFMY 360 (478) T ss_pred -CceeEecCC----------CCCcceEE--ee-cCCHHHHHHHHHHHHHHHHHHhCCcCcCcccc-ccch-HHHHHHHHH Confidence 112222110 01112221 11 11234567778888889999888885322211 1221 22211 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHH Q lcl|NC_021537. 368 ---------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEA 438 (602) Q Consensus 368 ---------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~ 438 (602) ....+..+|+-+++.+...++ ......+..+.|+..-.. +....++.+.++ .|+++...+ T Consensus 361 ~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-----~~~d~~~i~i~f~~~~p~----~~~e~~~~~~~~--~g~iS~et~ 429 (478) T protein:vir:10 361 SNLDLKANKLKNKTLTALQELLQYIIDFYR-----LDVRVQDIEITFNFNVMV----NELENSQIAMNS--TGLLSKETI 429 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccccceEEeCCCCCC----CHHHHHHHHHHH--hCCCChHHH Confidence 111122233333333332222 122233456677544332 223344555554 689998778 Q ss_pred HHHhCCCCCCCCccc--cccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 439 REELDLAPFEDDRGD--MTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 439 R~~~Gl~p~~~g~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) .++++. +++...+ +.......... +.....+..++... ++.+....| T Consensus 430 i~~~~~--v~d~~~E~~ri~~E~~~~~~---~~~~~~~~~~d~~~-~~~~d~~~e 478 (478) T protein:vir:10 430 LGNHSW--VQDPVAEMERIEQENIELNQ---QLPDIEEGLNDEQQ-RQSEDNQSE 478 (478) T ss_pred HHhCCC--CCCHHHHHHHHHHHHHHHHH---hccccCCCCccccc-ccCcCCCCC Confidence 887764 2222111 11000000000 00000011111000 000111111 No 201 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.92 E-value=1.1e-05 Score=47.80 Aligned_cols=422 Identities=12% Similarity=0.012 Sum_probs=162.3 Q ss_pred CCCCccc----ccccchhhhcccCccccCCCCHHH--HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEET----TQLDERHIATDVGRGIQPPYNPET--LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~----~~~~~~~~~~~~~~~i~p~~~~~~--l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) +.|=... +.+. +++... -.....+..... -.+++ .++...+|+..+..+.+-|+.+... +.+ T Consensus 25 i~~~~~~~~~~~~l~-~Yy~g~-~~i~~~~~~~~~~~~~ki~--~n~~~~Iv~~~~~~l~g~p~~~~~~--------~~~ 92 (499) T protein:vir:10 25 IRELQNRKKRLDKLS-DYYNGK-QEIEKHEFDNATVEAANVM--VNHAKYITDMNVGFMTGNPVKYVAE--------KGK 92 (499) T ss_pred HHHHHHHHHHHHHHH-HHhccc-cchhcCCcCcCCCCcceee--cchHHHHHHHHhhhhcccCceeecC--------Chh Confidence 1111100 0110 111100 001111111110 11222 4577889999999999988776431 111 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce----------------EEEEEeC Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP----------------VGLAHVP 138 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~----------------~~L~~l~ 138 (602) ..+.+..++. ...+..+...+..+.+.+|.+|..+-.+.+|.+ ..+..++ T Consensus 93 ~~~~l~~~~~--------------~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~ 158 (499) T protein:vir:10 93 NIDDILEVFN--------------QIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVID 158 (499) T ss_pred HHHHHHHHHh--------------hcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEc Confidence 1222222211 123556778888999999999999888887753 3466677 Q ss_pred cccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccc-cc-eeeecccce----EEecCceeEEech Q lcl|NC_021537. 139 AATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGD-DK-RFVDKETGE----VASDAGELKNGPA 212 (602) Q Consensus 139 p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~-~~-~~~~~~~g~----~~~~~~~~~~~~~ 212 (602) |..+-+..+...- ....-+..|++..+........+...|.. .. .+.....+. ..........+.. T Consensus 159 p~~~~~v~~d~~~--------~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 230 (499) T protein:vir:10 159 PRATVVVCDDTVE--------HDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGA 230 (499) T ss_pred ccceEEEecCCCC--------cceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCc Confidence 7665433221100 00001111222221110000000000000 00 000000000 0000000111222 Q ss_pred hHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccccc Q lcl|NC_021537. 213 NELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYR 292 (602) Q Consensus 213 ~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~na 292 (602) =.|++|+.. ..|.|.+..+...++....+..-..+.+...+.|-.+++ |..++++. .... .+ +. T Consensus 231 vPvv~~~n~-----~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~--G~~~~~~~-~~~~-~~-------~~ 294 (499) T protein:vir:10 231 VPIIEFRNN-----EERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTF--GFGLGDDK-DDIQ-RL-------KR 294 (499) T ss_pred cceEEecCC-----CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCcccccc-chhh-hh-------hh Confidence 246777643 357888887777777666655555566666666766654 33222211 1100 00 11 Q ss_pred Ccceec--cCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHH---- Q lcl|NC_021537. 293 TAILEV--EEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKE---- 366 (602) Q Consensus 293 g~~~~~--~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~---- 366 (602) +.+..+ +.+. +++.|+... ....+....+...+.|...-++|..-.+.. .++-|. .+ T Consensus 295 ~~~~~~~~~~~~--------------d~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~gn~Sg-~Al~~~ 357 (499) T protein:vir:10 295 GAIEAPPREEGA--------------DIEWLTKSF-DETQVNLLSQSIENDIHKISYVPNMNDEKF-MGNVSG-EAMKFK 357 (499) T ss_pred cceeccCCCCCC--------------cceEEeccC-CHHHHHHHHHHHHHHHHHHhCcccCCchhh-cccchH-HHHHHH Confidence 112211 1111 111222111 123455666777778877777774222111 122121 11 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHH Q lcl|NC_021537. 367 ---------QTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNE 437 (602) Q Consensus 367 ---------~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE 437 (602) .....+..+++-+++.+...++.. .......+..+.|...-.. +....++.+.++ .|+++..- T Consensus 358 ~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~~~~d~~~i~i~f~~~~p~----n~~e~~~~~~kl--~g~iS~et 429 (499) T protein:vir:10 358 LFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIK--GANDDASGCKISLVANIPS----NLSDVVNNVKNA--DGIIPRKY 429 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CCccccccceEEeCCCCCC----CHHHHHHHHHHH--hccCChHH Confidence 112223334444444444443321 1111223446666544322 334455666666 68999988 Q ss_pred HHHHhCCCCCCCCc--cccccccc--------cccccccccCCCcCcccccccccccccccccccccccccccccchhhh Q lcl|NC_021537. 438 AREELDLAPFEDDR--GDMTLSEF--------EAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQT 507 (602) Q Consensus 438 ~R~~~Gl~p~~~g~--~d~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~ 507 (602) ++++++. +++.. .++.-... ....+..+..+..++...+. +..........++ +.... T Consensus 430 ~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~----- 497 (499) T protein:vir:10 430 TYSWLPD--VDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDS----SENDKEAGSNHNQ-SHRTR----- 497 (499) T ss_pred HHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCccc----CCCCCCCcccccc-CCCCC----- Confidence 8888765 22221 11111000 00001111111111110000 0000000000000 00100 Q ss_pred hc Q lcl|NC_021537. 508 TF 509 (602) Q Consensus 508 ~v 509 (602) .| T Consensus 498 ~~ 499 (499) T protein:vir:10 498 AV 499 (499) T ss_pred CC Confidence 01 No 202 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.89 E-value=1.3e-05 Score=47.43 Aligned_cols=424 Identities=10% Similarity=-0.003 Sum_probs=168.5 Q ss_pred CCC---Cccccc---cc-chhhhcccCccc-cCCCCHH---HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSK---AEETTQ---LD-ERHIATDVGRGI-QPPYNPE---TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k---~~~~~~---~~-~~~~~~~~~~~i-~p~~~~~---~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) ++| .-.+.+ +. ..++...-...+ .+..... .-.++. .++...+|+..+..+.+-|+.+... T Consensus 45 i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~--~n~~k~Ivd~~~~yl~g~p~~~~~~------ 116 (512) T protein:vir:97 45 VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQCQDD------ 116 (512) T ss_pred HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceee--cchHHHHHHHHhhhhcccCceeccC------ Confidence 111 000000 00 001110000111 1111110 011222 3677888999999888888876421 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +.+..+.+..++. .-.+......+..+.+++|.+|..+.++.+|++ .+..++|..+.+..+.. T Consensus 117 --d~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~-~i~~~~p~~~~~iyd~~ 179 (512) T protein:vir:97 117 --DKDVLEAIEAFND--------------LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNT 179 (512) T ss_pred --ChHHHHHHHHHHh--------------hcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCC Confidence 1112222322221 224567778889999999999999999888875 57888998887653322 Q ss_pred ccccccchhhhhcccCceeEEEEcCCcc--eeecccccccccceeeecccceEEecCce------------eEEechhHE Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRRR--YFGEAGDRYGDDKRFVDKETGEVASDAGE------------LKNGPANEL 215 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~~--~~~~~~~~~~~~~~~~~~~~g~~~~~~~~------------~~~~~~~ev 215 (602) .. ....-+..|+........ ........|..+ ..+++...++. ...+..=.| T Consensus 180 ~~--------~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~------~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (512) T protein:vir:97 180 IE--------RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH------GVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (512) T ss_pred CC--------CceEEEEEEEEeeeccccccceEEEEEEEeCC------cEEEEEecCCCcccccccccccccccCcccce Confidence 10 000111122221111000 000000000000 00111111100 011122235 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHH-HhhcccccCc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMD-NLKGSRYRTA 294 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~-~~~g~~nag~ 294 (602) ++|+.. ..|.|.+..+...++....+..-.++.+...+.|-.+++-.. ..+.+.....+...- .......... T Consensus 246 v~~~nn-----~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (512) T protein:vir:97 246 TEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL-NLDPVEVRKQKEANVLFLEPTVYENR 319 (512) T ss_pred EeecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc-cCCchhhhhhhhcccccccccchhhc Confidence 666542 368888888887777776666656666666666666654211 122222222211110 0000000000 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------- Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------- 367 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------- 367 (602) ... ...+.+.+++-++.. ..+..+..+.+...+.|...-++|..-.+... ++- +.++. T Consensus 320 ~~~------------~~~~~~~d~~~l~~~-~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-gn~-Sg~Al~~~~~~l 384 (512) T protein:vir:97 320 DTG------------IETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGL 384 (512) T ss_pred ccc------------cCCCCCcceEEEeec-CCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccc-hHHHHHHHHHHH Confidence 000 001111222222211 12344567778888899888899875443222 221 22221 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHhhhcCCc-cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHH Q lcl|NC_021537. 368 ------TREFAKGIIEPEQAKFSARLYKIIHQD-ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEARE 440 (602) Q Consensus 368 ------~~~f~~~~l~P~~~~ie~~ln~~Ll~~-~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~ 440 (602) ....+..+|+-.++.+...++..--.. ........+.|...-.. +....++.+.++ .|+++..-+++ T Consensus 385 ~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~----~~~e~~~~~~kl--~giiS~et~~~ 458 (512) T protein:vir:97 385 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMS 458 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCchHHHHH Confidence 111222333333333333333211100 11122345666433222 223345666666 48999888888 Q ss_pred HhCCCCCCCCcc--ccccccccccc--cccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 441 ELDLAPFEDDRG--DMTLSEFEAEF--GADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 441 ~~Gl~p~~~g~~--d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) +++. +++... +++-....... .......+..+.......+...+....+. T Consensus 459 ~l~~--v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 459 LFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred hCCC--CCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 8865 222211 11100000000 00000001111111111111111111111 No 203 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.82 E-value=1.7e-05 Score=46.75 Aligned_cols=429 Identities=10% Similarity=0.013 Sum_probs=164.8 Q ss_pred CCCC---cccccc-c----chhhhcccCcc-ccCCCCHHH---HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MSKA---EETTQL-D----ERHIATDVGRG-IQPPYNPET---LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~k~---~~~~~~-~----~~~~~~~~~~~-i~p~~~~~~---l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) |++. -.+.+. . .+.+... ... ..+...... -.+++ .++...+|+..+..+.+-|..+... T Consensus 45 i~~~i~~~~~~~~~r~~~l~~Yy~g~-~~i~~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~g~p~~~~~~----- 116 (511) T protein:vir:99 45 VSKYIEHHMDYQRPRLKVLSDYYEGK-TKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQYQDD----- 116 (511) T ss_pred HHHHHHHHHHhhHHHHHHHHHHhccc-CccccccCcccccccCcceee--cchHHHHHHHHHhhhcccCceeecC----- Confidence 1100 000000 0 0111100 000 111111111 11222 3566788888888888888776421 Q ss_pred cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +.+..+.+..++. .-.+......+..+.+++|.+|..+.++.+|++ .+..++|..+-+.-+. T Consensus 117 ---d~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vyd~ 178 (511) T protein:vir:99 117 ---DKDVLEAIEAFND--------------LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDN 178 (511) T ss_pred ---chHHHHHHHHHHh--------------hcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcC Confidence 1112222222221 123556777888999999999999999888874 6788888887654322 Q ss_pred cccccccchhhhhcccCceeEEEEcCC--cceeecccccccccc--eeeecccceEE----ecCceeEEechhHEEEecC Q lcl|NC_021537. 149 TTIEREDGEEVENIESGHGYVQVRQGR--RRYFGEAGDRYGDDK--RFVDKETGEVA----SDAGELKNGPANELIFLPN 220 (602) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~qi~~~~--~~~~~~~~~~~~~~~--~~~~~~~g~~~----~~~~~~~~~~~~eviH~r~ 220 (602) ... . ...-+..|+.+.... ......+...|..+. .+.....+... ........+..=.|++|+. T Consensus 179 ~~~----~----~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 250 (511) T protein:vir:99 179 TIE----R----NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (511) T ss_pred CCC----C----ceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecC Confidence 110 0 000111222211100 000000000000000 00000000000 0000011122223677764 Q ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccC Q lcl|NC_021537. 221 PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEE 300 (602) Q Consensus 221 ~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~ 300 (602) . ..|.|.+..+...++....+..-..+.+...+.|-.+++-. ...+.+.....++ ++++.... T Consensus 251 n-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~~~~~~~~-----------~~~~~~~~ 313 (511) T protein:vir:99 251 N-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN-LNLDPVEVRKQKE-----------ANVLFLEP 313 (511) T ss_pred C-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccC-cccCchhhccccc-----------ccceeccc Confidence 3 26888888777777765555555555555555554444321 1122222221111 01111100 Q ss_pred CccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------------- Q lcl|NC_021537. 301 FVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------------- 367 (602) Q Consensus 301 g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------------- 367 (602) ... ..........+.+++.|+.. ..+..+....+...+.|+..-++|....+..+ +|-| ..+. T Consensus 314 ~~~-~~~~~~~~~~~~d~~~l~~~-~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-gn~S-g~Alk~~~~~l~~ka~~ 389 (511) T protein:vir:99 314 TVY-ADSEGRETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQS-GEAMKYKLFGLEQRTKT 389 (511) T ss_pred ccc-cccccccCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccch-HHHHHHHHHHHHHHHHH Confidence 000 00000011112233333321 12345667788888999999999875443222 2222 2211 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCC-ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCC Q lcl|NC_021537. 368 TREFAKGIIEPEQAKFSARLYKIIHQ-DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAP 446 (602) Q Consensus 368 ~~~f~~~~l~P~~~~ie~~ln~~Ll~-~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p 446 (602) ....+..+|.-.++.+...++..--. .........+.|...... +....++.+.++ .|+++..-++++++. T Consensus 390 k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~----n~~e~~~~~~kl--~GiiS~et~l~~l~~-- 461 (511) T protein:vir:99 390 KEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMSLFSF-- 461 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCCHHHHHHhCCC-- Confidence 11122233333333333333321100 111122445666543222 233345666666 489998888888754 Q ss_pred CCCCccc--cccccccccc--cccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 447 FEDDRGD--MTLSEFEAEF--GADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 447 ~~~g~~d--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) +++...+ +.--...... .......+..........+....+...++ T Consensus 462 v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 462 FQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 3322111 1100000000 00000000001000000000111111111 No 204 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.80 E-value=1.8e-05 Score=46.56 Aligned_cols=427 Identities=11% Similarity=0.038 Sum_probs=160.0 Q ss_pred CCCCcccccccchhhh-cccCc--cc-c-CCC--CH-HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSKAEETTQLDERHIA-TDVGR--GI-Q-PPY--NP-ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~-~~~~~--~i-~-p~~--~~-~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~ 72 (602) +.+-........++.. +-.|. .+ . +.. +. ..-.++ ..++.+.+|+..+..+.|-|+++...+ T Consensus 31 i~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki--~~n~~~~Iv~~~~~~l~G~p~~~~~~d-------- 100 (506) T protein:vir:94 31 ITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRA--THSFAKYIADFQTSYSVGNPINVKLPD-------- 100 (506) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCccee--ecchHHHHHHHhhhhhcccCceeecCc-------- Confidence 2111111110011111 00011 11 0 100 00 001122 246778899999999988887764321 Q ss_pred hhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccc Q lcl|NC_021537. 73 GESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~ 152 (602) ....+.+..++. ...+......+..+.+.+|.||+.+..+.+|++ .+..++|..+-+..+.... T Consensus 101 ~~~~~~l~~~~~--------------~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~-~i~~~~p~~~~~v~dd~~~- 164 (506) T protein:vir:94 101 DGSNSGFDTFNK--------------ANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEE-HLAKLDPLDTFVIYSTDVD- 164 (506) T ss_pred chHHHHHHHHHh--------------ccCHhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEecCCCC- Confidence 111122222221 124556677788899999999999999888865 5777888888654332110 Q ss_pred cccchhhhhcccCceeEEEEcCCcc---eeecccccccccc-eeeecccceEEecCceeEEechhHEEEecCCCCCCCcc Q lcl|NC_021537. 153 REDGEEVENIESGHGYVQVRQGRRR---YFGEAGDRYGDDK-RFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYY 228 (602) Q Consensus 153 ~~~~~~~~~~~~~~~~~qi~~~~~~---~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~ 228 (602) . ...-+..|++....... ....+...+.... .......+.+....+....+..=.|++++... . T Consensus 165 ---~----~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~ 232 (506) T protein:vir:94 165 ---P----KPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNSN-----F 232 (506) T ss_pred ---C----ceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEecCCC-----C Confidence 0 00001111111111000 0000000000000 00000000000001111122222466665432 4 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecc---------------------ccCCHHHHHHHHHHHHHhh Q lcl|NC_021537. 229 GVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTG---------------------GTLSEDSKEDLRNLMDNLK 287 (602) Q Consensus 229 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---------------------~~~~~~~~~~l~~~~~~~~ 287 (602) |+|.+......++....+..-..+.......|-.+++-.. .....+..+.+ ... T Consensus 233 ~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~- 307 (506) T protein:vir:94 233 RLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELI----KEM- 307 (506) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHH----hhh- Confidence 6666665555554443333323222222222222221100 00011111111 111 Q ss_pred cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHH- Q lcl|NC_021537. 288 GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKE- 366 (602) Q Consensus 288 g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~- 366 (602) ...+.+.+.++....... ...+.++ |+.. ..+..+....+.....|...-++|..-.+.. .++-| ..+ T Consensus 308 ---~~~~~~~~~~~~~~~~~~---~~~d~~~--l~~~-~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~S-g~Ai 376 (506) T protein:vir:94 308 ---KDANMLLLKSGMTVNGTQ---TSVDAKY--INKT-YDVVGSEAYKKRVAGDIHKFSHTPDLTDENF-ASNSS-GVAM 376 (506) T ss_pred ---hhcCeeeecccccccCcc---cccccee--eeec-CCHHHHHHHHHHHHHHHHHHhCccccccccc-cccch-HHHH Confidence 112233333332222111 1112222 2211 1234566778888899999999986432211 12222 221 Q ss_pred ------------HHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCccc Q lcl|NC_021537. 367 ------------QTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGT 434 (602) Q Consensus 367 ------------~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T 434 (602) ..+..+...|+.+++.+...++..=-..........+.|+..-.. +....++.+.++ .|+++ T Consensus 377 k~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~----d~~e~a~~~~kl--~g~iS 450 (506) T protein:vir:94 377 QYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPA----DNISQIKALVQA--GATLP 450 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCC Confidence 122233344555554444444321000011122345666544332 333445666666 58999 Q ss_pred HHHHHHHhCCCCCCCCccc--cccccccc-cccccccCCCcCcccccccccccccccccccccccccccccchh Q lcl|NC_021537. 435 VNEAREELDLAPFEDDRGD--MTLSEFEA-EFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIE 505 (602) Q Consensus 435 ~NE~R~~~Gl~p~~~g~~d--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~ 505 (602) ...++++++. +++...+ +..-.... ....+........... ...... ..+++. T Consensus 451 ~et~~~~lp~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~-----~~~~~~-----------~~~e~~ 506 (506) T protein:vir:94 451 QKYLYQQLPG--VTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQT-----NTTATQ-----------TDEEVR 506 (506) T ss_pred hHHHHHhCCC--CCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCc-----cccccc-----------cccCCC Confidence 9999988754 3322211 11000000 0000000000000000 000000 011111 No 205 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.79 E-value=1.9e-05 Score=46.44 Aligned_cols=405 Identities=12% Similarity=-0.007 Sum_probs=155.5 Q ss_pred CCCCcccccc-----------------cchhh-hcccCc--cccCCC--------CH-HHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_021537. 1 MSKAEETTQL-----------------DERHI-ATDVGR--GIQPPY--------NP-ETLAAFQELNETHQACIRKKSR 51 (602) Q Consensus 1 ~~k~~~~~~~-----------------~~~~~-~~~~~~--~i~p~~--------~~-~~l~~~~~~~~~v~~cI~~ia~ 51 (602) |+.......+ ..... ....|. .+.-+. ++ ..-.+++ .++...+|+..+. T Consensus 17 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~Iv~~~~~ 94 (468) T protein:vir:96 17 VEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMY--TNYHQNLVDQKVA 94 (468) T ss_pred eecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccc--cchHHHHHHHHHh Confidence 1110000000 00000 000000 000000 00 0001222 4577788888888 Q ss_pred hhccCceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce Q lcl|NC_021537. 52 YEAGYGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP 131 (602) Q Consensus 52 ~ia~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~ 131 (602) .+.+-|..+... +.+..+.+..++. .++......+..+...+|.+|+.+-.+.+|.+ T Consensus 95 ~l~g~p~~~~~~--------d~~~~~~l~~~~~---------------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~ 151 (468) T protein:vir:96 95 YAVANPVTYGTE--------DEKSLKTIQEVLN---------------HKWDDKLVDILTAASNKGVEWIQPYVDEQGEF 151 (468) T ss_pred hhccCCceeccC--------ChHHHHHHHHHHh---------------cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce Confidence 888877765321 1112222222211 13455666788999999999998888888864 Q ss_pred EEEEEeCcccccccccccc-cccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccc--eEEecCceeE Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTT-IEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETG--EVASDAGELK 208 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~~~~~ 208 (602) .+..++|..+-+..+... ....-....+ ...+..++.+......+.+........ ........+ .......... T Consensus 152 -~i~~~~p~~~~~v~~~~~~~~~~~~ir~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 228 (468) T protein:vir:96 152 -KTFRVPAEQAIPIWTNKERDELKAFIRLY-ELDGGERVEYWTANDVTFYELKDGQLI-PDYYQGEEHVQAHYYVGNKSM 228 (468) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEE-EecCceEEEEEeCCeEEEEEEcCCcee-ecccccccccccceeeccccc Confidence 677888888764422110 0000000000 001111111111111111110000000 000000000 0000011112 Q ss_pred EechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 209 NGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 209 ~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g 288 (602) .+..=.|++|+.. ..|.|.+......++....+..-..+.+...+.|-.+++ |....+. +.+ ..... T Consensus 229 ~~~~iPvv~~~n~-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~~~~~~--~~~---~~~~~- 295 (468) T protein:vir:96 229 SWNRVPFIPFKNN-----PQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLK--GYEGEDL--EEF---MYNLK- 295 (468) T ss_pred cCCcccEEEecCC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcccc--chh---hhhhh- Confidence 2223346666543 358888887777776666655555566666677755554 3222211 111 11111 Q ss_pred ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH- Q lcl|NC_021537. 289 SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ- 367 (602) Q Consensus 289 ~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~- 367 (602) .++++.+.+.. ..++++.... + .+..+....+...+.|...-++|....+ ...++- +.++. T Consensus 296 ---~~~~i~~~~d~----------~~~~~~l~~~-~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~-Sg~Alk 357 (468) T protein:vir:96 296 ---YYKAINVDGDG----------SGGVDTIQID-V--PVQSAKEYLDMLRDYVIEFGQGVDFQQD-KFGNSP-SGIALK 357 (468) T ss_pred ---cCceEEecCCC----------CCcceEEeec-C--ChHHHHHHHHHHHHHHHHHhCccccccc-ccccch-HHHHHH Confidence 11222221100 0112221111 1 1344566777788888888888753221 111221 22221 Q ss_pred ------------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccH Q lcl|NC_021537. 368 ------------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTV 435 (602) Q Consensus 368 ------------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~ 435 (602) ....+...|+-+++.+...+. .........+.|+..-..+ +. +.++++.++|+++. T Consensus 358 ~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g-----~~~d~~~i~i~f~~~~p~d---~~----e~a~~~~~~g~iS~ 425 (468) T protein:vir:96 358 FMYSNLDLKANKLKNKTLTALQELLQYIIDFYK-----LSIKVQDVEITFNFNVMVN---EL----EQSQIGVNSQYLSK 425 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccceeeEEecCCCCcC---HH----HHHHHHHhcCCCch Confidence 111122223333333322221 1112234556665443221 22 22345567899998 Q ss_pred HHHHHHhCCCCCCCCccc--cccccccccccccccCCCcCccccccccccc Q lcl|NC_021537. 436 NEAREELDLAPFEDDRGD--MTLSEFEAEFGADASDGDAEAMLTRSKAAPP 484 (602) Q Consensus 436 NE~R~~~Gl~p~~~g~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (602) ..++++++. +++...+ +.-....... +.+. .. ......+|. T Consensus 426 et~i~~l~~--v~D~~~E~~ri~~E~~~~~--~~~~-~~---~~~~~~~~~ 468 (468) T protein:vir:96 426 ETVVTNHPW--VDDPVAEMERIDQEELALP--SIEE-GL---NGKENNEPT 468 (468) T ss_pred HHHHHhCCC--CCCHHHHHHHHHHHHHHHH--HHhh-cc---CCCCCCCCC Confidence 888888754 2222111 1100000000 0000 00 000001111 No 206 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.77 E-value=2.1e-05 Score=46.28 Aligned_cols=430 Identities=11% Similarity=0.016 Sum_probs=173.9 Q ss_pred CCCC----------cccccccc-----hhhhcccCccccCC-CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKA----------EETTQLDE-----RHIATDVGRGIQPP-YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~----------~~~~~~~~-----~~~~~~~~~~i~p~-~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) +.|+ +-+..-.. +.....-..++... .+...-.+-...-.....+++.+|+-+.+=+-.|.-. T Consensus 20 ~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~~A~ll~~e~~~i~~~- 98 (505) T protein:vir:79 20 MTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLASAKLASLIFNEQCQVTVS- 98 (505) T ss_pred chhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHHHHHHHhhhcCCCceeecC- Confidence 1111 11111000 11110000011100 0000000000111455778888888887755444311 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) +....+.+...+. ...+...++..+.+.+..|.+++.+..+. |. +.+.+++|..+-+ T Consensus 99 -------d~~~~e~l~~i~~--------------~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~~v~ad~~~P 155 (505) T protein:vir:79 99 -------DETANDFLDDVFQ--------------QNDFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLAWATADQVYP 155 (505) T ss_pred -------ChHHHHHHHHHHH--------------hccHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEEEEcCCeeEE Confidence 1122222222221 12356677888888899999999888774 33 5688888888765 Q ss_pred cccccccccccchhhhhc----ccCceeEE---EE---cCCcceee-cccccccccc-eeeecccceEEecCceeE---E Q lcl|NC_021537. 145 RKTTTTIEREDGEEVENI----ESGHGYVQ---VR---QGRRRYFG-EAGDRYGDDK-RFVDKETGEVASDAGELK---N 209 (602) Q Consensus 145 ~~~~~~~~~~~~~~~~~~----~~~~~~~q---i~---~~~~~~~~-~~~~~~~~~~-~~~~~~~g~~~~~~~~~~---~ 209 (602) ....+.. ......+..+ .+...|+. .. ++.+.... .|........ .-+...+...+..-.... . T Consensus 156 ~~~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g 234 (505) T protein:vir:79 156 LQADTNQ-VNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITG 234 (505) T ss_pred EEEcCCC-eEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecC Confidence 4322211 0000000000 00011110 00 00000000 0000000000 000000000000000001 1 Q ss_pred echhHEEEecCCCCC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH Q lcl|NC_021537. 210 GPANELIFLPNPSPL----ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN 285 (602) Q Consensus 210 ~~~~eviH~r~~~~~----~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~ 285 (602) ++---+.||+.+.+. ..++|+|.+..+...++.....-.-..+-|..|... |.++...+ +.... T Consensus 235 ~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~---i~v~~~~l--------~~~~~- 302 (505) T protein:vir:79 235 LKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRR---LIVPAEWL--------KTGSS- 302 (505) T ss_pred CCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccc---eeechHHh--------cccCC- Confidence 222235677654222 346899999999988877666655555666665443 11221110 00000 Q ss_pred hhcc-cccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCH Q lcl|NC_021537. 286 LKGS-RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANS 364 (602) Q Consensus 286 ~~g~-~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~ 364 (602) ..|. ...+..++.....-+........+.+ ++.++. .-.+-++.+..+...++|+..-|+++..+|+..++.. |+ T Consensus 303 ~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~--i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~-TA 378 (505) T protein:vir:79 303 YGGQASETHPPMFDPDETVYQAMYGDASEVG--FHDATS-PIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQ-TA 378 (505) T ss_pred CCcccccccccCCCccceeeeeccCCCCCCc--eEEecc-cCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccc-hH Confidence 0000 00001111111111111111111222 223321 2235567888889999999999999999997655432 33 Q ss_pred HHH-------------HHHHHHHHHHHHHHHHHHHHhhhcCCc------cccccceEEEeccchhcchhHHHHHHHHHHH Q lcl|NC_021537. 365 KEQ-------------TREFAKGIIEPEQAKFSARLYKIIHQD------ALDVDEWTIDFELRGAEQPEQDAKMAEQRVR 425 (602) Q Consensus 365 e~~-------------~~~f~~~~l~P~~~~ie~~ln~~Ll~~------~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~ 425 (602) .+. .+..++.+|..++..+........+.. ......+.+.|++++-.-.+.++ ..+... T Consensus 379 tei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~--~~~~~~ 456 (505) T protein:vir:79 379 TEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQES--KRAADL 456 (505) T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHH--HHHHHH Confidence 332 122234455555555444333221111 11223456666666655443333 345567 Q ss_pred HHHhCCcccHHHHHHHh-CCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 426 AMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 426 ~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +++.+|+|+..+++... |+ ++++...-+... ...... ..++. .....+ T Consensus 457 ~~v~~Gi~s~e~~l~~~~~~---~eeea~~el~ri----~~E~~~-~~p~~----------~~~gg~ 505 (505) T protein:vir:79 457 QAVQAQVMPKKQFLMRNYGL---DEEEADEWLAQI----DAENST-AEPEF----------NQFGGD 505 (505) T ss_pred HHHHcCCCCHHHHHHhcCCC---ChHHHHHHHHHH----HHhccc-cCCCc----------hhccCC Confidence 88999999999888764 43 332222111111 000000 00000 000000 No 207 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=97.76 E-value=2.2e-05 Score=46.14 Aligned_cols=439 Identities=9% Similarity=-0.044 Sum_probs=169.5 Q ss_pred CC------CCcccccccc-----hhhhcccCccccCC-CCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MS------KAEETTQLDE-----RHIATDVGRGIQPP-YNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~------k~~~~~~~~~-----~~~~~~~~~~i~p~-~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) ++ +-+.+..... +....+...++.+. .+...-.+-...-.+...+.+.+|+-|..=+-.|.-.+.+.. T Consensus 23 ~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~ 102 (517) T protein:vir:98 23 LKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADVLSGLVFNEQCEVYVSDAKDE 102 (517) T ss_pred hhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHHhhhhhcCCcceEEecccccc Confidence 11 1111110000 00000000011100 000000000000133345566666666553433332211110 Q ss_pred c---ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccc Q lcl|NC_021537. 69 P---DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVR 145 (602) Q Consensus 69 ~---~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~ 145 (602) . .......+.+...+. ...+...++..+.+.+..|.+++.+..+. |. +.+.++++..+-+. T Consensus 103 ~~~~~~~~~~~e~l~~i~~--------------~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~I~~v~ad~~~Pl 166 (517) T protein:vir:98 103 EKKDNSFKTAHEFIQHVFQ--------------HNKFIKNLSDYLEPTFALGGLTVRPYVDN-GE-IEFSWALANAFYPL 166 (517) T ss_pred cccccchhHHHHHHHHHHH--------------hccHHHHHHHHHHHHhhhCCEEEEEEEeC-Ce-eEEEEEcCCeeEEE Confidence 0 000111111222111 12356677778888888999999888874 33 46888999888663 Q ss_pred cccccccccc---chhhhhcccCce-eEEEEcCCcceeecccccc-c-ccceee--------ecccce------EEecCc Q lcl|NC_021537. 146 KTTTTIERED---GEEVENIESGHG-YVQVRQGRRRYFGEAGDRY-G-DDKRFV--------DKETGE------VASDAG 205 (602) Q Consensus 146 ~~~~~~~~~~---~~~~~~~~~~~~-~~qi~~~~~~~~~~~~~~~-~-~~~~~~--------~~~~g~------~~~~~~ 205 (602) .........- ........+... |.++-. .+++... . ..+.+. +...|. ++.... T Consensus 167 ~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~------H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~ 240 (517) T protein:vir:98 167 RSNSNGISEGVMKSVTTKVIGNKTVYYTLLEF------HEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQ 240 (517) T ss_pred EecCCCeEEEEEEEEEEEeecCCceEEEEEEE------EecCceeccCCcEEEEEEEEecCCCccccccccccccccCCC Confidence 3221110000 000000001111 111100 0000000 0 000000 000010 000000 Q ss_pred eeEEec---hhHEEEecCCCCC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHH Q lcl|NC_021537. 206 ELKNGP---ANELIFLPNPSPL----ALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKED 278 (602) Q Consensus 206 ~~~~~~---~~eviH~r~~~~~----~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~ 278 (602) ....++ ---+.||+.+-+. +.++|+|.+..+...++.....-.....-|..|.+. |.++...+.. T Consensus 241 ~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~---i~vp~~~l~~----- 312 (517) T protein:vir:98 241 EKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRT---VFVSDVMLRT----- 312 (517) T ss_pred cceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcc---eecChhhhcc----- Confidence 111111 1124466654222 356899999999888877666555555566665543 2223221100 Q ss_pred HHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 279 LRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 279 l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) ..+ .++...+. ........+..+.......+ ++.++ ..-.+-++.+..+...+.|+...|+++..+|+... T Consensus 313 ---~~~--~~g~~~~~-~~d~~~~~y~~~~~~~~~~~--i~~~~-~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~ 383 (517) T protein:vir:98 313 ---VPD--ESGMPPPQ-VFDPDVNVYKSIRMGTDEEF--VKDVT-HDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGR 383 (517) T ss_pred ---ccC--CCCcccCC-CCCcccceeeeccCCCCCCc--eeeec-cccchHHHHHHHHHHHHHHHHHhCCCccccccccc Confidence 000 00000000 00001111111111111112 22222 22345578888999999999999999999998765 Q ss_pred CCccCHHHHHHH--HHHHHHHHHHHHHHHHHhhhc-----------CCccccccceEEEeccchhcchhHHHHHHHHHHH Q lcl|NC_021537. 359 SNRANSKEQTRE--FAKGIIEPEQAKFSARLYKII-----------HQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVR 425 (602) Q Consensus 359 ~~~sn~e~~~~~--f~~~~l~P~~~~ie~~ln~~L-----------l~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~ 425 (602) +. .|+.+.... -.-.++.-+...++.+|...+ +........+.+.+++++-... |.+...+... T Consensus 384 ~~-kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~--D~~~~~~~~~ 460 (517) T protein:vir:98 384 SM-KTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQ--DRSALLRFYG 460 (517) T ss_pred cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCC--CHHHHHHHHH Confidence 43 344432111 111233334444443333211 1111222344566666655444 3344455677 Q ss_pred HHHhCCcccHHHHHHHh-CCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 426 AMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 426 ~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +++.+|+|+.-+++.++ |+. +++.+..+....... ....+.+........-+..+ | T Consensus 461 ~~v~aG~ms~~~~i~~~~g~~---eeeA~~e~~~i~~E~---~~~~~~~~~~~~~~~~~gd~----e 517 (517) T protein:vir:98 461 QAKTFGFIPTVEAIQRIFKVP---KKTAEQWLEEIRKDQ---IELDPVTISQRAQKRMFGDE----E 517 (517) T ss_pred HHHhcCCCCHHHHHHHhCCCC---hHHHHHHHHHHHHhc---cccCCCCccccccCCCCCCC----C Confidence 89999999999987665 653 222222221111100 00111111000000000000 0 No 208 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=97.70 E-value=2.7e-05 Score=45.64 Aligned_cols=410 Identities=12% Similarity=0.021 Sum_probs=158.9 Q ss_pred CCCCcccccccc-hhhhcccCccccCC---------------CCHHH--HHHHHhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021537. 1 MSKAEETTQLDE-RHIATDVGRGIQPP---------------YNPET--LAAFQELNETHQACIRKKSRYEAGYGFEIVA 62 (602) Q Consensus 1 ~~k~~~~~~~~~-~~~~~~~~~~i~p~---------------~~~~~--l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~ 62 (602) -+++..-..... ..+.-.-......+ .++.. -.++ .+++...+|+..+..+.|-|..+.. T Consensus 15 ~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki--~~n~~~~Ivd~~~~yl~G~p~~~~~ 92 (471) T protein:vir:10 15 VKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRI--SHNWHQLLLDQKKAYALTYPPTFDV 92 (471) T ss_pred HHHHHHHHHHHHHHHHhccccccccccchhhhhccccccccccccccccccee--ccchhHHHHHhhhhhhcccCceecc Confidence 011110000000 00000000000000 00000 0012 2457788899888888888877532 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC-CceEEEEEeCccc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD-GTPVGLAHVPAAT 141 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~-G~~~~L~~l~p~~ 141 (602) + +.+..+.+..++. ..+......+..+...+|.+|..+.++.. |+ ..+..++|.. T Consensus 93 --~------~~~~~~~l~~~~~---------------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~-~~~~~~~p~~ 148 (471) T protein:vir:10 93 --D------DKKVNDMIVDVLG---------------DDYERISKQLCVNAGNAGIAWLHVWKDASDNS-FRYACVDSKE 148 (471) T ss_pred --C------ChHHHHHHHHHHh---------------cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCe-eEEEEEcccc Confidence 1 1122222222211 13456677788899999999999988854 55 5688889988 Q ss_pred cccccccccc-ccc---cchhhhh--cccCceeEEEEcCCcceeeccccccccc-------ceeeecccceEEecCceeE Q lcl|NC_021537. 142 VRVRKTTTTI-ERE---DGEEVEN--IESGHGYVQVRQGRRRYFGEAGDRYGDD-------KRFVDKETGEVASDAGELK 208 (602) Q Consensus 142 v~~~~~~~~~-~~~---~~~~~~~--~~~~~~~~qi~~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~~~~~~~ 208 (602) +-+.-+.... ... ....... ......++.+......+.+......... .......++.+........ T Consensus 149 ~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (471) T protein:vir:10 149 VIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKH 228 (471) T ss_pred eEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccC Confidence 7654332110 000 0000000 0001111111111111111110000000 0000001111111111111 Q ss_pred EechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 209 NGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 209 ~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g 288 (602) .+..=.|+|+++.. .|.|.+......++....+..-.++.+...+.|-.+++-.+....++.... + . T Consensus 229 ~~g~iPvv~~~n~~-----~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~----~---~- 295 (471) T protein:vir:10 229 DFGLVPFIPFKNNE-----IETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLED----L---K- 295 (471) T ss_pred CCCceeEEEeccCC-----CCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHH----h---h- Confidence 22223467776532 578888777776666665555555556666666455442111112221111 1 1 Q ss_pred ccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH- Q lcl|NC_021537. 289 SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ- 367 (602) Q Consensus 289 ~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~- 367 (602) ..+.+.+.... .....++++. +... .+..+....+...+.|...-++|..-.. ..+|-|. .+. T Consensus 296 ---~~~~i~~~~~~-------~~~~~~~~~l--~~~~-~~~~~~~~~~~l~~~I~~~s~tp~~~~~--~~gn~Sg-~Alk 359 (471) T protein:vir:10 296 ---RYKMIKMDNDG-------MGDQSGVTTI--AIDI-PTEARNLILERTKKQIFISGQGVNPETD--KLGNSSG-VALK 359 (471) T ss_pred ---cCCeEEecCCC-------CccCccceEE--eecC-ChHHHHHHHHHHHHHHHHHhCCcCCCcc--cccCccH-HHHH Confidence 11112211110 0111222222 1111 1234567777888888888888754222 1122221 111 Q ss_pred ------------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccH Q lcl|NC_021537. 368 ------------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTV 435 (602) Q Consensus 368 ------------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~ 435 (602) .+..+..+|+-+++.+...++. ....++.+.|...-.. +....++.+.++ .|+++. T Consensus 360 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~------~d~~~i~i~f~~~~p~----n~~e~~~~~~kl--~g~iS~ 427 (471) T protein:vir:10 360 FLYSLLELKAGNMETQFRSGYATLVKMILKHLGL------SDKLKIKQTWTRNSIN----NDTEMAQVVSTL--ATITSR 427 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------CCCceeEEEeCCCCCC----CHHHHHHHHHHH--hccCch Confidence 1112222333333333332221 1224456666554332 233345566665 689998 Q ss_pred HHHHHHhCCCCCCCCccc--cccccccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 436 NEAREELDLAPFEDDRGD--MTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 436 NE~R~~~Gl~p~~~g~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) .-++++++. +++...+ +.--.... ...... +..+. ..++ +.+ T Consensus 428 et~~~~~p~--v~D~~~E~eri~~E~~~-~~~~~~--~~~~~----~~~~-------e~~ 471 (471) T protein:vir:10 428 ENVAKSNPI--VEDWQDELRLQKAEQEG-RSEKLY--DMEEV----EHES-------EVE 471 (471) T ss_pred HHHHHhCCC--CCCHHHHHHHHHHHHHH-HHhccc--ccCCC----CCcc-------ccC Confidence 888888754 3222111 00000000 000000 00000 0000 000 No 209 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.62 E-value=3.6e-05 Score=44.93 Aligned_cols=426 Identities=10% Similarity=-0.005 Sum_probs=170.2 Q ss_pred CCCC---cccccc-cc---hhhhcccCccc-cCCCCHHH---HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKA---EETTQL-DE---RHIATDVGRGI-QPPYNPET---LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~---~~~~~~-~~---~~~~~~~~~~i-~p~~~~~~---l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~ 69 (602) |++. -.+.+. .. .++...-...+ .+...+.. -.+++ ..+...+|+..+..+.|-|..+... T Consensus 45 i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~g~p~~~~~~------ 116 (511) T protein:vir:96 45 VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQYQDD------ 116 (511) T ss_pred HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceee--cchHHHHHHHHHhhhccCCceeecC------ Confidence 1110 000000 00 01110000011 11111111 01222 3577788898998888888876421 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +.+..+.+..++. .-.+......+..+.+++|.+|..+-++.+|++ .+..++|..+.+.-+.. T Consensus 117 --~~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vydd~ 179 (511) T protein:vir:96 117 --DKDVLEAIEAFND--------------LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNT 179 (511) T ss_pred --chHHHHHHHHHHh--------------hcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCC Confidence 1112222222221 124566778888999999999999988888864 67888888887543321 Q ss_pred ccccccchhhhhcccCceeEEEEcCCc--ceeecccccccccceeeecccceEEecCc------------eeEEechhHE Q lcl|NC_021537. 150 TIEREDGEEVENIESGHGYVQVRQGRR--RYFGEAGDRYGDDKRFVDKETGEVASDAG------------ELKNGPANEL 215 (602) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~qi~~~~~--~~~~~~~~~~~~~~~~~~~~~g~~~~~~~------------~~~~~~~~ev 215 (602) .. . ...-+..|+....... .....+...|..+. .+.+...++ ....+..=.| T Consensus 180 ~~----~----~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~------i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 245 (511) T protein:vir:96 180 IE----R----NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG------VYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred CC----C----ceEEEEEEEEeeeccccccceEEEEEEEeCCc------EEEEEecCCCcccccccccccccccCCceee Confidence 10 0 0001112222111100 00000000000000 000100000 0011112235 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) ++|+.. ..|+|.++.+...++....+..-..+.+...+.|-.+++-. ...+.+...... .+++ T Consensus 246 v~~~nn-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~-~~~~~~~~~~~~-----------~~~~ 308 (511) T protein:vir:96 246 TEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN-LNLDPVEVRKQK-----------EANV 308 (511) T ss_pred EEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC-ccCCchhhcccc-----------cccc Confidence 666542 36888888887777776666555566666666665555421 112222221111 1111 Q ss_pred eeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH-------- Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ-------- 367 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------- 367 (602) +.+....... ........+.++..|+... .+..+....+.+.+.|...-++|..-.+... ++-| ..+. T Consensus 309 ~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~n~S-g~Al~~~~~~l~ 384 (511) T protein:vir:96 309 LFLEPTVYAD-SEGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQS-GEAMKYKLFGLE 384 (511) T ss_pred eecccccccc-cccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccc-ccch-HHHHHHHHHHHH Confidence 1111110000 0001111122333333211 2345677888888999999999875443222 2222 2221 Q ss_pred -----HHHHHHHHHHHHHHHHHHHHhhhcCCc-cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHH Q lcl|NC_021537. 368 -----TREFAKGIIEPEQAKFSARLYKIIHQD-ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREE 441 (602) Q Consensus 368 -----~~~f~~~~l~P~~~~ie~~ln~~Ll~~-~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 441 (602) ....+..+|+-.++.|...++.+--.. ........+.|...-.. +....++.+.++ .|+++...+.++ T Consensus 385 ~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~----n~~e~~~~~~kl--~G~iS~et~l~~ 458 (511) T protein:vir:96 385 QRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCC----CHHHHHHHHHHH--hccCChHHHHHh Confidence 112233344444444444443221111 11223445666533222 233345566665 689999888888 Q ss_pred hCCCCCCCCcccccccccccccc--ccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 442 LDLAPFEDDRGDMTLSEFEAEFG--ADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 442 ~Gl~p~~~g~~d~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) +++-.-+..+.++..-....... ........ ....+...++..++...+.. T Consensus 459 l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 459 FSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP-RDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC-CCCCCCCCCCcccccccccC Confidence 86522111111111100000000 00000000 00000000010011000100 No 210 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.62 E-value=3.7e-05 Score=44.90 Aligned_cols=422 Identities=10% Similarity=0.012 Sum_probs=168.2 Q ss_pred CCCC-----cccccccchhhhcccCccc-cCCCCHH---HHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCccc Q lcl|NC_021537. 1 MSKA-----EETTQLDERHIATDVGRGI-QPPYNPE---TLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDE 71 (602) Q Consensus 1 ~~k~-----~~~~~~~~~~~~~~~~~~i-~p~~~~~---~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~ 71 (602) +.+- +.-+++. +.+... ...+ .+..... ...+++ .++...+|+..+..+.+-|+++... T Consensus 49 i~~~~~~~~~r~~~l~-~Yy~g~-~~i~~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~g~p~~~~~~-------- 116 (511) T protein:vir:10 49 IEHHMDYQRPRLKVLS-DYYEGK-TKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQYQDD-------- 116 (511) T ss_pred HHHHHHhhHHHHHHHH-HHhccc-CccccccCcccccccCcceee--cchHHHHHHHHhhhhcccCceeecC-------- Confidence 1110 0000111 111100 0111 1111110 011222 4677888888888888888876421 Q ss_pred chhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccc Q lcl|NC_021537. 72 GGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTI 151 (602) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~ 151 (602) +.+..+.+..++. .-.+......+..+.+++|.||..+.++.+|++ .+..++|..+-+.-+.... T Consensus 117 d~~~~~~l~~~~~--------------~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~ 181 (511) T protein:vir:10 117 DKDVLEAIEAFND--------------LNDVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIE 181 (511) T ss_pred chHHHHHHHHHHh--------------hcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCC Confidence 1111122222221 123556777888999999999999988888874 6778888887654332110 Q ss_pred ccccchhhhhcccCceeEEEEcCC--cceeecccccccccceeeecccceEEecCc------------eeEEechhHEEE Q lcl|NC_021537. 152 EREDGEEVENIESGHGYVQVRQGR--RRYFGEAGDRYGDDKRFVDKETGEVASDAG------------ELKNGPANELIF 217 (602) Q Consensus 152 ~~~~~~~~~~~~~~~~~~qi~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~------------~~~~~~~~eviH 217 (602) . ...-+..|+...... ......+...|..+. .+.+...++ ....+..=.|++ T Consensus 182 -~-------~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~------i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~ 247 (511) T protein:vir:10 182 -R-------NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG------VYRYLTSRTNGLKLTPRENGFESHSFERMPITE 247 (511) T ss_pred -C-------ceEEEEEEEEeeecccCccceEEEEEEEeCCc------EEEEEecCCCcccccccccccccccCcceeEEE Confidence 0 000111222111100 000000000000000 000100000 011111223566 Q ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCccee Q lcl|NC_021537. 218 LPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILE 297 (602) Q Consensus 218 ~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~ 297 (602) |+.. ..|.|.++.+...++....+..-..+.+...+.|-.+++-. ...+.+.....++ ++++. T Consensus 248 f~nn-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~-~~~~~~~~~~~~~-----------~~~~~ 310 (511) T protein:vir:10 248 FSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN-LNLDPVEVRKQKE-----------ANVLF 310 (511) T ss_pred ecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecc-ccCCchhhccchh-----------cccee Confidence 6542 26888888887777766665555556666666665555421 1122222221111 11111 Q ss_pred ccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH---------- Q lcl|NC_021537. 298 VEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ---------- 367 (602) Q Consensus 298 ~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~---------- 367 (602) +..+..... .......+.+++.|+. +..+..+....+...+.|+..-++|..-.+... +|-| ..+. T Consensus 311 ~~~~~~~~~-~~~~~~~~~d~~~l~~-~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~S-g~Al~~~~~~l~~k 386 (511) T protein:vir:10 311 LEPTVYADS-EGRETEGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQS-GEAMKYKLFGLEQR 386 (511) T ss_pred ccccccccc-ccccCCCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccch-HHHHHHHHHHHHHH Confidence 111110000 0001111223333331 112345667788888899998899875433221 2222 2221 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhcCCc-cccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhC Q lcl|NC_021537. 368 ---TREFAKGIIEPEQAKFSARLYKIIHQD-ALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELD 443 (602) Q Consensus 368 ---~~~f~~~~l~P~~~~ie~~ln~~Ll~~-~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~G 443 (602) ....+..+|+-.++.+...+...--.. ........+.|...-.. +....++.+.++ .|+++..-+.++++ T Consensus 387 ~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~----d~~~~~~~~~kl--~G~iS~et~~~~l~ 460 (511) T protein:vir:10 387 TKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMSLFS 460 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCc----CHHHHHHHHHHH--hccCcHHHHHHhCC Confidence 122233344444444444433221111 11122345555433222 333345666666 48899888888886 Q ss_pred CCCCCCCccc--cccccccccc--cccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 444 LAPFEDDRGD--MTLSEFEAEF--GADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 444 l~p~~~g~~d--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) + +++...+ ++-....... .......+..........+...+....+. T Consensus 461 ~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 461 F--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred C--CCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 5 3322111 1100000000 00000000000000000000001111111 No 211 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.57 E-value=4.4e-05 Score=44.48 Aligned_cols=423 Identities=9% Similarity=-0.009 Sum_probs=166.9 Q ss_pred CCCC---cccccc-----cchhhhcccCccc-cCCCCHHH---HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MSKA---EETTQL-----DERHIATDVGRGI-QPPYNPET---LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~k~---~~~~~~-----~~~~~~~~~~~~i-~p~~~~~~---l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) |+|- -.+.+. ..+++... ...+ .+...... -.+++ .++...+|+..+..+.+-|..+.. + T Consensus 45 i~~~i~~~~~~~~~r~~~l~~Yy~g~-~~il~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~g~p~~~~~--~--- 116 (511) T protein:vir:78 45 VSKYIEHHMDYQRPRLKVLSDYYEGK-TKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQYQD--D--- 116 (511) T ss_pred HHHHHHHHHHhhhHHHHHHHHHhhcc-CccccccCcccccccCcceee--cchHHHHHHHHhhhhcccCceeec--C--- Confidence 1111 001110 00111110 0111 11110100 01222 367778888888888888877642 1 Q ss_pred cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +.+..+.+..++.. -.+..+...+..+.+++|.||..+-++.+|++ .+..++|..+.+..+. T Consensus 117 ---d~~~~~~l~~~~~~--------------n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd 178 (511) T protein:vir:78 117 ---DKDVLEAIEAFNDL--------------NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDN 178 (511) T ss_pred ---chHHHHHHHHHHhh--------------cChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcC Confidence 11122223322211 23456777888999999999999988888874 6788888888654332 Q ss_pred cccccccchhhhhcccCceeEEEEcCCcc--eeecccccccccceeeecccceEEecCce------------eEEechhH Q lcl|NC_021537. 149 TTIEREDGEEVENIESGHGYVQVRQGRRR--YFGEAGDRYGDDKRFVDKETGEVASDAGE------------LKNGPANE 214 (602) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~qi~~~~~~--~~~~~~~~~~~~~~~~~~~~g~~~~~~~~------------~~~~~~~e 214 (602) ... ....-+..|++....... ....+...|..+ ..+.+...++. ...+..=. T Consensus 179 ~~~--------~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~------~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 244 (511) T protein:vir:78 179 TVE--------RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH------GVYRYLTNRTNGLKLTPRENSFESHSFERMP 244 (511) T ss_pred CCC--------CceEEEEEEEEeeeccccccceEEEEEEEeCC------cEEEEEecCCCcccccccccccccCcCcccc Confidence 110 000111122222111100 000000011000 00111111110 01111223 Q ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc Q lcl|NC_021537. 215 LIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA 294 (602) Q Consensus 215 viH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~ 294 (602) |++|+.. ..|.|.++.+...++....+..-..+.+...+.|-.+++-. ...+.+....... ++ T Consensus 245 vv~~~n~-----~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~~~~~~~~-----------~~ 307 (511) T protein:vir:78 245 ITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN-LNLDPVEVRKQKE-----------AN 307 (511) T ss_pred eEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecC-ccCCchhhccccc-----------cc Confidence 5555432 36888888877777766655555555556556665555421 1122222221111 11 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------- Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------- 367 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------- 367 (602) ++....+..... .......+.+++.|+. +..+..+....+...+.|+..-++|....+... ++- +.++. T Consensus 308 ~~~~~~~~~~~~-~~~~~~~~~~~~~l~~-~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~-Sg~Al~~~~~~l 383 (511) T protein:vir:78 308 VLFLEPTVYVDA-EGRETEGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGL 383 (511) T ss_pred ceeccccceecc-ccccCCCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccc-HHHHHHHHHHHH Confidence 111111100000 0001111222222321 112345677788888999999999875443322 222 22211 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHhhhcCC-ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHH Q lcl|NC_021537. 368 ------TREFAKGIIEPEQAKFSARLYKIIHQ-DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEARE 440 (602) Q Consensus 368 ------~~~f~~~~l~P~~~~ie~~ln~~Ll~-~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~ 440 (602) ....+..+|+-.++.+...+...--. ......+..+.|...-.. +....++.+.++ .|+++..-+.+ T Consensus 384 ~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~----n~~e~~d~~~kl--~G~iS~et~l~ 457 (511) T protein:vir:78 384 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMS 457 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCChHHHHH Confidence 11222334444444444433321110 011122345666443222 233345667666 48999888888 Q ss_pred HhCCCCCCCCcc--ccccccccccc--cccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 441 ELDLAPFEDDRG--DMTLSEFEAEF--GADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 441 ~~Gl~p~~~g~~--d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) ++++ +++... +++-....... .......+..+.......+...+....+. T Consensus 458 ~l~~--v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 458 LFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred hCCC--CCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 8754 332211 11110000000 00000000000000000000001111111 No 212 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.57 E-value=4.4e-05 Score=44.48 Aligned_cols=423 Identities=9% Similarity=-0.009 Sum_probs=166.9 Q ss_pred CCCC---cccccc-----cchhhhcccCccc-cCCCCHHH---HHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCC Q lcl|NC_021537. 1 MSKA---EETTQL-----DERHIATDVGRGI-QPPYNPET---LAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADE 68 (602) Q Consensus 1 ~~k~---~~~~~~-----~~~~~~~~~~~~i-~p~~~~~~---l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~ 68 (602) |+|- -.+.+. ..+++... ...+ .+...... -.+++ .++...+|+..+..+.+-|..+.. + T Consensus 45 i~~~i~~~~~~~~~r~~~l~~Yy~g~-~~il~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~g~p~~~~~--~--- 116 (511) T protein:vir:96 45 VSKYIEHHMDYQRPRLKVLSDYYEGK-TKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQYQD--D--- 116 (511) T ss_pred HHHHHHHHHHhhhHHHHHHHHHhhcc-CccccccCcccccccCcceee--cchHHHHHHHHhhhhcccCceeec--C--- Confidence 1111 001110 00111110 0111 11110100 01222 367778888888888888877642 1 Q ss_pred cccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) +.+..+.+..++.. -.+..+...+..+.+++|.||..+-++.+|++ .+..++|..+.+..+. T Consensus 117 ---d~~~~~~l~~~~~~--------------n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd 178 (511) T protein:vir:96 117 ---DKDVLEAIEAFNDL--------------NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDN 178 (511) T ss_pred ---chHHHHHHHHHHhh--------------cChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcC Confidence 11122223322211 23456777888999999999999988888874 6788888888654332 Q ss_pred cccccccchhhhhcccCceeEEEEcCCcc--eeecccccccccceeeecccceEEecCce------------eEEechhH Q lcl|NC_021537. 149 TTIEREDGEEVENIESGHGYVQVRQGRRR--YFGEAGDRYGDDKRFVDKETGEVASDAGE------------LKNGPANE 214 (602) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~qi~~~~~~--~~~~~~~~~~~~~~~~~~~~g~~~~~~~~------------~~~~~~~e 214 (602) ... ....-+..|++....... ....+...|..+ ..+.+...++. ...+..=. T Consensus 179 ~~~--------~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~------~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 244 (511) T protein:vir:96 179 TVE--------RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH------GVYRYLTNRTNGLKLTPRENSFESHSFERMP 244 (511) T ss_pred CCC--------CceEEEEEEEEeeeccccccceEEEEEEEeCC------cEEEEEecCCCcccccccccccccCcCcccc Confidence 110 000111122222111100 000000011000 00111111110 01111223 Q ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc Q lcl|NC_021537. 215 LIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA 294 (602) Q Consensus 215 viH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~ 294 (602) |++|+.. ..|.|.++.+...++....+..-..+.+...+.|-.+++-. ...+.+....... ++ T Consensus 245 vv~~~n~-----~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~~~~~~~~-----------~~ 307 (511) T protein:vir:96 245 ITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN-LNLDPVEVRKQKE-----------AN 307 (511) T ss_pred eEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecC-ccCCchhhccccc-----------cc Confidence 5555432 36888888877777766655555555556556665555421 1122222221111 11 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH------- Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ------- 367 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~------- 367 (602) ++....+..... .......+.+++.|+. +..+..+....+...+.|+..-++|....+... ++- +.++. T Consensus 308 ~~~~~~~~~~~~-~~~~~~~~~~~~~l~~-~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~-Sg~Al~~~~~~l 383 (511) T protein:vir:96 308 VLFLEPTVYVDA-EGRETEGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGL 383 (511) T ss_pred ceeccccceecc-ccccCCCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccc-HHHHHHHHHHHH Confidence 111111100000 0001111222222321 112345677788888999999999875443322 222 22211 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHhhhcCC-ccccccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHH Q lcl|NC_021537. 368 ------TREFAKGIIEPEQAKFSARLYKIIHQ-DALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEARE 440 (602) Q Consensus 368 ------~~~f~~~~l~P~~~~ie~~ln~~Ll~-~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~ 440 (602) ....+..+|+-.++.+...+...--. ......+..+.|...-.. +....++.+.++ .|+++..-+.+ T Consensus 384 ~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~----n~~e~~d~~~kl--~G~iS~et~l~ 457 (511) T protein:vir:96 384 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK----SLIEELKAYIDS--GGKISQTTLMS 457 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCc----CHHHHHHHHHHH--hccCChHHHHH Confidence 11222334444444444433321110 011122345666443222 233345667666 48999888888 Q ss_pred HhCCCCCCCCcc--ccccccccccc--cccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 441 ELDLAPFEDDRG--DMTLSEFEAEF--GADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 441 ~~Gl~p~~~g~~--d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) ++++ +++... +++-....... .......+..+.......+...+....+. T Consensus 458 ~l~~--v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 458 LFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred hCCC--CCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 8754 332211 11110000000 00000000000000000000001111111 No 213 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.55 E-value=4.7e-05 Score=44.32 Aligned_cols=406 Identities=10% Similarity=-0.016 Sum_probs=163.3 Q ss_pred CCCCcccccccc------hhhhcccCcccc-C--CCC--------HH--HHHHHHhhhHHHHHHHHHHHHhhccCceEEE Q lcl|NC_021537. 1 MSKAEETTQLDE------RHIATDVGRGIQ-P--PYN--------PE--TLAAFQELNETHQACIRKKSRYEAGYGFEIV 61 (602) Q Consensus 1 ~~k~~~~~~~~~------~~~~~~~~~~i~-p--~~~--------~~--~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~ 61 (602) +.+--...+... .++...-..... + ++. .. .-.+++ +++...+|+..+..+.|-|..+. T Consensus 10 i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~k~Iv~~~~~yl~G~p~~~~ 87 (470) T protein:vir:10 10 IQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIP--SNFYQLLVDQEAGYVASVFPDID 87 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccc--cchHHHHHHhhhhheeccceeee Confidence 111111000000 001000000000 0 000 00 001222 46777889999999999898764 Q ss_pred EecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccc Q lcl|NC_021537. 62 AHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAAT 141 (602) Q Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~ 141 (602) ..+ ....+.+..++. .++...+..+..+...+|.+|..+-++.+|++ .+..++|.. T Consensus 88 ~~d--------~~~~~~l~~~~~---------------~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~ 143 (470) T protein:vir:10 88 VGK--------DADNKKIIDVLG---------------DDRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQ 143 (470) T ss_pred cCc--------hHHHHHHHHHHh---------------hhHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccc Confidence 321 111222322221 12445556677899999999999999988875 577788888 Q ss_pred ccccccccccccccchhhhhcccCceeEEEEcCC--------------cceeeccccccc--ccc-eeeecccceEEec- Q lcl|NC_021537. 142 VRVRKTTTTIEREDGEEVENIESGHGYVQVRQGR--------------RRYFGEAGDRYG--DDK-RFVDKETGEVASD- 203 (602) Q Consensus 142 v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~--------------~~~~~~~~~~~~--~~~-~~~~~~~g~~~~~- 203 (602) +-+.-+.... .....+..|+...... ..+.+....... ... ............. T Consensus 144 ~~~v~d~~~~--------~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (470) T protein:vir:10 144 ITPIYATTLD--------NKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYET 215 (470) T ss_pred eEEEEcCCCC--------CceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccccccccccccccccc Confidence 7655332110 0001111222211111 111111000000 000 0000000000000 Q ss_pred ---CceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHH Q lcl|NC_021537. 204 ---AGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLR 280 (602) Q Consensus 204 ---~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~ 280 (602) ......+.-=.|+||+... .|.|.+......++....+..-..+.+...+.|-.+++-.+....++... T Consensus 216 ~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~--- 287 (470) T protein:vir:10 216 GQSNTLKHNFGRVPFIEFSKNK-----YRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMN--- 287 (470) T ss_pred ccccccccCCCeeeEEEeecCC-----CCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhh--- Confidence 0001112222467776542 68898888877777766666666666666666666654221111122111 Q ss_pred HHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC Q lcl|NC_021537. 281 NLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN 360 (602) Q Consensus 281 ~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 360 (602) .+.. .+ .+.+... ..+...++++. +... .+..+...++...+.|...-++|.. +....++ T Consensus 288 -~~~~------~~-~i~~~~~-------~~~~~~~~~~l--t~~~-~~~~~~~~~~~L~~~I~~~s~~p~~--~~~~~gn 347 (470) T protein:vir:10 288 -DLRK------YK-SIKINNT-------GNGDNSGVDKL--QIDI-PVEARDDALKITRKNIFLFGQGIDP--ANFESSN 347 (470) T ss_pred -hhhh------cC-eEeccCC-------CCCcCceeEEE--eecC-ChHHHHHHHHHHHHHHHHHhCCCCC--Ccccccc Confidence 1111 11 1111110 00111222221 1111 1234566777788888888888753 2222222 Q ss_pred ccCHHH-------------HHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHHH Q lcl|NC_021537. 361 RANSKE-------------QTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAM 427 (602) Q Consensus 361 ~sn~e~-------------~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~ 427 (602) -| ..+ ..+..+..+|+-+++.|...++. .........+.|+..-.. +....++.+.++ T Consensus 348 ~S-g~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~----~~~d~~~i~i~f~~~~p~----d~~e~~~~~~~~ 418 (470) T protein:vir:10 348 AS-GVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF----SDADKRHISQHWTRTKVE----DSLTKAQIVSTV 418 (470) T ss_pred ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----cCcccceeeEEeccCCCC----CHHHHHHHHHHH Confidence 22 111 12222334444444444443332 111223445666544332 333345556555 Q ss_pred HhCCcccHHHHHHHhCCCCCCCCccc--cccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 428 RLAGVGTVNEAREELDLAPFEDDRGD--MTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 428 ~~~G~~T~NE~R~~~Gl~p~~~g~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +|+++..-++++++. +++...+ +.--..........+..+.. +.+....+ T Consensus 419 --~g~iS~et~l~~~p~--v~D~~~E~eri~~E~~e~~~~~~~~~~~~----------~~~~dde~ 470 (470) T protein:vir:10 419 --ANYSSKEAVAKANPI--VDDWQQELKDLAKDKEENDPYSNQADELN----------GKGVNDEQ 470 (470) T ss_pred --hccCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhccccccC----------CCCCCCCC Confidence 689998888888764 3332211 11000000000000000000 00000000 No 214 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=97.54 E-value=4.8e-05 Score=44.27 Aligned_cols=461 Identities=10% Similarity=0.038 Sum_probs=170.3 Q ss_pred CCCCccc--ccccchhhhcccCccccCCCC-----------HHHHHHHHhhhHHHHHHHHHHHHhhccC-----ceEEEE Q lcl|NC_021537. 1 MSKAEET--TQLDERHIATDVGRGIQPPYN-----------PETLAAFQELNETHQACIRKKSRYEAGY-----GFEIVA 62 (602) Q Consensus 1 ~~k~~~~--~~~~~~~~~~~~~~~i~p~~~-----------~~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~~~i~~ 62 (602) ..|++.. ...++-......+++....++ +...|.++ .+|-|..+|+-|.+.+.-. |..|.. T Consensus 16 ~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV~i~L 94 (558) T protein:vir:10 16 STSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMA-LHPEADGAIEDVVNEAIVSDLYDSPVEVEL 94 (558) T ss_pred ccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceEEEEe Confidence 1221111 111111111112222222222 23345554 4788899999998877632 332322 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---CceEEEEEeCc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---GTPVGLAHVPA 139 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---G~~~~L~~l~p 139 (602) +..+.+....+.+....... ..++ +-..+..++ .+.|.+.|..|..++-|.+ ..+.+|.+||| T Consensus 95 ----d~~~~s~~iK~kI~eEF~~I---l~ll---~F~~~~~e~----fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDP 160 (558) T protein:vir:10 95 ----SNLNASNTLKKKIREEFRYI---KEMM---DFDKKSHEI----FRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDP 160 (558) T ss_pred ----cccCcchHHHHHHHHHHHHH---HHHh---ccchhhhHH----HhhheeeeEEEEEEEEeCCCccccceeeeeeCc Confidence 12222322333333332221 1121 122334444 4456778999999987643 35889999999 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcC-------Ccceeecccc--cccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQG-------RRRYFGEAGD--RYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~-------~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) ..|+...... . ...+++....+.+. ....++.|.. .++- ..+|.+ ..+....+ T Consensus 161 r~i~~Vr~i~-~---------~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~------~~~~~~--~~~~~vkI 222 (558) T protein:vir:10 161 LKIKFIRQEK-R---------KPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPT------GMVGQM--GGKNSIKI 222 (558) T ss_pred ccceeeeeec-c---------ccccccceeeeecccceeeccceeEeeeecCCccccc------ccceee--cCCCceee Confidence 9997533211 0 11112222222221 1111222211 1111 111221 12233566 Q ss_pred chhHEEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 211 PANELIFLPN-PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 211 ~~~eviH~r~-~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g 288 (602) +.+-|.+..- .-..++-+=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++. T Consensus 223 ~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KN 302 (558) T protein:vir:10 223 AKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRN 302 (558) T ss_pred chhheeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccc Confidence 6655544431 11112223356677777766666655554443322222233333332 222223333334443333321 Q ss_pred c----------cccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccc Q lcl|NC_021537. 289 S----------RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS 357 (602) Q Consensus 289 ~----------~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 357 (602) . .+..+.+..-+ .-++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..+ T Consensus 303 klVYDa~TGev~ddrk~msMlE---DyWLpRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLy~aLnVP~SRl~~e~ 376 (558) T protein:vir:10 303 KLVYDANTGEVRDDRKFMSMME---DFWLPRREGGRGTEITTLPGGQNLGEL---SDVDYFQKKLYRALGVPESRIAAEG 376 (558) T ss_pred eEEEeccCceecccchhhhhHh---hhcccccCCCCccceeeccccCCcchH---HHHHHHHHHHHHHhCCCccccCCCC Confidence 0 00000000000 000000000011122222111 11123 3335677899999999999997544 Q ss_pred cCCccCHHH---HHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 358 TSNRANSKE---QTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 358 ~~~~sn~e~---~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) ..+...+.+ .-.-| ...|.-+..+|...|...|-.. .+ .....++.|..+.-.....+.+...+ T Consensus 377 ~f~~Gr~~EItRDEiKF-~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 455 (558) T protein:vir:10 377 GFNLGRSSEILRDELKF-AKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEG 455 (558) T ss_pred cccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 434322211 12233 4456777777666665444322 11 11234555555543333334444333 Q ss_pred HHHHHHh-CC----cccHHHHHH-HhCC------------------CCCCCCccccccccccccccccccCCCcCccccc Q lcl|NC_021537. 423 RVRAMRL-AG----VGTVNEARE-ELDL------------------APFEDDRGDMTLSEFEAEFGADASDGDAEAMLTR 478 (602) Q Consensus 423 ~~~~~~~-~G----~~T~NE~R~-~~Gl------------------~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (602) ++..+-. .+ .++.+=+++ .|.+ +-+++++...++.....+.++++... .... T Consensus 456 Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~----~~~~ 531 (558) T protein:vir:10 456 RLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAME----GMGE 531 (558) T ss_pred HHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhc----cCCC Confidence 3332211 11 113222221 2222 11222211111110000000010000 0011 Q ss_pred ccccccccccccccccccccccccchhhhhcchhhhhhheecccccEEEE Q lcl|NC_021537. 479 SKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERELYL 528 (602) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l~~ 528 (602) +..++..+....+..+ .||.+...+++ T Consensus 532 ~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~ 558 (558) T protein:vir:10 532 QPVDPDLEAQAQAVDA-----------------------QYSKDTKKAEL 558 (558) T ss_pred CCcccccccchhhhhh-----------------------hhhhhhhhhcC Confidence 1111111111111111 12222211221 No 215 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.47 E-value=6e-05 Score=43.71 Aligned_cols=435 Identities=11% Similarity=0.025 Sum_probs=171.0 Q ss_pred CCCCccc---------cccc-ch-----hhhcccCc-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEET---------TQLD-ER-----HIATDVGR-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~~---------~~~~-~~-----~~~~~~~~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) ....++. +++. +. .... ++. |-.++..-..-+.+ +.+....+++.+|+-|.+-+-.|.-.. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~w~~~~~~~~~~~~~--~~~l~~~i~~~~A~ll~~e~~~i~v~~ 91 (518) T protein:vir:78 15 LNGKPNGSEPELIPKYLPLVPDNQKEWSKDSY-LTSLWAQGYVPTVHDKLM--NSGTGNEIVVVAAEYISGKPLSIDVTG 91 (518) T ss_pred hcCCCCccchhccHHHhhhcccchhhhhhhhh-hhhhcccCCCCccccccc--cCChHHHHHHHHHHhhcCCCceEEecC Confidence 1111100 0000 00 0000 011 11111110111222 335667889999999987655543221 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) .+..+ +....+.+...+.. ..+...++..+.+.+..|.+++.+..+ +|+ ..+.++++..+-+ T Consensus 92 ~~~~d--~e~~~~~l~~il~~--------------n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~-~~i~~v~ad~~~P 153 (518) T protein:vir:78 92 VNGSK--DENLTKQLKEALRI--------------DNFDSKSVKIVELAGGSGVSAVKINIL-NGR-PSISVHSSSQFWI 153 (518) T ss_pred ccccC--cHHHHHHHHHHHHh--------------ccHHHHHHHHHHHhhccCceEEEEEEE-CCe-eEEEEEcCCeeEE Confidence 11111 11112222222211 235566677788888899999887776 355 4788888888766 Q ss_pred ccccc---ccccccchhhhhcccCceeEEEEcC-------------Ccceeecc-----ccccccc-ceeeecccceEEe Q lcl|NC_021537. 145 RKTTT---TIEREDGEEVENIESGHGYVQVRQG-------------RRRYFGEA-----GDRYGDD-KRFVDKETGEVAS 202 (602) Q Consensus 145 ~~~~~---~~~~~~~~~~~~~~~~~~~~qi~~~-------------~~~~~~~~-----~~~~~~~-~~~~~~~~g~~~~ 202 (602) ..... ...+...... ......|.++-.. .....+.. +...+.. .......+....+ T Consensus 154 ~~~~g~~~~~~f~~~~~~--~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~ 231 (518) T protein:vir:78 154 DFKNNEPFRFNFFEEIPT--SNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHT 231 (518) T ss_pred EeecCcEEEEEEEEEeec--CCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccccccccccc Confidence 42211 0111100000 0000011111000 00000000 0000000 0000000000011 Q ss_pred cCce-eE---Ee-chhHEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCH Q lcl|NC_021537. 203 DAGE-LK---NG-PANELIFLPNPSP----LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSE 273 (602) Q Consensus 203 ~~~~-~~---~~-~~~eviH~r~~~~----~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~ 273 (602) .+.. .. +. +.--+.|++...+ .+.++|+|.+..+...++.....-.-...-|+.|. +..++ +... T Consensus 232 ~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~-~~i~v--~~~~--- 305 (518) T protein:vir:78 232 NDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTK-TKIAA--SERM--- 305 (518) T ss_pred ccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCC-ceeee--chhH--- Confidence 1110 00 00 1112344443211 23468999999999888877766666666676643 33332 2211 Q ss_pred HHHHHHHHHHHH-hhcccccCcceeccCC-ccceecccc---ccccccccccccccchHHHHHHHHHHhhHHHHHHHhcC Q lcl|NC_021537. 274 DSKEDLRNLMDN-LKGSRYRTAILEVEEF-VDDHGLGDG---GSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGV 348 (602) Q Consensus 274 ~~~~~l~~~~~~-~~g~~nag~~~~~~~g-~~~~~~~~~---~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgV 348 (602) ++. ..++. .+....+..+ .-+..+... +.+..-.++.++. .-.+.++.+..+...+.|....|+ T Consensus 306 ---------l~~~~~~~~-~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~-~Ir~e~~~~~~~~~l~~~~~~~G~ 374 (518) T protein:vir:78 306 ---------FRKKVNKST-DKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQG-DFRDGSYRETMEYFAQKAVSKSGY 374 (518) T ss_pred ---------hccCCCCCC-CccccccCCCCceEEEecCcCCCCCccccceeeeec-ccChHHHHHHHHHHHHHHHHhhCC Confidence 110 00110 0111111111 111111100 0011111233322 224567888899999999999999 Q ss_pred ChHHhhccccCCccCHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhcCC--ccccccceEEEeccchhcch Q lcl|NC_021537. 349 PPVLINVTSTSNRANSKEQ-------------TREFAKGIIEPEQAKFSARLYKIIHQ--DALDVDEWTIDFELRGAEQP 413 (602) Q Consensus 349 Pp~~lg~~~~~~~sn~e~~-------------~~~f~~~~l~P~~~~ie~~ln~~Ll~--~~~~~~~~~~~f~~~~~~~~ 413 (602) +|..+|... +. .++.+. ....++.+|.-++..+...+...... .......+.+.+++++.... T Consensus 375 s~~tfg~~~-~~-~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~ 452 (518) T protein:vir:78 375 NPATFNLGN-RE-VKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSV 452 (518) T ss_pred ChhhcCccc-cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCC Confidence 999888642 22 222211 11222233333333333322221111 11112234566666665545 Q ss_pred hHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCcccccccccccc--ccccccCCCcCccccccc Q lcl|NC_021537. 414 EQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAE--FGADASDGDAEAMLTRSK 480 (602) Q Consensus 414 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 480 (602) |.+..++..++++.+|+|+..++.+++... ..+.+.+.-+...... ....+...+..+..+.+. T Consensus 453 --D~~~~~~~~~~~v~aGimS~e~~i~~~~~~-~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 453 --NLNELSSTLNNMNSALAMSVEEKVKLIHPK-WEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred --CHHHHHHHHHHHHhcCCCCHHHHHHHhCCC-CCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 444555677889999999999977665322 2222221111000000 000011111111111111 No 216 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=97.47 E-value=6.1e-05 Score=43.70 Aligned_cols=432 Identities=11% Similarity=-0.002 Sum_probs=168.9 Q ss_pred CCCCc------ccccccc-----------hhhh----cccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_021537. 1 MSKAE------ETTQLDE-----------RHIA----TDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGY 56 (602) Q Consensus 1 ~~k~~------~~~~~~~-----------~~~~----~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~ 56 (602) +.|.. +..+..+ .++. .--|.+-.. -.++..-.+-...-.....+++.+|+-+.+- T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e 90 (500) T protein:vir:98 11 VTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNE 90 (500) T ss_pred HHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhcCC Confidence 11100 0000000 0000 000000000 0000000000011145566777777777765 Q ss_pred ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEE Q lcl|NC_021537. 57 GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAH 136 (602) Q Consensus 57 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~ 136 (602) +-.+.-. +....+.+...+ ....+...++..+.+.+..|.+++.+..+. |. +.+.+ T Consensus 91 ~~~i~~~--------d~~~~~~l~~il--------------~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~ 146 (500) T protein:vir:98 91 QAEIKVD--------DDAANEFISETL--------------KNDRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAF 146 (500) T ss_pred cceEecC--------ChHHHHHHHHHH--------------hhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEE Confidence 4443210 111112222211 122456677788888889999999888874 33 46888 Q ss_pred eCcccccccccccccccccchh---hhhcccCceeEE-E--E---cCCccee--ecccccccccc-eeeecccceEEecC Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEE---VENIESGHGYVQ-V--R---QGRRRYF--GEAGDRYGDDK-RFVDKETGEVASDA 204 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~---~~~~~~~~~~~q-i--~---~~~~~~~--~~~~~~~~~~~-~~~~~~~g~~~~~~ 204 (602) ++|..+-+...........-.. ......+..|+. + . ++..+.+ ..|-....... .-+...+ ++... T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~--~~~~l 224 (500) T protein:vir:98 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSE--VYKDL 224 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCccccccc--ccCCc Confidence 9998877643221111000000 000000111110 0 0 0110000 00000000000 0000000 00000 Q ss_pred cee---EEechhHEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHH Q lcl|NC_021537. 205 GEL---KNGPANELIFLPNPSP----LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKE 277 (602) Q Consensus 205 ~~~---~~~~~~eviH~r~~~~----~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~ 277 (602) ... ..++---..||+.+.+ .+.++|+|.+..+...++.....-.-..+-|..|.. ..+ ++...+... T Consensus 225 ~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~--v~~~~l~~~--- 298 (500) T protein:vir:98 225 KDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVA--VPESLTALT--- 298 (500) T ss_pred CcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eee--echHHhccc--- Confidence 001 1112223556664422 235689999999998887766665555566665443 222 221110000 Q ss_pred HHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc Q lcl|NC_021537. 278 DLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS 357 (602) Q Consensus 278 ~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 357 (602) .. ..++...+....-....-+..+... .+.+-.++.++ ....+-++.+..+...++|+...|+++..+|+.. T Consensus 299 -----~~-~~~g~~~~~~~~d~~~~~~~~~~~~-~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~ 370 (500) T protein:vir:98 299 -----VR-TTDGDVVPRPRFESDQNVYIRMGGR-DLDSSAIQDLT-TPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDG 370 (500) T ss_pred -----CC-CCCccccCCcccCCCcceEEEcCCC-CCcCcceeEec-cccChHHHHHHHHHHHHHHHHHhCCCccccccCc Confidence 00 0000000000000000111111111 11111233332 1223557888889999999999999999998766 Q ss_pred cCCccCHHHH-------------HHHHHHHHHHHHHHHHHHHHhh-hcCCccccccceEEEeccchhcchhHHHHHHHHH Q lcl|NC_021537. 358 TSNRANSKEQ-------------TREFAKGIIEPEQAKFSARLYK-IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQR 423 (602) Q Consensus 358 ~~~~sn~e~~-------------~~~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~ 423 (602) ++. .|+.+. .+..++.+|+-++..+.+..+. .+. .......+.+.+++++-... |.+...+. T Consensus 371 ~g~-~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~--d~~~~~~~ 446 (500) T protein:vir:98 371 KSM-KTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLY-QSEVPSMDNISISLDDGVFT--DRDAELDY 446 (500) T ss_pred Ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CCCCCCCcceEEEeCCCCCC--CHHHHHHH Confidence 543 344332 1222334444444444432221 111 11112334455555544333 33344456 Q ss_pred HHHHHhCCcccHHHHHHHh-CCCCCCCCccccccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 424 VRAMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 424 ~~~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) ..+++.+|+|+.-+++.+. |+ ++++....+..... ............++.-. + T Consensus 447 ~~~~v~aGi~s~~~~i~~~~g~---~eeea~~~l~~i~~---E~~~~~~~~~~~~~~~g----~ 500 (500) T protein:vir:98 447 WIKVVNAGFGTREMAIQKVLNV---TEEKAQEIAAEINT---GIVDEINQQRTDTHLYG----E 500 (500) T ss_pred HHHHHHcCCCCHHHHHHhcCCC---CHHHHHHHHHHHHH---hccccCCCCCccccccC----C Confidence 7789999999999998654 54 22222221111110 00000000000000000 0 No 217 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=97.47 E-value=6.1e-05 Score=43.70 Aligned_cols=432 Identities=11% Similarity=-0.002 Sum_probs=168.9 Q ss_pred CCCCc------ccccccc-----------hhhh----cccCccccC---CCCHHHHHHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_021537. 1 MSKAE------ETTQLDE-----------RHIA----TDVGRGIQP---PYNPETLAAFQELNETHQACIRKKSRYEAGY 56 (602) Q Consensus 1 ~~k~~------~~~~~~~-----------~~~~----~~~~~~i~p---~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~ 56 (602) +.|.. +..+..+ .++. .--|.+-.. -.++..-.+-...-.....+++.+|+-+.+- T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e 90 (500) T protein:vir:30 11 VTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNE 90 (500) T ss_pred HHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhcCC Confidence 11100 0000000 0000 000000000 0000000000011145566777777777765 Q ss_pred ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEE Q lcl|NC_021537. 57 GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAH 136 (602) Q Consensus 57 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~ 136 (602) +-.+.-. +....+.+...+ ....+...++..+.+.+..|.+++.+..+. |. +.+.+ T Consensus 91 ~~~i~~~--------d~~~~~~l~~il--------------~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~ 146 (500) T protein:vir:30 91 QAEIKVD--------DDAANEFISETL--------------KNDRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAF 146 (500) T ss_pred cceEecC--------ChHHHHHHHHHH--------------hhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEE Confidence 4443210 111112222211 122456677788888889999999888874 33 46888 Q ss_pred eCcccccccccccccccccchh---hhhcccCceeEE-E--E---cCCccee--ecccccccccc-eeeecccceEEecC Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEE---VENIESGHGYVQ-V--R---QGRRRYF--GEAGDRYGDDK-RFVDKETGEVASDA 204 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~---~~~~~~~~~~~q-i--~---~~~~~~~--~~~~~~~~~~~-~~~~~~~g~~~~~~ 204 (602) ++|..+-+...........-.. ......+..|+. + . ++..+.+ ..|-....... .-+...+ ++... T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~--~~~~l 224 (500) T protein:vir:30 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSE--VYKDL 224 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCccccccc--ccCCc Confidence 9998877643221111000000 000000111110 0 0 0110000 00000000000 0000000 00000 Q ss_pred cee---EEechhHEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHH Q lcl|NC_021537. 205 GEL---KNGPANELIFLPNPSP----LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKE 277 (602) Q Consensus 205 ~~~---~~~~~~eviH~r~~~~----~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~ 277 (602) ... ..++---..||+.+.+ .+.++|+|.+..+...++.....-.-..+-|..|.. ..+ ++...+... T Consensus 225 ~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~--v~~~~l~~~--- 298 (500) T protein:vir:30 225 KDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVA--VPESLTALT--- 298 (500) T ss_pred CcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eee--echHHhccc--- Confidence 001 1112223556664422 235689999999998887766665555566665443 222 221110000 Q ss_pred HHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc Q lcl|NC_021537. 278 DLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS 357 (602) Q Consensus 278 ~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 357 (602) .. ..++...+....-....-+..+... .+.+-.++.++ ....+-++.+..+...++|+...|+++..+|+.. T Consensus 299 -----~~-~~~g~~~~~~~~d~~~~~~~~~~~~-~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~ 370 (500) T protein:vir:30 299 -----VR-TTDGDVVPRPRFESDQNVYIRMGGR-DLDSSAIQDLT-TPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDG 370 (500) T ss_pred -----CC-CCCccccCCcccCCCcceEEEcCCC-CCcCcceeEec-cccChHHHHHHHHHHHHHHHHHhCCCccccccCc Confidence 00 0000000000000000111111111 11111233332 1223557888889999999999999999998766 Q ss_pred cCCccCHHHH-------------HHHHHHHHHHHHHHHHHHHHhh-hcCCccccccceEEEeccchhcchhHHHHHHHHH Q lcl|NC_021537. 358 TSNRANSKEQ-------------TREFAKGIIEPEQAKFSARLYK-IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQR 423 (602) Q Consensus 358 ~~~~sn~e~~-------------~~~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~ 423 (602) ++. .|+.+. .+..++.+|+-++..+.+..+. .+. .......+.+.+++++-... |.+...+. T Consensus 371 ~g~-~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~--d~~~~~~~ 446 (500) T protein:vir:30 371 KSM-KTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLY-QSEVPSMDNISISLDDGVFT--DRDAELDY 446 (500) T ss_pred Ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CCCCCCCcceEEEeCCCCCC--CHHHHHHH Confidence 543 344332 1222334444444444432221 111 11112334455555544333 33344456 Q ss_pred HHHHHhCCcccHHHHHHHh-CCCCCCCCccccccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 424 VRAMRLAGVGTVNEAREEL-DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 424 ~~~~~~~G~~T~NE~R~~~-Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) ..+++.+|+|+.-+++.+. |+ ++++....+..... ............++.-. + T Consensus 447 ~~~~v~aGi~s~~~~i~~~~g~---~eeea~~~l~~i~~---E~~~~~~~~~~~~~~~g----~ 500 (500) T protein:vir:30 447 WIKVVNAGFGTREMAIQKVLNV---TEEKAQEIAAEINT---GIVDEINQQRTDTHLYG----E 500 (500) T ss_pred HHHHHHcCCCCHHHHHHhcCCC---CHHHHHHHHHHHHH---hccccCCCCCccccccC----C Confidence 7789999999999998654 54 22222221111110 00000000000000000 0 No 218 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.27 E-value=0.00011 Score=42.28 Aligned_cols=448 Identities=12% Similarity=0.100 Sum_probs=171.7 Q ss_pred CCCCccc---ccccchh--hhc-ccCc--cccCCCC-H----HHHHHHHhhhHHHHHHHHHHHHhhccC-----ceEEEE Q lcl|NC_021537. 1 MSKAEET---TQLDERH--IAT-DVGR--GIQPPYN-P----ETLAAFQELNETHQACIRKKSRYEAGY-----GFEIVA 62 (602) Q Consensus 1 ~~k~~~~---~~~~~~~--~~~-~~~~--~i~p~~~-~----~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~~~i~~ 62 (602) ..|++.. ...++.. .+. .++. .+++.+. - ...|.++. +|-|..||+-|.+.+.-. |..|.. T Consensus 16 ~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~-~pEvd~Av~eIVneaiv~d~~~~pV~i~L 94 (537) T protein:vir:10 16 VPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVL-NPECDSAVDDVVNETICGNFDDVPISIDL 94 (537) T ss_pred cccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhh-ccchhhHHHHhhcceeEecCCCceEEEEe Confidence 1111110 0001100 000 1111 1233222 2 33445553 688899999988877632 323322 Q ss_pred ecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---CceEEEEEeCc Q lcl|NC_021537. 63 HPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---GTPVGLAHVPA 139 (602) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---G~~~~L~~l~p 139 (602) . .++.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|.+ ..+.+|.+||| T Consensus 95 d----~~~~s~~iK~kI~eEF~~I---l~ll---~F~~~~~e~f----R~WYVDgRi~fhKiid~k~pk~GI~ELr~lDP 160 (537) T protein:vir:10 95 H----NLKQSEKIKKLIRSEFDEI---LRLL---DFDNRAYEIF----RRWYVDGRLFFHKVIDPKKPRQGLVELRYVDP 160 (537) T ss_pred c----ccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhheeeeEEEEEEEEeCCCccccceeeeeeCC Confidence 1 1222222223333322211 1121 1123344444 456778999999987643 35889999999 Q ss_pred ccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEEec Q lcl|NC_021537. 140 ATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIFLP 219 (602) Q Consensus 140 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r 219 (602) ..|+.......-..+. ......+.++..+...+| .| ++ .| +....+....++.+-|.+.. T Consensus 161 r~i~~vR~i~~~~~~~------~~~~~~~~~v~~~~~eyf-~y------np------~g-~~~~~~~~vkI~~dAI~y~h 220 (537) T protein:vir:10 161 RKIRKVTEYEAKRPEA------LRTQDLNQQLTQQSASYF-LY------NP------KG-LKNSTNQGMKIAPDSIAYCH 220 (537) T ss_pred ccceeeEeecccCCcc------ceEEecceeeeeccccee-ee------cc------cc-ccccCCCceeccHhheeeec Confidence 9997433211100000 000111122222211111 00 00 01 11123345677775544443 Q ss_pred -CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc-------- Q lcl|NC_021537. 220 -NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS-------- 289 (602) Q Consensus 220 -~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~-------- 289 (602) +.-..+.-+.+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.. T Consensus 221 SGl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TG 300 (537) T protein:vir:10 221 SGIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTG 300 (537) T ss_pred ccceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCc Confidence 122244567788888888887777666665544322233333344333 2222333333344444433210 Q ss_pred --cccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCH-- Q lcl|NC_021537. 290 --RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANS-- 364 (602) Q Consensus 290 --~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~-- 364 (602) .+..+.+..-. .-++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..+..+...+ T Consensus 301 ev~ddrk~msMlE---DyWLPRReGgrgTEItTLpGgqnlgem---~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~E 374 (537) T protein:vir:10 301 EIKDDKKFMSMLE---DFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAE 374 (537) T ss_pred eecccchhhhhhh---hhcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCCccccCCCCcccccccch Confidence 00000000000 000000000012222222211 11233 33456778999999999999975444333222 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHHHHHHHHhC- Q lcl|NC_021537. 365 -KEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLA- 430 (602) Q Consensus 365 -e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~- 430 (602) .-.-.-| ...|.-+..+|...|...|-.. .+ .....++.|..+.-.....+.+...+++..+-.. T Consensus 375 ItRDEiKF-~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~d 453 (537) T protein:vir:10 375 ITRDEVKF-QKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMD 453 (537) T ss_pred hhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhh Confidence 1122233 4456677777666665443322 11 1123455555554333333444433333322111 Q ss_pred ----CcccHHHHHH-HhCC------------------CCCCCCccccccccccccccccccCCCcCcccccc-ccccccc Q lcl|NC_021537. 431 ----GVGTVNEARE-ELDL------------------APFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRS-KAAPPLE 486 (602) Q Consensus 431 ----G~~T~NE~R~-~~Gl------------------~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 486 (602) -.++.+=+|+ .|.+ +-+++++...-...+... ..+..+....+..+. ..+++.. T Consensus 454 pyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 531 (537) T protein:vir:10 454 PYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGD--EEPVPEGGEEPQTDPNSAVSPAD 531 (537) T ss_pred hhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCC--cccCCCCCCCcccCCccCCCCCC Confidence 1112222221 1221 112222111111100000 000000000111000 1111111 Q ss_pred cccccc Q lcl|NC_021537. 487 NKIGER 492 (602) Q Consensus 487 ~~~~~~ 492 (602) +...+. T Consensus 532 ~~~~~~ 537 (537) T protein:vir:10 532 QKRGEL 537 (537) T ss_pred ccCCCC Confidence 111111 No 219 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=96.88 E-value=0.00028 Score=40.04 Aligned_cols=431 Identities=11% Similarity=0.064 Sum_probs=173.6 Q ss_pred CCCCcccccc----cch------hhhcccCc-cccCCCC------HHHH----HHHHhhhHHHHHHHHHHHHhhccC--- Q lcl|NC_021537. 1 MSKAEETTQL----DER------HIATDVGR-GIQPPYN------PETL----AAFQELNETHQACIRKKSRYEAGY--- 56 (602) Q Consensus 1 ~~k~~~~~~~----~~~------~~~~~~~~-~i~p~~~------~~~l----~~~~~~~~~v~~cI~~ia~~ia~~--- 56 (602) +++...|-.. ++. ......++ .+.+-.+ -..| |.++. +|-|..||+-|.+.+.-. T Consensus 16 ~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~-~pEvd~Av~eIvne~iv~d~~ 94 (511) T protein:vir:56 16 EKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAE-YHEVDDAIQEIVDEAIVYEND 94 (511) T ss_pred ccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhh-ccchhhHHHHhhcceeEecCC Confidence 2222111111 111 00111122 1222222 2333 44443 688899999988877632 Q ss_pred --ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEE Q lcl|NC_021537. 57 --GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGL 134 (602) Q Consensus 57 --~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L 134 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|.+..+.+| T Consensus 95 ~~pV~l~l----d~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fHkiid~k~GI~eL 160 (511) T protein:vir:56 95 KEVVWLNL----DNTDFSENIKAKINEEFDRV---VSLL---QMRKHGYKWF----RKWYVDSRIYFHKILDKDNNIIEL 160 (511) T ss_pred CceEEEEe----cccCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhhhhcceEEEEEEeccccceeeh Confidence 333322 11222222223333322211 1121 1223344444 456677999999998887789999 Q ss_pred EEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhH Q lcl|NC_021537. 135 AHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANE 214 (602) Q Consensus 135 ~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~e 214 (602) .+|||..|+....... ...++ +.++.+. ..++.|.......+..... .........++.+. T Consensus 161 r~lDPr~i~~vr~i~~-~~~~~------------~~v~~~~-~ey~~Y~~~~~~~~~~~~~-----~~~~~~~vkI~~da 221 (511) T protein:vir:56 161 RPLNPMKMELVREIQK-ETIDG------------VEVVKGT-LEYYVYKQSDYKMPSWMSA-----TNRAQTSFRIPKDA 221 (511) T ss_pred hhcCcccchhhhhhhc-ccccc------------cccccce-eeeeEecCCCcccCccccc-----ccccccceeechhh Confidence 9999999986433211 11111 1111111 1111111100000000000 00112446899999 Q ss_pred EEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC-CHHHHHHHHHHHHHhhc-- Q lcl|NC_021537. 215 LIFLPNP---SPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL-SEDSKEDLRNLMDNLKG-- 288 (602) Q Consensus 215 viH~r~~---~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~l~~~~~~~~g-- 288 (602) |.|..-- .+.+..+.+|-|..|.+.+.....++...--|=-.-+.-+-|..+.=+.+ ...+.+-+++.+..++. T Consensus 222 I~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNkl 301 (511) T protein:vir:56 222 IVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRV 301 (511) T ss_pred eeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceE Confidence 9666522 23555678898999988877776666655443222333333443332222 33333334444333221 Q ss_pred --------ccccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc- Q lcl|NC_021537. 289 --------SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST- 358 (602) Q Consensus 289 --------~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~- 358 (602) ..+..+.+..-+. -++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..++ T Consensus 302 VYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kKLy~aLnVP~SRl~~e~q~ 375 (511) T protein:vir:56 302 VYDTQTGQVKNTTNAMSMLED---YYLPRREGSKGTEVSTLPGGQSLGDI---EDVLYFNRKLYKAMRIPTSRAASEDQT 375 (511) T ss_pred EEeccCceeccchhhhhhHhh---hcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCc Confidence 0011111100000 00000000012222222211 11233 33456778999999999999964322 Q ss_pred CCcc-----CHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHH Q lcl|NC_021537. 359 SNRA-----NSKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAE 421 (602) Q Consensus 359 ~~~s-----n~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~ 421 (602) ++++ .+.-.-.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+... T Consensus 376 ~~f~~Gr~~EItRDEiKF-~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 454 (511) T protein:vir:56 376 GGINFGQGAEITRDELKF-TKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILN 454 (511) T ss_pred cccccccchhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHH Confidence 2221 122222233 4456677777666655443322 11 1123455555554333333444433 Q ss_pred HHHHHHHh-CC----cccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRL-AG----VGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 422 ~~~~~~~~-~G----~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) +++..+-. .+ .++.+=++ ..|.+.-.+-...+ ......... +.-+ +.+... T Consensus 455 ~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~-------k~I~~E~k~-----~~~~-----~~e~~f 511 (511) T protein:vir:56 455 SRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQ-------SEIDEEETN-----PRFQ-----QDDQGF 511 (511) T ss_pred HHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHH-------HHHHHhhcC-----CCCC-----CcccCC Confidence 33332211 11 11333232 22333110000000 000000000 0000 000001 No 220 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=96.63 E-value=0.00046 Score=38.89 Aligned_cols=440 Identities=8% Similarity=-0.003 Sum_probs=162.5 Q ss_pred CCCCcccccc----cchhhhcccCccccCC-----------CCH-HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKAEETTQL----DERHIATDVGRGIQPP-----------YNP-ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~~~~~~~----~~~~~~~~~~~~i~p~-----------~~~-~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) +.+-..+++. ....+...-......+ -+. ..=.++. +.+...+|+..+..+.|-|.++...+ T Consensus 21 i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~--~nf~k~Ivd~~~~yl~G~Pv~~~~~d 98 (537) T protein:vir:78 21 ITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKIS--HGFFTELVDQLAQYLLSNGVEVKVKD 98 (537) T ss_pred HHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccc--cchHHHHHHHHhhhhcccCceeecCc Confidence 1111100000 0001110000000000 000 0001222 34667788999999999998765321 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccc Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRV 144 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~ 144 (602) . .+.+..+.+..+. ...+......+..++..+|.||..+-++.+|.+ .+..++|..+-+ T Consensus 99 ~-----~~~e~~~~l~~~~---------------~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~~i~p~~~~p 157 (537) T protein:vir:78 99 E-----DNTQLDEILQEYF---------------DEDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQTVDGLTLIP 157 (537) T ss_pred c-----hhHHHHHHHHHHh---------------hccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEEEEccceeEE Confidence 1 1111112222111 123445667778889999999999988988865 577888887755 Q ss_pred cccccccccccchhhhhcc------------cCceeEEEEcCCcceeeccccccc-ccce-----eeecccce------- Q lcl|NC_021537. 145 RKTTTTIEREDGEEVENIE------------SGHGYVQVRQGRRRYFGEAGDRYG-DDKR-----FVDKETGE------- 199 (602) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~------------~~~~~~qi~~~~~~~~~~~~~~~~-~~~~-----~~~~~~g~------- 199 (602) ..+.+. ... ...+.+ ....++.+......+.+....... .... ...+.... T Consensus 158 v~d~~~-~~~---~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 233 (537) T protein:vir:78 158 VFDDYG-VLK---MIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEEST 233 (537) T ss_pred EEcCCC-Cce---eEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccc Confidence 433221 000 000000 001111121111111111100000 0000 00000000 Q ss_pred ---EE-ec--CceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCC- Q lcl|NC_021537. 200 ---VA-SD--AGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLS- 272 (602) Q Consensus 200 ---~~-~~--~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~- 272 (602) .. .. ......|..=.|++|+.. ..|+|.|......++....+..-.++.+.....|-.+| .|..+. T Consensus 234 ~~~~~~~~~~~~~~~~~g~iPvv~f~nn-----~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi--~g~~~~~ 306 (537) T protein:vir:78 234 DADFEDTDGYQVLGRSYSKFPFQLLYNN-----KDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVV--KGFSGDS 306 (537) T ss_pred cccccccccccccccCCcceeEEEeccC-----ccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeee--ecCCCcc Confidence 00 00 000011222235555543 36889888888888777777666677766655554444 443222 Q ss_pred -HHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChH Q lcl|NC_021537. 273 -EDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPV 351 (602) Q Consensus 273 -~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~ 351 (602) ++.... ++. .+.+.+.+.+.+ +++. +...+ +......++.+.+.|...-.+|. T Consensus 307 ~~~~~~~----l~~------~~~i~v~~d~~~------------v~~l--~~~~~-~~~~e~~ld~L~~~I~~~s~~~~- 360 (537) T protein:vir:78 307 TDKLRQN----IKA------KKMIGVNGDNAG------------MEIQ--TVSIP-YEARKAKMDIDVENIYRSGMGFN- 360 (537) T ss_pred chhHHHH----Hhh------cCceeecCCCCc------------eeEE--EecCC-HHHHHHHHHHHHHHHHHhcCCCC- Confidence 221221 111 111112111111 1111 11111 12233445555555554433332 Q ss_pred HhhccccCCccCHH------------HHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHH Q lcl|NC_021537. 352 LINVTSTSNRANSK------------EQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKM 419 (602) Q Consensus 352 ~lg~~~~~~~sn~e------------~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~ 419 (602) ......||-|... +.....+..+|+-.++.|...++.+-... .......+.|...-.. +... T Consensus 361 -~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~-~d~~~i~i~f~~~~P~----n~~e 434 (537) T protein:vir:78 361 -STAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGE-YDSNDICFEIEPHVLA----NELD 434 (537) T ss_pred -CccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc-cccceeeEEeccCCCC----CHHH Confidence 1222222322211 11223334455555555555544321111 1223455666544322 3344 Q ss_pred HHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccc-------cccccc--cccccCCCc-------Ccccccccccc Q lcl|NC_021537. 420 AEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLS-------EFEAEF--GADASDGDA-------EAMLTRSKAAP 483 (602) Q Consensus 420 ~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~-------~~~~~~--~~~~~~~~~-------~~~~~~~~~~~ 483 (602) .++.+.+++..|+++..-+.+.+++ +++.+...... ...... ..+.+..+. ....+.....+ T Consensus 435 ~a~~~~~l~~~giiS~eT~l~~~p~--vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (537) T protein:vir:78 435 IATTRKTEAETEALKIGNIMTVAPR--IGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQP 512 (537) T ss_pred HHHHHHHHHhcCcchHHHHHHhCCC--CCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCC Confidence 5567788899999998888887754 33221110000 000000 000000000 00000000000 Q ss_pred cccccccccccccccccccchhhhhcc Q lcl|NC_021537. 484 PLENKIGERDSVDVDVSKDPIEQTTFS 510 (602) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~m~~~~v~ 510 (602) +.+. .+........+...-.-.|+. T Consensus 513 ~~d~--~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 513 PVDP--NQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred CCCc--cCCCCCCCCCCCCCCccCCCC Confidence 0000 000000000000000001111 No 221 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=391 Identities=9% Similarity=-0.023 Sum_probs=153.7 Q ss_pred CCCC----c-ccccc--cchhhhcccCccccC--------CCCH-HHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021537. 1 MSKA----E-ETTQL--DERHIATDVGRGIQP--------PYNP-ETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHP 64 (602) Q Consensus 1 ~~k~----~-~~~~~--~~~~~~~~~~~~i~p--------~~~~-~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~ 64 (602) +.|- . .-..+ ..+++... ...... ..+. ..-.+++ +++.+.+|+..+..+.|-|+.+...+ T Consensus 6 i~~~i~~~~~~~~r~~~~~~YY~g~-~~i~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~Ivd~~~~yl~G~p~~~~~~~ 82 (451) T protein:vir:10 6 IRAIISADAARRQEILQAKSYYYNK-NDILKKGVVVQNRDENPLRNADNRIS--HNFHEILVDEKASYMFTYPVLFDIDN 82 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccc-Cccccccccccccccccccccccccc--cchHHHHHHhhhhheecccceeecCC Confidence 1110 0 00000 00001000 000000 0000 0001222 46778889999999988887654211 Q ss_pred CCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC-------ceEEEEEe Q lcl|NC_021537. 65 SADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG-------TPVGLAHV 137 (602) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G-------~~~~L~~l 137 (602) +....+.+..++ ...+......+.++.+.+|.||..+-++.+. ....+..+ T Consensus 83 -------~~~~~~~~~~~~---------------~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i 140 (451) T protein:vir:10 83 -------NKELNEKVTDVL---------------GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVV 140 (451) T ss_pred -------cHHHHHHHHHHh---------------ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEE Confidence 111112222111 1245667777889999999999988877541 23457778 Q ss_pred CcccccccccccccccccchhhhhcccCceeEE------------------EEcCCcceeecccccccccceeeecccce Q lcl|NC_021537. 138 PAATVRVRKTTTTIEREDGEEVENIESGHGYVQ------------------VRQGRRRYFGEAGDRYGDDKRFVDKETGE 199 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~q------------------i~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 199 (602) +|..+-+..+... . +. ...+.+|+. +......+.+...... ..+. T Consensus 141 ~p~~~~~vydd~~-~---~~----~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~---------~~~~ 203 (451) T protein:vir:10 141 NTEEIIPIYRNGI-E---RE----LEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVS---------CCGS 203 (451) T ss_pred cccceEEEEcCCC-C---Cc----eEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccC---------cccc Confidence 8877754432210 0 00 001111221 2111111111110000 0000 Q ss_pred EEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHH Q lcl|NC_021537. 200 VASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDL 279 (602) Q Consensus 200 ~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l 279 (602) .....+....+..=.|++++.. ..|.|.++.....++....+..-.++.+...+.|-.+++--+...+++....+ T Consensus 204 ~~~~~~~~~~~g~vPvv~~~nn-----~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 278 (451) T protein:vir:10 204 QIEHITVQHRFNSVPFVEFSNN-----IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL 278 (451) T ss_pred ccccccccCCCCeeeEEEeccC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH Confidence 0000111111222235666542 25778887777777666655555555555555554444321111222222221 Q ss_pred HHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC Q lcl|NC_021537. 280 RNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS 359 (602) Q Consensus 280 ~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 359 (602) +. .+++.+..... ....+.++. +... .+..+....+...+.|...-++|.. .....+ T Consensus 279 ~~-----------~~~i~~~~~~~-------~~~~~~~~l--~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~--~~~~~g 335 (451) T protein:vir:10 279 KR-----------YKTIKTETDSE-------GDSGGLKTM--QIEI-PTEARKIILEILKKQIYESGQGLQQ--DTENFG 335 (451) T ss_pred hh-----------CCeEEecCcCC-------ccCCcceEE--eecC-CHHHHHHHHHHHHHHHHHHhCcccc--cccccc Confidence 11 11222211100 011122221 1111 1344567788888899999998852 222222 Q ss_pred CccCHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHHHHHHHHHHH Q lcl|NC_021537. 360 NRANSKEQ-------------TREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRA 426 (602) Q Consensus 360 ~~sn~e~~-------------~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~ 426 (602) | ++..+. ....+..+|+-+++.+...++. .......+.|+..-.. +....++.+.+ T Consensus 336 n-~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~------~d~~~i~i~f~~~~p~----n~~e~~~~~~k 404 (451) T protein:vir:10 336 N-ASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGV------TDYKKIQQTYTRNMMS----NDLEDADIATK 404 (451) T ss_pred c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC------CCccceeEEecCCCCC----CHHHHHHHHHH Confidence 2 222221 1111222333333333333221 1234556677554322 33345566777 Q ss_pred HHhCCcccHHHHHHHhCCCCCCCCcccc-ccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 427 MRLAGVGTVNEAREELDLAPFEDDRGDM-TLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 427 ~~~~G~~T~NE~R~~~Gl~p~~~g~~d~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) + .|+++..-+.++++.- ++..... .+...-....... .+..+. ..+ T Consensus 405 l--~g~iS~et~~~~~p~v--~d~~~e~~~~~ee~~~~~~~~-~~~~~~--------------~~~ 451 (451) T protein:vir:10 405 S--VGIIPTKIILRHHPWV--DDVEEAEKLYLEEKKIQASKV-SDDYNN--------------FTE 451 (451) T ss_pred H--hccCchHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHH-HhhcCC--------------CCC Confidence 6 4889987788777542 2111000 0000000000000 000000 000 No 222 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=346 Identities=9% Similarity=0.003 Sum_probs=119.4 Q ss_pred CceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhc--cCCccCCH------HHH---HHHHHHHHHhcCCeEEEEe Q lcl|NC_021537. 56 YGFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGP--EGTAMSTP------EEV---LELGRQDYHGIGWAALEIL 124 (602) Q Consensus 56 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~pn~~~t~------~~~---~~~~~~d~l~~Gna~~~i~ 124 (602) |+| +.+......... ...... .....+. ++. ......+. ..+ +..+..++ .-+-+.+. T Consensus 1 Mgl--f~~~~~~~~~~~-~~~~~~---~~~~~~~--~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~i---a~l~~~~~ 69 (384) T protein:vir:49 1 MPI--FNITNLATESPP-SNQDSF---FDITDPE--FLDALNGSEWVSAETALKNSDLFSIISQLSNDL---ATAKITTS 69 (384) T ss_pred Ccc--ccccccCccccc-ccchhh---ccccchh--hcccccCCceechhhhhccHHHHHHHHHHHHHH---hhCceeee Confidence 443 211111000000 000000 0000000 000 00011111 112 22222221 11222222 Q ss_pred eCCCCceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecc-cceEEec Q lcl|NC_021537. 125 VEGDGTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKE-TGEVASD 203 (602) Q Consensus 125 r~~~G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~ 203 (602) +.... .-..++++.++...++..........|++|+.+..+..+.......+.+..+.+.... .+... T Consensus 70 ~~~~~---------~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~-- 138 (384) T protein:vir:49 70 RKQLQ---------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLY-- 138 (384) T ss_pred cchhh---------hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEE-- Confidence 21111 1245788999999999999999999999999998877654444333333333322111 11100 Q ss_pred CceeEEechh-----HEEEecCCCCCCCcccccH---HH--HHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCH Q lcl|NC_021537. 204 AGELKNGPAN-----ELIFLPNPSPLALYYGVPD---WV--AAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSE 273 (602) Q Consensus 204 ~~~~~~~~~~-----eviH~r~~~~~~~~~G~sp---l~--~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~ 273 (602) ..+... .-..++.....+ +.+.++ +. +.+..+...........++..+ T Consensus 139 ----y~~~~~~~~~~~~~~~~~~eVih-~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~----------------- 196 (384) T protein:vir:49 139 ----YNITFDDPRIPPKQHVPQGDILH-FRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLN----------------- 196 (384) T ss_pred ----EEEEecCccccceeEecCccEEE-ecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHH----------------- Confidence 011100 011111100000 011111 11 1111111111111111111111 Q ss_pred HHHHHHHHHHHHhhcccccCcceeccCCccceecc------ccccccccccccccc---cchHHHHHHHH--HHhhHHHH Q lcl|NC_021537. 274 DSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLG------DGGSDVNIELEPIGA---REDLDMEFQAF--RERNEHEI 342 (602) Q Consensus 274 ~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~------~~~~~~~~~~~pl~~---~~~~d~qf~e~--~~~~~~~I 342 (602) -+..+...+.++..+++....... .......-++-.+.. ..+-.+...+. .+.....+ T Consensus 197 -----------~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 265 (384) T protein:vir:49 197 -----------ALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTT 265 (384) T ss_pred -----------HHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHH Confidence 112122222222222111100000 000000000000100 00000000000 00000000 Q ss_pred HHHhcCChHHhhcc----ccCCccC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcchhHHH Q lcl|NC_021537. 343 AKVHGVPPVLINVT----STSNRAN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQPEQDA 417 (602) Q Consensus 343 a~~fgVPp~~lg~~----~~~~~sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~~~d~ 417 (602) ..+. ...|.. .....++ ..+....++..+|+|.+..|..+|+.+|..... ++.....+. +. T Consensus 266 ~~Ia----~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~--------~~~~~~~~~--~~ 331 (384) T protein:vir:49 266 GQFA----KVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVD--------ADILPAVDP--TG 331 (384) T ss_pred HHHH----HHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhh--------hhhhhhhhc--cc Confidence 1110 112221 1111111 234567788899999999999999999865432 222111111 11 Q ss_pred HHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCccc Q lcl|NC_021537. 418 KMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAML 476 (602) Q Consensus 418 ~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~ 476 (602) ......+..++.+|++|+||+|++++..++...+.... .+..+ ..++++++.= T Consensus 332 ~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~~--~~~~p----~~gGd~~~~~ 384 (384) T protein:vir:49 332 SNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPEG--ETDST----LKGGETNEQY 384 (384) T ss_pred hHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHHH--cCCCC----CCCCCCCCCC Confidence 22223356789999999999999997766543221111 11111 1112211111 No 223 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=96.10 E-value=0.001 Score=37.02 Aligned_cols=471 Identities=12% Similarity=0.071 Sum_probs=167.4 Q ss_pred CCCCccccccc-chhhhcccCccccC---------CCCH----HHHHHHHhhhHHHHHHHHHHHHhhccC-----ceEEE Q lcl|NC_021537. 1 MSKAEETTQLD-ERHIATDVGRGIQP---------PYNP----ETLAAFQELNETHQACIRKKSRYEAGY-----GFEIV 61 (602) Q Consensus 1 ~~k~~~~~~~~-~~~~~~~~~~~i~p---------~~~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~~~i~ 61 (602) ..|+....... ++.++.-.+++... +-+- ...|.++ .+|-|..+|+-|.+.+.-. |..|. T Consensus 14 ~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma-~~pEVd~Av~eIVneaIv~d~~~~pV~vd 92 (564) T protein:vir:10 14 GQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMS-LHPEVDSAIDEIVNEFVVNDGDDKPVEVD 92 (564) T ss_pred cCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceEEEE Confidence 11111111111 11111111111111 1121 2234454 4788899999888875522 33222 Q ss_pred EecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---CceEEEEEeC Q lcl|NC_021537. 62 AHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---GTPVGLAHVP 138 (602) Q Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---G~~~~L~~l~ 138 (602) . +.++.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|.+ ..+.+|.+|| T Consensus 93 L----~~~~~s~siK~kI~eEF~~I---l~ll---~F~~~~~e~f----R~WYVDgRi~fHkiid~~~pk~GI~eLr~lD 158 (564) T protein:vir:10 93 L----QNLEIGSGVKKKIRDEFNRI---LRMM---NFNVNAHEII----RNWYVDGRSHYHKVIDLDNPKKGILELRYID 158 (564) T ss_pred e----cccCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhhhhcceEEEEEEeeCCChhhhhhhhhhhc Confidence 1 12223333223333322221 1121 1223444454 456677999999876532 2388999999 Q ss_pred cccccccccccccccccchhhhhcccCce-eEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechhHEEE Q lcl|NC_021537. 139 AATVRVRKTTTTIEREDGEEVENIESGHG-YVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPANELIF 217 (602) Q Consensus 139 p~~v~~~~~~~~~~~~~~~~~~~~~~~~~-~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~eviH 217 (602) |..|+.......-....+..+ ..+.. ++........+.+.+... .. .....+|......+....++.+.|.| T Consensus 159 Pr~i~~vr~i~~~~~~~~~~v---~k~~~~~~~y~~~~Eyy~Ynp~~~-~g---~~~~~~~~~~~~~~~~ikI~~daI~y 231 (564) T protein:vir:10 159 SLKIRKVRQKLKDVDPNRKEI---EKGTALQYDYGDFIEYYIYNPKGF-AG---NIPMVTGSMDWSNQEGIKIASDAIAQ 231 (564) T ss_pred ccceeeeeeecccccccccee---eeeeeeeccccccccceeeccccc-cC---cccccccccccccccceeechhhcce Confidence 997774432211100001000 00100 000000011111111000 00 00111222223334567888998888 Q ss_pred ecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc------ Q lcl|NC_021537. 218 LPN-PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS------ 289 (602) Q Consensus 218 ~r~-~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~------ 289 (602) ..- .-..++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.- T Consensus 232 ~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~ 311 (564) T protein:vir:10 232 STSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQ 311 (564) T ss_pred ecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEecc Confidence 863 11223333456677777776666655555443322222333333332 2222233333344433333210 Q ss_pred ----cccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc-CC--c Q lcl|NC_021537. 290 ----RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST-SN--R 361 (602) Q Consensus 290 ----~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~--~ 361 (602) .+..+.+..-+ .-++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+...+ .+ . T Consensus 312 TGevrddrk~msMlE---DyWLPRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr 385 (564) T protein:vir:10 312 TGEIRDDKKHMSMLE---DFWLPRREGGRGTEITTLPGGQNLGEL---KDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGK 385 (564) T ss_pred CceecccchhhhhHh---hhcccccCCCcccceeeccccCCcchH---HHHHHHHHHHHHHhCCCcccccCCCceeeccc Confidence 00000000000 000000000011222222111 11123 33356778999999999999975421 22 2 Q ss_pred cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHHHHHHHH Q lcl|NC_021537. 362 AN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMR 428 (602) Q Consensus 362 sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~ 428 (602) ++ +.-.-.-| ...|.-+..+|...|...|-.. .+ .....++.|..+.-.....+.+...+++..+- T Consensus 386 ~~EItRDEiKF-~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~ 464 (564) T protein:vir:10 386 STEILRDELKF-TKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLAT 464 (564) T ss_pred ccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 22 22222233 4456777777666665443322 11 11234555655543333334444333333321 Q ss_pred hC-C----cccHHHHHH-HhC----------------------CCCCCCCccccccccccccccccccCCCcCccccccc Q lcl|NC_021537. 429 LA-G----VGTVNEARE-ELD----------------------LAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSK 480 (602) Q Consensus 429 ~~-G----~~T~NE~R~-~~G----------------------l~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (602) .. + .++.+=+|+ .|. .+|.+...++-....+.+ -+++.+.. ..... T Consensus 465 ~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~---~~p~~~~~---~~~~~ 538 (564) T protein:vir:10 465 QMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQA---FAPELQAA---QDDLA 538 (564) T ss_pred HhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCc---CCcchhhh---ccccc Confidence 11 1 112222211 111 122211111100000000 00000000 00000 Q ss_pred ccccccccccccccccccccccchhhhhcchhhhhhheecccc Q lcl|NC_021537. 481 AAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGE 523 (602) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~ 523 (602) .++..... ..+++++--+..- -|+.. T Consensus 539 ~~~~~~~~--------~~a~~~~~~~~~~---------~~~~~ 564 (564) T protein:vir:10 539 AEREIKKL--------NSAPKPPPSQQSK---------SQSNK 564 (564) T ss_pred cccChhhh--------ccCCCCCCCCCCc---------CcCCC Confidence 00000000 0000000000000 00000 No 224 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=452 Identities=10% Similarity=-0.021 Sum_probs=154.9 Q ss_pred CCCCcccccccchhhhcccCccccCC-----CCH--HHHHHHHhhhHHHHHHHHHHHHhhccC-----ce-EEEEecCC- Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPP-----YNP--ETLAAFQELNETHQACIRKKSRYEAGY-----GF-EIVAHPSA- 66 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~-----~~~--~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~~-~i~~~~~~- 66 (602) ..+..+.++..+.. ..+...++-|. -+. ..+.+.. +++...|++.+|..+.+. +| ++...+.. T Consensus 16 ~~~l~~~R~~~e~~-w~e~~~y~lP~~~~~~~~~~~~~~~~~~--dst~~~a~~~Las~l~~~ltP~~~WFrl~~~d~~~ 92 (522) T protein:vir:94 16 YDRLKNGRQPYETR-AQNCAAVTIPSLFPKESDNSSTEYTTPW--QAVGARCLNNLAAKLMLALFPQSPWMRLTVSEYEA 92 (522) T ss_pred HHHHHHHhhHHHHH-HHHHHHHhcccccCCCCCcccccccccc--cccHHHHHHHHHHHHHhhcCCCCcccccccchhhh Confidence 11000000000100 00000111110 000 0111222 234456777777777652 22 22211100 Q ss_pred CCcccchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceE--EEEEeCccccc Q lcl|NC_021537. 67 DEPDEGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPV--GLAHVPAATVR 143 (602) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~--~L~~l~p~~v~ 143 (602) +....+......++..+..+ ...+..+. ..+++.-+..+..|+.++||+++++..+..|.+. ..+||..-.| T Consensus 93 ~~~~~~~~~~~~v~~~L~~ve~~~~~~~~----~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~v- 167 (522) T protein:vir:94 93 KTLSQDSEAAARVDEGLAMVERVLMAYME----TNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYVV- 167 (522) T ss_pred hccCcccchhHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEEE- Confidence 00001111112222222221 11122222 3457777788899999999999998888777654 4556543333 Q ss_pred ccccccccc--cc--cchhhhhcc-c-CceeEEEEcC--Ccceeecccccccccce--eeecccceEEecCceeEEechh Q lcl|NC_021537. 144 VRKTTTTIE--RE--DGEEVENIE-S-GHGYVQVRQG--RRRYFGEAGDRYGDDKR--FVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 144 ~~~~~~~~~--~~--~~~~~~~~~-~-~~~~~qi~~~--~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~~~~~ 213 (602) ..+..+.. .. .......+. . ...+. .+. ....+..+...+|..-+ +.....|...........|... T Consensus 168 -~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~--~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~ 244 (522) T protein:vir:94 168 -QRDAFGNILQIVTIDKVAFSALPEDVKSQLN--ADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIEVTGTDGSYPLTAC 244 (522) T ss_pred -eeCCCcCeEEEeeeeeccHHhcchHHHHHHh--cccCCccceEEEEEEEEeeCCceeEEeeccCceecccCCCCccccC Confidence 33322211 00 000000000 0 00000 000 00001111111111110 0011111111111011123334 Q ss_pred HEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccC Q lcl|NC_021537. 214 ELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRT 293 (602) Q Consensus 214 eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag 293 (602) ..+.+|.....+..||.||...++..+.......+.......-...|..++. +++........ .++.+. T Consensus 245 P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~-~~g~~~~~~~~---------~~~~g~- 313 (522) T protein:vir:94 245 PYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVN-PNGITQPRRLN---------KAATGE- 313 (522) T ss_pred CceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-ccccccchhee---------ccCCce- Confidence 5677777777788999999999999999999998888888888888876654 33322222111 111110 Q ss_pred cceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH--- Q lcl|NC_021537. 294 AILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR--- 369 (602) Q Consensus 294 ~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~--- 369 (602) + ..+ ...++...++...+ |.+ ..+..+.....|-.+|-+.. ++.- ++..-++++... T Consensus 314 --~-v~g-----------~~~~v~~~~~~~~~--~~~~~~~~i~~~~~rI~~af~~~~--~~~~-~~~r~TAtEV~~r~~ 374 (522) T protein:vir:94 314 --F-VAG-----------RVEDINFLQLTKGQ--DFTIAKSVADAIEQRLGWAFLLNS--AVQR-NAERVTAEEIRYVAG 374 (522) T ss_pred --e-ecC-----------Ccccceeeeccccc--chhHHHHHHHHHHHHHHHHHhhhh--hccC-CCccccHHHHHHHHH Confidence 0 000 11122222322111 222 13455667788888886652 2221 122223333211 Q ss_pred -----------HHHHHHHHHHHHHHHHHH-hhhcCCccccccceEEEeccchhcchhH---HHHHHHHHHHHHHhCCccc Q lcl|NC_021537. 370 -----------EFAKGIIEPEQAKFSARL-YKIIHQDALDVDEWTIDFELRGAEQPEQ---DAKMAEQRVRAMRLAGVGT 434 (602) Q Consensus 370 -----------~f~~~~l~P~~~~ie~~l-n~~Ll~~~~~~~~~~~~f~~~~~~~~~~---d~~~~~~~~~~~~~~G~~T 434 (602) .+....|.|++.+.-..+ ...++++... ..+.+++ .+.+.... +......+++.+.. +. T Consensus 375 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~-~~v~v~~--~s~La~~qr~~~~~~l~~~~~~ia~---l~ 448 (522) T protein:vir:94 375 ELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPK-EAVEPTV--STGLEALGRGQDLEKLTQAVNMMTG---LQ 448 (522) T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCc-ccEEeeE--ecHHHHHHHHHHHHHHHHHHHHHHh---cc Confidence 223344566655543333 3335544322 2344444 22222111 11111112211110 11 Q ss_pred HHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccc--------cccccccccccccccc-cccccchh Q lcl|NC_021537. 435 VNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKA--------APPLENKIGERDSVDV-DVSKDPIE 505 (602) Q Consensus 435 ~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~-~~~~~~m~ 505 (602) |. +.. ..++ .+..+...-...|.++..--....+..+.. .........+...+.. ....+.|. T Consensus 449 P~-~~~----~~id---~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 520 (522) T protein:vir:94 449 PL-SQD----PDIN---LPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMA 520 (522) T ss_pred ch-hhh----hcCC---HHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhh Confidence 10 000 0010 011110000011111100000000000000 0000000000000000 00000000 Q ss_pred hhhcchh Q lcl|NC_021537. 506 QTTFSSS 512 (602) Q Consensus 506 ~~~v~ss 512 (602) +. T Consensus 521 -----~~ 522 (522) T protein:vir:94 521 -----QA 522 (522) T ss_pred -----cC Confidence 00 No 225 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=441 Identities=10% Similarity=0.073 Sum_probs=165.9 Q ss_pred CCCCccccc--c-------cchh-hhc--ccCc--cccCCCC-----HHHHHHHHhhhHHHHHHHHHHHHhhccC----- Q lcl|NC_021537. 1 MSKAEETTQ--L-------DERH-IAT--DVGR--GIQPPYN-----PETLAAFQELNETHQACIRKKSRYEAGY----- 56 (602) Q Consensus 1 ~~k~~~~~~--~-------~~~~-~~~--~~~~--~i~p~~~-----~~~l~~~~~~~~~v~~cI~~ia~~ia~~----- 56 (602) +++.+.+.+ + ++.. +.+ .++. .+++.+. +...|.++. +|-|..+|+-|.+.+.-. T Consensus 9 i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~-~pEvd~Av~eIVneaiv~d~~~~ 87 (533) T protein:vir:10 9 LERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVL-QPECDSAVDDIVNETICGNFDDV 87 (533) T ss_pred cccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhh-ccchhhHHHHhhcceeeecCCCc Confidence 111111111 1 1100 100 0111 1223222 233455553 688889999988877632 Q ss_pred ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCceEE Q lcl|NC_021537. 57 GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGTPVG 133 (602) Q Consensus 57 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~~~~ 133 (602) |..|.. +.++.+....+.+....... ..++ +-..+..++++ .|.+.|..|..++-|. ...+.+ T Consensus 88 pV~i~L----d~~~~s~~iK~kI~eEF~~I---l~ll---~F~~~~~e~fR----~WYVDgRi~fHkiid~~~pk~GI~E 153 (533) T protein:vir:10 88 PVSVEL----SNLKVSDKIKKLIREEFGEI---LRLL---DFENRSYEIFR----RWYVDGRLFYHKVIDPDNPQGGLIE 153 (533) T ss_pred eEEEEe----cccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHHh----hhhhcceEEEEEEecCCCcccccee Confidence 222221 12223333233333322221 1121 12233444544 5667799999987664 346889 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCcee----EEEEcCCcceeecccccccccceeeecccceEEecCceeEE Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGY----VQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKN 209 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~----~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 209 (602) |.+|||..|+......... .++..+ .++..+ ...++.| ++. |. ....+.... T Consensus 154 Lr~lDPr~i~~vr~i~~~~----------~~~~~~~~~~~~v~~~-~~eyf~Y------np~------g~-~~~~~~~vk 209 (533) T protein:vir:10 154 LRYIDPRKIRKINETEQKR----------PEQLRGLPLNQQLSPK-SAEYFLY------DPK------GL-KNSTTQGLK 209 (533) T ss_pred eeeccccceeeeeeeeccC----------CCccceeecchhhhcc-ceeeeee------ccc------cc-cccCCCcee Confidence 9999999998633211100 001000 000110 0001111 010 11 111233456 Q ss_pred echhHEEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhh Q lcl|NC_021537. 210 GPANELIFLP-NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLK 287 (602) Q Consensus 210 ~~~~eviH~r-~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~ 287 (602) ++.+-|.+.. +.-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++ T Consensus 210 I~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~K 289 (533) T protein:vir:10 210 IAPDSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYR 289 (533) T ss_pred cchhheeeeeccceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcc Confidence 7775544433 112222223357777777777666666555443322223333343332 22223333333444443332 Q ss_pred cc----cccC------cceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhcc Q lcl|NC_021537. 288 GS----RYRT------AILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVT 356 (602) Q Consensus 288 g~----~nag------~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~ 356 (602) .. ...| +.+..-. .-++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+.. T Consensus 290 NklVYDa~TGev~ddrk~msMlE---DyWLPRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLY~aLnVP~SRl~~e 363 (533) T protein:vir:10 290 NKLVYDANTGEIKDDKKFMSMLE---DFWLPRREGGRGTEITTLPGGQNLGEL---EDVKYFQKKLYKSLNVPGSRLETE 363 (533) T ss_pred ceEEEeccCceecccchhhhhHh---hhcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCccccCCC Confidence 10 0000 0000000 000000000012222222211 11223 334567789999999999998754 Q ss_pred ccCCccC---HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHH Q lcl|NC_021537. 357 STSNRAN---SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAE 421 (602) Q Consensus 357 ~~~~~sn---~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~ 421 (602) +..+... +.-.-.-| ...|.-+..+|...|...|-.. .+ .....++.|..+.-.....+.+... T Consensus 364 ~~f~~Gr~~EItRDEiKF-~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 442 (533) T protein:vir:10 364 TTFNVGRAAEITRDEVKF-QKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRN 442 (533) T ss_pred CcccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHH Confidence 4333322 22222233 4456777777666665444322 11 1123455565554333333444444 Q ss_pred HHHHHHHhC-C----cccHHHHHH-HhCC------------------CCCCCCccccccccccccccccccCCCcCcc-c Q lcl|NC_021537. 422 QRVRAMRLA-G----VGTVNEARE-ELDL------------------APFEDDRGDMTLSEFEAEFGADASDGDAEAM-L 476 (602) Q Consensus 422 ~~~~~~~~~-G----~~T~NE~R~-~~Gl------------------~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~-~ 476 (602) +++..+-.. + .++.+=+|+ .|.+ +-+++++.++-... .+.++..+..+.. . T Consensus 443 ~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~----~~~~~~~~~~~~~~~ 518 (533) T protein:vir:10 443 ERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAM----AAGDPDAGGAPAEEV 518 (533) T ss_pred HHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHh----cCCCCCcCCcccccC Confidence 443332211 1 113322221 2222 11222221111000 0111111111111 0 Q ss_pred cccccccccccccccc Q lcl|NC_021537. 477 TRSKAAPPLENKIGER 492 (602) Q Consensus 477 ~~~~~~~~~~~~~~~~ 492 (602) .+..++|..+. ..+- T Consensus 519 ~~~~~~~~~~~-~~~~ 533 (533) T protein:vir:10 519 APEGPDPSDER-KAEF 533 (533) T ss_pred CCCCCCcchhh-ccCC Confidence 11111111100 0000 No 226 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=431 Identities=10% Similarity=0.052 Sum_probs=168.9 Q ss_pred CC-CCcccc---cccchh-h-----hcccCccc------cCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC--- Q lcl|NC_021537. 1 MS-KAEETT---QLDERH-I-----ATDVGRGI------QPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY--- 56 (602) Q Consensus 1 ~~-k~~~~~---~~~~~~-~-----~~~~~~~i------~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~--- 56 (602) ++ |+.... .-++.. + ...+++.. ++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 22 ~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~ 100 (516) T protein:vir:10 22 LKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLI-NNPEVERAVANIVNEAIVYERG 100 (516) T ss_pred hcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCC Confidence 11 111100 000000 0 01112222 2212 22 3345554 4788899999998877632 Q ss_pred --ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC-CCCceEE Q lcl|NC_021537. 57 --GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE-GDGTPVG 133 (602) Q Consensus 57 --~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~-~~G~~~~ 133 (602) |..|.. +..+.+....+.+....... ..++ +-..+..++++ .|.+.|..|..++-+ .+..+.+ T Consensus 101 ~~pV~l~L----~~~~~s~~ik~kI~eeF~~I---l~ll---~F~~~~~~~fR----~WYVDgRi~fhKiid~~k~GI~E 166 (516) T protein:vir:10 101 HKVVSLDL----DDTDFGSNVKEKILEEFDEV---CRLL---DASRKLDTLFR----RWYVDSRIFFHKIMPNPKKGIAE 166 (516) T ss_pred CceEEEEe----cccCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHHh----hhhhcceEEEEEEecCcccccee Confidence 333322 11222222223333322221 1111 12234444544 566779999996655 3456889 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechh Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 213 (602) |.+|||..|+....... ....+..+. ...+..|.|..+.-. ....|..+.. +....++.+ T Consensus 167 lr~lDPr~i~~vR~i~~-~~~~~~~v~-----------~~~~e~~~Y~~~~~~-------~~~~g~~~~~-~~~ikI~~d 226 (516) T protein:vir:10 167 LRRLDPRFMEYYREIVT-SDIGGTTIV-----------KGYREFFIYTTGNEG-------YSYNGRIFEP-NTRIKIPRS 226 (516) T ss_pred eeeeCCcceeeEeeecc-cccccchhh-----------hhhhheeeeccCccc-------cccccceeCC-Ccceeechh Confidence 99999999985433211 111111111 000111111111100 0111222222 234567777 Q ss_pred HEEEec--CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc- Q lcl|NC_021537. 214 ELIFLP--NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS- 289 (602) Q Consensus 214 eviH~r--~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~- 289 (602) -|.|.. ..+..++.+ +|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.. T Consensus 227 AI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNkl 305 (516) T protein:vir:10 227 AVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRV 305 (516) T ss_pred heeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 776665 233233333 77788887777666666665443322223333343332 2222333333344444333210 Q ss_pred ---cccC------cceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC Q lcl|NC_021537. 290 ---RYRT------AILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS 359 (602) Q Consensus 290 ---~nag------~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 359 (602) .+.| +.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..++. T Consensus 306 vYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~ 379 (516) T protein:vir:10 306 VYDSNTGTVKNQKRNLSMTED---YWLMRRDGKSVTEVSSLPGAQTMGDM---DDVRWFNKKLYEALRIPLSRIPRDDGG 379 (516) T ss_pred EEeCCCCeeccchhhhhhHhh---hcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCc Confidence 0011 11100000 00000000011222222211 11233 334567789999999999999754443 Q ss_pred Cc----cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 360 NR----AN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 360 ~~----sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) +. ++ +..--.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+...+ T Consensus 380 ~~~~Gr~~EItRDEiKF-~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 458 (516) T protein:vir:10 380 MVIGGQDTAITRDELDF-RKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRL 458 (516) T ss_pred eeeccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 32 21 11111233 4456777777666655443322 11 11234555555543333334444333 Q ss_pred HHHHHH-----hCCcccHHHHHH-HhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 423 RVRAMR-----LAGVGTVNEARE-ELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 423 ~~~~~~-----~~G~~T~NE~R~-~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ++..+- -..+++.+=+++ .|.+.-.+-.. ...........+-. ..|..+... T Consensus 459 R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~-------e~k~I~~E~~~~~~--------~~p~~~~~f 516 (516) T protein:vir:10 459 RVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQ-------EEKQIEQEAGIKRF--------QNPENEDDF 516 (516) T ss_pred HHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHH-------HHHHHHHhhhCCCC--------CCCCccccC Confidence 333221 123444444433 34442111000 00000000000000 000000111 No 227 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=431 Identities=10% Similarity=0.052 Sum_probs=168.9 Q ss_pred CC-CCcccc---cccchh-h-----hcccCccc------cCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC--- Q lcl|NC_021537. 1 MS-KAEETT---QLDERH-I-----ATDVGRGI------QPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY--- 56 (602) Q Consensus 1 ~~-k~~~~~---~~~~~~-~-----~~~~~~~i------~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~--- 56 (602) ++ |+.... .-++.. + ...+++.. ++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 22 ~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~ 100 (516) T protein:vir:10 22 LKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLI-NNPEVERAVANIVNEAIVYERG 100 (516) T ss_pred hcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCC Confidence 11 111100 000000 0 01112222 2212 22 3345554 4788899999998877632 Q ss_pred --ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC-CCCceEE Q lcl|NC_021537. 57 --GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE-GDGTPVG 133 (602) Q Consensus 57 --~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~-~~G~~~~ 133 (602) |..|.. +..+.+....+.+....... ..++ +-..+..++++ .|.+.|..|..++-+ .+..+.+ T Consensus 101 ~~pV~l~L----~~~~~s~~ik~kI~eeF~~I---l~ll---~F~~~~~~~fR----~WYVDgRi~fhKiid~~k~GI~E 166 (516) T protein:vir:10 101 HKVVSLDL----DDTDFGSNVKEKILEEFDEV---CRLL---DASRKLDTLFR----RWYVDSRIFFHKIMPNPKKGIAE 166 (516) T ss_pred CceEEEEe----cccCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHHh----hhhhcceEEEEEEecCcccccee Confidence 333322 11222222223333322221 1111 12234444544 566779999996655 3456889 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechh Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 213 (602) |.+|||..|+....... ....+..+. ...+..|.|..+.-. ....|..+.. +....++.+ T Consensus 167 lr~lDPr~i~~vR~i~~-~~~~~~~v~-----------~~~~e~~~Y~~~~~~-------~~~~g~~~~~-~~~ikI~~d 226 (516) T protein:vir:10 167 LRRLDPRFMEYYREIVT-SDIGGTTIV-----------KGYREFFIYTTGNEG-------YSYNGRIFEP-NTRIKIPRS 226 (516) T ss_pred eeeeCCcceeeEeeecc-cccccchhh-----------hhhhheeeeccCccc-------cccccceeCC-Ccceeechh Confidence 99999999985433211 111111111 000111111111100 0111222222 234567777 Q ss_pred HEEEec--CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc- Q lcl|NC_021537. 214 ELIFLP--NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS- 289 (602) Q Consensus 214 eviH~r--~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~- 289 (602) -|.|.. ..+..++.+ +|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.. T Consensus 227 AI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNkl 305 (516) T protein:vir:10 227 AVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRV 305 (516) T ss_pred heeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 776665 233233333 77788887777666666665443322223333343332 2222333333344444333210 Q ss_pred ---cccC------cceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccccC Q lcl|NC_021537. 290 ---RYRT------AILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTS 359 (602) Q Consensus 290 ---~nag------~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 359 (602) .+.| +.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..++. T Consensus 306 vYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~ 379 (516) T protein:vir:10 306 VYDSNTGTVKNQKRNLSMTED---YWLMRRDGKSVTEVSSLPGAQTMGDM---DDVRWFNKKLYEALRIPLSRIPRDDGG 379 (516) T ss_pred EEeCCCCeeccchhhhhhHhh---hcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCc Confidence 0011 11100000 00000000011222222211 11233 334567789999999999999754443 Q ss_pred Cc----cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 360 NR----AN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 360 ~~----sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) +. ++ +..--.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+...+ T Consensus 380 ~~~~Gr~~EItRDEiKF-~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 458 (516) T protein:vir:10 380 MVIGGQDTAITRDELDF-RKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRL 458 (516) T ss_pred eeeccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 32 21 11111233 4456777777666655443322 11 11234555555543333334444333 Q ss_pred HHHHHH-----hCCcccHHHHHH-HhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 423 RVRAMR-----LAGVGTVNEARE-ELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 423 ~~~~~~-----~~G~~T~NE~R~-~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ++..+- -..+++.+=+++ .|.+.-.+-.. ...........+-. ..|..+... T Consensus 459 R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~-------e~k~I~~E~~~~~~--------~~p~~~~~f 516 (516) T protein:vir:10 459 RVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQ-------EEKQIEQEAGIKRF--------QNPENEDDF 516 (516) T ss_pred HHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHH-------HHHHHHHhhhCCCC--------CCCCccccC Confidence 333221 123444444433 34442111000 00000000000000 000000111 No 228 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=95.36 E-value=0.00033 Score=39.70 Aligned_cols=273 Identities=13% Similarity=0.041 Sum_probs=115.2 Q ss_pred CCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc----cccccccchhhhhcccCceeEEEEcCC Q lcl|NC_021537. 100 STPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT----TTIEREDGEEVENIESGHGYVQVRQGR 175 (602) Q Consensus 100 ~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~----~~~~~~~~~~~~~~~~~~~~~qi~~~~ 175 (602) |+...+-++. .| -.| ..+..-+|.+--..... +-+...++....... ..|+.-..|+ T Consensus 1 ~~~~~~~~~~-~~-----~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 61 (279) T protein:vir:40 1 MSLFNLSRRA-ED-----VSF-----------STFTVQDPTTDLLLGKLLGLVSYFDNVDYSEASKLE--DLFYWALQGK 61 (279) T ss_pred Ccccccchhh-cc-----cce-----------eeeeecCcchhHHHHHHHHHHHHhhcccchhhhhhh--hhhhhhhccc Confidence 1111111110 00 000 01111111110000000 000000111100000 1112222222 Q ss_pred cceeecccccccc-cceeeecccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021537. 176 RRYFGEAGDRYGD-DKRFVDKETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVF 254 (602) Q Consensus 176 ~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f 254 (602) ..+-..+|.+.-. .....++-..-+....-+.+++|-.+|.-| .++.+|+-+- ....-+++... -...+ + T Consensus 62 ~~~~~~~~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~Ii-----eNPlv~v~~e-e~~kM~~la~n--ai~~K-L 132 (279) T protein:vir:40 62 EVYRVWYGGFKYYAQRVNADQFNIVVREPNRREVTIRTNDYEML-----LNPFYGANPQ-RFGVMFGMASN--GIGRR-L 132 (279) T ss_pred eeehhhhhhHHHHHhhcCcchhhhheecCCcceeEeecchhhhh-----hcchheeccc-hhhHHHHHHHh--hhhhh-h Confidence 2222222221100 000011111112212223345555554433 3444665443 22222322211 12223 3 Q ss_pred HhcCCCceEEEeccccCCHHHHHHHHHHHHHhh-cccccCcceeccCCccceeccccccccccccccccccchHHHHHHH Q lcl|NC_021537. 255 DNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLK-GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQA 333 (602) Q Consensus 255 ~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~-g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e 333 (602) .+.+..+++|+.+...-.++.+++.+..++++. ++++-+.+.+++.|.+..++.+..+ .++..|. T Consensus 133 D~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYS----------tslk~di---- 198 (279) T protein:vir:40 133 DSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYS----------GSLQNDA---- 198 (279) T ss_pred cccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccc----------cccHHHH---- Confidence 677778899988755556777888887776644 4555577788888877777665432 2333344 Q ss_pred HHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccccceEEEeccchhcch Q lcl|NC_021537. 334 FRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKIIHQDALDVDEWTIDFELRGAEQP 413 (602) Q Consensus 334 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~~~f~~~~~~~~ 413 (602) ++.+.+.++.||||..++- + +..|.+...|+..+|.|++++.+..|-. .. ++.+.|-. T Consensus 199 --e~lkS~l~Sq~GinekIL~----G--sAtE~q~iAyy~rtVePILkQyek~liY----~~----E~fv~y~t------ 256 (279) T protein:vir:40 199 --NLAIEIALSEYGMPRELLY----G--QSNEVTIIAFAIQKVLPLLKQHDKNIIF----NQ----ENFVAYIS------ 256 (279) T ss_pred --HHHHHHHHhhcCCchhhcc----c--cCchhhhhhHHHhhHHHHHHHhcccccc----hh----hhhhhhhe------ Confidence 4566889999999988872 2 2337788999999999999997764321 11 11111110 Q ss_pred hHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCC Q lcl|NC_021537. 414 EQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDD 450 (602) Q Consensus 414 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g 450 (602) +-...|.+ |-.-...+-+|+.++ T Consensus 257 ------------tta~gg~~--~s~~~~~~~~~~~~~ 279 (279) T protein:vir:40 257 ------------TTAKGGAI--ESKSSKRDSEPVGND 279 (279) T ss_pred ------------ecccCccc--ccccccccCCCCCCC Confidence 11112221 111122334555433 No 229 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=95.35 E-value=0.0022 Score=35.10 Aligned_cols=433 Identities=11% Similarity=0.069 Sum_probs=167.4 Q ss_pred CCCCcc---cccccch-----h---hhcccCccccCCC-------C----HHHHHHHHhhhHHHHHHHHHHHHhhccC-- Q lcl|NC_021537. 1 MSKAEE---TTQLDER-----H---IATDVGRGIQPPY-------N----PETLAAFQELNETHQACIRKKSRYEAGY-- 56 (602) Q Consensus 1 ~~k~~~---~~~~~~~-----~---~~~~~~~~i~p~~-------~----~~~l~~~~~~~~~v~~cI~~ia~~ia~~-- 56 (602) -.|+.. +..-++. . ..+.+|+...-.+ + +...|.++ .+|-|..||+-|.+.+.-. T Consensus 24 ~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~ 102 (521) T protein:vir:81 24 KDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLM-NNHEVENAVQNIVNDAIVFEE 102 (521) T ss_pred ccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecC Confidence 111111 1111111 0 1111222211111 1 13345554 4788899999998877632 Q ss_pred ---ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC--CCce Q lcl|NC_021537. 57 ---GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG--DGTP 131 (602) Q Consensus 57 ---~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~--~G~~ 131 (602) |..|... ..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|+.++-+. ...+ T Consensus 103 ~~~pV~l~L~----~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fhkiid~~pk~GI 168 (521) T protein:vir:81 103 GHEVVSLNLE----ATGFSESVKERIHEEFKDL---LNTI---QFDRRGQDMF----RRWYVDSRIFFHKIIGKNPKDGI 168 (521) T ss_pred CCceEEEEec----ccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhhhhcceEEEEEEEcCCccccc Confidence 3333221 1222222223333322211 1121 1223344444 45677899999999553 3468 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) .+|.+|||..|+.......... .+. ..+...+..|+|..+.... ...|... ..+....++ T Consensus 169 ~Elr~lDPr~i~~vr~i~k~~~----------~~~--~v~~~~~e~f~Y~~~~~~~-------~~~g~~~-~~~~~vkI~ 228 (521) T protein:vir:81 169 VELRQLDPRNLEYVREIITEDT----------PEG--KIYKATKEYFIYTVGNSSY-------CAGGQVF-SPNSRVKIP 228 (521) T ss_pred eeeeeeCCcceeeeeeeccccc----------Ccc--ceecceeeeeeeecCCccc-------cccceee-cCCcceeec Confidence 9999999999985433221111 010 0011112223332221110 0011111 223345666 Q ss_pred hhHEEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhc- Q lcl|NC_021537. 212 ANELIFLP-NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKG- 288 (602) Q Consensus 212 ~~eviH~r-~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g- 288 (602) .+-|.+.. +.-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++. T Consensus 229 ~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNk 308 (521) T protein:vir:81 229 RSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNR 308 (521) T ss_pred hhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 66655543 122222223357778887777766666665544322333333444443 223333444445555544432 Q ss_pred ---c------cccCcceeccCCccceeccccccccccccccccc-cchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 289 ---S------RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGA-REDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 289 ---~------~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~-~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) . .+..+.+..-+. -++.--.-..+.+++.|.- .+.-+| +=.++..+...++++||.+.|+..++ T Consensus 309 lvYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~ 382 (521) T protein:vir:81 309 VVYDASTGKLKNQQANLSMTED---YWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDA 382 (521) T ss_pred eEeecccccccccccccchhhh---hcccccCCCcccceeecccCCCCChH---HHHHHHHHHHHHHhCCccccccCCCC Confidence 0 111111111000 0000000011222322221 122233 33456778999999999999964433 Q ss_pred CCc----cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc----c--------ccccceEEEeccchhcchhHHHHHHH Q lcl|NC_021537. 359 SNR----AN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD----A--------LDVDEWTIDFELRGAEQPEQDAKMAE 421 (602) Q Consensus 359 ~~~----sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~----~--------~~~~~~~~~f~~~~~~~~~~d~~~~~ 421 (602) +++ ++ +.-.-.-| ...|.-+...|...|...|-.+ . ......++.|..+.-.....+.+... T Consensus 383 ~~~~~Gr~~EItRDEiKF-~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 461 (521) T protein:vir:81 383 NMVIGGDGSEITRDELEF-SKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILE 461 (521) T ss_pred cceeccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHH Confidence 322 21 22222233 4456666666665554433221 1 11122455555544333333333333 Q ss_pred HHHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 422 ~~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) +++..+-. .-.++.+=++ ..|.+.-.+- . ............+-.. T Consensus 462 ~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei---~----~~~k~I~~E~~~~~~~---------------------- 512 (521) T protein:vir:81 462 RRIGLIERITPYIGKYFSNQTVMRDILKYTDDQM---D----TEKKQIEEEANDPRFK---------------------- 512 (521) T ss_pred HHHHHHHHhhhhhccccchHHHHHHHhccCHHHH---H----HHHHHHHHHhhCCCCC---------------------- Confidence 33322211 0111222222 2222211000 0 0000000000000000 Q ss_pred cccccccchhhhhc Q lcl|NC_021537. 496 DVDVSKDPIEQTTF 509 (602) Q Consensus 496 ~~~~~~~~m~~~~v 509 (602) .+.-++..+ T Consensus 513 -----~p~~~~~~f 521 (521) T protein:vir:81 513 -----QTPDEIEDF 521 (521) T ss_pred -----CCcccccCC Confidence 000000000 No 230 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=95.23 E-value=0.0025 Score=34.87 Aligned_cols=431 Identities=10% Similarity=0.050 Sum_probs=170.4 Q ss_pred CCCCccc---ccccchh------hhcccCcccc-------CCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC--- Q lcl|NC_021537. 1 MSKAEET---TQLDERH------IATDVGRGIQ-------PPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY--- 56 (602) Q Consensus 1 ~~k~~~~---~~~~~~~------~~~~~~~~i~-------p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~--- 56 (602) ..++... ..-++.. ..+-+++++. +.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 25 ~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma-~~pEvd~Av~eIvneaiv~d~~ 103 (521) T protein:vir:10 25 SDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLS-KYHEVDNAIDEIINDAIVQEDN 103 (521) T ss_pred ccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHh-hccchhhHHHhhhcceEEecCC Confidence 0111100 0001100 0010111111 111 22 3345554 4788899999998877632 Q ss_pred --ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCce Q lcl|NC_021537. 57 --GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGTP 131 (602) Q Consensus 57 --~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~~ 131 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|. ...+ T Consensus 104 ~~pV~i~L----d~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fHkiid~~~pk~GI 169 (521) T protein:vir:10 104 RDTVYLDL----DKTDWNESVKEMVREEFRTI---LKLL---KFEREGKRHF----RRWYVDSRIYFHKMIDPARPKDGI 169 (521) T ss_pred CceEEEEe----cCcccchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhheeeeeEEEEEEeeCCCccccc Confidence 333322 11222222222232222211 1121 1223344444 45677899999987653 3358 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) .+|.+|||..|+......+... +....+...+..+.|.... ...+......+....++ T Consensus 170 ~Elr~lDPr~i~~vr~i~k~~~------------~~~~v~~~~~e~f~Y~~~~----------~~~~~~~g~~~~~vkI~ 227 (521) T protein:vir:10 170 KELRLLDPRNVEYYRVNLKSNE------------NGNDVYKGVKEFFTYGATE----------DNRYNISGNSNNLVQIP 227 (521) T ss_pred eeeeeeCCcceeeeeeecCCCC------------CcchhhccceeeeeeccCC----------CceecCCCCCCcceeec Confidence 8999999999974432222110 1111111111111111000 00011111234456788 Q ss_pred hhHEEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC-CHHHHHHHHHHHHHhhc- Q lcl|NC_021537. 212 ANELIFLP-NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL-SEDSKEDLRNLMDNLKG- 288 (602) Q Consensus 212 ~~eviH~r-~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~l~~~~~~~~g- 288 (602) .+.|.|.. +.-..++.+.+|-|..|.+.+.....++...--|=-.-+.-+-|..+.=+.+ ...+.+-+++.+..++. T Consensus 228 ~daI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNk 307 (521) T protein:vir:10 228 IDAIVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNR 307 (521) T ss_pred hhheeeecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 87777665 1223455688898999988877776666655443222333333443332222 33333334443333221 Q ss_pred ---------ccccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 289 ---------SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 289 ---------~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) ..+..+.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|+...+ T Consensus 308 lVYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEI~TLpggqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~ 381 (521) T protein:vir:10 308 VVYDSSTGKVKNSSNNLAMTED---YWLMRRDGKATTEVSTLPGAQSMGEM---DDVRWFNRKLYESMKIPLSRLPQEGA 381 (521) T ss_pred EEEeccCceeccchhhhhhHhh---hcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCccccCCCCC Confidence 0011111100000 00000000012222222211 11233 33456778999999999999865422 Q ss_pred -CC--ccC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 359 -SN--RAN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 359 -~~--~sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) -+ .++ +.-.-.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+...+ T Consensus 382 ~f~~Gr~~EItRDEikF-~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~ 460 (521) T protein:vir:10 382 GVTFGAGNDITRDELQF-TKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILER 460 (521) T ss_pred ceecccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 22 222 22222233 4456777777666655443322 11 11234555555543333334444333 Q ss_pred HHHHHHh-------CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 423 RVRAMRL-------AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 423 ~~~~~~~-------~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ++..+-. .-+++.+=++ ..|.+.-.+-...+ .........+--..+..+ .... T Consensus 461 R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~-------k~I~~E~~~~~~~~p~~e-------~~df 521 (521) T protein:vir:10 461 RVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTER-------EKIDGELKDSVYKNPEDP-------MEEF 521 (521) T ss_pred HHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHH-------HHHHHhhhCCCCCCCcch-------hhcC Confidence 3332211 1133333333 23343211000000 000000000000000000 0011 No 231 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=94.94 E-value=0.0031 Score=34.32 Aligned_cols=432 Identities=10% Similarity=0.036 Sum_probs=171.2 Q ss_pred CCCCcccccccchhh----------hcccCccc-------cCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC-- Q lcl|NC_021537. 1 MSKAEETTQLDERHI----------ATDVGRGI-------QPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY-- 56 (602) Q Consensus 1 ~~k~~~~~~~~~~~~----------~~~~~~~i-------~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~-- 56 (602) +++...|-....... ...+++.. ++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 28 ~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaIv~~~ 106 (524) T protein:vir:98 28 LKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIM-SYPEVENAVSEIIDDAIVNEQ 106 (524) T ss_pred hcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHHHh-hccchhhHHHhhhcceeEecC Confidence 111111111111000 01122222 2222 12 3334554 4788899999888877522 Q ss_pred ---ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc--e Q lcl|NC_021537. 57 ---GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT--P 131 (602) Q Consensus 57 ---~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~--~ 131 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.+.+.|..|+.++-+.+.. + T Consensus 107 ~~~pV~l~L----~~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fhkiid~~~~kGI 172 (524) T protein:vir:98 107 GKDIITMDL----AKTNFSKAIQDKIVEEFDNV---LNIY---DFDNMGARLF----RDWYVDSRIYFHKIMHKDESKGI 172 (524) T ss_pred CCceEEEEe----cccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhhhhcceeEEEEEEcCCCCcce Confidence 333322 11222222223333322211 1121 1223344444 45667899999999665433 8 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEec Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGP 211 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 211 (602) .+|.+|||..|+.....-.-....+.. .+......|.|..+... ...+|.+...++ ...++ T Consensus 173 ~ELr~lDPr~i~~vr~~~~~~~~~~~~-----------v~~~~~e~f~Y~~~~~~-------~~~~g~~~~~~~-~ikI~ 233 (524) T protein:vir:98 173 RELRQLDPRCMELIRESITETLDGGVK-----------VFRGYREFFVYSAPKAG-------YTYNGQIYQANQ-KIKIP 233 (524) T ss_pred eeeeeeCCccceeeeeccccccccchh-----------hccceeeeeeeccCCCc-------cccccceecCCC-ceeec Confidence 899999999997532211110011110 11111122222211110 111233333333 47899 Q ss_pred hhHEEEecC--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 212 ANELIFLPN--PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 212 ~~eviH~r~--~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g 288 (602) .+.|.|..- .+..+++ +|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++. T Consensus 234 ~dAIvy~hSGL~d~~~~i--isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kN 311 (524) T protein:vir:98 234 RSAIVYAHSGLEDCSNNI--IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKN 311 (524) T ss_pred hhheeeeccCcccCCCCe--eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 999999862 2222222 47777787777666666665543322233333444443 222333444445555544431 Q ss_pred ----------ccccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccc Q lcl|NC_021537. 289 ----------SRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS 357 (602) Q Consensus 289 ----------~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 357 (602) ..+..+.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..+ T Consensus 312 klvYDa~TGevrddrk~msMlED---yWLpRReGgrgTEItTLpggqnlgem---~DV~YF~kkLy~aLnVP~sRl~~~~ 385 (524) T protein:vir:98 312 RVVYDARTGTVKNQQNNLSMTED---YWLMRRDGKAITEVSTLPGGQNFSDM---DDIKWFNRKLYEALRVPLSRMPRDD 385 (524) T ss_pred eeEeeccCceeeccccccchhhh---hcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCceeccCCC Confidence 0111111111000 00000000112222222211 11233 3345677899999999999986432 Q ss_pred c-CC--ccC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc----c--------ccccceEEEeccchhcchhHHHHHHH Q lcl|NC_021537. 358 T-SN--RAN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD----A--------LDVDEWTIDFELRGAEQPEQDAKMAE 421 (602) Q Consensus 358 ~-~~--~sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~----~--------~~~~~~~~~f~~~~~~~~~~d~~~~~ 421 (602) + -+ .++ +.-.-.-| ...|.-+...|...|...|-.+ . ......++.|..+.-.....+.+... T Consensus 386 ~~f~~Gr~~EItRDEiKF-~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 464 (524) T protein:vir:98 386 GGMQIGGGGEITRDELKF-SKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILE 464 (524) T ss_pred CccccccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHH Confidence 1 12 222 22222233 4456666666665554433221 1 11122455555554333333444433 Q ss_pred HHHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 422 ~~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) +++..+-. .-.++.+=++ ..|.+.-.+-. ............+--..+..+. ... T Consensus 465 ~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~-------~~~k~I~~E~k~~~~~~p~~e~-------~~f 524 (524) T protein:vir:98 465 RRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDID-------EQAKLIEEESKEERFKNPEAEE-------ENF 524 (524) T ss_pred HHHHHHHHhccccccccchHHHHHHHhccCHHHHH-------HHHHHHHHHHhCCCCcCCcccc-------ccC Confidence 33332211 1123333332 23333110000 0000000011110000000000 001 No 232 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=94.85 E-value=0.0033 Score=34.16 Aligned_cols=438 Identities=12% Similarity=0.051 Sum_probs=175.8 Q ss_pred CCcccccccchh-hhcccCcc---ccCCCCHHHHHH------HHhhhHHHHHHH-------------HHHHHhhccCceE Q lcl|NC_021537. 3 KAEETTQLDERH-IATDVGRG---IQPPYNPETLAA------FQELNETHQACI-------------RKKSRYEAGYGFE 59 (602) Q Consensus 3 k~~~~~~~~~~~-~~~~~~~~---i~p~~~~~~l~~------~~~~~~~v~~cI-------------~~ia~~ia~~~~~ 59 (602) -..+.+|-.... |-.. ... ..|+.|-..|++ +..|+.--.+.| .--+..|.+-..+ T Consensus 1 ~~~~~~~~~~~~~~~~g-~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~ 79 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAG-EANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMR 79 (527) T ss_pred CCccccccCCCcCcCCc-cccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCCcce Confidence 222222221111 2110 112 245555544442 222221000000 0001112222222 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCceEEEEE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGTPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~~~~L~~ 136 (602) +.-..-.-..+.. .+.+...+ ...-...++.....+.-++-++.|.+.+.+.+|. .|.-+++.. T Consensus 80 ~~~~g~~~~~~~~---~e~v~~~l----------r~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~ 146 (527) T protein:vir:10 80 FLGQGLKWEFSKK---DAKVDDAI----------KVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHE 146 (527) T ss_pred eeccCccccccch---hHHHHHHH----------HHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEee Confidence 2111100001111 11111111 1111123455666777788899999999999984 455678999 Q ss_pred eCcccccccccccccccccchh-h--hhcccCce--eEEEEcCCcceeecccc--------------cccc--cce---- Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEE-V--ENIESGHG--YVQVRQGRRRYFGEAGD--------------RYGD--DKR---- 191 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~-~--~~~~~~~~--~~qi~~~~~~~~~~~~~--------------~~~~--~~~---- 191 (602) +||.++-+..+........+.. + ...+.... +.-.+ ...|.++.++ .... +.. T Consensus 147 ~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar--~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e 224 (527) T protein:vir:10 147 VDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCAR--VQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPE 224 (527) T ss_pred cCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehh--hhhhhhhcCcccccccCcceeeeeceeeccccccccc Confidence 9998887665554333322221 0 01111000 00000 0001111100 0000 000 Q ss_pred -eeecccceEEecCcee----EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEe Q lcl|NC_021537. 192 -FVDKETGEVASDAGEL----KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKV 266 (602) Q Consensus 192 -~~~~~~g~~~~~~~~~----~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 266 (602) -++..+......+... ..+.-=.|+||++..+.+..+|.|-++-++..+........-......-++.|-.+++ T Consensus 225 ~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~t- 303 (527) T protein:vir:10 225 SPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATD- 303 (527) T ss_pred cccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeec- Confidence 0001111111111111 1122234899998888999999999987776666654444444444555666655543 Q ss_pred ccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHh Q lcl|NC_021537. 267 TGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVH 346 (602) Q Consensus 267 ~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~f 346 (602) +...- + ..|..+ .+.+.+|+-+.- +.+-++.-++... .=..|...+....+.|+..= T Consensus 304 -g~~~v-d-----------~~G~~~---~~~VgPG~iweL------~e~ak~~~v~~~~-~la~~~~h~~~L~~~l~~vA 360 (527) T protein:vir:10 304 -SAPPR-D-----------SRGNMV---PWTISPLGMVEH------GQNNKIYRVNGVA-SLEPSQTHMTKAEEAMQQTK 360 (527) T ss_pred -ccccc-c-----------ccCCcC---ccccCCceeEec------CCCcceeeccchh-hhHHHHHHHHHHHHHHHHhh Confidence 21111 0 011111 111122222211 1112333333221 22347777888889999999 Q ss_pred cCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHH----------------------HHhhhcCCccccccceEEE Q lcl|NC_021537. 347 GVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSA----------------------RLYKIIHQDALDVDEWTID 404 (602) Q Consensus 347 gVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~----------------------~ln~~Ll~~~~~~~~~~~~ 404 (602) ++|.+-+|..+.++-.+ .-++ .-.++|++.+.+. +.....+..........+. T Consensus 361 ~~PavA~G~vD~s~~~S-G~AL----eL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~iv 435 (527) T protein:vir:10 361 GIPDIAVGVVDAAVAES-GIAL----DLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTIT 435 (527) T ss_pred cCCeeeeccccCCcCcH-HHHH----HHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEE Confidence 99999999666544222 1111 2223444432111 1111111111111123444 Q ss_pred eccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCC-CCCCCccccc-ccc-------ccccc--cccccCCCcC Q lcl|NC_021537. 405 FELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLA-PFEDDRGDMT-LSE-------FEAEF--GADASDGDAE 473 (602) Q Consensus 405 f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~-p~~~g~~d~~-~~~-------~~~~~--~~~~~~~~~~ 473 (602) |- ..+-. |.+...+.+.+++.+|+++.-=+-++|+-- -+++++.+.- +.. -.+.+ ....+.++.. T Consensus 436 f~--p~lP~--D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~ 511 (527) T protein:vir:10 436 FR--DPKPV--NSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQ 511 (527) T ss_pred ec--ccCCC--CHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhcccc Confidence 43 33322 445566778899999999999988877211 1344433311 000 00000 0111111111 Q ss_pred ccccccccccccccccc Q lcl|NC_021537. 474 AMLTRSKAAPPLENKIG 490 (602) Q Consensus 474 ~~~~~~~~~~~~~~~~~ 490 (602) +...+..++ ..+.... T Consensus 512 g~~~~~~d~-~~~~~~~ 527 (527) T protein:vir:10 512 GIPDEEDDQ-ALNGQPL 527 (527) T ss_pred CCCCCCccc-ccCCCCC Confidence 111111111 0011100 No 233 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=94.74 E-value=0.0036 Score=33.97 Aligned_cols=438 Identities=12% Similarity=0.050 Sum_probs=175.7 Q ss_pred CCcccccccchh-hhcccCcc---ccCCCCHHHHHH------HHhhhHHHHHHH-------------HHHHHhhccCceE Q lcl|NC_021537. 3 KAEETTQLDERH-IATDVGRG---IQPPYNPETLAA------FQELNETHQACI-------------RKKSRYEAGYGFE 59 (602) Q Consensus 3 k~~~~~~~~~~~-~~~~~~~~---i~p~~~~~~l~~------~~~~~~~v~~cI-------------~~ia~~ia~~~~~ 59 (602) -..+.+|-.... |-.. ... ..|+.|-..|++ +..|+.--.+.| .--+..|.+-..+ T Consensus 1 ~~~~~~~~~~~~~~~~g-~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~~~~ 79 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAG-EANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEAKMR 79 (527) T ss_pred CCccccccCCCcCcCCc-cccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCCcce Confidence 222222221111 2110 112 245555544442 222221000000 0001112222222 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCceEEEEE Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGTPVGLAH 136 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~~~~L~~ 136 (602) +.-..-.-..+.. .+.+...+ ...-...++.....+.-++-++.|.+.+.+.+|. .|.-+++.. T Consensus 80 ~~~~g~~~~~~~~---~e~v~~~l----------r~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~ 146 (527) T protein:vir:10 80 FLGQGLKWEFSKK---DAKVDDAI----------RVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHE 146 (527) T ss_pred eeccCccccccch---hHHHHHHH----------HHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEee Confidence 2111100001111 11111111 1111123455666777788899999999999984 455678999 Q ss_pred eCcccccccccccccccccchh-h--hhcccCce--eEEEEcCCcceeecccc--------------cccc--cce---- Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDGEE-V--ENIESGHG--YVQVRQGRRRYFGEAGD--------------RYGD--DKR---- 191 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~~~-~--~~~~~~~~--~~qi~~~~~~~~~~~~~--------------~~~~--~~~---- 191 (602) +||.++-+..+........+.. + ...+.... +.-.+ ...|.++.++ .... +.. T Consensus 147 ~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar--~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e 224 (527) T protein:vir:10 147 VDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCAR--VQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPE 224 (527) T ss_pred cCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehh--hhhhhhhcCcccccccCcceeeeeceeeccccccccc Confidence 9998887665554333322221 0 01111000 00000 0001111100 0000 000 Q ss_pred -eeecccceEEecCcee----EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEe Q lcl|NC_021537. 192 -FVDKETGEVASDAGEL----KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKV 266 (602) Q Consensus 192 -~~~~~~g~~~~~~~~~----~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 266 (602) -++..+......+... ..+.-=.|+||++..+.+..+|.|-++-++..+........-......-++.|-.+++ T Consensus 225 ~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~t- 303 (527) T protein:vir:10 225 SPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATD- 303 (527) T ss_pred cccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeec- Confidence 0001111111111111 1122234899998888999999999987776666554444444444555666655543 Q ss_pred ccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHh Q lcl|NC_021537. 267 TGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVH 346 (602) Q Consensus 267 ~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~f 346 (602) +...- + ..|..+ .+.+.+|+-+.- +.+-++.-++... .=..|...+....+.|+..= T Consensus 304 -g~~~v-d-----------~~G~~~---~~~VgPG~iweL------~e~ak~~~v~~~~-~la~~~~h~~~L~~~l~~vA 360 (527) T protein:vir:10 304 -SAPPR-D-----------SRGNMV---PWTISPLGMVEH------GQNNKIYRVNGVA-SLEPSQTHMNKAEEAMQQTK 360 (527) T ss_pred -ccccc-c-----------ccCCcC---ccccCCceeEec------CCCcceeeccchh-hhHHHHHHHHHHHHHHHHhh Confidence 21111 0 011111 111122222211 1112333333221 22447777888889999999 Q ss_pred cCChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHH----------------------HHhhhcCCccccccceEEE Q lcl|NC_021537. 347 GVPPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSA----------------------RLYKIIHQDALDVDEWTID 404 (602) Q Consensus 347 gVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~----------------------~ln~~Ll~~~~~~~~~~~~ 404 (602) ++|.+-+|..+.++-.+ .-++ .-.++|++.+.+. +.....+..........+. T Consensus 361 ~~PavA~G~vD~s~~~S-G~AL----eL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~iv 435 (527) T protein:vir:10 361 GIPDIAVGVVDAAVAES-GIAL----DLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTIT 435 (527) T ss_pred cCCeeeeccccCCcCcH-HHHH----HHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEE Confidence 99999999666544222 1111 2223444432111 1111111111111123444 Q ss_pred eccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCC-CCCCCccccc-ccc-------ccccc--cccccCCCcC Q lcl|NC_021537. 405 FELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLA-PFEDDRGDMT-LSE-------FEAEF--GADASDGDAE 473 (602) Q Consensus 405 f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~-p~~~g~~d~~-~~~-------~~~~~--~~~~~~~~~~ 473 (602) |- ..+-. |.+...+.+.+++.+|+++.-=+-++|+-- -+++++.+.- +.. -.+.+ ....+.++.. T Consensus 436 f~--p~lP~--D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~ 511 (527) T protein:vir:10 436 FR--DPKPV--NNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQ 511 (527) T ss_pred ec--ccCCC--CHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhcccc Confidence 43 33322 444556778899999999999988877211 1344433311 000 00000 0111111111 Q ss_pred ccccccccccccccccc Q lcl|NC_021537. 474 AMLTRSKAAPPLENKIG 490 (602) Q Consensus 474 ~~~~~~~~~~~~~~~~~ 490 (602) +...+..++ ..+.... T Consensus 512 g~~~~~~d~-~~~~~~~ 527 (527) T protein:vir:10 512 GIPDEEDDQ-ALNGQPL 527 (527) T ss_pred CCCCCCccc-ccCCCCC Confidence 111111111 0011100 No 234 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=94.45 E-value=0.0044 Score=33.51 Aligned_cols=433 Identities=12% Similarity=0.083 Sum_probs=166.7 Q ss_pred CCCCcccccccc------------hhhhcc-----cCccccCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC-- Q lcl|NC_021537. 1 MSKAEETTQLDE------------RHIATD-----VGRGIQPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY-- 56 (602) Q Consensus 1 ~~k~~~~~~~~~------------~~~~~~-----~~~~i~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~-- 56 (602) .+....|-.... ..+.+. +-+.+++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 26 ~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~ 104 (524) T protein:vir:10 26 INNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLM-NNYEVDNAVQEIVSDAIVYED 104 (524) T ss_pred hccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecC Confidence 110000100000 011100 001123221 22 3334554 4788899999988877632 Q ss_pred ---ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCc Q lcl|NC_021537. 57 ---GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGT 130 (602) Q Consensus 57 ---~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~ 130 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|. ... T Consensus 105 ~~~pV~l~L----d~~~~s~siK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fHkiid~~~pk~G 170 (524) T protein:vir:10 105 DKEVVALNL----DGTDFSQSIKDKILAEFSEV---LNLL---NFQRKGTDHF----QRWYVDSRIFFHKIINPKKMKDG 170 (524) T ss_pred CCceEEEEe----cccCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhheeeceEEEEEEeeCCCcccc Confidence 333322 12222233223333322221 1121 1223344444 45677899999987653 335 Q ss_pred eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 131 PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 131 ~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) +.+|.+|||..|+......+ +.. .++..+..-+..+.+.++ ... ....|.+ ...+....+ T Consensus 171 I~Elr~lDPr~i~~vr~i~~-~~~-----------~~~~vi~~~~e~f~Y~~~--~~~-----~~~~~~~-~~~~~~ikI 230 (524) T protein:vir:10 171 VQELRRLDPRQVQYIREIVT-RME-----------DGVKIVDGYREFFVYDTG--HES-----YCADGRI-YSAGTKVKI 230 (524) T ss_pred ceeeeeeCCccceeeeeecc-cCc-----------ccchhhcchhhheeecCC--Ccc-----cccCcce-ecCCcceec Confidence 88999999999975322111 011 111111111222222211 100 0111222 234456789 Q ss_pred chhHEEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 211 PANELIFLPN-PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 211 ~~~eviH~r~-~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g 288 (602) +.+.|.|..- .-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++. T Consensus 231 ~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kN 310 (524) T protein:vir:10 231 PRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKN 310 (524) T ss_pred chhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 9999999862 22223323356677777776666656555443322223333333332 222223333334443333211 Q ss_pred c----------cccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccc Q lcl|NC_021537. 289 S----------RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS 357 (602) Q Consensus 289 ~----------~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 357 (602) . .+..+.+..-+. -++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+... T Consensus 311 KlvYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~ 384 (524) T protein:vir:10 311 RVVYDASTGKIKNQQHNMSMTED---YWLQRRDGKAVTEVDTMPGATGMSDM---DDVLYFRTALYRALRIPESRIPSES 384 (524) T ss_pred eeEEeccCCeeccchhhhhhHhh---hcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCchhccCCC Confidence 0 011111100000 00000000011222222111 11223 3345677899999999999995433 Q ss_pred cCC--c--cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHH Q lcl|NC_021537. 358 TSN--R--AN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMA 420 (602) Q Consensus 358 ~~~--~--sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~ 420 (602) ++. . ++ +.-.-.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+.. T Consensus 385 ~~~f~~gr~~EItRDEiKF-~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil 463 (524) T protein:vir:10 385 NSGVMFDAGTAITRDELKF-AKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIM 463 (524) T ss_pred CccccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHH Confidence 222 2 22 22222233 4456677776666655443322 11 112345555555433333344443 Q ss_pred HHHHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 421 EQRVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 421 ~~~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) .+++..+-. .-.++.+=++ ..|.+.-.+- . ............+-...+..+ .+.. T Consensus 464 ~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei---~----~~~k~I~~E~k~~~~~~~~~~-------~~~f 524 (524) T protein:vir:10 464 ERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEI---N----QEAKQIEEESKEARFQNPDEE-------EEDF 524 (524) T ss_pred HHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHH---H----HHHHHHHHHhhcCCCCCCChh-------hhcC Confidence 333332211 1111222222 2333311000 0 000000000000000000000 0001 No 235 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=93.44 E-value=0.0075 Score=32.22 Aligned_cols=432 Identities=11% Similarity=0.066 Sum_probs=164.6 Q ss_pred CCCCcccccccchh-h-------hcccCcc-------ccCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC---- Q lcl|NC_021537. 1 MSKAEETTQLDERH-I-------ATDVGRG-------IQPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY---- 56 (602) Q Consensus 1 ~~k~~~~~~~~~~~-~-------~~~~~~~-------i~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~---- 56 (602) .+.++ .+..++.. + ++.+++. +++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 29 ~S~~~-p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~ 106 (524) T protein:vir:72 29 VSITA-PKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLM-NNYEVDNAVSEIVSDAIVYEDDT 106 (524) T ss_pred ccccC-ccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCC Confidence 12222 11111111 0 0112222 22222 22 3334554 4788899999988877632 Q ss_pred -ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---CceE Q lcl|NC_021537. 57 -GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---GTPV 132 (602) Q Consensus 57 -~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---G~~~ 132 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|.+ .... T Consensus 107 ~pV~l~L----~~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fhKiid~k~pk~GI~ 172 (524) T protein:vir:72 107 EVVALNL----DKSKFSPKIKNMMLDEFSDV---LNHL---SFQRKGSDHF----RRWYVDSRIFFHKIIDPKRPKEGIK 172 (524) T ss_pred ceEEEEe----cCcCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhheeeeEEEEEEEEeCCCccccce Confidence 333322 11122222222222222111 1111 1223344444 456778999999987643 3588 Q ss_pred EEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 133 GLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 133 ~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) +|.+|||..|+....... ....+ +..+...+..+.|..+-- ....+|.. +..+....++. T Consensus 173 Elr~lDPr~i~~vr~i~~-~~~~~-----------~~vi~~~~e~f~Y~~~~~-------~y~~~g~~-~~~~~~ikI~~ 232 (524) T protein:vir:72 173 ELRRLDPRQVQYVREIIT-ETEAG-----------TKIVKGYKEYFIYDTAHE-------SYACDGRM-YEAGTKIKIPK 232 (524) T ss_pred eeeeeCCccceeeeeecc-CCCcc-----------chhhcchhhheeeccCcc-------ccccCccc-cCCCcceecch Confidence 999999999975322111 01111 111111111122211100 01122222 12344567777 Q ss_pred hHEEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc- Q lcl|NC_021537. 213 NELIFLPN-PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS- 289 (602) Q Consensus 213 ~eviH~r~-~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~- 289 (602) +-|.|..- .-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.. T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNkl 312 (524) T protein:vir:72 233 AAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRV 312 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 77766652 12222223356677777766666555554443322223333343332 2222333333344443333210 Q ss_pred ---cccC------cceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc- Q lcl|NC_021537. 290 ---RYRT------AILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST- 358 (602) Q Consensus 290 ---~nag------~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~- 358 (602) .+.| +.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|..... T Consensus 313 vYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~d~~~ 386 (524) T protein:vir:72 313 VYDASTGKIKNQQHNMSMTED---YWLQRRDGKAVTEVDTLPGADNTGNM---EDIRWFRQALYMALRVPLSRIPQDQQG 386 (524) T ss_pred EEeCCCCeeccchhhhhhHhh---hcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCchhhcCCCCCc Confidence 0011 10000000 00000000011222222211 11233 33456778999999999998832211 Q ss_pred -CC--ccC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 359 -SN--RAN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 359 -~~--~sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) -+ .++ +.-.-.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+...+ T Consensus 387 ~f~~gr~~EItRDEikF-~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 465 (524) T protein:vir:72 387 GVMFDSGTSITRDELTF-AKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILER 465 (524) T ss_pred cccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 12 122 22222233 4456777777666665443322 11 11234555655543333334444333 Q ss_pred HHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 423 RVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 423 ~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ++..+-. .-.++.+=++ ..|.+.-.+- . ............+-...+..+ .+.. T Consensus 466 R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei---~----~~~k~I~~E~k~~~~~~~~~~-------~~~f 524 (524) T protein:vir:72 466 RINMLTMAEPFIGKYISHRTAMKDILQMTDEEI---E----QEAKQIEEESKEARFQDPDQE-------QEDF 524 (524) T ss_pred HHHHHHHhhhhhcccchhHHHHHHHhccCHHHH---H----HHHHHHHHHhhcCCCCCCchh-------hhcC Confidence 3332211 1111222222 2333311000 0 000000000000000000000 0011 No 236 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=93.22 E-value=0.0083 Score=31.98 Aligned_cols=432 Identities=11% Similarity=0.067 Sum_probs=164.5 Q ss_pred CCCCcccccccchh-h-------hcccCcc-------ccCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC---- Q lcl|NC_021537. 1 MSKAEETTQLDERH-I-------ATDVGRG-------IQPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY---- 56 (602) Q Consensus 1 ~~k~~~~~~~~~~~-~-------~~~~~~~-------i~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~---- 56 (602) .+.++ .+..++.. + ++.+++. +++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 29 ~S~~~-p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~ 106 (524) T protein:vir:10 29 VSITA-PKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLM-NNYEVDNAVSEIVSDAIVYEDDT 106 (524) T ss_pred ccccC-ccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCC Confidence 12222 11111111 0 0112222 22222 22 3334554 4788899999988877632 Q ss_pred -ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC---CceE Q lcl|NC_021537. 57 -GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD---GTPV 132 (602) Q Consensus 57 -~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~---G~~~ 132 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|.+ .... T Consensus 107 ~pV~l~L----~~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fhKiid~k~pk~GI~ 172 (524) T protein:vir:10 107 EVVALNL----DKSKFSPKIKNMMLDEFNDV---LNHL---SFQRKGSDHF----RRWYVDSRIFFHKIIDPKRPKEGIK 172 (524) T ss_pred ceEEEEe----cCcCcchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhheeeeEEEEEEEeeCCCccccce Confidence 333322 11122222222222222111 1111 1223344444 456778999999987643 3588 Q ss_pred EEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEech Q lcl|NC_021537. 133 GLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPA 212 (602) Q Consensus 133 ~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 212 (602) +|.+|||..|+....... ....+ +..+...+..+.|..+-- ....+|.. +..+....++. T Consensus 173 Elr~lDPr~i~~vr~i~~-~~~~~-----------~~vi~~~~e~f~Y~~~~~-------~y~~~g~~-~~~~~~ikI~~ 232 (524) T protein:vir:10 173 ELRRLDPRQVQYVREIIT-ETEAG-----------TKIVKGYKEYFIYDTAHE-------SYACDGRM-YEAGTKIKIPK 232 (524) T ss_pred eeeeeCCccceeeeeecc-CCCcc-----------chhhcchhhheeeccCcc-------ccccCccc-cCCCcceecch Confidence 999999999975322111 01111 111111111122211100 01122222 12344567777 Q ss_pred hHEEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc- Q lcl|NC_021537. 213 NELIFLPN-PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS- 289 (602) Q Consensus 213 ~eviH~r~-~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~- 289 (602) +-|.|..- .-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.. T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNkl 312 (524) T protein:vir:10 233 AAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRV 312 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 77766652 12222223356677777776666655555443322223333343332 2222333333344443333210 Q ss_pred ---cccC------cceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc- Q lcl|NC_021537. 290 ---RYRT------AILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST- 358 (602) Q Consensus 290 ---~nag------~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~- 358 (602) .+.| +.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|..... T Consensus 313 vYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~d~~~ 386 (524) T protein:vir:10 313 VYDASTGKIKNQQHNMSMTED---YWLQRRDGKAVTEVDTLPGADNTGNM---EDVRWFRQALYMALRVPLSRIPQDQQG 386 (524) T ss_pred EEeCCCCeeccchhhhhhHhh---hcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCchhhcCCCCCc Confidence 0011 10000000 00000000011222222211 11233 33456778999999999998832211 Q ss_pred -CC--ccC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 359 -SN--RAN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 359 -~~--~sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) -+ .++ +.-.-.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+...+ T Consensus 387 ~f~~gr~~EItRDEikF-~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 465 (524) T protein:vir:10 387 GVMFDSGTSITRDELTF-AKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILER 465 (524) T ss_pred cccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 12 122 22222233 4456777777666665443322 11 11234555655543333334444333 Q ss_pred HHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 423 RVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 423 ~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ++..+-. .-.++.+=++ ..|.+.-.+- . ............+-...+..+ .... T Consensus 466 R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei---~----~~~k~I~~E~k~~~~~~~~~~-------~~~f 524 (524) T protein:vir:10 466 RINMLTMAEPFIGKYISHRTAMKDILQMTDEEI---E----QEAKQIEEESKEARFQDPDQE-------QEDF 524 (524) T ss_pred HHHHHHHhhhhhcccchhHHHHHHHhccCHHHH---H----HHHHHHHHHhhcCCCCCCchh-------hhcC Confidence 3332211 1111222222 2333311000 0 000000000000000000000 0011 No 237 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=92.94 E-value=0.0094 Score=31.70 Aligned_cols=433 Identities=11% Similarity=0.083 Sum_probs=163.8 Q ss_pred CCCCccc---ccccc-hh-hh-c------ccC-------ccccCCC-CH----HHHHHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_021537. 1 MSKAEET---TQLDE-RH-IA-T------DVG-------RGIQPPY-NP----ETLAAFQELNETHQACIRKKSRYEAGY 56 (602) Q Consensus 1 ~~k~~~~---~~~~~-~~-~~-~------~~~-------~~i~p~~-~~----~~l~~~~~~~~~v~~cI~~ia~~ia~~ 56 (602) .++...| ...++ .. +. + .++ +.+++.+ +- ...|.++ .+|-|..||+-|.+.+.-. T Consensus 24 ~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~ 102 (523) T protein:vir:68 24 EKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLM-TNYEVDNAVSEIVSDAIVY 102 (523) T ss_pred hhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeee Confidence 1111111 11111 00 00 0 001 1133322 22 3345554 4788899999988877632 Q ss_pred -----ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC--- Q lcl|NC_021537. 57 -----GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD--- 128 (602) Q Consensus 57 -----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~--- 128 (602) |..|.. +..+.+....+.+....... ..++ +-..+..+++ +.|.+.|..|..++-|.+ T Consensus 103 d~~~~pV~i~L----d~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fhKiid~k~pk 168 (523) T protein:vir:68 103 EDDTEVVSINL----DNTKFSPNIKSMMLDEFNEV---LNHL---SFQRKGSDHF----RRWYVDSRIFFHKIIDPKRPK 168 (523) T ss_pred cCCCceEEEEe----cccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----HhheeeeEEEEEEEeeCCCcc Confidence 222221 12222333223333322211 1111 1223344444 456778999999987643 Q ss_pred CceEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeE Q lcl|NC_021537. 129 GTPVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELK 208 (602) Q Consensus 129 G~~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 208 (602) ....+|.+|||..|+.......- ...+ +..+...+..+.|. ..+ .....+|.+.. .+... T Consensus 169 ~GI~Elr~lDPr~i~~vr~i~~~-~~~g-----------~~vi~~~~e~f~Y~--~~~-----~~~~~~g~~~~-~~~~i 228 (523) T protein:vir:68 169 EGIKELRRLDPRQVQYVREVITT-TEAG-----------VKIVKGYKEYFIYD--TSH-----ESYACDGRIYE-AGTKI 228 (523) T ss_pred ccceeeeeeCCcceeEEEeecCC-CCcc-----------hhhhhhhhhheeec--ccc-----ccccccccccC-CCcce Confidence 35889999999999753221110 0111 11111111111111 111 00112233222 23456 Q ss_pred EechhHEEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccC-CHHHHHHHHHHHHHh Q lcl|NC_021537. 209 NGPANELIFLPN-PSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTL-SEDSKEDLRNLMDNL 286 (602) Q Consensus 209 ~~~~~eviH~r~-~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~l~~~~~~~ 286 (602) .++.+-|.|..- .-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+.=+.+ ...+.+-+++.+..+ T Consensus 229 kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~ 308 (523) T protein:vir:68 229 KIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTM 308 (523) T ss_pred ecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhh Confidence 777777766652 122222233566777777766666555554433222233333433322222 333333344433332 Q ss_pred hcc----------cccCcceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhc Q lcl|NC_021537. 287 KGS----------RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINV 355 (602) Q Consensus 287 ~g~----------~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~ 355 (602) +.. .+..+.+..-+. -++.--....+.+++.|.-. +.-+| +=.++..+...++++||.+.|.. T Consensus 309 kNKlvYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~ 382 (523) T protein:vir:68 309 KNRIAYDATTGKIKNQQHIMSMTED---YWLQRRDGKAVTEVDTLPGADNTGNM---EDVRWFRNALYMALRIPITRIPS 382 (523) T ss_pred cceeEEeccCCeeccchhhhhhHhh---hcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCcceeecC Confidence 210 011111100000 00000000011222222211 11233 33456778999999999988843 Q ss_pred c-ccCC--ccC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc---------cc---cccceEEEeccchhcchhHHHHH Q lcl|NC_021537. 356 T-STSN--RAN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD---------AL---DVDEWTIDFELRGAEQPEQDAKM 419 (602) Q Consensus 356 ~-~~~~--~sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~---------~~---~~~~~~~~f~~~~~~~~~~d~~~ 419 (602) . ++.+ .++ +.-.-.-| ...|.-+...|...|...|-.+ .+ .....++.|..+.-.....+.+. T Consensus 383 ~~~~f~~Gr~~EItRDEikF-~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Ei 461 (523) T protein:vir:68 383 DQGGIQFDAGTSITRDELSF-GKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEI 461 (523) T ss_pred CCcceecccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHH Confidence 2 2122 222 22222233 4456777777666655443322 11 11234555555543333334444 Q ss_pred HHHHHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 420 AEQRVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 420 ~~~~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) ..+++..+-. .-.++.+=++ ..|.+.-.+- . ............+-...+..+ .... T Consensus 462 l~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei---~----~~~kqI~~E~k~~~~~~p~~e-------~~~f 523 (523) T protein:vir:68 462 LERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEI---E----QEAKQIEEESKEARFQDPDQE-------QEDF 523 (523) T ss_pred HHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHH---H----HHHHHHHHHhhcCCCCCCchh-------hhcC Confidence 3333332211 1111222222 2333311000 0 000000000000000000000 0011 No 238 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=463 Identities=13% Similarity=0.049 Sum_probs=158.9 Q ss_pred CCCCcccc--cccchh--h---hcccCccccCCC-----CHHH--HHHHHhhhHHHHHHHHHHHHhhccC-------ceE Q lcl|NC_021537. 1 MSKAEETT--QLDERH--I---ATDVGRGIQPPY-----NPET--LAAFQELNETHQACIRKKSRYEAGY-------GFE 59 (602) Q Consensus 1 ~~k~~~~~--~~~~~~--~---~~~~~~~i~p~~-----~~~~--l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~ 59 (602) |+|..... ++..++ + ..+...++-|.. +... ..+.. .++...|++.+|..+.+. .|+ T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~--dst~~~a~~~Laa~l~~~ltpp~~~WF~ 78 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPW--QSVGSKGVNVLASKLMLSLFPVNTSFFK 78 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc--cccHHHHHHHHHHHHHHhhcCCCCcccc Confidence 44443211 000000 0 000111222211 1111 11222 234456777777777642 333 Q ss_pred EEEecCC-CCcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEe Q lcl|NC_021537. 60 IVAHPSA-DEPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHV 137 (602) Q Consensus 60 i~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l 137 (602) +...+.. +....+.+.+..++..+..+. ..+..+. ..+++.-+..+..|+.++||+.+++-.+ +.+++|| T Consensus 79 l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~ly~~~~----~~~~~pl 150 (555) T protein:vir:17 79 LQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIA----ESSDRVHLEMAMKHLIVTGNALLYQGKK----NLKLYPL 150 (555) T ss_pred cccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCeEEEEecCC----ceeEEEc Confidence 3322111 000111122222332222211 1122222 3357777778889999999999877443 3567777 Q ss_pred Cccccccccccccccccc--chh----hhhc----------------ccCceeE-EEEcCCcceeecccccccccceeee Q lcl|NC_021537. 138 PAATVRVRKTTTTIERED--GEE----VENI----------------ESGHGYV-QVRQGRRRYFGEAGDRYGDDKRFVD 194 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~--~~~----~~~~----------------~~~~~~~-qi~~~~~~~~~~~~~~~~~~~~~~~ 194 (602) ..-.|............. ... ...+ .+..... ..+..+. .........+..+. T Consensus 151 ~~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~----~~~~~~~~v~t~~~ 226 (555) T protein:vir:17 151 DRFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRD----KGKSNDALVYTYVC 226 (555) T ss_pred CeEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccc----cCCCcceeEeeccc Confidence 543333222211110000 000 0000 0000000 0000000 00000000001112 Q ss_pred cccceEE--e-cCcee-------EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEE Q lcl|NC_021537. 195 KETGEVA--S-DAGEL-------KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAV 264 (602) Q Consensus 195 ~~~g~~~--~-~~~~~-------~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 264 (602) ...+.++ . .++.. ..|.....+.+|.....+..||.||...++..+.......+.......-...|..++ T Consensus 227 ~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv 306 (555) T protein:vir:17 227 RKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMV 306 (555) T ss_pred ccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee Confidence 2222111 1 11111 122233567778777778899999999999999998888888887777777787666 Q ss_pred EeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHH Q lcl|NC_021537. 265 KVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIA 343 (602) Q Consensus 265 ~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia 343 (602) . +++........ .++.+ .+..+. ..++ .|+......|.+ ..+..+.....|- T Consensus 307 ~-~~g~~~~~~l~---------~~~~g----~v~~g~-----------~~~v--~~~~~~~~~~~~~~~~~i~~~~~~I~ 359 (555) T protein:vir:17 307 S-PSATTKPQNLA---------LAANG----AIIQGR-----------PDDV--SVVQANKAADFRTVLEMIQKLEQRIS 359 (555) T ss_pred c-cccccCcceee---------cCCCc----eeecCC-----------cccc--eeeeccccchhhHHHHHHHHHHHHHH Confidence 3 22222221110 01100 011111 1111 122111112222 1233445566777 Q ss_pred HHhcCChHHhhccccCCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccc-cccceEEEecc Q lcl|NC_021537. 344 KVHGVPPVLINVTSTSNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDAL-DVDEWTIDFEL 407 (602) Q Consensus 344 ~~fgVPp~~lg~~~~~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~-~~~~~~~~f~~ 407 (602) .+|-+ ++. .++..-++++... .+....|.|++.+.-..+.+ .++++.- ...+..+.--+ T Consensus 360 ~aFm~----~~~-~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l 434 (555) T protein:vir:17 360 DAFLM----LQV-RQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGL 434 (555) T ss_pred HHHhh----cCC-CCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehH Confidence 77753 222 2222224433211 12334556666544444433 3444321 11223333333 Q ss_pred chhcchhHHHHHHHHHHHHHHhCCc-------ccHHHH----HHHhCCCCCCCCcccccccccc----ccccccc---cC Q lcl|NC_021537. 408 RGAEQPEQDAKMAEQRVRAMRLAGV-------GTVNEA----REELDLAPFEDDRGDMTLSEFE----AEFGADA---SD 469 (602) Q Consensus 408 ~~~~~~~~d~~~~~~~~~~~~~~G~-------~T~NE~----R~~~Gl~p~~~g~~d~~~~~~~----~~~~~~~---~~ 469 (602) ..+.+. .+......+++.+...+- +..+++ -+.+|.+|..-=..+.-+.... ....++. +. T Consensus 435 ~~l~r~-~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa 513 (555) T protein:vir:17 435 WGVGRG-QDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQA 513 (555) T ss_pred HHHHHH-HHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333322 233333345544433221 222222 2334554431000000000000 0000000 00 Q ss_pred CCc-Ccccccccccccccccccccccccccccccchhhhhcchhhhhhhe Q lcl|NC_021537. 470 GDA-EAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGL 518 (602) Q Consensus 470 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~ 518 (602) +.. +....++ . ..........+....+.+..-.|..-.+.. T Consensus 514 ~~~~~~~~~~~----~----~~~~~~~~~~a~~~~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 514 GQLAKTPMAEQ----A----MQLIQQQQEGAQDAGAAESETSSAEAQAGA 555 (555) T ss_pred HHHHhhhhhhh----H----HhccccchhhhhHHHHHHhhcCCcccccCC Confidence 000 0000000 0 000000000000111111111111000000 No 239 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=92.17 E-value=0.013 Score=31.01 Aligned_cols=451 Identities=9% Similarity=0.003 Sum_probs=155.0 Q ss_pred CCCCccc-ccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSKAEET-TQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k~~~~-~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~~~~~~ 72 (602) +.+=--. ...+.....+.......-++| ++...|++.+|..+.+. .|++...+.. ..+ T Consensus 30 ~~~~~lP~~~~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~Las~l~~~ltpp~~~WF~l~~~d~~--~~~- 95 (556) T protein:vir:73 30 LSDFINPRGSRFLTSDVNRDDRRNTKIVD-----------PTGSMAQRILSSGMMSGITSPARPWFKLATPDPD--MMD- 95 (556) T ss_pred HHHHhccccCCcCCCCCCcchhhcCcccc-----------chHHHHHHHHHHHHHHhhcCCCCcccccccCccc--ccc- Confidence 0000000 000000000000001112232 23345566666555431 3343322211 111 Q ss_pred hhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccccc Q lcl|NC_021537. 73 GESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTI 151 (602) Q Consensus 73 ~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~ 151 (602) ...++..+..+. ..+..+. .-+++.-+..+..|+.++||+.+++..+.. +.+++.+++...+-+..+..+. T Consensus 96 ---~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~r~~~~~l~~~~~~~d~~G~ 167 (556) T protein:vir:73 96 ---YGPVKIWLEVVQRRMNEVFN----KSNLYQSLPVMYASLGTFGTGAMAVMEDDQ-DVIRTMPFPIGSYYLANSPRGS 167 (556) T ss_pred ---hHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceeeeeeecCC-ceEEEEEeecceeEEeeCCCCC Confidence 112222222211 1222222 235777778889999999999999877654 4466666666666555544432 Q ss_pred ccccch--------------------hhh-hcccCc--eeEEEEcCCcceeecccccccc--cceeeecccceEEecCc- Q lcl|NC_021537. 152 EREDGE--------------------EVE-NIESGH--GYVQVRQGRRRYFGEAGDRYGD--DKRFVDKETGEVASDAG- 205 (602) Q Consensus 152 ~~~~~~--------------------~~~-~~~~~~--~~~qi~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~- 205 (602) ...-.+ .+. ....++ ..+.+..- .+.-..+.+. +.....-..+++..... T Consensus 168 vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~----V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~ 243 (556) T protein:vir:73 168 VDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHC----ITPNVNRDSGKMDSKNKPYRSVYFESGGDS 243 (556) T ss_pred eEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEE----EeccccccccccCcccceEEEEEEEecCCC Confidence 111000 000 000110 01111100 0000000000 00000000011111100 Q ss_pred ----eeEEechhHEEEecCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHH Q lcl|NC_021537. 206 ----ELKNGPANELIFLPNPSPLALYYGVP-DWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLR 280 (602) Q Consensus 206 ----~~~~~~~~eviH~r~~~~~~~~~G~s-pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~ 280 (602) ....|.....+.+|.....+..||.| |...++..+.......+.......-...|...+. ....... T Consensus 244 ~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~--~~~~~~~------ 315 (556) T protein:vir:73 244 DKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAP--TSLKNQR------ 315 (556) T ss_pred ceecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceecc--ccccccc------ Confidence 11123345567778777788899999 8999988888888888877766666667766542 2211100 Q ss_pred HHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHH-hhccccC Q lcl|NC_021537. 281 NLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVL-INVTSTS 359 (602) Q Consensus 281 ~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~ 359 (602) ++- .++|..+.....+. . .++|+...++.-....+..+.....|-.+|-.+..+ ++.-+ + T Consensus 316 --~~~------------~pgg~~~~~~~~~~--~--~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~-~ 376 (556) T protein:vir:73 316 --VSL------------LPGDVTYLDVISGQ--D--GFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNIN-T 376 (556) T ss_pred --eee------------ccCccccccCCCCc--c--ceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCC-C Confidence 011 11111111111000 0 133443232211223355677889999999876432 33322 2 Q ss_pred CccCHHHHH--------------HHHHHHHHHHHHHHHHHHHhh-hcCCccc-cccceEEEeccchhcchhHHHHHHH-- Q lcl|NC_021537. 360 NRANSKEQT--------------REFAKGIIEPEQAKFSARLYK-IIHQDAL-DVDEWTIDFELRGAEQPEQDAKMAE-- 421 (602) Q Consensus 360 ~~sn~e~~~--------------~~f~~~~l~P~~~~ie~~ln~-~Ll~~~~-~~~~~~~~f~~~~~~~~~~d~~~~~-- 421 (602) ..-++++.. ..+....|.|++.+.-..+.+ .+||+.. ...+-.++....+.+....+..... T Consensus 377 ~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i 456 (556) T protein:vir:73 377 RSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSL 456 (556) T ss_pred CCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHH Confidence 222333321 122333455655544444433 2444321 1122233333333332222221111 Q ss_pred ----HHHHHHHhCC-----cccHHHHH----HHhCCCCCCCCccccccccccccccccccCCCcCccccccccccc-ccc Q lcl|NC_021537. 422 ----QRVRAMRLAG-----VGTVNEAR----EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPP-LEN 487 (602) Q Consensus 422 ----~~~~~~~~~G-----~~T~NE~R----~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 487 (602) +.+..+...+ .+..+++- +.+|.|+- .+.+.-.+.. ..++....+ ... T Consensus 457 ~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~------~irs~eev~~------------~rq~r~~~qq~~~ 518 (556) T protein:vir:73 457 SQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPT------VIVPQEQVQG------------IREERAKQAQAAQ 518 (556) T ss_pred HHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChh------hcCCHHHHHH------------HHHHHHHHHHHHH Confidence 1111110000 01122221 12233210 0000000000 000000000 000 Q ss_pred ccc-ccccccccccccchhh-hhcc-hhhhhhheeccccc Q lcl|NC_021537. 488 KIG-ERDSVDVDVSKDPIEQ-TTFS-SSNLDEGLYDFGER 524 (602) Q Consensus 488 ~~~-~~~~~~~~~~~~~m~~-~~v~-ss~~~~~~yd~~~~ 524 (602) +.. ....+...+...+... .+.. .......|- + .+ T Consensus 519 ~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~g~-~-~~ 556 (556) T protein:vir:73 519 AMAMGQAAAQGAKTLSETQTSDPSALTAIANAAGA-P-QQ 556 (556) T ss_pred HHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhhcC-C-CC Confidence 000 0000000000000000 0000 001112221 1 11 No 240 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=91.96 E-value=0.013 Score=30.85 Aligned_cols=427 Identities=12% Similarity=0.047 Sum_probs=158.4 Q ss_pred CCCCcccccccchhhhc-----ccCccccCCCCHHHHHHHHhhhHH---------------HHHHHHHHHHhhccC---c Q lcl|NC_021537. 1 MSKAEETTQLDERHIAT-----DVGRGIQPPYNPETLAAFQELNET---------------HQACIRKKSRYEAGY---G 57 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~-----~~~~~i~p~~~~~~l~~~~~~~~~---------------v~~cI~~ia~~ia~~---~ 57 (602) ++..=+|-.--..+|.+ ..+.++---++.++|..... ||. .-+-|--+=+-|-++ . T Consensus 31 ~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~-npd~~~~~i~~l~~y~yi~~~~v~ql~~li~~lp~l~ 109 (525) T protein:vir:10 31 LERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFN-NPDKYINNIVNLLTYYYIIDGNVFQLYDLIFSLPPLD 109 (525) T ss_pred hhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhc-ChHHHHHHHHHHHHHhhhhcchHHHHHHHHHhcCCcc Confidence 11111110000011110 01111222233444332221 111 111122222333333 3 Q ss_pred eEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEe Q lcl|NC_021537. 58 FEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHV 137 (602) Q Consensus 58 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l 137 (602) ++|..-.+. ....+....++..++..- -..++-+-++..+... |..+++|.= T Consensus 110 y~i~~~~~~---k~~~~~~s~~n~~l~k~i-------------~hk~ltrdll~q~a~~------------gtlig~wlg 161 (525) T protein:vir:10 110 YQIKVLKRD---KDYKEDLSTINLYLEKKI-------------QHKQLTRDLLVQLAHS------------GTLIGTWLG 161 (525) T ss_pred eeehhhhhc---cchhhHHHHHHHHHHHhH-------------HHHHHHHHHHHHhhcc------------CceeEeeec Confidence 444322111 122333334443333210 0112222222222222 344444432 Q ss_pred CcccccccccccccccccchhhhhcccCceeEEEEcCCcceeeccccc---------ccc----------cceeeecccc Q lcl|NC_021537. 138 PAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDR---------YGD----------DKRFVDKETG 198 (602) Q Consensus 138 ~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~---------~~~----------~~~~~~~~~g 198 (602) +...-- ...+....+...+....| .++-+++- .||.++.+. .|. ++..-++..+ T Consensus 162 ~~~~py----~~vf~~~kyvfp~~r~~g-~~v~vid~--~~f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~ 234 (525) T protein:vir:10 162 SKREPY----FNVFNNLKYVFPYGRAKG-KMVAVIDL--QWFDEMSELERKLTFENLSPLITENKYKKWKEYNGENEDAL 234 (525) T ss_pred CCCCcc----hhhhhhhhhhccccccCC-ceEEEEeh--HHhhhhhHHHHHHHHHhhchhhhhhhhhHHhhcccccchhh Confidence 211100 000000000000001111 11111111 122222111 110 0000111111 Q ss_pred eEEecCceeEEechhHEEEecCCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEecccc-----CC Q lcl|NC_021537. 199 EVASDAGELKNGPANELIFLPNPSPLAL-YYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGT-----LS 272 (602) Q Consensus 199 ~~~~~~~~~~~~~~~eviH~r~~~~~~~-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-----~~ 272 (602) ....+|.+.++|.|...+... -.|.|....++..|.......+...+....=..|-.+|++.+.. +. T Consensus 235 -------r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn~mk~p 307 (525) T protein:vir:10 235 -------RYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDNDSKVK 307 (525) T ss_pred -------eeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCccccCc Confidence 124688999999998765444 45999888888888887777777666666666777888875432 23 Q ss_pred HHHHHHHHHHHHHhh--cc-cccCcce-eccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcC Q lcl|NC_021537. 273 EDSKEDLRNLMDNLK--GS-RYRTAIL-EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGV 348 (602) Q Consensus 273 ~~~~~~l~~~~~~~~--g~-~nag~~~-~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgV 348 (602) +..+.++-+..+..- |- ...|.+. .++..+++.--.....+.++ |.+ -.+.....|-.|+|+ T Consensus 308 ~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~gl-----------Dg~---K~d~I~~DI~~A~Gl 373 (525) T protein:vir:10 308 ESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTL-----------DPK---KYDSIDNDITNATGI 373 (525) T ss_pred hHHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCC-----------Cch---hhhhhhhhhhhhhcc Confidence 333333333332211 11 1123222 22222222210000001111 111 123345789999999 Q ss_pred ChHHhhccccCCccCHHHHHHHHHHHHHHHHHHHHHHHHhhh---cCCccccccceEEEeccchhcchhHHHHHHHHHHH Q lcl|NC_021537. 349 PPVLINVTSTSNRANSKEQTREFAKGIIEPEQAKFSARLYKI---IHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVR 425 (602) Q Consensus 349 Pp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~---Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~ 425 (602) +..+++. +++||+++.-.+..||+ -+.-+++.|+++.++. +|+. +....|.+.+|-++.... +...+.+- T Consensus 374 S~sL~nG-dggNyAtaslnld~fyk-kigVm~e~Iee~y~kL~d~Vl~~-~k~~nyifnydkd~pi~~----kkk~d~LI 446 (525) T protein:vir:10 374 SQVLTNG-TKGNYASAKLNLDVFYK-KIGVMLEIIEEIYNQLIDIILGE-EKGCNYIFQYNKDTPIER----EKKLDTLI 446 (525) T ss_pred ceeeecC-CCCceeeeeeeHHHHHH-HHHHHHHHHHHHHHHHHhhhcCc-ccCcceEEecCCCchhhh----hhhhhhhh Confidence 9998853 46789988777778876 4667777787655543 3333 333455555555544333 32233444 Q ss_pred HHHhCCcccHHHHHHHhCCCCCCCC-----------cc--ccccccccccccccccCCCcCccccccccccccccccccc Q lcl|NC_021537. 426 AMRLAGVGTVNEAREELDLAPFEDD-----------RG--DMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGER 492 (602) Q Consensus 426 ~~~~~G~~T~NE~R~~~Gl~p~~~g-----------~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (602) ++...|+... -+....|+.--+.- .. .++....++-.|.+...-..+.. .+.+..+.... T Consensus 447 kL~d~g~s~k-~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~n~iG~P~~----dd~~~~dati~-- 519 (525) T protein:vir:10 447 KLEAQGYSAK-YVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDGNDIGSPKL----DDSDSSDATIE-- 519 (525) T ss_pred hhhccchhhh-hhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeeccccccccCCcc----CCCcchhhhhh-- Confidence 5556666432 12223333211100 00 01111111222211111000000 00011111110 Q ss_pred ccccccccccch Q lcl|NC_021537. 493 DSVDVDVSKDPI 504 (602) Q Consensus 493 ~~~~~~~~~~~m 504 (602) ..+... T Consensus 520 ------s~~~~~ 525 (525) T protein:vir:10 520 ------SKERGV 525 (525) T ss_pred ------hhhcCC Confidence 000000 No 241 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=91.52 E-value=0.015 Score=30.51 Aligned_cols=446 Identities=9% Similarity=-0.020 Sum_probs=157.0 Q ss_pred CCCCcccccccc------------hhh---hcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC------ceE Q lcl|NC_021537. 1 MSKAEETTQLDE------------RHI---ATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY------GFE 59 (602) Q Consensus 1 ~~k~~~~~~~~~------------~~~---~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~------~~~ 59 (602) ..+..+-++..+ +.| +...++....++| ++...|++.+|..+.+. .|+ T Consensus 18 ~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~Laa~l~~~ltP~~~WF~ 86 (535) T protein:vir:15 18 YDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQ-----------AVGARGLNNLASKLMLALFPMQSWMK 86 (535) T ss_pred HHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccc-----------ccHHHHHHHHHHHHHHhhcCCCcccc Confidence 000000000000 000 0000111111222 23345666666666542 222 Q ss_pred EEEecCC-CCcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC-ceEEEEE Q lcl|NC_021537. 60 IVAHPSA-DEPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG-TPVGLAH 136 (602) Q Consensus 60 i~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G-~~~~L~~ 136 (602) +...+.. +....+......++..+..+. ..+..+. ..+++.-+..+..|+.++||+.+++..+..+ .....+| T Consensus 87 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~p 162 (535) T protein:vir:15 87 LTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIE----SNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR 162 (535) T ss_pred cccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEE Confidence 2211100 000001111122222222221 1222222 3457777888899999999999988765433 3455666 Q ss_pred eCcccccccccccccccccc--hhhhhcc-cCceeEE-----EEcCCcceeecccccccccceeeecccce--EE-ecCc Q lcl|NC_021537. 137 VPAATVRVRKTTTTIEREDG--EEVENIE-SGHGYVQ-----VRQGRRRYFGEAGDRYGDDKRFVDKETGE--VA-SDAG 205 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~~~--~~~~~~~-~~~~~~q-----i~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~-~~~~ 205 (602) |.--.|.............. ....... ....... -.......++. ..+ .+...+. ++ .-.+ T Consensus 163 l~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~--~v~------~~~~~~~~~~~~e~~g 234 (535) T protein:vir:15 163 LSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYT--HVY------LDEESGDYLKYEEVED 234 (535) T ss_pred cCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEE--EEE------EecCCCcEEEEEEeeC Confidence 65434432222111111000 0000000 0000000 00000000000 000 0111111 00 0011 Q ss_pred ee-------EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHH Q lcl|NC_021537. 206 EL-------KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKED 278 (602) Q Consensus 206 ~~-------~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~ 278 (602) .. ..|.....+..|.....+..||.||...++..+.......+.......-...|..++. +++........ T Consensus 235 ~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~-~~g~~~~~~l~- 312 (535) T protein:vir:15 235 VEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN-PAGITQPRRLT- 312 (535) T ss_pred ccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-ccccccchhcc- Confidence 11 1122334677787777788999999999999999988888888888777778876653 22222221110 Q ss_pred HHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 279 LRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 279 l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) .++.+. + +.+ ...++...++...+.-+. ..+..+.....|..+|-+.. +... + T Consensus 313 --------~~~~g~---~-v~g-----------~~~~v~~~~~~~~~~~~~-~~~~i~~~~~~I~~af~~~~--~~~~-~ 365 (535) T protein:vir:15 313 --------KAQTGD---F-VPG-----------RREDIDFLQLEKQADFTV-AKAVSDQIEARLSYAFMLNS--AVQR-T 365 (535) T ss_pred --------cCCcee---e-ecC-----------CcccceeeecccccchhH-HHHHHHHHHHHHHHHHhhhh--cccC-C Confidence 111111 0 000 111222222222211111 23445667788888885441 2111 2 Q ss_pred CCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccccccceEEEeccchhcchh--HHHHHHH Q lcl|NC_021537. 359 SNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALDVDEWTIDFELRGAEQPE--QDAKMAE 421 (602) Q Consensus 359 ~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~~~f~~~~~~~~~--~d~~~~~ 421 (602) +..-++++... .+....|.|++.+.-..+.+ .++++.-. ..+.++|. +.+.... .+..... T Consensus 366 ~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~-~~v~~~yi-s~La~aqr~~~~~~l~ 443 (535) T protein:vir:15 366 GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPK-EAVEPTIS-TGLEAIGRGQDLDKLE 443 (535) T ss_pred CccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCc-cceeEEEe-cHHHHHHHHHHHHHHH Confidence 22234443211 23344566666655444433 35554322 23556653 2221111 1222222 Q ss_pred HHHHHHHhCCcccHH---------HH----HHHhCCCCCCCCccccccccccccccccccCCCcCccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRLAGVGTVN---------EA----REELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENK 488 (602) Q Consensus 422 ~~~~~~~~~G~~T~N---------E~----R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (602) ++++.+.. +.|. ++ .+.+|.|+.. ......+.+.-- .+........+.-.+ T Consensus 444 ~~~~~la~---~~P~~ld~~id~d~~~~~~a~~~Gvp~~~-----------i~~~~eev~~~~--~q~~~~~~~~~~a~~ 507 (535) T protein:vir:15 444 RCISAWAA---LAPMQGDPDINLAVIKLRIANAIGIDTSG-----------ILLTDEQKQALM--MQDAAQTGIENAAAT 507 (535) T ss_pred HHHHHHHh---cChhhhhccCCHHHHHHHHHHHcCCChhh-----------hcCCHHHHHHHH--HHHHHHHHHHHHHHH Confidence 22222111 1111 11 1112222110 000000000000 000000000000000 Q ss_pred ccccccccccccccchhhhhcchhhhhhheeccc Q lcl|NC_021537. 489 IGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFG 522 (602) Q Consensus 489 ~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~ 522 (602) ......+. ....+.. -...+..+|-+.. T Consensus 508 ~g~~~~~~--~~~~p~~----~~~~~~~~g~~~~ 535 (535) T protein:vir:15 508 GGAGVGAL--ATSSPEA----MQGAAAQAGLDAT 535 (535) T ss_pred HHhhccch--hccChHH----HHHHHhccCCCCC Confidence 00000000 0111111 1234456665544 No 242 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=91.23 E-value=0.017 Score=30.31 Aligned_cols=433 Identities=10% Similarity=0.050 Sum_probs=169.7 Q ss_pred CCCCccccc----ccchh--------hhcccCccccCCCCH-----------HHHHHHHhhhHHHHHHHHHHHHhhccC- Q lcl|NC_021537. 1 MSKAEETTQ----LDERH--------IATDVGRGIQPPYNP-----------ETLAAFQELNETHQACIRKKSRYEAGY- 56 (602) Q Consensus 1 ~~k~~~~~~----~~~~~--------~~~~~~~~i~p~~~~-----------~~l~~~~~~~~~v~~cI~~ia~~ia~~- 56 (602) +++...|-. -++.. ..+.+|+.....++. ...|.++ .+|-|..||+-|.+.+.-. T Consensus 23 ~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d 101 (521) T protein:vir:65 23 IKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLM-NNHEVENAVQNIVNDAIVFE 101 (521) T ss_pred hccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEec Confidence 111111111 01100 011123222222221 2334554 4788899999988877632 Q ss_pred ----ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC--CCCc Q lcl|NC_021537. 57 ----GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE--GDGT 130 (602) Q Consensus 57 ----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~--~~G~ 130 (602) |..|... ..+.+....+.+....... ..++ +-..+..+++ +.+.+.|..|+.++-+ .... T Consensus 102 ~~~~pV~l~L~----~~~~s~~iK~kI~eeF~~I---l~ll---~F~~~~~~~f----R~WYVDgRi~fhkiid~~pk~G 167 (521) T protein:vir:65 102 EGHEVVSLNLE----ATGFSESVKERIHEEFKDL---LNTI---QFDRRGQDMF----RRWYVDSRIFFHKIIGKNPKDG 167 (521) T ss_pred CCCceEEEEec----ccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHH----hhhhhcceeEEEEEEcCCcccc Confidence 3333221 1222222222333222211 1121 1223344444 4566779999999955 3356 Q ss_pred eEEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEe Q lcl|NC_021537. 131 PVGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNG 210 (602) Q Consensus 131 ~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 210 (602) +.+|.+|||..|+........... +. ..+...+..|+|..+.... ...|... ..+....+ T Consensus 168 I~ELr~lDPr~i~~vr~i~k~~~~----------~~--~v~~~~~e~f~Y~~~~~~~-------~~~g~~~-~~~~~vkI 227 (521) T protein:vir:65 168 IVELRQLDPRNLEYVREIITEDTP----------EG--KIYKATKEYFIYTVGNSSY-------CAGGQVF-SPNSRVKI 227 (521) T ss_pred ceeeeeeCCcceeeeeeecccccC----------Cc--ceecceeeeeeeecCCcce-------eccceee-cCCcceee Confidence 899999999999854432221111 10 0011112222222211110 0111111 12234566 Q ss_pred chhHEEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhc Q lcl|NC_021537. 211 PANELIFLP-NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKG 288 (602) Q Consensus 211 ~~~eviH~r-~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g 288 (602) +.+-|.+.. +.-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++. T Consensus 228 ~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kN 307 (521) T protein:vir:65 228 PRSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKN 307 (521) T ss_pred chhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 666655543 122223323357778887777766666665544322333333444443 223333444445555544432 Q ss_pred ----c------cccCcceeccCCccceeccccccccccccccccc-cchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc Q lcl|NC_021537. 289 ----S------RYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGA-REDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS 357 (602) Q Consensus 289 ----~------~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~-~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~ 357 (602) . .+..+.+..-+. -++.--....+.+++.|.- .+.-+| +=.++..+...++++||.+.++..+ T Consensus 308 klvYDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~ 381 (521) T protein:vir:65 308 RVVYDASTGKLKNQQANLSMTED---YWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSD 381 (521) T ss_pred eeEeecccccccccccccchhhh---hcccccCCCCccceeecccCCCcChH---HHHHHHHHHHHHHhCCCceeccCCC Confidence 0 111111111000 0000000011222322221 122233 3345677899999999999886544 Q ss_pred cCCc----cC-HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCc----c--------ccccceEEEeccchhcchhHHHHHH Q lcl|NC_021537. 358 TSNR----AN-SKEQTREFAKGIIEPEQAKFSARLYKIIHQD----A--------LDVDEWTIDFELRGAEQPEQDAKMA 420 (602) Q Consensus 358 ~~~~----sn-~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~----~--------~~~~~~~~~f~~~~~~~~~~d~~~~ 420 (602) ++++ ++ +.-.-.-| ...|.-+...|...|...|-.+ . ......++.|..+.-.....+.+.. T Consensus 382 ~~~~~~gr~~EItRDEiKF-~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil 460 (521) T protein:vir:65 382 ANMVIGGDGSEITRDELEF-SKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEIL 460 (521) T ss_pred CcceeccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHH Confidence 4332 21 22222233 4456667766665554433221 1 1112245555555433333344443 Q ss_pred HHHHHHHHh-----CCcccHHHHH-HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 421 EQRVRAMRL-----AGVGTVNEAR-EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 421 ~~~~~~~~~-----~G~~T~NE~R-~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) .+++..+-. .-.++.+=++ ..|.+.-.+- . ............+-...+.. ..... T Consensus 461 ~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDeei---~----~~~k~I~~E~~~~~~~~p~~-------~~~~f 521 (521) T protein:vir:65 461 ERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQM---D----TEKKQIEEEANDPRFKQTPD-------EIEDF 521 (521) T ss_pred HHHHHHHHHhhhhhccccchHHHHHHHhccCHHHH---H----HHHHHHHHhhhCCCCCCCcc-------cccCC Confidence 333332211 1112333232 2333311000 0 00000000000000000000 00001 No 243 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=90.64 E-value=0.02 Score=29.92 Aligned_cols=457 Identities=12% Similarity=0.036 Sum_probs=156.4 Q ss_pred CCCCcccccccchhhhcc--cCc-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETTQLDERHIATD--VGR-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~--~~~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~~~~ 70 (602) +.+---+. ..+-+..+ .++ ...-++| ++...|++.+|..+.+. .|++...+. ... T Consensus 31 ~~~~~lP~--~~~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~--~l~ 95 (555) T protein:vir:98 31 ISDYLLPR--AGRFFVQDRNRGEKRHNNILD-----------NTGTRALRVLAAGMMAGMTSPARPWFRLTTSIP--ELD 95 (555) T ss_pred HHHHhCcc--cccccCCCCCcchhccccccc-----------ccHHHHHHHHHHHHHHhhcCCCCcccccccCcc--ccc Confidence 00000000 00000000 000 0111222 23345666666665532 333332211 111 Q ss_pred cchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +. ..++..+..+ ...+..+. ..+++.-+..+..|+.++||+.+++..+.. ..+.+..+|...+-+..+.. T Consensus 96 e~----~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~rf~~~pl~~~~v~~d~~ 166 (555) T protein:vir:98 96 ES----AAVKAWLANVTRLMLMIFA----KSNTYRALHSMYEELGAFGTASSIVLPDFD-AVVYHHSLTAGEYAIAADNQ 166 (555) T ss_pred ch----HHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceEEEEEeecceeEEeeCCC Confidence 11 1222222211 11222332 245677777788999999999999887754 45566666666665555444 Q ss_pred ccccccch--------------------hhh---hcccCceeEEEEcCCcceeeccccccccc--ceeeecccceEEec- Q lcl|NC_021537. 150 TIEREDGE--------------------EVE---NIESGHGYVQVRQGRRRYFGEAGDRYGDD--KRFVDKETGEVASD- 203 (602) Q Consensus 150 ~~~~~~~~--------------------~~~---~~~~~~~~~qi~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~~- 203 (602) +....-.+ .+. .......++.|+.- .+.-.++.+.. .....-..+++... T Consensus 167 G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:98 167 GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKRDDRNMAWKSVYFEPGA 242 (555) T ss_pred CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCCCccccceEEEEEEecc Confidence 42211000 000 00000011111000 00000000000 00000000111100 Q ss_pred Cc----eeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHH Q lcl|NC_021537. 204 AG----ELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDL 279 (602) Q Consensus 204 ~~----~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l 279 (602) ++ ....|.....+.+|.....+..||.||...++..+.......+.......-...|...+.- +..... T Consensus 243 d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~-~~~~~~------ 315 (555) T protein:vir:98 243 DETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPV-SAKNQD------ 315 (555) T ss_pred CCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc-cccccc------ Confidence 01 1112344556777777777889999999999999988888888777666666666665432 111110 Q ss_pred HHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 280 RNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 280 ~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) .+-. ++|..+...+... . .+.|+-.... |.+ ..+..+.....|-.+|-.+..+.....+ T Consensus 316 ---~~~~------------pgg~~~v~~g~~~--d--~~~~~~~~~~-d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~ 375 (555) T protein:vir:98 316 ---ISTV------------PGGLSYVDAAAPN--G--GIRTAFEVNL-DLSHLLADIVDVRERIKASFYADLFLMLANGT 375 (555) T ss_pred ---ceec------------cccccccccCCCC--c--ceeccccccc-chHHHHHHHHHHHHHHHHHhhcchhhhccCCC Confidence 0111 1111111110000 0 0111111111 222 2345677888999999776444322223 Q ss_pred CCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCcc-ccccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 359 SNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDA-LDVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 359 ~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~-~~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) +..-++++.+. .+....|.|++.+.-..+.+ .++|+. ....+..++......+....+.... . T Consensus 376 ~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~-~ 454 (555) T protein:vir:98 376 NPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIAT-N 454 (555) T ss_pred CCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHH-H Confidence 33334443211 22234456665544444433 244432 1122333443333333222221111 1 Q ss_pred HHHHHHhCCcccHHHHHHHhCCCCC--CCCccccccccccccccccccCCCcCcccc---ccccc--ccccccccccccc Q lcl|NC_021537. 423 RVRAMRLAGVGTVNEAREELDLAPF--EDDRGDMTLSEFEAEFGADASDGDAEAMLT---RSKAA--PPLENKIGERDSV 495 (602) Q Consensus 423 ~~~~~~~~G~~T~NE~R~~~Gl~p~--~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~ 495 (602) .+.. ..+-+-...+..|- +--+.+..+...-...|.+..---..+... .+..+ +....+....+.. T Consensus 455 ~i~~-------~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~ 527 (555) T protein:vir:98 455 SVDR-------FVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGA 527 (555) T ss_pred HHHH-------HHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 01111111222220 000001110000000111100000000000 00000 0000000000000 Q ss_pred ---cccccccchhhhhcchhhhhhheec Q lcl|NC_021537. 496 ---DVDVSKDPIEQTTFSSSNLDEGLYD 520 (602) Q Consensus 496 ---~~~~~~~~m~~~~v~ss~~~~~~yd 520 (602) ++-...+...+.....---.-.||- T Consensus 528 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 528 DTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHhcccccCcchhHHHHHhhhccCC Confidence 0000000000100011111223443 No 244 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=90.64 E-value=0.02 Score=29.92 Aligned_cols=457 Identities=12% Similarity=0.036 Sum_probs=156.4 Q ss_pred CCCCcccccccchhhhcc--cCc-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETTQLDERHIATD--VGR-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~--~~~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~~~~ 70 (602) +.+---+. ..+-+..+ .++ ...-++| ++...|++.+|..+.+. .|++...+. ... T Consensus 31 ~~~~~lP~--~~~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~--~l~ 95 (555) T protein:vir:10 31 ISDYLLPR--AGRFFVQDRNRGEKRHNNILD-----------NTGTRALRVLAAGMMAGMTSPARPWFRLTTSIP--ELD 95 (555) T ss_pred HHHHhCcc--cccccCCCCCcchhccccccc-----------ccHHHHHHHHHHHHHHhhcCCCCcccccccCcc--ccc Confidence 00000000 00000000 000 0111222 23345666666665532 333332211 111 Q ss_pred cchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +. ..++..+..+ ...+..+. ..+++.-+..+..|+.++||+.+++..+.. ..+.+..+|...+-+..+.. T Consensus 96 e~----~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~rf~~~pl~~~~v~~d~~ 166 (555) T protein:vir:10 96 ES----AAVKAWLANVTRLMLMIFA----KSNTYRALHSMYEELGAFGTASSIVLPDFD-AVVYHHSLTAGEYAIAADNQ 166 (555) T ss_pred ch----HHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceEEEEEeecceeEEeeCCC Confidence 11 1222222211 11222332 245677777788999999999999887754 45566666666665555444 Q ss_pred ccccccch--------------------hhh---hcccCceeEEEEcCCcceeeccccccccc--ceeeecccceEEec- Q lcl|NC_021537. 150 TIEREDGE--------------------EVE---NIESGHGYVQVRQGRRRYFGEAGDRYGDD--KRFVDKETGEVASD- 203 (602) Q Consensus 150 ~~~~~~~~--------------------~~~---~~~~~~~~~qi~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~~- 203 (602) +....-.+ .+. .......++.|+.- .+.-.++.+.. .....-..+++... T Consensus 167 G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:10 167 GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKRDDRNMAWKSVYFEPGA 242 (555) T ss_pred CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCCCccccceEEEEEEecc Confidence 42211000 000 00000011111000 00000000000 00000000111100 Q ss_pred Cc----eeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHH Q lcl|NC_021537. 204 AG----ELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDL 279 (602) Q Consensus 204 ~~----~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l 279 (602) ++ ....|.....+.+|.....+..||.||...++..+.......+.......-...|...+.- +..... T Consensus 243 d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~-~~~~~~------ 315 (555) T protein:vir:10 243 DETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPV-SAKNQD------ 315 (555) T ss_pred CCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc-cccccc------ Confidence 01 1112344556777777777889999999999999988888888777666666666665432 111110 Q ss_pred HHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 280 RNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 280 ~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) .+-. ++|..+...+... . .+.|+-.... |.+ ..+..+.....|-.+|-.+..+.....+ T Consensus 316 ---~~~~------------pgg~~~v~~g~~~--d--~~~~~~~~~~-d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~ 375 (555) T protein:vir:10 316 ---ISTV------------PGGLSYVDAAAPN--G--GIRTAFEVNL-DLSHLLADIVDVRERIKASFYADLFLMLANGT 375 (555) T ss_pred ---ceec------------cccccccccCCCC--c--ceeccccccc-chHHHHHHHHHHHHHHHHHhhcchhhhccCCC Confidence 0111 1111111110000 0 0111111111 222 2345677888999999776444322223 Q ss_pred CCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCcc-ccccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 359 SNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDA-LDVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 359 ~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~-~~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) +..-++++.+. .+....|.|++.+.-..+.+ .++|+. ....+..++......+....+.... . T Consensus 376 ~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~-~ 454 (555) T protein:vir:10 376 NPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIAT-N 454 (555) T ss_pred CCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHH-H Confidence 33334443211 22234456665544444433 244432 1122333443333333222221111 1 Q ss_pred HHHHHHhCCcccHHHHHHHhCCCCC--CCCccccccccccccccccccCCCcCcccc---ccccc--ccccccccccccc Q lcl|NC_021537. 423 RVRAMRLAGVGTVNEAREELDLAPF--EDDRGDMTLSEFEAEFGADASDGDAEAMLT---RSKAA--PPLENKIGERDSV 495 (602) Q Consensus 423 ~~~~~~~~G~~T~NE~R~~~Gl~p~--~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~ 495 (602) .+.. ..+-+-...+..|- +--+.+..+...-...|.+..---..+... .+..+ +....+....+.. T Consensus 455 ~i~~-------~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~ 527 (555) T protein:vir:10 455 SVDR-------FVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGA 527 (555) T ss_pred HHHH-------HHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 01111111222220 000001110000000111100000000000 00000 0000000000000 Q ss_pred ---cccccccchhhhhcchhhhhhheec Q lcl|NC_021537. 496 ---DVDVSKDPIEQTTFSSSNLDEGLYD 520 (602) Q Consensus 496 ---~~~~~~~~m~~~~v~ss~~~~~~yd 520 (602) ++-...+...+.....---.-.||- T Consensus 528 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 528 DTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHhcccccCcchhHHHHHhhhccCC Confidence 0000000000100011111223443 No 245 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=90.64 E-value=0.02 Score=29.92 Aligned_cols=457 Identities=12% Similarity=0.036 Sum_probs=156.4 Q ss_pred CCCCcccccccchhhhcc--cCc-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCCCcc Q lcl|NC_021537. 1 MSKAEETTQLDERHIATD--VGR-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSADEPD 70 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~--~~~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~~~~ 70 (602) +.+---+. ..+-+..+ .++ ...-++| ++...|++.+|..+.+. .|++...+. ... T Consensus 31 ~~~~~lP~--~~~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~--~l~ 95 (555) T protein:vir:10 31 ISDYLLPR--AGRFFVQDRNRGEKRHNNILD-----------NTGTRALRVLAAGMMAGMTSPARPWFRLTTSIP--ELD 95 (555) T ss_pred HHHHhCcc--cccccCCCCCcchhccccccc-----------ccHHHHHHHHHHHHHHhhcCCCCcccccccCcc--ccc Confidence 00000000 00000000 000 0111222 23345666666665532 333332211 111 Q ss_pred cchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccccc Q lcl|NC_021537. 71 EGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTT 149 (602) Q Consensus 71 ~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~ 149 (602) +. ..++..+..+ ...+..+. ..+++.-+..+..|+.++||+.+++..+.. ..+.+..+|...+-+..+.. T Consensus 96 e~----~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~rf~~~pl~~~~v~~d~~ 166 (555) T protein:vir:10 96 ES----AAVKAWLANVTRLMLMIFA----KSNTYRALHSMYEELGAFGTASSIVLPDFD-AVVYHHSLTAGEYAIAADNQ 166 (555) T ss_pred ch----HHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceEEEEEeecceeEEeeCCC Confidence 11 1222222211 11222332 245677777788999999999999887754 45566666666665555444 Q ss_pred ccccccch--------------------hhh---hcccCceeEEEEcCCcceeeccccccccc--ceeeecccceEEec- Q lcl|NC_021537. 150 TIEREDGE--------------------EVE---NIESGHGYVQVRQGRRRYFGEAGDRYGDD--KRFVDKETGEVASD- 203 (602) Q Consensus 150 ~~~~~~~~--------------------~~~---~~~~~~~~~qi~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~~- 203 (602) +....-.+ .+. .......++.|+.- .+.-.++.+.. .....-..+++... T Consensus 167 G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:10 167 GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKRDDRNMAWKSVYFEPGA 242 (555) T ss_pred CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCCCccccceEEEEEEecc Confidence 42211000 000 00000011111000 00000000000 00000000111100 Q ss_pred Cc----eeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHH Q lcl|NC_021537. 204 AG----ELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDL 279 (602) Q Consensus 204 ~~----~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l 279 (602) ++ ....|.....+.+|.....+..||.||...++..+.......+.......-...|...+.- +..... T Consensus 243 d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~-~~~~~~------ 315 (555) T protein:vir:10 243 DETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPV-SAKNQD------ 315 (555) T ss_pred CCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc-cccccc------ Confidence 01 1112344556777777777889999999999999988888888777666666666665432 111110 Q ss_pred HHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhcccc Q lcl|NC_021537. 280 RNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTST 358 (602) Q Consensus 280 ~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 358 (602) .+-. ++|..+...+... . .+.|+-.... |.+ ..+..+.....|-.+|-.+..+.....+ T Consensus 316 ---~~~~------------pgg~~~v~~g~~~--d--~~~~~~~~~~-d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~ 375 (555) T protein:vir:10 316 ---ISTV------------PGGLSYVDAAAPN--G--GIRTAFEVNL-DLSHLLADIVDVRERIKASFYADLFLMLANGT 375 (555) T ss_pred ---ceec------------cccccccccCCCC--c--ceeccccccc-chHHHHHHHHHHHHHHHHHhhcchhhhccCCC Confidence 0111 1111111110000 0 0111111111 222 2345677888999999776444322223 Q ss_pred CCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCcc-ccccceEEEeccchhcchhHHHHHHHH Q lcl|NC_021537. 359 SNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDA-LDVDEWTIDFELRGAEQPEQDAKMAEQ 422 (602) Q Consensus 359 ~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~-~~~~~~~~~f~~~~~~~~~~d~~~~~~ 422 (602) +..-++++.+. .+....|.|++.+.-..+.+ .++|+. ....+..++......+....+.... . T Consensus 376 ~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~-~ 454 (555) T protein:vir:10 376 NPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIAT-N 454 (555) T ss_pred CCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHH-H Confidence 33334443211 22234456665544444433 244432 1122333443333333222221111 1 Q ss_pred HHHHHHhCCcccHHHHHHHhCCCCC--CCCccccccccccccccccccCCCcCcccc---ccccc--ccccccccccccc Q lcl|NC_021537. 423 RVRAMRLAGVGTVNEAREELDLAPF--EDDRGDMTLSEFEAEFGADASDGDAEAMLT---RSKAA--PPLENKIGERDSV 495 (602) Q Consensus 423 ~~~~~~~~G~~T~NE~R~~~Gl~p~--~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~ 495 (602) .+.. ..+-+-...+..|- +--+.+..+...-...|.+..---..+... .+..+ +....+....+.. T Consensus 455 ~i~~-------~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~ 527 (555) T protein:vir:10 455 SVDR-------FVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGA 527 (555) T ss_pred HHHH-------HHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 01111111222220 000001110000000111100000000000 00000 0000000000000 Q ss_pred ---cccccccchhhhhcchhhhhhheec Q lcl|NC_021537. 496 ---DVDVSKDPIEQTTFSSSNLDEGLYD 520 (602) Q Consensus 496 ---~~~~~~~~m~~~~v~ss~~~~~~yd 520 (602) ++-...+...+.....---.-.||- T Consensus 528 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 528 DTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHhcccccCcchhHHHHHhhhccCC Confidence 0000000000100011111223443 No 246 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=90.56 E-value=0.02 Score=29.88 Aligned_cols=455 Identities=9% Similarity=-0.010 Sum_probs=159.7 Q ss_pred CCCCcccccccc------------hhh---hcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-----c-eE Q lcl|NC_021537. 1 MSKAEETTQLDE------------RHI---ATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-----G-FE 59 (602) Q Consensus 1 ~~k~~~~~~~~~------------~~~---~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~-~~ 59 (602) ..+..+-++..+ +.| +...++....++| ++...|++.+|..+.+. + |+ T Consensus 18 ~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~Laa~l~~~ltP~~~WF~ 86 (535) T protein:vir:33 18 YDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQ-----------AVGARGLNNLASKLMLALFPMQSWMK 86 (535) T ss_pred HHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccc-----------ccHHHHHHHHHHHHHHhhcCCCcccc Confidence 000000000000 000 0000111111222 23345666666666542 2 22 Q ss_pred EEEecCC-CCcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC-ceEEEEE Q lcl|NC_021537. 60 IVAHPSA-DEPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG-TPVGLAH 136 (602) Q Consensus 60 i~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G-~~~~L~~ 136 (602) +...+.. +...........++..+..+. ..+..+. ..+++.-+..+..|+.++||+.+++..+..+ .....+| T Consensus 87 l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~----~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~p 162 (535) T protein:vir:33 87 LTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIE----SNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR 162 (535) T ss_pred cccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEE Confidence 2111100 000001111112222222211 1122222 3457777788899999999999998765432 3445566 Q ss_pred eCccccccccccccc--ccccc--hhhhhcc-cCceeEE---EE--cCCcceeecccccccccceeeecccceEEe---c Q lcl|NC_021537. 137 VPAATVRVRKTTTTI--EREDG--EEVENIE-SGHGYVQ---VR--QGRRRYFGEAGDRYGDDKRFVDKETGEVAS---D 203 (602) Q Consensus 137 l~p~~v~~~~~~~~~--~~~~~--~~~~~~~-~~~~~~q---i~--~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~---~ 203 (602) |..-.|. .+..+. ..... ....... ....... .. ......++. . .+.+...+.+.+ - T Consensus 163 l~~~~v~--~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~--~------v~~~~~~~~~~~~~~~ 232 (535) T protein:vir:33 163 LSSYVVQ--RDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYT--H------VYLDEESGDYLKYEEV 232 (535) T ss_pred cCeeEEe--eCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEE--E------EEeeCCCCcEEEEEEE Confidence 5443333 332221 10000 0000000 0000000 00 000000000 0 000111111110 0 Q ss_pred Ccee-------EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHH Q lcl|NC_021537. 204 AGEL-------KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSK 276 (602) Q Consensus 204 ~~~~-------~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~ 276 (602) .+.. ..|.....+..|.....+..||.||...++..+.......+.......-...|..++. +++....... T Consensus 233 ~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~-~~g~~~~~~~ 311 (535) T protein:vir:33 233 EDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN-PAGITQPRRL 311 (535) T ss_pred eCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-cccccchhhc Confidence 1111 1133344677787777788999999999999999988888888888777778876653 2322222111 Q ss_pred HHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhcc Q lcl|NC_021537. 277 EDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVT 356 (602) Q Consensus 277 ~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~ 356 (602) . .++.+. + +.+ ...++...++...+.-+. ..+..+.....|..+|-+.. +... T Consensus 312 ~---------~~~~g~---~-v~g-----------~~~~v~~~~~~~~~~~~~-~~~~i~~~~~~I~~af~~~~--~~~~ 364 (535) T protein:vir:33 312 T---------KAQTGD---F-VPG-----------RREDIDFLQLEKQADFTV-AKAVSDQIEARLSYAFMLNS--AVQR 364 (535) T ss_pred c---------cCCcee---e-ecC-----------CcccceeeecccccchhH-HHHHHHHHHHHHHHHHhhhh--cccC Confidence 0 111111 0 000 111222222222211111 23445667788888885441 2111 Q ss_pred ccCCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccccccceEEEeccchhcchh--HHHHH Q lcl|NC_021537. 357 STSNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALDVDEWTIDFELRGAEQPE--QDAKM 419 (602) Q Consensus 357 ~~~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~~~f~~~~~~~~~--~d~~~ 419 (602) ++..-++++... .+....|.|++.+.-..+.+ .++++.-. ..+.++|. +.+.... .+... T Consensus 365 -~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~-~~v~~~yi-s~La~aqr~~~~~~ 441 (535) T protein:vir:33 365 -TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPK-EAVEPTIS-TGLEAIGRGQDLDK 441 (535) T ss_pred -CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCc-cceeEEEe-cHHHHHHHHHHHHH Confidence 222234443211 23344566666655444433 35554322 24556653 2222111 11121 Q ss_pred HHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCC-cCcccccccccccccc-cccccccccc Q lcl|NC_021537. 420 AEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGD-AEAMLTRSKAAPPLEN-KIGERDSVDV 497 (602) Q Consensus 420 ~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~ 497 (602) ..++++.+.. +.|..+-..++ .+..+...-...|.++..-- ..++............ .........- T Consensus 442 l~~~~~~la~---~~P~~~d~~id--------~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~ 510 (535) T protein:vir:33 442 LERCISAWAA---LAPMQGDPDIN--------LAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGA 510 (535) T ss_pred HHHHHHHHHh---hChhhhhccCC--------HHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 2222222111 11110000011 01111000000111100000 0000000000000000 0000000000 Q ss_pred cccccchhhhhcchhhhhhheeccc Q lcl|NC_021537. 498 DVSKDPIEQTTFSSSNLDEGLYDFG 522 (602) Q Consensus 498 ~~~~~~m~~~~v~ss~~~~~~yd~~ 522 (602) .....++...+.-....+.+|-|.. T Consensus 511 ~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 511 GVGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred hhcchhhcCChhHHHHHHhccCCCC Confidence 0111122222333455667777644 No 247 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=90.54 E-value=0.02 Score=29.86 Aligned_cols=450 Identities=11% Similarity=-0.046 Sum_probs=148.9 Q ss_pred CCCCccccc------ccchhhhcccCccccCC---CC----HHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEE Q lcl|NC_021537. 1 MSKAEETTQ------LDERHIATDVGRGIQPP---YN----PETLAAFQELNETHQACIRKKSRYEAGY-------GFEI 60 (602) Q Consensus 1 ~~k~~~~~~------~~~~~~~~~~~~~i~p~---~~----~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i 60 (602) |++.....= ..+.+ ..+...++-|- .+ -..+.+.. +++...|++.+|.-+.+. .|++ T Consensus 1 mk~~~~~~~~~lkr~~~e~~-w~e~a~~tlP~~~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQR-AIEFAKTTLPYLMVDPMSGSRGVVEHDF--QSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHhccchHHH-HHHHHHhhccccccCCCCcccccccCcc--cchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 333221100 00000 00001111110 00 01111122 234456777777777642 2333 Q ss_pred EEecCCCC-cccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeC Q lcl|NC_021537. 61 VAHPSADE-PDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVP 138 (602) Q Consensus 61 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~ 138 (602) ...+.... ..........++..+..+- ..+..+. ..+++.-+..+..|+.++||+.+++.. +|.....+||. T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~~~--~~~~~~~~pl~ 151 (510) T protein:vir:78 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLF----QNASLAVLTQVIKLLIVTGNALLYRNS--DEATVVAWSLR 151 (510) T ss_pred CCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCeEEEEEeC--CCCeEEEEEcc Confidence 22111100 0001111222333322221 1122222 235677777888899999999887653 45567788885 Q ss_pred cccccccccccccccc--cchhh----hhcccC--ceeEEEEcCCcceeecccccccccceeeecccceEEecC---cee Q lcl|NC_021537. 139 AATVRVRKTTTTIERE--DGEEV----ENIESG--HGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDA---GEL 207 (602) Q Consensus 139 p~~v~~~~~~~~~~~~--~~~~~----~~~~~~--~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~---~~~ 207 (602) .-.|............ ..... ..+..- .....-.......++. ..++.+-........++.-++ +.. T Consensus 152 ~y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~--~V~~~~~~~~~~~sv~~e~dg~~i~~~ 229 (510) T protein:vir:78 152 SYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT--HVQRRKGTAMDYAEMYHEIDGVRVGET 229 (510) T ss_pred eeEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEE--EEEeecCCCCcEEEEEEEecCeeeccc Confidence 5444333222111100 00000 000000 0000000000000100 001100000000000111111 111 Q ss_pred EEec--hhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHH Q lcl|NC_021537. 208 KNGP--ANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDN 285 (602) Q Consensus 208 ~~~~--~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~ 285 (602) ..++ ....+-+|.....+..||.||...++..+.......+.....-.....|..++. +++...... T Consensus 230 ~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~-p~g~~~~~~---------- 298 (510) T protein:vir:78 230 GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVVDD---------- 298 (510) T ss_pred cccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccC-Cccccchhh---------- Confidence 1221 123455666666778999999999999998888888777766555566654443 333222221 Q ss_pred hhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCH Q lcl|NC_021537. 286 LKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANS 364 (602) Q Consensus 286 ~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~ 364 (602) +..+.+ |. ++.+..+ ++....+. ...|.+ ..+..+.....|..+|=+. +.. -++..-|+ T Consensus 299 l~~~~~-g~--~v~g~~~-----------~v~~~~~~--~~~d~~~~~~~i~~~~~rI~~aF~~~---l~~-~~~~rvTA 358 (510) T protein:vir:78 299 YQDAEM-GD--YVPGGAE-----------AVRAYERG--DYNKMAAIQQSLQAVVVRLNQAFMYG---ANQ-RDAERVTA 358 (510) T ss_pred hccCCC-ce--eecCCcc-----------cccccccC--cccchHHHHHHHHHHHHHHHHHHhhc---ccc-CCCCCcCH Confidence 111111 11 1111111 11111111 111222 1244566777888887332 111 12222355 Q ss_pred HHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccc-cccceEEEeccchhcchhHHHHHHHHHHHHHH Q lcl|NC_021537. 365 KEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDAL-DVDEWTIDFELRGAEQPEQDAKMAEQRVRAMR 428 (602) Q Consensus 365 e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~-~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~ 428 (602) ++.+. .+....|.|++.+.-..+.+ .+++... ......+++ .+.+- ..++......+.+.+- T Consensus 359 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~-is~La-raq~~~~l~~~~q~l~ 436 (510) T protein:vir:78 359 EEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG-LPALS-RSAAVQSMLNASQVIA 436 (510) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeec-ccHHH-HHHHHHHHHHHHHHHH Confidence 43221 22233456666554444433 3332222 112222222 22222 2223322222222222 Q ss_pred hCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcC-cccc---cc----ccccccccccccccccccccc Q lcl|NC_021537. 429 LAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAE-AMLT---RS----KAAPPLENKIGERDSVDVDVS 500 (602) Q Consensus 429 ~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~-~~~~---~~----~~~~~~~~~~~~~~~~~~~~~ 500 (602) ..| .+.++...++. |..+...-...|.++..--.. +... .+ ....+..+.-.....+++... T Consensus 437 ~~~--~~~q~~~~id~--------d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~ 506 (510) T protein:vir:78 437 GLA--PIAQLDPRISL--------PKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA 506 (510) T ss_pred Hhc--ChhhhhhcCCH--------HHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 111 11121111110 111000000011111000000 0000 00 000000000001111111112 Q ss_pred ccch Q lcl|NC_021537. 501 KDPI 504 (602) Q Consensus 501 ~~~m 504 (602) ...| T Consensus 507 ~~g~ 510 (510) T protein:vir:78 507 LAGV 510 (510) T ss_pred CCCC Confidence 2222 No 248 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=90.43 E-value=0.021 Score=29.79 Aligned_cols=448 Identities=10% Similarity=-0.062 Sum_probs=149.9 Q ss_pred CCCCccc------ccccchhhhcccCccccCC-----C--CHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEE Q lcl|NC_021537. 1 MSKAEET------TQLDERHIATDVGRGIQPP-----Y--NPETLAAFQELNETHQACIRKKSRYEAGY-------GFEI 60 (602) Q Consensus 1 ~~k~~~~------~~~~~~~~~~~~~~~i~p~-----~--~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i 60 (602) |++.... .+..+.. ..+...++-|- - +-..+.+.. +++...|++.+|.-+.+. .|++ T Consensus 1 mk~~~~~~~~~lkR~~~e~~-w~e~a~~tlP~~~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQR-AIEFAKTTLPYLMVDPMSGSRGVVEHDF--QSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHhccchHHH-HHHHHHhhccccCCCCCCccccccCCCc--cchHHHHHHHHHHHHHhhhcCCCCccccc Confidence 2221110 0000000 00011111110 0 001111222 344557777777777642 2333 Q ss_pred EEecCCCCc-ccchhhHHHHHHhhhccch-hhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeC Q lcl|NC_021537. 61 VAHPSADEP-DEGGESYQTVRDFWYGSDS-RWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVP 138 (602) Q Consensus 61 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~ 138 (602) ...+..... ..+......++..+..+-. .+..+. ..+++.-+..+..|+.++||+.+++. .+|.....+||. T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~Li~~G~a~l~~~--~~~~~~~~~pl~ 151 (510) T protein:vir:63 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLF----QNASLAVLTQVIKLLIVTGNALLYRD--SDAATVVAWSLR 151 (510) T ss_pred CCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCeEEEEEc--CCCcEEEEEEcc Confidence 222111000 0011112223332222211 122222 24567777788889999999988764 456667777775 Q ss_pred ccccccccccccccc--ccchhhhhcc-cCceeE---EEEc--CCcceeecccccccccceeeecccceEEec-Ccee-- Q lcl|NC_021537. 139 AATVRVRKTTTTIER--EDGEEVENIE-SGHGYV---QVRQ--GRRRYFGEAGDRYGDDKRFVDKETGEVASD-AGEL-- 207 (602) Q Consensus 139 p~~v~~~~~~~~~~~--~~~~~~~~~~-~~~~~~---qi~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~-- 207 (602) .-.|........... .....+.... +..... .... .....++.. .++.+.. +.....++.. +|.. T Consensus 152 ~y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~--V~~~~~~--~~~~~sv~~e~dg~~~~ 227 (510) T protein:vir:63 152 SYAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTH--VQRKKGT--AMEYAELYHEIDGVRVG 227 (510) T ss_pred eeEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEE--EEeecCC--CceEEEEEEEecCceec Confidence 444433222211111 0000000000 000000 0000 000000000 0000000 0000111111 1111 Q ss_pred --EE--echhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHH Q lcl|NC_021537. 208 --KN--GPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLM 283 (602) Q Consensus 208 --~~--~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~ 283 (602) .. +.....+-+|.....+..||.||...++..+.......+.....-.....|..++. +++...... T Consensus 228 ~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~-p~g~~~~~~-------- 298 (510) T protein:vir:63 228 KEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVVDD-------- 298 (510) T ss_pred cccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccC-cccccchhh-------- Confidence 11 12233555666666778999999999999999988888877776666666665543 333222221 Q ss_pred HHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCcc Q lcl|NC_021537. 284 DNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRA 362 (602) Q Consensus 284 ~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s 362 (602) +..+.+ |.+ +.+. ..++....+. ...|.+ ..+..+.....|-.+|=+. +.. -++..- T Consensus 299 --~~~~~~-g~~--v~g~-----------~~~v~~~~~~--~~~d~~~~~~~i~~~~~rI~~af~~~---l~~-~~~~rv 356 (510) T protein:vir:63 299 --YQDAEM-GDY--VPGG-----------AEAVRAYERG--DYNKMAAIQQSLQAVVVRLNQAFMYG---ANQ-RDAERV 356 (510) T ss_pred --hccCCC-cee--ecCC-----------cccceeeecC--cccchHHHHHHHHHHHHHHHHHHHhh---ccc-CCCCCc Confidence 111111 111 1111 1111111111 111222 1345566777888888322 111 122223 Q ss_pred CHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCcccc-ccceEEEeccchhcchhHHHHHHHHHHHH Q lcl|NC_021537. 363 NSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALD-VDEWTIDFELRGAEQPEQDAKMAEQRVRA 426 (602) Q Consensus 363 n~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~-~~~~~~~f~~~~~~~~~~d~~~~~~~~~~ 426 (602) |+++.+. .+....|.|++.+.-..+.+ .|++.... .....++ ..+.+...++......+.+. T Consensus 357 TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~--~is~Laraq~~~~l~~~~q~ 434 (510) T protein:vir:63 357 TAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQV 434 (510) T ss_pred CHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceec--chhHHHHHHHHHHHHHHHHH Confidence 5543222 12233456666554333333 33322221 1111122 12222222232222222222 Q ss_pred HHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccc--------ccccccccccccccccccccc Q lcl|NC_021537. 427 MRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLT--------RSKAAPPLENKIGERDSVDVD 498 (602) Q Consensus 427 ~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 498 (602) +-..|- +.++-.. ++ .|..+...-...|.++..--....+- .+........+.......+.. T Consensus 435 l~~~~~--~aq~~~~-----id---~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~ 504 (510) T protein:vir:63 435 IAGLAP--IAQLDPR-----IS---LPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMT 504 (510) T ss_pred HHHhcC--chhhhcc-----CC---HHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 211111 1111110 10 01111000001111111000000000 000000000011111122222 Q ss_pred ccccch Q lcl|NC_021537. 499 VSKDPI 504 (602) Q Consensus 499 ~~~~~m 504 (602) .....| T Consensus 505 ~~~~g~ 510 (510) T protein:vir:63 505 NALAGV 510 (510) T ss_pred ccccCC Confidence 222222 No 249 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=85.46 E-value=0.053 Score=27.59 Aligned_cols=456 Identities=9% Similarity=-0.027 Sum_probs=141.5 Q ss_pred CCCCcccccccch---hhh-cccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCC-C Q lcl|NC_021537. 1 MSKAEETTQLDER---HIA-TDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSAD-E 68 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~-~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~-~ 68 (602) ..+..+.++..+. .++ +.....+.+.-+-....++. .++...|++.+|..+.+. .|++...+... . T Consensus 20 ~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~ 97 (516) T protein:vir:96 20 WEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGW--QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKV 97 (516) T ss_pred HHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccccCCcc--cchHHHHHHHHHHHHHhhhcCCCCcccccccChhHHhh Confidence 1100100000000 000 00000000000001111122 234456777777776642 33332221100 0 Q ss_pred cccchhhHHHHHHhhhccch-hhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSDS-RWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKT 147 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~-~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~ 147 (602) -+........++..+..+.. .+..+. ..+++.-+..+..|+.++|||.+++- .++ ....+||..-.|..... T Consensus 98 ~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~d--~~~-~~~~~pl~~y~v~~d~~ 170 (516) T protein:vir:96 98 LNQRGLKKTELATIFAQVETRAMKELE----QRQFRPAVVEAFKHLIVAGSCMLYKP--SKG-AISAIPMHHYVVNRDTN 170 (516) T ss_pred ccccCchhHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCeEeEEec--CCC-CEEEEEcCeEEEeeCCC Confidence 01111112223332222211 122222 23567777788889999999998774 333 36677875544433332 Q ss_pred ccccccccchhhhhcccCcee----------EEEEcCCcceeecccccccccceeeecccceEEecCcee----EE--ec Q lcl|NC_021537. 148 TTTIEREDGEEVENIESGHGY----------VQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGEL----KN--GP 211 (602) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~----------~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~----~~--~~ 211 (602) ...........+...--+..| ..........++....+.++... ..+...++.. .. |. T Consensus 171 G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~------~~~~~~d~~~~~~es~~~~~ 244 (516) T protein:vir:96 171 GDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFW------ELKQSADDIPVGKVSKIKSE 244 (516) T ss_pred CCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCcee------EEEEEeCceeeccccccccc Confidence 211111000000000000000 00000000111111111111100 0111111111 11 12 Q ss_pred hhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccc Q lcl|NC_021537. 212 ANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRY 291 (602) Q Consensus 212 ~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~n 291 (602) ....+-+|.....+..||.||...++..+.......+.......-...|..++. +++..... ++..+.+ T Consensus 245 e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~----------~l~~~~~ 313 (516) T protein:vir:96 245 KLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIR-PGAQTDVD----------HFVNSGT 313 (516) T ss_pred cCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccC-cccccchh----------hhccCCC Confidence 234466676666778999999999999998888888877766666666655443 22222221 1111111 Q ss_pred cCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHH Q lcl|NC_021537. 292 RTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTRE 370 (602) Q Consensus 292 ag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~ 370 (602) |. +..+.. ..+....+. +..|.+ ..+..+.....|..+|-+.. +..- ++..=++++.... T Consensus 314 -g~--i~~g~~-----------~~v~~~q~~--~~~d~~~~~~~i~~~~~rI~~af~~~~--l~~r-~~~rvTAtEV~~r 374 (516) T protein:vir:96 314 -GE--VVTGVE-----------EDIHIVQLG--KYADLTPISAVLEVYTRRIGVVFMMET--MTRR-DAERVTAVEIQRD 374 (516) T ss_pred -ce--eecCCc-----------ccceeeecC--cccchhHHHHHHHHHHHHHHHHHhhhh--hccC-CCccccHHHHHHH Confidence 11 111111 111111111 111222 22455667788888886543 2211 2222244432211 Q ss_pred --HHHHHHHHHHHHHHHHHhhhcC-----CccccccceEEEeccchhcchh---HHHHHHHHHHHHHHhCCcccH-HHHH Q lcl|NC_021537. 371 --FAKGIIEPEQAKFSARLYKIIH-----QDALDVDEWTIDFELRGAEQPE---QDAKMAEQRVRAMRLAGVGTV-NEAR 439 (602) Q Consensus 371 --f~~~~l~P~~~~ie~~ln~~Ll-----~~~~~~~~~~~~f~~~~~~~~~---~d~~~~~~~~~~~~~~G~~T~-NE~R 439 (602) =....|.|.+..+.++|=.-|+ .-....+.-.++-.....+... .+......+++. +. .++-. -++. T Consensus 375 ~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~-i~-~~~~~~p~v~ 452 (516) T protein:vir:96 375 ALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFTSDLVDPVIITGIEALGRMAELDKLANFAQY-MS-LPLQWPEPVL 452 (516) T ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCccccccceeechHHHHHHHHHHHHHHHHHHH-HH-HHhcCChhHH Confidence 1223455555554444311111 0000011111111111111111 111111111111 10 00000 1111 Q ss_pred HHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcchh Q lcl|NC_021537. 440 EELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSS 512 (602) Q Consensus 440 ~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss 512 (602) ..++. +..+...-...|.+..---.++...... ....+.+..+..........+.+....-..+ T Consensus 453 d~id~--------d~~~~~~a~~~Gvp~~~irs~eev~~~~-~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 453 AAVKW--------PDYMDWVRGQISAELPFLKSAEEMAQEQ-EAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred hcCCH--------HHHHHHHHHHhCCCccccCCHHHHHHHH-HHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 11111 1110000000011000000000000000 0000000000000000000000000000000 No 250 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=82.79 E-value=0.074 Score=26.79 Aligned_cols=432 Identities=10% Similarity=0.046 Sum_probs=162.5 Q ss_pred CC-CCcccc---cccchh------hhcccCccc------cCCCC-----HHHHHHHHhhhHHHHHHHHHHHHhhccC--- Q lcl|NC_021537. 1 MS-KAEETT---QLDERH------IATDVGRGI------QPPYN-----PETLAAFQELNETHQACIRKKSRYEAGY--- 56 (602) Q Consensus 1 ~~-k~~~~~---~~~~~~------~~~~~~~~i------~p~~~-----~~~l~~~~~~~~~v~~cI~~ia~~ia~~--- 56 (602) ++ |+.... .-++.. -+..+++.+ ++.+. +...|.++. +|-|..||+-|.+.+.-. T Consensus 22 ~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~-~pEvd~Av~eIvneaiv~d~~ 100 (516) T protein:vir:10 22 LKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTN-NPEVERAVANIVNEAVVYEKG 100 (516) T ss_pred hcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhh-ccchhHHHHHhhcceeEecCC Confidence 21 111100 001100 011122222 22221 233455554 688888999888877632 Q ss_pred --ceEEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeC-CCCceEE Q lcl|NC_021537. 57 --GFEIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVE-GDGTPVG 133 (602) Q Consensus 57 --~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~-~~G~~~~ 133 (602) |..|. -+..+.+....+.+....... ..++ +-..+..++++ .|.+.|..|..++-+ .+..+.+ T Consensus 101 ~~pV~l~----l~~~e~s~sik~kI~eeF~~I---l~ll---~F~~~~~~~fR----~WYVDgRi~fhKiid~~k~GI~e 166 (516) T protein:vir:10 101 HKVVSLD----LDDTEFSSSIKDKILEEFDEI---CRLL---DASRKLDTLFR----RWYIDSRIFFHKIMPNPKEGIVE 166 (516) T ss_pred CceEEEE----ecccccchHHHHHHHHHHHHH---HHHh---ccchhhhHHHH----hhhhcceEEEEEEecCcccceee Confidence 33332 111222222223333322221 1111 12234445544 566779999996655 3556899 Q ss_pred EEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEecCceeEEechh Q lcl|NC_021537. 134 LAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELKNGPAN 213 (602) Q Consensus 134 L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 213 (602) |.+|||..|+........ ...+..+... -..|+....+...+. ..|..+. -+....++.+ T Consensus 167 lr~lDPr~i~~vR~i~~~-~~~~~~v~~~--~~e~~~Y~~~~~~~~----------------~~g~~~~-~~~~ikI~~d 226 (516) T protein:vir:10 167 LRRLDPRHVEYYREIVTS-DVGGTSVVKG--YREFFVYTTGNEGYA----------------YNGRLFE-PNTRIKIPRS 226 (516) T ss_pred eeeeCCcceeeEEeeecc-cCcchhhhhc--eeeeeeeecCcccee----------------ccccccC-CCCceecchh Confidence 999999999854322111 1111111100 001111111111110 0111111 1223456666 Q ss_pred HEEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEec-cccCCHHHHHHHHHHHHHhhcc-- Q lcl|NC_021537. 214 ELIFLP-NPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVT-GGTLSEDSKEDLRNLMDNLKGS-- 289 (602) Q Consensus 214 eviH~r-~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~~~~~~l~~~~~~~~g~-- 289 (602) -|.+.. +.-+.++-.=+|-|..|.+.+.....++...--|=-.-+.-+-|..+. |......+.+-+++.+..++.. T Consensus 227 aI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklv 306 (516) T protein:vir:10 227 AIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVV 306 (516) T ss_pred heeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeE Confidence 555443 111122112267777777777666666555443322223333343332 2222333333344444333210 Q ss_pred --cccC------cceeccCCccceecccccccccccccccccc-chHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC Q lcl|NC_021537. 290 --RYRT------AILEVEEFVDDHGLGDGGSDVNIELEPIGAR-EDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN 360 (602) Q Consensus 290 --~nag------~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~-~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 360 (602) .+.| +.+..-+. -++.--.-..+.+++.|.-. +.-+| +=.++..+...++++||.+.|+..++.+ T Consensus 307 YDa~TGev~ddrk~msMlED---yWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~SRl~~e~~~~ 380 (516) T protein:vir:10 307 YDSNTGTVKNQKRNLSMTED---YWLMRRDGKSVTEVTSLPGAQTMGEM---DDVRWFNKKLYEALRIPLSRMPRDDGGM 380 (516) T ss_pred EeCCCCeeccchhhhhhHhh---hcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCce Confidence 0011 11100000 00000000011222222211 11223 3345677899999999999997544433 Q ss_pred c--c-CHHH--HHHHHHHHHHHHHHHHHHHHHh----hhcCCc-----cc---cccceEEEeccchhcchhHHHHHHHHH Q lcl|NC_021537. 361 R--A-NSKE--QTREFAKGIIEPEQAKFSARLY----KIIHQD-----AL---DVDEWTIDFELRGAEQPEQDAKMAEQR 423 (602) Q Consensus 361 ~--s-n~e~--~~~~f~~~~l~P~~~~ie~~ln----~~Ll~~-----~~---~~~~~~~~f~~~~~~~~~~d~~~~~~~ 423 (602) . + +.|- --.-| ...|.-+..+|...|. ..|+.. .+ .....++.|..+.-.....+.+...++ T Consensus 381 ~~~Gr~~EItRDEiKF-~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R 459 (516) T protein:vir:10 381 VIGGQDMAITRDELDF-RKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQR 459 (516) T ss_pred eeccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHH Confidence 2 1 2221 11233 4456666665554443 322221 11 112345555555433333344443333 Q ss_pred HHHHH-----hCCcccHHHHHH-HhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccc Q lcl|NC_021537. 424 VRAMR-----LAGVGTVNEARE-ELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKI 489 (602) Q Consensus 424 ~~~~~-----~~G~~T~NE~R~-~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (602) +..+- -..+++.+=+++ .|.+.-.+-...+ .........+-- .+ |..+... T Consensus 460 l~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~-------k~I~~E~~~~~~----~~----p~~e~~f 516 (516) T protein:vir:10 460 VDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEE-------KQIEKEANVKRF----QN----PENEDDF 516 (516) T ss_pred HHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHH-------HHHHHhhhCCCC----CC----CCccccC Confidence 33221 123444444433 3444211100000 000000000000 00 0000111 No 251 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=82.52 E-value=0.076 Score=26.71 Aligned_cols=452 Identities=9% Similarity=0.026 Sum_probs=141.5 Q ss_pred CCCCcccccccc---hhhh-cccCccccCCC-CHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCC-C Q lcl|NC_021537. 1 MSKAEETTQLDE---RHIA-TDVGRGIQPPY-NPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSA-D 67 (602) Q Consensus 1 ~~k~~~~~~~~~---~~~~-~~~~~~i~p~~-~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~-~ 67 (602) ..+..+.++..+ +.++ +.... .-|+- +-....+.. +++...|++.+|..+.+. .|++...+.. . T Consensus 16 ~~~Lk~~R~~~e~~w~e~~~~~lP~-~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~~~~l~ 92 (517) T protein:vir:10 16 YEQLVGKRSPFLSRAENYSRFTLPY-LMADVNDDLSSQNAW--QDDGASATNFLSNKLSQVLFPAQRSFFRIDLTPEGIK 92 (517) T ss_pred HHHHHHhhhHHHHHHHHHHHHhccc-cccCCCCCccccccc--cchHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHH Confidence 000000000000 0000 00000 00000 000001111 234456777777766532 3333221110 0 Q ss_pred CcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRK 146 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~ 146 (602) ....+.+....++..+..+- ..+..+ ...+++.-+..+..|+.++|||.+++ +..+..+..+||..-.|.... T Consensus 93 ~~~~~~~~~~~v~~~L~~ve~~~~~~l----~~snf~~~~~~~~~~L~~~G~a~ly~--~~~~~~~~~~pl~~y~v~~d~ 166 (517) T protein:vir:10 93 QLDNEAMTQSTAQKLLSDVEKAAMLYG----ESLQFRPAVVEAFKHLIVTGNVMMYH--PDKTSPIQAVPLHHYCVRRDN 166 (517) T ss_pred hhccCcchHHHHHHHHHHHHHHHHHHH----HhcCcHHHHHHHHHHHHhHCeEEEEE--eCCCCcEEEEEcCeEEEeeCC Confidence 00111112222333222221 111222 23467777888889999999998764 334456778888554443332 Q ss_pred cccccccc--cchhh----hhcccC----ceeEEEEcCCcceeecccccccccceee-ecccceEEecCceeEEechhHE Q lcl|NC_021537. 147 TTTTIERE--DGEEV----ENIESG----HGYVQVRQGRRRYFGEAGDRYGDDKRFV-DKETGEVASDAGELKNGPANEL 215 (602) Q Consensus 147 ~~~~~~~~--~~~~~----~~~~~~----~~~~qi~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~ev 215 (602) ........ ..... ..+... .............++......++....+ ....+......++ ..+..... T Consensus 167 ~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~~s~-y~~~e~P~ 245 (517) T protein:vir:10 167 NGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVPVGKEST-VTEDKSPF 245 (517) T ss_pred CcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEeCceeeccccc-cccccCCe Confidence 22111110 00000 000000 0000000000011111000000000000 0011111100111 01223345 Q ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcc Q lcl|NC_021537. 216 IFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAI 295 (602) Q Consensus 216 iH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~ 295 (602) +-+|.....+..||.||...++..+.......+.....-.-...|..++. +++..... ++..+.+ |. T Consensus 246 ~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~-~~~~~~~~----------~l~~~~~-g~- 312 (517) T protein:vir:10 246 LILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVK-PGSYTDIN----------QFVEGGS-GA- 312 (517) T ss_pred eeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccC-cccccchh----------hccCCCc-cc- Confidence 66676666778999999999999998888887777776666666666543 22222211 1111111 00 Q ss_pred eeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH--HHH Q lcl|NC_021537. 296 LEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR--EFA 372 (602) Q Consensus 296 ~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~--~f~ 372 (602) +..+. ..++...++.. ..|.+ ..+..+.....|-.+|-+.. +...+ +..-++++.+. .=. T Consensus 313 -~~~g~-----------~~~v~~~~~~~--~~d~~~~~~~i~~~~~rI~~af~~~~--l~~~~-~~rvTAtEV~~r~~E~ 375 (517) T protein:vir:10 313 -VLHGV-----------EGDIHIVQLGK--YADYTPIQAVLNDYRQRIGRVFMMEA--MTRRD-AERVTAYEIQRDAMLV 375 (517) T ss_pred -cccCC-----------cccceeeeccc--ccchhHHHHHHHHHHHHHHHHHhhhh--hhccC-CccccHHHHHHHHHHH Confidence 01100 01111111111 11222 23455667788889986653 22111 22224443221 112 Q ss_pred HHHHHHHHHHHHHHH-----hhhc--CCccccccceEEEec--cchhcchhHHHHHHHHHHHHHHhCC--------cccH Q lcl|NC_021537. 373 KGIIEPEQAKFSARL-----YKII--HQDALDVDEWTIDFE--LRGAEQPEQDAKMAEQRVRAMRLAG--------VGTV 435 (602) Q Consensus 373 ~~~l~P~~~~ie~~l-----n~~L--l~~~~~~~~~~~~f~--~~~~~~~~~d~~~~~~~~~~~~~~G--------~~T~ 435 (602) ...|.|.+..+.++| .+.+ +..........+++. +..+.+ ..+......+++..-... .+.. T Consensus 376 ~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~~v~~~~~s~la~l~r-~~~~~~i~~~~~~i~~~a~~~~~~~~~id~ 454 (517) T protein:vir:10 376 EQSLGGVYSLFATTFQGPLARWFMNGISSILTSKNVSPTILTGIEALGR-MAELDKLGTFNGYVSMTAQWPEPLQQAIKW 454 (517) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCCCccceeeccHHHHHH-HHHHHHHHHHHHHHHHhhcCChHHHhcCCH Confidence 234555555543331 1111 111111112222221 111111 112222222211110000 0011 Q ss_pred HHH----HHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcch Q lcl|NC_021537. 436 NEA----REELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSS 511 (602) Q Consensus 436 NE~----R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~s 511 (602) +++ .+.+|.|+- .+.+. +.......+....+. .......-.......+.+ T Consensus 455 d~~~~~~a~~~Gvp~~------~irs~---------------~ev~~~~~~~~~~~~-----~~~~~~~ag~~~~~~~~~ 508 (517) T protein:vir:10 455 PDFTDWVQGQISANFP------FFKTQ---------------DELNAEAQAQQEQEA-----TKYAAEQAGKAIPDMVKN 508 (517) T ss_pred HHHHHHHHHHhCCChh------hcCCH---------------HHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHhC Confidence 111 112222210 00000 000000000000000 000000000000000000 Q ss_pred hhh-hhheeccccc Q lcl|NC_021537. 512 SNL-DEGLYDFGER 524 (602) Q Consensus 512 s~~-~~~~yd~~~~ 524 (602) -.. -.+| + T Consensus 509 ~~~~~~~~-----~ 517 (517) T protein:vir:10 509 GQINPQGG-----Q 517 (517) T ss_pred CCCCCCCC-----C Confidence 000 0000 0 No 252 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=80.36 E-value=0.096 Score=26.17 Aligned_cols=452 Identities=9% Similarity=-0.041 Sum_probs=149.3 Q ss_pred CCCCcccccccchhhh-----cccCccccCCC---------CHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceE Q lcl|NC_021537. 1 MSKAEETTQLDERHIA-----TDVGRGIQPPY---------NPETLAAFQELNETHQACIRKKSRYEAGY-------GFE 59 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~-----~~~~~~i~p~~---------~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~ 59 (602) |+-.+.=.++..++-. .+...++-|.. .-..+.++. +++...|++.+|..+.+. .|+ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPW--QSVGAKCCVTLAAKLMLAVLPPQTSFFK 78 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccc--cchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 3322222222221100 01111122210 001112222 244556777777776642 233 Q ss_pred EEEecCCCCcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeC Q lcl|NC_021537. 60 IVAHPSADEPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVP 138 (602) Q Consensus 60 i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~ 138 (602) +...+.........+....++..+..+. ..+..+. ..+++.-+..+..|+.++|||.+++..+. ...+||. T Consensus 79 l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~ly~~~~~----~~~~pl~ 150 (522) T protein:vir:10 79 LQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIA----ASNDRVAVHQALKHLIVGGNALIFMGKDG----LKTFPLT 150 (522) T ss_pred ccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCceeEEEcCCC----ceEEEcc Confidence 3322211111111111222333222221 1222222 34577778888999999999998875442 3455654 Q ss_pred ccccccccccccccccc--chh----hhhcccCceeEEEEc----CCcceeecccccccc----cceeeecccceEEecC Q lcl|NC_021537. 139 AATVRVRKTTTTIERED--GEE----VENIESGHGYVQVRQ----GRRRYFGEAGDRYGD----DKRFVDKETGEVASDA 204 (602) Q Consensus 139 p~~v~~~~~~~~~~~~~--~~~----~~~~~~~~~~~qi~~----~~~~~~~~~~~~~~~----~~~~~~~~~g~~~~~~ 204 (602) .-.|............. ... ...+.....=..+.. .....++. ..+|. .+.+.....+...... T Consensus 151 ~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~--~v~p~~~~~~~~~~~~~~~~~~~~~ 228 (522) T protein:vir:10 151 RYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYT--YVKLDKSSGRWVWHQEAFDKIIPDS 228 (522) T ss_pred eEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEE--EEEeeccCCceEEEEccCCcccccc Confidence 33333222111110000 000 000000000000000 00000000 00000 0000000000000000 Q ss_pred ceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHH Q lcl|NC_021537. 205 GELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMD 284 (602) Q Consensus 205 ~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~ 284 (602) .....+.....+-+|.....+..||.||...++..+.......+.......-...|..++. +++....... T Consensus 229 ~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~-~~~~~~~~~l-------- 299 (522) T protein:vir:10 229 RSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVS-PSSTTKPATI-------- 299 (522) T ss_pred ccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeec-cccccccccc-------- Confidence 0000112223455566666777999999999999999988888888877777777776653 2222222111 Q ss_pred HhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccC Q lcl|NC_021537. 285 NLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRAN 363 (602) Q Consensus 285 ~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn 363 (602) ..++.+. +.. | ...++...++.. ..|.+ ..+..+..+..|..+|- +....++..-+ T Consensus 300 -~~~~~~~----~v~-g----------~~~~v~~~~~~~--~~d~~~~~~~i~~~~~ri~~aFl-----~~~~~d~~rvT 356 (522) T protein:vir:10 300 -AKAGNGA----IVQ-G----------RPEDVAVIQVGK--TADFSTAANMATAIEKRLLEAFL-----VMNVRNAERVT 356 (522) T ss_pred -cCCCCcc----eec-C----------CCccceeecccc--cccchHHHHHHHHHHHHHHHHHh-----hccCCCCCCCC Confidence 0111111 000 1 111122222221 11222 23445566777777773 22222222334 Q ss_pred HHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccccccceEEEeccchhcchhHHHHHHHHHHHHHH Q lcl|NC_021537. 364 SKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMR 428 (602) Q Consensus 364 ~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~ 428 (602) +++... .+....|.|++.+.-..+.+ .+|++......--......+.+.-.++.+....+++.+- T Consensus 357 AtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~ 436 (522) T protein:vir:10 357 AEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIA 436 (522) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHH Confidence 443211 12344556666655554443 344432211100111112222222223333333333221 Q ss_pred hC-C------cccHHHH----HHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccc--c--- Q lcl|NC_021537. 429 LA-G------VGTVNEA----REELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE--R--- 492 (602) Q Consensus 429 ~~-G------~~T~NE~----R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--- 492 (602) .. | .+..+++ -..+|.|+. +......+.+ ...++..+.....+..+ . T Consensus 437 ~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~-----------~ivrt~eev~------~~~q~~q~~~~~~~~~~~a~~~~ 499 (522) T protein:vir:10 437 QTLGPEALMQYLNPLEAIKRLAAAQGIDVL-----------NLVKTEQQLA------EEQQAAQQQAAQQSLVDQAGQMT 499 (522) T ss_pred HhhCchhhhhcCCHHHHHHHHHHHhCCChh-----------hhcCCHHHHH------HHHHHHHHHHHHHHHHHHHHHHh Confidence 10 0 0111111 112232210 0000000000 00000000000000000 0 Q ss_pred --ccccccccccchhhhhcchhhhhhheec Q lcl|NC_021537. 493 --DSVDVDVSKDPIEQTTFSSSNLDEGLYD 520 (602) Q Consensus 493 --~~~~~~~~~~~m~~~~v~ss~~~~~~yd 520 (602) ...+-.+.++.| ..+.+.+-+ T Consensus 500 ~~~~~~~~~~~~~~-------~~~~~~~~~ 522 (522) T protein:vir:10 500 GSPLMDPTKNPQLM-------DEEQPPMEE 522 (522) T ss_pred cccccCccccHHHH-------HHhCCCCCC Confidence 000000000000 111111111 No 253 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=79.45 E-value=0.1 Score=25.96 Aligned_cols=453 Identities=10% Similarity=-0.040 Sum_probs=142.3 Q ss_pred CCCCcccccccchhhhcccCccccC-----CCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCC-C Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQP-----PYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSA-D 67 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p-----~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~-~ 67 (602) ..+..+.++..+.. ..+...++-| .-+-....++. .++...|++.+|..+.+. .|++...+.. . T Consensus 20 ~~~L~~~R~~~e~~-w~e~a~~~lP~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~d~~~~ 96 (516) T protein:vir:10 20 WEKFSTKRSSFLDR-AKHYSKLTLPYLMNDKGDNETSQNGW--QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEK 96 (516) T ss_pred HHHHHHhhhHHHHH-HHHHHHhhcccccCCCCCcccccccc--cchHHHHHHHHHHHHHhhhcCCCCccccccCChhhHh Confidence 11000001000100 0000001111 00001111222 234457777777776642 3333221110 0 Q ss_pred CcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRK 146 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~ 146 (602) .-.........++..+..+. ..+..+. ..+++.-+..+..|+.++|||.+++. .++ +.+.+||..-.|.... T Consensus 97 ~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~d--~~~-~~~~~pl~~y~v~~d~ 169 (516) T protein:vir:10 97 VLNQRGLKKTELATIFAQVETRAMKELE----QRQFRPAVVEAFKHLIVAGSCMLYKP--SKG-AISAIPMHHYVVNRDT 169 (516) T ss_pred hhhccCchhHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCeEeEEec--CCC-CeEEEEcCeEEEeeCC Confidence 00111111222333222221 1122222 34667777788889999999987763 333 3567777544443322 Q ss_pred cccccccccchhhhhcccCceeEEE----------EcCCcceeecccccccccceeeecccceEEecCceeE----E--e Q lcl|NC_021537. 147 TTTTIEREDGEEVENIESGHGYVQV----------RQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGELK----N--G 210 (602) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~qi----------~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~----~--~ 210 (602) ............+...--...|-.. .......++....+.+..... .+...++... . | T Consensus 170 ~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~------~~~~~d~~~~~~~s~~~~ 243 (516) T protein:vir:10 170 NGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWE------LKQSADDIPVGKVSKIKS 243 (516) T ss_pred CCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceE------EEEeeCceeecccccccc Confidence 2211111000000000000000000 000000000000000000000 0000111110 1 2 Q ss_pred chhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhccc Q lcl|NC_021537. 211 PANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSR 290 (602) Q Consensus 211 ~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~ 290 (602) .....+-+|.....+..||.||...++..+.......+.......-...|..++. +++..... ++.-+. T Consensus 244 ~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~----------~l~~~~ 312 (516) T protein:vir:10 244 EKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIR-PGAQTDVD----------HFVNSG 312 (516) T ss_pred ccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccC-cccccchh----------hhccCC Confidence 2234466666666778999999999999998888888877776666666655543 22222211 111111 Q ss_pred ccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH Q lcl|NC_021537. 291 YRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR 369 (602) Q Consensus 291 nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~ 369 (602) + |.+ ..+. ..++....+. +..|.+ ..+..+.....|..+|-+.....-.. ..=++++... T Consensus 313 ~-g~~--~~g~-----------~~~v~~~q~~--~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~---~rvTAtEV~~ 373 (516) T protein:vir:10 313 T-GEV--VTGV-----------EEDIHIVQLG--KYADLTPISAVLEVYTRRIGVVFMMETMTRRDA---ERVTAVEIQR 373 (516) T ss_pred C-cee--ecCC-----------cccceeeecC--cccchHHHHHHHHHHHHHHHHHHhhhhhhccCC---ccccHHHHHH Confidence 1 111 1111 1111111111 111222 23455667788888887754222222 2224443221 Q ss_pred --HHHHHHHHHHHHHHHHHH---------hhhcCCccccccceE-EEeccchhcchhHHHHHHHHHHHHHHhCCccc-HH Q lcl|NC_021537. 370 --EFAKGIIEPEQAKFSARL---------YKIIHQDALDVDEWT-IDFELRGAEQPEQDAKMAEQRVRAMRLAGVGT-VN 436 (602) Q Consensus 370 --~f~~~~l~P~~~~ie~~l---------n~~Ll~~~~~~~~~~-~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T-~N 436 (602) .=....|.|.+..+..+| +..+-.......+.. +.+ .+.+.-.++......+++. +. .++- +- T Consensus 374 r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~v~~--i~~L~raq~~~~i~~~~q~-i~-~~~q~~p 449 (516) T protein:vir:10 374 DALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVIITG--IEALGRMAELDKLANFAQY-MS-LPLQWPE 449 (516) T ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCcceehh--HHHHHHHHHHHHHHHHHHH-HH-HHhcCCh Confidence 112234555555444443 211111111111111 111 1111111222222111111 10 0000 01 Q ss_pred HHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccccchhhhhcchh Q lcl|NC_021537. 437 EAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSS 512 (602) Q Consensus 437 E~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss 512 (602) ++...++. +..+.......|.+..---.++............++ ..-........++.+-......+ T Consensus 450 ~v~d~id~--------d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~-~~~~~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 450 PVLAAVKW--------PDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQ-AQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHhhcCH--------HHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHH-HHHHHHHhhhcccchhhhhhhcC Confidence 11111111 000000000001000000000000000000000000 00000000000111111111111 No 254 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=78.84 E-value=0.11 Score=25.83 Aligned_cols=451 Identities=9% Similarity=-0.012 Sum_probs=154.1 Q ss_pred CCCCcccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCCCcccch Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSADEPDEGG 73 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~~~~~~~ 73 (602) +-..- ..+.....+.......-++| ++...|++.+|..+.+. .|++...+. ...+ T Consensus 34 ~lP~~---~~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~Las~l~~~ltpp~~~WF~l~~~d~--~~~e-- 95 (559) T protein:vir:95 34 INPRG---SRFLTSEVNRNDRRNTRIID-----------STGTMAARTLASGMMSGITSPARPWFRLATPDP--EMMD-- 95 (559) T ss_pred hcccc---CCcCCCCCCccccccccccc-----------chHHHHHHHHHHHHHHhhcCCCCcccccccCCc--cccc-- Confidence 10000 00000000000001111232 23345566666655431 333322221 1111 Q ss_pred hhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccccccc Q lcl|NC_021537. 74 ESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTTTTIE 152 (602) Q Consensus 74 ~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~~~~~ 152 (602) ...++..+..+. ..+..+.+ .+++.-+..+..|+.++||+.+++..+.. +.+++.++|...+-+..+..+.. T Consensus 96 --~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~~L~~~Gta~l~~~~d~~-~~~r~~~~~l~~~~v~~d~~G~v 168 (559) T protein:vir:95 96 --YGPVKLWLEAVQNRMNDMFNK----SNLYQSLPQLYGSLGTYSTGAMAVLDDDE-DIIRTMPFPIGSYYLANSPRGSV 168 (559) T ss_pred --hHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhhCceeeEeecCCC-ceeEEEEeecCeEEEeeCCCCCe Confidence 122222222211 12223332 35666677788999999999998876653 45666666666665555544321 Q ss_pred cccch--------------------hhh-hcccC--ceeEEEEcCCcceeecccccccccceeeec--ccceEEecCc-- Q lcl|NC_021537. 153 REDGE--------------------EVE-NIESG--HGYVQVRQGRRRYFGEAGDRYGDDKRFVDK--ETGEVASDAG-- 205 (602) Q Consensus 153 ~~~~~--------------------~~~-~~~~~--~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~--~~g~~~~~~~-- 205 (602) ..-.+ .+. ....+ ..++.++.- .+.-.+..+......+. ..+++...+. T Consensus 169 d~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~ 244 (559) T protein:vir:95 169 DTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS----VYPNIDRDTSKLDSKNKPFKSVYYEVGGDND 244 (559) T ss_pred EEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEE----EeccccccccccccccceEEEEEEEecCCCc Confidence 11000 000 00000 001111000 00000000000000000 0011111000 Q ss_pred ---eeEEechhHEEEecCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHH Q lcl|NC_021537. 206 ---ELKNGPANELIFLPNPSPLALYYGVP-DWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRN 281 (602) Q Consensus 206 ---~~~~~~~~eviH~r~~~~~~~~~G~s-pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~ 281 (602) ....|.....+-+|.....+..||.| |...++..+.......+...........|...+ ++...... T Consensus 245 ~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v--~~~~~~~~------- 315 (559) T protein:vir:95 245 KLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQR------- 315 (559) T ss_pred eeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceec--cccccccc------- Confidence 01112223345556665667789999 899998888888888887777777777776554 22211100 Q ss_pred HHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHH-HHHHHhhHHHHHHHhcCChHHh-hccccC Q lcl|NC_021537. 282 LMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEF-QAFRERNEHEIAKVHGVPPVLI-NVTSTS 359 (602) Q Consensus 282 ~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf-~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~ 359 (602) .+ ..++|..+.....+. . .++|+-..+. +.++ .+..+.....|-.+|-..+.++ +.. ++ T Consensus 316 -~~------------l~pgg~~~~~~~~~~--~--~i~p~~~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r-~~ 376 (559) T protein:vir:95 316 -AS------------LLPGDITYIDQITGQ--D--GFRPAYLVNP-STADLVADIQDTRQIINSAYFVDLFMMLQNI-NT 376 (559) T ss_pred -ee------------eeccceeeeCCCCCc--c--cceeeccccc-chHHHHHHHHHHHHHHHHHhhhhhHHHhhcC-CC Confidence 00 111222211111100 0 1233322222 3333 2335667899999998875433 221 22 Q ss_pred CccCHHHHH--------------HHHHHHHHHHHHHHHHHHHhh-hcCCccc-cccceEEEeccchhcchhHHHHHHH-- Q lcl|NC_021537. 360 NRANSKEQT--------------REFAKGIIEPEQAKFSARLYK-IIHQDAL-DVDEWTIDFELRGAEQPEQDAKMAE-- 421 (602) Q Consensus 360 ~~sn~e~~~--------------~~f~~~~l~P~~~~ie~~ln~-~Ll~~~~-~~~~~~~~f~~~~~~~~~~d~~~~~-- 421 (602) ..-++++.+ ..+....|.|++.+.-..+.+ .++|+.. ...+-.++......+....+..... T Consensus 377 ~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i 456 (559) T protein:vir:95 377 RSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSL 456 (559) T ss_pred CCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHH Confidence 222433321 122334566666555444443 3444321 1122223333333332212111111 Q ss_pred ----HHHHHHHhCC-----cccHHHHH----HHhCCCCCCCCcccccccccccc-ccccccCCCcCcccccccccccccc Q lcl|NC_021537. 422 ----QRVRAMRLAG-----VGTVNEAR----EELDLAPFEDDRGDMTLSEFEAE-FGADASDGDAEAMLTRSKAAPPLEN 487 (602) Q Consensus 422 ----~~~~~~~~~G-----~~T~NE~R----~~~Gl~p~~~g~~d~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (602) +.+..+...+ .+..+++- +.+|.|+- .+.+.-.+. ..++-+. .+...+.. .. T Consensus 457 ~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~------~irs~~ev~~~rqqr~~----~qq~~q~~----~~ 522 (559) T protein:vir:95 457 ASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPT------VIVPQEQVEQARQQRAQ----QQQQQQMM----AM 522 (559) T ss_pred HHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchh------hcCCHHHHHHHHHHHHH----HHHHHHHH----HH Confidence 1111111100 01222221 22333210 000000000 0000000 00000000 00 Q ss_pred cccccccccc--cccccchhhhhcchhhhhhheeccccc Q lcl|NC_021537. 488 KIGERDSVDV--DVSKDPIEQTTFSSSNLDEGLYDFGER 524 (602) Q Consensus 488 ~~~~~~~~~~--~~~~~~m~~~~v~ss~~~~~~yd~~~~ 524 (602) .......++. .+....++...--...+...| ...+ T Consensus 523 ~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~~~~ 559 (559) T protein:vir:95 523 GMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQG--GQSQ 559 (559) T ss_pred HHHHHHhhhccccccCCChhHHHHHHHhhcCcc--ccCC Confidence 0000000000 000000000000011111111 1122 No 255 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=77.29 E-value=0.13 Score=25.51 Aligned_cols=451 Identities=9% Similarity=-0.037 Sum_probs=143.3 Q ss_pred CCCCcccccccc---hhhh-cccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCC-C Q lcl|NC_021537. 1 MSKAEETTQLDE---RHIA-TDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSAD-E 68 (602) Q Consensus 1 ~~k~~~~~~~~~---~~~~-~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~-~ 68 (602) ..+..+.++..+ +.++ +.....+.+.-+-....+.. +++...|++.+|.-+.+. .|++...+... . T Consensus 19 ~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~ 96 (515) T protein:vir:70 19 WEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGW--QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKV 96 (515) T ss_pred HHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcccccccc--cchHHHHHHHHHHHHHHhhcCCCCcccccccChhhhhc Confidence 000000000000 0000 00000000000111111111 234456777777766532 33332221110 0 Q ss_pred cccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCccccccccc Q lcl|NC_021537. 69 PDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKT 147 (602) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~ 147 (602) ..........++..+..+. ..+..+. ..+++.-+..+..|+.++|||.+++. .++ +.+.+||..-.|..... T Consensus 97 l~~~~~~~~~v~~~l~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~d--~~~-~~~~~pl~~y~v~~d~~ 169 (515) T protein:vir:70 97 LDDRGLKKTQLATIFARVETTAMKALE----QRQFRPAIVEVFKHLIVAGNCLLYKP--SKG-AMSAVPMHHYVVNRDTN 169 (515) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHH----hcCchHHHHHHHHHHHhHCeEEEEEe--CCC-CeEEEEcCeEEEeeCCC Confidence 1111112222333222221 1122222 34677777888889999999998873 333 25677775444332222 Q ss_pred cccccccc--chhhh----hcccCc----eeEEEEcCCcceeecccccccccceeee-cccceEEecCceeEEechhHEE Q lcl|NC_021537. 148 TTTIERED--GEEVE----NIESGH----GYVQVRQGRRRYFGEAGDRYGDDKRFVD-KETGEVASDAGELKNGPANELI 216 (602) Q Consensus 148 ~~~~~~~~--~~~~~----~~~~~~----~~~qi~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~~~~~~~evi 216 (602) ........ ..... .+.... ............++....+.+.....+. ...+......++ ..+.....+ T Consensus 170 G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~-y~~~e~P~~ 248 (515) T protein:vir:70 170 GDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKESR-IKSEKLPFI 248 (515) T ss_pred cCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEecCceeeccccc-cccccCCce Confidence 21111100 00000 000000 0000000000011110000110000000 000110000000 011223345 Q ss_pred EecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcce Q lcl|NC_021537. 217 FLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAIL 296 (602) Q Consensus 217 H~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~ 296 (602) -+|.....+..||.||...++..+...+...+.......-...|..++. +++..... .+..+.+ |.+ T Consensus 249 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~-~~g~~~~~----------~l~~~~~-g~i- 315 (515) T protein:vir:70 249 PLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIR-PGSQTDVD----------HFVNSGT-GEV- 315 (515) T ss_pred eeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeC-cccccchh----------hccccCC-cee- Confidence 5566666778999999999999999988888888777677777766653 23222221 1111111 111 Q ss_pred eccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH--HHHH Q lcl|NC_021537. 297 EVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR--EFAK 373 (602) Q Consensus 297 ~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~--~f~~ 373 (602) ..+. ...+...++. +..|.+ ..+..+.....|..+|-+........+ .-++++... .=.. T Consensus 316 -v~g~-----------~~~v~~~~~~--~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~---rvTAtEV~~r~~E~~ 378 (515) T protein:vir:70 316 -ITGV-----------AEDIHIVQLG--KYADLTPISAVLEVYTRRIGVIFMMETMTRRDAE---RVTAVEIQRDALEIE 378 (515) T ss_pred -ecCC-----------cccceeeecC--cccchhHHHHHHHHHHHHHHHHHhhhhhhccCCc---cccHHHHHHHHHHHH Confidence 1111 1111111111 111222 124456677889888877543332222 224433221 1122 Q ss_pred HHHHHHHHHHHHHHhhhc--------CCccccccceEEEec-cchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCC Q lcl|NC_021537. 374 GIIEPEQAKFSARLYKII--------HQDALDVDEWTIDFE-LRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDL 444 (602) Q Consensus 374 ~~l~P~~~~ie~~ln~~L--------l~~~~~~~~~~~~f~-~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl 444 (602) ..|.|.+..+.++|=.-| +++.-. ....+.+. ....+.-..+......+++ .+..-..-+-++...++. T Consensus 379 ~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~-~~v~~~~vs~l~~L~r~q~~~~i~~~~q-~i~~~~~~~p~~~~~id~ 456 (515) T protein:vir:70 379 QNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTS-ELVDPVIVTGIEALGRMAELDKLANFAQ-YMSLPQTWPEPAQRAIRW 456 (515) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHhhCCCCCh-hhcccceehhHHHHHHHHHHHHHHHHHH-HHHHHhccChhHHhhCCH Confidence 345555555444432211 222111 01112110 1011111112111111111 111000001111111111 Q ss_pred CCCCCCccccccccccccccccccCCCcCccccccccccccccccc----------ccccccccccccchhhh Q lcl|NC_021537. 445 APFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIG----------ERDSVDVDVSKDPIEQT 507 (602) Q Consensus 445 ~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~m~~~ 507 (602) +..+.. .....+.+.. -............+.. ....+......++|..- T Consensus 457 --------d~~~~~-~a~~~g~p~~-----~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 457 --------GDYMDW-VRGQISAELP-----FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred --------HHHHHH-HHHHhCCCcc-----ccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 000000 0000000000 0000000000000000 00001111111111111 No 256 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=76.07 E-value=0.14 Score=25.27 Aligned_cols=474 Identities=9% Similarity=-0.047 Sum_probs=153.0 Q ss_pred CCCCcccccccch---hhh-cccCccccCCCC--HHHHHHHHhhhHHHHHHHHHHHHhhccC-----c-eEEEEecCC-C Q lcl|NC_021537. 1 MSKAEETTQLDER---HIA-TDVGRGIQPPYN--PETLAAFQELNETHQACIRKKSRYEAGY-----G-FEIVAHPSA-D 67 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~-~~~~~~i~p~~~--~~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~-~~i~~~~~~-~ 67 (602) ..+..+.++..+. .++ +.......+.-+ -..+.++.+ ++...|++.+|..+.+. + |++...+.. + T Consensus 17 ~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--st~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~ 94 (536) T protein:vir:21 17 YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQ--AVGARGLNNLASKLMLALFPMQTWMRLTISEYEAK 94 (536) T ss_pred HHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccc--ccHHHHHHHHHHHHHHhhcCCCcccccccChhhhh Confidence 0000000000000 000 000000000000 001112221 23345666666665542 2 222111111 0 Q ss_pred CcccchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce--EEEEEeCcccccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP--VGLAHVPAATVRV 144 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~--~~L~~l~p~~v~~ 144 (602) ...........++..+..+ ...+..+. ..+++.-+..+..|+.++||+.+++..+..+.+ ...+||..-.|.. T Consensus 95 ~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~~~~v~~ 170 (536) T protein:vir:21 95 QLLSDPDGLAKVDEGLSMVERIIMNYIE----SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQR 170 (536) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEee Confidence 0001111222232222221 11222332 245677777888999999999999877665443 4566664444433 Q ss_pred ccccccccccc--chh----hhhcccCceeEEEEc--CCcceeeccccccccccee--eecccceEEecCceeEEechhH Q lcl|NC_021537. 145 RKTTTTIERED--GEE----VENIESGHGYVQVRQ--GRRRYFGEAGDRYGDDKRF--VDKETGEVASDAGELKNGPANE 214 (602) Q Consensus 145 ~~~~~~~~~~~--~~~----~~~~~~~~~~~qi~~--~~~~~~~~~~~~~~~~~~~--~~~~~g~~~~~~~~~~~~~~~e 214 (602) ........... ... ...+..-..-..... .....++......++...+ .....|...........|.... T Consensus 171 d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P 250 (536) T protein:vir:21 171 DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGSDGTYPKEACP 250 (536) T ss_pred CCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCeeeccccCccccccCC Confidence 22221111100 000 000000000000000 0000000000000000000 0000111111111111233445 Q ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc Q lcl|NC_021537. 215 LIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA 294 (602) Q Consensus 215 viH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~ 294 (602) .+.+|.....+..||.||...++..+.......+.......-...|...+. +++........ .++. |. T Consensus 251 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~~~---------~~~~--g~ 318 (536) T protein:vir:21 251 YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT---------KAQT--GD 318 (536) T ss_pred eeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccchhhhc---------cCCC--cc Confidence 677887777788999999999999888888777766665444455544443 33322222111 1111 11 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH---- Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR---- 369 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~---- 369 (602) + +.+. ..++...++.... |.+ ..+..+.....|-.+|-+.. +.. .++..-++++... T Consensus 319 ~--v~g~-----------~~~v~~~~~~~~~--~~~~~~~~i~~~~~rI~~af~~~~--l~~-~~~~r~TAtEV~~r~~E 380 (536) T protein:vir:21 319 F--VTGR-----------PEDISFLQLEKQA--DFTVAKAVSDAIEARLSFAFMLNS--AVQ-RTGERVTAEEIRYVASE 380 (536) T ss_pred e--ecCC-----------cccceeeeccccc--cchHHHHHHHHHHHHHHHHHhhhh--ccc-CCCCCccHHHHHHHHHH Confidence 1 1110 0111122222111 111 13445667788888885532 211 1222234433211 Q ss_pred ----------HHHHHHHHHHHHHHHHHH-hhhcCCccccccceEEEec--cchhcchhHHHHHHHHHHHHHHhCCcccHH Q lcl|NC_021537. 370 ----------EFAKGIIEPEQAKFSARL-YKIIHQDALDVDEWTIDFE--LRGAEQPEQDAKMAEQRVRAMRLAGVGTVN 436 (602) Q Consensus 370 ----------~f~~~~l~P~~~~ie~~l-n~~Ll~~~~~~~~~~~~f~--~~~~~~~~~d~~~~~~~~~~~~~~G~~T~N 436 (602) .+....|.|++.+.-..+ ...++++-.. ..+.+++. +..+.+. .+.+....+++.+... .| T Consensus 381 ~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~-~~v~~~~vs~l~~l~r~-~~~~~l~~~~~~la~~---~P- 454 (536) T protein:vir:21 381 LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK-EAVEPTISTGLEAIGRG-QDLDKLERCVTAWAAL---AP- 454 (536) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh-hhccceEEecHHHHHHH-HHHHHHHHHHHHHHhh---ch- Confidence 233445666665544444 3335543221 11233332 2122111 1222222232222111 11 Q ss_pred HHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccc--cchhhhhcchhhh Q lcl|NC_021537. 437 EAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSK--DPIEQTTFSSSNL 514 (602) Q Consensus 437 E~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~m~~~~v~ss~~ 514 (602) |+.. +.++ .+..+...-...|.++..--....+-.+........+..+...+..-..+ ..+.....-+... T Consensus 455 e~ld----~~id---~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 527 (536) T protein:vir:21 455 MRDD----PDIN---LAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAA 527 (536) T ss_pred hhhc----ccCC---HHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhh Confidence 1100 0011 01111000000111110000000000000000000000000000000000 0000111123344 Q ss_pred hhheecccc Q lcl|NC_021537. 515 DEGLYDFGE 523 (602) Q Consensus 515 ~~~~yd~~~ 523 (602) .++|-.++- T Consensus 528 ~~~g~~~~~ 536 (536) T protein:vir:21 528 DSVGLQPGI 536 (536) T ss_pred hccccCCCC Confidence 555555554 No 257 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=75.28 E-value=0.15 Score=25.12 Aligned_cols=454 Identities=11% Similarity=0.048 Sum_probs=154.2 Q ss_pred CCCCcccccccchhhhcccC-ccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-------ceEEEEecCCCCcccc Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVG-RGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-------GFEIVAHPSADEPDEG 72 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~-~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-------~~~i~~~~~~~~~~~~ 72 (602) +.+---... .+.+....+ +...+ ....++.+ ++...|++.+|..+.+. .|++...+. ...+. T Consensus 27 ~~~~~lP~~--~~~~~~~~~~~~~~~----~~~~~i~d--st~~~a~~~Las~L~~~ltPp~~~WF~l~~~d~--~~~~~ 96 (547) T protein:vir:10 27 IRKYIMPMR--SDFFSDLRSEGSINW----NQNREVFD--STAGDGLETLSSSLHGSLTSPATKWFELAFRDK--ELNSD 96 (547) T ss_pred HHHHhcccc--cccccCCCCCccccc----cccccccc--chHHHHHHHHHHHHHHhhcCCCCcccccccCCc--cccch Confidence 000000000 000000000 00011 11122221 33445666666665532 333332221 11111 Q ss_pred hhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CCceEEEEEeCcccccccccccc Q lcl|NC_021537. 73 GESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DGTPVGLAHVPAATVRVRKTTTT 150 (602) Q Consensus 73 ~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G~~~~L~~l~p~~v~~~~~~~~ 150 (602) . .++..+..+ ...+..+.+ .+++.-+..+..|+.++||+.+++..+. .+..+.+..+|...+-+..+..+ T Consensus 97 ~----~v~~~L~~ve~~i~~~l~~----snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~~d~~G 168 (547) T protein:vir:10 97 D----ECRKWLENATHDVYSALQD----SNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFEEDSRG 168 (547) T ss_pred H----HHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEeeCCCc Confidence 1 222222221 122233333 3466777778899999999999988764 23345555555555544444333 Q ss_pred cccccch--------hhhh--------------cccCcee---EEE---EcCCc-ceeecc------cccccccceeeec Q lcl|NC_021537. 151 IEREDGE--------EVEN--------------IESGHGY---VQV---RQGRR-RYFGEA------GDRYGDDKRFVDK 195 (602) Q Consensus 151 ~~~~~~~--------~~~~--------------~~~~~~~---~qi---~~~~~-~~~~~~------~~~~~~~~~~~~~ 195 (602) ....-.+ .... ..+...+ +.+ +..+. +..... ....|....++.. T Consensus 169 ~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~ 248 (547) T protein:vir:10 169 QVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILK 248 (547) T ss_pred CeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEe Confidence 2110000 0000 0011100 000 00000 000000 0000000000000 Q ss_pred ccceEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHH Q lcl|NC_021537. 196 ETGEVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDS 275 (602) Q Consensus 196 ~~g~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 275 (602) ..+... +....|.....+.+|.....+..||.||...++..+.......+.......-...|.+++. +++...+ T Consensus 249 ~~~~~~---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~-~~g~~~~-- 322 (547) T protein:vir:10 249 EGAVQL---GEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVT-ERGLISD-- 322 (547) T ss_pred cCceee---eecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecc-ccccccc-- Confidence 000000 0111234456788888877888999999999999998888888877777777777776543 2222221 Q ss_pred HHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHH-HHHHHhhHHHHHHHhcCChHHhh Q lcl|NC_021537. 276 KEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEF-QAFRERNEHEIAKVHGVPPVLIN 354 (602) Q Consensus 276 ~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf-~e~~~~~~~~Ia~~fgVPp~~lg 354 (602) ++-. .|.+.+....- .++|+.... |.+. .+..+.....|-.+|-+....+- T Consensus 323 -------~~~~-----pgg~~~~~~~~--------------~v~pl~~~~--~~~~~~~~i~~~~~rI~~af~~d~~~~~ 374 (547) T protein:vir:10 323 -------IDLG-----ASGLTVVRDME--------------SMKPFESRA--RFDVSSIQLTDLRSAVRRIYYVDQLQMK 374 (547) T ss_pred -------ceec-----CCeeeecCCcc--------------cceeeeccc--chHHHHHHHHHHHHHHHHHhhhhhhhcC Confidence 1100 11111111110 123332221 2222 35566778889999987654432 Q ss_pred ccccCCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccccc----cceEEEeccchhcchhH Q lcl|NC_021537. 355 VTSTSNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALDV----DEWTIDFELRGAEQPEQ 415 (602) Q Consensus 355 ~~~~~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~----~~~~~~f~~~~~~~~~~ 415 (602) ++..-++++... .+....|.|++.+.-..+.+ .++++.... .+..++....+.+..-. T Consensus 375 ---~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq 451 (547) T protein:vir:10 375 ---DSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQ 451 (547) T ss_pred ---CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHH Confidence 222234433211 22334566666544333333 354432110 12223333333322222 Q ss_pred HHHHHHHHHHHHHhCCcccHHHHHHHhCCCCC--CCCccccccccccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 416 DAKMAEQRVRAMRLAGVGTVNEAREELDLAPF--EDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 416 d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~--~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) +.... ..+...++ -+-.+.+..|- +--+.+..+...-...|.+..---..+... ..-....+.+....+ T Consensus 452 ~~~~~-~~i~~~~~-------~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~-~~r~qr~~~~q~~~q 522 (547) T protein:vir:10 452 KIDQA-ASIERWAG-------STAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVT-SIRKNRSQTQQKAEQ 522 (547) T ss_pred HHHHH-HHHHHHHH-------HHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHH-HHHHHHHHHHHHHHH Confidence 11111 11111111 01111111110 000000000000000010000000000000 000000000000000 Q ss_pred cccccccccchhhhhcchhhhhhheeccccc Q lcl|NC_021537. 494 SVDVDVSKDPIEQTTFSSSNLDEGLYDFGER 524 (602) Q Consensus 494 ~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~ 524 (602) .+........|+........+. .++ T Consensus 523 aa~~~~~g~~m~~~~~~~a~~~------~~~ 547 (547) T protein:vir:10 523 AAIAEAEGNAMEAQGKGQAALK------ENQ 547 (547) T ss_pred HHHHHHHHHHHHhhcCcccchh------ccC Confidence 0000111111111110000000 011 No 258 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=73.91 E-value=0.16 Score=24.88 Aligned_cols=474 Identities=9% Similarity=-0.043 Sum_probs=152.3 Q ss_pred CCCCcccccccch---hhh-cccCccccCCCC--HHHHHHHHhhhHHHHHHHHHHHHhhccC-----c-eEEEEecCC-C Q lcl|NC_021537. 1 MSKAEETTQLDER---HIA-TDVGRGIQPPYN--PETLAAFQELNETHQACIRKKSRYEAGY-----G-FEIVAHPSA-D 67 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~-~~~~~~i~p~~~--~~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~-~~i~~~~~~-~ 67 (602) ..+..+.++..+. .++ +.......+.-+ -..+.++.+ ++...|++.+|..+.+. + |++...+.. + T Consensus 17 ~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d--st~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~ 94 (536) T protein:vir:10 17 YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQ--AVGARGLNNLASKLMLALFPMQTWMRLTISEYEAK 94 (536) T ss_pred HHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccc--ccHHHHHHHHHHHHHhhhcCCCcccccccChhhhh Confidence 0000000000000 000 000000000000 001111221 23345666666665542 2 222111111 0 Q ss_pred CcccchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce--EEEEEeCcccccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP--VGLAHVPAATVRV 144 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~--~~L~~l~p~~v~~ 144 (602) ...........++..+..+ ...+..+. ..+++.-+..+..|+.++||+.+++..+..+.+ ...+||..-.|.. T Consensus 95 ~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~~~~v~~ 170 (536) T protein:vir:10 95 QLLSDPDGLAKVDEGLSMVERIIMNYIE----SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQR 170 (536) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEee Confidence 0001111222232222221 11222332 245677777888999999999999877665443 4566665444433 Q ss_pred ccccccccccc--chh----hhhcccCceeEEEEcCCcceeecccccccc--cc--eeeecccceEEecCceeEEechhH Q lcl|NC_021537. 145 RKTTTTIERED--GEE----VENIESGHGYVQVRQGRRRYFGEAGDRYGD--DK--RFVDKETGEVASDAGELKNGPANE 214 (602) Q Consensus 145 ~~~~~~~~~~~--~~~----~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~--~~--~~~~~~~g~~~~~~~~~~~~~~~e 214 (602) ........... ... ...+..-..-..........+..+...++. .. .+.....|...........|.... T Consensus 171 d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P 250 (536) T protein:vir:10 171 DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDGTYPKEACP 250 (536) T ss_pred CCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCccccccccccccccCC Confidence 22221111100 000 000000000000000000000000000000 00 000000111110001111233445 Q ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCc Q lcl|NC_021537. 215 LIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTA 294 (602) Q Consensus 215 viH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~ 294 (602) .+.+|.....+..||.||...++..+.......+.......-...|...+. +++........ .++. |. T Consensus 251 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~~~---------~~~~--g~ 318 (536) T protein:vir:10 251 YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT---------KAQT--GD 318 (536) T ss_pred ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccchhhhc---------cCCC--cc Confidence 677787777788999999999999888888777766665444455554443 33322222111 1111 11 Q ss_pred ceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHH---- Q lcl|NC_021537. 295 ILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTR---- 369 (602) Q Consensus 295 ~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~---- 369 (602) + +.+. ..++...++.... |.+ ..+..+.....|-.+|-+.. +.. .++..-++++... T Consensus 319 ~--v~g~-----------~~~v~~~~~~~~~--~~~~~~~~i~~~~~rI~~af~~~~--l~~-~~~~r~TAtEV~~r~~E 380 (536) T protein:vir:10 319 F--VTGR-----------PEDISFLQLEKQA--DFTVAKAVSDAIEARLSFAFMLNS--AVQ-RTGERVTAEEIRYVASE 380 (536) T ss_pred e--ecCC-----------cccceeeeccccc--cchHHHHHHHHHHHHHHHHHhhhh--ccc-CCCCCccHHHHHHHHHH Confidence 1 1110 0111122222111 111 13445667788888885542 211 1222234433211 Q ss_pred ----------HHHHHHHHHHHHHHHHHH-hhhcCCccccccceEEEec--cchhcchhHHHHHHHHHHHHHHhCCcccHH Q lcl|NC_021537. 370 ----------EFAKGIIEPEQAKFSARL-YKIIHQDALDVDEWTIDFE--LRGAEQPEQDAKMAEQRVRAMRLAGVGTVN 436 (602) Q Consensus 370 ----------~f~~~~l~P~~~~ie~~l-n~~Ll~~~~~~~~~~~~f~--~~~~~~~~~d~~~~~~~~~~~~~~G~~T~N 436 (602) .+....|.|++.+.-..+ ...++++--. ..+.+++. +..+.+. .+.+....+++.+... .| T Consensus 381 ~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~-~~v~~~~vs~l~~l~r~-~~~~~l~~~~~~la~~---~P- 454 (536) T protein:vir:10 381 LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK-EAVEPTISTGLEAIGRG-QDLDKLERCVTAWAAL---AP- 454 (536) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh-hhccceEEecHHHHHHH-HHHHHHHHHHHHHHhh---ch- Confidence 233445666665544444 3335543221 11233332 2122111 1222222223222111 11 Q ss_pred HHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccccccccc--cchhhhhcchhhh Q lcl|NC_021537. 437 EAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVDVSK--DPIEQTTFSSSNL 514 (602) Q Consensus 437 E~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~m~~~~v~ss~~ 514 (602) |+.. +.++ .+..+...-...|.++..--....+-.+........+..+...+..-..+ ..+.....-+..+ T Consensus 455 ~~ld----~~id---~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 527 (536) T protein:vir:10 455 MRDD----PDIN---LAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAA 527 (536) T ss_pred hhhc----ccCC---HHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhh Confidence 1100 0011 01111000000111110000000000000000000000000000000000 0000011123344 Q ss_pred hhheecccc Q lcl|NC_021537. 515 DEGLYDFGE 523 (602) Q Consensus 515 ~~~~yd~~~ 523 (602) .++|-.++- T Consensus 528 ~~~g~~~~~ 536 (536) T protein:vir:10 528 DSVGLQPGI 536 (536) T ss_pred hccccCCCC Confidence 455555554 No 259 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=64.46 E-value=0.3 Score=23.45 Aligned_cols=453 Identities=10% Similarity=-0.013 Sum_probs=144.6 Q ss_pred CCCCcccccccc------------hhh---hcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC------c-e Q lcl|NC_021537. 1 MSKAEETTQLDE------------RHI---ATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY------G-F 58 (602) Q Consensus 1 ~~k~~~~~~~~~------------~~~---~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~------~-~ 58 (602) ..+..+-++..+ +.+ +........-++| ++...|++.+|..+.+. + | T Consensus 18 ~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~LAa~L~~~ltpp~~~WF 86 (532) T protein:vir:99 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQ-----------SIGARGLNNLASKLMLALFPVGSSFF 86 (532) T ss_pred HHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhcccccc-----------chHHHHHHHHHHHHHHhhcCCCCccc Confidence 000000000000 000 0100111122232 23345666666666532 3 3 Q ss_pred EEEEecCCC-CcccchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC--CCc--eE Q lcl|NC_021537. 59 EIVAHPSAD-EPDEGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG--DGT--PV 132 (602) Q Consensus 59 ~i~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~--~G~--~~ 132 (602) ++...+..- ...........++..+..+ ...+..+. ..+++.-+..+..|+.++|||.+++..+. .++ .. T Consensus 87 ~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~----~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f 162 (532) T protein:vir:99 87 KLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME----SNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAP 162 (532) T ss_pred cccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhHCcEeEEecccccccCcccce Confidence 333221110 0000111122233322222 11222222 34577777888899999999999876542 122 44 Q ss_pred EEEEeCccccccccccccccccc--c--------hhhhhcccCceeEEEEcCCcceeecccccccccceeeecccceEEe Q lcl|NC_021537. 133 GLAHVPAATVRVRKTTTTIERED--G--------EEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVAS 202 (602) Q Consensus 133 ~L~~l~p~~v~~~~~~~~~~~~~--~--------~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 202 (602) ..+||..-.|............. . .......++. +.. .......++......++...+. .++. T Consensus 163 ~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~-~~~-~p~~~v~v~~~v~~~~~~~~~~-----~~~~ 235 (532) T protein:vir:99 163 KLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQ-GDQ-NPSEEVTIYTHVYRDPEAMVFR-----SYQE 235 (532) T ss_pred EEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccc-ccc-CCCcceEEEEEEEecCCCCeeE-----EEEe Confidence 55666433332222111110000 0 0000000000 000 0000001111000000000000 0111 Q ss_pred cCcee-----EE--echhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHH Q lcl|NC_021537. 203 DAGEL-----KN--GPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDS 275 (602) Q Consensus 203 ~~~~~-----~~--~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 275 (602) -.|.. .. +.....+-+|.....+..||.||...++..+.......+.......-...|..++. +++...... T Consensus 236 ~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~ 314 (532) T protein:vir:99 236 IDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN-PNGVTQIRR 314 (532) T ss_pred ecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceec-cccccchhh Confidence 11111 11 11223455566666777999999999999998888888877776666666665553 333222221 Q ss_pred HHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhh Q lcl|NC_021537. 276 KEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLIN 354 (602) Q Consensus 276 ~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg 354 (602) .. .++.+. + +.+ ...++...++...+ |.+ ..+..+.....|..+|-+.. +. T Consensus 315 ~~---------~~~~g~--~--v~g-----------~~~~i~~~~~~~~~--~~~~~~~~i~~~~~rI~~af~~~~--~~ 366 (532) T protein:vir:99 315 VA---------KANTGD--F--VAG-----------RKQDVEVFQLEKYN--DFQVAKATADDIEKRLSYAFMLNS--AV 366 (532) T ss_pred hc---------cCCCcc--e--ecC-----------Ccccceeeeccccc--chhHHHHHHHHHHHHHHHHHhhhh--cc Confidence 11 111110 1 111 01112222222111 111 13445667788888885432 11 Q ss_pred ccccCCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHh-hhcCCcccc-ccceEEEeccchhcchhHHHH Q lcl|NC_021537. 355 VTSTSNRANSKEQTR--------------EFAKGIIEPEQAKFSARLY-KIIHQDALD-VDEWTIDFELRGAEQPEQDAK 418 (602) Q Consensus 355 ~~~~~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln-~~Ll~~~~~-~~~~~~~f~~~~~~~~~~d~~ 418 (602) . .++..-++++... .+....|.|++.+.-..+. ..+|++... ..+..+.-..+.+ ...++.. T Consensus 367 ~-~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~L-araq~~~ 444 (532) T protein:vir:99 367 Q-RGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEAL-GRGHDLN 444 (532) T ss_pred c-CCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecchHH-HHHHHHH Confidence 1 1222234443221 1233445566555444443 335443211 1111221122222 2222333 Q ss_pred HHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCccccccccccccccccccccccccc Q lcl|NC_021537. 419 MAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVDVD 498 (602) Q Consensus 419 ~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (602) ....+++.+.. +.+ ++...+++ +..+...-...|.++..--.....-..........+ .. .... T Consensus 445 ~l~~~~~~laq---~~p-~~~d~id~--------d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~-~~---~~a~ 508 (532) T protein:vir:99 445 KLNVFIDYMIK---LAG-LQDDDINL--------LDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAA-GM---VTAG 508 (532) T ss_pred HHHHHHHHHHh---hcc-hhhhhCCH--------HHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHH-HH---HHHH Confidence 33333332211 001 01111111 100000000001000000000000000000000000 00 0000 Q ss_pred ccccchhhhhcchhhhhhheeccccc Q lcl|NC_021537. 499 VSKDPIEQTTFSSSNLDEGLYDFGER 524 (602) Q Consensus 499 ~~~~~m~~~~v~ss~~~~~~yd~~~~ 524 (602) .....+..+.-...+-...|=| ++ T Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~--~~ 532 (532) T protein:vir:99 509 QQMGAAGGQAAAAMMQQQAGMP--TQ 532 (532) T ss_pred HHHHHHHHHhcchhHHhhcCCC--CC Confidence 0000000000000011111111 11 No 260 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=59.40 E-value=0.39 Score=22.81 Aligned_cols=448 Identities=11% Similarity=-0.003 Sum_probs=161.5 Q ss_pred CCCCcc-cccccchhhhc-ccCccccCCCCHHHHHH---HHhhhHHHHHHHHHHHHhhc------cCceEEEEecCCCCc Q lcl|NC_021537. 1 MSKAEE-TTQLDERHIAT-DVGRGIQPPYNPETLAA---FQELNETHQACIRKKSRYEA------GYGFEIVAHPSADEP 69 (602) Q Consensus 1 ~~k~~~-~~~~~~~~~~~-~~~~~i~p~~~~~~l~~---~~~~~~~v~~cI~~ia~~ia------~~~~~i~~~~~~~~~ 69 (602) .-|+.+ +.+-+....++ -.+..+++|.|-.++-. -+..+|++.++.++..+..- |.-+-|++... .+ T Consensus 57 ~~~~~~~t~~~D~~~~g~~~~~~~~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~--~~ 134 (569) T protein:vir:10 57 GGKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHN--GN 134 (569) T ss_pred ccCccccchhhhhHHHHHHHHhhhccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecC--CC Confidence 344444 33333333332 23456789887655432 23357888888887765443 22344544322 22 Q ss_pred ccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEE---eCcccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAH---VPAATVRVRK 146 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~---l~p~~v~~~~ 146 (602) ..+.+..+++...+... +..+ .-.....+..++..||.+|+++--+.+-.++.|+- .-|.-|++ T Consensus 135 ~a~~daakai~~el~~d--l~~~---------iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqp-- 201 (569) T protein:vir:10 135 DSDYDAAQALCGELMND--IGRT---------INKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKE-- 201 (569) T ss_pred CCcchHHHHHHHHHHHH--HHHH---------HHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccch-- Confidence 23333333444444331 1111 23445677888999999999887655444555432 22223321 Q ss_pred cccccccccchhhhhcccCceeE--EEEcC-Ccceeecccccccc-cceeeecccceEEecCceeEEechhHEEEecCCC Q lcl|NC_021537. 147 TTTTIEREDGEEVENIESGHGYV--QVRQG-RRRYFGEAGDRYGD-DKRFVDKETGEVASDAGELKNGPANELIFLPNPS 222 (602) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~--qi~~~-~~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~eviH~r~~~ 222 (602) |+.+.. -.+|. ...+. +...+.....+.+- -|++.+-.-..-+..+.....+..++.=|.. T Consensus 202 ------FE~g~~------tvGF~~~~~~~~~~ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~P--- 266 (569) T protein:vir:10 202 ------FEVSGN------LAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTP--- 266 (569) T ss_pred ------hhhcCc------eEEeecccCCccccceeeechhhhhhhcccceeeccccchhhhhhhheeeccccccccc--- Confidence 111000 00000 00000 00000000000000 0111100000000000000011111111111 Q ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHH-----------HHHHHHH-hhccc Q lcl|NC_021537. 223 PLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKED-----------LRNLMDN-LKGSR 290 (602) Q Consensus 223 ~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~-----------l~~~~~~-~~g~~ 290 (602) ..-..+|-|-+..+.+.......+-....+---+..+-..+|.+....+++.+... -++.++. ..|++ T Consensus 267 i~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~ 346 (569) T protein:vir:10 267 IETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGAN 346 (569) T ss_pred ccchhhhhHHHHHHHhHHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCc Confidence 01234898888888655433332222111111112223344555444466655532 2223322 33333 Q ss_pred cc-----CcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccc-------- Q lcl|NC_021537. 291 YR-----TAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTS-------- 357 (602) Q Consensus 291 na-----g~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~-------- 357 (602) +- +.+.+.+.+. . ...++..-.+-+..+ +|-.-+..+..|.++|+.+.+||..+ T Consensus 347 ~~~~~~~H~LPv~gekq----~---~~tvDt~~~~A~~~g------IEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGe 413 (569) T protein:vir:10 347 NMPTVTNTLLPIMGDGK----G---QMTIDTQTIQADING------IEDILTYMRQLAAALGLDYTLLGWADQMSGGLGE 413 (569) T ss_pred cccccceeeeeeecCcc----c---cccccccccccCccc------HHHHHHHHHHHHhhhccchhHhhHHHHhcccccc Confidence 21 1222222221 1 111222212211112 34445678899999999999998532 Q ss_pred cCCccCHHH-H-HHHHHHHHHHHHHHHHHH-HHhhh---cCCccccccceEEEeccchhcchhH--H-HHHHH------- Q lcl|NC_021537. 358 TSNRANSKE-Q-TREFAKGIIEPEQAKFSA-RLYKI---IHQDALDVDEWTIDFELRGAEQPEQ--D-AKMAE------- 421 (602) Q Consensus 358 ~~~~sn~e~-~-~~~f~~~~l~P~~~~ie~-~ln~~---Ll~~~~~~~~~~~~f~~~~~~~~~~--d-~~~~~------- 421 (602) ++-+-++-| + +...+++.+.-++..+-+ .+..| .+++.. .-|.++|+....-...+ + .+.++ T Consensus 414 GG~frtSaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf~~~d--rP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~ 491 (569) T protein:vir:10 414 GGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGD--RPYKIEFHSVNTALQQEHNDNRDSQANYATIVT 491 (569) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccCCCC--cceEEEeccchHHHHHHHHhHHHHHHHHHHHHH Confidence 222222222 2 233444444444433322 12222 333322 24788887542111111 1 11111 Q ss_pred HHHHHHHhCCcccHHHH--HHH----hCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRLAGVGTVNEA--REE----LDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSV 495 (602) Q Consensus 422 ~~~~~~~~~G~~T~NE~--R~~----~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (602) +.+..+.++..+-.||. |.+ +|+ +++..+.++... .+.|+.++..- .. T Consensus 492 Q~la~l~e~n~Lg~de~~m~y~l~d~~~~---De~~~e~l~ae~--------------------~akp~DEe~~~---~~ 545 (569) T protein:vir:10 492 QILDAVSNNSVLANSDAFKRYLFSDVLEI---DEKISEALVNEL--------------------KAKSEDDDHLM---DS 545 (569) T ss_pred HHHHHhhhcccccccHHHHHHHHHHHhhc---chhHHHHHHhhc--------------------CCCcchhHHHH---HH Confidence 22222233333333332 111 121 111111110000 00011111110 01 Q ss_pred cccccccchhhhhcchhhhhhheeccc Q lcl|NC_021537. 496 DVDVSKDPIEQTTFSSSNLDEGLYDFG 522 (602) Q Consensus 496 ~~~~~~~~m~~~~v~ss~~~~~~yd~~ 522 (602) -.+.+..+..+ ....+-.-|-|-+ T Consensus 546 ~~~~~~~~~~~---~~~~~~~~~~~~~ 569 (569) T protein:vir:10 546 IIKTPPQELAQ---ILESVFKEGNDND 569 (569) T ss_pred HhcCChHHHHH---HHHHHhhccCCCC Confidence 11111111111 1111112222322 No 261 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=58.55 E-value=0.41 Score=22.70 Aligned_cols=464 Identities=10% Similarity=0.034 Sum_probs=152.0 Q ss_pred CCCCccc-------ccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC-----c-eEEEEecCCC Q lcl|NC_021537. 1 MSKAEET-------TQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY-----G-FEIVAHPSAD 67 (602) Q Consensus 1 ~~k~~~~-------~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~-----~-~~i~~~~~~~ 67 (602) ..|..+. +++.+-++++.......+.. + -+-.-.++.++.||+.+..++... . +++.+..+. T Consensus 40 ~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~-~---~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~- 114 (651) T protein:vir:80 40 EETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNA-D---WRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPG- 114 (651) T ss_pred hhhHHHHHHhhcccHHHHHhhccccccccCCCCC-C---CCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCch- Confidence 1121111 11111112211111111100 0 000012466677776555444432 1 334332211 Q ss_pred CcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCC----------------Cc- Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGD----------------GT- 130 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~----------------G~- 130 (602) +......+.++..+. +++ ....+......++.|.++.|+|++.+.++.. |. T Consensus 115 --d~a~~~~~~~~~~~~-----~~l-----~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~ 182 (651) T protein:vir:80 115 --QDNLLVSRLIKRYVQ-----DKL-----TEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEP 182 (651) T ss_pred --hHHHHHHHHHHHHHH-----HHh-----hccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheecccccccccc Confidence 111112222332221 111 2334777888889999999999887765421 11 Q ss_pred -------------eEEEEEeCcccccccccccccccccchhhhhc-----------ccCceeEEE-----Ec-------- Q lcl|NC_021537. 131 -------------PVGLAHVPAATVRVRKTTTTIEREDGEEVENI-----------ESGHGYVQV-----RQ-------- 173 (602) Q Consensus 131 -------------~~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~qi-----~~-------- 173 (602) -..+..|||..+-+.+.-... .+...+.+. ..|. |.-+ +. T Consensus 183 ~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~--~d~~~v~~~~~t~~~l~~l~~~g~-~~~~~~~~~~~~~~~~~~~ 259 (651) T protein:vir:80 183 TFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDP--NRGAFIRKLTKTKADILNLLSEGY-YYGVDPLDVVEHKCKDTSD 259 (651) T ss_pred ceeeeccceeeeceeEEEEecHHHeeecCCCcCc--cccceeeeeeeeHHHHHHHHhccc-ccchhhHHHHhhhcccccc Confidence 124677787777655432211 111110000 0000 0000 00 Q ss_pred -------------------CCcceeecccccccccceeeeccc-----ceEEecCceeE---Eec---hhHEEEecCCCC Q lcl|NC_021537. 174 -------------------GRRRYFGEAGDRYGDDKRFVDKET-----GEVASDAGELK---NGP---ANELIFLPNPSP 223 (602) Q Consensus 174 -------------------~~~~~~~~~~~~~~~~~~~~~~~~-----g~~~~~~~~~~---~~~---~~eviH~r~~~~ 223 (602) .+.+.+++.. ...+... ..+...++... ..+ ....+|++.... T Consensus 260 ~~~~~~~~~~~~d~~~~~~~~~v~v~E~~-------~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~Pf~~~~~~~~ 332 (651) T protein:vir:80 260 TKQDMLSTFQGVTTSLWSPHQNVELLEYW-------GDIHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRPFVIGTYIPT 332 (651) T ss_pred CCccccccccCCCccccccccceEEEEEE-------EEeeccCCceEEEEEEEcCcEEecccccCCCCCCCeeeecceec Confidence 0000001000 0000000 01111111111 111 124566666666 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCcc Q lcl|NC_021537. 224 LALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVD 303 (602) Q Consensus 224 ~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~ 303 (602) .+..||.|++..++........+.+.........+.|.+++. +++...++... . .-|.++......+ T Consensus 333 ~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~-~d~~~~~~~l~-------~-----~pg~vi~~~~~~~ 399 (651) T protein:vir:80 333 ARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLR-SDGLLQPEDVY-------T-----EPGKVFLVSDHGD 399 (651) T ss_pred CccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEec-CCccccHHHhh-------c-----CCCceEEecCCCC Confidence 677899999999988877777776666655565666666553 34444433221 0 1122333222222 Q ss_pred ceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC-ccCHHH--HHHHHHHHHHHHHH Q lcl|NC_021537. 304 DHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN-RANSKE--QTREFAKGIIEPEQ 380 (602) Q Consensus 304 ~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~sn~e~--~~~~f~~~~l~P~~ 380 (602) ...+. +-........ .........+-..+||+....|....+. .-|+.+ +...-....|++++ T Consensus 400 ~~~l~-----------~~~~~~~~~~---~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~ 465 (651) T protein:vir:80 400 LQPLA-----------NQSSNFSITY---QESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIH 465 (651) T ss_pred ceeec-----------cCcccchhHH---HHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHH Confidence 11111 1000111112 3445566788889999988877644321 112221 11122233444444 Q ss_pred HHHHHHH------------hhhcCCc--------------ccc--ccceEEEeccchhcch--hHHHHHHHHHHHHHHhC Q lcl|NC_021537. 381 AKFSARL------------YKIIHQD--------------ALD--VDEWTIDFELRGAEQP--EQDAKMAEQRVRAMRLA 430 (602) Q Consensus 381 ~~ie~~l------------n~~Ll~~--------------~~~--~~~~~~~f~~~~~~~~--~~d~~~~~~~~~~~~~~ 430 (602) ..+.+++ -+..-.+ ... ..+....++...+... ........+ +..+... T Consensus 466 ~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~-l~~~~q~ 544 (651) T protein:vir:80 466 KHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIED-RLTFIQA 544 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHH-HHHHHHh Confidence 4444432 1111000 000 0122233333221111 111111111 1112221 Q ss_pred CcccH------------HHHHHHhCCCCCCCCccccccccccccccccccCCCcCccccccccc-ccccccccccc---c Q lcl|NC_021537. 431 GVGTV------------NEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAA-PPLENKIGERD---S 494 (602) Q Consensus 431 G~~T~------------NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---~ 494 (602) +...+ -+..+..|++ +. +.++..... .......+......... .....+..+.. . T Consensus 545 ~~~~p~~~~~~~~~~~~~~l~~~~g~~---~~--~~~l~~~~q----~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~ 615 (651) T protein:vir:80 545 VAQVPEMGQLVDYKRILVDLLQHWGFE---EP--EAYLKQQDQ----QAPANPQEALLSQAKDVGGQAMSNMLQNQLQAD 615 (651) T ss_pred hccCCccchhhhHHHHHHHHHHHcCCC---Cc--HHhcCCCcc----chhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 11111 1122233432 11 111111000 00000000000000000 00000000000 0 Q ss_pred ccccccccchhhhh-c-chhhhhhheecccccEEEE Q lcl|NC_021537. 495 VDVDVSKDPIEQTT-F-SSSNLDEGLYDFGERELYL 528 (602) Q Consensus 495 ~~~~~~~~~m~~~~-v-~ss~~~~~~yd~~~~~l~~ 528 (602) .......+.+.+.. . -...+.+.-=+-..+.|.- T Consensus 616 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 616 GGTQMMSEMYGTPNADQMQQELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 00000000000000 0 0000000000000111111 No 262 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=54.46 E-value=0.5 Score=22.22 Aligned_cols=438 Identities=13% Similarity=0.037 Sum_probs=146.5 Q ss_pred CCCCcccc-cccc--hhhhcccCc-cccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC------ce-EEEEecCCCCc Q lcl|NC_021537. 1 MSKAEETT-QLDE--RHIATDVGR-GIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY------GF-EIVAHPSADEP 69 (602) Q Consensus 1 ~~k~~~~~-~~~~--~~~~~~~~~-~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~------~~-~i~~~~~~~~~ 69 (602) +.+---+. ..+. ..-....++ ...-++| ++...|++.+|..+.+. +| ++... +... T Consensus 33 ~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~d-----------stg~~a~~~LAs~l~~~ltpp~~~wF~l~~~--~~~~ 99 (549) T protein:vir:10 33 VIDYLMPRLDKFGQLPRPDSEKGRERSQKMFD-----------STAPLALRNFVAAMDSMITPATQLWHRLKTG--NDAL 99 (549) T ss_pred HHHHhccccccccccCCCCCCccccccccccc-----------chHHHHHHHHHHHHHhhccCCCCccccccCC--ccch Confidence 00000000 0000 000000000 1111232 23335666666555431 33 33221 1111 Q ss_pred ccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccccc Q lcl|NC_021537. 70 DEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRKTT 148 (602) Q Consensus 70 ~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~~~ 148 (602) .+. ..++..+..+. ..+..+. ...-+++.-+..+..|+.++||+.+++..+.. +.+.+..+|-..+-+..+. T Consensus 100 ~e~----~~v~~~l~~ve~~~~~~~~--~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~-~~~~f~~~pl~~~~v~~d~ 172 (549) T protein:vir:10 100 NEI----ASVKAYLQGVVRTLFAARY--RWQGGFVTQMGATYQSIGLFGPGALMIEHDVG-KGIVYRNVPMQRLWFAENN 172 (549) T ss_pred hhh----hHHHHHHHHHHHHHHHHHh--hhhcChHHHHHHHHHHHHhhcceeeEEeecCC-CeeEEEEEEcCeEEEeeCC Confidence 111 11222222111 1111110 11235666777788999999999999887653 3444444433444333333 Q ss_pred cccccccch--------hhhhc--------------ccCceeEEEEcCCcceeecccccccc---cceeeecc---cceE Q lcl|NC_021537. 149 TTIEREDGE--------EVENI--------------ESGHGYVQVRQGRRRYFGEAGDRYGD---DKRFVDKE---TGEV 200 (602) Q Consensus 149 ~~~~~~~~~--------~~~~~--------------~~~~~~~qi~~~~~~~~~~~~~~~~~---~~~~~~~~---~g~~ 200 (602) .+....-.+ ....+ .+....+.++. ..+|. ++...+.. ...+ T Consensus 173 ~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~----------~V~pr~~~~~~~~~~~~~pf~sv 242 (549) T protein:vir:10 173 SGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYH----------AVEPRADRDPRKLDGRNMQFASY 242 (549) T ss_pred CCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEE----------EeecCCCCCccccccccCceEEE Confidence 322111000 00000 00011111110 00000 00000000 0011 Q ss_pred EecCceeE-----EechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHH Q lcl|NC_021537. 201 ASDAGELK-----NGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDS 275 (602) Q Consensus 201 ~~~~~~~~-----~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~ 275 (602) +...+... .|.....+-+|.....+..||.||...++..+.......+.......-...|.+++.- ++.+.+.. T Consensus 243 ~~e~~~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~-~g~~~~~~ 321 (549) T protein:vir:10 243 WLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANE-DGVLDGFD 321 (549) T ss_pred EEEecCCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc-ccccccce Confidence 11111111 1222334555666666779999999999999999888888887777777777776532 22222211 Q ss_pred HHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHhh Q lcl|NC_021537. 276 KEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLIN 354 (602) Q Consensus 276 ~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~lg 354 (602) + ..|+ ..+...+.++ ...++|+.... |.+ ..+..+.....|-.+|-+....+- T Consensus 322 ---l------~pgg------------~~~~~~~~~~---~~~~~pl~~~~--~~~~~~~~i~~~~~rI~~af~~d~~~~~ 375 (549) T protein:vir:10 322 ---L------RSGA------------LNWGGLNDKG---EEMVKPLLTGK--QAQIGIEFAQDTRQTINQWFYVTLFQIL 375 (549) T ss_pred ---e------ccCC------------ccccccCCCC---ccceeeecccc--chhHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 0 1111 1111111111 12344443322 222 224466778899999987653321 Q ss_pred ccccCCccCHHHHHH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccccc---cceEEEeccchhcchhHH Q lcl|NC_021537. 355 VTSTSNRANSKEQTR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALDV---DEWTIDFELRGAEQPEQD 416 (602) Q Consensus 355 ~~~~~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~---~~~~~~f~~~~~~~~~~d 416 (602) .++..-++++... .+....|.|++.+.-..+.+ .++|+.... .+..++....+.+...++ T Consensus 376 --~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~ 453 (549) T protein:vir:10 376 --VDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMR 453 (549) T ss_pred --cCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHH Confidence 1222234443211 12233455655543333332 344432111 233333333333322222 Q ss_pred HHHH---HHHHHHHHh-CCc-------ccHHHH----HHHhCCCCCCCCccccccccccccccccccCCCcCcccccccc Q lcl|NC_021537. 417 AKMA---EQRVRAMRL-AGV-------GTVNEA----REELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKA 481 (602) Q Consensus 417 ~~~~---~~~~~~~~~-~G~-------~T~NE~----R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (602) .... .+.++.... +++ +..+++ -+.+|.|+. .+.+.- + ...... T Consensus 454 ~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~------~irs~e------e---------v~~~r~ 512 (549) T protein:vir:10 454 AGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVE------AMSTDE------E---------LQAQQA 512 (549) T ss_pred HHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCcc------ccCCHH------H---------HHHHHH Confidence 1111 111111000 000 111111 112233220 000000 0 000000 Q ss_pred cccccccccccccccccccccchhhhhcchhhhhhheecccccE Q lcl|NC_021537. 482 APPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGERE 525 (602) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~ 525 (602) .- ...+..+...+........ .... |....+ +...++ T Consensus 513 ~~-~~qqq~~~~~~~a~~a~~~--a~~~-~~~~ta---~~~~~~ 549 (549) T protein:vir:10 513 AE-AQAAQMQQMLAAAPVAAGA--IKDL-SDAQTA---AQTARV 549 (549) T ss_pred HH-HHHHHHHHHHHHHHHHHHH--HHhh-hhhcCC---CcccCC Confidence 00 0000000000000000000 0000 000000 000111 No 263 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=51.98 E-value=0.57 Score=21.94 Aligned_cols=435 Identities=10% Similarity=0.008 Sum_probs=143.7 Q ss_pred CCCCccccccc----chhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC------c-eEEEEecCC--C Q lcl|NC_021537. 1 MSKAEETTQLD----ERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY------G-FEIVAHPSA--D 67 (602) Q Consensus 1 ~~k~~~~~~~~----~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~------~-~~i~~~~~~--~ 67 (602) .....+-.... ....+........-++| ++...|++.+|..+.+. + |++...+.. . T Consensus 20 e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~d-----------stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~l~~ 88 (542) T protein:vir:78 20 LDMARRCAALTLPYLLTEDGHASGGRLQQPYQ-----------SLGSKGVNALSSKLMLSLFPIQTSFFKLQINDAEIAS 88 (542) T ss_pred HHHHHHHHHHhccccCCCCCCccccccccccc-----------chHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHHh Confidence 00000000000 00000000111112232 22335666666665532 2 333322111 0 Q ss_pred CcccchhhHHHHHHhhhcc-chhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeCcccccccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGS-DSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVPAATVRVRK 146 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~-~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~p~~v~~~~ 146 (602) ....+.+....++..+..+ ...+..+. ..+++.-+..+..|+.++|||.+++..+ +...+||..-.|.... T Consensus 89 ~~~~~~~~~~~v~~~L~~ve~~~~~~l~----~snf~~~~~~~~~~L~~~G~a~l~~~~~----~~~~~pl~~y~v~~d~ 160 (542) T protein:vir:78 89 VPELTPEVRSEIDMNLSKMEKMVMQQIA----ESSDRVQLTAAMKHLIVTGNVLVFAGKK----TLKVYPLDRYVIERDG 160 (542) T ss_pred hccCChhhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCeEEEEecCC----CceEEecceeEEeeCC Confidence 0011111112222222211 11112222 3356777778889999999999887543 3556666543443322 Q ss_pred cccccccccchh-----h-hhc---------------ccCcee--EEEEcCCc-ceeecccccccccceeeecccceEEe Q lcl|NC_021537. 147 TTTTIEREDGEE-----V-ENI---------------ESGHGY--VQVRQGRR-RYFGEAGDRYGDDKRFVDKETGEVAS 202 (602) Q Consensus 147 ~~~~~~~~~~~~-----~-~~~---------------~~~~~~--~qi~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~ 202 (602) ............ + ..+ .....| ++.+..+. ...+.... .+.....++. T Consensus 161 ~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~--------~~~~~~s~~~ 232 (542) T protein:vir:78 161 DGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCK--------LVDGQHRWHQ 232 (542) T ss_pred CCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccc--------cCCCeEEEEE Confidence 211111100000 0 000 000000 00000000 00000000 0000011111 Q ss_pred c-Cce-------eEEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHH Q lcl|NC_021537. 203 D-AGE-------LKNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSED 274 (602) Q Consensus 203 ~-~~~-------~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~ 274 (602) . .|. ...|.....+-.|.....+..||.||...++..+.......+.......-...|..++. +++..... T Consensus 233 e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~-~~g~~~~~ 311 (542) T protein:vir:78 233 ECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVS-PSATTKPQ 311 (542) T ss_pred EeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-cccccchh Confidence 0 111 11222334455666666778999999999999999988888888887777777776553 22222222 Q ss_pred HHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHH-HHHHHHhhHHHHHHHhcCChHHh Q lcl|NC_021537. 275 SKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDME-FQAFRERNEHEIAKVHGVPPVLI 353 (602) Q Consensus 275 ~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~q-f~e~~~~~~~~Ia~~fgVPp~~l 353 (602) ... .++.+. + +.+ ...++...++.... |.+ ..+..+.....|-.+|-+- T Consensus 312 ~~~---------~~~~g~--i--v~g-----------~~~~v~~~~~~~~~--~~~~~~~~i~~~~~rI~~aFl~~---- 361 (542) T protein:vir:78 312 SLA---------RAGTGA--I--IQG-----------RAEDVSVVQANKGA--DFRTVQEMIRDLSQRISDAFLIL---- 361 (542) T ss_pred hcc---------cCCCce--e--ecC-----------Cccceeeeeccccc--chhHHHHHHHHHHHHHHHHhccc---- Confidence 110 111110 1 110 01112222222111 222 2345566777888888432 Q ss_pred hccccCCccCHHHHHH--------------HHHHHHHHHHHHHHHHHH-hhhcCCccccccceEEEeccchhcchh---H Q lcl|NC_021537. 354 NVTSTSNRANSKEQTR--------------EFAKGIIEPEQAKFSARL-YKIIHQDALDVDEWTIDFELRGAEQPE---Q 415 (602) Q Consensus 354 g~~~~~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~l-n~~Ll~~~~~~~~~~~~f~~~~~~~~~---~ 415 (602) ...++..-++++... .+....|.|++.+.-..+ ...++++.-. ..+.+++ ...+... . T Consensus 362 -~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~-~lv~~~~--~s~La~~~r~~ 437 (542) T protein:vir:78 362 -NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPK-GLVMPTV--VAGLGGVGRGE 437 (542) T ss_pred -ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCch-hceeeee--echHHHHHHHH Confidence 112233334443221 122333445444332222 2334444322 1233443 2222111 1 Q ss_pred HHHHHHHHHHHHHhC-C------cccHHHH----HHHhCCCCC---CCCcc-ccccc--------cccccccccccCCCc Q lcl|NC_021537. 416 DAKMAEQRVRAMRLA-G------VGTVNEA----REELDLAPF---EDDRG-DMTLS--------EFEAEFGADASDGDA 472 (602) Q Consensus 416 d~~~~~~~~~~~~~~-G------~~T~NE~----R~~~Gl~p~---~~g~~-d~~~~--------~~~~~~~~~~~~~~~ 472 (602) +......+++..-.. | .+..+++ .+.+|.|+. ...+. ..... .....-++..+.... T Consensus 438 ~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~ 517 (542) T protein:vir:78 438 DRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPI 517 (542) T ss_pred HHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 222222222221110 0 0112222 223454431 11100 00000 000000000000000 Q ss_pred CcccccccccccccccccccccccccccccchhhhhcchhhhhhheecccccEE Q lcl|NC_021537. 473 EAMLTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSSSNLDEGLYDFGEREL 526 (602) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~ss~~~~~~yd~~~~~l 526 (602) .+....+. .++.+.+... |+ .|-| | T Consensus 518 ~~~~~~~~-~a~~~~~~~~----------------~~-------~~~~-----~ 542 (542) T protein:vir:78 518 GEKMMQQI-NAPGQEAPAG----------------PQ-------TGED-----L 542 (542) T ss_pred ccchhhhc-CCCCcCCCCC----------------Cc-------cccc-----C Confidence 00000000 0000000000 00 0000 0 No 264 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=50.93 E-value=0.6 Score=21.82 Aligned_cols=417 Identities=11% Similarity=0.026 Sum_probs=140.2 Q ss_pred CCCCcccccc-----cc-hhhhcccCccccCC--------CCHHHHH-HHHh--hhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MSKAEETTQL-----DE-RHIATDVGRGIQPP--------YNPETLA-AFQE--LNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~k~~~~~~~-----~~-~~~~~~~~~~i~p~--------~~~~~l~-~~~~--~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) ..+....=++ .+ +.+.. .|..+-|. .+-.... ++.+ -.++.+..++.+...+..-+..+. T Consensus 11 y~~~~~~W~~ird~~~G~~~~r~-~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf~k~p~~~-- 87 (501) T protein:vir:95 11 LGKLLPLYYLIRDAIAGEPTVKG-ARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVFMRDPVVK-- 87 (501) T ss_pred HHHHHHHHHHHHHHhcChHHHHh-cccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhhcCCccee-- Confidence 0000000000 00 00000 00011111 1111111 1111 123444445444444443332220 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc----e-------- Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT----P-------- 131 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~----~-------- 131 (602) .-..+..++...+. ...+..+|.+.+....+.+|-+++.+-....+. . T Consensus 88 -----------~p~~l~~l~~d~D~---------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~ 147 (501) T protein:vir:95 88 -----------VPALLNPLVANATG---------SGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGR 147 (501) T ss_pred -----------CcHHHHHHHhccCC---------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhcc Confidence 11223333333332 245789999999999999999999997643321 0 Q ss_pred --EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecc-cc---------- Q lcl|NC_021537. 132 --VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKE-TG---------- 198 (602) Q Consensus 132 --~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~-~g---------- 198 (602) -.|..+.|..|- .+.... ..+...-........++..++. |+......++++... .| T Consensus 148 ~rPy~~~~~~~~Ii--nW~~~~--v~g~~~l~~v~l~E~~~~~d~~------f~~~~~~q~RvL~~~~~g~~~~~v~r~~ 217 (501) T protein:vir:95 148 IRPTLYVYSPTEII--NWRTTD--RGAEEVLSLVVLFETWCAADDG------FEMKTSGQFRVLRLDEEGYYVHEIWREP 217 (501) T ss_pred CCcEEEEecHhhhc--Ccceec--cCCceeeeEEEEEEEEeecCCC------cccceeEEEEEEeeCCCceEEEEEEEec Confidence 124455554441 000000 0000000000000001111110 000000000010000 00 Q ss_pred -------------------eEEe--cCcee-EEechhHEEEecCCCCCCCcccccHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_021537. 199 -------------------EVAS--DAGEL-KNGPANELIFLPNPSPLALYYGVPDWVAAMQ-TMGADQAAKEWNHDVFD 255 (602) Q Consensus 199 -------------------~~~~--~~~~~-~~~~~~eviH~r~~~~~~~~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ 255 (602) .+.. .++.. ..+| ++.+ .....+...|.||+..+.. .+...+....+. ..+. T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IP---fv~~-~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~-~~l~ 292 (501) T protein:vir:95 218 QPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIP---FMFI-GSENNDSNPDNPNFYDLASLNMAHYRNSADYE-ESCY 292 (501) T ss_pred CCcccCcceecCCcccccceeeeeccCCCcCCeee---EEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhHHH-HHHH Confidence 0111 11111 1222 2222 1223344577888887764 333333333333 3445 Q ss_pred hcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHH Q lcl|NC_021537. 256 NLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFR 335 (602) Q Consensus 256 ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~ 335 (602) ..+.|..+++ | ++++..+.... ....=+.++ .+.++.|.+ .+|...+.... . .+.+ T Consensus 293 ~~~~P~l~i~--G--~~~~~~~~~~~--~~i~~G~~~--~~~lP~~~~------------~~~ie~~~~~i-~---~~~l 348 (501) T protein:vir:95 293 IVGQPTPVLI--G--LTEEWVTNVLK--GSVNFGSRG--GIPLPVGAD------------AKLLQASENTM-L---KEAM 348 (501) T ss_pred Hcccceeeee--C--CcccccccCCC--Cceeecccc--cccCCCCCc------------eeEEecChhhH-H---HHHH Confidence 5667777664 2 12221110000 000001111 122333322 22322221111 1 2233 Q ss_pred HhhHHHHHHHhcCChHHhhccccCCccCHHHHHHHH--HHHHHHHHHHHHHHHHhhhcCC--cccc--ccceEEEeccch Q lcl|NC_021537. 336 ERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTREF--AKGIIEPEQAKFSARLYKIIHQ--DALD--VDEWTIDFELRG 409 (602) Q Consensus 336 ~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f--~~~~l~P~~~~ie~~ln~~Ll~--~~~~--~~~~~~~f~~~~ 409 (602) +...+++..+ | ..++... ..+ .+.++....+ ....|.-++..+++++++.|-- .... ..+..|+.+.+ T Consensus 349 ~~l~~~m~~~-G--a~ll~~~-~~~-~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~d- 422 (501) T protein:vir:95 349 DTKERQMVAL-G--AKLVEQK-EVQ-RTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQADSGVKFELNTD- 422 (501) T ss_pred HHHHHHHHHH-H--HhhccCC-ccc-hhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecc- Confidence 3333444332 3 1223211 112 2233322222 2345677777788877765431 1111 12344544432 Q ss_pred hcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc---cccccccccccccccCCCcCccccccccccccc Q lcl|NC_021537. 410 AEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD---MTLSEFEAEFGADASDGDAEAMLTRSKAAPPLE 486 (602) Q Consensus 410 ~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (602) +.....+... .+++.+++.+|.++....++.+-.--+.+.+.+ ..+-.... ..+........+.+....+. T Consensus 423 f~~~~~~~~~-~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~--~~~~~~~~~~~~~~~~gg~~--- 496 (501) T protein:vir:95 423 FDIARMTPDE-RRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTA--EAMALATPANVPGDGSGGDN--- 496 (501) T ss_pred cccccCCHHH-HHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhc--CcccccccCCCCCCCccccc--- Confidence 1111223332 355678899999999999876633222221111 00000000 00000000000000000000 Q ss_pred ccccc Q lcl|NC_021537. 487 NKIGE 491 (602) Q Consensus 487 ~~~~~ 491 (602) -..++ T Consensus 497 ~~~~~ 501 (501) T protein:vir:95 497 VGNSE 501 (501) T ss_pred ccCCC Confidence 00011 No 265 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=44.83 E-value=0.79 Score=21.14 Aligned_cols=427 Identities=12% Similarity=0.060 Sum_probs=142.4 Q ss_pred CCCCcccccc-----cc-hhhhcccCccccCCCC-----------HHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021537. 1 MSKAEETTQL-----DE-RHIATDVGRGIQPPYN-----------PETLAAFQELNETHQACIRKKSRYEAGYGFEIVAH 63 (602) Q Consensus 1 ~~k~~~~~~~-----~~-~~~~~~~~~~i~p~~~-----------~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~ 63 (602) ..+....=++ .+ +.+.. .+..+-|.++ ......-+.-.++.+..++.++..+...+..+. T Consensus 42 y~a~~~~W~~ird~~~G~~~~r~-~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~-- 118 (535) T protein:vir:80 42 FGEMLPKWRKIMDCLSGQEAIKA-KREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQ-- 118 (535) T ss_pred HHHHHHHHHHHHHHhcChHHHHh-cccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCccee-- Confidence 0000000000 00 00000 0111112111 111111122234555556555554444332210 Q ss_pred cCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCce------------ Q lcl|NC_021537. 64 PSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTP------------ 131 (602) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~------------ 131 (602) .-..+..++...+. ...+..+|.+.+....+.+|-+++.+.....|.. T Consensus 119 -----------~p~~l~~l~~d~D~---------~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~r 178 (535) T protein:vir:80 119 -----------LPPALEAIVEDIDG---------EGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYR 178 (535) T ss_pred -----------ccHHHHHHHhccCC---------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCC Confidence 11233334443332 2457899999999999999999999987655432 Q ss_pred EEEEEeCcccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeec-ccc------------ Q lcl|NC_021537. 132 VGLAHVPAATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDK-ETG------------ 198 (602) Q Consensus 132 ~~L~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~-~~g------------ 198 (602) --|..+.|..|- .+.... ..|...-.+......++..++. |+......++++.+ .+| T Consensus 179 Py~~~y~ae~Ii--nW~~~~--v~G~~~Lt~v~lrE~~~~~dd~------f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~ 248 (535) T protein:vir:80 179 PTITLVHPTSII--NWRTKL--VGGKSVISLVVIQENVLAQDDG------FETTYVQQWRVLQLNAEGNYQVERWRRETQ 248 (535) T ss_pred cEEEEechhhcc--Cccccc--cCCccceeEEEEEEEEEecCCC------cccceeEEEEEEEecCCceEEEEEEEeecC Confidence 224444554441 111000 0000000000001111111110 01111111111111 000 Q ss_pred --------eEEecCceeEEechhHEEEecCCCCCCCcccccHHHHHHH-HHHHHHHHHHHHHHHHHhcCCCceEEEeccc Q lcl|NC_021537. 199 --------EVASDAGELKNGPANELIFLPNPSPLALYYGVPDWVAAMQ-TMGADQAAKEWNHDVFDNLGIPHYAVKVTGG 269 (602) Q Consensus 199 --------~~~~~~~~~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 269 (602) .++........+..=.++.+. ....+...|.||+..+.. .+...+....+. ..+...+.|..+++ | T Consensus 249 ~~~~~~~~~~~~~~~g~~~l~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~-~il~~~~~P~l~i~--G- 323 (535) T protein:vir:80 249 EEMYYSYSKHVPTDGNGNPFKEIPFQFIG-PLDNNADIDHPPLLDLCEVNIGHYRNSADYE-EMAFVAGQPTAFFT--G- 323 (535) T ss_pred CccccccceeecccCCCcccCeeEEEEee-cCCCCCCCCccchHHHHHHHHHHhhchhHHH-HHHHHhcCceeeee--c- Confidence 011100000111111122221 233455688889887765 344434333333 34455567766665 2 Q ss_pred cCCHHHHHHHHHHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCC Q lcl|NC_021537. 270 TLSEDSKEDLRNLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVP 349 (602) Q Consensus 270 ~~~~~~~~~l~~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVP 349 (602) ++++..+...+ -....-+.++ .+.++.+.+..-. ++++-+ -.. +.++...+++++ .|. T Consensus 324 -~~~~~~~~~~~-~~~i~iG~~~--~~~lP~~~~~~~~---------e~~~~~----~a~---~~l~~~e~qM~~-lGa- 381 (535) T protein:vir:80 324 -LTKDWVEDVFK-DFKVHLGSRA--IIPLPQGATAGIL---------QITPNS----VPF---EAMTHKESQMIA-MGA- 381 (535) T ss_pred -CchhhhhcCCC-CcceEecCcc--cccCCCCCCccee---------eeccch----hHH---HHHHHHHHHHHH-HHH- Confidence 11111000000 0001111111 1223333222111 111111 112 222333333333 221 Q ss_pred hHHhhccccCCccCHHHHHHHH--HHHHHHHHHHHHHHHHhhhcCCcc--cc----ccceEEEeccchhcchhHHHHHHH Q lcl|NC_021537. 350 PVLINVTSTSNRANSKEQTREF--AKGIIEPEQAKFSARLYKIIHQDA--LD----VDEWTIDFELRGAEQPEQDAKMAE 421 (602) Q Consensus 350 p~~lg~~~~~~~sn~e~~~~~f--~~~~l~P~~~~ie~~ln~~Ll~~~--~~----~~~~~~~f~~~~~~~~~~d~~~~~ 421 (602) .++. ...++. ++++....+ ....|.-++..+++++++.|---. .. ...+.|+.+.+=. ....+.+. . T Consensus 382 -~ll~-~~~~~~-Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~-~~~ld~~~-~ 456 (535) T protein:vir:80 382 -NLLV-KSGGNR-TFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFP-AARLTPNE-R 456 (535) T ss_pred -Hhhc-cCcccc-cHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCceEEEeccccc-cccCCHHH-H Confidence 1121 112222 222222222 223566677777777766543211 11 1123344332211 11113332 3 Q ss_pred HHHHHHHhCCcccHHHHHHHhCCCCCCCCc---ccc--cccc---ccccccccccCCCcCcccccccccccccccccccc Q lcl|NC_021537. 422 QRVRAMRLAGVGTVNEAREELDLAPFEDDR---GDM--TLSE---FEAEFGADASDGDAEAMLTRSKAAPPLENKIGERD 493 (602) Q Consensus 422 ~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~---~d~--~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (602) +++.+++++|.|+....++.+..--+-+++ .+. .+.. .....++.......+++++..... .+...++.. T Consensus 457 ~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~--~~~~~~~~~ 534 (535) T protein:vir:80 457 AELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNN--GNGGGNQAG 534 (535) T ss_pred HHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccC--CccccccCC Confidence 445678899999998887766322121111 110 0100 000011111111111111111111 111111111 Q ss_pred c Q lcl|NC_021537. 494 S 494 (602) Q Consensus 494 ~ 494 (602) . T Consensus 535 ~ 535 (535) T protein:vir:80 535 N 535 (535) T ss_pred C Confidence 1 No 266 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=42.42 E-value=0.89 Score=20.87 Aligned_cols=456 Identities=13% Similarity=0.065 Sum_probs=171.8 Q ss_pred CCCCcccccccchh-hhcccCcc-----ccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchh Q lcl|NC_021537. 1 MSKAEETTQLDERH-IATDVGRG-----IQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGE 74 (602) Q Consensus 1 ~~k~~~~~~~~~~~-~~~~~~~~-----i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~ 74 (602) |.-.+..+--..+- .....+.. |.+.-| -+.+ ..|..+..|++.+. +-+-|.++.--...+. .. T Consensus 23 V~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~d---r~~~--~~ps~r~~V~~~~~-~Lg~~~~~~Ve~~~~d----e~ 92 (563) T protein:vir:74 23 VDENDKNRVRAYDLYENIYLNSAETLKLVLRGDD---SVPI--LMPSGRKIVEAVHR-FLGVGFDYLVEPDMGD----EG 92 (563) T ss_pred CCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCc---eeee--ccchHHHHHHHHHH-hcCCCcEEecCccccC----cc Confidence 44333221100010 01111111 111111 0111 23455667777554 4466666532211111 11 Q ss_pred hHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC---CCceEEEEEeCccccccccccccc Q lcl|NC_021537. 75 SYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG---DGTPVGLAHVPAATVRVRKTTTTI 151 (602) Q Consensus 75 ~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~---~G~~~~L~~l~p~~v~~~~~~~~~ 151 (602) ..+.+.. .+...-....+.....+..++-++.|.+.+.+.+|. .|.-+++..+||.++.+..+.... T Consensus 93 ~~~avq~----------~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~~dpd~v 162 (563) T protein:vir:74 93 IRQSLNA----------YFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLIEDGSTV 162 (563) T ss_pred hHHHHHH----------HHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceeeeccCCCCc Confidence 1112211 111122234556667777788889999999999984 566778899999988765444332 Q ss_pred ccccc-hhhhhcc---cCceeEEEEcCCcceeecccc------cccccce-----eeeccc-----------ceEEe-cC Q lcl|NC_021537. 152 EREDG-EEVENIE---SGHGYVQVRQGRRRYFGEAGD------RYGDDKR-----FVDKET-----------GEVAS-DA 204 (602) Q Consensus 152 ~~~~~-~~~~~~~---~~~~~~qi~~~~~~~~~~~~~------~~~~~~~-----~~~~~~-----------g~~~~-~~ 204 (602) .-... .....++ +...++ -+...|.++.++ .+-.+.. ..+..+ +.+.. .+ T Consensus 163 ~g~~~v~v~~~~~~pdd~~~~~---~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d 239 (563) T protein:vir:74 163 VGFHMVDIVQDFRSPDDPSKKL---ARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQHD 239 (563) T ss_pred ccceeeecccCCCCCcchhccc---eeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhhhhh Confidence 11100 0000000 000000 001111111100 0000000 000000 10000 01 Q ss_pred ceeE----EechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHH Q lcl|NC_021537. 205 GELK----NGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLR 280 (602) Q Consensus 205 ~~~~----~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~ 280 (602) ++.. .+.-=.|.|+++..+.+..+|.|-++-++..+.....+..-......-.+.|-.++....+ .+....... T Consensus 240 ~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p--~d~~~g~~~ 317 (563) T protein:vir:74 240 EEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAP--VDPNTGELT 317 (563) T ss_pred chhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccc--ccccccccc Confidence 1111 1111237888888888999999999999888877766666666666667777666542211 110110000 Q ss_pred HHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHH-HHHHHhcCChHHhhccccC Q lcl|NC_021537. 281 NLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEH-EIAKVHGVPPVLINVTSTS 359 (602) Q Consensus 281 ~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~-~Ia~~fgVPp~~lg~~~~~ 359 (602) ...+.+|.-+.-..+. . .-.+..++.+.+ =..+..-+++... .|+..=++|..-+|..+.+ T Consensus 318 --------------~w~vgpG~i~El~~~~-~--~g~l~~v~g~~~-l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~ 379 (563) T protein:vir:74 318 --------------DWNIGPMQIVEIAGNR-N--DNYFERVSGVQD-VSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVT 379 (563) T ss_pred --------------ccccCCceeEeccCCc-c--ccceeeecchhh-hHHHHHHHHHHHHHHHHhhccCcceeecccccc Confidence 0111122111111100 0 001222222111 0111222233333 5667778999888844433 Q ss_pred C-ccCHHHHHHHHHHHHHHHHHH--------------HHHHHHhhhcCCcccc---------------ccc-eEEEeccc Q lcl|NC_021537. 360 N-RANSKEQTREFAKGIIEPEQA--------------KFSARLYKIIHQDALD---------------VDE-WTIDFELR 408 (602) Q Consensus 360 ~-~sn~e~~~~~f~~~~l~P~~~--------------~ie~~ln~~Ll~~~~~---------------~~~-~~~~f~~~ 408 (602) + -|.+. +.-.|+|+.. .+-..+...||+--++ ..+ ..+...+. T Consensus 380 ~~~SGiA------LeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~ 453 (563) T protein:vir:74 380 SAESGIS------LELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFA 453 (563) T ss_pred cccchhh------hhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeC Confidence 2 12211 1122333333 1111111111111000 000 11222333 Q ss_pred hhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCCCCccc-------------cccccccccccccccCCCcCcc Q lcl|NC_021537. 409 GAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFEDDRGD-------------MTLSEFEAEFGADASDGDAEAM 475 (602) Q Consensus 409 ~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~~g~~d-------------~~~~~~~~~~~~~~~~~~~~~~ 475 (602) +.+-. |.....+-+.+++.+|+++..-+-++|+--..+.++.+ +++.+.........+....++. T Consensus 454 p~~P~--d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~ 531 (563) T protein:vir:74 454 DPMPV--NKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGA 531 (563) T ss_pred CCCCc--cHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCC Confidence 33333 33334455678999999999999777732222222211 1111111111111111111111 Q ss_pred cccccccccccccccccccccccccccchhhhhcch Q lcl|NC_021537. 476 LTRSKAAPPLENKIGERDSVDVDVSKDPIEQTTFSS 511 (602) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~v~s 511 (602) ..++.++. ......-..-.. -.+...+.+.+- T Consensus 532 ~~~~~dd~---g~p~~~~~~~~~-~~~~~~~~~~~~ 563 (563) T protein:vir:74 532 GEQQFDDQ---GNPIDQFGNPVE-IPPDVTQVPLSP 563 (563) T ss_pred Cccccccc---CCchhHcCCccc-CCccccccCCCC Confidence 11111111 000000001111 111111111111 No 267 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=37.50 E-value=1.1 Score=20.32 Aligned_cols=399 Identities=11% Similarity=0.064 Sum_probs=147.8 Q ss_pred CC---------CC-ccccccc----c-hhhhcccCccccCCCCH---H----HHHHHHhhhHHHHHHHHHHHHhhccCce Q lcl|NC_021537. 1 MS---------KA-EETTQLD----E-RHIATDVGRGIQPPYNP---E----TLAAFQELNETHQACIRKKSRYEAGYGF 58 (602) Q Consensus 1 ~~---------k~-~~~~~~~----~-~~~~~~~~~~i~p~~~~---~----~l~~~~~~~~~v~~cI~~ia~~ia~~~~ 58 (602) |- +. +.=+... + +.+. ..+.-.-|++.. . .|.+ +.-.++.+..++.++..+..-+. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r-~~g~~YLpk~~~E~~~~Y~~rl~r-A~~~n~~~~t~~~~~G~vf~k~p 78 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVK-KKGVRFLPKLSGQTDDMYNAYKQR-ALFYSITSKTLSALSGMVLDQPP 78 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHH-cCCcccCCCCCCCCHHHHHHHHhh-ccCCchHHHHHHHHhchhhcCCc Confidence 00 00 0000000 0 1111 011111222221 1 1111 22245667777777766666555 Q ss_pred EEEEecCCCCcccchhhHHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCceEEEEEeC Q lcl|NC_021537. 59 EIVAHPSADEPDEGGESYQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGTPVGLAHVP 138 (602) Q Consensus 59 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~~~~L~~l~ 138 (602) .+. . ....+.+ .. -....+..+|.+++....+.+|-+++.+-+...|.---|..++ T Consensus 79 ~~~------~----p~~l~~~-------------~~-D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~ 134 (452) T protein:vir:94 79 VIT------H----PDAMSKY-------------FE-DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYT 134 (452) T ss_pred eec------c----cHHHHHH-------------Hh-cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEec Confidence 431 0 0111111 00 1235688999999999999999999999988777533456666 Q ss_pred cccccccccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeecccc--eE---EecCce------- Q lcl|NC_021537. 139 AATVRVRKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETG--EV---ASDAGE------- 206 (602) Q Consensus 139 p~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~---~~~~~~------- 206 (602) |..|- + +.... .+... +.......++.++... |+.-....+++.....| .+ ...++. T Consensus 135 ~~~Ii-~-W~~~~---~g~l~--~v~lre~~~~~d~~d~----f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~ 203 (452) T protein:vir:94 135 TENIL-N-WEEDE---DGRLL--MVVLREFYTVRDTADR----YVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKT 203 (452) T ss_pred hhhhc-C-ccccc---cCCee--EEEEEEEEEEecCCCc----ccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccc Confidence 66653 1 11100 01000 0000111111111110 00000001111100011 00 001111 Q ss_pred ---------eEEechhHEEEecCCCCCCCcccccHHHHHHH-HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHH Q lcl|NC_021537. 207 ---------LKNGPANELIFLPNPSPLALYYGVPDWVAAMQ-TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSK 276 (602) Q Consensus 207 ---------~~~~~~~eviH~r~~~~~~~~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~ 276 (602) ...+|. +.+ .....+...|.||+..+.. .+...+....+. ..+...+.|-.+++ +. +++ T Consensus 204 ~~~~~~~~~l~~IP~---v~~-~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~-~~l~~~~~P~l~~~--g~--~~~-- 272 (452) T protein:vir:94 204 STIQNVGVTMDYIPF---FCI-TPSGLSMTPAKPPMIDIVDINYSHYRTSADLE-HGRHFTGLPTPWIT--GA--ESQ-- 272 (452) T ss_pred eeecCCCcccceeEE---EEE-cCCCCCCCCCccchHHHHHHHHHHhcchhHHH-HHHHHcccceeEee--cC--cCC-- Confidence 111221 111 1222345688999887765 344444444433 44555667766654 21 110 Q ss_pred HHHHHHHHHhhcccccCcceeccC-CccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhc Q lcl|NC_021537. 277 EDLRNLMDNLKGSRYRTAILEVEE-FVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINV 355 (602) Q Consensus 277 ~~l~~~~~~~~g~~nag~~~~~~~-g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~ 355 (602) +.-..|..+ .+.++. |.+. .|...+- +...+ ..+.++...+++ ...|. .++-. T Consensus 273 ------~~i~iG~~~---~~~lpe~~~~~------------~yie~~g-~~i~~-~~~~l~~le~~m-~~~Ga--~ll~~ 326 (452) T protein:vir:94 273 ------STMHIGSTK---AWVIPEVAAKV------------GFLEFTG-QGLQS-LEKALSEKQAQL-ASLSA--RLIDN 326 (452) T ss_pred ------CceEecccc---cccCCCCCCcc------------eEEccCc-hhHHH-HHHHHHHHHHHH-HHHHH--Hhhcc Confidence 011122222 223331 3222 2222211 11111 112222222222 11121 22211 Q ss_pred cccCCccCHHHHHHHH--HHHHHHHHHHHHHHHHhhhcCC--cc-ccccceEEEeccchhcchhHHHHHHHHHHHHHHhC Q lcl|NC_021537. 356 TSTSNRANSKEQTREF--AKGIIEPEQAKFSARLYKIIHQ--DA-LDVDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLA 430 (602) Q Consensus 356 ~~~~~~sn~e~~~~~f--~~~~l~P~~~~ie~~ln~~Ll~--~~-~~~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~ 430 (602) ...+ ..+.++....+ .+..|.-+...+++++++.|-- .. ....+..|+.+.+=.. ...+.. ..+++.+++.+ T Consensus 327 ~~~~-~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~~~~v~~n~dF~~-~~~~~~-~~~al~~~~~~ 403 (452) T protein:vir:94 327 STRG-SEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGGTLNIKLNSAFLD-SKLTAA-ELKAWVEAYLS 403 (452) T ss_pred CCCc-chHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCceEEEecccccc-ccCCHH-HHHHHHHHHhc Confidence 1111 11223322222 2356677777777777654321 11 1112334443332111 111233 23455678999 Q ss_pred CcccHHHHHHHh---CCCCCCCCccccccccccccccccccCCCcCcccccccccccccc Q lcl|NC_021537. 431 GVGTVNEAREEL---DLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLEN 487 (602) Q Consensus 431 G~~T~NE~R~~~---Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (602) |.++....++.+ |....+... +..... ....+..+ .+++.++... . T Consensus 404 G~is~~t~~~~L~~~gvl~~~~e~-~~i~~E-~~~~~~~~----~~~~~~~~~~-----~ 452 (452) T protein:vir:94 404 GGISKEIYIHALKVGKVLPPPGES-MGVIPD-PPAPEPSP----SNTPPNPSSK-----A 452 (452) T ss_pred CCCcHHHHHHHHHhCCCCCCccCH-HHHHHH-hhccCccc----CCCCCCCccC-----C Confidence 999998888877 553322221 111110 00001000 1111111000 0 No 268 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=37.05 E-value=1.1 Score=20.27 Aligned_cols=454 Identities=11% Similarity=0.011 Sum_probs=145.8 Q ss_pred CCCCcccccccc---------------hhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccC------ceE Q lcl|NC_021537. 1 MSKAEETTQLDE---------------RHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGY------GFE 59 (602) Q Consensus 1 ~~k~~~~~~~~~---------------~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~------~~~ 59 (602) ..+..+.++..+ ...+.........++| ++...|++.+|..+.+. .|+ T Consensus 19 ~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~Laa~l~~~ltP~~~WF~ 87 (535) T protein:vir:94 19 YDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQ-----------AVGARGLNNLASKLMLALFPMQTWMK 87 (535) T ss_pred HHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCccc-----------ccHHHHHHHHHHHHHhhhcCCCCccc Confidence 000000000000 0000000001111222 23345666666655542 223 Q ss_pred EEEecCCC-CcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCC-CCceEEEEE Q lcl|NC_021537. 60 IVAHPSAD-EPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEG-DGTPVGLAH 136 (602) Q Consensus 60 i~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~-~G~~~~L~~ 136 (602) +...+... .-.........++..+..+. ..+..+ ...+++.-+..+..|+.++||+.+++..+. .+.....+| T Consensus 88 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~----~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~p 163 (535) T protein:vir:94 88 LTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYI----ESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR 163 (535) T ss_pred cccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH----HhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEE Confidence 32211110 00001111222333332221 111122 234577777788899999999999887653 334556666 Q ss_pred eCcccccccccccccccc--cchhhhhcccC--ceeE-EEEcCCcceeecccccccccceeeecccc--e-EEecCcee- Q lcl|NC_021537. 137 VPAATVRVRKTTTTIERE--DGEEVENIESG--HGYV-QVRQGRRRYFGEAGDRYGDDKRFVDKETG--E-VASDAGEL- 207 (602) Q Consensus 137 l~p~~v~~~~~~~~~~~~--~~~~~~~~~~~--~~~~-qi~~~~~~~~~~~~~~~~~~~~~~~~~~g--~-~~~~~~~~- 207 (602) |..-.|............ .......+... .... .-.......+..+...++ +...+ . ++...|.. T Consensus 164 l~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~------~~~~~~~~~~~e~~g~~~ 237 (535) T protein:vir:94 164 LSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYL------DEESGEYLKYEEIDGVEV 237 (535) T ss_pred cCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEe------eCCCCcEEEEEEecCeee Confidence 654333322211110000 00000000000 0000 000000000000000001 11111 0 11111111 Q ss_pred ------EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHH Q lcl|NC_021537. 208 ------KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRN 281 (602) Q Consensus 208 ------~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~ 281 (602) ..|.....+.+|.....+..||.||...++..+.......+.....-.-...|..++. +++...... T Consensus 238 ~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~------ 310 (535) T protein:vir:94 238 EGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN-PAGITQVRR------ 310 (535) T ss_pred ccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc-cccccchhh------ Confidence 1233445777887777788999999999999988888777766665555555555443 333222221 Q ss_pred HHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCc Q lcl|NC_021537. 282 LMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNR 361 (602) Q Consensus 282 ~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 361 (602) +..+.+ |.+ +.+ ...++...++...+.-+. -.+..+.....|..+|-+.. +.. .++.. T Consensus 311 ----~~~~~~-g~~--v~g-----------~~~~v~~~~~~~~~~~~~-~~~~i~~~~~rI~~af~~~~--~~~-~d~~r 368 (535) T protein:vir:94 311 ----LTKAQT-GDF--VSG-----------RPEDISFLQLEKAADFSV-ARAVSEQIEGRLSYAFMLNS--AVQ-RTGER 368 (535) T ss_pred ----cccCCC-cee--ecC-----------CcccceeeecccccchhH-HHHHHHHHHHHHHHHHhHhh--hcc-CCCCC Confidence 111111 111 111 111122223322111111 13445667788888884321 211 12222 Q ss_pred cCHHHHHH--------------HHHHHHHHHHHHHHHHHH-hhhcCCccccccceEEEeccchhcch--hHHHHHHHHHH Q lcl|NC_021537. 362 ANSKEQTR--------------EFAKGIIEPEQAKFSARL-YKIIHQDALDVDEWTIDFELRGAEQP--EQDAKMAEQRV 424 (602) Q Consensus 362 sn~e~~~~--------------~f~~~~l~P~~~~ie~~l-n~~Ll~~~~~~~~~~~~f~~~~~~~~--~~d~~~~~~~~ 424 (602) -++++... .+....|.|++.+.-..+ ...+|++-.. ..+.+++. ..+-.. ..+.+....++ T Consensus 369 vTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~-~~v~~~~v-s~la~l~r~~~~~~l~~~~ 446 (535) T protein:vir:94 369 VTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPK-EAVEPTIS-TGMEALGRGQDLDKLERCI 446 (535) T ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh-hhccceEe-ehHHHHHHHHHHHHHHHHH Confidence 24443211 123344566655544443 3345543211 12334431 212111 11222222233 Q ss_pred HHHHhCCcccHHHHHH-HhCCCCCCCCccccccccccccccccccC----CCcCcccccccccccc-ccccccc-ccccc Q lcl|NC_021537. 425 RAMRLAGVGTVNEARE-ELDLAPFEDDRGDMTLSEFEAEFGADASD----GDAEAMLTRSKAAPPL-ENKIGER-DSVDV 497 (602) Q Consensus 425 ~~~~~~G~~T~NE~R~-~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~ 497 (602) +.+.. +.| |+.. .++. +..+...-...|.+... .+.-.+...+...... ...-.+. ..... T Consensus 447 ~~laq---~~P-~~ld~~id~--------d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~ 514 (535) T protein:vir:94 447 AAWSA---LAP-MQGDPDINI--------ATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGT 514 (535) T ss_pred HHHHh---hCh-HHhhhcCCH--------HHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 22211 112 1111 1111 10000000000100000 0000000000000000 0000000 00000 Q ss_pred cccccchhhhhcchhhhhhheeccc Q lcl|NC_021537. 498 DVSKDPIEQTTFSSSNLDEGLYDFG 522 (602) Q Consensus 498 ~~~~~~m~~~~v~ss~~~~~~yd~~ 522 (602) -+...+..|. .. ....|--|. T Consensus 515 ~~~~~~~~~~--~~--~~~~g~~~~ 535 (535) T protein:vir:94 515 MATASPENMK--AA--AAQAGMAPN 535 (535) T ss_pred ccccChHHHH--HH--HHHhccCCC Confidence 0000000000 00 011121222 No 269 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=33.05 E-value=1.4 Score=19.81 Aligned_cols=462 Identities=9% Similarity=-0.026 Sum_probs=155.8 Q ss_pred CCCCcccccccch---hhh-cccCccccCCC--CHHHHHHHHhhhHHHHHHHHHHHHhhccC------ceEEEEecCC-C Q lcl|NC_021537. 1 MSKAEETTQLDER---HIA-TDVGRGIQPPY--NPETLAAFQELNETHQACIRKKSRYEAGY------GFEIVAHPSA-D 67 (602) Q Consensus 1 ~~k~~~~~~~~~~---~~~-~~~~~~i~p~~--~~~~l~~~~~~~~~v~~cI~~ia~~ia~~------~~~i~~~~~~-~ 67 (602) ..+..+.++..+. .++ +.....+.+.- .-..+.++. +++...|++.+|..+.+. .|++...+.. + T Consensus 18 ~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~--dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~ 95 (543) T protein:vir:88 18 YERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPW--QAVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQAK 95 (543) T ss_pred HHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccc--cchHHHHHHHHHHHHHHhhcCCCcccccccChHHHh Confidence 1111111111110 000 00000010000 001111222 234456777777766642 1222111100 0 Q ss_pred CcccchhhHHHHHHhhhccc-hhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCC----ceEEEEEeCcccc Q lcl|NC_021537. 68 EPDEGGESYQTVRDFWYGSD-SRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDG----TPVGLAHVPAATV 142 (602) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~-~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G----~~~~L~~l~p~~v 142 (602) ....+......++..+..+- ..+..+. ..+++.-+..+..|+.++||+.+++..+... .+...+||..-.| T Consensus 96 ~~~~~~~~~~~v~~~L~~ve~~~~~~~~----~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl~~y~v 171 (543) T protein:vir:88 96 QLVSDPSQLAVVEQGLGMVERILMSYME----ANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTLHNHVV 171 (543) T ss_pred cccCChhhHHHHHHHHHHHHHHHHHHHH----hcCcHHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEcceEEE Confidence 00001112222333332211 1222222 3457777778889999999999988765421 2345566644333 Q ss_pred ccccccccccc--ccchh-----------hhhc--ccCceeEEEEcCCcceeecccccccccceeeecccceEEecCcee Q lcl|NC_021537. 143 RVRKTTTTIER--EDGEE-----------VENI--ESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDKETGEVASDAGEL 207 (602) Q Consensus 143 ~~~~~~~~~~~--~~~~~-----------~~~~--~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 207 (602) ........... ..... +... .+....+.++.- ++....+. ...+.....+......... T Consensus 172 ~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~--V~pr~~~~----~~~~~~~~~~~~v~~~~~~ 245 (543) T protein:vir:88 172 QRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTH--IYIDDESG----DFLSYQEIEGVEVDGSDGQ 245 (543) T ss_pred eeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEE--EEeecCCC----cccccccccCeeeecCCCc Confidence 32221111100 00000 0000 000000111000 00000000 0000011112222111111 Q ss_pred EEechhHEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhh Q lcl|NC_021537. 208 KNGPANELIFLPNPSPLALYYGVPDWVAAMQTMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLK 287 (602) Q Consensus 208 ~~~~~~eviH~r~~~~~~~~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~ 287 (602) ..+.....+.+|.....+..||.||...++..+.......+.......-...|..++. +++........ . T Consensus 246 ~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~-~~g~~~~~~~~---------~ 315 (543) T protein:vir:88 246 YPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVN-PNGITQVRRLV---------K 315 (543) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-cccccchhhcc---------c Confidence 1123345677787777788999999999999999989888888888777788876653 22222221110 1 Q ss_pred cccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHH Q lcl|NC_021537. 288 GSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQ 367 (602) Q Consensus 288 g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~ 367 (602) ++.+. ++. | ...++...++.....-+. ..+..+.....|-.+|-+.. +.. .++..-++++. T Consensus 316 ~~~g~----~v~-g----------~~~~v~~~~~~~~~~~~~-~~~~i~~~~~rI~~af~~~~--~~~-~~~~r~TAtEV 376 (543) T protein:vir:88 316 AQTGD----FVA-G----------RKADIEFLQLEKTADFTV-AKSVADAIEARLSYVFMLNS--AVQ-RSGERVTAEEI 376 (543) T ss_pred CCCce----eec-C----------CCCcceeeecccccchhH-HHHHHHHHHHHHHHHHhhhh--hcc-CCCCcccHHHH Confidence 11111 010 0 111122222221111111 23455667788888886542 211 12222344432 Q ss_pred HH--------------HHHHHHHHHHHHHHHHHHhh-hcCCccccccceEEEecc--chhcchhHHHHHHHHHHHHHHhC Q lcl|NC_021537. 368 TR--------------EFAKGIIEPEQAKFSARLYK-IIHQDALDVDEWTIDFEL--RGAEQPEQDAKMAEQRVRAMRLA 430 (602) Q Consensus 368 ~~--------------~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~~~f~~--~~~~~~~~d~~~~~~~~~~~~~~ 430 (602) .. .+....|.|++.+.-..+.+ .+|++... ..+.+++.. ..+.+ ..+......+++..-.. T Consensus 377 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~-~~v~~~~vs~l~~l~r-~~~~~~l~~~~~~v~~~ 454 (543) T protein:vir:88 377 RYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQ-EAVEPTVTTGAEALGR-GQDLDKLTQFLNAVATV 454 (543) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCch-hceeeeEEecHHHHHH-HHHHHHHHHHHHHHHhc Confidence 21 22334456666554444433 35543322 234555532 12221 12323222333221110 Q ss_pred Cc------ccHHHHH----HHhCCCCCC---CCc-cccccccccccccccccCCCcCccccccccccccccccccccccc Q lcl|NC_021537. 431 GV------GTVNEAR----EELDLAPFE---DDR-GDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGERDSVD 496 (602) Q Consensus 431 G~------~T~NE~R----~~~Gl~p~~---~g~-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (602) +- +..+++- +.+|.+|.. .++ -......-........+......... .......+..+... T Consensus 455 ~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-- 529 (543) T protein:vir:88 455 SQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVA---AQATASPEAMESAM-- 529 (543) T ss_pred cchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchh---hhhccChHHHHHHh-- Confidence 00 1122221 223443310 000 00000000000000000000000000 00000000000000 Q ss_pred ccccccchhhhhcchhhh Q lcl|NC_021537. 497 VDVSKDPIEQTTFSSSNL 514 (602) Q Consensus 497 ~~~~~~~m~~~~v~ss~~ 514 (602) ....|...| .|..+ T Consensus 530 ---~~~~~~~~p-~~~~~ 543 (543) T protein:vir:88 530 ---DTAGVQPGP-IATQV 543 (543) T ss_pred ---hhcCCCCCC-CCCCC Confidence 000011111 11111 No 270 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=27.78 E-value=1.8 Score=19.17 Aligned_cols=408 Identities=10% Similarity=-0.015 Sum_probs=147.8 Q ss_pred CCCC---cccccccchhhhcccCccccCCCCHHHHHHHHhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhhHH Q lcl|NC_021537. 1 MSKA---EETTQLDERHIATDVGRGIQPPYNPETLAAFQELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGESYQ 77 (602) Q Consensus 1 ~~k~---~~~~~~~~~~~~~~~~~~i~p~~~~~~l~~~~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~~~ 77 (602) +-|- +...+.+.+.-.. ...+.-.|.-..+++ +.-.++.+.+++.++..+..-+..+.. ..-. T Consensus 49 LPk~~~~~~~~~~d~~y~~~--~~~~~~~y~~~~~~r-A~~~n~~~~tl~~l~G~vfrk~p~~~~-----------~~~~ 114 (488) T protein:vir:96 49 LPNLGAIPPEAKTDPKVTAL--AAKIEKDWEDLTWRL-ANYVNIVNPTMNAITGAVMRREPEFDT-----------MDNP 114 (488) T ss_pred CCCCCCccccccCcchhhhh--hccchhhhHhhhhhc-cccCchhHHHHHHhcchhhccCceecc-----------CCcH Confidence 1110 0000000000000 000000000001111 112356677777776666665544321 0011 Q ss_pred HHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc----------eEEEEEeCccccccccc Q lcl|NC_021537. 78 TVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT----------PVGLAHVPAATVRVRKT 147 (602) Q Consensus 78 ~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~----------~~~L~~l~p~~v~~~~~ 147 (602) .+..++...+. ...+..+|.+++....+.+|-+++.+.....+. ---|..+.|..|- .+ T Consensus 115 ~l~~l~~d~D~---------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~Ii--nW 183 (488) T protein:vir:96 115 VLIGLRDNIDG---------KGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHII--DW 183 (488) T ss_pred HHHHHHhccCC---------CCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhc--Cc Confidence 23334443332 346789999999999999999999988764332 1134444554441 11 Q ss_pred ccccccccchhhhhcccCceeEEEEcCCcc---eeecccccccccceee----ecccceEEecCceeEEechhHEEEecC Q lcl|NC_021537. 148 TTTIEREDGEEVENIESGHGYVQVRQGRRR---YFGEAGDRYGDDKRFV----DKETGEVASDAGELKNGPANELIFLPN 220 (602) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~---~~~~~~~~~~~~~~~~----~~~~g~~~~~~~~~~~~~~~eviH~r~ 220 (602) .... ..|...-........+...++... ....+..+.+....+. +..++.+++.+.....+..=.++++.. T Consensus 184 ~~~~--v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~ 261 (488) T protein:vir:96 184 EVEY--IDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLINSKQSDTIPFFLASS 261 (488) T ss_pred ceec--cCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccceEeecCCCcccCeeEEEEEec Confidence 0000 000000000000001111111000 0000000000000000 001111222111111111112333322 Q ss_pred CCCCCCcccccHHHHHHH-HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHHHHHHHhhcccccCcceecc Q lcl|NC_021537. 221 PSPLALYYGVPDWVAAMQ-TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLRNLMDNLKGSRYRTAILEVE 299 (602) Q Consensus 221 ~~~~~~~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~~~~~~~~g~~nag~~~~~~ 299 (602) ...+...|.||+..+.. .+...+....+....+ ..+.|..++...+ .+++..+.... .|..-..+..... T Consensus 262 -~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~-~~~~p~lv~~~~~--~~~~~~~~~~~-----~g~~~~~~~~~~~ 332 (488) T protein:vir:96 262 -QSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMI-LANEAKWMVDMGD--MNKTMASEMNP-----LGFTLAGRMPYYV 332 (488) T ss_pred -CCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHH-hcCCceeeeccCC--CCccccccccc-----ceeeecccccccc Confidence 22445678888887754 5666666666655444 5556777664333 23332221110 0110001111100 Q ss_pred CCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCCccCHHHHHHH--HHHHHHH Q lcl|NC_021537. 300 EFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSNRANSKEQTRE--FAKGIIE 377 (602) Q Consensus 300 ~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~--f~~~~l~ 377 (602) +. ..++|...+... ...+.++...++++. .| ..++.. +++ .+.++.... .....|. T Consensus 333 ~~------------g~~~~~e~~~~~----l~~~~l~~l~~qm~~-~G--a~l~~~--~~~-~Ta~~~~~~~~~~~S~L~ 390 (488) T protein:vir:96 333 KN------------GDVKVIQAQFSP----ETENKVEKLFEQAVK-VG--ASLFTQ--QSN-ETATGAAIRSGSSTASMA 390 (488) T ss_pred cC------------CceeecCCchhH----HHHHHHHHHHHHHHH-Hh--HhhccC--CCc-chHHHHHHHHHHhhHHHH Confidence 00 112222221110 012223333333322 22 122211 122 234433332 2345677 Q ss_pred HHHHHHHHHHhhhcCCccc--c-------ccceEEEeccchhcchhHHHHHHHHHHHHHHhCCcccHHHHHHHhCCCCCC Q lcl|NC_021537. 378 PEQAKFSARLYKIIHQDAL--D-------VDEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVGTVNEAREELDLAPFE 448 (602) Q Consensus 378 P~~~~ie~~ln~~Ll~~~~--~-------~~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~p~~ 448 (602) -+...+++++++.|---.. . .....|+.+.+=.. ...|... .+++.+++.+|.|+....++.+-.--+- T Consensus 391 ~~a~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~-~~ld~~~-~~al~~~~~~G~Is~~t~~~~L~~~gvl 468 (488) T protein:vir:96 391 TLGNNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFD-VEVNPQM-LQVAYAAMMEGNLPQVSWFELLKRARVV 468 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCC-ccCCHHH-HHHHHHHHhcCCCCHHHHHHHHHhCCcC Confidence 8888888888776542111 1 01233443322111 1113332 3456678899999988887665331111 Q ss_pred CCccccccccccccccccccCCCcCc Q lcl|NC_021537. 449 DDRGDMTLSEFEAEFGADASDGDAEA 474 (602) Q Consensus 449 ~g~~d~~~~~~~~~~~~~~~~~~~~~ 474 (602) +++.+.- ......+..+.+- T Consensus 469 ~~d~~~e------~~~~~ie~~g~~~ 488 (488) T protein:vir:96 469 RGDMSKE------EFDEHIAELGFGM 488 (488) T ss_pred CccCCHH------HHHHHHhhcCCCC Confidence 1110000 0000000000000 No 271 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=23.61 E-value=2.3 Score=18.62 Aligned_cols=412 Identities=11% Similarity=0.034 Sum_probs=145.1 Q ss_pred CCCCcccccccchhhhcccCccccCC--CCHHHH-HHH--HhhhHHHHHHHHHHHHhhccCceEEEEecCCCCcccchhh Q lcl|NC_021537. 1 MSKAEETTQLDERHIATDVGRGIQPP--YNPETL-AAF--QELNETHQACIRKKSRYEAGYGFEIVAHPSADEPDEGGES 75 (602) Q Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~i~p~--~~~~~l-~~~--~~~~~~v~~cI~~ia~~ia~~~~~i~~~~~~~~~~~~~~~ 75 (602) +-.-..++.... .+...++ .+-..- +++ +.-.++.+..++.++..+..-+..+. . T Consensus 31 ~~~G~~~~~~r~-------~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~-------------~ 90 (489) T protein:vir:78 31 ALAGELVSYLRN-------VGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEIN-------------I 90 (489) T ss_pred HhcCcccccccC-------CCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhchhhcCCccee-------------c Confidence 000000001000 0011111 111111 111 11235556666666665555443321 0 Q ss_pred HHHHHHhhhccchhhhhhccCCccCCHHHHHHHHHHHHHhcCCeEEEEeeCCCCc-----------eEEEEEeCcccccc Q lcl|NC_021537. 76 YQTVRDFWYGSDSRWQIGPEGTAMSTPEEVLELGRQDYHGIGWAALEILVEGDGT-----------PVGLAHVPAATVRV 144 (602) Q Consensus 76 ~~~~~~~~~~~~~~~~l~~~pn~~~t~~~~~~~~~~d~l~~Gna~~~i~r~~~G~-----------~~~L~~l~p~~v~~ 144 (602) -..+..++...+. ...+..+|.+.+....+.+|-+++.+.....|. ---|..+.|..|- T Consensus 91 p~~l~~l~~d~D~---------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~Ii- 160 (489) T protein:vir:78 91 PKELEYLLKNADG---------SGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIV- 160 (489) T ss_pred cHHHHHHHhccCC---------CCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhc- Confidence 1223333433332 346789999999999999999999998765542 1124445555441 Q ss_pred cccccccccccchhhhhcccCceeEEEEcCCcceeecccccccccceeeec--------------ccce-------EEec Q lcl|NC_021537. 145 RKTTTTIEREDGEEVENIESGHGYVQVRQGRRRYFGEAGDRYGDDKRFVDK--------------ETGE-------VASD 203 (602) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~qi~~~~~~~~~~~~~~~~~~~~~~~~--------------~~g~-------~~~~ 203 (602) .+... ...|...-........+.+... .. .|+......++++.. ..|. +... T Consensus 161 -nW~~~--~v~G~~~Lt~v~lrE~~~~~d~-~~---~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~ 233 (489) T protein:vir:78 161 -NWRLT--RVGSVNRVTMVVLRETWEYNEP-GN---EFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPD 233 (489) T ss_pred -Cceee--eeCCccceeEEEEEEeEEeecC-CC---CccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEecc Confidence 00000 0000000000000000001000 00 000000011111110 0000 0011 Q ss_pred CceeEEechhHEEEec-C-CCCCCCcccccHHHHHHH-HHHHHHHHHHHHHHHHHhcCCCceEEEeccccCCHHHHHHHH Q lcl|NC_021537. 204 AGELKNGPANELIFLP-N-PSPLALYYGVPDWVAAMQ-TMGADQAAKEWNHDVFDNLGIPHYAVKVTGGTLSEDSKEDLR 280 (602) Q Consensus 204 ~~~~~~~~~~eviH~r-~-~~~~~~~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~l~ 280 (602) .|+ ..-..|-|- . ....+...|.||+..+.. .+...+....+ ...+...+.|..+++-. ...+++..+... T Consensus 234 ~g~----~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~-~~~l~~~~~P~l~i~G~-d~~~~~~~~~~~ 307 (489) T protein:vir:78 234 LGE----SLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADN-EESSFVVGQPTLFIYPG-ENLTPQAFKEAN 307 (489) T ss_pred CCC----CccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHH-HHHHHHcccceeeeecC-ccCCcccccccC Confidence 111 011112221 1 122344578889887765 34444444443 34455667777776521 112332222111 Q ss_pred HHHHHhhcccccCcceeccCCccceeccccccccccccccccccchHHHHHHHHHHhhHHHHHHHhcCChHHhhccccCC Q lcl|NC_021537. 281 NLMDNLKGSRYRTAILEVEEFVDDHGLGDGGSDVNIELEPIGAREDLDMEFQAFRERNEHEIAKVHGVPPVLINVTSTSN 360 (602) Q Consensus 281 ~~~~~~~g~~nag~~~~~~~g~~~~~~~~~~~~~~~~~~pl~~~~~~d~qf~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 360 (602) . ....-+.+++ +.++.+.+. ++...+..+ +. .+.++....+++ ..| ..++. .++ T Consensus 308 ~--~~i~~g~~~~--~~lp~~~~~------------~~ie~~~~~---~~-r~~l~~le~qm~-~lG--a~l~~---~~~ 361 (489) T protein:vir:78 308 P--NGIKFGSRRG--HNLGYGGSA------------QLIQAGENN---LA-RQNMLDKEQQAI-QIG--AQLIT---PTQ 361 (489) T ss_pred c--cceeeCCccc--ccCCCCCCc------------ceeccCcch---HH-HHHHHHHHHHHH-HHh--hhhcc---CCc Confidence 0 0111112222 222222211 111111111 11 122222222222 222 12331 111 Q ss_pred ccCHHHHHH--HHHHHHHHHHHHHHHHHHhhhcCC--ccccc---cceEEEeccchhcchhHHHHHHHHHHHHHHhCCcc Q lcl|NC_021537. 361 RANSKEQTR--EFAKGIIEPEQAKFSARLYKIIHQ--DALDV---DEWTIDFELRGAEQPEQDAKMAEQRVRAMRLAGVG 433 (602) Q Consensus 361 ~sn~e~~~~--~f~~~~l~P~~~~ie~~ln~~Ll~--~~~~~---~~~~~~f~~~~~~~~~~d~~~~~~~~~~~~~~G~~ 433 (602) .-+.++... ......|.-++..+++++++.|-- ...+. .+..|+.+.+=. ....+++. .+++..++++|.| T Consensus 362 ~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~-~~~~d~~~-~~al~~~~~~G~i 439 (489) T protein:vir:78 362 QITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFF-LEPMTAQD-RAAWMADINAGLL 439 (489) T ss_pred chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccC-cccCCHHH-HHHHHHHHhcCCC Confidence 233333322 233456777888888888766432 11111 122332222211 11113332 3445678899999 Q ss_pred cHHHHHHHhCCCCCCCCccccccccccccccccccCCCcCcccccccccccccccccc Q lcl|NC_021537. 434 TVNEAREELDLAPFEDDRGDMTLSEFEAEFGADASDGDAEAMLTRSKAAPPLENKIGE 491 (602) Q Consensus 434 T~NE~R~~~Gl~p~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (602) +....++.+-.--+.+...+.... .+..++ .+.......+.|+..++..+ T Consensus 440 s~~t~~~~L~~~gv~d~~~e~~~~----ei~~~~----~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 440 PATAYYAALRKAGVTDWTDADIKD----AVADQP----LPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred CHHHHHHHHHhCCCCCccHHHHHH----HHhhcC----CCcccCCcccCCCCcccccC Confidence 988887765332121111110000 000000 00001111111111111111 Done!