Query lcl|NC_018285.1_cdsid_YP_006561264.1 [gene=Ssal_phage00050] [protein=portal protein] [protein_id=YP_006561264.1] [location=28391..29542] Match_columns 383 No_of_seqs 129 out of 1019 Neff 9.9 Searched_HMMs 1612 Date Thu Nov 7 13:05:31 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4854 Length: 386 # 100.0 5.5E-97 3E-100 548.3 42.4 383 1-383 1-386 (386) 2 protein:vir:4828 Length: 382 # 100.0 1.7E-95 1E-98 540.2 42.0 382 1-383 1-382 (382) 3 protein:vir:4952 Length: 386 # 100.0 1.7E-93 1E-96 529.2 41.5 383 1-383 1-386 (386) 4 protein:vir:7407 Length: 392 # 100.0 2.5E-89 1.5E-92 506.3 41.7 380 1-383 3-391 (392) 5 protein:vir:4995 Length: 384 # 100.0 8.5E-89 5.3E-92 503.4 41.3 381 1-383 1-384 (384) 6 protein:vir:3843 Length: 397 # 100.0 8.7E-89 5.4E-92 503.4 40.4 378 1-383 1-388 (397) 7 protein:vir:1023 Length: 392 # 100.0 5.2E-88 3.2E-91 499.1 42.1 380 1-383 3-391 (392) 8 protein:vir:3989 Length: 392 # 100.0 5.2E-88 3.2E-91 499.1 42.1 380 1-383 3-391 (392) 9 protein:vir:98396 Length: 441 100.0 1.3E-86 8.2E-90 491.4 38.9 379 1-381 26-441 (441) 10 protein:vir:1380 Length: 422 # 100.0 3.1E-86 1.9E-89 489.4 39.9 378 1-382 1-422 (422) 11 protein:vir:81095 Length: 416 100.0 2.8E-86 1.7E-89 489.6 38.9 378 1-381 1-416 (416) 12 protein:vir:4598 Length: 416 # 100.0 2.8E-86 1.7E-89 489.6 38.9 378 1-381 1-416 (416) 13 protein:vir:9408 Length: 441 # 100.0 2.9E-86 1.8E-89 489.5 38.7 379 1-381 26-441 (441) 14 protein:vir:79984 Length: 441 100.0 2.9E-86 1.8E-89 489.5 38.7 379 1-381 26-441 (441) 15 protein:vir:102080 Length: 429 100.0 1.2E-85 7.3E-89 486.2 40.6 379 1-383 1-421 (429) 16 protein:vir:81152 Length: 411 100.0 5.9E-86 3.7E-89 487.8 38.7 373 1-379 1-411 (411) 17 protein:vir:4509 Length: 424 # 100.0 2.6E-85 1.6E-88 484.3 40.9 375 1-383 16-424 (424) 18 protein:vir:105002 Length: 432 100.0 2.7E-85 1.7E-88 484.2 40.0 379 1-383 1-424 (432) 19 protein:vir:107605 Length: 432 100.0 2.7E-85 1.7E-88 484.2 40.0 379 1-383 1-424 (432) 20 protein:vir:102855 Length: 432 100.0 2.7E-85 1.7E-88 484.2 40.0 379 1-383 1-424 (432) 21 protein:vir:93610 Length: 454 100.0 2.1E-85 1.3E-88 484.8 38.4 379 3-383 1-431 (454) 22 protein:vir:81072 Length: 432 100.0 3.7E-85 2.3E-88 483.4 39.4 376 1-383 7-428 (432) 23 protein:vir:8418 Length: 409 # 100.0 7.5E-85 4.6E-88 481.8 40.9 377 1-383 1-409 (409) 24 protein:vir:97060 Length: 432 100.0 6.6E-85 4.1E-88 482.1 39.2 376 1-383 7-427 (432) 25 protein:vir:10362 Length: 432 100.0 9.4E-85 5.8E-88 481.2 39.2 376 1-383 7-428 (432) 26 protein:vir:4454 Length: 414 # 100.0 1.6E-84 1E-87 479.9 40.4 377 1-383 1-410 (414) 27 protein:vir:5737 Length: 419 # 100.0 1.4E-84 8.8E-88 480.3 39.4 374 1-383 1-407 (419) 28 protein:vir:2683 Length: 412 # 100.0 2.5E-84 1.5E-87 478.9 39.9 375 1-383 1-411 (412) 29 protein:vir:96980 Length: 409 100.0 2.4E-84 1.5E-87 479.0 39.8 376 1-383 4-408 (409) 30 protein:vir:100249 Length: 431 100.0 2.4E-84 1.5E-87 479.1 39.6 375 1-383 1-427 (431) 31 protein:vir:100150 Length: 437 100.0 2.4E-84 1.5E-87 479.0 39.5 377 1-383 1-430 (437) 32 protein:vir:102118 Length: 409 100.0 5.3E-84 3.3E-87 477.1 39.5 372 1-379 1-409 (409) 33 protein:vir:6240 Length: 457 # 100.0 5E-84 3.1E-87 477.2 39.1 381 1-383 1-450 (457) 34 protein:vir:105064 Length: 421 100.0 9.9E-84 6.2E-87 475.6 39.2 374 1-383 1-411 (421) 35 protein:vir:93943 Length: 409 100.0 1.5E-83 9.2E-87 474.7 39.9 376 1-383 4-408 (409) 36 protein:vir:1884 Length: 424 # 100.0 2.7E-83 1.7E-86 473.3 39.2 375 1-383 14-421 (424) 37 protein:vir:7853 Length: 518 # 100.0 4.1E-83 2.5E-86 472.3 39.6 381 1-383 1-414 (518) 38 protein:vir:189 Length: 424 # 100.0 3.2E-83 2E-86 472.9 38.8 375 1-383 14-421 (424) 39 protein:vir:94426 Length: 409 100.0 4.8E-83 3E-86 471.9 39.7 376 1-383 4-408 (409) 40 protein:vir:1326 Length: 457 # 100.0 6.7E-83 4.2E-86 471.1 39.9 381 1-383 1-447 (457) 41 protein:vir:1266 Length: 416 # 100.0 7.7E-83 4.7E-86 470.8 40.1 374 2-383 1-416 (416) 42 protein:vir:101648 Length: 518 100.0 1E-82 6.3E-86 470.1 39.7 380 1-383 1-414 (518) 43 protein:vir:3868 Length: 417 # 100.0 3.3E-82 2.1E-85 467.3 39.5 374 1-383 1-406 (417) 44 protein:vir:483 Length: 413 # 100.0 4.9E-82 3E-85 466.4 39.7 376 1-383 1-402 (413) 45 protein:vir:9702 Length: 406 # 100.0 4.5E-82 2.8E-85 466.6 38.1 373 1-383 1-403 (406) 46 protein:vir:4337 Length: 434 # 100.0 6.1E-82 3.8E-85 465.8 38.3 376 1-383 3-434 (434) 47 protein:vir:1431 Length: 419 # 100.0 2.3E-81 1.4E-84 462.7 38.2 372 2-383 1-408 (419) 48 protein:vir:80333 Length: 419 100.0 2.4E-81 1.5E-84 462.6 38.1 374 1-383 1-408 (419) 49 protein:vir:81218 Length: 423 100.0 9.4E-81 5.8E-84 459.3 40.0 382 1-383 1-420 (423) 50 protein:vir:8317 Length: 409 # 100.0 1.7E-80 1E-83 458.0 38.3 358 1-379 1-409 (409) 51 protein:vir:1082 Length: 359 # 100.0 4.8E-80 3E-83 455.4 37.4 352 1-359 1-359 (359) 52 protein:vir:100187 Length: 385 100.0 1.9E-79 1.2E-82 452.1 39.2 371 1-382 1-385 (385) 53 protein:vir:8100 Length: 466 # 100.0 8.5E-79 5.3E-82 448.6 39.5 381 1-382 1-466 (466) 54 protein:vir:94666 Length: 723 100.0 7.5E-79 4.7E-82 448.9 38.8 365 1-383 1-405 (723) 55 protein:vir:100882 Length: 383 100.0 1.1E-78 6.7E-82 448.0 38.5 369 1-380 1-383 (383) 56 protein:vir:101647 Length: 460 100.0 1.6E-78 1E-81 447.0 36.7 381 1-382 2-460 (460) 57 protein:vir:95378 Length: 406 100.0 5.8E-77 3.6E-80 438.6 38.3 369 1-383 1-402 (406) 58 protein:vir:80134 Length: 403 100.0 2.3E-76 1.4E-79 435.3 36.9 366 1-383 1-399 (403) 59 protein:vir:104259 Length: 403 100.0 7.4E-76 4.6E-79 432.5 38.0 367 1-383 1-397 (403) 60 protein:vir:960 Length: 413 # 100.0 1E-75 6.3E-79 431.7 37.8 367 1-379 13-413 (413) 61 protein:vir:102727 Length: 945 100.0 3.8E-75 2.4E-78 428.6 39.1 381 1-383 62-530 (945) 62 protein:vir:6210 Length: 394 # 100.0 4.9E-75 3.1E-78 428.0 36.6 361 1-382 1-394 (394) 63 protein:vir:9359 Length: 348 # 100.0 9.3E-74 5.8E-77 421.0 36.0 320 58-383 1-347 (348) 64 protein:vir:95965 Length: 385 100.0 6.7E-71 4.1E-74 405.3 34.1 354 1-382 1-385 (385) 65 protein:vir:78310 Length: 376 100.0 7.9E-71 4.9E-74 404.9 33.2 354 1-381 1-376 (376) 66 protein:vir:9507 Length: 395 # 100.0 4.3E-70 2.6E-73 400.9 36.5 354 1-383 1-394 (395) 67 protein:vir:101289 Length: 395 100.0 4.3E-70 2.6E-73 400.9 36.5 354 1-383 1-394 (395) 68 protein:vir:100650 Length: 395 100.0 4.3E-70 2.6E-73 400.9 36.5 354 1-383 1-394 (395) 69 protein:vir:4194 Length: 540 # 100.0 8.3E-70 5.2E-73 399.3 36.2 375 1-383 6-430 (540) 70 protein:vir:9641 Length: 395 # 100.0 6.8E-70 4.2E-73 399.8 33.1 362 1-383 1-393 (395) 71 protein:vir:80644 Length: 551 100.0 3.4E-68 2.1E-71 390.4 38.0 378 1-383 5-509 (551) 72 protein:vir:4089 Length: 395 # 100.0 1.3E-68 8E-72 392.8 34.8 362 1-383 1-392 (395) 73 protein:vir:80796 Length: 574 100.0 4.9E-68 3.1E-71 389.6 37.1 375 1-383 27-512 (574) 74 protein:vir:4156 Length: 542 # 100.0 8.7E-68 5.4E-71 388.2 37.0 378 1-383 6-432 (542) 75 protein:vir:63755 Length: 547 100.0 3.5E-67 2.2E-70 384.9 37.9 379 1-383 1-505 (547) 76 protein:vir:98643 Length: 395 100.0 6.9E-68 4.3E-71 388.8 33.8 361 1-383 1-393 (395) 77 protein:vir:96579 Length: 576 100.0 9E-67 5.6E-70 382.7 38.0 382 1-383 32-496 (576) 78 protein:vir:3153 Length: 467 # 100.0 1.2E-66 7.5E-70 382.0 37.3 346 38-383 1-444 (467) 79 protein:vir:100691 Length: 535 100.0 1.6E-66 1E-69 381.3 36.2 378 1-383 34-496 (535) 80 protein:vir:1661 Length: 378 # 100.0 4.3E-65 2.6E-68 373.5 33.7 326 1-382 1-378 (378) 81 protein:vir:99312 Length: 563 100.0 2.2E-64 1.4E-67 369.6 37.2 379 1-383 43-512 (563) 82 protein:vir:95599 Length: 563 100.0 2.2E-64 1.4E-67 369.6 37.2 379 1-383 43-512 (563) 83 protein:vir:93867 Length: 378 100.0 7.5E-65 4.6E-68 372.1 32.8 326 1-382 1-378 (378) 84 protein:vir:94002 Length: 378 100.0 1.2E-64 7.6E-68 371.0 33.7 326 1-382 1-378 (378) 85 protein:vir:94869 Length: 378 100.0 2.8E-63 1.8E-66 363.5 33.2 326 1-382 1-378 (378) 86 protein:vir:858 Length: 378 # 100.0 3.9E-62 2.4E-65 357.2 33.8 327 1-383 1-378 (378) 87 protein:vir:99452 Length: 651 100.0 3E-62 1.9E-65 357.9 32.8 383 1-383 1-527 (651) 88 protein:vir:79772 Length: 648 100.0 1.8E-61 1.1E-64 353.6 36.1 378 1-383 34-495 (648) 89 protein:vir:78641 Length: 278 100.0 4.3E-59 2.7E-62 340.6 29.9 269 58-338 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 3.3E-52 2.1E-55 302.8 28.7 330 1-346 1-368 (368) 91 protein:vir:103971 Length: 376 100.0 1E-51 6.2E-55 300.2 30.8 308 1-345 26-376 (376) 92 protein:vir:98567 Length: 340 100.0 1.4E-51 8.9E-55 299.3 30.1 303 1-327 1-340 (340) 93 protein:vir:100328 Length: 346 100.0 2.6E-51 1.6E-54 297.9 30.7 312 1-328 1-346 (346) 94 protein:vir:267 Length: 348 # 100.0 2.2E-51 1.4E-54 298.3 29.6 311 1-335 1-348 (348) 95 protein:vir:79207 Length: 351 100.0 2.6E-51 1.6E-54 297.9 29.7 308 1-345 1-351 (351) 96 protein:vir:5691 Length: 344 # 100.0 4.6E-51 2.8E-54 296.6 30.4 314 1-328 1-344 (344) 97 protein:vir:6058 Length: 344 # 100.0 7.3E-51 4.5E-54 295.5 29.4 311 1-328 1-344 (344) 98 protein:vir:78191 Length: 351 100.0 2.3E-50 1.4E-53 292.7 29.8 308 1-345 1-351 (351) 99 protein:vir:78749 Length: 337 100.0 2.4E-50 1.5E-53 292.6 29.5 308 1-339 1-337 (337) 100 protein:vir:2013 Length: 344 # 100.0 6.6E-50 4.1E-53 290.2 30.7 311 1-329 1-344 (344) 101 protein:vir:1150 Length: 350 # 100.0 2.9E-50 1.8E-53 292.2 28.6 316 1-338 1-350 (350) 102 protein:vir:4698 Length: 251 # 100.0 2.1E-50 1.3E-53 292.9 25.5 238 1-239 1-251 (251) 103 protein:vir:3780 Length: 345 # 100.0 3.1E-49 1.9E-52 286.5 30.0 320 1-325 1-345 (345) 104 protein:vir:3743 Length: 345 # 100.0 7.4E-49 4.6E-52 284.5 30.7 308 1-325 1-345 (345) 105 protein:vir:98853 Length: 219 100.0 3E-41 1.9E-44 242.7 21.4 202 123-327 1-219 (219) 106 protein:vir:5249 Length: 437 # 99.9 1.8E-24 1.1E-27 150.8 29.6 371 1-382 1-437 (437) 107 protein:vir:79647 Length: 435 99.9 3.6E-23 2.2E-26 143.6 28.3 373 1-380 1-435 (435) 108 protein:vir:107742 Length: 537 99.8 2.7E-20 1.7E-23 127.9 28.0 371 1-383 25-512 (537) 109 protein:vir:94049 Length: 532 99.8 2.9E-20 1.8E-23 127.7 25.7 373 1-383 33-513 (532) 110 protein:vir:80040 Length: 461 99.8 6.9E-20 4.3E-23 125.6 26.5 377 1-383 1-460 (461) 111 protein:vir:99563 Length: 862 99.8 3.7E-19 2.3E-22 121.7 27.1 368 1-383 101-544 (862) 112 protein:vir:104338 Length: 422 99.8 5.3E-19 3.3E-22 120.8 27.9 363 1-383 1-422 (422) 113 protein:vir:79538 Length: 502 99.8 6.4E-19 3.9E-22 120.3 27.5 383 1-383 1-502 (502) 114 protein:vir:107662 Length: 427 99.8 6.7E-19 4.2E-22 120.2 27.3 369 1-383 1-427 (427) 115 protein:vir:96068 Length: 765 99.7 7.2E-18 4.5E-21 114.6 27.2 371 1-383 37-516 (765) 116 protein:vir:108215 Length: 469 99.7 4.1E-16 2.5E-19 104.9 34.1 381 3-383 1-452 (469) 117 protein:vir:95542 Length: 548 99.7 1.7E-17 1E-20 112.5 25.2 382 1-383 1-496 (548) 118 protein:vir:96738 Length: 505 99.7 4.2E-17 2.6E-20 110.4 24.8 381 1-383 8-504 (505) 119 protein:vir:389 Length: 530 # 99.7 3E-16 1.8E-19 105.7 28.5 383 1-383 1-525 (530) 120 protein:vir:99853 Length: 488 99.6 2.3E-15 1.4E-18 100.9 30.0 358 4-383 1-407 (488) 121 protein:vir:79063 Length: 491 99.6 8.7E-15 5.4E-18 97.7 32.3 362 1-383 3-418 (491) 122 protein:vir:107880 Length: 491 99.6 1.2E-14 7.7E-18 96.8 33.0 362 1-383 3-418 (491) 123 protein:vir:103860 Length: 528 99.6 2.3E-15 1.5E-18 100.8 28.3 368 1-383 1-449 (528) 124 protein:vir:10321 Length: 495 99.6 3.7E-15 2.3E-18 99.7 29.1 381 1-383 1-494 (495) 125 protein:vir:99232 Length: 526 99.6 1.1E-14 6.7E-18 97.2 30.5 368 1-383 1-447 (526) 126 protein:vir:3420 Length: 533 # 99.6 4.9E-15 3E-18 99.0 27.7 383 1-383 3-525 (533) 127 protein:vir:6382 Length: 553 # 99.6 1E-14 6.4E-18 97.3 28.8 382 1-383 2-551 (553) 128 protein:vir:79233 Length: 526 99.6 3.7E-14 2.3E-17 94.2 30.5 366 1-383 1-447 (526) 129 protein:vir:1986 Length: 512 # 99.5 3.7E-14 2.3E-17 94.2 26.7 362 1-383 1-438 (512) 130 protein:vir:79511 Length: 448 99.5 5.6E-13 3.5E-16 87.7 29.5 373 1-383 1-439 (448) 131 protein:vir:106716 Length: 698 99.4 3E-13 1.9E-16 89.2 22.3 370 1-383 67-545 (698) 132 protein:vir:78589 Length: 695 99.4 5E-13 3.1E-16 88.0 23.2 370 1-383 67-544 (695) 133 protein:vir:101541 Length: 694 99.4 5.3E-13 3.3E-16 87.9 23.1 369 1-383 66-543 (694) 134 protein:vir:3648 Length: 695 # 99.4 5.4E-13 3.4E-16 87.8 22.8 370 1-383 67-544 (695) 135 protein:vir:77981 Length: 448 99.4 1.7E-11 1.1E-14 79.6 30.5 369 1-383 1-431 (448) 136 protein:vir:98816 Length: 446 99.4 4.4E-12 2.7E-15 82.9 26.4 355 1-362 3-446 (446) 137 protein:vir:78161 Length: 355 99.2 1.9E-10 1.2E-13 73.9 26.6 281 101-383 1-322 (355) 138 protein:vir:105782 Length: 449 99.1 1.2E-10 7.2E-14 75.0 22.8 362 1-381 23-449 (449) 139 protein:vir:95254 Length: 488 99.0 4.1E-09 2.5E-12 66.6 29.7 376 1-383 1-469 (488) 140 protein:vir:98444 Length: 434 99.0 5.5E-09 3.4E-12 65.9 25.9 347 26-383 1-430 (434) 141 protein:vir:7768 Length: 484 # 98.9 9.4E-09 5.8E-12 64.6 24.0 370 1-383 14-470 (484) 142 protein:vir:106491 Length: 646 98.7 4.9E-08 3E-11 60.7 23.6 373 1-383 1-484 (646) 143 protein:vir:5839 Length: 533 # 98.7 4.3E-08 2.6E-11 61.0 23.2 379 1-383 4-506 (533) 144 protein:vir:99916 Length: 504 98.6 1.6E-07 9.8E-11 57.9 25.0 368 1-383 23-482 (504) 145 protein:vir:8654 Length: 629 # 98.6 1.1E-08 7E-12 64.2 17.4 371 1-383 1-513 (629) 146 protein:vir:2427 Length: 485 # 98.6 1.7E-07 1.1E-10 57.7 24.8 369 1-383 6-471 (485) 147 protein:vir:99088 Length: 629 98.6 1.3E-08 8.2E-12 63.8 17.1 371 1-383 1-513 (629) 148 protein:vir:79703 Length: 505 98.6 2.1E-07 1.3E-10 57.2 27.2 372 1-378 1-505 (505) 149 protein:vir:94742 Length: 409 98.6 2.3E-07 1.4E-10 57.0 25.6 337 1-359 3-409 (409) 150 protein:vir:102426 Length: 631 98.5 8.9E-08 5.5E-11 59.3 19.7 376 1-383 1-508 (631) 151 protein:vir:7987 Length: 456 # 98.5 1.5E-07 9.1E-11 58.0 19.9 365 1-383 7-456 (456) 152 protein:vir:1587 Length: 508 # 98.4 6.8E-07 4.2E-10 54.4 24.0 370 1-380 1-508 (508) 153 protein:vir:9568 Length: 410 # 98.4 8.5E-07 5.3E-10 53.9 25.5 347 1-382 1-410 (410) 154 protein:vir:104082 Length: 485 98.4 9.7E-07 6E-10 53.5 23.5 368 1-383 8-483 (485) 155 protein:vir:99072 Length: 479 98.4 1.1E-06 6.9E-10 53.2 23.2 364 1-383 15-463 (479) 156 protein:vir:4898 Length: 502 # 98.3 1.2E-06 7.2E-10 53.1 25.1 377 1-383 46-492 (502) 157 protein:vir:97900 Length: 639 98.3 1.6E-07 9.8E-11 57.9 16.6 375 1-383 1-509 (639) 158 protein:vir:107517 Length: 639 98.3 1.6E-07 9.8E-11 57.9 16.6 375 1-383 1-509 (639) 159 protein:vir:1634 Length: 409 # 98.3 1.5E-06 9.6E-10 52.4 24.9 337 1-359 3-409 (409) 160 protein:vir:102602 Length: 456 98.3 5.1E-07 3.2E-10 55.1 18.3 367 1-383 7-456 (456) 161 protein:vir:105819 Length: 456 98.3 5.1E-07 3.2E-10 55.1 18.3 367 1-383 7-456 (456) 162 protein:vir:9751 Length: 422 # 98.3 1.7E-06 1E-09 52.3 26.4 350 1-376 4-422 (422) 163 protein:vir:106027 Length: 629 98.3 8.6E-07 5.3E-10 53.8 19.4 375 1-383 1-500 (629) 164 protein:vir:103219 Length: 201 98.2 1.4E-07 8.5E-11 58.2 14.1 174 204-383 1-201 (201) 165 protein:vir:95806 Length: 440 98.2 2.3E-06 1.5E-09 51.5 25.4 367 3-382 1-440 (440) 166 protein:vir:2732 Length: 501 # 98.2 2.4E-06 1.5E-09 51.4 25.3 376 1-383 38-491 (501) 167 protein:vir:9306 Length: 511 # 98.2 3E-06 1.8E-09 50.9 25.4 375 1-383 46-501 (511) 168 protein:vir:2341 Length: 488 # 98.2 3.1E-06 2E-09 50.8 23.6 369 1-383 10-485 (488) 169 protein:vir:80959 Length: 499 98.1 3.8E-06 2.4E-09 50.3 25.0 372 1-383 16-499 (499) 170 protein:vir:80680 Length: 441 98.1 3.8E-06 2.4E-09 50.3 24.0 361 1-383 6-440 (441) 171 protein:vir:106639 Length: 481 98.1 4.1E-06 2.5E-09 50.1 26.8 375 1-383 6-480 (481) 172 protein:vir:4223 Length: 486 # 98.1 4.4E-06 2.7E-09 50.0 23.9 366 1-383 15-477 (486) 173 protein:vir:105889 Length: 474 98.1 4.4E-06 2.8E-09 49.9 27.7 366 1-383 4-469 (474) 174 protein:vir:94101 Length: 474 98.1 4.4E-06 2.8E-09 49.9 27.7 366 1-383 4-469 (474) 175 protein:vir:96494 Length: 501 98.1 5.6E-06 3.5E-09 49.4 25.6 378 1-383 38-495 (501) 176 protein:vir:97171 Length: 512 98.0 6.6E-06 4.1E-09 49.0 25.2 377 1-383 46-501 (512) 177 protein:vir:5961 Length: 503 # 98.0 6.8E-06 4.2E-09 48.9 28.9 370 1-383 31-487 (503) 178 protein:vir:38 Length: 496 # N 98.0 7.9E-06 4.9E-09 48.5 25.3 373 1-383 16-496 (496) 179 protein:vir:99522 Length: 470 98.0 8.6E-06 5.3E-09 48.4 27.8 368 1-383 31-469 (470) 180 protein:vir:2500 Length: 501 # 98.0 9.1E-06 5.6E-09 48.2 25.4 360 1-383 33-485 (501) 181 protein:vir:78907 Length: 518 98.0 9.2E-06 5.7E-09 48.2 26.9 372 1-382 1-518 (518) 182 protein:vir:96240 Length: 511 97.9 1E-05 6.5E-09 47.9 26.1 378 1-383 46-503 (511) 183 protein:vir:78805 Length: 511 97.9 1.1E-05 6.7E-09 47.8 24.9 375 1-383 46-499 (511) 184 protein:vir:96366 Length: 511 97.9 1.1E-05 6.7E-09 47.8 24.9 375 1-383 46-499 (511) 185 protein:vir:99781 Length: 511 97.9 1.1E-05 6.8E-09 47.8 25.4 375 1-383 46-503 (511) 186 protein:vir:103951 Length: 511 97.9 1.3E-05 7.9E-09 47.4 26.3 375 1-383 46-503 (511) 187 protein:vir:1236 Length: 483 # 97.9 1.5E-05 9.1E-09 47.1 27.1 363 1-383 41-474 (483) 188 protein:vir:95113 Length: 474 97.8 2E-05 1.3E-08 46.3 27.8 361 1-383 33-465 (474) 189 protein:vir:4782 Length: 522 # 97.8 2.1E-05 1.3E-08 46.2 27.0 373 1-383 1-516 (522) 190 protein:vir:93747 Length: 472 97.8 2.2E-05 1.4E-08 46.1 27.0 363 1-383 30-469 (472) 191 protein:vir:97336 Length: 492 97.7 2.3E-05 1.4E-08 46.0 26.0 363 1-383 50-482 (492) 192 protein:vir:9871 Length: 429 # 97.6 3.8E-05 2.3E-08 44.8 25.5 360 1-383 7-427 (429) 193 protein:vir:78537 Length: 480 97.6 4.1E-05 2.5E-08 44.7 24.7 372 1-383 1-467 (480) 194 protein:vir:9815 Length: 500 # 97.6 4.6E-05 2.8E-08 44.4 24.0 374 1-383 1-496 (500) 195 protein:vir:3028 Length: 500 # 97.6 4.6E-05 2.8E-08 44.4 24.0 374 1-383 1-496 (500) 196 protein:vir:8184 Length: 474 # 97.6 4.6E-05 2.9E-08 44.3 24.9 365 1-381 17-474 (474) 197 protein:vir:4073 Length: 279 # 97.5 1.7E-06 1.1E-09 52.2 9.2 258 42-356 1-279 (279) 198 protein:vir:97376 Length: 320 97.5 3.3E-06 2.1E-09 50.6 10.5 309 1-382 1-320 (320) 199 protein:vir:3964 Length: 453 # 97.4 7.1E-05 4.4E-08 43.3 27.6 367 1-383 19-453 (453) 200 protein:vir:78227 Length: 480 97.4 7.6E-05 4.7E-08 43.2 24.3 368 1-383 1-467 (480) 201 protein:vir:94805 Length: 492 97.3 0.0001 6.3E-08 42.5 27.2 363 1-383 50-487 (492) 202 protein:vir:97447 Length: 474 97.3 0.0001 6.5E-08 42.4 27.8 358 1-383 33-464 (474) 203 protein:vir:94498 Length: 474 97.3 0.0001 6.5E-08 42.4 27.8 358 1-383 33-464 (474) 204 protein:vir:94546 Length: 506 97.2 0.00013 8.1E-08 41.9 21.7 370 1-383 28-499 (506) 205 protein:vir:79043 Length: 479 97.0 0.00024 1.5E-07 40.4 28.2 363 1-383 25-478 (479) 206 protein:vir:96839 Length: 474 96.9 0.00024 1.5E-07 40.4 26.8 358 1-383 32-472 (474) 207 protein:vir:733 Length: 453 # 96.8 0.00031 1.9E-07 39.8 25.3 373 1-383 23-452 (453) 208 protein:vir:3609 Length: 452 # 96.7 0.00044 2.7E-07 39.0 28.2 365 1-383 6-452 (452) 209 protein:vir:105292 Length: 478 96.6 0.00046 2.8E-07 38.9 27.1 361 1-383 32-478 (478) 210 protein:vir:95899 Length: 474 96.6 0.00047 2.9E-07 38.8 26.4 365 1-383 33-464 (474) 211 protein:vir:96266 Length: 474 96.6 0.00047 2.9E-07 38.8 26.4 365 1-383 33-464 (474) 212 protein:vir:4995 Length: 384 # 96.4 0.00021 1.3E-07 40.7 11.7 297 1-367 30-384 (384) 213 protein:vir:98883 Length: 517 96.3 0.0008 4.9E-07 37.6 25.7 374 1-383 16-509 (517) 214 protein:vir:107112 Length: 478 96.2 0.00083 5.1E-07 37.5 27.9 364 1-383 32-478 (478) 215 protein:vir:106571 Length: 499 95.5 0.0019 1.2E-06 35.5 27.6 369 1-383 22-477 (499) 216 protein:vir:102950 Length: 471 95.3 0.0024 1.5E-06 35.0 27.8 364 1-383 1-471 (471) 217 protein:vir:96179 Length: 468 94.3 0.0047 2.9E-06 33.4 28.3 357 1-382 32-468 (468) 218 protein:vir:102668 Length: 547 94.2 0.0052 3.2E-06 33.1 23.5 336 1-383 7-478 (547) 219 protein:vir:94709 Length: 522 92.4 0.012 7.2E-06 31.2 24.3 354 1-383 13-457 (522) 220 protein:vir:2198 Length: 536 # 90.6 0.02 1.3E-05 29.9 21.3 361 1-383 14-477 (536) 221 protein:vir:10447 Length: 536 90.6 0.02 1.3E-05 29.9 21.2 357 1-383 14-477 (536) 222 protein:vir:105461 Length: 470 90.4 0.021 1.3E-05 29.8 28.5 355 1-382 7-470 (470) 223 protein:vir:9922 Length: 489 # 90.0 0.023 1.5E-05 29.5 25.4 368 1-383 21-482 (489) 224 protein:vir:3361 Length: 535 # 88.9 0.029 1.8E-05 29.0 23.6 353 1-383 15-462 (535) 225 protein:vir:97265 Length: 513 86.6 0.045 2.8E-05 28.0 30.0 364 3-383 1-492 (513) 226 protein:vir:95149 Length: 501 85.6 0.052 3.2E-05 27.6 25.2 367 1-383 1-501 (501) 227 protein:vir:1538 Length: 535 # 85.2 0.055 3.4E-05 27.5 23.4 351 1-383 15-462 (535) 228 protein:vir:101806 Length: 516 82.7 0.075 4.6E-05 26.8 23.9 374 1-383 4-515 (516) 229 protein:vir:101189 Length: 516 82.7 0.075 4.6E-05 26.8 23.9 374 1-383 4-515 (516) 230 protein:vir:8883 Length: 543 # 82.6 0.076 4.7E-05 26.7 21.6 356 1-383 15-467 (543) 231 protein:vir:100598 Length: 516 80.8 0.091 5.7E-05 26.3 24.1 374 1-383 4-515 (516) 232 protein:vir:98265 Length: 524 78.4 0.11 7.1E-05 25.7 24.7 375 1-383 4-523 (524) 233 protein:vir:94956 Length: 452 77.9 0.12 7.4E-05 25.6 29.8 359 1-383 1-450 (452) 234 protein:vir:94572 Length: 535 77.1 0.13 7.9E-05 25.5 21.7 354 1-383 16-465 (535) 235 protein:vir:6596 Length: 521 # 69.5 0.22 0.00014 24.2 25.7 378 1-383 8-520 (521) 236 protein:vir:104500 Length: 537 68.0 0.24 0.00015 23.9 25.3 376 1-383 3-522 (537) 237 protein:vir:78083 Length: 537 67.0 0.26 0.00016 23.8 29.0 364 1-383 8-510 (537) 238 protein:vir:5665 Length: 511 # 66.9 0.26 0.00016 23.8 23.4 376 1-383 1-510 (511) 239 protein:vir:99672 Length: 532 65.6 0.28 0.00017 23.6 23.4 352 1-383 15-468 (532) 240 protein:vir:80165 Length: 651 65.1 0.29 0.00018 23.5 17.0 329 1-383 202-590 (651) 241 protein:vir:81017 Length: 521 64.3 0.3 0.00019 23.4 25.4 378 1-383 8-520 (521) 242 protein:vir:103330 Length: 517 60.7 0.37 0.00023 23.0 17.4 352 1-383 13-477 (517) 243 protein:vir:78942 Length: 510 55.7 0.47 0.00029 22.4 23.3 345 1-383 1-449 (510) 244 protein:vir:103765 Length: 549 54.1 0.51 0.00032 22.2 23.9 339 1-383 13-479 (549) 245 protein:vir:78393 Length: 489 53.3 0.53 0.00033 22.1 27.8 366 1-383 22-486 (489) 246 protein:vir:80453 Length: 535 51.8 0.57 0.00035 21.9 25.0 367 1-383 32-519 (535) 247 protein:vir:6322 Length: 510 # 51.8 0.57 0.00036 21.9 21.7 347 1-383 6-452 (510) 248 protein:vir:106282 Length: 521 51.1 0.59 0.00037 21.8 25.1 378 1-383 6-520 (521) 249 protein:vir:7017 Length: 515 # 50.7 0.6 0.00037 21.8 21.8 362 1-383 16-510 (515) 250 protein:vir:108049 Length: 524 50.3 0.61 0.00038 21.7 23.8 379 1-383 8-523 (524) 251 protein:vir:106999 Length: 564 46.9 0.72 0.00045 21.4 24.2 374 1-383 2-530 (564) 252 protein:vir:95315 Length: 559 46.7 0.73 0.00045 21.3 23.3 341 1-383 10-475 (559) 253 protein:vir:104892 Length: 558 45.6 0.76 0.00047 21.2 25.7 378 1-383 2-535 (558) 254 protein:vir:94599 Length: 641 45.1 0.78 0.00048 21.2 17.1 363 1-383 30-546 (641) 255 protein:vir:96988 Length: 516 36.8 1.2 0.00071 20.2 21.6 347 1-383 17-453 (516) 256 protein:vir:96783 Length: 488 33.9 1.3 0.00082 19.9 26.9 357 1-374 14-488 (488) 257 protein:vir:102330 Length: 451 32.6 1.4 0.00088 19.8 26.4 356 1-378 7-451 (451) 258 protein:vir:1785 Length: 555 # 30.3 1.6 0.00098 19.5 23.3 345 1-383 1-470 (555) 259 protein:vir:6896 Length: 523 # 29.0 1.7 0.0011 19.3 24.2 375 1-383 1-522 (523) 260 protein:vir:107404 Length: 555 27.9 1.8 0.0011 19.2 23.2 336 1-383 11-475 (555) 261 protein:vir:98506 Length: 555 27.9 1.8 0.0011 19.2 23.2 336 1-383 11-475 (555) 262 protein:vir:107822 Length: 555 27.9 1.8 0.0011 19.2 23.2 336 1-383 11-475 (555) 263 protein:vir:78696 Length: 542 27.3 1.9 0.0012 19.1 22.8 345 1-383 1-460 (542) 264 protein:vir:103177 Length: 533 26.9 1.9 0.0012 19.1 25.1 380 1-383 1-522 (533) 265 protein:vir:7208 Length: 524 # 26.0 2 0.0012 18.9 23.7 379 1-383 1-523 (524) 266 protein:vir:105641 Length: 516 25.2 2.1 0.0013 18.8 23.2 348 1-383 17-453 (516) 267 protein:vir:103458 Length: 524 24.8 2.1 0.0013 18.8 23.7 379 1-383 1-523 (524) No 1 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=5.5e-97 Score=548.30 Aligned_cols=383 Identities=88% Similarity=1.339 Sum_probs=365.3 Q ss_pred CchhhhhhcCCcc---cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPN---NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||||++.++.+.. ...++....++.+++.+.++.+++.++|+++|+|++||++||++||++|++++++..+.|+.+| T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~p 80 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQLQGIIDNP 80 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccchhHHHhhcC Confidence 9999998765543 3445556667778888889999999999999999999999999999999999999999999999 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL 157 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi 157 (383) |++||+++||+.++.+++++||||++|+|+.+|++++|+||+|++|++..+.++..++|++..++...+..+.++++||| T Consensus 81 N~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evi 160 (386) T protein:vir:48 81 SNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQGDVL 160 (386) T ss_pred CCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEecCccccceeEecCccEE Confidence 99999999999999999999999999999999999999999999999999988889999999888777788899999999 Q ss_pred EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC Q lcl|NC_018285. 158 HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD 237 (383) Q Consensus 158 h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~ 237 (383) |+|++++++.++|.||+..+..++....++++++.++|+||++|+++++.++.+++|+++++++.|..+.+++|+++||+ T Consensus 161 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~ 240 (386) T protein:vir:48 161 HFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLD 240 (386) T ss_pred EecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceecC Confidence 99999999989999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_018285. 238 DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVD 317 (383) Q Consensus 238 ~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e 317 (383) +|++|++++.+++|+||+|++++++++||++|||||.+||+.+++++++++.++|++.||.|+++.|+++|+++|+++++ T Consensus 241 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~ 320 (386) T protein:vir:48 241 DLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKLSCDVD 320 (386) T ss_pred CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 99999999999999999999999999999999999999999888999999999999999999999999999999999999 Q ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 318 ADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) +++...++.+.+.+++++++++++|++|+||+|+++|++|++++|++..+.++..+.+|||++++| T Consensus 321 ~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 321 ADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPEGENPNKTTLKGGEINGED 386 (386) T ss_pred cchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhcCCCCCccCCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=1.7e-95 Score=540.20 Aligned_cols=382 Identities=89% Similarity=1.364 Sum_probs=364.8 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCCCcc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNPSNS 80 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~PN~~ 80 (383) ||||++++++++.....+.+.....+++++.++..++.++|+++++|++||++||++||++|++++++..+.|+.+||++ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~L~~~PN~~ 80 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKLQGIVDNPSNN 80 (382) T ss_pred CccccccccCCcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchhhhhhhhcCCC Confidence 99999999888887777777777778888888999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEec Q lcl|NC_018285. 81 ANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFR 160 (383) Q Consensus 81 ~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~ 160 (383) ||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.+++.++|++..++...+..+.++++||||++ T Consensus 81 ~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evih~~ 160 (382) T protein:vir:48 81 ANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRIPPKQHVPQNDVLHFR 160 (382) T ss_pred CCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecCccccceeEEcCccEEEec Confidence 99999999999999999999999999999999999999999999999988899999999888777788899999999999 Q ss_pred cCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCc Q lcl|NC_018285. 161 LLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLE 240 (383) Q Consensus 161 ~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~ 240 (383) ++++++.++|.||+.++..++....++++++.++|+||+.|++++++++.+++|+++++++.|..+.+++|+++|+++|+ T Consensus 161 ~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~ 240 (382) T protein:vir:48 161 LLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLE 240 (382) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCCCc Confidence 99999989999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccc Q lcl|NC_018285. 241 DFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVDADI 320 (383) Q Consensus 241 ~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~ 320 (383) +|++++.++.|+||+|.+++++++||++|||||.+||+..++++++++.++|++.||.|++++|+++|+++|++++++++ T Consensus 241 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~~~~~ 320 (382) T protein:vir:48 241 DFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKLSCDVDADI 320 (382) T ss_pred eEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhhhh Confidence 99999999999999999999999999999999999999888888999999999999999999999999999999999999 Q ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 321 FPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 321 ~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) ...++.+...+++++++++++|++|+||+|++++..++.+++++..+++ .++++|||++++| T Consensus 321 ~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~-~~~~~GGd~~~~~ 382 (382) T protein:vir:48 321 FPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENP-NSTLKGGEEDGQD 382 (382) T ss_pred hhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcC-CCCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999998764 3578999999999 No 3 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=1.7e-93 Score=529.24 Aligned_cols=383 Identities=87% Similarity=1.294 Sum_probs=359.2 Q ss_pred CchhhhhhcCCcc---cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPN---NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||||+++++++.. ....+.+...+.+.+.+.++..++.++||++|+|++||++||++||++|++++++..+.|+.+| T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 80 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQLQGIVDNP 80 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchhhhhhhcc Confidence 9999998765543 2333444445566677778889999999999999999999999999999999999999999999 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL 157 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi 157 (383) |++||+++||+.++.+++++||||++|+|+.+|++++|+||+|++|++..+.+++..+|.+...+...+..+.++++||| T Consensus 81 N~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evi 160 (386) T protein:vir:49 81 SNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPHIAPKQHVPQNDIL 160 (386) T ss_pred CCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCccccceeEEccccEE Confidence 99999999999999999999999999999999999999999999999999988899999998887777788899999999 Q ss_pred EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC Q lcl|NC_018285. 158 HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD 237 (383) Q Consensus 158 h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~ 237 (383) |++++++++.++|+||+.++..++....++++++.++|+||+.|++++++++..++++++++++.|..+.+++|+++|++ T Consensus 161 h~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~ 240 (386) T protein:vir:49 161 HFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD 240 (386) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecC Confidence 99999999989999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_018285. 238 DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVD 317 (383) Q Consensus 238 ~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e 317 (383) +|++|++++.++.|+||+|++++++++||++|||||++||+..+++++.++.++|+..+|.|++++|+++|+++|+++++ T Consensus 241 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i~~~l~~i~~~~~~~l~~~~~ 320 (386) T protein:vir:49 241 DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIYFKSVSRYLRPFVSEMSKKLSCEVD 320 (386) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhc Confidence 99999999999999999999999999999999999999998776667777888999999999999999999999999999 Q ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 318 ADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) +++...++.|..++++.+++++++|++|+||+|++++..++.++|++..++++..+.||||+|++| T Consensus 321 ~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 321 VDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPNRTSLKGGEINEQD 386 (386) T ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccCCCCCCCCCCCCCC Confidence 999999999999999999999999999999999999999999999999998888999999999999 No 4 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=2.5e-89 Score=506.34 Aligned_cols=380 Identities=54% Similarity=0.899 Sum_probs=337.7 Q ss_pred CchhhhhhcCCccccccc-cc----ccchhhcccc--cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGF-FD----ITDPEFLATL--NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~-~~----~~~~~~~~~~--~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l 73 (383) ||||+++++++....... .. ..++.+++.+ .++..|+.++||++++|++||++||++||++|++++++..+.| T Consensus 3 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~~~l 82 (392) T protein:vir:74 3 LPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQGI 82 (392) T ss_pred chhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchhhhh Confidence 999998887654322111 11 1223333322 3577889999999999999999999999999999999999999 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.+++.++|++...++..+..+.+++ T Consensus 83 ~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:74 83 IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQ 162 (392) T ss_pred hhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCCccceeEEEcC Confidence 99999999999999999999999999999999999999999999999999999998899999999988877777889999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC--HHHHHHHHHHHHHhhcCCc Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL--LDFKTKVSRSRQAMKQMQG 231 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~--~e~~~~~~~~~~~~~~~~g 231 (383) +||||++++++++.++|+||+.++..+|....++++++.++|+||+.|++++++++... +++++++.+.+ .+..|+| T Consensus 163 ~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~-~~~~n~g 241 (392) T protein:vir:74 163 SDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSF-MKRSRSG 241 (392) T ss_pred ccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHH-hccccCC Confidence 99999999999988899999999999999999999999999999999999999987643 34444444444 3567889 Q ss_pred ceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 232 GPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 232 ~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~ 311 (383) +++||++|++|++++.+++|+||+|++++++++||++|||||++||+.+++++++++.++|+.+||.|++++|+++|+++ T Consensus 242 ~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~~~~~l~p~~~~ie~~l~~~ 321 (392) T protein:vir:74 242 GPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYK 321 (392) T ss_pred CeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999888788889999999999999999999999999 Q ss_pred hcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 312 LSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 312 l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) |++++++|....++.|...+++.+++++++|++|+||+|+++...|++++|+|+.++++ +++|||.++-= T Consensus 322 l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~--~~~~Gd~~~p~ 391 (392) T protein:vir:74 322 LSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTN--KKTTGQSNEPV 391 (392) T ss_pred ccchhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCC--CCCCCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999998765 67888753222 No 5 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=8.5e-89 Score=503.42 Aligned_cols=381 Identities=87% Similarity=1.306 Sum_probs=341.3 Q ss_pred CchhhhhhcCCcc---cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPN---NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||||++....++. ....+.+..++.+++.+.++.+++.++||++++|++||++||++||++|++++++..+.|+.+| T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~~~l~~~P 80 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQLQGIVDNP 80 (384) T ss_pred CccccccccCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchhhhhhhcc Confidence 9999987655443 3345556667778888889999999999999999999999999999999999999999999999 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL 157 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi 157 (383) |++||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.+++.++|++...++..+..+.++++||| T Consensus 81 N~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eVi 160 (384) T protein:vir:49 81 SNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPRIPPKQHVPQGDIL 160 (384) T ss_pred CCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCccccceeEecCccEE Confidence 99999999999999999999999999999999999999999999999999888889999999988877888999999999 Q ss_pred EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC Q lcl|NC_018285. 158 HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD 237 (383) Q Consensus 158 h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~ 237 (383) |++++++++.++|+||+.++...+....++++++.++|+||+.|++++++++..+++++++..+.+..+.+|+|+++|++ T Consensus 161 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~ 240 (384) T protein:vir:49 161 HFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLD 240 (384) T ss_pred EecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecC Confidence 99999999889999999999999999999999999999999999999999999988888887788888889999999999 Q ss_pred CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_018285. 238 DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVD 317 (383) Q Consensus 238 ~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e 317 (383) +|++|++++.++.|+|++|.+++++++||++|||||++||+..+++++.++.++++..++.|.+.+|.++++..|..+++ T Consensus 241 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~ 320 (384) T protein:vir:49 241 DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVD 320 (384) T ss_pred CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhh Confidence 99999999999999999999999999999999999999998655444445555566666666666666666666666666 Q ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 318 ADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) .++....+.+...++++++.++++|++|++|+|++++..|+++||+|+.+++ .|++|||++++= T Consensus 321 ~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~~~~~--~p~~gGd~~~~~ 384 (384) T protein:vir:49 321 ADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPEGETD--STLKGGETNEQY 384 (384) T ss_pred hhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHHHcCC--CCCCCCCCCCCC Confidence 6777777888899999999999999999999999999999999999998765 588999998877 No 6 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=8.7e-89 Score=503.35 Aligned_cols=378 Identities=46% Similarity=0.776 Sum_probs=337.2 Q ss_pred CchhhhhhcCCcccccccccccchhh---cccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEF---LATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||||++.++..... +..++.+ +....++.+|+.++||++++|++||++||++||++|+++.++..+.|+.+| T Consensus 1 M~~f~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~~~~~~~~l~~~P 75 (397) T protein:vir:38 1 MPLLKLNKSHSQGF-----SLNDPDWVNFLTGGEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTSESDRSQSIISNP 75 (397) T ss_pred CcchhhhhcccCcc-----cCCchhhhhhhcCCcCCceechHHhhccHHHHHHHHHHHHHHhhCcccccccHHHHHHhcC Confidence 99999866432211 1112222 233346778999999999999999999999999999999999999999999 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL 157 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi 157 (383) |++||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.++..++|++.......+..+.++++||| T Consensus 76 N~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eii 155 (397) T protein:vir:38 76 SVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEPAIGYMENVPAADVI 155 (397) T ss_pred CCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccccccceeEecCccEE Confidence 99999999999999999999999999999999999999999999999999989999999999888777888899999999 Q ss_pred EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH--hhcCCcceee Q lcl|NC_018285. 158 HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA--MKQMQGGPLV 235 (383) Q Consensus 158 h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~--~~~~~g~~~v 235 (383) |++++++++.++|.||+.++...+....++++++.++|+||++|+++++.++.+++++++++++.|+. +..|+|+++| T Consensus 156 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~v 235 (397) T protein:vir:38 156 HIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVV 235 (397) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCcee Confidence 99999999988999999999999999999999999999999999999999999999999998887753 4467899999 Q ss_pred cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_018285. 236 LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD 315 (383) Q Consensus 236 l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~ 315 (383) +++|++|++++.++.|+||+|.+++++++||++|||||.+||+..+.+++.++.+.|+.+||+|+++.|+++||++|+++ T Consensus 236 l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~l~~~ 315 (397) T protein:vir:38 236 IDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQISGQYAKSLNRYVQAIVGELNDKLHAN 315 (397) T ss_pred cCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccCh Confidence 99999999999999999999999999999999999999999986665566677788999999999999999999999999 Q ss_pred hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCC-----CCCCCCCCCCCCC Q lcl|NC_018285. 316 VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPN-----RTILKGGETNGQD 383 (383) Q Consensus 316 ~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~-----~~~~~ggd~~~~d 383 (383) +++++...++.|...+++.+++++++|++|+||+|+++|++|++++|.+..+..+ ....+||++++++ T Consensus 316 ~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (397) T protein:vir:38 316 ISANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNN 388 (397) T ss_pred hcccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccccccCCCCCCC Confidence 9999988999999999999999999999999999999999999988876544322 2244677766665 No 7 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=5.2e-88 Score=499.10 Aligned_cols=380 Identities=54% Similarity=0.902 Sum_probs=336.3 Q ss_pred CchhhhhhcCCcccc----cccccc-cchhhcccc--cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQ----GGFFDI-TDPEFLATL--NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~----~~~~~~-~~~~~~~~~--~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l 73 (383) ||||++++++...+. ..+... .+..+++.+ ..+..|+.+.||++++|++||++||++||++|++++++..+.| T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~l 82 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQGI 82 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchhhhH Confidence 999999876543322 111111 122222222 2467789999999999999999999999999999999999999 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.+++.++|+++..++..+....+++ T Consensus 83 ~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:10 83 IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQ 162 (392) T ss_pred hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEcc Confidence 99999999999999999999999999999999999999999999999999999988899999999988877777889999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC--CHHHHHHHHHHHHHhhcCCc Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG--LLDFKTKVSRSRQAMKQMQG 231 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~--~~e~~~~~~~~~~~~~~~~g 231 (383) +||||++++++++.++|+||+.++..++....++++++.++|+||++|++++++++.. ++++++++.+.+ .+..++| T Consensus 163 ~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~-~~~~~~g 241 (392) T protein:vir:10 163 SDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSF-MKRSRSG 241 (392) T ss_pred ccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHH-hccccCC Confidence 9999999999998889999999999999999999999999999999999999998764 333444444433 3457889 Q ss_pred ceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 232 GPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 232 ~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~ 311 (383) +++|+++|++|++++.+++|+||+|.+++++++||++|||||++||+.+++++++++.++|+.+||.|++++|+++|+++ T Consensus 242 ~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~ 321 (392) T protein:vir:10 242 GPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYK 321 (392) T ss_pred CeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999998888788888999999999999999999999999 Q ss_pred hcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 312 LSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 312 l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) |++++++|....++.|...+++.+++++++|++|+||+|+++...|++++|+|..++++ |.+|||.++-- T Consensus 322 L~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~--~~~~Gd~~~p~ 391 (392) T protein:vir:10 322 LSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTN--KKTTGQSNEPV 391 (392) T ss_pred ccccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCC--CCCCCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999998765 67788753333 No 8 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=5.2e-88 Score=499.10 Aligned_cols=380 Identities=54% Similarity=0.902 Sum_probs=336.3 Q ss_pred CchhhhhhcCCcccc----cccccc-cchhhcccc--cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQ----GGFFDI-TDPEFLATL--NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~----~~~~~~-~~~~~~~~~--~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l 73 (383) ||||++++++...+. ..+... .+..+++.+ ..+..|+.+.||++++|++||++||++||++|++++++..+.| T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~l 82 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQGI 82 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchhhhH Confidence 999999876543322 111111 122222222 2467789999999999999999999999999999999999999 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+.+++.++|+++..++..+....+++ T Consensus 83 ~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:39 83 IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQ 162 (392) T ss_pred hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEcc Confidence 99999999999999999999999999999999999999999999999999999988899999999988877777889999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC--CHHHHHHHHHHHHHhhcCCc Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG--LLDFKTKVSRSRQAMKQMQG 231 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~--~~e~~~~~~~~~~~~~~~~g 231 (383) +||||++++++++.++|+||+.++..++....++++++.++|+||++|++++++++.. ++++++++.+.+ .+..++| T Consensus 163 ~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~-~~~~~~g 241 (392) T protein:vir:39 163 SDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSF-MKRSRSG 241 (392) T ss_pred ccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHH-hccccCC Confidence 9999999999998889999999999999999999999999999999999999998764 333444444433 3457889 Q ss_pred ceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 232 GPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 232 ~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~ 311 (383) +++|+++|++|++++.+++|+||+|.+++++++||++|||||++||+.+++++++++.++|+.+||.|++++|+++|+++ T Consensus 242 ~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~ 321 (392) T protein:vir:39 242 GPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYK 321 (392) T ss_pred CeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999998888788888999999999999999999999999 Q ss_pred hcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 312 LSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 312 l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) |++++++|....++.|...+++.+++++++|++|+||+|+++...|++++|+|..++++ |.+|||.++-- T Consensus 322 L~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~--~~~~Gd~~~p~ 391 (392) T protein:vir:39 322 LSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTN--KKTTGQSNEPV 391 (392) T ss_pred ccccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCC--CCCCCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999998765 67788753333 No 9 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=1.3e-86 Score=491.41 Aligned_cols=379 Identities=20% Similarity=0.243 Sum_probs=311.9 Q ss_pred CchhhhhhcCCcccccccccccchhhccc-ccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------hh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLAT-LNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~~ 71 (383) ||+|.+..++...............+.+. ..++..++.+.||++++|++||++||++||++|+++++.. .+ T Consensus 26 ~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~ 105 (441) T protein:vir:98 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 105 (441) T ss_pred cccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHHHHhhccCceEEecCCcccccchHHH Confidence 88887655442221111111101111111 2245678999999999999999999999999999998743 34 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++|+||+|++|++..+.++...++....+....+..+.+ T Consensus 106 lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~ 185 (441) T protein:vir:98 106 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYYFHQRIDSNGNNIERNV 185 (441) T ss_pred HHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEEEEEEeccCcceeeEEE Confidence 57889999999999999999999999999999999999999999999999999998776655554444444445566889 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHH---hh Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQA---MK 227 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~---~~ 227 (383) +++||||+|+++.++ ++|+||+..+..++....++++++.++|+||+.|++++++++.++ +++++++++.|.. +. T Consensus 186 ~~~dviHir~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~ 264 (441) T protein:vir:98 186 KFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGT 264 (441) T ss_pred ccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCc Confidence 999999999887665 789999999999999999999999999999999999999999874 6777888888854 34 Q ss_pred cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSE 307 (383) Q Consensus 228 ~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~ 307 (383) +|+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||...++++.+++...| .+||+|++++|+++ T Consensus 265 ~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y-~~tl~P~~~~ie~~ 343 (441) T protein:vir:98 265 KQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-LSTLKPYITCVCAE 343 (441) T ss_pred cccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-HHHHHHHHHHHHHH Confidence 67899999999999999999999999999999999999999999999999866656666655444 56999999999999 Q ss_pred HHHhhcch-----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH-hCCCC---------- Q lcl|NC_018285. 308 LSQKLSCD-----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG-ENPNR---------- 371 (383) Q Consensus 308 l~~~l~~~-----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~-~~~~~---------- 371 (383) |+++|+++ ++||...+.+.|...+++.++.++++|++|+||+|+++|++|++++|.... -+.+. T Consensus 344 ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q 423 (441) T protein:vir:98 344 LNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQ 423 (441) T ss_pred HHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccc Confidence 99999864 678888899999999999999999999999999999999999988773221 11111 Q ss_pred --------CCCCCCCCCC Q lcl|NC_018285. 372 --------TILKGGETNG 381 (383) Q Consensus 372 --------~~~~ggd~~~ 381 (383) ...+|||.|+ T Consensus 424 ~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 424 MNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccCCCCCCC Confidence 2457888777 No 10 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=3.1e-86 Score=489.36 Aligned_cols=378 Identities=18% Similarity=0.219 Sum_probs=322.5 Q ss_pred CchhhhhhcCCcccc---cc-----cccccchhhcccc--cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MPIFNLATESPPNNQ---GG-----FFDITDPEFLATL--NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~---~~-----~~~~~~~~~~~~~--~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) ||||+++++++.... .. ...+++..++..+ ..+..++.+.||++++|++||++||++||++|+++++.. T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 999999876544221 11 1122333333322 235568999999999999999999999999999998644 Q ss_pred -------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-----eeEEE Q lcl|NC_018285. 70 -------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLYYN 137 (383) Q Consensus 70 -------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~y~ 137 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++|+||+|++|++..++++. .++|. T Consensus 81 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~~y~ 160 (422) T protein:vir:13 81 EYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKVWYV 160 (422) T ss_pred ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceEEEE Confidence 234778999999999999999999999999999999999999999999999999999887653 45566 Q ss_pred EeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHH Q lcl|NC_018285. 138 VTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKT 217 (383) Q Consensus 138 ~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~ 217 (383) +...+ +....++++||||++.+.+.++++|.||+..+..++....++++++.++|+||++|++++++++.+++++++ T Consensus 161 ~~~~~---g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~ 237 (422) T protein:vir:13 161 VTDKN---GKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKK 237 (422) T ss_pred EEeCC---CeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHH Confidence 55433 456789999999999876677789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHH Q lcl|NC_018285. 218 KVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNV 292 (383) Q Consensus 218 ~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~ 292 (383) ++++.|... .+|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+ .+++++.+++..+| T Consensus 238 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f 317 (422) T protein:vir:13 238 IFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDF 317 (422) T ss_pred HHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 999988654 4578999999999999999999999999999999999999999999999996 45678889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchh Q lcl|NC_018285. 293 YSKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELP 364 (383) Q Consensus 293 ~~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~ 364 (383) +++||.|++++|+++|+++|+++ ++||.....+.|..++++.+.+++++|++|+||+|+++|++|++++|.. T Consensus 318 ~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~ 397 (422) T protein:vir:13 318 YVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDRL 397 (422) T ss_pred HHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee Confidence 99999999999999999999864 4567777888899999999999999999999999999999999887744 Q ss_pred HHhCCCCC--------CCCCCCCCCC Q lcl|NC_018285. 365 KGENPNRT--------ILKGGETNGQ 382 (383) Q Consensus 365 ~~~~~~~~--------~~~ggd~~~~ 382 (383) .. ..|.. -.+|||.+|+ T Consensus 398 ~~-~~n~~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 398 LV-NGNMIPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred ee-ccCccchhhcccccccCCCcCCC Confidence 32 22332 2468888887 No 11 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=2.8e-86 Score=489.61 Aligned_cols=378 Identities=20% Similarity=0.258 Sum_probs=312.9 Q ss_pred CchhhhhhcCCcccc-cccccccchhhcc-cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------h Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-GGFFDITDPEFLA-TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------M 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~ 70 (383) ||||++..++..... .....+... +.+ ....+..++..+||++++|++||++||+++|++|+++++.. . T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~ 79 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQT-LPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIV 79 (416) T ss_pred CCcccccccccccCCCcchhHHHHH-hccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccccchHH Confidence 999997665433211 111111111 112 22356678999999999999999999999999999998633 3 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccccee Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQH 150 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~ 150 (383) +.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++...++....+....+..+. T Consensus 80 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~ 159 (416) T protein:vir:81 80 NLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERN 159 (416) T ss_pred HHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEE Confidence 35778999999999999999999999999999999999999999999999999999877665555544555445556678 Q ss_pred ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHh--- Q lcl|NC_018285. 151 VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAM--- 226 (383) Q Consensus 151 ~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~--- 226 (383) ++++||||+|+++.++ ++|+||+.++..+++...++++++.++|+||+.|++++++++.++ +++++++++.|... T Consensus 160 ~~~~evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g 238 (416) T protein:vir:81 160 VKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSG 238 (416) T ss_pred EccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcC Confidence 9999999999876654 789999999999999999999999999999999999999998874 66778888887643 Q ss_pred hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 227 KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLS 306 (383) Q Consensus 227 ~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~ 306 (383) ..++|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||....+++.++ ...++.+||+|++++|++ T Consensus 239 ~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~-~~~~~~~~l~P~~~~ie~ 317 (416) T protein:vir:81 239 TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITD-ANLDYLSTLKPYITCVCA 317 (416) T ss_pred ccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHH-HHHHHHHHHHHHHHHHHH Confidence 4678999999999999999999999999999999999999999999999997655555444 455667799999999999 Q ss_pred HHHHhhcch-----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH-hCCC---------- Q lcl|NC_018285. 307 ELSQKLSCD-----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG-ENPN---------- 370 (383) Q Consensus 307 ~l~~~l~~~-----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~-~~~~---------- 370 (383) +|+++|+++ ++||...+.+.|...+++.++.++++|++|+||+|+++|++|++++|.... -+.+ T Consensus 318 ~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~ 397 (416) T protein:vir:81 318 ELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEY 397 (416) T ss_pred HHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccccccccccc Confidence 999999863 678888888999999999999999999999999999999999988763221 1111 Q ss_pred --------CCCCCCCCCCC Q lcl|NC_018285. 371 --------RTILKGGETNG 381 (383) Q Consensus 371 --------~~~~~ggd~~~ 381 (383) ..+.+|||.|+ T Consensus 398 ~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 398 QMNKSRATDKKLKGGEENE 416 (416) T ss_pred CcccccccccccCCCCCCC Confidence 12457999888 No 12 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=2.8e-86 Score=489.61 Aligned_cols=378 Identities=20% Similarity=0.258 Sum_probs=312.9 Q ss_pred CchhhhhhcCCcccc-cccccccchhhcc-cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------h Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-GGFFDITDPEFLA-TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------M 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~ 70 (383) ||||++..++..... .....+... +.+ ....+..++..+||++++|++||++||+++|++|+++++.. . T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~ 79 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQT-LPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIV 79 (416) T ss_pred CCcccccccccccCCCcchhHHHHH-hccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccccchHH Confidence 999997665433211 111111111 112 22356678999999999999999999999999999998633 3 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccccee Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQH 150 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~ 150 (383) +.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++...++....+....+..+. T Consensus 80 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~ 159 (416) T protein:vir:45 80 NLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERN 159 (416) T ss_pred HHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEE Confidence 35778999999999999999999999999999999999999999999999999999877665555544555445556678 Q ss_pred ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHh--- Q lcl|NC_018285. 151 VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAM--- 226 (383) Q Consensus 151 ~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~--- 226 (383) ++++||||+|+++.++ ++|+||+.++..+++...++++++.++|+||+.|++++++++.++ +++++++++.|... T Consensus 160 ~~~~evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g 238 (416) T protein:vir:45 160 VKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSG 238 (416) T ss_pred EccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcC Confidence 9999999999876654 789999999999999999999999999999999999999998874 66778888887643 Q ss_pred hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 227 KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLS 306 (383) Q Consensus 227 ~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~ 306 (383) ..++|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||....+++.++ ...++.+||+|++++|++ T Consensus 239 ~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~-~~~~~~~~l~P~~~~ie~ 317 (416) T protein:vir:45 239 TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITD-ANLDYLSTLKPYITCVCA 317 (416) T ss_pred ccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHH-HHHHHHHHHHHHHHHHHH Confidence 4678999999999999999999999999999999999999999999999997655555444 455667799999999999 Q ss_pred HHHHhhcch-----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH-hCCC---------- Q lcl|NC_018285. 307 ELSQKLSCD-----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG-ENPN---------- 370 (383) Q Consensus 307 ~l~~~l~~~-----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~-~~~~---------- 370 (383) +|+++|+++ ++||...+.+.|...+++.++.++++|++|+||+|+++|++|++++|.... -+.+ T Consensus 318 ~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~ 397 (416) T protein:vir:45 318 ELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEY 397 (416) T ss_pred HHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccccccccccc Confidence 999999863 678888888999999999999999999999999999999999988763221 1111 Q ss_pred --------CCCCCCCCCCC Q lcl|NC_018285. 371 --------RTILKGGETNG 381 (383) Q Consensus 371 --------~~~~~ggd~~~ 381 (383) ..+.+|||.|+ T Consensus 398 ~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 398 QMNKSRATDKKLKGGEENE 416 (416) T ss_pred CcccccccccccCCCCCCC Confidence 12457999888 No 13 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=2.9e-86 Score=489.53 Aligned_cols=379 Identities=20% Similarity=0.248 Sum_probs=309.8 Q ss_pred CchhhhhhcCCcc-cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------hh Q lcl|NC_018285. 1 MPIFNLATESPPN-NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~~ 71 (383) ||||.+..++... +......+..........++..++.+.||++++|++||++||++||++|++++++. .+ T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~ 105 (441) T protein:vir:94 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 105 (441) T ss_pred cccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecCccccccchHHH Confidence 8888755443221 11111111110001112235568899999999999999999999999999998643 33 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++|+||+|++|++..+.++...++....+....+..+.+ T Consensus 106 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~ 185 (441) T protein:vir:94 106 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 185 (441) T ss_pred HHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEE Confidence 57789999999999999999999999999999999999999999999999999988776655444444444445566789 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-CHHHHHHHHHHHHHh---h Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTKVSRSRQAM---K 227 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-~~e~~~~~~~~~~~~---~ 227 (383) +++||||+|+++.++ ++|+||+.++..+|+...++++++.++|+||++|+++|++++.+ ++++++++++.|... . T Consensus 186 ~~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~ 264 (441) T protein:vir:94 186 KFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGT 264 (441) T ss_pred ccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCc Confidence 999999999876665 78999999999999999999999999999999999999999987 467788888888654 4 Q ss_pred cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSE 307 (383) Q Consensus 228 ~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~ 307 (383) .|+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||....+++.+++. .++.+||+|++++|+++ T Consensus 265 ~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~-~~~~~tl~P~~~~ie~e 343 (441) T protein:vir:94 265 KQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN-LDYLSTLKPYITCVCAE 343 (441) T ss_pred cccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHH-HHHHHHHHHHHHHHHHH Confidence 67899999999999999999999999999999999999999999999999765555655554 45567999999999999 Q ss_pred HHHhhcc-----hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH-hCCCC---------- Q lcl|NC_018285. 308 LSQKLSC-----DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG-ENPNR---------- 371 (383) Q Consensus 308 l~~~l~~-----~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~-~~~~~---------- 371 (383) |+++|++ +++||.....+.|...+++.++.++++|++|+||+|+++|++|++++|.... -+.+. T Consensus 344 ln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~ 423 (441) T protein:vir:94 344 LNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQ 423 (441) T ss_pred HhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccc Confidence 9999985 4678888889999999999999999999999999999999999988773221 11122 Q ss_pred --------CCCCCCCCCC Q lcl|NC_018285. 372 --------TILKGGETNG 381 (383) Q Consensus 372 --------~~~~ggd~~~ 381 (383) .+.+|||.++ T Consensus 424 ~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 424 MNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccCCCCCCC Confidence 2346888887 No 14 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=2.9e-86 Score=489.53 Aligned_cols=379 Identities=20% Similarity=0.248 Sum_probs=309.8 Q ss_pred CchhhhhhcCCcc-cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------hh Q lcl|NC_018285. 1 MPIFNLATESPPN-NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~~ 71 (383) ||||.+..++... +......+..........++..++.+.||++++|++||++||++||++|++++++. .+ T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~ 105 (441) T protein:vir:79 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 105 (441) T ss_pred cccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecCccccccchHHH Confidence 8888755443221 11111111110001112235568899999999999999999999999999998643 33 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++|+||+|++|++..+.++...++....+....+..+.+ T Consensus 106 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~ 185 (441) T protein:vir:79 106 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 185 (441) T ss_pred HHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEE Confidence 57789999999999999999999999999999999999999999999999999988776655444444444445566789 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-CHHHHHHHHHHHHHh---h Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTKVSRSRQAM---K 227 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-~~e~~~~~~~~~~~~---~ 227 (383) +++||||+|+++.++ ++|+||+.++..+|+...++++++.++|+||++|+++|++++.+ ++++++++++.|... . T Consensus 186 ~~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~ 264 (441) T protein:vir:79 186 KFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGT 264 (441) T ss_pred ccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCc Confidence 999999999876665 78999999999999999999999999999999999999999987 467788888888654 4 Q ss_pred cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSE 307 (383) Q Consensus 228 ~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~ 307 (383) .|+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||....+++.+++. .++.+||+|++++|+++ T Consensus 265 ~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~-~~~~~tl~P~~~~ie~e 343 (441) T protein:vir:79 265 KQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN-LDYLSTLKPYITCVCAE 343 (441) T ss_pred cccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHH-HHHHHHHHHHHHHHHHH Confidence 67899999999999999999999999999999999999999999999999765555655554 45567999999999999 Q ss_pred HHHhhcc-----hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH-hCCCC---------- Q lcl|NC_018285. 308 LSQKLSC-----DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG-ENPNR---------- 371 (383) Q Consensus 308 l~~~l~~-----~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~-~~~~~---------- 371 (383) |+++|++ +++||.....+.|...+++.++.++++|++|+||+|+++|++|++++|.... -+.+. T Consensus 344 ln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~ 423 (441) T protein:vir:79 344 LNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQ 423 (441) T ss_pred HhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccc Confidence 9999985 4678888889999999999999999999999999999999999988773221 11122 Q ss_pred --------CCCCCCCCCC Q lcl|NC_018285. 372 --------TILKGGETNG 381 (383) Q Consensus 372 --------~~~~ggd~~~ 381 (383) .+.+|||.++ T Consensus 424 ~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 424 MNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccCCCCCCC Confidence 2346888887 No 15 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=1.2e-85 Score=486.20 Aligned_cols=379 Identities=20% Similarity=0.199 Sum_probs=319.4 Q ss_pred Cchhhhhhc--CCcccccccccccc---hhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch------ Q lcl|NC_018285. 1 MPIFNLATE--SPPNNQGGFFDITD---PEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ------ 69 (383) Q Consensus 1 Mglf~~~~~--~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~------ 69 (383) ||||+++++ ++........+... ..+++...++.+++.++||++++|++||++||++||++|+++++.. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~~~ 80 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQR 80 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCceee Confidence 999998764 22221111111112 2234555567789999999999999999999999999999998643 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-----eEEEE Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-----LYYNV 138 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~y~~ 138 (383) .+.|+.+||++||+++||+.++.+++++||||+++.|+.+|++++|||++|++|++..++.+.. .+|.+ T Consensus 81 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~~~ 160 (429) T protein:vir:10 81 GTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWYVV 160 (429) T ss_pred ccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEEEE Confidence 2346789999999999999999999999999999999999999999999999999988764422 22333 Q ss_pred eecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHH Q lcl|NC_018285. 139 TFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTK 218 (383) Q Consensus 139 ~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~ 218 (383) . ..+..+.++++||||++++.+.++++|+||+..+..+++...++++++.++|+||+.|+++++.++.+++|++++ T Consensus 161 ~----~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~ 236 (429) T protein:vir:10 161 N----TGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKV 236 (429) T ss_pred c----cCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHH Confidence 2 234567899999999998777778999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHH Q lcl|NC_018285. 219 VSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVY 293 (383) Q Consensus 219 ~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~ 293 (383) +++.|+.. ..|+|+++|+++|++|++++.++.|+|++|.+++++++||++|||||++||+ .+++++.+++..+|+ T Consensus 237 ~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~ 316 (429) T protein:vir:10 237 FRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFY 316 (429) T ss_pred HHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHH Confidence 99988654 4678999999999999999999999999999999999999999999999985 456788899999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhH Q lcl|NC_018285. 294 SKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPK 365 (383) Q Consensus 294 ~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~ 365 (383) +.||.|++++|+++||++|+++ ++||....++.|..++++.+++++++|++|+||+|+++|++|++++|... T Consensus 317 ~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~ 396 (429) T protein:vir:10 317 TDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLL 396 (429) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Confidence 9999999999999999999864 56778888889999999999999999999999999999999998877544 Q ss_pred Hh-------CCCCCCCCCCCCCCCC Q lcl|NC_018285. 366 GE-------NPNRTILKGGETNGQD 383 (383) Q Consensus 366 ~~-------~~~~~~~~ggd~~~~d 383 (383) .. ..+....+|||++++- T Consensus 397 ~~~n~~~~d~~~~~~~k~g~~~~~~ 421 (429) T protein:vir:10 397 VNGNMLPIDMAGQAYLKGGDTNGEV 421 (429) T ss_pred ecccccchhhccccccCCCCCCCCC Confidence 32 1122335677764433 No 16 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=5.9e-86 Score=487.84 Aligned_cols=373 Identities=17% Similarity=0.180 Sum_probs=319.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) ||||+++++.. ..++...+.+++.+.. +.++..++.+.||++++|++||++||++||++|+++++.. T Consensus 1 MG~~~~~~~~~-~~~~~~~~~~~~~~~~-~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 78 (411) T protein:vir:81 1 MGWWSRLTRFF-RPRNETVDMTNPLLLQ-WLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIVKSDREE 78 (411) T ss_pred CchHHHHHhhc-cCcccccccchHHHHH-HhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCceeeecccH Confidence 99999986422 2223334444555443 3466778999999999999999999999999999998643 Q ss_pred -hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-----eeEEEEeecCc Q lcl|NC_018285. 70 -MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLYYNVTFDDP 143 (383) Q Consensus 70 -~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~y~~~~~~~ 143 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+ .|++.+|||++|+.|++..++.+. ..+|.+.. . T Consensus 79 l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~--~ 155 (411) T protein:vir:81 79 LYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRYND--P 155 (411) T ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEEEe--c Confidence 23467899999999999999999999999999999998 589999999999999998876542 33444443 3 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSR 223 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~ 223 (383) ..+....++++||||+|.+.+.++++|+||+.++..++....++++++.++|+||+.|+++++.++.+++++++++++.| T Consensus 156 ~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~ 235 (411) T protein:vir:81 156 YDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGF 235 (411) T ss_pred CCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHH Confidence 34566789999999999776666789999999999999999999999999999999999999999999999999999998 Q ss_pred HH---hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 224 QA---MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 224 ~~---~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~~~~~~~~~l~ 298 (383) .. +.+|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+. ++++|.+++.++|+.+||. T Consensus 236 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~ 315 (411) T protein:vir:81 236 EQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLL 315 (411) T ss_pred HHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHH Confidence 65 445789999999999999999999999999999999999999999999999864 5677888889999999999 Q ss_pred HHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCC Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPN 370 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~ 370 (383) |++++|+++|+++|++. ++||+...++.|..++++.+.+++++|++|+||+|+++|++|++++|..... .+ T Consensus 316 P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~~~~-~n 394 (411) T protein:vir:81 316 YVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNLMAN-GN 394 (411) T ss_pred HHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeec-cC Confidence 99999999999999864 5678888889999999999999999999999999999999999888755432 23 Q ss_pred CCC--------CCCCCC Q lcl|NC_018285. 371 RTI--------LKGGET 379 (383) Q Consensus 371 ~~~--------~~ggd~ 379 (383) +.| .+|||+ T Consensus 395 ~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 395 YIPLSMLGANYGKGGDS 411 (411) T ss_pred ccchhhhhhhhccCCCC Confidence 333 357777 No 17 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=2.6e-85 Score=484.30 Aligned_cols=375 Identities=15% Similarity=0.184 Sum_probs=316.6 Q ss_pred CchhhhhhcCCccccccc-ccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGF-FDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------- 69 (383) +-||+++++++....+.. .......+.+.+.++.+|+.++||++++|++||++||++||++|+++++.. T Consensus 16 ~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~ 95 (424) T protein:vir:45 16 RVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDH 95 (424) T ss_pred hHHHHhhccccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCceEEEEecCCceeecccc Confidence 778888887654322111 111112234556678899999999999999999999999999999998642 Q ss_pred --hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccc Q lcl|NC_018285. 70 --MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPP 147 (383) Q Consensus 70 --~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~ 147 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+..|++++|+|++|+.|++... ++...|++.... . T Consensus 96 ~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~--~~~~~y~~~~~~----~ 169 (424) T protein:vir:45 96 PAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNT--GGRYTYGLYNEY----G 169 (424) T ss_pred hHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEc--CCeEEEEEEecC----c Confidence 23467899999999999999999999999999999999999999999999999998764 456677765433 2 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhh Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMK 227 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~ 227 (383) ...++++||||+|++++++ .+|+||+..+..+|+...++++++.++|+||++|+++++.++.+++|+++++++.|.... T Consensus 170 ~~~~~~~eVih~r~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~ 248 (424) T protein:vir:45 170 AFAISPDDMIHIRALGNNQ-KMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKAS 248 (424) T ss_pred eEEECcccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHh Confidence 4579999999999988765 789999999999999999999999999999999999999999999999999998886432 Q ss_pred ----cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 ----QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVARYL 301 (383) Q Consensus 228 ----~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P~~ 301 (383) .|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+ .++++|.+++.+.|+++||.|++ T Consensus 249 ~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~ 328 (424) T protein:vir:45 249 QALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWV 328 (424) T ss_pred ccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 478999999999999999999999999999999999999999999999996 45678889999999999999999 Q ss_pred HHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCC Q lcl|NC_018285. 302 RPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTI 373 (383) Q Consensus 302 ~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~ 373 (383) ++|+++||++|+++ ++||....++.|..++++.+++++++|++|+||+|+++|++|++++|..... +|..+ T Consensus 329 ~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD~~~~~-~n~~~ 407 (424) T protein:vir:45 329 TNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEMLVS-VNAAN 407 (424) T ss_pred HHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeec-ccccc Confidence 99999999999875 5688888889999999999999999999999999999999999987755432 22221 Q ss_pred -------CCCCCCCCCC Q lcl|NC_018285. 374 -------LKGGETNGQD 383 (383) Q Consensus 374 -------~~ggd~~~~d 383 (383) .+..+.+++| T Consensus 408 ~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 408 PAGDFKPPKNDEGKTNE 424 (424) T ss_pred cccccCCCCCCCCCCCC Confidence 1222222222 No 18 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=2.7e-85 Score=484.23 Aligned_cols=379 Identities=20% Similarity=0.216 Sum_probs=318.8 Q ss_pred Cchhhhhh-----cCCcccccccccccchh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--- Q lcl|NC_018285. 1 MPIFNLAT-----ESPPNNQGGFFDITDPE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--- 69 (383) Q Consensus 1 Mglf~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--- 69 (383) ||||+++. .++........+...+. +++...++.+++.++|+++++|++||++||++||++|+++++.. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999973 23322222222222222 34445567889999999999999999999999999999998643 Q ss_pred ---------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-----eeE Q lcl|NC_018285. 70 ---------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLY 135 (383) Q Consensus 70 ---------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~ 135 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|++|++..++.+. ..+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 234667999999999999999999999999999999999999999999999999998875432 233 Q ss_pred EEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHH Q lcl|NC_018285. 136 YNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDF 215 (383) Q Consensus 136 y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~ 215 (383) |.+. .++..+.++++||||+|++.+.++++|+||+..+..++....++++++.++|+||+.|+++++.++.+++++ T Consensus 161 y~~~----~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~ 236 (432) T protein:vir:10 161 YVVN----TGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDA 236 (432) T ss_pred EEEe----cCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHH Confidence 3333 234567899999999998777777999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHH Q lcl|NC_018285. 216 KTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSS 290 (383) Q Consensus 216 ~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~ 290 (383) ++++++.|... ..|+|+++|+++|++|++++.++.|+||++.+++++++||++|||||++||. .+++++.+++.. T Consensus 237 ~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~ 316 (432) T protein:vir:10 237 KKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 316 (432) T ss_pred HHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 99999988654 4678999999999999999999999999999999999999999999999985 566788899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) +|+++||+|++++|+++||++|+++ ++||...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 317 ~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD 396 (432) T protein:vir:10 317 QFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999999999864 56777788889999999999999999999999999999999998887 Q ss_pred hhHHhC-------CCCCCCCCCCCCCCC Q lcl|NC_018285. 363 LPKGEN-------PNRTILKGGETNGQD 383 (383) Q Consensus 363 ~~~~~~-------~~~~~~~ggd~~~~d 383 (383) ...... .+....+|||++++- T Consensus 397 ~~~~~~n~~~~~~~~~~~~k~~~~~~~~ 424 (432) T protein:vir:10 397 RLLVNGNMLPIDMAGQAYLKGGDTNGEV 424 (432) T ss_pred eEeecccccchhhccccccCCCCCCCCC Confidence 554321 122234677754432 No 19 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=2.7e-85 Score=484.23 Aligned_cols=379 Identities=20% Similarity=0.216 Sum_probs=318.8 Q ss_pred Cchhhhhh-----cCCcccccccccccchh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--- Q lcl|NC_018285. 1 MPIFNLAT-----ESPPNNQGGFFDITDPE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--- 69 (383) Q Consensus 1 Mglf~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--- 69 (383) ||||+++. .++........+...+. +++...++.+++.++|+++++|++||++||++||++|+++++.. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999973 23322222222222222 34445567889999999999999999999999999999998643 Q ss_pred ---------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-----eeE Q lcl|NC_018285. 70 ---------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLY 135 (383) Q Consensus 70 ---------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~ 135 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|++|++..++.+. ..+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 234667999999999999999999999999999999999999999999999999998875432 233 Q ss_pred EEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHH Q lcl|NC_018285. 136 YNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDF 215 (383) Q Consensus 136 y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~ 215 (383) |.+. .++..+.++++||||+|++.+.++++|+||+..+..++....++++++.++|+||+.|+++++.++.+++++ T Consensus 161 y~~~----~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~ 236 (432) T protein:vir:10 161 YVVN----TGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDA 236 (432) T ss_pred EEEe----cCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHH Confidence 3333 234567899999999998777777999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHH Q lcl|NC_018285. 216 KTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSS 290 (383) Q Consensus 216 ~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~ 290 (383) ++++++.|... ..|+|+++|+++|++|++++.++.|+||++.+++++++||++|||||++||. .+++++.+++.. T Consensus 237 ~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~ 316 (432) T protein:vir:10 237 KKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 316 (432) T ss_pred HHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 99999988654 4678999999999999999999999999999999999999999999999985 566788899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) +|+++||+|++++|+++||++|+++ ++||...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 317 ~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD 396 (432) T protein:vir:10 317 QFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999999999864 56777788889999999999999999999999999999999998887 Q ss_pred hhHHhC-------CCCCCCCCCCCCCCC Q lcl|NC_018285. 363 LPKGEN-------PNRTILKGGETNGQD 383 (383) Q Consensus 363 ~~~~~~-------~~~~~~~ggd~~~~d 383 (383) ...... .+....+|||++++- T Consensus 397 ~~~~~~n~~~~~~~~~~~~k~~~~~~~~ 424 (432) T protein:vir:10 397 RLLVNGNMLPIDMAGQAYLKGGDTNGEV 424 (432) T ss_pred eEeecccccchhhccccccCCCCCCCCC Confidence 554321 122234677754432 No 20 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=2.7e-85 Score=484.23 Aligned_cols=379 Identities=20% Similarity=0.216 Sum_probs=318.8 Q ss_pred Cchhhhhh-----cCCcccccccccccchh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--- Q lcl|NC_018285. 1 MPIFNLAT-----ESPPNNQGGFFDITDPE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--- 69 (383) Q Consensus 1 Mglf~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--- 69 (383) ||||+++. .++........+...+. +++...++.+++.++|+++++|++||++||++||++|+++++.. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999973 23322222222222222 34445567889999999999999999999999999999998643 Q ss_pred ---------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-----eeE Q lcl|NC_018285. 70 ---------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLY 135 (383) Q Consensus 70 ---------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~ 135 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|++|++..++.+. ..+ T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~ 160 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 160 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 234667999999999999999999999999999999999999999999999999998875432 233 Q ss_pred EEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHH Q lcl|NC_018285. 136 YNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDF 215 (383) Q Consensus 136 y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~ 215 (383) |.+. .++..+.++++||||+|++.+.++++|+||+..+..++....++++++.++|+||+.|+++++.++.+++++ T Consensus 161 y~~~----~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~ 236 (432) T protein:vir:10 161 YVVN----TGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDA 236 (432) T ss_pred EEEe----cCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHH Confidence 3333 234567899999999998777777999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHH Q lcl|NC_018285. 216 KTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSS 290 (383) Q Consensus 216 ~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~ 290 (383) ++++++.|... ..|+|+++|+++|++|++++.++.|+||++.+++++++||++|||||++||. .+++++.+++.. T Consensus 237 ~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~ 316 (432) T protein:vir:10 237 KKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 316 (432) T ss_pred HHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 99999988654 4678999999999999999999999999999999999999999999999985 566788899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) +|+++||+|++++|+++||++|+++ ++||...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 317 ~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD 396 (432) T protein:vir:10 317 QFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999999999864 56777788889999999999999999999999999999999998887 Q ss_pred hhHHhC-------CCCCCCCCCCCCCCC Q lcl|NC_018285. 363 LPKGEN-------PNRTILKGGETNGQD 383 (383) Q Consensus 363 ~~~~~~-------~~~~~~~ggd~~~~d 383 (383) ...... .+....+|||++++- T Consensus 397 ~~~~~~n~~~~~~~~~~~~k~~~~~~~~ 424 (432) T protein:vir:10 397 RLLVNGNMLPIDMAGQAYLKGGDTNGEV 424 (432) T ss_pred eEeecccccchhhccccccCCCCCCCCC Confidence 554321 122234677754432 No 21 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=2.1e-85 Score=484.85 Aligned_cols=379 Identities=17% Similarity=0.244 Sum_probs=317.0 Q ss_pred hhhhhhcCCccccc-------cccc---ccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--- Q lcl|NC_018285. 3 IFNLATESPPNNQG-------GFFD---ITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--- 69 (383) Q Consensus 3 lf~~~~~~~~~~~~-------~~~~---~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--- 69 (383) +|+.+++.+..... .|.. .....+.+.+.++..|+.++||++++|++||++||++||++|+++|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 77777664432221 1111 1122345666778899999999999999999999999999999998743 Q ss_pred ---------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEee Q lcl|NC_018285. 70 ---------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTF 140 (383) Q Consensus 70 ---------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~ 140 (383) ...|+.+||++||+++||+.++.+++++||||++|+|+.+|++++|+||+|++|++..++++ .++|++.. T Consensus 81 ~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g-~~~y~~~~ 159 (454) T protein:vir:93 81 IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDG-EVFYRITP 159 (454) T ss_pred ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCC-cEEEEEEe Confidence 24588999999999999999999999999999999999999999999999999999887655 56677765 Q ss_pred cCc-ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHH Q lcl|NC_018285. 141 DDP-RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKV 219 (383) Q Consensus 141 ~~~-~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~ 219 (383) ... ..+....++++||||++.....++++|+||+..+..++....++++++.++|+||++|++++++++.+++|+++++ T Consensus 160 ~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~ 239 (454) T protein:vir:93 160 DRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKL 239 (454) T ss_pred ccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHH Confidence 432 2234567999999999976666778999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhh--cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHH Q lcl|NC_018285. 220 SRSRQAMK--QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSK 295 (383) Q Consensus 220 ~~~~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~ 295 (383) ++.|+... .|+|+++||++|++|++++.+++|+||+|++++++++||++|||||++||. .+++++.+++.++|++. T Consensus 240 ~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 319 (454) T protein:vir:93 240 KSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQ 319 (454) T ss_pred HHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHH Confidence 99987654 578999999999999999999999999999999999999999999999996 35677888999999999 Q ss_pred HHHHHHHHHHHHHHHhhcch----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC Q lcl|NC_018285. 296 AVARYLRPFLSELSQKLSCD----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR 371 (383) Q Consensus 296 ~l~P~~~~i~~~l~~~l~~~----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~ 371 (383) ||.|+++.|+++||++|++. ++|+...+++.|...+++.+.+++++|++|+||+|+++|++|++++|....... + T Consensus 320 ~l~P~~~~ie~~ln~~L~~~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~-~ 398 (454) T protein:vir:93 320 CLQTLIESIELLLDEALETGENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQ-N 398 (454) T ss_pred HHHHHHHHHHHHHHHhhcCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccC-c Confidence 99999999999999999864 678888888999999999999999999999999999999999988775433221 1 Q ss_pred CCCC------CCC---------------CCCCC Q lcl|NC_018285. 372 TILK------GGE---------------TNGQD 383 (383) Q Consensus 372 ~~~~------ggd---------------~~~~d 383 (383) .+++ +.+ ..++| T Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 431 (454) T protein:vir:93 399 YSLEALSRRDAREDPFASSGKTASVPQAVAASD 431 (454) T ss_pred cchHhhhccCcccCCCCCCccCCCCCCCCCCCC Confidence 1110 000 00001 No 22 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=3.7e-85 Score=483.45 Aligned_cols=376 Identities=16% Similarity=0.211 Sum_probs=310.8 Q ss_pred Cchhhhhhc---CCc-ccccccccc-----c-chhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MPIFNLATE---SPP-NNQGGFFDI-----T-DPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mglf~~~~~---~~~-~~~~~~~~~-----~-~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) ||||++++. ++. ...++..+. . .......+.++..|+.++||++++|++||++||++||++|+++|+.. T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecC Confidence 999999543 211 111111111 1 11112234567889999999999999999999999999999997532 Q ss_pred -----------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEE Q lcl|NC_018285. 70 -----------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNV 138 (383) Q Consensus 70 -----------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~ 138 (383) .+.|+.+||++||+++||+.++.+++++||||+++.|+ +|++++||||+|++|++..+.++ ...|++ T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g-~~~y~~ 164 (432) T protein:vir:81 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKG-NTAYRY 164 (432) T ss_pred CcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCC-cEEEEE Confidence 24567899999999999999999999999999999986 59999999999999999887654 566776 Q ss_pred eecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHH Q lcl|NC_018285. 139 TFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTK 218 (383) Q Consensus 139 ~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~ 218 (383) +..+ +..+.++++||+|+|+++.++ ++|+||+..+..+|....++++++.++|+||++|+++++.++.++++++++ T Consensus 165 ~~~~---g~~~~~~~~~iih~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~ 240 (432) T protein:vir:81 165 RRTD---GQMIDIPKQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDS 240 (432) T ss_pred EecC---ceEEEEccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHH Confidence 5543 456789999999999887776 789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--c---CcCHHHHHHHHH Q lcl|NC_018285. 219 VSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG--D---QQSSLEMSSNVY 293 (383) Q Consensus 219 ~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~--~---~~~~~e~~~~~~ 293 (383) +++.+. +..++|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||... + +++.+++.+.|+ T Consensus 241 ~~~~~~-~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~ 319 (432) T protein:vir:81 241 FAKKVS-GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred HHHHHh-hhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHH Confidence 998885 4568899999999999999999999999999999999999999999999998632 2 356788899999 Q ss_pred HHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH Q lcl|NC_018285. 294 SKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG 366 (383) Q Consensus 294 ~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~ 366 (383) .+||.|+++.|+++|+++|++. ++||+..+++.|..++++.+++++++|++|+||+|+++|++|+++++.... T Consensus 320 ~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~~~ 399 (432) T protein:vir:81 320 TMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLT 399 (432) T ss_pred HHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEe Confidence 9999999999999999999864 668888888999999999999999999999999999999999986653322 Q ss_pred hCCCCCC------------CCCCCCCCCC Q lcl|NC_018285. 367 ENPNRTI------------LKGGETNGQD 383 (383) Q Consensus 367 ~~~~~~~------------~~ggd~~~~d 383 (383) -+.+..| .+|++++.+| T Consensus 400 ~~~~~~pl~~~~~~~~~~~~~~~~n~~~~ 428 (432) T protein:vir:81 400 VQSAMVPLDSIGLQASPEPASGLGNQQQD 428 (432) T ss_pred ecCcccchhhhccCCCCCCCCCCCCcccc Confidence 1222222 1222222222 No 23 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=7.5e-85 Score=481.80 Aligned_cols=377 Identities=16% Similarity=0.219 Sum_probs=318.1 Q ss_pred CchhhhhhcCCccccccc--ccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGF--FDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------- 69 (383) ||||+++++++...+... .++..+ ...+..++..++.++||++++|++||++||++||++|+++++.. T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~-~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~ 79 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSP-AEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIPVSP 79 (409) T ss_pred CchhhhhhcCCCcccccccccccccc-cchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccch Confidence 999999888765433222 111111 12233467788999999999999999999999999999998743 Q ss_pred -hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEe-ecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccc Q lcl|NC_018285. 70 -MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRW-RNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPP 147 (383) Q Consensus 70 -~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~-r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~ 147 (383) .+.|+.+||++||+++||+.++.+++++||+|++|. ++..|+|++||||+|++|++....+....+|.+.+.. . T Consensus 80 l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~----~ 155 (409) T protein:vir:84 80 APKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRI----D 155 (409) T ss_pred HHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecC----C Confidence 234778999999999999999999999999999986 6788999999999999999887655544444433322 1 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhh Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMK 227 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~ 227 (383) .+.++++||||++++++++.++|+||+..+..++....++++++.++|+||++|+++|+.++.+++|+++++++.|..+. T Consensus 156 g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~ 235 (409) T protein:vir:84 156 GKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSH 235 (409) T ss_pred ceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHh Confidence 35689999999999999988899999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--c--CcCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG--D--QQSSLEMSSNVYSKAVARYLRP 303 (383) Q Consensus 228 ~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~--~--~~~~~e~~~~~~~~~l~P~~~~ 303 (383) .|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||... + +++.+++.++|+.+||.|+++. T Consensus 236 ~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ 315 (409) T protein:vir:84 236 HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRC 315 (409) T ss_pred ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999998632 2 3567888899999999999999 Q ss_pred HHHHHHHhhcc--hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCC-------- Q lcl|NC_018285. 304 FLSELSQKLSC--DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTI-------- 373 (383) Q Consensus 304 i~~~l~~~l~~--~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~-------- 373 (383) |+++|+++|.. .++||...+++.|..++++.+.+++++|++|+||+|+++|++|++++|.... ..|..+ T Consensus 316 ie~~l~~~L~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~-~~n~~~~~~~~~~~ 394 (409) T protein:vir:84 316 IEQALDTFLPRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQ-PMNFVPLGYVPPEE 394 (409) T ss_pred HHHHHHHhccCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeee-cccccccccCCccc Confidence 99999999854 3688888899999999999999999999999999999999999988775432 122222 Q ss_pred -----CCCCCCCCCC Q lcl|NC_018285. 374 -----LKGGETNGQD 383 (383) Q Consensus 374 -----~~ggd~~~~d 383 (383) .+.+++++++ T Consensus 395 ~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 395 PAQEPQPNSATEGNK 409 (409) T ss_pred cCcCCCCCCccCCCC Confidence 1233333333 No 24 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=6.6e-85 Score=482.11 Aligned_cols=376 Identities=16% Similarity=0.205 Sum_probs=311.9 Q ss_pred CchhhhhhcC--C--cccccccc--cccc----hhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MPIFNLATES--P--PNNQGGFF--DITD----PEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mglf~~~~~~--~--~~~~~~~~--~~~~----~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) ||+|++++.. + +....+.. ...+ ..+...+.++..|+.++||++++|++||++||++||++|+++|+.. T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTP 86 (432) T ss_pred CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecC Confidence 9999986432 1 11111111 1111 1122334568889999999999999999999999999999998643 Q ss_pred -----------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEE Q lcl|NC_018285. 70 -----------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNV 138 (383) Q Consensus 70 -----------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~ 138 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+ +|++.+||||+|+.|++..+.++ .+.|++ T Consensus 87 ~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g-~~~y~~ 164 (432) T protein:vir:97 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKG-NTAYRY 164 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCC-cEEEEE Confidence 24567899999999999999999999999999999997 58999999999999999887654 567776 Q ss_pred eecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHH Q lcl|NC_018285. 139 TFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTK 218 (383) Q Consensus 139 ~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~ 218 (383) ...+ +..+.++++||||+|+++.++ ++|+||+..+...+....+++++..++|+||++|++++++++.+++|++++ T Consensus 165 ~~~~---g~~~~~~~~~iih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~ 240 (432) T protein:vir:97 165 RRTD---GQMIDIPRQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDS 240 (432) T ss_pred EecC---ceEEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHH Confidence 6443 456789999999999887665 789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--c---CcCHHHHHHHHH Q lcl|NC_018285. 219 VSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG--D---QQSSLEMSSNVY 293 (383) Q Consensus 219 ~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~--~---~~~~~e~~~~~~ 293 (383) +++.|. +..++|+++||++|++|++++.++.|+||+|++++++++||++|||||++||... + +++.+++.+.|+ T Consensus 241 ~~~~~~-~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~ 319 (432) T protein:vir:97 241 FSKKVS-GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred HHHHHh-hhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHH Confidence 988875 4568899999999999999999999999999999999999999999999998532 2 356788899999 Q ss_pred HHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH Q lcl|NC_018285. 294 SKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG 366 (383) Q Consensus 294 ~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~ 366 (383) .+||.|+++.|+++|+++|++. ++||...+++.|..++++.+.+++++|++|+||+|+++|++|+++++.... T Consensus 320 ~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~~~ 399 (432) T protein:vir:97 320 TMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLT 399 (432) T ss_pred HHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEe Confidence 9999999999999999999864 568888888999999999999999999999999999999999987653332 Q ss_pred hCCCCCC----------C-CCCCCCCCC Q lcl|NC_018285. 367 ENPNRTI----------L-KGGETNGQD 383 (383) Q Consensus 367 ~~~~~~~----------~-~ggd~~~~d 383 (383) .+.+..| . .+|+.++++ T Consensus 400 ~~~~~~pl~~~~~~~~~~~~~~~~~~~~ 427 (432) T protein:vir:97 400 VQSAMVPLDSIGLQASPEPASGLGNQQQ 427 (432) T ss_pred ecccccchhhhcccCCCCCCCCCCCccc Confidence 2222221 1 233333333 No 25 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=9.4e-85 Score=481.25 Aligned_cols=376 Identities=16% Similarity=0.212 Sum_probs=309.9 Q ss_pred CchhhhhhcC---C-ccccccccc-----ccc-hhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MPIFNLATES---P-PNNQGGFFD-----ITD-PEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mglf~~~~~~---~-~~~~~~~~~-----~~~-~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) ||+|++++.. + +....+..+ ... ..+...+.++..|+.++||++++|++||++||++||++|+++|+.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecC Confidence 9999985432 1 111111111 111 1122334567889999999999999999999999999999998542 Q ss_pred -----------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEE Q lcl|NC_018285. 70 -----------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNV 138 (383) Q Consensus 70 -----------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~ 138 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+ +|++.+||||+|++|++..+.++ ...|++ T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g-~~~y~~ 164 (432) T protein:vir:10 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKG-NTAYRY 164 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCC-cEEEEE Confidence 23467899999999999999999999999999999997 59999999999999999887655 466666 Q ss_pred eecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHH Q lcl|NC_018285. 139 TFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTK 218 (383) Q Consensus 139 ~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~ 218 (383) ...+ +..+.++++||||+++++.++ ++|+||+..+..++....++++++.++|+||++|++++++++.+++|++++ T Consensus 165 ~~~~---g~~~~~~~~~iih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~ 240 (432) T protein:vir:10 165 RRTD---GQMIDIPKQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDS 240 (432) T ss_pred EecC---ceEEEEcCccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHH Confidence 5443 456789999999999887665 789999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--c---CcCHHHHHHHHH Q lcl|NC_018285. 219 VSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG--D---QQSSLEMSSNVY 293 (383) Q Consensus 219 ~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~--~---~~~~~e~~~~~~ 293 (383) +++.|. +..++|+++||++|++|++++.+++|+||+|++++++++||++|||||++||... + +++.+++.+.|+ T Consensus 241 ~~~~~~-~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~ 319 (432) T protein:vir:10 241 FAKKVS-GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred HHHHHh-hhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHH Confidence 998886 4568899999999999999999999999999999999999999999999999632 2 356788899999 Q ss_pred HHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHH Q lcl|NC_018285. 294 SKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKG 366 (383) Q Consensus 294 ~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~ 366 (383) ++||.|+++.|+++|+++|++. ++||....++.|..++++.+++++++|++|+||+|+++|++|+++++.... T Consensus 320 ~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~~~ 399 (432) T protein:vir:10 320 SMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLT 399 (432) T ss_pred HHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEe Confidence 9999999999999999999864 568888888999999999999999999999999999999999986543221 Q ss_pred hCCCCC------------CCCCCCCCCCC Q lcl|NC_018285. 367 ENPNRT------------ILKGGETNGQD 383 (383) Q Consensus 367 ~~~~~~------------~~~ggd~~~~d 383 (383) -+.+.. +..|.+++.+| T Consensus 400 ~~~~~~pl~~~~~~~~~~~~~~~~~~~~~ 428 (432) T protein:vir:10 400 VQSAMVPLDSIGLQASPEPASGLGNQQQD 428 (432) T ss_pred ecCcccchhhhcccCCCCCCCCCCCcccc Confidence 121211 22222222222 No 26 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=1.6e-84 Score=479.93 Aligned_cols=377 Identities=14% Similarity=0.187 Sum_probs=316.4 Q ss_pred CchhhhhhcCCcccc-cccccccchh-hcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-GGFFDITDPE-FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------- 69 (383) ||||+++++++.... .....+.+.. ....+.++..|+.+.|+++++|++||++||++||++|+++++.. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~~~~ 80 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATG 80 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCceeeccc Confidence 999999887654322 1111111111 11223467888999999999999999999999999999998643 Q ss_pred ---hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccc Q lcl|NC_018285. 70 ---MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIP 146 (383) Q Consensus 70 ---~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~ 146 (383) .+.|+.+||++||+++||++++.+++++||||++++|+ .|++++|+||+|+.|++..+.++ ...|++...+ + T Consensus 81 ~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~-~~~y~~~~~~---g 155 (414) T protein:vir:44 81 ERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSW-EPVYQVTFPD---G 155 (414) T ss_pred chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCC-cEEEEEEecC---c Confidence 24577899999999999999999999999999999886 59999999999999998876554 4566666544 3 Q ss_pred cceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHh Q lcl|NC_018285. 147 PKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAM 226 (383) Q Consensus 147 ~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~ 226 (383) ..+.++++||||+++++.++ ++|.||+..+..+++...++++++.++|+||++|+++++.++.+++|+++++++.|... T Consensus 156 ~~~~~~~~evih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~ 234 (414) T protein:vir:44 156 STDVLSQEDIWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEER 234 (414) T ss_pred eEEEEccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHH Confidence 46789999999999887665 78999999999999999999999999999999999999999999999999999988654 Q ss_pred ---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 227 ---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVARYL 301 (383) Q Consensus 227 ---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P~~ 301 (383) .+|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+ .++++|.+++.+.|+++||+|++ T Consensus 235 ~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~ 314 (414) T protein:vir:44 235 HTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYL 314 (414) T ss_pred hcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 4578999999999999999999999999999999999999999999999996 45678889999999999999999 Q ss_pred HHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-CCCCCC Q lcl|NC_018285. 302 RPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-NPNRTI 373 (383) Q Consensus 302 ~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-~~~~~~ 373 (383) ++|+++||++|+++ ++||.....+.|...+++.+++++++|++|+||+|+++|++|++++|..... +....+ T Consensus 315 ~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~ 394 (414) T protein:vir:44 315 TRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTKP 394 (414) T ss_pred HHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceecccccccccC Confidence 99999999999865 4677888888999999999999999999999999999999999988864432 111111 Q ss_pred ------CCCCCCCCCC Q lcl|NC_018285. 374 ------LKGGETNGQD 383 (383) Q Consensus 374 ------~~ggd~~~~d 383 (383) .+++|+..+| T Consensus 395 ~~~~~~~~~~~~~~~d 410 (414) T protein:vir:44 395 SDGSKAGKQKDNANAD 410 (414) T ss_pred CccccCCCCCCCCCCC Confidence 1233333334 No 27 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=1.4e-84 Score=480.26 Aligned_cols=374 Identities=14% Similarity=0.138 Sum_probs=318.7 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) |+||+++++++......+...........+.++..++.++||++++|++||++||++||++|+++++.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~ 80 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIAFDH 80 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccc Confidence 999999998887766666544333334556678899999999999999999999999999999998633 Q ss_pred --hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccc Q lcl|NC_018285. 70 --MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPP 147 (383) Q Consensus 70 --~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~ 147 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++ .++|.+... T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g-~~~y~~~~~------ 153 (419) T protein:vir:57 81 PLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDG-MPYYDIPSI------ 153 (419) T ss_pred hHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCc-eEEEEEcCC------ Confidence 23366899999999999999999999999999999999999999999999999999887654 456665322 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC----CCCHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG----GGLLDFKTKVSRSR 223 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~----~~~~e~~~~~~~~~ 223 (383) ...++.++|+|+++++.++ ++|+||+..+...+....++++++.++|+||++|+++|+.++ .+++++.+++++.| T Consensus 154 ~~~~~~~~vih~r~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~ 232 (419) T protein:vir:57 154 GEILPMRMVHHIKSFSLDG-YIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKW 232 (419) T ss_pred ceEEchhhEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHH Confidence 3468999999999887665 789999999999999999999999999999999999999864 45788888898888 Q ss_pred HH---hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 224 QA---MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 224 ~~---~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~ 298 (383) .. +..|+|+++|+++|++|++++.+++|+||+|++++++++||++|||||++||+ .+++++.+++.++|+++||+ T Consensus 233 ~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~ 312 (419) T protein:vir:57 233 TERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTML 312 (419) T ss_pred HHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHH Confidence 65 34578999999999999999999999999999999999999999999999996 45677889999999999999 Q ss_pred HHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR 371 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~ 371 (383) |+++.|+++|+++|++. ++||....++.|..++++.+..++++|++|+||+|+++|++|++++|..... +|. T Consensus 313 P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~-~n~ 391 (419) T protein:vir:57 313 AILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTP-LNM 391 (419) T ss_pred HHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec-ccc Confidence 99999999999999864 6788888889999999999999999999999999999999999988755432 222 Q ss_pred CCC---CC-CCCCCCC Q lcl|NC_018285. 372 TIL---KG-GETNGQD 383 (383) Q Consensus 372 ~~~---~g-gd~~~~d 383 (383) .+. ++ |...++. T Consensus 392 ~~~~~~~~~~~~~~~~ 407 (419) T protein:vir:57 392 VDSKALTGIGKATPQQ 407 (419) T ss_pred ccccccccccCCCccc Confidence 222 22 2222211 No 28 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=2.5e-84 Score=478.94 Aligned_cols=375 Identities=18% Similarity=0.192 Sum_probs=320.5 Q ss_pred Cchhhh--hhcCCcc-----c-ccccccccchhhcccc-cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-- Q lcl|NC_018285. 1 MPIFNL--ATESPPN-----N-QGGFFDITDPEFLATL-NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-- 69 (383) Q Consensus 1 Mglf~~--~~~~~~~-----~-~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-- 69 (383) |.||++ +..+... + .......++ +.++. .++..++.++|+++|+|++||++||++||++|+++++.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 78 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKLYD--FSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 78 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhccccccccc--ccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccc Confidence 999966 2221110 0 011111222 22222 245568899999999999999999999999999998643 Q ss_pred -----hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcc Q lcl|NC_018285. 70 -----MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPR 144 (383) Q Consensus 70 -----~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~ 144 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.+|++++|+||+|++|++..+.+++.++|.+...+ T Consensus 79 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~-- 156 (412) T protein:vir:26 79 VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT-- 156 (412) T ss_pred ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC-- Confidence 2357789999999999999999999999999999999999999999999999999999888888888887554 Q ss_pred cccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHH Q lcl|NC_018285. 145 IPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQ 224 (383) Q Consensus 145 ~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~ 224 (383) +....|+++||||+|++++.++++|+||+.++...+....+++++. ++.++..++++++.++.+++++++++++.|. T Consensus 157 -g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~ 233 (412) T protein:vir:26 157 -GNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFK 233 (412) T ss_pred -ceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCHHHHHHHHHHHH Confidence 3466799999999999888888999999999999999999998884 6666667788889999999999999999999 Q ss_pred HhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 225 AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEMSSNVYSKAVARYLR 302 (383) Q Consensus 225 ~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~~~~~~~~~l~P~~~ 302 (383) ...+++|+++|+++|++|++++.++.|+||+|.+++++++||++|||||.+||+. +++++.+++.+.|+.+||.|+++ T Consensus 234 ~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~ 313 (412) T protein:vir:26 234 QYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVK 313 (412) T ss_pred HHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 9889999999999999999999999999999999999999999999999999963 46778899999999999999999 Q ss_pred HHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC--- Q lcl|NC_018285. 303 PFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR--- 371 (383) Q Consensus 303 ~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~--- 371 (383) +|+++|+++|+++ ++||...+.+.|..++++.+++++++|++|+||+|+++|++|++++|..... .|. T Consensus 314 ~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~-~n~~~~ 392 (412) T protein:vir:26 314 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLIS-GDLYPI 392 (412) T ss_pred HHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec-cccccc Confidence 9999999999864 5677888888999999999999999999999999999999999988765542 122 Q ss_pred -------CCCCCCCCCCCC Q lcl|NC_018285. 372 -------TILKGGETNGQD 383 (383) Q Consensus 372 -------~~~~ggd~~~~d 383 (383) ...+|||.|++| T Consensus 393 ~~~~~~~~~~~gG~~n~~e 411 (412) T protein:vir:26 393 DTPLELRKSLKGGDKNVNE 411 (412) T ss_pred ccchhhcccccCCCCCcCC Confidence 246899999999 No 29 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=2.4e-84 Score=478.98 Aligned_cols=376 Identities=19% Similarity=0.191 Sum_probs=320.8 Q ss_pred CchhhhhhcCCc-ccccccc-cccchhhcccc-cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------h Q lcl|NC_018285. 1 MPIFNLATESPP-NNQGGFF-DITDPEFLATL-NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------M 70 (383) Q Consensus 1 Mglf~~~~~~~~-~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~ 70 (383) |+||+|+++.-- ....... +..+ +.++. .++..++.+.|+++++|++||++||++||++|+++++.. . T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~~~~~l~ 81 (409) T protein:vir:96 4 ENIVTRIKKKLIDNWIDQSASKLYD--FSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEVS 81 (409) T ss_pred ccchhhhhhHHhhhhhccccccccc--cccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeecccccchhHH Confidence 899999876521 1111111 1111 11111 234557889999999999999999999999999998643 2 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccccee Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQH 150 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~ 150 (383) +.|+.+||++||+++||+.++.+++++||||++|+|+.+|++++|||++|++|++..+.++..++|.+...+ +..+. T Consensus 82 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~---g~~~~ 158 (409) T protein:vir:96 82 DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GNKLI 158 (409) T ss_pred HHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ceEEE Confidence 357789999999999999999999999999999999999999999999999999999888888888876544 34678 Q ss_pred ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 151 VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQ 230 (383) Q Consensus 151 ~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~ 230 (383) ++++||||+|++++.++++|+||+..+..++....+++++. ++.++..++++++.++.+++++++++++.|....+++ T Consensus 159 ~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~ 236 (409) T protein:vir:96 159 VHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEEN 236 (409) T ss_pred EccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcC Confidence 99999999998877888999999999999999999988874 5555556678888889999999999999999989999 Q ss_pred cceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 231 GGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVARYLRPFLSEL 308 (383) Q Consensus 231 g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l 308 (383) |+++++++|++|++++.++.|+||+|.+++++++||++|||||++||+ .+++++.+++.+.|+++||.|++++|+++| T Consensus 237 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l 316 (409) T protein:vir:96 237 GGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEF 316 (409) T ss_pred CCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999996 346778899999999999999999999999 Q ss_pred HHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC---------CCC Q lcl|NC_018285. 309 SQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN---------PNR 371 (383) Q Consensus 309 ~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~---------~~~ 371 (383) +++|+++ ++||.....+.|..++++.+++++++|++|+||+|+++|++|++++|...... ... T Consensus 317 ~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~ 396 (409) T protein:vir:96 317 NRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELR 396 (409) T ss_pred HhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecccccccccchhhc Confidence 9999864 56777788889999999999999999999999999999999998888654321 112 Q ss_pred CCCCCCCCCCCC Q lcl|NC_018285. 372 TILKGGETNGQD 383 (383) Q Consensus 372 ~~~~ggd~~~~d 383 (383) ...+|||+|++| T Consensus 397 ~~~~gG~~n~~e 408 (409) T protein:vir:96 397 KSLKGGDKNVNE 408 (409) T ss_pred ccccCCCCCcCC Confidence 246899999999 No 30 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=2.4e-84 Score=479.06 Aligned_cols=375 Identities=14% Similarity=0.176 Sum_probs=309.7 Q ss_pred CchhhhhhcCCcc-----cc----------------cccccccchhh----cccccCCceechhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPN-----NQ----------------GGFFDITDPEF----LATLNGSEWVSAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mglf~~~~~~~~~-----~~----------------~~~~~~~~~~~----~~~~~~~~~~~~~~a~~~~~v~~~i~~ia 55 (383) ||||+++++++.. .+ ..+.+..++.+ .+...++..++.++||++++|++||++|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 9999998763211 00 00111112221 12234567789999999999999999999 Q ss_pred HhhhhCceeeecch-----------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeE Q lcl|NC_018285. 56 NDLATAKLTTSRKQ-----------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVS 124 (383) Q Consensus 56 ~~ia~~p~~~~~~~-----------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~ 124 (383) ++||++|+++++.+ .+.|+.+||++||+++||+.++.+++++||||++|+|+. |++++|+|++|++|+ T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~ 159 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAK 159 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeE Confidence 99999999998753 234678999999999999999999999999999999985 899999999999999 Q ss_pred EEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCccee Q lcl|NC_018285. 125 FNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGI 204 (383) Q Consensus 125 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i 204 (383) +..+.++ .++|++...+ +..+.|+++||||+|+++.++ ++|+||+..+..+|....+++++..++|+||++|+++ T Consensus 160 ~~~~~~~-~~~y~~~~~~---g~~~~~~~~dViHir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 234 (431) T protein:vir:10 160 GRLTSTW-QIVYDYTTPT---GDKIELPAREVFHLRDLSIDG-VSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGA 234 (431) T ss_pred EEEcCCC-eEEEEEEeCC---ceEEEEchhhEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE Confidence 9876554 5667766443 346789999999999887665 7899999999999999999999999999999999999 Q ss_pred EeecCCCCHHHHHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc-- Q lcl|NC_018285. 205 LKIKGGGLLDFKTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ-- 279 (383) Q Consensus 205 ~~~~~~~~~e~~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~-- 279 (383) +++++.+++|+.+++++.|... .+|+|+++|+++|++|++++.++.|+||+|++++++++||++|||||++||+. T Consensus 235 l~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~ 314 (431) T protein:vir:10 235 IEVPKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDT 314 (431) T ss_pred EecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCC Confidence 9999999999999999998653 46889999999999999999999999999999999999999999999999963 Q ss_pred ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCC----CcCHHH Q lcl|NC_018285. 280 GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSG----TLAQNQ 348 (383) Q Consensus 280 ~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g----~~t~nE 348 (383) ++++|.+++.++|+++||.|++++|+++||++|+++ ++||...+++.|..++++.+++++..| ++|+|| T Consensus 315 ~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE 394 (431) T protein:vir:10 315 SWGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNE 394 (431) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHH Confidence 557888999999999999999999999999999864 578888889999999999999988655 599999 Q ss_pred HHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 349 GLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 349 ~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) +|+++|++|+++++-.....+.+ ..+.+.+.+. T Consensus 395 ~R~~~gl~p~~~~~gD~~~~p~n--~~~~~~~~~~ 427 (431) T protein:vir:10 395 VREMLDLPRADDPVADQLRNPMT--QKQKGSGDEP 427 (431) T ss_pred HHHHhCCCCCCCccccceecccc--cccCCCCCCC Confidence 99999999997633223322221 2222221122 No 31 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=2.4e-84 Score=479.04 Aligned_cols=377 Identities=18% Similarity=0.231 Sum_probs=313.3 Q ss_pred Cc-----hhhhhhcCCcccccccccccchhhcccc-----cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGFFDITDPEFLATL-----NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) |+ +|++++.+-..+.+...+.+++.++..+ ..+..|+.++||++++|++||++||++||++|+++++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~~ 80 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQTKP 80 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEEcC Confidence 87 4444443323333444455455443322 356778999999999999999999999999999998643 Q ss_pred ------------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEE Q lcl|NC_018285. 70 ------------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYN 137 (383) Q Consensus 70 ------------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~ 137 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+. |++++||||+|+.|++..+.++ ..+|. T Consensus 81 ~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~~~~g-~~~y~ 158 (437) T protein:vir:10 81 DGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKRLTSG-ALQYT 158 (437) T ss_pred CCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEECCCC-eEEEE Confidence 234678999999999999999999999999999999984 9999999999999999887654 55666 Q ss_pred EeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHH Q lcl|NC_018285. 138 VTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKT 217 (383) Q Consensus 138 ~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~ 217 (383) +...+ +....++++||||+|+++.++ ++|+||+.++..++....++++++.++|+||++|+++++.++.+++++++ T Consensus 159 ~~~~~---g~~~~~~~~dIih~r~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~ 234 (437) T protein:vir:10 159 YRNVD---GTVSTLAEDDVFHVRGFSLDG-LMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRA 234 (437) T ss_pred EEecC---ceEEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHH Confidence 55433 346789999999999887655 78999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHH---hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc----cCcCHHHHHH Q lcl|NC_018285. 218 KVSRSRQA---MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMSS 290 (383) Q Consensus 218 ~~~~~~~~---~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~----~~~~~~e~~~ 290 (383) ++++.|.. +..|+|+++|+++|++|++++.++.|+||+|++++++++||++|||||++||... .+++.+++.+ T Consensus 235 ~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~ 314 (437) T protein:vir:10 235 EIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTL 314 (437) T ss_pred HHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHH Confidence 99999865 3467899999999999999999999999999999999999999999999998643 2367788899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL 363 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~ 363 (383) .|+++||.|++..|+++|+++|++. ++||....++.|..++++.+..++++|++|+||+|+++|++|+++++. T Consensus 315 ~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~ 394 (437) T protein:vir:10 315 GFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAA 394 (437) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc Confidence 9999999999999999999999864 568888888999999999999999999999999999999999987653 Q ss_pred hHHhCCCCCC----------------CCCCCCCCCC Q lcl|NC_018285. 364 PKGENPNRTI----------------LKGGETNGQD 383 (383) Q Consensus 364 ~~~~~~~~~~----------------~~ggd~~~~d 383 (383) ...-+.+..| .+|||.++++ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (437) T protein:vir:10 395 VLTVQSALLPIDKLGEHTTATAAQDALKAWLYQEEK 430 (437) T ss_pred eEeecCcccchhhccCcCCCcchhccccccCCCCCC Confidence 3221212221 1234443333 No 32 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=5.3e-84 Score=477.14 Aligned_cols=372 Identities=19% Similarity=0.200 Sum_probs=317.2 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) |- |++.+++++...+. .+.....+++...++.+++.++||++++|++||++||++||++|+++++.. T Consensus 1 m~-f~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~l 78 (409) T protein:vir:10 1 ML-FRKGFKNQSQEISI-DDKKILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRVPDHYL 78 (409) T ss_pred Cc-ccccccCcCCCCCC-ChHHHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeeccCchH Confidence 87 55666555543221 111123345556678889999999999999999999999999999998643 Q ss_pred hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-----eeEEEEeecCcc Q lcl|NC_018285. 70 MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLYYNVTFDDPR 144 (383) Q Consensus 70 ~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~y~~~~~~~~ 144 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+.+|++++||||+|++|++..++++. .++|.+.... T Consensus 79 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~~-- 156 (409) T protein:vir:10 79 EYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTDDL-- 156 (409) T ss_pred HHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEEeCC-- Confidence 234678999999999999999999999999999999999999999999999999998865442 3455554332 Q ss_pred cccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHH Q lcl|NC_018285. 145 IPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQ 224 (383) Q Consensus 145 ~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~ 224 (383) +..+.++++||||+|++++++ ++|+||+..+..+++...++++++.++|+||+.|+++++.++.+++++++++++.|. T Consensus 157 -g~~~~~~~~evih~r~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~ 234 (409) T protein:vir:10 157 -GQRHKFMSDEILHFKGLTADG-LAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFE 234 (409) T ss_pred -ceeEEeccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHH Confidence 456789999999999988765 789999999999999999999999999999999999999999999999999999886 Q ss_pred Hh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHH Q lcl|NC_018285. 225 AM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVAR 299 (383) Q Consensus 225 ~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P 299 (383) .. ..|+|+++|+++|++|++++.++.|+||+|++++++++||++|||||.+||+ .+++++.+++.++|+++||.| T Consensus 235 ~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P 314 (409) T protein:vir:10 235 RMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQS 314 (409) T ss_pred HHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHH Confidence 54 4578999999999999999999999999999999999999999999999985 456788899999999999999 Q ss_pred HHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC Q lcl|NC_018285. 300 YLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR 371 (383) Q Consensus 300 ~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~ 371 (383) ++++|+++||++|++. ++||.....+.|..++++.+++++++|++|+||+|+++|++|++++|..... .|. T Consensus 315 ~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~-~n~ 393 (409) T protein:vir:10 315 ILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLIN-GNM 393 (409) T ss_pred HHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec-cCc Confidence 9999999999999853 5678888888999999999999999999999999999999999888765432 344 Q ss_pred CCC--------CCCCC Q lcl|NC_018285. 372 TIL--------KGGET 379 (383) Q Consensus 372 ~~~--------~ggd~ 379 (383) .|+ +|||. T Consensus 394 ~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 394 IPVKMAGEQYSKGGEK 409 (409) T ss_pred cchhhccccccccCCC Confidence 444 34444 No 33 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=5e-84 Score=477.25 Aligned_cols=381 Identities=14% Similarity=0.189 Sum_probs=310.7 Q ss_pred CchhhhhhcCCcccc-----cccccccchh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--- Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-----GGFFDITDPE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~-----~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--- 69 (383) ||||++++++..... +...+...+. +.+.+.+++.|+.++||++++|++||++||++||++|+++++.. T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 999999876543321 1111112222 23344567899999999999999999999999999999998643 Q ss_pred --------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc---eeEEEE Q lcl|NC_018285. 70 --------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN---GLYYNV 138 (383) Q Consensus 70 --------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~---~~~y~~ 138 (383) ...|+.+||++||+++||+.++.+++++||||++|.++ .|++.+|+||+|++|++....... ..+|.+ T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEEE Confidence 23578899999999999999999999999999999665 689999999999999987754433 233334 Q ss_pred eecC-cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHH Q lcl|NC_018285. 139 TFDD-PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKT 217 (383) Q Consensus 139 ~~~~-~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~ 217 (383) .+.. +.......|+++||||||++++++.++|+||+.++..+|....++++++.++|+||++|+++|++++.+++|+++ T Consensus 160 ~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~ 239 (457) T protein:vir:62 160 DIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLA 239 (457) T ss_pred EEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHH Confidence 3332 222334578999999999999998899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc----CcCHHHHHH Q lcl|NC_018285. 218 KVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD----QQSSLEMSS 290 (383) Q Consensus 218 ~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~----~~~~~e~~~ 290 (383) ++++.|+.. ..|+|+++||++|++|++++.+++|+||+|++++++++||++|||||++||.... ++|.+++.+ T Consensus 240 ~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~ 319 (457) T protein:vir:62 240 RAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI 319 (457) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH Confidence 999998654 4578999999999999999999999999999999999999999999999996432 356788899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc-- Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK-- 361 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~-- 361 (383) +|+.+||.|++++|+++||++|+++ ++||+..+.+.|..++++.+.+++++|++|+||+|+++|++|++++ T Consensus 320 ~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~ 399 (457) T protein:vir:62 320 AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLG 399 (457) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCc Confidence 9999999999999999999999875 5678888888999999999999999999999999999999999876 Q ss_pred chhHHhCCCCC---------CC-------------------C--CCCCCCCC Q lcl|NC_018285. 362 ELPKGENPNRT---------IL-------------------K--GGETNGQD 383 (383) Q Consensus 362 d~~~~~~~~~~---------~~-------------------~--ggd~~~~d 383 (383) |..... +|.. +. + +|+.++.+ T Consensus 400 D~~~~~-~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 450 (457) T protein:vir:62 400 EKYRVP-LNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGE 450 (457) T ss_pred ceeeec-cccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCcccc Confidence 322211 0110 00 0 11111111 No 34 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=9.9e-84 Score=475.63 Aligned_cols=374 Identities=15% Similarity=0.175 Sum_probs=311.4 Q ss_pred CchhhhhhcCCcccccc--cccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGG--FFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------- 69 (383) |.++++++++...-... |..+........+.++..|+.++||++++|++||++||++||++|+++++.+ T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~ 80 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQRAT 80 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeecc Confidence 99999887665332211 1111111112334567889999999999999999999999999999998633 Q ss_pred ----hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccc Q lcl|NC_018285. 70 ----MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRI 145 (383) Q Consensus 70 ----~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~ 145 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++ .++|.+... T Consensus 81 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g-~~~y~~~~~---- 155 (421) T protein:vir:10 81 DHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDG-MPYYEIPEI---- 155 (421) T ss_pred cchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCc-eEEEEEcCC---- Confidence 23477899999999999999999999999999999999999999999999999999887654 455655322 Q ss_pred ccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC----CHHHHHHHHH Q lcl|NC_018285. 146 PPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG----LLDFKTKVSR 221 (383) Q Consensus 146 ~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~----~~e~~~~~~~ 221 (383) ...++.+||||+++++.++ ++|.||+..+..++....++++++.++|+||++|+++|+.++.. ++|+++++++ T Consensus 156 --g~~~~~~eiih~~~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~ 232 (421) T protein:vir:10 156 --GETLPMRMMHHVKVFSLDG-YIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLA 232 (421) T ss_pred --CcEEchhhEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHH Confidence 2368999999999987665 78999999999999999999999999999999999999988654 8889999998 Q ss_pred HHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHH Q lcl|NC_018285. 222 SRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKA 296 (383) Q Consensus 222 ~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~ 296 (383) .|... ..|+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||. .++++|.+++.+.|+++| T Consensus 233 ~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~t 312 (421) T protein:vir:10 233 KWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYT 312 (421) T ss_pred HHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHH Confidence 88654 4578899999999999999999999999999999999999999999999985 456778899999999999 Q ss_pred HHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCC Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENP 369 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~ 369 (383) |.|++++|+++||++|++. ++||.....+.|..++++.+++++++|++|+||+|+++|++|++++|.... .. T Consensus 313 l~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~-~~ 391 (421) T protein:vir:10 313 LLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLT-PL 391 (421) T ss_pred HHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeee-cc Confidence 9999999999999999864 678888888999999999999999999999999999999999988875432 12 Q ss_pred CCC----CCCCC--CCCCCC Q lcl|NC_018285. 370 NRT----ILKGG--ETNGQD 383 (383) Q Consensus 370 ~~~----~~~gg--d~~~~d 383 (383) +.. +.+|+ +.+.+. T Consensus 392 n~~~~~~~~~~~~~~~~~~~ 411 (421) T protein:vir:10 392 NMVDSAQIIPGDKKPTAQQM 411 (421) T ss_pred ccccccccccCCCCcccccC Confidence 211 11121 111111 No 35 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=1.5e-83 Score=474.69 Aligned_cols=376 Identities=18% Similarity=0.175 Sum_probs=318.8 Q ss_pred CchhhhhhcCCcc-cc-cccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hh Q lcl|NC_018285. 1 MPIFNLATESPPN-NQ-GGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~ 71 (383) =+|+.|++..-.. +. ......+++..+. ..++..++.++|+++++|++||++||++||++|+++++.. .+ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~~~~~~~~ 82 (409) T protein:vir:93 4 ENIVTRIKKKLIDNWIDQSTSKLYDFSPWK-NRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEVSD 82 (409) T ss_pred cchhhhhhhhhhhhhhcccccccccccccc-CccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccccccchHHH Confidence 3566665442111 00 1111222221111 1234557889999999999999999999999999998643 34 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||++||+++||+.++.+++++||||+++.|+.+|++++||||+|++|++..+.++..++|.+...+ +..+.+ T Consensus 83 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~---g~~~~~ 159 (409) T protein:vir:93 83 LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GNKLIV 159 (409) T ss_pred HHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ceEEEE Confidence 57789999999999999999999999999999999999999999999999999999888888888887654 346789 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCc Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQG 231 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g 231 (383) +++||||+|++++.++++|+||+.++..++....+++++. ++.++..++++++.++.+++++++++++.|+...+++| T Consensus 160 ~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g 237 (409) T protein:vir:93 160 HNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENG 237 (409) T ss_pred ccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCC Confidence 9999999998877788999999999999999999998884 66666677888899999999999999999998889999 Q ss_pred ceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 232 GPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVARYLRPFLSELS 309 (383) Q Consensus 232 ~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~ 309 (383) +++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+ .+++++.+++.+.|+..||.|++++|+++|+ T Consensus 238 ~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~ 317 (409) T protein:vir:93 238 GILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFN 317 (409) T ss_pred CeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999996 3467788999999999999999999999999 Q ss_pred Hhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC---------- Q lcl|NC_018285. 310 QKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR---------- 371 (383) Q Consensus 310 ~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~---------- 371 (383) ++|+++ ++||...+++.|..++++.+++++++|++|+||+|+++|++|++++|..... .|. T Consensus 318 ~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~-~n~~~~~~~~~~~ 396 (409) T protein:vir:93 318 RKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLIS-GDLYPIDTPLELR 396 (409) T ss_pred hhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec-ccccccccchhhc Confidence 999865 5677778888999999999999999999999999999999999888865532 222 Q ss_pred CCCCCCCCCCCC Q lcl|NC_018285. 372 TILKGGETNGQD 383 (383) Q Consensus 372 ~~~~ggd~~~~d 383 (383) ...+|||+|++| T Consensus 397 ~~~~gG~~n~~e 408 (409) T protein:vir:93 397 KSLKGGDKNVNE 408 (409) T ss_pred ccccCCCCCcCC Confidence 245799999999 No 36 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=2.7e-83 Score=473.26 Aligned_cols=375 Identities=13% Similarity=0.143 Sum_probs=312.5 Q ss_pred CchhhhhhcCCc---ccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------- Q lcl|NC_018285. 1 MPIFNLATESPP---NNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------- 69 (383) Q Consensus 1 Mglf~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------- 69 (383) =|||++++.-.. .......+...+.....+.++..|+.++||++++|++||++||++||++|+++++.. T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~ 93 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) T ss_pred CchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeecCCceeee Confidence 567776653211 111111112222223334567789999999999999999999999999999998632 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCc Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDP 143 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~ 143 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.+|++++|||++|++|++..+ ++..+|.+..+ T Consensus 94 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~--~~~~~y~~~~~-- 169 (424) T protein:vir:18 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQRD-- 169 (424) T ss_pred ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEc--CCeEEEEEEeC-- Confidence 23467899999999999999999999999999999999999999999999999998764 45677777654 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRS 222 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~ 222 (383) +..+.|+++||||+|++++++ ++|+||+..+..+++...++++++.++|+||+.|++++++++. +++++++++++. T Consensus 170 --g~~~~~~~~eIih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~ 246 (424) T protein:vir:18 170 --SEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) T ss_pred --CeEEEeccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHH Confidence 345689999999999988765 7899999999999999999999999999999999999999875 789999999998 Q ss_pred HHHhh--cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc----cCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQAMK--QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~----~~~~~~e~~~~~~~~~ 296 (383) |+... .++|+++||++|++|++++.+++|+||+|++++++++||++|||||++||... .+++.+++...|+++| T Consensus 247 ~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~t 326 (424) T protein:vir:18 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) T ss_pred HHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHH Confidence 86543 57889999999999999999999999999999999999999999999999632 2367888999999999 Q ss_pred HHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCC Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENP 369 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~ 369 (383) |.|++++|+++|+++|++. ++||+...++.|..++++.+.+++++|++|+||+|+++|++|++++|.... .. T Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~~~~-~~ 405 (424) T protein:vir:18 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMR-QS 405 (424) T ss_pred HHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee-cc Confidence 9999999999999999975 568888889999999999999999999999999999999999988886544 33 Q ss_pred CCCCCC--CCCCCCCC Q lcl|NC_018285. 370 NRTILK--GGETNGQD 383 (383) Q Consensus 370 ~~~~~~--ggd~~~~d 383 (383) +..|+. |-..+++| T Consensus 406 n~~~l~~~~~~~~p~~ 421 (424) T protein:vir:18 406 QYVPITDLGTNKEPRN 421 (424) T ss_pred CccchHhhhccCCCcc Confidence 444432 22223333 No 37 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=4.1e-83 Score=472.27 Aligned_cols=381 Identities=15% Similarity=0.160 Sum_probs=315.2 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCc------eechhhhhccHHHHHHHHHHHHhhhhCceeeecch----- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSE------WVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----- 69 (383) |=|=+...-.-+... +...+....+++.+..+. .+....|+++++|++||++||++||++|+++++.+ T Consensus 1 ~~~~~~~~~~~p~~~-~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~ 79 (518) T protein:vir:78 1 MLLANGQTLSAPAMA-ELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCceeeccchhh-hhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCccc Confidence 544333222111111 111122222222222332 33456688999999999999999999999998643 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCc Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDP 143 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~ 143 (383) ...|+.+||++||+++||+.++.+++++||+|++|.|+.+|++++||||+|++|++..+.+.+..+|++....+ T Consensus 80 ~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~~~~ 159 (518) T protein:vir:78 80 EEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) T ss_pred cccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecCC Confidence 24588999999999999999999999999999999999999999999999999999998888889999888777 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSR 223 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~ 223 (383) ..+..+.|+++||||+|++++++..+|+||+.++..+|....++++++.++|+||++|+++|+.++.+++++++++++.| T Consensus 160 ~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~ 239 (518) T protein:vir:78 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQF 239 (518) T ss_pred ccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHH Confidence 77777889999999999999998788999999999999999999999999999999999999999999999999999998 Q ss_pred HHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 224 QAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 224 ~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~ 298 (383) +.. ..|+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||. .++++|.+++...|+++||. T Consensus 240 ~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~ 319 (518) T protein:vir:78 240 DRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) T ss_pred HHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHH Confidence 654 3678999999999999999999999999999999999999999999999995 45677889999999999999 Q ss_pred HHHHHHHHHHHHhhcch------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcC--CcchhHHhCCC Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCD------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEIL--PKELPKGENPN 370 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~--~~d~~~~~~~~ 370 (383) |++++|+++||++|++. ++||+..+++.|...+++.+++++++|++|+||+|+++|++|++ ++|..... .+ T Consensus 320 P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~-~n 398 (518) T protein:vir:78 320 IPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN-SA 398 (518) T ss_pred HHHHHHHHHHHHhhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec-cc Confidence 99999999999999864 56888888899999999999999999999999999999999986 34432221 12 Q ss_pred CCCC---CCCCCCCCC Q lcl|NC_018285. 371 RTIL---KGGETNGQD 383 (383) Q Consensus 371 ~~~~---~ggd~~~~d 383 (383) ..|+ .+|...+++ T Consensus 399 ~~pl~~~~~~~~~g~~ 414 (518) T protein:vir:78 399 LQPLGATPDGAVEGEE 414 (518) T ss_pred ceecccccccccCCCC Confidence 2221 111111111 No 38 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=3.2e-83 Score=472.88 Aligned_cols=375 Identities=13% Similarity=0.153 Sum_probs=312.5 Q ss_pred Cchhhhhhc---CCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------- Q lcl|NC_018285. 1 MPIFNLATE---SPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------- 69 (383) Q Consensus 1 Mglf~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------- 69 (383) =|||++++. .+..........+.+.....+.++..|+.++||++++|++||++||++||++|+++|+.. T Consensus 14 ~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~~~ 93 (424) T protein:vir:18 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) T ss_pred CchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeccCCceeee Confidence 466666543 111111111112222223345567889999999999999999999999999999998632 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCc Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDP 143 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~ 143 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.+|++++|||++|++|++..+ ++..+|++..+ T Consensus 94 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~--~~~~~y~~~~~-- 169 (424) T protein:vir:18 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQRD-- 169 (424) T ss_pred ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEc--CCeEEEEEEeC-- Confidence 23467899999999999999999999999999999999999999999999999998764 45677777654 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRS 222 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~ 222 (383) +....|+++||||+|+++.++ ++|+||+..+..+|..+.++++++.++|+||+.|+++++.++. +++++++++++. T Consensus 170 --g~~~~~~~~eVihir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~ 246 (424) T protein:vir:18 170 --SEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) T ss_pred --CeEEEeccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHH Confidence 345689999999999988765 7899999999999999999999999999999999999999875 789999999988 Q ss_pred HHHhh--cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc----cCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQAMK--QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~----~~~~~~e~~~~~~~~~ 296 (383) |+... .++|+++||++|++|++++.++.|+||+|++++++++||++|||||++||+.. .+++.+++..+|+.+| T Consensus 247 ~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~t 326 (424) T protein:vir:18 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) T ss_pred HHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHH Confidence 86533 57889999999999999999999999999999999999999999999999642 2367788999999999 Q ss_pred HHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCC Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENP 369 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~ 369 (383) |.|++++|+++||++|++. ++||+...++.|..++++.+++++++|++|+||+|+++|++|++++|.... .. T Consensus 327 l~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~-~~ 405 (424) T protein:vir:18 327 LQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDVAMR-QA 405 (424) T ss_pred HHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee-cc Confidence 9999999999999999865 567888889999999999999999999999999999999999988886654 34 Q ss_pred CCCCCC--CCCCCCCC Q lcl|NC_018285. 370 NRTILK--GGETNGQD 383 (383) Q Consensus 370 ~~~~~~--ggd~~~~d 383 (383) +..|+. |-+.+++| T Consensus 406 n~~~l~~~~~~~~~~~ 421 (424) T protein:vir:18 406 QYVPITDLGTNKEPRN 421 (424) T ss_pred CccchhhhhccCCccc Confidence 444432 22233333 No 39 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=4.8e-83 Score=471.89 Aligned_cols=376 Identities=19% Similarity=0.188 Sum_probs=319.0 Q ss_pred CchhhhhhcCCcc-ccc-ccccccchhhccc-ccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------h Q lcl|NC_018285. 1 MPIFNLATESPPN-NQG-GFFDITDPEFLAT-LNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------M 70 (383) Q Consensus 1 Mglf~~~~~~~~~-~~~-~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~ 70 (383) =+|++|+++.-.. +.. ......+ +.++ ..++..++.+.|+++++|++||++||++||++|+++++.. . T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~ 81 (409) T protein:vir:94 4 ENIVTRIKKKLIDNWIDQSASKLYD--FSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVNTEVS 81 (409) T ss_pred cccchhhhhHHhhhhhcCCcccccc--cccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccccchhHH Confidence 2466666553211 111 1111111 1121 1234557889999999999999999999999999998643 2 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccccee Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQH 150 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~ 150 (383) +.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++..++|.+...+ +..+. T Consensus 82 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~---g~~~~ 158 (409) T protein:vir:94 82 DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GNKLI 158 (409) T ss_pred HHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ceEEE Confidence 357789999999999999999999999999999999999999999999999999999888888888887554 34678 Q ss_pred ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 151 VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQ 230 (383) Q Consensus 151 ~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~ 230 (383) ++++||||+|++++.++++|+||+.++..++....+++++. ++.++..++++++.++.+++++++++++.|....+++ T Consensus 159 ~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~ 236 (409) T protein:vir:94 159 VHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEEN 236 (409) T ss_pred EccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcC Confidence 99999999998878788999999999999999999998885 5666666788899999999999999999999989999 Q ss_pred cceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 231 GGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEMSSNVYSKAVARYLRPFLSEL 308 (383) Q Consensus 231 g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~~~~~~~~~l~P~~~~i~~~l 308 (383) |+++|+++|++|++++.+++|+||+|.+++++++||++|||||++||+. +++++.+++.+.|+++||.|+++.|+++| T Consensus 237 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l 316 (409) T protein:vir:94 237 GGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEF 316 (409) T ss_pred CCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999964 46778899999999999999999999999 Q ss_pred HHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCC---------CC Q lcl|NC_018285. 309 SQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENP---------NR 371 (383) Q Consensus 309 ~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~---------~~ 371 (383) +++|+++ ++||....++.|..++++.+++++++|++|+||+|+++|++|++++|....... .. T Consensus 317 n~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~ 396 (409) T protein:vir:94 317 NRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELR 396 (409) T ss_pred HHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeecccccccccchhhc Confidence 9999864 567777888899999999999999999999999999999999998886554211 11 Q ss_pred CCCCCCCCCCCC Q lcl|NC_018285. 372 TILKGGETNGQD 383 (383) Q Consensus 372 ~~~~ggd~~~~d 383 (383) ...+|||+|++| T Consensus 397 ~~~kGG~~n~~e 408 (409) T protein:vir:94 397 KSLKGGDKNVNE 408 (409) T ss_pred ccccCCCCCcCC Confidence 246899999999 No 40 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=6.7e-83 Score=471.09 Aligned_cols=381 Identities=14% Similarity=0.171 Sum_probs=308.7 Q ss_pred CchhhhhhcCCcccc-----cccccccchhh---cccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh-- Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-----GGFFDITDPEF---LATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM-- 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~-----~~~~~~~~~~~---~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~-- 70 (383) ||||++++++..... ....+..++.+ .+...+++.|+.++||++++|++||++||++||++|+++++... T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 999999876543321 11111222332 34455688999999999999999999999999999999997542 Q ss_pred ---------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc---eeEEEE Q lcl|NC_018285. 71 ---------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN---GLYYNV 138 (383) Q Consensus 71 ---------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~---~~~y~~ 138 (383) ..++..||..||+++||+.++.+++++||||++|.++ .|+|++||||+|++|++......+ ..++.+ T Consensus 81 ~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:13 81 RKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred ccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEEEE Confidence 2233444447999999999999999999999999776 589999999999999998765443 233334 Q ss_pred eecC-cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHH Q lcl|NC_018285. 139 TFDD-PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKT 217 (383) Q Consensus 139 ~~~~-~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~ 217 (383) .+.. +.......|+++||||++++++++.++|+||+..+..+|....++++++.++|+||++|+++|++++.+++|+++ T Consensus 160 ~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~ 239 (457) T protein:vir:13 160 DIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLA 239 (457) T ss_pred EEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHH Confidence 3332 222344568999999999999999899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc----CcCHHHHHH Q lcl|NC_018285. 218 KVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD----QQSSLEMSS 290 (383) Q Consensus 218 ~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~----~~~~~e~~~ 290 (383) ++++.|... .+|+|+++||++|++|++++.++.|+||+|++++++++||++|||||++||.... +++.+++.. T Consensus 240 ~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~ 319 (457) T protein:vir:13 240 RAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI 319 (457) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH Confidence 999988654 4678999999999999999999999999999999999999999999999986432 356788899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc-- Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK-- 361 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~-- 361 (383) +|+.+||.|++++|+++|+++|+++ ++||+..+.+.|..++++.+.+++++|++|+||+|+++|++|++++ T Consensus 320 ~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~ 399 (457) T protein:vir:13 320 AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLG 399 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc Confidence 9999999999999999999999875 5688888889999999999999999999999999999999999875 Q ss_pred chhHHhCCCCCC---------------------------CCCCCCCCCC Q lcl|NC_018285. 362 ELPKGENPNRTI---------------------------LKGGETNGQD 383 (383) Q Consensus 362 d~~~~~~~~~~~---------------------------~~ggd~~~~d 383 (383) |..... .|..+ .++|..++++ T Consensus 400 d~~~~~-~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~ 447 (457) T protein:vir:13 400 EKYRVP-LNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEG 447 (457) T ss_pred cceeec-cccccccccccccccCCCCCCCCCccccCCCCCCCCCCcccc Confidence 432211 11000 0111111111 No 41 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=7.7e-83 Score=470.77 Aligned_cols=374 Identities=18% Similarity=0.266 Sum_probs=318.1 Q ss_pred chhhhhhcCCcccccccccccchhh---cc--cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch------- Q lcl|NC_018285. 2 PIFNLATESPPNNQGGFFDITDPEF---LA--TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ------- 69 (383) Q Consensus 2 glf~~~~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~------- 69 (383) =||+++++++.... .....+.+.+ ++ .+.++..|+.++|+++++|++||++||++||++|+++++.. T Consensus 1 m~~~~~f~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~ 79 (416) T protein:vir:12 1 MLLERMFEKRSGSS-DHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIERK 79 (416) T ss_pred CccchhcccccCcc-ccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccccc Confidence 36688776654332 2222222222 22 23456789999999999999999999999999999998643 Q ss_pred -----hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcc Q lcl|NC_018285. 70 -----MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPR 144 (383) Q Consensus 70 -----~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~ 144 (383) ...|+.+||++||+++||+.++.+++++||||++|.|+..|++.+||||+|++|++..+.+++.++|.+..++ T Consensus 80 ~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~g-- 157 (416) T protein:vir:12 80 PEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVLNG-- 157 (416) T ss_pred cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEecCC-- Confidence 2347889999999999999999999999999999999999999999999999999999888888888876543 Q ss_pred cccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHH Q lcl|NC_018285. 145 IPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQ 224 (383) Q Consensus 145 ~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~ 224 (383) ..+.++++||||++++++++ ++|.||+.++..++....++++++.++|+||+.|+++++.++.+++|+++++++.|+ T Consensus 158 --~~~~~~~~eiih~~~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~ 234 (416) T protein:vir:12 158 --KAIELYDYEVLHFKGLSTDG-IHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWK 234 (416) T ss_pred --eEEEecCccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHH Confidence 45789999999999887665 789999999999999999999999999999999999999999999999999999997 Q ss_pred HhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 225 AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVARYLR 302 (383) Q Consensus 225 ~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P~~~ 302 (383) .. .++++++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+ .+++++.+++.++|+.+||.|+++ T Consensus 235 ~~-~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~ 313 (416) T protein:vir:12 235 RV-NKVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIV 313 (416) T ss_pred HH-hcCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHH Confidence 54 467899999999999999999999999999999999999999999999986 456778899999999999999999 Q ss_pred HHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCC-- Q lcl|NC_018285. 303 PFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRT-- 372 (383) Q Consensus 303 ~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~-- 372 (383) +|+++|+++|++. ++||.....+.|..++++.+++++++|++|+||+|+++|++|++++|.... ..|.. T Consensus 314 ~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~-~~n~~~~ 392 (416) T protein:vir:12 314 NFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYIS-SLNYVFL 392 (416) T ss_pred HHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeee-ccccccc Confidence 9999999999853 567888888899999999999999999999999999999999988775432 11111 Q ss_pred --------C-----CCCCCCCCCC Q lcl|NC_018285. 373 --------I-----LKGGETNGQD 383 (383) Q Consensus 373 --------~-----~~ggd~~~~d 383 (383) + .+|||++++= T Consensus 393 ~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 393 DFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred cccchhhccccccccCCCCCcCCC Confidence 1 2455532222 No 42 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=1e-82 Score=470.10 Aligned_cols=380 Identities=15% Similarity=0.149 Sum_probs=313.3 Q ss_pred CchhhhhhcCCcccc-cccccccchhhcccccCC------ceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---- Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-GGFFDITDPEFLATLNGS------EWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~------~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---- 69 (383) |=|-+.-. ..++. .+........+...+..+ ..+....|+++++|++||++||++||++|+++++.. T Consensus 1 ~~~~~~~~--~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~ 78 (518) T protein:vir:10 1 MLLANGQT--LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) T ss_pred CcccCcee--ecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCc Confidence 54433211 11111 011111111111111122 233456688999999999999999999999998753 Q ss_pred -------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 70 -------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 70 -------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) ...|+.+||++||+++||+.++.+++++||||++++|+.+|+|++|+||+|++|++..+.+.+..+|.+.... T Consensus 79 ~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y~~~~~~ 158 (518) T protein:vir:10 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) T ss_pred eeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecC Confidence 3458899999999999999999999999999999999999999999999999999999888888899888777 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) +..+..+.|+++||||+|++++++..+|+||+.++..+|....++++++.++|+||++|+++++.++.+++++++++++. T Consensus 159 ~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~ 238 (518) T protein:vir:10 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQ 238 (518) T ss_pred CccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHH Confidence 66667788999999999999999888999999999999999999999999999999999999999999999999999999 Q ss_pred HHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHH Q lcl|NC_018285. 223 RQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAV 297 (383) Q Consensus 223 ~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l 297 (383) |+.. ..|+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||. .++++|.+++.+.|+++|| T Consensus 239 ~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL 318 (518) T protein:vir:10 239 FDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM 318 (518) T ss_pred HHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHH Confidence 8654 4688999999999999999999999999999999999999999999999995 4567888999999999999 Q ss_pred HHHHHHHHHHHHHhhcch------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcC--CcchhHHhCC Q lcl|NC_018285. 298 ARYLRPFLSELSQKLSCD------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEIL--PKELPKGENP 369 (383) Q Consensus 298 ~P~~~~i~~~l~~~l~~~------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~--~~d~~~~~~~ 369 (383) .|++++|+++||++|++. ++||+..+++.|...+++.++.++++|++|+||+|+++|++|++ ++|..... . T Consensus 319 ~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~-~ 397 (518) T protein:vir:10 319 AIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN-S 397 (518) T ss_pred HHHHHHHHHHHHHhhcccccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeec-c Confidence 999999999999999864 67888888899999999999999999999999999999999986 34533221 1 Q ss_pred CCCCC---CCCCCCCCC Q lcl|NC_018285. 370 NRTIL---KGGETNGQD 383 (383) Q Consensus 370 ~~~~~---~ggd~~~~d 383 (383) |..|+ ..|...+++ T Consensus 398 n~~pl~~~~~~~~~g~~ 414 (518) T protein:vir:10 398 ALQPLGATPDGAVEGEE 414 (518) T ss_pred cceecccccccccCCCC Confidence 12221 111111111 No 43 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=3.3e-82 Score=467.29 Aligned_cols=374 Identities=21% Similarity=0.340 Sum_probs=302.0 Q ss_pred CchhhhhhcCCccccccccc-ccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFD-ITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------- 69 (383) ||||+.+... ....|.. +.++.+.+++. +.+ +..+||++++||+||++||++||++|+++++.. T Consensus 1 m~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-g~~-~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 75 (417) T protein:vir:38 1 MKLFRGLATE---VDPHWADHLLDSGVIPSFR-GGY-LGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANI 75 (417) T ss_pred CccccccccC---CCccchhhhcccccccccC-Cce-echhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchH Confidence 9999653322 1222222 12333333333 333 345799999999999999999999999998643 Q ss_pred hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-CceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccc Q lcl|NC_018285. 70 MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-GRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPK 148 (383) Q Consensus 70 ~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~ 148 (383) .+.|+.+||++||+++||+.++.+++++||||++|+|+.. |.|..|+|++|++|++...+.+ .+.|++...+ ++.. T Consensus 76 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~-~~~y~~~~~~--~~~~ 152 (417) T protein:vir:38 76 EYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPD-NIIYRFTPYN--SSMQ 152 (417) T ss_pred HHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCC-eEEEEEEEcC--CcEE Confidence 2347789999999999999999999999999999999864 6799999999999998776544 5667766544 3345 Q ss_pred eeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhh- Q lcl|NC_018285. 149 QHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMK- 227 (383) Q Consensus 149 ~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~- 227 (383) ..++++||||||+++.++ ++|+||+.++..+|....++++++.++|+||++|+++++.++.+++++++++++.|+... T Consensus 153 ~~~~~~dviH~r~~~~d~-~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~ 231 (417) T protein:vir:38 153 KVCGFEDVIHWKFFSYDT-IMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQA 231 (417) T ss_pred EEecCcceEEecCCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 679999999999886654 789999999999999999999999999999999999999999999999999999886543 Q ss_pred -cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 -QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLS 306 (383) Q Consensus 228 -~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~ 306 (383) .|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||..+++++.+++.++|+.+||.|++++|++ T Consensus 232 g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~~~~~~~~tl~P~~~~ie~ 311 (417) T protein:vir:38 232 GADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQLADDYIRNDLPFYFEPITS 311 (417) T ss_pred ccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHHHHHHHHHHHHHHHHHHHH Confidence 4789999999999999999999999999999999999999999999999988889999999999999999999999999 Q ss_pred HHHHhhcchhhcc-chhhhccCH--HHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-C-------------- Q lcl|NC_018285. 307 ELSQKLSCDVDAD-IFPAVDPTG--ANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-N-------------- 368 (383) Q Consensus 307 ~l~~~l~~~~e~~-~~~~~~~~~--~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-~-------------- 368 (383) +|+++|++..+.. ....++.+. ......+.+++++|++|+||+|+++|++|+++++.++.. . T Consensus 312 ~l~~~Ll~~~~~~~~~~~fd~~~l~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~ 391 (417) T protein:vir:38 312 EFELKLLDDAQRHQYCIGFDTKSVNGLPIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQA 391 (417) T ss_pred HHHhhhcChhhcccceEEechhhhhHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccccc Confidence 9999999764321 111111111 122356778999999999999999999999887543321 1 Q ss_pred CCCCCCCCCCCCCCC Q lcl|NC_018285. 369 PNRTILKGGETNGQD 383 (383) Q Consensus 369 ~~~~~~~ggd~~~~d 383 (383) ......+|||+++++ T Consensus 392 ~~~~~~kgg~~~~~~ 406 (417) T protein:vir:38 392 EHAAELKGGDTNAKG 406 (417) T ss_pred ccccccCCCCCCCCC Confidence 112234788876666 No 44 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=4.9e-82 Score=466.35 Aligned_cols=376 Identities=14% Similarity=0.199 Sum_probs=310.4 Q ss_pred CchhhhhhcCCcccccccccccchhhc-ccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFL-ATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------- 69 (383) |.|-+.+.++.......+..+.+.... ..+.++..|+.+.|+++++|++||++||+++|++|+++++.. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTRVVDE 80 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcceeeccc Confidence 544333222222221122222221111 123456788999999999999999999999999999998643 Q ss_pred --hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccc Q lcl|NC_018285. 70 --MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPP 147 (383) Q Consensus 70 --~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~ 147 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+ .|+|++||||+|++|++..+.++ ...|++...+ +. T Consensus 81 ~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~-~~~y~~~~~~---g~ 155 (413) T protein:vir:48 81 RLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQW-QPVYQVTFPD---GS 155 (413) T ss_pred HHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCc-eEEEEEEecC---ce Confidence 23466899999999999999999999999999999986 58999999999999999887654 4566665544 34 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHh- Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAM- 226 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~- 226 (383) ...++++||||++++++++ ++|+||+..+..++....++++++.++|+||+.|+++++.++.+++|+++++++.|... T Consensus 156 ~~~~~~~evih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~ 234 (413) T protein:vir:48 156 VDVLTQDEIWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERH 234 (413) T ss_pred EEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHh Confidence 5679999999999887765 78999999999999999999999999999999999999999999999999999988654 Q ss_pred --hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 227 --KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVARYLR 302 (383) Q Consensus 227 --~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~P~~~ 302 (383) ..|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||. .++++|.+++...|++.||.|+++ T Consensus 235 ~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~ 314 (413) T protein:vir:48 235 TGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLT 314 (413) T ss_pred cCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHH Confidence 4678999999999999999999999999999999999999999999999996 456778899999999999999999 Q ss_pred HHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCC Q lcl|NC_018285. 303 PFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILK 375 (383) Q Consensus 303 ~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ 375 (383) +|+++||++|+++ ++||.....+.|...+++.+++++++|++|+||+|+++|++|++++|.... ..+..+.+ T Consensus 315 ~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~-~~n~~~~~ 393 (413) T protein:vir:48 315 RIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLT-PMNMTTSP 393 (413) T ss_pred HHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeec-cccccccc Confidence 9999999999864 678888888999999999999999999999999999999999998886554 23333321 Q ss_pred -CCCCCCCC Q lcl|NC_018285. 376 -GGETNGQD 383 (383) Q Consensus 376 -ggd~~~~d 383 (383) .|+.++++ T Consensus 394 ~~~~~~~~~ 402 (413) T protein:vir:48 394 SAGDDNGKK 402 (413) T ss_pred cccccCCCC Confidence 22222111 No 45 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=4.5e-82 Score=466.57 Aligned_cols=373 Identities=23% Similarity=0.328 Sum_probs=303.9 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------hh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------~~ 71 (383) ||||+.....+... ..+ ...+++.. ....++..+||++++|++||++||++||++|+++++.+ .+ T Consensus 1 m~~f~~~~~~~~~~-~~~----~~~~~~~~-~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~~~ 74 (406) T protein:vir:97 1 MSFFQPLGTSKVSY-DDY----ISSVLAGD-VSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIHDEDINY 74 (406) T ss_pred CccccccCCCCCCc-chH----HHHHhcCC-CCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCccccccchHHH Confidence 99998654332221 111 11122221 22334556799999999999999999999999988643 23 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-CCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccccee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-NGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQH 150 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~ 150 (383) .|+.+||++||+++||+.++.+++++||||++|+|+. .|++.+|+|++|+.|++...++ +.++|++... ..+..+. T Consensus 75 lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~-~~~~y~~~~~--~~~~~~~ 151 (406) T protein:vir:97 75 LLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDN-HEIVYTFTDM--LTAKQVK 151 (406) T ss_pred HhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCC-ceEEEEEEec--CCceEEE Confidence 4668999999999999999999999999999999985 6899999999999999887654 4667776533 3456678 Q ss_pred ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhh--c Q lcl|NC_018285. 151 VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMK--Q 228 (383) Q Consensus 151 ~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~--~ 228 (383) ++++||||+|+++.++ +.|+||+.++..++....++++++.++|+||+.|++++..++.+++++++++++.|+... . T Consensus 152 ~~~~evih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~ 230 (406) T protein:vir:97 152 CFAHDVIHWKFFSHDT-ILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREGS 230 (406) T ss_pred EccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhccc Confidence 9999999999876554 779999999999999999999999999999999999999988999999999999886544 4 Q ss_pred CCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 229 MQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSEL 308 (383) Q Consensus 229 ~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l 308 (383) |+|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||+.+++++++++.+.|+.+||+|++++|+++| T Consensus 231 n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~l 310 (406) T protein:vir:97 231 VGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYFDAITSEL 310 (406) T ss_pred ccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77899999999999999999999999999999999999999999999999888899999999999999999999999999 Q ss_pred HHhhcchhhc---cchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh-------------CCC Q lcl|NC_018285. 309 SQKLSCDVDA---DIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE-------------NPN 370 (383) Q Consensus 309 ~~~l~~~~e~---~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~-------------~~~ 370 (383) +++|++..+. .+...++.+.......+.+++++|++|+||+|+++|++|+++ +|..... +.. T Consensus 311 ~~kll~~~~~~~~~i~fd~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~ 390 (406) T protein:vir:97 311 GLKTLNDKDRRLYHIEFDTRSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKV 390 (406) T ss_pred hhhhcChhhccceeEEEecCccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccccccccc Confidence 9999975321 111122344455567778899999999999999999999865 4533321 111 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_018285. 371 RTILKGGETNGQD 383 (383) Q Consensus 371 ~~~~~ggd~~~~d 383 (383) ....+|||+++++ T Consensus 391 ~~~~~gg~~~~~~ 403 (406) T protein:vir:97 391 GIKGKGGEVNAEE 403 (406) T ss_pred ccccCCCCCCCCC Confidence 1234799988888 No 46 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=6.1e-82 Score=465.82 Aligned_cols=376 Identities=17% Similarity=0.234 Sum_probs=309.0 Q ss_pred Cchhhhhh---cCCcc----cccccccccchhhcc-----cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecc Q lcl|NC_018285. 1 MPIFNLAT---ESPPN----NQGGFFDITDPEFLA-----TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK 68 (383) Q Consensus 1 Mglf~~~~---~~~~~----~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~ 68 (383) =.|++.+. ..+.. +.+...+.+++.++. .+.++..|+.++||++++|++||++||++||++|+++|+. T Consensus 3 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~~~~~ 82 (434) T protein:vir:43 3 KSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLGVYER 82 (434) T ss_pred cchhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEE Confidence 11222222 22211 112223344444432 2335778999999999999999999999999999999864 Q ss_pred h-------------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeE Q lcl|NC_018285. 69 Q-------------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLY 135 (383) Q Consensus 69 ~-------------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~ 135 (383) + .+.|+.+||++||+++||+.++.+++++||+|++|.++ +|+|++|+||+|++|++..+.++ ..+ T Consensus 83 ~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~g-~~~ 160 (434) T protein:vir:43 83 KADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDENG-RLK 160 (434) T ss_pred cCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCCC-eEE Confidence 3 23467899999999999999999999999999999876 69999999999999999887665 455 Q ss_pred EEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHH Q lcl|NC_018285. 136 YNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDF 215 (383) Q Consensus 136 y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~ 215 (383) |++...+ +..+.++++||||+++++.++ ++|+||+..+..++....++++++.++|+||++|+++++.++.+++++ T Consensus 161 y~~~~~~---g~~~~~~~~eVih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~ 236 (434) T protein:vir:43 161 YFYTTKK---GARREIERTNMLHIPAFTLDG-RIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQ 236 (434) T ss_pred EEEEecC---ceEEEEccccEEEecCcCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHH Confidence 5555433 456789999999999886665 789999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHH--hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc----cCcCHHHHH Q lcl|NC_018285. 216 KTKVSRSRQA--MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMS 289 (383) Q Consensus 216 ~~~~~~~~~~--~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~----~~~~~~e~~ 289 (383) ++++++.|+. +..|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||... ++++.+++. T Consensus 237 ~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~ 316 (434) T protein:vir:43 237 REEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQM 316 (434) T ss_pred HHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHH Confidence 9999888864 3467899999999999999999999999999999999999999999999998532 256778889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 290 SNVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 290 ~~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) ..|+.+||.|++.+|+++||++|++. ++||+...++.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 317 ~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD 396 (434) T protein:vir:43 317 LAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGD 396 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 99999999999999999999999864 67888888899999999999999999999999999999999998887 Q ss_pred hhHHhCCCCCCCCC-CC-----------------CCCCC Q lcl|NC_018285. 363 LPKGENPNRTILKG-GE-----------------TNGQD 383 (383) Q Consensus 363 ~~~~~~~~~~~~~g-gd-----------------~~~~d 383 (383) .... ..|..|++. |+ .+++| T Consensus 397 ~~~~-~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 397 ILTV-QSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred eEee-ccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 5543 234444321 11 11222 No 47 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=2.3e-81 Score=462.67 Aligned_cols=372 Identities=15% Similarity=0.149 Sum_probs=305.5 Q ss_pred chhhhhhcCCccc-ccccccccchhh-cccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------- Q lcl|NC_018285. 2 PIFNLATESPPNN-QGGFFDITDPEF-LATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------- 69 (383) Q Consensus 2 glf~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------- 69 (383) =||++...+.... .....++....+ ...+.++..||.++||++++|++||++||++||++|+++++.. T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDRKPATDH 80 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcccccccc Confidence 2445443332221 111111111111 1234567889999999999999999999999999999998643 Q ss_pred --hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccc Q lcl|NC_018285. 70 --MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPP 147 (383) Q Consensus 70 --~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~ 147 (383) ...|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++ ..+|++...+ T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~-~~~y~~~~~~----- 154 (419) T protein:vir:14 81 PLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL-KPVYRVRGSD----- 154 (419) T ss_pred HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc-eEEEEEccCc----- Confidence 23467899999999999999999999999999999999999999999999999999886655 4556654322 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC----CHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG----LLDFKTKVSRSR 223 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~----~~e~~~~~~~~~ 223 (383) .++.++|+|+++++.++ ++|+||+..+..++....++++++.++|+||+.|+++|+.++.. ++++.+++++.| T Consensus 155 --~~~~~~i~h~~~~~~dg-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~ 231 (419) T protein:vir:14 155 --PMPQRLVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGW 231 (419) T ss_pred --ccchhheeEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHH Confidence 37889999999887665 78999999999999999999999999999999999999998765 477888888888 Q ss_pred HHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 224 QAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 224 ~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~ 298 (383) +.. ..|+|+++|+++|++|++++.++.|+||+|++++++++||++|||||++||. .+++++.+++.+.|+++||. T Consensus 232 ~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~ 311 (419) T protein:vir:14 232 NAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLL 311 (419) T ss_pred HHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHH Confidence 654 4577899999999999999999999999999999999999999999999985 46678889999999999999 Q ss_pred HHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR 371 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~ 371 (383) |++++|+++|+++|++. ++||...+.+.|...+++.+++++++|++|+||+|+++|++|++++|..... .|. T Consensus 312 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~-~n~ 390 (419) T protein:vir:14 312 PWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSP-MNM 390 (419) T ss_pred HHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec-ccc Confidence 99999999999999864 6788888889999999999999999999999999999999999988865432 222 Q ss_pred CC------CCCCCCCCCC Q lcl|NC_018285. 372 TI------LKGGETNGQD 383 (383) Q Consensus 372 ~~------~~ggd~~~~d 383 (383) .+ .++|+.++.+ T Consensus 391 ~~~~~~~~~~~~~~~~~~ 408 (419) T protein:vir:14 391 VDASKPQQLPVGKSEPTK 408 (419) T ss_pred ccccccccccCCCCCCcc Confidence 21 1223332222 No 48 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=2.4e-81 Score=462.60 Aligned_cols=374 Identities=14% Similarity=0.138 Sum_probs=309.0 Q ss_pred CchhhhhhcCCcccccccccccchhh-cccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh--------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEF-LATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--------- 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--------- 70 (383) |-|.++.++......+...++....+ ...+.++..|+.++||++++|++||++||++||++|+++++... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDRKPATDH 80 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCccccccc Confidence 76655433321111111111111111 22345678899999999999999999999999999999986432 Q ss_pred ---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccccc Q lcl|NC_018285. 71 ---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPP 147 (383) Q Consensus 71 ---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~ 147 (383) +.|+.+||++||+++||+.++.+++++||||++|+|+.+|+|++||||+|++|++..+.++ ..+|++.. T Consensus 81 ~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~-~~~y~~~~------- 152 (419) T protein:vir:80 81 PLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDL-KPMYRVAG------- 152 (419) T ss_pred HHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc-eEEEEEcC------- Confidence 3467899999999999999999999999999999999999999999999999999887654 45555532 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC----CCHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG----GLLDFKTKVSRSR 223 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~----~~~e~~~~~~~~~ 223 (383) ...++.++|+|+++++.++ ++|+||+..+..+|....++++++.++|+||+.|+++|+.++. .++++.+++++.| T Consensus 153 ~~~~~~~~i~h~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~ 231 (419) T protein:vir:80 153 ADPLPQRLVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGW 231 (419) T ss_pred ccccchhheEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHH Confidence 1248899999999887665 7899999999999999999999999999999999999998754 3677888888888 Q ss_pred HHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 224 QAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 224 ~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~~~~~l~ 298 (383) +.. ..|+|+++|+++|++|++++.++.|+||+|.+++++++||++|||||++||+ .+++++.+++.+.|+.+||. T Consensus 232 ~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~ 311 (419) T protein:vir:80 232 NAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLL 311 (419) T ss_pred HHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHH Confidence 654 4577999999999999999999999999999999999999999999999986 45677889999999999999 Q ss_pred HHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC--- Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN--- 368 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~--- 368 (383) |+++.|+++|+++|++. ++||.....+.|..++++.+++++++|++|+||+|+++|++|++++|...... T Consensus 312 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~ 391 (419) T protein:vir:80 312 PWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMV 391 (419) T ss_pred HHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccc Confidence 99999999999999863 67888888899999999999999999999999999999999998887654321 Q ss_pred --CCCCCCCCCCCCCCC Q lcl|NC_018285. 369 --PNRTILKGGETNGQD 383 (383) Q Consensus 369 --~~~~~~~ggd~~~~d 383 (383) ....+.++|+.++.+ T Consensus 392 ~~~~~~~~~~~~~~~~~ 408 (419) T protein:vir:80 392 DASKPQPIPMGKTEPTK 408 (419) T ss_pred cccccccccCCCCCchh Confidence 111234566666655 No 49 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=9.4e-81 Score=459.33 Aligned_cols=382 Identities=13% Similarity=0.103 Sum_probs=309.8 Q ss_pred CchhhhhhcCCcccccccc-cccchhhcccccCCcee-chhhhhccHHHHHHHHHHHHhhhhCceeeecch--------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFF-DITDPEFLATLNGSEWV-SAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------- 69 (383) ||||+++..++........ .+....+.+....+..+ +...++++|+|++||++||++||++|+++++.. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~ 80 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVR 80 (423) T ss_pred CchhHhhccccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeec Confidence 9999998766654332221 22222222333323333 344467899999999999999999999998532 Q ss_pred ---hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC--CceeEEEEeccceeEEEEcCCC-ceeEEEEeecCc Q lcl|NC_018285. 70 ---MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN--GRDMKWEYLRPSQVSFNRLDNQ-NGLYYNVTFDDP 143 (383) Q Consensus 70 ---~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~--g~~~~l~~l~~~~v~~~~~~~~-~~~~y~~~~~~~ 143 (383) ...|+.+||++||+++||+.++.+++++||||++|.|+.. +.+..|+|++++.+++....++ +.+.|++..... T Consensus 81 ~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~~~~~ 160 (423) T protein:vir:81 81 EGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIIIESGD 160 (423) T ss_pred cchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEEEecC Confidence 1246779999999999999999999999999999999753 4667888888888887665443 567788776655 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-----CCCHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-----GGLLDFKTK 218 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-----~~~~e~~~~ 218 (383) ..+....++++||||+|.+++++.++|+||+..+..+++...++++++.++|+||+.|+++|+.+. .+++|++++ T Consensus 161 ~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~ 240 (423) T protein:vir:81 161 NDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTR 240 (423) T ss_pred CCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHH Confidence 667778899999999999999998899999999999999999999999999999999999998764 478999999 Q ss_pred HHHHHHH----hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc--cccCcCHHHHHHHH Q lcl|NC_018285. 219 VSRSRQA----MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSSNV 292 (383) Q Consensus 219 ~~~~~~~----~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~--~~~~~~~~e~~~~~ 292 (383) +++.|+. +.+++|+++||++|++|++++.+++|+||+|.+++++++||++|||||++||. .++++|.+++.+.| T Consensus 241 ~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f 320 (423) T protein:vir:81 241 FMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKAL 320 (423) T ss_pred HHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHH Confidence 9888864 34678999999999999999999999999999999999999999999999996 45677889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcch---------hhccchhhhccCHHHHHHHHHHHHh-CCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 293 YSKAVARYLRPFLSELSQKLSCD---------VDADIFPAVDPTGANYISRINSMVK-SGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 293 ~~~~l~P~~~~i~~~l~~~l~~~---------~e~~~~~~~~~~~~~~~~~~~~l~~-~g~~t~nE~r~~lg~~~~~~~d 362 (383) +.+||.|+++.|+++|+++|+++ ++||...+++.|.+.+++.+..++. .|++|+||+|+++|++|++++| T Consensus 321 ~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD 400 (423) T protein:vir:81 321 YGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGD 400 (423) T ss_pred HHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc Confidence 99999999999999999999975 4567777888899999999998874 6999999999999999999888 Q ss_pred hhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 363 LPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 363 ~~~~~~~~~~~~~ggd~~~~d 383 (383) ..... .|..+....+..+++ T Consensus 401 ~~~~p-~n~~~~~~~~~~~~~ 420 (423) T protein:vir:81 401 DLARP-LNTEFGDSEDAPGEE 420 (423) T ss_pred eeecc-cccccCccCCCCCCC Confidence 65432 233332222211111 No 50 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=1.7e-80 Score=457.97 Aligned_cols=358 Identities=13% Similarity=0.092 Sum_probs=302.8 Q ss_pred CchhhhhhcCCcc----cc---------------------------cccccccch-hhcc--cccCCceechhhhhccHH Q lcl|NC_018285. 1 MPIFNLATESPPN----NQ---------------------------GGFFDITDP-EFLA--TLNGSEWVSAETALKNSD 46 (383) Q Consensus 1 Mglf~~~~~~~~~----~~---------------------------~~~~~~~~~-~~~~--~~~~~~~~~~~~a~~~~~ 46 (383) ||||++++.-+.. ++ ..+...+.. .+.. ...++..++.+.|+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 9999998873110 00 011111100 0111 123567789999999999 Q ss_pred HHHHHHHHHHhhhhCceeeecch------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEE-eecCCCceeEEEEec Q lcl|NC_018285. 47 LFSIISQLSNDLATAKLTTSRKQ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYR-WRNDNGRDMKWEYLR 119 (383) Q Consensus 47 v~~~i~~ia~~ia~~p~~~~~~~------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i-~r~~~g~~~~l~~l~ 119 (383) |++||++||++||++|+++++.. ...|+.+||++||+++||+.++.+|++ ||+|+++ .|+.+|+|++|+||+ T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~ 159 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVP 159 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEEC Confidence 99999999999999999998743 345788999999999999999999987 9999875 588999999999999 Q ss_pred cceeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_018285. 120 PSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNAL 199 (383) Q Consensus 120 ~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 199 (383) |+.|++..+.+ +..+|++.. ...++||||+|++++.+.++|+||+..+..++....++++++.++|+||+ T Consensus 160 p~~v~v~~~~~-g~~~y~~~~---------~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga 229 (409) T protein:vir:83 160 PWLVNVELKKG-ARREYRIGG---------LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGG 229 (409) T ss_pred CcceEEEEcCC-ceEEEEEcc---------ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 99999988755 456666532 13358999999988888889999999999999999999999999999999 Q ss_pred CcceeEeecCCCCHHHHHHHHHHHHHhh-cCCcceeecCCCcee-eecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_018285. 200 NANGILKIKGGGLLDFKTKVSRSRQAMK-QMQGGPLVLDDLEDF-TPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVG 277 (383) Q Consensus 200 ~~~~i~~~~~~~~~e~~~~~~~~~~~~~-~~~g~~~vl~~g~~~-~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg 277 (383) +|++++++++.+++|+++++++.|+... +|+|+++++.+|+++ ++++++++|+||+|++++++++||++|||||++|| T Consensus 230 ~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg 309 (409) T protein:vir:83 230 VPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVG 309 (409) T ss_pred CcceEeecCCCCCHHHHHHHHHHHHHhhCCccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcc Confidence 9999999999999999999999986543 588999999999987 57899999999999999999999999999999998 Q ss_pred cc-----ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch---hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHH Q lcl|NC_018285. 278 GQ-----GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD---VDADIFPAVDPTGANYISRINSMVKSGTLAQNQG 349 (383) Q Consensus 278 ~~-----~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~---~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~ 349 (383) .. .+++|.+++..+|+++||.|++++|+++|+++|++. ++|+...+++.|..++++.++.++++|++|+||+ T Consensus 310 ~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~ 389 (409) T protein:vir:83 310 LPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPSPQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEA 389 (409) T ss_pred CCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 52 346788999999999999999999999999999865 7788888899999999999999999999999999 Q ss_pred HHHhhcCCcCCcchhHHhCCCCCCCCCCCC Q lcl|NC_018285. 350 LYILQQAEILPKELPKGENPNRTILKGGET 379 (383) Q Consensus 350 r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~ 379 (383) |+++|++|++++| .++||++ T Consensus 390 R~~~glpp~~ggd----------~l~~~gv 409 (409) T protein:vir:83 390 RAMERLHSEAAAV----------RLSGGGV 409 (409) T ss_pred HHHhCCCCCCCCc----------ccCCCCC Confidence 9999999987665 2346666 No 51 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=4.8e-80 Score=455.42 Aligned_cols=352 Identities=27% Similarity=0.442 Sum_probs=300.7 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCCCcc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNPSNS 80 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~PN~~ 80 (383) ||||+.++++ ....+. ..+......+.+.++..|+.++||++++|++||++||++||++|++ ..+..+.|+.+||++ T Consensus 1 M~~~~~f~~r-~~~~~~-~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~-~~~~~~~L~~~PN~~ 77 (359) T protein:vir:10 1 MSILNPFERR-SSITPN-NYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI-GNQVFTSVLNNPSHL 77 (359) T ss_pred Ccccchhhcc-ccCCCC-cchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc-cchHHHHHhhccccc Confidence 9999876543 222211 1111111134455678899999999999999999999999999995 456778899999999 Q ss_pred CCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEec Q lcl|NC_018285. 81 ANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFR 160 (383) Q Consensus 81 ~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~ 160 (383) ||+++||+.++.+++++||||++|+|+.+|+|.+|+|++|++|++...++ ..+|.+.... ++..+.++++|||||| T Consensus 78 ~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~--~~~y~~~~~~--~~~~~~~~~~evih~~ 153 (359) T protein:vir:10 78 TNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD--TLTYEVNQFD--DYPSAKYNASEMIHVK 153 (359) T ss_pred CCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC--eEEEEEEecC--CceEEEEcccceEEec Confidence 99999999999999999999999999999999999999999999877543 4667665432 3456789999999999 Q ss_pred cCCC----CccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHHHHHHHHhh--cCCcce Q lcl|NC_018285. 161 LLSV----DGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKVSRSRQAMK--QMQGGP 233 (383) Q Consensus 161 ~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~~~~~~~~~--~~~g~~ 233 (383) .++. .++++|+||+.++..++....+++++..++|+||++|+++++.++ .+++++++++++.|+... .|+|++ T Consensus 154 ~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~ 233 (359) T protein:vir:10 154 IMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRV 233 (359) T ss_pred cCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCc Confidence 8753 356789999999999999999999999999999999999999975 689999999999886544 678999 Q ss_pred eecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 234 LVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS 313 (383) Q Consensus 234 ~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~ 313 (383) +||++|++|++++.++.|+||+|.+++++++||++|||||++||+.+++.++.++.++++..+|.|.+.+|+++|+.+|. T Consensus 234 ~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~ 313 (359) T protein:vir:10 234 MVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKCD 313 (359) T ss_pred eecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999876655566666777788888888888888888888 Q ss_pred chhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcC Q lcl|NC_018285. 314 CDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEIL 359 (383) Q Consensus 314 ~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~ 359 (383) ..++++....++.+...+...+.+++++|++|+||+|+++|++|++ T Consensus 314 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 314 SSIGVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred hhhcccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 8888887777777888888889999999999999999999999998 No 52 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=1.9e-79 Score=452.13 Aligned_cols=371 Identities=24% Similarity=0.414 Sum_probs=308.9 Q ss_pred CchhhhhhcCCcccccccccccchhhc---ccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFL---ATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||||+++...+.......... ++.++ ........++.++|+++++|++||++||++||++|++++++..+.|+.+| T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~~~~~~ll~~P 79 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPS-NPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTENTATLNRLESP 79 (385) T ss_pred Cccccchhccccccccccccc-chhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeeccchhhhhhcC Confidence 999997643332222222111 11121 12223567899999999999999999999999999999999999999999 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL 157 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi 157 (383) |++||+++||++++.+++++||||++++|+ +.+++|+++.+|++..+ ....+|.+.... ++..+.|+++||| T Consensus 80 N~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~--~~~~~~~~~~~~--~~~~~~~~~~eii 151 (385) T protein:vir:10 80 SSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPG--NMGIVYTVLESN--DRPQMVLRQDQML 151 (385) T ss_pred CCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEc--CCceEEEEEEcC--CceEEEEccccEE Confidence 999999999999999999999999999875 46788888888876654 344556655443 3456789999999 Q ss_pred EeccCCCC--ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-CHHHHHHHHHHHHHhh--cCCcc Q lcl|NC_018285. 158 HFRLLSVD--GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTKVSRSRQAMK--QMQGG 232 (383) Q Consensus 158 h~~~~~~~--~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-~~e~~~~~~~~~~~~~--~~~g~ 232 (383) |||+++++ +..+|+||+..+..++....++++++.++|+||++|++++++++.+ ++++++++++.|+... .++|+ T Consensus 152 hik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~ 231 (385) T protein:vir:10 152 HFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGR 231 (385) T ss_pred EeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC Confidence 99987654 3678999999999999999999999999999999999999999876 4678888888886544 46789 Q ss_pred eeecCCCceeeecccChhhHHHH-HHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 233 PLVLDDLEDFTPLEIKSNVAQLL-KQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLEMSSNVYSKAVARYLRPFLSE 307 (383) Q Consensus 233 ~~vl~~g~~~~~~~~~~~d~~~~-e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e~~~~~~~~~l~P~~~~i~~~ 307 (383) ++|+++|++|++++.++.|+|++ |.+++++++||++|||||++||+. +++++. |+...++..||.|+++.|+++ T Consensus 232 ~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~-eq~~~~~~~~l~P~~~~ie~~ 310 (385) T protein:vir:10 232 LMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNI-DQIKATYLANLNSYVNPIVDE 310 (385) T ss_pred ccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccH-HHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999974 999999999999999999999863 223444 566777778999999999999 Q ss_pred HHHhhcc-hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCC Q lcl|NC_018285. 308 LSQKLSC-DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQ 382 (383) Q Consensus 308 l~~~l~~-~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~ 382 (383) |+++|++ +++|+....++.|...+++.+++++++|++|+||+|+++|++|+++++...... +...++|||++++ T Consensus 311 l~~~l~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~-~~~~~~~g~~~dn 385 (385) T protein:vir:10 311 LRLKMNAPDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKP-LTTQVKGGDEGDN 385 (385) T ss_pred HHHhhCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccC-cccccCCCCCCCC Confidence 9999985 689999999999999999999999999999999999999999998877655543 3346789999888 No 53 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=8.5e-79 Score=448.59 Aligned_cols=381 Identities=12% Similarity=0.074 Sum_probs=304.4 Q ss_pred CchhhhhhcCCccc-c------------------cccccccchhhccc---------ccCCceechhhhhccHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPNN-Q------------------GGFFDITDPEFLAT---------LNGSEWVSAETALKNSDLFSIIS 52 (383) Q Consensus 1 Mglf~~~~~~~~~~-~------------------~~~~~~~~~~~~~~---------~~~~~~~~~~~a~~~~~v~~~i~ 52 (383) ||||+++.+..... + +...+.+++..... ..++..|+.++|+++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 99999998764321 0 01111222222111 12466789999999999999999 Q ss_pred HHHHhhhhCceeeecch-----------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC--------Ccee Q lcl|NC_018285. 53 QLSNDLATAKLTTSRKQ-----------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN--------GRDM 113 (383) Q Consensus 53 ~ia~~ia~~p~~~~~~~-----------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~--------g~~~ 113 (383) +||++||++|+++++.. ...|+.+||++||+++||+.++.+++++||||++|+|+.. |.++ T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~ 160 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVV 160 (466) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCccee Confidence 99999999999998643 3458899999999999999999999999999999999765 4589 Q ss_pred EEEEeccceeEEEEcCCCc-eeEEEEeecCc-ccccceeecccceEEeccC-CCCccccCcchHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 114 KWEYLRPSQVSFNRLDNQN-GLYYNVTFDDP-RIPPKQHVPQSDILHFRLL-SVDGGLTSVSPLMALGRELDIQKASDKL 190 (383) Q Consensus 114 ~l~~l~~~~v~~~~~~~~~-~~~y~~~~~~~-~~~~~~~~~~~dvih~~~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 190 (383) +|+|++|++|++....++. ...|.++.... .....+.++++||||||.+ ++.++++|+||+..+..+|....+++++ T Consensus 161 ~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~ 240 (466) T protein:vir:81 161 EERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKH 240 (466) T ss_pred EEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHH Confidence 9999999999999877664 34566654432 2335578999999999965 4567789999999999999999999999 Q ss_pred HHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHH Q lcl|NC_018285. 191 TLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAK 267 (383) Q Consensus 191 ~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~ 267 (383) +.++|+||+.|+++++.++.+++|+++++++.|... .+|+|+++||++|++|++++.+++|+||+|++++++++||+ T Consensus 241 ~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~ 320 (466) T protein:vir:81 241 QAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAA 320 (466) T ss_pred HHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999988654 45789999999999999999999999999999999999999 Q ss_pred HhcCCHHHhcc-----cccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------hhhccchhhhccCHHHHHHH- Q lcl|NC_018285. 268 VYGIPENVVGG-----QGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSC-------DVDADIFPAVDPTGANYISR- 334 (383) Q Consensus 268 ~~gVpp~~lg~-----~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~-------~~e~~~~~~~~~~~~~~~~~- 334 (383) +|||||++||. .+++++.+++.+.|+++||.|++++|+++|+++|++ +++|+...+++.|...+.+. T Consensus 321 ~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~ 400 (466) T protein:vir:81 321 AAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQ 400 (466) T ss_pred HhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHH Confidence 99999999984 356778899999999999999999999999999986 35677778888887665433 Q ss_pred ------HHHHHhCCCcCHHHHHHHhhcCC---cCCcchhHH----------hCCCCCCCCCCCCCCC Q lcl|NC_018285. 335 ------INSMVKSGTLAQNQGLYILQQAE---ILPKELPKG----------ENPNRTILKGGETNGQ 382 (383) Q Consensus 335 ------~~~l~~~g~~t~nE~r~~lg~~~---~~~~d~~~~----------~~~~~~~~~ggd~~~~ 382 (383) +..++++|+ |+||+|+.+.... +.+.++... ...+....+|||+|++ T Consensus 401 ~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 401 KVRAETINTLITAGY-EPESVVAAVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred HHHHHHHHHHHHcCC-ChhhccccccCCccccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 667888895 9999998653211 111111100 1122223467877777 No 54 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=7.5e-79 Score=448.89 Aligned_cols=365 Identities=14% Similarity=0.121 Sum_probs=294.4 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------hh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------~~ 71 (383) |. .-+...+++.. |. ......++.+.+|++++|++||++||++||++|+++++.+ .+ T Consensus 1 ~~-------~~~~~~g~~~~-----~~--~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~ 66 (723) T protein:vir:94 1 MT-------TFPSGAGGWNA-----WS--ADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQ 66 (723) T ss_pred Cc-------ccccCCCcccc-----cc--ccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHH Confidence 11 11111111111 11 1134456788899999999999999999999999998754 23 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC---CCceeEEEEeccceeEEEEcCCCc------eeEEEEeecC Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND---NGRDMKWEYLRPSQVSFNRLDNQN------GLYYNVTFDD 142 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~---~g~~~~l~~l~~~~v~~~~~~~~~------~~~y~~~~~~ 142 (383) .|+.+||++||+++||+.++.+|+++||+|++++|++ .|.|.+|+|++++.+.+....+.. ...|.+...+ T Consensus 67 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~ 146 (723) T protein:vir:94 67 LWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTD 146 (723) T ss_pred HHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecC Confidence 4667999999999999999999999999999999754 589999999999888776554432 3344444333 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) +....++++||||||.+++.++++|+||+..+..+|....++++++.++|+||++|++||+.+ .+++++++++++. T Consensus 147 ---G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~l~~e~~~~~~~~ 222 (723) T protein:vir:94 147 ---GVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-DMDEQTFTKTVAA 222 (723) T ss_pred ---ceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCHHHHHHHHHH Confidence 456789999999999998888899999999999999999999999999999999999999986 5899999999998 Q ss_pred HHH---hhcCCcceeecC----------CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH Q lcl|NC_018285. 223 RQA---MKQMQGGPLVLD----------DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS 289 (383) Q Consensus 223 ~~~---~~~~~g~~~vl~----------~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~ 289 (383) |.. +.+|+|+++||+ .|++|++++.+++|+||+|++++++++||++|||||++|++.++++|.+++. T Consensus 223 ~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~ 302 (723) T protein:vir:94 223 FRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAK 302 (723) T ss_pred HHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHH Confidence 854 457889999885 5899999999999999999999999999999999999999888889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchh------hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch Q lcl|NC_018285. 290 SNVYSKAVARYLRPFLSELSQKLSCDV------DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL 363 (383) Q Consensus 290 ~~~~~~~l~P~~~~i~~~l~~~l~~~~------e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~ 363 (383) +.|+.+||.|+++.|+++||++|++++ +||....++.|...++..+..++++|++|+||+|+++|++|++++|. T Consensus 303 ~~f~~~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~ 382 (723) T protein:vir:94 303 AAVWTETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIG 382 (723) T ss_pred HHHHHHHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcc Confidence 999999999999999999999999764 34555567889999999999999999999999999999999988772 Q ss_pred hHHhC---CCCCCCCCCCCCCCC Q lcl|NC_018285. 364 PKGEN---PNRTILKGGETNGQD 383 (383) Q Consensus 364 ~~~~~---~~~~~~~ggd~~~~d 383 (383) ...-. .+..|.+......+| T Consensus 383 ~~~~~p~~~~~a~~~~~~p~~~e 405 (723) T protein:vir:94 383 QMTLTPYRAQFAPAPAPAPAVEE 405 (723) T ss_pred cceeccccccccCCCCCCccchh Confidence 21101 011111111111111 No 55 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=1.1e-78 Score=448.04 Aligned_cols=369 Identities=24% Similarity=0.398 Sum_probs=307.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcc---cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLA---TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||||++..-.+...+.. ....++.++. ...++.+++.++|+++++|++||++||+++|++|++++++....|+.+| T Consensus 1 Mg~~~~~~~~k~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ll~~P 79 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNM-VYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTENTATLNRLESP 79 (383) T ss_pred CCccccccccccccccc-ccccchhhhhhhccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecccchhhhhhCC Confidence 99999753222222211 1111222221 2234667899999999999999999999999999999999999999999 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL 157 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi 157 (383) |++||+++||+.++.+++++||||++++|+ +.+++|+++.+|++..+. ...+|.+.... .+..+.|+++||| T Consensus 80 N~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~--~~~~~~~~~~~--~~~~~~~~~~evi 151 (383) T protein:vir:10 80 SSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGN--MGIVYTVLESN--DRPKMVLRQDQML 151 (383) T ss_pred CCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcC--CceEEEEEEcC--CceEEEEcccceE Confidence 999999999999999999999999999875 466778888777766543 34556555443 3456789999999 Q ss_pred EeccCCCCc--cccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-CHHHHHHHHHHHHHhh--cCCcc Q lcl|NC_018285. 158 HFRLLSVDG--GLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTKVSRSRQAMK--QMQGG 232 (383) Q Consensus 158 h~~~~~~~~--~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-~~e~~~~~~~~~~~~~--~~~g~ 232 (383) |+|++++++ ..+|+||+.++...+....++++++.++|+||++|++++++++.+ ++++++++++.|+... .|+|+ T Consensus 152 h~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~ 231 (383) T protein:vir:10 152 HFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGR 231 (383) T ss_pred EeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC Confidence 999876543 568999999999999999999999999999999999999999877 5778888888886543 57899 Q ss_pred eeecCCCceeeecccChhhHHH-HHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 233 PLVLDDLEDFTPLEIKSNVAQL-LKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLEMSSNVYSKAVARYLRPFLSE 307 (383) Q Consensus 233 ~~vl~~g~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e~~~~~~~~~l~P~~~~i~~~ 307 (383) ++|+++|++|++++.++.|+|+ .|++++++++||++|||||++||+. .++++.+ +...++..||.|+++.|+++ T Consensus 232 ~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~e-q~~~~~~~~l~P~~~~ie~~ 310 (383) T protein:vir:10 232 LMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNID-QIKATYLANLNSYVNPIVDE 310 (383) T ss_pred ccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHH-HHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999997 5999999999999999999999863 2334444 55556678999999999999 Q ss_pred HHHhhcc-hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCC Q lcl|NC_018285. 308 LSQKLSC-DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETN 380 (383) Q Consensus 308 l~~~l~~-~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~ 380 (383) |+++|+. +++|+....++.|...+++.+++++++|++|+||+|+++|++|++++|.+.... +..+.+|||++ T Consensus 311 l~~~l~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~-~~~~~~gGd~e 383 (383) T protein:vir:10 311 LRLKMNAPDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKP-LTNETKGGDDK 383 (383) T ss_pred HHHhhCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCC-CcccCCCCCCC Confidence 9999974 689999999999999999999999999999999999999999999988877654 45688999999 No 56 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=1.6e-78 Score=447.05 Aligned_cols=381 Identities=12% Similarity=0.079 Sum_probs=298.2 Q ss_pred CchhhhhhcCCcccc----cccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh------ Q lcl|NC_018285. 1 MPIFNLATESPPNNQ----GGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM------ 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~------ 70 (383) =.++.++.++..... ..|..+..+.+.+...++..++...|+++|+|++||++||+++|++|+++++... T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~ 81 (460) T protein:vir:10 2 ANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQ 81 (460) T ss_pred chhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccchh Confidence 345555544322211 1122222223333334566788999999999999999999999999999986432 Q ss_pred ---------------------------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC----CCcee Q lcl|NC_018285. 71 ---------------------------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND----NGRDM 113 (383) Q Consensus 71 ---------------------------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~----~g~~~ 113 (383) ..|+.+||++||+++||+.++.+++++||||++|+|+. .|+|. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~ 161 (460) T protein:vir:10 82 LNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPS 161 (460) T ss_pred hhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCceeE Confidence 23788999999999999999999999999999999964 47899 Q ss_pred EEEEeccceeEEEEcCCCceeEEEEee---cCcccccceeecccceEEeccCCCC-----ccccCcchHHHHHHHHHHHH Q lcl|NC_018285. 114 KWEYLRPSQVSFNRLDNQNGLYYNVTF---DDPRIPPKQHVPQSDILHFRLLSVD-----GGLTSVSPLMALGRELDIQK 185 (383) Q Consensus 114 ~l~~l~~~~v~~~~~~~~~~~~y~~~~---~~~~~~~~~~~~~~dvih~~~~~~~-----~~~~G~s~~~~~~~~i~~~~ 185 (383) +||||+|++|++..++++....|.+.. ....++..+.++++||||||+++++ ++++|+||+..+..+|.... T Consensus 162 ~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~ 241 (460) T protein:vir:10 162 QMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQN 241 (460) T ss_pred EEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHH Confidence 999999999999998887666554321 1122355688999999999987765 35789999999999999999 Q ss_pred HHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHH Q lcl|NC_018285. 186 ASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTT 262 (383) Q Consensus 186 ~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~ 262 (383) ++++++.++|+||+.|+++++.++.+++++++++++.|... .+|+|+++++++|++|++++.++.|+||+|.+++++ T Consensus 242 ~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 321 (460) T protein:vir:10 242 STIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQ 321 (460) T ss_pred HHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHH Confidence 99999999999999999999999999999999999998754 457899999999999999999999999999999999 Q ss_pred HHHHHHhcCCHHHhccc----ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccchhh--hccCH----HHHH Q lcl|NC_018285. 263 GQFAKVYGIPENVVGGQ----GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVDADIFPA--VDPTG----ANYI 332 (383) Q Consensus 263 ~~Ia~~~gVpp~~lg~~----~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~--~~~~~----~~~~ 332 (383) ++||++|||||++||.. .++++.+++.+.|+.+||.|++++|+++||++|+++.+.+.... ++.+. .... T Consensus 322 ~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~ 401 (460) T protein:vir:10 322 KAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDM 401 (460) T ss_pred HHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHH Confidence 99999999999999853 34678899999999999999999999999999998654322111 11111 1122 Q ss_pred HHHHHHHhCCCcCHHHHHHHhhcCCcC--CcchhHHhCCCCCCCC--------CCCCCCC Q lcl|NC_018285. 333 SRINSMVKSGTLAQNQGLYILQQAEIL--PKELPKGENPNRTILK--------GGETNGQ 382 (383) Q Consensus 333 ~~~~~l~~~g~~t~nE~r~~lg~~~~~--~~d~~~~~~~~~~~~~--------ggd~~~~ 382 (383) .....++++|++|+||+|+++|++|++ ++|..... .|..+++ ++++..| T Consensus 402 ~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~-~n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 402 VAMASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMP-SNKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred HHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeec-ccccchhhcccccCCCcccCCC Confidence 334568899999999999999999985 35644332 3444433 2222222 No 57 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=5.8e-77 Score=438.55 Aligned_cols=369 Identities=14% Similarity=0.110 Sum_probs=303.8 Q ss_pred CchhhhhhcCCcccccccccccchhhc-ccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFL-ATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------- 69 (383) ||||+++++................+. +.......++...|+++++|++||++||++||++|+++++.. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~ 80 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDIRIRNE 80 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeecch Confidence 999999865433222111111111111 222344556788999999999999999999999999998543 Q ss_pred -hhhhccCCCccCCHHHHHHHHHHHHHHcCCe--EEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccc Q lcl|NC_018285. 70 -MQGIVDNPSNSANRFNFYQSIFAQMLLGGEA--FAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIP 146 (383) Q Consensus 70 -~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a--~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~ 146 (383) .+.|+.+||++||+++||++++.+++++|++ |+++.|+.+|++++||||+|++|++..+.++ |++.++ T Consensus 81 ~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~----~~~~~~----- 151 (406) T protein:vir:95 81 LSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDG----YQVLYG----- 151 (406) T ss_pred HHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCe----EEEEec----- Confidence 3458899999999999999999999999765 6667899999999999999999999887664 344332 Q ss_pred cceeecccceEEecc-CCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSDILHFRL-LSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 147 ~~~~~~~~dvih~~~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) .+.|+++||||+++ +++.+.++|.||+..+..++....++.+++.++|+||+.|+++++.++.+++++++++++.|.. T Consensus 152 -~~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~ 230 (406) T protein:vir:95 152 -GQTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFK 230 (406) T ss_pred -cEEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHH Confidence 24699999999995 4556678899999999999999999999999999999999999999999999999999988865 Q ss_pred h---hcCCcceeecC-CCceeeec-ccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHH Q lcl|NC_018285. 226 M---KQMQGGPLVLD-DLEDFTPL-EIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARY 300 (383) Q Consensus 226 ~---~~~~g~~~vl~-~g~~~~~~-~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~ 300 (383) . ..++|+++|++ +|.+++++ +.+++|+||+|.+++++++||++|||||++||.. .+++++..+|+++||.|+ T Consensus 231 ~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~---~~~~~~~~~~~~~~l~P~ 307 (406) T protein:vir:95 231 KYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG---EFNRDEYNNFINSTILPI 307 (406) T ss_pred HhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---CchHHHHHHHHHHHHHHH Confidence 3 45778888775 45677776 4699999999999999999999999999999853 345778889999999999 Q ss_pred HHHHHHHHHHhhcch----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCC----- Q lcl|NC_018285. 301 LRPFLSELSQKLSCD----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNR----- 371 (383) Q Consensus 301 ~~~i~~~l~~~l~~~----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~----- 371 (383) +++|+++|+++|+++ ++||+....+.|..++++.+..++++|++|+||+|+++|++|++++|..... .+. T Consensus 308 ~~~ie~~l~~~l~~~~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~-~n~~~~~~ 386 (406) T protein:vir:95 308 AKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVIL-ENYIPLDK 386 (406) T ss_pred HHHHHHHHHHhcCCCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeec-cCccchhh Confidence 999999999999875 6778888888999999999999999999999999999999999887754432 222 Q ss_pred ----CCCCCCCCCCCC Q lcl|NC_018285. 372 ----TILKGGETNGQD 383 (383) Q Consensus 372 ----~~~~ggd~~~~d 383 (383) ...+|||+++++ T Consensus 387 ~~~~~~~k~g~~~~~~ 402 (406) T protein:vir:95 387 IGDQSKLKGGDNSGAD 402 (406) T ss_pred cccccccCCCCCCCCC Confidence 234799998888 No 58 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=2.3e-76 Score=435.28 Aligned_cols=366 Identities=14% Similarity=0.131 Sum_probs=293.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechh-hhhccHHHHHHHHHHHHhhhhCceeeecch---------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAE-TALKNSDLFSIISQLSNDLATAKLTTSRKQ---------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------- 69 (383) ||||++++++..........+ ...+.......++.. .+.++|+|++||++||++||++|+++++.. T Consensus 1 Mg~~~~f~~k~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~ 77 (403) T protein:vir:80 1 MGLFNFFRRKTRSEPTNAISW---FLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNE 77 (403) T ss_pred Ccccccccccccccccchhhh---hcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCCh Confidence 999997765432211111111 001111112222332 345789999999999999999999998643 Q ss_pred -hhhhccCCCccCCHHHHHHHHHHHHHHc--CCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccc Q lcl|NC_018285. 70 -MQGIVDNPSNSANRFNFYQSIFAQMLLG--GEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIP 146 (383) Q Consensus 70 -~~~l~~~PN~~~t~~~f~~~~~~~~~l~--G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~ 146 (383) .+.|+.+||++||+++||+.++.++++. ||||+++.|+..|++.+||||+|+.|++..+.++..++|. T Consensus 78 ~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--------- 148 (403) T protein:vir:80 78 LSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--------- 148 (403) T ss_pred HHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEe--------- Confidence 2457789999999999999999999984 7899999999999999999999999999888776544432 Q ss_pred cceeecccceEEecc-CCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSDILHFRL-LSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 147 ~~~~~~~~dvih~~~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) .+.++++||||++. +++.++++|+||+..+..++....++++++.++|+||+.|+++++.++.+++++.++.++.|.. T Consensus 149 -~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~ 227 (403) T protein:vir:80 149 -GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFK 227 (403) T ss_pred -ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHH Confidence 13588999999994 5677778999999999999999999999999999999999999999999998888888777643 Q ss_pred ---hhcCCcceeecCCC-ceeeecc-cChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHH Q lcl|NC_018285. 226 ---MKQMQGGPLVLDDL-EDFTPLE-IKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARY 300 (383) Q Consensus 226 ---~~~~~g~~~vl~~g-~~~~~~~-~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~ 300 (383) +..++|++++++.+ .+++++. ++++|+|++|.+++++.+||++|||||++||.. ...+++..+|+.+||.|+ T Consensus 228 ~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~---~~~~~~~~~f~~~~l~P~ 304 (403) T protein:vir:80 228 KYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVG---KYDKDEYNNFINSTILPI 304 (403) T ss_pred HHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCC---CccHHHHHHHHHHHHHHH Confidence 44678888888655 4555544 588999999999999999999999999999853 223455678999999999 Q ss_pred HHHHHHHHHHhhcch----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCC---- Q lcl|NC_018285. 301 LRPFLSELSQKLSCD----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRT---- 372 (383) Q Consensus 301 ~~~i~~~l~~~l~~~----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~---- 372 (383) +++|+++|+++|+++ ++||....++.|..++++.+.+++++|++|+||+|+++|++|++++|.... ..+.. T Consensus 305 ~~~ie~~l~~kll~~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~-~~n~~pl~~ 383 (403) T protein:vir:80 305 AKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVI-LENYIPLDK 383 (403) T ss_pred HHHHHHHHHHhccCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEee-cccccchhh Confidence 999999999999875 678888888999999999999999999999999999999999988774332 23333 Q ss_pred -----CCCCCCCCCCC Q lcl|NC_018285. 373 -----ILKGGETNGQD 383 (383) Q Consensus 373 -----~~~ggd~~~~d 383 (383) ..+|||.+++| T Consensus 384 ~~~~~~~k~ge~~~~~ 399 (403) T protein:vir:80 384 IGDQNKLKGGEKGGAD 399 (403) T ss_pred ccchhhccCCCCCCCC Confidence 24788888777 No 59 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=7.4e-76 Score=432.49 Aligned_cols=367 Identities=14% Similarity=0.122 Sum_probs=293.5 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) |||++|+..+.........+. .............+.+.++++++|++||++||++||++|+++++.. T Consensus 1 mg~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~ 78 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDM--EPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGVKT 78 (403) T ss_pred Ccchhhhhhccchhhhhhhcc--cccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeeccccccccccccc Confidence 999999975432111111111 0111111122224668889999999999999999999999987431 Q ss_pred ---hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccc Q lcl|NC_018285. 70 ---MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIP 146 (383) Q Consensus 70 ---~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~ 146 (383) .+.|+.+||++||+++||+.++.+++++||||+++.+ .++++++++.|++..+.++. .|.+...+ T Consensus 79 ~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~--~~~~~~~~---- 146 (403) T protein:vir:10 79 KTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKF--IKKFIFNN---- 146 (403) T ss_pred chHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCce--EEEEEecC---- Confidence 2457889999999999999999999999999988753 25899999999987765433 23332222 Q ss_pred cceeecccceEEeccCCC----CccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSDILHFRLLSV----DGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 147 ~~~~~~~~dvih~~~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) ...++.+||+|++.+++ .++++|+||+.++..+++...++++++.++|+||++|+++++.++.+++++++++++. T Consensus 147 -~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~ 225 (403) T protein:vir:10 147 -QINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEE 225 (403) T ss_pred -ceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHH Confidence 34588899999996543 3568899999999999999999999999999999999999999999999999999998 Q ss_pred HHH---hhcCCcceeecCCCceeeeccc--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHH Q lcl|NC_018285. 223 RQA---MKQMQGGPLVLDDLEDFTPLEI--KSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAV 297 (383) Q Consensus 223 ~~~---~~~~~g~~~vl~~g~~~~~~~~--~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l 297 (383) |.. +.+|+|+++||++|++|++++. ++.|+||+|.+++++++||++|||||++||+ ++++|.+++.+.|+.+|| T Consensus 226 ~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~sn~e~~~~~f~~~tl 304 (403) T protein:vir:10 226 LQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDG-GNNANIRPNIELFYYMTI 304 (403) T ss_pred HHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CCCcCHHHHHHHHHHHHH Confidence 865 4567899999999999999975 6789999999999999999999999999985 467889999999999999 Q ss_pred HHHHHHHHHHHHHhhcchhhccchhh--hccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh---C--CC Q lcl|NC_018285. 298 ARYLRPFLSELSQKLSCDVDADIFPA--VDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE---N--PN 370 (383) Q Consensus 298 ~P~~~~i~~~l~~~l~~~~e~~~~~~--~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~---~--~~ 370 (383) .|++++|+++|+++|..++++|.... ++.|...+++.+++++++|++|+||+|+++|++|++..+..... + .. T Consensus 305 ~P~~~~ie~~l~~~L~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~ 384 (403) T protein:vir:10 305 IPMLNKLTSSLTFFFGYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIRIPANVAGS 384 (403) T ss_pred HHHHHHHHHHHHHhcCceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccccccccccccc Confidence 99999999999999998888887644 77888999999999999999999999999999998643322211 1 12 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_018285. 371 RTILKGGETNGQD 383 (383) Q Consensus 371 ~~~~~ggd~~~~d 383 (383) ..+.+|||.++.+ T Consensus 385 ~~~~~~~e~~~~~ 397 (403) T protein:vir:10 385 ATGVSGQEGGRPK 397 (403) T ss_pred cccCCCCcCCCCC Confidence 2345666654444 No 60 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=1e-75 Score=431.72 Aligned_cols=367 Identities=14% Similarity=0.133 Sum_probs=295.8 Q ss_pred CchhhhhhcCCcccccccccccchhh---cccccCCceec-hhhhhccHHHHHHHHHHHHhhhhCceeeecch------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEF---LATLNGSEWVS-AETALKNSDLFSIISQLSNDLATAKLTTSRKQ------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~-~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~------- 69 (383) |+||++.++............+.+.. .+.+......+ ...++++++|++||++||++||++|+++++.. T Consensus 13 m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (413) T protein:vir:96 13 LKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDKRI 92 (413) T ss_pred CCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCceEEEEecCCCcccc Confidence 99998754322111111111111100 01111111112 23478899999999999999999999998643 Q ss_pred ----hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC-ceeEEEEeccceeEEEEcCCCceeEEEEeecCcc Q lcl|NC_018285. 70 ----MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG-RDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPR 144 (383) Q Consensus 70 ----~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g-~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~ 144 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+.+| ++.+|||++|++|++..+. +.++|.+...+ T Consensus 93 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~--~~~~y~~~~~~-- 168 (413) T protein:vir:96 93 KNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSD--DDLDYSITFDN-- 168 (413) T ss_pred ccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcC--CeEEEEEeecC-- Confidence 23477899999999999999999999999999999999887 5789999999999988754 45677776544 Q ss_pred cccceeecccceEEecc-CCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHH Q lcl|NC_018285. 145 IPPKQHVPQSDILHFRL-LSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSR 223 (383) Q Consensus 145 ~~~~~~~~~~dvih~~~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~ 223 (383) ..++++||||||. +++.++++|.||+.++..++....++++++.++|+||++|++++++++.+++++++++++.| T Consensus 169 ----~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~ 244 (413) T protein:vir:96 169 ----KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENF 244 (413) T ss_pred ----cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHH Confidence 3688999999995 56777889999999999999999999999999999999999999999999999999999998 Q ss_pred HHh---hcCCcceeecCCCc-eeeec-ccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 224 QAM---KQMQGGPLVLDDLE-DFTPL-EIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 224 ~~~---~~~~g~~~vl~~g~-~~~~~-~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~ 298 (383) +.. ..++|+++|++.|. ++..+ +.+++|+||+|.+++++++||++|||||.+||.. .+++++..+|+..||+ T Consensus 245 ~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~---~~~~~~~~~~~~~~l~ 321 (413) T protein:vir:96 245 EEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVG---TYNKDEFNNFINTKIM 321 (413) T ss_pred HHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---cchHHHHHHHHHHHHH Confidence 664 35789999987665 44555 4689999999999999999999999999999853 2457788899999999 Q ss_pred HHHHHHHHHHHHhhcch---hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCC--- Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCD---VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRT--- 372 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~---~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~--- 372 (383) |+++.|+++||++|+++ ++||....++.|..++++.+++++++|++|+||+|+++|++|++++|.... ..|.. T Consensus 322 P~~~~ie~~ln~~ll~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~gd~~~~-~~n~~~~~ 400 (413) T protein:vir:96 322 SIAQVIQQTYNKLIVEEDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEMDDLLV-LENYLQQK 400 (413) T ss_pred HHHHHHHHHHHHhhCCCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeee-cccccchh Confidence 99999999999999873 788888899999999999999999999999999999999999988775442 22333 Q ss_pred ------CCCCCCC Q lcl|NC_018285. 373 ------ILKGGET 379 (383) Q Consensus 373 ------~~~ggd~ 379 (383) +.+|||+ T Consensus 401 ~~~~~~~~~~~dt 413 (413) T protein:vir:96 401 DLVNQKKLIQDET 413 (413) T ss_pred hcccccCCCCCCC Confidence 3445555 No 61 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=3.8e-75 Score=428.57 Aligned_cols=381 Identities=13% Similarity=0.122 Sum_probs=291.3 Q ss_pred Cchhhh------------hhcCCccccccccccc--chhhccc-----ccCCceechhhhhccHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MPIFNL------------ATESPPNNQGGFFDIT--DPEFLAT-----LNGSEWVSAETALKNSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mglf~~------------~~~~~~~~~~~~~~~~--~~~~~~~-----~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~ 61 (383) .=||+. ++++.+.+.......+ ...+.+. ..++..++.+.|+++++|++||++||++||++ T Consensus 62 ~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsL 141 (945) T protein:vir:10 62 IIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSK 141 (945) T ss_pred eeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccC Confidence 223332 1111111111111110 0111111 11244577889999999999999999999999 Q ss_pred ceeeecch-----------------hhhhccCCCccCCHHH----HHHHHHHHHHHcCCeEEEEeecCCCceeEEEEecc Q lcl|NC_018285. 62 KLTTSRKQ-----------------MQGIVDNPSNSANRFN----FYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRP 120 (383) Q Consensus 62 p~~~~~~~-----------------~~~l~~~PN~~~t~~~----f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~ 120 (383) |+++|++. ...|+.+||++||+++ |+++++.+++++||+|+++.|+.+|+|++|+|++| T Consensus 142 PlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdP 221 (945) T protein:vir:10 142 ELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDG 221 (945) T ss_pred ceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECC Confidence 99998642 2346679999999988 56678899999999999999999999999999999 Q ss_pred ceeEEEEcCCCceeEEEEeecCcccccceeecccceE-EeccCCCCccc--cCcchHHHHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_018285. 121 SQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL-HFRLLSVDGGL--TSVSPLMALGRELDIQKASDKLTLNSLK- 196 (383) Q Consensus 121 ~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi-h~~~~~~~~~~--~G~s~~~~~~~~i~~~~~~~~~~~~~~~- 196 (383) ++|++..++++...++.....++ .....++++|+| |++++++++.. +|+||+.++..++....++++++.++|. T Consensus 222 s~Vti~~ddDG~~~y~Yv~~idG--~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~Fsk 299 (945) T protein:vir:10 222 TTIKPILSEDTGIVVGYVQEVDG--AIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRK 299 (945) T ss_pred cceEEEEcCCCcEEEEEEEecCC--ceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999988777654432222221 234467777755 67778777644 5999999999999999999999999995 Q ss_pred ccCCcceeEeec----------CCCCHHHHHHHHHHHHHhh--cCCcceeecCCCceeeecccChhhHHHHHHHHHHHHH Q lcl|NC_018285. 197 NALNANGILKIK----------GGGLLDFKTKVSRSRQAMK--QMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQ 264 (383) Q Consensus 197 ng~~~~~i~~~~----------~~~~~e~~~~~~~~~~~~~--~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~ 264 (383) ||++|+++++++ +.+++++.+++++.|+... .++|+++|+++|++|++++.++.|+||+|++++++++ T Consensus 300 NGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~ee 379 (945) T protein:vir:10 300 GGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARK 379 (945) T ss_pred CCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 788999999864 4578999999999997654 4667899999999999999999999999999999999 Q ss_pred HHHHhcCCHHHhccc--ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-----hhccchhhhccCHHHHHHHHHH Q lcl|NC_018285. 265 FAKVYGIPENVVGGQ--GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-----VDADIFPAVDPTGANYISRINS 337 (383) Q Consensus 265 Ia~~~gVpp~~lg~~--~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-----~e~~~~~~~~~~~~~~~~~~~~ 337 (383) ||++|||||++||.. +++++++++...|+++||+|++++|+++||++|+.. ++++.......+..++++.++. T Consensus 380 IArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~k 459 (945) T protein:vir:10 380 ICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQG 459 (945) T ss_pred HHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHHHHHHHH Confidence 999999999999963 456788888999999999999999999999998753 2334444445667888999999 Q ss_pred HHhCCCcCHHHHHHHhhcCCcCCcchhHHhCC-------------------------CCCCCCCCCCCCCC Q lcl|NC_018285. 338 MVKSGTLAQNQGLYILQQAEILPKELPKGENP-------------------------NRTILKGGETNGQD 383 (383) Q Consensus 338 l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~-------------------------~~~~~~ggd~~~~d 383 (383) ++++|++|+||+|+++|++|++++|....... +.+..+||++++++ T Consensus 460 li~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns 530 (945) T protein:vir:10 460 QLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENS 530 (945) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCC Confidence 99999999999999999999988876532110 11112455555544 No 62 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=4.9e-75 Score=427.96 Aligned_cols=361 Identities=15% Similarity=0.100 Sum_probs=283.3 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------hhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------MQG 72 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~~~ 72 (383) ||||++++++.........+..+......+.++..++.+.|+++++|++||++||++||++|+++++.+ ... T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~~~~~~~~ 80 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGNEIKDDIALQ 80 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCcccchhhHHH Confidence 999999875533222221122222123334466789999999999999999999999999999999754 345 Q ss_pred hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeec Q lcl|NC_018285. 73 IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVP 152 (383) Q Consensus 73 l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~ 152 (383) |+.+||++||+++||+.++.+++++||+|++|.++..+.+ ..|++..++.+ .|.+... .+.|+ T Consensus 81 Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~--------~~~~~~~~~~~---~~~~~~~------~~~~~ 143 (394) T protein:vir:62 81 ILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLA--------SNVFTELDDNL---VEHFNIG------GHEIP 143 (394) T ss_pred HhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecc--------ccceEEECCce---EEEEeeC------CEEec Confidence 8899999999999999999999999999999976544332 23444444332 2233221 35799 Q ss_pred ccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC--HHHHHHHHHHHHH---hh Q lcl|NC_018285. 153 QSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL--LDFKTKVSRSRQA---MK 227 (383) Q Consensus 153 ~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~--~e~~~~~~~~~~~---~~ 227 (383) ++||||+|+++.++ ++|+||+..+..+|....++++++.++|+||+.|+++++.++.++ +++++++++.|.. +. T Consensus 144 ~~eiih~r~~~~d~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~ 222 (394) T protein:vir:62 144 PCMIRHVKNIGADH-LRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESI 222 (394) T ss_pred hhheEEecCcCCCC-ccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccc Confidence 99999999887665 789999999999999999999999999999999999999998765 4456777777754 44 Q ss_pred cCCcceeecCCCc--eeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 228 QMQGGPLVLDDLE--DFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFL 305 (383) Q Consensus 228 ~~~g~~~vl~~g~--~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~ 305 (383) .++|+++|++.|. ++++++.++.|+||+|.+++++++||++|||||.+||+. ++++.+++.++|+..||.|++++|+ T Consensus 223 ~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~sn~e~~~~~~~~~~l~P~~~~ie 301 (394) T protein:vir:62 223 DEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL-IKEDIEKAMMYIHNKAVRPIMKNFE 301 (394) T ss_pred cccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC-CCcCHHHHHHHHHHHHHHHHHHHHH Confidence 6778998887765 666889999999999999999999999999999999864 5678899999999999999999999 Q ss_pred HHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHhC-------- Q lcl|NC_018285. 306 SELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGEN-------- 368 (383) Q Consensus 306 ~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~~-------- 368 (383) ++|+++|++. ++||.....+. ...+..+.+++++|++|+||+|+++|++|+++ +|...... T Consensus 302 ~~l~~kll~~~~~~~~~~~fd~~~~~~~--~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~~ 379 (394) T protein:vir:62 302 DHLSLLFYAQNSGKRIKFKINILDFVTY--SNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDVTEIGKK 379 (394) T ss_pred HHHhhhhcCccccCceEEEechhhhcCH--HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccccccccc Confidence 9999999874 44555444443 45667788999999999999999999999853 33222111 Q ss_pred -CCCCCCCCCCCCCC Q lcl|NC_018285. 369 -PNRTILKGGETNGQ 382 (383) Q Consensus 369 -~~~~~~~ggd~~~~ 382 (383) ....+.+|||++++ T Consensus 380 ~~~~~~~kgge~~en 394 (394) T protein:vir:62 380 EATDGSLGGGEENEN 394 (394) T ss_pred ccccccCCCCCCCCC Confidence 12234579998777 No 63 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=9.3e-74 Score=420.96 Aligned_cols=320 Identities=18% Similarity=0.204 Sum_probs=285.8 Q ss_pred hhhCceeeecch-------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 58 LATAKLTTSRKQ-------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 58 ia~~p~~~~~~~-------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~ 130 (383) ||++|+++++.. .+.|+.+||++||+++||+.++.+++++||||++++|+..|+|++|+||+|++|++..+.+ T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 999999998643 2356789999999999999999999999999999999999999999999999999999888 Q ss_pred CceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC Q lcl|NC_018285. 131 QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG 210 (383) Q Consensus 131 ~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~ 210 (383) +..++|.+...+ +..+.++++||||+|++++.+.++|+||+.++..++....++++++ ++.++..++++++.++. T Consensus 81 ~~~~~y~~~~~~---g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~ 155 (348) T protein:vir:93 81 SRELYYSIHAAT---GNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSN 155 (348) T ss_pred CcEEEEEEEcCC---CeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCCC Confidence 888888887554 3456799999999999888888999999999999999999998886 44455556788889999 Q ss_pred CCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHH Q lcl|NC_018285. 211 GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEM 288 (383) Q Consensus 211 ~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~ 288 (383) +++|+++++++.|....+|+|+++|+++|++|++++.+++|+||+|.+++++++||++|||||.+||+. +++++.+++ T Consensus 156 l~~e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~ 235 (348) T protein:vir:93 156 VSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEEL 235 (348) T ss_pred CCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999964 457788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC Q lcl|NC_018285. 289 SSNVYSKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP 360 (383) Q Consensus 289 ~~~~~~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~ 360 (383) .++|+.+||+|+++.|+++|+++|++. ++||...+.+.|..++++.+++++++|++|+||+|+++|++|+++ T Consensus 236 ~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~g 315 (348) T protein:vir:93 236 NRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG 315 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 999999999999999999999999865 557777888889999999999999999999999999999999998 Q ss_pred cchhHHhCCCC----------CCCCCCCCCCCC Q lcl|NC_018285. 361 KELPKGENPNR----------TILKGGETNGQD 383 (383) Q Consensus 361 ~d~~~~~~~~~----------~~~~ggd~~~~d 383 (383) +|..... .+. ...+|||+|++| T Consensus 316 gD~~~~~-~n~~~~~~~~~~~~~~~gg~~n~~~ 347 (348) T protein:vir:93 316 GDKPLIS-GDLYPIDTPLELRKSLKGGDKNVNE 347 (348) T ss_pred cCeEeec-ccccccccchhhcccccCCCCCcCC Confidence 8865542 122 235799999988 No 64 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=6.7e-71 Score=405.31 Aligned_cols=354 Identities=14% Similarity=0.129 Sum_probs=282.4 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~~l 73 (383) ||||++++++....... .+. .....++.+.|+++++|++||++||+++|++|+++++.. .+.| T Consensus 1 Mg~f~~~f~~~~~~~~~----~~~------~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~~~l~~lL 70 (385) T protein:vir:95 1 MGLFDSVFKRHSELSWM----YDL------EFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEKGTLYYLL 70 (385) T ss_pred CchhhhhhccCcccccc----cch------hhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCccccchHHHHH Confidence 99999999765433211 111 123346778999999999999999999999999998644 2346 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.+++++||||+++.++. +.+..++++.+..+..... .+|.+...+. +..+.+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~--~~~~~~~~ 142 (385) T protein:vir:95 71 NVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSH-----RFTNVLVNDF--EFKRVFTM 142 (385) T ss_pred hcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeeccccccccccccccc-----cceeeeeccc--ceeeeecc Confidence 78999999999999999999999999999887765 4555666666665544332 2333333322 34567999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC--CCCHHHHHHHHHHHHHhhc--- Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG--GGLLDFKTKVSRSRQAMKQ--- 228 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~--~~~~e~~~~~~~~~~~~~~--- 228 (383) +||||+|++++++..+|.||+..+...+....++.. +++.|+++++.++ .+++++++++++.|....+ T Consensus 143 ~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~-------~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~ 215 (385) T protein:vir:95 143 DDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQM-------LNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQ 215 (385) T ss_pred ccEEEecCCCCCcccccchHHHHHHHHHHHHHHHHH-------hcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhh Confidence 999999999988888999999999998876554422 2344788888864 4788888888888865432 Q ss_pred -CCcceeecCCCceeeeccc------ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 229 -MQGGPLVLDDLEDFTPLEI------KSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYL 301 (383) Q Consensus 229 -~~g~~~vl~~g~~~~~~~~------~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~ 301 (383) +.++++++++|++|++++. ++.|+||+|.+++++++||++|||||++|+ ++++|.+++.++|+++||.|++ T Consensus 216 ~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~--~~~sn~e~~~~~~~~~~l~P~~ 293 (385) T protein:vir:95 216 NNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL--GEMADLEKTIESYLQFCINPLL 293 (385) T ss_pred hcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCcCHHHHHHHHHHHHHHHHH Confidence 3456889999999999875 677999999999999999999999999996 4678899999999999999999 Q ss_pred HHHHHHHHHhhcch-------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcC--CcchhHHhCCCCC Q lcl|NC_018285. 302 RPFLSELSQKLSCD-------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEIL--PKELPKGENPNRT 372 (383) Q Consensus 302 ~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~--~~d~~~~~~~~~~ 372 (383) ++|+++|+++|++. ++||+...++.|..++++.+.+++++|++|+||+|+++|++|++ ++|..... .|.. T Consensus 294 ~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~-~n~~ 372 (385) T protein:vir:95 294 RKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFIIT-KNLQ 372 (385) T ss_pred HHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec-ccce Confidence 99999999999874 56788888899999999999999999999999999999999984 45654432 2333 Q ss_pred C---CCCCCCCCC Q lcl|NC_018285. 373 I---LKGGETNGQ 382 (383) Q Consensus 373 ~---~~ggd~~~~ 382 (383) + .+|||++++ T Consensus 373 ~~~~~kgge~~~e 385 (385) T protein:vir:95 373 SADAFKGGESNEE 385 (385) T ss_pred ecccccCCCCCCC Confidence 3 479999988 No 65 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=7.9e-71 Score=404.91 Aligned_cols=354 Identities=12% Similarity=0.077 Sum_probs=274.9 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~~l 73 (383) ||||++++++...... ...+.....++.+.|+++++|++||++||+++|++|+++++.. .+.| T Consensus 1 Mg~f~~l~~~~~~~~~----------~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~l~~ll 70 (376) T protein:vir:78 1 MGFFSELFKRNKEIEW----------MWDLDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGETSVRDKLYYKL 70 (376) T ss_pred CchhhhhhccCCcccc----------ccchhhccccchhhhhhhHHHHHHHHHHHHhhcccceeeccccccccchHHHHH Confidence 9999999876543211 1112234557889999999999999999999999999998644 3457 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.+++++||||+++.|+..|.+.+++|+.+..+.... ++.+...++ +....+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~--~~~~~~~~ 141 (376) T protein:vir:78 71 NIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDV-------FEGVTVKDY--RYNRNFSM 141 (376) T ss_pred hhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeee-------eeeeeeecc--eeeeeecc Confidence 789999999999999999999999999999999999999999999887654322 233333322 12356999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhc----C Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQ----M 229 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~----~ 229 (383) +||||+|+...++...+.++...+...+.. ......+.++.++.++++.++.+++++++++++.|+...+ + T Consensus 142 ~evih~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~ 216 (376) T protein:vir:78 142 DDVIFLEYGNERLSAFTDGMFEDYGELFGK-----MIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNN 216 (376) T ss_pred ccEEEeccCCCCchhhhhHHHHHHHHHHHH-----HHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhcccccc Confidence 999999976555433333333333222211 1122344455566666777788999999999998865443 3 Q ss_pred CcceeecCCCceeeecccChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 230 QGGPLVLDDLEDFTPLEIKSNVA-----QLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPF 304 (383) Q Consensus 230 ~g~~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i 304 (383) .++++++++|++|++++.++.|+ ||+|++++++++||++|||||++||+ ++++.+++.++|+++||.|++++| T Consensus 217 ~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~--~~s~~e~~~~~f~~~~l~P~~~~i 294 (376) T protein:vir:78 217 EIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG--DMADLSNNMKAYMEYCIDPLTKKL 294 (376) T ss_pred CcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC--CCCCHHHHHHHHHHHHHHHHHHHH Confidence 45578899999999999988664 99999999999999999999999974 678889999999999999999999 Q ss_pred HHHHHHhhcch----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc--hhHHhCCCCCCCCCCC Q lcl|NC_018285. 305 LSELSQKLSCD----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE--LPKGENPNRTILKGGE 378 (383) Q Consensus 305 ~~~l~~~l~~~----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d--~~~~~~~~~~~~~ggd 378 (383) +++|+++|++. +++++...++.|..++++.+.+++++|++|+||+|+++|++|+++++ ... -..|..|++.|+ T Consensus 295 e~~l~~kll~~~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~-~~~n~~~~~~~~ 373 (376) T protein:vir:78 295 EDELNAKLFTFSEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYL-ITKNYQSADEGG 373 (376) T ss_pred HHHHHhhhCCcccceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee-eccCceehhccc Confidence 99999999975 34566677788999999999999999999999999999999998774 333 356777776455 Q ss_pred CCC Q lcl|NC_018285. 379 TNG 381 (383) Q Consensus 379 ~~~ 381 (383) .|| T Consensus 374 e~g 376 (376) T protein:vir:78 374 EDG 376 (376) T ss_pred cCC Confidence 555 No 66 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=4.3e-70 Score=400.90 Aligned_cols=354 Identities=14% Similarity=0.129 Sum_probs=278.0 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~~l 73 (383) ||||+++++++.... ++..+..+..++.+.|+++++|++||++||+++|++|+++++.. .+.| T Consensus 1 Mg~f~~lf~~~~~~~----------~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll 70 (395) T protein:vir:95 1 MSILEKIFKTRKDIT----------YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKL 70 (395) T ss_pred CchhhhhhccCcccc----------ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHH Confidence 999999987754321 12223456678889999999999999999999999999998643 3457 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.++++.|++|+++.++.. ++++++..++.....+. .++.+..... +..+.+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~ 141 (395) T protein:vir:95 71 NIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKE-----LLIADSFYREEYALYDD--IFKDVTVKDY--TYQRTFTM 141 (395) T ss_pred HhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCC-----eEecCCccceeEeecCc--ceeEEEEcCc--eeeeeecc Confidence 789999999999999999999999999988765532 45555555544433222 2333333322 34567999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhhc--CC Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMKQ--MQ 230 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~~--~~ 230 (383) +||||++++++.+..+|.||+..+...+.... +.|.+++.++++|+.++. +++++++++++.|+.... ++ T Consensus 142 ~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:95 142 QEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNK 214 (395) T ss_pred ccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccc Confidence 99999999888888899999999887776543 356778888999988766 688888888888865443 33 Q ss_pred cc--eeecCCCceeeecccChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 231 GG--PLVLDDLEDFTPLEIKSNVA-----QLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRP 303 (383) Q Consensus 231 g~--~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ 303 (383) ++ ++++++|++|++++.++.++ ||+|++++++++||++|||||++||+ ++++.+++.++|+++||.|++++ T Consensus 215 ~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~--~~sn~e~~~~~~~~~~l~P~~~~ 292 (395) T protein:vir:95 215 NQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG--ETADLEKNTLVFEKFCLTPLLKK 292 (395) T ss_pred cCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--cccCHHHHHHHHHHHHHHHHHHH Confidence 33 56689999999999988765 89999999999999999999999974 67889999999999999999999 Q ss_pred HHHHHHHhhcch------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc--chhHHhCCCCC--- Q lcl|NC_018285. 304 FLSELSQKLSCD------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK--ELPKGENPNRT--- 372 (383) Q Consensus 304 i~~~l~~~l~~~------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~--d~~~~~~~~~~--- 372 (383) |+++|+++|++. ++|+.....+.|..++++.++.++++|++|+||+|+++|++|++++ |..... .|.. T Consensus 293 ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~-~n~~~~~ 371 (395) T protein:vir:95 293 IQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLIT-KNYEKAN 371 (395) T ss_pred HHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec-ccccccc Confidence 999999999874 4567777788899999999999999999999999999999999876 433221 1111 Q ss_pred ------------CCCCCCCCCCC Q lcl|NC_018285. 373 ------------ILKGGETNGQD 383 (383) Q Consensus 373 ------------~~~ggd~~~~d 383 (383) ..+|||++++- T Consensus 372 ~~~~~~~~~~~~~~kgg~~~~~g 394 (395) T protein:vir:95 372 SGENDEKEKDENTLKGGDEDESG 394 (395) T ss_pred ccccccCcccccccCCCCCCCCC Confidence 23566664443 No 67 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=4.3e-70 Score=400.90 Aligned_cols=354 Identities=14% Similarity=0.129 Sum_probs=278.0 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~~l 73 (383) ||||+++++++.... ++..+..+..++.+.|+++++|++||++||+++|++|+++++.. .+.| T Consensus 1 Mg~f~~lf~~~~~~~----------~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll 70 (395) T protein:vir:10 1 MSILEKIFKTRKDIT----------YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKL 70 (395) T ss_pred CchhhhhhccCcccc----------ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHH Confidence 999999987754321 12223456678889999999999999999999999999998643 3457 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.++++.|++|+++.++.. ++++++..++.....+. .++.+..... +..+.+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~ 141 (395) T protein:vir:10 71 NIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKE-----LLIADSFYREEYALYDD--IFKDVTVKDY--TYQRTFTM 141 (395) T ss_pred HhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCC-----eEecCCccceeEeecCc--ceeEEEEcCc--eeeeeecc Confidence 789999999999999999999999999988765532 45555555544433222 2333333322 34567999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhhc--CC Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMKQ--MQ 230 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~~--~~ 230 (383) +||||++++++.+..+|.||+..+...+.... +.|.+++.++++|+.++. +++++++++++.|+.... ++ T Consensus 142 ~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:10 142 QEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNK 214 (395) T ss_pred ccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccc Confidence 99999999888888899999999887776543 356778888999988766 688888888888865443 33 Q ss_pred cc--eeecCCCceeeecccChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 231 GG--PLVLDDLEDFTPLEIKSNVA-----QLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRP 303 (383) Q Consensus 231 g~--~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ 303 (383) ++ ++++++|++|++++.++.++ ||+|++++++++||++|||||++||+ ++++.+++.++|+++||.|++++ T Consensus 215 ~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~--~~sn~e~~~~~~~~~~l~P~~~~ 292 (395) T protein:vir:10 215 NQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG--ETADLEKNTLVFEKFCLTPLLKK 292 (395) T ss_pred cCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--cccCHHHHHHHHHHHHHHHHHHH Confidence 33 56689999999999988765 89999999999999999999999974 67889999999999999999999 Q ss_pred HHHHHHHhhcch------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc--chhHHhCCCCC--- Q lcl|NC_018285. 304 FLSELSQKLSCD------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK--ELPKGENPNRT--- 372 (383) Q Consensus 304 i~~~l~~~l~~~------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~--d~~~~~~~~~~--- 372 (383) |+++|+++|++. ++|+.....+.|..++++.++.++++|++|+||+|+++|++|++++ |..... .|.. T Consensus 293 ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~-~n~~~~~ 371 (395) T protein:vir:10 293 IQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLIT-KNYEKAN 371 (395) T ss_pred HHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec-ccccccc Confidence 999999999874 4567777788899999999999999999999999999999999876 433221 1111 Q ss_pred ------------CCCCCCCCCCC Q lcl|NC_018285. 373 ------------ILKGGETNGQD 383 (383) Q Consensus 373 ------------~~~ggd~~~~d 383 (383) ..+|||++++- T Consensus 372 ~~~~~~~~~~~~~~kgg~~~~~g 394 (395) T protein:vir:10 372 SGENDEKEKDENTLKGGDEDESG 394 (395) T ss_pred ccccccCcccccccCCCCCCCCC Confidence 23566664443 No 68 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=4.3e-70 Score=400.90 Aligned_cols=354 Identities=14% Similarity=0.129 Sum_probs=278.0 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~~l 73 (383) ||||+++++++.... ++..+..+..++.+.|+++++|++||++||+++|++|+++++.. .+.| T Consensus 1 Mg~f~~lf~~~~~~~----------~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll 70 (395) T protein:vir:10 1 MSILEKIFKTRKDIT----------YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKL 70 (395) T ss_pred CchhhhhhccCcccc----------ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHH Confidence 999999987754321 12223456678889999999999999999999999999998643 3457 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.++++.|++|+++.++.. ++++++..++.....+. .++.+..... +..+.+++ T Consensus 71 ~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~ 141 (395) T protein:vir:10 71 NIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKE-----LLIADSFYREEYALYDD--IFKDVTVKDY--TYQRTFTM 141 (395) T ss_pred HhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCC-----eEecCCccceeEeecCc--ceeEEEEcCc--eeeeeecc Confidence 789999999999999999999999999988765532 45555555544433222 2333333322 34567999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhhc--CC Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMKQ--MQ 230 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~~--~~ 230 (383) +||||++++++.+..+|.||+..+...+.... +.|.+++.++++|+.++. +++++++++++.|+.... ++ T Consensus 142 ~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:10 142 QEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNK 214 (395) T ss_pred ccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccc Confidence 99999999888888899999999887776543 356778888999988766 688888888888865443 33 Q ss_pred cc--eeecCCCceeeecccChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 231 GG--PLVLDDLEDFTPLEIKSNVA-----QLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRP 303 (383) Q Consensus 231 g~--~~vl~~g~~~~~~~~~~~d~-----~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ 303 (383) ++ ++++++|++|++++.++.++ ||+|++++++++||++|||||++||+ ++++.+++.++|+++||.|++++ T Consensus 215 ~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~--~~sn~e~~~~~~~~~~l~P~~~~ 292 (395) T protein:vir:10 215 NQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG--ETADLEKNTLVFEKFCLTPLLKK 292 (395) T ss_pred cCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--cccCHHHHHHHHHHHHHHHHHHH Confidence 33 56689999999999988765 89999999999999999999999974 67889999999999999999999 Q ss_pred HHHHHHHhhcch------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc--chhHHhCCCCC--- Q lcl|NC_018285. 304 FLSELSQKLSCD------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK--ELPKGENPNRT--- 372 (383) Q Consensus 304 i~~~l~~~l~~~------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~--d~~~~~~~~~~--- 372 (383) |+++|+++|++. ++|+.....+.|..++++.++.++++|++|+||+|+++|++|++++ |..... .|.. T Consensus 293 ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~-~n~~~~~ 371 (395) T protein:vir:10 293 IQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLIT-KNYEKAN 371 (395) T ss_pred HHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec-ccccccc Confidence 999999999874 4567777788899999999999999999999999999999999876 433221 1111 Q ss_pred ------------CCCCCCCCCCC Q lcl|NC_018285. 373 ------------ILKGGETNGQD 383 (383) Q Consensus 373 ------------~~~ggd~~~~d 383 (383) ..+|||++++- T Consensus 372 ~~~~~~~~~~~~~~kgg~~~~~g 394 (395) T protein:vir:10 372 SGENDEKEKDENTLKGGDEDESG 394 (395) T ss_pred ccccccCcccccccCCCCCCCCC Confidence 23566664443 No 69 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=8.3e-70 Score=399.30 Aligned_cols=375 Identities=13% Similarity=0.054 Sum_probs=273.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceec----hhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVS----AETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDN 76 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~ 76 (383) |.+.+.-+-.+-.....+..+....+..+. ..+++ ++.+..+++|++||++||++||++|++++......+... T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~~~~~~l 83 (540) T protein:vir:41 6 LSIKSLEKYRAIKGDTDSQALKEDRFEEYV--EPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDGGVEELL 83 (540) T ss_pred cChhhccchhhhhccccccccccCCCCccc--cCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCccchhhhc Confidence 555442111111111111111111111111 01122 234456889999999999999999999998887777778 Q ss_pred CCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-------ceeEEEEee------cCc Q lcl|NC_018285. 77 PSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-------NGLYYNVTF------DDP 143 (383) Q Consensus 77 PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-------~~~~y~~~~------~~~ 143 (383) ||++||+++||++++.+++++||||++++|+.+|+|++|+||+|++|++..+... ....|...+ ... T Consensus 84 pN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 163 (540) T protein:vir:41 84 RACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPD 163 (540) T ss_pred cCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeecc Confidence 9999999999999999999999999999999999999999999999998765432 112221111 111 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHH-------- Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDF-------- 215 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~-------- 215 (383) .+...+.++++||||+|.+++.++++|+||+.++..++....++++++.++|+||++|+++|++++.+++++ T Consensus 164 ~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~ 243 (540) T protein:vir:41 164 NGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEP 243 (540) T ss_pred ccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHH Confidence 223456799999999999988888999999999999999999999999999999999999999988765543 Q ss_pred --HHHHHHHHHH----hhcCCcceeecC------CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc---- Q lcl|NC_018285. 216 --KTKVSRSRQA----MKQMQGGPLVLD------DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ---- 279 (383) Q Consensus 216 --~~~~~~~~~~----~~~~~g~~~vl~------~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~---- 279 (383) ++++++.|.. ..+|+|+++||+ +|++|++++.+++|+||+|++++++++||++|||||++||.. T Consensus 244 ~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~ 323 (540) T protein:vir:41 244 TGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGP 323 (540) T ss_pred HHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCC Confidence 2334444433 245789999984 799999999999999999999999999999999999999853 Q ss_pred ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh------hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_018285. 280 GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDV------DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYIL 353 (383) Q Consensus 280 ~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~------e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~l 353 (383) .++++.+++.+.|+.+||.|++++|+++||++|+++. +|+....++. +.++.++.++++|++|+||+|+.+ T Consensus 324 ~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll~~---D~~~~~~~lv~~G~lT~NE~Re~L 400 (540) T protein:vir:41 324 LGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLDPGARFVFNEEILMES---EFVHNYALLVQCGVLTPSEVREKL 400 (540) T ss_pred CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecchhhcch---HHHHHHHHHHhCCCCCHHHHHHHh Confidence 2356888889999999999999999999999998754 3444444443 455667889999999999999865 Q ss_pred -hcCCcCCcchhHHh--CCCCCCCCCCCCCCCC Q lcl|NC_018285. 354 -QQAEILPKELPKGE--NPNRTILKGGETNGQD 383 (383) Q Consensus 354 -g~~~~~~~d~~~~~--~~~~~~~~ggd~~~~d 383 (383) |.+|. +| ..+. +.....+.+++.+++- T Consensus 401 ~g~e~g--dd-~~l~p~n~~~~~~~~~~~~~~~ 430 (540) T protein:vir:41 401 FGLDGG--PD-MFMVPSSIGKSAMKRQKRNYEK 430 (540) T ss_pred CcCcCC--Cc-ccccccccccccccccccccCC Confidence 55442 22 2211 1111122222211111 No 70 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=6.8e-70 Score=399.78 Aligned_cols=362 Identities=10% Similarity=0.086 Sum_probs=274.9 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------hh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------~~ 71 (383) ||||++++.++...... . .....-..++.+.||++++|++||++||++||++|+++++.. .+ T Consensus 1 Mgl~d~~~~~~~~~~~~------~---~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~ 71 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSD------D---DSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLY 71 (395) T ss_pred CcchhhhcCCCCccccc------c---ccccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHH Confidence 99999987765332111 1 111122346778899999999999999999999999998753 23 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||++||+++||+.++.+++++||||+++.|+..+.+...++.... -....++.+.+.++ ...+.+ T Consensus 72 lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~v~~~~~--~~~~~~ 141 (395) T protein:vir:96 72 WINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQDKK--------LSGNKFKVSRVQGQ--TYEKIF 141 (395) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccccccc--------cccceeeeeeeccc--eeeeEe Confidence 46689999999999999999999999999999998865433322221111 11122333333322 234679 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHH------HHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQ------KASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~------~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) +++||+|+|+++++...++.+++......+... ..+.++..+++++++.+.++++.++...++..++..+.+.. T Consensus 142 ~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (395) T protein:vir:96 142 TFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKDFFKRTIE 221 (395) T ss_pred ccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhHHHHHHHHHHHHH Confidence 999999999877665555555444443333332 33446788999999999999999988888777777666654 Q ss_pred h-hcCCcceeecCCCceeeecccChhhHHHHHHHHHH------HHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 226 M-KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWT------TGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 226 ~-~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~------~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~ 298 (383) . ..+.++++++++|++|++++.++.|+|+++.+++. .++||++|||||++|| +++++.+++.++|+++||. T Consensus 222 ~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~--~~~sn~e~~~~~f~~~~L~ 299 (395) T protein:vir:96 222 KIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH--GDIADNQKNYELLLEGPIE 299 (395) T ss_pred HhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCccHHHHHHHHHHHHHH Confidence 4 34566788999999999999999999988887665 5899999999999997 4678899999999999999 Q ss_pred HHHHHHHHHHHHhhcchh------hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHH-hCC Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCDV------DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKG-ENP 369 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~~------e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~-~~~ 369 (383) |++++|+++|+++|+++. +|+.....+.|..++++.+..++++|++|+||+|+++|++|+++ +|.... .|. T Consensus 300 P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~~~~~~N~ 379 (395) T protein:vir:96 300 SLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNY 379 (395) T ss_pred HHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccc Confidence 999999999999999753 35666777889999999999999999999999999999999976 554333 222 Q ss_pred CCCCCCCCCCCCCC Q lcl|NC_018285. 370 NRTILKGGETNGQD 383 (383) Q Consensus 370 ~~~~~~ggd~~~~d 383 (383) .+...+|||+++++ T Consensus 380 ~~~~~~gge~~~~~ 393 (395) T protein:vir:96 380 ESVLERGGEVDEEV 393 (395) T ss_pred eechhccCCCCCCC Confidence 23344799988877 No 71 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=3.4e-68 Score=390.44 Aligned_cols=378 Identities=10% Similarity=0.056 Sum_probs=272.5 Q ss_pred Cchhhhhhc-CC-ccc---------------------------ccccccccchhhc-ccccC-----Cceech------- Q lcl|NC_018285. 1 MPIFNLATE-SP-PNN---------------------------QGGFFDITDPEFL-ATLNG-----SEWVSA------- 38 (383) Q Consensus 1 Mglf~~~~~-~~-~~~---------------------------~~~~~~~~~~~~~-~~~~~-----~~~~~~------- 38 (383) ||||+++.. ++ ... .....-.+.+... -.+.. ..+.+. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~~~~ 84 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 84 (551) T ss_pred hhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHHHHHH Confidence 999999863 11 000 0000000001100 00000 011111 Q ss_pred hhhhccHHHHHHHHHHHHhhhhC-----------ceeeecch---------------hhhhccCCCcc-----CCHHHHH Q lcl|NC_018285. 39 ETALKNSDLFSIISQLSNDLATA-----------KLTTSRKQ---------------MQGIVDNPSNS-----ANRFNFY 87 (383) Q Consensus 39 ~~a~~~~~v~~~i~~ia~~ia~~-----------p~~~~~~~---------------~~~l~~~PN~~-----~t~~~f~ 87 (383) +.+..+|+|++||+.||++||++ ++.+.-.+ ...++.+||+. +|+.+|+ T Consensus 85 ~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~ 164 (551) T protein:vir:80 85 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFV 164 (551) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHH Confidence 22345799999999999999984 34332111 12356788887 4888999 Q ss_pred HHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce----eEEEEeecCcccccceeecccceEEeccCC Q lcl|NC_018285. 88 QSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG----LYYNVTFDDPRIPPKQHVPQSDILHFRLLS 163 (383) Q Consensus 88 ~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~dvih~~~~~ 163 (383) ++++.+++++||||++++|+.+|+|++||||+|++|++..+.++.. .+|.+...+ +....|+++||||+++++ T Consensus 165 ~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g---~~~~~~~~~eiiH~~~n~ 241 (551) T protein:vir:80 165 KKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQ---KIVATFNAREMAFAVRNP 241 (551) T ss_pred HHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCC---cEEEEEcccceEEecccC Confidence 9999999999999999999999999999999999999988776643 233333222 335679999999999654 Q ss_pred CC---ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC--CCHHHHHHHHHHHHHh---hcCCcceee Q lcl|NC_018285. 164 VD---GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG--GLLDFKTKVSRSRQAM---KQMQGGPLV 235 (383) Q Consensus 164 ~~---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~--~~~e~~~~~~~~~~~~---~~~~g~~~v 235 (383) .. +..+|+||+.++..+|....++++++.++|+||++|+++|++++. +++++.+++++.|... ..|+|++++ T Consensus 242 ~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~v 321 (551) T protein:vir:80 242 RSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPV 321 (551) T ss_pred CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcccc Confidence 33 356799999999999999999999999999999999999998754 7899999999998654 468888765 Q ss_pred c-CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc------------cCcCHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 236 L-DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG------------DQQSSLEMSSNVYSKAVARYLR 302 (383) Q Consensus 236 l-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~------------~~~~~~e~~~~~~~~~l~P~~~ 302 (383) + ++|++|++++.++.|+||+|++++++++||++|||||++||... +++|.+++...|+..||+|+++ T Consensus 322 l~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~ 401 (551) T protein:vir:80 322 VSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLG 401 (551) T ss_pred ccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHH Confidence 5 68999999999999999999999999999999999999998532 3567888889999999999999 Q ss_pred HHHHHHHHhhcchhhccc----hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCC-cCCcchhHHhCCCCCC---- Q lcl|NC_018285. 303 PFLSELSQKLSCDVDADI----FPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAE-ILPKELPKGENPNRTI---- 373 (383) Q Consensus 303 ~i~~~l~~~l~~~~e~~~----~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~-~~~~d~~~~~~~~~~~---- 373 (383) .|+++||++|+++++.++ ......+..+. ..+.+++.+|++|+||+|+++|++| ++++|...... +-.+ T Consensus 402 ~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~-~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~-~~~~~~~~ 479 (551) T protein:vir:80 402 FIEDFINKHIVAEFGDKYTFQFVGGDIKSELES-VKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGV-IVQRIGQL 479 (551) T ss_pred HHHHHHHhhhccccCCceEEEeeccChhhHHHH-HHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeeccc-cccccccc Confidence 999999999998654332 22222222333 3345677789999999999999988 67777543110 0000 Q ss_pred --------------------CCCCCCCCCC Q lcl|NC_018285. 374 --------------------LKGGETNGQD 383 (383) Q Consensus 374 --------------------~~ggd~~~~d 383 (383) ..|++.++.. T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (551) T protein:vir:80 480 MQQEQFEHEKQQSNLQMLQEQTGNRVSTDV 509 (551) T ss_pred ccccCcchhhhhhccccccCcCCCCCCCCC Confidence 0011111100 No 72 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=1.3e-68 Score=392.79 Aligned_cols=362 Identities=15% Similarity=0.114 Sum_probs=269.4 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch-------hhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ-------MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------~~~l 73 (383) ||||++++..... ........+... +.....++.+.|+++++|++||++||+++|++|+++++++ .+.| T Consensus 1 Mg~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~~~~~~~~lL 76 (395) T protein:vir:40 1 MGFKSWVSGFFNE-EQRTLNLTDTVW---CSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEEVRKKNWYMF 76 (395) T ss_pred CchHHHHHhhhcc-cccccccccchh---hccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCccccchHHHHH Confidence 9999998754322 112222222221 2234456788999999999999999999999999998753 3458 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQ 153 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 153 (383) +.+||++||+++||+.++.+++++||||+++.++.. ++.++. ...........++.+..++. ...+.|++ T Consensus 77 ~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~------~~~~~~--~~~~~~~~~~~~~~v~~~~~--~~~~~~~~ 146 (395) T protein:vir:40 77 NVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI------YVADSF--TKNDKSLYENTYTEVTLKDL--TLKKEFKE 146 (395) T ss_pred HhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce------eecCCc--cccccccccceeeeeeecCc--eeeeeecc Confidence 889999999999999999999999999999987642 222221 12121111222233333322 23467999 Q ss_pred cceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHh----hcC Q lcl|NC_018285. 154 SDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAM----KQM 229 (383) Q Consensus 154 ~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~----~~~ 229 (383) +||||+|+.+..+...+.+.+......+... .....+.++..+.++++.+..+++++++++++.|... ..+ T Consensus 147 ~evih~r~~~~~~~~~~~~l~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (395) T protein:vir:40 147 SEVLHLTLNNESIKSIIDGFYLLYGDLLTAA-----VNKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAE 221 (395) T ss_pred ccEEEeecCCCCccccchhHHHHHHHHHHHH-----HHHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhcc Confidence 9999999765544333434434333333222 2333455666677777778889999988888877543 356 Q ss_pred CcceeecCCCceeeecccChhhHHHHHHHHHHH---HHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 230 QGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTT---GQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLS 306 (383) Q Consensus 230 ~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~---~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~ 306 (383) +++++|+++|++|++++.++.|+||+|.+++.. ++||++|||||++||+ ++++.+++.+.|+.+||.|++++|++ T Consensus 222 ~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~--~~sn~e~~~~~f~~~~L~P~~~~ie~ 299 (395) T protein:vir:40 222 GDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG--DTVGLSEQVNSFLMFSINPIAEMFTD 299 (395) T ss_pred CCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC--CCcCHHHHHHHHHHHHHHHHHHHHHH Confidence 788999999999999999999999999998764 7999999999999974 67888999999999999999999999 Q ss_pred HHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHhC------CC Q lcl|NC_018285. 307 ELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGEN------PN 370 (383) Q Consensus 307 ~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~~------~~ 370 (383) +|+++|++. ++||...+.+.|..++++.+.+++++|++|+||+|+++|++|+++ +|...... .. T Consensus 300 ~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~ 379 (395) T protein:vir:40 300 EGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERFVTKNYAPLGEN 379 (395) T ss_pred HHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceeeecccccccccc Confidence 999999874 567778888999999999999999999999999999999999965 55433221 12 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_018285. 371 RTILKGGETNGQD 383 (383) Q Consensus 371 ~~~~~ggd~~~~d 383 (383) ....+|||+++++ T Consensus 380 ~~~~kgge~~~~~ 392 (395) T protein:vir:40 380 EEDLKGGDINENK 392 (395) T ss_pred ccccCCCCCCCCc Confidence 2245899998888 No 73 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=4.9e-68 Score=389.58 Aligned_cols=375 Identities=10% Similarity=0.057 Sum_probs=274.0 Q ss_pred Cch------------------------------------hhhh--hcCCcccccccccccchhhcccccCCceechhhhh Q lcl|NC_018285. 1 MPI------------------------------------FNLA--TESPPNNQGGFFDITDPEFLATLNGSEWVSAETAL 42 (383) Q Consensus 1 Mgl------------------------------------f~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 42 (383) |++ .+.. .+.++..+ .+.+ -+..+..+..+..+..-.++ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~l~~~~~~~iv~~~i~~ 103 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIR-NSQD--LHKTLKKFGNNIILNAIINT 103 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccC-Cccc--HHHHHHhhccChhHHHHHHH Confidence 222 1111 00010000 0000 01122222333344555667 Q ss_pred ccHHHHHHHHHHHHhhhhCceeeecchh---------------hhhc----cCCCccC-CHHHHHHHHHHHHHHcCCeEE Q lcl|NC_018285. 43 KNSDLFSIISQLSNDLATAKLTTSRKQM---------------QGIV----DNPSNSA-NRFNFYQSIFAQMLLGGEAFA 102 (383) Q Consensus 43 ~~~~v~~~i~~ia~~ia~~p~~~~~~~~---------------~~l~----~~PN~~~-t~~~f~~~~~~~~~l~G~a~~ 102 (383) ...+|++|+.+|+.++|++|++|+..+. ..|+ ..|||++ |+.+|++.++.+++++||+|+ T Consensus 104 ~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi 183 (574) T protein:vir:80 104 RSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNF 183 (574) T ss_pred HHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEE Confidence 7888999999999999999999985421 1233 2466665 788999999999999999999 Q ss_pred EEeecCCCceeEEEEeccceeEEEEcCCCc-----eeEEEEeecCcccccceeecccceEEeccCCCC---ccccCcchH Q lcl|NC_018285. 103 YRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-----GLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVD---GGLTSVSPL 174 (383) Q Consensus 103 ~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~---~~~~G~s~~ 174 (383) +++|+.+|+|++||||+|.+|++..+.++. ..+|.+ .. ++....++++||||++++... .+.+|+||+ T Consensus 184 ~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~-~~---g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi 259 (574) T protein:vir:80 184 EKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQV-ID---NRIVAKFNERELAFAVRNPRADIEVGQYGYPEL 259 (574) T ss_pred EEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEE-eC---CceEEEEccccEEEEeccCCCCcccccccccHH Confidence 999999999999999999999998876553 223332 22 234677999999999965443 346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC--CCCHHHHHHHHHHHHHh---hcCCcce-eecCCCceeeecccC Q lcl|NC_018285. 175 MALGRELDIQKASDKLTLNSLKNALNANGILKIKG--GGLLDFKTKVSRSRQAM---KQMQGGP-LVLDDLEDFTPLEIK 248 (383) Q Consensus 175 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~--~~~~e~~~~~~~~~~~~---~~~~g~~-~vl~~g~~~~~~~~~ 248 (383) .++..+|....++++++.++|+||++|+++|++++ .+++++.+++++.|... ..|+|++ +++++|++|++++.+ T Consensus 260 ~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s 339 (574) T protein:vir:80 260 EIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPS 339 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCC Confidence 99999999999999999999999999999999864 37999999999999654 4578886 556889999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc------------cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh Q lcl|NC_018285. 249 SNVAQLLKQADWTTGQFAKVYGIPENVVGGQG------------DQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDV 316 (383) Q Consensus 249 ~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~------------~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~ 316 (383) +.|+||+|++++++++||++|||||++||... ++++.+++.+.|+..||+|+++.|+++||++|++.+ T Consensus 340 ~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~ 419 (574) T protein:vir:80 340 ANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEF 419 (574) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Confidence 99999999999999999999999999998532 246788889999999999999999999999999865 Q ss_pred hccchhhhc-cCHHH--HHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCC------------------- Q lcl|NC_018285. 317 DADIFPAVD-PTGAN--YISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTIL------------------- 374 (383) Q Consensus 317 e~~~~~~~~-~~~~~--~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~------------------- 374 (383) +.+....+. .+... ....+..++.+|++|+||+|+++|++|++++|..... .+..++ T Consensus 420 ~~~~~~~f~~~d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~-~n~~~~~~~~~~~~~~~~~~~~~~~ 498 (574) T protein:vir:80 420 GEKYQFQFRGGDLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNG-VHIQAIGQALQEEQLEYQRSQDRLN 498 (574) T ss_pred CCceEEEecccchhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeec-cceeecccccccccCCccchhcccc Confidence 533222221 12222 2233445788999999999999999999887755321 111111 Q ss_pred -----CCCCCCCCC Q lcl|NC_018285. 375 -----KGGETNGQD 383 (383) Q Consensus 375 -----~ggd~~~~d 383 (383) .|++++..+ T Consensus 499 ~~~~~~~~~~~~~~ 512 (574) T protein:vir:80 499 RLLELSGGDVEQPE 512 (574) T ss_pred ccccccCCCCCCCC Confidence 111111110 No 74 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=8.7e-68 Score=388.23 Aligned_cols=378 Identities=12% Similarity=0.051 Sum_probs=273.2 Q ss_pred CchhhhhhcCCcc-cccccccccchhhcccccCCceechh----hhhccHHHHHHHHHHHHhhhhCceeeecchhhhh-c Q lcl|NC_018285. 1 MPIFNLATESPPN-NQGGFFDITDPEFLATLNGSEWVSAE----TALKNSDLFSIISQLSNDLATAKLTTSRKQMQGI-V 74 (383) Q Consensus 1 Mglf~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l-~ 74 (383) |.|-+....++-. ...+...+....+...+ ..+++.. .+..+++|++||++||++||++|+++++.....+ . T Consensus 6 ~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~--~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~~~l~~ 83 (542) T protein:vir:41 6 LSIRSLEKYKAIKREEVESQALGETRFEEYV--EPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDEGVVDE 83 (542) T ss_pred ccccccccchhhhhccccccccccccCCccc--cCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccchhhhh Confidence 3333322211111 11110000011111111 1123332 2345789999999999999999999987665443 5 Q ss_pred cCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-------eEEEEeecC----- Q lcl|NC_018285. 75 DNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-------LYYNVTFDD----- 142 (383) Q Consensus 75 ~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-------~~y~~~~~~----- 142 (383) ..||++||+++|+++++.+++++||||++++|+.+|++.+|+||+|++|++..+.+... ..|...+.. T Consensus 84 ~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~ 163 (542) T protein:vir:41 84 FIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEIN 163 (542) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeeccccccc Confidence 56999999999999999999999999999999999999999999999999987644321 111111111 Q ss_pred -cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC----------C Q lcl|NC_018285. 143 -PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG----------G 211 (383) Q Consensus 143 -~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~----------~ 211 (383) ..+.....++++||||+|.+++.++++|+||+.++..++....++++++.++|+||++|++||++++. + T Consensus 164 ~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~ 243 (542) T protein:vir:41 164 PETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDG 243 (542) T ss_pred ccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCcccccccccccc Confidence 11223456889999999999888889999999999999999999999999999999999999998754 4 Q ss_pred CHHHHHHHHHHHHH----hhcCCcceeecC------CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc- Q lcl|NC_018285. 212 LLDFKTKVSRSRQA----MKQMQGGPLVLD------DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG- 280 (383) Q Consensus 212 ~~e~~~~~~~~~~~----~~~~~g~~~vl~------~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~- 280 (383) ++++++++++.|+. ..+|+|+++||+ +|++|++++.++.|+||+|.+++++++||++|||||.+||... T Consensus 244 ~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~ 323 (542) T protein:vir:41 244 NPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADT 323 (542) T ss_pred CHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCC Confidence 56777888877754 345788999984 7999999999999999999999999999999999999999642 Q ss_pred ---cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccchhhhccC---HHHHHHHHHHHHhCCCcCHHHHHHHh- Q lcl|NC_018285. 281 ---DQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVDADIFPAVDPT---GANYISRINSMVKSGTLAQNQGLYIL- 353 (383) Q Consensus 281 ---~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~---~~~~~~~~~~l~~~g~~t~nE~r~~l- 353 (383) ++++.+++.+.|+++||.|+++.|+++||++|+++++.+....+..+ ..+.++.++.++++|++|+||+|+.+ T Consensus 324 ~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~d~~~~~~~~v~~GilT~NE~Re~L~ 403 (542) T protein:vir:41 324 GPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNPKTRFKFNDETLLESDSVRNCALLVQSGVLTPAEARERLF 403 (542) T ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEecchhhcchHHHHHHHHHHhCCCCCHHHHHHhhC Confidence 23577888999999999999999999999999876544332222211 12345667889999999999999864 Q ss_pred hcCCcCCcchhHHh--CCCCCCCCCCCCCCCC Q lcl|NC_018285. 354 QQAEILPKELPKGE--NPNRTILKGGETNGQD 383 (383) Q Consensus 354 g~~~~~~~d~~~~~--~~~~~~~~ggd~~~~d 383 (383) |.+| ++...+. +.....+++++.+.+. T Consensus 404 g~~p---gdd~~l~p~~~~~~~~~~~~~n~~~ 432 (542) T protein:vir:41 404 GLDG---GPDIFMVPSKGAAKSVKRQERNYEK 432 (542) T ss_pred CCCC---CCccccccccccccccccCCcCCCC Confidence 5444 3322221 1111223344443332 No 75 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=3.5e-67 Score=384.90 Aligned_cols=379 Identities=10% Similarity=0.045 Sum_probs=272.5 Q ss_pred CchhhhhhcC-Cccc------cc-----------------cc-----ccccchhh------cccccCCceec-------h Q lcl|NC_018285. 1 MPIFNLATES-PPNN------QG-----------------GF-----FDITDPEF------LATLNGSEWVS-------A 38 (383) Q Consensus 1 Mglf~~~~~~-~~~~------~~-----------------~~-----~~~~~~~~------~~~~~~~~~~~-------~ 38 (383) ||||.++.+. +... +. .. ...+.+.+ .+......+.+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 9999998652 1000 00 00 00000000 00000111111 1 Q ss_pred hhhhccHHHHHHHHHHHHhhhhC-------------ceeeecch-------------hhhhccCCCccC-----CHHHHH Q lcl|NC_018285. 39 ETALKNSDLFSIISQLSNDLATA-------------KLTTSRKQ-------------MQGIVDNPSNSA-----NRFNFY 87 (383) Q Consensus 39 ~~a~~~~~v~~~i~~ia~~ia~~-------------p~~~~~~~-------------~~~l~~~PN~~~-----t~~~f~ 87 (383) +.+..+|.|++||+.||+.||++ ++++.... ...++.+||+++ |+.+|+ T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~ 160 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFV 160 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHH Confidence 22346799999999999999974 23333211 123566788764 889999 Q ss_pred HHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce----eEEEEeecCcccccceeecccceEEeccCC Q lcl|NC_018285. 88 QSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG----LYYNVTFDDPRIPPKQHVPQSDILHFRLLS 163 (383) Q Consensus 88 ~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~----~~y~~~~~~~~~~~~~~~~~~dvih~~~~~ 163 (383) ++++.+++++||+|++++|+.+|+|++||||+|++|++..+.++.. .+|..... ++....|+++||||+++++ T Consensus 161 ~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~---~~~~~~~~~~eiih~r~n~ 237 (547) T protein:vir:63 161 KKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVID---QKIVATFNAREMAFAVRNP 237 (547) T ss_pred HHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcC---CcEEEEeccccEEEecccC Confidence 9999999999999999999999999999999999999988766532 23333322 2345679999999999765 Q ss_pred CCc---cccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC--CCHHHHHHHHHHHHHh---hcCCcceee Q lcl|NC_018285. 164 VDG---GLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG--GLLDFKTKVSRSRQAM---KQMQGGPLV 235 (383) Q Consensus 164 ~~~---~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~--~~~e~~~~~~~~~~~~---~~~~g~~~v 235 (383) ..+ ..+|+||+.++...|....++++++.++|+||++|+++|++++. +++++++++++.|... ..|+|+++| T Consensus 238 ~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~v 317 (547) T protein:vir:63 238 RSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPV 317 (547) T ss_pred CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccc Confidence 433 46799999999999999999999999999999999999998754 7899999999998654 468888765 Q ss_pred c-CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc------------cCcCHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 236 L-DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG------------DQQSSLEMSSNVYSKAVARYLR 302 (383) Q Consensus 236 l-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~------------~~~~~~e~~~~~~~~~l~P~~~ 302 (383) + ++|++|+++++++.|+||+|++++++++||++|||||++||... +++|.+++.+.|+..||.|+++ T Consensus 318 l~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~ 397 (547) T protein:vir:63 318 VSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLG 397 (547) T ss_pred ccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHH Confidence 5 68999999999999999999999999999999999999998532 4567888899999999999999 Q ss_pred HHHHHHHHhhcchhhccc----hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCC-cCCcchhHHhC--------- Q lcl|NC_018285. 303 PFLSELSQKLSCDVDADI----FPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAE-ILPKELPKGEN--------- 368 (383) Q Consensus 303 ~i~~~l~~~l~~~~e~~~----~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~-~~~~d~~~~~~--------- 368 (383) .|+++||++|+++++.++ ......+..+ ...+..++.+|++|+||+|+++|++| ++++|...... T Consensus 398 ~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~-~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~ 476 (547) T protein:vir:63 398 FIEDFINKHIVAEFGDKYTFQFVGGDIKSELE-SVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLM 476 (547) T ss_pred HHHHHHHhhcccccCCceEEEeeccccccHHH-HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccc Confidence 999999999997654222 2122223233 33455678889999999999999988 56777543110 Q ss_pred ----C---------CCC-CCCCCCCCCCC Q lcl|NC_018285. 369 ----P---------NRT-ILKGGETNGQD 383 (383) Q Consensus 369 ----~---------~~~-~~~ggd~~~~d 383 (383) . +.. ...|++.++.+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (547) T protein:vir:63 477 QQEQFEHEKQQSNLQMLQEQTGNRVSTDV 505 (547) T ss_pred cccCCccccchhhccccccccCCCCCCCC Confidence 0 000 00122211111 No 76 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=6.9e-68 Score=388.80 Aligned_cols=361 Identities=10% Similarity=0.061 Sum_probs=270.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---------hh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---------~~ 71 (383) ||||+++..++...... .........++.+.||++++|++||++||++||++|+++++.. .+ T Consensus 1 MGlf~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~ 71 (395) T protein:vir:98 1 MGILDFFSFKKSGTLSD---------DDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLY 71 (395) T ss_pred CcchhhhcCCCcccccc---------cccchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHH Confidence 99999997654322111 0111122345778899999999999999999999999998753 23 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||++||+++||+.++.+++++||||++++++..+. +++..+.... .....++.+.+..+ ...+.+ T Consensus 72 lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~ 141 (395) T protein:vir:98 72 WINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY------VADSFTQDKK--ISGSQFKVSRVQGQ--TYEKTF 141 (395) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee------cCCccccccc--ccCcccceeeecCc--eeeeEe Confidence 5778999999999999999999999999999999876432 2222222211 11122333333332 224679 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHH--HHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHH---- Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKAS--DKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQ---- 224 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~--~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~---- 224 (383) +++||||+|+.+.++..++.+++......+...... .....+++.++..+.+++..+.... +++.+..++.++ T Consensus 142 ~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (395) T protein:vir:98 142 TFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVE 221 (395) T ss_pred cCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHh Confidence 999999999887776666666666666655544433 3345678888888888887766543 333333333332 Q ss_pred HhhcCCcceeecCCCceeeeccc------ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHH Q lcl|NC_018285. 225 AMKQMQGGPLVLDDLEDFTPLEI------KSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVA 298 (383) Q Consensus 225 ~~~~~~g~~~vl~~g~~~~~~~~------~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~ 298 (383) ....+.++++++++|++|++++. ++.++||.+.+++++++||++|||||++|| +++++.+++.++|+++||. T Consensus 222 ~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~--~~~sn~e~~~~~f~~~tl~ 299 (395) T protein:vir:98 222 KIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH--GDIADNQKNYELLLEGPIE 299 (395) T ss_pred hhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCcccHHHHHHHHHHHHHH Confidence 22345566888999999999985 467789999999999999999999999997 4678899999999999999 Q ss_pred HHHHHHHHHHHHhhcchh------hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHhCCC Q lcl|NC_018285. 299 RYLRPFLSELSQKLSCDV------DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGENPN 370 (383) Q Consensus 299 P~~~~i~~~l~~~l~~~~------e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~~~~ 370 (383) |++++|+++|+++|+++. +|+.....+.|..++++.+.+++++|++|+||+|+++|++|+++ +|.... ..| T Consensus 300 P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~~~~-~~n 378 (395) T protein:vir:98 300 SLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYM-TKN 378 (395) T ss_pred HHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeee-ccc Confidence 999999999999999853 35666778899999999999999999999999999999999976 554443 345 Q ss_pred CCCC--CCCCCCCCC Q lcl|NC_018285. 371 RTIL--KGGETNGQD 383 (383) Q Consensus 371 ~~~~--~ggd~~~~d 383 (383) ..|+ +|||+++++ T Consensus 379 ~~~~~~~gge~~~~~ 393 (395) T protein:vir:98 379 YESVLERGGEVDEEV 393 (395) T ss_pred ceecccccCCCCCCC Confidence 5555 599998888 No 77 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=9e-67 Score=382.67 Aligned_cols=382 Identities=12% Similarity=0.050 Sum_probs=268.1 Q ss_pred CchhhhhhcCC--c-ccc-cccccccchhhcccccCC-------cee--------chhhhhccHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MPIFNLATESP--P-NNQ-GGFFDITDPEFLATLNGS-------EWV--------SAETALKNSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mglf~~~~~~~--~-~~~-~~~~~~~~~~~~~~~~~~-------~~~--------~~~~a~~~~~v~~~i~~ia~~ia~~ 61 (383) =..|..+.++. . ... +.......|. .....++ .++ ..+....+|.|++||++||++||++ T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~a~~~p~-~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~ 110 (576) T protein:vir:96 32 QANIRNIEEKSKELNKSLYGKQQAYAEPF-LEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMY 110 (576) T ss_pred hHHHHHhhhhhhhhccccCCccchhhcce-eeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhh Confidence 22333331110 0 000 0000001111 0000000 000 0112235788999999999999973 Q ss_pred -------------ceeeecchh-----------------hhhccCCCcc-CCHHHHHHHHHHHHHHcCCeEEEEeec--C Q lcl|NC_018285. 62 -------------KLTTSRKQM-----------------QGIVDNPSNS-ANRFNFYQSIFAQMLLGGEAFAYRWRN--D 108 (383) Q Consensus 62 -------------p~~~~~~~~-----------------~~l~~~PN~~-~t~~~f~~~~~~~~~l~G~a~~~i~r~--~ 108 (383) ++++++.+. ..++..||++ +|+.+||+.++.+++++||||++++++ . T Consensus 111 ~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~ 190 (576) T protein:vir:96 111 CQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKN 190 (576) T ss_pred hhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCC Confidence 344443221 1123345555 589999999999999999999999854 4 Q ss_pred CCceeEEEEeccceeEEEEcCCCceeEEEEee-cCcccccceeecccceEEecc-CCCC--ccccCcchHHHHHHHHHHH Q lcl|NC_018285. 109 NGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTF-DDPRIPPKQHVPQSDILHFRL-LSVD--GGLTSVSPLMALGRELDIQ 184 (383) Q Consensus 109 ~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~-~~~~~~~~~~~~~~dvih~~~-~~~~--~~~~G~s~~~~~~~~i~~~ 184 (383) .|++++||||+|++|++..+.++....+...+ ....+.....++++||||++. ++++ .+.+|+||+.++..+|... T Consensus 191 ~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~ 270 (576) T protein:vir:96 191 ATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAY 270 (576) T ss_pred CCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHH Confidence 57899999999999999998877654432211 112234556789999987654 4333 2567999999999999999 Q ss_pred HHHHHHHHHHHhccCCcceeEeecC--CCCHHHHHHHHHHHHHh---hcCCcc-eeecCCCceeeecccChhhHHHHHHH Q lcl|NC_018285. 185 KASDKLTLNSLKNALNANGILKIKG--GGLLDFKTKVSRSRQAM---KQMQGG-PLVLDDLEDFTPLEIKSNVAQLLKQA 258 (383) Q Consensus 185 ~~~~~~~~~~~~ng~~~~~i~~~~~--~~~~e~~~~~~~~~~~~---~~~~g~-~~vl~~g~~~~~~~~~~~d~~~~e~~ 258 (383) .++++++.++|+||++|++||+.++ .+++++++++++.|+.. ..|+|+ ++|+++|++|++++.++.|+||+|++ T Consensus 271 ~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~ 350 (576) T protein:vir:96 271 NNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWL 350 (576) T ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHH Confidence 9999999999999999999999876 46899999999999764 357788 58999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCHHHhcccc-------------cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccch-hhh Q lcl|NC_018285. 259 DWTTGQFAKVYGIPENVVGGQG-------------DQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVDADIF-PAV 324 (383) Q Consensus 259 ~~~~~~Ia~~~gVpp~~lg~~~-------------~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~-~~~ 324 (383) ++++++||++|||||++||... ++++++++.+.|+.+||+|+++.|+++|+++|+++++.+.. .+. T Consensus 351 ~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~ 430 (576) T protein:vir:96 351 TYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYSDKYVFQFV 430 (576) T ss_pred HHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccCceEEEec Confidence 9999999999999999998632 45788999999999999999999999999999987654332 233 Q ss_pred ccCHHHHHHHHH--HHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC----CCCCCC-CCCCCCCCC Q lcl|NC_018285. 325 DPTGANYISRIN--SMVKSGTLAQNQGLYILQQAEILPKELPKGEN----PNRTIL-KGGETNGQD 383 (383) Q Consensus 325 ~~~~~~~~~~~~--~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~----~~~~~~-~ggd~~~~d 383 (383) +.|...+.+.++ .++.+|++|+||+|+++|++|++++|...... .+.... ++.+.+.++ T Consensus 431 r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~ 496 (576) T protein:vir:96 431 GGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQK 496 (576) T ss_pred cCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCcccc Confidence 456555555444 34567999999999999999998888543211 000011 111111111 No 78 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=1.2e-66 Score=381.96 Aligned_cols=346 Identities=12% Similarity=0.030 Sum_probs=272.4 Q ss_pred hhh-hhccHHHHHHHHHHHHhhhhCceeeecchh----------------hhhccCCCccC--------CHHHHHHHHHH Q lcl|NC_018285. 38 AET-ALKNSDLFSIISQLSNDLATAKLTTSRKQM----------------QGIVDNPSNSA--------NRFNFYQSIFA 92 (383) Q Consensus 38 ~~~-a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~----------------~~l~~~PN~~~--------t~~~f~~~~~~ 92 (383) .+. +-.+++|++||++||++||++|++++.+.. ..+..+||+.| ++.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 222 234799999999999999999999973321 13456688755 66789999999 Q ss_pred HHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-------ceeEEEE------------------eecCccccc Q lcl|NC_018285. 93 QMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-------NGLYYNV------------------TFDDPRIPP 147 (383) Q Consensus 93 ~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-------~~~~y~~------------------~~~~~~~~~ 147 (383) +++++||||++++|+.+|+|++|+||+|++|++..+... ...+|.+ ....+..+. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 999999999999999999999999999999998765432 1111111 111223445 Q ss_pred ceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHHHHHHHHh Q lcl|NC_018285. 148 KQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKVSRSRQAM 226 (383) Q Consensus 148 ~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~~~~~~~~ 226 (383) .+.++++||||+|.+++.+.++|+||+.++..++....++++++.++|+||+.|++++++++ .+++++++++++.|... T Consensus 161 ~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 240 (467) T protein:vir:31 161 SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDN 240 (467) T ss_pred eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhh Confidence 67899999999999988888999999999999999999999999999999999999999865 68999999999888653 Q ss_pred h--------------cCCcceeecCCCceeeec-------c-cChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--c- Q lcl|NC_018285. 227 K--------------QMQGGPLVLDDLEDFTPL-------E-IKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG--D- 281 (383) Q Consensus 227 ~--------------~~~g~~~vl~~g~~~~~~-------~-~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~--~- 281 (383) . .+++++++++.|++++++ + .+++|+||++++++++++||++|||||.+||... + T Consensus 241 ~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~ 320 (467) T protein:vir:31 241 NEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF 320 (467) T ss_pred hcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc Confidence 3 466778888877655544 3 3678999999999999999999999999999632 3 Q ss_pred CcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch--------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_018285. 282 QQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD--------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYIL 353 (383) Q Consensus 282 ~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~--------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~l 353 (383) +++.+++.+.|++.||.|+++.|+++||++|++. ++|+.....+.|...++..+..++++|++|+||+|+++ T Consensus 321 ~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 400 (467) T protein:vir:31 321 STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEF 400 (467) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 3567888999999999999999999999999864 56788888889999999999999999999999999999 Q ss_pred hcCCcCCcchhHHh------CCCCCCC--------CCCCCCCCC Q lcl|NC_018285. 354 QQAEILPKELPKGE------NPNRTIL--------KGGETNGQD 383 (383) Q Consensus 354 g~~~~~~~d~~~~~------~~~~~~~--------~ggd~~~~d 383 (383) |++|+++++..... +.+..+. +++++..++ T Consensus 401 Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (467) T protein:vir:31 401 GFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADE 444 (467) T ss_pred CCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCcccc Confidence 99998654421100 0011111 111111111 No 79 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.6e-66 Score=381.28 Aligned_cols=378 Identities=10% Similarity=0.010 Sum_probs=269.9 Q ss_pred CchhhhhhcCCcccccc---------c-------ccccchhhcccccCCcee--chhhhhccHHHHHHHHHHHHhhhhCc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGG---------F-------FDITDPEFLATLNGSEWV--SAETALKNSDLFSIISQLSNDLATAK 62 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~---------~-------~~~~~~~~~~~~~~~~~~--~~~~a~~~~~v~~~i~~ia~~ia~~p 62 (383) |.+...-..++....+- + ...+-..++..+.+...+ .-+++....++++||.++++.++++| T Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~ 113 (535) T protein:vir:10 34 KAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFK 113 (535) T ss_pred hhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcce Confidence 22221111110000000 0 000001111111111111 22455677889999999999999999 Q ss_pred eeeecch--------------hhhhccCCCccCCHHH----HHHHHHHHHHHc-CCeEEEEeecCCCceeEEEEecccee Q lcl|NC_018285. 63 LTTSRKQ--------------MQGIVDNPSNSANRFN----FYQSIFAQMLLG-GEAFAYRWRNDNGRDMKWEYLRPSQV 123 (383) Q Consensus 63 ~~~~~~~--------------~~~l~~~PN~~~t~~~----f~~~~~~~~~l~-G~a~~~i~r~~~g~~~~l~~l~~~~v 123 (383) +++++.+ .+.|..+||++|++++ |+++++.+++++ |++|++|+|+..|+|++||||+|++| T Consensus 114 i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V 193 (535) T protein:vir:10 114 VELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKV 193 (535) T ss_pred eEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCcee Confidence 9998643 1347789999998875 556667776655 57999999999999999999999999 Q ss_pred EEEEcCCCc---eeEEEEeecCcccccceeecccceEEeccCCCC---ccccCcchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 124 SFNRLDNQN---GLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVD---GGLTSVSPLMALGRELDIQKASDKLTLNSLKN 197 (383) Q Consensus 124 ~~~~~~~~~---~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 197 (383) ++..+.++. ..+|.+. . ++....|+++||||+++++.. ++.+|+||+.++..+|....++++++.++|+| T Consensus 194 ~v~~d~~~~~~~~~~~~~~-~---~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~n 269 (535) T protein:vir:10 194 VISYSPRSKDQPRKFEQFV-S---ETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQ 269 (535) T ss_pred EEEEcCccccCceEEEEEe-c---CceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 998765443 3333332 2 234567999999999976543 35679999999999999999999999999999 Q ss_pred cCCcceeEeecCC----CCHHHHHHHHHHHHHh---hcCCcceeec-CCCceeeecccChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 198 ALNANGILKIKGG----GLLDFKTKVSRSRQAM---KQMQGGPLVL-DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVY 269 (383) Q Consensus 198 g~~~~~i~~~~~~----~~~e~~~~~~~~~~~~---~~~~g~~~vl-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 269 (383) |++|+++|++++. +++++++++++.|+.. ..|+|+++|+ ++|++|++++.++.|+||+|++++++++||++| T Consensus 270 g~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~af 349 (535) T protein:vir:10 270 GGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIF 349 (535) T ss_pred cCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHh Confidence 9999999999764 7789999999998654 4577887665 579999999999999999999999999999999 Q ss_pred cCCHHHhcccc--cC------------cCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh----ccchhhhccCHHHH Q lcl|NC_018285. 270 GIPENVVGGQG--DQ------------QSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVD----ADIFPAVDPTGANY 331 (383) Q Consensus 270 gVpp~~lg~~~--~~------------~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e----~~~~~~~~~~~~~~ 331 (383) ||||++||... ++ ++.+++...|++.||.|+++.|+++||++|+++++ |+.....+.+...+ T Consensus 350 gVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r 429 (535) T protein:vir:10 350 QMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQE 429 (535) T ss_pred CCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHHHH Confidence 99999999632 22 23456667899999999999999999999998644 44445556666666 Q ss_pred HHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC----C--------CCCCC----CCCCCCCCC Q lcl|NC_018285. 332 ISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN----P--------NRTIL----KGGETNGQD 383 (383) Q Consensus 332 ~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~----~--------~~~~~----~ggd~~~~d 383 (383) ...+. +..+|++|+||+|+++|++|++++|++.... + +..|. .|...++++ T Consensus 430 ~~~~~-~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 496 (535) T protein:vir:10 430 EQVWK-LKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERE 496 (535) T ss_pred HHHHH-HHHcCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccc Confidence 55554 4456779999999999999999888654211 0 00010 111111111 No 80 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=4.3e-65 Score=373.48 Aligned_cols=326 Identities=16% Similarity=0.147 Sum_probs=247.7 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) ||||+++.+....... .+......+ .+...+++.++|++||++||++||++|+++++.. T Consensus 1 Mg~f~~~~~~~~~~~~-----~~~~~~~~~-----~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLN-----NDTQRVTAW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred Cccchhhhhhhccccc-----CCcceeeec-----ccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccc Confidence 9999998642211110 011111111 2334567889999999999999999999987431 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-CceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-GRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) .+.|+.+||++||+++||+.++.+++++||||++++|+.. |.+..++|.. T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~----------------------- 127 (378) T protein:vir:16 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------------- 127 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------------- Confidence 2346679999999999999999999999999999988754 5555544321 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) ..+.|+++||||+|++ .+...|.|++..+...+. ..+++ +.++++++.++.+++++.+++++. T Consensus 128 ----~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~i~----------~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~ 190 (378) T protein:vir:16 128 ----DKKEYKPEELVRLTSP--FYINEDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred ----CeeEecccceEEecCc--cCccchhHHHHHHHHHHH----------HHHhc-CccceeeEeCCcCCHHHHHHHHHH Confidence 1235778999999964 334568888887776553 23444 468999999999988876666555 Q ss_pred HHH------hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQA------MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~------~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~ 296 (383) |.. +.+++|+++||++|++|++++.++.++|+. .+++++++||++|||||.+|++ ++++++..+|+.+| T Consensus 191 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~-~~~~~~~~Ia~~fgVPp~~l~g----~~~e~~~~~f~~~t 265 (378) T protein:vir:16 191 ALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKD-EIDLIKSELLTGYFMNENILLG----TASQEQQIYFYNST 265 (378) T ss_pred HHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHH-HHHHHHHHHHHHhCCCHHHhcC----CchHHHHHHHHHHH Confidence 533 334778999999999999999999999975 5679999999999999999964 34578889999999 Q ss_pred HHHHHHHHHHHHHHhhcch--------------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD--------------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~--------------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) |.|++++|+++|+++|+++ ++|+...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 266 l~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD 345 (378) T protein:vir:16 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999975 45677778889999999999999999999999999999999998877 Q ss_pred hhHHhCCCCCC--------------CCCCCCCCC Q lcl|NC_018285. 363 LPKGENPNRTI--------------LKGGETNGQ 382 (383) Q Consensus 363 ~~~~~~~~~~~--------------~~ggd~~~~ 382 (383) ..... .|..+ .+|+|++++ T Consensus 346 ~~~~~-~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 346 VYIAN-LNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred eEeec-cccccccchhhhcCccCCCCCCCCCCCC Confidence 44321 12221 234444444 No 81 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=2.2e-64 Score=369.59 Aligned_cols=379 Identities=12% Similarity=0.070 Sum_probs=267.7 Q ss_pred CchhhhhhcCCc-ccccccccccc--hhhcccccCCc-eec----hhhhhccHHHHHHHHHHHHhhhh------------ Q lcl|NC_018285. 1 MPIFNLATESPP-NNQGGFFDITD--PEFLATLNGSE-WVS----AETALKNSDLFSIISQLSNDLAT------------ 60 (383) Q Consensus 1 Mglf~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~-~~~----~~~a~~~~~v~~~i~~ia~~ia~------------ 60 (383) +.++.+..++.. +....+....+ ..+........ ..+ .+..-.++.|.+||+.+|+.||. T Consensus 43 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~ 122 (563) T protein:vir:99 43 YQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGL 122 (563) T ss_pred HHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccc Confidence 555555432211 11111111100 01111000000 011 11222367788999999988885 Q ss_pred -Cceeeecchh-------------hhhc----cCCCcc-CCHHHHHHHHHHHHHHcCCeEEEEe--ecCCCceeEEEEec Q lcl|NC_018285. 61 -AKLTTSRKQM-------------QGIV----DNPSNS-ANRFNFYQSIFAQMLLGGEAFAYRW--RNDNGRDMKWEYLR 119 (383) Q Consensus 61 -~p~~~~~~~~-------------~~l~----~~PN~~-~t~~~f~~~~~~~~~l~G~a~~~i~--r~~~g~~~~l~~l~ 119 (383) +|+++++.+. ..++ ..||++ +|+.+|+++++.+++++||+|++++ |+..|+|++||||+ T Consensus 123 ~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~ 202 (563) T protein:vir:99 123 GFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVD 202 (563) T ss_pred cceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeC Confidence 5777665331 0111 223443 5889999999999999999999866 77889999999999 Q ss_pred cceeEEEEcCCCceeE----EEEeecCcccccceeecccceEEe-ccCCCC--ccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 120 PSQVSFNRLDNQNGLY----YNVTFDDPRIPPKQHVPQSDILHF-RLLSVD--GGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 120 ~~~v~~~~~~~~~~~~----y~~~~~~~~~~~~~~~~~~dvih~-~~~~~~--~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) |++|++..+.++.... |.+...+ .....++++|+||+ ++++++ .+.+|+||+.++..+|....++++++. T Consensus 203 p~~V~v~~~~~g~~~~~~~~y~~~~~g---~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~ 279 (563) T protein:vir:99 203 PSTIFYATDKKGKIIKGGKRFVQVVDK---RVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFND 279 (563) T ss_pred CceeEEEECCCCceeccceeEEEEeCC---ceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHH Confidence 9999999887765432 3333322 33557889997755 455443 356799999999999999999999999 Q ss_pred HHHhccCCcceeEeecCC--CCHHHHHHHHHHHHHhh---cCCcce-eecCCCceeeecccChhhHHHHHHHHHHHHHHH Q lcl|NC_018285. 193 NSLKNALNANGILKIKGG--GLLDFKTKVSRSRQAMK---QMQGGP-LVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFA 266 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~--~~~e~~~~~~~~~~~~~---~~~g~~-~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia 266 (383) ++|+||+.|+++|++++. +++++++++++.|.... .|+|++ +|+++|++|++++.+++|+||+|++++++++|| T Consensus 280 ~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia 359 (563) T protein:vir:99 280 RFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIIS 359 (563) T ss_pred HHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 999999999999998764 68999999999997643 577885 789999999999999999999999999999999 Q ss_pred HHhcCCHHHhcccc-------------cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccch-hhhccCHHHHH Q lcl|NC_018285. 267 KVYGIPENVVGGQG-------------DQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVDADIF-PAVDPTGANYI 332 (383) Q Consensus 267 ~~~gVpp~~lg~~~-------------~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~-~~~~~~~~~~~ 332 (383) ++|||||++||... ++++.+++.+.|+..||.|+++.|+++||++|+++++.+.. .+.+.|...+. T Consensus 360 ~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~r~D~~~~~ 439 (563) T protein:vir:99 360 ALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFVGGDTKSAT 439 (563) T ss_pred HHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEEEeccCCHHHHH Confidence 99999999998531 23577888899999999999999999999999987553322 23345555554 Q ss_pred HH--HHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCC---------------------CCCCCCCCCC Q lcl|NC_018285. 333 SR--INSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTI---------------------LKGGETNGQD 383 (383) Q Consensus 333 ~~--~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~---------------------~~ggd~~~~d 383 (383) +. +..++.+|++|+||+|+++|++|++++|...... +..+ .++++.++.+ T Consensus 440 e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (563) T protein:vir:99 440 DKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDAS-FLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDND 512 (563) T ss_pred HHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeeccc-ccccccccccccCCCccccchhhhhcccccCCCCC Confidence 43 3457889999999999999999999887543211 1000 0011111111 No 82 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=2.2e-64 Score=369.59 Aligned_cols=379 Identities=12% Similarity=0.070 Sum_probs=267.7 Q ss_pred CchhhhhhcCCc-ccccccccccc--hhhcccccCCc-eec----hhhhhccHHHHHHHHHHHHhhhh------------ Q lcl|NC_018285. 1 MPIFNLATESPP-NNQGGFFDITD--PEFLATLNGSE-WVS----AETALKNSDLFSIISQLSNDLAT------------ 60 (383) Q Consensus 1 Mglf~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~-~~~----~~~a~~~~~v~~~i~~ia~~ia~------------ 60 (383) +.++.+..++.. +....+....+ ..+........ ..+ .+..-.++.|.+||+.+|+.||. T Consensus 43 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~ 122 (563) T protein:vir:95 43 YQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGL 122 (563) T ss_pred HHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccc Confidence 555555432211 11111111100 01111000000 011 11222367788999999988885 Q ss_pred -Cceeeecchh-------------hhhc----cCCCcc-CCHHHHHHHHHHHHHHcCCeEEEEe--ecCCCceeEEEEec Q lcl|NC_018285. 61 -AKLTTSRKQM-------------QGIV----DNPSNS-ANRFNFYQSIFAQMLLGGEAFAYRW--RNDNGRDMKWEYLR 119 (383) Q Consensus 61 -~p~~~~~~~~-------------~~l~----~~PN~~-~t~~~f~~~~~~~~~l~G~a~~~i~--r~~~g~~~~l~~l~ 119 (383) +|+++++.+. ..++ ..||++ +|+.+|+++++.+++++||+|++++ |+..|+|++||||+ T Consensus 123 ~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~ 202 (563) T protein:vir:95 123 GFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVD 202 (563) T ss_pred cceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeC Confidence 5777665331 0111 223443 5889999999999999999999866 77889999999999 Q ss_pred cceeEEEEcCCCceeE----EEEeecCcccccceeecccceEEe-ccCCCC--ccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 120 PSQVSFNRLDNQNGLY----YNVTFDDPRIPPKQHVPQSDILHF-RLLSVD--GGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 120 ~~~v~~~~~~~~~~~~----y~~~~~~~~~~~~~~~~~~dvih~-~~~~~~--~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) |++|++..+.++.... |.+...+ .....++++|+||+ ++++++ .+.+|+||+.++..+|....++++++. T Consensus 203 p~~V~v~~~~~g~~~~~~~~y~~~~~g---~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~ 279 (563) T protein:vir:95 203 PSTIFYATDKKGKIIKGGKRFVQVVDK---RVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFND 279 (563) T ss_pred CceeEEEECCCCceeccceeEEEEeCC---ceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHH Confidence 9999999887765432 3333322 33557889997755 455443 356799999999999999999999999 Q ss_pred HHHhccCCcceeEeecCC--CCHHHHHHHHHHHHHhh---cCCcce-eecCCCceeeecccChhhHHHHHHHHHHHHHHH Q lcl|NC_018285. 193 NSLKNALNANGILKIKGG--GLLDFKTKVSRSRQAMK---QMQGGP-LVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFA 266 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~--~~~e~~~~~~~~~~~~~---~~~g~~-~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia 266 (383) ++|+||+.|+++|++++. +++++++++++.|.... .|+|++ +|+++|++|++++.+++|+||+|++++++++|| T Consensus 280 ~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia 359 (563) T protein:vir:95 280 RFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIIS 359 (563) T ss_pred HHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 999999999999998764 68999999999997643 577885 789999999999999999999999999999999 Q ss_pred HHhcCCHHHhcccc-------------cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccch-hhhccCHHHHH Q lcl|NC_018285. 267 KVYGIPENVVGGQG-------------DQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCDVDADIF-PAVDPTGANYI 332 (383) Q Consensus 267 ~~~gVpp~~lg~~~-------------~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~-~~~~~~~~~~~ 332 (383) ++|||||++||... ++++.+++.+.|+..||.|+++.|+++||++|+++++.+.. .+.+.|...+. T Consensus 360 ~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~r~D~~~~~ 439 (563) T protein:vir:95 360 ALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFVGGDTKSAT 439 (563) T ss_pred HHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEEEeccCCHHHHH Confidence 99999999998531 23577888899999999999999999999999987553322 23345555554 Q ss_pred HH--HHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCC---------------------CCCCCCCCCC Q lcl|NC_018285. 333 SR--INSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTI---------------------LKGGETNGQD 383 (383) Q Consensus 333 ~~--~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~---------------------~~ggd~~~~d 383 (383) +. +..++.+|++|+||+|+++|++|++++|...... +..+ .++++.++.+ T Consensus 440 e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (563) T protein:vir:95 440 DKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDAS-FLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDND 512 (563) T ss_pred HHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeeccc-ccccccccccccCCCccccchhhhhcccccCCCCC Confidence 43 3457889999999999999999999887543211 1000 0011111111 No 83 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=7.5e-65 Score=372.15 Aligned_cols=326 Identities=16% Similarity=0.157 Sum_probs=248.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) ||||++++.-...... .+......+ .+...+++.++|++||++||++||++|+++++.. T Consensus 1 Mg~f~~~~~f~~~~~~-----~~~~~~~~~-----~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLN-----NDTQRVTAW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CccchhhhhhhccccC-----CCcceeeec-----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccc Confidence 9999998642211110 011111111 2334567888999999999999999999987532 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-CceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-GRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) .+.|+.+||++||+++||+.++.+++++||||++++++.. |++..++|.. T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~~----------------------- 127 (378) T protein:vir:93 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------------- 127 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------------- Confidence 2345679999999999999999999999999999887643 5555554321 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) ..+.++++||||+|++ .+...|.|++..+...+. .++++| .++++++.++.+++++.+++++. T Consensus 128 ----~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~i~----------~~~~~~-~~~g~l~~~~~l~~~~~~~~~~~ 190 (378) T protein:vir:93 128 ----DKKEYKTEELVRLTSP--FYINEDTSILDNALASIQ----------TKLEQG-KLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred ----CeeEeccceeEEecCc--cccchhhHHHHHHHHHHH----------HHHhcC-cccceeeeCCcCCHHHHHHHHHH Confidence 1235788999999964 344568888887766553 345555 58999999999998877766665 Q ss_pred HHH------hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQA------MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~------~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~ 296 (383) |.. +..++|+++++++|++|++++.++.|+|+ +.+++++++||++|||||.+|++ ++++++..+|+.+| T Consensus 191 ~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g----~~~e~~~~~f~~~t 265 (378) T protein:vir:93 191 ALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG----TATQEQQIYFYNST 265 (378) T ss_pred HHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC----CcHHHHHHHHHHHH Confidence 543 23467889999999999999999999997 66789999999999999999964 34578889999999 Q ss_pred HHHHHHHHHHHHHHhhcch--------------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD--------------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~--------------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) |.|++++|+++|+++|+++ ++||...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 266 l~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD 345 (378) T protein:vir:93 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999975 45677788889999999999999999999999999999999999887 Q ss_pred hhHHhCCCCCC---------CC-----CCCCCCC Q lcl|NC_018285. 363 LPKGENPNRTI---------LK-----GGETNGQ 382 (383) Q Consensus 363 ~~~~~~~~~~~---------~~-----ggd~~~~ 382 (383) ..... .|..| .+ ++|++++ T Consensus 346 ~~~~~-~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 346 VYIAN-LNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred eeeec-cccccccchhhhcCccCCCCCCCCCCCC Confidence 54432 22222 12 2222222 No 84 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=1.2e-64 Score=370.96 Aligned_cols=326 Identities=16% Similarity=0.147 Sum_probs=248.7 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) ||||+++.+........ +......+ .+...+++.++|++||++||++||++|+++++.. T Consensus 1 Mg~f~~~~~~~~~~~~~-----~~~~~~~~-----~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-----DTQRVTAW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CCccccchhcccccccC-----Ccceeeee-----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccc Confidence 99999985422111111 11111111 2233567888999999999999999999986422 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-CCceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-NGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) ++.|+.+||++||+++||+.++.+++++||||++++++. .|+++.++|.. T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~----------------------- 127 (378) T protein:vir:94 71 MAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD----------------------- 127 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC----------------------- Confidence 234667899999999999999999999999999987654 57776665521 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) ..+.++++||||++++ .+...|.||+..+...+.. .+++ +.++++++.++.+++++.+++++. T Consensus 128 ----~~~~~~~~diiH~~~~--~~~~~g~s~l~~~~~~i~~----------~~~~-~~~~gil~~~~~l~~~~~~~~~~~ 190 (378) T protein:vir:94 128 ----DKKEYKPEELVRLTSP--FYINEDTSILDNALASIQT----------KLEQ-GKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred ----CeeEeeeeeeEEecCc--CCccchhHHHHHHHHHHHH----------HHhc-ccccceeeeCCcCCHHHHHHHHHH Confidence 1235778999999964 3446799999888876643 2344 458999999999998877666655 Q ss_pred HHH------hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQA------MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~------~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~ 296 (383) |.. +..++|+++||++|++|++++.++.++|+ +.++++.++||++|||||.+|++ ++++++..+|+.+| T Consensus 191 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~----~~se~~~~~f~~~t 265 (378) T protein:vir:94 191 ALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG----TASQEQQIYFYNST 265 (378) T ss_pred HHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC----ChHHHHHHHHHHHH Confidence 532 23567889999999999999999999997 56789999999999999999964 34578889999999 Q ss_pred HHHHHHHHHHHHHHhhcch--------------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD--------------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~--------------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) |.|++++|+++|+++|+++ ++|+...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 266 L~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD 345 (378) T protein:vir:94 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999975 45777778889999999999999999999999999999999999887 Q ss_pred hhHHhCCCCCC--------------CCCCCCCCC Q lcl|NC_018285. 363 LPKGENPNRTI--------------LKGGETNGQ 382 (383) Q Consensus 363 ~~~~~~~~~~~--------------~~ggd~~~~ 382 (383) ..... +|..| .+++|++++ T Consensus 346 ~~~~~-~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 346 VYIAN-LNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred eeeec-ccccccccchhhcCCcCCCCCCCCCCCC Confidence 54332 12222 123333333 No 85 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=2.8e-63 Score=363.49 Aligned_cols=326 Identities=16% Similarity=0.160 Sum_probs=244.7 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) ||||+++..-... ... .+.......+....+++.++|++||++||++||++|+++++.. T Consensus 1 M~if~~~~~~~~~---~~~-------~~~~~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLFGKVVSFSRG---KLN-------NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CchhHHhHhhhhc---ccc-------cCcceeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccc Confidence 9999998742211 000 0111122234555678889999999999999999999987532 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEe-ecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRW-RNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~-r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) .+.|+.+||++||+++||+.++.+++++||||++++ ++.+|.+..+++. .+ T Consensus 71 ~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~----------------------~~ 128 (378) T protein:vir:94 71 MAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFA----------------------ND 128 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEe----------------------cC Confidence 245678999999999999999999999999999854 4556665544332 11 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) .+.++++||+|++.+...+ .+.+++..+...+. ..+++ +.++++++.++.+++++.+++++. T Consensus 129 -----~~~~~~~dvih~~~~~~~~--~~~~~~~~~~~~~~----------~~~~~-~~~~g~l~~~~~l~~~~~~~~~e~ 190 (378) T protein:vir:94 129 -----KKEYKPEELVRLTSPFYIN--EDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred -----cEEechhceeeecCcCCcc--cchhHHHHHHHHHH----------HHHhh-CCcccceeeCCcCCHHHHHHHHHH Confidence 2458889999999665433 24566665554332 23334 468899999999988776665554 Q ss_pred HHH------hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQA------MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~------~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~ 296 (383) |.. ...++|+++||++|++|++++.++.++|+ +.++++.++||++|||||++|++. .++++..+|+.+| T Consensus 191 ~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~g~----~~e~~~~~f~~~t 265 (378) T protein:vir:94 191 ALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGT----ATQEQQIYFYNST 265 (378) T ss_pred HHHHHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhcCC----chHHHHHHHHHHH Confidence 432 33567889999999999999999999996 667899999999999999999643 3367788999999 Q ss_pred HHHHHHHHHHHHHHhhcch--------------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD--------------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~--------------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) |.|++++|+++|+++|+++ ++|+...+.+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 266 l~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd 345 (378) T protein:vir:94 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999864 34677788889999999999999999999999999999999998877 Q ss_pred hhHHhCCCCCC--------------CCCCCCCCC Q lcl|NC_018285. 363 LPKGENPNRTI--------------LKGGETNGQ 382 (383) Q Consensus 363 ~~~~~~~~~~~--------------~~ggd~~~~ 382 (383) ..... .|..+ .+|+|++++ T Consensus 346 ~~~~~-~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 346 VYIAN-LNAVAVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred eeeec-ccccchhcchhcccccCCCCCCCCCCCC Confidence 54432 22222 234444444 No 86 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=3.9e-62 Score=357.23 Aligned_cols=327 Identities=14% Similarity=0.141 Sum_probs=240.3 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch----------- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ----------- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~----------- 69 (383) ||||+++........ . .+.......++...+++.++|++||++||++||++|+++++.. T Consensus 1 M~~f~k~~~~~~~~~---~-------~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:85 1 MNLFGKVVSFSRGKL---N-------NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred Cchhhhhhhhhhccc---c-------cCCcceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccc Confidence 999998753211100 0 0001112234455678999999999999999999999987532 Q ss_pred ------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEe-ecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 70 ------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRW-RNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 70 ------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~-r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) .+.|+.+||++||+++||+.++.+++++||||++++ ++.+|.+..+++. .+ T Consensus 71 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~----------------------~~ 128 (378) T protein:vir:85 71 MAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFA----------------------ND 128 (378) T ss_pred cccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEec----------------------CC Confidence 234678999999999999999999999999999865 4555655433321 11 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHH Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRS 222 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~ 222 (383) .+.+.++||||++.+... ..+.+.+..+...+ ...+++ +.|+++++.++.+++++.+++++. T Consensus 129 -----~~~~~~~dvih~~~~~~~--~~~~~~~~~a~~~~----------~~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~ 190 (378) T protein:vir:85 129 -----KKEYKPEELVRLVSPFYI--NEDTSILDNALASI----------QTKLEQ-GKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred -----CEEEcccceEEEecCcCc--cchhhHHHHHHHHH----------HHHHhc-CCcceEEEeCCcCCHHHHHHHHHH Confidence 235778999999854222 22334444333322 233444 478999999999998887777666 Q ss_pred HHH------hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHH Q lcl|NC_018285. 223 RQA------MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKA 296 (383) Q Consensus 223 ~~~------~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~ 296 (383) |.. +..++|+++||++|++|++++.++.++++ +.++++.++||++|||||++|++ ++++++..+|+.+| T Consensus 191 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~----s~~e~~~~~f~~~t 265 (378) T protein:vir:85 191 ALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLG----TATQEQQIYFYNST 265 (378) T ss_pred HHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC----CchHHHHHHHHHHH Confidence 533 33577899999999999999999999996 66789999999999999999964 34577888999999 Q ss_pred HHHHHHHHHHHHHHhhcch--------------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 297 VARYLRPFLSELSQKLSCD--------------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~~~--------------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) |.|++++|+++|+++|+++ ++|+.....+.|..++++.+.+++++|++|+||+|+++|++|++++| T Consensus 266 L~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD 345 (378) T protein:vir:85 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999874 34666777888999999999999999999999999999999999887 Q ss_pred hhHHhCCCCCCC-------------CCCCCCCCC Q lcl|NC_018285. 363 LPKGENPNRTIL-------------KGGETNGQD 383 (383) Q Consensus 363 ~~~~~~~~~~~~-------------~ggd~~~~d 383 (383) ..... .|..++ +++|.+.++ T Consensus 346 ~~~~~-~N~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 346 IYIAN-LNAVAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred eEeec-ccccccccchhhcCccCCCCCCCCCCCC Confidence 54321 122221 112222222 No 87 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=3e-62 Score=357.88 Aligned_cols=383 Identities=13% Similarity=0.070 Sum_probs=284.6 Q ss_pred CchhhhhhcCCccc------------ccccccccchhhcccccCCc----eechhhhhc-cHHHHHHHHHHHHhhhhCce Q lcl|NC_018285. 1 MPIFNLATESPPNN------------QGGFFDITDPEFLATLNGSE----WVSAETALK-NSDLFSIISQLSNDLATAKL 63 (383) Q Consensus 1 Mglf~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~----~~~~~~a~~-~~~v~~~i~~ia~~ia~~p~ 63 (383) |.=-++-.+....- .....-+....+....+.-. +......+. ++++++||+.++++||++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~ 80 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGF 80 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCc Confidence 43222111111000 00111111111211111111 112233334 88999999999999999999 Q ss_pred eeecch--------------h-----------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEe Q lcl|NC_018285. 64 TTSRKQ--------------M-----------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYL 118 (383) Q Consensus 64 ~~~~~~--------------~-----------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l 118 (383) .+.-.. . ......+|+.+++.++++.++.|++.+||+|+.++++..|+|++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~l 160 (651) T protein:vir:99 81 DLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYV 160 (651) T ss_pred eeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhc Confidence 864210 0 112344688999999999999999999999999999999999999999 Q ss_pred ccceeEEEEcCCCc--------------------------------eeEEEEe--------------------------- Q lcl|NC_018285. 119 RPSQVSFNRLDNQN--------------------------------GLYYNVT--------------------------- 139 (383) Q Consensus 119 ~~~~v~~~~~~~~~--------------------------------~~~y~~~--------------------------- 139 (383) +++.+++..+.... ..++.+. T Consensus 161 p~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~ 240 (651) T protein:vir:99 161 PARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEES 240 (651) T ss_pred ChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcce Confidence 99887654321100 0000000 Q ss_pred -------------ecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEe Q lcl|NC_018285. 140 -------------FDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILK 206 (383) Q Consensus 140 -------------~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~ 206 (383) +.....+....++++||||||.+++.++++|+||+..+..++..+.++++++.++|+||++|+++|+ T Consensus 241 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~ 320 (651) T protein:vir:99 241 EREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIK 320 (651) T ss_pred eeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 0000112234688999999999988778899999999999999999999999999999999999999 Q ss_pred ecC-CCCHHHHHHHHHHHHHhhcCCcceeecCC-----------CceeeecccCh-hhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 207 IKG-GGLLDFKTKVSRSRQAMKQMQGGPLVLDD-----------LEDFTPLEIKS-NVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 207 ~~~-~~~~e~~~~~~~~~~~~~~~~g~~~vl~~-----------g~~~~~~~~~~-~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) +++ .+++++++++++.|+.+.+|+|+++||+. |++|++++.++ +|+||+|++++++++||++||||| T Consensus 321 ~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp 400 (651) T protein:vir:99 321 VTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPP 400 (651) T ss_pred ecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCH Confidence 976 48999999999999999999999998865 89999999976 599999999999999999999999 Q ss_pred HHhcc--cccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch----------hhccchhhhccCHHHHHHHHHHHHhC Q lcl|NC_018285. 274 NVVGG--QGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD----------VDADIFPAVDPTGANYISRINSMVKS 341 (383) Q Consensus 274 ~~lg~--~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~----------~e~~~~~~~~~~~~~~~~~~~~l~~~ 341 (383) ++||. .+++++++++.+.|+.+||+|+++.|+++||++|++. ++|+....++.|...+++.+..++++ T Consensus 401 ~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~ 480 (651) T protein:vir:99 401 VKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLA 480 (651) T ss_pred HHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhC Confidence 99996 3557788999999999999999999999999999874 34566667788999999999999999 Q ss_pred CCcCHHHHHHHhhcCCcCC--cchhHHh---CCCCCCCCCCCCCCCC Q lcl|NC_018285. 342 GTLAQNQGLYILQQAEILP--KELPKGE---NPNRTILKGGETNGQD 383 (383) Q Consensus 342 g~~t~nE~r~~lg~~~~~~--~d~~~~~---~~~~~~~~ggd~~~~d 383 (383) |++|+||+|+++|++|+++ ++..... .....+.+|||++++. T Consensus 481 G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~ 527 (651) T protein:vir:99 481 GVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVH 527 (651) T ss_pred CCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccc Confidence 9999999999999999864 2322110 1111245677766544 No 88 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=1.8e-61 Score=353.61 Aligned_cols=378 Identities=9% Similarity=0.012 Sum_probs=264.3 Q ss_pred CchhhhhhcCCcccccccccccchhhc----------ccccCC-----ceech----hhhhccHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFL----------ATLNGS-----EWVSA----ETALKNSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~-----~~~~~----~~a~~~~~v~~~i~~ia~~ia~~ 61 (383) |+|=......|+..........++... ....++ -+++. +.+..+|.|++||++||++||++ T Consensus 34 ~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l 113 (648) T protein:vir:79 34 MQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKA 113 (648) T ss_pred cccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhC Confidence 666443333222211111111111110 000011 11121 22346999999999999999999 Q ss_pred ceeeecchh--------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCc---------------eeEEEEe Q lcl|NC_018285. 62 KLTTSRKQM--------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGR---------------DMKWEYL 118 (383) Q Consensus 62 p~~~~~~~~--------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~---------------~~~l~~l 118 (383) |+.+...+. ..+..+||++||+++||+.++.+++++||||++++|+.+|. +.++||| T Consensus 114 ~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl 193 (648) T protein:vir:79 114 DWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPL 193 (648) T ss_pred cceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEee Confidence 999875442 24567999999999999999999999999999999998883 4789999 Q ss_pred ccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 119 RPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 119 ~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) +|.+|++..++++....|.+...+ ++..+.++++||||||.+++.+.++|+||+.++..+|....+++++..++|+|| T Consensus 194 ~p~~v~v~~d~~g~~~~Y~y~~~g--~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NG 271 (648) T protein:vir:79 194 NLASMKVKRDKFGMIKGWQQEQEG--QDKPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRN 271 (648) T ss_pred cCceeEEEEcCCCceeeeEEEecC--CceeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 999999998888777777765433 344567999999999987777778999999999999999999999999999999 Q ss_pred CCcceeEeecC-CCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecc----cChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 199 LNANGILKIKG-GGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLE----IKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 199 ~~~~~i~~~~~-~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~----~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) ++|+++++++. ....++.+++++.+.....+ ..+.++++++..+. .+++|+||++++++++++||++||||| T Consensus 272 a~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~---~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP 348 (648) T protein:vir:79 272 LHPLWHVKVGLEQEGFGAEEGEVDLVRGEVEN---MDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSE 348 (648) T ss_pred CCccEEEEeCCCccchHHHHHHHHHHHHhccc---ccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCH Confidence 99999999863 34445555555666554433 23334444443332 366899999999999999999999999 Q ss_pred HHhcccc-cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------------hhhccchhhhccCHHHHHHHHH Q lcl|NC_018285. 274 NVVGGQG-DQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSC----------------DVDADIFPAVDPTGANYISRIN 336 (383) Q Consensus 274 ~~lg~~~-~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~----------------~~e~~~~~~~~~~~~~~~~~~~ 336 (383) ++||... ++.++.++...++..++.|+...++..++..+.. .++|+.....+.|....+..+. T Consensus 349 ~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~ 428 (648) T protein:vir:79 349 LMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAV 428 (648) T ss_pred hHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHH Confidence 9999643 3445566666777888888877776665543321 1355555666677777888888 Q ss_pred HHHhCCCcCHHHHHHHhhcCCcCCcchhH-HhCC-------------CCCCCCCCCCCC------CC Q lcl|NC_018285. 337 SMVKSGTLAQNQGLYILQQAEILPKELPK-GENP-------------NRTILKGGETNG------QD 383 (383) Q Consensus 337 ~l~~~g~~t~nE~r~~lg~~~~~~~d~~~-~~~~-------------~~~~~~ggd~~~------~d 383 (383) .++++|++|+||+|+++|++|+++++-+. .... ...+..++..+. .+ T Consensus 429 ~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e 495 (648) T protein:vir:79 429 FLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKA 495 (648) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCccccccccc Confidence 99999999999999999999997654221 1000 000111111000 00 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=4.3e-59 Score=340.59 Aligned_cols=269 Identities=18% Similarity=0.242 Sum_probs=236.0 Q ss_pred hhhCceeeecch-------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 58 LATAKLTTSRKQ-------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 58 ia~~p~~~~~~~-------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~ 130 (383) ||++|+++++.. .+.|+.+||++||+.+||+.++.+++++||||++++|+.+|++++|+|++|++|++..+++ T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 999999998643 3457889999999999999999999999999999999999999999999999999999988 Q ss_pred CceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC Q lcl|NC_018285. 131 QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG 210 (383) Q Consensus 131 ~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~ 210 (383) +..++|.+...+ +..+.++++||||++++++.+.++|.||+.++..++....++++++...+.+ .|+++++.++. T Consensus 81 ~~~~~y~~~~~~---g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~~~~~ 155 (278) T protein:vir:78 81 SRELYYSIHAAT---GNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLKYGSN 155 (278) T ss_pred CceEEEEEEcCC---ceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEEeCCC Confidence 888888886544 4467899999999999888888999999999999999999999887655544 57999999999 Q ss_pred CCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHH Q lcl|NC_018285. 211 GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEM 288 (383) Q Consensus 211 ~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~ 288 (383) +++|+++++++.|+...+++|+++++++|++|++++.++.|+|+.|.+++++++||++|||||.+||+. +++++.+++ T Consensus 156 l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~ 235 (278) T protein:vir:78 156 VGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEEL 235 (278) T ss_pred CCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999964 567788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHH Q lcl|NC_018285. 289 SSNVYSKAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSM 338 (383) Q Consensus 289 ~~~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l 338 (383) .+.|+..||+|+++.|+++||++|+++.+.... ..+.++++.| T Consensus 236 ~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g-------~~~~f~~~~l 278 (278) T protein:vir:78 236 NRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKI-------GILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCC-------ceEEEecccC Confidence 999999999999999999999999986543211 1111111111 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=3.3e-52 Score=302.80 Aligned_cols=330 Identities=9% Similarity=0.020 Sum_probs=228.2 Q ss_pred CchhhhhhcCCcc---cccccc------cccchhhcccccCCcee--c-hhh------------hhccHHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPN---NQGGFF------DITDPEFLATLNGSEWV--S-AET------------ALKNSDLFSIISQLSN 56 (383) Q Consensus 1 Mglf~~~~~~~~~---~~~~~~------~~~~~~~~~~~~~~~~~--~-~~~------------a~~~~~v~~~i~~ia~ 56 (383) |+=-++-..++.. .+.... ..+...-...+..+.++ . ... ....|.-+.|+..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~ 80 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFR 80 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHh Confidence 6543321111100 000000 00000001111112211 1 100 1111222223222222 Q ss_pred hhhhC-ceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeE Q lcl|NC_018285. 57 DLATA-KLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLY 135 (383) Q Consensus 57 ~ia~~-p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~ 135 (383) .-+.- .....+++...+..+||++||+++|++ ++.+++++||||++++|+..|++++|+|+++.+|++..+. .. + T Consensus 81 ~~~~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~--~~-~ 156 (368) T protein:vir:79 81 AAAHHSSAVYVKRNILVSTFIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDL--NT-Y 156 (368) T ss_pred hccccchhhhhhcchhhhhcCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccC--CE-E Confidence 21100 000123344567789999999999975 6789999999999999999999999999999999876543 22 2 Q ss_pred EEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHH Q lcl|NC_018285. 136 YNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLD 214 (383) Q Consensus 136 y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e 214 (383) |.+. . .+..+.|+++||||+|.+++.++++|+||+.++..++.+..+++.+..++|+||++|++||..++ .+++| T Consensus 157 ~~~~-~---~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e 232 (368) T protein:vir:79 157 FFVQ-N---WQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQE 232 (368) T ss_pred EEEe-c---CCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHH Confidence 2222 1 24567899999999999998888999999999999999999999999999999999999998875 68999 Q ss_pred HHHHHHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCc Q lcl|NC_018285. 215 FKTKVSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQ 283 (383) Q Consensus 215 ~~~~~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~ 283 (383) +.+++++.|+. +..|+|+++|+ ++|++|++++.++.|+||+|++++++++||++|||||.+||.. ++++ T Consensus 233 ~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~s 312 (368) T protein:vir:79 233 DVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFG 312 (368) T ss_pred HHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccc Confidence 99999998864 45688899988 5789999999999999999999999999999999999999963 2357 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCHHHHHHHHHHHHhCCCcCH Q lcl|NC_018285. 284 SSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTGANYISRINSMVKSGTLAQ 346 (383) Q Consensus 284 ~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~ 346 (383) |.+++.+.|+++||.|+++.|+ ++|.+|..+ ++|+.....+.|....+.... .+. T Consensus 313 n~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~e~~rF~~~~l~~~D~~a~a~~~~-------rsa 368 (368) T protein:vir:79 313 DVEKAAMVFARNEVKPLQDRLL-AINDWIGDEVVRFAPYALGGHDQPAAAPGGQ-------RSA 368 (368) T ss_pred cHHHHHHHHHHHHHHHHHHHHH-HHHhccCcceeeechhHhhcccccccCCccc-------ccC Confidence 8899999999999999999998 688888654 345544444444333221100 000 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=1e-51 Score=300.19 Aligned_cols=308 Identities=14% Similarity=0.110 Sum_probs=223.5 Q ss_pred CchhhhhhcCCcc--cc-----c-----ccccccchh----------hcccccCCc----eechhh---hh-ccHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPN--NQ-----G-----GFFDITDPE----------FLATLNGSE----WVSAET---AL-KNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~~~~~--~~-----~-----~~~~~~~~~----------~~~~~~~~~----~~~~~~---a~-~~~~v~~~ 50 (383) |+--+.-.+.... +. . ..++..+|. +...+..+. +++... ++ .++...+| T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s~ 105 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 105 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhhh Confidence 6543322111000 00 0 000100110 001111111 111111 11 12223344 Q ss_pred HHHHHHhhhhCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 51 ISQLSNDLATAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 51 i~~ia~~ia~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~ 130 (383) |.+.++.+++ ..+||+.||+.+|++. +.+++++||||++++|+.+|++++|+||+|.+|++..+.+ T Consensus 106 l~~k~n~l~~-------------~~~Pnp~lT~~~f~~~-v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~ 171 (376) T protein:vir:10 106 LFFKANVLAS-------------TFRPHRWLSRHAFERW-ALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFN 171 (376) T ss_pred HHHHhHHHHh-------------ccCCCCCCCHHHHHHH-HHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEEeeCC Confidence 4444443322 3479999999999855 5789999999999999999999999999999999887654 Q ss_pred CceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC- Q lcl|NC_018285. 131 QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG- 209 (383) Q Consensus 131 ~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~- 209 (383) . .+| ... .+....|+++||||++.+++.+.++|+|++.++..++.+..+++.++.++|+||++|++||..++ T Consensus 172 ~--~~~--~~~---~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~ 244 (376) T protein:vir:10 172 G--FVY--VNG---WQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDA 244 (376) T ss_pred e--EEE--EEc---CCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCC Confidence 2 222 221 23567899999999999998888999999999999999999999999999999999999999875 Q ss_pred CCCHHHHHHHHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--- Q lcl|NC_018285. 210 GGLLDFKTKVSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--- 279 (383) Q Consensus 210 ~~~~e~~~~~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--- 279 (383) .+++|+.+++++.|+. +..|.++++|+ ++|++|++++.++.|+||+|++++++++||++|||||.++|.. T Consensus 245 ~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~ 324 (376) T protein:vir:10 245 AQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSN 324 (376) T ss_pred CCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCC Confidence 6899999999998865 45677888887 4689999999999999999999999999999999999999963 Q ss_pred -ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_018285. 280 -GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTGANYISRINSMVKSGTLA 345 (383) Q Consensus 280 -~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t 345 (383) +++++.+++.+.|+.+||.|+++.|+ ++|.+|..+ ++|+.. .|++++..+ T Consensus 325 t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~~~~F~~~---------------~Llr~d~ka 376 (376) T protein:vir:10 325 SGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEEVVRFDDY---------------EIPPAPVAA 376 (376) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhccccccccChh---------------HhhcccccC Confidence 34678899999999999999999998 588887554 344443 344444433 No 92 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=1.4e-51 Score=299.32 Aligned_cols=303 Identities=12% Similarity=0.114 Sum_probs=225.2 Q ss_pred CchhhhhhcCCccccc-c-----cccccchhh----------cccccCCc----eechhhh---h-ccHHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPNNQG-G-----FFDITDPEF----------LATLNGSE----WVSAETA---L-KNSDLFSIISQLSN 56 (383) Q Consensus 1 Mglf~~~~~~~~~~~~-~-----~~~~~~~~~----------~~~~~~~~----~~~~~~a---~-~~~~v~~~i~~ia~ 56 (383) |+ ++.+.+...... . .++..+|.. .+.+..+. +++.... + .++...+||.+.++ T Consensus 1 m~--~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~n 78 (340) T protein:vir:98 1 MS--KRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKRN 78 (340) T ss_pred CC--CCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhhh Confidence 76 332211111110 0 011111100 00111111 0111110 0 01112334443333 Q ss_pred hhhhCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEE Q lcl|NC_018285. 57 DLATAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYY 136 (383) Q Consensus 57 ~ia~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y 136 (383) .+++ .-+||++||..+|++ ++.+++++||||++++|+..|++++|+|+++.+|++..+. ..+| T Consensus 79 ~l~~-------------~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~---~~~~ 141 (340) T protein:vir:98 79 VLAS-------------TYIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDD---SVFW 141 (340) T ss_pred HHhh-------------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccC---cEEE Confidence 3322 348999999999965 5579999999999999999999999999999999886543 3445 Q ss_pred EEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHH Q lcl|NC_018285. 137 NVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDF 215 (383) Q Consensus 137 ~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~ 215 (383) .+.. .+....|+++||||+|.+++.++++|+|++.++..++.+..+++.++.++|+||++|++|+.+++ .+++++ T Consensus 142 ~~~~----~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~ 217 (340) T protein:vir:98 142 FVEN----FTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATD 217 (340) T ss_pred EEec----CCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHH Confidence 5443 23467899999999999998888999999999999999999999999999999999999999875 689999 Q ss_pred HHHHHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcC Q lcl|NC_018285. 216 KTKVSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQS 284 (383) Q Consensus 216 ~~~~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~ 284 (383) .+++++.|+. +..|.++++|+ ++|++|++++.++.|+||++++++++++||++|||||.++|.. +++++ T Consensus 218 ~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn 297 (340) T protein:vir:98 218 VESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGD 297 (340) T ss_pred HHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCcccc Confidence 9999998865 45577888887 5789999999999999999999999999999999999999963 33578 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccC Q lcl|NC_018285. 285 SLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPT 327 (383) Q Consensus 285 ~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~ 327 (383) .+++.+.|+.+||.|+++.|++ +|.+|..+ ++|+.....+.| T Consensus 298 ~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e~~rF~~~~l~~~d 340 (340) T protein:vir:98 298 VEKVAKVFVRNELSPLQDRFRE-VNDWLGMEVIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHhcccccccccCccccccCC Confidence 8999999999999999999985 88888766 577766666666 No 93 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=2.6e-51 Score=297.88 Aligned_cols=312 Identities=10% Similarity=0.066 Sum_probs=227.5 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHH----------------HHHhh---h-- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQ----------------LSNDL---A-- 59 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~----------------ia~~i---a-- 59 (383) |+=..+-............. ...+.-+.+ +..|...++..|+.. +|+.+ + T Consensus 1 m~~~~~~~~~~~~~~~~~~~------~~~~~~~~p---~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h 71 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQ------TEIFSFGDP---IPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHH 71 (346) T ss_pred CCcccCCCCCcccccccccC------eEEEecCCc---ceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhc Confidence 65443211110000000000 000111111 001111111111111 12111 1 Q ss_pred hCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEe Q lcl|NC_018285. 60 TAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVT 139 (383) Q Consensus 60 ~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~ 139 (383) +-.+++.++....++.+||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|..|++..+.++ .+|.+. T Consensus 72 ~~~i~~k~n~l~~l~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~--~~~~~~ 148 (346) T protein:vir:10 72 ESAIITKANILLSTCEVDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQ--FYYVPQ 148 (346) T ss_pred chhhhhhhhhHHHHHhCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCe--EEEEEE Confidence 12233334444567789999999999987 567899999999999999999999999999999998775543 333332 Q ss_pred ecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHH Q lcl|NC_018285. 140 FDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK 218 (383) Q Consensus 140 ~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~ 218 (383) .. ++..+.|+++||||+|.+++.+.++|+|++.++..++.+..+++.+..++|+||++|++|+..++ .+++|+.++ T Consensus 149 ~~---~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~ 225 (346) T protein:vir:10 149 RF---DHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVEN 225 (346) T ss_pred cc---CCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHH Confidence 22 24567899999999999998888999999999999999999999999999999999999999865 689999999 Q ss_pred HHHHHHH--hhcCCcceeecCC-----CceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHH Q lcl|NC_018285. 219 VSRSRQA--MKQMQGGPLVLDD-----LEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLE 287 (383) Q Consensus 219 ~~~~~~~--~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e 287 (383) +++.|+. +..|+|+++|+.+ |+++++++.++.|+||+|.+++++++||++|||||.+||.. +++++.++ T Consensus 226 i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~ 305 (346) T protein:vir:10 226 IRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVAD 305 (346) T ss_pred HHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHH Confidence 9998864 3457788888854 78999999999999999999999999999999999999953 34678889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCH Q lcl|NC_018285. 288 MSSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTG 328 (383) Q Consensus 288 ~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~ 328 (383) +.+.|+.++|.|+++.|++ ++.+|+.+ ++|+....++.+. T Consensus 306 ~~~~f~~~~l~P~~~~iee-~n~~L~~e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 306 AAEVFFITEIEPLQERLKE-FNQWLGQEVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHHHHH-HHhhcccceeeechhhhcccCC Confidence 9999999999999999985 77777765 5777666666555 No 94 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=2.2e-51 Score=298.32 Aligned_cols=311 Identities=8% Similarity=0.019 Sum_probs=225.2 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC---ceeee------cch-- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA---KLTTS------RKQ-- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~---p~~~~------~~~-- 69 (383) |. .....+.... +.... ..+.. ++. ++..|..+.+..|+.++.+..... |+... +.. T Consensus 1 ~~--~~~~~~~~~~--~~~~~---~~~~~--~~~---p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~ 68 (348) T protein:vir:26 1 MT--EQLIHSHTTD--GTESK---SVYSF--DPN---PEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGY 68 (348) T ss_pred CC--ccccchhhcc--ccCCc---eEEEe--cCC---CeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhh Confidence 32 1000000000 00000 00000 101 222344445556666655443322 33211 000 Q ss_pred --------hhh--hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEe Q lcl|NC_018285. 70 --------MQG--IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVT 139 (383) Q Consensus 70 --------~~~--l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~ 139 (383) .+. ..-+||++||+.+|++. +.+++++||||++++|+..|+|++|+|+++.+|++..+.+ +|.+. T Consensus 69 h~~~i~~k~N~l~~~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d~~----~~~~~ 143 (348) T protein:vir:26 69 HGSLLKARANYVAGRFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKNGD----FVQLL 143 (348) T ss_pred hhhhHhhhhhHHhhcccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeecCc----EEEEE Confidence 011 13469999999999765 5799999999999999999999999999999998876432 33333 Q ss_pred ecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHH Q lcl|NC_018285. 140 FDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK 218 (383) Q Consensus 140 ~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~ 218 (383) .. +..+.|+++||||+|.+++.+.++|+|++.++..++.+..+++.++.++|+||++|++|+..++ .+++|++++ T Consensus 144 ~~----g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~ 219 (348) T protein:vir:26 144 RN----NEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKA 219 (348) T ss_pred ec----CeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHH Confidence 22 3567899999999999998888999999999999999999999999999999999999998765 699999999 Q ss_pred HHHHHHHh--hcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc----cccCcCHHH Q lcl|NC_018285. 219 VSRSRQAM--KQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG----QGDQQSSLE 287 (383) Q Consensus 219 ~~~~~~~~--~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~----~~~~~~~~e 287 (383) +++.|+.. .+|.++++|+ +.|+++++++.++.|+||++.+++++++||++|||||.++|. ++++++.++ T Consensus 220 lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~ 299 (348) T protein:vir:26 220 LKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLK 299 (348) T ss_pred HHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHH Confidence 99988653 4577888888 788999999999999999999999999999999999999995 245678899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcch----hhccchhhhccCHHHHHHHH Q lcl|NC_018285. 288 MSSNVYSKAVARYLRPFLSELSQKLSCD----VDADIFPAVDPTGANYISRI 335 (383) Q Consensus 288 ~~~~~~~~~l~P~~~~i~~~l~~~l~~~----~e~~~~~~~~~~~~~~~~~~ 335 (383) +.+.|+.++|.|+++.|+++||++|... ++||+....+.+... .+ T Consensus 300 ~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~fdl~~~~e~~~~~---a~ 348 (348) T protein:vir:26 300 VSQVYDFYEVIPVCKRFMDAVNNDPEIPDNLKLKFNLNPGVESANGS---AV 348 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhCCCCccEEEEecCcccccchhh---cC Confidence 9999999999999999999999998632 334433222211111 11 No 95 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=2.6e-51 Score=297.87 Aligned_cols=308 Identities=13% Similarity=0.117 Sum_probs=220.7 Q ss_pred CchhhhhhcCC-cc-cc----------cccccccchh----------hcccccCCce----echhh---hh-ccHHHHHH Q lcl|NC_018285. 1 MPIFNLATESP-PN-NQ----------GGFFDITDPE----------FLATLNGSEW----VSAET---AL-KNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~~~-~~-~~----------~~~~~~~~~~----------~~~~~~~~~~----~~~~~---a~-~~~~v~~~ 50 (383) |+=-+.-.+.. +. +. ...++..+|. +...+..+.+ ++... ++ .++...+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhhh Confidence 54322111100 00 00 0000000110 0000111110 11110 00 11222233 Q ss_pred HHHHHHhhhhCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 51 ISQLSNDLATAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 51 i~~ia~~ia~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~ 130 (383) |.+.++.++ ..-+||+.||+.+|+ +++.+++++||||++++|+..|.+++|+|++|++|++..+.+ T Consensus 81 l~~k~n~l~-------------~~~~Pnp~~t~~~f~-~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~ 146 (351) T protein:vir:79 81 LFFKANVLA-------------STFRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS 146 (351) T ss_pred hhhhhhHHh-------------hcccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCC Confidence 333333222 135799999999996 466899999999999999999999999999999999877654 Q ss_pred CceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC- Q lcl|NC_018285. 131 QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG- 209 (383) Q Consensus 131 ~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~- 209 (383) + |.+... .+....|+++||||+|.+++.+.++|+|++.++..++.+..+++.+..++|+||++|++|+..++ T Consensus 147 ~----~~~~~~---~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~~ 219 (351) T protein:vir:79 147 G----FVYVNG---WQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDA 219 (351) T ss_pred e----EEEEec---CceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCC Confidence 3 222222 24567899999999999999889999999999999999999999999999999999999999875 Q ss_pred CCCHHHHHHHHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--- Q lcl|NC_018285. 210 GGLLDFKTKVSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--- 279 (383) Q Consensus 210 ~~~~e~~~~~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--- 279 (383) .+++|+.+++++.|+. +.+|.++++|+ ++|+++++++.++.|+||+|++++++++||++|||||.++|.. T Consensus 220 ~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~ 299 (351) T protein:vir:79 220 AQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSN 299 (351) T ss_pred CCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCC Confidence 6899999999998865 34677888887 5789999999999999999999999999999999999999963 Q ss_pred -ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_018285. 280 -GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTGANYISRINSMVKSGTLA 345 (383) Q Consensus 280 -~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t 345 (383) +++++.+++.+.|+.+||.|+++.|++ +|.+|..+ ++|+... +++++..+ T Consensus 300 t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg~~~~~F~~~~---------------llr~d~~a 351 (351) T protein:vir:79 300 SGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDEVVTFDDYE---------------IPPAPVAA 351 (351) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcceeeeChhh---------------hccccccC Confidence 346788999999999999999999975 88877654 3444443 33333333 No 96 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=4.6e-51 Score=296.56 Aligned_cols=314 Identities=11% Similarity=0.116 Sum_probs=219.7 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCc--ee-chh------hhhcc------HHHHHHHHHHH--HhhhhCce Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSE--WV-SAE------TALKN------SDLFSIISQLS--NDLATAKL 63 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~------~a~~~------~~v~~~i~~ia--~~ia~~p~ 63 (383) |+=-+. +++.+.........+.. ..+..+. +| +.. ....+ |.-+.++-.+. +..-+-++ T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i 76 (344) T protein:vir:56 1 MSKKKG---KTPQPAAKTMTASAPKM-EAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPI 76 (344) T ss_pred CCCCCC---CCCchhhHHhhcCCCce-EEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccc Confidence 543222 22211111100000000 0000011 11 000 00000 11011111110 11111122 Q ss_pred eeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCc Q lcl|NC_018285. 64 TTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDP 143 (383) Q Consensus 64 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~ 143 (383) ...+ +.-...-+||++||+.+| ++++.+++++||||++++|+..|++++|+|+++.+|++..+.+ .+|.+.. T Consensus 77 ~~k~-n~l~~~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~---~~~~~~~--- 148 (344) T protein:vir:56 77 YVKR-NILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED---VYWWVPS--- 148 (344) T ss_pred eehh-hhHHhhcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCC---EEEEEec--- Confidence 2221 111124589999999999 6778899999999999999999999999999999999876543 2333322 Q ss_pred ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKVSRS 222 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~~~~ 222 (383) .+....|+++||||++.+++.++++|+||+.++..++.+..+++.+..++|+||++|++|+..++ .+++|+++++++. T Consensus 149 -~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~ 227 (344) T protein:vir:56 149 -FNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLREN 227 (344) T ss_pred -CCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHH Confidence 24567899999999999998888999999999999999999999999999999999999999875 6899999999999 Q ss_pred HHHhh-cCCcceeec------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHHHHHH Q lcl|NC_018285. 223 RQAMK-QMQGGPLVL------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLEMSSN 291 (383) Q Consensus 223 ~~~~~-~~~g~~~vl------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e~~~~ 291 (383) |+... .+++++++| ++|++|++++.++.|+||+|++++++++||++|||||.++|.. +++++.+++.+. T Consensus 228 ~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~ 307 (344) T protein:vir:56 228 MVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKV 307 (344) T ss_pred HHHhcCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHH Confidence 97533 567888887 4799999999999999999999999999999999999999953 335678999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCH Q lcl|NC_018285. 292 VYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTG 328 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~ 328 (383) |+.+||.|+++.|++ +|.+|+.+ ++|+-......|. T Consensus 308 f~~~tL~Pl~~~ie~-~n~~l~~~~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 308 FVRNELIPLQDRIRE-INGWIGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHH-HHhhhccccccCCCccccccCC Confidence 999999999999985 88888754 3444332222222 No 97 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=7.3e-51 Score=295.46 Aligned_cols=311 Identities=13% Similarity=0.124 Sum_probs=219.0 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHh--------------------hhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSND--------------------LAT 60 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~--------------------ia~ 60 (383) |+=-++-...+.. ...... ..-...+..+.+.. .+....++.|+.+..+. ..+ T Consensus 1 m~~~~~~~~~~~~-~~~~~~---~~~~~~~~f~~p~~---v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~ 73 (344) T protein:vir:60 1 MSKKKGKTLQPAA-KKMTAS---APKMEAFTFGEPVP---VLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHS 73 (344) T ss_pred CCcccCCCCCchH-HhhcCC---cCcEEEEEcCCcee---ecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhc Confidence 4322211101100 000000 00000011111100 01111111222111111 111 Q ss_pred CceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEee Q lcl|NC_018285. 61 AKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTF 140 (383) Q Consensus 61 ~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~ 140 (383) -++...+ +.-...-+||+.||+.+| +.++.+++++||||++++|+..|+|++|+|+++.+|++..+.+ .+|.+.. T Consensus 74 ~~i~~k~-n~l~~~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~---~~~~v~~ 148 (344) T protein:vir:60 74 SPIYVKR-NILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED---VYWWVPS 148 (344) T ss_pred cchhhhh-hHHHhhccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCC---eEEEEcc Confidence 1222211 111124489999999999 5778999999999999999999999999999999999876543 2344332 Q ss_pred cCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHH Q lcl|NC_018285. 141 DDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKV 219 (383) Q Consensus 141 ~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~ 219 (383) .+....|+++||||++.+++.++++|+||+.++..++.+..+++.+..++|+||++|++|+..++ .+++|+.+++ T Consensus 149 ----~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~i 224 (344) T protein:vir:60 149 ----FNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEML 224 (344) T ss_pred ----CCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHH Confidence 23567899999999999998888999999999999999999999999999999999999999875 6999999999 Q ss_pred HHHHHHhh-cCCcceeec------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHHH Q lcl|NC_018285. 220 SRSRQAMK-QMQGGPLVL------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLEM 288 (383) Q Consensus 220 ~~~~~~~~-~~~g~~~vl------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e~ 288 (383) ++.|+... .++++++++ .+|++|++++.++.|+||+|++++++++||++|||||.++|.. +++++.+++ T Consensus 225 k~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~ 304 (344) T protein:vir:60 225 RENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKV 304 (344) T ss_pred HHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHH Confidence 99996543 466777776 4689999999999999999999999999999999999999953 235688999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCH Q lcl|NC_018285. 289 SSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTG 328 (383) Q Consensus 289 ~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~ 328 (383) .+.|+.++|.|+++.|+ +||.+|..+ ++|+.......|. T Consensus 305 ~~~f~~~~L~Pl~~~~e-~ln~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 305 AKVFVRNELIPLQDRIR-EINGWLGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHH-HHHHhcCCcccccCccccCCCCC Confidence 99999999999999998 599998764 4454333333333 No 98 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=2.3e-50 Score=292.73 Aligned_cols=308 Identities=13% Similarity=0.122 Sum_probs=220.7 Q ss_pred CchhhhhhcCC-cc-cc----------cccccccchh----------hcccccCCce----echhh---hh-ccHHHHHH Q lcl|NC_018285. 1 MPIFNLATESP-PN-NQ----------GGFFDITDPE----------FLATLNGSEW----VSAET---AL-KNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~~~-~~-~~----------~~~~~~~~~~----------~~~~~~~~~~----~~~~~---a~-~~~~v~~~ 50 (383) |+=-+.-.+.. +. +. ...++..+|. +...+..+.+ ++... ++ .++...+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhhh Confidence 54322111100 00 00 0000101110 0011111111 11111 01 12222333 Q ss_pred HHHHHHhhhhCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 51 ISQLSNDLATAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 51 i~~ia~~ia~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~ 130 (383) |.+.++.+++ .-+||+.||+++|++ ++.+++++||||++++|+..|++++|+|+++.+|++..+.+ T Consensus 81 l~~k~n~l~~-------------~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~ 146 (351) T protein:vir:78 81 LFFKANVLAS-------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS 146 (351) T ss_pred hhhhhhHHhh-------------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC Confidence 3333333321 347999999999975 55789999999999999999999999999999999887654 Q ss_pred CceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC- Q lcl|NC_018285. 131 QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG- 209 (383) Q Consensus 131 ~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~- 209 (383) + |.+... .+....|+++||||++.+++.+.++|+|++.++..++.+..+++.++.++|+||++|++|+..++ T Consensus 147 ~----~~~~~~---~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~ 219 (351) T protein:vir:78 147 G----FVYVNG---WQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDA 219 (351) T ss_pred e----EEEEec---CCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCC Confidence 3 222222 24567899999999999998888999999999999999999999999999999999999999875 Q ss_pred CCCHHHHHHHHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--- Q lcl|NC_018285. 210 GGLLDFKTKVSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--- 279 (383) Q Consensus 210 ~~~~e~~~~~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--- 279 (383) .+++|+.+++++.|+. +..|+++++|+ ++|+++++++.++.|+||+|++++++++||++|||||.++|.. T Consensus 220 ~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~ 299 (351) T protein:vir:78 220 AQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSN 299 (351) T ss_pred CCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCC Confidence 6899999999998854 45678888887 4689999999999999999999999999999999999999963 Q ss_pred -ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_018285. 280 -GDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTGANYISRINSMVKSGTLA 345 (383) Q Consensus 280 -~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t 345 (383) +++++.+++.+.|+.++|.|+++.|++ ++.+|..+ ++|+... +++++..+ T Consensus 300 t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~~~~~F~~~~---------------Llr~d~ka 351 (351) T protein:vir:78 300 SGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDEVVRFDDYE---------------IPPAPVAA 351 (351) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCccceecChhh---------------hccccccC Confidence 336788999999999999999999985 77777544 4444443 44443333 No 99 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=2.4e-50 Score=292.60 Aligned_cols=308 Identities=10% Similarity=0.007 Sum_probs=227.6 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhh---Cceeee------cch-- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLAT---AKLTTS------RKQ-- 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~---~p~~~~------~~~-- 69 (383) |+=-+. ++....+. ... ..+.-+. ++..|....+..|+.+..+.++. .|+... +.. T Consensus 1 m~~~~~---~~~~~~~~-~~~------~~~~~~~---p~~~~~~~~~~~~~~~~~~~~~~~~~pP~~~~~La~l~~~~~~ 67 (337) T protein:vir:78 1 MTKRQQ---QPAQAAAS-SPR------PSVVFSM---PEAIDPTAWMTDYTGVFYNPYGEYYQPPIDRKGLAKVARANAH 67 (337) T ss_pred CCCccc---Cccccccc-Cce------eEEEecC---cccccCcchhHhhhhhhhccCcceecCCCCHHHHHHHhhcchh Confidence 543221 11111101 000 0111111 12223444555566655554442 344221 111 Q ss_pred -hhhhccCCCccCCHH----HHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcc Q lcl|NC_018285. 70 -MQGIVDNPSNSANRF----NFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPR 144 (383) Q Consensus 70 -~~~l~~~PN~~~t~~----~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~ 144 (383) ...|..+||..++.+ +++++++.+++++||||++++|+..|+|++|+|+++.+|++..+. ..+|... T Consensus 68 h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d~---~~~~~~~----- 139 (337) T protein:vir:78 68 HGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRREDG---CFVYLQQ----- 139 (337) T ss_pred hhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeCC---eEEEEEc----- Confidence 225778999877654 688999999999999999999999999999999999999877542 2333221 Q ss_pred cccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHHHHHH Q lcl|NC_018285. 145 IPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKVSRSR 223 (383) Q Consensus 145 ~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~~~~~ 223 (383) .+....|+++||||+|.+++.++++|+|++.++..++.+..+++++..++|+||++|++|+..++ .+++++.+++++.| T Consensus 140 ~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~ 219 (337) T protein:vir:78 140 GKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMI 219 (337) T ss_pred CCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Confidence 23457899999999999998888999999999999999999999999999999999999999876 68999999999988 Q ss_pred HHh--hcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc-----ccCcCHHHHHHH Q lcl|NC_018285. 224 QAM--KQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ-----GDQQSSLEMSSN 291 (383) Q Consensus 224 ~~~--~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~-----~~~~~~~e~~~~ 291 (383) +.. .+|.++++|+ +.|++|++++.++.|+||+|++++++++||++|||||.++|.. ++++|.+++.+. T Consensus 220 ~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~ 299 (337) T protein:vir:78 220 ANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDAT 299 (337) T ss_pred HHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHH Confidence 653 4567778777 5789999999999999999999999999999999999999852 345578889999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHH Q lcl|NC_018285. 292 VYSKAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMV 339 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~ 339 (383) |+.+||.|+++.|++++|+++++..++. ++..++.+++ T Consensus 300 f~~~~L~P~~~~ie~~~n~~ll~~~~~~----------~f~~~~~~~~ 337 (337) T protein:vir:78 300 YARNEVLPLCELVQDAINSAGLPRALWV----------TFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHhhhcCChhhce----------eccccccccC Confidence 9999999999999999999887754321 1112222222 No 100 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=6.6e-50 Score=290.21 Aligned_cols=311 Identities=12% Similarity=0.123 Sum_probs=218.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCc--ee-chhhhhccHHHHHHHH---------------HH--HHhhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSE--WV-SAETALKNSDLFSIIS---------------QL--SNDLAT 60 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~a~~~~~v~~~i~---------------~i--a~~ia~ 60 (383) |+=-+. +++.+.........+. ...+..+. +| +.+. ..+..-|++ .+ |+...+ T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~-~~~~~f~~p~~v~~~~~---~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~ 73 (344) T protein:vir:20 1 MSKKKG---KTPQPAAKTMTASGPK-MEAFTFGEPVPVLDRRD---ILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHS 73 (344) T ss_pred CCcccC---CCCcchhhhhhccCCc-eEEEEcCCceEecCcch---hhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhC Confidence 543222 2221111100000000 00011111 11 1110 001111111 11 111111 Q ss_pred CceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEee Q lcl|NC_018285. 61 AKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTF 140 (383) Q Consensus 61 ~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~ 140 (383) -++...+ +.-...-+||+.||+.+| ++++.+++++||||++++|+..|++++|+|+++.+|++..+.+ .+|.+.. T Consensus 74 ~~i~~k~-n~l~~~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~---~~~~~~~ 148 (344) T protein:vir:20 74 SPIYVKR-NILASTFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED---VYWWVPS 148 (344) T ss_pred ccceehh-hhHHHhccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCC---EEEEEcc Confidence 2222222 111123489999999999 5778999999999999999999999999999999999876543 2333322 Q ss_pred cCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHH Q lcl|NC_018285. 141 DDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKV 219 (383) Q Consensus 141 ~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~ 219 (383) .+..+.|+++||||++.+++.++++|+||+.++..++.+..+++.+..++|+||++|++|+.+++ .+++|+.+++ T Consensus 149 ----~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~i 224 (344) T protein:vir:20 149 ----FNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEML 224 (344) T ss_pred ----CCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHH Confidence 24567899999999999998888999999999999999999999999999999999999999864 6899999999 Q ss_pred HHHHHHhh-cCCcceeec------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHHH Q lcl|NC_018285. 220 SRSRQAMK-QMQGGPLVL------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLEM 288 (383) Q Consensus 220 ~~~~~~~~-~~~g~~~vl------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e~ 288 (383) ++.|+... .++++++++ ++|++|++++.++.|+||+|++++++++||++|||||.++|.. +++++.+++ T Consensus 225 k~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~ 304 (344) T protein:vir:20 225 RENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKV 304 (344) T ss_pred HHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHH Confidence 99996543 466777776 4689999999999999999999999999999999999999953 335678999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcch-hhccchhhhccCHH Q lcl|NC_018285. 289 SSNVYSKAVARYLRPFLSELSQKLSCD-VDADIFPAVDPTGA 329 (383) Q Consensus 289 ~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~~~~~~~~~~ 329 (383) .+.|+.++|.|+++.|+ +++.+|..+ ++|+.... +.+.+ T Consensus 305 ~~~f~~~~l~P~~~~~e-~in~~lg~~~i~F~~~~l-~~~d~ 344 (344) T protein:vir:20 305 AKVFVRNELIPLQDRIR-EINGWLGQEVIRFKNYSL-DTDND 344 (344) T ss_pred HHHHHHHHHHHHHHHHH-HHHHhcCCcccccCcccc-ccCCC Confidence 99999999999999998 588888654 34443222 22222 No 101 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=2.9e-50 Score=292.20 Aligned_cols=316 Identities=11% Similarity=0.036 Sum_probs=219.8 Q ss_pred CchhhhhhcCCcc--cccccc-cccc-hhhcccccCCce---echhhhhccHHHHHHHH---------HHHHhhh----- Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFF-DITD-PEFLATLNGSEW---VSAETALKNSDLFSIIS---------QLSNDLA----- 59 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~-~~~~-~~~~~~~~~~~~---~~~~~a~~~~~v~~~i~---------~ia~~ia----- 59 (383) |+=-++..++.+. +..... .... ..-...+..+.+ ++.+..+....++.|-+ -||+.+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~h 80 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVYL 80 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhhh Confidence 6544332222111 000000 0000 000001111111 11111111112221111 1111110 Q ss_pred hCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEe Q lcl|NC_018285. 60 TAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVT 139 (383) Q Consensus 60 ~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~ 139 (383) +-++...+ +.-....+||++||+++|++ ++.+++++||||++++|+..|++++|+|+++.+|++..+.+ .+|.+. T Consensus 81 ~~~l~~k~-n~l~~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~---~~~~~~ 155 (350) T protein:vir:11 81 QSGLKFKR-NMLAKTFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLE---TFYQVR 155 (350) T ss_pred ccchhhhh-hhhhhcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCC---eEEEEe Confidence 01111111 11112358999999999976 56799999999999999999999999999999999876543 344443 Q ss_pred ecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHH Q lcl|NC_018285. 140 FDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK 218 (383) Q Consensus 140 ~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~ 218 (383) . .+....|+++||||++.+++.+.++|+||+.++..++.+..+++.+..++|+||++|++|+..++ .+++|+.++ T Consensus 156 ~----~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~ 231 (350) T protein:vir:11 156 S----WKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDA 231 (350) T ss_pred e----CCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHH Confidence 2 24567899999999999999989999999999999999999999999999999999999999875 689999999 Q ss_pred HHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHH Q lcl|NC_018285. 219 VSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLE 287 (383) Q Consensus 219 ~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e 287 (383) +++.|+. +..|+|+++|+ +.|+++++++.++.|+||+|++++++++||++|||||.++|.. +++++.++ T Consensus 232 l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~ 311 (350) T protein:vir:11 232 LRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISD 311 (350) T ss_pred HHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHH Confidence 9998865 34677888887 4689999999999999999999999999999999999999953 34678899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcchh-hccchhhhccCHHHHHHHHHHH Q lcl|NC_018285. 288 MSSNVYSKAVARYLRPFLSELSQKLSCDV-DADIFPAVDPTGANYISRINSM 338 (383) Q Consensus 288 ~~~~~~~~~l~P~~~~i~~~l~~~l~~~~-e~~~~~~~~~~~~~~~~~~~~l 338 (383) +.+.|+.++|.|+++.|++ +|.+|+.++ +|+- + ++..| T Consensus 312 ~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~~F~~---~---------~~~~l 350 (350) T protein:vir:11 312 AAAVWASLELAPMQTRLQQ-VNEMIGEEVVRFAQ---F---------DAPGL 350 (350) T ss_pred HHHHHHHHHHHHHHHHHHH-HHhhcCccccccCc---c---------cccCC Confidence 9999999999999999984 888887652 2221 1 11111 No 102 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=2.1e-50 Score=292.90 Aligned_cols=238 Identities=18% Similarity=0.245 Sum_probs=189.8 Q ss_pred CchhhhhhcCCcccccccccccchhhccc-ccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--------hh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLAT-LNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--------MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--------~~ 71 (383) ||||++..++...............+.+. ...+..++.+.||++|+|++||++||++||++|+++++.. .+ T Consensus 1 MglF~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 80 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 80 (251) T ss_pred CCccccccccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCccccccchHHH Confidence 99998776554332222211111111121 1245678999999999999999999999999999998743 24 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) .|+.+||+.||+++||+.++.+++++||||++++|+.+|+|++|+||+|++|++..++++...++.........+..+.+ T Consensus 81 ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~g~~~~~ 160 (251) T protein:vir:46 81 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNV 160 (251) T ss_pred HHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEeccCCcceeEEE Confidence 57789999999999999999999999999999999999999999999999999998877655444444444445667899 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-CHHHHHHHHHHHHHh---h Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTKVSRSRQAM---K 227 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-~~e~~~~~~~~~~~~---~ 227 (383) +++||||+|+++.++ ++|+||+.++..+|....++++++.++|+||++|++++++++.+ ++++++++++.|... . T Consensus 161 ~~~diiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~~~~~g~ 239 (251) T protein:vir:46 161 KFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFPKVLVEL 239 (251) T ss_pred CCccEEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCc Confidence 999999999987664 78999999999999999999999999999999999999999987 456678888888654 3 Q ss_pred cCCcceeecCCC Q lcl|NC_018285. 228 QMQGGPLVLDDL 239 (383) Q Consensus 228 ~~~g~~~vl~~g 239 (383) +|+|++++..+- T Consensus 240 ~n~g~~~~gm~~ 251 (251) T protein:vir:46 240 NKLGKLSYSMNQ 251 (251) T ss_pred ccccccccccCC Confidence 566776653222 No 103 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=3.1e-49 Score=286.51 Aligned_cols=320 Identities=9% Similarity=0.003 Sum_probs=217.8 Q ss_pred CchhhhhhcCCccccc----ccccccchhhcccccCCceec---hhhhhccHHHHHHHHHH--HHhhhhCceeeecchhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQG----GFFDITDPEFLATLNGSEWVS---AETALKNSDLFSIISQL--SNDLATAKLTTSRKQMQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~---~~~a~~~~~v~~~i~~i--a~~ia~~p~~~~~~~~~ 71 (383) |+-............+ ..++..+|.... ...+..+. ......-|.-+.++-.+ |+...+-.+.+.+ +.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~-~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~-n~l 78 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASP-ALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRA-NMV 78 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCccccc-chhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceeeec-hHH Confidence 5543322111100000 001111111000 00000000 00011111111111111 1111111222222 222 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceee Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHV 151 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 151 (383) ...-+||+.||+++|++ ++.+++++||||++++|+..|++++|+|++++.|++..+.+.....+... ....+....| T Consensus 79 ~~~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~~--~~~~g~~~~~ 155 (345) T protein:vir:37 79 SSLYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGYSYLMKKSL--YDTAQEIYRY 155 (345) T ss_pred HhhccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCeeEEEEEeE--ecCCceEEEE Confidence 23457999999999985 55789999999999999999999999999999999877654322221111 1223456789 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCHHHHHHHHHHHHHh--hc Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLLDFKTKVSRSRQAM--KQ 228 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~e~~~~~~~~~~~~--~~ 228 (383) +++||||+|.+++.+.++|+|++.++..++.+..+++.++.++|+||++|++||..+ ..+++|+.+++++.|+.. .. T Consensus 156 ~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~ 235 (345) T protein:vir:37 156 DAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVG 235 (345) T ss_pred ccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcc Confidence 999999999999888899999999999999999999999999999999999999986 468999999999988653 45 Q ss_pred CCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHHHHHHHHHHHHHH Q lcl|NC_018285. 229 MQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLEMSSNVYSKAVAR 299 (383) Q Consensus 229 ~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e~~~~~~~~~l~P 299 (383) |.++++++ +.|++|++++.++.|+||.|++++++++||++|||||.++|.. +++++.+++.+.|++++|.| T Consensus 236 n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P 315 (345) T protein:vir:37 236 NFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMP 315 (345) T ss_pred cccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHH Confidence 66777776 5799999999999999999999999999999999999999953 34567899999999999999 Q ss_pred HHHHHHHHHHHhhc-c---hhhccchhhhc Q lcl|NC_018285. 300 YLRPFLSELSQKLS-C---DVDADIFPAVD 325 (383) Q Consensus 300 ~~~~i~~~l~~~l~-~---~~e~~~~~~~~ 325 (383) +++.|++++|+.+. + .++|+.....+ T Consensus 316 ~~~~ie~~ln~~~~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 316 LQEIIAETINQDPEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHhhhhccCCCcceEEecchhhcC Confidence 99999999997431 1 12232211111 No 104 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=7.4e-49 Score=284.47 Aligned_cols=308 Identities=9% Similarity=0.029 Sum_probs=216.7 Q ss_pred CchhhhhhcCCccccc-----ccccccchh------hcccc--cCCce----echhh---hh-ccHHHHHHHHHHHHhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQG-----GFFDITDPE------FLATL--NGSEW----VSAET---AL-KNSDLFSIISQLSNDLA 59 (383) Q Consensus 1 Mglf~~~~~~~~~~~~-----~~~~~~~~~------~~~~~--~~~~~----~~~~~---a~-~~~~v~~~i~~ia~~ia 59 (383) |+=...-.. ...... ..++..++. +.+.+ ..+.+ ++... .+ .++...+||...++.+ T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~n~l- 78 (345) T protein:vir:37 1 MKTNVKTDN-KKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMV- 78 (345) T ss_pred CCccccccc-hhhhcCCCceEEEeecCCcccchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhhhhHH- Confidence 544322111 000000 011111111 01110 01111 11110 01 1222333443333333 Q ss_pred hCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEe Q lcl|NC_018285. 60 TAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVT 139 (383) Q Consensus 60 ~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~ 139 (383) ...-+||+.||+.+|++ ++.+++++||||++++|+..|++++|+|++|.+|++..+.+. .++... T Consensus 79 ------------~~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~d~~~--~~~~~~ 143 (345) T protein:vir:37 79 ------------SATYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVHKDGGY--SYLMKK 143 (345) T ss_pred ------------hhccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEeecCCe--eEEEee Confidence 22448999999999975 557899999999999999999999999999999998765432 222111 Q ss_pred ecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHH Q lcl|NC_018285. 140 FDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK 218 (383) Q Consensus 140 ~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~ 218 (383) ......+....|+++||||++.+++.+.++|+|++.++..++.+..+++.++.++|+||++|++|+..++ .+++|+.++ T Consensus 144 ~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~ 223 (345) T protein:vir:37 144 SLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEE 223 (345) T ss_pred eeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHH Confidence 1112234567899999999999998888999999999999999999999999999999999999998765 689999999 Q ss_pred HHHHHHHhh--cCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc----ccCcCHHH Q lcl|NC_018285. 219 VSRSRQAMK--QMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ----GDQQSSLE 287 (383) Q Consensus 219 ~~~~~~~~~--~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~~~~~e 287 (383) +++.|+... .|.++++++ ++|+++++++.++.|+||.+++++++++||++|||||.++|.. +++++.++ T Consensus 224 lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~ 303 (345) T protein:vir:37 224 IARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLK 303 (345) T ss_pred HHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHH Confidence 999987644 344445554 4679999999999999999999999999999999999999953 34678899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc-c---hhhccchhhhc Q lcl|NC_018285. 288 MSSNVYSKAVARYLRPFLSELSQKLS-C---DVDADIFPAVD 325 (383) Q Consensus 288 ~~~~~~~~~l~P~~~~i~~~l~~~l~-~---~~e~~~~~~~~ 325 (383) +.+.|+++||.|+++.|++++|+.+- + .++|+.....+ T Consensus 304 ~~~~f~~~~l~P~~~~ie~~ln~~~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 304 YREVYHYDEVMPLQEIIAETINQDPEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhccCCcceEEECchhhcC Confidence 99999999999999999999997431 1 13344333333 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=3e-41 Score=242.71 Aligned_cols=202 Identities=7% Similarity=0.006 Sum_probs=161.7 Q ss_pred eEEEEcCCCceeEEEEeecC-cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_018285. 123 VSFNRLDNQNGLYYNVTFDD-PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNA 201 (383) Q Consensus 123 v~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~ 201 (383) |++..+ +.++|.+.... ...+....++++||+|+|.+++.++++|+||+.++..++....++++|+.++|+||++| T Consensus 1 ~r~~~d---g~~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p 77 (219) T protein:vir:98 1 MRVCKD---GNYKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHM 77 (219) T ss_pred Cceeec---CeEEEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 333322 23444443221 12345678999999999999988889999999999999999999999999999999999 Q ss_pred ceeEeecC-CCCHHHHHHHHHHHHH--hhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 202 NGILKIKG-GGLLDFKTKVSRSRQA--MKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 202 ~~i~~~~~-~~~~e~~~~~~~~~~~--~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) ++||++++ .+++++++++++.|+. +..|+++++|+ +.|++|++++.+++|+||+|++++++.+||++||||| T Consensus 78 ~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp 157 (219) T protein:vir:98 78 GFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPP 157 (219) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH Confidence 99998876 6999999999999865 33455566665 5689999999999999999999999999999999999 Q ss_pred HHhcc----cccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch----hhccchhhhccC Q lcl|NC_018285. 274 NVVGG----QGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD----VDADIFPAVDPT 327 (383) Q Consensus 274 ~~lg~----~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~----~e~~~~~~~~~~ 327 (383) ++||. +++++|.+++.+.|+.+||.|+++.|+++||++++.. ++|+.....+.+ T Consensus 158 ~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 158 GLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred HHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCccEEeecCcccccCC Confidence 99984 3457789999999999999999999999999875422 233322222222 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.91 E-value=1.8e-24 Score=150.76 Aligned_cols=371 Identities=13% Similarity=0.018 Sum_probs=220.7 Q ss_pred CchhhhhhcCCcccccccccccchh-hcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh--h---hhc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPE-FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--Q---GIV 74 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--~---~l~ 74 (383) |.+.+.+.+- ....+...+..... ..+....+..+ ...+.+++.++++|+.+|+++.+.++++...+. . .+. T Consensus 1 ~~~~D~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~~l-~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~~ 78 (437) T protein:vir:52 1 MKFFDGIKSL-ALKLGSKQEQTYYSPSLSLTDDLVQL-EALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLFT 78 (437) T ss_pred CchhhhhHhH-HhcCCCccccceeecCccccccHHHH-HHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHHH Confidence 8888876542 11111111100000 01111111111 122346888999999999999999999864321 1 122 Q ss_pred cCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC---------CceeEEEEeccceeEEEEcC--------CCceeEEE Q lcl|NC_018285. 75 DNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN---------GRDMKWEYLRPSQVSFNRLD--------NQNGLYYN 137 (383) Q Consensus 75 ~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~---------g~~~~l~~l~~~~v~~~~~~--------~~~~~~y~ 137 (383) ..-+.. ...+-+..++.+.-++|.|++++++++. |.+..+.+++++.|+..... .+....|. T Consensus 79 ~~~~~l-~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~ 157 (437) T protein:vir:52 79 KFERSL-KLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYS 157 (437) T ss_pred HHHHhh-cHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEE Confidence 221111 2234444555666689999999998763 67888999999888743221 23445666 Q ss_pred EeecCcccccceeecccceEEeccC---CCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC---CC Q lcl|NC_018285. 138 VTFDDPRIPPKQHVPQSDILHFRLL---SVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG---GG 211 (383) Q Consensus 138 ~~~~~~~~~~~~~~~~~dvih~~~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~---~~ 211 (383) +.. ....+.+-++.||||.+. .+.+.+.|.|.++.+.+.|.....+.......+.+...+. +++++ .+ T Consensus 158 v~~----~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v--~k~~~l~~~l 231 (437) T protein:vir:52 158 ILG----GSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDI--FKIAGLSDKI 231 (437) T ss_pred Eec----CCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCc--eecchHHHHh Confidence 542 123456888899999743 2345678999999999999999999999998887765543 34432 23 Q ss_pred CHHHHHHHHHHH--HHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccC--cCHHH Q lcl|NC_018285. 212 LLDFKTKVSRSR--QAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQ--QSSLE 287 (383) Q Consensus 212 ~~e~~~~~~~~~--~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~--~~~~e 287 (383) .....+.+.+.+ .....+.+++++++.+.+|+.++.+..+ +.+.......+||++.+||..+|.+.+.. ++.++ T Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sg--l~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~ 309 (437) T protein:vir:52 232 AAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTG--LKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDE 309 (437) T ss_pred cCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCC--HHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHH Confidence 222222232222 2233566889999999999999888754 45677788899999999999999765332 34455 Q ss_pred HHHHHHH-------HHHHHHHHHHHHHHHHhhcc----hhhccchhhhccCHHHH-------HHHHHHHHhCCCcCHHHH Q lcl|NC_018285. 288 MSSNVYS-------KAVARYLRPFLSELSQKLSC----DVDADIFPAVDPTGANY-------ISRINSMVKSGTLAQNQG 349 (383) Q Consensus 288 ~~~~~~~-------~~l~P~~~~i~~~l~~~l~~----~~e~~~~~~~~~~~~~~-------~~~~~~l~~~g~~t~nE~ 349 (383) ..+.|+. ..+.|.++.+-+.|-+..+. ++.+...++...+..+. ++.+..++++|+++++|+ T Consensus 310 D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~ 389 (437) T protein:vir:52 310 DIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQI 389 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHH Confidence 5666664 45777777777766655543 23332233334443333 345667889999999999 Q ss_pred HHHhhcCC----cCCcchhHHhCCCC------CCCC-----CCCCCCC Q lcl|NC_018285. 350 LYILQQAE----ILPKELPKGENPNR------TILK-----GGETNGQ 382 (383) Q Consensus 350 r~~lg~~~----~~~~d~~~~~~~~~------~~~~-----ggd~~~~ 382 (383) |++|...+ ++..++...++... ++.+ .++.++| T Consensus 390 r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 390 ANELRESGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred HHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 99986544 22233222222111 0011 1111111 No 107 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.88 E-value=3.6e-23 Score=143.61 Aligned_cols=373 Identities=11% Similarity=0.042 Sum_probs=214.1 Q ss_pred CchhhhhhcCCccccccccccc---chhhcccccCCceech---hh-hhccHHHHHHHHHHHHhhhhCceeeecchhhhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT---DPEFLATLNGSEWVSA---ET-ALKNSDLFSIISQLSNDLATAKLTTSRKQMQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~---~~-a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l 73 (383) ||+|-+.++++.....++.+.. ++.....+..+..++. .. +.+++.+.++|+.+|+++.+..+++........ T Consensus 1 ~~~~m~~~~~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~~ 80 (435) T protein:vir:79 1 MGVFMSDKVKAITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVKNEKS 80 (435) T ss_pred CCcccccccccchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCCChHHH Confidence 9999988888776665554421 1111111111111222 12 347888999999999999999998875433222 Q ss_pred cc-CCCccCCHHHHHHHHHHHHHHcCCeEEEEee-cC---------CCceeEEEEeccceeEEEEcC-------CCceeE Q lcl|NC_018285. 74 VD-NPSNSANRFNFYQSIFAQMLLGGEAFAYRWR-ND---------NGRDMKWEYLRPSQVSFNRLD-------NQNGLY 135 (383) Q Consensus 74 ~~-~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r-~~---------~g~~~~l~~l~~~~v~~~~~~-------~~~~~~ 135 (383) +. +-... ...+-+..++.+..++|.|++++.. +. .|.+..+.++++.+|++.... .+.+.. T Consensus 81 ~~~~~~~l-~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~ 159 (435) T protein:vir:79 81 FKSRWDEL-RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGEPKL 159 (435) T ss_pred HHHHHHHh-hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCcceE Confidence 11 11222 2334445555666689998888774 32 244557888888887654321 233455 Q ss_pred EEEeecCcccccceeecccceEEeccCC------CCccccCcchH-HHHHHHHHHHHHHHHHHHHHHhccCCcceeEe-e Q lcl|NC_018285. 136 YNVTFDDPRIPPKQHVPQSDILHFRLLS------VDGGLTSVSPL-MALGRELDIQKASDKLTLNSLKNALNANGILK-I 207 (383) Q Consensus 136 y~~~~~~~~~~~~~~~~~~dvih~~~~~------~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~-~ 207 (383) |.+...+ ....+.+-++.||||.+.. +.+.++|.|++ +.+.+.|.....+.......+........-++ . T Consensus 160 y~v~~~~--~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l 237 (435) T protein:vir:79 160 YKISPGG--DIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDL 237 (435) T ss_pred EEEecCC--CCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhH Confidence 6654222 2334567788899986432 33456799998 57889999999999888887766544332221 1 Q ss_pred cCCCC-HHHHHHHHHH---HHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccC- Q lcl|NC_018285. 208 KGGGL-LDFKTKVSRS---RQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQ- 282 (383) Q Consensus 208 ~~~~~-~e~~~~~~~~---~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~- 282 (383) ...+. .+......+. +.....+.+.+++.+++.+|+.++.+..+ +.+..+....+||++.|||..+|.+.+.. T Consensus 238 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~G~s~~g 315 (435) T protein:vir:79 238 ALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSG--VPEFLQEKIDRIVALTGIHEIIIKNKNTGG 315 (435) T ss_pred HHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCC--HHHHHHHHHHHHHhhhCCCeeeeccCCccc Confidence 11111 1122222222 23333444556666667789988887754 57778888999999999999888654332 Q ss_pred --cCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHH-------HHHHHHHHhCCCcCH Q lcl|NC_018285. 283 --QSSLEMSSNVYSK-------AVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANY-------ISRINSMVKSGTLAQ 346 (383) Q Consensus 283 --~~~~e~~~~~~~~-------~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~-------~~~~~~l~~~g~~t~ 346 (383) ++.++..+.|+.. .+.|.++.+-+.+-.. +++.+...++...+..+. ++.+..++++|++++ T Consensus 316 lnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~ 393 (435) T protein:vir:79 316 VSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMISE--TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINL 393 (435) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCH Confidence 2233444455542 3445444443332211 333333334444444433 345667889999999 Q ss_pred HHHHHHhh----cCCcCCcc---hhHHhCCC-CCCCCCCCCC Q lcl|NC_018285. 347 NQGLYILQ----QAEILPKE---LPKGENPN-RTILKGGETN 380 (383) Q Consensus 347 nE~r~~lg----~~~~~~~d---~~~~~~~~-~~~~~ggd~~ 380 (383) +|+|+.+. ..++.+++ ++..+..+ ....||||+. T Consensus 394 ~e~r~~L~~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 394 KETRDTLRSICPDLKIMDNDNIELPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred HHHHHHHHHhccccCCCCcccccCCccccCCCCCCCCCCCCC Confidence 99999872 23333332 22222222 2234566666 No 108 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.82 E-value=2.7e-20 Score=127.89 Aligned_cols=371 Identities=9% Similarity=-0.048 Sum_probs=204.3 Q ss_pred Cchhhhh--------hc---------CCc-ccccccccccchhh-----------cccccCC------------ceec-- Q lcl|NC_018285. 1 MPIFNLA--------TE---------SPP-NNQGGFFDITDPEF-----------LATLNGS------------EWVS-- 37 (383) Q Consensus 1 Mglf~~~--------~~---------~~~-~~~~~~~~~~~~~~-----------~~~~~~~------------~~~~-- 37 (383) |++|..- .. +++ .......+.+.+.+ ++...+. ..+. T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGH 104 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccH Confidence 8887420 00 011 00000000000000 0000000 0011 Q ss_pred --hhhhhccHHHHHHHHHHHHhhhhCceeeecchh--------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec Q lcl|NC_018285. 38 --AETALKNSDLFSIISQLSNDLATAKLTTSRKQM--------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN 107 (383) Q Consensus 38 --~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~ 107 (383) ...+.+++.+..+|+.+|+++.+-++++...+. ..+....+.......|.+ ++.+..++|.+++++... T Consensus 105 ~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~-a~~~~rlyG~~~i~i~v~ 183 (537) T protein:vir:10 105 QMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQ-FVRKGRIFGIRIALFKVD 183 (537) T ss_pred HHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHH-HHHhcccccceEEEEeec Confidence 112346888999999999999998888754322 223333444444444444 445555789998887642 Q ss_pred -CC---------------CceeEEEEeccceeEEEEc----------CCCceeEEEEeecCcccccceeecccceEEecc Q lcl|NC_018285. 108 -DN---------------GRDMKWEYLRPSQVSFNRL----------DNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRL 161 (383) Q Consensus 108 -~~---------------g~~~~l~~l~~~~v~~~~~----------~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~ 161 (383) .+ |....|..++|.+++.... +.+.+..|.+. .+.+-++.|+||.+ T Consensus 184 ~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~--------g~~iH~SRli~f~g 255 (537) T protein:vir:10 184 SPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN--------GKKYHRSHLAIYIN 255 (537) T ss_pred CcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec--------CeEecceeEEEecC Confidence 22 2345677788877765321 11223344432 23567889999875 Q ss_pred CCCC------ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHhhcCCccee Q lcl|NC_018285. 162 LSVD------GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPL 234 (383) Q Consensus 162 ~~~~------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~ 234 (383) .... ..+.|.|.++.+...|.....+.......+.........+.....+. +++..+..+.+..+.+|. +++ T Consensus 256 ~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~-g~~ 334 (537) T protein:vir:10 256 DEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNY-QVR 334 (537) T ss_pred CCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCc-cee Confidence 4322 34579999999999999999999998888888776544444333333 333333334444444444 456 Q ss_pred ecCC-CceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc-c--CcCHHHHHHHHHH------HHHHHHHHHH Q lcl|NC_018285. 235 VLDD-LEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG-D--QQSSLEMSSNVYS------KAVARYLRPF 304 (383) Q Consensus 235 vl~~-g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~-~--~~~~~e~~~~~~~------~~l~P~~~~i 304 (383) +++. +.+|+.++.+... +.+........||++.|||..+|.+.+ . +++.++..+.|+. ..|.|.++.+ T Consensus 335 ~id~e~e~~e~~~~~lsg--l~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l 412 (537) T protein:vir:10 335 VVDKDNEDVVQIDTTLND--LDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRH 412 (537) T ss_pred EecCCCceeEEEeccCCC--HHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6665 6899988876654 456778888899999999999775433 2 2333444455553 2478888888 Q ss_pred HHHHHHhhcc-hhhc--cchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhhcCC------cCCcchhHHhC Q lcl|NC_018285. 305 LSELSQKLSC-DVDA--DIFPAVDPTGANYI-------SRINSMVKSGTLAQNQGLYILQQAE------ILPKELPKGEN 368 (383) Q Consensus 305 ~~~l~~~l~~-~~e~--~~~~~~~~~~~~~~-------~~~~~l~~~g~~t~nE~r~~lg~~~------~~~~d~~~~~~ 368 (383) .+.+.+..+. ..++ ....+...+..+++ +.+..++++|+++++|+|+.|+..+ +.++.-....+ T Consensus 413 ~~ll~~~~~~~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e 492 (537) T protein:vir:10 413 HQLVCRSHLRKRIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAE 492 (537) T ss_pred HHHHHHhcCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhh Confidence 7777665543 2222 22334445555544 3467788999999999999998754 21110000000 Q ss_pred CCCC-----CCCCCCCCCCC Q lcl|NC_018285. 369 PNRT-----ILKGGETNGQD 383 (383) Q Consensus 369 ~~~~-----~~~ggd~~~~d 383 (383) .... +.++.+..+.. T Consensus 493 ~~~~~~~~~~~~~~~~~~~~ 512 (537) T protein:vir:10 493 DIDVDDEGKPVRIIEDQPAP 512 (537) T ss_pred cccCCccCCcCCCCCCCCCc Confidence 0000 00111111000 No 109 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.81 E-value=2.9e-20 Score=127.69 Aligned_cols=373 Identities=11% Similarity=-0.003 Sum_probs=202.9 Q ss_pred CchhhhhhcCCc----cccccccccc--chhhc--------ccccCCceec----hhhhhccHHHHHHHHHHHHhhhhCc Q lcl|NC_018285. 1 MPIFNLATESPP----NNQGGFFDIT--DPEFL--------ATLNGSEWVS----AETALKNSDLFSIISQLSNDLATAK 62 (383) Q Consensus 1 Mglf~~~~~~~~----~~~~~~~~~~--~~~~~--------~~~~~~~~~~----~~~a~~~~~v~~~i~~ia~~ia~~p 62 (383) +++-....-.+. ..+....+.. +..+. ..+.....+. ...+.+++.++.+|+.+|+++.+-. T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~ 112 (532) T protein:vir:94 33 LGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETPADECVRAW 112 (532) T ss_pred hhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccchHHHhhCC Confidence 222111000000 0000000000 00000 0000000111 1223468889999999999999988 Q ss_pred eeeecchhh--------hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC-------------------ceeEE Q lcl|NC_018285. 63 LTTSRKQMQ--------GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG-------------------RDMKW 115 (383) Q Consensus 63 ~~~~~~~~~--------~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g-------------------~~~~l 115 (383) +++...... .+...-... ...+-+..++.+..++|.+++++....+| .+..| T Consensus 113 ~~i~~~~~~~~~~~~~~~i~~~~~~l-~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l 191 (532) T protein:vir:94 113 GKITCSSKDELAADKATRITQKLEQY-NVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGF 191 (532) T ss_pred ceEeeCCccccchHHHHHHHHHHHhh-hHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEE Confidence 888542211 122222222 22334444556666899999887654322 34578 Q ss_pred EEeccceeEEEEcC--------CCceeEEEEeecCcccccceeecccceEEeccCCCC------ccccCcchHHHHHHHH Q lcl|NC_018285. 116 EYLRPSQVSFNRLD--------NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVD------GGLTSVSPLMALGREL 181 (383) Q Consensus 116 ~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~------~~~~G~s~~~~~~~~i 181 (383) .+++|.+|+..... .+....|.+. ..+.+-++.|+||.+.... ..+.|.|.++.+...| T Consensus 192 ~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~-------~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l 264 (532) T protein:vir:94 192 ATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT-------SGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYV 264 (532) T ss_pred EeechheecccccccccccccccCCceeEEEc-------cCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHH Confidence 88888888764322 1122333331 1235678889999754322 3357999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHH--HhhcCCcceeecCC-CceeeecccChhhHHHHHHH Q lcl|NC_018285. 182 DIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQ--AMKQMQGGPLVLDD-LEDFTPLEIKSNVAQLLKQA 258 (383) Q Consensus 182 ~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~--~~~~~~g~~~vl~~-g~~~~~~~~~~~d~~~~e~~ 258 (383) .....+......+..........+.....++.+..+++.+.+. ....+..++++++. +.+|+.++.+..+ +.+.. T Consensus 265 ~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~~~lsg--l~~~l 342 (532) T protein:vir:94 265 DNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTNTPLSG--LDSLQ 342 (532) T ss_pred HHHHHHHHHHHHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEecccCC--HHHHH Confidence 9999988888887766554433332223345555566655543 22344455677765 5789988877654 56677 Q ss_pred HHHHHHHHHHhcCCHHHhccccc-C--cCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcc----hhhccchhhh Q lcl|NC_018285. 259 DWTTGQFAKVYGIPENVVGGQGD-Q--QSSLEMSSNVYS-------KAVARYLRPFLSELSQKLSC----DVDADIFPAV 324 (383) Q Consensus 259 ~~~~~~Ia~~~gVpp~~lg~~~~-~--~~~~e~~~~~~~-------~~l~P~~~~i~~~l~~~l~~----~~e~~~~~~~ 324 (383) +....+||++.|||..+|.+.+. + ++.++..+.|+. ..+.|+++.+.+.|-+..+. ++.+...++. T Consensus 343 ~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~ 422 (532) T protein:vir:94 343 AQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLM 422 (532) T ss_pred HHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCC Confidence 88888999999999998755332 2 222334444543 45788888887777665543 2333333333 Q ss_pred ccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhhcCCcCCc--------chhHH-----hC--C-CCC--------- Q lcl|NC_018285. 325 DPTGANY-------ISRINSMVKSGTLAQNQGLYILQQAEILPK--------ELPKG-----EN--P-NRT--------- 372 (383) Q Consensus 325 ~~~~~~~-------~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~--------d~~~~-----~~--~-~~~--------- 372 (383) ..+..+. ++.+..++.+|+++.+|+|+.++..|..+- ++... ++ . ..+ T Consensus 423 ~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (532) T protein:vir:94 423 ELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPN 502 (532) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCC Confidence 4444443 344567889999999999999987763211 00000 00 0 000 Q ss_pred CCCCCCCCCCC Q lcl|NC_018285. 373 ILKGGETNGQD 383 (383) Q Consensus 373 ~~~ggd~~~~d 383 (383) |.+.++.++.| T Consensus 503 ~~~~~~~d~~~ 513 (532) T protein:vir:94 503 PQPDSEDDQTD 513 (532) T ss_pred CCCCCCCCCCC Confidence 00111111111 No 110 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.80 E-value=6.9e-20 Score=125.62 Aligned_cols=377 Identities=13% Similarity=0.059 Sum_probs=212.1 Q ss_pred CchhhhhhcCCcccc-ccccc---------ccchhhcccccCCceechh----hhhccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-GGFFD---------ITDPEFLATLNGSEWVSAE----TALKNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~~~~~~~-~~~~~---------~~~~~~~~~~~~~~~~~~~----~a~~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) |+=.+....+..... .+..+ ..+...++.......++.. .+.+++.+..+|+.+|+.+.+-++++. T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i~ 80 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSLK 80 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeeee Confidence 655544332221110 00000 0111111111111112322 223577788999999999998888775 Q ss_pred cchhh---hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-------------CceeEEEEec---cceeEEEE Q lcl|NC_018285. 67 RKQMQ---GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-------------GRDMKWEYLR---PSQVSFNR 127 (383) Q Consensus 67 ~~~~~---~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-------------g~~~~l~~l~---~~~v~~~~ 127 (383) ..+.. .+...-+... ..+-+..++.+..++|.|++++...+. +.+..+.+|. +..+.... T Consensus 81 ~~~~~~~~~~~~~~~~l~-~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~ 159 (461) T protein:vir:80 81 TDNKEMKKNIESKWRKLK-TKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLY 159 (461) T ss_pred cCCHHHHHHHHHHHHHhh-HHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEeccccccchhh Confidence 43321 1222222222 233445556667799999998864221 1122333332 22222111 Q ss_pred ---c----CCCceeEEEEeec---------CcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 128 ---L----DNQNGLYYNVTFD---------DPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLT 191 (383) Q Consensus 128 ---~----~~~~~~~y~~~~~---------~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 191 (383) + +.+.+..|++... +......+.+-++.|||+.+....+..+|.|.++.+.+.|.....+.... T Consensus 160 ~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~ 239 (461) T protein:vir:80 160 LNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSV 239 (461) T ss_pred hcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHH Confidence 1 1234445555321 11223346788899999998887778889999999999999999999999 Q ss_pred HHHHhccCCcceeEeecC--CCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 192 LNSLKNALNANGILKIKG--GGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVY 269 (383) Q Consensus 192 ~~~~~ng~~~~~i~~~~~--~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 269 (383) ..+..+...+ +++.++ .+..+....+.+.+.... +..++++++.+-+|+.++.+..+ +.+..+.....||++- T Consensus 240 ~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~-~~~g~~~~d~~e~~e~~~~~lsg--l~~~l~~~~~~iaa~s 314 (461) T protein:vir:80 240 GQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMF-RTEALAIIKGDEQLTKESTNVSG--MKDLLDYGWDYLAGAV 314 (461) T ss_pred HHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhc-CCceEEEEcCCcceEEEecCcCC--HHHHHHHHHHHHhhhh Confidence 8888776554 344443 333344444444444333 34557888999999988887654 5677888899999999 Q ss_pred cCCHHHhcccccCc--CHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcc----------hhhccchhhhccCHHH Q lcl|NC_018285. 270 GIPENVVGGQGDQQ--SSLEMSSNVYS-------KAVARYLRPFLSELSQKLSC----------DVDADIFPAVDPTGAN 330 (383) Q Consensus 270 gVpp~~lg~~~~~~--~~~e~~~~~~~-------~~l~P~~~~i~~~l~~~l~~----------~~e~~~~~~~~~~~~~ 330 (383) +||..+|.+.+.+. +.++..+.|+. ..+.|+++.+.+.+-+..+. ++.+...++..++..+ T Consensus 315 ~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~ke 394 (461) T protein:vir:80 315 RMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKT 394 (461) T ss_pred cCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHH Confidence 99999886644433 33444555543 35677777776666554432 1222223333444444 Q ss_pred HH-------HHHHHHHhCCCcCHHHHHHHhh-cCCcC-----CcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 331 YI-------SRINSMVKSGTLAQNQGLYILQ-QAEIL-----PKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 331 ~~-------~~~~~l~~~g~~t~nE~r~~lg-~~~~~-----~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) .+ +.+..++++|+++++|+|+.+. .-+.+ +++-+..+.......+++..+..| T Consensus 395 kAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 460 (461) T protein:vir:80 395 DAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKKNAD 460 (461) T ss_pred HHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccccccCCC Confidence 43 4466788999999999999773 32222 112122222222233344444444 No 111 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.78 E-value=3.7e-19 Score=121.66 Aligned_cols=368 Identities=7% Similarity=-0.056 Sum_probs=197.6 Q ss_pred CchhhhhhcCCcccccccccccc----hhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh------ Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD----PEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM------ 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~------ 70 (383) +||-+.....-.........+.. ....+.+.+. .+ ...+.+++.+..+|+.+|+++.+-.+++..... T Consensus 101 Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gy-ql-~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~ 178 (862) T protein:vir:99 101 DDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGH-QA-CALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDE 178 (862) T ss_pred hcchhhhhhccccccccccccchhccccccccCcccH-HH-HHHHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCH Confidence 11111111000000000000000 0000111111 11 123457888999999999999999888864211 Q ss_pred ---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec-CC---------------CceeEEEEeccceeEEEE---- Q lcl|NC_018285. 71 ---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN-DN---------------GRDMKWEYLRPSQVSFNR---- 127 (383) Q Consensus 71 ---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~-~~---------------g~~~~l~~l~~~~v~~~~---- 127 (383) ..+...-... ...+-+..++.+.-++|.+++++..+ .+ |.+..|.+|+|.+++... T Consensus 179 e~~~~ie~~~~rL-~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~ 257 (862) T protein:vir:99 179 ESLEKFKAIDVEF-KVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAES 257 (862) T ss_pred HHHHHHHHHHHHh-hHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccc Confidence 1122111122 12333344555566788887776532 12 234677788877765421 Q ss_pred cCC------CceeEEEEeecCcccccceeecccceEEeccCCCC------ccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 128 LDN------QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVD------GGLTSVSPLMALGRELDIQKASDKLTLNSL 195 (383) Q Consensus 128 ~~~------~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 195 (383) ..+ +....|.+. ...+-++-||||...... ..+.|.|.++.+...|.....+......++ T Consensus 258 ~~Dp~sp~yGkP~~y~I~--------g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll 329 (862) T protein:vir:99 258 TADPSSQFFYEPEFWIIS--------GQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLA 329 (862) T ss_pred cccccccccCCceeeeec--------CeeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 223334332 123556777777654322 235699999999999999999999999988 Q ss_pred hccCCcceeEeecCCCCHH-HHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 196 KNALNANGILKIKGGGLLD-FKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 196 ~ng~~~~~i~~~~~~~~~e-~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) .+......-+..-..+..+ ...+-.+.+..+. +..++++++.+-+|+.++.+..+ +.+.......+||++.+||.. T Consensus 330 ~ka~l~v~ktd~l~~l~~ed~l~~r~~~~~~~r-dN~Gi~liD~eEe~e~ls~slSG--L~dll~~~~q~IAaas~IP~t 406 (862) T protein:vir:99 330 MNKRTTAIHTDTAKAIANEDKFIQRLMFWVRYR-DNHAVKVLGTDETMEQFDTSLAD--FDAVIMGQYQLVASIAKTPAT 406 (862) T ss_pred HHhccceeechhHhhhccHHHHHHHHHHHHhcc-CcceeEEecCCCceeEEecccCC--hHHHHHHHHHHHHhhhCCCce Confidence 8866543333222233322 2222122233333 44558889999999988887754 456667778899999999999 Q ss_pred Hhcccc-c--CcCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcch--hhccchhhhccCHHHHH-------HHH Q lcl|NC_018285. 275 VVGGQG-D--QQSSLEMSSNVYS-------KAVARYLRPFLSELSQKLSCD--VDADIFPAVDPTGANYI-------SRI 335 (383) Q Consensus 275 ~lg~~~-~--~~~~~e~~~~~~~-------~~l~P~~~~i~~~l~~~l~~~--~e~~~~~~~~~~~~~~~-------~~~ 335 (383) +|.+.+ + +++.++..+.||. ..|.|+++.+...+..++..+ +.+....+...+..+.+ +.+ T Consensus 407 iLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~ 486 (862) T protein:vir:99 407 KLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLGIQHEIDVVMEPVASMTAQQQADLNKTKAEGG 486 (862) T ss_pred eecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHH Confidence 775533 3 3344555555654 457888888887776665432 33333344444444443 445 Q ss_pred HHHHhCCCcCHHHHHHHhhcCC------cCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 336 NSMVKSGTLAQNQGLYILQQAE------ILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 336 ~~l~~~g~~t~nE~r~~lg~~~------~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) ..++++|+++++|+|++|...+ ++..++... ++.. .+.+.|+...+. T Consensus 487 ~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~-~~e~~g~a~~~a 544 (862) T protein:vir:99 487 KVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLA-AYQKAGAAQETA 544 (862) T ss_pred HHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCccccc-ccccCCcccccc Confidence 6788999999999999874322 222222110 0000 011111111111 No 112 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.78 E-value=5.3e-19 Score=120.80 Aligned_cols=363 Identities=13% Similarity=0.032 Sum_probs=200.4 Q ss_pred CchhhhhhcCCcccccccccccchh-hcccccCCceechh-hhhccHHHHHHHHHHHHhhhhCceeeecchhh-hhccCC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPE-FLATLNGSEWVSAE-TALKNSDLFSIISQLSNDLATAKLTTSRKQMQ-GIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~-~l~~~P 77 (383) |--.+.+.+--. +..++. ..+......+.... .+.+++.+.++|+.+|+++.+..+++...+.. .+..+- T Consensus 1 ~~~~D~~~n~~~-------gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~~~~~~~ 73 (422) T protein:vir:10 1 MVKTDSYANIFL-------GGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRW 73 (422) T ss_pred CccchhhHHHHc-------CCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHHHHHHHHH Confidence 333332221100 000000 00110111111112 23468889999999999999988888755433 222222 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecC----------CCceeEEEEeccceeEEEEc-------CCCceeEEEEee Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRND----------NGRDMKWEYLRPSQVSFNRL-------DNQNGLYYNVTF 140 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~----------~g~~~~l~~l~~~~v~~~~~-------~~~~~~~y~~~~ 140 (383) ... ...+-+..++.+..++|.|++++...+ .|.+..+.++++.+|++... +.+.+..|.+.. T Consensus 74 ~~l-~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~ 152 (422) T protein:vir:10 74 DDL-EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITT 152 (422) T ss_pred HHh-hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEec Confidence 222 233444555566678999999887522 35566888888888875432 223455666543 Q ss_pred cCcccccceeecccceEEeccCC------CCccccCcchHHH-HHHHHHHHHHHHHHHHHHHhccCCcceeEeecC---C Q lcl|NC_018285. 141 DDPRIPPKQHVPQSDILHFRLLS------VDGGLTSVSPLMA-LGRELDIQKASDKLTLNSLKNALNANGILKIKG---G 210 (383) Q Consensus 141 ~~~~~~~~~~~~~~dvih~~~~~------~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~---~ 210 (383) .. .+..+.+-++.||||.+.. +...++|.|++.. +.+.|.....+.......+....... +++++ . T Consensus 153 ~~--~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v--~~~~~l~~~ 228 (422) T protein:vir:10 153 NE--SDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAV--WKAKGLAEL 228 (422) T ss_pred CC--CCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--ccchhHHHh Confidence 22 2333566677899986543 3345679999986 67889999999888888777655443 33332 1 Q ss_pred CC-HHHHHHHHHH---HHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-C--c Q lcl|NC_018285. 211 GL-LDFKTKVSRS---RQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-Q--Q 283 (383) Q Consensus 211 ~~-~e~~~~~~~~---~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-~--~ 283 (383) .. .+......+. +.....+.+.+++.+++.+|+.++.+..+ +.+.......+||++.|||..+|.+.+. + + T Consensus 229 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glna 306 (422) T protein:vir:10 229 CDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKNKNVGGVSS 306 (422) T ss_pred cCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCC--hHHHHHHHHHHHHhhhCCCeeeeccCCcccccc Confidence 11 1122222222 22333444556666677899999888765 6777888899999999999998865433 2 2 Q ss_pred CHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHH-------HHHHHHHHHhCCCcCHHHH Q lcl|NC_018285. 284 SSLEMSSNVYS-------KAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGAN-------YISRINSMVKSGTLAQNQG 349 (383) Q Consensus 284 ~~~e~~~~~~~-------~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~-------~~~~~~~l~~~g~~t~nE~ 349 (383) +.++..+.|+. ..+.|.++.+-+.+-.. .++.+...++..++..+ .++.+..++++|+++++|+ T Consensus 307 tgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s--~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~ 384 (422) T protein:vir:10 307 SQNTALETFHKLVDRKRNAELLPILEFLIPFIVNA--EEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEA 384 (422) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHH Confidence 33344445554 34555555544333211 12222222333334333 3455667889999999999 Q ss_pred HHHhhc----CCcC----CcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 350 LYILQQ----AEIL----PKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 350 r~~lg~----~~~~----~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) |+.|.. .++. +.|+...+..+. ..++.++| T Consensus 385 r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~d 422 (422) T protein:vir:10 385 RDTLRTIAPEVKINDGSVETEVTISETSND----PLEVPTDD 422 (422) T ss_pred HHHhhhhcccccCCCCCCccccchhhcCCC----CCCCCCCC Confidence 998843 2222 222222222211 12333333 No 113 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.78 E-value=6.4e-19 Score=120.35 Aligned_cols=383 Identities=10% Similarity=0.040 Sum_probs=219.5 Q ss_pred CchhhhhhcC--------Ccccc---cccccccchhhcccccCCce----e----------chhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATES--------PPNNQ---GGFFDITDPEFLATLNGSEW----V----------SAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mglf~~~~~~--------~~~~~---~~~~~~~~~~~~~~~~~~~~----~----------~~~~a~~~~~v~~~i~~ia 55 (383) |+|+++...- +...+ ..+..........++..... + +..-+..++.+..+|+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 9999984221 11111 11211111111111111111 0 1122457888999999777 Q ss_pred HhhhhC-ceeee----cch--------------hhhhccCC--CccCCHHHHHHHHHHHHHHcCCeEEEEeecCC----- Q lcl|NC_018285. 56 NDLATA-KLTTS----RKQ--------------MQGIVDNP--SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN----- 109 (383) Q Consensus 56 ~~ia~~-p~~~~----~~~--------------~~~l~~~P--N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~----- 109 (383) +.+-.. -+.+. ..+ ...+.+.| +-.++.+.+...++..++..|++|+.+++... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 666543 23221 110 01122222 23478899999999999999999999876443 Q ss_pred --CceeEEEEeccceeE------------EEEcCCCceeEEEEeecCccc---ccceeecccceEEeccCCCCccccCcc Q lcl|NC_018285. 110 --GRDMKWEYLRPSQVS------------FNRLDNQNGLYYNVTFDDPRI---PPKQHVPQSDILHFRLLSVDGGLTSVS 172 (383) Q Consensus 110 --g~~~~l~~l~~~~v~------------~~~~~~~~~~~y~~~~~~~~~---~~~~~~~~~dvih~~~~~~~~~~~G~s 172 (383) +.+..|..|+|+++. +..+..+..+.|.+.-..+.. ...+.+++++|+|+..+...+..+|+| T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis 240 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTS 240 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCCc Confidence 346789999998875 234456677788876543322 334679999999999888888899999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHH-HHHHhhcCCccee-ecCCCceeeecccChh Q lcl|NC_018285. 173 PLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSR-SRQAMKQMQGGPL-VLDDLEDFTPLEIKSN 250 (383) Q Consensus 173 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~-~~~~~~~~~g~~~-vl~~g~~~~~~~~~~~ 250 (383) .+..+...+............-.+-.+...++|+.+..........-.. .-....-..|.++ .|..|.+++..+.+.. T Consensus 241 ~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p 320 (502) T protein:vir:79 241 LLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRP 320 (502) T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCC Confidence 9999999998888888888777777788888888653211110000000 0000011245555 5899999998887766 Q ss_pred hHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHHHH-----------HHHHHHHHHHHHHHHHH-HHHhhcc-- Q lcl|NC_018285. 251 VAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEMSS-----------NVYSKAVARYLRPFLSE-LSQKLSC-- 314 (383) Q Consensus 251 d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~~~-----------~~~~~~l~P~~~~i~~~-l~~~l~~-- 314 (383) ...|.+..+...+.||+.+|||.+.|.+. .+|++...... .|....++|+.+.+.++ +-...++ T Consensus 321 ~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p 400 (502) T protein:vir:79 321 NPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLP 400 (502) T ss_pred CCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC Confidence 66788889999999999999999999642 33333322221 23333444444433222 1111111 Q ss_pred h-hhccc--------hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc-----hhHHh--CCCCCC----- Q lcl|NC_018285. 315 D-VDADI--------FPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE-----LPKGE--NPNRTI----- 373 (383) Q Consensus 315 ~-~e~~~--------~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d-----~~~~~--~~~~~~----- 373 (383) . .+.+. -.....|+.+.+.....++++|++|+-|+-+..|..+-.--+ ...++ ++.... T Consensus 401 ~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~ 480 (502) T protein:vir:79 401 RDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDPASD 480 (502) T ss_pred CCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCC Confidence 0 01110 011124667777777889999999999999888866521111 11111 111110 Q ss_pred ------------CCCCCCCCCC Q lcl|NC_018285. 374 ------------LKGGETNGQD 383 (383) Q Consensus 374 ------------~~ggd~~~~d 383 (383) .+.++.+.+| T Consensus 481 ~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 481 KGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCC Confidence 0111122222 No 114 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.78 E-value=6.7e-19 Score=120.20 Aligned_cols=369 Identities=12% Similarity=0.044 Sum_probs=203.4 Q ss_pred Cchhhh--hhcCCcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCCC Q lcl|NC_018285. 1 MPIFNL--ATESPPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNPS 78 (383) Q Consensus 1 Mglf~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~PN 78 (383) |+.|+. +..--.....+. ..+ +.... .+..+ ...+.+++.+.++|+.+|+++.+..+++........+..-- T Consensus 1 ~~~~~~d~~~~~~~~~~~~~---~~~-~~~~~-~~~~l-~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~~~~~~~ 74 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGS---PKP-FFMSD-ASYHV-GSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKDEKEFKSLW 74 (427) T ss_pred CCccccchHHHHhhcCCCCc---ccC-ccccC-chHHH-HHHHHcCchhhhhhccchHHhhcCCccccCccHHHHHHHHH Confidence 777764 000000000000 001 01111 11111 12244688899999999999999998887543322221111 Q ss_pred ccCCHHHHHHHHHHHHHHcCCeEEEEeec----------CCCceeEEEEeccceeEEEEcC-------CCceeEEEEeec Q lcl|NC_018285. 79 NSANRFNFYQSIFAQMLLGGEAFAYRWRN----------DNGRDMKWEYLRPSQVSFNRLD-------NQNGLYYNVTFD 141 (383) Q Consensus 79 ~~~t~~~f~~~~~~~~~l~G~a~~~i~r~----------~~g~~~~l~~l~~~~v~~~~~~-------~~~~~~y~~~~~ 141 (383) ......+-+..++.+..++|.+++++..+ ..|.+..+.+++++++++.... .+.+.+|.+... T Consensus 75 ~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~ 154 (427) T protein:vir:10 75 DSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPEIYKVSPG 154 (427) T ss_pred HHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccccccCcceEEEEecC Confidence 11223344556667777899999988542 2466789999998887665432 234555665422 Q ss_pred CcccccceeecccceEEeccCC------CCccccCcchHH-HHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC---C Q lcl|NC_018285. 142 DPRIPPKQHVPQSDILHFRLLS------VDGGLTSVSPLM-ALGRELDIQKASDKLTLNSLKNALNANGILKIKGG---G 211 (383) Q Consensus 142 ~~~~~~~~~~~~~dvih~~~~~------~~~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~---~ 211 (383) .....+.+-++.|+||.+.. +...++|.|++. .+...|.....+.......+......- +++++- + T Consensus 155 --~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v--~k~~~l~~~~ 230 (427) T protein:vir:10 155 --DNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAV--WKVKGLAEMC 230 (427) T ss_pred --CCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhcccc--ccchhHHHHh Confidence 22334567788899986542 234567999985 577888888888888888776654433 333321 1 Q ss_pred C-HHHHHHHHHH---HHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-C--cC Q lcl|NC_018285. 212 L-LDFKTKVSRS---RQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-Q--QS 284 (383) Q Consensus 212 ~-~e~~~~~~~~---~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-~--~~ 284 (383) + .+......+. +....++.+.+++.+.+.+|+.++.+... +.+.......+||++.+||..+|.+.+. + ++ T Consensus 231 ~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnst 308 (427) T protein:vir:10 231 DDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSAS 308 (427) T ss_pred cCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCC--hHHHHHHHHHHHHhhhCCCeeeeccCCccccccc Confidence 1 1111122222 22334455666666777899988887754 5677788889999999999998865433 2 22 Q ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHH-------HHHHHHHHHhCCCcCHHHHH Q lcl|NC_018285. 285 SLEMSSNVYS-------KAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGAN-------YISRINSMVKSGTLAQNQGL 350 (383) Q Consensus 285 ~~e~~~~~~~-------~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~-------~~~~~~~l~~~g~~t~nE~r 350 (383) .++..+.|+. ..+.|.++.+-+.+-.. .++.+...++...+..+ .++.+..++++|+++++|+| T Consensus 309 gd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s--~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r 386 (427) T protein:vir:10 309 QNTALETFYKLVDRKREEDYRPLLEFLLPFIVDE--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEAR 386 (427) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHH Confidence 2344445554 34556555554433211 22222222333333333 34556678899999999999 Q ss_pred HHhh----cCCcCCcc-h--hHHhC-CCCCCCCCCCCCCCC Q lcl|NC_018285. 351 YILQ----QAEILPKE-L--PKGEN-PNRTILKGGETNGQD 383 (383) Q Consensus 351 ~~lg----~~~~~~~d-~--~~~~~-~~~~~~~ggd~~~~d 383 (383) +.|. ..++.+++ + ...+. ...+|..|-+.+++| T Consensus 387 ~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 387 DTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred HHHHhhhccccCCCCccccccccchhcCCCCCCCCCCCCCC Confidence 9873 44443321 1 11111 111222222223333 No 115 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.74 E-value=7.2e-18 Score=114.55 Aligned_cols=371 Identities=9% Similarity=-0.053 Sum_probs=195.9 Q ss_pred Cc----hhhhhhcCCccc--c----cc-cc---ccc-------chhhcccccC----Cce-----------e----chhh Q lcl|NC_018285. 1 MP----IFNLATESPPNN--Q----GG-FF---DIT-------DPEFLATLNG----SEW-----------V----SAET 40 (383) Q Consensus 1 Mg----lf~~~~~~~~~~--~----~~-~~---~~~-------~~~~~~~~~~----~~~-----------~----~~~~ 40 (383) |. ++++..+....+ + .. .. .+. .+.+.....+ ... + -... T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~al 116 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAI 116 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHH Confidence 33 333322111000 0 00 00 000 0001000000 000 0 0112 Q ss_pred hhccHHHHHHHHHHHHhhhhCceeeecchh-------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-C--- Q lcl|NC_018285. 41 ALKNSDLFSIISQLSNDLATAKLTTSRKQM-------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-N--- 109 (383) Q Consensus 41 a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~-------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~--- 109 (383) +.+++.+..+|+.+|+++.+-.+++..... ..+...-.. ....+-+..++.+..++|.+|+++..+. + T Consensus 117 Y~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~r-l~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~ 195 (765) T protein:vir:96 117 ISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDME-FRVKDNLVELNRFKNVFGVRIALFVVESDDPDY 195 (765) T ss_pred HHhCchhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHH-hhHHHHHHHHHHHhhhceeeEEEEEecccCcch Confidence 346888999999999999998888754221 111111111 1234445556677778999988775431 2 Q ss_pred ------------CceeEEEEeccceeEEEE----c-C-----CCceeEEEEeecCcccccceeecccceEEeccCCC--- Q lcl|NC_018285. 110 ------------GRDMKWEYLRPSQVSFNR----L-D-----NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSV--- 164 (383) Q Consensus 110 ------------g~~~~l~~l~~~~v~~~~----~-~-----~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~--- 164 (383) |....|..++|.++.... . + .+....|.+. ...+-++.||||..... T Consensus 196 l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~--------g~~IH~SRli~~~g~~lpd~ 267 (765) T protein:vir:96 196 YEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIIS--------GKKYHRSHLVVVRGPQPPDI 267 (765) T ss_pred hhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeec--------CceeccceEEEecCCCchhh Confidence 233456666665554421 1 1 1112233322 12355677888865442 Q ss_pred ---CccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCH-HHHHHHHHHHHHhhcCCcceeecCCCc Q lcl|NC_018285. 165 ---DGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLL-DFKTKVSRSRQAMKQMQGGPLVLDDLE 240 (383) Q Consensus 165 ---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~-e~~~~~~~~~~~~~~~~g~~~vl~~g~ 240 (383) ...+.|.|.++.+...|.....+......++........-+.....+.. ++..+-.+.+... .+..++++++.+. T Consensus 268 lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~-r~n~g~~~id~ee 346 (765) T protein:vir:96 268 LKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIAN-RDNHGVKVIGIDE 346 (765) T ss_pred hccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHh-cCCceeEEecCCc Confidence 2345799999999999999999999988888886655443333333322 2222222233333 3445688899999 Q ss_pred eeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc-cC--cCHHHHHHHHHH-------HHHHHHHHHHHHHHHH Q lcl|NC_018285. 241 DFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG-DQ--QSSLEMSSNVYS-------KAVARYLRPFLSELSQ 310 (383) Q Consensus 241 ~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~-~~--~~~~e~~~~~~~-------~~l~P~~~~i~~~l~~ 310 (383) +|+.++.+..+ +.+.......+||++.+||..+|.+.+ ++ ++.++..+.||. ..+.|.++.+-+.|-. T Consensus 347 ~~e~~s~~lsg--l~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~ 424 (765) T protein:vir:96 347 TMEQFDTNLSD--FDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAK 424 (765) T ss_pred ceeEEecccCC--HHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999887754 567778889999999999998886543 33 333444555554 4456666655555443 Q ss_pred hhc--chhhccchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhhcCCc------CCcchhHHhCCCCCC-- Q lcl|NC_018285. 311 KLS--CDVDADIFPAVDPTGANYI-------SRINSMVKSGTLAQNQGLYILQQAEI------LPKELPKGENPNRTI-- 373 (383) Q Consensus 311 ~l~--~~~e~~~~~~~~~~~~~~~-------~~~~~l~~~g~~t~nE~r~~lg~~~~------~~~d~~~~~~~~~~~-- 373 (383) .-. +++.+....+...+..+.+ +.+..++.+|+++++|+|+.|+..+. +.+++.......... T Consensus 425 s~~i~~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~ 504 (765) T protein:vir:96 425 SESIDVQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLA 504 (765) T ss_pred hcCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccc Confidence 211 1233333334444544443 44667889999999999999876552 212211111110000 Q ss_pred --CCCCCCCCCC Q lcl|NC_018285. 374 --LKGGETNGQD 383 (383) Q Consensus 374 --~~ggd~~~~d 383 (383) .+++++++.. T Consensus 505 ~~~~~~~~~~~~ 516 (765) T protein:vir:96 505 ELEKAGAQSAKA 516 (765) T ss_pred cccCCCcccccc Confidence 0111111000 No 116 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.72 E-value=4.1e-16 Score=104.94 Aligned_cols=381 Identities=10% Similarity=0.051 Sum_probs=228.9 Q ss_pred hhhhhhcCCcccccccc---cccch-hhc------ccccCCcee-chhhhh-ccHHHHHHHHHHHHhhhhCceeeecchh Q lcl|NC_018285. 3 IFNLATESPPNNQGGFF---DITDP-EFL------ATLNGSEWV-SAETAL-KNSDLFSIISQLSNDLATAKLTTSRKQM 70 (383) Q Consensus 3 lf~~~~~~~~~~~~~~~---~~~~~-~~~------~~~~~~~~~-~~~~a~-~~~~v~~~i~~ia~~ia~~p~~~~~~~~ 70 (383) .-.+.....+..+.+.. .+.+. ..+ +.+.++..+ ..+..+ +.+.|.+|++.+...|.+++|+|.-... T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~ 80 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANGA 80 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 11111111111111111 00100 001 011111111 123333 5889999999999999999999964321 Q ss_pred h---------hhccC----------C--CccCCHHHHHHHHHHHHHHcCCeEEEEeecC-----CCc--eeEEEEeccce Q lcl|NC_018285. 71 Q---------GIVDN----------P--SNSANRFNFYQSIFAQMLLGGEAFAYRWRND-----NGR--DMKWEYLRPSQ 122 (383) Q Consensus 71 ~---------~l~~~----------P--N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-----~g~--~~~l~~l~~~~ 122 (383) . .|... + +-..++.+++..++.+.+.+|-++.++++.. +|. +..|.+.|+.+ T Consensus 81 ~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~rp~~~ 160 (469) T protein:vir:10 81 SDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAPRPQWT 160 (469) T ss_pred CHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeeecCccc Confidence 1 11110 1 1134678888888888888999999999753 343 56677777766 Q ss_pred eE-EEEcCCCceeEEEEeecC--------cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 123 VS-FNRLDNQNGLYYNVTFDD--------PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLN 193 (383) Q Consensus 123 v~-~~~~~~~~~~~y~~~~~~--------~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 193 (383) +. +..+.+++.+.++..... .......+++....|+.++....+..+|.|.+..+......-....++... T Consensus 161 i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~ 240 (469) T protein:vir:10 161 ISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAA 240 (469) T ss_pred ceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHH Confidence 53 334445555554432111 011234567788877777776677788999999999999898999999999 Q ss_pred HHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 194 SLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 194 ~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) +.+..|.|--+.+.+...++++++.+.+.......+....++++.|++++-++.+.....+.+..++.-++|+.++--.- T Consensus 241 f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~t 320 (469) T protein:vir:10 241 TAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHF 320 (469) T ss_pred HHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhccc Confidence 99999999999999988888888888887776654444566788898887776665555688888888888887652111 Q ss_pred HHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch---hhccc--------hhhhccCHHHHHHHHHHHHhCC Q lcl|NC_018285. 274 NVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSCD---VDADI--------FPAVDPTGANYISRINSMVKSG 342 (383) Q Consensus 274 ~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~---~e~~~--------~~~~~~~~~~~~~~~~~l~~~g 342 (383) --.+..+..+...+.........+.-.++.|+..||+.|+.. +.|.. ....+.+....++.+..|+..| T Consensus 321 lTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e~~~~~~a~~i~~l~~~G 400 (469) T protein:vir:10 321 LNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIGSRQDLTAAAVKLLYDAG 400 (469) T ss_pred ccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCCCcHHHHHHHHHHHHhcC Confidence 001111111122344456677788888999999999877653 11110 1122344556678888999999 Q ss_pred C-----cCHHHHHHHhhcCCcCCcchh---HHhCCCCCCCCCCCC---CCCC Q lcl|NC_018285. 343 T-----LAQNQGLYILQQAEILPKELP---KGENPNRTILKGGET---NGQD 383 (383) Q Consensus 343 ~-----~t~nE~r~~lg~~~~~~~d~~---~~~~~~~~~~~ggd~---~~~d 383 (383) + .+.+.+|+.+|.+....++-. ...+.......+++. .+++ T Consensus 401 ~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (469) T protein:vir:10 401 VFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNA 452 (469) T ss_pred CccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCc Confidence 8 456789999998864433211 111111111111111 1111 No 117 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.71 E-value=1.7e-17 Score=112.54 Aligned_cols=382 Identities=13% Similarity=0.026 Sum_probs=216.1 Q ss_pred CchhhhhhcCC--c------c---cccccccccchhhcccccCCcee--------------chhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESP--P------N---NQGGFFDITDPEFLATLNGSEWV--------------SAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mglf~~~~~~~--~------~---~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~a~~~~~v~~~i~~ia 55 (383) |+||+++..-- . . ....+...........+...... +..-+..++.+..||+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 99999864311 0 0 01112222111111111111000 1111346788889999886 Q ss_pred Hhhhh---Cceee--ecch---h-----------hhhccCCC--ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC----- Q lcl|NC_018285. 56 NDLAT---AKLTT--SRKQ---M-----------QGIVDNPS--NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN----- 109 (383) Q Consensus 56 ~~ia~---~p~~~--~~~~---~-----------~~l~~~PN--~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~----- 109 (383) +.+-. +.++- ...+ . ..+..+|. -.++.+.+...++..++..|++|+.+.+... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 66553 22221 1111 0 11222332 3478999999999999999999999876432 Q ss_pred --CceeEEEEeccceeEEE-------------EcCCCceeEEEEeecCcc-------cccceeecccceEEeccCCCCcc Q lcl|NC_018285. 110 --GRDMKWEYLRPSQVSFN-------------RLDNQNGLYYNVTFDDPR-------IPPKQHVPQSDILHFRLLSVDGG 167 (383) Q Consensus 110 --g~~~~l~~l~~~~v~~~-------------~~~~~~~~~y~~~~~~~~-------~~~~~~~~~~dvih~~~~~~~~~ 167 (383) ..+..|..|+|+++..- .+..+..+.|.+....+. ....+.+++++|+|+..+...+. T Consensus 161 g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ 240 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQ 240 (548) T ss_pred CcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCcc Confidence 24678899998887432 234455677877654332 12346799999999998888888 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhh-cCCccee-ecCCCceeeec Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMK-QMQGGPL-VLDDLEDFTPL 245 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~-~~~g~~~-vl~~g~~~~~~ 245 (383) .+|+|.+..+...+............-.+=.+...++|+.+....... ........... -..|.++ .|..|.+++.. T Consensus 241 ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~-~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~ 319 (548) T protein:vir:95 241 NRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTV-EPGKDRKNRTIPIAPGMVFDDLEPGEDVGMI 319 (548) T ss_pred ccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccC-CCCcccccccccccCCccccccCCCceeeec Confidence 999999999999999888888888777777777788887653321110 00000000011 1235444 58899999888 Q ss_pred ccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--ccCcCHHHHHH-----------HHHHHHHHHHHHHHHHH-HHHh Q lcl|NC_018285. 246 EIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEMSS-----------NVYSKAVARYLRPFLSE-LSQK 311 (383) Q Consensus 246 ~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~~~~~~~e~~~-----------~~~~~~l~P~~~~i~~~-l~~~ 311 (383) +.+.....|.+..+...+.||+.+|||.+.|.+. .+|++...... .|....++|+.+.+.++ +-.- T Consensus 320 ~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G 399 (548) T protein:vir:95 320 ESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLAR 399 (548) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 8775666889999999999999999999999642 23333322221 23333444443333222 1111 Q ss_pred hc--ch-hhc----cch----hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc-----chhHHh--CCCCC- Q lcl|NC_018285. 312 LS--CD-VDA----DIF----PAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK-----ELPKGE--NPNRT- 372 (383) Q Consensus 312 l~--~~-~e~----~~~----~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~-----d~~~~~--~~~~~- 372 (383) .+ +. .+. ... ...-.|+.+.+.....++++|++|+-|+-...|..+-.-- |....+ ++... T Consensus 400 ~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~ 479 (548) T protein:vir:95 400 KERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSS 479 (548) T ss_pred CcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCC Confidence 11 10 010 000 0112567777777788999999999999998886652111 111111 11110 Q ss_pred -----CCCCCCCCCC-C Q lcl|NC_018285. 373 -----ILKGGETNGQ-D 383 (383) Q Consensus 373 -----~~~ggd~~~~-d 383 (383) +.+++++... + T Consensus 480 ~~~~~~~~~~~~~~~~~ 496 (548) T protein:vir:95 480 DAYHQLVKSGMDPVEAV 496 (548) T ss_pred cccccccccccCCCCch Confidence 1111111111 1 No 118 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.68 E-value=4.2e-17 Score=110.36 Aligned_cols=381 Identities=12% Similarity=-0.010 Sum_probs=220.8 Q ss_pred CchhhhhhcCCcc----c----ccccccccchhh-ccc-----ccCC-cee----------chhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPN----N----QGGFFDITDPEF-LAT-----LNGS-EWV----------SAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mglf~~~~~~~~~----~----~~~~~~~~~~~~-~~~-----~~~~-~~~----------~~~~a~~~~~v~~~i~~ia 55 (383) |.++++....... . ...+........ .++ ..+. ..+ +..-+..++.+..+|+.+. T Consensus 8 ~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 87 (505) T protein:vir:96 8 PSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLK 87 (505) T ss_pred cchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 8888886431110 0 011111110000 000 0010 000 1122357888999999777 Q ss_pred Hhhhh-Cceeeecc----------h--------hhhhccCCCc----cCCHHHHHHHHHHHHHHcCCeEEEEeecCC-Cc Q lcl|NC_018285. 56 NDLAT-AKLTTSRK----------Q--------MQGIVDNPSN----SANRFNFYQSIFAQMLLGGEAFAYRWRNDN-GR 111 (383) Q Consensus 56 ~~ia~-~p~~~~~~----------~--------~~~l~~~PN~----~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g~ 111 (383) +.+-. ..++..-. . ...+...+|. .++.+++...++..++..|++|+.+++... .. T Consensus 88 ~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~ 167 (505) T protein:vir:96 88 NNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKW 167 (505) T ss_pred HHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCc Confidence 66654 44443211 0 1122344553 367899999999999999999998876433 34 Q ss_pred eeEEEEeccceeEEE----------------EcCCCceeEEEEeecCcc---------cccceeecccceEEeccCCCCc Q lcl|NC_018285. 112 DMKWEYLRPSQVSFN----------------RLDNQNGLYYNVTFDDPR---------IPPKQHVPQSDILHFRLLSVDG 166 (383) Q Consensus 112 ~~~l~~l~~~~v~~~----------------~~~~~~~~~y~~~~~~~~---------~~~~~~~~~~dvih~~~~~~~~ 166 (383) +..|..|+|+++..- .+..+..+.|.+....+. ....+.+++++|+|+..+...+ T Consensus 168 ~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~g 247 (505) T protein:vir:96 168 GYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPH 247 (505) T ss_pred ceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhcccCCc Confidence 678888888887432 234455677877543321 1234558899999999888888 Q ss_pred cccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-CHHHHHHHHHHHHHhhcCCcceeecCCCceeeec Q lcl|NC_018285. 167 GLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPL 245 (383) Q Consensus 167 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~ 245 (383) ..+|+|.+..+...+.......+....-.+=.+...++|+.+... .+.....-.. ....-..|.+..|..|.+++.+ T Consensus 248 Q~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~--~~~~l~pG~i~~L~pGe~i~~~ 325 (505) T protein:vir:96 248 QNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGE--IVEEVEAGTYQLLPYGIRFKEH 325 (505) T ss_pred cccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCc--cccccCCceeeecCCCCeeeee Confidence 999999999999999888888888877777777788888875432 1111000000 0011235677889999999998 Q ss_pred ccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc---ccCcCHHHHH-----------HHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_018285. 246 EIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ---GDQQSSLEMS-----------SNVYSKAVARYLRPFLSE-LSQ 310 (383) Q Consensus 246 ~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~---~~~~~~~e~~-----------~~~~~~~l~P~~~~i~~~-l~~ 310 (383) +.+....+|.+..+...+.||+.+|||.+.|-+. .+|++..... ..|....++|+.+.+.++ +-. T Consensus 326 ~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 405 (505) T protein:vir:96 326 KIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLT 405 (505) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 8886667889999999999999999999988532 2333332222 223344555544443322 211 Q ss_pred hhcc--hhhcc----ch----hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc-----hhHHhCCCC---- Q lcl|NC_018285. 311 KLSC--DVDAD----IF----PAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE-----LPKGENPNR---- 371 (383) Q Consensus 311 ~l~~--~~e~~----~~----~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d-----~~~~~~~~~---- 371 (383) ..++ ....+ .. ...-.|+.+.+.....++++|++|+-|+-...|..+-.--+ ....+..+- T Consensus 406 G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~ 485 (505) T protein:vir:96 406 QALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTP 485 (505) T ss_pred CCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCC Confidence 1111 11111 00 11125677777777889999999999998888876521111 111111111 Q ss_pred --CCCCCC-----CCCCCC Q lcl|NC_018285. 372 --TILKGG-----ETNGQD 383 (383) Q Consensus 372 --~~~~gg-----d~~~~d 383 (383) .....+ +.+.+| T Consensus 486 ~~~~~~~~~~~~~~~~~~d 504 (505) T protein:vir:96 486 PEQESKDATTDEEDDSASD 504 (505) T ss_pred CCCCCCCCCCCCCCCCCCC Confidence 011111 112222 No 119 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.68 E-value=3e-16 Score=105.71 Aligned_cols=383 Identities=10% Similarity=0.004 Sum_probs=216.4 Q ss_pred CchhhhhhcC--Cc--ccccccccc-c-chhhcccccCCce-----------e---chhhhhccHHHHHHHHHHHHhhhh Q lcl|NC_018285. 1 MPIFNLATES--PP--NNQGGFFDI-T-DPEFLATLNGSEW-----------V---SAETALKNSDLFSIISQLSNDLAT 60 (383) Q Consensus 1 Mglf~~~~~~--~~--~~~~~~~~~-~-~~~~~~~~~~~~~-----------~---~~~~a~~~~~v~~~i~~ia~~ia~ 60 (383) |+.-..+.-. +. ...+..... + .....++...... + +..-+..++.+.+||+.+.+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 5443322111 10 000000000 0 0111111111100 0 112235688899999999888877 Q ss_pred Cceeeecc-----------hh-----------hhhccCCCc------cCCHHHHHHHHHHHHHHcCCeEEEEeecCC-C- Q lcl|NC_018285. 61 AKLTTSRK-----------QM-----------QGIVDNPSN------SANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G- 110 (383) Q Consensus 61 ~p~~~~~~-----------~~-----------~~l~~~PN~------~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g- 110 (383) ..+++.-. .. ..+...|+. .++.+++.+.++..++..|++|+.+++... | T Consensus 81 ~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~ 160 (530) T protein:vir:38 81 SFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTR 160 (530) T ss_pred CCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCC Confidence 77765421 01 112234442 478899999999999999999999986543 3 Q ss_pred -ceeEEEEeccceeEEE--------------EcCCCceeEEEEeecCcc-c--c------cceeecccceEEeccCCCCc Q lcl|NC_018285. 111 -RDMKWEYLRPSQVSFN--------------RLDNQNGLYYNVTFDDPR-I--P------PKQHVPQSDILHFRLLSVDG 166 (383) Q Consensus 111 -~~~~l~~l~~~~v~~~--------------~~~~~~~~~y~~~~~~~~-~--~------~~~~~~~~dvih~~~~~~~~ 166 (383) .+..|..|+|+++... .+..+..+.|.+...... . . ....+++++|+|+..+...+ T Consensus 161 ~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~g 240 (530) T protein:vir:38 161 LFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDG 240 (530) T ss_pred ccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCCC Confidence 3578888988877431 244555677777543211 1 0 12446677999999888888 Q ss_pred cccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC-----------HHHHHHHH------HHHHH---h Q lcl|NC_018285. 167 GLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL-----------LDFKTKVS------RSRQA---M 226 (383) Q Consensus 167 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-----------~e~~~~~~------~~~~~---~ 226 (383) ..+|+|.+..+...+.......+....-.+-.+.-.++|+.+.... .++...+. ..+.. . T Consensus 241 Q~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (530) T protein:vir:38 241 QTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPV 320 (530) T ss_pred cccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccce Confidence 8999999999999998888888887777777777777877543211 11111111 01111 0 Q ss_pred hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc-c--ccCcCHHHHH-----------HHH Q lcl|NC_018285. 227 KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG-Q--GDQQSSLEMS-----------SNV 292 (383) Q Consensus 227 ~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~-~--~~~~~~~e~~-----------~~~ 292 (383) .-..|.+..|..|.+++..+.+-...+|.+..+...+.||+.+|||.+.|-+ - .+|++..... ..| T Consensus 321 ~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~ 400 (530) T protein:vir:38 321 RLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFV 400 (530) T ss_pred eccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 1235667789999999988877666788899999999999999999998843 2 2333332222 223 Q ss_pred HHHHHHHHHHHHHHH-HHHhhcc-----hhhccc--h----------hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhh Q lcl|NC_018285. 293 YSKAVARYLRPFLSE-LSQKLSC-----DVDADI--F----------PAVDPTGANYISRINSMVKSGTLAQNQGLYILQ 354 (383) Q Consensus 293 ~~~~l~P~~~~i~~~-l~~~l~~-----~~e~~~--~----------~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg 354 (383) ....+.|+.+.+.++ +..-.++ .+++.. . ...-.|+.+.+.....++++|++|+-++-...| T Consensus 401 ~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G 480 (530) T protein:vir:38 401 ASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG 480 (530) T ss_pred HHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC Confidence 333444544443222 2221111 011100 0 011246777777778899999999999998888 Q ss_pred cCCcCCc-----chhHHh--CCCCC------CCC---CCCCCCCC Q lcl|NC_018285. 355 QAEILPK-----ELPKGE--NPNRT------ILK---GGETNGQD 383 (383) Q Consensus 355 ~~~~~~~-----d~~~~~--~~~~~------~~~---ggd~~~~d 383 (383) ..+-.-- |....+ ++... +.. ..+.+++| T Consensus 481 ~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d 525 (530) T protein:vir:38 481 DDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQD 525 (530) T ss_pred CCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCC Confidence 6652111 111111 12111 111 11222222 No 120 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.65 E-value=2.3e-15 Score=100.88 Aligned_cols=358 Identities=10% Similarity=-0.010 Sum_probs=210.8 Q ss_pred hhhh-hcCCcccccccccccc----------hhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh-- Q lcl|NC_018285. 4 FNLA-TESPPNNQGGFFDITD----------PEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM-- 70 (383) Q Consensus 4 f~~~-~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~-- 70 (383) .++- .....+...+..+... +..+-...++..-..+..++.+.|.+|++.+...|.+++|.+.-... T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~~~~ 80 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAGGDRP 80 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcCCCCh Confidence 1110 0000000111111110 11111111121222344567889999999999999999999963221 Q ss_pred ---------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-C--ceeEEEEeccceeEEEEcCCCceeEEEE Q lcl|NC_018285. 71 ---------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G--RDMKWEYLRPSQVSFNRLDNQNGLYYNV 138 (383) Q Consensus 71 ---------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g--~~~~l~~l~~~~v~~~~~~~~~~~~y~~ 138 (383) +..+.+ ..+.++++.+. +.+++|-++.++++..+ | .|..+.+.++.++.+..+ + ...+.. T Consensus 81 ~~~~~ae~v~~~l~~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~--~-~l~~~~ 152 (488) T protein:vir:99 81 IDQAAAEHLEQQLQR----VGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQD--G-GLRLLT 152 (488) T ss_pred HHHHHHHHHHHHHhC----CCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecCC--C-ceEEec Confidence 112333 35677777775 56789999999998543 3 466888999988776432 2 233322 Q ss_pred eecCcccccceeeccc-c-eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHH Q lcl|NC_018285. 139 TFDDPRIPPKQHVPQS-D-ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDF 215 (383) Q Consensus 139 ~~~~~~~~~~~~~~~~-d-vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~ 215 (383) . ++. ....+++.. . ++|. +....+..+|.|.+..+......-....+....|.+..|.|-.+.+.+. ..++++ T Consensus 153 ~-~~~--~~g~~lp~~~~~i~~~-~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~e 228 (488) T protein:vir:99 153 P-NNM--FEGEPCPAPYFWHFST-GADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPED 228 (488) T ss_pred c-CCC--CCccccccCceEEEEe-ecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHH Confidence 1 111 123445433 3 3333 3444556789999999999888888999999999999999999999875 567888 Q ss_pred HHHHHHHHHHhhcCCcceeecCCCceeeecc--cChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc-----ccCc-CHHH Q lcl|NC_018285. 216 KTKVSRSRQAMKQMQGGPLVLDDLEDFTPLE--IKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ-----GDQQ-SSLE 287 (383) Q Consensus 216 ~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~--~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~-----~~~~-~~~e 287 (383) ++++.+.......+++.+ ++.|++++-++ .... ..|.+..++.-++|+.+ +||.+ .+++ ...+ T Consensus 229 k~~l~~av~~~~~~~~~v--iP~~~~ie~~ea~~~~~-~~~~~li~~~d~~Isk~------iLGqtlts~~~~Gs~a~~~ 299 (488) T protein:vir:99 229 KAKLLAALHAIQTDSAII--MPAGMQAELLEAGRSGT-ADYKTLHDTMDATIAKV------GLGQVASTQGTPGRLGNDD 299 (488) T ss_pred HHHHHHHHHHHhcCcEEE--ecCCceeEEeecCCCCh-HHHHHHHHHHHHHHHHH------HhhhhhcccccccchhhHH Confidence 888888877776655544 45555554433 3222 24677778888888877 34421 1112 2233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcchh---hcc--------chhhhccCHHHHHHHHHHHHhC-CC-cCHHHHHHHhh Q lcl|NC_018285. 288 MSSNVYSKAVARYLRPFLSELSQKLSCDV---DAD--------IFPAVDPTGANYISRINSMVKS-GT-LAQNQGLYILQ 354 (383) Q Consensus 288 ~~~~~~~~~l~P~~~~i~~~l~~~l~~~~---e~~--------~~~~~~~~~~~~~~~~~~l~~~-g~-~t~nE~r~~lg 354 (383) .........+.-.++.|++.||+.|+..+ .|. .......|...+++.+.++++. |+ ++..++|+++| T Consensus 300 vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~G 379 (488) T protein:vir:99 300 LQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYG 379 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcC Confidence 34456677888889999999998776431 111 1112234556778888999985 64 78999999999 Q ss_pred cCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 355 QAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 355 ~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) .++-..++............++ ..+... T Consensus 380 ip~~~~~~~~~~~~~~~~~~~~-~~~~~~ 407 (488) T protein:vir:99 380 VEVESTQAEATAPTPSTEFAEG-DQPSDP 407 (488) T ss_pred CCCcccccccccCCCcccCCCC-CCCCCc Confidence 8864333211111111111111 111111 No 121 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.64 E-value=8.7e-15 Score=97.68 Aligned_cols=362 Identities=9% Similarity=-0.039 Sum_probs=212.8 Q ss_pred Cchhhh----hhc---CCcccc--ccc-ccccchhhcccc---------cCCceechhhhhccHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MPIFNL----ATE---SPPNNQ--GGF-FDITDPEFLATL---------NGSEWVSAETALKNSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mglf~~----~~~---~~~~~~--~~~-~~~~~~~~~~~~---------~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~ 61 (383) =+|++- ++. +++... ... ..++.....+.+ .++..--.+..++.+.|.+|++.+...|.++ T Consensus 3 ~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~ 82 (491) T protein:vir:79 3 KGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRKAAVKAL 82 (491) T ss_pred CeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhCC Confidence 111111 000 000000 000 000000000000 1111113344567899999999999999999 Q ss_pred ceeeecchhh--------hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-C--ceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 62 KLTTSRKQMQ--------GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G--RDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 62 p~~~~~~~~~--------~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g--~~~~l~~l~~~~v~~~~~~~ 130 (383) +|.+...... .++.+ +...++++.+ .+.+++|-++.++++..+ | .|..+.+.++.++.+..+ T Consensus 83 ~w~i~~~~~~~~~a~~i~e~l~~----~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~-- 155 (491) T protein:vir:79 83 EWGLDRGKAKSRVAKSIADVFAD----LDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDPE-- 155 (491) T ss_pred CcEEecCCCCHHHHHHHHHHHhc----CCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeecccceeeccC-- Confidence 9999743221 12333 2456666666 457789999999987654 3 356899999998876442 Q ss_pred CceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC Q lcl|NC_018285. 131 QNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG 210 (383) Q Consensus 131 ~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~ 210 (383) + ...++... + .....++++...++.++....+..+|.|.+..+......-....++...|.+..|.|-.+.+.+.. T Consensus 156 ~-~l~l~~~~-~--~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~ 231 (491) T protein:vir:79 156 N-QLRFRSKE-H--WVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS 231 (491) T ss_pred C-ceEEeecC-C--CCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC Confidence 2 33333221 1 223456888888877776666778899999999999988899999999999999999999999988 Q ss_pred CCHHHHHHHHHHHHHhhcCCcceeecCCCceeee--cccCh-hhHHHHHHHHHHHHHHHHHhcCCHHHhcccc----cCc Q lcl|NC_018285. 211 GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTP--LEIKS-NVAQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQ 283 (383) Q Consensus 211 ~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~--~~~~~-~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~----~~~ 283 (383) .++++++++.+.......+++.+ ++.|++++- .+... .-..|.+..++.-++|+.++ ||.+. +++ T Consensus 232 a~~~ek~~l~~al~~~~~~a~~v--iP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~~gs 303 (491) T protein:vir:79 232 ASDAETNLLLDRLEDMVQDAVAV--IPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL------LGQNQTTEATST 303 (491) T ss_pred CCHHHHHHHHHHHHHHhcCeEEE--ecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH------hhhhhccCcccc Confidence 88999988888887776665544 555555544 33222 22236666777777777754 55321 111 Q ss_pred -CHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hh----------ccchhhhccCHHHHHHHHHHHHhCCC-cCHHHHH Q lcl|NC_018285. 284 -SSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VD----------ADIFPAVDPTGANYISRINSMVKSGT-LAQNQGL 350 (383) Q Consensus 284 -~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e----------~~~~~~~~~~~~~~~~~~~~l~~~g~-~t~nE~r 350 (383) ...+.........+.-.++.+++.||+ |+.. +. |.... ........++.+.++...|+ ++..++| T Consensus 304 ~a~~~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e-~ee~~~~~a~~~~~L~~~G~~i~~~~~~ 381 (491) T protein:vir:79 304 RASAQAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWE-QEQVDEIQAGRDEKLTRAGARFTPAYFK 381 (491) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecC-cCchhHHHHHHHHHHHhCCCccCHHHHH Confidence 122333445566677778888888885 4422 22 11111 11122446677888999887 6999999 Q ss_pred HHhhcCCcCCcchhHHhCCCCC----CCCCCCCCCCC Q lcl|NC_018285. 351 YILQQAEILPKELPKGENPNRT----ILKGGETNGQD 383 (383) Q Consensus 351 ~~lg~~~~~~~d~~~~~~~~~~----~~~ggd~~~~d 383 (383) +.+|.+.-..++.......+.. ........+++ T Consensus 382 e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (491) T protein:vir:79 382 RAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQD 418 (491) T ss_pred HHhCCCCCCCCccccCcCcccccccccccccCCCCCc Confidence 9999875333332111000100 11111112222 No 122 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.63 E-value=1.2e-14 Score=96.82 Aligned_cols=362 Identities=11% Similarity=0.011 Sum_probs=214.4 Q ss_pred Cchhhhhhc-------CC-----ccccccccc-----cc--ch-hhcccccCCceechhhhhccHHHHHHHHHHHHhhhh Q lcl|NC_018285. 1 MPIFNLATE-------SP-----PNNQGGFFD-----IT--DP-EFLATLNGSEWVSAETALKNSDLFSIISQLSNDLAT 60 (383) Q Consensus 1 Mglf~~~~~-------~~-----~~~~~~~~~-----~~--~~-~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~ 60 (383) =+|++---+ ++ ...+....+ .+ .. ..+-. .++..--.+..++.+.|.+|++.+...|.+ T Consensus 3 ~~i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~-~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~ 81 (491) T protein:vir:10 3 KGLWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKA-LGKDIRVYRELRADAHVGGCVRRRKAAVKA 81 (491) T ss_pred CceeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHh-cCCCHHHHHHHhhChHHHHHHHHHHHHHhC Confidence 011111000 00 000000000 00 00 00000 111111234456788999999999999999 Q ss_pred Cceeeecchh--------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC---ceeEEEEeccceeEEEEcC Q lcl|NC_018285. 61 AKLTTSRKQM--------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 61 ~p~~~~~~~~--------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g---~~~~l~~l~~~~v~~~~~~ 129 (383) ++|.|..... ..++.+ ....++++.+. +.+++|.++.++++..+| .|..+.++++.++.+..+ T Consensus 82 ~~w~i~~~~~~~~~~e~v~e~l~~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~- 155 (491) T protein:vir:10 82 LEWGLDRGKAKSRVAKSIADVFAD----LDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDPE- 155 (491) T ss_pred CCcEEecCCCCHHHHHHHHHHHhc----CCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceeeccC- Confidence 9999974221 122333 34677777775 677899999999986543 366899999998876432 Q ss_pred CCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC Q lcl|NC_018285. 130 NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG 209 (383) Q Consensus 130 ~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~ 209 (383) +...+... + ......++++...|+.++....+..+|.|.+..+......-....+....|.+..|.|-.+.+.+. T Consensus 156 --~~l~~~~~-~--~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~ 230 (491) T protein:vir:10 156 --NQLRFRSK-D--HWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR 230 (491) T ss_pred --CceEEecC-C--CCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCC Confidence 23444321 1 123345678887777777666677889999999999998889999999999999999999999998 Q ss_pred CCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeec--ccChhhH-HHHHHHHHHHHHHHHHhcCCHHHhccc----ccC Q lcl|NC_018285. 210 GGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPL--EIKSNVA-QLLKQADWTTGQFAKVYGIPENVVGGQ----GDQ 282 (383) Q Consensus 210 ~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~--~~~~~d~-~~~e~~~~~~~~Ia~~~gVpp~~lg~~----~~~ 282 (383) ..++++++++.+...+...+++ ++++.|++++-+ +.+.... -|.+..++.-++|+.+ +||.+ +++ T Consensus 231 ~a~~~ek~~l~~al~~~~~~a~--~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~------iLGqtlTt~~~g 302 (491) T protein:vir:10 231 SASDGEKNLLLDCLEDMVQDAV--AVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIA------LLGQNQTTEATS 302 (491) T ss_pred CCCHHHHHHHHHHHHHHhcCcE--EEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHH------HhhhhcccCccc Confidence 8899999989888887766655 445566555444 3322222 3666677777777775 44432 111 Q ss_pred c-CHHHHHHHHHHHHHHHHHHHHHHHHHHhhcch-hhccc---------hhhhccCHHHHHHHHHHHHhCCC-cCHHHHH Q lcl|NC_018285. 283 Q-SSLEMSSNVYSKAVARYLRPFLSELSQKLSCD-VDADI---------FPAVDPTGANYISRINSMVKSGT-LAQNQGL 350 (383) Q Consensus 283 ~-~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~-~e~~~---------~~~~~~~~~~~~~~~~~l~~~g~-~t~nE~r 350 (383) + ...+.........+.-.++.++..||+ |+.. +.++. ......+....++.+.+|...|+ ++..+++ T Consensus 303 s~a~~~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~e~~~~~a~~~~~L~~~G~~i~~~~i~ 381 (491) T protein:vir:10 303 TRASAQAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQEQVDEIQAGRDQKLTQAGARFTPAYFK 381 (491) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcCchhHHHHHHHHHHHhCCCcCCHHHHH Confidence 2 122333445566677778888888875 4421 11110 00111223556778889998887 6999999 Q ss_pred HHhhcCCcCCcchhHHhCCCCC--CCC--CCCCCCCC Q lcl|NC_018285. 351 YILQQAEILPKELPKGENPNRT--ILK--GGETNGQD 383 (383) Q Consensus 351 ~~lg~~~~~~~d~~~~~~~~~~--~~~--ggd~~~~d 383 (383) +++|.+.-..++.......+.. +.. ..+..+++ T Consensus 382 e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (491) T protein:vir:10 382 RAYNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQD 418 (491) T ss_pred HHhCCCCCCcCccccccCCCCCcccccccccCCCCCC Confidence 9999875333332111111111 111 11122222 No 123 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.63 E-value=2.3e-15 Score=100.79 Aligned_cols=368 Identities=12% Similarity=0.033 Sum_probs=212.7 Q ss_pred Cchh----hhhhcCCccccccccccc--chhhc-------------ccc---cCCceec----hhhhh-ccHHHHHHHHH Q lcl|NC_018285. 1 MPIF----NLATESPPNNQGGFFDIT--DPEFL-------------ATL---NGSEWVS----AETAL-KNSDLFSIISQ 53 (383) Q Consensus 1 Mglf----~~~~~~~~~~~~~~~~~~--~~~~~-------------~~~---~~~~~~~----~~~a~-~~~~v~~~i~~ 53 (383) |+-+ .+-.+.+...+....... ...+. ..+ .++.... .+..+ +.+.|.+|++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~ 80 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSK 80 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 4322 221111111110000000 00010 000 0111110 11111 58889999999 Q ss_pred HHHhhhhCceeeecchh-----hh----hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-C--ceeEEEEeccc Q lcl|NC_018285. 54 LSNDLATAKLTTSRKQM-----QG----IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G--RDMKWEYLRPS 121 (383) Q Consensus 54 ia~~ia~~p~~~~~~~~-----~~----l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g--~~~~l~~l~~~ 121 (383) +...|.+++|.|.-... .. +...-+..-.+.+++..+ .+.+++|-++.++++..+ | .|..+.+.++. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~ 159 (528) T protein:vir:10 81 RKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDC-MDGVGHGYSAIELDWSLQGREWLPQAFDHRPQS 159 (528) T ss_pred HHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHH-HhhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 99999999999974211 11 111111111244454444 556689999999987543 3 46788899998 Q ss_pred eeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_018285. 122 QVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNA 201 (383) Q Consensus 122 ~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~ 201 (383) ++.+..+.. ..++... + ......+++...++.++....+..+|.+.+..+.-....-....++...|.+..|.| T Consensus 160 ~f~~~~~~~---~~l~~~~-~--~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P 233 (528) T protein:vir:10 160 WFQLNPDDQ---DELRLRD-N--SIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLP 233 (528) T ss_pred ceeeccCCC---cEEeccC-C--CCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCC Confidence 887654322 2222211 1 123456777776666666667778899999999998888889999999999999999 Q ss_pred ceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccCh-hhHHHHHHHHHHHHHHHHHhcCCHHHhcccc Q lcl|NC_018285. 202 NGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKS-NVAQLLKQADWTTGQFAKVYGIPENVVGGQG 280 (383) Q Consensus 202 ~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~-~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~ 280 (383) -.+.+.+...++++++.+.+.......+++.+ ++.|++++=+..+. .-..|.+..++.-++|+.+. ||.+. T Consensus 234 ~~igky~~~a~~~ek~~L~~al~~i~~~~~~i--iP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqtl 305 (528) T protein:vir:10 234 IRLGKYPPGTPDEEKVTLLRAVTGLGHAAAGI--IPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI------LGGTL 305 (528) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHHhhCcEEE--ecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH------hhhhh Confidence 99999998888999999988888777766544 45555554433221 12246777788888887764 44211 Q ss_pred --------cCcCH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcch---hhcc------------chhhhccCHHHHHHHHH Q lcl|NC_018285. 281 --------DQQSS-LEMSSNVYSKAVARYLRPFLSELSQKLSCD---VDAD------------IFPAVDPTGANYISRIN 336 (383) Q Consensus 281 --------~~~~~-~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~---~e~~------------~~~~~~~~~~~~~~~~~ 336 (383) .+++. .+.......+.+.--++.++..||+.|+.. +.|. .......|...+++.+. T Consensus 306 Ts~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~ 385 (528) T protein:vir:10 306 TSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLP 385 (528) T ss_pred hccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHH Confidence 12222 222345566777888899999999877542 1221 01112234456788888 Q ss_pred HHHhCCC-cCHHHHHHHhhcCCcCCcchhHHhCCCCCC-----C-----------CCCCCCCCC Q lcl|NC_018285. 337 SMVKSGT-LAQNQGLYILQQAEILPKELPKGENPNRTI-----L-----------KGGETNGQD 383 (383) Q Consensus 337 ~l~~~g~-~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~-----~-----------~ggd~~~~d 383 (383) ++...|+ ++..++|+++|.+....+|..........+ . .+....+++ T Consensus 386 ~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (528) T protein:vir:10 386 PLVKLGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQE 449 (528) T ss_pred HHHhCCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccccccccccccccccccc Confidence 9999998 899999999998654333321111000000 0 001111111 No 124 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.63 E-value=3.7e-15 Score=99.67 Aligned_cols=381 Identities=10% Similarity=0.010 Sum_probs=206.5 Q ss_pred CchhhhhhcCCcc------cccccccccchhhcccccC---Ccee----------chhhhhccHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MPIFNLATESPPN------NQGGFFDITDPEFLATLNG---SEWV----------SAETALKNSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mglf~~~~~~~~~------~~~~~~~~~~~~~~~~~~~---~~~~----------~~~~a~~~~~v~~~i~~ia~~ia~~ 61 (383) |.|++.....+.. ..+.+........+..+.. ...+ +..-+..++.+..||+.+.+.+-.. T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~ 80 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGN 80 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCC Confidence 9998885432211 1111211111110111110 0001 1112356888899999888887555 Q ss_pred ceeeecc-hh-----------hhhccCCC--ccCCHHHHHHHHHHHHHHcCCeEEEEeecC--CC--ceeEEEEecccee Q lcl|NC_018285. 62 KLTTSRK-QM-----------QGIVDNPS--NSANRFNFYQSIFAQMLLGGEAFAYRWRND--NG--RDMKWEYLRPSQV 123 (383) Q Consensus 62 p~~~~~~-~~-----------~~l~~~PN--~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~--~g--~~~~l~~l~~~~v 123 (383) .++..-. .. ..+.++|. -.++.+.+...++..++..|++|+.++... .| .+..|..|+|+++ T Consensus 81 Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l 160 (495) T protein:vir:10 81 GLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDML 160 (495) T ss_pred CcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhc Confidence 5544311 11 11223332 347889999999999999999999887542 33 4678899998887 Q ss_pred EEE-----------------EcCCCceeEEEEeecCccc-------ccceeecccceEEeccCCCCccccCcchHHHHHH Q lcl|NC_018285. 124 SFN-----------------RLDNQNGLYYNVTFDDPRI-------PPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGR 179 (383) Q Consensus 124 ~~~-----------------~~~~~~~~~y~~~~~~~~~-------~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~ 179 (383) ..- .+..+..+.|.+....+.. ...+.+++++|+|+.. ...+...|+|.+.. .. T Consensus 161 ~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~-~r~gQ~RGis~la~-i~ 238 (495) T protein:vir:10 161 ASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV-LTVRSDAGAPWFQL-LL 238 (495) T ss_pred CCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc-cCCCcccCcchhHH-HH Confidence 521 1234456777775443322 1245699999999964 45678899997654 44 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCH--HHHHHHHHH---HHHhhcCCcceeecCCCceeeecccChhhHHH Q lcl|NC_018285. 180 ELDIQKASDKLTLNSLKNALNANGILKIKGGGLL--DFKTKVSRS---RQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQL 254 (383) Q Consensus 180 ~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~--e~~~~~~~~---~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~ 254 (383) .+.-.....+....-.+-.+...++|+.+..... +........ -....-..|.+..|..|.+++.++.+..-..| T Consensus 239 ~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~ 318 (495) T protein:vir:10 239 RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTY 318 (495) T ss_pred HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCH Confidence 4665555555555555556667777775432110 000000000 00111245678889999999988877666678 Q ss_pred HHHHHHHHHHHHHHhcCCHHHhccc---ccCcCHHHHH----HH--------HHHHHHHHHHHHHHH-HHHHhhc--chh Q lcl|NC_018285. 255 LKQADWTTGQFAKVYGIPENVVGGQ---GDQQSSLEMS----SN--------VYSKAVARYLRPFLS-ELSQKLS--CDV 316 (383) Q Consensus 255 ~e~~~~~~~~Ia~~~gVpp~~lg~~---~~~~~~~e~~----~~--------~~~~~l~P~~~~i~~-~l~~~l~--~~~ 316 (383) .+..+...+.||+.+|||.+.|-+. .+|++..... +. +....++|+.+.+.+ ++-.-.+ +.+ T Consensus 319 ~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~ 398 (495) T protein:vir:10 319 EPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDY 398 (495) T ss_pred HHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCc Confidence 8888999999999999999998532 2333332222 11 222333443333322 2111111 111 Q ss_pred hc------cch----hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc-----chhHHh--CCCCC----CC- Q lcl|NC_018285. 317 DA------DIF----PAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK-----ELPKGE--NPNRT----IL- 374 (383) Q Consensus 317 e~------~~~----~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~-----d~~~~~--~~~~~----~~- 374 (383) .- ... ...-.|+.+.+.....++++|++|+-|+-...|..+-.-- |...++ ++... .. T Consensus 399 ~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~ 478 (495) T protein:vir:10 399 LQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDPRYVN 478 (495) T ss_pred hhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCC Confidence 00 000 1112467777777788999999999999988887652111 111111 11110 00 Q ss_pred -CCC------CCCCCC Q lcl|NC_018285. 375 -KGG------ETNGQD 383 (383) Q Consensus 375 -~gg------d~~~~d 383 (383) .|- +.+++| T Consensus 479 ~~~~~~~~~~~~~~~~ 494 (495) T protein:vir:10 479 GSGAEQKSVMEAALNN 494 (495) T ss_pred CccCCCCCCCCCCCCC Confidence 011 111111 No 125 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.61 E-value=1.1e-14 Score=97.16 Aligned_cols=368 Identities=11% Similarity=0.029 Sum_probs=212.7 Q ss_pred CchhhhhhcCCcc----cccccc---------------cccchhhcccc---cCCceec----hhhhh-ccHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPPN----NQGGFF---------------DITDPEFLATL---NGSEWVS----AETAL-KNSDLFSIISQ 53 (383) Q Consensus 1 Mglf~~~~~~~~~----~~~~~~---------------~~~~~~~~~~~---~~~~~~~----~~~a~-~~~~v~~~i~~ 53 (383) |+-+=-...+|-. .+.... +++...+-..+ .++.... .+..+ +.+.|.+|++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~ 80 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSK 80 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 4322111111110 000000 00000000000 1111110 11112 57899999999 Q ss_pred HHHhhhhCceeeecchh---------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC---ceeEEEEeccc Q lcl|NC_018285. 54 LSNDLATAKLTTSRKQM---------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDMKWEYLRPS 121 (383) Q Consensus 54 ia~~ia~~p~~~~~~~~---------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g---~~~~l~~l~~~ 121 (383) +...|.+++|.|.-... ..+...-+..-.+.+++..+. +.+.+|-++.++++..+| .|..+.+.++. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 159 (526) T protein:vir:99 81 RKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQS 159 (526) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 99999999999973211 111111111123566666665 577899999999976543 46789999999 Q ss_pred eeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_018285. 122 QVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNA 201 (383) Q Consensus 122 ~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~ 201 (383) ++.+..+.. ..+.++ ++ .....++++...+..++....+..+|.+.+..+.-....-....+....|.+..|.| T Consensus 160 ~f~~~~~~~-~~l~~~---~~--~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P 233 (526) T protein:vir:99 160 WFQLNPEDQ-NELRLR---DN--SPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLP 233 (526) T ss_pred ceeeccCCC-cEEEec---CC--CCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCc Confidence 888755432 223222 11 123456777765555566666778899999999988888888999999999999999 Q ss_pred ceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccC-hhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc Q lcl|NC_018285. 202 NGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIK-SNVAQLLKQADWTTGQFAKVYGIPENVVGGQG 280 (383) Q Consensus 202 ~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~ 280 (383) -.+.+.+...++++++.+.+.......+++ ++++.|++++-+..+ ..-..|.+..++.-++|+.++ ||.+. T Consensus 234 ~~igky~~~a~~~ek~~L~~av~~i~~d~~--~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqtl 305 (526) T protein:vir:99 234 IRLGKYPPGTADEEKATLLRAVTGLGHAAA--GIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV------LGGTL 305 (526) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHHhhCcE--EEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhhh Confidence 999999988889999888888877766654 445566555444332 122346777788888888764 33211 Q ss_pred --------cCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcch---hhcc------------chhhhccCHHHHHHHHH Q lcl|NC_018285. 281 --------DQQS-SLEMSSNVYSKAVARYLRPFLSELSQKLSCD---VDAD------------IFPAVDPTGANYISRIN 336 (383) Q Consensus 281 --------~~~~-~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~---~e~~------------~~~~~~~~~~~~~~~~~ 336 (383) .+++ ..+.......+.+.--++.+++.||+.|+.. +.+. .......|...+++.+. T Consensus 306 Ts~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~ 385 (526) T protein:vir:99 306 TSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIP 385 (526) T ss_pred ccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHH Confidence 1111 1222344566677778889999998877532 1111 01112344556788889 Q ss_pred HHHhCCC-cCHHHHHHHhhcCCcCCcchhHH-hCCC-CCCC-CCCC-----------CCCCC Q lcl|NC_018285. 337 SMVKSGT-LAQNQGLYILQQAEILPKELPKG-ENPN-RTIL-KGGE-----------TNGQD 383 (383) Q Consensus 337 ~l~~~g~-~t~nE~r~~lg~~~~~~~d~~~~-~~~~-~~~~-~ggd-----------~~~~d 383 (383) +|...|+ ++..++++++|.+.-..+|.-.. ...+ ..+. ++.. ..+++ T Consensus 386 ~L~~~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (526) T protein:vir:99 386 ALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQ 447 (526) T ss_pred HHHhCCCccCHHHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccccCcchh Confidence 9999987 79999999999865333331110 0000 0000 0100 01111 No 126 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.60 E-value=4.9e-15 Score=99.04 Aligned_cols=383 Identities=8% Similarity=0.015 Sum_probs=214.5 Q ss_pred CchhhhhhcCCcc-c---cccccc-c--cchh---hcccccCCc-ee----------chhhhhccHHHHHHHHHHHHhhh Q lcl|NC_018285. 1 MPIFNLATESPPN-N---QGGFFD-I--TDPE---FLATLNGSE-WV----------SAETALKNSDLFSIISQLSNDLA 59 (383) Q Consensus 1 Mglf~~~~~~~~~-~---~~~~~~-~--~~~~---~~~~~~~~~-~~----------~~~~a~~~~~v~~~i~~ia~~ia 59 (383) |.-+-.+...... . ...+.. . .... |.+...+.. .+ +..-+..++.+..||+.+.+.+- T Consensus 3 ~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvV 82 (533) T protein:vir:34 3 TPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIV 82 (533) T ss_pred CchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhh Confidence 2222211110000 0 000000 0 0011 111111100 00 11223468889999998888876 Q ss_pred hCceeeecc-----------hh-----------hhhccCCCc------cCCHHHHHHHHHHHHHHcCCeEEEEeecCC-C Q lcl|NC_018285. 60 TAKLTTSRK-----------QM-----------QGIVDNPSN------SANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G 110 (383) Q Consensus 60 ~~p~~~~~~-----------~~-----------~~l~~~PN~------~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g 110 (383) ...+++.-. .. ..+.+.|+. .++.+++...++..++..|++|+.+.+... | T Consensus 83 G~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g 162 (533) T protein:vir:34 83 GSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSS 162 (533) T ss_pred CCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCC Confidence 666665421 01 112234432 468899999999999999999999876543 2 Q ss_pred --ceeEEEEeccceeEEE--------------EcCCCceeEEEEeecCcccc---------cceeecccceEEeccCCCC Q lcl|NC_018285. 111 --RDMKWEYLRPSQVSFN--------------RLDNQNGLYYNVTFDDPRIP---------PKQHVPQSDILHFRLLSVD 165 (383) Q Consensus 111 --~~~~l~~l~~~~v~~~--------------~~~~~~~~~y~~~~~~~~~~---------~~~~~~~~dvih~~~~~~~ 165 (383) .+..|..|+|+++..- .+..+..+.|.+........ ....+++++|+|+..+... T Consensus 163 ~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~ 242 (533) T protein:vir:34 163 RLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVED 242 (533) T ss_pred CccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCC Confidence 3568888888776432 23445567777754321110 1234678899999988888 Q ss_pred ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC-----------CHHHHHHHHH------HHHHh-- Q lcl|NC_018285. 166 GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG-----------LLDFKTKVSR------SRQAM-- 226 (383) Q Consensus 166 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~-----------~~e~~~~~~~------~~~~~-- 226 (383) +..+|+|.+..+...+.......+....-.+-.+.-.++|+.+... ..+....+.. .+... T Consensus 243 gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (533) T protein:vir:34 243 GQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAP 322 (533) T ss_pred CcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcce Confidence 8899999999999999888888888777777777778888754221 1111111111 11111 Q ss_pred -hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc---ccCcCHHHHH-----------HH Q lcl|NC_018285. 227 -KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ---GDQQSSLEMS-----------SN 291 (383) Q Consensus 227 -~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~---~~~~~~~e~~-----------~~ 291 (383) .-..|.+..|..|.+++.++.+-...+|.+..+...+.||+.+|||.+.|-+. .+|++..... .. T Consensus 323 ~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~ 402 (533) T protein:vir:34 323 VRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKF 402 (533) T ss_pred eeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHH Confidence 11356777899999999888876667888999999999999999999988532 2333332111 22 Q ss_pred HHHHHHHHHHHHHHHH-HHHhhcc--h---hhccch------------hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_018285. 292 VYSKAVARYLRPFLSE-LSQKLSC--D---VDADIF------------PAVDPTGANYISRINSMVKSGTLAQNQGLYIL 353 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~-l~~~l~~--~---~e~~~~------------~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~l 353 (383) |....+.|+.+.+.++ +-...++ . +++... ...-.|+.+.+.....++++|++|+-|+-... T Consensus 403 ~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~ 482 (533) T protein:vir:34 403 VASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR 482 (533) T ss_pred HHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc Confidence 3333445554443332 2111111 0 111110 01124667777777889999999999999988 Q ss_pred hcCCcCCcc-----hhHHhC--CCCC--C----CCCCCCCCCC Q lcl|NC_018285. 354 QQAEILPKE-----LPKGEN--PNRT--I----LKGGETNGQD 383 (383) Q Consensus 354 g~~~~~~~d-----~~~~~~--~~~~--~----~~ggd~~~~d 383 (383) |..+-.--+ ....+. +... + ..|...++++ T Consensus 483 G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~ 525 (533) T protein:vir:34 483 GDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEE 525 (533) T ss_pred CCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCC Confidence 876521111 111111 1111 1 1111111111 No 127 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.60 E-value=1e-14 Score=97.28 Aligned_cols=382 Identities=7% Similarity=-0.044 Sum_probs=213.1 Q ss_pred Cchhhhhhc----CCcc-----ccccccccc--chhhcccccCCce----e----------chhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATE----SPPN-----NQGGFFDIT--DPEFLATLNGSEW----V----------SAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mglf~~~~~----~~~~-----~~~~~~~~~--~~~~~~~~~~~~~----~----------~~~~a~~~~~v~~~i~~ia 55 (383) |.--.+... .++. ....+...+ ...+.++...... + +..-+..++.+..+|+.+. T Consensus 2 ~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 81 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQR 81 (553) T ss_pred cchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 111111111 0000 001111111 1111111111110 0 1122356888899999888 Q ss_pred HhhhhCceeeecc------------hh-----------hhhccCCC------ccCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_018285. 56 NDLATAKLTTSRK------------QM-----------QGIVDNPS------NSANRFNFYQSIFAQMLLGGEAFAYRWR 106 (383) Q Consensus 56 ~~ia~~p~~~~~~------------~~-----------~~l~~~PN------~~~t~~~f~~~~~~~~~l~G~a~~~i~r 106 (383) +.+-...+++.-. .. ..+.+.|| -.++.+.+...++..++..|++|+.+++ T Consensus 82 ~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~ 161 (553) T protein:vir:63 82 DSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEW 161 (553) T ss_pred HhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeee Confidence 7777666765311 01 11233343 3467889999999999999999998876 Q ss_pred cCC-C--ceeEEEEeccceeEEEE--------------cCCCceeEEEEeecCcccc--------------cceeecccc Q lcl|NC_018285. 107 NDN-G--RDMKWEYLRPSQVSFNR--------------LDNQNGLYYNVTFDDPRIP--------------PKQHVPQSD 155 (383) Q Consensus 107 ~~~-g--~~~~l~~l~~~~v~~~~--------------~~~~~~~~y~~~~~~~~~~--------------~~~~~~~~d 155 (383) ... | .+..|..|+|+++..-. +..+..+.|.+....+... ....+++++ T Consensus 162 ~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~ 241 (553) T protein:vir:63 162 DRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQ 241 (553) T ss_pred ccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccChhH Confidence 432 2 35678889888874322 3445567777654433211 123578999 Q ss_pred eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHH--------------- Q lcl|NC_018285. 156 ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVS--------------- 220 (383) Q Consensus 156 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~--------------- 220 (383) |||+..+...+..+|+|.+..++..+............-.+=.+...++|+.+... +...+.+. T Consensus 242 vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 320 (553) T protein:vir:63 242 VIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPP-EFIHSQMSGGSPNADMVGIFGKY 320 (553) T ss_pred heecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCh-hhhhhhccccccccccccccccc Confidence 99999888888899999999999999888888888777777777777888765321 11111110 Q ss_pred -H---HHHH----hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc---ccCcCHHHHH Q lcl|NC_018285. 221 -R---SRQA----MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ---GDQQSSLEMS 289 (383) Q Consensus 221 -~---~~~~----~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~---~~~~~~~e~~ 289 (383) . .... ..-..|.+..|..|.+++..+.+-...+|.+..+...+.||+.+|||.+.|-+. .+|++..... T Consensus 321 ~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~ 400 (553) T protein:vir:63 321 MDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGI 400 (553) T ss_pred ccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHH Confidence 0 0000 011356788899999999888876666888999999999999999999988532 2333332221 Q ss_pred -----------HHHHHHHHHHHHHHHHHH-HHHhhc--chh----hc--------------cchhhhccCHHHHHHHHHH Q lcl|NC_018285. 290 -----------SNVYSKAVARYLRPFLSE-LSQKLS--CDV----DA--------------DIFPAVDPTGANYISRINS 337 (383) Q Consensus 290 -----------~~~~~~~l~P~~~~i~~~-l~~~l~--~~~----e~--------------~~~~~~~~~~~~~~~~~~~ 337 (383) ..|....++|+.+.+.++ +-...+ +.. .+ ---...-.|+.+-+..... T Consensus 401 ~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~ 480 (553) T protein:vir:63 401 AMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVM 480 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHH Confidence 123334445544443332 111111 100 00 0000112567777777888 Q ss_pred HHhCCCcCHHHHHHHhhcCCcCCc-----chhHHh--CCCCC-----C------CC-------CCCCCCCC Q lcl|NC_018285. 338 MVKSGTLAQNQGLYILQQAEILPK-----ELPKGE--NPNRT-----I------LK-------GGETNGQD 383 (383) Q Consensus 338 l~~~g~~t~nE~r~~lg~~~~~~~-----d~~~~~--~~~~~-----~------~~-------ggd~~~~d 383 (383) ++++|++|+-|+-...|..+-.-- |...++ ++... + .+ ..+++++| T Consensus 481 ~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (553) T protein:vir:63 481 RIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQ 551 (553) T ss_pred HHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccc Confidence 999999999999998886652111 111111 11100 0 00 01111111 No 128 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.58 E-value=3.7e-14 Score=94.24 Aligned_cols=366 Identities=12% Similarity=0.052 Sum_probs=213.1 Q ss_pred Cc-hhhhhhcCCcccc----ccc---------------ccccchhhcccc---cCCceec----hhhhh-ccHHHHHHHH Q lcl|NC_018285. 1 MP-IFNLATESPPNNQ----GGF---------------FDITDPEFLATL---NGSEWVS----AETAL-KNSDLFSIIS 52 (383) Q Consensus 1 Mg-lf~~~~~~~~~~~----~~~---------------~~~~~~~~~~~~---~~~~~~~----~~~a~-~~~~v~~~i~ 52 (383) |+ |++. ..+|.... ... .+++...+...+ ..+.... .+..+ +.+.|.+|++ T Consensus 1 ~~~~~d~-~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~ 79 (526) T protein:vir:79 1 MAQIVDV-YGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMS 79 (526) T ss_pred CCeeeCC-CCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 33 2221 11111000 000 001000000000 1111110 11112 5788999999 Q ss_pred HHHHhhhhCceeeecchh-----h----hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC---ceeEEEEecc Q lcl|NC_018285. 53 QLSNDLATAKLTTSRKQM-----Q----GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDMKWEYLRP 120 (383) Q Consensus 53 ~ia~~ia~~p~~~~~~~~-----~----~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g---~~~~l~~l~~ 120 (383) .+-..|.+++|.|.-... . .+...-+....+.+++..+.. .+.+|-++.++++..+| .|..+.+.++ T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 158 (526) T protein:vir:79 80 KRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLAFHHRPQ 158 (526) T ss_pred HHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEEeeeecc Confidence 999999999999963211 1 111111111235666666644 66899999999976543 4678889999 Q ss_pred ceeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_018285. 121 SQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALN 200 (383) Q Consensus 121 ~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~ 200 (383) .++.+..+.. ..+.++ ++ .....++++...++.++....+..+|.+.+..+.-....-....++...|.+..|. T Consensus 159 ~~F~~~~~~~-~~l~~~---~~--~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~ 232 (526) T protein:vir:79 159 SWFQLNPEDQ-NELRLR---DN--SPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGL 232 (526) T ss_pred cceEeccCCC-cEEEec---CC--CCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCC Confidence 8887655432 223222 11 12345677776666666666777889999999988888888899999999999999 Q ss_pred cceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccC-hhhHHHHHHHHHHHHHHHHHhcCCHHHhccc Q lcl|NC_018285. 201 ANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIK-SNVAQLLKQADWTTGQFAKVYGIPENVVGGQ 279 (383) Q Consensus 201 ~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~ 279 (383) |-.+.+.+...++++++.+.+.......+++ ++++.|++++=+..+ .....|.+..++.-++|+.+. ||.+ T Consensus 233 P~~igky~~~a~~~ek~~L~~av~~i~~da~--~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqt 304 (526) T protein:vir:79 233 PIRLGKYPPGTADEEKATLLRAVTGLGHAAA--GIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV------LGGT 304 (526) T ss_pred ceEEEecCCCCCHHHHHHHHHHHHHHhcCcE--EEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhh Confidence 9999999988888888888888877766554 455666555544432 222346777788888888763 3321 Q ss_pred c--------cCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcch---hhcc------------chhhhccCHHHHHHHH Q lcl|NC_018285. 280 G--------DQQS-SLEMSSNVYSKAVARYLRPFLSELSQKLSCD---VDAD------------IFPAVDPTGANYISRI 335 (383) Q Consensus 280 ~--------~~~~-~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~---~e~~------------~~~~~~~~~~~~~~~~ 335 (383) . .+++ ..+.......+.+.--++.++..||+.|+.. +.|. .......|...+++.+ T Consensus 305 lTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~ 384 (526) T protein:vir:79 305 LTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSI 384 (526) T ss_pred hccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHH Confidence 1 1111 1223345566777788999999999877542 1111 0111223446678888 Q ss_pred HHHHhCCC-cCHHHHHHHhhcCCcCCcchhHHh--CCCCC-CC-C-----------CCCCCCCC Q lcl|NC_018285. 336 NSMVKSGT-LAQNQGLYILQQAEILPKELPKGE--NPNRT-IL-K-----------GGETNGQD 383 (383) Q Consensus 336 ~~l~~~g~-~t~nE~r~~lg~~~~~~~d~~~~~--~~~~~-~~-~-----------ggd~~~~d 383 (383) .+|+..|+ ++..++++++|.+....++ .... ..+.. +. + +-...+++ T Consensus 385 ~~L~~~G~~i~~~~i~e~~gip~~~~~e-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (526) T protein:vir:79 385 PALVNVGLEIPSAWVYDKLGIPQPAKNE-PVLRPAAQPAILSRQHGQRVAALATIVGPRYGDQQ 447 (526) T ss_pred HHHHhCCCcCCHHHHHHHhCCCCCCCch-hhccccCCccccccccccccccccccccccCchhh Confidence 89999887 7999999999986433332 1110 00000 00 0 00011111 No 129 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.53 E-value=3.7e-14 Score=94.23 Aligned_cols=362 Identities=12% Similarity=0.032 Sum_probs=213.5 Q ss_pred Cc-hhhhhhcCCc----cccccccc--ccchhhcc-------------cc---cCCce-----echhhhhccHHHHHHHH Q lcl|NC_018285. 1 MP-IFNLATESPP----NNQGGFFD--ITDPEFLA-------------TL---NGSEW-----VSAETALKNSDLFSIIS 52 (383) Q Consensus 1 Mg-lf~~~~~~~~----~~~~~~~~--~~~~~~~~-------------~~---~~~~~-----~~~~~a~~~~~v~~~i~ 52 (383) |+ |++. ..+|. ..+..... .....+.. .+ .++.. +-.+.-++.+.|.+|++ T Consensus 1 m~~~~d~-~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~ 79 (512) T protein:vir:19 1 MGRILDI-SGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELS 79 (512) T ss_pred CcceeCC-CCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 43 2221 11111 00000000 00001100 00 01111 11122236888999999 Q ss_pred HHHHhhhhCceeeecc---h--hh--------hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec---CCCceeEEE Q lcl|NC_018285. 53 QLSNDLATAKLTTSRK---Q--MQ--------GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN---DNGRDMKWE 116 (383) Q Consensus 53 ~ia~~ia~~p~~~~~~---~--~~--------~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~---~~g~~~~l~ 116 (383) .+-..|.+++|.|.-. + .. .|...| ...+++..+. +.+.+|-++.++++. +...|..+. T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~ 154 (512) T protein:vir:19 80 KRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAA----WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALH 154 (512) T ss_pred HHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCC----CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeee Confidence 9999999999999632 1 11 122333 3566666664 577899999999874 334577899 Q ss_pred EeccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 117 YLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLK 196 (383) Q Consensus 117 ~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 196 (383) +.++.++.+..+. ...+.++ ++ .....++++...++.++....+..+|.+.+..+.-....-....+....|.+ T Consensus 155 ~r~~~~f~~~~~~-~~~lr~~---~~--~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E 228 (512) T protein:vir:19 155 HRDPALFCANPDN-LNELRLR---DA--SYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLE 228 (512) T ss_pred eeccccceeccCC-CcEEEec---CC--CCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999988765533 2233322 11 1234567777766666666677788999999999998888999999999999 Q ss_pred ccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccC-hhhHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_018285. 197 NALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIK-SNVAQLLKQADWTTGQFAKVYGIPENV 275 (383) Q Consensus 197 ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~-~~d~~~~e~~~~~~~~Ia~~~gVpp~~ 275 (383) ..|.|-.+.+.+...++++++.+.+.......+++ ++++.|++++-+..+ .....|.+..++..++|+.+ + T Consensus 229 ~yG~P~~igky~~~a~~~ek~~L~~al~~~~~~a~--~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~------i 300 (512) T protein:vir:19 229 IYGLPMRVGKYPTGSTNREKATLMQAVMDIGRRAG--GIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKA------I 300 (512) T ss_pred HcCCCeeEEecCCCCCHHHHHHHHHHHHHHhhCcE--EEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHH------H Confidence 99999999999988888888888888887766554 445666665544332 22234777788888888877 3 Q ss_pred hcccc------cCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh---hccc------hh------hhccCHHHHHH Q lcl|NC_018285. 276 VGGQG------DQQS-SLEMSSNVYSKAVARYLRPFLSELSQKLSCDV---DADI------FP------AVDPTGANYIS 333 (383) Q Consensus 276 lg~~~------~~~~-~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~---e~~~------~~------~~~~~~~~~~~ 333 (383) ||.+. ++++ ..+.......+.+.-.++.++..||+.|+..+ .|.. .. ....|....+. T Consensus 301 LGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~ 380 (512) T protein:vir:19 301 LGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSD 380 (512) T ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHH Confidence 55321 1111 22334556677888889999999998876532 2210 00 11123344555 Q ss_pred HHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCC--------CC-CCCCC Q lcl|NC_018285. 334 RINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKG--------GE-TNGQD 383 (383) Q Consensus 334 ~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~g--------gd-~~~~d 383 (383) .+..+..+--++..++++++|.+....+|. .....+..+..+ .+ ...++ T Consensus 381 ~~~~l~~G~~i~~~~i~e~~Gip~~~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (512) T protein:vir:19 381 AIPKLAAGMRIPVSWIQEKLHIPQPVGDEA-VFTIQPVVPDNGSQKEAALSAEDIPQED 438 (512) T ss_pred HHHHHhcCCCCCHHHHHHHhCCCCCCCccc-cccCCCccccccccccccccccCCCchh Confidence 566665444569999999999864333321 111111111101 11 11111 No 130 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.49 E-value=5.6e-13 Score=87.74 Aligned_cols=373 Identities=11% Similarity=0.024 Sum_probs=203.0 Q ss_pred CchhhhhhcCCcccccccccccc--------hhh-----ccc--------ccCCcee-chhhhhccHHHHHHHHHHHHhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD--------PEF-----LAT--------LNGSEWV-SAETALKNSDLFSIISQLSNDL 58 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~--------~~~-----~~~--------~~~~~~~-~~~~a~~~~~v~~~i~~ia~~i 58 (383) |---.+..+ +-.+.++....+. ..+ .+. +.+...+ -.+..+..+.|.+|++.+-..| T Consensus 1 m~k~~~k~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~av 79 (448) T protein:vir:79 1 MAKRGRKPK-ELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFGRI 79 (448) T ss_pred CCCCCCCCc-cccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHHHH Confidence 543222211 1011111110000 000 000 0000011 2344456788999999999999 Q ss_pred hhCceeeecchh-----------hhhccCCCc---cCCHHHHHHHHHHHHHHcCCeEEEEeec--CCCc--eeEEEEecc Q lcl|NC_018285. 59 ATAKLTTSRKQM-----------QGIVDNPSN---SANRFNFYQSIFAQMLLGGEAFAYRWRN--DNGR--DMKWEYLRP 120 (383) Q Consensus 59 a~~p~~~~~~~~-----------~~l~~~PN~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~--~~g~--~~~l~~l~~ 120 (383) .+++|.|.-... ...+..+.. ..++.+++..+ .+.+++|-+++++++. .+|+ +..+.+.++ T Consensus 80 ~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~-lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~ 158 (448) T protein:vir:79 80 RSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVPIHP 158 (448) T ss_pred hcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHH-HHhhhhcceeEEEEeeecCCCceecccccccCC Confidence 999999962111 112222322 23455666555 5567899999999974 3564 445666676 Q ss_pred ceeEEE-EcCCCceeEEEEeecCc----ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 121 SQVSFN-RLDNQNGLYYNVTFDDP----RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSL 195 (383) Q Consensus 121 ~~v~~~-~~~~~~~~~y~~~~~~~----~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 195 (383) .+++.. .+.++. ..+....... .......++..-++|..+ ...+..+|.+.+..+.-....-....+....|. T Consensus 159 ~~~~~f~~~~d~~-l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~ 236 (448) T protein:vir:79 159 FNIDEVLYDEEGG-PKALKLSGEVKGGSQFVSGLEIPIWKTVVFLH-NDDGSFTGQSALRAAVPHWLAKRALILLINHGL 236 (448) T ss_pred ccccceeeecCCc-eEEeecCCcccccccCCCccccccceEEEEec-CccCCcccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 643322 222232 2222111110 111234567788888865 455667899999999998888889999999999 Q ss_pred hccCCcceeEeecCCCC--HHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 196 KNALNANGILKIKGGGL--LDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 196 ~ng~~~~~i~~~~~~~~--~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) +..|.|--+.+.+...+ +++++.+.+.......+....++++.|++++-++.......+.+..++.-++|+.+. T Consensus 237 E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i---- 312 (448) T protein:vir:79 237 ERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL---- 312 (448) T ss_pred HHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHH---- Confidence 99999999999886544 455556665555443333334668888887766554444456667777777877654 Q ss_pred HHhcccc-----cCcCHHH-H-HHHHHHHHHHHHHHHHHHHHHHhhcchh-hccc-----------hhhhccCHHHHHHH Q lcl|NC_018285. 274 NVVGGQG-----DQQSSLE-M-SSNVYSKAVARYLRPFLSELSQKLSCDV-DADI-----------FPAVDPTGANYISR 334 (383) Q Consensus 274 ~~lg~~~-----~~~~~~e-~-~~~~~~~~l~P~~~~i~~~l~~~l~~~~-e~~~-----------~~~~~~~~~~~~~~ 334 (383) ||.+. .++.... . ......+.+.--++.+++.||+.|+..+ .++. ......|...+++. T Consensus 313 --LGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~e~~Dl~~~a~~ 390 (448) T protein:vir:79 313 --GIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEMEERNDFSAAANL 390 (448) T ss_pred --hhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCCChHHHHHHHHH Confidence 34211 1111111 1 1234456667788899999998876532 1110 00112233445666 Q ss_pred HHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 335 INSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 335 ~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) +..++..+-...+-.|+.++.+...+++.+........+.+++...+.+ T Consensus 391 ~~~l~~~~~~~~~~~~~~~~~p~~~~~~~~~a~~~~~~~~~~~~~~~~~ 439 (448) T protein:vir:79 391 MGMLINAVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVRQPADS 439 (448) T ss_pred hhhhhccchhhHHHHHHhhcCCCCCCCccccccCCCCcccccccCCccc Confidence 6777766544444467777776444443332222222222333333333 No 131 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.40 E-value=3e-13 Score=89.23 Aligned_cols=370 Identities=12% Similarity=0.015 Sum_probs=190.8 Q ss_pred CchhhhhhcCCcc-----ccccccccc-----ch--hhcc--cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATESPPN-----NQGGFFDIT-----DP--EFLA--TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~~~~~-----~~~~~~~~~-----~~--~~~~--~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) .+|-..+.-.... .+.....+. .. .|.. .|.|+ -......+.|.+++|+..||+.+.+-=+.+. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy--~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~ 144 (698) T protein:vir:10 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGF--PTLVLLAQLPEYRAMHEVLADECIRTWGEAI 144 (698) T ss_pred ccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchH--HHHHHHhhccchhhHHHHHHHHhhcccceec Confidence 3332222111100 000000000 00 0100 01111 1233345788899999999998865522222 Q ss_pred cchh-----------------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-------------- Q lcl|NC_018285. 67 RKQM-----------------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-------------- 109 (383) Q Consensus 67 ~~~~-----------------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-------------- 109 (383) .... +.|..+-.... -.+-+...+.+--++|-+.+++..+++ T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~-V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~ 223 (698) T protein:vir:10 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLR-IRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (698) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHhcccccceEEEEEeecCcccccccccccccc Confidence 1100 11211112222 223333444556678888766654332 Q ss_pred ---CceeEEEEeccceeEEEEcC--------CCceeEEEEeecCcccccceeecccceEEeccCCC------CccccCcc Q lcl|NC_018285. 110 ---GRDMKWEYLRPSQVSFNRLD--------NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSV------DGGLTSVS 172 (383) Q Consensus 110 ---g~~~~l~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~------~~~~~G~s 172 (383) |....|..++|.+|+..... .+.+.+|++.. ..+-.+-++.|..... ...+.|.| T Consensus 224 I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G--------~~IH~SRL~~~vg~pvpd~LKp~y~f~G~S 295 (698) T protein:vir:10 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG--------SEVHATRLHTIVSRPVGDMLKPTYSFAGIS 295 (698) T ss_pred ccCccceeeeeecccccccchhhhccchhhccCCCceEEEec--------ceecceeEEEecCCCchhhhcchhccCCcc Confidence 33455888888888765422 22333444421 1233344443433221 12356999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHH--HHHHhhcCCcceeecC-CCceeeecccCh Q lcl|NC_018285. 173 PLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSR--SRQAMKQMQGGPLVLD-DLEDFTPLEIKS 249 (383) Q Consensus 173 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~--~~~~~~~~~g~~~vl~-~g~~~~~~~~~~ 249 (383) ..+.+...+.....+........+.-.........-..++......+.+ .+-....+..+.++++ .+-+|+..+.+. T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st~l 375 (698) T protein:vir:10 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALTPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPL 375 (698) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhcCccceEEEecCCcceEEEecCc Confidence 9999999999988888887777665333222222222222222222332 2223344455577778 578999888766 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHHHhccccc---CcCHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcchh--- Q lcl|NC_018285. 250 NVAQLLKQADWTTGQFAKVYGIPENVVGGQGD---QQSSLEMSSNVY-------SKAVARYLRPFLSELSQKLSCDV--- 316 (383) Q Consensus 250 ~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~---~~~~~e~~~~~~-------~~~l~P~~~~i~~~l~~~l~~~~--- 316 (383) .. +.+....+...||.+-+||...|-+.+. +++.+.-.+.|| +..|+|.++.+-+.+-+..+..+ T Consensus 376 SG--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~ 453 (698) T protein:vir:10 376 SG--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 453 (698) T ss_pred CC--HHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 44 5566677788999999999888865432 223333334444 45789999998888887776543 Q ss_pred -hccchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhhcCC---c----CCcchh---------HHhCCCCC Q lcl|NC_018285. 317 -DADIFPAVDPTGANYIS-------RINSMVKSGTLAQNQGLYILQQAE---I----LPKELP---------KGENPNRT 372 (383) Q Consensus 317 -e~~~~~~~~~~~~~~~~-------~~~~l~~~g~~t~nE~r~~lg~~~---~----~~~d~~---------~~~~~~~~ 372 (383) .|...++-+++..++++ ....++..|+++++|+|++|...+ + +.+|.+ .....-.. T Consensus 454 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~ 533 (698) T protein:vir:10 454 IKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQR 533 (698) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcC Confidence 33333444555555543 234567899999999999987542 2 111211 11000011 Q ss_pred CCCCCCCCC-CC Q lcl|NC_018285. 373 ILKGGETNG-QD 383 (383) Q Consensus 373 ~~~ggd~~~-~d 383 (383) ..+||++.. ++ T Consensus 534 ~~~~~~~~~~~~ 545 (698) T protein:vir:10 534 MAEGGDTGAPTA 545 (698) T ss_pred CcCCCCcccccc Confidence 234555433 22 No 132 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.39 E-value=5e-13 Score=88.01 Aligned_cols=370 Identities=11% Similarity=0.019 Sum_probs=187.5 Q ss_pred CchhhhhhcCCcc-----ccccccccc-----ch--hhcc--cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATESPPN-----NQGGFFDIT-----DP--EFLA--TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~~~~~-----~~~~~~~~~-----~~--~~~~--~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) .+|-..+.-.... .+.....+. .. .|.. .|.|+ -......+.|.+++|+..||+.+.+-=+.+. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy--~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~ 144 (695) T protein:vir:78 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGF--PTLVLLAQLPEYRAMHEVLADECIRTWGEAI 144 (695) T ss_pred cccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchH--HHHHHHhhccchhhHHHHHHHHhhcccceec Confidence 3332221111100 000000000 00 0100 01111 1233345788899999999998865522222 Q ss_pred cchh-----------------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-------------- Q lcl|NC_018285. 67 RKQM-----------------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-------------- 109 (383) Q Consensus 67 ~~~~-----------------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-------------- 109 (383) .... +.|..+-... .-.+-+...+.+--++|-+.+++..+++ T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL-~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~ 223 (695) T protein:vir:78 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERL-RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (695) T ss_pred cccchhhhhhcccccccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHhhccccceEEEEEeccCcccccccccccccc Confidence 1100 1121111222 2223334445566678888777655332 Q ss_pred ---CceeEEEEeccceeEEEEcC--------CCceeEEEEeecCcccccceeecccceEEeccCCC------CccccCcc Q lcl|NC_018285. 110 ---GRDMKWEYLRPSQVSFNRLD--------NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSV------DGGLTSVS 172 (383) Q Consensus 110 ---g~~~~l~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~------~~~~~G~s 172 (383) |....|..++|.+|+..... .+.+.+|++.. +.+-.+-++.|..... ...+.|+| T Consensus 224 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G--------~kIH~SRL~~f~g~plPd~LKp~y~~~GiS 295 (695) T protein:vir:78 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG--------TEVHATRLHTIVSRPVGDMLKPTYSFAGIS 295 (695) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEEec--------eEEeeeeEEEecCCCchhhhhcccccCccc Confidence 33455888888888775422 22333444421 1233344443433221 12357999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHH--HHHHhhcCCcceeecC-CCceeeecccCh Q lcl|NC_018285. 173 PLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSR--SRQAMKQMQGGPLVLD-DLEDFTPLEIKS 249 (383) Q Consensus 173 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~--~~~~~~~~~g~~~vl~-~g~~~~~~~~~~ 249 (383) ..+.+...+............+...-.........-..+.......+.. .+-....+..++++++ ..-+|+..+.+. T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stsl 375 (695) T protein:vir:78 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPL 375 (695) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEeccc Confidence 9999999999888888887777665333222111112222222222222 2223334445577778 478999888766 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHHHhccccc---CcCHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcchh--- Q lcl|NC_018285. 250 NVAQLLKQADWTTGQFAKVYGIPENVVGGQGD---QQSSLEMSSNVY-------SKAVARYLRPFLSELSQKLSCDV--- 316 (383) Q Consensus 250 ~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~---~~~~~e~~~~~~-------~~~l~P~~~~i~~~l~~~l~~~~--- 316 (383) .. +.+....+...||.+-+||...|-+.+. +++.+.-.+.|| +..|+|.++.+-+.+-+..|..+ T Consensus 376 SG--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpd 453 (695) T protein:vir:78 376 SG--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 453 (695) T ss_pred CC--HHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 44 5566677788999999999888865432 222333333444 45789999998888887776543 Q ss_pred -hccchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhhcCC---c----CCcc---------hhHHhCCCCC Q lcl|NC_018285. 317 -DADIFPAVDPTGANYIS-------RINSMVKSGTLAQNQGLYILQQAE---I----LPKE---------LPKGENPNRT 372 (383) Q Consensus 317 -e~~~~~~~~~~~~~~~~-------~~~~l~~~g~~t~nE~r~~lg~~~---~----~~~d---------~~~~~~~~~~ 372 (383) .|...++-+++..++++ ....++..|+++++|+|+++..++ + +.+| +......... T Consensus 454 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~ 533 (695) T protein:vir:78 454 IKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQR 533 (695) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcC Confidence 33333444555555443 334567899999999999987543 1 1111 1111111111 Q ss_pred CCCCCCCCCCC Q lcl|NC_018285. 373 ILKGGETNGQD 383 (383) Q Consensus 373 ~~~ggd~~~~d 383 (383) ..+|+|..+-- T Consensus 534 ~~~~~~~~~~~ 544 (695) T protein:vir:78 534 LAEGGDTGAPG 544 (695) T ss_pred cccccccCCCC Confidence 11122221100 No 133 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.39 E-value=5.3e-13 Score=87.90 Aligned_cols=369 Identities=12% Similarity=0.017 Sum_probs=187.0 Q ss_pred CchhhhhhcCCcccc-----cc-cc-----cccchh--hcc--cccCCceechhhhhccHHHHHHHHHHHHhhhhCceee Q lcl|NC_018285. 1 MPIFNLATESPPNNQ-----GG-FF-----DITDPE--FLA--TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTT 65 (383) Q Consensus 1 Mglf~~~~~~~~~~~-----~~-~~-----~~~~~~--~~~--~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~ 65 (383) -+| ...+.-++... .. .. ...... |.. .|.|+ -......+.|.+++|+..||+.+.+-=+.+ T Consensus 66 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy--~~la~laQ~~eyr~~~~~ia~e~~R~w~~~ 142 (694) T protein:vir:10 66 LRL-ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGF--PTLVLLAQLPEYRAMHEVLADECIRTWGEA 142 (694) T ss_pred hhh-hhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchH--HHHHHHhhccchhhHHHHHHHHhhccccee Confidence 222 11111111100 00 00 000000 100 11111 123334578889999999999886552222 Q ss_pred ecchh-----------------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC------------- Q lcl|NC_018285. 66 SRKQM-----------------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN------------- 109 (383) Q Consensus 66 ~~~~~-----------------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~------------- 109 (383) ..... +.|..+-... .-.+-+...+.+--++|-+.+++..+++ T Consensus 143 ~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl-~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~ 221 (694) T protein:vir:10 143 IGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERL-RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPY 221 (694) T ss_pred ccccchhhhhhcccccccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHhhccccceEEEEEeecCccccccccccccc Confidence 21100 1122111222 2223334445566678888777654332 Q ss_pred ----CceeEEEEeccceeEEEEcC--------CCceeEEEEeecCcccccceeecccceEEeccCCC------CccccCc Q lcl|NC_018285. 110 ----GRDMKWEYLRPSQVSFNRLD--------NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSV------DGGLTSV 171 (383) Q Consensus 110 ----g~~~~l~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~------~~~~~G~ 171 (383) |....|..++|.+|+..... .+.+.+|++.. +.+-.+-++.|..... ...+.|+ T Consensus 222 ~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G--------~~IH~SRL~~f~g~plPd~LKp~y~~~G~ 293 (694) T protein:vir:10 222 TVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG--------TEVHATRLHTIVSRPVGDMLKPTYSFAGI 293 (694) T ss_pred cccCcceeeeEeecccccccchhhhccchhhccCCCceEEEec--------eEEeeeeEEEecCCCchhhhhcccccCcc Confidence 33455888888888775422 22333444421 1233344443433221 1235799 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHH--HHHHhhcCCcceeecC-CCceeeecccC Q lcl|NC_018285. 172 SPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSR--SRQAMKQMQGGPLVLD-DLEDFTPLEIK 248 (383) Q Consensus 172 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~--~~~~~~~~~g~~~vl~-~g~~~~~~~~~ 248 (383) |..+.+...+............+...-.........-..+.......+.. .+-....+..++++++ ..-+|+..+.+ T Consensus 294 Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~sts 373 (694) T protein:vir:10 294 SMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTP 373 (694) T ss_pred cHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecc Confidence 99999999999888888887777655333222111112222222222222 2223334445577778 47899988876 Q ss_pred hhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc---CcCHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcchh-- Q lcl|NC_018285. 249 SNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD---QQSSLEMSSNVY-------SKAVARYLRPFLSELSQKLSCDV-- 316 (383) Q Consensus 249 ~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~---~~~~~e~~~~~~-------~~~l~P~~~~i~~~l~~~l~~~~-- 316 (383) ... +.+....+...||.+-+||...|-+.+. +++.+.-.+.|| +..|+|.++.+-+.+-+..|..+ T Consensus 374 lSG--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp 451 (694) T protein:vir:10 374 LSG--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDP 451 (694) T ss_pred cCC--HHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC Confidence 644 5566677788999999999888865432 222332333444 45789999998888887776543 Q ss_pred --hccchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhhcCC---c----CCcc---------hhHHhCCCC Q lcl|NC_018285. 317 --DADIFPAVDPTGANYIS-------RINSMVKSGTLAQNQGLYILQQAE---I----LPKE---------LPKGENPNR 371 (383) Q Consensus 317 --e~~~~~~~~~~~~~~~~-------~~~~l~~~g~~t~nE~r~~lg~~~---~----~~~d---------~~~~~~~~~ 371 (383) .|...++-+++..++++ ....++..|+++++|+|+++...+ + +.+| +........ T Consensus 452 ~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~ 531 (694) T protein:vir:10 452 SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQ 531 (694) T ss_pred cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhc Confidence 33333444555555443 334567899999999999987542 1 1111 111111111 Q ss_pred CCCCCCCCCCCC Q lcl|NC_018285. 372 TILKGGETNGQD 383 (383) Q Consensus 372 ~~~~ggd~~~~d 383 (383) ...+|+|.++-- T Consensus 532 ~~~~~~~~~~~~ 543 (694) T protein:vir:10 532 RLAEGGDTGAPG 543 (694) T ss_pred CcccccccCCCC Confidence 111122221100 No 134 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.38 E-value=5.4e-13 Score=87.83 Aligned_cols=370 Identities=11% Similarity=0.023 Sum_probs=187.1 Q ss_pred CchhhhhhcCCcc-----ccccccccc-----ch--hhcc--cccCCceechhhhhccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATESPPN-----NQGGFFDIT-----DP--EFLA--TLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~~~~~-----~~~~~~~~~-----~~--~~~~--~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) .+|-..+.-.... .+.....+. .. .|.. .+.|+ -......+.|.+++|+..||+.+.+-=+.+. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy--~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~ 144 (695) T protein:vir:36 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGF--PTLVLLAQLPEYRAMHEVLADECIRTWGEAI 144 (695) T ss_pred cccceeceecccccCccccchhhhhhcccccccccchhhhccCcchH--HHHHHHhhccchhhHHHHHHHHhhcccceec Confidence 3332222111100 000000000 00 0100 01111 1233345788899999999998865522222 Q ss_pred cch-----------------------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-------------- Q lcl|NC_018285. 67 RKQ-----------------------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-------------- 109 (383) Q Consensus 67 ~~~-----------------------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-------------- 109 (383) ... .+.|..+-.. ..-.+-+...+.+--++|-+.+++..+++ T Consensus 145 ~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~er-L~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~ 223 (695) T protein:vir:36 145 GGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER-LRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYT 223 (695) T ss_pred ccchhhhhhccccccccccccCchHHHHHHHHHHHH-HHHHHHHHHHHHhhccccceEEEEEeccCcccccccccccccc Confidence 110 0112111111 22223334455666678888777655332 Q ss_pred ---CceeEEEEeccceeEEEEcC--------CCceeEEEEeecCcccccceeecccceEEeccCCC------CccccCcc Q lcl|NC_018285. 110 ---GRDMKWEYLRPSQVSFNRLD--------NQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSV------DGGLTSVS 172 (383) Q Consensus 110 ---g~~~~l~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~------~~~~~G~s 172 (383) |....|..++|.+|+..... .+.+.+|++.. +.+-.+-++.|..... ...+.|+| T Consensus 224 I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G--------~kIH~SRL~~f~g~plPd~LKp~y~~~GiS 295 (695) T protein:vir:36 224 VPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG--------TEVHATRLHTIVSRPVGDMLKPTYSFAGIS 295 (695) T ss_pred ccCcceeeeEeecccccccchhhhccchhhccCCCceEEEec--------eEEeeeeEEEecCCCchhhhhcccccCccc Confidence 33455888888888775422 22333444421 1233344443433221 12357999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHH--HHHHhhcCCcceeecC-CCceeeecccCh Q lcl|NC_018285. 173 PLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSR--SRQAMKQMQGGPLVLD-DLEDFTPLEIKS 249 (383) Q Consensus 173 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~--~~~~~~~~~g~~~vl~-~g~~~~~~~~~~ 249 (383) ..+.+...+............+.+.-.........-..+.......+.. .+-....+..++++++ ..-+|+..+.+. T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stsl 375 (695) T protein:vir:36 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPL 375 (695) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEeccc Confidence 9999999999888888877777655332222111111222222222222 2223334445577778 478999888766 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHHHhccccc---CcCHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcchh--- Q lcl|NC_018285. 250 NVAQLLKQADWTTGQFAKVYGIPENVVGGQGD---QQSSLEMSSNVY-------SKAVARYLRPFLSELSQKLSCDV--- 316 (383) Q Consensus 250 ~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~---~~~~~e~~~~~~-------~~~l~P~~~~i~~~l~~~l~~~~--- 316 (383) .. +.+....+...||.+-+||...|-+.+. +++.+.-.+.|| +..|+|.++.+-+.+-+..|..+ T Consensus 376 SG--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpd 453 (695) T protein:vir:36 376 SG--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 453 (695) T ss_pred CC--HHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 44 5566677788999999999888865432 222333333444 45789999998888887776543 Q ss_pred -hccchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhhcCC---c----CCcc---------hhHHhCCCCC Q lcl|NC_018285. 317 -DADIFPAVDPTGANYIS-------RINSMVKSGTLAQNQGLYILQQAE---I----LPKE---------LPKGENPNRT 372 (383) Q Consensus 317 -e~~~~~~~~~~~~~~~~-------~~~~l~~~g~~t~nE~r~~lg~~~---~----~~~d---------~~~~~~~~~~ 372 (383) .|...++-+++..++++ ....++..|+++++|+|+++..++ + +.+| +......... T Consensus 454 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~ 533 (695) T protein:vir:36 454 IKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQR 533 (695) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcC Confidence 33333444555555443 334567899999999999987543 1 1111 1111111111 Q ss_pred CCCCCCCCCCC Q lcl|NC_018285. 373 ILKGGETNGQD 383 (383) Q Consensus 373 ~~~ggd~~~~d 383 (383) ..+|+|..+-- T Consensus 534 ~~~~~~~~~~~ 544 (695) T protein:vir:36 534 LAEGGDTGAPG 544 (695) T ss_pred cccccccCCCC Confidence 11122221100 No 135 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.37 E-value=1.7e-11 Score=79.58 Aligned_cols=369 Identities=14% Similarity=0.056 Sum_probs=197.8 Q ss_pred CchhhhhhcCCc--ccccccccccc-------------hhhccc----------ccCCceechhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITD-------------PEFLAT----------LNGSEWVSAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~-------------~~~~~~----------~~~~~~~~~~~a~~~~~v~~~i~~ia 55 (383) |.-=+ +++. .+.++...... ..+.+. ..+... -.+..+..+.|.+|++.+- T Consensus 1 m~kk~---~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~-ly~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:77 1 MAKRG---RKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLL-VYHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCC---CCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchH-HHHHHhhChHHHHHHHHHH Confidence 55433 2222 11111111000 000000 011111 1344566788999999999 Q ss_pred HhhhhCceeeecchh-----------hhhccCC---CccCCHHHHHHHHHHHHHHcCCeEEEEeec--CCCc--eeEEEE Q lcl|NC_018285. 56 NDLATAKLTTSRKQM-----------QGIVDNP---SNSANRFNFYQSIFAQMLLGGEAFAYRWRN--DNGR--DMKWEY 117 (383) Q Consensus 56 ~~ia~~p~~~~~~~~-----------~~l~~~P---N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~--~~g~--~~~l~~ 117 (383) ..|.+++|.|.-... ...+..+ ....++.+++..+ .+.+.+|-+++++++. .+|. +..+.+ T Consensus 77 ~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~ 155 (448) T protein:vir:77 77 GRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVP 155 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeeccccc Confidence 999999999963111 0112222 1234566777777 5788999999999974 3564 446666 Q ss_pred eccceeEEEEcCCCceeEEEEeecCc----ccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 118 LRPSQVSFNRLDNQNGLYYNVTFDDP----RIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLN 193 (383) Q Consensus 118 l~~~~v~~~~~~~~~~~~y~~~~~~~----~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 193 (383) .++.+++....+..+.+.++...... .......+|..-++|..+ ...+..+|.|.+..+.-....-....+.... T Consensus 156 r~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~ 234 (448) T protein:vir:77 156 IHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLH-NDDGSFTGQSALRAAVPHWLAKRALILLINH 234 (448) T ss_pred cCCCccceeeeecCCceEEEecCCcccccccCCCccccccceEEEEec-CCcCCcccchHHHHHHHHHHHHHhhHHHHHH Confidence 67654433222222223333221111 111234567888888865 4456678999999999888888888899999 Q ss_pred HHhccCCcceeEeecCCCC--HHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018285. 194 SLKNALNANGILKIKGGGL--LDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGI 271 (383) Q Consensus 194 ~~~ng~~~~~i~~~~~~~~--~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gV 271 (383) |.+..|.|--+.+.+...+ +++++.+.+.......+....++++.|++++-++.+.....+.+..++.-++|+.+..- T Consensus 235 f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLG 314 (448) T protein:vir:77 235 GLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGI 314 (448) T ss_pred HHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhc Confidence 9999999999999876543 45566666655544333333466788887765555444444666777777888776532 Q ss_pred CHHHhcccccCcCHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcchh-hccc-----hh------hhccCHHHHHHHHHHH Q lcl|NC_018285. 272 PENVVGGQGDQQSSL-EMSSNVYSKAVARYLRPFLSELSQKLSCDV-DADI-----FP------AVDPTGANYISRINSM 338 (383) Q Consensus 272 pp~~lg~~~~~~~~~-e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~-e~~~-----~~------~~~~~~~~~~~~~~~l 338 (383) .---.+..+..+... ..........+.-.++.|++.||+.|+..+ .++. .+ ....|...+++.+.++ T Consensus 315 qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~~a~~~~~l 394 (448) T protein:vir:77 315 DFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDFSAAANLMGML 394 (448) T ss_pred cccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhHHHHHHHhHHH Confidence 210011111111222 222234556677788899999998886532 1110 00 0112333445555555 Q ss_pred HhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 339 VKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 339 ~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) + +-.|+.+|.+.-. ++..........+..|-+....+ T Consensus 395 ~-------~~~~~~~~ip~~~-~~~~~~~~~~~~~~~~~~~~~~~ 431 (448) T protein:vir:77 395 I-------NAVKDSEDIPTEL-KALIDALPSKMRRALGVVDEVRE 431 (448) T ss_pred H-------HHHHHHhcCCccC-CcCCCCCchhcccccCCCCCCCc Confidence 4 4577787765421 11111110011111121111111 No 136 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.36 E-value=4.4e-12 Score=82.86 Aligned_cols=355 Identities=11% Similarity=0.072 Sum_probs=201.2 Q ss_pred Cchhhhhhc---CCccccccccccc------chhhcccccCCce---e-chhhhh-ccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATE---SPPNNQGGFFDIT------DPEFLATLNGSEW---V-SAETAL-KNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~~---~-~~~~a~-~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) |-.-+.-.. +.......-.+.+ |+.+- . .++.. + -.+..+ +.+.|.+|++.+...|.++++.|. T Consensus 3 ~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr-~-~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~ 80 (446) T protein:vir:98 3 MEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYK-R-AGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQ 80 (446) T ss_pred ccccCCCchhhhhhhhhccccchhhcccCCcchHhh-h-cCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceec Confidence 222111000 0000000000000 11000 0 11110 0 123334 478999999999999999999997 Q ss_pred cchhh------hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-C--ceeE----EEEeccceeEEEEcCCCce Q lcl|NC_018285. 67 RKQMQ------GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G--RDMK----WEYLRPSQVSFNRLDNQNG 133 (383) Q Consensus 67 ~~~~~------~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g--~~~~----l~~l~~~~v~~~~~~~~~~ 133 (383) -.+.. .++... ..++....+.+.+.+|-++.++++... | .|.. +....|..+....+.+... T Consensus 81 p~~~~~a~~v~~~l~~~-----~~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~~~ 155 (446) T protein:vir:98 81 HGDKRIKKFIDDQLRNR-----AKTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDNGRI 155 (446) T ss_pred CccHHHHHHHHHHHhhc-----CchhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccccceeeeccCCcc Confidence 43211 111111 123444456788899999999997532 2 1211 1112222222222211111 Q ss_pred eE-EEEe---------------------ecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 134 LY-YNVT---------------------FDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLT 191 (383) Q Consensus 134 ~~-y~~~---------------------~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 191 (383) .. ...+ ......+....+|...++|+++....+..+|.|.+..+.-....-....+.. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w 235 (446) T protein:vir:98 156 VDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMM 235 (446) T ss_pred ccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHHHHHHHHHHhhHHHH Confidence 00 0000 0001122345678888888888777777899999999999888889999999 Q ss_pred HHHHhccCCcceeEeecCCCCHHHH-------------HHHHHHHHHhhcCCccee---ecCCCceeeecccChhh-HHH Q lcl|NC_018285. 192 LNSLKNALNANGILKIKGGGLLDFK-------------TKVSRSRQAMKQMQGGPL---VLDDLEDFTPLEIKSNV-AQL 254 (383) Q Consensus 192 ~~~~~ng~~~~~i~~~~~~~~~e~~-------------~~~~~~~~~~~~~~g~~~---vl~~g~~~~~~~~~~~d-~~~ 254 (383) ..+.+..|.|--+.+.+...+++++ +++.+.......+++.++ +++.|++++-++..... ..+ T Consensus 236 ~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~ 315 (446) T protein:vir:98 236 LIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSF 315 (446) T ss_pred HHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhH Confidence 9999999999999998765443322 123333333333444433 34889888765443222 247 Q ss_pred HHHHHHHHHHHHHHhcCCHHHhccc-cc-CcCH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcchh-hccc---------- Q lcl|NC_018285. 255 LKQADWTTGQFAKVYGIPENVVGGQ-GD-QQSS-LEMSSNVYSKAVARYLRPFLSELSQKLSCDV-DADI---------- 320 (383) Q Consensus 255 ~e~~~~~~~~Ia~~~gVpp~~lg~~-~~-~~~~-~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~-e~~~---------- 320 (383) .+..++.-++|+.+....--.+|.. ++ ++++ .+.......+.++--++.|.+.||+.|+..+ .++. T Consensus 316 ~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~ 395 (446) T protein:vir:98 316 ERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASN 395 (446) T ss_pred HHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccc Confidence 7888888899999887765555532 11 2222 3333456667788889999999999886432 1110 Q ss_pred --hh----hhccCHHHHHHHHHHHHhCCCcCH---HHHHHHhhcCCcCCcc Q lcl|NC_018285. 321 --FP----AVDPTGANYISRINSMVKSGTLAQ---NQGLYILQQAEILPKE 362 (383) Q Consensus 321 --~~----~~~~~~~~~~~~~~~l~~~g~~t~---nE~r~~lg~~~~~~~d 362 (383) .. .-..|...+++.+.+|++.|+.++ +.+|+.+|.+...+.- T Consensus 396 ~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 396 TGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred cccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 00 112355667788889999998764 5599999987654332 No 137 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.19 E-value=1.9e-10 Score=73.86 Aligned_cols=281 Identities=9% Similarity=0.004 Sum_probs=161.9 Q ss_pred EEEEeecCC-C--ceeEEEEeccceeE-EEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHH Q lcl|NC_018285. 101 FAYRWRNDN-G--RDMKWEYLRPSQVS-FNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMA 176 (383) Q Consensus 101 ~~~i~r~~~-g--~~~~l~~l~~~~v~-~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~ 176 (383) +.+|++... | .|..|.+.++.++. +..+.++....++.... .+.....++....|+.++....+..+|.+.+.. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~--~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~ 78 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGV--FGKATVRIPVDRLVVFVNEREGANWLGQSLLRQ 78 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCC--CCCCcceeccCCEEEEEeCCCCCCccchhhHHH Confidence 788887543 3 36778888887665 33445555554443322 222345788888887776666677889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC--CCHH-----------HHHHHHHHHHHhhcCCcceeecCCCceee Q lcl|NC_018285. 177 LGRELDIQKASDKLTLNSLKNALNANGILKIKGG--GLLD-----------FKTKVSRSRQAMKQMQGGPLVLDDLEDFT 243 (383) Q Consensus 177 ~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~--~~~e-----------~~~~~~~~~~~~~~~~g~~~vl~~g~~~~ 243 (383) +.-....-....++...+.+..+.|--+.+.+.. ...+ .++.+.........+....++++.|++++ T Consensus 79 ~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie 158 (355) T protein:vir:78 79 AYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFT 158 (355) T ss_pred HHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEE Confidence 9998888899999999999988555555554422 2111 12222222222222323467789998887 Q ss_pred ecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-Cc-CHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh---hc Q lcl|NC_018285. 244 PLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-QQ-SSLEMSSNVYSKAVARYLRPFLSELSQKLSCDV---DA 318 (383) Q Consensus 244 ~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-~~-~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~---e~ 318 (383) -++.......+.+..++.-++|+.++.-.---.+..+. ++ ...+.......+.+.-.++.|.+.||+.|+..+ .| T Consensus 159 ~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~ 238 (355) T protein:vir:78 159 LTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQNW 238 (355) T ss_pred EeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 77665555567777888888888776333111111111 22 223344566677788888999999998776531 11 Q ss_pred cc--------hhhhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHhhcCCcCCcchhHH---h-CCCCCCCCCCCC-- Q lcl|NC_018285. 319 DI--------FPAVDPTGANYISRINSMVKSGTLAQN-----QGLYILQQAEILPKELPKG---E-NPNRTILKGGET-- 379 (383) Q Consensus 319 ~~--------~~~~~~~~~~~~~~~~~l~~~g~~t~n-----E~r~~lg~~~~~~~d~~~~---~-~~~~~~~~ggd~-- 379 (383) .- ......+....++.+.+++..|+.-.+ .+|+.+|.+.-..++-... . .......+.|+. T Consensus 239 ~~~~~~P~~~~~~~~~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (355) T protein:vir:78 239 GPEEPAPRLVPAQLGKEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQR 318 (355) T ss_pred CCCCCCCEEEecCcChhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccCCcc Confidence 10 111233445567888899999987654 4789999864222211100 0 000111111111 Q ss_pred CCCC Q lcl|NC_018285. 380 NGQD 383 (383) Q Consensus 380 ~~~d 383 (383) ...+ T Consensus 319 ~~~~ 322 (355) T protein:vir:78 319 QGAA 322 (355) T ss_pred cccc Confidence 0111 No 138 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.13 E-value=1.2e-10 Score=75.04 Aligned_cols=362 Identities=11% Similarity=0.028 Sum_probs=170.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechh---hhh-ccHHHHHHHHHHHHhhhh-Cceeeecchhhh--- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAE---TAL-KNSDLFSIISQLSNDLAT-AKLTTSRKQMQG--- 72 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~a~-~~~~v~~~i~~ia~~ia~-~p~~~~~~~~~~--- 72 (383) |+|.+-...-- ..+ +..+..+.....++.+ .+. .+..+..+|+.+++.+-. .|..+...+... T Consensus 23 d~l~~~~~glg-~~r--------~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~~~~~~~~ 93 (449) T protein:vir:10 23 MGLMVPTMGLD-NKR--------HSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGDDADDSED 93 (449) T ss_pred HHHHHHHhcCC-ccc--------chhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccccCccccchhh Confidence 44444322110 000 0011111111112222 222 356678899999987632 222221111100 Q ss_pred ---hccCCCccCCHHHHHHHH---HHHHHHcCCeEEEEee-cCC---------CceeEEEEeccceeEEEE-------cC Q lcl|NC_018285. 73 ---IVDNPSNSANRFNFYQSI---FAQMLLGGEAFAYRWR-NDN---------GRDMKWEYLRPSQVSFNR-------LD 129 (383) Q Consensus 73 ---l~~~PN~~~t~~~f~~~~---~~~~~l~G~a~~~i~r-~~~---------g~~~~l~~l~~~~v~~~~-------~~ 129 (383) +..+-... ....+|..+ ...-.++|-+.+++.. +.. +.+..+.|+....+++.. .. T Consensus 94 ~~~~e~~~~~l-~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp~ 172 (449) T protein:vir:10 94 ETSWEKKSKQV-FTNRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSKT 172 (449) T ss_pred hHHHHHHHHHH-HHHHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCCC Confidence 00000000 111333332 2333467877777653 332 234556666554444321 12 Q ss_pred CCceeEEEEeec-CcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHH-HHHHHHHhccCC------- Q lcl|NC_018285. 130 NQNGLYYNVTFD-DPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASD-KLTLNSLKNALN------- 200 (383) Q Consensus 130 ~~~~~~y~~~~~-~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~-~~~~~~~~ng~~------- 200 (383) .+.+.+|++... .+.....+.+-++-|+||-.. +.-|.|.++.+...+.....+. .+...+++|-.+ T Consensus 173 yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~----~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~ 248 (449) T protein:vir:10 173 YGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDY----SEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFE 248 (449) T ss_pred CCCceEEEEeeeccCCCccceeeccceeEeecCC----CCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhh Confidence 344566666532 122233345667778887432 2337888888887654333332 223333333211 Q ss_pred ----cceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_018285. 201 ----ANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVV 276 (383) Q Consensus 201 ----~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~l 276 (383) ..++....+...++..+++.........+.+ .+.++.+-+|+.++.++.+ +.+..+....+||++-|||...| T Consensus 249 ~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~-~~~i~~~~d~~~~~~~~sg--l~d~l~~~~q~iaaa~~IP~t~L 325 (449) T protein:vir:10 249 KEIDFTNLASLYGVSIDELQDKFNEVAGEINRGND-VLMTTQGATVTPLVTSVAD--PTATYNVNLQTAAAGVDIPTRIL 325 (449) T ss_pred hhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccch-heeecCCcceEEEecccCC--hhHHHHHHHHHHHHHhCCCeeee Confidence 1111111122233344444433333333333 3456677789988888765 45566777888999999999888 Q ss_pred cccccCc-CHHHHHHHHHH------HHHHHHHHHHHHHHHHhhc----chhhccchhhhccCHHHHHH-------HHHHH Q lcl|NC_018285. 277 GGQGDQQ-SSLEMSSNVYS------KAVARYLRPFLSELSQKLS----CDVDADIFPAVDPTGANYIS-------RINSM 338 (383) Q Consensus 277 g~~~~~~-~~~e~~~~~~~------~~l~P~~~~i~~~l~~~l~----~~~e~~~~~~~~~~~~~~~~-------~~~~l 338 (383) -+.+... |+.+-.+.|+. +-|.|.++.+-+.|-+.-+ +++.+...++..++..+.++ .+..+ T Consensus 326 ~Gqsp~glnst~D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~ 405 (449) T protein:vir:10 326 IGNQQAERSSTEDQKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTM 405 (449) T ss_pred eccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 6543321 22222333442 2366777777666654433 22333334455555555543 33456 Q ss_pred HhCC---CcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCC Q lcl|NC_018285. 339 VKSG---TLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNG 381 (383) Q Consensus 339 ~~~g---~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~ 381 (383) +.+| +++.+|+|+.+|++|..+.+.+.-. .....++.|... T Consensus 406 ~~ag~~~~~~~~EiR~~~~~~~~~~~~~~~e~--~de~~~~~d~~a 449 (449) T protein:vir:10 406 LGSGDNPAFSREEIRTAAGYDNDDEEPLGEED--GDEEDKATDSAA 449 (449) T ss_pred HHccccCCcCHHHHHHHhcccCCCCCCCCCCC--CccccccCCcCC Confidence 6666 8999999999999885433221110 000011111111 No 139 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.04 E-value=4.1e-09 Score=66.57 Aligned_cols=376 Identities=10% Similarity=0.006 Sum_probs=189.8 Q ss_pred CchhhhhhcCC-c--------ccccccccccchhhcccccC-CceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MPIFNLATESP-P--------NNQGGFFDITDPEFLATLNG-SEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mglf~~~~~~~-~--------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) |.=-......- + ....++.........+-+.+ ...-..+..+..+.|.+|++.+-..|.+++|+|.-.. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w~v~p~~~ 80 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMMRDPAVAASVNIIKMFVRKVNWRFVPPKG 80 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 11100000000 0 00000000000000000000 0111244556788999999999999999999996221 Q ss_pred ---h-------hhhccC-CCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-------------CCc--eeEEEEecccee Q lcl|NC_018285. 70 ---M-------QGIVDN-PSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-------------NGR--DMKWEYLRPSQV 123 (383) Q Consensus 70 ---~-------~~l~~~-PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-------------~g~--~~~l~~l~~~~v 123 (383) . ..+... -|-..++.++++.+. +.+.+|-+.+++++.. +|+ +..+.+.++.+. T Consensus 81 ~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq~~~ 159 (488) T protein:vir:95 81 KEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQSTL 159 (488) T ss_pred CchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeeeeeecCcccc Confidence 1 111111 122235667777775 5779999999998853 232 445555555322 Q ss_pred EE-EEcCCCceeE-EEEeec----------CcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 124 SF-NRLDNQNGLY-YNVTFD----------DPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLT 191 (383) Q Consensus 124 ~~-~~~~~~~~~~-y~~~~~----------~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 191 (383) +. ..+.++.... .+.... .+.......++....++.++....+..+|.+.+..+.-....-....++. T Consensus 160 ~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w 239 (488) T protein:vir:95 160 DKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYKVQIEEYE 239 (488) T ss_pred cceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHHHHHHHHHHH Confidence 11 1122222211 110000 00112334577777666666666677889999999988888778888888 Q ss_pred HHHHhccCCcceeEeecC----CCCHHHHHHHHHHHHHhhcC----CcceeecCCCceee---------ecccC-hhhHH Q lcl|NC_018285. 192 LNSLKNALNANGILKIKG----GGLLDFKTKVSRSRQAMKQM----QGGPLVLDDLEDFT---------PLEIK-SNVAQ 253 (383) Q Consensus 192 ~~~~~ng~~~~~i~~~~~----~~~~e~~~~~~~~~~~~~~~----~g~~~vl~~g~~~~---------~~~~~-~~d~~ 253 (383) ..+.+..+.|--+...+. ..+++++..+.+.......+ ....++++.|+... -++.. ..-.. T Consensus 240 ~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~ 319 (488) T protein:vir:95 240 AVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYD 319 (488) T ss_pred HHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchh Confidence 888887655555555532 23344455555554443322 22234566665432 22221 11223 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhcccc------cCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh-hcc------ Q lcl|NC_018285. 254 LLKQADWTTGQFAKVYGIPENVVGGQG------DQQS-SLEMSSNVYSKAVARYLRPFLSELSQKLSCDV-DAD------ 319 (383) Q Consensus 254 ~~e~~~~~~~~Ia~~~gVpp~~lg~~~------~~~~-~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~-e~~------ 319 (383) +.+..++.-++|+.+. ||.+. .+++ ..+.........+.--++.|.+.||+.|++.+ .++ T Consensus 320 ~~~li~~~d~~Isk~i------LGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~ 393 (488) T protein:vir:95 320 TGSIIDRYSKQIMMAF------MSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNMWDDEE 393 (488) T ss_pred HHHHHHHHHHHHHHHH------hccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC Confidence 5556666667777664 54311 1122 22334456677788889999999999877532 111 Q ss_pred -----chhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhhcCCcCCcchhHHhCC-CCCCCCC-CCCCCCC Q lcl|NC_018285. 320 -----IFPAVDPTGANYISRINSMVKSGTLAQ-----NQGLYILQQAEILPKELPKGENP-NRTILKG-GETNGQD 383 (383) Q Consensus 320 -----~~~~~~~~~~~~~~~~~~l~~~g~~t~-----nE~r~~lg~~~~~~~d~~~~~~~-~~~~~~g-gd~~~~d 383 (383) .......|...+++.+.+|+..|+.-. +.+|+.+|.+.-..++....... ...+..| +....++ T Consensus 394 ~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 469 (488) T protein:vir:95 394 HVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYKTAGE 469 (488) T ss_pred ccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccCCCcc Confidence 011112344567788889999998764 56899999886433332111111 1111111 0011111 No 140 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.97 E-value=5.5e-09 Score=65.87 Aligned_cols=347 Identities=8% Similarity=-0.044 Sum_probs=158.3 Q ss_pred hcccccC--CceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEE Q lcl|NC_018285. 26 FLATLNG--SEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAY 103 (383) Q Consensus 26 ~~~~~~~--~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~ 103 (383) +++.-.. ..... +.+ ...-..-+|+.+++.+---.|++-+........+-...-........+..+++.+|.||+. T Consensus 1 ~l~~~~~~~~~~~~-~~~-v~n~~~~ivd~~~~~l~~~gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 78 (434) T protein:vir:98 1 MLPKNAEQAFLDFQ-RKA-RTNFCGLIANASVHRLLALGVTGPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYML 78 (434) T ss_pred CCCCCccHHHHHhh-hhh-hccchHHHHHHHHhhhccCceecCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 1110000 00000 000 0112233555555544333444433222211111111123344556778889999999999 Q ss_pred EeecCCCce------eEEEEeccceeEEEEcCCCceeEEEEee----cCccc--------------------c------- Q lcl|NC_018285. 104 RWRNDNGRD------MKWEYLRPSQVSFNRLDNQNGLYYNVTF----DDPRI--------------------P------- 146 (383) Q Consensus 104 i~r~~~g~~------~~l~~l~~~~v~~~~~~~~~~~~y~~~~----~~~~~--------------------~------- 146 (383) +.++.++.. ..+.+++|.++.+..+...+.+.+-+.+ .++.. . T Consensus 79 v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (434) T protein:vir:98 79 VGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERTGARLPWGPD 158 (434) T ss_pred EecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEeeccccccccccc Confidence 987655432 2366788888877665432211111000 00000 0 Q ss_pred -----------cceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC--CCCH Q lcl|NC_018285. 147 -----------PKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG--GGLL 213 (383) Q Consensus 147 -----------~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~--~~~~ 213 (383) ....+..-.|+||.+....+. .|.|-+......++....+.-......+-.+.|..+++... ...+ T Consensus 159 ~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~ 237 (434) T protein:vir:98 159 SWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTD 237 (434) T ss_pred cceecccccccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccc Confidence 000122223566654432222 58888877777777766666555555555566665554211 1111 Q ss_pred HHHHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHH Q lcl|NC_018285. 214 DFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNV 292 (383) Q Consensus 214 e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~ 292 (383) + .......+.......+++++++ ++.++.++..+.-+ .+++..+..+.+|+..=++|++.+|+..++.+.+... + T Consensus 238 ~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~--~ 313 (434) T protein:vir:98 238 P-ATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLS-GFLKEHASDVRDMLTISQTPTYLYATDLVNISADTIG--A 313 (434) T ss_pred c-ccccchhhhhhhccccccccCCCCCceEEEecCcchH-HHHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHH--H Confidence 1 1111112222223334555554 45777666554333 3777788889999999999999999754444333222 2 Q ss_pred HHHHHHHHH----HHHHHHHHHhh--cch----------hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcC Q lcl|NC_018285. 293 YSKAVARYL----RPFLSELSQKL--SCD----------VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQA 356 (383) Q Consensus 293 ~~~~l~P~~----~~i~~~l~~~l--~~~----------~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~ 356 (383) ....+.-.+ +.+.+.|.+-+ ... ++.......-.+..+.+..+.++...|+ +...+++.+|.. T Consensus 314 ~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~-~~e~~~~~lg~~ 392 (434) T protein:vir:98 314 LDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGY-PLDVIAEELDES 392 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCC-cHHHHHHhCCCC Confidence 122222222 22222222211 000 1111112223455677777888887775 677777777654 Q ss_pred CcCCcchhHHh-------------C-CCCCCCCCCCCCCCC Q lcl|NC_018285. 357 EILPKELPKGE-------------N-PNRTILKGGETNGQD 383 (383) Q Consensus 357 ~~~~~d~~~~~-------------~-~~~~~~~ggd~~~~d 383 (383) + .++.+.+ . ....+..|.+.++++ T Consensus 393 ~---~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 430 (434) T protein:vir:98 393 P---ARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGG 430 (434) T ss_pred H---HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccC Confidence 3 3332211 0 111122222322222 No 141 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.87 E-value=9.4e-09 Score=64.60 Aligned_cols=370 Identities=8% Similarity=-0.024 Sum_probs=157.9 Q ss_pred Cchhhhhhc----CC--cccccccccccchhhcccccCCceechhh---hhccHHHHHHHHHHHHhhhhCceeeecch-h Q lcl|NC_018285. 1 MPIFNLATE----SP--PNNQGGFFDITDPEFLATLNGSEWVSAET---ALKNSDLFSIISQLSNDLATAKLTTSRKQ-M 70 (383) Q Consensus 1 Mglf~~~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---a~~~~~v~~~i~~ia~~ia~~p~~~~~~~-~ 70 (383) ..+..++.+ .. ......+...... .... +..+..+. ...+.-..-+|+.++..+--..+.+-+.. . T Consensus 14 ~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~--i~~~--~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 89 (484) T protein:vir:77 14 EKAREEMLNLFTERTQDLGDNTAYYESERR--PDAV--GVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFRLGGADKA 89 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccc--chhc--ccccchhHHhhhhhcCcHHHHHHHHHhhhccCceecCCcchh Confidence 111111110 00 0000000000000 0000 00011100 01111223345545443333344443321 1 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce-------eEEEEeccceeEEEEcCCCceeEE--EEe-- Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD-------MKWEYLRPSQVSFNRLDNQNGLYY--NVT-- 139 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~-------~~l~~l~~~~v~~~~~~~~~~~~y--~~~-- 139 (383) .....+-...-........+..+++.+|.||+.+.++.+|.+ ..+.+++|..+....+.....+.+ ++. T Consensus 90 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~ 169 (484) T protein:vir:77 90 DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIED 169 (484) T ss_pred HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCCCCceEEEEEEEEe Confidence 111111222223455667888999999999999998887754 246778888887666543222111 110 Q ss_pred ---------------------ecCcccc--cce--eecccceEEeccCCCCccccCcchHH----HHHHHHHHHHHHHHH Q lcl|NC_018285. 140 ---------------------FDDPRIP--PKQ--HVPQSDILHFRLLSVDGGLTSVSPLM----ALGRELDIQKASDKL 190 (383) Q Consensus 140 ---------------------~~~~~~~--~~~--~~~~~dvih~~~~~~~~~~~G~s~~~----~~~~~i~~~~~~~~~ 190 (383) ...+... ... .+..-.|++|.+....+...|.|.+. .+.+.+.....-... T Consensus 170 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~ 249 (484) T protein:vir:77 170 EEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQA 249 (484) T ss_pred ecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHH Confidence 0110000 000 11122356666544445566877654 333333333222222 Q ss_pred HHHHHhccCCcceeEeecCCCCHHHH--HHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHHHHHHHHHH Q lcl|NC_018285. 191 TLNSLKNALNANGILKIKGGGLLDFK--TKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQADWTTGQFAK 267 (383) Q Consensus 191 ~~~~~~ng~~~~~i~~~~~~~~~e~~--~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~ 267 (383) ...++ +.|..++..- ...+... ..-...+. ...++++.++ ++.++.++..+.-+ .+++..+....+|+. T Consensus 250 ~~~~~---a~p~~~i~G~-~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~q~~~~~~e-~~~~~l~~~i~~~s~ 321 (484) T protein:vir:77 250 TAELM---GVPQRLLFGV-KGEELGVDPETGQTLFD---AYLARILAFEDHESKAQQFSAAELR-NFVDALDALDRKAAA 321 (484) T ss_pred HHHhh---hhhHHHHhCC-Ccchhcccccccchhhh---hhhhhhcccCCCCceeEeecCCChH-HHHHHHHHHHHHHhc Confidence 33333 3444444321 1111000 00011111 1234555555 46788776655433 377888888899999 Q ss_pred HhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhh------cc------h-------hhccchhhhccCH Q lcl|NC_018285. 268 VYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKL------SC------D-------VDADIFPAVDPTG 328 (383) Q Consensus 268 ~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l------~~------~-------~e~~~~~~~~~~~ 328 (383) .=++|++.+|+...+..+.++.+. ....+.-.++..+..|...| +. . ++.......-.+. T Consensus 322 ~~~~p~~~fg~~~~n~~Sg~Al~~-~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~ 400 (484) T protein:vir:77 322 YTGLPPYYLSFSSENPASAEAIRS-SESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTY 400 (484) T ss_pred ccCCCHHHhccccCcchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCH Confidence 999999999976554333333321 11111111222112121111 00 0 1111111223455 Q ss_pred HHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhHHh------------C-CCCCCCCCCCCCCCC Q lcl|NC_018285. 329 ANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPKGE------------N-PNRTILKGGETNGQD 383 (383) Q Consensus 329 ~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~~~------------~-~~~~~~~ggd~~~~d 383 (383) .+.+..+.+++++| +++..-+++++|..+-+-.++.+.. . ....+..||+.+.+| T Consensus 401 ~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (484) T protein:vir:77 401 AAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPE 470 (484) T ss_pred HHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCC Confidence 66777888888865 8899888888865432112221110 0 011123344444444 No 142 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=98.73 E-value=4.9e-08 Score=60.67 Aligned_cols=373 Identities=15% Similarity=0.104 Sum_probs=196.8 Q ss_pred CchhhhhhcCCcccccc----cccccch---------hhcccccCCceechhhhh----ccHHHHHHHHHHHHhhhhCce Q lcl|NC_018285. 1 MPIFNLATESPPNNQGG----FFDITDP---------EFLATLNGSEWVSAETAL----KNSDLFSIISQLSNDLATAKL 63 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~----~~~~~~~---------~~~~~~~~~~~~~~~~a~----~~~~v~~~i~~ia~~ia~~p~ 63 (383) |.|++= +..|+.++.. ...++.. .+.-...++..-....|. .+|.++-.+..|+++|+++.+ T Consensus 1 ~~~~rP-k~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a~SR~rL 79 (646) T protein:vir:10 1 MALLKP-KSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDSVAQARL 79 (646) T ss_pred CcccCC-CCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhHhhhhhhhhceeee Confidence 888653 2333322200 0011100 000111111111222222 246778888899999999999 Q ss_pred eeecchhhh-------------hccCCC-ccCCHHHHHHHHHHHHHHcCCeEEEE----eecCCCceeEEEEeccceeEE Q lcl|NC_018285. 64 TTSRKQMQG-------------IVDNPS-NSANRFNFYQSIFAQMLLGGEAFAYR----WRNDNGRDMKWEYLRPSQVSF 125 (383) Q Consensus 64 ~~~~~~~~~-------------l~~~PN-~~~t~~~f~~~~~~~~~l~G~a~~~i----~r~~~g~~~~l~~l~~~~v~~ 125 (383) ..-+-+... +...+= --....++++.+..++-+-|++|+.. .-..+++ -..+++-.+.|.. T Consensus 80 ~aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~-~~W~vvt~~Ev~~ 158 (646) T protein:vir:10 80 YVTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAE-GSWFVVTGSAISR 158 (646) T ss_pred eeeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCc-cceeeecHHHhcc Confidence 876543211 111111 11234678999999999999999874 1112221 2233444444421 Q ss_pred EEcCCCceeEEEEeecCc-ccccceeecccceEEec--cCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcc Q lcl|NC_018285. 126 NRLDNQNGLYYNVTFDDP-RIPPKQHVPQSDILHFR--LLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNAN 202 (383) Q Consensus 126 ~~~~~~~~~~y~~~~~~~-~~~~~~~~~~~dvih~~--~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~ 202 (383) . +. .+.+..... .+.....++..+++ || .+++.......||+.+++..+.-.....+...+..+.-..-. T Consensus 159 --t--g~--~~~i~~p~~~~g~~~v~~~~~d~l-vRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~Gn 231 (646) T protein:vir:10 159 --T--GD--EIAVRRPQQRGGSKLVLVDGQDIL-IRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGA 231 (646) T ss_pred --C--CC--eeeeecCccCCCCCcceecCCceE-EEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcC Confidence 1 11 223322221 12334456666764 45 455666678899999999999999999999888888888888 Q ss_pred eeEeecCCCC-------HHHHHHHHHHHH----Hhhc-----CCcceeecCC-Cc------eeeecccChh-hHHHHHHH Q lcl|NC_018285. 203 GILKIKGGGL-------LDFKTKVSRSRQ----AMKQ-----MQGGPLVLDD-LE------DFTPLEIKSN-VAQLLKQA 258 (383) Q Consensus 203 ~i~~~~~~~~-------~e~~~~~~~~~~----~~~~-----~~g~~~vl~~-g~------~~~~~~~~~~-d~~~~e~~ 258 (383) |++-+|..++ +.....+...+. .... .+--|+++.. |. +++.+++... +.--+.++ T Consensus 232 GvLfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR 311 (646) T protein:vir:10 232 GIMFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMK 311 (646) T ss_pred ceeeeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhH Confidence 8888775432 122233333332 2222 2223455432 11 3344444322 22347899 Q ss_pred HHHHHHHHHHhcCCHHHhcccccCcC--HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh----hc-c---chhhhccCH Q lcl|NC_018285. 259 DWTTGQFAKVYGIPENVVGGQGDQQS--SLEMSSNVYSKAVARYLRPFLSELSQKLSCDV----DA-D---IFPAVDPTG 328 (383) Q Consensus 259 ~~~~~~Ia~~~gVpp~~lg~~~~~~~--~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~~----e~-~---~~~~~~~~~ 328 (383) +..+..+|..+-|||+.|-+.++.+- .-+-...-+. .|.|.+..|.++|++.++... .+ | ....++.+. T Consensus 312 ~daI~RlA~glDIppE~LLGlgd~NHWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~ 390 (646) T protein:vir:10 312 DKAIARLASSAEIPGEVLTGIGDANHWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTST 390 (646) T ss_pred HHHHHHHHhccCCchhheeeccccceeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCcc Confidence 99999999999999998865544221 1111122233 599999999999999877431 11 1 111122221 Q ss_pred H----HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch---------------------hHH---------h--CCCCC Q lcl|NC_018285. 329 A----NYISRINSMVKSGTLAQNQGLYILQQAEILPKEL---------------------PKG---------E--NPNRT 372 (383) Q Consensus 329 ~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~---------------------~~~---------~--~~~~~ 372 (383) . .+.+....+.+.|.+|-...|+.+|+.--+.... +.. . .++.. T Consensus 391 Lt~~pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~ 470 (646) T protein:vir:10 391 LASKPNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPPT 470 (646) T ss_pred cccCCCCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCCc Confidence 1 1224455678899999999999988653211110 011 0 01111 Q ss_pred CCCCCC---CCCCC Q lcl|NC_018285. 373 ILKGGE---TNGQD 383 (383) Q Consensus 373 ~~~ggd---~~~~d 383 (383) ..+.+| +++++ T Consensus 471 ~~~~~dg~~~~~e~ 484 (646) T protein:vir:10 471 AAQRTDGDLDDDES 484 (646) T ss_pred ccccccCCCCChhh Confidence 111112 11111 No 143 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.73 E-value=4.3e-08 Score=61.00 Aligned_cols=379 Identities=13% Similarity=0.151 Sum_probs=181.3 Q ss_pred Cchhhhhhc---------------CCcccccc-ccc--ccch-hh---cccccCCcee--------chhhhhccHHHHHH Q lcl|NC_018285. 1 MPIFNLATE---------------SPPNNQGG-FFD--ITDP-EF---LATLNGSEWV--------SAETALKNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~---------------~~~~~~~~-~~~--~~~~-~~---~~~~~~~~~~--------~~~~a~~~~~v~~~ 50 (383) |++|.+.-. .|....+. ... ...+ +. .+..-++..- ..+.|+.+|.|..| T Consensus 4 ~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEVd~A 83 (533) T protein:vir:58 4 LEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLISTV 83 (533) T ss_pred cchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcchhhH Confidence 444443211 11111111 111 1111 00 0111112111 23445788999999 Q ss_pred HHHHHHhhhhC-----ceeeecchh---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec-CCCceeEEEEeccc Q lcl|NC_018285. 51 ISQLSNDLATA-----KLTTSRKQM---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN-DNGRDMKWEYLRPS 121 (383) Q Consensus 51 i~~ia~~ia~~-----p~~~~~~~~---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~-~~g~~~~l~~l~~~ 121 (383) |+.|++.+.-+ |+.+.-.+. ..+..+-...++...--+..+..|++.|..|..++.+ .++-+.+|.+|+|. T Consensus 84 ideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr 163 (533) T protein:vir:58 84 LDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPY 163 (533) T ss_pred HHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCe Confidence 99999886532 333321111 0111111123333333346678889999999988743 44567899999999 Q ss_pred eeEEEEcCCCceeEEEEee---cCcccccceeecccceEEeccCCCC-ccccCcchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 122 QVSFNRLDNQNGLYYNVTF---DDPRIPPKQHVPQSDILHFRLLSVD-GGLTSVSPLMALGRELDIQKASDKLTLNSLKN 197 (383) Q Consensus 122 ~v~~~~~~~~~~~~y~~~~---~~~~~~~~~~~~~~dvih~~~~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 197 (383) .|+.++.......+|-+.. ..+.....+.++.+.|+|+..-..+ ...+++|-|..+.+.+....-++....-+.-. T Consensus 164 ~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRis 243 (533) T protein:vir:58 164 IFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVV 243 (533) T ss_pred eeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhc Confidence 9998887655555554431 2233344577889999999754222 34568899999988888877777776655544 Q ss_pred cCCcceeEeec-CCC-CHHHHHHHHHH---HHH--hhcCC-cce------e----ec----------CCCceeeecccCh Q lcl|NC_018285. 198 ALNANGILKIK-GGG-LLDFKTKVSRS---RQA--MKQMQ-GGP------L----VL----------DDLEDFTPLEIKS 249 (383) Q Consensus 198 g~~~~~i~~~~-~~~-~~e~~~~~~~~---~~~--~~~~~-g~~------~----vl----------~~g~~~~~~~~~~ 249 (383) -+.-+-++-.+ +.+ ...+.+.++.. +++ .++++ |.+ + .+ +.|.+++.+... T Consensus 244 RAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg- 322 (533) T protein:vir:58 244 RSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGS- 322 (533) T ss_pred CChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCC- Confidence 44333343332 232 22232222221 111 11222 222 1 11 235777777653 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH--HHHHHHHHHHHHHHHHHHHHHhhcch-----hhccchh Q lcl|NC_018285. 250 NVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS--SNVYSKAVARYLRPFLSELSQKLSCD-----VDADIFP 322 (383) Q Consensus 250 ~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~--~~~~~~~l~P~~~~i~~~l~~~l~~~-----~e~~~~~ 322 (383) . +.-++-..+..+.+..+++||.+.|+.........+-. +.=+...|.-+-..+.+.|.+.|... -++.. T Consensus 323 ~-lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~eew~~-- 399 (533) T protein:vir:58 323 K-VDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQDFRL-- 399 (533) T ss_pred C-CCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhheee-- Confidence 2 44456667889999999999999998543322222221 22245566677777778887776532 11111 Q ss_pred hhccCH--------HHHHHHHH----------------HHHh--CCCcCHHHHHHHhhcCCcCC-cc----hh----HHh Q lcl|NC_018285. 323 AVDPTG--------ANYISRIN----------------SMVK--SGTLAQNQGLYILQQAEILP-KE----LP----KGE 367 (383) Q Consensus 323 ~~~~~~--------~~~~~~~~----------------~l~~--~g~~t~nE~r~~lg~~~~~~-~d----~~----~~~ 367 (383) .+..|. .-....++ ..++ -.+++..|.-+..+..|+-+ ++ +. ..+ T Consensus 400 ~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~ 479 (533) T protein:vir:58 400 VMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGE 479 (533) T ss_pred eeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCcc Confidence 011110 00111111 1111 12333333333333333211 11 00 000 Q ss_pred CCCCCCCC-----------CCCCCCCC Q lcl|NC_018285. 368 NPNRTILK-----------GGETNGQD 383 (383) Q Consensus 368 ~~~~~~~~-----------ggd~~~~d 383 (383) ...+...+ +++..+.+ T Consensus 480 ~~~p~~~~~~~~~~~~~~~~~~~~~~~ 506 (533) T protein:vir:58 480 RGSPIESPRGRTEFDFGTEGGEELGGE 506 (533) T ss_pred ccCcccCCCChhhHhcccCCccccccc Confidence 00000000 00000000 No 144 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.64 E-value=1.6e-07 Score=57.87 Aligned_cols=368 Identities=10% Similarity=-0.003 Sum_probs=145.6 Q ss_pred CchhhhhhcCCcccc------cccccccchhhcccccCCceechhhh-h--ccHHHHHHHHHHHHhhhhCceeeecchhh Q lcl|NC_018285. 1 MPIFNLATESPPNNQ------GGFFDITDPEFLATLNGSEWVSAETA-L--KNSDLFSIISQLSNDLATAKLTTSRKQMQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~a-~--~~~~v~~~i~~ia~~ia~~p~~~~~~~~~ 71 (383) +-+++.+.+...... ..+...... ...+ +..+..+-. + ...-..-+|+.+|+.+---.|.+-+.... T Consensus 23 ~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~--i~~~--~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~d~~~~ 98 (504) T protein:vir:99 23 VDKVNGLYQQLVDRTPRNLLRASFYDGKYA--IRQI--GNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWPDGDYG 98 (504) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHhcccc--chhc--cccccHHHHHHhhccCcHHHHHHHHHhhhccceeeCCCCChh Confidence 222222221111000 000000000 0000 000111100 0 01111224455554332223333221111 Q ss_pred -hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCcee-EEEEeccceeEEEEcCCCceeEEEE--ee--cCccc Q lcl|NC_018285. 72 -GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDM-KWEYLRPSQVSFNRLDNQNGLYYNV--TF--DDPRI 145 (383) Q Consensus 72 -~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~-~l~~l~~~~v~~~~~~~~~~~~y~~--~~--~~~~~ 145 (383) .-+.+-...-+.......+..+.+++|.||+.|..+.+|.+. .+.+++|.++....++....+.+-+ .. .++.. T Consensus 99 ~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~ 178 (504) T protein:vir:99 99 SIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSRDAEGHP 178 (504) T ss_pred hHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCCCceeEEEEEEEecCCCeE Confidence 111111111223345667888999999999999998888764 5678899988877765333222111 10 11000 Q ss_pred ccceeecccc------------------------eEEeccCCCCccccCcchH----HHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 146 PPKQHVPQSD------------------------ILHFRLLSVDGGLTSVSPL----MALGRELDIQKASDKLTLNSLKN 197 (383) Q Consensus 146 ~~~~~~~~~d------------------------vih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~n 197 (383) .....+.++. |++|.+....+..+|.|.+ ..+.+.+.....-......+|.. T Consensus 179 ~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~ 258 (504) T protein:vir:99 179 TGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSF 258 (504) T ss_pred EEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcc Confidence 0011122222 4444433222344576643 33444443333333333344333 Q ss_pred cCCcceeEee-c-CCCCHHHHHHHHHHHHHhhcCCcceeecC----------CCceeeecccChhhHHHHHHHHHHHHHH Q lcl|NC_018285. 198 ALNANGILKI-K-GGGLLDFKTKVSRSRQAMKQMQGGPLVLD----------DLEDFTPLEIKSNVAQLLKQADWTTGQF 265 (383) Q Consensus 198 g~~~~~i~~~-~-~~~~~e~~~~~~~~~~~~~~~~g~~~vl~----------~g~~~~~~~~~~~d~~~~e~~~~~~~~I 265 (383) |...+.. . .....+... ....|... .++++.++ ...++-++....-+ .|++..+....+| T Consensus 259 ---p~r~i~G~~~~~~~~~d~~-~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~-~~~~~l~~~i~~~ 330 (504) T protein:vir:99 259 ---PQLILLGADAKNFRNKDGS-MKPAWQIA---LARVFALPDDEDEPDAARARADVKQFPASSPQ-PHIEMLEQIAMMF 330 (504) T ss_pred ---hhhhhccCCcccccccccc-ccchhhhh---hhhhhcCCCccccccccCccceeeecCCCChH-HHHHHHHHHHHHH Confidence 3333321 1 111111100 00111110 11222222 23555555444322 4788899999999 Q ss_pred HHHhcCCHHHhccccc-CcCHHHHHHH---HHHHHHHHHHHHHHHHHHHhh--cchh--------------hccchhhhc Q lcl|NC_018285. 266 AKVYGIPENVVGGQGD-QQSSLEMSSN---VYSKAVARYLRPFLSELSQKL--SCDV--------------DADIFPAVD 325 (383) Q Consensus 266 a~~~gVpp~~lg~~~~-~~~~~e~~~~---~~~~~l~P~~~~i~~~l~~~l--~~~~--------------e~~~~~~~~ 325 (383) +..=++|++.||..+. +..+.+..+. =+...+.-..+.+.+.|.+.+ .-.+ +.......- T Consensus 331 a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~ 410 (504) T protein:vir:99 331 SGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLY 410 (504) T ss_pred HhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCc Confidence 9999999999996443 3333333321 111122222333333332211 1011 011111222 Q ss_pred cCHHHHHHHHHHHHhCCCcC--H-HHHHHHhhcCCcCCcchhHHh-------------CC-CCCCCCCCCCCCCC Q lcl|NC_018285. 326 PTGANYISRINSMVKSGTLA--Q-NQGLYILQQAEILPKELPKGE-------------NP-NRTILKGGETNGQD 383 (383) Q Consensus 326 ~~~~~~~~~~~~l~~~g~~t--~-nE~r~~lg~~~~~~~d~~~~~-------------~~-~~~~~~ggd~~~~d 383 (383) .+..+.+..+.+++..|... . .-.++++|. ++.++.+.+ .+ ...+.++++..+++ T Consensus 411 ~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~---~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 482 (504) T protein:vir:99 411 LSKAAQADAGAKMLGAGPEWLKETEVGLELLGL---TPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQD 482 (504) T ss_pred cCHHHHHHHHHHHHhhccccccchHHHHhhcCC---CHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCC Confidence 44556677788888877532 2 334444544 333332111 01 11122233322222 No 145 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=98.64 E-value=1.1e-08 Score=64.16 Aligned_cols=371 Identities=13% Similarity=0.112 Sum_probs=195.2 Q ss_pred Cc-----hhhhhhcCCccccccc-----ccccchhhcccccCCceechhh-----h---hc-cHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGF-----FDITDPEFLATLNGSEWVSAET-----A---LK-NSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~-----a---~~-~~~v~~~i~~ia~~ia~~ 61 (383) |. +-.+-+..|+..+... .-+.++..... ....++.+. | +. ++.++-.+..|+++|+++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~--ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~ 78 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFR--KAMGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRV 78 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhh--hhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 43 2222222222211000 01111111110 001122222 2 22 667788899999999999 Q ss_pred ceeeecchhh-----hhccCCCcc---------------CCHHHHHHHHHHHHHHcCCeEEEEeecC------CCceeEE Q lcl|NC_018285. 62 KLTTSRKQMQ-----GIVDNPSNS---------------ANRFNFYQSIFAQMLLGGEAFAYRWRND------NGRDMKW 115 (383) Q Consensus 62 p~~~~~~~~~-----~l~~~PN~~---------------~t~~~f~~~~~~~~~l~G~a~~~i~r~~------~g~~~~l 115 (383) .+..-+-+.+ .-.+.|++. +...++++.+..++-+-|++|+.+.... ++.++.- T Consensus 79 rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~e 158 (629) T protein:vir:86 79 RLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPE 158 (629) T ss_pred eeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhh Confidence 9987653321 112233333 3457899999999999999999987322 3334444 Q ss_pred EE-eccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEec--cCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 116 EY-LRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFR--LLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 116 ~~-l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~--~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) |+ |-++-|.- ..++ . -+.. +.+......+..+++ +| .+++....+..||+.+++..+.-.....+... T Consensus 159 W~~vt~~ei~~---~~~~-~--~i~l--P~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~ 229 (629) T protein:vir:86 159 WLALTPEEVRA---SEKK-T--IIEL--PTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIA 229 (629) T ss_pred heeechHHhhh---ccCc-e--eeEc--CCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 43 33333321 1111 1 1111 122233344555555 55 55666667889999999999998888888888 Q ss_pred HHHhccCCcceeEeecCCCC----------------------HHHHHHHHHHHHH----hhc-----CCcceeecC---- Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGL----------------------LDFKTKVSRSRQA----MKQ-----MQGGPLVLD---- 237 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~----------------------~e~~~~~~~~~~~----~~~-----~~g~~~vl~---- 237 (383) +..+.-..-.|++-++..++ .....++...+.+ ... .+--|+++. T Consensus 230 aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E 309 (629) T protein:vir:86 230 NASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGE 309 (629) T ss_pred HHHHHHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 87777777777765543211 1133444444432 222 222344442 Q ss_pred --CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCH---HHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 238 --DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSS---LEMSSNVYSKAVARYLRPFLSELSQKL 312 (383) Q Consensus 238 --~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~---~e~~~~~~~~~l~P~~~~i~~~l~~~l 312 (383) ++++.-.+.....+ --+.+++..+..+|..+.|||+.|-+.++.+|- -+-...-+...|.|.+..|.++|++.+ T Consensus 310 ~i~~i~hlkf~~ei~e-~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~ 388 (629) T protein:vir:86 310 LIKNVTHLKFDNQVTE-VAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQV 388 (629) T ss_pred HhcCeeEEeecCchhH-HHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhH Confidence 23344444333333 347899999999999999999988655433331 111123345679999999999999987 Q ss_pred cchh----hccc---hhhhccCHH----HHHHHHHHHHhCCCcCHHHHHHHhhcCC---c--CCcc-------------- Q lcl|NC_018285. 313 SCDV----DADI---FPAVDPTGA----NYISRINSMVKSGTLAQNQGLYILQQAE---I--LPKE-------------- 362 (383) Q Consensus 313 ~~~~----e~~~---~~~~~~~~~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~---~--~~~d-------------- 362 (383) +... .+|- ...++.+.. .+.+....+.+.|.+|-...|+.+|... + +..| T Consensus 389 Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P 468 (629) T protein:vir:86 389 LRTVLMREGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDP 468 (629) T ss_pred HHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCc Confidence 7431 1111 111121111 1223455678899999999999998754 2 2222 Q ss_pred -hh--------HHhC--CC-----CCCC--CCCC------CCCCC Q lcl|NC_018285. 363 -LP--------KGEN--PN-----RTIL--KGGE------TNGQD 383 (383) Q Consensus 363 -~~--------~~~~--~~-----~~~~--~ggd------~~~~d 383 (383) +. ..-. ++ .+|. .+|+ .+++| T Consensus 469 ~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~e 513 (629) T protein:vir:86 469 NLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGASRREE 513 (629) T ss_pred chhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCCCcCCC Confidence 00 0000 00 0011 1122 22222 No 146 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.63 E-value=1.7e-07 Score=57.66 Aligned_cols=369 Identities=9% Similarity=-0.018 Sum_probs=152.5 Q ss_pred Cc-------------hhhhhhcCCcc--cccccccccchhhcccccCCceechhh---hhccHHHHHHHHHHHHhhhhCc Q lcl|NC_018285. 1 MP-------------IFNLATESPPN--NQGGFFDITDPEFLATLNGSEWVSAET---ALKNSDLFSIISQLSNDLATAK 62 (383) Q Consensus 1 Mg-------------lf~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~---a~~~~~v~~~i~~ia~~ia~~p 62 (383) +| |.+....+.+. ....+...... .... +..+..+. ...+.-...+|+..+.-+.-.+ T Consensus 6 ~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~--i~~~--~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g 81 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERR--PEAI--GVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEG 81 (485) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc--hhhc--CcccchhhhhhhhccchHHHHHHHHhhhhccCc Confidence 11 11111111000 00001000000 0000 00011100 0011122334444444443345 Q ss_pred eeeecch-----hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce-------eEEEEeccceeEEEEcCC Q lcl|NC_018285. 63 LTTSRKQ-----MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD-------MKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 63 ~~~~~~~-----~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~-------~~l~~l~~~~v~~~~~~~ 130 (383) +.+-+.. .+.++.+ | ....+...+..+++.+|.||+.+.++.++.+ ..+.+++|..+.+..+.. T Consensus 82 ~~~~~~~~~~~~l~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~ 157 (485) T protein:vir:24 82 FRLGDADEADEELWQWWQA-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPR 157 (485) T ss_pred eecCCCchhHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCC Confidence 5544321 1122221 1 3445677888999999999999988765432 257788888887766543 Q ss_pred CceeE--EEEee-----------------------cCcccc--cce--eecccceEEeccCCCCccccCcchHHH-HHHH Q lcl|NC_018285. 131 QNGLY--YNVTF-----------------------DDPRIP--PKQ--HVPQSDILHFRLLSVDGGLTSVSPLMA-LGRE 180 (383) Q Consensus 131 ~~~~~--y~~~~-----------------------~~~~~~--~~~--~~~~~dvih~~~~~~~~~~~G~s~~~~-~~~~ 180 (383) ..... |.+.. .++... ... .+..--|+||++....+..+|.|.+.. +... T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l 237 (485) T protein:vir:24 158 IGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSM 237 (485) T ss_pred cCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHH Confidence 22111 11110 010000 000 111223466654433444568876542 3333 Q ss_pred HHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHH--HHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHH Q lcl|NC_018285. 181 LDIQKASDKLTLNSLKNALNANGILKIKGGGLLDF--KTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQ 257 (383) Q Consensus 181 i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~--~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~ 257 (383) ++....+.--......-.+.|..++..-. ..... ...-...+ ....+.+.+++ ++.++..+..+..+ .+++. T Consensus 238 iDa~~~~~s~~~~~~~~~a~p~~~i~G~~-~~~~~~~~~~~~~~~---~~~~~~i~~~~~~~~~~~q~~~~~~e-~~~~~ 312 (485) T protein:vir:24 238 TDAAARILMLMQATAELMGVPQRLIFGIK-PEEIGVDPETGQTLF---DAYLARILAFEDAEGKIQQFSAAELA-NFTNA 312 (485) T ss_pred HHHHHHHHHHHHHHHHhhcchhhhhccCC-ccccccccccccchh---hhcccceeccCCCCceEEeecccchH-HHHHH Confidence 33332222222223333344555554211 11000 00001111 12334555554 56777666554433 46777 Q ss_pred HHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------h-------hhc Q lcl|NC_018285. 258 ADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSC------------D-------VDA 318 (383) Q Consensus 258 ~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~------------~-------~e~ 318 (383) .+..+.+++..=++|++.+|+...+..+.++.+ +....+.-.+...+..|...|-. . +++ T Consensus 313 l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~-~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v 391 (485) T protein:vir:24 313 LDQIAKQVAAYTGLPPQYLSTAADNPASAEAIR-AAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMET 391 (485) T ss_pred HHHHHHHHhcccCCCHHHhccccCcchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccceeeE Confidence 788888888888999999987655433333332 22222222222222222221110 0 011 Q ss_pred cchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhHH------------hCCC-CCCCCCCCCCCCC Q lcl|NC_018285. 319 DIFPAVDPTGANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPKG------------ENPN-RTILKGGETNGQD 383 (383) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~~------------~~~~-~~~~~ggd~~~~d 383 (383) ......-.+..+.+..+.+++.+| +++..-+++.+|..+-.-.++.+. +.++ ..+..+|+.++.+ T Consensus 392 ~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e 471 (485) T protein:vir:24 392 VWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTP 471 (485) T ss_pred EecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCC Confidence 111112245566677777888755 788888887776533111111110 0011 1111111111111 No 147 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=98.61 E-value=1.3e-08 Score=63.77 Aligned_cols=371 Identities=13% Similarity=0.112 Sum_probs=195.0 Q ss_pred Cc-----hhhhhhcCCccccccc-----ccccchhhcccccCCceechhh-----h---hc-cHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGF-----FDITDPEFLATLNGSEWVSAET-----A---LK-NSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~-----a---~~-~~~v~~~i~~ia~~ia~~ 61 (383) |. +-.+-+..|+..+... .-+.++..... ....++.+. | +. ++.++-.+..|+++|+++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~--ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~ 78 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFR--KAMGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRV 78 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhh--hhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 43 2222222222211000 01111111110 001122222 2 22 667788889999999999 Q ss_pred ceeeecchhh-----hhccCCCcc---------------CCHHHHHHHHHHHHHHcCCeEEEEeecC------CCceeEE Q lcl|NC_018285. 62 KLTTSRKQMQ-----GIVDNPSNS---------------ANRFNFYQSIFAQMLLGGEAFAYRWRND------NGRDMKW 115 (383) Q Consensus 62 p~~~~~~~~~-----~l~~~PN~~---------------~t~~~f~~~~~~~~~l~G~a~~~i~r~~------~g~~~~l 115 (383) .+..-+-+.+ .-.+.|++. +...++++.+..++-+-|++|+.+.... ++.++.- T Consensus 79 rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~e 158 (629) T protein:vir:99 79 RLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPE 158 (629) T ss_pred eeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhh Confidence 9987653321 112233333 3457899999999999999999987322 3334343 Q ss_pred EE-eccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEec--cCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 116 EY-LRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFR--LLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 116 ~~-l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~--~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) |+ |-++-|.- ..++ . -+.. +.+......+..|++ +| .+++....+..||+.+++..+.-.....+... T Consensus 159 W~~vt~~ei~~---~~~~-~--~i~l--P~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~ 229 (629) T protein:vir:99 159 WLALTPEEVRA---SEKK-T--IIEL--PTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIA 229 (629) T ss_pred heeechHHhhh---ccCc-e--eEEc--CCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 33 33333321 1111 1 1111 112233344555554 55 45566667889999999999998888888888 Q ss_pred HHHhccCCcceeEeecCCCC----------------------HHHHHHHHHHHHH----hhc-----CCcceeecC---- Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGL----------------------LDFKTKVSRSRQA----MKQ-----MQGGPLVLD---- 237 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~----------------------~e~~~~~~~~~~~----~~~-----~~g~~~vl~---- 237 (383) +..+.-..-.|++-++..++ .....++...+.+ ... .+--|+++. T Consensus 230 aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E 309 (629) T protein:vir:99 230 NASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGE 309 (629) T ss_pred HHHHHHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 88777777777765543211 1133444444432 222 222344442 Q ss_pred --CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCH---HHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 238 --DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSS---LEMSSNVYSKAVARYLRPFLSELSQKL 312 (383) Q Consensus 238 --~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~---~e~~~~~~~~~l~P~~~~i~~~l~~~l 312 (383) ++++.-.+.....+ --+.+++..+..+|..+.|||+.|-+.++.+|- -+-...-+...|.|.+..|.++|++.+ T Consensus 310 ~i~~i~hlkf~~ei~e-~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~ 388 (629) T protein:vir:99 310 LIKNVTHLKFDNQVTE-VAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQV 388 (629) T ss_pred HhcCeeEEeecCchhH-HHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhH Confidence 23344343333333 347899999999999999999988655433331 111123345679999999999999987 Q ss_pred cchh----hccc---hhhhccCHH----HHHHHHHHHHhCCCcCHHHHHHHhhcCC---c--CCcc-------------- Q lcl|NC_018285. 313 SCDV----DADI---FPAVDPTGA----NYISRINSMVKSGTLAQNQGLYILQQAE---I--LPKE-------------- 362 (383) Q Consensus 313 ~~~~----e~~~---~~~~~~~~~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~---~--~~~d-------------- 362 (383) +... .+|- ...++.+.. .+.+....+.+.|.+|-...|+.+|... + +..| T Consensus 389 Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P 468 (629) T protein:vir:99 389 LRTVLMREGIDPNAYVVWHDASQLTVDPDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDP 468 (629) T ss_pred HHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCc Confidence 7431 1111 111121111 1223455678899999999999998754 2 2222 Q ss_pred -hh--------HHhC--CC-----CCCC--CCCC------CCCCC Q lcl|NC_018285. 363 -LP--------KGEN--PN-----RTIL--KGGE------TNGQD 383 (383) Q Consensus 363 -~~--------~~~~--~~-----~~~~--~ggd------~~~~d 383 (383) +. ..-. ++ .+|. .+|+ .+++| T Consensus 469 ~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~e 513 (629) T protein:vir:99 469 NLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGASRREE 513 (629) T ss_pred chhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCCCcCCC Confidence 00 0000 00 0011 1122 22222 No 148 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.60 E-value=2.1e-07 Score=57.19 Aligned_cols=372 Identities=10% Similarity=0.060 Sum_probs=172.7 Q ss_pred CchhhhhhcC--C---ccc-cccc--------ccccc------hhhcccccCC-ceech---------hhhhccHHHHHH Q lcl|NC_018285. 1 MPIFNLATES--P---PNN-QGGF--------FDITD------PEFLATLNGS-EWVSA---------ETALKNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~~--~---~~~-~~~~--------~~~~~------~~~~~~~~~~-~~~~~---------~~a~~~~~v~~~ 50 (383) ||+|+++++- + ... .... ..++. ..+.....+. ..+.. +.-.....-... T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 9999997531 1 000 0010 01110 0000000000 00110 011112233445 Q ss_pred HHHHHHhhhhCceee--ecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEc Q lcl|NC_018285. 51 ISQLSNDLATAKLTT--SRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 51 i~~ia~~ia~~p~~~--~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~ 128 (383) ++.+|+-+..-|..+ .+......+.+--..-......+..+.+....|.+++.+..+. |. +.+.+++|..+-+... T Consensus 81 ~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~~v~ad~~~P~~~ 158 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLAWATADQVYPLQA 158 (505) T ss_pred HHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEEEEcCCeeEEEEE Confidence 566666665544433 3322222222222222345555667788888999988887763 43 3455566665544322 Q ss_pred CCCc-----------------eeEE-------------EEeec-----C-cccccc---------------ee---eccc Q lcl|NC_018285. 129 DNQN-----------------GLYY-------------NVTFD-----D-PRIPPK---------------QH---VPQS 154 (383) Q Consensus 129 ~~~~-----------------~~~y-------------~~~~~-----~-~~~~~~---------------~~---~~~~ 154 (383) +.+. ..+| ++... + ...|.. .. ++.. T Consensus 159 d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p 238 (505) T protein:vir:79 159 DTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHP 238 (505) T ss_pred cCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcc Confidence 1111 0111 11100 0 000100 01 1112 Q ss_pred ceEEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeE-----eecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 155 DILHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGIL-----KIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 155 dvih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~-----~~~~~~~~e~~~~~~~~~~~ 225 (383) -+.||+.+.++ ....|.|.+..+...+...+........-|+.|.. ..++ ........+........+.. T Consensus 239 ~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~~~~~~~~~~fd~ 317 (505) T protein:vir:79 239 LFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQR-RLIVPAEWLKTGSSYGGQASETHPPMFDP 317 (505) T ss_pred eEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-ceeechHHhcccCCCCcccccccccCCCc Confidence 34566654333 23469999999998888877776666666766543 3222 22222111100000000000 Q ss_pred hhcCC-cceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcC-HHHHH--H----------- Q lcl|NC_018285. 226 MKQMQ-GGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS-SLEMS--S----------- 290 (383) Q Consensus 226 ~~~~~-g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~-~~e~~--~----------- 290 (383) .... ..+..-+++..++.++....+-++.+..+...++|+...|+++..+|...++.. ..|.. . T Consensus 318 -~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~ 396 (505) T protein:vir:79 318 -DETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYI 396 (505) T ss_pred -cceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHH Confidence 0000 000011234568888888888889999999999999999999999986443322 22221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc-c-------------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcC Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQKLS-C-------------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQA 356 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~~l~-~-------------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~ 356 (383) ..+..+|..++..|........+ + ++.++....+-.|.........+++.+|++++-+++... . T Consensus 397 ~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~--~ 474 (505) T protein:vir:79 397 TQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFLMRN--Y 474 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc--C Confidence 12223333333333222111100 0 111222223345566777888889999999999887663 5 Q ss_pred CcCCcch----hHHhC-----CCCCCCCCCC Q lcl|NC_018285. 357 EILPKEL----PKGEN-----PNRTILKGGE 378 (383) Q Consensus 357 ~~~~~d~----~~~~~-----~~~~~~~ggd 378 (383) +++..|+ .+.+. .+....-||| T Consensus 475 ~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 475 GLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred CCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 5555443 22221 2222345777 No 149 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.59 E-value=2.3e-07 Score=56.99 Aligned_cols=337 Identities=11% Similarity=-0.005 Sum_probs=153.9 Q ss_pred CchhhhhhcCCcccc------cccccccchhhcccccCCceechhhh--hc--cHHHHHHHHHHHHhhhhCceeeecchh Q lcl|NC_018285. 1 MPIFNLATESPPNNQ------GGFFDITDPEFLATLNGSEWVSAETA--LK--NSDLFSIISQLSNDLATAKLTTSRKQM 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~a--~~--~~~v~~~i~~ia~~ia~~p~~~~~~~~ 70 (383) ..+++++.++..... ..+.....+ ... -+..+..+-. .+ ..-..-+|+-+|..+.=-.|..-+... T Consensus 3 ~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~--~~~--~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d~~l 78 (409) T protein:vir:94 3 EKGIGYLRFKLSVHKRRAEMRYDQYAMKYV--DRF--KGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDDFTV 78 (409) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHhcccCc--hhh--cChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccCCchHH Confidence 333344433211111 111110000 000 0111111100 01 112223444444433212222222222 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeE--EEEeecCcccc-- Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLY--YNVTFDDPRIP-- 146 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~--y~~~~~~~~~~-- 146 (383) +.++.+ -........+..+.+++|.||+.+..+.+|.| .+.+++|.++....+.....+. |++...+.... T Consensus 79 ~~i~~~----N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~ 153 (409) T protein:vir:94 79 NEIFEE----NNPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPITGLLTEGYAVLERDENNNVV 153 (409) T ss_pred HHHHHh----cChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCCCceeeeEEEEEecCCCceE Confidence 223222 12234455777888999999999999989977 6788999988887765443322 22221111100 Q ss_pred cceeeccc----------------------ceEEeccCCCCccccCcchH----HHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_018285. 147 PKQHVPQS----------------------DILHFRLLSVDGGLTSVSPL----MALGRELDIQKASDKLTLNSLKNALN 200 (383) Q Consensus 147 ~~~~~~~~----------------------dvih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~ng~~ 200 (383) ....+.++ -|++|.+..-.+..+|.|.+ ..+.+.+.....-......++.+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~--- 230 (409) T protein:vir:94 154 LEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSF--- 230 (409) T ss_pred EEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcC--- Confidence 00111112 24444433223445677744 44445544444444444455444 Q ss_pred cceeEee-cCCCCHHHHHHHHHHHHHhhcCCcceeecC-----CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 201 ANGILKI-KGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-----DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 201 ~~~i~~~-~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) |.-.+.. +... +... .|.. ..++++.++ .+.++.++..+.-+ .|++..+....++|..=++|++ T Consensus 231 pqr~i~G~d~d~--~~~~----~~~~---~~~~i~~~~~d~dg~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~t~lP~~ 300 (409) T protein:vir:94 231 PQKYVTGLSDDA--EPME----TWKA---TVSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGLTLD 300 (409) T ss_pred hhheeEecCCCC--cccc----hhhh---hHHHhhcCCCCCCCCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCCCHH Confidence 5444432 1111 1111 1211 123344443 23556555443322 4889999999999999999999 Q ss_pred HhcccccCcCHHHHHHHHH---HHHHHHHHHHHHHHHHHh--hc----ch----------hhccchhhhccCH---HHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMSSNVY---SKAVARYLRPFLSELSQK--LS----CD----------VDADIFPAVDPTG---ANYI 332 (383) Q Consensus 275 ~lg~~~~~~~~~e~~~~~~---~~~l~P~~~~i~~~l~~~--l~----~~----------~e~~~~~~~~~~~---~~~~ 332 (383) .+|+.+++..+.+..++-. .....-..+.+.+.+.+. |. .. .+....+.+..+. ...+ T Consensus 301 ~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~a 380 (409) T protein:vir:94 301 DLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIG 380 (409) T ss_pred HhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHHHH Confidence 9998766533343332111 111111122222222221 10 00 0111112223333 3445 Q ss_pred HHHHHHHhCC--CcCHHHHHHHhhcCCcC Q lcl|NC_018285. 333 SRINSMVKSG--TLAQNQGLYILQQAEIL 359 (383) Q Consensus 333 ~~~~~l~~~g--~~t~nE~r~~lg~~~~~ 359 (383) ..+.+|+++| ++...-+++++|+..-+ T Consensus 381 Da~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 381 DGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 6677888888 77788899988877543 No 150 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=98.53 E-value=8.9e-08 Score=59.25 Aligned_cols=376 Identities=13% Similarity=0.115 Sum_probs=199.1 Q ss_pred Cch------hhhhhcCCccccccccc----ccchhhcccccCCce---echhhhh----ccHHHHHHHHHHHHhhhhCce Q lcl|NC_018285. 1 MPI------FNLATESPPNNQGGFFD----ITDPEFLATLNGSEW---VSAETAL----KNSDLFSIISQLSNDLATAKL 63 (383) Q Consensus 1 Mgl------f~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~---~~~~~a~----~~~~v~~~i~~ia~~ia~~p~ 63 (383) |.- -.+-+..++..+..... +.++.-...+..+.. -....|. .++.++-.+..|+++|+++.+ T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCRL 80 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhceeee Confidence 432 22222222222111111 112211111111111 1122222 246778888999999999999 Q ss_pred eeecchhh-------------------h-hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEe-ecCCC---------cee Q lcl|NC_018285. 64 TTSRKQMQ-------------------G-IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRW-RNDNG---------RDM 113 (383) Q Consensus 64 ~~~~~~~~-------------------~-l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~-r~~~g---------~~~ 113 (383) ..-+-+.+ . ...-+.--+...++++.+..++-+-|++|+.+. |..+| ++. T Consensus 81 ~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r~~ 160 (631) T protein:vir:10 81 VASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRTR 160 (631) T ss_pred EeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccccc Confidence 87543311 1 112344556778999999999999999999875 32321 233 Q ss_pred -EEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceE-EeccCCCCccccCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 114 -KWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDIL-HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLT 191 (383) Q Consensus 114 -~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvi-h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 191 (383) ++++|....|......++ ..+....+. ........|++ .+=.+++....+..||+.+++..+.-.....+.. T Consensus 161 ~~W~~vt~~ei~~~~~g~g--~~v~lp~g~----~h~~~~~~D~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i 234 (631) T protein:vir:10 161 QEWYAVSKEEIKKSNKGSG--TNIVLPTGE----EHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (631) T ss_pred cceeeccHHHHhcccCccc--ceeecCCCC----ccceecCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHH Confidence 334444444432221211 222222111 11223333433 2325666666788999999999999999998888 Q ss_pred HHHHhccCCcceeEeecCCCCH---------------------HHHHHHHHHHH----Hhhc-----CCcceeecC---- Q lcl|NC_018285. 192 LNSLKNALNANGILKIKGGGLL---------------------DFKTKVSRSRQ----AMKQ-----MQGGPLVLD---- 237 (383) Q Consensus 192 ~~~~~ng~~~~~i~~~~~~~~~---------------------e~~~~~~~~~~----~~~~-----~~g~~~vl~---- 237 (383) .+..+.-..-.|++-+|..++= .+..++...+. .... .+--|+++. T Consensus 235 ~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E 314 (631) T protein:vir:10 235 ANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (631) T ss_pred HHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 8888888888888877654331 13444444332 1122 222344442 Q ss_pred --CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCH---HHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 238 --DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSS---LEMSSNVYSKAVARYLRPFLSELSQKL 312 (383) Q Consensus 238 --~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~---~e~~~~~~~~~l~P~~~~i~~~l~~~l 312 (383) ++++.-.+.....+ --+.+++..+..+|..+.|||+.|-+.++.+|- -+-...-+.-.|.|.+..|.++|++.+ T Consensus 315 ~i~~i~hlkf~~ei~e-~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~ 393 (631) T protein:vir:10 315 QIKDVKHIRFDNEITE-VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQI 393 (631) T ss_pred HhcCeeEEeecCchhH-HHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhH Confidence 23344443333333 347899999999999999999988655433331 111123345679999999999999988 Q ss_pred cchh----hccc---hhhhccCHH----HHHHHHHHHHhCCCcCHHHHHHHhhcCC---cC--Ccc-------------- Q lcl|NC_018285. 313 SCDV----DADI---FPAVDPTGA----NYISRINSMVKSGTLAQNQGLYILQQAE---IL--PKE-------------- 362 (383) Q Consensus 313 ~~~~----e~~~---~~~~~~~~~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~---~~--~~d-------------- 362 (383) +... .+|- ...++.+.. .+.+....+.+.|.+|-...|+.+|... ++ +.| T Consensus 394 Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPdr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dp 473 (631) T protein:vir:10 394 LRVTLAREGIDPSKYVVWYDPSQLTIDPDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDP 473 (631) T ss_pred HHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhccc Confidence 7431 1111 111122111 1223455678899999999999998654 11 111 Q ss_pred ------hhH----HhC--CCCC--CCCCCCCCCCC Q lcl|NC_018285. 363 ------LPK----GEN--PNRT--ILKGGETNGQD 383 (383) Q Consensus 363 ------~~~----~~~--~~~~--~~~ggd~~~~d 383 (383) .+. ... ++.+ +..+|+.+..+ T Consensus 474 aLip~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~ 508 (631) T protein:vir:10 474 TLIPMLAPLIAGVLKQIEFPQQQAIDSGGNEDTSD 508 (631) T ss_pred CcchhhHHHHHHHhhhccCCCCCCCCCCCCCcccc Confidence 010 111 1111 11233332222 No 151 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.49 E-value=1.5e-07 Score=58.05 Aligned_cols=365 Identities=11% Similarity=0.030 Sum_probs=159.0 Q ss_pred CchhhhhhcCC----c--ccccccccccchhhcccccCCceechhh---hhccHHHHHHHHHHHHhhhhCceeeecchh- Q lcl|NC_018285. 1 MPIFNLATESP----P--NNQGGFFDITDPEFLATLNGSEWVSAET---ALKNSDLFSIISQLSNDLATAKLTTSRKQM- 70 (383) Q Consensus 1 Mglf~~~~~~~----~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~---a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~- 70 (383) +.+.+++.+.. + .....+.....+ . ........-..+. -....-...+|+..+.-+-.-|+.+...+. T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~-i-~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~d~ 84 (456) T protein:vir:79 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAP-L-PELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCC-h-hhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCc Confidence 22222221111 0 000011100000 0 0000000000000 011123355677777776667887643221 Q ss_pred ------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-----eEEEEe Q lcl|NC_018285. 71 ------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-----LYYNVT 139 (383) Q Consensus 71 ------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~y~~~ 139 (383) ..++.+ | ....+.+.+..+++.+|.||+.+-++.+|.+ .+..++|..+.+..++.... +.|... T Consensus 85 ~~~~~~~~~~~~-n---~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~ 159 (456) T protein:vir:79 85 DLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) T ss_pred cHHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCCceEEEEEEEEe Confidence 122222 2 3345567788999999999999989889987 47788888887766542211 111100 Q ss_pred ecC-----------ccc------------c-------cceeecccceEE-------eccCCCCccccCcchHHHHHHHHH Q lcl|NC_018285. 140 FDD-----------PRI------------P-------PKQHVPQSDILH-------FRLLSVDGGLTSVSPLMALGRELD 182 (383) Q Consensus 140 ~~~-----------~~~------------~-------~~~~~~~~dvih-------~~~~~~~~~~~G~s~~~~~~~~i~ 182 (383) .+. +.. . ........++-| +...+ ..|.|-+......++ T Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N----~~~~gd~e~v~~liD 235 (456) T protein:vir:79 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEPHIDIIN 235 (456) T ss_pred cCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecC----CCCCchhhhhHHHHH Confidence 000 000 0 000000001111 11111 236666665554444 Q ss_pred HHHHHHHHHHHHHhccCCcceeEeecCC---CCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHH Q lcl|NC_018285. 183 IQKASDKLTLNSLKNALNANGILKIKGG---GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQAD 259 (383) Q Consensus 183 ~~~~~~~~~~~~~~ng~~~~~i~~~~~~---~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~ 259 (383) ....+.--........+.|..++..... ..++.-..+ ..........+.++.++++.++..+..+.-+ .+.+..+ T Consensus 236 ~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i-~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~-~~~~~l~ 313 (456) T protein:vir:79 236 RINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI-DYASIFEAAPGALWELPPGVDIWESQTNDFT-PMLSAIK 313 (456) T ss_pred HHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc-chhhhhhhhccccccCCCCcceeeecccChH-HHHHHHH Confidence 4333322222222222333333322110 001110111 1111112233556677888888766544332 3788889 Q ss_pred HHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------chh-----hccchhhh Q lcl|NC_018285. 260 WTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS----------CDV-----DADIFPAV 324 (383) Q Consensus 260 ~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~----------~~~-----e~~~~~~~ 324 (383) ....+|+..-++|++.+|+.+.+.+.+ +.+ +....+.-.+...+..|...|- ... +....+.. T Consensus 314 ~~i~~i~~~t~~p~~~~~~~~~N~Sg~-Al~-~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~~~~~i~v~w~~~~ 391 (456) T protein:vir:79 314 EHIRQLSSATKTPLPMLMPDSANQSAE-GAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPD 391 (456) T ss_pred HHHHHHHhhcCCChhHhcccccCcHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceEEeCCCC Confidence 999999999999999999755444332 221 1111111122222222222111 111 11111112 Q ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhH------Hh---CCCCCCCCCCCCCCCC Q lcl|NC_018285. 325 DPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPK------GE---NPNRTILKGGETNGQD 383 (383) Q Consensus 325 ~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~------~~---~~~~~~~~ggd~~~~d 383 (383) -.+..+.++.+.++...|+.+...+++++|..+ .++.+ .+ .+...++..++.++.- T Consensus 392 ~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~---~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 392 RVTLGEKYSAASLAKAAGESWASIRRNILNYNA---DQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred CcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCH---HHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 234466677778888899999988888876544 22211 11 1111123333333333 No 152 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.43 E-value=6.8e-07 Score=54.40 Aligned_cols=370 Identities=12% Similarity=0.089 Sum_probs=167.5 Q ss_pred CchhhhhhcCC------cccccccccccc-------hhh-------cccccC------Ccee----chhhhhccHHHHHH Q lcl|NC_018285. 1 MPIFNLATESP------PNNQGGFFDITD-------PEF-------LATLNG------SEWV----SAETALKNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~~~------~~~~~~~~~~~~-------~~~-------~~~~~~------~~~~----~~~~a~~~~~v~~~ 50 (383) ||+|.+++.-- -........+++ +.. .-...+ .... ..+.-........+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 99999875420 000000000000 000 000000 0000 00111122233445 Q ss_pred HHHHHHhhhhCce--eeecch-hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEE Q lcl|NC_018285. 51 ISQLSNDLATAKL--TTSRKQ-MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNR 127 (383) Q Consensus 51 i~~ia~~ia~~p~--~~~~~~-~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~ 127 (383) ++.+|+-+..-|. .+...+ .+..+.+--..-......+..+.+.+..|.+++.+..+.++ +.+.++++..+-+.. T Consensus 81 ~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~~v~ad~~~P~~ 158 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGNH--IKIAWVRADQFYPLQ 158 (508) T ss_pred HHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEEEEcCCeeEEEE Confidence 6666666655443 332222 22211111111223444556677888899999888776532 345556666544322 Q ss_pred cCCCc-----------------eeEEE--------------Ee---ecCc---cccccee---ec--------------- Q lcl|NC_018285. 128 LDNQN-----------------GLYYN--------------VT---FDDP---RIPPKQH---VP--------------- 152 (383) Q Consensus 128 ~~~~~-----------------~~~y~--------------~~---~~~~---~~~~~~~---~~--------------- 152 (383) .+.+. ..+|. +. +... ..|..+. ++ T Consensus 159 ~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~ 238 (508) T protein:vir:15 159 SNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQ 238 (508) T ss_pred EcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCC Confidence 11111 11111 00 0000 0011110 00 Q ss_pred ccceEEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecC-CCCHHHHHHHHHHHHHhh Q lcl|NC_018285. 153 QSDILHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTKVSRSRQAMK 227 (383) Q Consensus 153 ~~dvih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~-~~~~e~~~~~~~~~~~~~ 227 (383) .--..||+.+-++ ....|.|.+..+...+...+........-|+. +.+..++...- ..+++....+.. . T Consensus 239 ~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~d~~~~~~~~~-----~ 312 (508) T protein:vir:15 239 RPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRL-GQKHIAVQPGMLRFDDEHKPTFDT-----E 312 (508) T ss_pred cceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcCCCCCccccCC-----C Confidence 0113455543322 23569999999998888887777777777754 44444442110 001111000000 0 Q ss_pred cCCcceee--cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc-CHHHHH---H----------H Q lcl|NC_018285. 228 QMQGGPLV--LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEMS---S----------N 291 (383) Q Consensus 228 ~~~g~~~v--l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~-~~~e~~---~----------~ 291 (383) ......+- -++|..++.++....+-++.+..+...+.|....|++|..+|...++. +..+.. + . T Consensus 313 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~ 392 (508) T protein:vir:15 313 QNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLT 392 (508) T ss_pred CeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHH Confidence 00001111 134456777888877788999999999999999999999998654433 222221 1 1 Q ss_pred HHHHHHHHHHHHHHHHHHHh-hcc---------------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhc Q lcl|NC_018285. 292 VYSKAVARYLRPFLSELSQK-LSC---------------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQ 355 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~l~~~-l~~---------------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~ 355 (383) .+..+|..++..|....+.. +.. ++.++....+-.|.........+++.+|++++-+++... T Consensus 393 ~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~-- 470 (508) T protein:vir:15 393 MVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRN-- 470 (508) T ss_pred HHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc-- Confidence 22223333322222211110 000 011222233345667777888899999999999988653 Q ss_pred CCcCCcch----hHHhCCC---------CCCCCCCCCC Q lcl|NC_018285. 356 AEILPKEL----PKGENPN---------RTILKGGETN 380 (383) Q Consensus 356 ~~~~~~d~----~~~~~~~---------~~~~~ggd~~ 380 (383) .+++..|+ .+.+.-. ..+..|||.+ T Consensus 471 ~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 471 YGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred CCCChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 55555443 2221111 1122233333 No 153 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.40 E-value=8.5e-07 Score=53.86 Aligned_cols=347 Identities=12% Similarity=0.019 Sum_probs=156.1 Q ss_pred CchhhhhhcCCcccccccccccchhhcccccCCceechhhh----hccHHHHHHHHHHHHhhhhCceeeecchhhhhccC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEFLATLNGSEWVSAETA----LKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDN 76 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a----~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~ 76 (383) |.++.- +......+.....+ .... +..+..+.. +...-..-+|+-+|..+.=-.|..-+...+.++. T Consensus 1 l~~~~~----r~~~~~~yY~g~~~--~~~~--~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~~d~~l~~i~~- 71 (410) T protein:vir:95 1 MNLYQS----RVNLRYKHYAMQHY--EAPT--GITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFANDDFNVTEIFD- 71 (410) T ss_pred CCcchh----hHHHHHHHhcCCCC--cccc--chhccHHHHhHHHhhcchhHHHHHHhHhhhccccccCCCchHHHHHh- Confidence 333221 11111111111000 0000 001111100 0111222234444443321223222222233332 Q ss_pred CCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEe--ecCccc--ccceeec Q lcl|NC_018285. 77 PSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVT--FDDPRI--PPKQHVP 152 (383) Q Consensus 77 PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~--~~~~~~--~~~~~~~ 152 (383) .-.......++..+.+++|.||+.|..+.+|.| .+.+++|.++....+.....+.+-+. .....+ .....+. T Consensus 72 ---~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~~~~~~~~~~~~~ 147 (410) T protein:vir:95 72 ---RNNPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGLLVEGYAVLARDDYNRPTLEAYFE 147 (410) T ss_pred ---hcChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCceEEEEEEEEecCCCeEEEEEEEe Confidence 223344555777899999999999999888876 57889999988877654443332211 111000 0111122 Q ss_pred cc---------------------ceEEeccCCCCccccCcc----hHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEee Q lcl|NC_018285. 153 QS---------------------DILHFRLLSVDGGLTSVS----PLMALGRELDIQKASDKLTLNSLKNALNANGILKI 207 (383) Q Consensus 153 ~~---------------------dvih~~~~~~~~~~~G~s----~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~ 207 (383) ++ -|++|.+..-.+..+|.| ++..+.+.+.....-......++.+ |.-.+.. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G 224 (410) T protein:vir:95 148 PNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSW---PQKYILG 224 (410) T ss_pred CCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcc---hhheeec Confidence 22 244554332223446766 3455555555555444455555543 5444432 Q ss_pred cCCCCHHHHHHHHHHHHHhhcCCcceeecCC-----CceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccC Q lcl|NC_018285. 208 KGGGLLDFKTKVSRSRQAMKQMQGGPLVLDD-----LEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQ 282 (383) Q Consensus 208 ~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~ 282 (383) -. .+.+... .|. ...++++.++. +.++-+++...-+ .|++..+....+||..=++|++.||..+++ T Consensus 225 ~d-~d~~~~~----~~~---~~~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~s~lP~~~lg~~~~N 295 (410) T protein:vir:95 225 LD-PDAEPME----KWK---ATVSSLLTISSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGEMGLTLDDLGFVSDN 295 (410) T ss_pred cC-CCCCcCc----hhh---hhhhhheeccCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhhcCCCHHHhccccCc Confidence 11 0111111 111 22244555543 3566555443332 488999999999999999999999987665 Q ss_pred cCHHHHHHHH---HHHHHHHHHHHHHHHHHHh----hc--chh----------hccchhhhcc---CHHHHHHHHHHHHh Q lcl|NC_018285. 283 QSSLEMSSNV---YSKAVARYLRPFLSELSQK----LS--CDV----------DADIFPAVDP---TGANYISRINSMVK 340 (383) Q Consensus 283 ~~~~e~~~~~---~~~~l~P~~~~i~~~l~~~----l~--~~~----------e~~~~~~~~~---~~~~~~~~~~~l~~ 340 (383) ..+++..+.- +...+.-..+.+.+.+.+. +. ... +....+..+. +....++.+.+|++ T Consensus 296 psSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~ 375 (410) T protein:vir:95 296 PSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQ 375 (410) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHH Confidence 3333333211 1111112222222222221 10 100 0001111122 33455666667777 Q ss_pred C--CCcCHHHHHHHhhcCCcCCcchhHH-hCCCCCCCCCCCCCCC Q lcl|NC_018285. 341 S--GTLAQNQGLYILQQAEILPKELPKG-ENPNRTILKGGETNGQ 382 (383) Q Consensus 341 ~--g~~t~nE~r~~lg~~~~~~~d~~~~-~~~~~~~~~ggd~~~~ 382 (383) + |+.+..-+++.+|+.+ .++.+. ..-. ..+|+ T Consensus 376 a~~g~~~~~~~~~~lg~~~---~~~~~~~~~e~-------~~~g~ 410 (410) T protein:vir:95 376 ALPGYINAETIRDLTGIAG---DMSAKPVVSEG-------GSNGE 410 (410) T ss_pred hccCCccHHHHHHhcCCCh---HHHHHHHHHHH-------HhCCC Confidence 6 7888888999998753 233222 1100 11112 No 154 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.37 E-value=9.7e-07 Score=53.54 Aligned_cols=368 Identities=8% Similarity=-0.009 Sum_probs=149.7 Q ss_pred Cc-------hhhhhhcCCcc------cccccccccchhhcccccCCceechhh-hh--ccHHHHHHHHHHHHhhhhCcee Q lcl|NC_018285. 1 MP-------IFNLATESPPN------NQGGFFDITDPEFLATLNGSEWVSAET-AL--KNSDLFSIISQLSNDLATAKLT 64 (383) Q Consensus 1 Mg-------lf~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~--~~~~v~~~i~~ia~~ia~~p~~ 64 (383) |. +.+.+.+.... ....+...... ....... +..+. .+ ...-...+|+.++..+---++. T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~--i~~~~~~--~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~ 83 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERR--PEAIGVT--VPIQMQSLLAHVGYPRLYVDSIAERQAVEGFR 83 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--chhcCCC--CChhhhhhhhhcCcHHHHHHHHHhhhccccee Confidence 11 11111111000 00000000000 0000000 00000 00 0112233455555444222343 Q ss_pred eecch-hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCc-------eeEEEEeccceeEEEEcCCCceeE- Q lcl|NC_018285. 65 TSRKQ-MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGR-------DMKWEYLRPSQVSFNRLDNQNGLY- 135 (383) Q Consensus 65 ~~~~~-~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~-------~~~l~~l~~~~v~~~~~~~~~~~~- 135 (383) +-+.. ......+-...-....+...+..+++.+|.||+.+.++..+. ...+.+++|..+.+..+.....+. T Consensus 84 ~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~ 163 (485) T protein:vir:10 84 FGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSK 163 (485) T ss_pred cCCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCCceeE Confidence 32211 111111111122345666788899999999999998875432 235777888887776654332211 Q ss_pred -EEEeecC--cccccceeecc-------------------------cceEEeccCCCCccccCcchHH----HHHHHHHH Q lcl|NC_018285. 136 -YNVTFDD--PRIPPKQHVPQ-------------------------SDILHFRLLSVDGGLTSVSPLM----ALGRELDI 183 (383) Q Consensus 136 -y~~~~~~--~~~~~~~~~~~-------------------------~dvih~~~~~~~~~~~G~s~~~----~~~~~i~~ 183 (383) +.+.... +.......+.. -.|++|.+....+..+|.|-+. .+...+.. T Consensus 164 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~ 243 (485) T protein:vir:10 164 AIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAAR 243 (485) T ss_pred EEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHH Confidence 1111110 00000011111 2345555433334456777543 33333333 Q ss_pred HHHHHHHHHHHHhccCCcceeEeecC--CC-CHHHHHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHH Q lcl|NC_018285. 184 QKASDKLTLNSLKNALNANGILKIKG--GG-LLDFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQAD 259 (383) Q Consensus 184 ~~~~~~~~~~~~~ng~~~~~i~~~~~--~~-~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~ 259 (383) ..+-......+| +.|..+++... .. .++... ...+ ....++++.++ ++.++.++..+.-+ .+++..+ T Consensus 244 ~~s~~~~~~~~~---a~p~~~i~G~~~~~~~~~~~~~--~~~~---~~~~~~i~~~~~~d~k~~q~~~~~~~-~~~~~l~ 314 (485) T protein:vir:10 244 ILMLMQATAELM---GVPQRLIFGIKPEEIGVDPETG--QTLF---DAYLARILAFEDAEGKIQQFSAAELA-NFTNALD 314 (485) T ss_pred HHHHHHHHHHhh---cchHHHHhcCCccccccccccc--chhh---hhcccceeccCCCCceEEeecccchH-HHHHHHH Confidence 333222223333 33444443211 00 000000 0111 12234556554 56777766554433 3778888 Q ss_pred HHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHH----HHHHHHHHHHHhh--cch-------------hhccc Q lcl|NC_018285. 260 WTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVAR----YLRPFLSELSQKL--SCD-------------VDADI 320 (383) Q Consensus 260 ~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P----~~~~i~~~l~~~l--~~~-------------~e~~~ 320 (383) ....+|+..=++|++.+|+...+..+.++.+. ....+.- ..+.+...|.+.+ ... ++... T Consensus 315 ~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~-~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w 393 (485) T protein:vir:10 315 QIAKQVAAYTGLPPQYLSTAADNPASAEAIRA-AESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVW 393 (485) T ss_pred HHHHHHhcccCCCHHHhccccCchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEe Confidence 88999999999999999876544333333221 1111111 1122222221110 000 11111 Q ss_pred hhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhHH------------hCCCCCCCCCCCC------- Q lcl|NC_018285. 321 FPAVDPTGANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPKG------------ENPNRTILKGGET------- 379 (383) Q Consensus 321 ~~~~~~~~~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~~------------~~~~~~~~~ggd~------- 379 (383) ....-.+..+.+..+.+|+.+| +++...+++.+|..+-+-.++.+. ..++. +.+|.++ T Consensus 394 ~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 472 (485) T protein:vir:10 394 RDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVD-PNPTVPGSPSPAPA 472 (485) T ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhc-cCCCCCCCCCcccc Confidence 1222345566777788888866 889999998877544211111110 01111 1111111 Q ss_pred -------CCCC Q lcl|NC_018285. 380 -------NGQD 383 (383) Q Consensus 380 -------~~~d 383 (383) ++.| T Consensus 473 ~~~~~~~~~~~ 483 (485) T protein:vir:10 473 PKPAALESGGD 483 (485) T ss_pred ccCcCCCCCCC Confidence 1111 No 155 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.35 E-value=1.1e-06 Score=53.24 Aligned_cols=364 Identities=9% Similarity=-0.030 Sum_probs=153.3 Q ss_pred Cc-----hhhhhhcCCc--ccccccccccchhhcccccCCceechhhhh----ccHHHHHHHHHHHHhhhhCceeeecch Q lcl|NC_018285. 1 MP-----IFNLATESPP--NNQGGFFDITDPEFLATLNGSEWVSAETAL----KNSDLFSIISQLSNDLATAKLTTSRKQ 69 (383) Q Consensus 1 Mg-----lf~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~----~~~~v~~~i~~ia~~ia~~p~~~~~~~ 69 (383) .. |+....++.+ .....+..... ... .......-.....+ .+.-...+|+.+++.+---.|.+-+.. T Consensus 15 ~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~-~i~-~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~~d~~ 92 (479) T protein:vir:99 15 AKYLETKVFPKMNTECERLDDFEAWTKNGQ-EVP-DLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRKTGTN 92 (479) T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHhcCC-ccc-ccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccCCCch Confidence 11 1111111110 00001100000 000 00000000011111 111233355555554332233333222 Q ss_pred hh----hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEee-----cCCCceeEEEEeccceeEEEEcCCCce--eEEEE Q lcl|NC_018285. 70 MQ----GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWR-----NDNGRDMKWEYLRPSQVSFNRLDNQNG--LYYNV 138 (383) Q Consensus 70 ~~----~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r-----~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~y~~ 138 (383) .. .++.. | ........+..+++.+|.||+++.+ ++.|.+ .+..++|..+....++.... ..|.+ T Consensus 93 ~~~~~~~i~~~-N---~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~~~~p~~~~~iydd~~~~~~~~~~~ 167 (479) T protein:vir:99 93 ENAKGWDTWRL-N---QMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIKCIDPRDAFAIWEDPYWDEWPKYLL 167 (479) T ss_pred hhHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EEEEechhheEEEecCCcccceeeEEE Confidence 21 12222 2 2234556778888999999998874 344544 46677888877665433221 11221 Q ss_pred eecCc----------------cccc-----cee--ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 139 TFDDP----------------RIPP-----KQH--VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSL 195 (383) Q Consensus 139 ~~~~~----------------~~~~-----~~~--~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 195 (383) ..... ..+. ..+ +..--|++|++.... ...|.|-+..+...++.......-..... T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~~ 246 (479) T protein:vir:99 168 ERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVEPLVTVAKAIDKTGLDILLVQ 246 (479) T ss_pred eecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCc-CcCCcchhHHHHHHHHHHHHHHHHHHHHH Confidence 11100 0000 011 122235666643222 23588877776666666655555555555 Q ss_pred hccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee-cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 196 KNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV-LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 196 ~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v-l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) .-.+.|..++..-.....+.... ..+. -..++++. -++++++..+..... ..+.+..+....+|+..=++|++ T Consensus 247 ~~~a~p~~~i~G~~~~~~~~~~~--~~~~---~~~~~i~~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~~i~~~t~~p~~ 320 (479) T protein:vir:99 247 HHQSFQIRWATGLMLPEGANADQ--EKMR---FAQESMLISQNEKASFGAIPAAPL-DGLLNAYKESLLEFLALAQLPPH 320 (479) T ss_pred HHhhchhhhhcCCCcccccccch--hccc---cccccceeecCCCceEEEecccch-HHHHHHHHHHHHHHhccCCCCHH Confidence 55566665554321111111110 0111 11233433 456677766553322 23677777888889988899999 Q ss_pred HhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------ch----hhccch----hhhccCHHHHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS----------CD----VDADIF----PAVDPTGANYISRIN 336 (383) Q Consensus 275 ~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~----------~~----~e~~~~----~~~~~~~~~~~~~~~ 336 (383) .+|..++ .+. ++.+ +....+.-.++..+..|...|- .. ...++. ...-.+..+.+..+. T Consensus 321 ~~g~~~n-~Sg-~Al~-~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~ 397 (479) T protein:vir:99 321 IAGQIVN-VAA-DALA-AGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWA 397 (479) T ss_pred Hcccccc-hHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 9985432 222 1211 1112221222222222111110 00 001111 111245567778888 Q ss_pred HHHhCCCcCHHHHHHHhhcCCcCCcchhHH--------------hCCCCCCC-------CCCCCCCCC Q lcl|NC_018285. 337 SMVKSGTLAQNQGLYILQQAEILPKELPKG--------------ENPNRTIL-------KGGETNGQD 383 (383) Q Consensus 337 ~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~--------------~~~~~~~~-------~ggd~~~~d 383 (383) +|+++|+++.-.+.+++ +++++.++.+. +.+...+. .+|+++.++ T Consensus 398 kl~~ag~is~et~l~~l--~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (479) T protein:vir:99 398 KMVESLKIPAEGVWDMI--PNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQ 463 (479) T ss_pred HHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCC Confidence 89999999998887766 33433332211 11111111 112222222 No 156 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.35 E-value=1.2e-06 Score=53.14 Aligned_cols=377 Identities=11% Similarity=0.005 Sum_probs=163.3 Q ss_pred CchhhhhhcC---CcccccccccccchhhcccccC--CceechhhhhccHHHHHHHHHHHHhhhhCceeeecch------ Q lcl|NC_018285. 1 MPIFNLATES---PPNNQGGFFDITDPEFLATLNG--SEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ------ 69 (383) Q Consensus 1 Mglf~~~~~~---~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~------ 69 (383) .++.+.-... +-.....+.............. ......+ +.+.-....|+..+.-+-+-|+++.-.+ T Consensus 46 ~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~k--i~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~ 123 (502) T protein:vir:48 46 KNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKR--AVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQ 123 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccce--eecchHHHHHHHHhhhhcccCeeEecCCccchhH Confidence 1111110000 0000000000000000000000 0000011 1122233455556655556677654322 Q ss_pred hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-c-ee---EE-EEeecCc Q lcl|NC_018285. 70 MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-N-GL---YY-NVTFDDP 143 (383) Q Consensus 70 ~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~-~~---~y-~~~~~~~ 143 (383) ....+.+....-........+..++..+|.||+.+.++.+|.+ .+..++|..+.+..++.. . .. .| ....... T Consensus 124 ~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~ 202 (502) T protein:vir:48 124 NDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQN 202 (502) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCC Confidence 1122333333345566778889999999999999999988876 467788888877765432 1 11 11 1111111 Q ss_pred ccccceeecccceEEeccC----------CC---------CccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCccee Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLL----------SV---------DGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGI 204 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~----------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i 204 (383) .......+.+..++++... ++ .+...|.|.+..+...++....+.....+.+.-.+.|-.+ T Consensus 203 ~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 282 (502) T protein:vir:48 203 AKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILA 282 (502) T ss_pred cEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 0011112333333332211 00 1123578888877777777777766667777777777777 Q ss_pred EeecCCCCH-HHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc Q lcl|NC_018285. 205 LKIKGGGLL-DFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ 283 (383) Q Consensus 205 ~~~~~~~~~-e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~ 283 (383) ++....... +....+++.+.......+..-..+.+.++.-++....+..+....+...+.|+..=++|+...+..+.+. T Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~ 362 (502) T protein:vir:48 283 IYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNA 362 (502) T ss_pred eecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCc Confidence 665433222 2222222221111111111111233445544554444445666778888899999999976554322222 Q ss_pred CHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhc-chh-----hccchhhhccCHHHHHHHHHHHHhCCC Q lcl|NC_018285. 284 SSLEMSS--------------NVYSKAVARYLRPFLSELSQKLS-CDV-----DADIFPAVDPTGANYISRINSMVKSGT 343 (383) Q Consensus 284 ~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~-~~~-----e~~~~~~~~~~~~~~~~~~~~l~~~g~ 343 (383) +. .+.+ ..+...+.-.++.+...++..-. ..+ ++...+..-.+..+.+..+.++ .|+ T Consensus 363 Sg-~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~ 439 (502) T protein:vir:48 363 SG-EALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDL--GGQ 439 (502) T ss_pred hH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hcc Confidence 22 2221 12222333333333222221110 011 1111222234456666666666 488 Q ss_pred cCHHHHHHHhhcCCcCCcchhHHh------CCCCCC-------CCCCCCCCCC Q lcl|NC_018285. 344 LAQNQGLYILQQAEILPKELPKGE------NPNRTI-------LKGGETNGQD 383 (383) Q Consensus 344 ~t~nE~r~~lg~~~~~~~d~~~~~------~~~~~~-------~~ggd~~~~d 383 (383) ++...+.+.++.-.=+..|+.+++ .....+ ..|+|..+++ T Consensus 440 iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~ 492 (502) T protein:vir:48 440 VSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKET 492 (502) T ss_pred CcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCC Confidence 999888888743210112333221 001111 1122221111 No 157 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.34 E-value=1.6e-07 Score=57.87 Aligned_cols=375 Identities=12% Similarity=0.082 Sum_probs=191.9 Q ss_pred Cc-----hhhhhhcCCccccccccccc-----chh-hcccccCCc--eechhhhh----ccHHHHHHHHHHHHhhhhCce Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGFFDIT-----DPE-FLATLNGSE--WVSAETAL----KNSDLFSIISQLSNDLATAKL 63 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~~-----~~~-~~~~~~~~~--~~~~~~a~----~~~~v~~~i~~ia~~ia~~p~ 63 (383) |. +-.+-+..|++.+....... ++. .+..+.++. .-....|. .++.++-.+..++++++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 43 22222333322221111100 110 011110110 11112222 246778889999999999999 Q ss_pred eeecchhhh------h--ccC-------------CCccCCHHHHHHHHHHHHHHcCCeEEEEe-ecCCC------ceeEE Q lcl|NC_018285. 64 TTSRKQMQG------I--VDN-------------PSNSANRFNFYQSIFAQMLLGGEAFAYRW-RNDNG------RDMKW 115 (383) Q Consensus 64 ~~~~~~~~~------l--~~~-------------PN~~~t~~~f~~~~~~~~~l~G~a~~~i~-r~~~g------~~~~l 115 (383) ..-+-+.+- + -+. -.--+...++++.+..++-+-|++|+.++ |.+++ .+.+- T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:97 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 875433110 1 011 11224557889999999999999998765 33433 24555 Q ss_pred EEec-cceeEEEEcCCCceeEEEEeecCcccccceeeccc-ce-EEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 116 EYLR-PSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQS-DI-LHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 116 ~~l~-~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~-dv-ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) |++- ..-|. ...++... +...++ ...+|... ++ +.+=.+++.......||+.+++..+.-.....+... T Consensus 161 W~vvs~~Ei~---~~~~~~~~--i~lPdG---~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~ 232 (639) T protein:vir:97 161 WYAVTREEIK---SKAGETAE--ISLPDG---KTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (639) T ss_pred eeeeeHHHhc---ccCCCeeE--eecCCC---CCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 5543 22222 11222221 222111 11122222 33 223356666667889999999999999888888888 Q ss_pred HHHhccCCcceeEeecCCCCH-------------------------HHHHHHHHHHHH----hhc-----CCcceeecCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLL-------------------------DFKTKVSRSRQA----MKQ-----MQGGPLVLDD 238 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~-------------------------e~~~~~~~~~~~----~~~-----~~g~~~vl~~ 238 (383) +..+.-..-.|++-+|..++- .+.+.+...+.+ ... .+--|+++.. T Consensus 233 aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~ 312 (639) T protein:vir:97 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (639) T ss_pred HHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEee Confidence 888877777788766543221 123334333322 222 2223444422 Q ss_pred ----CceeeecccChhh-HHHHHHHHHHHHHHHHHhcCCHHHhcccccCcC--HHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 239 ----LEDFTPLEIKSNV-AQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS--SLEMSSNVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 239 ----g~~~~~~~~~~~d-~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~--~~e~~~~~~~~~l~P~~~~i~~~l~~~ 311 (383) .-+++.+.+..+- ---+.+++..+..+|..+.|||+.|-+.++.+- .-+-...-+...|.|.+..|.++|++. T Consensus 313 p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~ 392 (639) T protein:vir:97 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (639) T ss_pred chHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhh Confidence 2344555554322 234789999999999999999998865444221 111122334567999999999999998 Q ss_pred hcchh----hccc---hhhhccCHH----HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc------------------ Q lcl|NC_018285. 312 LSCDV----DADI---FPAVDPTGA----NYISRINSMVKSGTLAQNQGLYILQQAEILPKE------------------ 362 (383) Q Consensus 312 l~~~~----e~~~---~~~~~~~~~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d------------------ 362 (383) ++... .+|- ...++.+.. .+.+....+++.|.+|-.-.|+.+|...-.+-+ T Consensus 393 ~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~ 472 (639) T protein:vir:97 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (639) T ss_pred HHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCC Confidence 77431 1221 111121111 122345567788999999899988865321111 Q ss_pred -------h----hHHh--CCCCCC---CCCCCCCCCC Q lcl|NC_018285. 363 -------L----PKGE--NPNRTI---LKGGETNGQD 383 (383) Q Consensus 363 -------~----~~~~--~~~~~~---~~ggd~~~~d 383 (383) . +..+ ..+.++ .+|.++++.| T Consensus 473 P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~d 509 (639) T protein:vir:97 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDD 509 (639) T ss_pred cchhhhhhhccCccceecccCCCCCCCCCCCCCCCcc Confidence 0 0001 111111 1122221111 No 158 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.34 E-value=1.6e-07 Score=57.87 Aligned_cols=375 Identities=12% Similarity=0.082 Sum_probs=191.9 Q ss_pred Cc-----hhhhhhcCCccccccccccc-----chh-hcccccCCc--eechhhhh----ccHHHHHHHHHHHHhhhhCce Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGFFDIT-----DPE-FLATLNGSE--WVSAETAL----KNSDLFSIISQLSNDLATAKL 63 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~~-----~~~-~~~~~~~~~--~~~~~~a~----~~~~v~~~i~~ia~~ia~~p~ 63 (383) |. +-.+-+..|++.+....... ++. .+..+.++. .-....|. .++.++-.+..++++++++.+ T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL 80 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANSCSRTTL 80 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceeee Confidence 43 22222333322221111100 110 011110110 11112222 246778889999999999999 Q ss_pred eeecchhhh------h--ccC-------------CCccCCHHHHHHHHHHHHHHcCCeEEEEe-ecCCC------ceeEE Q lcl|NC_018285. 64 TTSRKQMQG------I--VDN-------------PSNSANRFNFYQSIFAQMLLGGEAFAYRW-RNDNG------RDMKW 115 (383) Q Consensus 64 ~~~~~~~~~------l--~~~-------------PN~~~t~~~f~~~~~~~~~l~G~a~~~i~-r~~~g------~~~~l 115 (383) ..-+-+.+- + -+. -.--+...++++.+..++-+-|++|+.++ |.+++ .+.+- T Consensus 81 ~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~~ 160 (639) T protein:vir:10 81 IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRAR 160 (639) T ss_pred EeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcccccccc Confidence 875433110 1 011 11224557889999999999999998765 33433 24555 Q ss_pred EEec-cceeEEEEcCCCceeEEEEeecCcccccceeeccc-ce-EEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 116 EYLR-PSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQS-DI-LHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 116 ~~l~-~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~-dv-ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) |++- ..-|. ...++... +...++ ...+|... ++ +.+=.+++.......||+.+++..+.-.....+... T Consensus 161 W~vvs~~Ei~---~~~~~~~~--i~lPdG---~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~ 232 (639) T protein:vir:10 161 WYAVTREEIK---SKAGETAE--ISLPDG---KTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (639) T ss_pred eeeeeHHHhc---ccCCCeeE--eecCCC---CCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 5543 22222 11222221 222111 11122222 33 223356666667889999999999999888888888 Q ss_pred HHHhccCCcceeEeecCCCCH-------------------------HHHHHHHHHHHH----hhc-----CCcceeecCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLL-------------------------DFKTKVSRSRQA----MKQ-----MQGGPLVLDD 238 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~-------------------------e~~~~~~~~~~~----~~~-----~~g~~~vl~~ 238 (383) +..+.-..-.|++-+|..++- .+.+.+...+.+ ... .+--|+++.. T Consensus 233 aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~ 312 (639) T protein:vir:10 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (639) T ss_pred HHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEee Confidence 888877777788766543221 123334333322 222 2223444422 Q ss_pred ----CceeeecccChhh-HHHHHHHHHHHHHHHHHhcCCHHHhcccccCcC--HHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 239 ----LEDFTPLEIKSNV-AQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS--SLEMSSNVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 239 ----g~~~~~~~~~~~d-~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~--~~e~~~~~~~~~l~P~~~~i~~~l~~~ 311 (383) .-+++.+.+..+- ---+.+++..+..+|..+.|||+.|-+.++.+- .-+-...-+...|.|.+..|.++|++. T Consensus 313 p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~ 392 (639) T protein:vir:10 313 AAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYND 392 (639) T ss_pred chHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhh Confidence 2344555554322 234789999999999999999998865444221 111122334567999999999999998 Q ss_pred hcchh----hccc---hhhhccCHH----HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc------------------ Q lcl|NC_018285. 312 LSCDV----DADI---FPAVDPTGA----NYISRINSMVKSGTLAQNQGLYILQQAEILPKE------------------ 362 (383) Q Consensus 312 l~~~~----e~~~---~~~~~~~~~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d------------------ 362 (383) ++... .+|- ...++.+.. .+.+....+++.|.+|-.-.|+.+|...-.+-+ T Consensus 393 ~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~ 472 (639) T protein:vir:10 393 ILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKN 472 (639) T ss_pred HHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCC Confidence 77431 1221 111121111 122345567788999999899988865321111 Q ss_pred -------h----hHHh--CCCCCC---CCCCCCCCCC Q lcl|NC_018285. 363 -------L----PKGE--NPNRTI---LKGGETNGQD 383 (383) Q Consensus 363 -------~----~~~~--~~~~~~---~~ggd~~~~d 383 (383) . +..+ ..+.++ .+|.++++.| T Consensus 473 P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~d 509 (639) T protein:vir:10 473 PELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDD 509 (639) T ss_pred cchhhhhhhccCccceecccCCCCCCCCCCCCCCCcc Confidence 0 0001 111111 1122221111 No 159 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.30 E-value=1.5e-06 Score=52.43 Aligned_cols=337 Identities=10% Similarity=-0.025 Sum_probs=151.2 Q ss_pred CchhhhhhcCCcccc------cccccccchhhcccccCCceechhhh--hc--cHHHHHHHHHHHHhhhhCceeeecchh Q lcl|NC_018285. 1 MPIFNLATESPPNNQ------GGFFDITDPEFLATLNGSEWVSAETA--LK--NSDLFSIISQLSNDLATAKLTTSRKQM 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~a--~~--~~~v~~~i~~ia~~ia~~p~~~~~~~~ 70 (383) ..+++.+.++..... ..+.....+ .... +..+..+-. ++ ..-..-+|+-+|..+.=-.|..-+... T Consensus 3 ~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~--~~~~--~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d~~l 78 (409) T protein:vir:16 3 EKGIGYLRFKLSVHKRRAEMRYEQYAMKHV--DRFK--GITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDDFTV 78 (409) T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHhccCc--hhhc--chhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccCcchHH Confidence 233344333211111 011100000 0000 001111110 01 111222444444433222222222222 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeE--EEEeecCccc--- Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLY--YNVTFDDPRI--- 145 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~--y~~~~~~~~~--- 145 (383) +.++.+ -.......++..+.+++|.||+.+..+.+|.| .+.+++|.++....+.....+. |.+...+..+ T Consensus 79 ~~i~~~----N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~ 153 (409) T protein:vir:16 79 NEIFEE----NNPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPITGLLTEGYAVLERDENNNVV 153 (409) T ss_pred HHHHHh----cChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeecccccceeeeEEEEecCCCceE Confidence 233322 23334455777888999999999999888876 6778888888777654332211 1111111000 Q ss_pred -------------------cccee--ecccceEEeccCCCCccccCcch----HHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_018285. 146 -------------------PPKQH--VPQSDILHFRLLSVDGGLTSVSP----LMALGRELDIQKASDKLTLNSLKNALN 200 (383) Q Consensus 146 -------------------~~~~~--~~~~dvih~~~~~~~~~~~G~s~----~~~~~~~i~~~~~~~~~~~~~~~ng~~ 200 (383) ....+ +..--|++|.+..-.+..+|.|- +..+.+.+.....-......++.+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~--- 230 (409) T protein:vir:16 154 LEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSF--- 230 (409) T ss_pred EEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcC--- Confidence 00011 11112455543322234567773 455555555555555555556544 Q ss_pred cceeEee-cCCCCHHHHHHHHHHHHHhhcCCcceeecC-----CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 201 ANGILKI-KGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-----DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 201 ~~~i~~~-~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~-----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) |.-.+.. +..... .+ .|. ...++++.++ .+.++.+++.+.-+ .|++..+....++|..=++|++ T Consensus 231 pqr~i~G~d~d~~~--~~----~~~---~~~~~i~~~~~d~~g~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~ 300 (409) T protein:vir:16 231 PQKYVTGLSDDAEP--ME----TWK---ATVSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGLTLD 300 (409) T ss_pred hhheeEecCCCCCc--cc----hhh---hhhhHhhccCCCCCCCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCCCHH Confidence 5444432 211111 11 121 1124455443 23566555444333 4899999999999999999999 Q ss_pred HhcccccCcCHHHHHHH---HHHHHHHHHHHHHHHHHHHh----hc--chh----------hccchhhhccC---HHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMSSN---VYSKAVARYLRPFLSELSQK----LS--CDV----------DADIFPAVDPT---GANYI 332 (383) Q Consensus 275 ~lg~~~~~~~~~e~~~~---~~~~~l~P~~~~i~~~l~~~----l~--~~~----------e~~~~~~~~~~---~~~~~ 332 (383) .+|..+++-.+++..++ -+.....-.-+.+.+.+.+. +. ..+ +....+....+ ....+ T Consensus 301 ~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~a 380 (409) T protein:vir:16 301 DLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIG 380 (409) T ss_pred HcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHHHH Confidence 99987665333333321 11111111122222222211 11 000 11111111222 34556 Q ss_pred HHHHHHHhCC--CcCHHHHHHHhhcCCcC Q lcl|NC_018285. 333 SRINSMVKSG--TLAQNQGLYILQQAEIL 359 (383) Q Consensus 333 ~~~~~l~~~g--~~t~nE~r~~lg~~~~~ 359 (383) +.+.+|+.+| ++...-+++++|+..-+ T Consensus 381 Da~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 381 DGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHHHHHhhcccccchhHHHHhccCCCCC Confidence 6677788775 44566678888766533 No 160 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.29 E-value=5.1e-07 Score=55.08 Aligned_cols=367 Identities=11% Similarity=0.017 Sum_probs=161.7 Q ss_pred CchhhhhhcCC----c--ccccccccccch-hhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh--- Q lcl|NC_018285. 1 MPIFNLATESP----P--NNQGGFFDITDP-EFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--- 70 (383) Q Consensus 1 Mglf~~~~~~~----~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--- 70 (383) ..+.+++.... + .....+...... ...+.......-....-..+.-...+|+..+.-+-.-|+.+..... T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~~~ 86 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCcch Confidence 22222221110 0 000001000000 0000000000000000011223345677777766677887643221 Q ss_pred ----hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-----eEEEEeec Q lcl|NC_018285. 71 ----QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-----LYYNVTFD 141 (383) Q Consensus 71 ----~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~y~~~~~ 141 (383) ..++.+ | ....+...+..+++.+|.||+.+.++.+|.+. +..++|..+.+..+..... +.|....+ T Consensus 87 ~~~~~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d 161 (456) T protein:vir:10 87 ALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRAAMRWWRDLD 161 (456) T ss_pred HHHHHHHHHh-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE-EEEEccceeEEEEcCCCCcceEEEEEEEEecC Confidence 122222 2 33445567788999999999999998888763 6678888887766543321 11111000 Q ss_pred Ccc------------------------cccceeecc-------------cceEEeccCCCCccccCcchHHHHHHHHHHH Q lcl|NC_018285. 142 DPR------------------------IPPKQHVPQ-------------SDILHFRLLSVDGGLTSVSPLMALGRELDIQ 184 (383) Q Consensus 142 ~~~------------------------~~~~~~~~~-------------~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~ 184 (383) +.. ......... ..+......+ ..|.|.++.....++.. T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N----~~g~gd~e~vi~liDa~ 237 (456) T protein:vir:10 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEPHIDIINRI 237 (456) T ss_pred CceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecC----CCCCchhhhhHHHHHHH Confidence 000 000000000 0000011111 24677777666655554 Q ss_pred HHHHHHHHHHHhccCCcceeEeecCC---CCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHH Q lcl|NC_018285. 185 KASDKLTLNSLKNALNANGILKIKGG---GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWT 261 (383) Q Consensus 185 ~~~~~~~~~~~~ng~~~~~i~~~~~~---~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~ 261 (383) ..+.--........+.|..++..... ..++.-..+.. ........+.++.++++.++..++...-+ .+.+..+.. T Consensus 238 ~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~q~~~~~~~-~~~~~l~~~ 315 (456) T protein:vir:10 238 NRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDY-ASIFEAAPGALWELPPGVDIWESQANDFT-PMLSAIKEH 315 (456) T ss_pred HHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccch-hhhhhhhccccccCCCCcceEEecccChh-HHHHHHHHH Confidence 44433333333333334444432110 01111111110 01112233556677889988776544322 478888999 Q ss_pred HHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------chhh-----ccchhhhcc Q lcl|NC_018285. 262 TGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS----------CDVD-----ADIFPAVDP 326 (383) Q Consensus 262 ~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~----------~~~e-----~~~~~~~~~ 326 (383) +.+|++.=++|++.+|+.+.+.+. ++.+ +....+.-.+...+..|...|- ...+ ....+..-. T Consensus 316 i~~~~~~s~~p~~~~~~~~~N~Sg-~Ai~-~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~~~ 393 (456) T protein:vir:10 316 IRQLSSATKTPLPMLMPDSANQSA-EGAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRV 393 (456) T ss_pred HHHHHhccCCChHHhcccccChHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCc Confidence 999999999999999975544433 2222 1111222222222222222111 1111 111111224 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchh------HHhC---CCCCCCCCCCCCCCC Q lcl|NC_018285. 327 TGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELP------KGEN---PNRTILKGGETNGQD 383 (383) Q Consensus 327 ~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~------~~~~---~~~~~~~ggd~~~~d 383 (383) +..+.+..+.++...|+.+..-+++++|..+ .++. ..+. +...+++.++.|+.- T Consensus 394 ~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~---~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 394 TLGEKYSAASLAKAAGESWASIRRNILNYNA---DQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred CHHHHHHHHHHHHHcCCChHHHHHhhCCCCH---HHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 4566777778888899999999988877543 2221 1111 111222223322222 No 161 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.29 E-value=5.1e-07 Score=55.08 Aligned_cols=367 Identities=11% Similarity=0.017 Sum_probs=161.7 Q ss_pred CchhhhhhcCC----c--ccccccccccch-hhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh--- Q lcl|NC_018285. 1 MPIFNLATESP----P--NNQGGFFDITDP-EFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--- 70 (383) Q Consensus 1 Mglf~~~~~~~----~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--- 70 (383) ..+.+++.... + .....+...... ...+.......-....-..+.-...+|+..+.-+-.-|+.+..... T Consensus 7 ~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~~~ 86 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCcch Confidence 22222221110 0 000001000000 0000000000000000011223345677777766677887643221 Q ss_pred ----hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-----eEEEEeec Q lcl|NC_018285. 71 ----QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-----LYYNVTFD 141 (383) Q Consensus 71 ----~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~y~~~~~ 141 (383) ..++.+ | ....+...+..+++.+|.||+.+.++.+|.+. +..++|..+.+..+..... +.|....+ T Consensus 87 ~~~~~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d 161 (456) T protein:vir:10 87 ALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRAAMRWWRDLD 161 (456) T ss_pred HHHHHHHHHh-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE-EEEEccceeEEEEcCCCCcceEEEEEEEEecC Confidence 122222 2 33445567788999999999999998888763 6678888887766543321 11111000 Q ss_pred Ccc------------------------cccceeecc-------------cceEEeccCCCCccccCcchHHHHHHHHHHH Q lcl|NC_018285. 142 DPR------------------------IPPKQHVPQ-------------SDILHFRLLSVDGGLTSVSPLMALGRELDIQ 184 (383) Q Consensus 142 ~~~------------------------~~~~~~~~~-------------~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~ 184 (383) +.. ......... ..+......+ ..|.|.++.....++.. T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N----~~g~gd~e~vi~liDa~ 237 (456) T protein:vir:10 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEPHIDIINRI 237 (456) T ss_pred CceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecC----CCCCchhhhhHHHHHHH Confidence 000 000000000 0000011111 24677777666655554 Q ss_pred HHHHHHHHHHHhccCCcceeEeecCC---CCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHH Q lcl|NC_018285. 185 KASDKLTLNSLKNALNANGILKIKGG---GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWT 261 (383) Q Consensus 185 ~~~~~~~~~~~~ng~~~~~i~~~~~~---~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~ 261 (383) ..+.--........+.|..++..... ..++.-..+.. ........+.++.++++.++..++...-+ .+.+..+.. T Consensus 238 ~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~q~~~~~~~-~~~~~l~~~ 315 (456) T protein:vir:10 238 NRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDY-ASIFEAAPGALWELPPGVDIWESQANDFT-PMLSAIKEH 315 (456) T ss_pred HHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccch-hhhhhhhccccccCCCCcceEEecccChh-HHHHHHHHH Confidence 44433333333333334444432110 01111111110 01112233556677889988776544322 478888999 Q ss_pred HHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------chhh-----ccchhhhcc Q lcl|NC_018285. 262 TGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS----------CDVD-----ADIFPAVDP 326 (383) Q Consensus 262 ~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~----------~~~e-----~~~~~~~~~ 326 (383) +.+|++.=++|++.+|+.+.+.+. ++.+ +....+.-.+...+..|...|- ...+ ....+..-. T Consensus 316 i~~~~~~s~~p~~~~~~~~~N~Sg-~Ai~-~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~~~ 393 (456) T protein:vir:10 316 IRQLSSATKTPLPMLMPDSANQSA-EGAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRV 393 (456) T ss_pred HHHHHhccCCChHHhcccccChHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCc Confidence 999999999999999975544433 2222 1111222222222222222111 1111 111111224 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchh------HHhC---CCCCCCCCCCCCCCC Q lcl|NC_018285. 327 TGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELP------KGEN---PNRTILKGGETNGQD 383 (383) Q Consensus 327 ~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~------~~~~---~~~~~~~ggd~~~~d 383 (383) +..+.+..+.++...|+.+..-+++++|..+ .++. ..+. +...+++.++.|+.- T Consensus 394 ~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~---~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 394 TLGEKYSAASLAKAAGESWASIRRNILNYNA---DQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred CHHHHHHHHHHHHHcCCChHHHHHhhCCCCH---HHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 4566777778888899999999988877543 2221 1111 111222223322222 No 162 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.29 E-value=1.7e-06 Score=52.25 Aligned_cols=350 Identities=11% Similarity=0.024 Sum_probs=154.7 Q ss_pred CchhhhhhcCCcccc------cccccccchhhcccccCCceechhh-hhcc---HHHHHHHHHHHHhhhhCceeeecchh Q lcl|NC_018285. 1 MPIFNLATESPPNNQ------GGFFDITDPEFLATLNGSEWVSAET-ALKN---SDLFSIISQLSNDLATAKLTTSRKQM 70 (383) Q Consensus 1 Mglf~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-a~~~---~~v~~~i~~ia~~ia~~p~~~~~~~~ 70 (383) |++ +.+.+...... ..+.....+ .... +..+..+- .+.. .-..-+|+-+|..+-=-.|.+-+... T Consensus 4 ~~i-~~L~~~~~~~~~r~~~~~~yy~g~~~--~~~~--~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~l 78 (422) T protein:vir:97 4 MGM-GYLRRKLALFKTGVDKRYRYYAMDDR--DDTR--SIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTNDDFNA 78 (422) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhcCCC--hhhc--CccccHHHHHHHHhhcchhHHHHHHHHhccccceeeCCchhH Confidence 333 33332211111 111110000 0000 01111110 1111 11122344444422212233322223 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-CCceeEEEEeccceeEEEEcCCCceeEE--EEe-ecCcccc Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-NGRDMKWEYLRPSQVSFNRLDNQNGLYY--NVT-FDDPRIP 146 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~g~~~~l~~l~~~~v~~~~~~~~~~~~y--~~~-~~~~~~~ 146 (383) +.++.+ |. ......++..+.+++|.||+.+.++. +|.| .+.+++|.++....+.....+.. .+. .+..... T Consensus 79 ~~~w~~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~~~~~~~ 153 (422) T protein:vir:97 79 WEIFKA-NN---PDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPTTFLLTEGYAILESDSNGNP 153 (422) T ss_pred HHHHHh-cC---hHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCCCCcceeeEEEEEecCCCcE Confidence 333332 22 23344467788899999999998874 5665 58888999988877654332211 111 1000000 Q ss_pred c-ceeecc---------------------cceEEeccCCCCccccCcchH----HHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_018285. 147 P-KQHVPQ---------------------SDILHFRLLSVDGGLTSVSPL----MALGRELDIQKASDKLTLNSLKNALN 200 (383) Q Consensus 147 ~-~~~~~~---------------------~dvih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~ng~~ 200 (383) . ...++. --|++|.+....+..+|.|.+ ..+.+.+.....-......++.. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~--- 230 (422) T protein:vir:97 154 TLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSF--- 230 (422) T ss_pred EEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcc--- Confidence 0 001111 124555443333445677754 34444444433333334444433 Q ss_pred cceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCC-----CceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_018285. 201 ANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDD-----LEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENV 275 (383) Q Consensus 201 ~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~ 275 (383) |.-.+..-. .+..... .|. ...++++.++. +.++.+++.+.-+ .|++..+....+++..=++|++. T Consensus 231 pqr~i~G~d-~d~~~~~----~~~---~~~~~i~~~~~de~~~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~ 301 (422) T protein:vir:97 231 PQKYVLGMD-PDAKPME----KWR---ATVSTLLEISKDEDGDKPTVGQFTTASMA-PFMEHLKMYASLFAGGSGLTLDD 301 (422) T ss_pred hhhhhcccC-cccccCc----hhh---hhhhhhhccCCCCCCCcceeeecCCCChh-HHHHHHHHHHHHHhcccCCCHHH Confidence 444443211 0111111 111 12234555532 3466555444433 48899999999999999999999 Q ss_pred hcccccCcCHHHHHHH---HHHHHHHHHHHHHHHHHHHhh------cch----------hhccchhhhccCHH---HHHH Q lcl|NC_018285. 276 VGGQGDQQSSLEMSSN---VYSKAVARYLRPFLSELSQKL------SCD----------VDADIFPAVDPTGA---NYIS 333 (383) Q Consensus 276 lg~~~~~~~~~e~~~~---~~~~~l~P~~~~i~~~l~~~l------~~~----------~e~~~~~~~~~~~~---~~~~ 333 (383) +|..+++..+.+..+. -+...+.-..+.+.+.+.+.+ ... +++...+.+..+.. ..++ T Consensus 302 lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aD 381 (422) T protein:vir:97 302 LGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGD 381 (422) T ss_pred hccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHH Confidence 9987665333333321 111112222222222222211 010 11111122234433 3445 Q ss_pred HHHHHHhC--CCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCC Q lcl|NC_018285. 334 RINSMVKS--GTLAQNQGLYILQQAEILPKELPKGENPNRTILKG 376 (383) Q Consensus 334 ~~~~l~~~--g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~g 376 (383) .+.+++++ |+++..-+++++|..+ ...++.+.+.. ...| T Consensus 382 a~~Kl~~a~~~~~~~~~~~~~lg~~~-~~~~~~~~~~~---~~d~ 422 (422) T protein:vir:97 382 GAIKLNQAIPGFMDADVIRDLTGVKG-ADKPIPAITEV---TTDG 422 (422) T ss_pred HHHHHHhhccccccHHHHHHHcCCCc-hhHHHHHHHhh---hccC Confidence 56677777 7889999999998754 23444444432 3444 No 163 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.28 E-value=8.6e-07 Score=53.84 Aligned_cols=375 Identities=11% Similarity=0.062 Sum_probs=191.6 Q ss_pred CchhhhhhcCCccccccccccc------chhhccc-ccCCce---echhhhh----ccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT------DPEFLAT-LNGSEW---VSAETAL----KNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~------~~~~~~~-~~~~~~---~~~~~a~----~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) |.--+--..+|+........+. +|..... ...+.. -....|. .++.++-.+..++++++++.+..- T Consensus 1 ma~~~lrv~rrpk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss~Sr~rL~as 80 (629) T protein:vir:10 1 MAASTLRVSRRPKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASSCSRVELIAS 80 (629) T ss_pred CCccceeEEecCCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhhheeeeEEEe Confidence 4332211112222111111111 1111110 001110 1112222 246677778889999999998775 Q ss_pred cchhhh--------------------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC----ceeEEEEe-ccc Q lcl|NC_018285. 67 RKQMQG--------------------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG----RDMKWEYL-RPS 121 (383) Q Consensus 67 ~~~~~~--------------------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g----~~~~l~~l-~~~ 121 (383) .-+.+. ...--.--+...++++.+..++-+-|+.|+.++-...+ .+..-|++ ..+ T Consensus 81 ~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~W~vVt~~ 160 (629) T protein:vir:10 81 ELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHNWYVVTND 160 (629) T ss_pred eecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccceeeecHH Confidence 432110 11112223556788999999999999999998754443 33433333 223 Q ss_pred eeEEEEcCCCceeEEEEeecCcccccceee-cccceE-EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_018285. 122 QVSFNRLDNQNGLYYNVTFDDPRIPPKQHV-PQSDIL-HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNAL 199 (383) Q Consensus 122 ~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~dvi-h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 199 (383) -|. ....+.. .+...++ ...+| ...|++ .+=++.+.......||+.+++..+.-.....+...+..+.-. T Consensus 161 Ei~---~kg~g~~--~i~lpdg---~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL 232 (629) T protein:vir:10 161 EVK---NKGAGKT--DIELPDG---TIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRL 232 (629) T ss_pred Hhc---cccCcee--EEEcCCC---ceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHH Confidence 222 1111111 2333322 12223 233433 232566666677899999999999988888888888877777 Q ss_pred CcceeEeecCCCC----------------------HHHHHHHHHHHHH----hhcC-----CcceeecCC----Cceeee Q lcl|NC_018285. 200 NANGILKIKGGGL----------------------LDFKTKVSRSRQA----MKQM-----QGGPLVLDD----LEDFTP 244 (383) Q Consensus 200 ~~~~i~~~~~~~~----------------------~e~~~~~~~~~~~----~~~~-----~g~~~vl~~----g~~~~~ 244 (383) .-.|++-++..++ ....+.+...+.+ .... +--|+++.. --+++. T Consensus 233 ~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ikh 312 (629) T protein:vir:10 233 IGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKIFH 312 (629) T ss_pred hhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcCeee Confidence 7777765543211 0123334433322 2222 222444321 134455 Q ss_pred cccChhhH-HHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHH---HHHHHHHHHHHHHHHHHHHHHHhhcchh---- Q lcl|NC_018285. 245 LEIKSNVA-QLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM---SSNVYSKAVARYLRPFLSELSQKLSCDV---- 316 (383) Q Consensus 245 ~~~~~~d~-~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~---~~~~~~~~l~P~~~~i~~~l~~~l~~~~---- 316 (383) +.+..+-. --+.+++..+..+|..+.|||+.|-+.++++|--.. ...-+...|.|.+..|.++|++.++... T Consensus 313 Lkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~~e 392 (629) T protein:vir:10 313 LKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLRAE 392 (629) T ss_pred eeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHHHh Confidence 55533222 247899999999999999999988655433331111 1233456789999999999998877431 Q ss_pred hccc---hhhhccCHH----HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC-----cc-hh------------------- Q lcl|NC_018285. 317 DADI---FPAVDPTGA----NYISRINSMVKSGTLAQNQGLYILQQAEILP-----KE-LP------------------- 364 (383) Q Consensus 317 e~~~---~~~~~~~~~----~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~-----~d-~~------------------- 364 (383) .+|- ...++.+.. .+.+....+...|.+|-...|+.+|...-.+ .+ .. T Consensus 393 GiDp~~Yvvw~DaS~Lt~dPd~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~~ap 472 (629) T protein:vir:10 393 GIDPDRYVLWYDASGLTVDPDKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKVLAP 472 (629) T ss_pred CCCHHHhEeeecCcccccCCCCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhhhhh Confidence 1111 111121111 1224455677899999999999998654221 11 11 Q ss_pred H----Hh--CCCCC---CCCCCCCCCCC Q lcl|NC_018285. 365 K----GE--NPNRT---ILKGGETNGQD 383 (383) Q Consensus 365 ~----~~--~~~~~---~~~ggd~~~~d 383 (383) . +. ..+.+ ..+|++.++.+ T Consensus 473 ll~~~l~~i~~P~p~~a~~~~~~~~~~~ 500 (629) T protein:vir:10 473 LLTDELAEIDWPEPPAALPPGEDDQADE 500 (629) T ss_pred hcCCccccccccCCCCcCCCCCcccCcc Confidence 0 00 01111 11233221111 No 164 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.24 E-value=1.4e-07 Score=58.21 Aligned_cols=174 Identities=10% Similarity=0.070 Sum_probs=86.8 Q ss_pred eEeecCC---CC--HHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc Q lcl|NC_018285. 204 ILKIKGG---GL--LDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG 278 (383) Q Consensus 204 i~~~~~~---~~--~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~ 278 (383) ++++++- +. .++..+..+.+....++.+.+.+...+-+|+.++.+.. .+.+........||++-|||...|-+ T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~ls--Gl~d~l~~~~~~iaa~s~iP~t~LfG 78 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIG--GIDTFLSQKFDRIVALSGIHEIILKG 78 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcC--ChHHHHHHHHHHHHhHhcCchhhhcC Confidence 5554431 11 11111111222233333344555556678888877764 35567778888999999999988865 Q ss_pred ccc-Cc--CHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHH-------HHHHHHHHhC Q lcl|NC_018285. 279 QGD-QQ--SSLEMSSNVYS-------KAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANY-------ISRINSMVKS 341 (383) Q Consensus 279 ~~~-~~--~~~e~~~~~~~-------~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~-------~~~~~~l~~~ 341 (383) .+. +- +.+.-.+.||. ..+.|.++.+-+-+.. -+++.+...++...+..+. ++.+..++.+ T Consensus 79 ~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~--~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~ 156 (201) T protein:vir:10 79 KNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVT--EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAA 156 (201) T ss_pred CCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHc Confidence 433 22 22223344443 4456665555432211 1222333334444444433 3456678899 Q ss_pred CCcCHHHHHHHhhcCCc---CC-cchhHHhCCCCCCCC-CCCCCCCC Q lcl|NC_018285. 342 GTLAQNQGLYILQQAEI---LP-KELPKGENPNRTILK-GGETNGQD 383 (383) Q Consensus 342 g~~t~nE~r~~lg~~~~---~~-~d~~~~~~~~~~~~~-ggd~~~~d 383 (383) |+++++|+|+.|...+. .+ +.++... .....+ +-+++++| T Consensus 157 g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~--~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 157 GIIDADEARDTLRAISTEVKIGEGSIQTEV--VINESEDPLDVSANN 201 (201) T ss_pred CCCCHHHHHHHHHhcCCcCCCCCCCCCccc--cccccCCCCCCCCCC Confidence 99999999999865442 11 1111111 101111 11222222 No 165 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.23 E-value=2.3e-06 Score=51.45 Aligned_cols=367 Identities=9% Similarity=-0.020 Sum_probs=157.8 Q ss_pred hhhhhhcCCcccc---cccccccchhhc--ccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeee--cchhh---h Q lcl|NC_018285. 3 IFNLATESPPNNQ---GGFFDITDPEFL--ATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTS--RKQMQ---G 72 (383) Q Consensus 3 lf~~~~~~~~~~~---~~~~~~~~~~~~--~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~--~~~~~---~ 72 (383) |+..+...+...- ..+.......+. .........+.+ +.+.-....|+..+.-+-+-|+++. +.... . T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~ 78 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYR--VRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLS 78 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcce--eecchHHHHHHhhhhheeccCceEeeCCCccHHHHH Confidence 3333332211000 000000000000 000000000011 1122233344444444444455542 21111 1 Q ss_pred hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-eeEEEEee-cCccccccee Q lcl|NC_018285. 73 IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-GLYYNVTF-DDPRIPPKQH 150 (383) Q Consensus 73 l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~~~y~~~~-~~~~~~~~~~ 150 (383) .+.+-............+..+.+.+|.+|+.+.++.+|++ .+..++|..+.+..++... .+.+.+.+ .......... T Consensus 79 ~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~~~~v 157 (440) T protein:vir:95 79 TIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKVNMTV 157 (440) T ss_pred HHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEE Confidence 1222112223444556777889999999999999988877 4667888888887765432 11111100 0000001111 Q ss_pred ecccce----------------------------EEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcc Q lcl|NC_018285. 151 VPQSDI----------------------------LHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNAN 202 (383) Q Consensus 151 ~~~~dv----------------------------ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~ 202 (383) +..+.+ +++++ ...|.|-+..+...+.....+.....+..+..+.|- T Consensus 158 yt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~ 232 (440) T protein:vir:95 158 YTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAM 232 (440) T ss_pred EeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcce Confidence 222222 33322 124677777766666666665555555566666777 Q ss_pred eeEeec---CCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc Q lcl|NC_018285. 203 GILKIK---GGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ 279 (383) Q Consensus 203 ~i~~~~---~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~ 279 (383) .+++.. ...+++....+++...............+.+.+...++.+.....+....+...+.|+..-++|..-.+.. T Consensus 233 ~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 312 (440) T protein:vir:95 233 LLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRF 312 (440) T ss_pred eeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc Confidence 776653 22344544444433221111111111123333344344444445566777888899999999997554432 Q ss_pred ccCcCHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcch-----hhccchhhhccCHHHHHHHHHHHHh Q lcl|NC_018285. 280 GDQQSSLEMS--------------SNVYSKAVARYLRPFLSELSQKLSCD-----VDADIFPAVDPTGANYISRINSMVK 340 (383) Q Consensus 280 ~~~~~~~e~~--------------~~~~~~~l~P~~~~i~~~l~~~l~~~-----~e~~~~~~~~~~~~~~~~~~~~l~~ 340 (383) +.+.+. .+. +..+...+...++.|...++..-... +++......-.+..+.+..+.++ T Consensus 313 ~~n~Sg-~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl-- 389 (440) T protein:vir:95 313 NSTSSG-IALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA-- 389 (440) T ss_pred cccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH-- Confidence 222221 111 12223333333333333222211111 11112222234456666777676 Q ss_pred CCCcCHHHHHHHhhcCCcCC-cchhHHh--------CCCC--CCCCCCCCCCC Q lcl|NC_018285. 341 SGTLAQNQGLYILQQAEILP-KELPKGE--------NPNR--TILKGGETNGQ 382 (383) Q Consensus 341 ~g~~t~nE~r~~lg~~~~~~-~d~~~~~--------~~~~--~~~~ggd~~~~ 382 (383) .|+++.-.+.++++. +++ .|+-++. .... ....||..+++ T Consensus 390 ~g~iS~et~~~~l~~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 390 GGEISQETLMENASF--TDYKTEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred hccCcHHHHHHhCCC--CCcHHHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 588998888777643 322 2222111 0000 01112222222 No 166 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.22 E-value=2.4e-06 Score=51.40 Aligned_cols=376 Identities=10% Similarity=0.023 Sum_probs=161.3 Q ss_pred Cchhhhhhc-------C---CcccccccccccchhhcccccC--CceechhhhhccHHHHHHHHHHHHhhhhCceeeecc Q lcl|NC_018285. 1 MPIFNLATE-------S---PPNNQGGFFDITDPEFLATLNG--SEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK 68 (383) Q Consensus 1 Mglf~~~~~-------~---~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~ 68 (383) +..+..+.+ . +......+.............. ....+.+ +..+-....|+..+.-+-+-|+++... T Consensus 38 ~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~k--i~~n~~k~Ivd~~~~yl~g~p~~~~~~ 115 (501) T protein:vir:27 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKR--AVHNYGRMISKFKTGYLAGNPIRVEYD 115 (501) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccce--eccchHHHHHHHHhhhhcccCeeEecC Confidence 111111110 0 0000000000000000000000 0000011 122334445566665555556665432 Q ss_pred hh------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-c-e---eEEE Q lcl|NC_018285. 69 QM------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-N-G---LYYN 137 (383) Q Consensus 69 ~~------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~-~---~~y~ 137 (383) +. ...+.+-............+..++..+|.||+.+.++.+|.+ .+..++|..+.+..++.. . . +.|. T Consensus 116 d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~ 194 (501) T protein:vir:27 116 DNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFVIYDNSLEDNSIAAVRYY 194 (501) T ss_pred CccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEEEecCCCCCceEEEEEEE Confidence 21 112222222335556777888999999999999999988876 467788888877765432 1 1 1111 Q ss_pred E-eecCcccccceeecccceEEeccC----------C---------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 138 V-TFDDPRIPPKQHVPQSDILHFRLL----------S---------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKN 197 (383) Q Consensus 138 ~-~~~~~~~~~~~~~~~~dvih~~~~----------~---------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 197 (383) . ............+.++.+.++... + ......|.|.+..+...++....+.......+.- T Consensus 195 ~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 274 (501) T protein:vir:27 195 NRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSD 274 (501) T ss_pred EeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHH Confidence 1 001110001111222222222110 0 0112357788887777777766666666666666 Q ss_pred cCCcceeEeecCCC-CHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_018285. 198 ALNANGILKIKGGG-LLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVV 276 (383) Q Consensus 198 g~~~~~i~~~~~~~-~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~l 276 (383) ...|-.+++..... .++....++..........+.....+.++++.-++....+..+....+...+.|+..-++|..-. T Consensus 275 ~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~ 354 (501) T protein:vir:27 275 MADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSD 354 (501) T ss_pred hcCceeeeecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCc Confidence 66666666543222 22222333322211112222233344555665555555555667777888889999999987555 Q ss_pred cccccCcCHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhc-ch-----hhccchhhhccCHHHHHHHHHH Q lcl|NC_018285. 277 GGQGDQQSSLEMS-------------SNVYSKAVARYLRPFLSELSQKLS-CD-----VDADIFPAVDPTGANYISRINS 337 (383) Q Consensus 277 g~~~~~~~~~e~~-------------~~~~~~~l~P~~~~i~~~l~~~l~-~~-----~e~~~~~~~~~~~~~~~~~~~~ 337 (383) +..+.+.+..... +..+...+.-.++.+...++..-. .. +++...+.+-.+..+.+..+.+ T Consensus 355 ~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~k 434 (501) T protein:vir:27 355 TNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTG 434 (501) T ss_pred cccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHH Confidence 4322222221111 112222333333322222221100 01 1111122222345556666666 Q ss_pred HHhCCCcCHHHHHHHhhcCCcCC--cchhHHh-------------CCCCCCCCCCCCCCCC Q lcl|NC_018285. 338 MVKSGTLAQNQGLYILQQAEILP--KELPKGE-------------NPNRTILKGGETNGQD 383 (383) Q Consensus 338 l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~-------------~~~~~~~~ggd~~~~d 383 (383) + .|+++...+++.++. ++. .|+.+++ .++....+++|...++ T Consensus 435 l--~g~iS~et~l~~l~~--v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~ 491 (501) T protein:vir:27 435 L--GGQVSQETALSLSGL--VESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKET 491 (501) T ss_pred H--hccCcHHHHHHhCCC--CCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCC Confidence 5 588998888777632 221 2222210 0111111222221111 No 167 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.18 E-value=3e-06 Score=50.89 Aligned_cols=375 Identities=10% Similarity=0.006 Sum_probs=161.3 Q ss_pred CchhhhhhcCC-c--ccccccccccchhhc-ccccC-CceechhhhhccHHHHHHHHHHHHhhhhCceeeecc--hhhhh Q lcl|NC_018285. 1 MPIFNLATESP-P--NNQGGFFDITDPEFL-ATLNG-SEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK--QMQGI 73 (383) Q Consensus 1 Mglf~~~~~~~-~--~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~--~~~~l 73 (383) ..|.+...... + .....+.....+.+. ..... ......+ +.++-....|+..+.-+-+-|+++... ..... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~ 123 (511) T protein:vir:93 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEV 123 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcce--eecchHHHHHHHHhhhhcccCeeeccCChHHHHH Confidence 11211110000 0 000000000000000 00000 0000111 112223334444555455556665322 22223 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC--cee---EEE-EeecCcccc- Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ--NGL---YYN-VTFDDPRIP- 146 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~--~~~---~y~-~~~~~~~~~- 146 (383) +..-+...........+..++..+|.||.++.++.+|.+ .+..++|..+.+..++.. ... .|. .....+... T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~ 202 (511) T protein:vir:93 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccc Confidence 333333335566667888899999999999999988876 477788888877665432 111 111 110000000 Q ss_pred ---cceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 147 ---PKQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 147 ---~~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) ....+.++.+.+++... ......|.|-+..+...++....+..-..+.+... T Consensus 203 ~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (511) T protein:vir:93 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred eEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHh Confidence 01123333333332110 00112477877777777776666655555666666 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHH-h---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQA-M---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~-~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) +.|-.+++.......+...+..+.... . ....+...-.+++.++.-++....+..+....+...+.|+..-++|.. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:93 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 667666665433343333322221110 0 001111122344555555555444555677778888899999999875 Q ss_pred HhcccccCcCHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMS--------------SNVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYIS 333 (383) Q Consensus 275 ~lg~~~~~~~~~e~~--------------~~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~ 333 (383) -.+..+.+.+ ..+. +..+...|.-.++.|...++..-... +++...+..-.+..+.+. T Consensus 363 ~~~~~~~n~S-g~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~ 441 (511) T protein:vir:93 363 KDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred ccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHH Confidence 5543222222 2221 12233333333333333222211111 111111222234555666 Q ss_pred HHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh--------C--CCCCCCCCCCCCC--CC Q lcl|NC_018285. 334 RINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE--------N--PNRTILKGGETNG--QD 383 (383) Q Consensus 334 ~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~--------~--~~~~~~~ggd~~~--~d 383 (383) .+.++ .|+++.-.+++.++. ++. .|+.+++ . .+....+++.+++ .+ T Consensus 442 ~~~kl--~g~iS~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) T protein:vir:93 442 AYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDD 501 (511) T ss_pred HHHHH--hccCchHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCC Confidence 66666 589999888887643 221 2322211 0 0111111111111 11 No 168 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.17 E-value=3.1e-06 Score=50.75 Aligned_cols=369 Identities=9% Similarity=-0.018 Sum_probs=145.2 Q ss_pred CchhhhhhcCCcc------cccccccccchhhcccccCCceechhhh---hccHHHHHHHHHHHHhhhhCceeeecc--- Q lcl|NC_018285. 1 MPIFNLATESPPN------NQGGFFDITDPEFLATLNGSEWVSAETA---LKNSDLFSIISQLSNDLATAKLTTSRK--- 68 (383) Q Consensus 1 Mglf~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~a---~~~~~v~~~i~~ia~~ia~~p~~~~~~--- 68 (383) ..+.+.+.+.... ....+..... .+ .. .+..+..+.. +...-...+|+.+++.+---.|.+-.. T Consensus 10 ~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~-~i-~~--~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~~ 85 (488) T protein:vir:23 10 EKLRDQLLDAFENKQNELKSSKAYYDAER-RP-DA--IGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANGE 85 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccc-ch-hh--cCcccchhhhhhhhhcchHHHHHHHHHHhhhccceeccCCccc Confidence 2233322211100 0001100000 00 00 0011111110 111122334555554332222322111 Q ss_pred --------hhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC--------CCceeEEEEeccceeEEEEcCCCc Q lcl|NC_018285. 69 --------QMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND--------NGRDMKWEYLRPSQVSFNRLDNQN 132 (383) Q Consensus 69 --------~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~--------~g~~~~l~~l~~~~v~~~~~~~~~ 132 (383) +....+.+--..-........+..+++.+|.||+.+.++. .|.+ .+.+++|..+.+..++..+ T Consensus 86 ~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~~~~d~~~~ 164 (488) T protein:vir:23 86 EPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP-LIRVEPPTALYAEVDPRTR 164 (488) T ss_pred ccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc-eEEEeccceeEEEEecCCC Confidence 0001111111112345556677889999999999886542 2322 3566777777666543222 Q ss_pred eeEEEE----eecCccccccee-------------------------ecccceEEeccCCCCccccCcchHH----HHHH Q lcl|NC_018285. 133 GLYYNV----TFDDPRIPPKQH-------------------------VPQSDILHFRLLSVDGGLTSVSPLM----ALGR 179 (383) Q Consensus 133 ~~~y~~----~~~~~~~~~~~~-------------------------~~~~dvih~~~~~~~~~~~G~s~~~----~~~~ 179 (383) ...+-+ ..+++....... +..-.|++|++.......+|.|-+. .+.. T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~D 244 (488) T protein:vir:23 165 KVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTD 244 (488) T ss_pred ceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHH Confidence 211111 000000000001 1111246665443334456777654 2233 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHH--HHHHHHHHHhhcCCcceeecCCC--ceeeecccChhhHHHH Q lcl|NC_018285. 180 ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFK--TKVSRSRQAMKQMQGGPLVLDDL--EDFTPLEIKSNVAQLL 255 (383) Q Consensus 180 ~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~--~~~~~~~~~~~~~~g~~~vl~~g--~~~~~~~~~~~d~~~~ 255 (383) .+....+-......++ +.|..+++.- ...+... ..-...+. ...++++.+++| .++.++..++.+ .++ T Consensus 245 a~~~~~s~~~~~~~~~---a~p~~~i~G~-~~~~~~~~~~~~~~~~~---~~~~~v~~~~~g~~~~~~q~~~~~~~-~~~ 316 (488) T protein:vir:23 245 AAAQILMNMQGTANLM---AIPQRLIFGA-KPEELGINAETGQRMFD---AYMARILAFEGGEGAHAEQFSAAELR-NFV 316 (488) T ss_pred HHHHHHHHHHHHHHHh---hhHHHHHhCC-Ccccccccccccchhhh---hhhhhhccCCCCCCceeEecCCCChH-HHH Confidence 3332222222222222 2343333311 1111000 00011111 123456666655 556555544333 477 Q ss_pred HHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------chh----h-ccc Q lcl|NC_018285. 256 KQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS----------CDV----D-ADI 320 (383) Q Consensus 256 e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~----------~~~----e-~~~ 320 (383) +..+..+.+|+..=++|++.+|+...+..+.++.+. ....+.-.+...+..|...|- ... + .++ T Consensus 317 ~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~-~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i 395 (488) T protein:vir:23 317 DALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKA-AESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRM 395 (488) T ss_pred HHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhccc Confidence 888888999999999999999876554333333322 111111112222222221110 000 0 011 Q ss_pred ----hhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhH------------HhCC---CCCCCCCCCC Q lcl|NC_018285. 321 ----FPAVDPTGANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPK------------GENP---NRTILKGGET 379 (383) Q Consensus 321 ----~~~~~~~~~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~------------~~~~---~~~~~~ggd~ 379 (383) ....-.+..+.+..+.+++++| +++...+++.+|..+-+-.++.+ .+.+ ...+-..|+. T Consensus 396 ~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (488) T protein:vir:23 396 ETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEA 475 (488) T ss_pred eEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCC Confidence 1111234556677788888865 78999898988754321111111 0000 0011111222 Q ss_pred ------CCCC Q lcl|NC_018285. 380 ------NGQD 383 (383) Q Consensus 380 ------~~~d 383 (383) ++++ T Consensus 476 ~~~~~~~~e~ 485 (488) T protein:vir:23 476 PVGEPPAPEP 485 (488) T ss_pred CCCCCCCCCC Confidence 2222 No 169 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.14 E-value=3.8e-06 Score=50.31 Aligned_cols=372 Identities=10% Similarity=0.010 Sum_probs=168.6 Q ss_pred CchhhhhhcC---Cccccc--------cccccc---chhhccc--ccCCceechhhhhccHHHHHHHHHHHHhhhhCcee Q lcl|NC_018285. 1 MPIFNLATES---PPNNQG--------GFFDIT---DPEFLAT--LNGSEWVSAETALKNSDLFSIISQLSNDLATAKLT 64 (383) Q Consensus 1 Mglf~~~~~~---~~~~~~--------~~~~~~---~~~~~~~--~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~ 64 (383) |+..+.+... +....+ .+..+- .+.+... ...+.... +.-+...-....++..|+-+..-|.. T Consensus 16 ~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~-~~~~s~n~~~~iv~~~a~~l~~ep~~ 94 (499) T protein:vir:80 16 MGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVN-RRQLSMNLPKVTAKYMSKLLFNEKVK 94 (499) T ss_pred hccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccc-cceeecchHHHHHHHHHHhhhCCcce Confidence 4443222211 000000 000000 0001000 00000001 11122233344566677766665554 Q ss_pred ee--cchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce---e----- Q lcl|NC_018285. 65 TS--RKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG---L----- 134 (383) Q Consensus 65 ~~--~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~---~----- 134 (383) +. +......+.+-...-........++......|.+|+.+..|.+|++ .+.+++|..+-+...+.+.. . T Consensus 95 i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~ 173 (499) T protein:vir:80 95 INIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSENVDECLIANSF 173 (499) T ss_pred EeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EEEEEcCCceEEEEecCCCeEEEEEEEEE Confidence 43 3323332222222233455566677788889999999999988875 35667776665433322211 0 Q ss_pred ---------------------EEEEee-----cCc-cccccee----------------ecccceEEeccCCCC----cc Q lcl|NC_018285. 135 ---------------------YYNVTF-----DDP-RIPPKQH----------------VPQSDILHFRLLSVD----GG 167 (383) Q Consensus 135 ---------------------~y~~~~-----~~~-~~~~~~~----------------~~~~dvih~~~~~~~----~~ 167 (383) .|++.. .+. ..|..+. +..--+.|++.+-++ .. T Consensus 174 ~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~s 253 (499) T protein:vir:80 174 HKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTS 253 (499) T ss_pred eecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCC Confidence 111110 000 0011100 011114556554222 23 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEe-----ecCCCCHHHHHHHHHHHHHhhcCCcceee---cCCC Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILK-----IKGGGLLDFKTKVSRSRQAMKQMQGGPLV---LDDL 239 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~-----~~~~~~~e~~~~~~~~~~~~~~~~g~~~v---l~~g 239 (383) ..|.|.+..+...+...........+-|+.+ ....++. .....+.+....+... ....+.+. -+++ T Consensus 254 plG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~g~~~~~~~~~-----~~~~~~~~~~~~~~~ 327 (499) T protein:vir:80 254 PLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDGSTTQYFDST-----DEAFFLYQGEQDDNG 327 (499) T ss_pred ccCCchHhhHHHHHHHHHHHHHHHHHHHHhc-ccceecchhhhhccCCCCCCcccCCCcc-----cceeeEeeccCCCCc Confidence 4689999888888887777666666666653 3333331 1111110000000000 00001111 1223 Q ss_pred ceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcC-HHHHH--HHHHHHHHHHHHHHHHHHHHHh---h- Q lcl|NC_018285. 240 EDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS-SLEMS--SNVYSKAVARYLRPFLSELSQK---L- 312 (383) Q Consensus 240 ~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~-~~e~~--~~~~~~~l~P~~~~i~~~l~~~---l- 312 (383) ..++.++....+-++.+..+...++|....|+++..+|...++.. ..+.. ..-...++.-..+.++..|..- + T Consensus 328 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il 407 (499) T protein:vir:80 328 KAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSIL 407 (499) T ss_pred CceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 357777777777778888889999999999999999996544332 22221 1111112222233333332221 0 Q ss_pred --------cc-------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch----hHH--hC--- Q lcl|NC_018285. 313 --------SC-------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL----PKG--EN--- 368 (383) Q Consensus 313 --------~~-------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~----~~~--~~--- 368 (383) +. .+.++.....-.|..+.+....+++.+|++++-+++... .+++..++ .+. +. T Consensus 408 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d~ea~~el~~i~~E~~~~ 485 (499) T protein:vir:80 408 EVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRA--WNITEAEADEWAEMLAKEKQAE 485 (499) T ss_pred HHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhc--CCCChHHHHHHHHHHHHHhhcC Confidence 00 112222233345667777888899999999999887653 33443332 221 11 Q ss_pred CCCCCCCCCCCCCCC Q lcl|NC_018285. 369 PNRTILKGGETNGQD 383 (383) Q Consensus 369 ~~~~~~~ggd~~~~d 383 (383) .+. +..+|-.++.| T Consensus 486 ~~~-~d~~g~~ge~e 499 (499) T protein:vir:80 486 IPN-NDMTGIFGEEE 499 (499) T ss_pred CCC-CCccccCCCCC Confidence 121 12244333334 No 170 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.14 E-value=3.8e-06 Score=50.30 Aligned_cols=361 Identities=11% Similarity=0.013 Sum_probs=147.8 Q ss_pred CchhhhhhcCCccc------ccccccccchhhcccccCCceechh-hhhc--cHHHHHHHHHHHHhhhhCceeeecc-hh Q lcl|NC_018285. 1 MPIFNLATESPPNN------QGGFFDITDPEFLATLNGSEWVSAE-TALK--NSDLFSIISQLSNDLATAKLTTSRK-QM 70 (383) Q Consensus 1 Mglf~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~--~~~v~~~i~~ia~~ia~~p~~~~~~-~~ 70 (383) .-+.+++.+..... ...+...... . ... +.....+ ...+ +.-..-+|+..+.-+--..|.+-+. .. T Consensus 6 ~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~-i-~~~--~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~d~~~l 81 (441) T protein:vir:80 6 LALIEGMYDRIQRLSSWHCCIEGYYEGSNR-V-RDL--GVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNGDGYGL 81 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-c-hhc--CcccchhhhhhhhhcchHHHHHHHHHhhhccccccCCChHHH Confidence 11111111110000 0000000000 0 000 0000000 0011 1112234444443331112222111 11 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCcee--EEEEeecC-c---- Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGL--YYNVTFDD-P---- 143 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~--~y~~~~~~-~---- 143 (383) +.++.. -+.......+..+++++|.||+.+.++.+|.+ .+..++|..+.+..+...... .+.+.... . T Consensus 82 ~~i~~~----n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~ 156 (441) T protein:vir:80 82 DGVYAA----NRLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVE 156 (441) T ss_pred HHHHHh----cCHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEE Confidence 222221 24566777888999999999999999999987 578889998887665433211 11111000 0 Q ss_pred ---------------ccccc-------eeecccceEEeccCCCCccccCcchHHH-HHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_018285. 144 ---------------RIPPK-------QHVPQSDILHFRLLSVDGGLTSVSPLMA-LGRELDIQKASDKLTLNSLKNALN 200 (383) Q Consensus 144 ---------------~~~~~-------~~~~~~dvih~~~~~~~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~ 200 (383) ..+.. ..+..-.|+||.+....+..+|.|-+.. +...++......--......-.+. T Consensus 157 ~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~ 236 (441) T protein:vir:80 157 AELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAY 236 (441) T ss_pred EEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcC Confidence 00000 0112223566654433445567775432 222333322222222233333444 Q ss_pred cceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCC-----ceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_018285. 201 ANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDL-----EDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENV 275 (383) Q Consensus 201 ~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g-----~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~ 275 (383) |..+++.- ...++.... + ....++++.++.+ .++..+..+..+ .+++..+....+|+..-++|++. T Consensus 237 ~~~~i~G~-~~~~~~~~~----~---~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~ 307 (441) T protein:vir:80 237 PQRWVTGV-SADEFSQPG----W---VLSMASVWAVDKDDDGDTPNVGSFPVNSPT-PYSDQMRLLAQLTAGEAAVPERY 307 (441) T ss_pred ceeeeecC-Cccccccch----h---hhcccccccCCCCCCCCcceeEecCccchH-HHHHHHHHHHHHHhcccCCCHHH Confidence 55555421 122211111 1 1122444444332 344444333222 36777888899999999999999 Q ss_pred hcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------------hhhccchhhhccCHHHHHHHH Q lcl|NC_018285. 276 VGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSC--------------------DVDADIFPAVDPTGANYISRI 335 (383) Q Consensus 276 lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~--------------------~~e~~~~~~~~~~~~~~~~~~ 335 (383) +|+.+++..+.++.+. ....+.-.+...+..|...|-. .+++......-.+..+.+..+ T Consensus 308 ~g~~~~~~~Sg~Al~~-~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~ 386 (441) T protein:vir:80 308 FGFITSNPPSGEALAA-EESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAV 386 (441) T ss_pred hccCCCcchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHH Confidence 9876654333333321 1111222222222222221110 001111112224456677777 Q ss_pred HHHHhCCCc--CHHHHHHHhhcCCcCCcchhHHhCC-----CCC-CCCCCCC-CCCC Q lcl|NC_018285. 336 NSMVKSGTL--AQNQGLYILQQAEILPKELPKGENP-----NRT-ILKGGET-NGQD 383 (383) Q Consensus 336 ~~l~~~g~~--t~nE~r~~lg~~~~~~~d~~~~~~~-----~~~-~~~ggd~-~~~d 383 (383) .+++.+|+. +...+++.+|. .+.++.+++.- +.. ...|... ...+ T Consensus 387 ~kl~~~g~~~~s~~~~~~~l~~---~~~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~ 440 (441) T protein:vir:80 387 TKLVGAGILPADSRTVLEMLGL---DDVQVEAVMRHRAESSDPLAVLAGAISRQTNE 440 (441) T ss_pred HHHHhcCcccccHHHHHHhCCC---CHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 888888865 44456666554 33444332110 000 0001111 1111 No 171 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.12 E-value=4.1e-06 Score=50.15 Aligned_cols=375 Identities=12% Similarity=0.069 Sum_probs=156.1 Q ss_pred CchhhhhhcCCcccc---cccccccchh-----------------------hccc----c------cC-Cceechhhhhc Q lcl|NC_018285. 1 MPIFNLATESPPNNQ---GGFFDITDPE-----------------------FLAT----L------NG-SEWVSAETALK 43 (383) Q Consensus 1 Mglf~~~~~~~~~~~---~~~~~~~~~~-----------------------~~~~----~------~~-~~~~~~~~a~~ 43 (383) |++++-++....... +.......+. +.+- . .. ....+. -+. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~--ki~ 83 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADH--RAV 83 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccc--eee Confidence 444443322211000 0000000000 0000 0 00 000000 122 Q ss_pred cHHHHHHHHHHHHhhhhCceeeec--chhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccc Q lcl|NC_018285. 44 NSDLFSIISQLSNDLATAKLTTSR--KQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPS 121 (383) Q Consensus 44 ~~~v~~~i~~ia~~ia~~p~~~~~--~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~ 121 (383) ++-....|+..+.-+.+-|+.+.- ......+...........+...+..+.+.+|.||+.+.++.+|++ .+..++|. T Consensus 84 ~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~~p~ 162 (481) T protein:vir:10 84 HNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVLDPK 162 (481) T ss_pred cchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEEccc Confidence 333445666666655566655432 222222222222234556778888999999999999999998876 47778888 Q ss_pred eeEEEEcCCCc-ee-----EEEEeecCcccc-cceeecccceEEeccC-----------C---------CCccccCcchH Q lcl|NC_018285. 122 QVSFNRLDNQN-GL-----YYNVTFDDPRIP-PKQHVPQSDILHFRLL-----------S---------VDGGLTSVSPL 174 (383) Q Consensus 122 ~v~~~~~~~~~-~~-----~y~~~~~~~~~~-~~~~~~~~dvih~~~~-----------~---------~~~~~~G~s~~ 174 (383) .+.+..++... .+ +|......+... ....+.++.+.+++.. + ......|.|-+ T Consensus 163 ~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~ 242 (481) T protein:vir:10 163 STFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDF 242 (481) T ss_pred ceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCch Confidence 88776654321 11 111111111000 0112233333333211 0 00113467766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee--cCCCceeeecccChhhH Q lcl|NC_018285. 175 MALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV--LDDLEDFTPLEIKSNVA 252 (383) Q Consensus 175 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v--l~~g~~~~~~~~~~~d~ 252 (383) ..+...+.............+.-.+.|..++......+++....++..... ....+... .+.+.++.-++...... T Consensus 243 ~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~l~~~~~~~ 320 (481) T protein:vir:10 243 ENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMI--HLEPGTNANGSEGKAEVKYVYKQYDVA 320 (481) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccce--eccccccccCCCCCcceeEEeecCCHH Confidence 655555554444433334444445566666654333333333333221110 00111111 12233444444444455 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHH-H--HHHHHHHHHHHHHHHHHH-----------hhcch--- Q lcl|NC_018285. 253 QLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSN-V--YSKAVARYLRPFLSELSQ-----------KLSCD--- 315 (383) Q Consensus 253 ~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~-~--~~~~l~P~~~~i~~~l~~-----------~l~~~--- 315 (383) .+.+..+...+.|+..-++|....+..+.+.+ ..+.+. + ....+.-..+.+...|.+ .-... T Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S-g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 399 (481) T protein:vir:10 321 GVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQS-GESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNY 399 (481) T ss_pred HHHHHHHHHHHHHHHHhCCccccccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Confidence 67777888889999999999766653322222 222211 0 111111111112222211 10011 Q ss_pred --hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-------------CCCCCCCCCCCCC Q lcl|NC_018285. 316 --VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-------------NPNRTILKGGETN 380 (383) Q Consensus 316 --~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-------------~~~~~~~~ggd~~ 380 (383) ++....+..-.+..+.+..+.++ .|+++.-.+.+.++.-.=+..|+.+++ ..+.....+++.+ T Consensus 400 ~~i~v~f~~~~~~~~~~~a~~~~kl--~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~d 477 (481) T protein:vir:10 400 AELTITFTPNLPKSMMESINAFNAL--SGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVD 477 (481) T ss_pred ceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCC Confidence 11111222234455666666665 488998888877643210012322211 0111111122222 Q ss_pred CCC Q lcl|NC_018285. 381 GQD 383 (383) Q Consensus 381 ~~d 383 (383) +.+ T Consensus 478 d~~ 480 (481) T protein:vir:10 478 DSN 480 (481) T ss_pred CCC Confidence 222 No 172 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.11 E-value=4.4e-06 Score=49.98 Aligned_cols=366 Identities=9% Similarity=-0.008 Sum_probs=147.9 Q ss_pred CchhhhhhcCCcc------cccccccccchhhcccccCCceechhh-h--hccHHHHHHHHHHHHhhhhCceeeecch-- Q lcl|NC_018285. 1 MPIFNLATESPPN------NQGGFFDITDPEFLATLNGSEWVSAET-A--LKNSDLFSIISQLSNDLATAKLTTSRKQ-- 69 (383) Q Consensus 1 Mglf~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~-a--~~~~~v~~~i~~ia~~ia~~p~~~~~~~-- 69 (383) -.+...+.+.... ....+...... . ... +..+..+. - ....-..-+|+.++..+--..+.+-... T Consensus 15 ~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~-i-~~~--~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~~~~~~ 90 (486) T protein:vir:42 15 AVVREEMISAFEDASKDLASNTSYYDAERR-P-EAI--GVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLGDADEA 90 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCc-c-hhc--ccccchhHhhhhhccchHHHHHHHHHhhhcccceecCCCchh Confidence 1112221111000 00001000000 0 000 00010000 0 0111223345555544433444443221 Q ss_pred ---hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce-------eEEEEeccceeEEEEcCCCceeE--EE Q lcl|NC_018285. 70 ---MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD-------MKWEYLRPSQVSFNRLDNQNGLY--YN 137 (383) Q Consensus 70 ---~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~-------~~l~~l~~~~v~~~~~~~~~~~~--y~ 137 (383) .+.++.+ | ........+..+++.+|.||+.+.++..|.. ..+.+++|..+.+..+.....+. ++ T Consensus 91 ~~~~~~i~~~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~ 166 (486) T protein:vir:42 91 DEELWQWWQA-N---NLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRINRVSKAIR 166 (486) T ss_pred HHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCCCCeEEEEE Confidence 1122222 2 2344556788999999999999987654322 35677888887776653322111 11 Q ss_pred EeecC--cccccceeeccc-------------------------ceEEeccCCCCccccCcchHH----HHHHHHHHHHH Q lcl|NC_018285. 138 VTFDD--PRIPPKQHVPQS-------------------------DILHFRLLSVDGGLTSVSPLM----ALGRELDIQKA 186 (383) Q Consensus 138 ~~~~~--~~~~~~~~~~~~-------------------------dvih~~~~~~~~~~~G~s~~~----~~~~~i~~~~~ 186 (383) +.... +.......+.++ .|++|.+..-.+..+|.|-+. .+.+.+....+ T Consensus 167 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s 246 (486) T protein:vir:42 167 VAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILM 246 (486) T ss_pred EEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHH Confidence 11110 000000011111 244454432334456777544 33444443333 Q ss_pred HHHHHHHHHhccCCcceeEeecC--CCCHHHHHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHHHHHH Q lcl|NC_018285. 187 SDKLTLNSLKNALNANGILKIKG--GGLLDFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQADWTTG 263 (383) Q Consensus 187 ~~~~~~~~~~ng~~~~~i~~~~~--~~~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~ 263 (383) - .....+-.+.|..+++... ....+. .+-...| ....+++++++ ++.++.++.....+ .+++..+.... T Consensus 247 ~---~~~~~e~~a~p~~~i~G~~~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~q~~~~~~e-~~~~~l~~~i~ 318 (486) T protein:vir:42 247 L---MQATAELMGVPQRLIFGIKPEEIGVDS-ETGQTLF---DAYLARILAFEDAEGKIQQFSAAELA-NFTNALDQIAK 318 (486) T ss_pred H---HHHHHHhhcchHHHhhcCCcccccccc-ccccchh---hhhhchhcccCCCCceEEeecccCHH-HHHHHHHHHHH Confidence 2 2222222333444444211 111000 0000111 12235555554 56777666544333 36788888889 Q ss_pred HHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------------ch-------hhccchhhh Q lcl|NC_018285. 264 QFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS------------CD-------VDADIFPAV 324 (383) Q Consensus 264 ~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~------------~~-------~e~~~~~~~ 324 (383) +++..=++|++.+|+...+..+.++.+. ....+.-.+...+..|...|- .. ++....... T Consensus 319 ~~s~~~~~p~~~fg~~~~n~~Sg~Al~~-~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~ 397 (486) T protein:vir:42 319 QVAAYTGLPPQYLSTAADNPASAEAIRA-AESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPS 397 (486) T ss_pred HHhcccCCCHHHhccccCchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCC Confidence 9999999999999876544333333221 111111111222221111110 00 011111122 Q ss_pred ccCHHHHHHHHHHHHhC--CCcCHHHHHHHhhcCCcCCcchhHH------------hCCCC-C------CCCCCCCCCCC Q lcl|NC_018285. 325 DPTGANYISRINSMVKS--GTLAQNQGLYILQQAEILPKELPKG------------ENPNR-T------ILKGGETNGQD 383 (383) Q Consensus 325 ~~~~~~~~~~~~~l~~~--g~~t~nE~r~~lg~~~~~~~d~~~~------------~~~~~-~------~~~ggd~~~~d 383 (383) -.+..+.+..+.+++++ |+++..-+++.+|..+-.-.++.+. ..++. . +..+++..+++ T Consensus 398 ~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (486) T protein:vir:42 398 TPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQP 477 (486) T ss_pred CCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCc Confidence 24556677778888875 7888888888776533111111110 00110 0 11111111111 No 173 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.11 E-value=4.4e-06 Score=49.93 Aligned_cols=366 Identities=12% Similarity=0.020 Sum_probs=162.4 Q ss_pred CchhhhhhcCCccc-------------c------ccc-cc------c-cchhh---cccccCC--ceec--hhhhhccHH Q lcl|NC_018285. 1 MPIFNLATESPPNN-------------Q------GGF-FD------I-TDPEF---LATLNGS--EWVS--AETALKNSD 46 (383) Q Consensus 1 Mglf~~~~~~~~~~-------------~------~~~-~~------~-~~~~~---~~~~~~~--~~~~--~~~a~~~~~ 46 (383) |+++.-+..+.... . ..+ .. . ..+.. ...-.+. .... +..=+.++- T Consensus 4 ~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 83 (474) T protein:vir:10 4 YKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSF 83 (474) T ss_pred HHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccch Confidence 23322222111000 0 000 00 0 00000 0000000 0000 000011223 Q ss_pred HHHHHHHHHHhhhhCceeeecch----h---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEec Q lcl|NC_018285. 47 LFSIISQLSNDLATAKLTTSRKQ----M---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLR 119 (383) Q Consensus 47 v~~~i~~ia~~ia~~p~~~~~~~----~---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~ 119 (383) ....|+..+.-+-+-|+++.-.. . ...+.+-............+..++..+|.||..+.++.+|++ .+..++ T Consensus 84 ~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~ 162 (474) T protein:vir:10 84 DSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNID 162 (474) T ss_pred HHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEc Confidence 33445555555555576654211 1 112222222234555667788999999999999999988875 567788 Q ss_pred cceeEEEEcCCCceeE---EEEeecCcccc---cceeecccc--------------------------eEEeccCCCCcc Q lcl|NC_018285. 120 PSQVSFNRLDNQNGLY---YNVTFDDPRIP---PKQHVPQSD--------------------------ILHFRLLSVDGG 167 (383) Q Consensus 120 ~~~v~~~~~~~~~~~~---y~~~~~~~~~~---~~~~~~~~d--------------------------vih~~~~~~~~~ 167 (383) |..+-+..++.+.... |.......... ....+.... |+++++ . T Consensus 163 p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~ 237 (474) T protein:vir:10 163 PYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPN-----N 237 (474) T ss_pred ccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecC-----C Confidence 8877666554332211 11100000000 000111111 233322 2 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeeccc Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEI 247 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~ 247 (383) ..|.|-+..+...+.....+.....+.+...+.|-.+++.- .++++....++ ..+.+.+.+++.+++-+.. T Consensus 238 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~-~~~~~~~~~~~--------~~~~i~~~~~~~~~~~l~~ 308 (474) T protein:vir:10 238 KEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM-GMSEEMIQETQ--------KSGAFELFDKDMDVKYLTK 308 (474) T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC-CCCchhhhhhh--------hcceeEecCCCCceeEEec Confidence 34777777766666666655555555555556666666532 23333222221 1244555566666666655 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018285. 248 KSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM-------------SSNVYSKAVARYLRPFLSELSQKLSC 314 (383) Q Consensus 248 ~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~-------------~~~~~~~~l~P~~~~i~~~l~~~l~~ 314 (383) ...+..+.+..+...+.|+..-++|..-.+..+.+.+.... .+..+...+.-.++.|...++.+-.. T Consensus 309 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~ 388 (474) T protein:vir:10 309 DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYN 388 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 55555667778888899999999987554422222222111 11233334444444444433332111 Q ss_pred -------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-------CCCCCCCCCCCCC Q lcl|NC_018285. 315 -------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-------NPNRTILKGGETN 380 (383) Q Consensus 315 -------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-------~~~~~~~~ggd~~ 380 (383) ++++......-.+..+.+..+.++ .|+++...+.++++.-.=+..|+.+++ .......+|+.++ T Consensus 389 ~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~ 466 (474) T protein:vir:10 389 LDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDAND 466 (474) T ss_pred CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCC Confidence 111122222234556677777776 489999888888753220012222211 1111122233332 Q ss_pred CCC Q lcl|NC_018285. 381 GQD 383 (383) Q Consensus 381 ~~d 383 (383) +++ T Consensus 467 ~~~ 469 (474) T protein:vir:10 467 KSQ 469 (474) T ss_pred CCc Confidence 222 No 174 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.11 E-value=4.4e-06 Score=49.93 Aligned_cols=366 Identities=12% Similarity=0.020 Sum_probs=162.4 Q ss_pred CchhhhhhcCCccc-------------c------ccc-cc------c-cchhh---cccccCC--ceec--hhhhhccHH Q lcl|NC_018285. 1 MPIFNLATESPPNN-------------Q------GGF-FD------I-TDPEF---LATLNGS--EWVS--AETALKNSD 46 (383) Q Consensus 1 Mglf~~~~~~~~~~-------------~------~~~-~~------~-~~~~~---~~~~~~~--~~~~--~~~a~~~~~ 46 (383) |+++.-+..+.... . ..+ .. . ..+.. ...-.+. .... +..=+.++- T Consensus 4 ~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 83 (474) T protein:vir:94 4 YKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSF 83 (474) T ss_pred HHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccch Confidence 23322222111000 0 000 00 0 00000 0000000 0000 000011223 Q ss_pred HHHHHHHHHHhhhhCceeeecch----h---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEec Q lcl|NC_018285. 47 LFSIISQLSNDLATAKLTTSRKQ----M---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLR 119 (383) Q Consensus 47 v~~~i~~ia~~ia~~p~~~~~~~----~---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~ 119 (383) ....|+..+.-+-+-|+++.-.. . ...+.+-............+..++..+|.||..+.++.+|++ .+..++ T Consensus 84 ~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~ 162 (474) T protein:vir:94 84 DSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNID 162 (474) T ss_pred HHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEc Confidence 33445555555555576654211 1 112222222234555667788999999999999999988875 567788 Q ss_pred cceeEEEEcCCCceeE---EEEeecCcccc---cceeecccc--------------------------eEEeccCCCCcc Q lcl|NC_018285. 120 PSQVSFNRLDNQNGLY---YNVTFDDPRIP---PKQHVPQSD--------------------------ILHFRLLSVDGG 167 (383) Q Consensus 120 ~~~v~~~~~~~~~~~~---y~~~~~~~~~~---~~~~~~~~d--------------------------vih~~~~~~~~~ 167 (383) |..+-+..++.+.... |.......... ....+.... |+++++ . T Consensus 163 p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~ 237 (474) T protein:vir:94 163 PYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPN-----N 237 (474) T ss_pred ccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecC-----C Confidence 8877666554332211 11100000000 000111111 233322 2 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeeccc Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEI 247 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~ 247 (383) ..|.|-+..+...+.....+.....+.+...+.|-.+++.- .++++....++ ..+.+.+.+++.+++-+.. T Consensus 238 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~-~~~~~~~~~~~--------~~~~i~~~~~~~~~~~l~~ 308 (474) T protein:vir:94 238 KEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM-GMSEEMIQETQ--------KSGAFELFDKDMDVKYLTK 308 (474) T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC-CCCchhhhhhh--------hcceeEecCCCCceeEEec Confidence 34777777766666666655555555555556666666532 23333222221 1244555566666666655 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018285. 248 KSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM-------------SSNVYSKAVARYLRPFLSELSQKLSC 314 (383) Q Consensus 248 ~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~-------------~~~~~~~~l~P~~~~i~~~l~~~l~~ 314 (383) ...+..+.+..+...+.|+..-++|..-.+..+.+.+.... .+..+...+.-.++.|...++.+-.. T Consensus 309 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~ 388 (474) T protein:vir:94 309 DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYN 388 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 55555667778888899999999987554422222222111 11233334444444444433332111 Q ss_pred -------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-------CCCCCCCCCCCCC Q lcl|NC_018285. 315 -------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-------NPNRTILKGGETN 380 (383) Q Consensus 315 -------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-------~~~~~~~~ggd~~ 380 (383) ++++......-.+..+.+..+.++ .|+++...+.++++.-.=+..|+.+++ .......+|+.++ T Consensus 389 ~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~ 466 (474) T protein:vir:94 389 LDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDAND 466 (474) T ss_pred CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCC Confidence 111122222234556677777776 489999888888753220012222211 1111122233332 Q ss_pred CCC Q lcl|NC_018285. 381 GQD 383 (383) Q Consensus 381 ~~d 383 (383) +++ T Consensus 467 ~~~ 469 (474) T protein:vir:94 467 KSQ 469 (474) T ss_pred CCc Confidence 222 No 175 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.06 E-value=5.6e-06 Score=49.38 Aligned_cols=378 Identities=10% Similarity=0.010 Sum_probs=160.1 Q ss_pred Cch---hhhh----hcC---CcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch- Q lcl|NC_018285. 1 MPI---FNLA----TES---PPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ- 69 (383) Q Consensus 1 Mgl---f~~~----~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~- 69 (383) +.. ..++ ... +......+.................-.+..-+.++-....|+..+.-+-+-|+++.-.+ T Consensus 38 ~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~ 117 (501) T protein:vir:96 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCc Confidence 111 1111 000 00000000000000000000000000000011233344455555555555566654221 Q ss_pred -----hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-cee----EEE-E Q lcl|NC_018285. 70 -----MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGL----YYN-V 138 (383) Q Consensus 70 -----~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~----~y~-~ 138 (383) ...++.+.............+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..++.. +.+ .|. . T Consensus 118 ~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 196 (501) T protein:vir:96 118 DDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYNR 196 (501) T ss_pred cchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 1222333333345566777888999999999999999988876 467788888877766432 111 111 1 Q ss_pred eecCcccccceeecccceEEeccC----------C---------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_018285. 139 TFDDPRIPPKQHVPQSDILHFRLL----------S---------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNAL 199 (383) Q Consensus 139 ~~~~~~~~~~~~~~~~dvih~~~~----------~---------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 199 (383) ....+.......+.++.+.++..- + ......|.|.+..+...++....+.....+.+...+ T Consensus 197 ~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~ 276 (501) T protein:vir:96 197 GTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMA 276 (501) T ss_pred ecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhc Confidence 011000001111222222222110 0 001134788887777777766666666666666666 Q ss_pred CcceeEeecCCCC-HHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc Q lcl|NC_018285. 200 NANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGG 278 (383) Q Consensus 200 ~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~ 278 (383) .|-.+++...... .+....++..+.......+.....+.+.+..-++....+..+....+...+.|+..-++|..-.+. T Consensus 277 ~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~ 356 (501) T protein:vir:96 277 DAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTN 356 (501) T ss_pred CceeeeecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccc Confidence 6766665432222 222222222111111111212222344454445544445556777788888899988998766553 Q ss_pred cccCcCHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhc-ch-----hhccchhhhccCHHHHHHHHHHHH Q lcl|NC_018285. 279 QGDQQSSLEMS-------------SNVYSKAVARYLRPFLSELSQKLS-CD-----VDADIFPAVDPTGANYISRINSMV 339 (383) Q Consensus 279 ~~~~~~~~e~~-------------~~~~~~~l~P~~~~i~~~l~~~l~-~~-----~e~~~~~~~~~~~~~~~~~~~~l~ 339 (383) .+.+.+..... +..+...+.-.++.+...++..-. .. +++......-.+..+.+..+.++. T Consensus 357 ~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~ 436 (501) T protein:vir:96 357 FSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG 436 (501) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh Confidence 22222221111 112222333333222222221100 01 111112223345566666666663 Q ss_pred hCCCcCHHHHHHHhhcCCcCC--cchhHHh----C--CCC-----CCCCC------CCCCCCC Q lcl|NC_018285. 340 KSGTLAQNQGLYILQQAEILP--KELPKGE----N--PNR-----TILKG------GETNGQD 383 (383) Q Consensus 340 ~~g~~t~nE~r~~lg~~~~~~--~d~~~~~----~--~~~-----~~~~g------gd~~~~d 383 (383) |+++...+.+.++. ++. .|+.+++ . .+. .+..| .|.+.+| T Consensus 437 --g~iS~et~~~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~ 495 (501) T protein:vir:96 437 --GQVSQETALSLSGL--VESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDD 495 (501) T ss_pred --ccCchHHHHHhCCC--CCCHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCc Confidence 78998888777642 221 2222211 0 000 01111 1111111 No 176 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.03 E-value=6.6e-06 Score=49.00 Aligned_cols=377 Identities=10% Similarity=0.006 Sum_probs=161.2 Q ss_pred CchhhhhhcCC-c--ccccccccccchhhcccccCCc--eechhhhhccHHHHHHHHHHHHhhhhCceeeecch--hhhh Q lcl|NC_018285. 1 MPIFNLATESP-P--NNQGGFFDITDPEFLATLNGSE--WVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~~l 73 (383) .++.+.....+ + .....+.......+.-...... ....+ +...-....|+..+.-+-+-|+++...+ .... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~ 123 (512) T protein:vir:97 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEA 123 (512) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeccCChHHHHH Confidence 12222111000 0 0000000000000000000000 00001 1122233345555555555666653222 2223 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-e----eEE-EEeecCccc-- Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-G----LYY-NVTFDDPRI-- 145 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~----~~y-~~~~~~~~~-- 145 (383) +..-+...........+..++..+|.||+.+.++.+|.+ .+..++|..+.+..++... . +.| ......+.. T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~ 202 (512) T protein:vir:97 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (512) T ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccc Confidence 333333334556667788899999999999999988876 4778888888777654321 1 111 111000000 Q ss_pred -c-cceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 146 -P-PKQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 146 -~-~~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) . ....+.++.+.+++... ......|.|-+..+...++....+..-..+.+... T Consensus 203 ~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (512) T protein:vir:97 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (512) T ss_pred eEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 0 01123333343332100 00112477877777777777766665556666666 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHHhhc-----CCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQAMKQ-----MQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~-----~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) +.|-.+++.....+.+.............. +.....-.++|.+++-++.......+....+...+.|+..-++|. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~ 362 (512) T protein:vir:97 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPN 362 (512) T ss_pred cCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 777777665443443333332222211111 111111234555665555554555566777888888988888887 Q ss_pred HHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHH Q lcl|NC_018285. 274 NVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYI 332 (383) Q Consensus 274 ~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~ 332 (383) .-.+..+.+.+ ..+.+ ..+...|.-.++.|...+...-... +++...+.+-.+..+.+ T Consensus 363 ~~~~~~~gn~S-g~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~ 441 (512) T protein:vir:97 363 MKDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEEL 441 (512) T ss_pred cCcccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHH Confidence 65543222222 22221 1222222222222222222111011 11111122223445566 Q ss_pred HHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh--------C-C-CCCCCCCCCC-CCCC Q lcl|NC_018285. 333 SRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE--------N-P-NRTILKGGET-NGQD 383 (383) Q Consensus 333 ~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~--------~-~-~~~~~~ggd~-~~~d 383 (383) ..+.++ .|+++.-.++++++.-.=+..|+.+++ . . +....+++.+ ++++ T Consensus 442 ~~~~kl--~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (512) T protein:vir:97 442 KAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQD 501 (512) T ss_pred HHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCC Confidence 666666 489999888887743210012222111 0 0 1111111111 1111 No 177 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.02 E-value=6.8e-06 Score=48.91 Aligned_cols=370 Identities=11% Similarity=0.094 Sum_probs=157.4 Q ss_pred CchhhhhhcC----Ccccccccccccc-----hh-hcccccC----CceechhhhhccHHHHHHHHHHHHhhhhCceeee Q lcl|NC_018285. 1 MPIFNLATES----PPNNQGGFFDITD-----PE-FLATLNG----SEWVSAETALKNSDLFSIISQLSNDLATAKLTTS 66 (383) Q Consensus 1 Mglf~~~~~~----~~~~~~~~~~~~~-----~~-~~~~~~~----~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~ 66 (383) .-+++++-++ +......+..... +. ....... ....+.+ +.++-....|+..+.-+-.-|+++. T Consensus 31 ~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~~~ivd~~~~yl~g~~~~~~ 108 (503) T protein:vir:59 31 TTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNR--TSHAWHKLFVDQKTQYLVGEPVTFT 108 (503) T ss_pred HHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccce--eecchHHHHHHHHHhhhhcCCeeec Confidence 1111111100 0000000000000 00 0000000 0000011 1233345566767776666676643 Q ss_pred cch--hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ce----e-EEEE Q lcl|NC_018285. 67 RKQ--MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NG----L-YYNV 138 (383) Q Consensus 67 ~~~--~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~----~-~y~~ 138 (383) ..+ ....+..-.. -........+..++..+|.+|+.+.++.+|++ .+..++|..+....++.. .. + +|.. T Consensus 109 ~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~ 186 (503) T protein:vir:59 109 SDNKTLLEYVNELAD-DDFDDILNETVKNMSNKGIEYWHPFVDEEGEF-DYVIFPAEEMIVVYKDNTRRDILFALRYYSY 186 (503) T ss_pred cCcHHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEeecCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEE Confidence 222 1111111111 13455666788899999999999999988876 477888888877665432 11 1 1111 Q ss_pred eecCcccc-cceeecccceEEeccC----------------------------------CCCccccCcchHHHHHHHHHH Q lcl|NC_018285. 139 TFDDPRIP-PKQHVPQSDILHFRLL----------------------------------SVDGGLTSVSPLMALGRELDI 183 (383) Q Consensus 139 ~~~~~~~~-~~~~~~~~dvih~~~~----------------------------------~~~~~~~G~s~~~~~~~~i~~ 183 (383) ....+... ....+.+..+.++... .......|.|-+..+...++. T Consensus 187 ~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa 266 (503) T protein:vir:59 187 KGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDN 266 (503) T ss_pred ecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHH Confidence 11110000 1112222222222110 001123477777777766666 Q ss_pred HHHHHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHH Q lcl|NC_018285. 184 QKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTT 262 (383) Q Consensus 184 ~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~ 262 (383) ...+.......+...+.|-.+++.-.... ++....+. .++++.++++.+.+.+..+.....+.+..+... T Consensus 267 ~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 337 (503) T protein:vir:59 267 YDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLR---------YHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQ 337 (503) T ss_pred HHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhh---------cccceeccCCCcceeEeccCCHHHHHHHHHHHH Confidence 66555555555666667766665322211 11111111 133455555544444444433444455555555 Q ss_pred HHHHHHhcCC---HHHhcccccCcCH----------HHHHHHHHHHHHHHHHHHHHHHHHHhhcch------hhccchhh Q lcl|NC_018285. 263 GQFAKVYGIP---ENVVGGQGDQQSS----------LEMSSNVYSKAVARYLRPFLSELSQKLSCD------VDADIFPA 323 (383) Q Consensus 263 ~~Ia~~~gVp---p~~lg~~~~~~~~----------~e~~~~~~~~~l~P~~~~i~~~l~~~l~~~------~e~~~~~~ 323 (383) +.|+..-++| +..+++..++... .+..+..+...|.-+++.|...++..-... +.+..... T Consensus 338 ~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~ 417 (503) T protein:vir:59 338 DELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRT 417 (503) T ss_pred HHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCC Confidence 5665555555 3333332221111 111122333344433333333333211111 11112223 Q ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh--------CCCCC--CCCCCCCCCCC Q lcl|NC_018285. 324 VDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE--------NPNRT--ILKGGETNGQD 383 (383) Q Consensus 324 ~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~--------~~~~~--~~~ggd~~~~d 383 (383) .-.+..+.+..+.+++.+|+++...+.++++.-.=+.-|+.+++ ..... +..|.++++++ T Consensus 418 ~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 487 (503) T protein:vir:59 418 RIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEED 487 (503) T ss_pred CCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcC Confidence 33566778888889999999999999888643210012322221 11111 11122222222 No 178 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=97.99 E-value=7.9e-06 Score=48.55 Aligned_cols=373 Identities=11% Similarity=0.013 Sum_probs=165.1 Q ss_pred Cchhhhhhc---CCcccccc--------ccccc---chhhccc--ccCCceechhhhhccHHHHHHHHHHHHhhhhCcee Q lcl|NC_018285. 1 MPIFNLATE---SPPNNQGG--------FFDIT---DPEFLAT--LNGSEWVSAETALKNSDLFSIISQLSNDLATAKLT 64 (383) Q Consensus 1 Mglf~~~~~---~~~~~~~~--------~~~~~---~~~~~~~--~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~ 64 (383) |+.-+-++. .+....+. +..+- .+.+... ...+.... ..-+...-....++..|+-+..-|.. T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~-~~~~~~n~~k~i~~~~a~~l~~~p~~ 94 (496) T protein:vir:38 16 MGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVN-RRQLSMNLPKVTAKYMSKLLFNEKVK 94 (496) T ss_pred hccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccc-cceeecchHHHHHHHHhhhhhCCcce Confidence 333221111 11100000 00000 0000000 00000000 11122233445677777777666665 Q ss_pred ee--cchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce---eE---- Q lcl|NC_018285. 65 TS--RKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG---LY---- 135 (383) Q Consensus 65 ~~--~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~---~~---- 135 (383) +. +......+.+-...-......+.++.+...+|.+|+.+..|.+|.+ .+.+++|..+-+...+.+.. .+ T Consensus 95 i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~~~~~~P~~~~~~~~~~~~f~~~~ 173 (496) T protein:vir:38 95 INIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSENVDECVIANSF 173 (496) T ss_pred EeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EEEEEcccceEEEEecCCcEEEEEEEEEE Confidence 43 2222222222222234555667778899999999999999988875 35666666655433322211 00 Q ss_pred ------E-------------EEee-----cCc-ccccce-------------ee---cccceEEeccCCC----CccccC Q lcl|NC_018285. 136 ------Y-------------NVTF-----DDP-RIPPKQ-------------HV---PQSDILHFRLLSV----DGGLTS 170 (383) Q Consensus 136 ------y-------------~~~~-----~~~-~~~~~~-------------~~---~~~dvih~~~~~~----~~~~~G 170 (383) | .+.. .+. ..+..+ .+ +.--+.|++.+-+ .....| T Consensus 174 ~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G 253 (496) T protein:vir:38 174 HKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLG 253 (496) T ss_pred EeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCC Confidence 1 0000 000 000000 00 1111334443221 233569 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEe-----ecCCCCHHHHHHHHHHHHHhhcCCccee---ecCCCcee Q lcl|NC_018285. 171 VSPLMALGRELDIQKASDKLTLNSLKNALNANGILK-----IKGGGLLDFKTKVSRSRQAMKQMQGGPL---VLDDLEDF 242 (383) Q Consensus 171 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~-----~~~~~~~e~~~~~~~~~~~~~~~~g~~~---vl~~g~~~ 242 (383) .|.+..+...+.....+.....+-++. +.+..++. .......+... .+.. ....-... -.+++..+ T Consensus 254 ~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~----~~~~-~~~~~~~~~~~~~~~~~~i 327 (496) T protein:vir:38 254 ISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQ----YFDS-TDEAFFLYQGDQDDNGKAI 327 (496) T ss_pred CchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCCCCcccc----CCCC-ccceEEEeecCCCcccccc Confidence 999988888887776665555555554 34444431 11110000000 0000 00000011 11233456 Q ss_pred eecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc-CHHHHH---HH----------HHHHHHHHHHHHHHHHH Q lcl|NC_018285. 243 TPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEMS---SN----------VYSKAVARYLRPFLSEL 308 (383) Q Consensus 243 ~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~-~~~e~~---~~----------~~~~~l~P~~~~i~~~l 308 (383) +.++.....-++.+..+....+|+..-|+||..+|...++. +..+.. +. .+..+|..+++.+.+-. T Consensus 328 ~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~ 407 (496) T protein:vir:38 328 KDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVG 407 (496) T ss_pred eeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77777766667888888999999999999999998654433 222221 11 12223333333332211 Q ss_pred HHhhc------ch--hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchh----HHhCCCC----C Q lcl|NC_018285. 309 SQKLS------CD--VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELP----KGENPNR----T 372 (383) Q Consensus 309 ~~~l~------~~--~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~----~~~~~~~----~ 372 (383) +.... .. +.+......-.+..+.+..+.+++.+|+++.-.+++.. ++++..++. +.+.-.. . T Consensus 408 ~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~--~~~~d~ea~~el~ri~~E~~~~~~~ 485 (496) T protein:vir:38 408 KFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRA--WNITEAEADEWAEMLAKEKQAEMPN 485 (496) T ss_pred HHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCChHHHHHHHHHHHHhhhccCcc Confidence 11000 11 11111223334567778888899999999988887643 344444432 2211111 1 Q ss_pred CCCCCCCCCCC Q lcl|NC_018285. 373 ILKGGETNGQD 383 (383) Q Consensus 373 ~~~ggd~~~~d 383 (383) +..||-.+++| T Consensus 486 ~d~~~~~~~~e 496 (496) T protein:vir:38 486 NDMNGIFGEEE 496 (496) T ss_pred ccccCCCCCCC Confidence 11122222222 No 179 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.97 E-value=8.6e-06 Score=48.36 Aligned_cols=368 Identities=9% Similarity=0.014 Sum_probs=161.9 Q ss_pred CchhhhhhcCC---cccccccccccchhhcccc-cCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhh---hh Q lcl|NC_018285. 1 MPIFNLATESP---PNNQGGFFDITDPEFLATL-NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQ---GI 73 (383) Q Consensus 1 Mglf~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~---~l 73 (383) -.+.+....+. ......+..... .+.... .... .+.+ +.++-....|+..+.-+-+-|+++.-.+.. .. T Consensus 31 ~~~i~~~~~~~~~~~~~l~~Yy~g~~-~i~~~~~~~~~-~~~k--i~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~ 106 (470) T protein:vir:99 31 LGFIAYNETVLKPRYRENMKLYLGKH-KILTAPEKETG-ADNR--IVVNSAKYVVDVYNGYFCGIEPKLALLNDSSKIDE 106 (470) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccc-ccccCcccccC-Ccce--eecchHHHHHHHHhhhhccCCeeEeeCCchhHHHH Confidence 11111100000 000000000000 000000 0000 0011 122233344555555544556655322211 11 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-e--EEEEe-ecCcccc--c Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-L--YYNVT-FDDPRIP--P 147 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-~--~y~~~-~~~~~~~--~ 147 (383) +.+-............+..+.+.+|.+|+.+.++.+|++ .+..++|..+.+..++.... . +.++. ....... . T Consensus 107 l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~ 185 (470) T protein:vir:99 107 IARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAY 185 (470) T ss_pred HHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEE Confidence 222222335566777888999999999999999988886 47778888887776554321 1 11111 1100000 0 Q ss_pred ceeecccceEEeccC----------------------CCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeE Q lcl|NC_018285. 148 KQHVPQSDILHFRLL----------------------SVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGIL 205 (383) Q Consensus 148 ~~~~~~~dvih~~~~----------------------~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~ 205 (383) ...+..+.+.+++.. .......|.|-+..+...++....+.......+...+.|-.++ T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i 265 (470) T protein:vir:99 186 GVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYM 265 (470) T ss_pred EEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 011122222221110 0111234777777777777766666666666666667777776 Q ss_pred eecCCCCHHHHHHHHHHHHHhhcCCcceeec-----CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc Q lcl|NC_018285. 206 KIKGGGLLDFKTKVSRSRQAMKQMQGGPLVL-----DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG 280 (383) Q Consensus 206 ~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl-----~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~ 280 (383) ..-....++.-+ ....+. . .+++.+ +.+.++..++.......+.+..+...+.|+..-++|+...+..+ T Consensus 266 ~g~~~~~~~~g~-~~~~~~---~--~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~ 339 (470) T protein:vir:99 266 IGFKLPEDDEGN-PKFDFK---N--NRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFA 339 (470) T ss_pred ecCCcccccccc-hhhhhh---h--cceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccc Confidence 653222221111 111111 1 222222 23445555555555555667778888999999999975554322 Q ss_pred cCcCHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcch-----hhccchhhhccCHHHHHHHHHHHHhCC Q lcl|NC_018285. 281 DQQSSLEMS-------------SNVYSKAVARYLRPFLSELSQKLSCD-----VDADIFPAVDPTGANYISRINSMVKSG 342 (383) Q Consensus 281 ~~~~~~e~~-------------~~~~~~~l~P~~~~i~~~l~~~l~~~-----~e~~~~~~~~~~~~~~~~~~~~l~~~g 342 (383) .+.+..... +..+...|.-.++.+...+..+-... +++...+..-.+..+.+..+.++. | T Consensus 340 ~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--g 417 (470) T protein:vir:99 340 GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--G 417 (470) T ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--c Confidence 222221111 11222233333332222222111111 111122222345566666666664 8 Q ss_pred CcCHHHHHHHhhcCCcCCc-chhHH------------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 343 TLAQNQGLYILQQAEILPK-ELPKG------------ENPNRTILKGGETNGQD 383 (383) Q Consensus 343 ~~t~nE~r~~lg~~~~~~~-d~~~~------------~~~~~~~~~ggd~~~~d 383 (383) +++...+++.++. +++. |+.++ ......+..++|.+++| T Consensus 418 iis~et~l~~l~~--vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee 469 (470) T protein:vir:99 418 IVSKKTQLGMIPD--IEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEE 469 (470) T ss_pred cCCHHHHHHhCCC--CCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccC Confidence 8999888887643 3322 22211 11222334444555555 No 180 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=97.96 E-value=9.1e-06 Score=48.23 Aligned_cols=360 Identities=10% Similarity=-0.034 Sum_probs=143.2 Q ss_pred CchhhhhhcCCcc--cccccccccchhhcccccCCceechh-hhh----ccHHHHHHHHHHHHhhhhCceeeecchhhhh Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFLATLNGSEWVSAE-TAL----KNSDLFSIISQLSNDLATAKLTTSRKQMQGI 73 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~----~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l 73 (383) =.|+.....+.+. ....+.....+ ....... ...+ .++ .+.-..-+|+.++..+---.|.+-+...... T Consensus 33 ~~l~~~~~~~~~rl~~l~~YY~G~~~--~~~~~~~--~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~d~~~~~~ 108 (501) T protein:vir:25 33 ADMWRLHISERQWLDRIYEYTKGLRG--RPEVPEG--ASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNALAKENDP 108 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCC--chhcccc--CChhhhhhHhhhhcChHHHHHHHHHhhhcccceecCCccchHH Confidence 0233222111110 00001000000 0000000 0000 000 0111223444444433223344333222221 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-c-----eeEEEEeecC-cccc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-N-----GLYYNVTFDD-PRIP 146 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~-----~~~y~~~~~~-~~~~ 146 (383) ..+-............+..+++.+|.||+.+.++.+|. .+..++|..+.....+.. . .+.|...... .... T Consensus 109 l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~ 186 (501) T protein:vir:25 109 AWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHR 186 (501) T ss_pred HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcce Confidence 12222222234555677888999999999999988874 355678888876543221 1 1111111000 0000 Q ss_pred ccee----------------------------------------------ecccceEEeccCCCCccccCcchHHHHHHH Q lcl|NC_018285. 147 PKQH----------------------------------------------VPQSDILHFRLLSVDGGLTSVSPLMALGRE 180 (383) Q Consensus 147 ~~~~----------------------------------------------~~~~dvih~~~~~~~~~~~G~s~~~~~~~~ 180 (383) .... |..--|+|+.+..... -.|.|-++.+... T Consensus 187 ~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~-~~g~sdie~v~~l 265 (501) T protein:vir:25 187 RGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDAD-DMIVGEVAPLILL 265 (501) T ss_pred eEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCccccC-ccccchhhhhHHH Confidence 0000 1111245554322112 2467755544433 Q ss_pred HHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHH Q lcl|NC_018285. 181 LDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQAD 259 (383) Q Consensus 181 i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~ 259 (383) ++..+.+.-.......-.+.|..++..-. .+.... |. ...+++++++ ++.++..+....-+ .+.+..+ T Consensus 266 ~Da~~~~~s~~~~~~e~~a~p~~~i~G~~---~~~~~~----~~---~~~~~i~~~~~~~~~~~q~~~~~~~-~~~~~l~ 334 (501) T protein:vir:25 266 QQAINSVNFDRLIVSRFGANPQRVISGWT---GSKAEV----LK---ASALRVWTFEDPEVKAQAFPPASVE-PYNLILE 334 (501) T ss_pred HHHHHHHHHHHHHHHHhhccHHHHHhCCC---CCccch----hh---hcccceeccCCCCceEEEecccChH-HHHHHHH Confidence 33333333333333333333433332211 111111 11 2235566665 46676655433222 3788889 Q ss_pred HHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------chhh--------ccc----h Q lcl|NC_018285. 260 WTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS------CDVD--------ADI----F 321 (383) Q Consensus 260 ~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~------~~~e--------~~~----~ 321 (383) ....+|+..=++|++.+|+.+++.+.+ +. .+....|.-.+...+..|...|- ..++ +++ . T Consensus 335 ~~i~~i~~~s~~P~~~~~~~~~N~Sg~-Al-~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~ 412 (501) T protein:vir:25 335 EMLQHVAMVAQISPAQVTGKMINVSAE-AL-AAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWR 412 (501) T ss_pred HHHHHHHhhcCCChhhhccccCChHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEec Confidence 999999999999999999766655333 22 12122222222222222222211 1110 111 1 Q ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-------------C-CCCCCCCCCCCCCCC Q lcl|NC_018285. 322 PAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-------------N-PNRTILKGGETNGQD 383 (383) Q Consensus 322 ~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-------------~-~~~~~~~ggd~~~~d 383 (383) ...-.+..+.+..+.++...|+ +.-.+..+ ++++++.++.+.+ . .+..+..+.+.++++ T Consensus 413 ~~~~~s~~~~ada~~kl~~~gi-s~et~~~~--~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (501) T protein:vir:25 413 DTEARSFGAVVDGITKLASAGI-PIEHLLSM--VPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQA 485 (501) T ss_pred CCCCCCHHHHHHHHHHHHhcCC-CHHHHHHH--cCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCC Confidence 1122355677777778887775 44333322 2344443322110 0 111111111111111 No 181 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.96 E-value=9.2e-06 Score=48.20 Aligned_cols=372 Identities=11% Similarity=0.014 Sum_probs=167.4 Q ss_pred CchhhhhhcCCccccc--------ccc----ccc-ch-------hhcc-cccCCce--echhhhhccHHHHHHHHHHHHh Q lcl|NC_018285. 1 MPIFNLATESPPNNQG--------GFF----DIT-DP-------EFLA-TLNGSEW--VSAETALKNSDLFSIISQLSND 57 (383) Q Consensus 1 Mglf~~~~~~~~~~~~--------~~~----~~~-~~-------~~~~-~~~~~~~--~~~~~a~~~~~v~~~i~~ia~~ 57 (383) ||+|..+++....+.. +.. ... +. .+.. .|....+ +. ..-+..+.-..+++.+|+- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~-~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVH-DKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccc-cccccCChHHHHHHHHHHh Confidence 9999998764332211 110 000 00 0000 1111100 11 1112233344567777777 Q ss_pred hhhCcee--eecc---hhhhhccCCCcc---CCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcC Q lcl|NC_018285. 58 LATAKLT--TSRK---QMQGIVDNPSNS---ANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 58 ia~~p~~--~~~~---~~~~l~~~PN~~---~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~ 129 (383) +..-|.. |... +...+...-+.. -.........+.+.+..|.+++.+..+ +|++ .+.+++++.+-+...+ T Consensus 80 l~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~-~i~~v~ad~~~P~~~~ 157 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRP-SISVHSSSQFWIDFKN 157 (518) T ss_pred hcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCee-EEEEEcCCeeEEEeec Confidence 7665544 3211 112222222221 122333445566677788888776665 3553 4566666655443211 Q ss_pred C--------------CceeEE-------------------------EEeecCcccc---ccee--------ec------- Q lcl|NC_018285. 130 N--------------QNGLYY-------------------------NVTFDDPRIP---PKQH--------VP------- 152 (383) Q Consensus 130 ~--------------~~~~~y-------------------------~~~~~~~~~~---~~~~--------~~------- 152 (383) . ....+| ....++.... .... .. T Consensus 158 g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~ 237 (518) T protein:vir:78 158 NEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLN 237 (518) T ss_pred CcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCccc Confidence 0 001111 1110100000 0000 00 Q ss_pred --------ccceEEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEee-----cCCC-CHH Q lcl|NC_018285. 153 --------QSDILHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKI-----KGGG-LLD 214 (383) Q Consensus 153 --------~~dvih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~-----~~~~-~~e 214 (383) .--+.|+++..++ +...|+|.+..+...+...+........-|+. +.+..++.. .... ... T Consensus 238 ~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~ 316 (518) T protein:vir:78 238 HSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDK 316 (518) T ss_pred eeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCc Confidence 0012333432222 23459999999999988888887777777766 445544421 1111 000 Q ss_pred HHHHHHHHHHHhhcCCccee--ecCCCc----eeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHH Q lcl|NC_018285. 215 FKTKVSRSRQAMKQMQGGPL--VLDDLE----DFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM 288 (383) Q Consensus 215 ~~~~~~~~~~~~~~~~g~~~--vl~~g~----~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~ 288 (383) ..-.+.. ..+.-..+ -.++|. .++.++....+.++.+..+...++|....|++|..+|......+..|. T Consensus 317 ~~~~fd~-----~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei 391 (518) T protein:vir:78 317 EEWSMNV-----DEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEI 391 (518) T ss_pred cccccCC-----CCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHH Confidence 0000000 00000000 012222 367778888888999999999999999999999999753222222222 Q ss_pred H-------------HHHHHHHHHHHHHHHHHHHHHhhcc----------hhhccchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_018285. 289 S-------------SNVYSKAVARYLRPFLSELSQKLSC----------DVDADIFPAVDPTGANYISRINSMVKSGTLA 345 (383) Q Consensus 289 ~-------------~~~~~~~l~P~~~~i~~~l~~~l~~----------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t 345 (383) . +..+..+|.-++..+.+.+...... ++.++....+-.|..+.+....+++.+|+|+ T Consensus 392 ~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS 471 (518) T protein:vir:78 392 WSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMS 471 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCC Confidence 1 1112222222222222211111000 0112223334456677777888899999999 Q ss_pred HHHHHHHhhcCCcCCcch----hHHh--C-----CCCCCCCCCCCCCC Q lcl|NC_018285. 346 QNQGLYILQQAEILPKEL----PKGE--N-----PNRTILKGGETNGQ 382 (383) Q Consensus 346 ~nE~r~~lg~~~~~~~d~----~~~~--~-----~~~~~~~ggd~~~~ 382 (383) +.++.+++. +++++.++ .+++ + ..+.++.|-+..++ T Consensus 472 ~e~~i~~~~-~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 472 VEEKVKLIH-PKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHHHHhC-CCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 999877653 23444332 2211 1 11112223333333 No 182 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.93 E-value=1e-05 Score=47.89 Aligned_cols=378 Identities=9% Similarity=-0.013 Sum_probs=162.3 Q ss_pred CchhhhhhcC---CcccccccccccchhhcccccCC--ceechhhhhccHHHHHHHHHHHHhhhhCceeeecc--hhhhh Q lcl|NC_018285. 1 MPIFNLATES---PPNNQGGFFDITDPEFLATLNGS--EWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK--QMQGI 73 (383) Q Consensus 1 Mglf~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~--~~~~l 73 (383) ..+.+..... +......+.....+.+.-..... ...+.+ +.+.-....|+..+.-+-+-|+++.-. ..... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~ 123 (511) T protein:vir:96 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEA 123 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcce--eecchHHHHHHHHHhhhccCCceeecCchHHHHH Confidence 1121111000 00000000000000000000000 000011 112223334455555555566665322 22233 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-e----eEEE-EeecCccccc Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-G----LYYN-VTFDDPRIPP 147 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~----~~y~-~~~~~~~~~~ 147 (383) +..-+...........+..++..+|.||.++-++.+|.+ .+.+++|..+.+..++... . +.|. .....+.... T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~ 202 (511) T protein:vir:96 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccc Confidence 333333334556667788899999999999999988875 5777888888877654321 1 1111 1000000000 Q ss_pred ----ceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 148 ----KQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 148 ----~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) ...+.+..+.++.... ......|.|-+..+...++....+..-..+.+... T Consensus 203 ~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (511) T protein:vir:96 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred eEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 0122333333321100 00112477877777777776666665666666666 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHH----hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQA----MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~----~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) +.|-.+++.....+.++..+..+.... .....+...-.+++.++.-++.......+....+...+.|+..-++|.. T Consensus 283 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~ 362 (511) T protein:vir:96 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 667776665444443333222221110 0011111112234555555555545556677778888999999999876 Q ss_pred HhcccccCcCHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEM-------------SSNVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYISR 334 (383) Q Consensus 275 ~lg~~~~~~~~~e~-------------~~~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~~ 334 (383) -.+..+.+.+.... .+..+...+.-.++.|...+..+-... +++...+.+-.+..+.+.. T Consensus 363 ~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~ 442 (511) T protein:vir:96 363 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKA 442 (511) T ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHH Confidence 55432222221111 112233333333333333333221111 1111122223445566666 Q ss_pred HHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh------------CCCCCCCCC--CCCCCCC Q lcl|NC_018285. 335 INSMVKSGTLAQNQGLYILQQAEILPKELPKGE------------NPNRTILKG--GETNGQD 383 (383) Q Consensus 335 ~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~------------~~~~~~~~g--gd~~~~d 383 (383) +.++ .|+++.-.+++.++.-.=+..|+.+++ .....+... +++++++ T Consensus 443 ~~kl--~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (511) T protein:vir:96 443 YIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 503 (511) T ss_pred HHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCcc Confidence 6665 689999888887743210012222211 111111111 1111111 No 183 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.92 E-value=1.1e-05 Score=47.82 Aligned_cols=375 Identities=8% Similarity=-0.013 Sum_probs=156.0 Q ss_pred CchhhhhhcC-Cc--ccccccccccchhhcccccCC--ceechhhhhccHHHHHHHHHHHHhhhhCceeeecc--hhhhh Q lcl|NC_018285. 1 MPIFNLATES-PP--NNQGGFFDITDPEFLATLNGS--EWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK--QMQGI 73 (383) Q Consensus 1 Mglf~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~--~~~~l 73 (383) ..+.+..... .+ .....+.....+.+.-..... .....+ +.+.-..-.|+..+.-+-+-|+++... ..... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~ 123 (511) T protein:vir:78 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEA 123 (511) T ss_pred HHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeecCchHHHHH Confidence 1111111000 00 000001000000000000000 000011 111223334454555444556554322 22222 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc--ee---E-EEEeecCcccc- Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN--GL---Y-YNVTFDDPRIP- 146 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~--~~---~-y~~~~~~~~~~- 146 (383) +..-+..-....+...+..++..+|.||.++-++.+|.+ .+..++|..+.+..++... .. . |......+... T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~ 202 (511) T protein:vir:78 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccc Confidence 222232334445666788899999999999999988875 4677888888777654321 11 1 11110000000 Q ss_pred ---cceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 147 ---PKQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 147 ---~~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) ....+.++.+.++.... ......|.|-+..+...++....+..-..+.+... T Consensus 203 ~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~ 282 (511) T protein:vir:78 203 EVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred eEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 01123333343332110 00112477777777777776665555555555555 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHHhhcC----CcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQAMKQM----QGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~----~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) +.|-.+++.....+.++.....+........ .....-.+.+.+..-++.......+....+...+.|+..-++|.. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:78 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 6666666654444443333222211100000 000001122334433444444455677778888899999999876 Q ss_pred HhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcc-------hhhccchhhhccCHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSC-------DVDADIFPAVDPTGANYIS 333 (383) Q Consensus 275 ~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~-------~~e~~~~~~~~~~~~~~~~ 333 (383) -.+..+.+.+ ..+.+ ..+...+.-.++.|...+...-.. .+++...+.+-.+..+.+. T Consensus 363 ~~~~~~~n~S-g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d 441 (511) T protein:vir:78 363 KDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred cccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHH Confidence 5543222222 22221 122233333333333222211110 0111112222344556666 Q ss_pred HHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHhC----------CCCCCCCCCCCCCCC Q lcl|NC_018285. 334 RINSMVKSGTLAQNQGLYILQQAEILP--KELPKGEN----------PNRTILKGGETNGQD 383 (383) Q Consensus 334 ~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~~----------~~~~~~~ggd~~~~d 383 (383) .+.++ .|+++.-.+.+.++. ++. .|+.+++. .+....++++++++. T Consensus 442 ~~~kl--~G~iS~et~l~~l~~--v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:78 442 AYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred HHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCC Confidence 66666 488999888877642 221 23322110 111111122111111 No 184 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.92 E-value=1.1e-05 Score=47.82 Aligned_cols=375 Identities=8% Similarity=-0.013 Sum_probs=156.0 Q ss_pred CchhhhhhcC-Cc--ccccccccccchhhcccccCC--ceechhhhhccHHHHHHHHHHHHhhhhCceeeecc--hhhhh Q lcl|NC_018285. 1 MPIFNLATES-PP--NNQGGFFDITDPEFLATLNGS--EWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK--QMQGI 73 (383) Q Consensus 1 Mglf~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~--~~~~l 73 (383) ..+.+..... .+ .....+.....+.+.-..... .....+ +.+.-..-.|+..+.-+-+-|+++... ..... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~ 123 (511) T protein:vir:96 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEA 123 (511) T ss_pred HHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeecCchHHHHH Confidence 1111111000 00 000001000000000000000 000011 111223334454555444556554322 22222 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc--ee---E-EEEeecCcccc- Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN--GL---Y-YNVTFDDPRIP- 146 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~--~~---~-y~~~~~~~~~~- 146 (383) +..-+..-....+...+..++..+|.||.++-++.+|.+ .+..++|..+.+..++... .. . |......+... T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~ 202 (511) T protein:vir:96 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccc Confidence 222232334445666788899999999999999988875 4677888888777654321 11 1 11110000000 Q ss_pred ---cceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 147 ---PKQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 147 ---~~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) ....+.++.+.++.... ......|.|-+..+...++....+..-..+.+... T Consensus 203 ~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~ 282 (511) T protein:vir:96 203 EVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred eEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 01123333343332110 00112477777777777776665555555555555 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHHhhcC----CcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQAMKQM----QGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~----~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) +.|-.+++.....+.++.....+........ .....-.+.+.+..-++.......+....+...+.|+..-++|.. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:96 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 6666666654444443333222211100000 000001122334433444444455677778888899999999876 Q ss_pred HhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcc-------hhhccchhhhccCHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSC-------DVDADIFPAVDPTGANYIS 333 (383) Q Consensus 275 ~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~-------~~e~~~~~~~~~~~~~~~~ 333 (383) -.+..+.+.+ ..+.+ ..+...+.-.++.|...+...-.. .+++...+.+-.+..+.+. T Consensus 363 ~~~~~~~n~S-g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d 441 (511) T protein:vir:96 363 KDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred cccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHH Confidence 5543222222 22221 122233333333333222211110 0111112222344556666 Q ss_pred HHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHhC----------CCCCCCCCCCCCCCC Q lcl|NC_018285. 334 RINSMVKSGTLAQNQGLYILQQAEILP--KELPKGEN----------PNRTILKGGETNGQD 383 (383) Q Consensus 334 ~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~~----------~~~~~~~ggd~~~~d 383 (383) .+.++ .|+++.-.+.+.++. ++. .|+.+++. .+....++++++++. T Consensus 442 ~~~kl--~G~iS~et~l~~l~~--v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:96 442 AYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred HHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCC Confidence 66666 488999888877642 221 23322110 111111122111111 No 185 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.92 E-value=1.1e-05 Score=47.77 Aligned_cols=375 Identities=9% Similarity=0.007 Sum_probs=155.0 Q ss_pred CchhhhhhcCC-c--ccccccccccchhhcc-cccCC-ceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--hhhh Q lcl|NC_018285. 1 MPIFNLATESP-P--NNQGGFFDITDPEFLA-TLNGS-EWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQGI 73 (383) Q Consensus 1 Mglf~~~~~~~-~--~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~~l 73 (383) .++.++....+ + .....+.....+.+.- ..... .....+ +.+.-....|+..+.-+-+-|+++...+ .... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~ 123 (511) T protein:vir:99 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEA 123 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcce--eecchHHHHHHHHHhhhcccCceeecCchHHHHH Confidence 12222111100 0 0000000000000000 00000 000001 1112223344444544445566653222 2222 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-e----eEE-EEeecCcccc- Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-G----LYY-NVTFDDPRIP- 146 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~----~~y-~~~~~~~~~~- 146 (383) +..-+..-........+..++..+|.||.++.++.+|.+ .+..++|..+.+..++... . +.| ......+... T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~ 202 (511) T protein:vir:99 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccc Confidence 222222224455666788899999999999999988875 5777888888776654321 1 111 1100000000 Q ss_pred ---cceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 147 ---PKQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 147 ---~~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) ....+.+..+.+++... ......|.|.+..+...++....+..-..+.+... T Consensus 203 ~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (511) T protein:vir:99 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred eEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 00123333333332100 00112477777776666666555555555555555 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHH----hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQA----MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~----~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) +.|-.+++.....+.+...+..+.... .....+...-.++|.++..++....+..+....+...+.|+..-++|.. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:99 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 556666554333343333322221110 0011111122344555555555555556677778888899999999876 Q ss_pred HhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcc-------hhhccchhhhccCHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSC-------DVDADIFPAVDPTGANYIS 333 (383) Q Consensus 275 ~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~-------~~e~~~~~~~~~~~~~~~~ 333 (383) -.+..+.+.+ ..+.+ ..+...+.-.++.|...+...--. .+++......-.+..+.+. T Consensus 363 ~~~~~~gn~S-g~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~ 441 (511) T protein:vir:99 363 KDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred ccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHH Confidence 5543222222 22221 122222222222222222211100 1111111222234455666 Q ss_pred HHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh------------CCCC--CCCCCCCCCCCC Q lcl|NC_018285. 334 RINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE------------NPNR--TILKGGETNGQD 383 (383) Q Consensus 334 ~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~------------~~~~--~~~~ggd~~~~d 383 (383) .+.++ .|+++.-.+++.++. ++. .|+.+++ .... .+...++.++++ T Consensus 442 ~~~kl--~GiiS~et~l~~l~~--v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (511) T protein:vir:99 442 AYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDST 503 (511) T ss_pred HHHHH--hccCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCC Confidence 66665 489999888887632 221 2222211 0000 011111111111 No 186 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.88 E-value=1.3e-05 Score=47.41 Aligned_cols=375 Identities=9% Similarity=0.007 Sum_probs=160.5 Q ss_pred CchhhhhhcC-Cc--ccccccccccchhhcccccCC-c-eechhhhhccHHHHHHHHHHHHhhhhCceeeecch--hhhh Q lcl|NC_018285. 1 MPIFNLATES-PP--NNQGGFFDITDPEFLATLNGS-E-WVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQGI 73 (383) Q Consensus 1 Mglf~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~~l 73 (383) .++.+..... .+ .....+.....+.+.-..... . ....+ +.+.-....|+..+.-+-+-|+++...+ .... T Consensus 46 ~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~ 123 (511) T protein:vir:10 46 SKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEA 123 (511) T ss_pred HHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeecCchHHHHH Confidence 1121111000 00 000000000000000000000 0 00011 1112233344444444445566654222 2222 Q ss_pred ccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-e----eEE-EEeecCccc-- Q lcl|NC_018285. 74 VDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-G----LYY-NVTFDDPRI-- 145 (383) Q Consensus 74 ~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~----~~y-~~~~~~~~~-- 145 (383) +..-+..-........+..++..+|.||.++.++.+|.+ .+..++|..+.+..++... . +.| ......... T Consensus 124 l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~ 202 (511) T protein:vir:10 124 IEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (511) T ss_pred HHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccc Confidence 222222234445566788899999999999999988875 4677888888776654321 1 111 110000000 Q ss_pred -cc-ceeecccceEEeccCC-------------------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_018285. 146 -PP-KQHVPQSDILHFRLLS-------------------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNA 198 (383) Q Consensus 146 -~~-~~~~~~~dvih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 198 (383) .. ...+.++.+.++.... ....-.|.|-+..+...++....+..-..+.+... T Consensus 203 ~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (511) T protein:vir:10 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred eEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 00 0122333333321100 00112477877777777776666555555566666 Q ss_pred CCcceeEeecCCCCHHHHHHHHHHHHH----hhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_018285. 199 LNANGILKIKGGGLLDFKTKVSRSRQA----MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPEN 274 (383) Q Consensus 199 ~~~~~i~~~~~~~~~e~~~~~~~~~~~----~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~ 274 (383) +.|-.+++.....+.+...+..+.... .....+...-.++|.++.-++....+..+....+...+.|+..-++|.. T Consensus 283 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~ 362 (511) T protein:vir:10 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 362 (511) T ss_pred hCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 667776665444444333332222111 1111111122344555555555555556677778888889998899875 Q ss_pred HhcccccCcCHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcch-------hhccchhhhccCHHHHHH Q lcl|NC_018285. 275 VVGGQGDQQSSLEMS--------------SNVYSKAVARYLRPFLSELSQKLSCD-------VDADIFPAVDPTGANYIS 333 (383) Q Consensus 275 ~lg~~~~~~~~~e~~--------------~~~~~~~l~P~~~~i~~~l~~~l~~~-------~e~~~~~~~~~~~~~~~~ 333 (383) -.+..+.+.+ ..+. +..+...|.-.++.|...+...-... +++...+.+-.+..+.+. T Consensus 363 ~~~~~~~n~S-g~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~ 441 (511) T protein:vir:10 363 KDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred ccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHH Confidence 4432222222 2221 12233333333333333332211111 111122223345566666 Q ss_pred HHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh------------CCCCCC--CCCCCCCCCC Q lcl|NC_018285. 334 RINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE------------NPNRTI--LKGGETNGQD 383 (383) Q Consensus 334 ~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~------------~~~~~~--~~ggd~~~~d 383 (383) .+.++ .|+++.-.+++.++. ++. .|+.+++ .....+ ..++++++++ T Consensus 442 ~~~kl--~G~iS~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (511) T protein:vir:10 442 AYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 503 (511) T ss_pred HHHHH--hccCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcc Confidence 77776 488999888887743 221 2322211 011111 1111111111 No 187 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.85 E-value=1.5e-05 Score=47.09 Aligned_cols=363 Identities=11% Similarity=0.023 Sum_probs=150.7 Q ss_pred CchhhhhhcCCc--ccccccccccchhhc---ccccCC--ceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh--h Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPEFL---ATLNGS--EWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--Q 71 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~~~---~~~~~~--~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--~ 71 (383) -.+.+....+.. .....+.......+. .....+ ....+..-+.++-....|+..+.-+-+-|+++...+. . T Consensus 41 ~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~~~ 120 (483) T protein:vir:12 41 VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVV 120 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCChHHH Confidence 111111000000 000000000000000 000000 0000000012333444566666666566666532221 1 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEE--EE-eecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY--NV-TFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y--~~-~~~~~~~~~ 147 (383) ..+..-..+ ........+..++..+|.||+.+-.+.+|++ .+..++|..+.+..++.. ..+.+ ++ .... ... T Consensus 121 ~~l~~~~~n-~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--~~~ 196 (483) T protein:vir:12 121 KRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--ETK 196 (483) T ss_pred HHHHHHHhc-cHHHHHHHHHHHHhhCCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeec--ceE Confidence 111111111 2234445667888999999999999999876 477788888877665322 11111 11 1000 000 Q ss_pred ceeecccc-----------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSD-----------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 148 ~~~~~~~d-----------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) ...+.... |+++++ ...|.|-+..+...++....+..... T Consensus 197 ~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~ 271 (483) T protein:vir:12 197 VEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLS 271 (483) T ss_pred EEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHH Confidence 11111122 222221 12477777776666666665555555 Q ss_pred HHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIP 272 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVp 272 (383) +.+...+.|..+++.-..... ....... ...+++.++.+.+...+..+..+..+....+...+.|+..-++| T Consensus 272 ~~~~~~~~~~lv~~g~~~~~~---~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 343 (483) T protein:vir:12 272 NTFKDSNELTYVLTNYDDQEL---PEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAV 343 (483) T ss_pred HHHHHhcCceeeeecCCcccc---hhHHHhh-----hhccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC Confidence 555656667666653222111 1111111 11234444555444444444455566777788888898888888 Q ss_pred HHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhh-cchhhccchhhhccCHHHHHHHHHH Q lcl|NC_018285. 273 ENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKL-SCDVDADIFPAVDPTGANYISRINS 337 (383) Q Consensus 273 p~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l-~~~~e~~~~~~~~~~~~~~~~~~~~ 337 (383) ..-.+..+.+.+ ..+.+ ..+...+...++.+...++.+. ..++++...+..-.+..+.+..+.+ T Consensus 344 ~~~~~~~~~n~S-g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k 422 (483) T protein:vir:12 344 DFSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQ 422 (483) T ss_pred CCCccccccCcH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHHHHHH Confidence 644432222222 22221 1222223332222222221110 0011111122223455666666666 Q ss_pred HHhCCCcCHHHHHHHhhcCCcCCcchhH--------HhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 338 MVKSGTLAQNQGLYILQQAEILPKELPK--------GENPNRTILKGGETNGQD 383 (383) Q Consensus 338 l~~~g~~t~nE~r~~lg~~~~~~~d~~~--------~~~~~~~~~~ggd~~~~d 383 (383) + .|+++...+++.++.-.=+..|+.+ .+..+..+..+.|...++ T Consensus 423 l--~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~ 474 (483) T protein:vir:12 423 S--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQ 474 (483) T ss_pred H--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccccCCcccC Confidence 6 5899998888876432100122222 111211111111111122 No 188 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.77 E-value=2e-05 Score=46.29 Aligned_cols=361 Identities=13% Similarity=0.040 Sum_probs=153.5 Q ss_pred CchhhhhhcCCcc--cccccccccchhh----cccccCCceec-hhhhhccHHHHHHHHHHHHhhhhCceeeecch--hh Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEF----LATLNGSEWVS-AETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~----~~~~~~~~~~~-~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~ 71 (383) ..+.+...++... ....+.......+ ........... +..-+.++-....|+..+.-+-+-|+++.-.+ .. T Consensus 33 ~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~ 112 (474) T protein:vir:95 33 IRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDESVL 112 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCceeccCchHHH Confidence 1111111100000 0000000000000 00000000000 00001123333456666665556666653222 11 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-cee---EEEEeecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGL---YYNVTFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~---~y~~~~~~~~~~~ 147 (383) ..+..-.. .........+..+...+|.||+.+.++.+|++ .+..++|..+-+..++.. +.+ .+.+..... .. T Consensus 113 ~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~--~~ 188 (474) T protein:vir:95 113 KIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNE--EK 188 (474) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEEcCe--eE Confidence 11111111 12344456677889999999999999888876 466778887776654321 111 111111100 01 Q ss_pred ceeecccc-----------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSD-----------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 148 ~~~~~~~d-----------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) ...+.... |+++++ ...|.|-+..+...++....+..... T Consensus 189 ~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~ 263 (474) T protein:vir:95 189 VEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSLIDAIDKRLSDAQ 263 (474) T ss_pred EEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecC-----CCCCCCcHHHHHHHHHHHHHHHHHHH Confidence 11122222 233322 13467777766666666555555555 Q ss_pred HHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIP 272 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVp 272 (383) +.++..+.|-.+++.-...+.+. +... ....+++.++++.+...++.+.....+.+..+...+.|+..-++| T Consensus 264 ~~~~~~~~p~lv~~g~~~~~~~~---~~~~-----~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 335 (474) T protein:vir:95 264 NMFDESVELIYILKGYEGQDLEE---FMRG-----LKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGV 335 (474) T ss_pred HHHHHhcCceeeeecCCcccchh---hhhh-----hhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 55566666766655322211111 1111 122456666666666555555555667777888889999999998 Q ss_pred HHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHH Q lcl|NC_018285. 273 ENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSM 338 (383) Q Consensus 273 p~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l 338 (383) ..-.+..+.+.+ ..+.+ ..+...+...++.|.+.+.... ...++.+. +.+..+...++.++.+ T Consensus 336 ~~~~~~~~~n~S-g~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~-d~~~i~v~-f~~~~p~d~~e~a~~~ 412 (474) T protein:vir:95 336 DFQTDKFGSAPS-GIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKM-DVKDIEIS-FNFNRMMNDAEQSQII 412 (474) T ss_pred ccccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEE-eccCCCcCHHHHHHHH Confidence 633221111122 21211 1222233333333322221110 00011111 1122233445556667 Q ss_pred HhCCCcCHHHHHHHhhcCCcCC--cchhHH--------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 339 VKSGTLAQNQGLYILQQAEILP--KELPKG--------ENPNRTILKGGETNGQD 383 (383) Q Consensus 339 ~~~g~~t~nE~r~~lg~~~~~~--~d~~~~--------~~~~~~~~~ggd~~~~d 383 (383) .+.|+++...+.+.++. ++. .|+.++ +........|+|.++++ T Consensus 413 ~~~g~iS~et~i~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~ 465 (474) T protein:vir:95 413 AQSQYLSRETLVKSSPL--VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQ 465 (474) T ss_pred HhcCCCchHHHHHhCCC--CCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCC Confidence 78899999888887643 221 122221 11211222223322222 No 189 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.76 E-value=2.1e-05 Score=46.17 Aligned_cols=373 Identities=9% Similarity=0.046 Sum_probs=167.2 Q ss_pred CchhhhhhcCCc-----ccccccccccc-------hh-------hcccccC----------CceechhhhhccHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPP-----NNQGGFFDITD-------PE-------FLATLNG----------SEWVSAETALKNSDLFSII 51 (383) Q Consensus 1 Mglf~~~~~~~~-----~~~~~~~~~~~-------~~-------~~~~~~~----------~~~~~~~~a~~~~~v~~~i 51 (383) ||+|.+++.--. ........+++ +. +..+..+ ......+.-+....-..++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 999988643111 00011111110 00 0000000 0000111122223334455 Q ss_pred HHHHHhhhhCceee--ecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcC Q lcl|NC_018285. 52 SQLSNDLATAKLTT--SRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 52 ~~ia~~ia~~p~~~--~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~ 129 (383) +.+|+-+..-|..+ .+......+.+--..-......+..+......|.+++.+..+. |. +.+.++++..+-+...+ T Consensus 81 ~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~v~ad~~~P~~~~ 158 (522) T protein:vir:47 81 KKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAFIQAPVFFPLESN 158 (522) T ss_pred HHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEEEcCCceEEEEEc Confidence 66666665544433 2222222222111122334445566677777888888777764 33 23444444443332111 Q ss_pred -----------------CCceeEEE-------------------------Eee----c-C-cccccce------------ Q lcl|NC_018285. 130 -----------------NQNGLYYN-------------------------VTF----D-D-PRIPPKQ------------ 149 (383) Q Consensus 130 -----------------~~~~~~y~-------------------------~~~----~-~-~~~~~~~------------ 149 (383) +....+|. +.. . + ...|..+ T Consensus 159 ~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~ 238 (522) T protein:vir:47 159 TQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLE 238 (522) T ss_pred CCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCC Confidence 11111111 100 0 0 0001100 Q ss_pred ---ee---cccceEEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcce----eEeecCCCC-HH Q lcl|NC_018285. 150 ---HV---PQSDILHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANG----ILKIKGGGL-LD 214 (383) Q Consensus 150 ---~~---~~~dvih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~----i~~~~~~~~-~e 214 (383) .+ +..-+.||+.+-++ +..+|+|.+..+...+...+........-|+-|...-. +++...... .+ T Consensus 239 ~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~ 318 (522) T protein:vir:47 239 PVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGT 318 (522) T ss_pred CceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCcc Confidence 00 01113466554332 34569999999998888777666666666665553211 122211111 00 Q ss_pred --HHHHHH--HH-HHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc-CHHHH Q lcl|NC_018285. 215 --FKTKVS--RS-RQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEM 288 (383) Q Consensus 215 --~~~~~~--~~-~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~-~~~e~ 288 (383) ....+. +. +..... -.+.+-+++.++....+-++.+..+...+.|+...|+++..+|...+.. +..|. T Consensus 319 ~~~~~~fd~~~~~f~~~~~------~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi 392 (522) T protein:vir:47 319 IDFRPRFDVEQNVYMQIGG------SSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEI 392 (522) T ss_pred cccccccCcccceEeecCC------CCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHH Confidence 000000 00 111100 0123345777788778889999999999999999999999998644322 22222 Q ss_pred H-------------HHHHHHHHHHHHHHHHHHHHH-hhc-----c--hhhccchhhhccCHHHHHHHHHHHHhCCCcCHH Q lcl|NC_018285. 289 S-------------SNVYSKAVARYLRPFLSELSQ-KLS-----C--DVDADIFPAVDPTGANYISRINSMVKSGTLAQN 347 (383) Q Consensus 289 ~-------------~~~~~~~l~P~~~~i~~~l~~-~l~-----~--~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~n 347 (383) . +..+..+|..++..+.+..+. .++ . ++.++....+-.|.........+++.+|++++- T Consensus 393 ~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e 472 (522) T protein:vir:47 393 VSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKK 472 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHH Confidence 1 122333333333333322221 011 0 112233333456667777888899999999999 Q ss_pred HHHHHhhcCCcCCcch----hHH--hCCCCCC----CCCCCCCCCC Q lcl|NC_018285. 348 QGLYILQQAEILPKEL----PKG--ENPNRTI----LKGGETNGQD 383 (383) Q Consensus 348 E~r~~lg~~~~~~~d~----~~~--~~~~~~~----~~ggd~~~~d 383 (383) +++.++ .+++..++ .+. ++....| .-||++..++ T Consensus 473 ~~i~~~--~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~ 516 (522) T protein:vir:47 473 RAIGKT--LNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEE 516 (522) T ss_pred HHHHhc--CCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccc Confidence 988764 45554432 222 1111111 1122221111 No 190 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.75 E-value=2.2e-05 Score=46.12 Aligned_cols=363 Identities=11% Similarity=0.014 Sum_probs=151.0 Q ss_pred CchhhhhhcCCc--ccccccccccchhhc---ccccCCc--eechhhhhccHHHHHHHHHHHHhhhhCceeeecch--hh Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPEFL---ATLNGSE--WVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~~~---~~~~~~~--~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~ 71 (383) -.+.+....+.. .....+.......+. .....+. ..-...-+..+-....|+..+.-+-+-|+++...+ .. T Consensus 30 ~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~ 109 (472) T protein:vir:93 30 VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVV 109 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeeeccCChHHH Confidence 111111000000 000000000000000 0000000 00000001223444566666666656666653222 11 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEEE--E-eecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYYN--V-TFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y~--~-~~~~~~~~~ 147 (383) ..+..-..+ ........+..+++.+|.||+.+..+.+|++ .+..++|..+.+..++.. ..+.+. + ..... .. T Consensus 110 ~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~--~~ 185 (472) T protein:vir:93 110 KRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE--TK 185 (472) T ss_pred HHHHHHHhc-cHHHHHHHHHHHHhhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeecc--ee Confidence 111111111 2334455677889999999999999988876 477788888877765321 111110 0 00000 00 Q ss_pred ceeeccc-----------------------------------ceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQS-----------------------------------DILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 148 ~~~~~~~-----------------------------------dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) ...+... .|+++++ ...|.|-+..+...++....+..... T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~ 260 (472) T protein:vir:93 186 VEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLS 260 (472) T ss_pred EEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHH Confidence 0011111 1333322 13477877777777766665555555 Q ss_pred HHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIP 272 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVp 272 (383) +.+...+.|-.+++.-..... ....... ...+++.++.+.+...++....+..+....+...+.|+..-++| T Consensus 261 ~~~~~~~~~~~~~~g~~~~~~---~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 332 (472) T protein:vir:93 261 NTFKDSNELTYVLTNYDDQEL---PEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAV 332 (472) T ss_pred HHHHHhcCceeEeecCCcccc---hhhHHHH-----hhccccccCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCC Confidence 566666777666653222111 1111111 12234445555555445545555667778888888999999998 Q ss_pred HHHhcccccCcCHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhh-cchhhccchhhhccCHHHHHHHHHH Q lcl|NC_018285. 273 ENVVGGQGDQQSSLEMSSN--------------VYSKAVARYLRPFLSELSQKL-SCDVDADIFPAVDPTGANYISRINS 337 (383) Q Consensus 273 p~~lg~~~~~~~~~e~~~~--------------~~~~~l~P~~~~i~~~l~~~l-~~~~e~~~~~~~~~~~~~~~~~~~~ 337 (383) ..-.+..+.+.+ ..+.+. .+...+.-.++.+...++... ..++++......-.+..+.+..+.+ T Consensus 333 ~~~~~~~~~n~S-g~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k 411 (472) T protein:vir:93 333 DFSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQ 411 (472) T ss_pred CCCccccccCch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHHHHHH Confidence 654432222222 222211 112222222222222111110 0011111112222344555666666 Q ss_pred HHhCCCcCHHHHHHHhhcCCcCCcchhH--------HhCCCCCCCC------CCCCCCCC Q lcl|NC_018285. 338 MVKSGTLAQNQGLYILQQAEILPKELPK--------GENPNRTILK------GGETNGQD 383 (383) Q Consensus 338 l~~~g~~t~nE~r~~lg~~~~~~~d~~~--------~~~~~~~~~~------ggd~~~~d 383 (383) + .|+++.-.+.+.++.-.=+..|+.+ .+.....+.. ++|.+++. T Consensus 412 ~--~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 469 (472) T protein:vir:93 412 S--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNK 469 (472) T ss_pred H--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcc Confidence 5 5889988888776432100112222 1112111111 11111111 No 191 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.74 E-value=2.3e-05 Score=45.99 Aligned_cols=363 Identities=11% Similarity=0.010 Sum_probs=151.9 Q ss_pred CchhhhhhcCCcc--cccccccccchhhc---ccccCCcee--chhhhhccHHHHHHHHHHHHhhhhCceeeecchh--h Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFL---ATLNGSEWV--SAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--Q 71 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~---~~~~~~~~~--~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--~ 71 (383) .++.+....+... ....+.....+.+. .....+... .+..-+.++-...+|+..+.-+-+-|+++...+. . T Consensus 50 ~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~ 129 (492) T protein:vir:97 50 VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVV 129 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHHH Confidence 1111111111000 00000000000000 000000000 0000012233444666666655566666532221 1 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEE--EEeecCcccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY--NVTFDDPRIPPK 148 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y--~~~~~~~~~~~~ 148 (383) ..+..-..+ ........+..+++.+|.||.++.++.+|++ .+..++|..+.+..++.. +.+.+ ++..... .... T Consensus 130 ~~l~~~~~n-~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~-~~~~ 206 (492) T protein:vir:97 130 KRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN-ETKV 206 (492) T ss_pred HHHHHHHhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecc-ceeE Confidence 111111111 2334445677889999999999999988876 477788888877765322 11111 1100000 0001 Q ss_pred eeecccc-----------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 149 QHVPQSD-----------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLN 193 (383) Q Consensus 149 ~~~~~~d-----------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 193 (383) ..+.... |+++++ ...|.|-+..+...++....+..-..+ T Consensus 207 ~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~ 281 (492) T protein:vir:97 207 EYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSN 281 (492) T ss_pred EEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHH Confidence 1111112 222221 124778787777777766665555566 Q ss_pred HHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 194 SLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 194 ~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) .+...+.|-.+++.-.... ........ ...+++.++.+.+...+.....+..+....+...+.|+..-++|. T Consensus 282 ~~~~~~~~~l~~~g~~~~~---~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~ 353 (492) T protein:vir:97 282 TFKDSNELTYVLKNYDDQE---LPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD 353 (492) T ss_pred HHHHhccceeeeecCCccc---chhHHHHH-----hhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCC Confidence 6666666766665322211 11111111 112344455554444444444555667777888888888888886 Q ss_pred HHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhc-chhhccchhhhccCHHHHHHHHHHH Q lcl|NC_018285. 274 NVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLS-CDVDADIFPAVDPTGANYISRINSM 338 (383) Q Consensus 274 ~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~-~~~e~~~~~~~~~~~~~~~~~~~~l 338 (383) .-.+..+.+. +..+.+ ..+...+...++.+...++.+-- .++++...+..-.+..+.+..+.++ T Consensus 354 ~~~~~~~~n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl 432 (492) T protein:vir:97 354 FSSDKFGSAP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS 432 (492) T ss_pred CCccccccCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHH Confidence 4433211112 222221 11222222222222221111000 0111111222223455666666665 Q ss_pred HhCCCcCHHHHHHHhhcCCcCCcchhHH--------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 339 VKSGTLAQNQGLYILQQAEILPKELPKG--------ENPNRTILKGGETNGQD 383 (383) Q Consensus 339 ~~~g~~t~nE~r~~lg~~~~~~~d~~~~--------~~~~~~~~~ggd~~~~d 383 (383) .|+++...+.+.++.-.=+..|+.++ +...... .+|.+++++ T Consensus 433 --~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~-~~~~~~~~~ 482 (492) T protein:vir:97 433 --MGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLD-DGGADSAQQ 482 (492) T ss_pred --hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccc-cCCCCCCcc Confidence 58999888888775322011233221 1111111 122222221 No 192 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.61 E-value=3.8e-05 Score=44.83 Aligned_cols=360 Identities=8% Similarity=-0.004 Sum_probs=155.9 Q ss_pred CchhhhhhcCCcc--cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch--hhhhccC Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQGIVDN 76 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~~l~~~ 76 (383) ..|.+........ ....+.......+..........+.+ +.++-....|+..+.-+-+-|+++.-.+ ....+.. T Consensus 7 ~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~l~~ 84 (429) T protein:vir:98 7 SELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNR--LVVNFAKYIVDTFNGYFIGVPVQTSHENKQVSNYLEL 84 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcce--eecchHHHHHHHHhhhhcccCceeecCChHHHHHHHH Confidence 2222221111000 00000000000000000000000011 1233445566766666666676653322 2222222 Q ss_pred CCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-eeEE--EEeecCcccccceeec- Q lcl|NC_018285. 77 PSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-GLYY--NVTFDDPRIPPKQHVP- 152 (383) Q Consensus 77 PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~~~y--~~~~~~~~~~~~~~~~- 152 (383) -............+..+++.+|.||+.+.++.+|++ .+..++|..+.+..++... ...+ ++...... .....+. T Consensus 85 ~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~-~~~~~~~~ 162 (429) T protein:vir:98 85 LDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGG-VLEGSYSD 162 (429) T ss_pred HHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEEeCCCCCceEEEEEEEEecCc-eEEEEEEe Confidence 222234455667788899999999999999999986 4667888888776554221 1111 01000000 0000111 Q ss_pred -------------------------ccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEee Q lcl|NC_018285. 153 -------------------------QSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKI 207 (383) Q Consensus 153 -------------------------~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~ 207 (383) .-.|+++++ ...|.|-+..+...++....+.....+.....+.|-.+++. T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g 237 (429) T protein:vir:98 163 ASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVE-----NEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILG 237 (429) T ss_pred CceEEEEEecCCceEecccccccCCccceEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 112333332 23577877777777776666666666666667777777664 Q ss_pred cCCCCHHHHHHHHHHHHHhhcCCcceeecC----CCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc Q lcl|NC_018285. 208 KGGGLLDFKTKVSRSRQAMKQMQGGPLVLD----DLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ 283 (383) Q Consensus 208 ~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~----~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~ 283 (383) - ..+++....++. ++++.++ .+.+...+..+.....+.+..+...+.|+..-++|..-.+..+ +. T Consensus 238 ~-~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-n~ 306 (429) T protein:vir:98 238 A-ELDDETLKSLRD---------TRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFG-TA 306 (429) T ss_pred C-CCCcchhhhHhh---------CceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-cc Confidence 2 223322222211 2333332 1223333333333444566678888899998888853332211 12 Q ss_pred CHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcc----hhhccchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_018285. 284 SSLEMSSN--------------VYSKAVARYLRPFLSELSQKLSC----DVDADIFPAVDPTGANYISRINSMVKSGTLA 345 (383) Q Consensus 284 ~~~e~~~~--------------~~~~~l~P~~~~i~~~l~~~l~~----~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t 345 (383) +..+.+. .+...+.-.++.+..-++..-.. .+++...+..-.+..+.+..+.++ +|+++ T Consensus 307 -Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is 383 (429) T protein:vir:98 307 -SGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQIAGNL--AGIVS 383 (429) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCc Confidence 2222211 11112222222221111110000 011111222234555666666665 68999 Q ss_pred HHHHHHHhhcCCcCCcchhHHhC------CCCCCCCCCCCCCCC Q lcl|NC_018285. 346 QNQGLYILQQAEILPKELPKGEN------PNRTILKGGETNGQD 383 (383) Q Consensus 346 ~nE~r~~lg~~~~~~~d~~~~~~------~~~~~~~ggd~~~~d 383 (383) ...+.++++.-+=+.-|+.+++. .+....-+++++++| T Consensus 384 ~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 427 (429) T protein:vir:98 384 EETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTI 427 (429) T ss_pred hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCC Confidence 98888777432200112222111 011122244555555 No 193 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.59 E-value=4.1e-05 Score=44.65 Aligned_cols=372 Identities=11% Similarity=0.001 Sum_probs=146.9 Q ss_pred Cch----hhhhhcC----Cc--ccccccccccchhhcccccCCceechhhh---hccHHHHHHHHHHHHhhhhCceeeec Q lcl|NC_018285. 1 MPI----FNLATES----PP--NNQGGFFDITDPEFLATLNGSEWVSAETA---LKNSDLFSIISQLSNDLATAKLTTSR 67 (383) Q Consensus 1 Mgl----f~~~~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~a---~~~~~v~~~i~~ia~~ia~~p~~~~~ 67 (383) |.- ...+.+. .+ .....+...... .... +..+..+.+ +...-..-+|+..++-+---++.+-. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~--i~~~--~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRR--LKTI--GIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--chhc--ccccchhhhhhhhhcchHHHHHHHHHhhhccCceecCC Confidence 321 1111100 00 000000000000 0000 000111000 01112233444444443223343322 Q ss_pred ch-hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEee------cCCCceeEEEEeccceeEEEEcCCCc--e---eE Q lcl|NC_018285. 68 KQ-MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWR------NDNGRDMKWEYLRPSQVSFNRLDNQN--G---LY 135 (383) Q Consensus 68 ~~-~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r------~~~g~~~~l~~l~~~~v~~~~~~~~~--~---~~ 135 (383) .. ....+.+-...-........+..+++.+|.||+.+.+ +.+|.+ .+.+++|..+.+..+.... . +. T Consensus 77 d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D~~~~~~~~~~i~ 155 (480) T protein:vir:78 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) T ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEcCCCccceEEEEE Confidence 11 1111111111113345567788999999999998875 345555 4777888888777654221 1 11 Q ss_pred EEEeecC----------------------ccc------cccee--ecccceEEeccCCCCccccCcchHHH-HHHHHHHH Q lcl|NC_018285. 136 YNVTFDD----------------------PRI------PPKQH--VPQSDILHFRLLSVDGGLTSVSPLMA-LGRELDIQ 184 (383) Q Consensus 136 y~~~~~~----------------------~~~------~~~~~--~~~~dvih~~~~~~~~~~~G~s~~~~-~~~~i~~~ 184 (383) |....++ ... +...+ +..--|+||.+....+..+|.|-+.. +...++.. T Consensus 156 ~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~ 235 (480) T protein:vir:78 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) T ss_pred EEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHH Confidence 1110000 000 00000 11223566655443444567775542 33333332 Q ss_pred HHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHHHHHH Q lcl|NC_018285. 185 KASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQADWTTG 263 (383) Q Consensus 185 ~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~ 263 (383) ..+.-.......-.+.|..++... ...+...+.-...+. ...++++.++ ++.++..+.....+ .+++..+..+. T Consensus 236 ~~~~s~~~~~~~~~a~p~~~i~G~-~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~i~ 310 (480) T protein:vir:78 236 SRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAELR-NFAEEMEVFRK 310 (480) T ss_pred HHHHHHHHHHHHhhcchhhhhhCC-Cccccccccccchhh---hhhhhhccCCCCCceEEecCccCHH-HHHHHHHHHHH Confidence 222222222222233454444321 111110010001111 1123444443 45677665554433 37788888899 Q ss_pred HHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----------chhhc---cc----hhhhc Q lcl|NC_018285. 264 QFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS-----------CDVDA---DI----FPAVD 325 (383) Q Consensus 264 ~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~-----------~~~e~---~~----~~~~~ 325 (383) +|+..=++|++.+|+...+..+.++.+ +....|.-.+...+..|...|- ..... ++ ....- T Consensus 311 ~~~~~~~~p~~~fg~~~~n~~Sg~Al~-~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~ 389 (480) T protein:vir:78 311 EAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) T ss_pred HHhcccCCCHHHhccccCchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCC Confidence 999999999999997554333333322 1112222222222222221111 01100 11 11123 Q ss_pred cCHHHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhH---H------hCCC--------CCCCC-CCCCCCCC Q lcl|NC_018285. 326 PTGANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPK---G------ENPN--------RTILK-GGETNGQD 383 (383) Q Consensus 326 ~~~~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~---~------~~~~--------~~~~~-ggd~~~~d 383 (383) .+..+.+..+.+++.+| +++..-+++++|..+-+-.++.. . ..++ ..+.+ +||..+.. T Consensus 390 ~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (480) T protein:vir:78 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) T ss_pred CCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCcc Confidence 45566777788888765 67877778777654311111110 0 0011 11111 22221111 No 194 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=97.56 E-value=4.6e-05 Score=44.37 Aligned_cols=374 Identities=7% Similarity=0.037 Sum_probs=169.2 Q ss_pred CchhhhhhcC--Ccc---ccccc--------ccccch---------hh-cccccCC------ceechhhhhccHHHHHHH Q lcl|NC_018285. 1 MPIFNLATES--PPN---NQGGF--------FDITDP---------EF-LATLNGS------EWVSAETALKNSDLFSII 51 (383) Q Consensus 1 Mglf~~~~~~--~~~---~~~~~--------~~~~~~---------~~-~~~~~~~------~~~~~~~a~~~~~v~~~i 51 (383) ||+|.++++= +.. ...+. ..++.. .+ .+-+... .....+.-.....-...+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 9999987541 100 00011 111100 00 0000000 000011112223334455 Q ss_pred HHHHHhhhhCce--eeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcC Q lcl|NC_018285. 52 SQLSNDLATAKL--TTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 52 ~~ia~~ia~~p~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~ 129 (383) +.+|+-+..-|. .+.+......+.+--..-......+..+...+..|.+++.+..+. |.+ .+.++++..+-+...+ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~v~ad~~~P~~~d 158 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAFVQAPVFLPLQSN 158 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeeEEEEEc Confidence 666666655443 333332332222222222344555666777888899888887764 333 4566666665443221 Q ss_pred CC-----------------ceeEE--------------EEe----ec-C-cccccce-------------e---ecccce Q lcl|NC_018285. 130 NQ-----------------NGLYY--------------NVT----FD-D-PRIPPKQ-------------H---VPQSDI 156 (383) Q Consensus 130 ~~-----------------~~~~y--------------~~~----~~-~-~~~~~~~-------------~---~~~~dv 156 (383) .. ...+| .+. .. + ...+..+ . ++..-+ T Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f 238 (500) T protein:vir:98 159 TQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIF 238 (500) T ss_pred CCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccE Confidence 11 11111 000 00 0 0001000 0 001113 Q ss_pred EEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeE-----eecCCC-CHHHHHHHHHHHHHh Q lcl|NC_018285. 157 LHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGIL-----KIKGGG-LLDFKTKVSRSRQAM 226 (383) Q Consensus 157 ih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~-----~~~~~~-~~e~~~~~~~~~~~~ 226 (383) .|++.+.++ +...|.|.+..+...+...+........-++.|. ...++ ...... +.+.... ..+. . T Consensus 239 ~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~~~~~--~~~d-~ 314 (500) T protein:vir:98 239 TYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQ-RRVAVPESLTALTVRTTDGDVVPR--PRFE-S 314 (500) T ss_pred EEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcccCCCCCccccCC--cccC-C Confidence 455543332 3356999999999888888777777777776644 33333 111111 1000000 0000 0 Q ss_pred hcCCcceee--cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc-CHHHHH-------------H Q lcl|NC_018285. 227 KQMQGGPLV--LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEMS-------------S 290 (383) Q Consensus 227 ~~~~g~~~v--l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~-~~~e~~-------------~ 290 (383) .+.--..+- .+++..++.++....+-++.+..+...++|+...|+++..+|..+++. ++.|.. + T Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~ 394 (500) T protein:vir:98 315 DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIV 394 (500) T ss_pred CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHH Confidence 000000000 123345777777777888999999999999999999999998654432 222221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHH-hhc-----c--hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQ-KLS-----C--DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~-~l~-----~--~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) ..+..+|..++..|.+.... .++ . ++.++....+-.|....+....+++.+|++++-+++.++ .+++..+ T Consensus 395 ~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~--~g~~eee 472 (500) T protein:vir:98 395 ALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV--LNVTEEK 472 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc--CCCCHHH Confidence 12233333333333221111 011 0 111222333445667777888899999999999998764 3444444 Q ss_pred h----hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 363 L----PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 363 ~----~~~~~~~~~~~~ggd~~~~d 383 (383) + .+.+.-. .+.-|-.++.+| T Consensus 473 a~~~l~~i~~E~-~~~~~~~~~~~~ 496 (500) T protein:vir:98 473 AQEIAAEINTGI-VDEINQQRTDTH 496 (500) T ss_pred HHHHHHHHHHhc-cccCCCCCcccc Confidence 3 2222110 011011111122 No 195 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=97.56 E-value=4.6e-05 Score=44.37 Aligned_cols=374 Identities=7% Similarity=0.037 Sum_probs=169.2 Q ss_pred CchhhhhhcC--Ccc---ccccc--------ccccch---------hh-cccccCC------ceechhhhhccHHHHHHH Q lcl|NC_018285. 1 MPIFNLATES--PPN---NQGGF--------FDITDP---------EF-LATLNGS------EWVSAETALKNSDLFSII 51 (383) Q Consensus 1 Mglf~~~~~~--~~~---~~~~~--------~~~~~~---------~~-~~~~~~~------~~~~~~~a~~~~~v~~~i 51 (383) ||+|.++++= +.. ...+. ..++.. .+ .+-+... .....+.-.....-...+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 9999987541 100 00011 111100 00 0000000 000011112223334455 Q ss_pred HHHHHhhhhCce--eeecchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcC Q lcl|NC_018285. 52 SQLSNDLATAKL--TTSRKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 52 ~~ia~~ia~~p~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~ 129 (383) +.+|+-+..-|. .+.+......+.+--..-......+..+...+..|.+++.+..+. |.+ .+.++++..+-+...+ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~v~ad~~~P~~~d 158 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAFVQAPVFLPLQSN 158 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeeEEEEEc Confidence 666666655443 333332332222222222344555666777888899888887764 333 4566666665443221 Q ss_pred CC-----------------ceeEE--------------EEe----ec-C-cccccce-------------e---ecccce Q lcl|NC_018285. 130 NQ-----------------NGLYY--------------NVT----FD-D-PRIPPKQ-------------H---VPQSDI 156 (383) Q Consensus 130 ~~-----------------~~~~y--------------~~~----~~-~-~~~~~~~-------------~---~~~~dv 156 (383) .. ...+| .+. .. + ...+..+ . ++..-+ T Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f 238 (500) T protein:vir:30 159 TQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIF 238 (500) T ss_pred CCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccE Confidence 11 11111 000 00 0 0001000 0 001113 Q ss_pred EEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeE-----eecCCC-CHHHHHHHHHHHHHh Q lcl|NC_018285. 157 LHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGIL-----KIKGGG-LLDFKTKVSRSRQAM 226 (383) Q Consensus 157 ih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~-----~~~~~~-~~e~~~~~~~~~~~~ 226 (383) .|++.+.++ +...|.|.+..+...+...+........-++.|. ...++ ...... +.+.... ..+. . T Consensus 239 ~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~~~~~--~~~d-~ 314 (500) T protein:vir:30 239 TYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQ-RRVAVPESLTALTVRTTDGDVVPR--PRFE-S 314 (500) T ss_pred EEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcccCCCCCccccCC--cccC-C Confidence 455543332 3356999999999888888777777777776644 33333 111111 1000000 0000 0 Q ss_pred hcCCcceee--cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc-CHHHHH-------------H Q lcl|NC_018285. 227 KQMQGGPLV--LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEMS-------------S 290 (383) Q Consensus 227 ~~~~g~~~v--l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~-~~~e~~-------------~ 290 (383) .+.--..+- .+++..++.++....+-++.+..+...++|+...|+++..+|..+++. ++.|.. + T Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~ 394 (500) T protein:vir:30 315 DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIV 394 (500) T ss_pred CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHH Confidence 000000000 123345777777777888999999999999999999999998654432 222221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHH-hhc-----c--hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc Q lcl|NC_018285. 291 NVYSKAVARYLRPFLSELSQ-KLS-----C--DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE 362 (383) Q Consensus 291 ~~~~~~l~P~~~~i~~~l~~-~l~-----~--~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d 362 (383) ..+..+|..++..|.+.... .++ . ++.++....+-.|....+....+++.+|++++-+++.++ .+++..+ T Consensus 395 ~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~--~g~~eee 472 (500) T protein:vir:30 395 ALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV--LNVTEEK 472 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc--CCCCHHH Confidence 12233333333333221111 011 0 111222333445667777888899999999999998764 3444444 Q ss_pred h----hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 363 L----PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 363 ~----~~~~~~~~~~~~ggd~~~~d 383 (383) + .+.+.-. .+.-|-.++.+| T Consensus 473 a~~~l~~i~~E~-~~~~~~~~~~~~ 496 (500) T protein:vir:30 473 AQEIAAEINTGI-VDEINQQRTDTH 496 (500) T ss_pred HHHHHHHHHHhc-cccCCCCCcccc Confidence 3 2222110 011011111122 No 196 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.55 E-value=4.6e-05 Score=44.34 Aligned_cols=365 Identities=10% Similarity=0.013 Sum_probs=148.9 Q ss_pred CchhhhhhcCCcc------cccccccccchhhcccccCCceechhh-h--hccHHHHHHHHHHHHhhhhCceeeecch-- Q lcl|NC_018285. 1 MPIFNLATESPPN------NQGGFFDITDPEFLATLNGSEWVSAET-A--LKNSDLFSIISQLSNDLATAKLTTSRKQ-- 69 (383) Q Consensus 1 Mglf~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~-a--~~~~~v~~~i~~ia~~ia~~p~~~~~~~-- 69 (383) +.+...+.+.... ....|.....+ .... +..+..+- . ....-..-||+-+|+.+.--.|.+-+.. T Consensus 17 ~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~--~~~~--~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~~~ 92 (474) T protein:vir:81 17 NALINGLLAQIENLRWKNLLRTSYYENKRT--IQYV--GTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVWPDGDLD 92 (474) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHhccCCC--hhhc--cccccHHHHHHHhhcChHHHHHHHHHhhhcccceECCCCCcc Confidence 3233322221110 01111111000 0000 00011110 0 1111223355555554433344432211 Q ss_pred ---hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce-eEEEEeccceeEEEEcCCCceeEEEEe---ec- Q lcl|NC_018285. 70 ---MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD-MKWEYLRPSQVSFNRLDNQNGLYYNVT---FD- 141 (383) Q Consensus 70 ---~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~-~~l~~l~~~~v~~~~~~~~~~~~y~~~---~~- 141 (383) .+.++. .-.......++..+.+++|.||+.|..+.+|.+ ..+.+++|.++....+.....+.+-+. .+ T Consensus 93 ~~~l~~iw~----~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~~~ 168 (474) T protein:vir:81 93 SLGGTEVVD----DNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRRGLNNLLSIIDKDK 168 (474) T ss_pred chHHHHHHH----hcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCCcceeeeEEEEEcC Confidence 112221 112234455677888999999999998777764 457788898888766543332111110 00 Q ss_pred Ccccccceeeccc-------------------------ceEEeccCCCCccccCcchH----HHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 142 DPRIPPKQHVPQS-------------------------DILHFRLLSVDGGLTSVSPL----MALGRELDIQKASDKLTL 192 (383) Q Consensus 142 ~~~~~~~~~~~~~-------------------------dvih~~~~~~~~~~~G~s~~----~~~~~~i~~~~~~~~~~~ 192 (383) ++.......+.++ .|++|.+...-...+|.|.+ ..+.+.+.....-..... T Consensus 169 ~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~ 248 (474) T protein:vir:81 169 EGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHM 248 (474) T ss_pred CCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 0000000011111 24444433222334577743 444444444444444444 Q ss_pred HHHhccCCcceeEee-cC-CCCHHH---HHHHHHHHH---HhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHH Q lcl|NC_018285. 193 NSLKNALNANGILKI-KG-GGLLDF---KTKVSRSRQ---AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQ 264 (383) Q Consensus 193 ~~~~ng~~~~~i~~~-~~-~~~~e~---~~~~~~~~~---~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~ 264 (383) .|+.. |.-.+.. .. ...+++ ...++.... ....+..+-.....+.++-++....-+ .|++..+....+ T Consensus 249 e~~a~---pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~-~~~~~l~~~~~~ 324 (474) T protein:vir:81 249 DVFSY---PEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPD-AHWSDINGLAKL 324 (474) T ss_pred HHhcc---hhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChh-HHHHHHHHHHHH Confidence 44443 4444432 11 111111 111111110 011111111112234566555544322 378889999999 Q ss_pred HHHHhcCCHHHhcccc-cCcCHHHHHHHH---HHHHHHHHHHHHHHHHHHhhc------chhhcc--------c----hh Q lcl|NC_018285. 265 FAKVYGIPENVVGGQG-DQQSSLEMSSNV---YSKAVARYLRPFLSELSQKLS------CDVDAD--------I----FP 322 (383) Q Consensus 265 Ia~~~gVpp~~lg~~~-~~~~~~e~~~~~---~~~~l~P~~~~i~~~l~~~l~------~~~e~~--------~----~~ 322 (383) +|..=+||++.||..+ ++..+++..++- +.....-..+.+.+.|.+.+- ..+..+ + .+ T Consensus 325 ~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d 404 (474) T protein:vir:81 325 FAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRD 404 (474) T ss_pred HHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecC Confidence 9999999999999543 343333333211 111122222222222222111 111111 1 11 Q ss_pred hhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhHHh-------------CCCCCCCCCCCCCC Q lcl|NC_018285. 323 AVDPTGANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPKGE-------------NPNRTILKGGETNG 381 (383) Q Consensus 323 ~~~~~~~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~~~-------------~~~~~~~~ggd~~~ 381 (383) ....+....++.+.++.++| +.+..-+++++|+.+ .++.+.+ .+...-. +|.+.. T Consensus 405 ~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~---~~i~~~~~~~~~~~~~~~~~~l~~~~~-~~~~aq 474 (474) T protein:vir:81 405 PRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTP---QQARRAMADKRRVQGRGTLQALIDRSN-NGATAQ 474 (474) T ss_pred CCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCH---HHHHHHHHHHHHHhHHHHHHHHHhcCC-CCCCCC Confidence 22345566677777888876 455566677776553 3332211 0100011 111111 No 197 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=97.53 E-value=1.7e-06 Score=52.19 Aligned_cols=258 Identities=12% Similarity=0.091 Sum_probs=125.9 Q ss_pred hccHHHHHHHHHHHHhhhhCceeeecchhhhhcc--------CCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCcee Q lcl|NC_018285. 42 LKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVD--------NPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDM 113 (383) Q Consensus 42 ~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~--------~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~ 113 (383) |..- .++..|++++--.+.|.+...+.|+. .-|...+-..-++.++.+. +.|--...+ +-+.-+. T Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~- 73 (279) T protein:vir:40 1 MSLF----NLSRRAEDVSFSTFTVQDPTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWA-LQGKEVYRV-WYGGFKY- 73 (279) T ss_pred Cccc----ccchhhcccceeeeeecCcchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhh-hccceeehh-hhhhHHH- Confidence 1110 11223445544455555544333221 2344444433334443332 344322111 1110000 Q ss_pred EEEEeccceeEEEEcCCCceeEEEEeecCcccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 114 KWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLN 193 (383) Q Consensus 114 ~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 193 (383) -+.+|. .+..+.+. + . .....+++|-.|+..|-++ .+|.-+ +....-+++.. +.... T Consensus 74 -----~~~~~~---~d~fn~~v---r--~-~~~~~vtVP~~Dv~IieNP-----lv~v~~-ee~~kM~~la~---nai~~ 130 (279) T protein:vir:40 74 -----YAQRVN---ADQFNIVV---R--E-PNRREVTIRTNDYEMLLNP-----FYGANP-QRFGVMFGMAS---NGIGR 130 (279) T ss_pred -----HHhhcC---cchhhhhe---e--c-CCcceeEeecchhhhhhcc-----hheecc-chhhHHHHHHH---hhhhh Confidence 000000 00000000 0 0 1112345555666655443 344443 23333333332 22333 Q ss_pred HHhccCCcceeEeecCC-CCHHHHHHHHHHHHHh---hcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 194 SLKNALNANGILKIKGG-GLLDFKTKVSRSRQAM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVY 269 (383) Q Consensus 194 ~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~---~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 269 (383) -+.+.+..+++++.+-. ..++.+++.+....++ .++-+++.+++.|-++++++.+-.-+ ..+-.++.+.+.+..+ T Consensus 131 KLD~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYSts-lk~die~lkS~l~Sq~ 209 (279) T protein:vir:40 131 RLDSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGS-LQNDANLAIEIALSEY 209 (279) T ss_pred hhcccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccccc-cHHHHHHHHHHHHhhc Confidence 33677778888888755 4555555555554443 34446899999999999998865544 3455578889999999 Q ss_pred cCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHH------HHhhcchhhccchhhhccCHHHHHHHHHHHHhCCC Q lcl|NC_018285. 270 GIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSEL------SQKLSCDVDADIFPAVDPTGANYISRINSMVKSGT 343 (383) Q Consensus 270 gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l------~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~ 343 (383) |||..+|-+ +.++++..+|+..+|.|++++.+..| +.+.++. -.++|. T Consensus 210 GinekIL~G----sAtE~q~iAyy~rtVePILkQyek~liY~~E~fv~y~tt----------------------ta~gg~ 263 (279) T protein:vir:40 210 GMPRELLYG----QSNEVTIIAFAIQKVLPLLKQHDKNIIFNQENFVAYIST----------------------TAKGGA 263 (279) T ss_pred CCchhhccc----cCchhhhhhHHHhhHHHHHHHhcccccchhhhhhhhhee----------------------cccCcc Confidence 999999955 44588899999999999999977633 3333322 011111 Q ss_pred cCH-HHHHH--HhhcC Q lcl|NC_018285. 344 LAQ-NQGLY--ILQQA 356 (383) Q Consensus 344 ~t~-nE~r~--~lg~~ 356 (383) +.. .-.|. -.|.. T Consensus 264 ~~s~~~~~~~~~~~~~ 279 (279) T protein:vir:40 264 IESKSSKRDSEPVGND 279 (279) T ss_pred cccccccccCCCCCCC Confidence 100 00000 01111 No 198 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=97.51 E-value=3.3e-06 Score=50.63 Aligned_cols=309 Identities=17% Similarity=0.184 Sum_probs=144.7 Q ss_pred CchhhhhhcCCcccccccccccchhh---cccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhccCC Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEF---LATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIVDNP 77 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~~~P 77 (383) ||+|+..++.--.+.-. .++..... -+++.+... +.+-+-+|+.||.- +-.|+..++.| T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~-~~~~~~~~~~~---- 62 (320) T protein:vir:97 1 MGIFNFKKRETLTPELK-ESIIRQVTIEDESPFTGTTD------------FNVRNEVAESIATY-LGAYKTSAKRL---- 62 (320) T ss_pred CCccccccccccChhHH-hhhhheeeeccCCCcccccc------------cchhhHHHHHHHHH-hhhhcccccee---- Confidence 99999754332111100 00000000 011112111 12333444544421 01122122222 Q ss_pred CccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCccc-ccceeecccce Q lcl|NC_018285. 78 SNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRI-PPKQHVPQSDI 156 (383) Q Consensus 78 N~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~-~~~~~~~~~dv 156 (383) .-..+--.|++.++.+.+..-..|++....- |+.+ .+.+++. +-.....++..+.-+ .....++-.|+ T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~------~~~~~~~----~~~~~~~~~~~D~FN~~V~mtvpfyD~ 131 (320) T protein:vir:97 63 SLLTNNPSFLRRLVKHALHNKTTYVYKSPTY-GWLI------TDSMTIE----GLRARLTFTLPDPFNSAVTMTVPFYDV 131 (320) T ss_pred eeeeCCHHHHHHHHHHhhcccceEEeeCCcc-ceee------ecceeee----eeeeeEEEecCcccceeEEEEeeeech Confidence 2223334689999999998888888764322 3221 1222221 111111222211111 01111111121 Q ss_pred EEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhhcC---Ccc Q lcl|NC_018285. 157 LHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMKQM---QGG 232 (383) Q Consensus 157 ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~~~---~g~ 232 (383) -.+ +++++|..+- ....-++.+ .....+-+.|.+.....++.+-+ .-+|-.++......++.+- -.+ T Consensus 132 ~IL-----dnpl~gv~tq-e~gkM~g~a---~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kIk~mq~~A~~~nG 202 (320) T protein:vir:97 132 GII-----DSPLVEVDTE-EANKMLEAA---YSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKIKAMLATAELLSG 202 (320) T ss_pred hhh-----hhhhcccChH-HhhHHHHHH---hhhhhhhccccceeEEEEecccchhHHHHHHHHHHHHHHHHHHHHHhcC Confidence 111 3456777774 333333333 33444556677778888887654 3355555555444433332 246 Q ss_pred eeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHH---HHHH Q lcl|NC_018285. 233 PLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFL---SELS 309 (383) Q Consensus 233 ~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~---~~l~ 309 (383) .-+++.|-+++++..+-.-+. ..-.+..+...+.-|+||..+|-++++ +++.-+|+..++.|+++++. -+|. T Consensus 203 ~T~i~~~dDI~Qi~pDYS~sn-~~D~~l~~t~alS~y~m~~~IL~GsAt----e~~~Iaf~~~~V~PLL~Q~~~~Ek~Lv 277 (320) T protein:vir:97 203 YTYIQRGDDVTQMMPDYTTSN-VTDFAAMRTFAASQLSVSDKILDGSAT----DGEKVAVMFRFVEPILEQFREYEPSLI 277 (320) T ss_pred cccccCCcceeeecccccccc-hhHHHHHHHHHHhhcCCchhhccccCC----cceeeehhhHhHHHHHHHhhhcCccee Confidence 777888888888766433322 222345566788899999999976543 45566899999999999973 3333 Q ss_pred HhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCC Q lcl|NC_018285. 310 QKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETNGQ 382 (383) Q Consensus 310 ~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~ 382 (383) .+|..+ .+ +.=+--+|.+..|-+- ++ +. +. .+...+|||+.+- T Consensus 278 y~m~~E--------------~F---Vs~mtTGG~l~S~~~~------~~--~~----~~-~~~~~~~~~~~~~ 320 (320) T protein:vir:97 278 YAMRDE--------------FF---VSFMTTGGMLNSNRVD------GW--GK----EK-APNESKGGDVGDV 320 (320) T ss_pred eeeccc--------------ee---eeeeecCceeeccccc------cc--cc----cc-CCccccCCcccCC Confidence 322111 01 1111125555444221 11 10 11 1123457777655 No 199 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.42 E-value=7.1e-05 Score=43.34 Aligned_cols=367 Identities=10% Similarity=-0.006 Sum_probs=149.9 Q ss_pred CchhhhhhcC------CcccccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecc--hhhh Q lcl|NC_018285. 1 MPIFNLATES------PPNNQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRK--QMQG 72 (383) Q Consensus 1 Mglf~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~--~~~~ 72 (383) ..++.++.++ +......+.....+...-........+.+ +..+-....|+..+.-+-+-|+.+.-. .... T Consensus 19 ~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~ 96 (453) T protein:vir:39 19 NEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNR--LTVNFTKYIVDTFTGYFNGIPVKKSHSDKETLS 96 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccce--eecchHHHHHHHHhhhhcccCceeccCChHHHH Confidence 1111111100 00000000000000000000000000111 112233445555655555566655322 2222 Q ss_pred hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-eE--EEEeecCcccccce Q lcl|NC_018285. 73 IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-LY--YNVTFDDPRIPPKQ 149 (383) Q Consensus 73 l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-~~--y~~~~~~~~~~~~~ 149 (383) .+.+.............+..+.+.+|.||+.+.++.+|.+ .+..++|..+....++.... .. .++........... T Consensus 97 ~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~ 175 (453) T protein:vir:39 97 KLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGE 175 (453) T ss_pred HHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEE Confidence 2222222234455667788999999999999999998876 46667888887776543321 11 11111111000011 Q ss_pred eecccc-------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCccee Q lcl|NC_018285. 150 HVPQSD-------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGI 204 (383) Q Consensus 150 ~~~~~d-------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i 204 (383) .+.++. |+++++ ...|.|-+..+...++....+..-..+.....+.|..+ T Consensus 176 ~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~ 250 (453) T protein:vir:39 176 VYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLT 250 (453) T ss_pred EEeCCeEEEEEecCCceeeecccccCCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceee Confidence 112222 233322 12477777666666655555444444455555667666 Q ss_pred EeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcC Q lcl|NC_018285. 205 LKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS 284 (383) Q Consensus 205 ~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~ 284 (383) ++.. .+.++....++........ + ....+.+.++..++.+.....+.+..+...+.|+..-++|..-.+..++ . T Consensus 251 ~~g~-~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~- 324 (453) T protein:vir:39 251 FLGA-AVEEEDLKNIRSNRVINYY--G-ESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGS-S- 324 (453) T ss_pred eecC-CCCchhhhhhhhcceeeec--C-CCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccC-C- Confidence 6542 3333333332221110000 0 0011233333333333344556667777888888888887433322111 1 Q ss_pred HHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhc----chhhccchhhhccCHHHHHHHHHHHHhCCCcCH Q lcl|NC_018285. 285 SLEMS--------------SNVYSKAVARYLRPFLSELSQKLS----CDVDADIFPAVDPTGANYISRINSMVKSGTLAQ 346 (383) Q Consensus 285 ~~e~~--------------~~~~~~~l~P~~~~i~~~l~~~l~----~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~ 346 (383) +..+. +..+...+...++.+...++..-. .++++......-.+..+.+..+.++ +|+++. T Consensus 325 Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~ 402 (453) T protein:vir:39 325 SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQ 402 (453) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCh Confidence 11111 122233333333333332221100 0111212222334455666666665 588999 Q ss_pred HHHHHHhhcCCcCCcchhHHhC----------CCCCCCCC--CCCC--CCC Q lcl|NC_018285. 347 NQGLYILQQAEILPKELPKGEN----------PNRTILKG--GETN--GQD 383 (383) Q Consensus 347 nE~r~~lg~~~~~~~d~~~~~~----------~~~~~~~g--gd~~--~~d 383 (383) -.+.+.++.-.=+..|+.+++. ......+| ++.+ ++| T Consensus 403 et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 403 ETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 8888877432100122222110 01111222 1111 111 No 200 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.40 E-value=7.6e-05 Score=43.18 Aligned_cols=368 Identities=11% Similarity=0.006 Sum_probs=146.8 Q ss_pred Cchh--------hhhhcCCc--ccccccccccchhhcccccCCceechhhh---hccHHHHHHHHHHHHhhhhCceeeec Q lcl|NC_018285. 1 MPIF--------NLATESPP--NNQGGFFDITDPEFLATLNGSEWVSAETA---LKNSDLFSIISQLSNDLATAKLTTSR 67 (383) Q Consensus 1 Mglf--------~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~a---~~~~~v~~~i~~ia~~ia~~p~~~~~ 67 (383) |+-. +...++.+ .....+...... .... +..+..+.+ +...-...+|+..+..+--.++.+-. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~--i~~~--~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRR--LKTI--GIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cccc--ccccchhHhhhhhhcchHHHHHHHHHhhhccCceecCC Confidence 3311 11111100 000001000000 0000 001111100 01112234555555544333444332 Q ss_pred ch-h-hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec------CCCceeEEEEeccceeEEEEcCCC-ce----e Q lcl|NC_018285. 68 KQ-M-QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN------DNGRDMKWEYLRPSQVSFNRLDNQ-NG----L 134 (383) Q Consensus 68 ~~-~-~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~------~~g~~~~l~~l~~~~v~~~~~~~~-~~----~ 134 (383) +. . ..+.. .-..-........+..+++.+|.||+.+-+. .+|.+ .+.+++|..+.+..+... .. + T Consensus 77 d~~~~~~l~~-i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D~~~~~~~~~~i 154 (480) T protein:vir:78 77 DSEGLEELWN-WWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) T ss_pred CchhHHHHHH-HHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEcCCCccceEEEE Confidence 21 1 11111 1111133456677889999999999988763 34554 466788888877665321 11 1 Q ss_pred EEEEeec----------------------Cccc------ccce--eecccceEEeccCCCCccccCcchHHH-HHHHHHH Q lcl|NC_018285. 135 YYNVTFD----------------------DPRI------PPKQ--HVPQSDILHFRLLSVDGGLTSVSPLMA-LGRELDI 183 (383) Q Consensus 135 ~y~~~~~----------------------~~~~------~~~~--~~~~~dvih~~~~~~~~~~~G~s~~~~-~~~~i~~ 183 (383) .|....+ .+.. .... .+..-.|++|.+....+..+|.|-+.. +...++. T Consensus 155 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da 234 (480) T protein:vir:78 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) T ss_pred EEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHH Confidence 1100000 0000 0000 022223566654433344567776543 3333333 Q ss_pred HHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeec-CCCceeeecccChhhHHHHHHHHHHH Q lcl|NC_018285. 184 QKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVL-DDLEDFTPLEIKSNVAQLLKQADWTT 262 (383) Q Consensus 184 ~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl-~~g~~~~~~~~~~~d~~~~e~~~~~~ 262 (383) .....-.......-.+.|..++... ...+...+.-...+. ...+.++.+ ++++++..+.....+ .+++..+..+ T Consensus 235 ~~~~~s~~~~~~~~~a~p~~~i~G~-~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~i 309 (480) T protein:vir:78 235 ASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAELR-NFAEEMEVFR 309 (480) T ss_pred HHHHHHHHHHHHHhhcchhhhhhcC-Cccccccccccchhh---hhhhhhccCCCCCceEEecCccCHH-HHHHHHHHHH Confidence 2222222222222234454444321 111111110011111 111334444 345677666654333 3677778888 Q ss_pred HHHHHHhcCCHHHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------ch------------hhccchhhh Q lcl|NC_018285. 263 GQFAKVYGIPENVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLS------CD------------VDADIFPAV 324 (383) Q Consensus 263 ~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~------~~------------~e~~~~~~~ 324 (383) .+|+..=++|+..+|+.+.+..+.++.+. ....|.-.+...+..|...|- .. +++...... T Consensus 310 ~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~-~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~ 388 (480) T protein:vir:78 310 KEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPS 388 (480) T ss_pred HHHhcccCCChHHhccccCcchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCC Confidence 99999999999999976544333333322 112222222222222221111 00 011111111 Q ss_pred ccCHHHHHHHHHHHHhCC--CcCHHHHHHHhhcCCcCCcchhHH------------hCCCCC----CC-----CCCCCCC Q lcl|NC_018285. 325 DPTGANYISRINSMVKSG--TLAQNQGLYILQQAEILPKELPKG------------ENPNRT----IL-----KGGETNG 381 (383) Q Consensus 325 ~~~~~~~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~d~~~~------------~~~~~~----~~-----~ggd~~~ 381 (383) -.+..+.+..+.+++.+| +++...+++.+|..+ +++.++ +..... +. .+||+.+ T Consensus 389 ~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~---d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (480) T protein:vir:78 389 TPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA---TQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKT 465 (480) T ss_pred CCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCH---hHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCC Confidence 245556677777887765 778877787776543 222111 111111 00 1222211 Q ss_pred CC Q lcl|NC_018285. 382 QD 383 (383) Q Consensus 382 ~d 383 (383) +. T Consensus 466 ~~ 467 (480) T protein:vir:78 466 ET 467 (480) T ss_pred cc Confidence 11 No 201 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.30 E-value=0.0001 Score=42.48 Aligned_cols=363 Identities=10% Similarity=0.007 Sum_probs=149.0 Q ss_pred CchhhhhhcCC--cccccccccccchhhc---ccccCCc--eechhhhhccHHHHHHHHHHHHhhhhCceeeecchh--- Q lcl|NC_018285. 1 MPIFNLATESP--PNNQGGFFDITDPEFL---ATLNGSE--WVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM--- 70 (383) Q Consensus 1 Mglf~~~~~~~--~~~~~~~~~~~~~~~~---~~~~~~~--~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--- 70 (383) .+|.+....+. ......+.......+. .....+. ...+..-+.++-....|+..+.-+-+-|+++...+. T Consensus 50 ~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~~ 129 (492) T protein:vir:94 50 VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVV 129 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCceeccCchHHH Confidence 11111111000 0000000000000000 0000000 000000112333444566666666566666532221 Q ss_pred ---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEE--E-EeecCc Q lcl|NC_018285. 71 ---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY--N-VTFDDP 143 (383) Q Consensus 71 ---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y--~-~~~~~~ 143 (383) +.++. | ........+..+++.+|.||+++-.+.+|++ .+..++|..+.+..++.. ..+.+ + +..... T Consensus 130 ~~l~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~ 203 (492) T protein:vir:94 130 KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE 203 (492) T ss_pred HHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeccc Confidence 11221 2 2334556778899999999999999988876 477788888877654321 11111 1 110100 Q ss_pred ccccceeecccceEEeccC---------------------CC---------CccccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 144 RIPPKQHVPQSDILHFRLL---------------------SV---------DGGLTSVSPLMALGRELDIQKASDKLTLN 193 (383) Q Consensus 144 ~~~~~~~~~~~dvih~~~~---------------------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 193 (383) .....+....+.++... ++ .+...|.|-+..+...++....+..-..+ T Consensus 204 --~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~ 281 (492) T protein:vir:94 204 --TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSN 281 (492) T ss_pred --eeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHH Confidence 00111111122211100 00 01124778787777777776666666666 Q ss_pred HHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_018285. 194 SLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPE 273 (383) Q Consensus 194 ~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp 273 (383) .+...+.|-.+++.-... ......... ...+++.++.+.+...+........+....+...+.|+..-++|. T Consensus 282 ~~~~~~~p~lv~~g~~~~---~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 353 (492) T protein:vir:94 282 TFKDSNELTYVLKNYDDQ---ELPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD 353 (492) T ss_pred HHHHhcCceeeeecCCcc---cchhhHHHH-----hhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcC Confidence 666667776666532221 111111111 113344455554444444444445556666777788888888875 Q ss_pred HHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhc-chhhccchhhhccCHHHHHHHHHHH Q lcl|NC_018285. 274 NVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLS-CDVDADIFPAVDPTGANYISRINSM 338 (383) Q Consensus 274 ~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~-~~~e~~~~~~~~~~~~~~~~~~~~l 338 (383) .-.+..+.+. +.++.+ ..+...+.-.++.+...++.+-- .++++...+..-.+..+.+..+.++ T Consensus 354 ~~~~~~~~n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl 432 (492) T protein:vir:94 354 FSSDKFGSAP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS 432 (492) T ss_pred CCccccccCc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHH Confidence 3332111112 222221 11222222222222222111100 0111111222223455666666665 Q ss_pred HhCCCcCHHHHHHHhhcCCcCCcchhHH--------hCCCCCCCCCCCC-----CCCC Q lcl|NC_018285. 339 VKSGTLAQNQGLYILQQAEILPKELPKG--------ENPNRTILKGGET-----NGQD 383 (383) Q Consensus 339 ~~~g~~t~nE~r~~lg~~~~~~~d~~~~--------~~~~~~~~~ggd~-----~~~d 383 (383) .|+++...++++++.-.=+..|+.++ +...... .++.+ ++.+ T Consensus 433 --~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-~~~~~~~~~~~~~~ 487 (492) T protein:vir:94 433 --MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLD-DGGADSAQQQERSN 487 (492) T ss_pred --hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc-cccCCCCccccCCc Confidence 48999988888774322011232221 1111111 11111 1111 No 202 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.29 E-value=0.0001 Score=42.41 Aligned_cols=358 Identities=13% Similarity=0.051 Sum_probs=152.8 Q ss_pred CchhhhhhcCCc--ccccccccccchh----hcccccCCceec-hhhhhccHHHHHHHHHHHHhhhhCceeeecch--hh Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPE----FLATLNGSEWVS-AETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~ 71 (383) ..+.+....+.. .....+....... ......+..... ...=+.++-....|+..+.-+-+-|+++.-.+ .. T Consensus 33 ~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~ 112 (474) T protein:vir:97 33 VRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVL 112 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHH Confidence 111111100000 0000000000000 000000000000 00001123344456666666666676654322 22 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEE---EEeecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY---NVTFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y---~~~~~~~~~~~ 147 (383) ..+..-+. .........+..++..+|.||+.+.++.+|.+ .+..++|..+-+..++.. ..+.+ .+...+. .. T Consensus 113 ~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~--~~ 188 (474) T protein:vir:97 113 KVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNE--EK 188 (474) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCe--EE Confidence 22211111 12345556677889999999999999998875 466788888877765421 11111 0110000 00 Q ss_pred ceeecccc-----------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSD-----------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 148 ~~~~~~~d-----------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) ...+.... |+++++ ...|.|-+..+...++....+..... T Consensus 189 ~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~ 263 (474) T protein:vir:97 189 VEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQ 263 (474) T ss_pred EEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHH Confidence 01111111 333332 12477777777777776665555555 Q ss_pred HHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIP 272 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVp 272 (383) +.+...+.|..+++.-.....+ .+... ....+++.+++|.+...++.......+.+..+...+.|...-++| T Consensus 264 ~~~~~~~~~~lv~~g~~~~~~~---~~~~~-----~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 335 (474) T protein:vir:97 264 NMFDESVELIYILKGYEGEDLE---EFMRG-----LKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGV 335 (474) T ss_pred HHHHHhcCceeeeecCCcccch---hhhhh-----hhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 5555566666665542221111 11111 113456666666665555555455556667777788888888888 Q ss_pred HHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcchhhc-cchh-hhccCHHHHHHHHH Q lcl|NC_018285. 273 ENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSCDVDA-DIFP-AVDPTGANYISRIN 336 (383) Q Consensus 273 p~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~~~e~-~~~~-~~~~~~~~~~~~~~ 336 (383) ..-.+..+.+. +..+.+ ..+...+..+++.|..-++. ..+. ++.. +.+..+....+.++ T Consensus 336 ~~~~~~~~~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~----~~d~~~i~v~f~~~~p~~~~e~a~ 410 (474) T protein:vir:97 336 DFQTDKFGSAP-SGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL----KTDVKDIEISFNFNRMMNDAEQSQ 410 (474) T ss_pred ccCcccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCcccCHHHHHH Confidence 53322111112 222211 12223333333333222211 1111 1111 11222233445556 Q ss_pred HHHhCCCcCHHHHHHHhhcCCcCC--cchhHH--------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 337 SMVKSGTLAQNQGLYILQQAEILP--KELPKG--------ENPNRTILKGGETNGQD 383 (383) Q Consensus 337 ~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~--------~~~~~~~~~ggd~~~~d 383 (383) .+.+.|+++.-.++++++. ++. .|+.+. +..+.. ..+|.+.+++ T Consensus 411 ~~~~~g~iS~et~l~~l~~--v~D~~~E~eri~~E~~~~~~~~~~~-~~~~~~~~~~ 464 (474) T protein:vir:97 411 IIAQSQYLSRETLVKSSPL--VDDYKAELERIEQEQMEYNKQLPNL-DDGGADGAQQ 464 (474) T ss_pred HHHHcCCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHhhcccc-CCCCCCCccc Confidence 6777899999999888743 221 222221 111111 1122221111 No 203 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.29 E-value=0.0001 Score=42.41 Aligned_cols=358 Identities=13% Similarity=0.051 Sum_probs=152.8 Q ss_pred CchhhhhhcCCc--ccccccccccchh----hcccccCCceec-hhhhhccHHHHHHHHHHHHhhhhCceeeecch--hh Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPE----FLATLNGSEWVS-AETALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~ 71 (383) ..+.+....+.. .....+....... ......+..... ...=+.++-....|+..+.-+-+-|+++.-.+ .. T Consensus 33 ~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~ 112 (474) T protein:vir:94 33 VRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVL 112 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHH Confidence 111111100000 0000000000000 000000000000 00001123344456666666666676654322 22 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEE---EEeecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY---NVTFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y---~~~~~~~~~~~ 147 (383) ..+..-+. .........+..++..+|.||+.+.++.+|.+ .+..++|..+-+..++.. ..+.+ .+...+. .. T Consensus 113 ~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~--~~ 188 (474) T protein:vir:94 113 KVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNE--EK 188 (474) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCe--EE Confidence 22211111 12345556677889999999999999998875 466788888877765421 11111 0110000 00 Q ss_pred ceeecccc-----------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 148 KQHVPQSD-----------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 148 ~~~~~~~d-----------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) ...+.... |+++++ ...|.|-+..+...++....+..... T Consensus 189 ~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~ 263 (474) T protein:vir:94 189 VEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQ 263 (474) T ss_pred EEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHH Confidence 01111111 333332 12477777777777776665555555 Q ss_pred HHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIP 272 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVp 272 (383) +.+...+.|..+++.-.....+ .+... ....+++.+++|.+...++.......+.+..+...+.|...-++| T Consensus 264 ~~~~~~~~~~lv~~g~~~~~~~---~~~~~-----~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 335 (474) T protein:vir:94 264 NMFDESVELIYILKGYEGEDLE---EFMRG-----LKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGV 335 (474) T ss_pred HHHHHhcCceeeeecCCcccch---hhhhh-----hhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 5555566666665542221111 11111 113456666666665555555455556667777788888888888 Q ss_pred HHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcchhhc-cchh-hhccCHHHHHHHHH Q lcl|NC_018285. 273 ENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSCDVDA-DIFP-AVDPTGANYISRIN 336 (383) Q Consensus 273 p~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~~~e~-~~~~-~~~~~~~~~~~~~~ 336 (383) ..-.+..+.+. +..+.+ ..+...+..+++.|..-++. ..+. ++.. +.+..+....+.++ T Consensus 336 ~~~~~~~~~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~----~~d~~~i~v~f~~~~p~~~~e~a~ 410 (474) T protein:vir:94 336 DFQTDKFGSAP-SGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL----KTDVKDIEISFNFNRMMNDAEQSQ 410 (474) T ss_pred ccCcccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCcccCHHHHHH Confidence 53322111112 222211 12223333333333222211 1111 1111 11222233445556 Q ss_pred HHHhCCCcCHHHHHHHhhcCCcCC--cchhHH--------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 337 SMVKSGTLAQNQGLYILQQAEILP--KELPKG--------ENPNRTILKGGETNGQD 383 (383) Q Consensus 337 ~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~--------~~~~~~~~~ggd~~~~d 383 (383) .+.+.|+++.-.++++++. ++. .|+.+. +..+.. ..+|.+.+++ T Consensus 411 ~~~~~g~iS~et~l~~l~~--v~D~~~E~eri~~E~~~~~~~~~~~-~~~~~~~~~~ 464 (474) T protein:vir:94 411 IIAQSQYLSRETLVKSSPL--VDDYKAELERIEQEQMEYNKQLPNL-DDGGADGAQQ 464 (474) T ss_pred HHHHcCCCCHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHhhcccc-CCCCCCCccc Confidence 6777899999999888743 221 222221 111111 1122221111 No 204 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.21 E-value=0.00013 Score=41.87 Aligned_cols=370 Identities=10% Similarity=0.028 Sum_probs=145.6 Q ss_pred CchhhhhhcCC-c--ccccccccccchhhcccc---cCCceechhhhhccHHHHHHHHHHHHhhhhCceeee--cchhhh Q lcl|NC_018285. 1 MPIFNLATESP-P--NNQGGFFDITDPEFLATL---NGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTS--RKQMQG 72 (383) Q Consensus 1 Mglf~~~~~~~-~--~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~--~~~~~~ 72 (383) .+|.++...++ + .....+............ ......+.+ +..+-....|+..+.-+-+-|+++. +..... T Consensus 28 ~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~k--i~~n~~~~Iv~~~~~~l~G~p~~~~~~d~~~~~ 105 (506) T protein:vir:94 28 MKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHR--ATHSFAKYIADFQTSYSVGNPINVKLPDDGSNS 105 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcce--eecchHHHHHHHhhhhhcccCceeecCcchHHH Confidence 22222211100 0 000000000000000000 000000011 1223344455555555545565543 222323 Q ss_pred hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-e----eE-EEEeecCcccc Q lcl|NC_018285. 73 IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-G----LY-YNVTFDDPRIP 146 (383) Q Consensus 73 l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~----~~-y~~~~~~~~~~ 146 (383) .+.+-............+..+++.+|.||+.+.++.+|++ .+..++|..+.+..++... . +. |.......... T Consensus 106 ~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~ 184 (506) T protein:vir:94 106 GFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQV 184 (506) T ss_pred HHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCce Confidence 3333222334455566778889999999999999988876 4667888888776654221 1 11 11000000000 Q ss_pred -----cceeeccc-------------------------ceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 147 -----PKQHVPQS-------------------------DILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLK 196 (383) Q Consensus 147 -----~~~~~~~~-------------------------dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 196 (383) ....+... .|+++++. -.|.|.+......++....+.-...+... T Consensus 185 ~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~~~~liDa~d~~~S~~~~~~~ 259 (506) T protein:vir:94 185 STINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNS-----NFRLGDFENVLPLIDLYDAAQSDTANYMT 259 (506) T ss_pred eEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 00001111 12333221 13555555555444444333322222222 Q ss_pred ccCCcceeEeecCCC---------------------CHHHHHHHHHHH-H---HhhcCCcceeecCCCceeeecccChhh Q lcl|NC_018285. 197 NALNANGILKIKGGG---------------------LLDFKTKVSRSR-Q---AMKQMQGGPLVLDDLEDFTPLEIKSNV 251 (383) Q Consensus 197 ng~~~~~i~~~~~~~---------------------~~e~~~~~~~~~-~---~~~~~~g~~~vl~~g~~~~~~~~~~~d 251 (383) ..+.|-.+++..... ..+......+.. . -.....+.+...+.+.++.-++.+..+ T Consensus 260 ~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~ 339 (506) T protein:vir:94 260 DLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDV 339 (506) T ss_pred HhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCH Confidence 222233333211000 001111111111 1 111111222223344455555665566 Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhhcc--h- Q lcl|NC_018285. 252 AQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLE-------------MSSNVYSKAVARYLRPFLSELSQKLSC--D- 315 (383) Q Consensus 252 ~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e-------------~~~~~~~~~l~P~~~~i~~~l~~~l~~--~- 315 (383) ..+....+.....|+..-++|..-.+..+.+.+... ..+..+...+...++.|...++.. .. . T Consensus 340 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~~~~~~ 418 (506) T protein:vir:94 340 VGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSI-HGDWTF 418 (506) T ss_pred HHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCcccc Confidence 667778888899999999999643322111222111 112233344444444443333211 11 1 Q ss_pred ----hhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHhC-----CCCCCCCCCCCCC--- Q lcl|NC_018285. 316 ----VDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGEN-----PNRTILKGGETNG--- 381 (383) Q Consensus 316 ----~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~~-----~~~~~~~ggd~~~--- 381 (383) +++......-.+..+.+..+.++ .|+++...++++++ .++. .|+.+++. .......++..+. T Consensus 419 d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~lp--~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 494 (506) T protein:vir:94 419 DPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQLP--GVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQT 494 (506) T ss_pred ccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCC--CCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCc Confidence 11112222234455666666665 58999999988863 3332 22222110 0000011111111 Q ss_pred ---CC Q lcl|NC_018285. 382 ---QD 383 (383) Q Consensus 382 ---~d 383 (383) ++ T Consensus 495 ~~~~~ 499 (506) T protein:vir:94 495 NTTAT 499 (506) T ss_pred ccccc Confidence 11 No 205 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=96.95 E-value=0.00024 Score=40.42 Aligned_cols=363 Identities=12% Similarity=0.045 Sum_probs=157.2 Q ss_pred CchhhhhhcC-Ccc---cccccccccchhh----cccccCCceechhh---hhccHHHHHHHHHHHHhhhhCceeeecch Q lcl|NC_018285. 1 MPIFNLATES-PPN---NQGGFFDITDPEF----LATLNGSEWVSAET---ALKNSDLFSIISQLSNDLATAKLTTSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~-~~~---~~~~~~~~~~~~~----~~~~~~~~~~~~~~---a~~~~~v~~~i~~ia~~ia~~p~~~~~~~ 69 (383) ..+++.+..+ +.. .-..+........ .............. =+.++-....|+..+.-+-+-|+++.-.+ T Consensus 25 ~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~~~~ 104 (479) T protein:vir:79 25 VKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNPIVFNADD 104 (479) T ss_pred HHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCC Confidence 2222222111 000 0000000000000 00000000000000 01233344456666665656666654222 Q ss_pred --hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc--ee----EEEEeec Q lcl|NC_018285. 70 --MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN--GL----YYNVTFD 141 (383) Q Consensus 70 --~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~--~~----~y~~~~~ 141 (383) ...++..-..+ ........+..+...+|.+|..+..+.+|++. +..++|..+.+..++... .. +|..... T Consensus 105 ~~~~~~~~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~ 182 (479) T protein:vir:79 105 DNLTKLLNDLLGE-EFDDTITELYLNASNKGVEWLHPYINRKGEFK-YVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDI 182 (479) T ss_pred HHHHHHHHHHHhc-CHHHHHHHHHHHHHhcCeEEEEEEeCCCCceE-EEEEccceeEEEEeCCCCCceEEEEEEEEEeec Confidence 11222111111 34555567778899999999999998888764 777888887776543321 11 1111111 Q ss_pred Cccc-ccceeecccc--------------------------------------------eEEeccCCCCccccCcchHHH Q lcl|NC_018285. 142 DPRI-PPKQHVPQSD--------------------------------------------ILHFRLLSVDGGLTSVSPLMA 176 (383) Q Consensus 142 ~~~~-~~~~~~~~~d--------------------------------------------vih~~~~~~~~~~~G~s~~~~ 176 (383) ++.. .....+.... |+++++ ...|.|-+.. T Consensus 183 ~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd~~~ 257 (479) T protein:vir:79 183 DGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVSDLTF 257 (479) T ss_pred CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecC-----CCCCCcchhh Confidence 1000 0011122222 233322 1246777777 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHH Q lcl|NC_018285. 177 LGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLL 255 (383) Q Consensus 177 ~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~ 255 (383) +...++....+.....+.+...+.|-.+++.-.. ..++.... ...++++.++++.+++-+..+..+..+. T Consensus 258 v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~---------~~~~~~i~~~~~~~~~~l~~~~~~~~~~ 328 (479) T protein:vir:79 258 YKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDN---------IRYYKSIKVDGGGGVDKLEINIPVEAKK 328 (479) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhh---------hhhccceecCCCCcceEEeccCCHHHHH Confidence 7777766666555555566666667666654221 11221111 1124566666665555555554555567 Q ss_pred HHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcch-----h Q lcl|NC_018285. 256 KQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS--------------SNVYSKAVARYLRPFLSELSQKLSCD-----V 316 (383) Q Consensus 256 e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~--------------~~~~~~~l~P~~~~i~~~l~~~l~~~-----~ 316 (383) +..+...+.|+..-++|..-.+..++ . +..+. +..+...+.-.++.+...++..-..+ + T Consensus 329 ~~~~~l~~~i~~~s~~p~~~~~~~gn-~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i 406 (479) T protein:vir:79 329 ELLDRLEKNIIIFGQGVNPESQNTGD-K-SGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTV 406 (479) T ss_pred HHHHHHHHHHHHHhCccccccccccc-h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccc Confidence 77788888888888888654443222 1 11121 11222333333333333222211111 1 Q ss_pred hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC-------CCCCCCCCCCCCCCC Q lcl|NC_018285. 317 DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN-------PNRTILKGGETNGQD 383 (383) Q Consensus 317 e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~-------~~~~~~~ggd~~~~d 383 (383) ++......-.+..+.+..+.++ .|+++...+.++++.-.=+..|+.+.+. ........++...+| T Consensus 407 ~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e 478 (479) T protein:vir:79 407 QITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPNNQDGVIDE 478 (479) T ss_pred eEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCcccCCCcCc Confidence 1111222223455666666665 4899998888877432100123322210 111111122222222 No 206 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=96.95 E-value=0.00024 Score=40.40 Aligned_cols=358 Identities=13% Similarity=0.024 Sum_probs=149.4 Q ss_pred CchhhhhhcCCc--ccccccccccchhhc---ccccCCcee--chhhhhccHHHHHHHHHHHHhhhhCceeeecch--h- Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPEFL---ATLNGSEWV--SAETALKNSDLFSIISQLSNDLATAKLTTSRKQ--M- 70 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~~~---~~~~~~~~~--~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~- 70 (383) =.+.+....+.. .....+.....+... -....+... .+..=+.++-....|+..+.-+-+-|+++.-.+ . T Consensus 32 ~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~ 111 (474) T protein:vir:96 32 IRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSL 111 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcccCceeecCchHHH Confidence 001111000000 000000000000000 000000000 000001123334455555555545565543221 1 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC--ceeE--EEEeecCcccc Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ--NGLY--YNVTFDDPRIP 146 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~--y~~~~~~~~~~ 146 (383) ..|...-+ .........+..++..+|.||+.+.++.+|++. +..++|..+.+..++.. ...+ +.+..... . T Consensus 112 ~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~--~ 186 (474) T protein:vir:96 112 KTIQEVLN--HKWDDKLVDILTAASNKGIEWLQPYIDENGEFK-TFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGA--E 186 (474) T ss_pred HHHHHHHh--cCHHHHHHHHHHHHHhcCeeEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceEEEEEEEeecCc--e Confidence 11111111 123344456678888999999999998888764 77788888877765421 1111 11111100 0 Q ss_pred cceeecccc---------------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSD---------------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKAS 187 (383) Q Consensus 147 ~~~~~~~~d---------------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~ 187 (383) ....+.... |+++++ ...|.|-+......++....+ T Consensus 187 ~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~ 261 (474) T protein:vir:96 187 RVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN-----NPQEMSDLFMYKTIIDAMDKR 261 (474) T ss_pred EEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEecc-----CCCCCCcHHHHHHHHHHHHHH Confidence 011111112 233332 124777777777777666666 Q ss_pred HHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC-CCceeeecccChhhHHHHHHHHHHHHHHH Q lcl|NC_018285. 188 DKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLKQADWTTGQFA 266 (383) Q Consensus 188 ~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia 266 (383) .....+.+...+.|-.+++.-...+. ...... -...+++.++ .|.+++.++.+.....+.+..+...+.|+ T Consensus 262 ~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~-----~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~ 333 (474) T protein:vir:96 262 LSDTQNTFDESTELIYILKGYEGQDL---DEFMRN-----LKYYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVI 333 (474) T ss_pred HHHHHHHHHHhccceeeeecCCcccc---cchhhh-----hhcCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHH Confidence 65566666666667666553222111 111111 1124555554 45555555555445566777788889999 Q ss_pred HHhcCCHHHhcccccCcCHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcchhhc-cchh-hhccCHHH Q lcl|NC_018285. 267 KVYGIPENVVGGQGDQQSSLEMSSN--------------VYSKAVARYLRPFLSELSQKLSCDVDA-DIFP-AVDPTGAN 330 (383) Q Consensus 267 ~~~gVpp~~lg~~~~~~~~~e~~~~--------------~~~~~l~P~~~~i~~~l~~~l~~~~e~-~~~~-~~~~~~~~ 330 (383) ..-++|..-.+..+.+. +..+.+. .+...+...++.|..-+ ....+. ++.. +.+..+.. T Consensus 334 ~~s~~p~~~~~~~~~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~----~~~~~~~~i~i~f~~~~p~~ 408 (474) T protein:vir:96 334 EFGQGVDFQQDKFGNSP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY----KLNIKVQDVEITFNFNVMVN 408 (474) T ss_pred HHhCCcccccccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCcccceeeEEeccCCCcC Confidence 99999865443222222 2222221 22222222222222111 111110 1110 11112223 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh------CCCCCCCCC-----CCCCCCC Q lcl|NC_018285. 331 YISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE------NPNRTILKG-----GETNGQD 383 (383) Q Consensus 331 ~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~------~~~~~~~~g-----gd~~~~d 383 (383) ..+.++.+.++|+++...+++.++. ++. -|+.+.+ .....+.++ .++++++ T Consensus 409 ~~e~~~~~~~ag~iS~et~~~~~~~--v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d~~~e 472 (474) T protein:vir:96 409 ELEQSQIGVQSQYLSKETVVTNHPW--VDDPVAELERIEQDNIDFNKQLPPLEGDANGRAQDNESE 472 (474) T ss_pred HHHHHHHHHhcCCCchHHHHHhCCC--CCCHHHHHHHHHHHHHHHHhcccccccccccccCCCccc Confidence 3444555677899999999987643 221 1222211 011112222 1112222 No 207 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=96.83 E-value=0.00031 Score=39.83 Aligned_cols=373 Identities=10% Similarity=0.001 Sum_probs=154.1 Q ss_pred CchhhhhhcCCcc--cccccccccchhhccc--ccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeec--chhhhhc Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFLAT--LNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSR--KQMQGIV 74 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~--~~~~~l~ 74 (383) -.+.+....+... ....+...... .... ...+. .+.+ +..+-....|+..+.-+-.-|+++.. ......+ T Consensus 23 ~~~i~~~~~~~~r~~~~~~yy~g~~~-i~~~~~~~~~~-~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l 98 (453) T protein:vir:73 23 NDFMKKHQEEVERYEYLGNMYKGIME-ISSQKAKDSWK-PDNR--LTNNFAKYIVDTFVGYFNGIPIKKTHDDKSVLEAM 98 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccc-hhcCCCCCccC-ccce--eecchHHHHHHHhhhhhcccCceeecCChHHHHHH Confidence 1111111110000 00000000000 0000 00000 0111 11222333444444444444555432 2222222 Q ss_pred cCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCce-eEE--EEeecCcccccceee Q lcl|NC_018285. 75 DNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-LYY--NVTFDDPRIPPKQHV 151 (383) Q Consensus 75 ~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~-~~y--~~~~~~~~~~~~~~~ 151 (383) ..-............+..+.+.+|.||+.+.++.+|.+ .+..++|..+.+..++.... ..+ ++..+.........+ T Consensus 99 ~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vy 177 (453) T protein:vir:73 99 QLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTES-EVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVY 177 (453) T ss_pred HHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEE Confidence 22222234445566788899999999999999998877 46677888887766543221 111 111111111111223 Q ss_pred cccceEEeccC-----------C---------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC Q lcl|NC_018285. 152 PQSDILHFRLL-----------S---------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG 211 (383) Q Consensus 152 ~~~dvih~~~~-----------~---------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~ 211 (383) ..+.++++... + ......|.|-+..+...++....+...........+.|..+++.- .+ T Consensus 178 t~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~-~~ 256 (453) T protein:vir:73 178 TLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA-EV 256 (453) T ss_pred eCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC-CC Confidence 33333333211 0 001124777777666666665555555555555556676666543 33 Q ss_pred CHHHHHHHHHHH--HHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH Q lcl|NC_018285. 212 LLDFKTKVSRSR--QAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS 289 (383) Q Consensus 212 ~~e~~~~~~~~~--~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~ 289 (383) .++....++... .......+.....+.+.++.-++....+..+....+.....|+..-++|..-....+ +.+ ..+. T Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~S-g~Al 334 (453) T protein:vir:73 257 DEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFG-NSS-GVAL 334 (453) T ss_pred CchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-Ccc-HHHH Confidence 333333333221 111122222333344555544444444555667777888888888888853222111 122 2222 Q ss_pred H--------------HHHHHHHHHHHHHHHHHHHHhhc----chhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Q lcl|NC_018285. 290 S--------------NVYSKAVARYLRPFLSELSQKLS----CDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLY 351 (383) Q Consensus 290 ~--------------~~~~~~l~P~~~~i~~~l~~~l~----~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~ 351 (383) + ..+...+.-.++.+..-++..-. ..+++......-.+..+.+..+.++. |+++.-.+.+ T Consensus 335 ~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~ 412 (453) T protein:vir:73 335 AYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GITSEETALS 412 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHH Confidence 1 12223333333333221111100 01122222233345566677777764 8899887877 Q ss_pred HhhcCCcCCcchhHHhC-----CCCC---CCCCCCCCCCC Q lcl|NC_018285. 352 ILQQAEILPKELPKGEN-----PNRT---ILKGGETNGQD 383 (383) Q Consensus 352 ~lg~~~~~~~d~~~~~~-----~~~~---~~~ggd~~~~d 383 (383) .++.-.=+..|+.+++. .... ...-.|.+-+| T Consensus 413 ~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 452 (453) T protein:vir:73 413 VISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGN 452 (453) T ss_pred hCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhcC Confidence 66432100123222111 0000 00011122222 No 208 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=96.65 E-value=0.00044 Score=39.01 Aligned_cols=365 Identities=9% Similarity=-0.008 Sum_probs=146.6 Q ss_pred Cc-------------hhhhhhcC------Ccccccccccccchhhcccc-cCCceechhhhhccHHHHHHHHHHHHhhhh Q lcl|NC_018285. 1 MP-------------IFNLATES------PPNNQGGFFDITDPEFLATL-NGSEWVSAETALKNSDLFSIISQLSNDLAT 60 (383) Q Consensus 1 Mg-------------lf~~~~~~------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~ 60 (383) -| ...++.++ +......+...... ..... ......+.+ +..+-....|+..+.-+-+ T Consensus 6 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~-i~~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g 82 (452) T protein:vir:36 6 PKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMA-IDDEPAKDSWKPDNR--LAVNFTKYIVDTFTGYFNG 82 (452) T ss_pred ceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccCccccccCccce--eecchHHHHHHHHhhhhcc Confidence 11 11111000 00000000000000 00000 000000111 1122333344545544444 Q ss_pred Cceeee--cchhhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCc-eeEE- Q lcl|NC_018285. 61 AKLTTS--RKQMQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQN-GLYY- 136 (383) Q Consensus 61 ~p~~~~--~~~~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~-~~~y- 136 (383) -|+++. +......+.+-............+..+.+.+|.||..+.++.+|.+ .+..++|..+....++... ...+ T Consensus 83 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ 161 (452) T protein:vir:36 83 IPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSPENMFMVYDDTVKQEPLFA 161 (452) T ss_pred cCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEE Confidence 555543 2222222222222234455567788899999999999999988876 4677888888776654321 1111 Q ss_pred -EEeecCcccccceeecccceEEeccC-----------C---------CCccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 137 -NVTFDDPRIPPKQHVPQSDILHFRLL-----------S---------VDGGLTSVSPLMALGRELDIQKASDKLTLNSL 195 (383) Q Consensus 137 -~~~~~~~~~~~~~~~~~~dvih~~~~-----------~---------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 195 (383) ++............+.++.+.++... + ..+...|.|-+..+...++....+.....+.+ T Consensus 162 i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~ 241 (452) T protein:vir:36 162 VRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDV 241 (452) T ss_pred EEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHH Confidence 11111000001111222222222110 0 00112467777666666665555555555555 Q ss_pred hccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCC-----CceeeecccChhhHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 196 KNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDD-----LEDFTPLEIKSNVAQLLKQADWTTGQFAKVYG 270 (383) Q Consensus 196 ~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~-----g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~g 270 (383) ...+.|-.+++.. ...++....++. ++++.++. +.++.-+..+..+..+....+...+.|+..-+ T Consensus 242 ~~~~~p~~~~~g~-~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 311 (452) T protein:vir:36 242 DYFSDQYLTFLGA-AVEEEDLKNIRS---------NRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTM 311 (452) T ss_pred HHhcCceeEeecC-CcCchhhhhhhh---------cceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 5566676666532 233333222211 12222221 22333333333445566777888888988888 Q ss_pred CCHHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcc----hhhccchhhhccCHHHHH Q lcl|NC_018285. 271 IPENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSC----DVDADIFPAVDPTGANYI 332 (383) Q Consensus 271 Vpp~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~----~~e~~~~~~~~~~~~~~~ 332 (383) +|..-.+..+ +.+ ..+.+ ..+...+...++.|..-++..-.. ++++......-.+..+.+ T Consensus 312 ~p~~~~~~~g-n~S-g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a 389 (452) T protein:vir:36 312 VANISDESFG-SSS-GVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQA 389 (452) T ss_pred ccccCccccc-CCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHH Confidence 9853332221 122 11211 122233333333332222211000 111111222234455666 Q ss_pred HHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC---------CCC-CCCCC----CCCCCCC Q lcl|NC_018285. 333 SRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN---------PNR-TILKG----GETNGQD 383 (383) Q Consensus 333 ~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~---------~~~-~~~~g----gd~~~~d 383 (383) ..+.++ .|+++.-.+.+.++.-.=+..|+.+++. .+. .+.+| .+.+++| T Consensus 390 ~~~~k~--~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 390 ETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred HHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCCCCcccccCccccCC Confidence 666665 5889988888776432100122222110 111 11111 1111111 No 209 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=96.63 E-value=0.00046 Score=38.89 Aligned_cols=361 Identities=11% Similarity=0.034 Sum_probs=143.7 Q ss_pred CchhhhhhcCCcc--cccccccccchhhc---ccccCCceechh--hhhccHHHHHHHHHHHHhhhhCceeeecch--hh Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFL---ATLNGSEWVSAE--TALKNSDLFSIISQLSNDLATAKLTTSRKQ--MQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~---~~~~~~~~~~~~--~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~ 71 (383) ..+.+....+.+. ....+......... .....+...... .=+.++-...+|+..+.-+-+-|+++.-.+ .. T Consensus 32 ~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~ 111 (478) T protein:vir:10 32 LRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKAL 111 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCeeeecCChHHH Confidence 1111110000000 00000000000000 000000000000 001122333455555555555566643222 11 Q ss_pred -hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-ceeEE---EEeecCcccc Q lcl|NC_018285. 72 -GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-NGLYY---NVTFDDPRIP 146 (383) Q Consensus 72 -~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~~~~y---~~~~~~~~~~ 146 (383) .|..--+ .........+..++..+|.||+.+..+.+|++ .+..++|..+.+..++.. +.+.+ .+...+.. T Consensus 112 ~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~-- 186 (478) T protein:vir:10 112 KQIQHTLN--HKWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAE-- 186 (478) T ss_pred HHHHHHHh--cCHHHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCce-- Confidence 1111111 13455666778899999999999999888876 466788888877655321 11110 11111000 Q ss_pred cceeecccc---------------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSD---------------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKAS 187 (383) Q Consensus 147 ~~~~~~~~d---------------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~ 187 (383) ....+..+. |+++++ ...|.|-+..+...++....+ T Consensus 187 ~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~liDa~~~~ 261 (478) T protein:vir:10 187 RVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKR 261 (478) T ss_pred EEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEecc-----CCCCCCcHHHHHHHHHHHHHH Confidence 001111112 333332 234777777666666666555 Q ss_pred HHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHhhcCCcceeec--CCCceeeecccChhhHHHHHHHHHHHHH Q lcl|NC_018285. 188 DKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLVL--DDLEDFTPLEIKSNVAQLLKQADWTTGQ 264 (383) Q Consensus 188 ~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~vl--~~g~~~~~~~~~~~d~~~~e~~~~~~~~ 264 (383) .....+.++..+.|-.+++.-.... .+....++ .++++.+ +.|.+..-+........+.+..+...+. T Consensus 262 ~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 332 (478) T protein:vir:10 262 LSDTQNTFDESVELIYILKGYEGEDMKDFMHNLK---------YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDY 332 (478) T ss_pred HHHHHHHHHHhhCceeeeecCCccccchhhhhhh---------hcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHH Confidence 5555555555566665554322211 11111111 1223323 2333333344443445566777777888 Q ss_pred HHHHhcCCHHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhh-cchhhccchhhhccCHH Q lcl|NC_018285. 265 FAKVYGIPENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKL-SCDVDADIFPAVDPTGA 329 (383) Q Consensus 265 Ia~~~gVpp~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l-~~~~e~~~~~~~~~~~~ 329 (383) |...-++|..-.+..+.+. +..+.+ ..+...+.-.++.+...+.... ..++++......-.+.. T Consensus 333 i~~~s~~p~~~~~~~~~n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~ 411 (478) T protein:vir:10 333 IIEFGQGVDFQQDKFGNSP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMVNEL 411 (478) T ss_pred HHHHhCccccCcccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCCCHH Confidence 8888888854433222212 222221 1222222222222211111000 00011111222223455 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC------CCCCCCC---------CCCCCCCC Q lcl|NC_018285. 330 NYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN------PNRTILK---------GGETNGQD 383 (383) Q Consensus 330 ~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~------~~~~~~~---------ggd~~~~d 383 (383) +.+..+.++ +|+++...+++.++.-.=+..|+.+++. ....... ++|++..| T Consensus 412 e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 412 ENSQIAMNS--TGLLSKETILSNHAWVEDPVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCCCC Confidence 666666665 6899998888887532101122222110 0001111 11111111 No 210 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=96.61 E-value=0.00047 Score=38.84 Aligned_cols=365 Identities=12% Similarity=0.028 Sum_probs=147.9 Q ss_pred CchhhhhhcCCcc--cccccccccchh----hcccccCCceec-hhhhhccHHHHHHHHHHHHhhhhCceeeecchh--h Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPE----FLATLNGSEWVS-AETALKNSDLFSIISQLSNDLATAKLTTSRKQM--Q 71 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~~-~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--~ 71 (383) -.+.+....+... ....+....... .-....+....+ +..-+.+.-....|+..+.-+-+-|+++.-.+. . T Consensus 33 ~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~ 112 (474) T protein:vir:95 33 IRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVL 112 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCceeccCChHHH Confidence 1111100000000 000000000000 000000000000 000011222333455555555556666532221 1 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC--ceeEE--EEeecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ--NGLYY--NVTFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~y--~~~~~~~~~~~ 147 (383) ..+..-.. .........+..++..+|.||..+-++.+|.+ .+..++|..+-+..++.. ...++ .+.... ... T Consensus 113 ~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~--~~~ 188 (474) T protein:vir:95 113 DVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNG--ETK 188 (474) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecC--eeE Confidence 11111011 13445556778899999999999999988876 566788888877664321 11110 111110 001 Q ss_pred ceeecccceEEeccC------------------------------CCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 148 KQHVPQSDILHFRLL------------------------------SVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKN 197 (383) Q Consensus 148 ~~~~~~~dvih~~~~------------------------------~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 197 (383) ...+....+.++..- ...+...|.|-+......++....+..-..+.+.. T Consensus 189 ~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~ 268 (474) T protein:vir:95 189 VEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDE 268 (474) T ss_pred EEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 111222222222100 00011346777766666666655554444555555 Q ss_pred cCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_018285. 198 ALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVG 277 (383) Q Consensus 198 g~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg 277 (383) .+.|-.+++.-+. +......... ...+++.++++.+...++.+..+..+.+..+...+.|+..-++|..-.. T Consensus 269 ~~~p~lv~~g~~~---~~~~~~~~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 340 (474) T protein:vir:95 269 SVELIYILRGYEG---EDLSEFMEGL-----KYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD 340 (474) T ss_pred hhcchhhhcCCCc---ccccchhhhh-----hccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc Confidence 5556555543211 1111111111 1234566666655555555555566777788888899999999854433 Q ss_pred ccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcchhhc-cchh-hhccCHHHHHHHHHHHHhC Q lcl|NC_018285. 278 GQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSCDVDA-DIFP-AVDPTGANYISRINSMVKS 341 (383) Q Consensus 278 ~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~~~e~-~~~~-~~~~~~~~~~~~~~~l~~~ 341 (383) ..+.+. +..+.+ ..+...+...++.+...++ ...+. ++.. +.+..+....+.++.+.+. T Consensus 341 ~~~~n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g----~~~d~~~i~i~f~~~~p~~~~e~a~~~~~~ 415 (474) T protein:vir:95 341 KFGSAT-SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNK----IKLDAKEIEITFNFNVMVNDLEQSQIGAQS 415 (474) T ss_pred cccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEecCCCccCHHHHHHHHHHc Confidence 222222 222221 1222222222222222111 11110 1111 1122223334445556678 Q ss_pred CCcCHHHHHHHhhcCCcCCcchhHH--------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 342 GTLAQNQGLYILQQAEILPKELPKG--------ENPNRTILKGGETNGQD 383 (383) Q Consensus 342 g~~t~nE~r~~lg~~~~~~~d~~~~--------~~~~~~~~~ggd~~~~d 383 (383) |+++.-.+++.++.-.=+..|+.++ +...... .++.+++++ T Consensus 416 giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~-~~~~~~~~~ 464 (474) T protein:vir:95 416 QYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLD-DGGADGAQQ 464 (474) T ss_pred CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc-cccCCCCCC Confidence 9999999988874321011222221 1122111 111111111 No 211 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=96.61 E-value=0.00047 Score=38.84 Aligned_cols=365 Identities=12% Similarity=0.028 Sum_probs=147.9 Q ss_pred CchhhhhhcCCcc--cccccccccchh----hcccccCCceec-hhhhhccHHHHHHHHHHHHhhhhCceeeecchh--h Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPE----FLATLNGSEWVS-AETALKNSDLFSIISQLSNDLATAKLTTSRKQM--Q 71 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~~-~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--~ 71 (383) -.+.+....+... ....+....... .-....+....+ +..-+.+.-....|+..+.-+-+-|+++.-.+. . T Consensus 33 ~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~ 112 (474) T protein:vir:96 33 IRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVL 112 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCceeccCChHHH Confidence 1111100000000 000000000000 000000000000 000011222333455555555556666532221 1 Q ss_pred hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC--ceeEE--EEeecCccccc Q lcl|NC_018285. 72 GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ--NGLYY--NVTFDDPRIPP 147 (383) Q Consensus 72 ~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~y--~~~~~~~~~~~ 147 (383) ..+..-.. .........+..++..+|.||..+-++.+|.+ .+..++|..+-+..++.. ...++ .+.... ... T Consensus 113 ~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~--~~~ 188 (474) T protein:vir:96 113 DVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNG--ETK 188 (474) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecC--eeE Confidence 11111011 13445556778899999999999999988876 566788888877664321 11110 111110 001 Q ss_pred ceeecccceEEeccC------------------------------CCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018285. 148 KQHVPQSDILHFRLL------------------------------SVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKN 197 (383) Q Consensus 148 ~~~~~~~dvih~~~~------------------------------~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 197 (383) ...+....+.++..- ...+...|.|-+......++....+..-..+.+.. T Consensus 189 ~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~ 268 (474) T protein:vir:96 189 VEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDE 268 (474) T ss_pred EEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 111222222222100 00011346777766666666655554444555555 Q ss_pred cCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_018285. 198 ALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVG 277 (383) Q Consensus 198 g~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg 277 (383) .+.|-.+++.-+. +......... ...+++.++++.+...++.+..+..+.+..+...+.|+..-++|..-.. T Consensus 269 ~~~p~lv~~g~~~---~~~~~~~~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 340 (474) T protein:vir:96 269 SVELIYILRGYEG---EDLSEFMEGL-----KYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD 340 (474) T ss_pred hhcchhhhcCCCc---ccccchhhhh-----hccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc Confidence 5556555543211 1111111111 1234566666655555555555566777788888899999999854433 Q ss_pred ccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcchhhc-cchh-hhccCHHHHHHHHHHHHhC Q lcl|NC_018285. 278 GQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSCDVDA-DIFP-AVDPTGANYISRINSMVKS 341 (383) Q Consensus 278 ~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~~~e~-~~~~-~~~~~~~~~~~~~~~l~~~ 341 (383) ..+.+. +..+.+ ..+...+...++.+...++ ...+. ++.. +.+..+....+.++.+.+. T Consensus 341 ~~~~n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g----~~~d~~~i~i~f~~~~p~~~~e~a~~~~~~ 415 (474) T protein:vir:96 341 KFGSAT-SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNK----IKLDAKEIEITFNFNVMVNDLEQSQIGAQS 415 (474) T ss_pred cccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEecCCCccCHHHHHHHHHHc Confidence 222222 222221 1222222222222222111 11110 1111 1122223334445556678 Q ss_pred CCcCHHHHHHHhhcCCcCCcchhHH--------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 342 GTLAQNQGLYILQQAEILPKELPKG--------ENPNRTILKGGETNGQD 383 (383) Q Consensus 342 g~~t~nE~r~~lg~~~~~~~d~~~~--------~~~~~~~~~ggd~~~~d 383 (383) |+++.-.+++.++.-.=+..|+.++ +...... .++.+++++ T Consensus 416 giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~-~~~~~~~~~ 464 (474) T protein:vir:96 416 QYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLD-DGGADGAQQ 464 (474) T ss_pred CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc-cccCCCCCC Confidence 9999999988874321011222221 1122111 111111111 No 212 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=96.43 E-value=0.00021 Score=40.74 Aligned_cols=297 Identities=10% Similarity=-0.041 Sum_probs=103.1 Q ss_pred Cchhhhh---hcCCcccccccccc---cchhh--cccccCCceechhhhhccHH-HHHHHHHHHHhhhhCc------eee Q lcl|NC_018285. 1 MPIFNLA---TESPPNNQGGFFDI---TDPEF--LATLNGSEWVSAETALKNSD-LFSIISQLSNDLATAK------LTT 65 (383) Q Consensus 1 Mglf~~~---~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~~a~~~~~-v~~~i~~ia~~ia~~p------~~~ 65 (383) ++.+..- .....-..+....+ ..... .+...... .....+..|- -.+..+++...+..+- +.+ T Consensus 30 ~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~--~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i 107 (384) T protein:vir:49 30 LDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRK--QLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYR 107 (384) T ss_pred cccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecc--hhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEE Confidence 1111110 00000000000000 00000 00000000 0000000000 0011122222221111 111 Q ss_pred ecc-hhh--hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecC Q lcl|NC_018285. 66 SRK-QMQ--GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDD 142 (383) Q Consensus 66 ~~~-~~~--~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~ 142 (383) .+. ... .|..-|....+.. ..-.++-..+.+...++..-....++++.|-..+....... T Consensus 108 ~r~~~g~~~~L~~l~~~~v~v~---------~~~~~~~~~y~~~~~~~~~~~~~~~~~~eVih~~~~~~~~~-------- 170 (384) T protein:vir:49 108 WRNENGRDMKWEYLRPSQVSFN---------RLDNQNGLYYNITFDDPRIPPKQHVPQGDILHFRLLSVDGG-------- 170 (384) T ss_pred EECCCCcEEEEEEEcCceeEEE---------EcCCCceEEEEEEecCccccceeEecCccEEEecCCCCCCc-------- Confidence 111 111 1111111111000 00001111111111111111112233333322211100000 Q ss_pred cccccceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHH-------------------------HHhc Q lcl|NC_018285. 143 PRIPPKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLN-------------------------SLKN 197 (383) Q Consensus 143 ~~~~~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~-------------------------~~~n 197 (383) +.+.++. .+-...+...........++..+ .+.. T Consensus 171 ----------------~~G~s~i-----~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~ 229 (384) T protein:vir:49 171 ----------------LTSVSPL-----MALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAM 229 (384) T ss_pred ----------------eeeccHH-----HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhc Confidence 0011100 00001111111111111111111 1111 Q ss_pred cCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHH----HHHhcCCH Q lcl|NC_018285. 198 ALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQF----AKVYGIPE 273 (383) Q Consensus 198 g~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~I----a~~~gVpp 273 (383) +...+.++-+++.. =+.+-+++..... ..+ ..+.....+-.+ ...+|.. T Consensus 230 ~~n~~~~~vl~~g~----------------------~~~~l~~~~~d~q--~~e--~~~~~~~~Ia~~fgVp~~~lg~~- 282 (384) T protein:vir:49 230 KQMQGGPLVLDDLE----------------------DFTPLEIKSNVAQ--LLS--QADWTTGQFAKVYGIPESVVGGE- 282 (384) T ss_pred ccCCccceecCCCc----------------------eEEEccCChhhHH--HHH--HHHHHHHHHHHHhCCCHHHhCCC- Confidence 12222233222211 1111111111111 111 111111111112 2234442 Q ss_pred HHhcccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------hhhccchhhhccCHHHHHHHHHHHHhCC Q lcl|NC_018285. 274 NVVGGQGDQQSSLEMSSNVYSKAVARYLRPFLSELSQKLSC-----------DVDADIFPAVDPTGANYISRINSMVKSG 342 (383) Q Consensus 274 ~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~i~~~l~~~l~~-----------~~e~~~~~~~~~~~~~~~~~~~~l~~~g 342 (383) -+..+++.+.+++...|+..++.|+++.|+++|++++.. .+++++...++.+...+.+....+.+.| T Consensus 283 --~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g 360 (384) T protein:vir:49 283 --GDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAE 360 (384) T ss_pred --CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCC Confidence 123445567788889999999999999999999998853 2456677777888888888888999999 Q ss_pred CcCHHHHHHHhhcCCcCCcchhHHh Q lcl|NC_018285. 343 TLAQNQGLYILQQAEILPKELPKGE 367 (383) Q Consensus 343 ~~t~nE~r~~lg~~~~~~~d~~~~~ 367 (383) +++ ||+|+.++++|++++|..... T Consensus 361 ~~~-ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 361 ILP-KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred CCC-hhHHHHcCCCCCCCCCCCCCC Confidence 987 999999999999987754332 No 213 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=96.28 E-value=0.0008 Score=37.58 Aligned_cols=374 Identities=8% Similarity=-0.026 Sum_probs=155.6 Q ss_pred Cchhhh----hhcCCcccccc--------ccccc--chhhcccccCCcee-chhhhhccHHHHHHHHHHHHhhhhCc--e Q lcl|NC_018285. 1 MPIFNL----ATESPPNNQGG--------FFDIT--DPEFLATLNGSEWV-SAETALKNSDLFSIISQLSNDLATAK--L 63 (383) Q Consensus 1 Mglf~~----~~~~~~~~~~~--------~~~~~--~~~~~~~~~~~~~~-~~~~a~~~~~v~~~i~~ia~~ia~~p--~ 63 (383) |.+|.. +...+....+. |..+. .+.+.... .+... ..+.-++......+.+.+|+-+..-+ + T Consensus 16 ~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~-~~~~~~~~~~~~sl~~~~~i~~~~A~Ll~~e~~~i 94 (517) T protein:vir:98 16 YALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYI-NSQGKIQERDYMTLNLRKLSADVLSGLVFNEQCEV 94 (517) T ss_pred HHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccc-cccccccccceeecCcHHHHHHHhhhhhcCCcceE Confidence 222211 11111100000 00000 00000000 00000 00011111222233444555443323 3 Q ss_pred eeecchh-----------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC-- Q lcl|NC_018285. 64 TTSRKQM-----------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN-- 130 (383) Q Consensus 64 ~~~~~~~-----------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~-- 130 (383) .+.+.+. ...+.+-=.........+..+.+.+..|.+++.+..+. |.+ .+.+++++.+-+...+. T Consensus 95 ~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~~-~I~~v~ad~~~Pl~~~~~~ 172 (517) T protein:vir:98 95 YVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN-GEI-EFSWALANAFYPLRSNSNG 172 (517) T ss_pred EecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC-Cee-EEEEEcCCeeEEEEecCCC Confidence 3332111 11111111111233444456677777899888887764 332 35556655543321111 Q ss_pred ---------------CceeEE------------------EEee-----c-Ccccccc-------------eeecc---cc Q lcl|NC_018285. 131 ---------------QNGLYY------------------NVTF-----D-DPRIPPK-------------QHVPQ---SD 155 (383) Q Consensus 131 ---------------~~~~~y------------------~~~~-----~-~~~~~~~-------------~~~~~---~d 155 (383) ....+| .+.. . ....|.. ..++. .- T Consensus 173 v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Pl 252 (517) T protein:vir:98 173 ISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPL 252 (517) T ss_pred eEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcce Confidence 111111 1100 0 0000110 01111 11 Q ss_pred eEEeccCCCC----ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC---HHHHHH-HHHHHHHhh Q lcl|NC_018285. 156 ILHFRLLSVD----GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL---LDFKTK-VSRSRQAMK 227 (383) Q Consensus 156 vih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~---~e~~~~-~~~~~~~~~ 227 (383) +.|++.+-++ +...|.|.+..+...+...+........-|+-|.. +.++ +...- ++.... ....|. .. T Consensus 253 f~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~v--p~~~l~~~~~~~g~~~~~~~d-~~ 328 (517) T protein:vir:98 253 FNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFV--SDVMLRTVPDESGMPPPQVFD-PD 328 (517) T ss_pred EEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceec--ChhhhccccCCCCcccCCCCC-cc Confidence 2356554333 33569999999888888877766666666666443 3332 11110 000000 000000 00 Q ss_pred cCCcceeec-CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcC-HHHHHH--HHHHHHHHHHHHH Q lcl|NC_018285. 228 QMQGGPLVL-DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS-SLEMSS--NVYSKAVARYLRP 303 (383) Q Consensus 228 ~~~g~~~vl-~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~-~~e~~~--~~~~~~l~P~~~~ 303 (383) ...-..+-. +++-.++.++....+-++.+..+...++|+...|+++..+|....+.. ..|... .-.-.++.-+.+. T Consensus 329 ~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~ 408 (517) T protein:vir:98 329 VNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYE 408 (517) T ss_pred cceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 000000001 122346777777777889999999999999999999999996443332 222211 1111233333333 Q ss_pred HHHHHHHh------------hcc-------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch- Q lcl|NC_018285. 304 FLSELSQK------------LSC-------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL- 363 (383) Q Consensus 304 i~~~l~~~------------l~~-------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~- 363 (383) ++..|..- ++. ++.++....+-.|....+....+++.+|+|++-+++.++ .+++..++ T Consensus 409 ~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~--~g~~eeeA~ 486 (517) T protein:vir:98 409 VEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQRI--FKVPKKTAE 486 (517) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHh--CCCChHHHH Confidence 33333221 111 112223334456677788888899999999999998775 23444432 Q ss_pred ---hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 364 ---PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 364 ---~~~~~~~~~~~~ggd~~~~d 383 (383) ++.+.-+....+.++..+++ T Consensus 487 ~e~~~i~~E~~~~~~~~~~~~~~ 509 (517) T protein:vir:98 487 QWLEEIRKDQIELDPVTISQRAQ 509 (517) T ss_pred HHHHHHHHhccccCCCCcccccc Confidence 33222111111111111111 No 214 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.25 E-value=0.00083 Score=37.48 Aligned_cols=364 Identities=11% Similarity=0.027 Sum_probs=143.5 Q ss_pred CchhhhhhcCCc--ccccccccccchhhc---ccccCCceec--hhhhhccHHHHHHHHHHHHhhhhCceeeecchh--h Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPEFL---ATLNGSEWVS--AETALKNSDLFSIISQLSNDLATAKLTTSRKQM--Q 71 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~~~---~~~~~~~~~~--~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~--~ 71 (383) -.+.+....+.. .....+......... -......... +..=+.++-....|+..+.-+-+-|+++.-.+. . T Consensus 32 ~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~ 111 (478) T protein:vir:10 32 LRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKAL 111 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCceeecCChHHH Confidence 111111100000 000000000000000 0000000000 000011233444566666555556666532221 1 Q ss_pred -hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC-CceeEE---EEeecCcccc Q lcl|NC_018285. 72 -GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN-QNGLYY---NVTFDDPRIP 146 (383) Q Consensus 72 -~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~-~~~~~y---~~~~~~~~~~ 146 (383) .|...-+ .........+..++..+|.+|+.+-.+.+|++ .+..++|..+.+..++. .+.+.+ .+...+. . T Consensus 112 ~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~--~ 186 (478) T protein:vir:10 112 KQIQHTLN--HKWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGA--E 186 (478) T ss_pred HHHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeeCc--e Confidence 1111111 13455556677899999999999988888876 57778888877665432 221111 1111110 0 Q ss_pred cceeecccceEEeccC-------------------------CC---------CccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSDILHFRLL-------------------------SV---------DGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 147 ~~~~~~~~dvih~~~~-------------------------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) ....+..+.|.+++.. ++ .....|.|-+..+...++....+..... T Consensus 187 ~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~ 266 (478) T protein:vir:10 187 RVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQ 266 (478) T ss_pred EEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHH Confidence 0111222222222110 00 0112467777766666666555544444 Q ss_pred HHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHhhcCCcceeec--CCCceeeecccChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLVL--DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVY 269 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~vl--~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~ 269 (383) +.+...+.|-.+++.-.... .+....++. .+++.+ +.|.++..+..+.....+.+..+...+.|+..- T Consensus 267 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 337 (478) T protein:vir:10 267 NTFDESVELIYILKGYEGEDMKDFMHNLKY---------YKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFG 337 (478) T ss_pred HHHHHhhCcceeeecCCcccccchhhhhhh---------CceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 44455555655554322111 111111111 223333 233344444444455666777888888999988 Q ss_pred cCCHHHhcccccCcCHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhhc-chhhccchhhhccCHHHHHHH Q lcl|NC_018285. 270 GIPENVVGGQGDQQSSLEMSSN--------------VYSKAVARYLRPFLSELSQKLS-CDVDADIFPAVDPTGANYISR 334 (383) Q Consensus 270 gVpp~~lg~~~~~~~~~e~~~~--------------~~~~~l~P~~~~i~~~l~~~l~-~~~e~~~~~~~~~~~~~~~~~ 334 (383) ++|..-.+..+.+.+ ..+.+. .+...+.-.++-+...++...- .++++......-.+..+.+.. T Consensus 338 ~~p~~~~~~~~~n~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~ 416 (478) T protein:vir:10 338 QGVDFQQDKFGNSPS-GIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQI 416 (478) T ss_pred CCcCcCccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHH Confidence 988543322111122 222211 1111222222222111110000 011111112222344555555 Q ss_pred HHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh------CCCCCCCCCCCC---------CCCC Q lcl|NC_018285. 335 INSMVKSGTLAQNQGLYILQQAEILP--KELPKGE------NPNRTILKGGET---------NGQD 383 (383) Q Consensus 335 ~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~------~~~~~~~~ggd~---------~~~d 383 (383) +.++ +|+++.-.+.+.++. ++. .|+.+.+ ........+|++ ++.| T Consensus 417 ~~~~--~g~iS~et~i~~~~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 417 AMNS--TGLLSKETILGNHSW--VQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred HHHH--hCCCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhccccCCCCcccccccCcCCCCC Confidence 5554 688998888877642 221 1221110 011112222222 2222 No 215 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=95.53 E-value=0.0019 Score=35.51 Aligned_cols=369 Identities=9% Similarity=0.032 Sum_probs=146.3 Q ss_pred CchhhhhhcCCcc--cccccccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchh---hhhcc Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQM---QGIVD 75 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~---~~l~~ 75 (383) ..+.+....+... ....+..................+.+. .+.-....|+..+.-+-+-|+.+.-.+. ..|.. T Consensus 22 ~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki--~~n~~~~Iv~~~~~~l~g~p~~~~~~~~~~~~~l~~ 99 (499) T protein:vir:10 22 NYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANV--MVNHAKYITDMNVGFMTGNPVKYVAEKGKNIDDILE 99 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCccee--ecchHHHHHHHHhhhhcccCceeecCChhHHHHHHH Confidence 1111211111000 000000000000000000000001111 1222333455555544455655432221 11222 Q ss_pred CCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce----------------eEEEEeccceeEEEEcCCCce-----e Q lcl|NC_018285. 76 NPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD----------------MKWEYLRPSQVSFNRLDNQNG-----L 134 (383) Q Consensus 76 ~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~----------------~~l~~l~~~~v~~~~~~~~~~-----~ 134 (383) - ........+...+..+...+|.||.++..+.+|.+ ..+..++|..+-+..++.... + T Consensus 100 ~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i 178 (499) T protein:vir:10 100 V-FNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAV 178 (499) T ss_pred H-HhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEE Confidence 1 12224445677788899999999999988887743 345667777665555433221 1 Q ss_pred EEEEeecC--cccc-cceeecccceEEecc----------------CCC---------CccccCcchHHHHHHHHHHHHH Q lcl|NC_018285. 135 YYNVTFDD--PRIP-PKQHVPQSDILHFRL----------------LSV---------DGGLTSVSPLMALGRELDIQKA 186 (383) Q Consensus 135 ~y~~~~~~--~~~~-~~~~~~~~dvih~~~----------------~~~---------~~~~~G~s~~~~~~~~i~~~~~ 186 (383) +|....+. .... ....+.++.+.+++. +++ .+...|.|-+..+...++.... T Consensus 179 ~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~ 258 (499) T protein:vir:10 179 FTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNL 258 (499) T ss_pred EEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHH Confidence 11111110 0000 001122222222210 000 0112366767666666666555 Q ss_pred HHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHhhcCCcceee--cCCCceeeecccChhhHHHHHHHHHHHH Q lcl|NC_018285. 187 SDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLV--LDDLEDFTPLEIKSNVAQLLKQADWTTG 263 (383) Q Consensus 187 ~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~v--l~~g~~~~~~~~~~~d~~~~e~~~~~~~ 263 (383) +.....+.+...+.|-.+++...... .+....+ ..+++.. .++|.+++.+........+.+..+...+ T Consensus 259 ~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~---------~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~ 329 (499) T protein:vir:10 259 LQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRL---------KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIEN 329 (499) T ss_pred HHHHHHHHHHHhcCceeeeecCccccccchhhhh---------hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHH Confidence 55555555556666776665422111 1111111 1122322 2455555555554444556666677777 Q ss_pred HHHHHhcCCH---HHhcccccCcCH----------HHHHHHHHHHHHHHHHHHHHHHHHHhhc----chhhccchhhhcc Q lcl|NC_018285. 264 QFAKVYGIPE---NVVGGQGDQQSS----------LEMSSNVYSKAVARYLRPFLSELSQKLS----CDVDADIFPAVDP 326 (383) Q Consensus 264 ~Ia~~~gVpp---~~lg~~~~~~~~----------~e~~~~~~~~~l~P~~~~i~~~l~~~l~----~~~e~~~~~~~~~ 326 (383) .|...-++|. ..+++..++-.. ....+..+...+.-.++.+...++..-. ..+++.....+-. T Consensus 330 ~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~ 409 (499) T protein:vir:10 330 DIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPS 409 (499) T ss_pred HHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCC Confidence 8877777773 222221111100 0111122222333333333322221100 0111111222234 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhC-------------CCCCCCCCCCCCCCC Q lcl|NC_018285. 327 TGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGEN-------------PNRTILKGGETNGQD 383 (383) Q Consensus 327 ~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~-------------~~~~~~~ggd~~~~d 383 (383) +..+.+..+.++ +|+++.-.++++++.-.-+..|+.+++. .+..+..+++.+.++ T Consensus 410 n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (499) T protein:vir:10 410 NLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQD 477 (499) T ss_pred CHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCc Confidence 556667777776 6889998888876432111122222110 111111222221111 No 216 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=95.29 E-value=0.0024 Score=35.00 Aligned_cols=364 Identities=10% Similarity=0.024 Sum_probs=142.9 Q ss_pred Cch------hhhh----hcCCc---------ccccccccc---cchhh---cccccCCceechhhhhccHHHHHHHHHHH Q lcl|NC_018285. 1 MPI------FNLA----TESPP---------NNQGGFFDI---TDPEF---LATLNGSEWVSAETALKNSDLFSIISQLS 55 (383) Q Consensus 1 Mgl------f~~~----~~~~~---------~~~~~~~~~---~~~~~---~~~~~~~~~~~~~~a~~~~~v~~~i~~ia 55 (383) |.+ ...+ .++.. ...+..... ..... ...........+..=+.++-....|+..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 110 0000 00000 000000000 00000 00000000000000011222333444444 Q ss_pred HhhhhCceeeecch--hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-CCceeEEEEeccceeEEEEcCCCc Q lcl|NC_018285. 56 NDLATAKLTTSRKQ--MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-NGRDMKWEYLRPSQVSFNRLDNQN 132 (383) Q Consensus 56 ~~ia~~p~~~~~~~--~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~g~~~~l~~l~~~~v~~~~~~~~~ 132 (383) .-+-+-|+++.-.+ ....+..-.. .........+...+..+|.||..+.++. +|++ .+..++|..+-+..++... T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~-~~~~~~p~~~~~i~d~~~~ 158 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDASDNSF-RYACVDSKEVIPIYSKSLD 158 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCee-EEEEEcccceEEEEcCCCC Confidence 44445555543222 1111111000 1234445667788999999999998875 5654 5777888888776654321 Q ss_pred --e---eEEEEeec--Ccccc-cceeecccceEEeccCC-------------------------------C--------- Q lcl|NC_018285. 133 --G---LYYNVTFD--DPRIP-PKQHVPQSDILHFRLLS-------------------------------V--------- 164 (383) Q Consensus 133 --~---~~y~~~~~--~~~~~-~~~~~~~~dvih~~~~~-------------------------------~--------- 164 (383) . +.|..... +.... ....+....+.|++... + T Consensus 159 ~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 238 (471) T protein:vir:10 159 KKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPF 238 (471) T ss_pred CceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEe Confidence 1 11111110 00100 01112233333332100 0 Q ss_pred CccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhhcCCcceeec------- Q lcl|NC_018285. 165 DGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMKQMQGGPLVL------- 236 (383) Q Consensus 165 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~~~~g~~~vl------- 236 (383) .+...|.|-+......++....+.-...+.+...+.|-.+++.... ..++....++. ++++.+ T Consensus 239 ~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~---------~~~i~~~~~~~~~ 309 (471) T protein:vir:10 239 KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKR---------YKMIKMDNDGMGD 309 (471) T ss_pred ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhc---------CCeEEecCCCCcc Confidence 0112366767666666665555544455555555556555554322 12222221111 122222 Q ss_pred CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--------------HHHHHHHHHHHH Q lcl|NC_018285. 237 DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLR 302 (383) Q Consensus 237 ~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~ 302 (383) +++++|.. .+.....+....+...+.|+..-++|..-....++ .+. .+.+ ..+...+.-.++ T Consensus 310 ~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn-~Sg-~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~ 385 (471) T protein:vir:10 310 QSGVTTIA--IDIPTEARNLILERTKKQIFISGQGVNPETDKLGN-SSG-VALKFLYSLLELKAGNMETQFRSGYATLVK 385 (471) T ss_pred CccceEEe--ecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccC-ccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12344444 33333456667777788888888888543332221 111 1111 122222222222 Q ss_pred HHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh------CCCCCCC Q lcl|NC_018285. 303 PFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE------NPNRTIL 374 (383) Q Consensus 303 ~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~------~~~~~~~ 374 (383) .+...+...=..++++......-.+..+.+..+.++ +|+++.-.++++++. ++. .|+.+++ .-...+. T Consensus 386 li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~--v~D~~~E~eri~~E~~~~~~~~~~~ 461 (471) T protein:vir:10 386 MILKHLGLSDKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPI--VEDWQDELRLQKAEQEGRSEKLYDM 461 (471) T ss_pred HHHHHhccCCCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCC--CCCHHHHHHHHHHHHHHHHhccccc Confidence 222211110001122222223334556666666665 589999888887643 322 2222211 1111223 Q ss_pred CC-CCCCCCC Q lcl|NC_018285. 375 KG-GETNGQD 383 (383) Q Consensus 375 ~g-gd~~~~d 383 (383) .| +++++-| T Consensus 462 ~~~~~~~e~~ 471 (471) T protein:vir:10 462 EEVEHESEVE 471 (471) T ss_pred CCCCCccccC Confidence 33 2333333 No 217 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=94.34 E-value=0.0047 Score=33.36 Aligned_cols=357 Identities=11% Similarity=0.007 Sum_probs=141.4 Q ss_pred CchhhhhhcCCcc--cccccccccchhhcc---cccCCc--eechhhhhccHHHHHHHHHHHHhhhhCceeeecc--hhh Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITDPEFLA---TLNGSE--WVSAETALKNSDLFSIISQLSNDLATAKLTTSRK--QMQ 71 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~~---~~~~~~--~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~--~~~ 71 (383) -.+.+....+... ....+.......+.. ...... ...+..=+.++-....|+..+.-+-+-|+++.-. ... T Consensus 32 ~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~ 111 (468) T protein:vir:96 32 LRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYGTEDEKSL 111 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhccCCceeccCChHHH Confidence 1111111000000 000000000000000 000000 0000000112223334444444444455554322 111 Q ss_pred h-hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC-CceeEE---EEeecCcccc Q lcl|NC_018285. 72 G-IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN-QNGLYY---NVTFDDPRIP 146 (383) Q Consensus 72 ~-l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~-~~~~~y---~~~~~~~~~~ 146 (383) . |...-+ .+.......+..++..+|.+|+.+..+.+|.+ .+..++|..+-+..++. .+.+.+ .+..+... T Consensus 112 ~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~-- 186 (468) T protein:vir:96 112 KTIQEVLN--HKWDDKLVDILTAASNKGVEWIQPYVDEQGEF-KTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGE-- 186 (468) T ss_pred HHHHHHHh--cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCce-- Confidence 1 111111 13345556677899999999999988888865 46677888776665432 111111 11111100 Q ss_pred cceeecccc---------------------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHH Q lcl|NC_018285. 147 PKQHVPQSD---------------------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKAS 187 (383) Q Consensus 147 ~~~~~~~~d---------------------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~ 187 (383) ....+.... |+++++ ...|.|-+..+...++....+ T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~ 261 (468) T protein:vir:96 187 RVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKN-----NPQEVSDLFMYKTIIDAMDKR 261 (468) T ss_pred EEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecC-----CCCCCCchHHHHHHHHHHHHH Confidence 011111222 222221 134677766666666665555 Q ss_pred HHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC--CCceeeecccChhhHHHHHHHHHHHHHH Q lcl|NC_018285. 188 DKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD--DLEDFTPLEIKSNVAQLLKQADWTTGQF 265 (383) Q Consensus 188 ~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~--~g~~~~~~~~~~~d~~~~e~~~~~~~~I 265 (383) .....+.++..+.|-.+++.-...+. +.+.... . .++++.++ ++.+.+.++.......+.+..+...+.| T Consensus 262 ~S~~~~~~~~~~~p~lv~~g~~~~~~---~~~~~~~----~-~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I 333 (468) T protein:vir:96 262 LSDTQNTFDEATELIYVLKGYEGEDL---EEFMYNL----K-YYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYV 333 (468) T ss_pred HHHHHHHHHHhcCceeeeecCCcccc---chhhhhh----h-cCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHH Confidence 55555555666667666553221111 1111111 1 13344332 3333443444444455666777888888 Q ss_pred HHHhcCCHHHhcccccCcCHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcchhhc-cchhh-hccCHH Q lcl|NC_018285. 266 AKVYGIPENVVGGQGDQQSSLEMSSN--------------VYSKAVARYLRPFLSELSQKLSCDVDA-DIFPA-VDPTGA 329 (383) Q Consensus 266 a~~~gVpp~~lg~~~~~~~~~e~~~~--------------~~~~~l~P~~~~i~~~l~~~l~~~~e~-~~~~~-~~~~~~ 329 (383) +..-++|..-....+.+. +..+.+. .+...++-.++-|... +....+. ++... -+..+. T Consensus 334 ~~~s~~p~~~~~~~~~n~-Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~----~g~~~d~~~i~i~f~~~~p~ 408 (468) T protein:vir:96 334 IEFGQGVDFQQDKFGNSP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDF----YKLSIKVQDVEITFNFNVMV 408 (468) T ss_pred HHHhCcccccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCCCcccceeeEEecCCCCc Confidence 888888854322212122 2222211 1122222222222111 1111111 11100 111122 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHHh-------CCCCCCCCCCCCCCC Q lcl|NC_018285. 330 NYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKGE-------NPNRTILKGGETNGQ 382 (383) Q Consensus 330 ~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~~-------~~~~~~~~ggd~~~~ 382 (383) ...+.++.+.+.|+++.-.+++.++ .++. .|+.+.+ ........++++++. T Consensus 409 d~~e~a~~~~~~g~iS~et~i~~l~--~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 409 NELEQSQIGVNSQYLSKETVVTNHP--WVDDPVAEMERIDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred CHHHHHHHHHhcCCCchHHHHHhCC--CCCCHHHHHHHHHHHHHHHHHHhhccCCCCCCCCC Confidence 2344555667789999988888763 3322 2222211 111111222333333 No 218 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=94.18 E-value=0.0052 Score=33.12 Aligned_cols=336 Identities=13% Similarity=0.120 Sum_probs=158.0 Q ss_pred CchhhhhhcCCcccccccccccc---hhh---cccccCCceechh--hhhccHHHHHHHHHHHHhhhhC--c----e--- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD---PEF---LATLNGSEWVSAE--TALKNSDLFSIISQLSNDLATA--K----L--- 63 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~---~~~---~~~~~~~~~~~~~--~a~~~~~v~~~i~~ia~~ia~~--p----~--- 63 (383) =+-|+.++..|..+...|..+.+ |.. +....+....+.. .-+=.++-..|++.+|+.+-+. | | T Consensus 7 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~~~WF~l 86 (547) T protein:vir:10 7 VKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPATKWFEL 86 (547) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 44555566665544444433221 111 1111111111110 0012345555666666555431 2 2 Q ss_pred eeecch------hh-----------hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-CceeEEEEeccceeEE Q lcl|NC_018285. 64 TTSRKQ------MQ-----------GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-GRDMKWEYLRPSQVSF 125 (383) Q Consensus 64 ~~~~~~------~~-----------~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-g~~~~l~~l~~~~v~~ 125 (383) .+.+.. .. ..+.+-| .+.-+..++.+++.+|+|.+++..+.+ ...+.+..++..++-+ T Consensus 87 ~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v 162 (547) T protein:vir:10 87 AFRDKELNSDDECRKWLENATHDVYSALQDSN----FNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYF 162 (547) T ss_pred ccCCccccchHHHHHHHHHHHHHHHHHHHhcC----cHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEE Confidence 111110 00 1122223 333355667899999999999876542 2344455555555555 Q ss_pred EEcCCCcee-EEE-E----------------------------------------ee--cCcccc--------------c Q lcl|NC_018285. 126 NRLDNQNGL-YYN-V----------------------------------------TF--DDPRIP--------------P 147 (383) Q Consensus 126 ~~~~~~~~~-~y~-~----------------------------------------~~--~~~~~~--------------~ 147 (383) ..+..+... .|+ + .. .++... . T Consensus 163 ~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~ 242 (547) T protein:vir:10 163 EEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFG 242 (547) T ss_pred eeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeecccccee Confidence 444433211 000 0 00 000000 0 Q ss_pred ceeec--------------ccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCH Q lcl|NC_018285. 148 KQHVP--------------QSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLL 213 (383) Q Consensus 148 ~~~~~--------------~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~ 213 (383) .+.+. ....+..|....++..||.||...+...+...+.+.+.......-...|.+++..++...+ T Consensus 243 s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~ 322 (547) T protein:vir:10 243 KKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISD 322 (547) T ss_pred EEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc Confidence 00111 1123444433446778999999999999999999999988888888888877665544332 Q ss_pred HHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--H Q lcl|NC_018285. 214 DFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--N 291 (383) Q Consensus 214 e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~ 291 (383) + ....|++++.+..-.++++...++-....+..+.....|-.+|-+....+-... .-+.+|... . T Consensus 323 -----~-------~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~-~~TAtEV~~r~~ 389 (547) T protein:vir:10 323 -----I-------DLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSP-AMTATEVQVRYE 389 (547) T ss_pred -----c-------eecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCc-cccHHHHHHHHH Confidence 0 123566777666667777765543323356667788889999988765443222 223334332 3 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhH------ Q lcl|NC_018285. 292 VYSKAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPK------ 365 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~------ 365 (383) =....|-|....+.++|-.-|+.. .+.-|.+.|.+ ++ +|.++-. T Consensus 390 E~~~~LG~v~~rl~~E~l~Pli~r------------------~~~il~r~g~l-----------P~-~p~~l~~~~~~~~ 439 (547) T protein:vir:10 390 LMQRLLGPTLGRLENDFLSPMIQR------------------TFNIRFRAGKL-----------GE-LPSKLLESGKAAM 439 (547) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHH------------------HHHHHHhcCCC-----------CC-CchhhhccCcceE Confidence 345677788887777754333211 11122233332 11 1111100 Q ss_pred -------------------HhC-CCC-CCCCCCCCCCCC Q lcl|NC_018285. 366 -------------------GEN-PNR-TILKGGETNGQD 383 (383) Q Consensus 366 -------------------~~~-~~~-~~~~ggd~~~~d 383 (383) +.. ... ..+.+-+-+=-| T Consensus 440 ~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld 478 (547) T protein:vir:10 440 DIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLD 478 (547) T ss_pred EEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhh Confidence 000 000 001111000011 No 219 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=92.39 E-value=0.012 Score=31.20 Aligned_cols=354 Identities=11% Similarity=0.048 Sum_probs=148.4 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhh-C-c---e---eeecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLAT-A-K---L---TTSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~-~-p---~---~~~~~~ 69 (383) =+.|+.++.+|..+...|..+ +.|..+........-...+.+ .++-..|++.+|+.+-+ + | | .+.+.. T Consensus 13 ~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Las~l~~~ltP~~~WFrl~~~d~~ 91 (522) T protein:vir:94 13 KAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPW-QAVGARCLNNLAAKLMLALFPQSPWMRLTVSEYE 91 (522) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHhhcCCCCcccccccchhh Confidence 333666666554443333322 223222211111110111223 33444566666655543 2 2 1 111100 Q ss_pred hhhhccCCC-----------------c---cCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcC Q lcl|NC_018285. 70 MQGIVDNPS-----------------N---SANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 70 ~~~l~~~PN-----------------~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~ 129 (383) ...+...+. . .-+.+.-+..+..+++.+|||.+++..+..|.+..+..++-.++-+..+. T Consensus 92 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~v~~d~ 171 (522) T protein:vir:94 92 AKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYVVQRDA 171 (522) T ss_pred hhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEEEeeCC Confidence 000000000 0 01233334566788899999999988776666554443333444444443 Q ss_pred CCce--eEEEEee--------------------------------cCccccc-------c-------eeecccceEEecc Q lcl|NC_018285. 130 NQNG--LYYNVTF--------------------------------DDPRIPP-------K-------QHVPQSDILHFRL 161 (383) Q Consensus 130 ~~~~--~~y~~~~--------------------------------~~~~~~~-------~-------~~~~~~dvih~~~ 161 (383) .|.. ++.++.. ....... . ..|...-.+..|. T Consensus 172 ~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw 251 (522) T protein:vir:94 172 FGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRM 251 (522) T ss_pred CcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCceecccCCCCccccCCceeeee Confidence 3321 0101000 0000000 0 0111222344443 Q ss_pred CCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecC--CC Q lcl|NC_018285. 162 LSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD--DL 239 (383) Q Consensus 162 ~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~--~g 239 (383) ...++..||.||...+...+...+.+.+.......-...|..++..++........ .+..+.++.+ ++ T Consensus 252 ~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~----------~~~~g~~v~g~~~~ 321 (522) T protein:vir:94 252 VRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLN----------KAATGEFVAGRVED 321 (522) T ss_pred eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchhee----------ccCCceeecCCccc Confidence 34456789999999999999999999999999999999999888766555443221 1111222222 23 Q ss_pred ceeeecccChhhHHH-HHHHHHHHHHHHHHhcCCHHHhc-ccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_018285. 240 EDFTPLEIKSNVAQL-LKQADWTTGQFAKVYGIPENVVG-GQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQKLSCD 315 (383) Q Consensus 240 ~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gVpp~~lg-~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l~~~ 315 (383) +...++...+ +.+. .+..+.....|-.+|-+.. ++ ..+..-+.+|... .=....|-|.+..+.++|=.-|+.. T Consensus 322 v~~~~~~~~~-~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 398 (522) T protein:vir:94 322 INFLQLTKGQ-DFTIAKSVADAIEQRLGWAFLLNS--AVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRV 398 (522) T ss_pred ceeeeccccc-chhHHHHHHHHHHHHHHHHHhhhh--hccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 4444443322 3332 4556677788999997662 22 1112223333322 2334556666666666654333211 Q ss_pred -h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-hcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 316 -V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYIL-QQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 316 -~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~l-g~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) + +....+.+..+ +++-.+.|+-+...+. +...+.. -+-..-... |. -.+ .-=| T Consensus 399 ~~~il~r~g~lP~~p~~----------~v~v~~~s~La~~qr~~~~~~l~~-~~~~ia~l~--P~-~~~-~~id 457 (522) T protein:vir:94 399 LMNQLQSAGMIPDLPKE----------AVEPTVSTGLEALGRGQDLEKLTQ-AVNMMTGLQ--PL-SQD-PDIN 457 (522) T ss_pred HHHHHHhcCCCCCCCcc----------cEEeeEecHHHHHHHHHHHHHHHH-HHHHHHhcc--ch-hhh-hcCC Confidence 0 00110000000 0111111111110000 0000000 000000011 10 001 0013 No 220 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=90.56 E-value=0.02 Score=29.88 Aligned_cols=361 Identities=12% Similarity=0.077 Sum_probs=140.8 Q ss_pred CchhhhhhcCCccccccccccc---chhhcccccCCceechhhhhccHHHHHHHHHHHHhhhh-C-cee------eecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT---DPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLAT-A-KLT------TSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~-~-p~~------~~~~~ 69 (383) =+.|+.++.+|..+...|..+. .|..+........-...+.+ .++-..|++.+|+.+-+ + |-. +.+.. T Consensus 14 ~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~ 92 (536) T protein:vir:21 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPW-QAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHHhhcCCCcccccccChhh Confidence 4556667666544443343322 23222211111110111122 33444556656655443 2 311 11111 Q ss_pred hhh------------------------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce--eEEEEecccee Q lcl|NC_018285. 70 MQG------------------------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD--MKWEYLRPSQV 123 (383) Q Consensus 70 ~~~------------------------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~--~~l~~l~~~~v 123 (383) ... .+.+-| .+.-+..+..+++.+|||.+++..+..+.+ ...||| .++ T Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl--~~~ 166 (536) T protein:vir:21 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIESNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL--SSY 166 (536) T ss_pred hhccccchhhHHHHHHHHHHHHHHHHHHHHhcC----cHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEc--CeE Confidence 110 111222 333344667888899999999876554433 344554 333 Q ss_pred EEEEcCCCce-----------------------------------eEEEEeecCcccccc-----------------eee Q lcl|NC_018285. 124 SFNRLDNQNG-----------------------------------LYYNVTFDDPRIPPK-----------------QHV 151 (383) Q Consensus 124 ~~~~~~~~~~-----------------------------------~~y~~~~~~~~~~~~-----------------~~~ 151 (383) -+..+..|.. ..|........++.. ..| T Consensus 167 ~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~~~f 246 (536) T protein:vir:21 167 VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGSDGTYPK 246 (536) T ss_pred EEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCeeeccccCcccc Confidence 3333332211 011111000000000 011 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCc Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQG 231 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g 231 (383) ...-.+.+|....++..||.||...+...+...+.+.+.......-...|..++..++....+... ....| T Consensus 247 ~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---------~~~~g 317 (536) T protein:vir:21 247 EACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---------KAQTG 317 (536) T ss_pred ccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc---------cCCCc Confidence 222334444444566789999999999999999999888887666666666666654443332111 11112 Q ss_pred ceee-cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 232 GPLV-LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSEL 308 (383) Q Consensus 232 ~~~v-l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l 308 (383) .++. .++++...++...++-.-..+..+.....|-.+|-+.. +....+..-+.+|... .=....|-|....+.++| T Consensus 318 ~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (536) T protein:vir:21 318 DFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (536) T ss_pred ceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-cccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 2211 22333444444333222234556677888989886542 1112222223333322 223345556555555554 Q ss_pred HHhhcch-h----hccchhhhccCHH--HHHHHHHHHHhCCCcCHHHHHHHhh-cCCcCCcchhHHh-CCCC--CCCCCC Q lcl|NC_018285. 309 SQKLSCD-V----DADIFPAVDPTGA--NYISRINSMVKSGTLAQNQGLYILQ-QAEILPKELPKGE-NPNR--TILKGG 377 (383) Q Consensus 309 ~~~l~~~-~----e~~~~~~~~~~~~--~~~~~~~~l~~~g~~t~nE~r~~lg-~~~~~~~d~~~~~-~~~~--~~~~gg 377 (383) =.-|+.. + +.+..+.+..+.. ++..-+..+-++. ........++ ...+. +++ +. ..+. ....=. T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~--~~~~l~~~~~~la~~~-Pe~--ld~~id~d~~~~~~a 471 (536) T protein:vir:21 397 QLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQ--DLDKLERCVTAWAALA-PMR--DDPDINLAMIKLRIA 471 (536) T ss_pred HHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHH--HHHHHHHHHHHHHhhc-hhh--hcccCCHHHHHHHHH Confidence 3322211 0 0011000000000 0000011111100 0000000000 00010 010 00 0000 000000 Q ss_pred CCCCCC Q lcl|NC_018285. 378 ETNGQD 383 (383) Q Consensus 378 d~~~~d 383 (383) +..|=| T Consensus 472 ~~~Gv~ 477 (536) T protein:vir:21 472 NAIGID 477 (536) T ss_pred HHcCCC Confidence 001111 No 221 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=90.56 E-value=0.02 Score=29.87 Aligned_cols=357 Identities=14% Similarity=0.122 Sum_probs=140.5 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhh-C-cee------eecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLAT-A-KLT------TSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~-~-p~~------~~~~~ 69 (383) =+.|+.++.+|..+...|..+ +.|..+........-...+.+ .++-..|++.+|+.+-+ + |-. +.+.. T Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~ 92 (536) T protein:vir:10 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPW-QAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHhhhcCCCcccccccChhh Confidence 455666666654444333332 223222211111110111122 33444566666655543 2 311 11111 Q ss_pred hhhhccCC-----------------Cc---cCCHHHHHHHHHHHHHHcCCeEEEEeecCCCce--eEEEEeccceeEEEE Q lcl|NC_018285. 70 MQGIVDNP-----------------SN---SANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRD--MKWEYLRPSQVSFNR 127 (383) Q Consensus 70 ~~~l~~~P-----------------N~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~--~~l~~l~~~~v~~~~ 127 (383) ...+...+ .. .-+.+.-+..+..+++.+|||.+++..+..+.+ ...||| .++-+.. T Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl--~~~~v~~ 170 (536) T protein:vir:10 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL--SSYVVQR 170 (536) T ss_pred hhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEc--CeEEEee Confidence 11000000 00 112333344667888899999999876554433 344554 3333333 Q ss_pred cCCCcee-----------------------------------EEEEeecCccccc-----------------ceeecccc Q lcl|NC_018285. 128 LDNQNGL-----------------------------------YYNVTFDDPRIPP-----------------KQHVPQSD 155 (383) Q Consensus 128 ~~~~~~~-----------------------------------~y~~~~~~~~~~~-----------------~~~~~~~d 155 (383) +..|... .|....-....+. ...|...- T Consensus 171 d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P 250 (536) T protein:vir:10 171 DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDGTYPKEACP 250 (536) T ss_pred CCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCccccccccccccccCC Confidence 3322110 1100000000000 00112223 Q ss_pred eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee Q lcl|NC_018285. 156 ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV 235 (383) Q Consensus 156 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v 235 (383) .+.+|....++..||.||...+...+...+.+.+.......-...|..++..++........ ....|.++. T Consensus 251 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---------~~~~g~~v~ 321 (536) T protein:vir:10 251 YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---------KAQTGDFVT 321 (536) T ss_pred ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc---------cCCCcceec Confidence 34444444556789999999999999999999888887766666666666654443332111 111122211 Q ss_pred -cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 236 -LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQKL 312 (383) Q Consensus 236 -l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l 312 (383) .++++...++...++-.-..+..+.....|-.+|-+.. +....+..-+.+|... .=....|-|....+.++|=.-| T Consensus 322 g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pl 400 (536) T protein:vir:10 322 GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) T ss_pred CCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-cccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 22333444444333222234556677888989996552 1112222223333322 2233455555555555543322 Q ss_pred cch-h----hccchhhh-----ccC---H---HHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHh-CCCC--CC Q lcl|NC_018285. 313 SCD-V----DADIFPAV-----DPT---G---ANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGE-NPNR--TI 373 (383) Q Consensus 313 ~~~-~----e~~~~~~~-----~~~---~---~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~-~~~~--~~ 373 (383) +.. + +.+..+.+ +.+ . ..+...++++.+ +-..+ ..+. +++ +. ..+. .. T Consensus 401 i~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~--------~~~~l--a~~~-P~~--ld~~id~d~~~ 467 (536) T protein:vir:10 401 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLER--------CVTAW--AALA-PMR--DDPDINLAMIK 467 (536) T ss_pred HHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHH--------HHHHH--Hhhc-hhh--hcccCCHHHHH Confidence 211 0 00110000 000 0 011111111110 00000 0010 010 00 0000 00 Q ss_pred CCCCCCCCCC Q lcl|NC_018285. 374 LKGGETNGQD 383 (383) Q Consensus 374 ~~ggd~~~~d 383 (383) ..=.+..|=| T Consensus 468 ~~~a~~~Gv~ 477 (536) T protein:vir:10 468 LRIANAIGID 477 (536) T ss_pred HHHHHHcCCC Confidence 0000001111 No 222 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=90.42 E-value=0.021 Score=29.79 Aligned_cols=355 Identities=11% Similarity=0.054 Sum_probs=146.9 Q ss_pred CchhhhhhcCCc-------------ccccccccccchhh-c---ccccCCceechhhhhccHHHHHHHHHHHHhhhhCce Q lcl|NC_018285. 1 MPIFNLATESPP-------------NNQGGFFDITDPEF-L---ATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKL 63 (383) Q Consensus 1 Mglf~~~~~~~~-------------~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~ 63 (383) ..|.++...... .............. . .........+.+. .+.-...-|+..+.=+-+-|+ T Consensus 7 ~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki--~~n~~k~Iv~~~~~yl~G~p~ 84 (470) T protein:vir:10 7 KKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRI--PSNFYQLLVDQEAGYVASVFP 84 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCccc--ccchHHHHHHhhhhheeccce Confidence 111111110000 00000000000000 0 0000000001111 111222234444444445565 Q ss_pred eeecchh---hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC-c-e---eE Q lcl|NC_018285. 64 TTSRKQM---QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-N-G---LY 135 (383) Q Consensus 64 ~~~~~~~---~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~-~-~---~~ 135 (383) ++.-.+. ..+...-+. +...-...+..++..+|.||.++-++.+|++ .+..++|..+-+..++.. + . +. T Consensus 85 ~~~~~d~~~~~~l~~~~~~--~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir 161 (470) T protein:vir:10 85 DIDVGKDADNKKIIDVLGD--DRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQITPIYATTLDNKLLGILR 161 (470) T ss_pred eeecCchHHHHHHHHHHhh--hHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEE Confidence 5432221 112111111 2334445677888999999999999998876 467788888777765432 1 1 11 Q ss_pred EEEeecCccc---ccceeecccce---------------------------------------------EEeccCCCCcc Q lcl|NC_018285. 136 YNVTFDDPRI---PPKQHVPQSDI---------------------------------------------LHFRLLSVDGG 167 (383) Q Consensus 136 y~~~~~~~~~---~~~~~~~~~dv---------------------------------------------ih~~~~~~~~~ 167 (383) |....+.... .....+....+ +|+++. T Consensus 162 ~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn----- 236 (470) T protein:vir:10 162 SYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKN----- 236 (470) T ss_pred EEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecC----- Confidence 1111110000 00111222222 333221 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCC-HHHHHHHHHHHHHhhcCCcceeec-------CCC Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLVL-------DDL 239 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~g~~~vl-------~~g 239 (383) -.|.|-+..+...++....+.-...+.+...+.|-.+++.-...+ ++....++ ..+.+.+ +++ T Consensus 237 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~---------~~~~i~~~~~~~~~~~~ 307 (470) T protein:vir:10 237 KYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLR---------KYKSIKINNTGNGDNSG 307 (470) T ss_pred CCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhh---------hcCeEeccCCCCCcCce Confidence 246777777777776666666555666666666766665432222 22111111 1122222 233 Q ss_pred ceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH--------------HHHHHHHHHHHHHHHH Q lcl|NC_018285. 240 EDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS--------------SNVYSKAVARYLRPFL 305 (383) Q Consensus 240 ~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~--------------~~~~~~~l~P~~~~i~ 305 (383) ++|.....+ ...+....+...+.|+..-++|..-....+ +.+ ..+. +..+..+|.-.++.|. T Consensus 308 ~~~lt~~~~--~~~~~~~~~~L~~~I~~~s~~p~~~~~~~g-n~S-g~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~ 383 (470) T protein:vir:10 308 VDKLQIDIP--VEARDDALKITRKNIFLFGQGIDPANFESS-NAS-GVAIKMLYSHLELKAAKTQTYFEHAINELVRAIM 383 (470) T ss_pred eEEEeecCC--hHHHHHHHHHHHHHHHHHhCCCCCCccccc-cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455443333 344566667778888888888753222211 111 1111 1233333333333333 Q ss_pred HHHHHhhc--chhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCC--cchhHH--------hCCCCC- Q lcl|NC_018285. 306 SELSQKLS--CDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILP--KELPKG--------ENPNRT- 372 (383) Q Consensus 306 ~~l~~~l~--~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~d~~~~--------~~~~~~- 372 (383) ..++..=. .++++.....+-.+..+.+..+.++ +|+++.-.+++.++. ++. .|+.++ ...+.. T Consensus 384 ~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~--v~D~~~E~eri~~E~~e~~~~~~~~~ 459 (470) T protein:vir:10 384 RYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDKEENDPYSNQAD 459 (470) T ss_pred HHhcccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhcccc Confidence 32221100 0111222223334556666666665 689999888877643 221 122221 111111 Q ss_pred CC-CCCCCCCC Q lcl|NC_018285. 373 IL-KGGETNGQ 382 (383) Q Consensus 373 ~~-~ggd~~~~ 382 (383) .. ++|+++++ T Consensus 460 ~~~~~~~dde~ 470 (470) T protein:vir:10 460 ELNGKGVNDEQ 470 (470) T ss_pred ccCCCCCCCCC Confidence 22 23444444 No 223 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=89.96 E-value=0.023 Score=29.52 Aligned_cols=368 Identities=12% Similarity=0.037 Sum_probs=142.9 Q ss_pred CchhhhhhcCC-c--ccccccccccchhhcccccCC-ceechhhhhccHHHHHHHHHHHHhhhhCceeeec--chhhhhc Q lcl|NC_018285. 1 MPIFNLATESP-P--NNQGGFFDITDPEFLATLNGS-EWVSAETALKNSDLFSIISQLSNDLATAKLTTSR--KQMQGIV 74 (383) Q Consensus 1 Mglf~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~--~~~~~l~ 74 (383) +.+.++...++ . .....+.........-..... .....+ +..+-..-.|+..+.-+-.-|+++.. ......+ T Consensus 21 ~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~k--i~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l 98 (489) T protein:vir:99 21 KNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNR--IASDFAKYITVFEQGYMLGVPVEYKNENKDLQAAI 98 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcce--eecchHHHHHHHHhhhhccCCceeecCChhHHHHH Confidence 33333222111 0 000000000000000000000 000011 12233344555555555555665432 2222222 Q ss_pred cCCCccCCHHHHHHHHHHHHHHcCCeEEEEee----cCCCceeEEEEeccceeEEEEcCCCc-eeE-----EEEeecCcc Q lcl|NC_018285. 75 DNPSNSANRFNFYQSIFAQMLLGGEAFAYRWR----NDNGRDMKWEYLRPSQVSFNRLDNQN-GLY-----YNVTFDDPR 144 (383) Q Consensus 75 ~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r----~~~g~~~~l~~l~~~~v~~~~~~~~~-~~~-----y~~~~~~~~ 144 (383) .+-........+...+..+++.+|.+|..+.. +.+|+ ..+..++|..+.+..++... ... |......+. T Consensus 99 ~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~ 177 (489) T protein:vir:99 99 DLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGK 177 (489) T ss_pred HHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCc Confidence 22222224445567788889999999987764 34444 35777888887766553321 111 111100000 Q ss_pred cc-cceeecccc----------------------------eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 145 IP-PKQHVPQSD----------------------------ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSL 195 (383) Q Consensus 145 ~~-~~~~~~~~d----------------------------vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 195 (383) .. ....+.++. |+|+++. ..|.|.+..+...++....+.....+.. T Consensus 178 ~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~s~~~~v~~liDa~d~~~s~~~~~~ 252 (489) T protein:vir:99 178 RKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANN-----EERTGAYESVLDNIDAYDLSQSELANFQ 252 (489) T ss_pred eEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecC-----CCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 00 001111222 3343321 2366666555555554444433333333 Q ss_pred hccCCcceeEeecCCCCHHHHHHHHHHHHH--------hh-cCCcceeecCCC-------ceeeecccChhhHHHHHHHH Q lcl|NC_018285. 196 KNALNANGILKIKGGGLLDFKTKVSRSRQA--------MK-QMQGGPLVLDDL-------EDFTPLEIKSNVAQLLKQAD 259 (383) Q Consensus 196 ~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~--------~~-~~~g~~~vl~~g-------~~~~~~~~~~~d~~~~e~~~ 259 (383) .-.+.|-.+++.-.....+. ......+.. .. ...++++.++.+ .+...+.....+..+....+ T Consensus 253 ~~~~~~~l~i~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 331 (489) T protein:vir:99 253 QDSVNALLVIAGNAYTGADE-NDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKN 331 (489) T ss_pred HHhhhhhhhhccCCcccccc-hhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHH Confidence 33344444443321111111 111111110 00 112233333322 23333333333444556667 Q ss_pred HHHHHHHHHhcCCHHHhcccccCcCHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhhcc--------hhh Q lcl|NC_018285. 260 WTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--------------NVYSKAVARYLRPFLSELSQKLSC--------DVD 317 (383) Q Consensus 260 ~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--------------~~~~~~l~P~~~~i~~~l~~~l~~--------~~e 317 (383) ...+.|+..-++|..-....+.+.+ ..+.+ ..+...+.-.++.+...+...-.. +++ T Consensus 332 ~l~~~i~~~s~~p~~~~~~~~~n~S-g~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~ 410 (489) T protein:vir:99 332 RLVADILRFTFTPDTQDMKFSGVQS-GESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTS 410 (489) T ss_pred HHHHHHHHHhCCcccccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccce Confidence 7888899988888533221111121 12211 122233333333333322211000 111 Q ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc----hhHH-------hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 318 ADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE----LPKG-------ENPNRTILKGGETNGQD 383 (383) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d----~~~~-------~~~~~~~~~ggd~~~~d 383 (383) +......-.+..+.+..+.++ .|++++-.+.+++. .++..| +.++ .... ....+|+.++++ T Consensus 411 v~f~~~~p~d~~~~~~~~~kl--~giis~et~~~~l~--~v~~~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~ 482 (489) T protein:vir:99 411 IVFTPNLPQNDNEIVTAAQNL--YGIVSDQTIFEILN--TVTGVDAEAELKRLKEEADKKQSLP-EPRLVGDASGQE 482 (489) T ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhcC--CCCchhHHHHHHHHHHHHHHHhccc-cccccCCCCCCc Confidence 111122223455566666665 48999998888763 333222 2221 1112 222344443333 No 224 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=88.94 E-value=0.029 Score=28.99 Aligned_cols=353 Identities=14% Similarity=0.100 Sum_probs=146.5 Q ss_pred CchhhhhhcCCccccccccccc---chhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC--cee------eecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT---DPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA--KLT------TSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p~~------~~~~~ 69 (383) =+.|+.++..|..+...|..+. .|..+..-.....-...+.+ .++-..|++.+|+.+-+. |-. +.+.. T Consensus 15 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~ 93 (535) T protein:vir:33 15 KATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPW-QAVGARGLNNLASKLMLALFPMQSWMKLTISEYE 93 (535) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHHhhcCCCcccccccChHH Confidence 3566777766654444443322 22222111000000001112 344455666666655442 211 11100 Q ss_pred hhhhccCC-----------------Cc---cCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcC Q lcl|NC_018285. 70 MQGIVDNP-----------------SN---SANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLD 129 (383) Q Consensus 70 ~~~l~~~P-----------------N~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~ 129 (383) ...+...+ .. .-+.+.-+..+..+++.+|||.+++..+. +..+.+..++-.++-+..+. T Consensus 94 ~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~-~~~~~f~~~pl~~~~v~~d~ 172 (535) T protein:vir:33 94 AKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPE-GSYNPMKLYRLSSYVVQRDA 172 (535) T ss_pred HhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCC-CCceeeEEEEcCeeEEeeCC Confidence 00000000 00 11233344566788999999999987654 23233333333444343333 Q ss_pred CCcee-----------------------------------EEEEeecCcccccce-----------------eecccceE Q lcl|NC_018285. 130 NQNGL-----------------------------------YYNVTFDDPRIPPKQ-----------------HVPQSDIL 157 (383) Q Consensus 130 ~~~~~-----------------------------------~y~~~~~~~~~~~~~-----------------~~~~~dvi 157 (383) .|... .|....-...++... .|...-.+ T Consensus 173 ~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i 252 (535) T protein:vir:33 173 YGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYI 252 (535) T ss_pred CCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCce Confidence 32110 011000000011000 01122234 Q ss_pred EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeec- Q lcl|NC_018285. 158 HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVL- 236 (383) Q Consensus 158 h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl- 236 (383) ..|....++..||.||...+...+...+.+.+.......-...|.+++..++........ .+..+.++. T Consensus 253 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~----------~~~~g~~v~g 322 (535) T protein:vir:33 253 PVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT----------KAQTGDFVPG 322 (535) T ss_pred eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc----------cCCceeeecC Confidence 444444556789999999999999999999999999988888888887665554432211 111122222 Q ss_pred -CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018285. 237 -DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQKLS 313 (383) Q Consensus 237 -~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l~ 313 (383) .+++...++...++-.-..+..+..+..|-.+|-+.. +....+..-+.+|... .=....|-|....+.++|=.-|+ T Consensus 323 ~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli 401 (535) T protein:vir:33 323 RREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (535) T ss_pred CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH Confidence 2344555544443222235556777888989986552 1111122223334332 23344566666666665533332 Q ss_pred ch-h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhh-cCCcCCcchhHHhCCCCCCCCCCCCCCC--C Q lcl|NC_018285. 314 CD-V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQ-QAEILPKELPKGENPNRTILKGGETNGQ--D 383 (383) Q Consensus 314 ~~-~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg-~~~~~~~d~~~~~~~~~~~~~ggd~~~~--d 383 (383) .. + +.+..+.+..+ +++-.+.|+-++-.+.. ...+. .-+-.+-.. +-+.-.+ | T Consensus 402 ~r~~~il~r~g~lP~~p~~----------~v~~~yis~La~aqr~~~~~~l~-~~~~~la~~------~P~~~d~~id 462 (535) T protein:vir:33 402 RVLLKQLQATSQIPELPKE----------AVEPTISTGLEAIGRGQDLDKLE-RCISAWAAL------APMQGDPDIN 462 (535) T ss_pred HHHHHHHHhcCCCCCCCcc----------ceeEEEecHHHHHHHHHHHHHHH-HHHHHHHhh------ChhhhhccCC Confidence 11 0 00111100000 00111111111100000 00000 000000001 0111111 3 No 225 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=86.57 E-value=0.045 Score=27.99 Aligned_cols=364 Identities=12% Similarity=0.100 Sum_probs=152.9 Q ss_pred hhhhhhcCCccccccccc------ccchhhccc---c-cCCcee------c---hh----hhhccHHHHHHHHHHHHhhh Q lcl|NC_018285. 3 IFNLATESPPNNQGGFFD------ITDPEFLAT---L-NGSEWV------S---AE----TALKNSDLFSIISQLSNDLA 59 (383) Q Consensus 3 lf~~~~~~~~~~~~~~~~------~~~~~~~~~---~-~~~~~~------~---~~----~a~~~~~v~~~i~~ia~~ia 59 (383) +-++-.+........+.. ..+..+.|. . .+..++ + ++ .|.-.+.+...++.++..+- T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 111100000000011100 000101000 0 000111 0 11 23334556666777776666 Q ss_pred hCceeeecchhh----hhccCCC-ccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCc------------------eeEEE Q lcl|NC_018285. 60 TAKLTTSRKQMQ----GIVDNPS-NSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGR------------------DMKWE 116 (383) Q Consensus 60 ~~p~~~~~~~~~----~l~~~PN-~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~------------------~~~l~ 116 (383) +-|..+...... .|..... ...+-.+|.+.++...+.+|-|++++.....+. | -+. T Consensus 81 ~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rP-y~~ 159 (513) T protein:vir:97 81 SEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRP-YWV 159 (513) T ss_pred hcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCc-eEE Confidence 667666532222 2333333 457789999999999999999999997543321 2 122 Q ss_pred EeccceeEE--------------------EEcCCCc--eeEEEEe-ecCccc----------c---cceeeccc------ Q lcl|NC_018285. 117 YLRPSQVSF--------------------NRLDNQN--GLYYNVT-FDDPRI----------P---PKQHVPQS------ 154 (383) Q Consensus 117 ~l~~~~v~~--------------------~~~~~~~--~~~y~~~-~~~~~~----------~---~~~~~~~~------ 154 (383) .+.|..|-- ....++. ....++. ...+.. . ........ T Consensus 160 ~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~~ 239 (513) T protein:vir:97 160 MIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLNY 239 (513) T ss_pred EecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCCc Confidence 233322200 0011111 1110000 000000 0 00000000 Q ss_pred -ceEEeccCCCCccccCcchHHHHHH-HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcc Q lcl|NC_018285. 155 -DILHFRLLSVDGGLTSVSPLMALGR-ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGG 232 (383) Q Consensus 155 -dvih~~~~~~~~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~ 232 (383) .++.+ +....+...|.||+..+.. .+........+.... ...+.|-.+++.-..... ....-+++. T Consensus 240 IP~v~~-~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il-~~~~~P~l~~~G~~~~~~----------~~i~iG~~~ 307 (513) T protein:vir:97 240 VPLVTF-YADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHIL-TVSRFPILACSGASGEDS----------DPVVVGPNK 307 (513) T ss_pred eeEEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHH-HhcccceeeeecCCcCCC----------CceEeeccc Confidence 01111 1112344557788776554 445544444444443 344566666653211110 012234456 Q ss_pred eeecCC-C--ceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_018285. 233 PLVLDD-L--EDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNV--YSKAVARYLRPFLSE 307 (383) Q Consensus 233 ~~vl~~-g--~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~--~~~~l~P~~~~i~~~ 307 (383) ++.+++ | .+|.+.+.+..... .+..+...+++ ...| ..+|..++.+.+..+....+ ....|.-++..++++ T Consensus 308 ~~~lpe~~~~~~yie~~g~~i~~~-~~~l~~le~qm-~~~G--a~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~a 383 (513) T protein:vir:97 308 VLYNPDPAGRFYYVEHTGQAIAAG-RTDLKDLEEQM-AGYG--AEFLKRKTGGQTATARALDSAEATSDLSAMTGLFEDA 383 (513) T ss_pred cccCCCCCCcceeeccCchhHHHH-HHHHHHHHHHH-HHHH--HHhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666764 4 56666666654433 22223333333 2223 23343222222222222222 344566677788887 Q ss_pred HHHhhcc----------hhhccchhhhccCH--HHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchh----------H Q lcl|NC_018285. 308 LSQKLSC----------DVDADIFPAVDPTG--ANYISRINSMVKSGTLAQNQGLYILQQAEILPKELP----------K 365 (383) Q Consensus 308 l~~~l~~----------~~e~~~~~~~~~~~--~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~----------~ 365 (383) |+..|-- ...+.+...|.... ......+-+++..|.++....++.|.+-++.+.|.. + T Consensus 384 l~~~l~~~a~wlg~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~ 463 (513) T protein:vir:97 384 LAQALDITADWLRLGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEE 463 (513) T ss_pred HHHHHHHHHHHhCCCCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHh Confidence 7765421 12333433333222 233455667788999999999888877666432111 1 Q ss_pred Hh------C-----CCCCCCCCCCCCCCC Q lcl|NC_018285. 366 GE------N-----PNRTILKGGETNGQD 383 (383) Q Consensus 366 ~~------~-----~~~~~~~ggd~~~~d 383 (383) .+ + .+..|-+||+.++.+ T Consensus 464 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 492 (513) T protein:vir:97 464 ISEAMGRAGLDLDPAQKNPPEGGEGEGEG 492 (513) T ss_pred hhhccCCCCccccccCCCCCCCCCCCCCC Confidence 10 0 111122233322222 No 226 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=85.57 E-value=0.052 Score=27.63 Aligned_cols=367 Identities=11% Similarity=0.056 Sum_probs=145.2 Q ss_pred CchhhhhhcCCc--ccccccccccchhhcccc----cCCceec--------------hhh----hhccHHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPEFLATL----NGSEWVS--------------AET----ALKNSDLFSIISQLSN 56 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~~~~~~----~~~~~~~--------------~~~----a~~~~~v~~~i~~ia~ 56 (383) |.=.+. +.|. .....|. ..+..+.|.. .+..++. .+. |.-.+.+...++.+.. T Consensus 1 m~~V~~--~hp~y~~~~~~W~-~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G 77 (501) T protein:vir:95 1 MPNVSF--IRPELGKLLPLYY-LIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVG 77 (501) T ss_pred CCCCCC--CCHHHHHHHHHHH-HHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhh Confidence 441111 0000 0000000 0000000000 0001110 111 2223344444444444 Q ss_pred hhhhCceeeec-chhhhhccCCC-ccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCc---------------eeEEEEec Q lcl|NC_018285. 57 DLATAKLTTSR-KQMQGIVDNPS-NSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGR---------------DMKWEYLR 119 (383) Q Consensus 57 ~ia~~p~~~~~-~~~~~l~~~PN-~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~---------------~~~l~~l~ 119 (383) .+-+-|..+.- .....|..... ...+-.+|.+.++...+.+|-+++++.....+. | -+..+. T Consensus 78 ~vf~k~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rP-y~~~~~ 156 (501) T protein:vir:95 78 QVFMRDPVVKVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRP-TLYVYS 156 (501) T ss_pred hhhcCCcceeCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCc-EEEEec Confidence 44444433321 11222333332 446789999999999999999999997543211 2 122233 Q ss_pred cceeE-----------------EE---EcCCC------------------ceeEEEEeecCccc-ccce----------- Q lcl|NC_018285. 120 PSQVS-----------------FN---RLDNQ------------------NGLYYNVTFDDPRI-PPKQ----------- 149 (383) Q Consensus 120 ~~~v~-----------------~~---~~~~~------------------~~~~y~~~~~~~~~-~~~~----------- 149 (383) |..|- +. ...++ +...+++....... .... T Consensus 157 ~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~ 236 (501) T protein:vir:95 157 PTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYV 236 (501) T ss_pred HhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccc Confidence 32210 00 00011 11112211111000 0000 Q ss_pred -eec------ccc---eEEeccCCCCccccCcchHHHHHH-HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHH Q lcl|NC_018285. 150 -HVP------QSD---ILHFRLLSVDGGLTSVSPLMALGR-ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTK 218 (383) Q Consensus 150 -~~~------~~d---vih~~~~~~~~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~ 218 (383) ..+ .-. ++++ +....+...|.+|+..+.. .+........+. ..+...+.|-.+++.... +.... T Consensus 237 ~~~~~~~g~~~l~~IPfv~~-~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~-~~l~~~~~P~l~i~G~~~---~~~~~ 311 (501) T protein:vir:95 237 VYKPTDAQGKRLTEIPFMFI-GSENNDSNPDNPNFYDLASLNMAHYRNSADYE-ESCYIVGQPTPVLIGLTE---EWVTN 311 (501) T ss_pred eeeeeccCCCcCCeeeEEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhHHH-HHHHHcccceeeeeCCcc---ccccc Confidence 000 000 1111 1222333456777776553 333333333333 334445667666653222 11000 Q ss_pred HHHHHHHhhcCCcceeecCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHH Q lcl|NC_018285. 219 VSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKA 296 (383) Q Consensus 219 ~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~ 296 (383) ..+ ....-+++..+.++.|.++.=+..++.-+. .+..+...+++..+ | ..++.....+.+..+... .--... T Consensus 312 ~~~--~~i~~G~~~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~~-G--a~ll~~~~~~~Ta~~~~~~~~~~~S~ 385 (501) T protein:vir:95 312 VLK--GSVNFGSRGGIPLPVGADAKLLQASENTML-KEAMDTKERQMVAL-G--AKLVEQKEVQRTATEAELEAASEGST 385 (501) T ss_pred CCC--CceeecccccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHHH-H--HhhccCCccchhHHHHHHHHHHHhHH Confidence 000 011113345566766654443333332222 22233333333322 3 233332211122222221 223446 Q ss_pred HHHHHHHHHHHHHHhhc--------ch--hhccchhhhc---cCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch Q lcl|NC_018285. 297 VARYLRPFLSELSQKLS--------CD--VDADIFPAVD---PTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL 363 (383) Q Consensus 297 l~P~~~~i~~~l~~~l~--------~~--~e~~~~~~~~---~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~ 363 (383) |.-++..++++|+..|- .+ .++.+...|. .+... ...+-+++.+|.++..+.++.|...++++.+. T Consensus 386 L~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~-~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~ 464 (501) T protein:vir:95 386 LSSATKNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDE-RRSLVEEWQKGAITFEEMRTGLRKAGVATEDD 464 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHH-HHHHHHHHhCCCCcHHHHHHHHHhCCCCChhH Confidence 77778888888776442 11 2233333332 23333 44566788999999999999998888775321 Q ss_pred ----hHH-h---C--------CCCCCCCCCCC-CCCC Q lcl|NC_018285. 364 ----PKG-E---N--------PNRTILKGGET-NGQD 383 (383) Q Consensus 364 ----~~~-~---~--------~~~~~~~ggd~-~~~d 383 (383) .+. + + ....+..|||+ ..+| T Consensus 465 ~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 465 SKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred HHHHHHHHhhhcCcccccccCCCCCCCcccccccCCC Confidence 111 0 0 01112235555 4444 No 227 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=85.19 E-value=0.055 Score=27.50 Aligned_cols=351 Identities=14% Similarity=0.084 Sum_probs=146.5 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC--cee------eecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA--KLT------TSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p~~------~~~~~ 69 (383) =+.|+.++..|..+...|..+ +.|..+..-.....-...+.+ .++-..|++.+|+.+-+. |-. +.+.. T Consensus 15 k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~ 93 (535) T protein:vir:15 15 KATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPW-QAVGARGLNNLASKLMLALFPMQSWMKLTISEYE 93 (535) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHHhhcCCCcccccccChHH Confidence 346777777665544444332 223222211110000001122 344455666666655431 211 11100 Q ss_pred hhh---------------------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC-ceeEEEEeccceeEEEE Q lcl|NC_018285. 70 MQG---------------------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG-RDMKWEYLRPSQVSFNR 127 (383) Q Consensus 70 ~~~---------------------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g-~~~~l~~l~~~~v~~~~ 127 (383) ... +...-+ .-+.+.-+..+..+++.+|||.+++..+..+ .+...||| .++-+.. T Consensus 94 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl--~~~~v~~ 170 (535) T protein:vir:15 94 AKQLVGDPDGLAKVDEGLSMVERIIMNYIE-SNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRL--SSYVVQR 170 (535) T ss_pred HhccCCCcchHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEc--CeeEEee Confidence 000 000011 1223444556678899999999888665432 23444444 3333333 Q ss_pred cCCCce-----------------------------------eEEEEeecCcccccc-----------------eeecccc Q lcl|NC_018285. 128 LDNQNG-----------------------------------LYYNVTFDDPRIPPK-----------------QHVPQSD 155 (383) Q Consensus 128 ~~~~~~-----------------------------------~~y~~~~~~~~~~~~-----------------~~~~~~d 155 (383) +..+.. ..|....-...++.. ..|...- T Consensus 171 d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P 250 (535) T protein:vir:15 171 DAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMP 250 (535) T ss_pred CCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCC Confidence 322211 011110000000000 0111222 Q ss_pred eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee Q lcl|NC_018285. 156 ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV 235 (383) Q Consensus 156 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v 235 (383) .+..|....++..||.||...+...+...+.+.+.......-...|.+++..++........ .+..+.++ T Consensus 251 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~----------~~~~g~~v 320 (535) T protein:vir:15 251 YIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT----------KAQTGDFV 320 (535) T ss_pred ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcc----------cCCceeee Confidence 34444444556789999999999999999999999999988888888887665554442211 11112222 Q ss_pred c--CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 236 L--DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 236 l--~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~ 311 (383) . .+++...++...++-.-..+..+..+..|-.+|-+.. +....+..-+.+|... .=....|-|....+.++|=.- T Consensus 321 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~P 399 (535) T protein:vir:15 321 PGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (535) T ss_pred cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 2 2344555544443222234556677888989986552 1111122223334332 233445666666666655333 Q ss_pred hcch-h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhh-cCCcCCcchhHHhCCCCCCCCCCCCCCC--C Q lcl|NC_018285. 312 LSCD-V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQ-QAEILPKELPKGENPNRTILKGGETNGQ--D 383 (383) Q Consensus 312 l~~~-~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg-~~~~~~~d~~~~~~~~~~~~~ggd~~~~--d 383 (383) |+.. + +.+..+.+..+ +++-.+.|+-++-.+.. ...+. .-+-.+-.. +-+.-.+ | T Consensus 400 li~r~~~il~r~g~lP~~p~~----------~v~~~yis~La~aqr~~~~~~l~-~~~~~la~~------~P~~ld~~id 462 (535) T protein:vir:15 400 LVRVLLKQLQATSQIPELPKE----------AVEPTISTGLEAIGRGQDLDKLE-RCISAWAAL------APMQGDPDIN 462 (535) T ss_pred HHHHHHHHHHhcCCCCCCCcc----------ceeEEEecHHHHHHHHHHHHHHH-HHHHHHHhc------ChhhhhccCC Confidence 3211 0 00111100000 00111111111100000 00000 000000001 0111111 3 No 228 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=82.68 E-value=0.075 Score=26.76 Aligned_cols=374 Identities=11% Similarity=0.136 Sum_probs=164.0 Q ss_pred CchhhhhhcC-----------C------ccccccccc-------ccchhhccc-ccCCcee--------chhhhhccHHH Q lcl|NC_018285. 1 MPIFNLATES-----------P------PNNQGGFFD-------ITDPEFLAT-LNGSEWV--------SAETALKNSDL 47 (383) Q Consensus 1 Mglf~~~~~~-----------~------~~~~~~~~~-------~~~~~~~~~-~~~~~~~--------~~~~a~~~~~v 47 (383) +.||+...+. + +....+... .+..++.+. ......+ ..+..+.+|.| T Consensus 4 ~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEv 83 (516) T protein:vir:10 4 LDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEV 83 (516) T ss_pred hHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccch Confidence 2333331110 0 001100000 000111100 0000001 24556789999 Q ss_pred HHHHHHHHHhhhhC-----ceeeecchh--------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec- Q lcl|NC_018285. 48 FSIISQLSNDLATA-----KLTTSRKQM--------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN- 107 (383) Q Consensus 48 ~~~i~~ia~~ia~~-----p~~~~~~~~--------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~- 107 (383) ..||+-|.+.+.-+ |+.+.=.+. +.+..--|....+ ...+..|++.|..|..++.+ T Consensus 84 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~----~~~fR~WYVDgRi~fhKiid~ 159 (516) T protein:vir:10 84 ERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKL----DTLFRRWYVDSRIFFHKIMPN 159 (516) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhh----hHHHhhhhhcceEEEEEEecC Confidence 99999999886532 222211110 0111111222223 35567888999999886654 Q ss_pred CCCceeEEEEeccceeEEEEcC---C--Cc------eeEEEEee-------cCc--ccccceeecccceEEec--cCCCC Q lcl|NC_018285. 108 DNGRDMKWEYLRPSQVSFNRLD---N--QN------GLYYNVTF-------DDP--RIPPKQHVPQSDILHFR--LLSVD 165 (383) Q Consensus 108 ~~g~~~~l~~l~~~~v~~~~~~---~--~~------~~~y~~~~-------~~~--~~~~~~~~~~~dvih~~--~~~~~ 165 (383) .+.-+.+|.+|+|..+..++.- + +. ..+|.|.. .+. ..+..+.++.+-|.|.. ..+.+ T Consensus 160 ~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~ 239 (516) T protein:vir:10 160 PKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDCS 239 (516) T ss_pred ccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeCC Confidence 3445899999999998776532 1 11 12222221 110 11123445555555544 22333 Q ss_pred ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHH-----hhcCC-c------ Q lcl|NC_018285. 166 GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQA-----MKQMQ-G------ 231 (383) Q Consensus 166 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~-----~~~~~-g------ 231 (383) +... +|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++..... .++++ | T Consensus 240 ~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 318 (516) T protein:vir:10 240 DRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 3333 78888888888888777777665555545444444333 33333 333333332211 11111 1 Q ss_pred ce--------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-----CcCHHHHHH-HHHH Q lcl|NC_018285. 232 GP--------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-----QQSSLEMSS-NVYS 294 (383) Q Consensus 232 ~~--------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-----~~~~~e~~~-~~~~ 294 (383) +. +.= +.|.+++.+.....-.+ ++-..+..+.+..+++||.+-|+.... +.+++=.+. .=+. T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~ 397 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFR 397 (516) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHH Confidence 11 111 23567776655433333 233357788999999999999974322 122221222 2233 Q ss_pred HHHHHHHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHH--H---hCCCcCHH Q lcl|NC_018285. 295 KAVARYLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSM--V---KSGTLAQN 347 (383) Q Consensus 295 ~~l~P~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l--~---~~g~~t~n 347 (383) ..|.-+-..+...|.. .|. +.-|++ +...+..|.. -....++.+ + -+..++.+ T Consensus 398 KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 477 (516) T protein:vir:10 398 KFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHD 477 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 4444444444444433 332 111221 1111111111 111122211 1 25677777 Q ss_pred HHHHHhhcCCcCCcchhHHh-----CCCCCCCCCCCCCCCC Q lcl|NC_018285. 348 QGLYILQQAEILPKELPKGE-----NPNRTILKGGETNGQD 383 (383) Q Consensus 348 E~r~~lg~~~~~~~d~~~~~-----~~~~~~~~ggd~~~~d 383 (383) =+++.+-.- +..|+...+ .......+ -..+++| T Consensus 478 yi~k~ILr~--tDeei~~e~k~I~~E~~~~~~~-~p~~~~~ 515 (516) T protein:vir:10 478 YVMKNILQM--TEEQIAQEEKQIEQEAGIKRFQ-NPENEDD 515 (516) T ss_pred HHHHHHhcC--CHhhHHHHHHHHHHhhhCCCCC-CCCcccc Confidence 777754221 223332211 11111111 1122233 No 229 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=82.68 E-value=0.075 Score=26.76 Aligned_cols=374 Identities=11% Similarity=0.136 Sum_probs=164.0 Q ss_pred CchhhhhhcC-----------C------ccccccccc-------ccchhhccc-ccCCcee--------chhhhhccHHH Q lcl|NC_018285. 1 MPIFNLATES-----------P------PNNQGGFFD-------ITDPEFLAT-LNGSEWV--------SAETALKNSDL 47 (383) Q Consensus 1 Mglf~~~~~~-----------~------~~~~~~~~~-------~~~~~~~~~-~~~~~~~--------~~~~a~~~~~v 47 (383) +.||+...+. + +....+... .+..++.+. ......+ ..+..+.+|.| T Consensus 4 ~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEv 83 (516) T protein:vir:10 4 LDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEV 83 (516) T ss_pred hHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccch Confidence 2333331110 0 001100000 000111100 0000001 24556789999 Q ss_pred HHHHHHHHHhhhhC-----ceeeecchh--------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec- Q lcl|NC_018285. 48 FSIISQLSNDLATA-----KLTTSRKQM--------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN- 107 (383) Q Consensus 48 ~~~i~~ia~~ia~~-----p~~~~~~~~--------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~- 107 (383) ..||+-|.+.+.-+ |+.+.=.+. +.+..--|....+ ...+..|++.|..|..++.+ T Consensus 84 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~----~~~fR~WYVDgRi~fhKiid~ 159 (516) T protein:vir:10 84 ERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKL----DTLFRRWYVDSRIFFHKIMPN 159 (516) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhh----hHHHhhhhhcceEEEEEEecC Confidence 99999999886532 222211110 0111111222223 35567888999999886654 Q ss_pred CCCceeEEEEeccceeEEEEcC---C--Cc------eeEEEEee-------cCc--ccccceeecccceEEec--cCCCC Q lcl|NC_018285. 108 DNGRDMKWEYLRPSQVSFNRLD---N--QN------GLYYNVTF-------DDP--RIPPKQHVPQSDILHFR--LLSVD 165 (383) Q Consensus 108 ~~g~~~~l~~l~~~~v~~~~~~---~--~~------~~~y~~~~-------~~~--~~~~~~~~~~~dvih~~--~~~~~ 165 (383) .+.-+.+|.+|+|..+..++.- + +. ..+|.|.. .+. ..+..+.++.+-|.|.. ..+.+ T Consensus 160 ~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~ 239 (516) T protein:vir:10 160 PKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDCS 239 (516) T ss_pred ccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeCC Confidence 3445899999999998776532 1 11 12222221 110 11123445555555544 22333 Q ss_pred ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHH-----hhcCC-c------ Q lcl|NC_018285. 166 GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQA-----MKQMQ-G------ 231 (383) Q Consensus 166 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~-----~~~~~-g------ 231 (383) +... +|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++..... .++++ | T Consensus 240 ~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 318 (516) T protein:vir:10 240 DRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 3333 78888888888888777777665555545444444333 33333 333333332211 11111 1 Q ss_pred ce--------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-----CcCHHHHHH-HHHH Q lcl|NC_018285. 232 GP--------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-----QQSSLEMSS-NVYS 294 (383) Q Consensus 232 ~~--------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-----~~~~~e~~~-~~~~ 294 (383) +. +.= +.|.+++.+.....-.+ ++-..+..+.+..+++||.+-|+.... +.+++=.+. .=+. T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~ 397 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFR 397 (516) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHH Confidence 11 111 23567776655433333 233357788999999999999974322 122221222 2233 Q ss_pred HHHHHHHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHH--H---hCCCcCHH Q lcl|NC_018285. 295 KAVARYLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSM--V---KSGTLAQN 347 (383) Q Consensus 295 ~~l~P~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l--~---~~g~~t~n 347 (383) ..|.-+-..+...|.. .|. +.-|++ +...+..|.. -....++.+ + -+..++.+ T Consensus 398 KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 477 (516) T protein:vir:10 398 KFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHD 477 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 4444444444444433 332 111221 1111111111 111122211 1 25677777 Q ss_pred HHHHHhhcCCcCCcchhHHh-----CCCCCCCCCCCCCCCC Q lcl|NC_018285. 348 QGLYILQQAEILPKELPKGE-----NPNRTILKGGETNGQD 383 (383) Q Consensus 348 E~r~~lg~~~~~~~d~~~~~-----~~~~~~~~ggd~~~~d 383 (383) =+++.+-.- +..|+...+ .......+ -..+++| T Consensus 478 yi~k~ILr~--tDeei~~e~k~I~~E~~~~~~~-~p~~~~~ 515 (516) T protein:vir:10 478 YVMKNILQM--TEEQIAQEEKQIEQEAGIKRFQ-NPENEDD 515 (516) T ss_pred HHHHHHhcC--CHhhHHHHHHHHHHhhhCCCCC-CCCcccc Confidence 777754221 223332211 11111111 1122233 No 230 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=82.55 E-value=0.076 Score=26.72 Aligned_cols=356 Identities=15% Similarity=0.107 Sum_probs=146.4 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC--cee------eecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA--KLT------TSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p~~------~~~~~ 69 (383) =+.|+.++..|..+...|..+ +.|..+..-.....-...+ +-.++-..|++.+|+.+-+. |-. +.+.. T Consensus 15 ~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~-~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~ 93 (543) T protein:vir:88 15 KAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTT-PWQAVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQ 93 (543) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccc-cccchHHHHHHHHHHHHHHhhcCCCcccccccChHH Confidence 234566666555444434332 2232221111110000011 22444456666666655432 221 11101 Q ss_pred hhhhccCCCc--------------------cCCHHHHHHHHHHHHHHcCCeEEEEeecCCC----ceeEEEEeccceeEE Q lcl|NC_018285. 70 MQGIVDNPSN--------------------SANRFNFYQSIFAQMLLGGEAFAYRWRNDNG----RDMKWEYLRPSQVSF 125 (383) Q Consensus 70 ~~~l~~~PN~--------------------~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g----~~~~l~~l~~~~v~~ 125 (383) ...+...|.. .-+.+.-+..+..+++.+|||.+++..+... .+...|||.. +-+ T Consensus 94 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl~~--y~v 171 (543) T protein:vir:88 94 AKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTLHN--HVV 171 (543) T ss_pred HhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEcce--EEE Confidence 0000000000 1123333456678899999999888654321 2234455532 222 Q ss_pred EEcCCCc----------------------------------eeEEEEee--cCc-ccc-----cceee---------ccc Q lcl|NC_018285. 126 NRLDNQN----------------------------------GLYYNVTF--DDP-RIP-----PKQHV---------PQS 154 (383) Q Consensus 126 ~~~~~~~----------------------------------~~~y~~~~--~~~-~~~-----~~~~~---------~~~ 154 (383) ..+..|. ...|.... .++ ... ....+ ... T Consensus 172 ~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~ 251 (543) T protein:vir:88 172 QRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQEIEGVEVDGSDGQYPQDAL 251 (543) T ss_pred eeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCcccccccccCeeeecCCCccccccC Confidence 2222221 11111111 000 000 00111 112 Q ss_pred ceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCccee Q lcl|NC_018285. 155 DILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPL 234 (383) Q Consensus 155 dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~ 234 (383) -.+..|....++..||.||...+...+...+.+.+.......-...|..++..++........ .+..+.+ T Consensus 252 P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~----------~~~~g~~ 321 (543) T protein:vir:88 252 PWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLV----------KAQTGDF 321 (543) T ss_pred CceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc----------cCCCcee Confidence 234444334456789999999999999999999999999888888888887666554442211 1111222 Q ss_pred ec--CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 235 VL--DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQ 310 (383) Q Consensus 235 vl--~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~ 310 (383) +. .+++...++...++-.-..+..+..+..|-.+|-+... ....+..-+.+|... .=....|-|....+.++|=. T Consensus 322 v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~-~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~ 400 (543) T protein:vir:88 322 VAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSA-VQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQL 400 (543) T ss_pred ecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhh-ccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHH Confidence 22 23445555554333233355667778889989976531 112222223444332 23445666777777666543 Q ss_pred hhcch-h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-HhhcCCcCCcchhHHhCCCCCCCCCCCC-CCCC Q lcl|NC_018285. 311 KLSCD-V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLY-ILQQAEILPKELPKGENPNRTILKGGET-NGQD 383 (383) Q Consensus 311 ~l~~~-~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~-~lg~~~~~~~d~~~~~~~~~~~~~ggd~-~~~d 383 (383) -|+.+ + +.+..+..-.+ .++-.+.|.-.... ..+...+.. -+....... +|. --|. |... T Consensus 401 Pli~r~~~il~r~g~lP~~p~~----------~v~~~~vs~l~~l~r~~~~~~l~~-~~~~v~~~~-~p~-vld~id~d~ 467 (543) T protein:vir:88 401 PIVRVLLNQLQATQQIPNLPQE----------AVEPTVTTGAEALGRGQDLDKLTQ-FLNAVATVS-QLN-GDPDLNVNN 467 (543) T ss_pred HHHHHHHHHHHhcCCCCCCchh----------ceeeeEEecHHHHHHHHHHHHHHH-HHHHHHhcc-chh-hhccCCHHH Confidence 33211 0 00110000000 01111111100000 000000000 000000111 010 0111 1111 No 231 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=80.81 E-value=0.091 Score=26.28 Aligned_cols=374 Identities=12% Similarity=0.134 Sum_probs=159.9 Q ss_pred Cchhhhhhc--------CCcccccccccc--cchh-----------hcccccC------Cc-e----e-chhhhhccHHH Q lcl|NC_018285. 1 MPIFNLATE--------SPPNNQGGFFDI--TDPE-----------FLATLNG------SE-W----V-SAETALKNSDL 47 (383) Q Consensus 1 Mglf~~~~~--------~~~~~~~~~~~~--~~~~-----------~~~~~~~------~~-~----~-~~~~a~~~~~v 47 (383) +.||+...+ +-......+... .+++ ..+..+. .. + + +.+..+.+|.| T Consensus 4 ~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pEv 83 (516) T protein:vir:10 4 LDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNPEV 83 (516) T ss_pred hHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhccch Confidence 233333111 000000000000 0100 0111100 00 0 1 34667789999 Q ss_pred HHHHHHHHHhhhhC-----ceeeecchh--------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec- Q lcl|NC_018285. 48 FSIISQLSNDLATA-----KLTTSRKQM--------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN- 107 (383) Q Consensus 48 ~~~i~~ia~~ia~~-----p~~~~~~~~--------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~- 107 (383) ..||+.|.+.+.-+ |+.+.-.+. +.+..--+....+ ...+..|++.|..|..++.+ T Consensus 84 d~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~----~~~fR~WYVDgRi~fhKiid~ 159 (516) T protein:vir:10 84 ERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKL----DTLFRRWYIDSRIFFHKIMPN 159 (516) T ss_pred hHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhh----hHHHHhhhhcceEEEEEEecC Confidence 99999999887533 333321111 0111112222233 35567888999999886654 Q ss_pred CCCceeEEEEeccceeEEEEcC---C--------CceeEEEEeecCc---------ccccceeecccceEEecc--CCCC Q lcl|NC_018285. 108 DNGRDMKWEYLRPSQVSFNRLD---N--------QNGLYYNVTFDDP---------RIPPKQHVPQSDILHFRL--LSVD 165 (383) Q Consensus 108 ~~g~~~~l~~l~~~~v~~~~~~---~--------~~~~~y~~~~~~~---------~~~~~~~~~~~dvih~~~--~~~~ 165 (383) .+.-+.+|.+|+|..+..++.. + +...+|.+...+. .......++.+-|.+... .+.. T Consensus 160 ~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl~d~~ 239 (516) T protein:vir:10 160 PKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGLQDCS 239 (516) T ss_pred cccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCcccCC Confidence 3445899999999998776532 1 1111222221110 111123344444433321 1112 Q ss_pred ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCC-HHHHHHHHHHHHH-----hhcCC-c------ Q lcl|NC_018285. 166 GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGL-LDFKTKVSRSRQA-----MKQMQ-G------ 231 (383) Q Consensus 166 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~-~e~~~~~~~~~~~-----~~~~~-g------ 231 (383) +.. =+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+. ..+.+.++..... -++++ | T Consensus 240 ~~~-i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddr 318 (516) T protein:vir:10 240 DRG-IVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCc-eeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 212 257788888888877777777665555545444444333 3333 3333333332211 11111 1 Q ss_pred ceee-c----------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-----CcCHHHHHH-HHHH Q lcl|NC_018285. 232 GPLV-L----------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-----QQSSLEMSS-NVYS 294 (383) Q Consensus 232 ~~~v-l----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-----~~~~~e~~~-~~~~ 294 (383) +.+- + +.|.+++.+.....-.+ ++-..+..+.+..+++||.+-|+.... +.+++=.+. .=+. T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~ 397 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGE-MDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFR 397 (516) T ss_pred hhhhhHhhhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHH Confidence 1111 1 23567776655433333 233357788999999999999974321 222222221 1223 Q ss_pred HHHHHHHH----HHHHHHHHhhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHH--H---hCCCcCHH Q lcl|NC_018285. 295 KAVARYLR----PFLSELSQKLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSM--V---KSGTLAQN 347 (383) Q Consensus 295 ~~l~P~~~----~i~~~l~~~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l--~---~~g~~t~n 347 (383) ..|.-+-. -+.+.|.+.|.. .-+++ +...+..|.. -....++.+ + -+..++.+ T Consensus 398 KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~ 477 (516) T protein:vir:10 398 KFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHD 477 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 33433333 333444444432 11221 1111111111 111122211 1 25677777 Q ss_pred HHHHHhhcCCcCCcchhHHh-----CCCCCCCCCCCCCCCC Q lcl|NC_018285. 348 QGLYILQQAEILPKELPKGE-----NPNRTILKGGETNGQD 383 (383) Q Consensus 348 E~r~~lg~~~~~~~d~~~~~-----~~~~~~~~ggd~~~~d 383 (383) =+++.+-.- +..|+...+ .......+- ..++.| T Consensus 478 yi~k~ILr~--tDeei~~~~k~I~~E~~~~~~~~-p~~e~~ 515 (516) T protein:vir:10 478 YVMKNILQM--TDEQIAQEEKQIEKEANVKRFQN-PENEDD 515 (516) T ss_pred HHHHHHhcC--CHhHHHHHHHHHHHhhhCCCCCC-CCcccc Confidence 777754221 223332211 111111110 222233 No 232 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=78.41 E-value=0.11 Score=25.74 Aligned_cols=375 Identities=11% Similarity=0.092 Sum_probs=165.4 Q ss_pred Cchhhhh--hcCC------------cccccccccc--cchh----------hcccccCCcee--------------chhh Q lcl|NC_018285. 1 MPIFNLA--TESP------------PNNQGGFFDI--TDPE----------FLATLNGSEWV--------------SAET 40 (383) Q Consensus 1 Mglf~~~--~~~~------------~~~~~~~~~~--~~~~----------~~~~~~~~~~~--------------~~~~ 40 (383) ||+|+.+ ++.+ ......+..+ .+++ ..+...+..++ ..+. T Consensus 4 ~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ 83 (524) T protein:vir:98 4 LGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRG 83 (524) T ss_pred cchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHH Confidence 5666532 1100 0000000000 0000 00000000011 2455 Q ss_pred hhccHHHHHHHHHHHHhhhhC-----ceeeecchh--------------hhhccCCCccCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_018285. 41 ALKNSDLFSIISQLSNDLATA-----KLTTSRKQM--------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEAF 101 (383) Q Consensus 41 a~~~~~v~~~i~~ia~~ia~~-----p~~~~~~~~--------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~ 101 (383) .+.+|.|..||+-|.+.+.-+ |+.+.=.+. +.+..--+....+ ...+..|++.|..| T Consensus 84 ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~----~~~fR~WYVDgRi~ 159 (524) T protein:vir:98 84 IMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMG----ARLFRDWYVDSRIY 159 (524) T ss_pred HhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhh----hHHHhhhhhcceeE Confidence 678999999999999876422 222211110 0111112222223 45677889999999 Q ss_pred EEEeecCCCc--eeEEEEeccceeEEEEc------CCCce------eEEEEee--cCc-------ccccceeecccceEE Q lcl|NC_018285. 102 AYRWRNDNGR--DMKWEYLRPSQVSFNRL------DNQNG------LYYNVTF--DDP-------RIPPKQHVPQSDILH 158 (383) Q Consensus 102 ~~i~r~~~g~--~~~l~~l~~~~v~~~~~------~~~~~------~~y~~~~--~~~-------~~~~~~~~~~~dvih 158 (383) +.++.+.+.. +.+|.+|+|..+..++. +.+.. .+|.+.. ... ..+....++.+.|.| T Consensus 160 fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy 239 (524) T protein:vir:98 160 FHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVY 239 (524) T ss_pred EEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheee Confidence 9998765543 89999999999876541 11111 1222211 000 112335688888888 Q ss_pred eccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHHhh-----c--- Q lcl|NC_018285. 159 FRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQAMK-----Q--- 228 (383) Q Consensus 159 ~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~~~-----~--- 228 (383) ...--.+..-.=+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++....... + T Consensus 240 ~hSGL~d~~~~iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~T 319 (524) T protein:vir:98 240 AHSGLEDCSNNIIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDART 319 (524) T ss_pred eccCcccCCCCeeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccC Confidence 76322221111257888888888887777777665555545444444333 33433 33333333221111 1 Q ss_pred ----CCcceee-c----------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc-c---CcCHHHHH Q lcl|NC_018285. 229 ----MQGGPLV-L----------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG-D---QQSSLEMS 289 (383) Q Consensus 229 ----~~g~~~v-l----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~-~---~~~~~e~~ 289 (383) +..+.+- + +.|.+++.+.....-.+ ++-..+..+.+..+++||.+-|.... . +..++=.. T Consensus 320 Gevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItR 398 (524) T protein:vir:98 320 GTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSD-MDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITR 398 (524) T ss_pred ceeeccccccchhhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhH Confidence 1112111 1 23567776655433333 23335778899999999999996321 1 12221111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hC Q lcl|NC_018285. 290 -SNVYSKAVARYLRPFLSELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KS 341 (383) Q Consensus 290 -~~~~~~~l~P~~~~i~~~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~ 341 (383) +.=+...|.-+-..+...|.. .|.. .-+++ +...+..|.. -....++.+- -+ T Consensus 399 DEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvG 478 (524) T protein:vir:98 399 DELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVG 478 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccc Confidence 112233444444444444433 3321 11111 1111111111 1111222211 13 Q ss_pred CCcCHHHHHHHh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 342 GTLAQNQGLYIL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 342 g~~t~nE~r~~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) -.++.+=+++.+ .+ +..|+... +.......+--+.+.+| T Consensus 479 ky~s~dyi~k~ILr~---tDeei~~~~k~I~~E~k~~~~~~p~~e~~~ 523 (524) T protein:vir:98 479 KYVSHKYIMKEILRM---SDEDIDEQAKLIEEESKEERFKNPEAEEEN 523 (524) T ss_pred cccchHHHHHHHhcc---CHHHHHHHHHHHHHHHhCCCCcCCcccccc Confidence 456666666643 32 22232211 11111111122333334 No 233 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=77.95 E-value=0.12 Score=25.64 Aligned_cols=359 Identities=13% Similarity=0.073 Sum_probs=152.4 Q ss_pred CchhhhhhcCCc--ccccccccccchh-------------hcccccCCceechhh----hhccHHHHHHHHHHHHhhhhC Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDPE-------------FLATLNGSEWVSAET----ALKNSDLFSIISQLSNDLATA 61 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~----a~~~~~v~~~i~~ia~~ia~~ 61 (383) |..-.+ .|. .....|. ..+.. +++-..+-..-..+. |.-.+.+...++.++..+-+- T Consensus 1 m~V~~~---hp~y~a~~~~W~-~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k 76 (452) T protein:vir:94 1 MPIETK---HPEYLAYENDWI-DCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQ 76 (452) T ss_pred CCCCCc---CHHHHHHHHHHH-HHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcC Confidence 553221 000 0000000 00000 111110000000111 222445566666666666555 Q ss_pred ceeeecchh-hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC-ceeEEEEeccceeEEEE-cCCC------- Q lcl|NC_018285. 62 KLTTSRKQM-QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG-RDMKWEYLRPSQVSFNR-LDNQ------- 131 (383) Q Consensus 62 p~~~~~~~~-~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g-~~~~l~~l~~~~v~~~~-~~~~------- 131 (383) |..+.-... ..++. =-...+-.+|.++++...+.+|-|++++.....| +|. +..+.|..|--.. +..+ T Consensus 77 ~p~~~~p~~l~~~~~-D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy-~~~~~~~~Ii~W~~~~~g~l~~v~l 154 (452) T protein:vir:94 77 PPVITHPDAMSKYFE-DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPY-ISVYTTENILNWEEDEDGRLLMVVL 154 (452) T ss_pred CceecccHHHHHHHh-cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceE-EEEechhhhcCccccccCCeeEEEE Confidence 554432211 12221 1356788999999999999999999999887665 442 2333332221110 0000 Q ss_pred ----------------ceeEEEEe-ecCcc---------cccc------eeeccc----ceEEec--cCCCCccccCcch Q lcl|NC_018285. 132 ----------------NGLYYNVT-FDDPR---------IPPK------QHVPQS----DILHFR--LLSVDGGLTSVSP 173 (383) Q Consensus 132 ----------------~~~~y~~~-~~~~~---------~~~~------~~~~~~----dvih~~--~~~~~~~~~G~s~ 173 (383) ....|++- ...+. .+.. ...... ..|=|- +....+...|.|| T Consensus 155 re~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pP 234 (452) T protein:vir:94 155 REFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPP 234 (452) T ss_pred EEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCCCCCCCCccc Confidence 01111110 00000 0000 000000 011111 1222344568888 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCC-C--ceeeecccCh Q lcl|NC_018285. 174 LMALGR-ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDD-L--EDFTPLEIKS 249 (383) Q Consensus 174 ~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~-g--~~~~~~~~~~ 249 (383) +..+.. .+........+. ..+...+.|-.+++.-.... ...-+++.++.+++ | ..|...+.+. T Consensus 235 Ll~LA~ln~~hy~~~sd~~-~~l~~~~~P~l~~~g~~~~~------------~i~iG~~~~~~lpe~~~~~~yie~~g~~ 301 (452) T protein:vir:94 235 MIDIVDINYSHYRTSADLE-HGRHFTGLPTPWITGAESQS------------TMHIGSTKAWVIPEVAAKVGFLEFTGQG 301 (452) T ss_pred hHHHHHHHHHHhcchhHHH-HHHHHcccceeEeecCcCCC------------ceEecccccccCCCCCCcceEEccCchh Confidence 776654 444444444443 34444566766665322111 11224556677774 6 4666666666 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc-CHHHHHH--HHHHHHHHHHHHHHHHHHHHhhc-------c--hhh Q lcl|NC_018285. 250 NVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEMSS--NVYSKAVARYLRPFLSELSQKLS-------C--DVD 317 (383) Q Consensus 250 ~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~-~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l~-------~--~~e 317 (383) ..+.. +..+...++. ...|- .++-..+... +.+.... +-.+..|.-++..+++.++..|- . ..+ T Consensus 302 i~~~~-~~l~~le~~m-~~~Ga--~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~~~~ 377 (452) T protein:vir:94 302 LQSLE-KALSEKQAQL-ASLSA--RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGGTLN 377 (452) T ss_pred HHHHH-HHHHHHHHHH-HHHHH--HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCceE Confidence 54322 2222222222 11111 2332211111 2221112 22346677778888888876442 1 122 Q ss_pred ccchhhh---ccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcc---hhHHhC-CCCCCCCCC-CCCCCC Q lcl|NC_018285. 318 ADIFPAV---DPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKE---LPKGEN-PNRTILKGG-ETNGQD 383 (383) Q Consensus 318 ~~~~~~~---~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d---~~~~~~-~~~~~~~gg-d~~~~d 383 (383) +.+...| ..+.. ....+-+++.+|.++....++.|...++++.| .+.... ....+.+++ .-++.+ T Consensus 378 v~~n~dF~~~~~~~~-~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:94 378 IKLNSAFLDSKLTAA-ELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPSNTPPNPSS 450 (452) T ss_pred EEeccccccccCCHH-HHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccCCCCCCCcc Confidence 3333322 22333 33445568899999999999999777765422 122111 112232222 222222 No 234 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=77.13 E-value=0.13 Score=25.48 Aligned_cols=354 Identities=12% Similarity=0.078 Sum_probs=143.6 Q ss_pred CchhhhhhcCCccccccccccc---chhhcccccCCceechhhhhccHHHHHHHHHHHHhhhh-C-cee------eecch Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT---DPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLAT-A-KLT------TSRKQ 69 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~-~-p~~------~~~~~ 69 (383) =+.|+.++.+|..+...|..+. .|..+..-.....-...+ +=.++-..|++.+|+.+-+ + |-. +.+.. T Consensus 16 ~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~-~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~ 94 (535) T protein:vir:94 16 KAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTT-PWQAVGARGLNNLASKLMLALFPMQTWMKLTISEFE 94 (535) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCC-cccccHHHHHHHHHHHHHhhhcCCCCccccccChhh Confidence 4456666666544443343322 222221111110000111 1233444566666655543 2 221 11111 Q ss_pred hhhhccCCC-----------------c---cCCHHHHHHHHHHHHHHcCCeEEEEeecC-CCceeEEEEeccceeEEEEc Q lcl|NC_018285. 70 MQGIVDNPS-----------------N---SANRFNFYQSIFAQMLLGGEAFAYRWRND-NGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 70 ~~~l~~~PN-----------------~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~g~~~~l~~l~~~~v~~~~~ 128 (383) ...+...|. . .-+.+.-+..+..+++.+|||.+++..+. .+.....|||. ++-+..+ T Consensus 95 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~--~y~v~~d 172 (535) T protein:vir:94 95 AKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRLS--SYVVQRD 172 (535) T ss_pred hhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEEcC--eEEEeeC Confidence 111111111 0 11223334466788899999999886543 23334445543 3333333 Q ss_pred CCCce----------------------------------eEEEEeecCcccccc-----------------eeecccceE Q lcl|NC_018285. 129 DNQNG----------------------------------LYYNVTFDDPRIPPK-----------------QHVPQSDIL 157 (383) Q Consensus 129 ~~~~~----------------------------------~~y~~~~~~~~~~~~-----------------~~~~~~dvi 157 (383) ..|.. ..|....-...++.. ..|...-.+ T Consensus 173 ~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~ 252 (535) T protein:vir:94 173 AFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYI 252 (535) T ss_pred CCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEeeCCCCcEEEEEEecCeeeccccccCccccCCce Confidence 22211 011110000000000 011122334 Q ss_pred EeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee-- Q lcl|NC_018285. 158 HFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV-- 235 (383) Q Consensus 158 h~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v-- 235 (383) ..|....++..||.||..-+...+...+.+.+.......-...|..++..++..+.... .....+.++ T Consensus 253 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~----------~~~~~g~~v~g 322 (535) T protein:vir:94 253 PVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRL----------TKAQTGDFVSG 322 (535) T ss_pred eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhc----------ccCCCceeecC Confidence 44444455678999999999999999999988888777776777777665554433211 111112222 Q ss_pred cCCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhc-ccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 236 LDDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVG-GQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQKL 312 (383) Q Consensus 236 l~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg-~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l 312 (383) ..+++...++...++-....+..+..+..|..+|-+.. +. ..+..-+.+|... .=....|-|....+.++|=.-| T Consensus 323 ~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pl 400 (535) T protein:vir:94 323 RPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNS--AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPM 400 (535) T ss_pred CcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhh--hccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH Confidence 22344455554433222234556777888988884332 22 1122223344332 2334566666666666654333 Q ss_pred cch-h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhh-cCCcCCcchhHHhCCCCCCCCCCC--CCCCC Q lcl|NC_018285. 313 SCD-V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQ-QAEILPKELPKGENPNRTILKGGE--TNGQD 383 (383) Q Consensus 313 ~~~-~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg-~~~~~~~d~~~~~~~~~~~~~ggd--~~~~d 383 (383) +.. + +.+..+.... ++++-.+.|.-+...... ...+. .-+-.+-... | +-.| .|..+ T Consensus 401 i~r~~~il~r~g~lP~~p~----------~~v~~~~vs~la~l~r~~~~~~l~-~~~~~laq~~--P-~~ld~~id~d~ 465 (535) T protein:vir:94 401 VRVLLKQLQATNQIPELPK----------EAVEPTISTGMEALGRGQDLDKLE-RCIAAWSALA--P-MQGDPDINIAT 465 (535) T ss_pred HHHHHHHHHhCCCCCCCCh----------hhccceEeehHHHHHHHHHHHHHH-HHHHHHHhhC--h-HHhhhcCCHHH Confidence 211 0 0011110000 011111111111111000 00000 0000011111 1 1112 12222 No 235 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=69.54 E-value=0.22 Score=24.17 Aligned_cols=378 Identities=13% Similarity=0.103 Sum_probs=161.4 Q ss_pred CchhhhhhcCC--------------ccccccccccc----ch--hhcccccCCcee------------chhhhhccHHHH Q lcl|NC_018285. 1 MPIFNLATESP--------------PNNQGGFFDIT----DP--EFLATLNGSEWV------------SAETALKNSDLF 48 (383) Q Consensus 1 Mglf~~~~~~~--------------~~~~~~~~~~~----~~--~~~~~~~~~~~~------------~~~~a~~~~~v~ 48 (383) |++|.+...+. +....+....- .+ ...+..+....+ ..+..+.+|.|. T Consensus 8 ~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:65 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchh Confidence 55555432211 00000000000 00 000100001100 235567899999 Q ss_pred HHHHHHHHhhhhC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC--Cc Q lcl|NC_018285. 49 SIISQLSNDLATA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN--GR 111 (383) Q Consensus 49 ~~i~~ia~~ia~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~--g~ 111 (383) .||+-|.+.+.-+ |+.+.=.+. ..+...-+ ..++...--...+..|++.|..|+.++.+.+ .- T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~G 167 (521) T protein:vir:65 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDG 167 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCcccc Confidence 9999999887532 222211100 00111111 1112222224567788999999999986644 34 Q ss_pred eeEEEEeccceeEEEEcCCC-----------ceeEEEEeecC---------cccccceeecccceEEeccCCCC--cccc Q lcl|NC_018285. 112 DMKWEYLRPSQVSFNRLDNQ-----------NGLYYNVTFDD---------PRIPPKQHVPQSDILHFRLLSVD--GGLT 169 (383) Q Consensus 112 ~~~l~~l~~~~v~~~~~~~~-----------~~~~y~~~~~~---------~~~~~~~~~~~~dvih~~~~~~~--~~~~ 169 (383) +.+|.+|+|..+..++.... ...+|.|...+ ......+.++.+-|.+.. -..- +.-. T Consensus 168 I~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~h-SGl~d~~~~~ 246 (521) T protein:vir:65 168 IVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAH-SGLMDCDDKY 246 (521) T ss_pred ceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeeee-ccceeCCCCe Confidence 89999999999877653211 11122221100 011122344444444332 2111 1112 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHHhh-----c-------CCcceee Q lcl|NC_018285. 170 SVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQAMK-----Q-------MQGGPLV 235 (383) Q Consensus 170 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~~~-----~-------~~g~~~v 235 (383) =+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++....... + +..+.+- T Consensus 247 i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~ms 326 (521) T protein:vir:65 247 IIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLS 326 (521) T ss_pred eeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccc Confidence 357888888888887777777665555545444444333 33433 33333333221111 1 1112111 Q ss_pred -c----------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-----CcCHHHHH-HHHHHHHHH Q lcl|NC_018285. 236 -L----------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-----QQSSLEMS-SNVYSKAVA 298 (383) Q Consensus 236 -l----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-----~~~~~e~~-~~~~~~~l~ 298 (383) + +.|.+++.+..-..-.+ ++-..+..+.+..+++||.+-++..++ +..++=.. +.=+...|. T Consensus 327 MlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~ 405 (521) T protein:vir:65 327 MTEDYWLQRRDGKAITDVTTLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIR 405 (521) T ss_pred hhhhhcccccCCCCccceeecccCCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHH Confidence 1 23567776655332223 233357788999999999999864322 11221111 122333444 Q ss_pred HHHHHHHHHHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcCHHHHHH Q lcl|NC_018285. 299 RYLRPFLSELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLAQNQGLY 351 (383) Q Consensus 299 P~~~~i~~~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t~nE~r~ 351 (383) -+-..+...|.. .|.. .-+++ +...+..|.. -....++.+- -+-.++.+=+++ T Consensus 406 rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k 485 (521) T protein:vir:65 406 TLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMR 485 (521) T ss_pred HHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHH Confidence 444444444433 3321 11111 1111111111 1111222111 134556666666 Q ss_pred Hh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 352 IL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 352 ~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) .+ .+. ..|+... +.......+.-+.+.++ T Consensus 486 ~ILr~t---Deei~~~~k~I~~E~~~~~~~~p~~~~~~ 520 (521) T protein:vir:65 486 DILKYT---DDQMDTEKKQIEEEANDPRFKQTPDEIED 520 (521) T ss_pred HHhccC---HHHHHHHHHHHHHhhhCCCCCCCcccccC Confidence 43 322 2222211 11111111112233333 No 236 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=67.96 E-value=0.24 Score=23.94 Aligned_cols=376 Identities=12% Similarity=0.133 Sum_probs=158.8 Q ss_pred CchhhhhhcC-Ccccccccc----------cccchhhcccc---cCCce-----e-chhhhhccHHHHHHHHHHHHhhhh Q lcl|NC_018285. 1 MPIFNLATES-PPNNQGGFF----------DITDPEFLATL---NGSEW-----V-SAETALKNSDLFSIISQLSNDLAT 60 (383) Q Consensus 1 Mglf~~~~~~-~~~~~~~~~----------~~~~~~~~~~~---~~~~~-----~-~~~~a~~~~~v~~~i~~ia~~ia~ 60 (383) -.||....++ +........ ......+.+.. .+... + +.+..+.+|.|..||+-|.+.+.- T Consensus 3 ~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv 82 (537) T protein:vir:10 3 QQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNETIC 82 (537) T ss_pred cccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeE Confidence 3455543222 111111111 00011111111 11100 1 345667899999999999988753 Q ss_pred C-----ceeeecch--------------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCC---CceeEEEEe Q lcl|NC_018285. 61 A-----KLTTSRKQ--------------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDN---GRDMKWEYL 118 (383) Q Consensus 61 ~-----p~~~~~~~--------------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~---g~~~~l~~l 118 (383) + |+.+.=++ .+.++.--+....+ ...+..|++.|..|+.++.|.. .-+.+|.+| T Consensus 83 ~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~----~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~l 158 (537) T protein:vir:10 83 GNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRA----YEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYV 158 (537) T ss_pred ecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhh----hHHHhhheeeeEEEEEEEEeCCCccccceeeeee Confidence 2 22221111 01111112222223 4567788899999998877643 248899999 Q ss_pred ccceeEEEEcC---C--Cce-------e------EEEEeec--CcccccceeecccceEEecc---CCCCccccCcchHH Q lcl|NC_018285. 119 RPSQVSFNRLD---N--QNG-------L------YYNVTFD--DPRIPPKQHVPQSDILHFRL---LSVDGGLTSVSPLM 175 (383) Q Consensus 119 ~~~~v~~~~~~---~--~~~-------~------~y~~~~~--~~~~~~~~~~~~~dvih~~~---~~~~~~~~G~s~~~ 175 (383) +|..+..++.. + ... + +|-+... .......+.++. +.|++-+ .+.++ ...+|-|. T Consensus 159 DPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~-dAI~y~hSGl~d~n~-~~i~syLh 236 (537) T protein:vir:10 159 DPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAP-DSIAYCHSGIQDLNK-NMVLSHLH 236 (537) T ss_pred CCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccH-hheeeecccceeCCC-Ceeeeeeh Confidence 99998766531 1 110 0 0111100 001122244555 3444332 12222 34688899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCC-HHHHHHHHHHHHHh-----hcCC-c------ce-------- Q lcl|NC_018285. 176 ALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGL-LDFKTKVSRSRQAM-----KQMQ-G------GP-------- 233 (383) Q Consensus 176 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~-~e~~~~~~~~~~~~-----~~~~-g------~~-------- 233 (383) .+.+.+....-++....=+.-.-+.-+-+.-.+ |.+. ..+.+.++...... ++++ | +. T Consensus 237 kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyW 316 (537) T protein:vir:10 237 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 316 (537) T ss_pred hhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhc Confidence 999988888888877765555555444444333 3333 33333333322111 1111 1 11 Q ss_pred eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc--CHHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_018285. 234 LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ--SSLEMS--SNVYSKAVARYLRPFLS 306 (383) Q Consensus 234 ~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~--~~~e~~--~~~~~~~l~P~~~~i~~ 306 (383) +.= +.|.+++.+.....-.+ ++-..+..+.++.+++||.+-|+..+..+ ...|-. +.=+...|.-+-..+.. T Consensus 317 LPRReGgrgTEItTLpGgqnlge-m~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~ 395 (537) T protein:vir:10 317 LPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSE 395 (537) T ss_pred ccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHH Confidence 111 23567776655433223 23335778899999999999997432211 111222 12233344444444444 Q ss_pred HHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHHh-----CCCcCHHHHHHHh-hc--- Q lcl|NC_018285. 307 ELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMVK-----SGTLAQNQGLYIL-QQ--- 355 (383) Q Consensus 307 ~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~~-----~g~~t~nE~r~~l-g~--- 355 (383) .|.. .|.. .-+++ +...+..|.. -....++.+-+ +-.++.+=+++.+ .+ T Consensus 396 lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDe 475 (537) T protein:vir:10 396 LFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTES 475 (537) T ss_pred HHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHH Confidence 4433 3321 11211 1111111110 01111111111 2223333333321 11 Q ss_pred --------------CCc-CC-cchhHHh-C-CCCCCC-CCCCCCCCC Q lcl|NC_018285. 356 --------------AEI-LP-KELPKGE-N-PNRTIL-KGGETNGQD 383 (383) Q Consensus 356 --------------~~~-~~-~d~~~~~-~-~~~~~~-~ggd~~~~d 383 (383) .|+ .+ .+.-.++ + ....++ +||+.-+.| T Consensus 476 eI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (537) T protein:vir:10 476 EIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTD 522 (537) T ss_pred HHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccC Confidence 121 11 1111110 0 011111 233222222 No 237 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=67.01 E-value=0.26 Score=23.80 Aligned_cols=364 Identities=9% Similarity=-0.001 Sum_probs=139.5 Q ss_pred Cchh---hhhhc--------C-Cc--ccccccccccchhh------ccccc----CCceechhhhhccHHHHHHHHHHHH Q lcl|NC_018285. 1 MPIF---NLATE--------S-PP--NNQGGFFDITDPEF------LATLN----GSEWVSAETALKNSDLFSIISQLSN 56 (383) Q Consensus 1 Mglf---~~~~~--------~-~~--~~~~~~~~~~~~~~------~~~~~----~~~~~~~~~a~~~~~v~~~i~~ia~ 56 (383) |-+. ..+.+ + +. .....+.......+ .+... .....+.+.+.. -..--|+..+. T Consensus 8 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~n--f~k~Ivd~~~~ 85 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHG--FFTELVDQLAQ 85 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccc--hHHHHHHHHhh Confidence 1110 00000 0 00 00000000000000 00000 000001111111 11123333333 Q ss_pred hhhhCceeeecc--hhhh----hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCC Q lcl|NC_018285. 57 DLATAKLTTSRK--QMQG----IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 130 (383) Q Consensus 57 ~ia~~p~~~~~~--~~~~----l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~ 130 (383) =+-+-|+++.-. .... |... +. .........+..++..+|.||.++-++.+|.+ .+..++|..+-++.++. T Consensus 86 yl~G~Pv~~~~~d~~~~e~~~~l~~~-~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~~i~p~~~~pv~d~~ 162 (537) T protein:vir:78 86 YLLSNGVEVKVKDEDNTQLDEILQEY-FD-EDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQTVDGLTLIPVFDDY 162 (537) T ss_pred hhcccCceeecCcchhHHHHHHHHHH-hh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEEEEccceeEEEEcCC Confidence 344456655321 1111 2211 11 22334456777889999999999999998865 46677787776665543 Q ss_pred Ccee---E-EEEeecCccc------ccceeecccceEEeccCC------------------------------------- Q lcl|NC_018285. 131 QNGL---Y-YNVTFDDPRI------PPKQHVPQSDILHFRLLS------------------------------------- 163 (383) Q Consensus 131 ~~~~---~-y~~~~~~~~~------~~~~~~~~~dvih~~~~~------------------------------------- 163 (383) +... + |......... .....+.++.|.+++... T Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 242 (537) T protein:vir:78 163 GVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDG 242 (537) T ss_pred CCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccccccccc Confidence 3211 1 1110000000 001122333333322100 Q ss_pred ---------------CCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCC-CCHHHHHHHHHHHHHhh Q lcl|NC_018285. 164 ---------------VDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTKVSRSRQAMK 227 (383) Q Consensus 164 ---------------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~-~~~e~~~~~~~~~~~~~ 227 (383) +...-.|.|-+......++....+.-..++.+..-+.|-.+++--+. ..++....++. T Consensus 243 ~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~------ 316 (537) T protein:vir:78 243 YQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKA------ 316 (537) T ss_pred ccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhh------ Confidence 00112467777777777766666666666666555555555543222 12222221111 Q ss_pred cCCcceeec-CCC--ceeeecccChhhHHHHHHHHHHHHHHHHHhcCC---HHHhcccccCcCH----------HHHHHH Q lcl|NC_018285. 228 QMQGGPLVL-DDL--EDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIP---ENVVGGQGDQQSS----------LEMSSN 291 (383) Q Consensus 228 ~~~g~~~vl-~~g--~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVp---p~~lg~~~~~~~~----------~e~~~~ 291 (383) .+++.+ +.+ ++|..... .+.......+...+.|...-.+| ....| ..++-.. ....+. T Consensus 317 ---~~~i~v~~d~~~v~~l~~~~--~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~g-n~SGvAlk~~~~~l~~ka~~ke~ 390 (537) T protein:vir:78 317 ---KKMIGVNGDNAGMEIQTVSI--PYEARKAKMDIDVENIYRSGMGFNSTAVGDG-NVTNVVIKSRYTLLAMKARKMET 390 (537) T ss_pred ---cCceeecCCCCceeEEEecC--CHHHHHHHHHHHHHHHHHhcCCCCCcccccc-CCcHHHHHHHHhhHHHHHHHHHH Confidence 123323 233 55544333 33333344455555555443333 22211 1111000 111122 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcchh-----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhH- Q lcl|NC_018285. 292 VYSKAVARYLRPFLSELSQKLSCDV-----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPK- 365 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~l~~~l~~~~-----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~- 365 (383) .+...|+-.++.|...++.+-...+ ++.....+-.+..+.+..+.++..+|+++...+.+.++. ++..+.-+ T Consensus 391 ~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~--vdd~e~ek~ 468 (537) T protein:vir:78 391 SLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPR--IGDDETLKL 468 (537) T ss_pred HHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCC--CCCHHHHHH Confidence 3344444444444444332211111 122222333455677788888889999999888876532 11111000 Q ss_pred --------------------HhCC----CCCCCCCCCCCCCC Q lcl|NC_018285. 366 --------------------GENP----NRTILKGGETNGQD 383 (383) Q Consensus 366 --------------------~~~~----~~~~~~ggd~~~~d 383 (383) .+.. ...+..+|+++.++ T Consensus 469 ~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (537) T protein:vir:78 469 IAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNAN 510 (537) T ss_pred HHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCC Confidence 0000 00011111111111 No 238 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=66.89 E-value=0.26 Score=23.79 Aligned_cols=376 Identities=11% Similarity=0.105 Sum_probs=171.0 Q ss_pred CchhhhhhcCCc-----ccccccc--cccchh-------hcccccC---Cce------------e-chhhhhccHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPP-----NNQGGFF--DITDPE-------FLATLNG---SEW------------V-SAETALKNSDLFSI 50 (383) Q Consensus 1 Mglf~~~~~~~~-----~~~~~~~--~~~~~~-------~~~~~~~---~~~------------~-~~~~a~~~~~v~~~ 50 (383) |++|.+--.+.. .....+. +..+++ ......+ +.+ + ..+..+.+|.|..| T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A 80 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDA 80 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhH Confidence 777776322210 0000000 000110 0000001 000 1 34556789999999 Q ss_pred HHHHHHhhhhC-----ceeeecchhh-------hhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEE Q lcl|NC_018285. 51 ISQLSNDLATA-----KLTTSRKQMQ-------GIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKW 115 (383) Q Consensus 51 i~~ia~~ia~~-----p~~~~~~~~~-------~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l 115 (383) |+-|.+.+.-+ |+.+.=.+.+ .+...-+ ..++...--...+..|++.|..|..++.+.+.-+.+| T Consensus 81 v~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eL 160 (511) T protein:vir:56 81 IQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIEL 160 (511) T ss_pred HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeeh Confidence 99999887532 2222111100 0111111 1122222224566788899999999988876678999 Q ss_pred EEeccceeEEEEcC-----C------CceeEEEEeecCcc----------cccceeecccceEEeccCCCC---ccccCc Q lcl|NC_018285. 116 EYLRPSQVSFNRLD-----N------QNGLYYNVTFDDPR----------IPPKQHVPQSDILHFRLLSVD---GGLTSV 171 (383) Q Consensus 116 ~~l~~~~v~~~~~~-----~------~~~~~y~~~~~~~~----------~~~~~~~~~~dvih~~~~~~~---~~~~G~ 171 (383) .+|+|..+..++.. + +...+|.|.-.+.. ......++.+.|.|...--.+ +..+.+ T Consensus 161 r~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~ 240 (511) T protein:vir:56 161 RPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCADDPYII 240 (511) T ss_pred hhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCCCCeee Confidence 99999988765421 1 11222333211110 013366888888766533222 223478 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCC-HHHHHHHHHHHHH----h-hcCC-c------ceee-c Q lcl|NC_018285. 172 SPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGL-LDFKTKVSRSRQA----M-KQMQ-G------GPLV-L 236 (383) Q Consensus 172 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~-~e~~~~~~~~~~~----~-~~~~-g------~~~v-l 236 (383) |-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+. ..+.+.++..... . ++++ | +.+- + T Consensus 241 syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMl 320 (511) T protein:vir:56 241 GYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSML 320 (511) T ss_pred ccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhH Confidence 9999999999888888887766555555444444333 3333 3333333332211 1 1111 1 1111 1 Q ss_pred ----------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc---cC--cCHHHHH--HHHHHHHHHH Q lcl|NC_018285. 237 ----------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQG---DQ--QSSLEMS--SNVYSKAVAR 299 (383) Q Consensus 237 ----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~---~~--~~~~e~~--~~~~~~~l~P 299 (383) +.|.+++.+.....-.+ ++-..+....+..+++||.+-|+... .. ....|-. +.=+...|.- T Consensus 321 EDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~R 399 (511) T protein:vir:56 321 EDYYLPRREGSKGTEVSTLPGGQSLGD-IEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKR 399 (511) T ss_pred hhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHH Confidence 23567776655433233 23335778899999999999997321 10 1112222 1223334444 Q ss_pred HHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcCHHHHHHH Q lcl|NC_018285. 300 YLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLAQNQGLYI 352 (383) Q Consensus 300 ~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t~nE~r~~ 352 (383) +-..+...|.. .|. +.-|++ +...+..|.. -....++.+- -+-.++.+=+++. T Consensus 400 LR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ 479 (511) T protein:vir:56 400 LQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKN 479 (511) T ss_pred HHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHH Confidence 44444444433 332 111221 1111111111 1111222211 1345566666664 Q ss_pred h-hcCCcCCcchhHHh-----CCCCCCCCCCCCCCCC Q lcl|NC_018285. 353 L-QQAEILPKELPKGE-----NPNRTILKGGETNGQD 383 (383) Q Consensus 353 l-g~~~~~~~d~~~~~-----~~~~~~~~ggd~~~~d 383 (383) + .+. ..|+...+ ....+.. ...++| T Consensus 480 ILr~t---Deei~~~~k~I~~E~k~~~~---~~~e~~ 510 (511) T protein:vir:56 480 ILRLS---DDQITAMQSEIDEEETNPRF---QQDDQG 510 (511) T ss_pred HhccC---HHHHHHHHHHHHHhhcCCCC---CCcccC Confidence 3 322 22322111 1111111 111222 No 239 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=65.58 E-value=0.28 Score=23.61 Aligned_cols=352 Identities=14% Similarity=0.088 Sum_probs=139.9 Q ss_pred CchhhhhhcCCccccccccccc---chhhcccccCCce-echhhhhccHHHHHHHHHHHHhhhhC------ce---eeec Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT---DPEFLATLNGSEW-VSAETALKNSDLFSIISQLSNDLATA------KL---TTSR 67 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~a~~~~~v~~~i~~ia~~ia~~------p~---~~~~ 67 (383) =+.|+.++..|..+..-|..+. .|..+.. .+... -...+.+ .++-..|++.+|+.+-+. || .+.+ T Consensus 15 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~-~~~~~~~~~~~~~-dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d 92 (532) T protein:vir:99 15 AAAYNRLKNDRGAYETRAEDCATYTIPSVFPS-ATADGSTSYTTPW-QSIGARGLNNLASKLMLALFPVGSSFFKLNVSE 92 (532) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhhhcccCC-CCCcchhhccccc-cchHHHHHHHHHHHHHHhhcCCCCccccccCCH Confidence 3455666665544443333322 2222211 11100 0011122 344555677676665542 22 1111 Q ss_pred chh-------------hhhccC----CCc---cCCHHHHHHHHHHHHHHcCCeEEEEeecC--CCc--eeEEEEecccee Q lcl|NC_018285. 68 KQM-------------QGIVDN----PSN---SANRFNFYQSIFAQMLLGGEAFAYRWRND--NGR--DMKWEYLRPSQV 123 (383) Q Consensus 68 ~~~-------------~~l~~~----PN~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~--~g~--~~~l~~l~~~~v 123 (383) ... ..++.. -.. .-+.+.-+..+..+++.+|||.+++..+. .++ ....||+ .++ T Consensus 93 ~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~~~pl--~~y 170 (532) T protein:vir:99 93 LEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL--HNF 170 (532) T ss_pred HHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCcccceEEEEc--CeE Confidence 111 111100 000 11233344566788899999988875432 122 2333444 223 Q ss_pred EEEEcCCCce-----------------------------------eEEEEeecCcccccc-e--e--------------e Q lcl|NC_018285. 124 SFNRLDNQNG-----------------------------------LYYNVTFDDPRIPPK-Q--H--------------V 151 (383) Q Consensus 124 ~~~~~~~~~~-----------------------------------~~y~~~~~~~~~~~~-~--~--------------~ 151 (383) -+..+..|.. ..|....-...+... . . + T Consensus 171 ~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 250 (532) T protein:vir:99 171 VVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPL 250 (532) T ss_pred EEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCCeeEEEEeecCceeccccccccc Confidence 3333322211 011000000000000 0 0 1 Q ss_pred cccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCc Q lcl|NC_018285. 152 PQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQG 231 (383) Q Consensus 152 ~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g 231 (383) ...-.+..|....++..||.||...+...+...+.+.+.......-...|..++..++........ .... T Consensus 251 ~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~----------~~~~ 320 (532) T protein:vir:99 251 DSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA----------KANT 320 (532) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhc----------cCCC Confidence 111223333333456689999999999999999999998888877778888777766554443211 1111 Q ss_pred ceee--cCCCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_018285. 232 GPLV--LDDLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLS 306 (383) Q Consensus 232 ~~~v--l~~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~ 306 (383) +.++ ..+++...++... .+.+ ..+..+..+..|-.+|-+.. +....+..-+.+|... .=....|-|....+.+ T Consensus 321 g~~v~g~~~~i~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~ 398 (532) T protein:vir:99 321 GDFVAGRKQDVEVFQLEKY-NDFQVAKATADDIEKRLSYAFMLNS-AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQ 398 (532) T ss_pred cceecCCcccceeeecccc-cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHH Confidence 1122 1223333333332 2333 24555677788988885542 1111122223344332 2334556666666666 Q ss_pred HHHHhhcch-h----hccchhhhccCHH--HHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCC Q lcl|NC_018285. 307 ELSQKLSCD-V----DADIFPAVDPTGA--NYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGET 379 (383) Q Consensus 307 ~l~~~l~~~-~----e~~~~~~~~~~~~--~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~ 379 (383) +|=.-|+.. + +.+..+..-.+.. .+...+.. +.+++..+.+- .+. ..+ .+-.+.. -|. T Consensus 399 E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~------Laraq~~~~l~--~~~-~~l--aq~~p~~----~d~ 463 (532) T protein:vir:99 399 ELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEA------LGRGHDLNKLN--VFI-DYM--IKLAGLQ----DDD 463 (532) T ss_pred HHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecchH------HHHHHHHHHHH--HHH-HHH--Hhhcchh----hhh Confidence 654333211 0 0001000000000 00000111 11111111110 000 001 1111110 111 Q ss_pred -CCCC Q lcl|NC_018285. 380 -NGQD 383 (383) Q Consensus 380 -~~~d 383 (383) |..+ T Consensus 464 id~d~ 468 (532) T protein:vir:99 464 INLLD 468 (532) T ss_pred CCHHH Confidence 1111 No 240 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=65.13 E-value=0.29 Score=23.54 Aligned_cols=329 Identities=13% Similarity=0.083 Sum_probs=123.6 Q ss_pred CchhhhhhcCCccc---cccc---ccccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecchhhhhc Q lcl|NC_018285. 1 MPIFNLATESPPNN---QGGF---FDITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQMQGIV 74 (383) Q Consensus 1 Mglf~~~~~~~~~~---~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~l~ 74 (383) -..++.+.- |... ...+ ..++...+......+.+ .......++...+..-...+....+.... T Consensus 202 v~p~~~~~d-p~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 270 (651) T protein:vir:80 202 LDMFDCFYD-PNVTDPNRGAFIRKLTKTKADILNLLSEGYY-------YGVDPLDVVEHKCKDTSDTKQDMLSTFQG--- 270 (651) T ss_pred ecHHHeeec-CCCcCccccceeeeeeeeHHHHHHHHhcccc-------cchhhHHHHhhhccccccCCccccccccC--- Confidence 222332221 1110 0000 01111111111111111 12233344444433322222211110000 Q ss_pred cCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCCceeEEEEeecCcccccceeeccc Q lcl|NC_018285. 75 DNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNVTFDDPRIPPKQHVPQS 154 (383) Q Consensus 75 ~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 154 (383) .+ .+.....+. +..=++|.. .+..|...+.+++ ...+..+. ++. .++ ..... T Consensus 271 -~d---~~~~~~~~~-----v~v~E~~~~--~d~e~~~~~~~~v---------~~~g~~il-~~~-~~~------~~~~~ 322 (651) T protein:vir:80 271 -VT---TSLWSPHQN-----VELLEYWGD--IHLENKTYHDVVV---------TIMGNEVL-RFE-QNP------YWCGR 322 (651) T ss_pred -CC---ccccccccc-----eEEEEEEEE--eeccCCceEEEEE---------EEcCcEEe-ccc-ccC------CCCCC Confidence 00 000000000 000012222 2222322221111 00111110 000 000 00112 Q ss_pred ceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCccee Q lcl|NC_018285. 155 DILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPL 234 (383) Q Consensus 155 dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~ 234 (383) ..+|++....++..||.|++..+.......+.+.+.......-.+.|.+++..++..+++.. ....|+++ T Consensus 323 Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l----------~~~pg~vi 392 (651) T protein:vir:80 323 PFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDV----------YTEPGKVF 392 (651) T ss_pred CeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHh----------hcCCCceE Confidence 45677766667788999999999999999999988888888888888888876655554431 12357777 Q ss_pred ecCCCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccC---cCHHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 235 VLDDLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQ---QSSLEM--SSNVYSKAVARYLRPFLSEL 308 (383) Q Consensus 235 vl~~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~---~~~~e~--~~~~~~~~l~P~~~~i~~~l 308 (383) +.+.+..+.++...+.+.+ .....+.....+-.++||+...-|..... .+..+. ...-....+.++++.+.+++ T Consensus 393 ~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~ 472 (651) T protein:vir:80 393 LVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETS 472 (651) T ss_pred EecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7776666666654332322 23445666778888999999887743221 122221 11223344555555555443 Q ss_pred HHhhcch--------------hh----------c------cchhhh---ccC------HHHHHHHHHHHHhCCCcC---- Q lcl|NC_018285. 309 SQKLSCD--------------VD----------A------DIFPAV---DPT------GANYISRINSMVKSGTLA---- 345 (383) Q Consensus 309 ~~~l~~~--------------~e----------~------~~~~~~---~~~------~~~~~~~~~~l~~~g~~t---- 345 (383) -.-|+.. ++ + |+...+ ... .......+..+++..... T Consensus 473 l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~ 552 (651) T protein:vir:80 473 LLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMG 552 (651) T ss_pred HHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccc Confidence 2222110 00 0 110000 000 011112222223211110 Q ss_pred ----H-HHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 346 ----Q-NQGLYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 346 ----~-nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) . +-+++++...++...+. .+....... ....+. T Consensus 553 ~~~~~~~~~~~l~~~~g~~~~~~-~l~~~~q~~----~~~~~~ 590 (651) T protein:vir:80 553 QLVDYKRILVDLLQHWGFEEPEA-YLKQQDQQA----PANPQE 590 (651) T ss_pred hhhhHHHHHHHHHHHcCCCCcHH-hcCCCccch----hhhhhH Confidence 0 11222232333322111 110000000 011111 No 241 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=64.32 E-value=0.3 Score=23.43 Aligned_cols=378 Identities=12% Similarity=0.085 Sum_probs=160.8 Q ss_pred CchhhhhhcCCcc-----cccccccc--cchh----------------hcccccCCce--------e-chhhhhccHHHH Q lcl|NC_018285. 1 MPIFNLATESPPN-----NQGGFFDI--TDPE----------------FLATLNGSEW--------V-SAETALKNSDLF 48 (383) Q Consensus 1 Mglf~~~~~~~~~-----~~~~~~~~--~~~~----------------~~~~~~~~~~--------~-~~~~a~~~~~v~ 48 (383) |+.|..+..++.. ....+..+ .+++ +.+...+... + ..+..+.+|.|. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:81 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccchh Confidence 6555543221100 00000000 0000 0000000000 0 245567899999 Q ss_pred HHHHHHHHhhhhC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC--Cc Q lcl|NC_018285. 49 SIISQLSNDLATA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN--GR 111 (383) Q Consensus 49 ~~i~~ia~~ia~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~--g~ 111 (383) .||+-|.+.+.-+ |+.+.=.+. ..+...-+ ..++...--...+..|++.|..|+.++.+.+ .- T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~G 167 (521) T protein:vir:81 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDG 167 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcCCcccc Confidence 9999999887532 222211100 00111111 1112222224567788999999999986644 34 Q ss_pred eeEEEEeccceeEEEEcCC-----C------ceeEEEEeec---------CcccccceeecccceEEeccCCCC--cccc Q lcl|NC_018285. 112 DMKWEYLRPSQVSFNRLDN-----Q------NGLYYNVTFD---------DPRIPPKQHVPQSDILHFRLLSVD--GGLT 169 (383) Q Consensus 112 ~~~l~~l~~~~v~~~~~~~-----~------~~~~y~~~~~---------~~~~~~~~~~~~~dvih~~~~~~~--~~~~ 169 (383) +.+|.+|+|..+..++... + ...+|.|... .......+.++.+-|.+.. -..- +.-. T Consensus 168 I~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~h-SGl~d~~~~~ 246 (521) T protein:vir:81 168 IVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAH-SGLMDCDDKY 246 (521) T ss_pred ceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeeee-ccceeCCCCe Confidence 8999999999987765321 1 1111222111 0011122344444444332 2111 1112 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHHhh-----c-------CCcceee Q lcl|NC_018285. 170 SVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQAMK-----Q-------MQGGPLV 235 (383) Q Consensus 170 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~~~-----~-------~~g~~~v 235 (383) =+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++....... + +..+.+- T Consensus 247 i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~ms 326 (521) T protein:vir:81 247 IIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLS 326 (521) T ss_pred eeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccc Confidence 357888888888887777777665555545444444333 33433 33333333221111 1 1112111 Q ss_pred -c----------CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-----CcCHHHHH-HHHHHHHHH Q lcl|NC_018285. 236 -L----------DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD-----QQSSLEMS-SNVYSKAVA 298 (383) Q Consensus 236 -l----------~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~-----~~~~~e~~-~~~~~~~l~ 298 (383) + +.|.+++.+..-..-.+ ++-..+..+.+..+++||.+-|+..++ +..++=.. +.=+...|. T Consensus 327 MlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~ 405 (521) T protein:vir:81 327 MTEDYWLQRRDGKAITDVTTLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIR 405 (521) T ss_pred hhhhhcccccCCCcccceeecccCCCCCh-HHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHH Confidence 1 23567776655332223 233357788999999999999953222 11222111 122333444 Q ss_pred HHHHHHHHHHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcCHHHHHH Q lcl|NC_018285. 299 RYLRPFLSELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLAQNQGLY 351 (383) Q Consensus 299 P~~~~i~~~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t~nE~r~ 351 (383) -+-..+...|.. .|.. .-+++ +...+..|.. -....++.+- -+-.++.+=+++ T Consensus 406 rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k 485 (521) T protein:vir:81 406 TRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMR 485 (521) T ss_pred HHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHH Confidence 444444444433 3321 11111 1111111111 1111222111 134456666665 Q ss_pred Hh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 352 IL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 352 ~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) .+ .+. ..|+... +.......+--+.+.+| T Consensus 486 ~ILr~t---Deei~~~~k~I~~E~~~~~~~~p~~~~~~ 520 (521) T protein:vir:81 486 DILKYT---DDQMDTEKKQIEEEANDPRFKQTPDEIED 520 (521) T ss_pred HHhccC---HHHHHHHHHHHHHHhhCCCCCCCcccccC Confidence 43 322 2222211 11111111111223333 No 242 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=60.68 E-value=0.37 Score=22.96 Aligned_cols=352 Identities=10% Similarity=0.049 Sum_probs=140.7 Q ss_pred CchhhhhhcCCccccccccccc---chhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce---eeecc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDIT---DPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL---TTSRK 68 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~---~~~~~ 68 (383) =+.|+.++.+|..+...|..+. .|..+.....+ . ...+.+ .++--.|++.+|+.+-+. || .+.+. T Consensus 13 ~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~-~-~~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~~~ 89 (517) T protein:vir:10 13 PKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD-L-SSQNAW-QDDGASATNFLSNKLSQVLFPAQRSFFRIDLTPE 89 (517) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC-c-cccccc-cchHHHHHHHHHHHHHHhhcCCCCccccccCCHH Confidence 2335666666654444443332 23222111111 1 112223 334445666666655431 22 11111 Q ss_pred h-------------hhhhcc----CCCc---cCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEc Q lcl|NC_018285. 69 Q-------------MQGIVD----NPSN---SANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 69 ~-------------~~~l~~----~PN~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~ 128 (383) . ...++. .... .-+.+.-+..+..+++.+|||.+++ +..+.+...|||.. +-+..+ T Consensus 90 ~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~~~~~~~~~~pl~~--y~v~~d 165 (517) T protein:vir:10 90 GIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYH--PDKTSPIQAVPLHH--YCVRRD 165 (517) T ss_pred HHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--eCCCCcEEEEEcCe--EEEeeC Confidence 0 011100 0011 1133444556778899999998775 33344566677632 323333 Q ss_pred CCCce--eEEEEe-----------------------------------ec--Cccccccee-------------ecccce Q lcl|NC_018285. 129 DNQNG--LYYNVT-----------------------------------FD--DPRIPPKQH-------------VPQSDI 156 (383) Q Consensus 129 ~~~~~--~~y~~~-----------------------------------~~--~~~~~~~~~-------------~~~~dv 156 (383) ..|.. ++++.. .. ++....... +...-. T Consensus 166 ~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~~s~y~~~e~P~ 245 (517) T protein:vir:10 166 NNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVPVGKESTVTEDKSPF 245 (517) T ss_pred CCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEeCceeeccccccccccCCe Confidence 22211 111100 00 000000000 111223 Q ss_pred EEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeec Q lcl|NC_018285. 157 LHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVL 236 (383) Q Consensus 157 ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl 236 (383) +..|....++..||.||..-+...+...+.+.+.......-...|..++..++........ .+..+.++- T Consensus 246 ~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~----------~~~~g~~~~ 315 (517) T protein:vir:10 246 LILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFV----------EGGSGAVLH 315 (517) T ss_pred eeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhcc----------CCCcccccc Confidence 3333333456789999999999999999999999888888878888777665544432110 111111111 Q ss_pred C--CCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 237 D--DLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 237 ~--~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~ 311 (383) + +++...++.. ..|.+ ..+..+.....|-.+|-+..-..-. +..-+.+|... .-....|-|.+..+.++|=.- T Consensus 316 g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~-~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~P 393 (517) T protein:vir:10 316 GVEGDIHIVQLGK-YADYTPIQAVLNDYRQRIGRVFMMEAMTRRD-AERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGP 393 (517) T ss_pred CCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhhhhccC-CccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHH Confidence 1 2223333222 22333 3455567788899999766422211 11223333332 223345556555555553221 Q ss_pred hcc-------------hhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhc----CC-----cCCcch-hH-Hh Q lcl|NC_018285. 312 LSC-------------DVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQ----AE-----ILPKEL-PK-GE 367 (383) Q Consensus 312 l~~-------------~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~----~~-----~~~~d~-~~-~~ 367 (383) |+. .++.+....+ ....+...+..+. ..-..++. ++ +..+++ +. .. T Consensus 394 li~r~~~~l~~~l~~~~v~~~~~s~l--a~l~r~~~~~~i~--------~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~ 463 (517) T protein:vir:10 394 LARWFMNGISSILTSKNVSPTILTGI--EALGRMAELDKLG--------TFNGYVSMTAQWPEPLQQAIKWPDFTDWVQG 463 (517) T ss_pred HHHHHHHHhhhhcCCCCccceeeccH--HHHHHHHHHHHHH--------HHHHHHHHhhcCChHHHhcCCHHHHHHHHHH Confidence 111 1111110000 0011111111111 11111110 11 011110 00 11 Q ss_pred CCCCCCCCCCCCCCCC Q lcl|NC_018285. 368 NPNRTILKGGETNGQD 383 (383) Q Consensus 368 ~~~~~~~~ggd~~~~d 383 (383) .+. .|. +==.+++| T Consensus 464 ~~G-vp~-~~irs~~e 477 (517) T protein:vir:10 464 QIS-ANF-PFFKTQDE 477 (517) T ss_pred HhC-CCh-hhcCCHHH Confidence 111 010 00001111 No 243 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=55.75 E-value=0.47 Score=22.37 Aligned_cols=345 Identities=9% Similarity=0.048 Sum_probs=135.7 Q ss_pred Cc-----hhhhhhcCCccccccccc---ccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce--- Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGFFD---ITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL--- 63 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~--- 63 (383) |+ .|.++++.++ ...|.. ++.|..+..-.....-...+.+ .++-..|++.+|+.+-+. || T Consensus 1 mk~~~~~~~~~lkr~~~--e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSV--EQRAIEFAKTTLPYLMVDPMSGSRGVVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHhccch--HHHHHHHHHhhccccccCCCCcccccccCcc-cchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 43 4454543332 222222 2233222211111110111223 334455677676665541 22 Q ss_pred eeecchh-------------hhhcc----CCCccC---CHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEecccee Q lcl|NC_018285. 64 TTSRKQM-------------QGIVD----NPSNSA---NRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQV 123 (383) Q Consensus 64 ~~~~~~~-------------~~l~~----~PN~~~---t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v 123 (383) .+.+... +.++. ...... +.+.-+..++.+++.+|+|.+++.. ++.+...||+.. + T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~--~~~~~~~~pl~~--y 153 (510) T protein:vir:78 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DEATVVAWSLRS--Y 153 (510) T ss_pred CCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeC--CCCeEEEEEcce--e Confidence 1111110 10000 011111 2233345666788899999887653 344566677643 2 Q ss_pred EEEEcCCCce--eEEEE-------------------------------------------------eecCccccccee-- Q lcl|NC_018285. 124 SFNRLDNQNG--LYYNV-------------------------------------------------TFDDPRIPPKQH-- 150 (383) Q Consensus 124 ~~~~~~~~~~--~~y~~-------------------------------------------------~~~~~~~~~~~~-- 150 (383) -+..+..|.. ++.++ ..++...+.... T Consensus 154 ~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i~~~~~~~ 233 (510) T protein:vir:78 154 AVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWP 233 (510) T ss_pred EEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeeeccccccc Confidence 2223322211 00000 001100000000 Q ss_pred ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 151 VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQ 230 (383) Q Consensus 151 ~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~ 230 (383) +...-.+..|....++..||.||...+...+...+.+.+.......-...|..++..++........ ... T Consensus 234 ~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~----------~~~ 303 (510) T protein:vir:78 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ----------DAE 303 (510) T ss_pred cccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhc----------cCC Confidence 1111223333333456789999999999999999999998888777777777777665543332111 111 Q ss_pred cceeecCCCceeeeccc-ChhhHHH-HHHHHHHHHHHHHHhcCCHHHhcc-cccCcCHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_018285. 231 GGPLVLDDLEDFTPLEI-KSNVAQL-LKQADWTTGQFAKVYGIPENVVGG-QGDQQSSLEMSS--NVYSKAVARYLRPFL 305 (383) Q Consensus 231 g~~~vl~~g~~~~~~~~-~~~d~~~-~e~~~~~~~~Ia~~~gVpp~~lg~-~~~~~~~~e~~~--~~~~~~l~P~~~~i~ 305 (383) .+.++-+..-.+.++.. +..+.+. .+..+.....|-.+|-+. +.. .+..-+.+|... .=....|-|....+. T Consensus 304 ~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~---l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~ 380 (510) T protein:vir:78 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG---ANQRDAERVTAEEVRITAEEAENTLGGTYSLLA 380 (510) T ss_pred CceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhhc---cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHH Confidence 12222221122333322 2233332 455567778888888432 221 111124444432 233455666666666 Q ss_pred HHHHHhhcchh-h----ccchh----hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCC Q lcl|NC_018285. 306 SELSQKLSCDV-D----ADIFP----AVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKG 376 (383) Q Consensus 306 ~~l~~~l~~~~-e----~~~~~----~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~g 376 (383) ++|-.-|+... . ..+.. ..+....+....+....+ .-....+...++.-+.. ... .+ T Consensus 381 ~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~--~~~l~~~~q~l~~~~~~----~q~--~~------ 446 (510) T protein:vir:78 381 ENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAA--VQSMLNASQVIAGLAPI----AQL--DP------ 446 (510) T ss_pred HHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHHHH--HHHHHHHHHHHHHhcCh----hhh--hh------ Confidence 66544333210 0 00000 000000000000000000 00000111111100000 000 00 Q ss_pred CCCCCCC Q lcl|NC_018285. 377 GETNGQD 383 (383) Q Consensus 377 gd~~~~d 383 (383) .=| T Consensus 447 ----~id 449 (510) T protein:vir:78 447 ----RIS 449 (510) T ss_pred ----cCC Confidence 014 No 244 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=54.14 E-value=0.51 Score=22.18 Aligned_cols=339 Identities=10% Similarity=0.012 Sum_probs=150.1 Q ss_pred CchhhhhhcCCcccccccccccchhh--ccccc---CCceechhhh---hccHHHHHHHHHHHHhhhh-C-----ce--- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPEF--LATLN---GSEWVSAETA---LKNSDLFSIISQLSNDLAT-A-----KL--- 63 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~--~~~~~---~~~~~~~~~a---~~~~~v~~~i~~ia~~ia~-~-----p~--- 63 (383) =+.|+.++..|..+..-|..+.+..+ .+.+. ....-..+.. +=.++--.|++.+|+.+-+ + || T Consensus 13 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~~ltpp~~~wF~l 92 (549) T protein:vir:10 13 NADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDSMITPATQLWHRL 92 (549) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHhhccCCCCccccc Confidence 23445555555544444433321111 11000 0000011110 1233444566666655543 1 22 Q ss_pred eeecchh------hh--------hccCCCc-cCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEc Q lcl|NC_018285. 64 TTSRKQM------QG--------IVDNPSN-SANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 64 ~~~~~~~------~~--------l~~~PN~-~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~ 128 (383) .+.+... .. +...-+. .-+.+.-+-.+..+++++|+|.+++..+. ++.+.+..++-.++-+..+ T Consensus 93 ~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~-~~~~~f~~~pl~~~~v~~d 171 (549) T protein:vir:10 93 KTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDV-GKGIVYRNVPMQRLWFAEN 171 (549) T ss_pred cCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecC-CCeeEEEEEEcCeEEEeeC Confidence 1211110 01 1111111 11223334567789999999999987654 3455555555555555555 Q ss_pred CCCceeE-EE-Eee-------------------------------------cC--c----cccc-----ceeec------ Q lcl|NC_018285. 129 DNQNGLY-YN-VTF-------------------------------------DD--P----RIPP-----KQHVP------ 152 (383) Q Consensus 129 ~~~~~~~-y~-~~~-------------------------------------~~--~----~~~~-----~~~~~------ 152 (383) ..|.... |+ +.. .. . ..+. .+.+. T Consensus 172 ~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~i 251 (549) T protein:vir:10 172 NSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRI 251 (549) T ss_pred CCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccCceEEEEEEecCCEe Confidence 4443211 10 000 00 0 0000 00000 Q ss_pred -------ccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 153 -------QSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 153 -------~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) ..-.+..|....++..||.||...+...+...+.+.+.......-...|.+++...+.++... ... T Consensus 252 l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~-------l~p 324 (549) T protein:vir:10 252 VQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFD-------LRS 324 (549) T ss_pred eccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccce-------ecc Confidence 011122222223566899999999999999999999999988888888888876655444311 111 Q ss_pred hhcCCcceeec--CCCceeeecccChhhHHH-HHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHH Q lcl|NC_018285. 226 MKQMQGGPLVL--DDLEDFTPLEIKSNVAQL-LKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARY 300 (383) Q Consensus 226 ~~~~~g~~~vl--~~g~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~ 300 (383) +.....+. .+...++|+.... +.+. .+..+..+..|-.+|-+........+..-+.+|... .=....|-|. T Consensus 325 ---gg~~~~~~~~~~~~~~~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv 400 (549) T protein:vir:10 325 ---GALNWGGLNDKGEEMVKPLLTGK-QAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPT 400 (549) T ss_pred ---CCccccccCCCCccceeeecccc-chhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHH Confidence 11111111 2234566665442 3333 445677788999999887643322222234444432 2344667777 Q ss_pred HHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch----------------- Q lcl|NC_018285. 301 LRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL----------------- 363 (383) Q Consensus 301 ~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~----------------- 363 (383) ...+.++|=.-|+.+ .+.-|.+.|.+ ++. |.++ T Consensus 401 ~~rl~~E~l~Pli~R------------------~~~il~r~g~l-----------P~~-p~~l~~~~~~~~i~yis~La~ 450 (549) T protein:vir:10 401 LGRTQSELLGPMIAR------------------EVDILAEAGQL-----------PDM-PQELIDAGADVDVEYDSPLNK 450 (549) T ss_pred HHHHHHHHHHHHHHH------------------HHHHHHhcCCC-----------CCC-ChhhhcCCceeEEEeecHHHH Confidence 777776654332211 11112222322 110 1110 Q ss_pred -----------hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 364 -----------PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 364 -----------~~~~~~~~~~~~ggd~~~~d 383 (383) ..+.... ++.+-+.+--| T Consensus 451 aq~~~~~~~i~~~~~~~~--~laq~~Pe~ld 479 (549) T protein:vir:10 451 AMRAGEGAAILQWLQQLG--IVSQFDPAAAK 479 (549) T ss_pred HHHHHHHHHHHHHHHHHH--HHhccChhHHh Confidence 0000000 11121111112 No 245 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=53.29 E-value=0.53 Score=22.08 Aligned_cols=366 Identities=10% Similarity=0.011 Sum_probs=149.2 Q ss_pred Cchhhh----hhcCCcccccccccccchhhcccccCCcee-chhh----hhccHHHHHHHHHHHHhhhhCceeeec-chh Q lcl|NC_018285. 1 MPIFNL----ATESPPNNQGGFFDITDPEFLATLNGSEWV-SAET----ALKNSDLFSIISQLSNDLATAKLTTSR-KQM 70 (383) Q Consensus 1 Mglf~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----a~~~~~v~~~i~~ia~~ia~~p~~~~~-~~~ 70 (383) +.=|.. +...+... ....+.........- ..+. |.-.+.+...++.++..+-+-|..+.- ... T Consensus 22 ~~~W~~ird~~~G~~~~~-------~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p~~l 94 (489) T protein:vir:78 22 APKWQKVRHALAGELVSY-------LRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEINIPKEL 94 (489) T ss_pred HHHHHHHHHHhcCccccc-------ccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhchhhcCCcceeccHHH Confidence 000000 00000000 000000000000000 0111 222345555666566555555544421 112 Q ss_pred hhhccCCC-ccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC------------ceeEEEEecccee-------------- Q lcl|NC_018285. 71 QGIVDNPS-NSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG------------RDMKWEYLRPSQV-------------- 123 (383) Q Consensus 71 ~~l~~~PN-~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g------------~~~~l~~l~~~~v-------------- 123 (383) ..|..... ...+-.+|.+.++...+.+|-+++++.....| +|. +..+.|..| T Consensus 95 ~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy-~~~~~~~~IinW~~~~v~G~~~L 173 (489) T protein:vir:78 95 EYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPT-IAFYTTENIVNWRLTRVGSVNRV 173 (489) T ss_pred HHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcE-EEEechhhhcCceeeeeCCccce Confidence 23333333 45778999999999999999999999875544 221 222333222 Q ss_pred ---EEEE-----cCCCc-----eeEEEE-------------ee---cCcccccceeeccc------ceEEec--cCCCCc Q lcl|NC_018285. 124 ---SFNR-----LDNQN-----GLYYNV-------------TF---DDPRIPPKQHVPQS------DILHFR--LLSVDG 166 (383) Q Consensus 124 ---~~~~-----~~~~~-----~~~y~~-------------~~---~~~~~~~~~~~~~~------dvih~~--~~~~~~ 166 (383) .+.. +..+. ...|++ .. +.........+.++ ..|=|- +....+ T Consensus 174 t~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~ 253 (489) T protein:vir:78 174 TMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGESLRGVIPFTFIGATNND 253 (489) T ss_pred eEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCCCccCeeeEEEEecCCCC Confidence 1111 00000 000111 00 00000000001000 011111 111223 Q ss_pred cccCcchHHHHHH-HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeec Q lcl|NC_018285. 167 GLTSVSPLMALGR-ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPL 245 (383) Q Consensus 167 ~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~ 245 (383) ...+.+|+..+.. .+........+. ..+...+.|-.+++.....+++....... ....-+++..+.++.+.++.-+ T Consensus 254 ~~~~~pPLl~LA~lni~Hy~~ssd~~-~~l~~~~~P~l~i~G~d~~~~~~~~~~~~--~~i~~g~~~~~~lp~~~~~~~i 330 (489) T protein:vir:78 254 ATIDDAPLLPLAELNIGHYRNSADNE-ESSFVVGQPTLFIYPGENLTPQAFKEANP--NGIKFGSRRGHNLGYGGSAQLI 330 (489) T ss_pred CCCCcCchHHHHHHHHHHhhhhhHHH-HHHHHcccceeeeecCccCCcccccccCc--cceeeCCcccccCCCCCCccee Confidence 4457788776554 444444444443 44445566777776544343332221111 0111233445666666544433 Q ss_pred ccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH--HHHHHHHHHHHHHHHHHHHHHhhcc--------- Q lcl|NC_018285. 246 EIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS--SNVYSKAVARYLRPFLSELSQKLSC--------- 314 (383) Q Consensus 246 ~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~--~~~~~~~l~P~~~~i~~~l~~~l~~--------- 314 (383) ..+...+. .+..+....+. ..+-..++. ...+.+..+.. ...-+..|.-++..++++|+..|-- T Consensus 331 e~~~~~~~-r~~l~~le~qm---~~lGa~l~~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~ 405 (489) T protein:vir:78 331 QAGENNLA-RQNMLDKEQQA---IQIGAQLIT-PTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGKPE 405 (489) T ss_pred ccCcchHH-HHHHHHHHHHH---HHHhhhhcc-CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Confidence 33333322 12222212221 111122332 11111222221 2334566777888888888775421 Q ss_pred --hhhccchhhhc---cCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc-chhHHhCCCCCCCC-----CCCCCC-- Q lcl|NC_018285. 315 --DVDADIFPAVD---PTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK-ELPKGENPNRTILK-----GGETNG-- 381 (383) Q Consensus 315 --~~e~~~~~~~~---~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~-d~~~~~~~~~~~~~-----ggd~~~-- 381 (383) +.++.+...|. .+.. ....+-+++++|.++..+.++.|...++.+. +....+.....+.. +||.+. T Consensus 406 ~~~~~i~~n~dF~~~~~d~~-~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~~e~~~~ei~~~~~~~~~~~~g~~~~~~ 484 (489) T protein:vir:78 406 DTEVEFRLNMDFFLEPMTAQ-DRAAWMADINAGLLPATAYYAALRKAGVTDWTDADIKDAVADQPLPVATEVQGEIPQSA 484 (489) T ss_pred CCceEEEeecccCcccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHhhcCCCcccCCcccCCCCc Confidence 12333333332 2222 3445566788999999999999988777532 21111111112222 333322 Q ss_pred CC Q lcl|NC_018285. 382 QD 383 (383) Q Consensus 382 ~d 383 (383) |+ T Consensus 485 q~ 486 (489) T protein:vir:78 485 QQ 486 (489) T ss_pred cc Confidence 22 No 246 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=51.84 E-value=0.57 Score=21.92 Aligned_cols=367 Identities=10% Similarity=0.056 Sum_probs=144.3 Q ss_pred CchhhhhhcCCc--ccccccccccch-------------hhcccccCCce--e---chhh----hhccHHHHHHHHHHHH Q lcl|NC_018285. 1 MPIFNLATESPP--NNQGGFFDITDP-------------EFLATLNGSEW--V---SAET----ALKNSDLFSIISQLSN 56 (383) Q Consensus 1 Mglf~~~~~~~~--~~~~~~~~~~~~-------------~~~~~~~~~~~--~---~~~~----a~~~~~v~~~i~~ia~ 56 (383) |.-.+. +.|. .....|. ..+. .+++-+..-.. - ..+. |.-.+.+...++.++. T Consensus 32 m~dV~~--~hp~y~a~~~~W~-~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G 108 (535) T protein:vir:80 32 LPNVGY--QRVEFGEMLPKWR-KIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMG 108 (535) T ss_pred CCCCCc--CCHHHHHHHHHHH-HHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhc Confidence 442111 0000 0000000 0000 01111100000 0 0111 2233344455554544 Q ss_pred hhhhCceeee-cchhhhhccCCC-ccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCcee------------EEEEeccce Q lcl|NC_018285. 57 DLATAKLTTS-RKQMQGIVDNPS-NSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDM------------KWEYLRPSQ 122 (383) Q Consensus 57 ~ia~~p~~~~-~~~~~~l~~~PN-~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~------------~l~~l~~~~ 122 (383) .+-+-|..+. -.....|..... ...+-.+|.+.++...+.+|-+++++.....|... -+..+.|.. T Consensus 109 ~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~ 188 (535) T protein:vir:80 109 QVFSRDPIRQLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTS 188 (535) T ss_pred hhhcCCcceeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhh Confidence 4444443321 112223333333 44678999999999999999999999765444210 112121111 Q ss_pred e-----------------EEE----EcCCC--cee--EEEE-----------ee---cCc--ccc-cceeeccc------ Q lcl|NC_018285. 123 V-----------------SFN----RLDNQ--NGL--YYNV-----------TF---DDP--RIP-PKQHVPQS------ 154 (383) Q Consensus 123 v-----------------~~~----~~~~~--~~~--~y~~-----------~~---~~~--~~~-~~~~~~~~------ 154 (383) | .+. ..+++ ... .|++ .. ... ... ....++.+ T Consensus 189 IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l 268 (535) T protein:vir:80 189 IINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPF 268 (535) T ss_pred ccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCccc Confidence 1 110 01111 000 1111 10 000 000 00011111 Q ss_pred c---eEEeccCCCCccccCcchHHHHHH-HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 155 D---ILHFRLLSVDGGLTSVSPLMALGR-ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQ 230 (383) Q Consensus 155 d---vih~~~~~~~~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~ 230 (383) . ++++ +....+...|.+|+..+.. .+........+. ..+...+.|-.+++.......+.. .+ -....-++ T Consensus 269 ~~IPfv~~-~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~-~il~~~~~P~l~i~G~~~~~~~~~---~~-~~~i~iG~ 342 (535) T protein:vir:80 269 KEIPFQFI-GPLDNNADIDHPPLLDLCEVNIGHYRNSADYE-EMAFVAGQPTAFFTGLTKDWVEDV---FK-DFKVHLGS 342 (535) T ss_pred CeeEEEEe-ecCCCCCCCCccchHHHHHHHHHHhhchhHHH-HHHHHhcCceeeeecCchhhhhcC---CC-CcceEecC Confidence 1 1222 1223344567788766554 344433333333 334445567776664322111100 00 00011234 Q ss_pred cceeecCCCc--eeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_018285. 231 GGPLVLDDLE--DFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSSNV--YSKAVARYLRPFLS 306 (383) Q Consensus 231 g~~~vl~~g~--~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~~--~~~~l~P~~~~i~~ 306 (383) +..+.++.+. +|..++.+....+.++.+ .++.++ .| ..++.....+....+....+ -...|.-++..+++ T Consensus 343 ~~~~~lP~~~~~~~~e~~~~~~a~~~l~~~---e~qM~~-lG--a~ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~ 416 (535) T protein:vir:80 343 RAIIPLPQGATAGILQITPNSVPFEAMTHK---ESQMIA-MG--ANLLVKSGGNRTFGEAQQEEASEQSILSACTKNVSM 416 (535) T ss_pred cccccCCCCCCcceeeeccchhHHHHHHHH---HHHHHH-HH--HHhhccCcccccHHHHHHHHHHHhHHHHHHHHHHHH Confidence 4556677654 445554444433333332 223222 11 11221111111122222222 23456667777777 Q ss_pred HHHHhhc-------c-----hhhccchhhh---ccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch------hH Q lcl|NC_018285. 307 ELSQKLS-------C-----DVDADIFPAV---DPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL------PK 365 (383) Q Consensus 307 ~l~~~l~-------~-----~~e~~~~~~~---~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~------~~ 365 (383) +|+..|- . .+++.+...| ..+.. ....+-+++++|.++..+.++.|..-++...+. .+ T Consensus 417 al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~-~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~r 495 (535) T protein:vir:80 417 AFRKALRWANQFQTGIVNDETVEYNLNTDFPAARLTPN-ERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGK 495 (535) T ss_pred HHHHHHHHHHHHcCCccCCCceEEEeccccccccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHH Confidence 7766442 1 1222332222 22333 344455778899999999999998877754321 11 Q ss_pred H-hCCCCCCCCCCCCCC-----CC Q lcl|NC_018285. 366 G-ENPNRTILKGGETNG-----QD 383 (383) Q Consensus 366 ~-~~~~~~~~~ggd~~~-----~d 383 (383) . ......+..+|+++. +. T Consensus 496 i~~E~~~~~~~~g~~~d~~~~g~~ 519 (535) T protein:vir:80 496 ATVEFIAKTAAAGKVGDAASGGTN 519 (535) T ss_pred HHhhhhhccccCCCCCCCCCCCCC Confidence 1 111222333443321 11 No 247 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=51.76 E-value=0.57 Score=21.91 Aligned_cols=347 Identities=8% Similarity=0.024 Sum_probs=137.4 Q ss_pred CchhhhhhcCCcccccccc---cccchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce---eeecc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFF---DITDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL---TTSRK 68 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~---~~~~~ 68 (383) =+.|.++++.++ ...|. .++.|..+..-.....-...+.+ .++--.|++.+|+.+-+. || .+.+. T Consensus 6 ~~~~~~lkR~~~--e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~ 82 (510) T protein:vir:63 6 AMLWEKLRDGSV--EQRAIEFAKTTLPYLMVDPMSGSRGVVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDA 82 (510) T ss_pred HHHHHHHhccch--HHHHHHHHHhhccccCCCCCCccccccCCCc-cchHHHHHHHHHHHHHhhhcCCCCcccccCCChH Confidence 234555543332 22222 22233222211111111111122 334456666666665441 22 11110 Q ss_pred h-------------hhhhc----cCCCcc---CCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEc Q lcl|NC_018285. 69 Q-------------MQGIV----DNPSNS---ANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 69 ~-------------~~~l~----~~PN~~---~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~ 128 (383) . .+.++ +..... -+.+.-+..+..+++.+|||.+++. .+|.....||+.. +-+..+ T Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~--~~~~~~~~~pl~~--y~v~~d 158 (510) T protein:vir:63 83 IRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRD--SDAATVVAWSLRS--YAVRRD 158 (510) T ss_pred HhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEc--CCCcEEEEEEcce--eEEeeC Confidence 0 01100 001111 1233334566688899999987765 4455556666642 222232 Q ss_pred CCCce--e---------------------------------EEEEee-cCccccc----cee-------------ecccc Q lcl|NC_018285. 129 DNQNG--L---------------------------------YYNVTF-DDPRIPP----KQH-------------VPQSD 155 (383) Q Consensus 129 ~~~~~--~---------------------------------~y~~~~-~~~~~~~----~~~-------------~~~~d 155 (383) ..|.. + .|.... .+..+.. ..+ +...- T Consensus 159 ~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e~P 238 (510) T protein:vir:63 159 ATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCP 238 (510) T ss_pred CCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCceeccccccccccCc Confidence 22211 0 110000 0000000 000 11112 Q ss_pred eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee Q lcl|NC_018285. 156 ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV 235 (383) Q Consensus 156 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v 235 (383) .+..|....++..||.||...+...+...+.+.+.......-...|..++..++........ .+..+.++ T Consensus 239 ~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~----------~~~~g~~v 308 (510) T protein:vir:63 239 YIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ----------DAEMGDYV 308 (510) T ss_pred eeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhc----------cCCCceee Confidence 33333333456789999999999999999999999888877777777777665543332111 11111122 Q ss_pred cC--CCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcc-cccCcCHHHHHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 236 LD--DLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENVVGG-QGDQQSSLEMSS--NVYSKAVARYLRPFLSELS 309 (383) Q Consensus 236 l~--~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~-~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~ 309 (383) -+ ++++..++.. ..+.+ ..+..+..+..|-.+|-+. +.. .+..-+.+|... .=....|-|....+.++|- T Consensus 309 ~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~~---l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l 384 (510) T protein:vir:63 309 PGGAEAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMYG---ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQ 384 (510) T ss_pred cCCcccceeeecCc-ccchHHHHHHHHHHHHHHHHHHHhh---cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 11 2233322222 23333 2455567778888888432 221 111224444432 2334456666666666654 Q ss_pred Hhhcchh-h----ccch----hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhHHhCCCCCCCCCCCCC Q lcl|NC_018285. 310 QKLSCDV-D----ADIF----PAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPKGENPNRTILKGGETN 380 (383) Q Consensus 310 ~~l~~~~-e----~~~~----~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~ 380 (383) .-|+... . ..+. ...+....+....+....+ .-....+...++.-+.. ... .+ ..| T Consensus 385 ~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~--~~~l~~~~q~l~~~~~~----aq~--~~-------~id 449 (510) T protein:vir:63 385 SPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAA--VQSMLNASQVIAGLAPI----AQL--DP-------RIS 449 (510) T ss_pred HHHHHHHHHHHHhccCCCCCchhcccceecchhHHHHHHH--HHHHHHHHHHHHHhcCc----hhh--hc-------cCC Confidence 4333210 0 0000 0001110000011110000 00011111111111100 110 01 123 Q ss_pred CCC Q lcl|NC_018285. 381 GQD 383 (383) Q Consensus 381 ~~d 383 (383) ..+ T Consensus 450 ~d~ 452 (510) T protein:vir:63 450 LPK 452 (510) T ss_pred HHH Confidence 332 No 248 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=51.13 E-value=0.59 Score=21.84 Aligned_cols=378 Identities=11% Similarity=0.118 Sum_probs=168.7 Q ss_pred CchhhhhhcCCc-----------------ccccccc----cccch----hhcccccCCce--------e-chhhhhccHH Q lcl|NC_018285. 1 MPIFNLATESPP-----------------NNQGGFF----DITDP----EFLATLNGSEW--------V-SAETALKNSD 46 (383) Q Consensus 1 Mglf~~~~~~~~-----------------~~~~~~~----~~~~~----~~~~~~~~~~~--------~-~~~~a~~~~~ 46 (383) ++||....+... ....+.. ++..+ ++.+.+.+... + ..+..+.+|. T Consensus 6 l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pE 85 (521) T protein:vir:10 6 LKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHE 85 (521) T ss_pred hHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHhhccc Confidence 445554322111 0000000 00000 01111111110 0 3455678999 Q ss_pred HHHHHHHHHHhhhhC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-- Q lcl|NC_018285. 47 LFSIISQLSNDLATA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-- 109 (383) Q Consensus 47 v~~~i~~ia~~ia~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-- 109 (383) |..||+-|.+.+.-+ |+.+.=++. ..+...-+ ..++...--...+..|++.|..|..++.+.+ T Consensus 86 vd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~p 165 (521) T protein:vir:10 86 VDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPARP 165 (521) T ss_pred hhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCCc Confidence 999999999887533 222210000 00111111 1112222224567788999999998876532 Q ss_pred -CceeEEEEeccceeEEEEcC----CC-------ceeEEEEeec-------CcccccceeecccceEEecc--CCCCccc Q lcl|NC_018285. 110 -GRDMKWEYLRPSQVSFNRLD----NQ-------NGLYYNVTFD-------DPRIPPKQHVPQSDILHFRL--LSVDGGL 168 (383) Q Consensus 110 -g~~~~l~~l~~~~v~~~~~~----~~-------~~~~y~~~~~-------~~~~~~~~~~~~~dvih~~~--~~~~~~~ 168 (383) .-+.+|.+|+|..+..++.. .. ...+|-+... ++.......++.+.|.|... .+.++ . T Consensus 166 k~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~hSGL~d~~~-~ 244 (521) T protein:vir:10 166 KDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSHSGKVDIDG-K 244 (521) T ss_pred cccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeeecccceeCCC-C Confidence 24899999999998665421 11 1122222110 11222335577766655542 22233 4 Q ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCC-HHHHHHHHHHHHH----h-hcCC-c------ce- Q lcl|NC_018285. 169 TSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGL-LDFKTKVSRSRQA----M-KQMQ-G------GP- 233 (383) Q Consensus 169 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~-~e~~~~~~~~~~~----~-~~~~-g------~~- 233 (383) ..+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+. ..+.+.++..... . ++++ | +. T Consensus 245 ~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~m 324 (521) T protein:vir:10 245 TIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNL 324 (521) T ss_pred ceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhh Confidence 5789999999999888888887766655555444444333 3333 3333333332211 1 1111 1 11 Q ss_pred -------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc----CcCHHHHH-HHHHHHHHH Q lcl|NC_018285. 234 -------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD----QQSSLEMS-SNVYSKAVA 298 (383) Q Consensus 234 -------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~----~~~~~e~~-~~~~~~~l~ 298 (383) +.= +.|.+++.+.....-.+ ++-..+..+.+..+++||.+-|+...+ +..++=.. +.=+...|. T Consensus 325 sMlEDyWLpRReGgrgTEI~TLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~ 403 (521) T protein:vir:10 325 AMTEDYWLMRRDGKATTEVSTLPGAQSMGE-MDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIR 403 (521) T ss_pred hhHhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHH Confidence 111 23567776655433333 233357788999999999999975422 11222111 122333444 Q ss_pred HHHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHH-------HhCCCcCHHHH Q lcl|NC_018285. 299 RYLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSM-------VKSGTLAQNQG 349 (383) Q Consensus 299 P~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l-------~~~g~~t~nE~ 349 (383) -+-..+...|.. .|. +.-+++ +...+..|.. -....++.+ +-+-.++.+=+ T Consensus 404 rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi 483 (521) T protein:vir:10 404 GLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYV 483 (521) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHH Confidence 444444444433 332 111221 1111111111 111122211 12335666666 Q ss_pred HHHh-hcCCcCCcchhH-----HhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 350 LYIL-QQAEILPKELPK-----GENPNRTILKGGETNGQD 383 (383) Q Consensus 350 r~~l-g~~~~~~~d~~~-----~~~~~~~~~~ggd~~~~d 383 (383) ++.+ .+. ..|+.. -+.......+--+.+.+| T Consensus 484 ~k~ILr~t---Deeik~~~k~I~~E~~~~~~~~p~~e~~d 520 (521) T protein:vir:10 484 MKNILRMS---DEDIKTEREKIDGELKDSVYKNPEDPMEE 520 (521) T ss_pred HHHHhcCC---HhHHHHHHHHHHHhhhCCCCCCCcchhhc Confidence 6643 332 222211 111111112222334444 No 249 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=50.70 E-value=0.6 Score=21.79 Aligned_cols=362 Identities=9% Similarity=0.043 Sum_probs=144.7 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce---eeecc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL---TTSRK 68 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~---~~~~~ 68 (383) =+.|+.++.+|..+...|..+ +.|..++.-..+. ...+.+ .++-..|++.+|+.+.+. || .+.+. T Consensus 16 ~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~--~~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~ 92 (515) T protein:vir:70 16 PKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGW-QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAK 92 (515) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc--cccccc-cchHHHHHHHHHHHHHHhhcCCCCcccccccChh Confidence 456666766665544444333 2332222111111 111222 445555677676665541 22 11111 Q ss_pred hhhh---------------------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEE Q lcl|NC_018285. 69 QMQG---------------------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNR 127 (383) Q Consensus 69 ~~~~---------------------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~ 127 (383) .... +...-+ .-+.+.-+..++.+++.+|||.+++.. ++ +...|||.. +-+.. T Consensus 93 ~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~d~--~~-~~~~~pl~~--y~v~~ 166 (515) T protein:vir:70 93 GEKVLDDRGLKKTQLATIFARVETTAMKALE-QRQFRPAIVEVFKHLIVAGNCLLYKPS--KG-AMSAVPMHH--YVVNR 166 (515) T ss_pred hhhccccchhHHHHHHHHHHHHHHHHHHHHH-hcCchHHHHHHHHHHHhHCeEEEEEeC--CC-CeEEEEcCe--EEEee Confidence 0000 111111 113334445667788899999888743 22 245566532 22333 Q ss_pred cCCCcee-------------------------------------EEEEeecCccccccee---------------ecccc Q lcl|NC_018285. 128 LDNQNGL-------------------------------------YYNVTFDDPRIPPKQH---------------VPQSD 155 (383) Q Consensus 128 ~~~~~~~-------------------------------------~y~~~~~~~~~~~~~~---------------~~~~d 155 (383) +..|... .|.............. |...- T Consensus 167 d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~y~~~e~P 246 (515) T protein:vir:70 167 DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKESRIKSEKLP 246 (515) T ss_pred CCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEecCceeeccccccccccCC Confidence 3222111 1100000000000000 11112 Q ss_pred eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee Q lcl|NC_018285. 156 ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV 235 (383) Q Consensus 156 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v 235 (383) .+..|....++..||.||...+...+...+.+.+.......-...|..++..++........ .+..+.++ T Consensus 247 ~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~----------~~~~g~iv 316 (515) T protein:vir:70 247 FIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFV----------NSGTGEVI 316 (515) T ss_pred ceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhcc----------ccCCceee Confidence 23333333456789999999999999999999999998888888888888665554432111 11112222 Q ss_pred cC--CCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 236 LD--DLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQ 310 (383) Q Consensus 236 l~--~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~ 310 (383) -+ +++...++.. ..|.+ ..+..+..+..|-.+|-+......... .-+.+|... .=....|-|.+..+.++|=. T Consensus 317 ~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~-rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~ 394 (515) T protein:vir:70 317 TGVAEDIHIVQLGK-YADLTPISAVLEVYTRRIGVIFMMETMTRRDAE-RVTAVEIQRDALEIEQNMGGVYSLFAMTMQT 394 (515) T ss_pred cCCcccceeeecCc-ccchhHHHHHHHHHHHHHHHHHhhhhhhccCCc-cccHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 22 2233333222 23333 245557778889999977643332221 223333332 23344566666666666543 Q ss_pred hh--------cchhhccchhhhccCH---HHHHHHHHHHHh---------------CCCcCHHHHHHHh-hcCCcCC--- Q lcl|NC_018285. 311 KL--------SCDVDADIFPAVDPTG---ANYISRINSMVK---------------SGTLAQNQGLYIL-QQAEILP--- 360 (383) Q Consensus 311 ~l--------~~~~e~~~~~~~~~~~---~~~~~~~~~l~~---------------~g~~t~nE~r~~l-g~~~~~~--- 360 (383) -| ++..--.+......+. ..+....+++.+ ...+..+++-+.+ ...+++. T Consensus 395 Pli~r~~~~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~ 474 (515) T protein:vir:70 395 PIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFL 474 (515) T ss_pred HHHHHHHHhhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCcccc Confidence 33 2211000000000011 011111111100 0111111111110 0001000 Q ss_pred ---cchhHHh----------CCCCCCCCCCCCCCCC Q lcl|NC_018285. 361 ---KELPKGE----------NPNRTILKGGETNGQD 383 (383) Q Consensus 361 ---~d~~~~~----------~~~~~~~~ggd~~~~d 383 (383) .|+.... .+.....+-.+.--+| T Consensus 475 rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~ 510 (515) T protein:vir:70 475 KSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ 510 (515) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhh Confidence 0000000 0000000000000000 No 250 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=50.28 E-value=0.61 Score=21.74 Aligned_cols=379 Identities=11% Similarity=0.106 Sum_probs=164.6 Q ss_pred CchhhhhhcCCc--------cccccccccc--chh------h----cccccCCcee--------------chhhhhccHH Q lcl|NC_018285. 1 MPIFNLATESPP--------NNQGGFFDIT--DPE------F----LATLNGSEWV--------------SAETALKNSD 46 (383) Q Consensus 1 Mglf~~~~~~~~--------~~~~~~~~~~--~~~------~----~~~~~~~~~~--------------~~~~a~~~~~ 46 (383) +.||+...+... .....+.... +++ . .+...++.++ ..+..+.+|. T Consensus 8 ~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pE 87 (524) T protein:vir:10 8 LSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYE 87 (524) T ss_pred HHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccc Confidence 333333221100 0000000000 000 0 0000011111 2455678999 Q ss_pred HHHHHHHHHHhhhhC-----ceeeecchhh-------hhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC-- Q lcl|NC_018285. 47 LFSIISQLSNDLATA-----KLTTSRKQMQ-------GIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN-- 109 (383) Q Consensus 47 v~~~i~~ia~~ia~~-----p~~~~~~~~~-------~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~-- 109 (383) |..||+-|.+.+.-+ |+.+.=.+.+ .+...-+ ..++...--...+..|++.|..|..++.+.+ T Consensus 88 vd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~p 167 (524) T protein:vir:10 88 VDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKM 167 (524) T ss_pred hhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCc Confidence 999999999886532 2222111100 0111111 1112222223566788999999998876532 Q ss_pred -CceeEEEEeccceeEEEEc-----CCCc------eeEEEEeecC---------cccccceeecccceEEeccCCCC-cc Q lcl|NC_018285. 110 -GRDMKWEYLRPSQVSFNRL-----DNQN------GLYYNVTFDD---------PRIPPKQHVPQSDILHFRLLSVD-GG 167 (383) Q Consensus 110 -g~~~~l~~l~~~~v~~~~~-----~~~~------~~~y~~~~~~---------~~~~~~~~~~~~dvih~~~~~~~-~~ 167 (383) .-+.+|.+|+|..+..++. +.+. ..+|-+...+ ........++.+.|.|...--.+ +. T Consensus 168 k~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~d~~~ 247 (524) T protein:vir:10 168 KDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLLDCCG 247 (524) T ss_pred cccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhheeeeccCcccCCC Confidence 2489999999999876432 1111 1112221110 12223456888888887632222 11 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHHh-----hc-------CCcce Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQAM-----KQ-------MQGGP 233 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~~-----~~-------~~g~~ 233 (383) -.=+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++...... +. +..+. T Consensus 248 ~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~ 327 (524) T protein:vir:10 248 KNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHN 327 (524) T ss_pred CceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhh Confidence 23467888888888887777777665555545444444333 33333 3333333322111 11 11111 Q ss_pred --------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc----CcCHHHHH--HHHHHHH Q lcl|NC_018285. 234 --------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD----QQSSLEMS--SNVYSKA 296 (383) Q Consensus 234 --------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~----~~~~~e~~--~~~~~~~ 296 (383) +.= +.|.+++.+.....-.+ ++-..+..+.+..+++||.+-|+...+ .....|-. +.=+... T Consensus 328 msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KF 406 (524) T protein:vir:10 328 MSMTEDYWLQRRDGKAVTEVDTMPGATGMSD-MDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKW 406 (524) T ss_pred hhhHhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHH Confidence 111 23567776655433233 233357788999999999999953221 11112222 1223334 Q ss_pred HHHHHHHHHHHHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcCHHHH Q lcl|NC_018285. 297 VARYLRPFLSELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLAQNQG 349 (383) Q Consensus 297 l~P~~~~i~~~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t~nE~ 349 (383) |.-+-..+...|.. .|.. .-|++ +...+..|.. -....++.+- -+-.++.+=+ T Consensus 407 I~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi 486 (524) T protein:vir:10 407 IRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTA 486 (524) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHH Confidence 44444444444433 3321 11221 1111111111 1111222111 1334566666 Q ss_pred HHHh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 350 LYIL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 350 r~~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) ++.+ .+. ..|+... +.......+--+.+++| T Consensus 487 ~k~ILr~t---Deei~~~~k~I~~E~k~~~~~~~~~~~~~ 523 (524) T protein:vir:10 487 MKDFLQMT---DEEINQEAKQIEEESKEARFQNPDEEEED 523 (524) T ss_pred HHHHhccC---HHHHHHHHHHHHHHhhcCCCCCCChhhhc Confidence 6543 322 2222211 11111111122334444 No 251 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=46.95 E-value=0.72 Score=21.37 Aligned_cols=374 Identities=11% Similarity=0.082 Sum_probs=160.4 Q ss_pred CchhhhhhcCCcc-ccccccccc--------chhhccc---ccCCcee--------chhhhhccHHHHHHHHHHHHhhhh Q lcl|NC_018285. 1 MPIFNLATESPPN-NQGGFFDIT--------DPEFLAT---LNGSEWV--------SAETALKNSDLFSIISQLSNDLAT 60 (383) Q Consensus 1 Mglf~~~~~~~~~-~~~~~~~~~--------~~~~~~~---~~~~~~~--------~~~~a~~~~~v~~~i~~ia~~ia~ 60 (383) =.||....++... ...++..+. ...+.+. ..++... ..+..+.+|.|..||+.|.+.+.- T Consensus 2 ~~lfgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVneaIv 81 (564) T protein:vir:10 2 SQLFGFLINEKEGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNEFVV 81 (564) T ss_pred cchhcceeeeeccCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcceeE Confidence 2455554332211 111111111 1111111 1111111 245567899999999999987542 Q ss_pred C-----ceeeecch--------------hhhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecC-C--CceeEEEEe Q lcl|NC_018285. 61 A-----KLTTSRKQ--------------MQGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRND-N--GRDMKWEYL 118 (383) Q Consensus 61 ~-----p~~~~~~~--------------~~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~-~--g~~~~l~~l 118 (383) + |+.+.=.+ .+.++.--|....+ ...+..|++.|..|..++.+. + .-+.+|.+| T Consensus 82 ~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~----~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~l 157 (564) T protein:vir:10 82 NDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNA----HEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYI 157 (564) T ss_pred ecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhh----hHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhh Confidence 2 22221100 01111112222233 355678889999999887653 2 138899999 Q ss_pred ccceeEEEEcCC------Cce---------------eEEEEeecCcc-------------cccceeecccceEEecc--C Q lcl|NC_018285. 119 RPSQVSFNRLDN------QNG---------------LYYNVTFDDPR-------------IPPKQHVPQSDILHFRL--L 162 (383) Q Consensus 119 ~~~~v~~~~~~~------~~~---------------~~y~~~~~~~~-------------~~~~~~~~~~dvih~~~--~ 162 (383) +|..++.++... +.. .+|.+...... ......++.+.|.|... . T Consensus 158 DPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSGL~ 237 (564) T protein:vir:10 158 DSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSGLM 237 (564) T ss_pred cccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceecccce Confidence 999887765211 110 12222211110 11235667777776653 2 Q ss_pred CCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHHh-----hcCC-c--- Q lcl|NC_018285. 163 SVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQAM-----KQMQ-G--- 231 (383) Q Consensus 163 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~~-----~~~~-g--- 231 (383) +.++. .=+|-|..+.+.+....-++....=+.-.-+.-+-|.-.+ |.+.+ .+.+.++...... ++++ | T Consensus 238 d~~~~-~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevr 316 (564) T protein:vir:10 238 DLNKK-MTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIR 316 (564) T ss_pred eCCCC-ceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceec Confidence 22222 2367788888888887777777665555545444444333 33333 3333333322111 1111 1 Q ss_pred ---ce--------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc----CcCHHHHH-HHH Q lcl|NC_018285. 232 ---GP--------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGD----QQSSLEMS-SNV 292 (383) Q Consensus 232 ---~~--------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~----~~~~~e~~-~~~ 292 (383) +. +.= +.|.+++.+.....-.+ ++-..+..+.++.+++||.+.|..... +..++=.. +.= T Consensus 317 ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLge-m~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiK 395 (564) T protein:vir:10 317 DDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELK 395 (564) T ss_pred ccchhhhhHhhhcccccCCCcccceeeccccCCcch-HHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHH Confidence 11 110 23567776655433333 223357788999999999999975421 12222111 122 Q ss_pred HHHHHHHHHHHHHHHHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcC Q lcl|NC_018285. 293 YSKAVARYLRPFLSELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLA 345 (383) Q Consensus 293 ~~~~l~P~~~~i~~~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t 345 (383) +...|.-+-..+...|.. .|.. .-+++ +...+..|.. -....++.+- -+-.++ T Consensus 396 F~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 475 (564) T protein:vir:10 396 FTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFS 475 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 333444444444444433 3321 11211 1111111111 0111111111 122334 Q ss_pred HHHHHHHh-hc-----------------CCc--CCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 346 QNQGLYIL-QQ-----------------AEI--LPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 346 ~nE~r~~l-g~-----------------~~~--~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) .+=+++.+ .+ .|+ +|.++. .+.+.+.+|+--.+++ T Consensus 476 ~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~---~~~~~~~~~~~~~p~~ 530 (564) T protein:vir:10 476 TEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVN---MLDDMEKQNQAFAPEL 530 (564) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhh---cCCCccCCCCcCCcch Confidence 44443322 11 111 111111 1111222222111111 No 252 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=46.69 E-value=0.73 Score=21.34 Aligned_cols=341 Identities=10% Similarity=0.013 Sum_probs=144.3 Q ss_pred CchhhhhhcCCcccccccccccchh--hcccccCCc-eec---hhhhhccHHHHHHHHHHHHhhhh--Cc----e---ee Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITDPE--FLATLNGSE-WVS---AETALKNSDLFSIISQLSNDLAT--AK----L---TT 65 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~~---~~~a~~~~~v~~~i~~ia~~ia~--~p----~---~~ 65 (383) =+.|+.++..|..+...|..+.+.. ....+.+.. ... ..+ +=.++-..|++.+|+.+-+ +| | .+ T Consensus 10 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~-~~dst~~~a~~~Las~l~~~ltpp~~~WF~l~~ 88 (559) T protein:vir:95 10 NKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTR-IIDSTGTMAARTLASGMMSGITSPARPWFRLAT 88 (559) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccc-cccchHHHHHHHHHHHHHHhhcCCCCccccccc Confidence 3445555555554444443332211 111111110 000 011 1233444566666655543 12 1 11 Q ss_pred ecch------hh-----------hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEc Q lcl|NC_018285. 66 SRKQ------MQ-----------GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 66 ~~~~------~~-----------~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~ 128 (383) .+.. .. ..+.+-| .+.-+..++.+++++||+.+++..+. ++.+.+.+++..++-+..+ T Consensus 89 ~d~~~~e~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~L~~~Gta~l~~~~d~-~~~~r~~~~~l~~~~v~~d 163 (559) T protein:vir:95 89 PDPEMMDYGPVKLWLEAVQNRMNDMFNKSN----LYQSLPQLYGSLGTYSTGAMAVLDDD-EDIIRTMPFPIGSYYLANS 163 (559) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHhcC----cHHHHHHHHHHHHhhCceeeEeecCC-CceeEEEEeecCeEEEeeC Confidence 1111 00 1112222 23334566789999999999887654 3455666676666666555 Q ss_pred CCCceeE-EE-Eeec----------------------Ccccccc---------------------------eeec--cc- Q lcl|NC_018285. 129 DNQNGLY-YN-VTFD----------------------DPRIPPK---------------------------QHVP--QS- 154 (383) Q Consensus 129 ~~~~~~~-y~-~~~~----------------------~~~~~~~---------------------------~~~~--~~- 154 (383) ..+.... |+ +... .+..... +.+. .+ T Consensus 164 ~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~ 243 (559) T protein:vir:95 164 PRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDN 243 (559) T ss_pred CCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceEEEEEEEecCCC Confidence 4443211 10 0000 0000000 0000 01 Q ss_pred ------------ceEEeccCCCCccccCcc-hHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHH Q lcl|NC_018285. 155 ------------DILHFRLLSVDGGLTSVS-PLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSR 221 (383) Q Consensus 155 ------------dvih~~~~~~~~~~~G~s-~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~ 221 (383) -.+..|....++..||.| |...+...+...+...+.......-...|..++..++..... T Consensus 244 ~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~------- 316 (559) T protein:vir:95 244 DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRA------- 316 (559) T ss_pred ceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccce------- Confidence 012222222356689999 899999999999999999888888888887766443221110 Q ss_pred HHHHhhcCCcceeecC--CC-ceeeecc-cChhhHHHHHHHHHHHHHHHHHhcCCHHH-hccccc-CcCHHHHHH--HHH Q lcl|NC_018285. 222 SRQAMKQMQGGPLVLD--DL-EDFTPLE-IKSNVAQLLKQADWTTGQFAKVYGIPENV-VGGQGD-QQSSLEMSS--NVY 293 (383) Q Consensus 222 ~~~~~~~~~g~~~vl~--~g-~~~~~~~-~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~-lg~~~~-~~~~~e~~~--~~~ 293 (383) .-.+|++.+.+ .| -.++++. .++.-....+..+.....|-.+|-..+.+ ++.... .-+.+|... .=. T Consensus 317 -----~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~ 391 (559) T protein:vir:95 317 -----SLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEK 391 (559) T ss_pred -----eeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHH Confidence 01234433332 11 2344432 23322222333466788999999987643 332221 223444332 223 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCH------------------HHHHHHhhc Q lcl|NC_018285. 294 SKAVARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQ------------------NQGLYILQQ 355 (383) Q Consensus 294 ~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~------------------nE~r~~lg~ 355 (383) ...|-|....+.++|-.-|+.+ .+.-|.+.|.+-+ ..+.+..+. T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r------------------~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~ 453 (559) T protein:vir:95 392 LLMLGPVLERLNDECLNPLIDR------------------SFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGL 453 (559) T ss_pred HHHhhHHHHHHHHHHHHHHHHH------------------HHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHH Confidence 3556777777766643322211 0111112221100 000000000 Q ss_pred CCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 356 AEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 356 ~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) ..+ .+.+... .++.+-+-+--| T Consensus 454 ~~i----~~~~~~~--~~laq~~Pevld 475 (559) T protein:vir:95 454 SSL----ASTVNFI--GQLAQVKPEALD 475 (559) T ss_pred HHH----HHHHHHH--HHHhccChhhhh Confidence 000 0000000 011111111112 No 253 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=45.65 E-value=0.76 Score=21.23 Aligned_cols=378 Identities=14% Similarity=0.143 Sum_probs=152.1 Q ss_pred Cchhhhhhc---CC---------cccccccccccchhhcccccCCce--------e-chhhhhccHHHHHHHHHHHHhhh Q lcl|NC_018285. 1 MPIFNLATE---SP---------PNNQGGFFDITDPEFLATLNGSEW--------V-SAETALKNSDLFSIISQLSNDLA 59 (383) Q Consensus 1 Mglf~~~~~---~~---------~~~~~~~~~~~~~~~~~~~~~~~~--------~-~~~~a~~~~~v~~~i~~ia~~ia 59 (383) -.||....+ +- +.+..+.......++.+...+... + ..+..+.+|.|..||+-|.+.+. T Consensus 2 ~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 81 (558) T protein:vir:10 2 AKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEAI 81 (558) T ss_pred cchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 123333221 10 111111111111111111111000 0 24556789999999999998865 Q ss_pred hC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC---CceeEEEEeccc Q lcl|NC_018285. 60 TA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN---GRDMKWEYLRPS 121 (383) Q Consensus 60 ~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~---g~~~~l~~l~~~ 121 (383) -+ |+.+.=++. ..+...-+ ..++...--...+..|++.|..|+.++.|.. .-+.+|.+|+|. T Consensus 82 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr 161 (558) T protein:vir:10 82 VSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDPL 161 (558) T ss_pred EecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCcc Confidence 32 222211100 01111111 1112222224567788999999998877543 248899999999 Q ss_pred eeEEEEcCC---------------Cc-------eeEEEEeecCc---------ccccceeecccceEEeccC---CCCcc Q lcl|NC_018285. 122 QVSFNRLDN---------------QN-------GLYYNVTFDDP---------RIPPKQHVPQSDILHFRLL---SVDGG 167 (383) Q Consensus 122 ~v~~~~~~~---------------~~-------~~~y~~~~~~~---------~~~~~~~~~~~dvih~~~~---~~~~~ 167 (383) .+..++... .+ ..+|.|..... ..+....++.+- |++-+- +.++. T Consensus 162 ~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dA-I~y~hSGL~d~~~~ 240 (558) T protein:vir:10 162 KIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDS-ITMCTSGLVDRNKN 240 (558) T ss_pred cceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhh-eeeecccceecCCC Confidence 986654320 00 11222211100 011112333333 333321 12222 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHHh-----hcCC-c------ce Q lcl|NC_018285. 168 LTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQAM-----KQMQ-G------GP 233 (383) Q Consensus 168 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~~-----~~~~-g------~~ 233 (383) .=+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++...... ++++ | +. T Consensus 241 -~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~ 319 (558) T protein:vir:10 241 -RVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKF 319 (558) T ss_pred -eeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchh Confidence 2357788888888887777777665555545444444333 33333 3333333322111 1111 1 11 Q ss_pred --------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc--CHHHHH--HHHHHHHHH Q lcl|NC_018285. 234 --------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ--SSLEMS--SNVYSKAVA 298 (383) Q Consensus 234 --------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~--~~~e~~--~~~~~~~l~ 298 (383) +.= +.|.+++.+.....-.+ ++-..+..+.++.+++||.+.|+..+..+ .+.|-. +.=+...|. T Consensus 320 msMlEDyWLpRReGgrgTEItTLpGgqnLge-m~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~ 398 (558) T protein:vir:10 320 MSMMEDFWLPRREGGRGTEITTLPGGQNLGE-LSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVG 398 (558) T ss_pred hhhHhhhcccccCCCCccceeeccccCCcch-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHH Confidence 110 23567776655443333 22335778899999999999997432211 111222 122333444 Q ss_pred HHHHHHHHHHHH----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcCHHHHHH Q lcl|NC_018285. 299 RYLRPFLSELSQ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLAQNQGLY 351 (383) Q Consensus 299 P~~~~i~~~l~~----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t~nE~r~ 351 (383) -+-..+...|.. .|.. .-+++ +...+..|.. -....++.+- -+-.++.+=+++ T Consensus 399 RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k 478 (558) T protein:vir:10 399 RLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRK 478 (558) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHH Confidence 444444444433 3321 11111 1111111110 0111111111 122333333333 Q ss_pred Hh-h-----------------cCCc-C-CcchhHHhCCCCCCCCCCCC-------CCCC Q lcl|NC_018285. 352 IL-Q-----------------QAEI-L-PKELPKGENPNRTILKGGET-------NGQD 383 (383) Q Consensus 352 ~l-g-----------------~~~~-~-~~d~~~~~~~~~~~~~ggd~-------~~~d 383 (383) .+ . ..|+ . |.+..-+.... .| ++||. .+.| T Consensus 479 ~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~ 535 (558) T protein:vir:10 479 RVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEP-LP-QEGDPAMEGMGEQPVD 535 (558) T ss_pred HHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccc-cC-ccCCchhccCCCCCcc Confidence 21 1 1121 1 11111111000 00 01111 1111 No 254 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=45.11 E-value=0.78 Score=21.17 Aligned_cols=363 Identities=9% Similarity=0.019 Sum_probs=138.5 Q ss_pred CchhhhhhcCCcccccccccccc-----hh----hccc---ccCCceechhhhhccHHHHHHHHHHHHhhhhC--ce--- Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD-----PE----FLAT---LNGSEWVSAETALKNSDLFSIISQLSNDLATA--KL--- 63 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~-----~~----~~~~---~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p~--- 63 (383) -+.|+.++..|......|....+ +. +.+. +.+......+.-+-.+++..+++.|+..+.+. |= T Consensus 30 ~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~w 109 (641) T protein:vir:94 30 ISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYFKGATFPSDDW 109 (641) T ss_pred HHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHHhhhhcCCCce Confidence 45555565555443333322211 00 0000 00111111122245677888888888777663 21 Q ss_pred -ee--ecch----hh----hhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeec------------CCCce-------- Q lcl|NC_018285. 64 -TT--SRKQ----MQ----GIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRN------------DNGRD-------- 112 (383) Q Consensus 64 -~~--~~~~----~~----~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~------------~~g~~-------- 112 (383) ++ .... +. .++..-+.. .+++-...++.+.+.+|++++.+-++ ..|.+ T Consensus 110 f~~~p~~~ed~~~A~~~~~~~~~~l~~~-~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (641) T protein:vir:94 110 FDLKGMVPELADAARVVKQLTKTKLEAA-SIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVA 188 (641) T ss_pred EEEecCCCChHHHHHHHHHHHHHHHhhc-chHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhcccchhhcccccccc Confidence 11 1100 11 112222221 23455567888999999998765432 11211 Q ss_pred -------eEEEEeccceeEEEEcC---CCcee------------------------------------------------ Q lcl|NC_018285. 113 -------MKWEYLRPSQVSFNRLD---NQNGL------------------------------------------------ 134 (383) Q Consensus 113 -------~~l~~l~~~~v~~~~~~---~~~~~------------------------------------------------ 134 (383) +...||+|..+-+.... +.... T Consensus 189 v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~ 268 (641) T protein:vir:94 189 VNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTS 268 (641) T ss_pred eecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhccccccccccccccccccccc Confidence 12333333322110000 00000 Q ss_pred ---EEEEe--ecCcccc---cceeecccceEE--------------eccCCCCccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 135 ---YYNVT--FDDPRIP---PKQHVPQSDILH--------------FRLLSVDGGLTSVSPLMALGRELDIQKASDKLTL 192 (383) Q Consensus 135 ---~y~~~--~~~~~~~---~~~~~~~~dvih--------------~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 192 (383) .|.+. +...+.. ....+....|+| ++.....+..||.||...+...+.....+.+... T Consensus 269 ~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~l 348 (641) T protein:vir:94 269 GWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRL 348 (641) T ss_pred ccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHH Confidence 00000 0000000 001112222333 3322234568999999999999999999998888 Q ss_pred HHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeecCCCceeeecccChhhHHH-HHHHHHHHHHHHHHhcC Q lcl|NC_018285. 193 NSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQL-LKQADWTTGQFAKVYGI 271 (383) Q Consensus 193 ~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~~~~d~~~-~e~~~~~~~~Ia~~~gV 271 (383) ....-...|..++..++..+++.. ..+.|+++..+....+.++.....+.+. .+..+.....|-.+|++ T Consensus 349 d~~~~~~~p~~~~~~~~~~~~~~l----------~~~PG~ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~ 418 (641) T protein:vir:94 349 DNLVLHINKMWTLVEDGILKREDV----------KAKPGAVFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTST 418 (641) T ss_pred HHHHHHhCCeeeecccccccccee----------eccCCcceeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhh Confidence 777777777777766655554222 2345666555444445554322112111 12233444567788998 Q ss_pred CHHHhccccc---CcCHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcch-hhc--------------cc----hhhhccC Q lcl|NC_018285. 272 PENVVGGQGD---QQSSLEMSS--NVYSKAVARYLRPFLSELSQKLSCD-VDA--------------DI----FPAVDPT 327 (383) Q Consensus 272 pp~~lg~~~~---~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l~~~-~e~--------------~~----~~~~~~~ 327 (383) ...+-+.... .-+..|... .=...-+.++++.+++++-.-|+.. ++. .. ...+... T Consensus 419 ~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~ 498 (641) T protein:vir:94 419 GPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVS 498 (641) T ss_pred hhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCC Confidence 8654433211 112222221 1122234444444443333222211 000 00 0000001 Q ss_pred HHHHHHHHHHHHhCCCcCHHHH-HHHhhcCCcCCcchhHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 328 GANYISRINSMVKSGTLAQNQG-LYILQQAEILPKELPKGENPNRTILKGGETNGQD 383 (383) Q Consensus 328 ~~~~~~~~~~l~~~g~~t~nE~-r~~lg~~~~~~~d~~~~~~~~~~~~~ggd~~~~d 383 (383) +.+...+++ .+.-|....-+- .+...+..+ .......|. =+|.-+-| T Consensus 499 p~~L~~~~~-iv~l~~~q~~~~~~~i~~l~~~-------~~~~a~~P~-v~d~~d~~ 546 (641) T protein:vir:94 499 PEYLHYPYK-FLALGANYVVERERMVTDLLQL-------LDISGRVPQ-IGQSLDYA 546 (641) T ss_pred ccceeeeee-EeecchhHHHHHHHHHHHHHHH-------HHHhhcChh-hhhcCCHH Confidence 111100000 000010000000 000000000 000000000 01111111 No 255 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=36.82 E-value=1.2 Score=20.25 Aligned_cols=347 Identities=9% Similarity=0.032 Sum_probs=142.1 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce-e--eecc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL-T--TSRK 68 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~-~--~~~~ 68 (383) =+.|+.++..|..+...|..+ +.|.++..-.+... ..+.+ .++--.|++.+|+.+-+. || + +.+. T Consensus 17 ~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~--~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~~~ 93 (516) T protein:vir:96 17 PKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNET--SQNGW-QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQ 93 (516) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccc--cCCcc-cchHHHHHHHHHHHHHhhhcCCCCcccccccChh Confidence 345566666554444444332 23332221111111 11222 344456666666655431 22 1 1110 Q ss_pred hh-------------hh--------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEE Q lcl|NC_018285. 69 QM-------------QG--------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNR 127 (383) Q Consensus 69 ~~-------------~~--------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~ 127 (383) .. +. +...-+. -+.+.-+..++.+++.+|||.+++.. ++ ....|||.. +-+.. T Consensus 94 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~-snf~~~~~~~~~~L~~~G~a~l~~d~--~~-~~~~~pl~~--y~v~~ 167 (516) T protein:vir:96 94 GEKVLNQRGLKKTELATIFAQVETRAMKELEQ-RQFRPAVVEAFKHLIVAGSCMLYKPS--KG-AISAIPMHH--YVVNR 167 (516) T ss_pred HHhhccccCchhHHHHHHHHHHHHHHHHHHHh-cCcHHHHHHHHHHHHhHCeEeEEecC--CC-CEEEEEcCe--EEEee Confidence 00 00 1101111 12333345666888899999887643 33 244566532 22222 Q ss_pred cCCCce-------------------------------------eEE-------------EEeecCccccccee--ecccc Q lcl|NC_018285. 128 LDNQNG-------------------------------------LYY-------------NVTFDDPRIPPKQH--VPQSD 155 (383) Q Consensus 128 ~~~~~~-------------------------------------~~y-------------~~~~~~~~~~~~~~--~~~~d 155 (383) +..|.. ..| ....++...+..-. |...- T Consensus 168 d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~es~~~~~e~P 247 (516) T protein:vir:96 168 DTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSKIKSEKLP 247 (516) T ss_pred CCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEeCceeeccccccccccCC Confidence 222211 001 00000000000000 11122 Q ss_pred eEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceee Q lcl|NC_018285. 156 ILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLV 235 (383) Q Consensus 156 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~v 235 (383) .+..|....++..||.||..-+...+...+.+.+.......-...|.+++..++........ ....+.++ T Consensus 248 ~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~----------~~~~g~i~ 317 (516) T protein:vir:96 248 FIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFV----------NSGTGEVV 317 (516) T ss_pred eeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhc----------cCCCceee Confidence 34444334456789999999999999999999998888777777777766665544332111 11112222 Q ss_pred cCCCceeeeccc-ChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018285. 236 LDDLEDFTPLEI-KSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQK 311 (383) Q Consensus 236 l~~g~~~~~~~~-~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~ 311 (383) -+..-.+.++.. +..|.+ ..+..+..+..|-.+|-+.....-. +..-+.+|... .=....|-|.+..+.++|=.- T Consensus 318 ~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~-~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~P 396 (516) T protein:vir:96 318 TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRD-AERVTAVEIQRDALEIEQNMGGVYSLFATTMQSP 396 (516) T ss_pred cCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCC-CccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHH Confidence 222222333322 223333 2455677788898888765322211 12223444432 234456777777777775433 Q ss_pred hcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhH-HhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 312 LSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPK-GENPNRTILKGGETNGQD 383 (383) Q Consensus 312 l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~-~~~~~~~~~~ggd~~~~d 383 (383) |+...-... .+..- ..+++-.+.+.-....+.. .-..+.. ..... .+.+++.+=-| T Consensus 397 li~r~l~~~----~p~lp------~~~v~~~~vs~l~~l~r~~----~~~~i~~~~~~i~--~~~~~~p~v~d 453 (516) T protein:vir:96 397 VAMWGLLEA----GESFT------SDLVDPVIITGIEALGRMA----ELDKLANFAQYMS--LPLQWPEPVLA 453 (516) T ss_pred HHHHHHHhc----CCCCc------cccccceeechHHHHHHHH----HHHHHHHHHHHHH--HHhcCChhHHh Confidence 321100000 00000 0001111111111110000 0000111 11111 11222222112 No 256 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=33.89 E-value=1.3 Score=19.91 Aligned_cols=357 Identities=10% Similarity=0.011 Sum_probs=153.0 Q ss_pred Cchh---------------------hhhhcCCccccccccc-----ccchhhcccccCCceech-----hhhhccHHHHH Q lcl|NC_018285. 1 MPIF---------------------NLATESPPNNQGGFFD-----ITDPEFLATLNGSEWVSA-----ETALKNSDLFS 49 (383) Q Consensus 1 Mglf---------------------~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~-----~~a~~~~~v~~ 49 (383) |..- ...+.+....-+.+.. -.++.+..... ...-.. +.|.=.++... T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~-~~~~~y~~~~~~rA~~~n~~~~ 92 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAA-KIEKDWEDLTWRLANYVNIVNP 92 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhc-cchhhhHhhhhhccccCchhHH Confidence 2210 0111000000000000 00000000000 000000 13444566677 Q ss_pred HHHHHHHhhhhCceeeecch---hhhhccCCC-ccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCc-----------eeE Q lcl|NC_018285. 50 IISQLSNDLATAKLTTSRKQ---MQGIVDNPS-NSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGR-----------DMK 114 (383) Q Consensus 50 ~i~~ia~~ia~~p~~~~~~~---~~~l~~~PN-~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~-----------~~~ 114 (383) +++.++..+-+-|..+.... ...+..... ...+-.+|.+.++...+.+|-+++++.....+. |. T Consensus 93 tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy- 171 (488) T protein:vir:96 93 TMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPT- 171 (488) T ss_pred HHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcE- Confidence 77777777777776664332 233444443 457789999999999999999999988654322 21 Q ss_pred EEEecccee-----------------EEE---EcCCCc----eeEEEEe-ecCc-----------ccccceeeccc---- Q lcl|NC_018285. 115 WEYLRPSQV-----------------SFN---RLDNQN----GLYYNVT-FDDP-----------RIPPKQHVPQS---- 154 (383) Q Consensus 115 l~~l~~~~v-----------------~~~---~~~~~~----~~~y~~~-~~~~-----------~~~~~~~~~~~---- 154 (383) +..+.|..| .+. ...++. ...+++. ..++ .....+..... T Consensus 172 ~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l 251 (488) T protein:vir:96 172 AAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLINSKQS 251 (488) T ss_pred EEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccceEeecCCCccc Confidence 222222111 110 011110 1111110 1110 00000000000 Q ss_pred ---ceEEeccCCCCccccCcchHHHHHH-HHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCC Q lcl|NC_018285. 155 ---DILHFRLLSVDGGLTSVSPLMALGR-ELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQ 230 (383) Q Consensus 155 ---dvih~~~~~~~~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~ 230 (383) .++++- ....+...|.||+..+.. .+........+....+ ..+.|..++... ..+++....... .+..-. T Consensus 252 ~~IP~v~~~-~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~-~~~~p~lv~~~~-~~~~~~~~~~~~---~g~~~~ 325 (488) T protein:vir:96 252 DTIPFFLAS-SQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMI-LANEAKWMVDMG-DMNKTMASEMNP---LGFTLA 325 (488) T ss_pred CeeEEEEEe-cCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHH-hcCCceeeeccC-CCCccccccccc---ceeeec Confidence 122221 122344557777766544 6666666666655444 555666665433 234333222211 111111 Q ss_pred c-ceeecCCC-ceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_018285. 231 G-GPLVLDDL-EDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS--SNVYSKAVARYLRPFLS 306 (383) Q Consensus 231 g-~~~vl~~g-~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~--~~~~~~~l~P~~~~i~~ 306 (383) . -+...+.| ++|.+.+.+..- .+..+...++.+. .| ..++.-. .+.+.++.. ...-+..|.-++..+++ T Consensus 326 ~~~~~~~~~g~~~~~e~~~~~l~---~~~l~~l~~qm~~-~G--a~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~ 398 (488) T protein:vir:96 326 GRMPYYVKNGDVKVIQAQFSPET---ENKVEKLFEQAVK-VG--ASLFTQQ-SNETATGAAIRSGSSTASMATLGNNVED 398 (488) T ss_pred ccccccccCCceeecCCchhHHH---HHHHHHHHHHHHH-Hh--HhhccCC-CcchHHHHHHHHHHhhHHHHHHHHHHHH Confidence 1 12333334 566665554321 1222222222211 11 1222211 111222222 22345667778888888 Q ss_pred HHHHhhcc---------------hhhccchhhhc---cCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch----- Q lcl|NC_018285. 307 ELSQKLSC---------------DVDADIFPAVD---PTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL----- 363 (383) Q Consensus 307 ~l~~~l~~---------------~~e~~~~~~~~---~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~----- 363 (383) +++..|-- ..++++...|. .+.. ....+.+++.+|.++..+.++.|..-++...|. T Consensus 399 al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~-~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~ 477 (488) T protein:vir:96 399 TVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQ-MLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEF 477 (488) T ss_pred HHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHH Confidence 88765421 12344443332 2333 345666778899999999999998888764432 Q ss_pred -hHHhCCCCCCC Q lcl|NC_018285. 364 -PKGENPNRTIL 374 (383) Q Consensus 364 -~~~~~~~~~~~ 374 (383) .+++..+. .+ T Consensus 478 ~~~ie~~g~-~~ 488 (488) T protein:vir:96 478 DEHIAELGF-GM 488 (488) T ss_pred HHHHhhcCC-CC Confidence 22222111 11 No 257 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=32.60 E-value=1.4 Score=19.76 Aligned_cols=356 Identities=9% Similarity=0.027 Sum_probs=135.9 Q ss_pred CchhhhhhcCCcc--cccccccccc-----hhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhCceeeecch---h Q lcl|NC_018285. 1 MPIFNLATESPPN--NQGGFFDITD-----PEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATAKLTTSRKQ---M 70 (383) Q Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~---~ 70 (383) -+|.+....++.. ....+..... ..............+..=+.+.-..-.|+..+.-+-+-|+...-.+ . T Consensus 7 ~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~~~ 86 (451) T protein:vir:10 7 RAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDIDNNKEL 86 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeecCCcHHH Confidence 2222221111000 0000000000 0000000000000000001122333455555555555666543211 1 Q ss_pred hhhccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCC-------ceeEEEEeccceeEEEEcCCC-ce----eEEEE Q lcl|NC_018285. 71 QGIVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNG-------RDMKWEYLRPSQVSFNRLDNQ-NG----LYYNV 138 (383) Q Consensus 71 ~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g-------~~~~l~~l~~~~v~~~~~~~~-~~----~~y~~ 138 (383) ..++..-. ..........+..++..+|.||.++.++.+. ....+..++|..+-+..++.. +. ++|.. T Consensus 87 ~~~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~ 165 (451) T protein:vir:10 87 NEKVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELEAVIRYYI 165 (451) T ss_pred HHHHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 11221111 1234555667788899999999998887641 123466677777766554322 11 11111 Q ss_pred eecCccc-------ccceeecccceEEeccC---------------CC---------CccccCcchHHHHHHHHHHHHHH Q lcl|NC_018285. 139 TFDDPRI-------PPKQHVPQSDILHFRLL---------------SV---------DGGLTSVSPLMALGRELDIQKAS 187 (383) Q Consensus 139 ~~~~~~~-------~~~~~~~~~dvih~~~~---------------~~---------~~~~~G~s~~~~~~~~i~~~~~~ 187 (383) ...+..+ .....+..+.+.+++.. ++ ...-.|.|-+..+...++....+ T Consensus 166 ~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~liDa~~~~ 245 (451) T protein:vir:10 166 QLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKKILDLYDRV 245 (451) T ss_pred eeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCCchhhHHHHHHHHHHH Confidence 1000000 00011222223222210 00 00123566666666666555554 Q ss_pred HHHHHHHHhccCCcceeEee-cCCCCHHHHHHHHHHHHHhhcCCcceeecC-------CCceeeecccChhhHHHHHHHH Q lcl|NC_018285. 188 DKLTLNSLKNALNANGILKI-KGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-------DLEDFTPLEIKSNVAQLLKQAD 259 (383) Q Consensus 188 ~~~~~~~~~ng~~~~~i~~~-~~~~~~e~~~~~~~~~~~~~~~~g~~~vl~-------~g~~~~~~~~~~~d~~~~e~~~ 259 (383) ..-..+.+.-.+.|-.+++. .+...++....++. .+++.++ ++++|..... ....+.+..+ T Consensus 246 ~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~---------~~~i~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~ 314 (451) T protein:vir:10 246 MSGFANDLEDIQQIIYILENFGGEDTSEFLKELKR---------YKTIKTETDSEGDSGGLKTMQIEI--PTEARKIILE 314 (451) T ss_pred HHHHHHHHHHhccceeeeecCCcccchhhHHHHhh---------CCeEEecCcCCccCCcceEEeecC--CHHHHHHHHH Confidence 44444444444555555443 22222332222211 1222222 3455544333 3445566777 Q ss_pred HHHHHHHHHhcCCHHHhcccccCcCHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhhcchhhccchhhhc Q lcl|NC_018285. 260 WTTGQFAKVYGIPENVVGGQGDQQSSLEMSSN--------------VYSKAVARYLRPFLSELSQKLSCDVDADIFPAVD 325 (383) Q Consensus 260 ~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~~--------------~~~~~l~P~~~~i~~~l~~~l~~~~e~~~~~~~~ 325 (383) ...+.|...-++|..--...+ +.+ ..+.+. .+...+.-.++.+...++..=..++++.....+- T Consensus 315 ~l~~~I~~~s~~p~~~~~~~g-n~S-g~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~p 392 (451) T protein:vir:10 315 ILKKQIYESGQGLQQDTENFG-NAS-GVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVTDYKKIQQTYTRNMM 392 (451) T ss_pred HHHHHHHHHhCcccccccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccceeEEecCCCC Confidence 888889988888842111111 122 112111 1222222222222221110000011111122222 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc--chhHH------------hCCCCCCCCCCC Q lcl|NC_018285. 326 PTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK--ELPKG------------ENPNRTILKGGE 378 (383) Q Consensus 326 ~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~--d~~~~------------~~~~~~~~~ggd 378 (383) .+..+.+..+.++ .|+++.-.+.+.++. ++.. +..+. .+++.. +| T Consensus 393 ~n~~e~~~~~~kl--~g~iS~et~~~~~p~--v~d~~~e~~~~~ee~~~~~~~~~~~~~~~----~~ 451 (451) T protein:vir:10 393 SNDLEDADIATKS--VGIIPTKIILRHHPW--VDDVEEAEKLYLEEKKIQASKVSDDYNNF----TE 451 (451) T ss_pred CCHHHHHHHHHHH--hccCchHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhcCCC----CC Confidence 3455666666666 388999888777632 2211 11111 111111 11 No 258 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=30.29 E-value=1.6 Score=19.48 Aligned_cols=345 Identities=11% Similarity=0.087 Sum_probs=140.5 Q ss_pred Cc-----hhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce--- Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL--- 63 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~--- 63 (383) |+ .|+.++.+|..+...|..+ +.|..+........-...+.+ .++-..|++.+|+.+-+. || T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltpp~~~WF~l 79 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPW-QSVGSKGVNVLASKLMLSLFPVNTSFFKL 79 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHHhhcCCCCccccc Confidence 54 4555666554444333332 223222211111000111222 334445666666655431 22 Q ss_pred eeecchhhh---------------------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccce Q lcl|NC_018285. 64 TTSRKQMQG---------------------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQ 122 (383) Q Consensus 64 ~~~~~~~~~---------------------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~ 122 (383) .+.+..... +...-+ .-+.+.-+..+..+++.+||+.+++..+ +...|||.. T Consensus 80 ~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~ly~~~~----~~~~~pl~~-- 152 (555) T protein:vir:17 80 QINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIA-ESSDRVHLEMAMKHLIVTGNALLYQGKK----NLKLYPLDR-- 152 (555) T ss_pred ccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCeEEEEecCC----ceeEEEcCe-- Confidence 222111110 000011 1123344456678889999998876432 244555532 Q ss_pred eEEEEcCCCcee--EEEEe------------------------------------------------------------- Q lcl|NC_018285. 123 VSFNRLDNQNGL--YYNVT------------------------------------------------------------- 139 (383) Q Consensus 123 v~~~~~~~~~~~--~y~~~------------------------------------------------------------- 139 (383) +-+..+..|... +.++. T Consensus 153 y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~~~ 232 (555) T protein:vir:17 153 FVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDGQV 232 (555) T ss_pred EEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCCee Confidence 223333222111 00000 Q ss_pred -----ecCccc-c--cceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCC Q lcl|NC_018285. 140 -----FDDPRI-P--PKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGG 211 (383) Q Consensus 140 -----~~~~~~-~--~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~ 211 (383) .++... + ....|...-.+..|....++..||.||...+...+...+.+.+.......-...|..++..++.. T Consensus 233 ~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~ 312 (555) T protein:vir:17 233 KWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATT 312 (555) T ss_pred EEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Confidence 000000 0 00011112234455444567789999999999999999999999998888888888888666554 Q ss_pred CHHHHHHHHHHHHHhhcCCcceeecCCCceeeeccc-ChhhHHH-HHHHHHHHHHHHHHhcCCHHHhccc-ccCcCHHHH Q lcl|NC_018285. 212 LLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEI-KSNVAQL-LKQADWTTGQFAKVYGIPENVVGGQ-GDQQSSLEM 288 (383) Q Consensus 212 ~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~~~~-~~~d~~~-~e~~~~~~~~Ia~~~gVpp~~lg~~-~~~~~~~e~ 288 (383) ..... ..+..+.++.+..-.+.++.. ++.+.+. .+..+..+..|-.+|.+ ++.. +..-+.+|. T Consensus 313 ~~~~l----------~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~----~~~~d~~r~TAtEV 378 (555) T protein:vir:17 313 KPQNL----------ALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM----LQVRQSERTTATEV 378 (555) T ss_pred Cccee----------ecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh----cCCCCcccchHHHH Confidence 43221 112223333222222333322 2334442 34445556777778754 2211 111233333 Q ss_pred HH--HHHHHHHHHHHHHHHHHHHHhhcch-h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCc Q lcl|NC_018285. 289 SS--NVYSKAVARYLRPFLSELSQKLSCD-V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPK 361 (383) Q Consensus 289 ~~--~~~~~~l~P~~~~i~~~l~~~l~~~-~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~ 361 (383) .. .=....|-|.+..+.++|=.-|+.+ + +....+.+-. .+++..+.+.-++...... +. . T Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~----------~~v~~~i~~~l~~l~r~~~--~~-~ 445 (555) T protein:vir:17 379 QATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPK----------DLVQPTVVAGLWGVGRGQD--KQ-Q 445 (555) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCH----------hhhccceeehHHHHHHHHH--HH-H Confidence 22 2334566666666666653322211 0 0011000000 0111112221111100000 00 0 Q ss_pred chhHHhCCCCCCCCCCCC------CCCC Q lcl|NC_018285. 362 ELPKGENPNRTILKGGET------NGQD 383 (383) Q Consensus 362 d~~~~~~~~~~~~~ggd~------~~~d 383 (383) =........ .-+|+. |..+ T Consensus 446 l~~~~~~la---q~~~~p~~~d~id~d~ 470 (555) T protein:vir:17 446 LMEFITTLA---QTMGPEIAMKYINPTE 470 (555) T ss_pred HHHHHHHHH---hhcCchhHhhcCCHHH Confidence 000011110 001221 1111 No 259 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=28.99 E-value=1.7 Score=19.32 Aligned_cols=375 Identities=13% Similarity=0.135 Sum_probs=160.4 Q ss_pred Cc-----hhhhhhcC-----------------Ccccccccccccchhhccc---------ccCCc-e--------e-chh Q lcl|NC_018285. 1 MP-----IFNLATES-----------------PPNNQGGFFDITDPEFLAT---------LNGSE-W--------V-SAE 39 (383) Q Consensus 1 Mg-----lf~~~~~~-----------------~~~~~~~~~~~~~~~~~~~---------~~~~~-~--------~-~~~ 39 (383) |. ||....+. ++....+..........+. +-++. . + ..+ T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYR 80 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHH Confidence 33 34332220 0111111000000000010 00000 0 0 245 Q ss_pred hhhccHHHHHHHHHHHHhhhhC-----ceeeecchh--------------hhhccCCCccCCHHHHHHHHHHHHHHcCCe Q lcl|NC_018285. 40 TALKNSDLFSIISQLSNDLATA-----KLTTSRKQM--------------QGIVDNPSNSANRFNFYQSIFAQMLLGGEA 100 (383) Q Consensus 40 ~a~~~~~v~~~i~~ia~~ia~~-----p~~~~~~~~--------------~~l~~~PN~~~t~~~f~~~~~~~~~l~G~a 100 (383) ..+.+|.|..||+-|.+.+.-+ |+.+.=+.. +.++.--+....+ ...+..|++.|.. T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~----~~~fR~WYVDgRi 156 (523) T protein:vir:68 81 NLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKG----SDHFRRWYVDSRI 156 (523) T ss_pred HHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhh----hHHHHhheeeeEE Confidence 5678999999999999886533 222211100 1111112222233 4556788899999 Q ss_pred EEEEeecCC---CceeEEEEeccceeEEEEc-----CCCc------eeEEEEee--cCc-------ccccceeecccceE Q lcl|NC_018285. 101 FAYRWRNDN---GRDMKWEYLRPSQVSFNRL-----DNQN------GLYYNVTF--DDP-------RIPPKQHVPQSDIL 157 (383) Q Consensus 101 ~~~i~r~~~---g~~~~l~~l~~~~v~~~~~-----~~~~------~~~y~~~~--~~~-------~~~~~~~~~~~dvi 157 (383) |+.++.|.. .-+.+|.+|+|..|..++. +.+. ..+|-+.. ..+ ..+....++.+-|. T Consensus 157 ~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~ 236 (523) T protein:vir:68 157 FFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIV 236 (523) T ss_pred EEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhhee Confidence 998877633 2488999999999876432 1111 11122211 100 11223445555555 Q ss_pred EeccCCCC-ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCC-HHHHHHHHHHHHHh-----hc- Q lcl|NC_018285. 158 HFRLLSVD-GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGL-LDFKTKVSRSRQAM-----KQ- 228 (383) Q Consensus 158 h~~~~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~-~e~~~~~~~~~~~~-----~~- 228 (383) |....-.+ +.-.=+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+. ..+.+.++...... +. T Consensus 237 y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa 316 (523) T protein:vir:68 237 YAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDA 316 (523) T ss_pred eeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEec Confidence 44321111 1122367788888888887777777665555545444444333 3333 33333333322111 11 Q ss_pred ------CCcce--------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc-cc---CcCHHH Q lcl|NC_018285. 229 ------MQGGP--------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ-GD---QQSSLE 287 (383) Q Consensus 229 ------~~g~~--------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~-~~---~~~~~e 287 (383) +..+. +.= +.|.+++.+.....-.+ ++-..+....+..+++||.+-|... +. +..++= T Consensus 317 ~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EI 395 (523) T protein:vir:68 317 TTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGN-MEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSI 395 (523) T ss_pred cCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCcceeecCCCcceecccccch Confidence 11111 111 23567776655433333 2333577889999999999999532 21 112211 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHHH----- Q lcl|NC_018285. 288 MS-SNVYSKAVARYLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSMV----- 339 (383) Q Consensus 288 ~~-~~~~~~~l~P~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l~----- 339 (383) .. +.=+...|.-+-..+...|.. .|. +.-|++ +...+..|.. -....++.+- T Consensus 396 tRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpy 475 (523) T protein:vir:68 396 TRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPF 475 (523) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhh Confidence 11 122333444444444444433 332 111221 1111111111 1111222111 Q ss_pred hCCCcCHHHHHHHh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 340 KSGTLAQNQGLYIL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 340 ~~g~~t~nE~r~~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) -+-.++.+=+++.+ .+. ..|+... +.......+--+.+.+| T Consensus 476 vGky~s~~yi~k~ILr~t---Deei~~~~kqI~~E~k~~~~~~p~~e~~~ 522 (523) T protein:vir:68 476 IGKYISHRTAMKDILQMS---DEEIEQEAKQIEEESKEARFQDPDQEQED 522 (523) T ss_pred hcccchhHHHHHHHhccC---HHHHHHHHHHHHHHhhcCCCCCCchhhhc Confidence 13455666666543 322 2222211 11111111122333344 No 260 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=27.89 E-value=1.8 Score=19.18 Aligned_cols=336 Identities=13% Similarity=0.070 Sum_probs=147.1 Q ss_pred CchhhhhhcCCcccccccccccc---hh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhC--c----e---ee Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD---PE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA--K----L---TT 65 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p----~---~~ 65 (383) =+-|+.++..|..+..-|..+.+ |. ++..-.....-...+ +=.++-..|++.+|+.+-+. | | .+ T Consensus 11 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~-~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~ 89 (555) T protein:vir:10 11 LSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNN-ILDNTGTRALRVLAAGMMAGMTSPARPWFRLTT 89 (555) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccc-cccccHHHHHHHHHHHHHHhhcCCCCccccccc Confidence 34455555555444433333221 11 111000000000011 12344556666666665441 2 1 12 Q ss_pred ecchh------hh--------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC Q lcl|NC_018285. 66 SRKQM------QG--------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ 131 (383) Q Consensus 66 ~~~~~------~~--------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~ 131 (383) .+... .. +...-+ .-+.+.-+..+..+++.+|||.+++..+.. ..+.+.+++..++-+..+..+ T Consensus 90 ~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~rf~~~pl~~~~v~~d~~G 167 (555) T protein:vir:10 90 SIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDFD-AVVYHHSLTAGEYAIAADNQG 167 (555) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceEEEEEeecceeEEeeCCCC Confidence 11110 00 111111 112333345677899999999999876653 445566666666655554444 Q ss_pred ceeE-EE-Eeec----------------------Ccccccc---------------------------ee---------- Q lcl|NC_018285. 132 NGLY-YN-VTFD----------------------DPRIPPK---------------------------QH---------- 150 (383) Q Consensus 132 ~~~~-y~-~~~~----------------------~~~~~~~---------------------------~~---------- 150 (383) .... |+ +... .+..... +. T Consensus 168 ~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~v 247 (555) T protein:vir:10 168 RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRT 247 (555) T ss_pred CEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCccc Confidence 2211 10 0000 0000000 00 Q ss_pred -----ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 151 -----VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 151 -----~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) |.....+.+|....++..||.||...+...+...+.+.+.......-...|...+...+.... . T Consensus 248 l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~---------~-- 316 (555) T protein:vir:10 248 LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQD---------I-- 316 (555) T ss_pred cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc---------c-- Confidence 111112333333345678999999999999999999999888887777777777655543221 0 Q ss_pred hhcCCcceeecC---CCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHH-hc-ccccCcCHHHHH--HHHHHHHH Q lcl|NC_018285. 226 MKQMQGGPLVLD---DLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENV-VG-GQGDQQSSLEMS--SNVYSKAV 297 (383) Q Consensus 226 ~~~~~g~~~vl~---~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~-lg-~~~~~~~~~e~~--~~~~~~~l 297 (383) .-..|++-.+. .+-...++-....|.+ ..+..+..+..|-.+|-.+..+ ++ ..+..-+.+|.. ..-....| T Consensus 317 -~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~L 395 (555) T protein:vir:10 317 -STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLML 395 (555) T ss_pred -eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHh Confidence 11223321111 1112233222222333 3455677888999999877433 32 122223444443 23445677 Q ss_pred HHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch-------------- Q lcl|NC_018285. 298 ARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL-------------- 363 (383) Q Consensus 298 ~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~-------------- 363 (383) -|....+.++|=.-|+.+ .+.-|.+.|.+ ++ +|.++ T Consensus 396 G~v~~rl~~E~l~Pli~r------------------~~~il~r~g~l-----------P~-~P~~l~~~~i~v~yis~La 445 (555) T protein:vir:10 396 GPVLERMHNEILDPLIEL------------------TFQRMVEANIL-----------PP-PPQEMQGVDLNVEFVSMLA 445 (555) T ss_pred hHHHHHHHHHHHHHHHHH------------------HHHHHHhcCCC-----------CC-CchhhcCceeEEEeccHHH Confidence 777777777654333211 11122222222 10 11100 Q ss_pred ------------hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 364 ------------PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 364 ------------~~~~~~~~~~~~ggd~~~~d 383 (383) ..+... .++.+-+-+--| T Consensus 446 ~aq~~~~~~~i~~~l~~i--~~laq~~P~vld 475 (555) T protein:vir:10 446 QAQRAIATNSVDRFVGNL--GAVAGIKPEVLD 475 (555) T ss_pred HHHHHHHHHHHHHHHHHH--HHHhcCChhhhh Confidence 000000 011121111112 No 261 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=27.89 E-value=1.8 Score=19.18 Aligned_cols=336 Identities=13% Similarity=0.070 Sum_probs=147.1 Q ss_pred CchhhhhhcCCcccccccccccc---hh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhC--c----e---ee Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD---PE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA--K----L---TT 65 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p----~---~~ 65 (383) =+-|+.++..|..+..-|..+.+ |. ++..-.....-...+ +=.++-..|++.+|+.+-+. | | .+ T Consensus 11 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~-~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~ 89 (555) T protein:vir:98 11 LSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNN-ILDNTGTRALRVLAAGMMAGMTSPARPWFRLTT 89 (555) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccc-cccccHHHHHHHHHHHHHHhhcCCCCccccccc Confidence 34455555555444433333221 11 111000000000011 12344556666666665441 2 1 12 Q ss_pred ecchh------hh--------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC Q lcl|NC_018285. 66 SRKQM------QG--------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ 131 (383) Q Consensus 66 ~~~~~------~~--------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~ 131 (383) .+... .. +...-+ .-+.+.-+..+..+++.+|||.+++..+.. ..+.+.+++..++-+..+..+ T Consensus 90 ~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~rf~~~pl~~~~v~~d~~G 167 (555) T protein:vir:98 90 SIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDFD-AVVYHHSLTAGEYAIAADNQG 167 (555) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceEEEEEeecceeEEeeCCCC Confidence 11110 00 111111 112333345677899999999999876653 445566666666655554444 Q ss_pred ceeE-EE-Eeec----------------------Ccccccc---------------------------ee---------- Q lcl|NC_018285. 132 NGLY-YN-VTFD----------------------DPRIPPK---------------------------QH---------- 150 (383) Q Consensus 132 ~~~~-y~-~~~~----------------------~~~~~~~---------------------------~~---------- 150 (383) .... |+ +... .+..... +. T Consensus 168 ~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~v 247 (555) T protein:vir:98 168 RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRT 247 (555) T ss_pred CEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCccc Confidence 2211 10 0000 0000000 00 Q ss_pred -----ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 151 -----VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 151 -----~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) |.....+.+|....++..||.||...+...+...+.+.+.......-...|...+...+.... . T Consensus 248 l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~---------~-- 316 (555) T protein:vir:98 248 LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQD---------I-- 316 (555) T ss_pred cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc---------c-- Confidence 111112333333345678999999999999999999999888887777777777655543221 0 Q ss_pred hhcCCcceeecC---CCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHH-hc-ccccCcCHHHHH--HHHHHHHH Q lcl|NC_018285. 226 MKQMQGGPLVLD---DLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENV-VG-GQGDQQSSLEMS--SNVYSKAV 297 (383) Q Consensus 226 ~~~~~g~~~vl~---~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~-lg-~~~~~~~~~e~~--~~~~~~~l 297 (383) .-..|++-.+. .+-...++-....|.+ ..+..+..+..|-.+|-.+..+ ++ ..+..-+.+|.. ..-....| T Consensus 317 -~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~L 395 (555) T protein:vir:98 317 -STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLML 395 (555) T ss_pred -eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHh Confidence 11223321111 1112233222222333 3455677888999999877433 32 122223444443 23445677 Q ss_pred HHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch-------------- Q lcl|NC_018285. 298 ARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL-------------- 363 (383) Q Consensus 298 ~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~-------------- 363 (383) -|....+.++|=.-|+.+ .+.-|.+.|.+ ++ +|.++ T Consensus 396 G~v~~rl~~E~l~Pli~r------------------~~~il~r~g~l-----------P~-~P~~l~~~~i~v~yis~La 445 (555) T protein:vir:98 396 GPVLERMHNEILDPLIEL------------------TFQRMVEANIL-----------PP-PPQEMQGVDLNVEFVSMLA 445 (555) T ss_pred hHHHHHHHHHHHHHHHHH------------------HHHHHHhcCCC-----------CC-CchhhcCceeEEEeccHHH Confidence 777777777654333211 11122222222 10 11100 Q ss_pred ------------hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 364 ------------PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 364 ------------~~~~~~~~~~~~ggd~~~~d 383 (383) ..+... .++.+-+-+--| T Consensus 446 ~aq~~~~~~~i~~~l~~i--~~laq~~P~vld 475 (555) T protein:vir:98 446 QAQRAIATNSVDRFVGNL--GAVAGIKPEVLD 475 (555) T ss_pred HHHHHHHHHHHHHHHHHH--HHHhcCChhhhh Confidence 000000 011121111112 No 262 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=27.89 E-value=1.8 Score=19.18 Aligned_cols=336 Identities=13% Similarity=0.070 Sum_probs=147.1 Q ss_pred CchhhhhhcCCcccccccccccc---hh---hcccccCCceechhhhhccHHHHHHHHHHHHhhhhC--c----e---ee Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDITD---PE---FLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA--K----L---TT 65 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~--p----~---~~ 65 (383) =+-|+.++..|..+..-|..+.+ |. ++..-.....-...+ +=.++-..|++.+|+.+-+. | | .+ T Consensus 11 ~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~-~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~ 89 (555) T protein:vir:10 11 LSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNN-ILDNTGTRALRVLAAGMMAGMTSPARPWFRLTT 89 (555) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccc-cccccHHHHHHHHHHHHHHhhcCCCCccccccc Confidence 34455555555444433333221 11 111000000000011 12344556666666665441 2 1 12 Q ss_pred ecchh------hh--------hccCCCccCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEcCCC Q lcl|NC_018285. 66 SRKQM------QG--------IVDNPSNSANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ 131 (383) Q Consensus 66 ~~~~~------~~--------l~~~PN~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~~~~ 131 (383) .+... .. +...-+ .-+.+.-+..+..+++.+|||.+++..+.. ..+.+.+++..++-+..+..+ T Consensus 90 ~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~rf~~~pl~~~~v~~d~~G 167 (555) T protein:vir:10 90 SIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDFD-AVVYHHSLTAGEYAIAADNQG 167 (555) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceEEEEEeecceeEEeeCCCC Confidence 11110 00 111111 112333345677899999999999876653 445566666666655554444 Q ss_pred ceeE-EE-Eeec----------------------Ccccccc---------------------------ee---------- Q lcl|NC_018285. 132 NGLY-YN-VTFD----------------------DPRIPPK---------------------------QH---------- 150 (383) Q Consensus 132 ~~~~-y~-~~~~----------------------~~~~~~~---------------------------~~---------- 150 (383) .... |+ +... .+..... +. T Consensus 168 ~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~v 247 (555) T protein:vir:10 168 RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRT 247 (555) T ss_pred CEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCccc Confidence 2211 10 0000 0000000 00 Q ss_pred -----ecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHH Q lcl|NC_018285. 151 -----VPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQA 225 (383) Q Consensus 151 -----~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~ 225 (383) |.....+.+|....++..||.||...+...+...+.+.+.......-...|...+...+.... . T Consensus 248 l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~---------~-- 316 (555) T protein:vir:10 248 LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQD---------I-- 316 (555) T ss_pred cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc---------c-- Confidence 111112333333345678999999999999999999999888887777777777655543221 0 Q ss_pred hhcCCcceeecC---CCceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHH-hc-ccccCcCHHHHH--HHHHHHHH Q lcl|NC_018285. 226 MKQMQGGPLVLD---DLEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENV-VG-GQGDQQSSLEMS--SNVYSKAV 297 (383) Q Consensus 226 ~~~~~g~~~vl~---~g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~-lg-~~~~~~~~~e~~--~~~~~~~l 297 (383) .-..|++-.+. .+-...++-....|.+ ..+..+..+..|-.+|-.+..+ ++ ..+..-+.+|.. ..-....| T Consensus 317 -~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~L 395 (555) T protein:vir:10 317 -STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLML 395 (555) T ss_pred -eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHh Confidence 11223321111 1112233222222333 3455677888999999877433 32 122223444443 23445677 Q ss_pred HHHHHHHHHHHHHhhcchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcch-------------- Q lcl|NC_018285. 298 ARYLRPFLSELSQKLSCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKEL-------------- 363 (383) Q Consensus 298 ~P~~~~i~~~l~~~l~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~-------------- 363 (383) -|....+.++|=.-|+.+ .+.-|.+.|.+ ++ +|.++ T Consensus 396 G~v~~rl~~E~l~Pli~r------------------~~~il~r~g~l-----------P~-~P~~l~~~~i~v~yis~La 445 (555) T protein:vir:10 396 GPVLERMHNEILDPLIEL------------------TFQRMVEANIL-----------PP-PPQEMQGVDLNVEFVSMLA 445 (555) T ss_pred hHHHHHHHHHHHHHHHHH------------------HHHHHHhcCCC-----------CC-CchhhcCceeEEEeccHHH Confidence 777777777654333211 11122222222 10 11100 Q ss_pred ------------hHHhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 364 ------------PKGENPNRTILKGGETNGQD 383 (383) Q Consensus 364 ------------~~~~~~~~~~~~ggd~~~~d 383 (383) ..+... .++.+-+-+--| T Consensus 446 ~aq~~~~~~~i~~~l~~i--~~laq~~P~vld 475 (555) T protein:vir:10 446 QAQRAIATNSVDRFVGNL--GAVAGIKPEVLD 475 (555) T ss_pred HHHHHHHHHHHHHHHHHH--HHHhcCChhhhh Confidence 000000 011121111112 No 263 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=27.30 E-value=1.9 Score=19.11 Aligned_cols=345 Identities=11% Similarity=0.056 Sum_probs=140.0 Q ss_pred Cc-----hhhhhhcCCccccccccccc---chhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce--- Q lcl|NC_018285. 1 MP-----IFNLATESPPNNQGGFFDIT---DPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL--- 63 (383) Q Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~--- 63 (383) |+ .|+.++..|..+...|..+. .|..+..-.....-...+.+ .++-..|++.+|+.+-+. || T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~-dstg~~a~~~Laa~l~~~ltpp~~~WF~l 79 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPY-QSLGSKGVNALSSKLMLSLFPIQTSFFKL 79 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccc-cchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 65 44556665544444333322 22222111100000011222 334445666666655431 22 Q ss_pred eeecch--------------hhhhcc----CCCcc---CCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccce Q lcl|NC_018285. 64 TTSRKQ--------------MQGIVD----NPSNS---ANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQ 122 (383) Q Consensus 64 ~~~~~~--------------~~~l~~----~PN~~---~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~ 122 (383) .+.+.. ...++. .-... -+.+.-+..++.+++.+|||.+++..+ +...+||.. T Consensus 80 ~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~----~~~~~pl~~-- 153 (542) T protein:vir:78 80 QINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKK----TLKVYPLDR-- 153 (542) T ss_pred cCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCC----CceEEecce-- Confidence 111110 000110 00011 123333456778889999998876433 233444422 Q ss_pred eEEEEcCCCce---------------------------------------------------------------eEEEEe Q lcl|NC_018285. 123 VSFNRLDNQNG---------------------------------------------------------------LYYNVT 139 (383) Q Consensus 123 v~~~~~~~~~~---------------------------------------------------------------~~y~~~ 139 (383) +-+..+..|.. ..+... T Consensus 154 y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e 233 (542) T protein:vir:78 154 YVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQE 233 (542) T ss_pred eEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEE Confidence 22222222210 000111 Q ss_pred ecCccc-c--cceeecccceEEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHH Q lcl|NC_018285. 140 FDDPRI-P--PKQHVPQSDILHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFK 216 (383) Q Consensus 140 ~~~~~~-~--~~~~~~~~dvih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~ 216 (383) .++... + ....|.....+..|....++..||.||...+...+...+.+.+.......-...|..++..++....... T Consensus 234 ~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~ 313 (542) T protein:vir:78 234 CDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSL 313 (542) T ss_pred eccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhc Confidence 111000 0 0011222233444444456678999999999999999999999999988888888877776655444221 Q ss_pred HHHHHHHHHhhcCCcceeecCC--CceeeecccChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--H Q lcl|NC_018285. 217 TKVSRSRQAMKQMQGGPLVLDD--LEDFTPLEIKSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--N 291 (383) Q Consensus 217 ~~~~~~~~~~~~~~g~~~vl~~--g~~~~~~~~~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~ 291 (383) . .+..+.++.+. ++...++.. +.+.+ ..+..+.....|-.+|-+-. . ..+..-+.+|... . T Consensus 314 ~----------~~~~g~iv~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~aFl~~~-~--~d~~rvTAtEV~~r~~ 379 (542) T protein:vir:78 314 A----------RAGTGAIIQGRAEDVSVVQANK-GADFRTVQEMIRDLSQRISDAFLILN-V--RQSERTTATEVREVQM 379 (542) T ss_pred c----------cCCCceeecCCccceeeeeccc-ccchhHHHHHHHHHHHHHHHHhcccc-c--CCcccccHHHHHHHHH Confidence 1 11112222222 233333332 22333 34555677788888885421 1 1111223334332 2 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcch-h----hccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhh-cCCcCCcchhH Q lcl|NC_018285. 292 VYSKAVARYLRPFLSELSQKLSCD-V----DADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQ-QAEILPKELPK 365 (383) Q Consensus 292 ~~~~~l~P~~~~i~~~l~~~l~~~-~----e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg-~~~~~~~d~~~ 365 (383) =....|-|....++++|=.-|+.+ + +.+..+.+-. .+++-.+.|+-++..+.. ...+ + +. T Consensus 380 E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~----------~lv~~~~~s~La~~~r~~~~~~l---~-~~ 445 (542) T protein:vir:78 380 ELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPK----------GLVMPTVVAGLGGVGRGEDRAAL---I-EF 445 (542) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCch----------hceeeeeechHHHHHHHHHHHHH---H-HH Confidence 334566777777766654322211 0 0111110000 011111122111111100 0000 0 00 Q ss_pred HhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 366 GENPNRTILKGGETNGQD 383 (383) Q Consensus 366 ~~~~~~~~~~ggd~~~~d 383 (383) +..... .. |+..=.+ T Consensus 446 ~~~i~~-~~--~p~~l~~ 460 (542) T protein:vir:78 446 MQTVGQ-AM--GPEALQQ 460 (542) T ss_pred HHHHHH-hc--CChhHHh Confidence 111111 01 1111012 No 264 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=26.91 E-value=1.9 Score=19.06 Aligned_cols=380 Identities=13% Similarity=0.079 Sum_probs=157.0 Q ss_pred Cc-hhhhhhcC----Ccccccccccccchhh--cccccCCcee--------------chhhhhccHHHHHHHHHHHHhhh Q lcl|NC_018285. 1 MP-IFNLATES----PPNNQGGFFDITDPEF--LATLNGSEWV--------------SAETALKNSDLFSIISQLSNDLA 59 (383) Q Consensus 1 Mg-lf~~~~~~----~~~~~~~~~~~~~~~~--~~~~~~~~~~--------------~~~~a~~~~~v~~~i~~ia~~ia 59 (383) |. ||....++ +..+....-+..+.+. .+....+.++ ..+..+.+|.|..||+-|.+.+. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 53 44432221 1111111101111111 0000011111 34566789999999999998865 Q ss_pred hC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEEeecCC---CceeEEEEeccc Q lcl|NC_018285. 60 TA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYRWRNDN---GRDMKWEYLRPS 121 (383) Q Consensus 60 ~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~---g~~~~l~~l~~~ 121 (383) -+ |+.+.=+.. ..+...-+ ..++...--+..+..|++.|..|..++.+.+ .-+.+|.+|+|. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr 160 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPR 160 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeecccc Confidence 32 222211110 00111111 1122222224566788899999998876633 358999999999 Q ss_pred eeEEEEcC-----CCc-------------eeEEEEeecC--cccccceeecccceEEeccCCCC--ccccCcchHHHHHH Q lcl|NC_018285. 122 QVSFNRLD-----NQN-------------GLYYNVTFDD--PRIPPKQHVPQSDILHFRLLSVD--GGLTSVSPLMALGR 179 (383) Q Consensus 122 ~v~~~~~~-----~~~-------------~~~y~~~~~~--~~~~~~~~~~~~dvih~~~~~~~--~~~~G~s~~~~~~~ 179 (383) .|+.++.. ++. ..+|-+...+ ........++.+ .|++-+...- +.-.=+|-|..+.+ T Consensus 161 ~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~d-AI~y~hSGl~d~~~~~i~syLhkAiK 239 (533) T protein:vir:10 161 KIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPD-SICYVHSGIMDLNKNMTLSHLHKAIK 239 (533) T ss_pred ceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchh-heeeeeccceeCCCCceeccchHhHH Confidence 98875421 110 0111111110 011222445554 4444332211 11123578888888 Q ss_pred HHHHHHHHHHHHHHHHhccCCcceeEeec-CCCC-HHHHHHHHHHHHHh-----hcCC-c------ce--------eec- Q lcl|NC_018285. 180 ELDIQKASDKLTLNSLKNALNANGILKIK-GGGL-LDFKTKVSRSRQAM-----KQMQ-G------GP--------LVL- 236 (383) Q Consensus 180 ~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~-~e~~~~~~~~~~~~-----~~~~-g------~~--------~vl- 236 (383) .+....-++....=+.-.-+.-+-+.-.+ |.+. ..+.+.++...... ++++ | +. +.= T Consensus 240 p~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRR 319 (533) T protein:vir:10 240 AVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR 319 (533) T ss_pred HHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhccccc Confidence 88887777777665555545444444333 3333 33333333322111 1111 1 11 110 Q ss_pred --CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccCc--CHHHHH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018285. 237 --DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ--SSLEMS--SNVYSKAVARYLRPFLSELSQ 310 (383) Q Consensus 237 --~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~--~~~e~~--~~~~~~~l~P~~~~i~~~l~~ 310 (383) +.|.+++.+.....-.+ ++-..+..+.++.+++||.+-|+..+..+ ...|-. +.=+...|.-+-..+...|.. T Consensus 320 eGgrgTEItTLpGgqnLge-m~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~ 398 (533) T protein:vir:10 320 EGGRGTEITTLPGGQNLGE-LEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTD 398 (533) T ss_pred CCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 23567776655433223 23335778899999999999997433211 111222 122333444444444444433 Q ss_pred ----hhcc-----hhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCCCcCHHHHHHHh-h-------- Q lcl|NC_018285. 311 ----KLSC-----DVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSGTLAQNQGLYIL-Q-------- 354 (383) Q Consensus 311 ----~l~~-----~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g~~t~nE~r~~l-g-------- 354 (383) .|.. .-+++ +...+..|.. -....++.+- -+-.++.+=+++.+ . T Consensus 399 ~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~ 478 (533) T protein:vir:10 399 LLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKE 478 (533) T ss_pred HHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHH Confidence 3321 11211 1111111111 0111111111 12233433333321 1 Q ss_pred ---------cCCc-CC--cchhHHhCCCCCCCCCCC----CCCCC Q lcl|NC_018285. 355 ---------QAEI-LP--KELPKGENPNRTILKGGE----TNGQD 383 (383) Q Consensus 355 ---------~~~~-~~--~d~~~~~~~~~~~~~ggd----~~~~d 383 (383) ..|+ .+ .+.....+. ..|-.||- ..++- T Consensus 479 ~~kqI~~E~k~~~~~~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 522 (533) T protein:vir:10 479 IDKQIESEMESGIIADPAAEMDPAMAA-GDPDAGGAPAEEVAPEG 522 (533) T ss_pred HHHHHHHHHhCCCCCCCcchhhHHhcC-CCCCcCCcccccCCCCC Confidence 1222 11 111111111 12222221 11221 No 265 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=25.95 E-value=2 Score=18.93 Aligned_cols=379 Identities=12% Similarity=0.125 Sum_probs=159.2 Q ss_pred Cch-----hhhhhcCCc--------ccccccc--cccch-------------hhcc---cccCCce---------e-chh Q lcl|NC_018285. 1 MPI-----FNLATESPP--------NNQGGFF--DITDP-------------EFLA---TLNGSEW---------V-SAE 39 (383) Q Consensus 1 Mgl-----f~~~~~~~~--------~~~~~~~--~~~~~-------------~~~~---~~~~~~~---------~-~~~ 39 (383) |.| |+...+.+. ....... ...++ .+.+ .+.++.. + ..+ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:72 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYR 80 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHH Confidence 444 222111100 0000000 00010 0111 1111100 0 245 Q ss_pred hhhccHHHHHHHHHHHHhhhhC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_018285. 40 TALKNSDLFSIISQLSNDLATA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYR 104 (383) Q Consensus 40 ~a~~~~~v~~~i~~ia~~ia~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i 104 (383) ..+.+|.|..||+-|.+.+.-+ |+.+.=.+. ..+...-+ ..++...--...+..|++.|..|+.+ T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhK 160 (524) T protein:vir:72 81 NLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHK 160 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEE Confidence 5678999999999999886532 222211000 00111111 11112222235567889999999988 Q ss_pred eecCC---CceeEEEEeccceeEEEEc-----CCCc------eeEEEEeec--Cc-------ccccceeecccceEEecc Q lcl|NC_018285. 105 WRNDN---GRDMKWEYLRPSQVSFNRL-----DNQN------GLYYNVTFD--DP-------RIPPKQHVPQSDILHFRL 161 (383) Q Consensus 105 ~r~~~---g~~~~l~~l~~~~v~~~~~-----~~~~------~~~y~~~~~--~~-------~~~~~~~~~~~dvih~~~ 161 (383) +.|.. .-+.+|.+|+|..+..++. +.+. ..+|-|... .+ ..+....++.+-|.|... T Consensus 161 iid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hS 240 (524) T protein:vir:72 161 IIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAVVYAHS 240 (524) T ss_pred EEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhheeeeec Confidence 77643 2488999999999876432 1111 111222110 00 112234455555554432 Q ss_pred CCCC-ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHH-----hhcCC-c- Q lcl|NC_018285. 162 LSVD-GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQA-----MKQMQ-G- 231 (383) Q Consensus 162 ~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~-----~~~~~-g- 231 (383) .-.+ +.-.=+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++..... .++++ | T Consensus 241 GL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGe 320 (524) T protein:vir:72 241 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGK 320 (524) T ss_pred cceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe Confidence 1111 1122367788888888887777777665555545444444333 33333 333333332211 11111 2 Q ss_pred -----ce--------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--c--c-CcCHHHHH- Q lcl|NC_018285. 232 -----GP--------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--G--D-QQSSLEMS- 289 (383) Q Consensus 232 -----~~--------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~--~-~~~~~e~~- 289 (383) +. +.= +.|.+++.+.....-.+ ++-..+....+..+++||.+-|... + + +..++=.. T Consensus 321 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRD 399 (524) T protein:vir:72 321 IKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGN-MEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRD 399 (524) T ss_pred eccchhhhhhHhhhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHH Confidence 11 111 23567776655433333 2333577889999999999999321 1 1 11111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCC Q lcl|NC_018285. 290 SNVYSKAVARYLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSG 342 (383) Q Consensus 290 ~~~~~~~l~P~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g 342 (383) +.=+...|.-+-..+...|.. .|. +.-|++ +...+..|.. -....++.+- -+- T Consensus 400 EikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 479 (524) T protein:vir:72 400 ELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGK 479 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 122334444444444444433 332 111221 1111111211 1111222111 134 Q ss_pred CcCHHHHHHHh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 343 TLAQNQGLYIL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 343 ~~t~nE~r~~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) .++.+=+++.+ .+. ..|+... +.......+--+.+.+| T Consensus 480 y~s~~yi~k~ILr~t---Deei~~~~k~I~~E~k~~~~~~~~~~~~~ 523 (524) T protein:vir:72 480 YISHRTAMKDILQMT---DEEIEQEAKQIEEESKEARFQDPDQEQED 523 (524) T ss_pred cchhHHHHHHHhccC---HHHHHHHHHHHHHHhhcCCCCCCchhhhc Confidence 45666666543 322 2222211 11111111122333344 No 266 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=25.19 E-value=2.1 Score=18.83 Aligned_cols=348 Identities=9% Similarity=-0.000 Sum_probs=141.7 Q ss_pred CchhhhhhcCCcccccccccc---cchhhcccccCCceechhhhhccHHHHHHHHHHHHhhhhC------ce---eeecc Q lcl|NC_018285. 1 MPIFNLATESPPNNQGGFFDI---TDPEFLATLNGSEWVSAETALKNSDLFSIISQLSNDLATA------KL---TTSRK 68 (383) Q Consensus 1 Mglf~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~------p~---~~~~~ 68 (383) =+.|+.++.+|..+...|..+ +.|..+..-..... ..+.+ .++--.|++.+|+.+-+. || .+.+. T Consensus 17 ~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~--~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~d~ 93 (516) T protein:vir:10 17 PKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNET--SQNGW-QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQ 93 (516) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCccc--ccccc-cchHHHHHHHHHHHHHhhhcCCCCccccccCChh Confidence 345555655554444333322 23322221111111 11222 344456666666655431 22 11111 Q ss_pred hh-------------hhhcc----CCCc---cCCHHHHHHHHHHHHHHcCCeEEEEeecCCCceeEEEEeccceeEEEEc Q lcl|NC_018285. 69 QM-------------QGIVD----NPSN---SANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 128 (383) Q Consensus 69 ~~-------------~~l~~----~PN~---~~t~~~f~~~~~~~~~l~G~a~~~i~r~~~g~~~~l~~l~~~~v~~~~~ 128 (383) .. +.++. .-.. .-+.+.-+..++.+++.+|||.+++. .++ ....|||. ++-+..+ T Consensus 94 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d--~~~-~~~~~pl~--~y~v~~d 168 (516) T protein:vir:10 94 GEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP--SKG-AISAIPMH--HYVVNRD 168 (516) T ss_pred hHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec--CCC-CeEEEEcC--eEEEeeC Confidence 00 00100 0000 11233334566678889999987763 332 24456653 2333333 Q ss_pred CCCce--eEEEEe-----------------------------------ecCccccccee---------------ecccce Q lcl|NC_018285. 129 DNQNG--LYYNVT-----------------------------------FDDPRIPPKQH---------------VPQSDI 156 (383) Q Consensus 129 ~~~~~--~~y~~~-----------------------------------~~~~~~~~~~~---------------~~~~dv 156 (383) ..|.. ++++.. .-.....-... |...-. T Consensus 169 ~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~~~~~d~~~~~~~s~~~~~e~P~ 248 (516) T protein:vir:10 169 TNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWELKQSADDIPVGKVSKIKSEKLPF 248 (516) T ss_pred CCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEEEEeeCceeeccccccccccCCe Confidence 32221 111100 00000000000 111122 Q ss_pred EEeccCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeecCCCCHHHHHHHHHHHHHhhcCCcceeec Q lcl|NC_018285. 157 LHFRLLSVDGGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVL 236 (383) Q Consensus 157 ih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~~~~~~e~~~~~~~~~~~~~~~~g~~~vl 236 (383) +..|....++..||.||..-+...+...+...+.......-...|..++...+........ .+..+.++- T Consensus 249 ~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~----------~~~~g~~~~ 318 (516) T protein:vir:10 249 IPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFV----------NSGTGEVVT 318 (516) T ss_pred eeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhc----------cCCCceeec Confidence 3333333456789999999999999999999999888887778777777665554432111 111122222 Q ss_pred CCCceeeeccc-ChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccccCcCHHHHHH--HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018285. 237 DDLEDFTPLEI-KSNVAQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSS--NVYSKAVARYLRPFLSELSQKL 312 (383) Q Consensus 237 ~~g~~~~~~~~-~~~d~~-~~e~~~~~~~~Ia~~~gVpp~~lg~~~~~~~~~e~~~--~~~~~~l~P~~~~i~~~l~~~l 312 (383) +..-.+.++.. +..|.+ ..+..+.....|-.+|-+.....-.. ..-+.+|... .-....|-|.+..+.++|=.-| T Consensus 319 g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~-~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pl 397 (516) T protein:vir:10 319 GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDA-ERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPV 397 (516) T ss_pred CCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCC-ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH Confidence 22222333322 223333 24556777888989998764332221 1223344332 2344567777777776654333 Q ss_pred cchhhccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhhcCCcCCcchhH-HhCCCCCCCCCCCCCCCC Q lcl|NC_018285. 313 SCDVDADIFPAVDPTGANYISRINSMVKSGTLAQNQGLYILQQAEILPKELPK-GENPNRTILKGGETNGQD 383 (383) Q Consensus 313 ~~~~e~~~~~~~~~~~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~d~~~-~~~~~~~~~~ggd~~~~d 383 (383) +...-..... . .-..+++-.+.+.-.+ |++.. .-..+.. .+-.. .+-+++.+=-| T Consensus 398 i~r~~~~~~p----~------~P~~lv~~~~v~~i~~---L~raq-~~~~i~~~~q~i~--~~~q~~p~v~d 453 (516) T protein:vir:10 398 AMWGLLEAGD----S------FTSDLVDPVIITGIEA---LGRMA-ELDKLANFAQYMS--LPLQWPEPVLA 453 (516) T ss_pred HHHHHHhhCC----C------CChhhcCcceehhHHH---HHHHH-HHHHHHHHHHHHH--HHhcCChHHHh Confidence 2111000000 0 0001111111111000 00000 0000000 00000 01112111122 No 267 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=24.76 E-value=2.1 Score=18.77 Aligned_cols=379 Identities=12% Similarity=0.125 Sum_probs=159.2 Q ss_pred Cch-----hhhhhcCCc--------ccccccc--cccch-------------hhcc---cccCCce---------e-chh Q lcl|NC_018285. 1 MPI-----FNLATESPP--------NNQGGFF--DITDP-------------EFLA---TLNGSEW---------V-SAE 39 (383) Q Consensus 1 Mgl-----f~~~~~~~~--------~~~~~~~--~~~~~-------------~~~~---~~~~~~~---------~-~~~ 39 (383) |.| |+...+.+. ....... ...++ .+.+ .+.++.. + ..+ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYR 80 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHH Confidence 444 222111100 0000000 00010 0111 1111100 0 245 Q ss_pred hhhccHHHHHHHHHHHHhhhhC-----ceeeecchh-------hhhccCCC---ccCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_018285. 40 TALKNSDLFSIISQLSNDLATA-----KLTTSRKQM-------QGIVDNPS---NSANRFNFYQSIFAQMLLGGEAFAYR 104 (383) Q Consensus 40 ~a~~~~~v~~~i~~ia~~ia~~-----p~~~~~~~~-------~~l~~~PN---~~~t~~~f~~~~~~~~~l~G~a~~~i 104 (383) ..+.+|.|..||+-|.+.+.-+ |+.+.=.+. ..+...-+ ..++...--...+..|++.|..|+.+ T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhK 160 (524) T protein:vir:10 81 NLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHK 160 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEE Confidence 5678999999999999886532 222211000 00111111 11122222235567889999999988 Q ss_pred eecCC---CceeEEEEeccceeEEEEc-----CCCc------eeEEEEeec--Cc-------ccccceeecccceEEecc Q lcl|NC_018285. 105 WRNDN---GRDMKWEYLRPSQVSFNRL-----DNQN------GLYYNVTFD--DP-------RIPPKQHVPQSDILHFRL 161 (383) Q Consensus 105 ~r~~~---g~~~~l~~l~~~~v~~~~~-----~~~~------~~~y~~~~~--~~-------~~~~~~~~~~~dvih~~~ 161 (383) +.|.. .-+.+|.+|+|..+..++. +.+. ..+|-|... .+ ..+....++.+-|.|... T Consensus 161 iid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hS 240 (524) T protein:vir:10 161 IIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAIVYAHS 240 (524) T ss_pred EeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhheeeeec Confidence 77633 2488999999999876432 1111 111222111 00 112234455555544432 Q ss_pred CCCC-ccccCcchHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEeec-CCCCH-HHHHHHHHHHHH-----hhcCC-c- Q lcl|NC_018285. 162 LSVD-GGLTSVSPLMALGRELDIQKASDKLTLNSLKNALNANGILKIK-GGGLL-DFKTKVSRSRQA-----MKQMQ-G- 231 (383) Q Consensus 162 ~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i~~~~-~~~~~-e~~~~~~~~~~~-----~~~~~-g- 231 (383) .-.+ +.-.=+|-|..+.+.+....-++....=+.-.-+.-+-+.-.+ |.+.+ .+.+.++..... .++++ | T Consensus 241 GL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGe 320 (524) T protein:vir:10 241 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGK 320 (524) T ss_pred cceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe Confidence 1111 1122367788888888887777777665555545444444333 33333 333333332211 11111 2 Q ss_pred -----ce--------eec---CCCceeeecccChhhHHHHHHHHHHHHHHHHHhcCCHHHhccc--c--c-CcCHHHHH- Q lcl|NC_018285. 232 -----GP--------LVL---DDLEDFTPLEIKSNVAQLLKQADWTTGQFAKVYGIPENVVGGQ--G--D-QQSSLEMS- 289 (383) Q Consensus 232 -----~~--------~vl---~~g~~~~~~~~~~~d~~~~e~~~~~~~~Ia~~~gVpp~~lg~~--~--~-~~~~~e~~- 289 (383) +. +.= +.|.+++.+.....-.+ ++-..+....+..+++||.+-|... + + +..++=.. T Consensus 321 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRD 399 (524) T protein:vir:10 321 IKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGN-MEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRD 399 (524) T ss_pred eccchhhhhhHhhhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHH Confidence 11 111 23567776655433333 2333577889999999999999321 1 1 11111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHH----hhc-----chhhcc-----chhhhccCHH--------HHHHHHHHHH-----hCC Q lcl|NC_018285. 290 SNVYSKAVARYLRPFLSELSQ----KLS-----CDVDAD-----IFPAVDPTGA--------NYISRINSMV-----KSG 342 (383) Q Consensus 290 ~~~~~~~l~P~~~~i~~~l~~----~l~-----~~~e~~-----~~~~~~~~~~--------~~~~~~~~l~-----~~g 342 (383) +.=+...|.-+-..+...|.. .|. +.-|++ +...+..|.. -....++.+- -+- T Consensus 400 EikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 479 (524) T protein:vir:10 400 ELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGK 479 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 122334444444444444433 332 111221 1111111211 1111222111 134 Q ss_pred CcCHHHHHHHh-hcCCcCCcchhHH-----hCCCCCCCCCCCCCCCC Q lcl|NC_018285. 343 TLAQNQGLYIL-QQAEILPKELPKG-----ENPNRTILKGGETNGQD 383 (383) Q Consensus 343 ~~t~nE~r~~l-g~~~~~~~d~~~~-----~~~~~~~~~ggd~~~~d 383 (383) .++.+=+++.+ .+. ..|+... +.......+--+.+.+| T Consensus 480 y~s~~yi~k~ILr~t---Deei~~~~k~I~~E~k~~~~~~~~~~~~~ 523 (524) T protein:vir:10 480 YISHRTAMKDILQMT---DEEIEQEAKQIEEESKEARFQDPDQEQED 523 (524) T ss_pred cchhHHHHHHHhccC---HHHHHHHHHHHHHHhhcCCCCCCchhhhc Confidence 45666666543 322 2222211 11111111122333344 Done!